The Bulletin of Symbolic Logic
Volume 1, Number 4, Dec. 1995
INNER MODELS AND LARGE CARDINALS
RONALD JENSEN
In this paper, we sketch the development of two important themes of
modern set theory,1 both of which can be regarded as growing out of work of
Kurt Gödel. We begin with a review of some basic concepts and conventions
of set theory.
§0. The ordinal numbers were Georg Cantor’s deepest contribution to
mathematics. After the natural numbers 0, 1, . . . , n, . . . comes the first
infinite ordinal number ù, followed by ù + 1, ù + 2, . . . , ù + ù, . . . and so
forth. ù is the first limit ordinal as it is neither 0 nor a successor ordinal.
We follow the von Neumann convention, according to which each ordinal
number α is identified with the set {í | í < α} of its predecessors. The ∈
relation on ordinals thus coincides with <. We have 0 = ∅ and α + 1 =
α ∪ {α}. According to the usual set-theoretic conventions, ù is identified
with the first infinite cardinal ℵ0 , similarly for the first uncountable ordinal
number ù1 and the first uncountable cardinal number ℵ1 , etc.2 We thus
arrive at the following picture:
0, 1, . . . , ù, ù + 1, . . . , ù + ù, . . . , ù1 , . . . , ù2 , . . .
k
k
k
ℵ0
ℵ1
ℵ2
The von Neumann hierarchy3 divides the class V of all sets into a hierarchy
of sets Vα indexed by the ordinal numbers. The recursive definition reads:
V0 = ∅; Vα+1 = P (Vα ) (where P (x) = {z | z ⊆ x} is the power set of x);
This article is based on the Gödel Lecture given at the meeting of the Association for Symbolic Logic at Berkeley in January, 1990. A German version of this paper, “Innere Modelle
und Große Kardinalzahlen,” was published in Jahresbericht der Deutschen MathematikerVereinigung, Jubiläumstagung 100 Jahre DMV (B. G. Teubner, Stuttgart, 1992), pp. 265–280.
This English translation was prepared by Andreas Blass. It is published by permission of B.
G. Teubner.
1
This paper and the lecture on which it is based are not intended as historical surveys. In
particular, no attempt has been made to give proper credit for many of the early contributions
to the subject.
2
ùí is the íth infinite ordinal number of cardinality greater than all its predecessors and is
identified with the corresponding cardinal number ℵí .
3
This hierarchy was introduced independently by Zermelo.
c 1995, Association for Symbolic Logic
1079-8986/95/0104-0001/$02.50
393
394
RONALD JENSEN
S
Vë = í<ë Ví for limit ordinals ë.4 We can represent this hierarchy by the
following picture.
❆
❆
❆
❆
✁
✁
✁
α
❆
❆
❆
❆
✁
✁
❆
❆ ✁
❆✁
✁
✁
✁ Vα
✁
The Vα increase in width as α increases. The ordinal numbers, here
drawn as a straight line, grow linearly, since in the transition from α to
α + 1 = α ∪ {α} exactly one new
S element is added. The axiom of foundation
in set theory asserts that V = α Vα .
This structure has two complementary motivations. The iteration can be
understood as a generalization of analysis, since the real numbers can be represented in P (ù), subsets and functions on R in PP (ù), etc. On the other
hand, set theory can be regarded as a relativization of finite combinatorics,
where the concepts “finite” and “infinite” are replaced by “set” and “proper
class.” From this point of view it is particularly easy to motivate the usual
axioms of set theory,5 for example the axiom of separation which says that
a subclass of a set is a set or the axiom of replacement which says that the
image of a set under a function is a set. It is perhaps not a coincidence that
the young Cantor was influenced by Kronecker as well as by Weierstraß.
§1. The most famous question in set theory is the continuum problem.
The continuum hypothesis (CH) asserts that the cardinal number Card(R)
of R has the smallest possible value ù1 . An equivalent formulation is
Card(P (ù)) = ù1 . The generalized continuum hypothesis (GCH) asserts
that Card(P (ùí )) = ùí+1 for all í. GCH implies the axiom of choice (AC).
Despite great efforts, Cantor did not succeed in deciding whether CH is true,
and neither did his successors. In this century, the suspicion grew that CH
might be independent of the axioms of set theory, i.e., that these axioms do
not suffice to prove either the continuum hypothesis or its negation. This suspicion was based on the apparent insufficiency of the axioms to adequately
characterize the power set operation. It proved to be correct, and the two
4
Urelements or non-sets are not needed in mathematics. If we want to have them, we must
modify the definition so that they occur in V0 .
5
By this I mean the axioms of Zermelo-Fraenkel (ZF), possibly extended by the axiom of
choice (ZFC).
INNER MODELS AND LARGE CARDINALS
395
research directions described in this article can be regarded as attempts to
understand power sets better.
The first progress came in 1939, when Gödel showed that GCH is consistent with the axioms. Then the unprovability of CH was shown by Cohen
in 1963. Thus the independence was proved. Gödel’s proof was the first
application of the concept of inner models. This concept arises from an attempt to circumscribe the amorphous power set operation: one replaces the
Vα -hierarchy with a new hierarchy Wα , defined just like Vα except that, in
the transition from Wα to Wα+1 , not all subsets of Wα are adjoined but only
those that are obtained by S
means of certain “well-understood” and clearly
specified operations. W = α Wα is then a “slice” of V , as indicated in the
following picture.
W
V
❡
❆
✁
✪
❡
✪
❡
α
❆
✁ Wα ✪
❡
✪ Vα
❡
✪
✁ ✪
❡ ❆
❡
✪
❡ ❆
✁ ✪
❡
✪
❡
✪
❆✪
✁
❡
We can hope that, for a suitable choice of these operations, W will formally
satisfy all the axioms of set theory. In S
this case we call W an inner model.6
Gödel defined the inner model L = α Lα of constructible sets. Lα+1 is
the collection of subsets of Lα that can be obtained from the members of
Lα ∪ {Lα } by constructive set operations. A pleasant property of this concept is its invariance with respect to different notions of constructivity. (For
constructivists disagree as to what “constructive” means.) The ordinal numbers form a non-constructive ingredient in the definition of constructibility,
and so we arrive at the same class L by using any “reasonable” notion of
constructivity. It follows that many versions of the hierarchy Lα are possible.
One frequently used definition sets Lα+1 = Def(Lα ) = the set of subsets of
Lα definable in first-order logic over the structure hLα , ∈i with the use of
arbitrary parameters from Lα .
Gödel showed that L is an inner model. He also showed that the definition
of L is absolute in every inner model W in the following sense. If we interpret
the definition of L in W rather than in V , then we obtain a class LW ⊆ W .
Absoluteness means that LW = L. In particular, it follows that LL = L.
6
The language of set theory is a first-order language with the predicates = and ∈. We
interpret this language in W by restricting all quantifiers to range over W while the atomic
formulas retain the same meaning in W that they have in V .
396
RONALD JENSEN
But then the statement V = L holds in L, for this means just that VL = LL ,
where LL = L and VL = {x | x = x}L = L. So the axioms of set theory
remain consistent if we adjoin the additional axiom V = L, called the axiom
of constructibility. Gödel deduced GCH from V = L. So GCH is consistent
relative to the other axioms. From GCH follows AC, which asserts that every
set can be well-ordered. Gödel went further and showed, as a consequence
of V = L, that the real continuum R has a definable well-ordering, and in
fact one whose definition can be formulated in second-order number theory.7
(The precise complexity of this definition is specified by its level, Σ12 , in the
projective hierarchy.)
Gödel’s discovery came shortly before the start of World War II and
constituted a coda for the development of naive set theory in the early
decades of this century. In the next twenty years, set theory was quiescent.
In the sixties, after Cohen’s discovery had revived interest in set theory, we
discovered a series of strong and applicable combinatorial principles that
follow from the axiom V = L. These principles, which can be regarded as
natural strengthenings of GCH, are used in the infinitary parts of algebra,
topology, and combinatorics. (The best known of these principles, called ♦,
permits constructions in which the ≥ ù2 subsets of ù1 are diagonalized in
ù1 steps.)
§2. According to Gödel’s theorem, the negation of CH is not provable.
It remained to show that CH is not provable. But this could not be shown
with the method of inner models, since precisely the success of this method
precluded such an application: By absoluteness, we have L = LW ⊆ W for
every inner model W . So V = L says that V is the only inner model. As
V = L is consistent and implies CH, we cannot prove the existence of an
inner model in which CH is false. To negate CH, we must apparently find an
“outer model” that extends V . But how is this possible if V already contains
all sets? The solution to this puzzle was given by Cohen’s forcing method.
Although forcing is somewhat outside the scope of this paper, I shall try to
give a sketch of it (which the reader may skip without loss of continuity).8
Instead of working with sets, one can consider their characteristic functions.
2
This suggests an alternative to the Vα hierarchy: We define a hierarchy
S Vα of2
2
2
partial characteristic functions. As before, we set V0 = ∅ and Vë = í<ë Ví
2
for limit ordinals ë. At successor ordinals,
set Vα+1
= the set of all
S we
2
2
2
f : Vα → 2. Finally, we define V = α Vα . Then V 2 is “essentially”
isomorphic to V , which can be made precise as follows. Each f ∈ V 2 is
defined on a subset Vα2 of V 2 . Extend f to a total function f ∗ on V 2 by
7
Second-order number theory is the theory of the structure hP (ù), ù, +, ·, ∈i. (Real
numbers can be identified with elements of P (ù).)
8
Cohen’s original version looks somewhat different from the version presented here, which
is based on a later analysis by Solovay.
INNER MODELS AND LARGE CARDINALS
397
setting f ∗ (x) = 0 wherever f(x) is undefined. Define an equality relation I
on V 2 by letting fIg iff f ∗ = g ∗ , and define a pseudo-∈ relation E by fEg
iff g ∗ (f) = 1. Then I is a congruence relation on hV 2 , Ei, and the quotient
structure hV 2 , Ei/I is isomorphic to V .
This analysis suggests a strategy for extending V 2 . 2 = {0, 1} is the
smallest Boolean algebra. If we carry out the same construction with another
B
B
complete Boolean algebra B in place of 2, we obtain a hierarchy
S VαB. (Vα+1
B
B
is then defined as the set of all f : Vα → B.) We set V = α Vα . Then
we have V 2 ⊆ V B because 2 ⊆ B. Somewhat more complicated definitions
than before give an “equality relation” I and a “pseudo-∈ relation” E. These
are, however, not relations in the usual sense but “B-valued relations,” i.e.,
binary functions whose values lie in B. In other words, we now assign to
each statement of the form x = y or x ∈ y (with x, y ∈ V B ) a truth value in
B. Then we can recursively define, for each statement ϕ of the language of
set theory, a truth value kϕk ∈ B. 9 (kϕk may lie strictly between 0 and 1.)
It turns out that all the axioms of set theory have truth value 1 (including AC
if we assume AC in V ) and the property of having truth value 1 is preserved
by logical deduction. However, a suitable choice of B makes kCHk = 0.
This shows that CH does not follow from the axioms. (The consistency of
CH with the axioms could also have been established by the forcing method,
for one can choose B so that kCHk = 1.)
Forcing also led to the discovery of useful combinatorial principles. The
best known of these is Martin’s Axiom (MA). The strength of MA depends
on the size of the continuum. MA is a trivial consequence of CH. If CH is
false, then MA has far-reaching consequences (but remains consistent relative to the axioms). MA is often a useful complement to the combinatorial
principles that hold in L. For example, the independence of various statements can be proved by showing that one direction follows from ♦ and the
other from MA and ¬CH.
§3. The forcing method proved to be an extraordinarily powerful tool
for obtaining independence results in set theory and led to a renaissance of
this field. It was an exciting time for the young mathematicians who flocked
into this field. Major classical problems were solved with gratifying regularity. In the long run, though, it is somewhat unsatisfying to prove only that
statements are unprovable. Increasingly, people asked whether the weakness
of the set-theoretic axioms could be alleviated by accepting new axioms. An
obvious candidate would be Gödel’s axiom of constructibility, V = L. This
9
For example, weTset k¬ϕk = ¬kϕk =the complement of kϕk in B ; kϕ ∧ øk = kϕk ∩
køk; k∀v ϕ(v)k = x∈V B kϕ(x)k. Once this definition is given, the B -valued relations are
uniquely characterized by the requirements that the axioms
S of equality logic and the axiom
of extensionality have truth value 1 and that kx ∈ yk ⊆ z∈dom(y) kx = zk for x, y ∈ V B .
398
RONALD JENSEN
axiom makes a clear statement about the nature of the mathematical universe. It is mathematically fruitful in that it solves many problems and leads
to interesting new concepts and theories. It is philosophically attractive for
adherents of “Ockham’s razor,” which says that one should avoid superfluous
existence assumptions. I personally find it a very attractive axiom. Nevertheless, it has been rejected by the majority of set theorists, beginning with
Gödel himself. One could perhaps summarize Gödel’s objection as follows.
The universe of sets is too large and varied to be fully grasped by the human
spirit. Over time, however, mathematical intuition improves and with it our
ability to describe this universe. The steps of this approximation process
are marked by the axioms that we recognize as true. The axiom of constructibility presents, modulo the ordinal numbers, a complete description
of sets and is thus a limiting principle that would halt the approximation
process. Instead, one should look for additional existence axioms, called
strong axioms of infinity. A typical axiom of this sort asserts the existence
of a large cardinal number. As an example, Gödel suggests the inaccessible
cardinals. As the universe satisfies all axioms of set theory, it should be so
big that even some initial segment Vκ also satisfies the axioms. If Vκ satisfies
the axioms (in the sense of second order logic10 ), then we call κ inaccessible.
But if an inaccessible cardinal exists, then this statement also should be true
in an initial segment Vô , i.e., there should be two inaccessible cardinals κ
and ô. By continuing this argument, we see that the inaccessible cardinals
are unbounded in the ordinal numbers. But then also this assertion should
hold in an initial segment, i.e., there is an inaccessible limit of inaccessible
cardinals, etc.
Inaccessible cardinals are, however, not an alternative to the axiom of
constructibility, for they are consistent with this axiom.11 For this we need
a principle saying that the universe is not only “long” but also “wide.” The
first large cardinal axiom proved to imply V 6= L asserts the existence of a
measurable cardinal. As the name implies, these cardinals originated from
questions in measure theory. κ is measurable if and only if there is a κcomplete, non-trivial, 0,1-valued measure defined on all subsets of κ. (Or,
equivalently, there is a κ-complete non-principal ultrafilter on κ.) The settheoretic content of this concept becomes clearer in the following definition.
Definition. κ is measurable if and only if there exist M and ð such that
(1) M is an inner model.
(2) ð : V → M is an elementary embedding.12
(3) κ is the critical point of ð (i.e., ð ↾ κ = id, ð(κ) > κ).
10
This means that Vκ is used as the domain of sets and Vκ+1 as the domain of classes.
If κ is inaccessible in V , then also in L.
12
That is, ϕ(x1 , . . . , xn ) is true in V if and only if ϕ(ð(x1 ), . . . , ð(xn )) is true in M .
11
INNER MODELS AND LARGE CARDINALS
399
In the late fifties, Dana Scott showed that the existence of a measurable
cardinal implies V 6= L. Later, it was shown that these cardinals have
interesting consequences for the real continuum, e.g., that every Σ12 set is
Lebesgue measurable and has the Baire property. (Consequently, there can
be no Σ12 well-ordering of R.)
We can generate a hierarchy of even stronger large cardinal principles by
imposing additional conditions on the model M in the definition above. For
example, we call κ α-strong iff Vα ⊆ M . We call κ strong if it is α-strong
for all α. κ is superstrong iff Vð(κ) ⊆ M . (This axiom is stronger than
the previous one in the sense of relative consistency , i.e., the existence of
superstrong cardinals implies the consistency of strong cardinals, but not
conversely.)
These cardinals are the first stages of a hierarchy that has been intensively
investigated in the last 20 years. Among the later stages are the compact
and huge cardinals. The strongest principle of this sort was proposed by
Reinhardt. We call κ a Reinhardt cardinal if M = V in the definition above.
At first, Reinhardt cardinals were greeted by many set theorists, including
the author of this article, as the embodiment of an intuitively especially
satisfactory existence principle. But after a few years Kunen proved that
these cardinals do not exist. This set an upper limit for the hierarchy.
§4. Strong axioms of infinity say that the universe of sets is very large,
and, as we have seen, they have important consequences for the real continuum. But one can also formulate axioms saying very directly that the
continuum is large. The most successful principles of this sort are the socalled axioms of determinacy.
Let ù ù be the set of all infinite sequences of natural numbers. To every set
A ⊆ ù ù we associate a game as follows: Player I chooses a number a0 ∈ ù;
Player II then chooses a1 ∈ ù; Player I chooses a2 ; etc. After ù moves, the
players have defined a sequence a = hai |i < ùi. Player I wins if a ∈ A;
otherwise II wins. We say that A is determined if one of the two players has a
winning strategy. A winning strategy for I, for example, is a function f, from
finite sequences of natural numbers to natural numbers, such that I always
wins if he plays a2i = f(a1 , a3 , . . . , a2i−1 ) for all i. Strategies can be coded
by real numbers, so the existence of a strategy for A is an existence postulate
about R. The simplest form of the axiom of determinacy (AD) asserts that
every A ⊆ ù ù is determined. But AD implies that R cannot be well-ordered,
so it cannot be accepted as an axiom. There are, however, various axioms
of definable determinacy, saying that a certain class of “nicely definable”
A ⊂ ù ù are determined. These seem to be consistent with the axiom of
choice (AC) and have been intensively studied in the last twenty years. The
axiom of projective determinacy (PD) says that the projective, i.e., Σ1ù , sets
400
RONALD JENSEN
are determined. Naturally, Σ1n -AD means that the Σ1n sets are determined.13
An especially attractive axiom is AD in L[R], which says that AD holds in
the inner model L[R] that is defined exactly like L except that L0 [R] = R.
This axiom is equivalent to the statement that every A ∈ P (ù ù ) ∩ L[R] is
determined. It turns out that under this assumption the model L[R] becomes
a paradise for analysts: Every set of real numbers is Lebesgue measurable
and has the Baire property. AC is false there, but if we assume AC in V then
L[R] still satisfies the weaker forms of AC—such as the axiom of dependent
choice—that are needed by analysts.
It was soon discovered that there is a connection between determinacy and
large cardinals. For example, AD implies the existence of an inner model
with a measurable cardinal. (In fact, this cardinal is ù1 in V .) In the other
direction, Σ11 -AD follows from the existence of a measurable cardinal. The
great hope of researchers in both fields was to deduce definable determinacy
from the existence of sufficiently large cardinals. Until recently, however,
this hope remained unfulfilled, and the three main areas of set theory—
inner models, large cardinals, and determinacy—went their separate ways.
Large cardinals probably attracted the most attention and were certainly
the subject of very deep investigations. Nevertheless, this area remained
in a certain sense the least satisfactory: It lacked—until recently—both the
deep structure theory and the occasional dramatic shifts of perspective of
the other two areas.
§5. The axiom of constructibility on the one hand and large cardinals and
determinacy on the other embody two radically different conceptions of the
universe of sets. How can these conceptions be justified? Most proponents
of V = L and similar axioms support their belief with a mild version of
Ockham’s razor. L is adequate for all of mathematics; it gives clear answers to
deep questions; it leads to interesting mathematics. Why should one assume
more? The proponents of strong axioms of infinity usually call themselves
Platonists and use this to support their belief. I do not understand, however,
why a belief in the objective existence of sets obligates one to seek ever
stronger existence postulates. Why isn’t Platonism compatible with the mild
form of Ockham’s razor cited above? Something else must be going on. I
would like to propose a—necessarily somewhat speculative—hypothesis.14
Could it be that the duality in modern set theory is nothing but a new
manifestation of an ancient conflict between two points of view—I almost
want to say two emotional states—which have always existed in mathematics?
I call them the arithmetical and geometrical points of view. I also call the first
13 1
Σ0Sconsists of the Borel sets. Σ1n+1 consists of projections of complements of Σ1n sets.
Σ1ù = n Σ1n .
14
The reader can skip the rest of §5 without loss of continuity.
INNER MODELS AND LARGE CARDINALS
401
one the Pythagorean point of view, for Pythagoras expressed it in its purest
form: Everything consists of numbers. In other words, every mathematical
structure can be interpreted in the natural structure of the positive integers.
This idea is naturally very attractive; it gives to all of mathematics the intuitive
clarity of the natural numbers. I conjecture that, if it could really be carried
out, it would still be the dominant point of view today. In reality, however,
the geometric point of view has been dominant since the rise of analysis.
Thus I also call it the Newtonian point of view.15 The Newtonian directs his
gaze to the real rather than the natural numbers. He is less impressed by their
clarity than by their boundless multiplicity. The real numbers constitute a
gigantic, unfathomable sea. For every principle that generates real numbers,
there must be a number not attainable by that principle. This excludes
the possibility of an interpretation of the real numbers within the natural
numbers.
If one accepts set theory, then there is no doubt that Cantor has refuted
Pythagoreanism in the strongest terms by showing that there are more real
than natural numbers. But Cantor also introduced the ordinal numbers,
which are in every sense the transfinite continuation of the natural numbers.
They share much of the intuitive clarity of the natural numbers. Thus Cantor,
who refuted the old Pythagoreanism, made possible a new Pythagoreanism
in which the ordinal numbers take over the role of the natural numbers.
In this sense, Gödel’s axiom of constructibility seems to me to embody an
entirely coherent Pythagorean picture of the world. And this picture cannot
be refuted, for Gödel showed that V = L is consistent if the other axioms
are. But this axiom provides—modulo the ordinal numbers—a complete
description of all sets and is therefore unacceptable for a Newtonian. For
him, there must be a real number not generated by constructible processes.
Since one cannot prove in set theory that such a number exists, one must
seek new axioms. Thus, the ancient conflict is fought in a new arena.
Whether or not one accepts this analysis, there is no doubt that the quest
for new axioms of infinity has led to very interesting mathematics. In addition, the proponents of these axioms have, through hard technical work,
assembled a series of impressive plausibility arguments.16 Nevertheless, I
doubt that one could, with the sort of evidence that we have, convert the
mathematical world to one or the other point of view.17 Deeply rooted
differences in mathematical taste are too strong and would persist.18
15
Of course one could also call it Leibnizian. But perhaps Newton thought more
geometrically.
16
See §7, §8.
17
The author confesses to being emotionally a Pythagorean.
18
In the course of time I have had numerous opportunities to discuss the alternatives in set
theory with mathematicians from other disciplines. I am always surprised how rapidly and
402
RONALD JENSEN
§6. Strong axioms of infinity should prevent the universe from being
“constructed from below” like L. But do they really prevent this? It turns
out that we can have a measurable cardinal in a very L-like model. The
measurability of κ means that there is a normal ultrafilter U in P (κ).19 Let
the hierarchy LUαS
be defined like Lα except that LUα+1 = Def(hLUα , ε, U ∩LUα i),
U
and set L = α LUα . Then LU is an inner model. In addition, κ is
measurable in LU , for U ∩ LU ∈ LU appears in LU to be a normal ultrafilter.
Silver and others have shown that LU is thoroughly similar to L. It satisfies
GCH. It has the combinatorial properties of L. (In fact, until now, no
purely combinatorial difference between L and LU has been discovered.) It
has a well-ordering of the reals definable in second-order arithmetic (in this
case Σ13 instead of Σ12 ). Even the absoluteness of L has an analog: LU has a
definition (using κ as a parameter) that is absolute in every inner model M
with LU ⊆ M .
But our definition of LU does not by any means show that this model can
be “built from below by well-understood operations,” for the object U is
explicitly used in the construction. That a construction from below is nevertheless possible to a certain extent was proved only later in the so-called
core model theory. This theory is somewhat complicated, but I shall try to
give a rough sketch of it. Let us assume not only that V 6= L but also that
there is a non-trivial elementary embedding ð : L → L.20 It turns out that
this assumption is equivalent to the existence of a certain set, denoted 0♯ . In
a very concrete way—which unfortunately cannot be described here—0♯ encodes a complete description of the structure of L together with a canonical
elementary embedding ð : L → L. The definition of 0♯ is absolute in every
inner model that contains 0♯ . Although it unfortunately cannot be justified
here, there is no doubt that 0♯ is a “well-understood” mathematical object.
If 0♯ does not exist, then the global structure of V does not deviate too much
from that of L. This is the main content of the so-called “covering theorem”:
If 0♯ does not exist then every uncountable set X of ordinal numbers has a
“covering set” Y ⊇ X with Y ∈ L and Card(Y ) = Card(X ). (From this
follows, for example, that every singular cardinal number has the same cardinal successor in L as in V .) But if 0♯ exists, then the structure of L-cardinals
is “wiped out” in V . (For example, every uncountable cardinal looks inaccessible in L.) So 0♯ may be regarded as the “next larger construction step”
after L. Then we form the inner model L[0♯ ], the constructible closure of
with what certainty they express an opinion. Their opinions are divided and reflect, in my
view, a pre-existing orientation.
19
U is normal if for every regressive function f : X → κ (where “regressive” means
f(î) < î for all î ∈ X ) with X ∈ U there is an ç < κ with {î | f(î) = ç} ∈ U . Normality
implies κ-completeness.
20
Non-trivial means ð 6= id.
INNER MODELS AND LARGE CARDINALS
403
0♯ .21 Now let us assume that L[0♯ ] is non-trivially elementarily embeddable in
itself by ð : L[0♯ ] → L[0♯ ]. This leads to a set 0♯♯ encoding L[0♯ ] and a canonical embedding ð. Then we form L[0♯♯] and so on. After ù steps, we have
a sequence 0♯ , 0♯♯ , . . . , 0(n) , . . . . We let 0(ù) encode this sequence and form
L[0(ù) ]. An embedding ð : L[0(ù) ] → L[0(ù) ] yields 0(ù+1) , and so on. When
we have 0(α) for all ordinal numbers α, we form the constructible closure L♯
of all the 0(α) .22 A non-trivial elementary embedding then yields a set 0(∞) ,
and so forth.23 We arrive at a sequence: 0♯ , 0♯♯ , . . . , 0(α) , . . . , 0(∞) , 0(∞+1) , . . . .
The terms in this sequence are called mice. With some effort, one can define
the class of all mice along with its natural ordering. (This ordering can be
much longer than ∞.) But then we can build the constructible closure of all
mice, which we denote by K.24 K is called the core model. K is an inner
model. The definition is absolute in the sense that KM = K for every inner
model M ⊇ K. K is L-like in the same sense as LU . What happens if there
is a non-trivial elementary embedding ð : K → K? A new mouse cannot
result, as all mice are already present in K. What we get is an inner model
LU , where U ∈ LU witnesses the measurability in LU of some ordinal κ. We
have P (κ) ∩ K = P (κ) ∩ LU . U can be obtained in a fairly direct way from
the embedding ð. So we have, after all, obtained LU “from below” via the
long march through the mice.
The model LU is uniquely determined by the number κ. Let us now choose
κ as small as possible. What happens if there is a non-trivial elementary
embedding ð : LU → LU ? We obtain a set 0† coding LU and a canonical
embedding ð. So, after all, 0† is “the next mouse.” It differs from the previous
mice in that, to obtain it, we needed an intermediate step marked not with
a mouse but with an inner model LU . This suggests the possibility of a
continuing process, producing inner models for stronger and stronger axioms
of infinity. How far can we go? Could it be that the directions mentioned
in §5, Newtonianism and Pythagoreanism, are compatible after all, with
Newtonianism indicating possibilities that then turn out to be realizable in a
Pythagorean universe? Progress so far has been relatively modest. The next
major step after a measurable cardinal is a strong cardinal. A few years ago,
Dodd and the author set themselves the goal of defining a core model for
a strong cardinal. Mitchell worked in the same direction but with a more
modest goal. The technical difficulties turned out to be very great. Dodd and
I barely got past the fundamental fine structure theory. Mitchell got closer
21 ♯
0 can be regarded as a set of ordinal numbers (even as a subset of ù). Then L[0♯ ] is
defined from 0♯ as LU was defined from U .
S
22 ♯
L = α L[0(α) ]. (Remark: 0(í) ∈ L[0(α) ] for í ≤ α.)
23
∞ is the class of all ordinal numbers and is also regarded as the “largest ordinal number.”
24
K is the union of all L[m] such that m is a mouse.
404
RONALD JENSEN
to his goal. Recently, Mitchell—building on a suggestion of Baldwin—
proposed a new definition of mice, which made immense simplifications
possible.
The theory is still very long, and not all the details have been written
down yet. Nevertheless, I believe I can claim that we now have an extended
core model K in which a strong cardinal can exist. How big K is depends,
naturally, on what there is. If there is no inner model with a measurable
cardinal, then the new K coincides with the old one. If there is such an inner
model but 0† does not exist, then K = LU , etc. We again have KM = K if M
is an inner model with K ⊆ M . K is again thoroughly L-like. The existence
of a strong cardinal in V ensures the existence of one in K. If there is no
elementary embedding ð : K → K, then again much of the global structure
of K is preserved in V .25 But if ð : K → K exists, then we obtain the next
mouse. So we can go further. But it turned out that one cannot go much
further without seriously modifying the program.
§7. A few years ago, Hugh Woodin, in the wake of a deep result of
Foreman, Magidor, and Shelah, proved the following theorem: If there is a
superstrong cardinal, then there is no well-ordering of R in L[R]. In particular, R has no well-ordering definable in second-order arithmetic. This result
was surprising, because the concept of a superstrong cardinal had previously
been regarded as a rather modest extension of the concept of measurability.
The theorem set off an avalanche of results that have fundamentally altered
the landscape of set theory. It also set a sharp limit for the conventional
core model theory, because the methods that we had been using for the
construction of these models always yielded a Σ13 well-ordering of R. Even
more serious for this theory was another consequence of Woodin’s proof:
In a possible modified core model theory, one would have to give up the
absoluteness theorem; if K contains a superstrong cardinal, then one cannot
exclude the possibility that KM 6= K for some inner model M ⊇ K.26
A rapid sequence of discoveries followed. Martin and Steel produced the
long-desired proof that determinacy axioms follow from strong axioms of
infinity. A typical result—due to Martin, Steel, and Woodin—is that the
existence of a superstrong cardinal implies AD in L[R]. An even weaker
cardinal—the so-called Woodin cardinal—played a key role in the new developments. Call a cardinal ô strong with respect to a class A if for each
25
But this can no longer be expressed by so strong and simple a statement as the covering
theorem.
26
Using forcing, Woodin showed that, given an inner model M with a superstrong cardinal,
one can consistently assume the existence of a strictly larger model M ′ and an elementary
embedding ð : M → M ′ . Now let M = K. Then V = K holds in M and hence also in M ′ .
Absoluteness would imply M ′ = K = M , a contradiction.
INNER MODELS AND LARGE CARDINALS
405
â there exist an inner model M , a class B, and an elementary embedding ð : hV, Ai → hM, Bi such that ô is the critical point, Vâ ⊆ M , and
B ∩ Vâ = A ∩ Vâ . If κ is inaccessible, A ⊆ Vκ , and ô < κ, then we can
relativize this concept to Vκ . We call κ a Woodin cardinal if for each A ⊆ Vκ
there exists a ô < κ that is strong with respect to A in Vκ . Martin and
Steel showed that Σ1n+1 -determinacy follows from the existence of n Woodin
cardinals and a measurable cardinal above them. Then Woodin showed that
the existence of ù Woodin cardinals and a measurable cardinal above them
implies AD in L[R]. Woodin also has interesting equiconsistency results:
Σ12 -determinacy is equiconsistent with the existence of a Woodin cardinal.
AD in L[R] is equiconsistent with the existence of ù Woodin cardinals.
§8. The two areas, large cardinals and determinacy, had come together.
What about inner models? At first glance, the core model program seemed
to have reached a dead end. For two essential attributes of this theory would
have to be sacrificed in any extension: a nicely definable well-ordering of
R and the absoluteness theorem. The second sacrifice seemed especially
serious to me. K is defined as the union of its construction steps, where a
construction step (e.g., a mouse) should be a “well understood” object. For
me, this requirement always meant that one can recognize in an arbitrary
inner model M whether a given element of M is a construction step. That
implies KM = K if K ⊆ M . But this conclusion is false if a superstrong (or
just a Woodin) cardinal occurs in K.
Fortunately, Martin, Mitchell, and Steel did not let these considerations
trouble them but set out to find a concept of mouse suitable for the continuation of the theory. The first step in this extended core model program was to
realize a Woodin cardinal in a core model. Martin and Steel first defined a
“model of type LU .” Assuming a Woodin cardinal κ, they defined an inner
model W . The construction of W is not really “from below” but uses a
set whose existence follows from the assumed property of κ. They showed
that, in W , κ remains a Woodin cardinal, CH holds, and R has a Σ13 wellordering. (So Σ12 -determinacy is false in W .) But the internal structure of
this model remained quite mysterious; for example we do not know whether
GCH holds in it. Mitchell and Steel then created a revised W using the fine
structure techniques of core model theory. Their W satisfies GCH and is
quite generally L-like. Finally, Steel managed to construct a genuine core
model K. But to carry out his construction, he needs third order set theory
and a large cardinal property (namely measurability) for ∞ (the class of all
ordinals).27 When one has this, one finds a first order definition of K that
27
Steel thinks that second order set theory will suffice and that the measurability assumption can be weakened.
406
RONALD JENSEN
is absolute in K. At the moment, only a rather weak analog of the covering
theorem has been proved.28
The mice studied in this extended theory are still “well understood” objects, but the definition of mice is of such high logical complexity that the
absoluteness requirement is no longer satisfied.29 The possibility that there
are models M with K ⊆ M and KM 6= K turned out to be less troublesome
than one had expected. The increasing complexity of the concept of mouse
also explains why we lose the nicely definable well-orderings of R. The existence of such a well-ordering depends on the fact that every mouse that
constructs a new real number is itself a countable object and so can be coded
by a real number. The earlier mice were so simple that one could define the
set of all such codes—together with their natural well-ordering—in second
order arithmetic. With increasing complexity, this is no longer possible.
Will it be possible to realize all reasonable large cardinals in “well understood” inner models? This goal is important not only for the reasons
indicated in §6 but also because of a consideration connected with Gödel’s
justification for strong axioms of infinity. A common plausibility argument
for these axioms is that they are—as far as we can see—linearly ordered
by the relation of relative consistency.30 Historically, the axioms of infinity have many different mathematical sources, and there is no prima facie
reason to suppose that they are linearly ordered. The existence of such
an order—insofar as we can confirm it—is surely a strong argument for regarding these axioms as successive approximations to a “final” universe. The
axioms in the sequence mentioned earlier—measurable, strong, superstrong,
etc.—arise from weaker and stronger variations of a single theme. So it is
not surprising that they are linearly ordered. We can regard this sequence
as a yardstick with which to compare other strong axioms of infinity. (It
turns out that such comparison is sensible even for some assertions that seem
to have nothing to do with large cardinals, for example the existence of a
saturated ideal on ù1 .) Very many open questions remain. Perhaps the best
known is whether compact and supercompact cardinals are equiconsistent.
If we go “down” a bit and consider principles of infinity that are weaker than
measurability yet strong enough to imply V 6= L, then the so-called Erdős
cardinals form an analogous yardstick. Here the core model is available,
and it frequently allows us to compare other principles with this yardstick.
28
By this I mean theorems saying that the global structure of K is preserved in V if K is
rigid with respect to elementary embeddings.
29
That the absoluteness condition held previously is due to the fact that the concept of
mouse was ∆1 in the parameter ù1 .
30
An axiom A is consistent relative to B in the axiom system of set theory (ZFC) if it is
provable in first order arithmetic that the consistency of ZFC+A follows from the consistency
of ZFC+B. Modulo equiconsistency, this relation is a partial order.
INNER MODELS AND LARGE CARDINALS
407
(Typical examples are the Chang conjecture31 and the existence of Jónsson
cardinals.32 ) The only strategy we have for attaining comparable results for
larger cardinals is a corresponding generalization of the inner model theory.
I would like to close with a general comment on the direction of research
in set theory. In recent years I have occasionally heard other mathematicians complain that set theory has turned inward, i.e., that set theorists are
occupied too much with their own structural questions and not enough with
applications. This reproach does not by any means apply to all set theorists, but this paper does little to refute it. I hope, however, to have made
it plausible that a subject should from time to time turn inward. The deep
inner problems have become steadily clearer in recent years, and the tools
needed to overcome them have been developed. There has been enormous
progress recently. If set theory now finds itself in a rather introspective phase,
I am convinced that it will emerge from this phase stronger than ever and of
greater use for other areas of mathematics.
REFERENCES
[1] K. Devlin, Constructability, Springer-Verlag, 1984.
[2] F. R. Drake, Set theory: An introduction to large cardinals, North-Holland, 1974.
[3] R. Jensen, Measures of order 0, unpublished manuscript.
[4] K. Kunen, Set theory: An introduction to independence proofs, North-Holland, 1980.
[5] D. A. Martin and J. R. Steel, A proof of projective determinacy, Journal of the
American Mathematical Society, vol. 2 (1989), pp. 71–125.
, Iteration trees, Journal of the American Mathematical Society, vol. 7 (1994),
[6]
pp. 1–73.
[7] W. Mitchell and J. R. Steel, Fine structure and iteration trees, Lecture notes in logic
3, Springer-Verlag, 1994.
[8] J. R. Steel, The core model iterability problem, unpublished manuscript.
[9] J. R. Steel, Inner models with many Woodin cardinals, Annals of Pure and Applied
Logic, vol. 65 (1993), pp. 185–209.
ALL SOULS COLLEGE
OXFORD OX1 4AL
GREAT BRITAIN
31
The Chang conjecture is the following model-theoretic assertion: If hA, B, . . . i is a model
for a countable language with Card(A) = ù2 and Card(B) = ù1 , then there is an elementary
¯ B̄, . . . i with Card(A)
¯ = ù1 and Card(B̄) = ù.
submodel hA,
32
κ is a Jónsson cardinal if every algebra A = h|A|, f1 , f2 , . . . , fn i of cardinality κ has a
proper subalgebra of the same cardinality.