Inner Models and Large Cardinals

Ronald Jensen

Inner Models and Large Cardinals

Ronald Jensen

1995, Bulletin of Symbolic Logic

visibility

…

description

15 pages

link

1 file

In this paper, we sketch the development of two important themes of modern set theory, both of which can be regarded as growing out of work of Kurt Gödel. We begin with a review of some basic concepts and conventions of set theory. §0. The ordinal numbers were Georg Cantor's deepest contribution to mathematics. After the natural numbers 0, 1, …, n, … comes the first infinite ordinal number ω, followed by ω + 1, ω + 2, …, ω + ω, … and so forth. ω is the first limit ordinal as it is neither 0 nor a successor ordinal. We follow the von Neumann convention, according to which each ordinal number α is identified with the set {ν ∣ ν α} of its predecessors. The ∈ relation on ordinals thus coincides with <. We have 0 = ∅ and α + 1 = α ∪ {α}. According to the usual set-theoretic conventions, ω is identified with the first infinite cardinal ℵ0, similarly for the first uncountable ordinal number ω1 and the first uncountable cardinal number ℵ1, etc. We thus arrive at the following picture...

The Bulletin of Symbolic Logic Volume 1, Number 4, Dec. 1995 INNER MODELS AND LARGE CARDINALS RONALD JENSEN In this paper, we sketch the development of two important themes of modern set theory,1 both of which can be regarded as growing out of work of Kurt Gödel. We begin with a review of some basic concepts and conventions of set theory. §0. The ordinal numbers were Georg Cantor’s deepest contribution to mathematics. After the natural numbers 0, 1, . . . , n, . . . comes the first infinite ordinal number ù, followed by ù + 1, ù + 2, . . . , ù + ù, . . . and so forth. ù is the first limit ordinal as it is neither 0 nor a successor ordinal. We follow the von Neumann convention, according to which each ordinal number α is identified with the set {í | í < α} of its predecessors. The ∈ relation on ordinals thus coincides with <. We have 0 = ∅ and α + 1 = α ∪ {α}. According to the usual set-theoretic conventions, ù is identified with the first infinite cardinal ℵ0 , similarly for the first uncountable ordinal number ù1 and the first uncountable cardinal number ℵ1 , etc.2 We thus arrive at the following picture: 0, 1, . . . , ù, ù + 1, . . . , ù + ù, . . . , ù1 , . . . , ù2 , . . . k k k ℵ0 ℵ1 ℵ2 The von Neumann hierarchy3 divides the class V of all sets into a hierarchy of sets Vα indexed by the ordinal numbers. The recursive definition reads: V0 = ∅; Vα+1 = P (Vα ) (where P (x) = {z | z ⊆ x} is the power set of x); This article is based on the Gödel Lecture given at the meeting of the Association for Symbolic Logic at Berkeley in January, 1990. A German version of this paper, “Innere Modelle und Große Kardinalzahlen,” was published in Jahresbericht der Deutschen MathematikerVereinigung, Jubiläumstagung 100 Jahre DMV (B. G. Teubner, Stuttgart, 1992), pp. 265–280. This English translation was prepared by Andreas Blass. It is published by permission of B. G. Teubner. 1 This paper and the lecture on which it is based are not intended as historical surveys. In particular, no attempt has been made to give proper credit for many of the early contributions to the subject. 2 ùí is the íth infinite ordinal number of cardinality greater than all its predecessors and is identified with the corresponding cardinal number ℵí . 3 This hierarchy was introduced independently by Zermelo. c 1995, Association for Symbolic Logic 1079-8986/95/0104-0001/$02.50 393 394 RONALD JENSEN S Vë = í<ë Ví for limit ordinals ë.4 We can represent this hierarchy by the following picture. ❆ ❆ ❆ ❆ ✁ ✁ ✁ α ❆ ❆ ❆ ❆ ✁ ✁ ❆ ❆ ✁ ❆✁ ✁ ✁ ✁ Vα ✁ The Vα increase in width as α increases. The ordinal numbers, here drawn as a straight line, grow linearly, since in the transition from α to α + 1 = α ∪ {α} exactly one new S element is added. The axiom of foundation in set theory asserts that V = α Vα . This structure has two complementary motivations. The iteration can be understood as a generalization of analysis, since the real numbers can be represented in P (ù), subsets and functions on R in PP (ù), etc. On the other hand, set theory can be regarded as a relativization of finite combinatorics, where the concepts “finite” and “infinite” are replaced by “set” and “proper class.” From this point of view it is particularly easy to motivate the usual axioms of set theory,5 for example the axiom of separation which says that a subclass of a set is a set or the axiom of replacement which says that the image of a set under a function is a set. It is perhaps not a coincidence that the young Cantor was influenced by Kronecker as well as by Weierstraß. §1. The most famous question in set theory is the continuum problem. The continuum hypothesis (CH) asserts that the cardinal number Card(R) of R has the smallest possible value ù1 . An equivalent formulation is Card(P (ù)) = ù1 . The generalized continuum hypothesis (GCH) asserts that Card(P (ùí )) = ùí+1 for all í. GCH implies the axiom of choice (AC). Despite great efforts, Cantor did not succeed in deciding whether CH is true, and neither did his successors. In this century, the suspicion grew that CH might be independent of the axioms of set theory, i.e., that these axioms do not suffice to prove either the continuum hypothesis or its negation. This suspicion was based on the apparent insufficiency of the axioms to adequately characterize the power set operation. It proved to be correct, and the two 4 Urelements or non-sets are not needed in mathematics. If we want to have them, we must modify the definition so that they occur in V0 . 5 By this I mean the axioms of Zermelo-Fraenkel (ZF), possibly extended by the axiom of choice (ZFC). INNER MODELS AND LARGE CARDINALS 395 research directions described in this article can be regarded as attempts to understand power sets better. The first progress came in 1939, when Gödel showed that GCH is consistent with the axioms. Then the unprovability of CH was shown by Cohen in 1963. Thus the independence was proved. Gödel’s proof was the first application of the concept of inner models. This concept arises from an attempt to circumscribe the amorphous power set operation: one replaces the Vα -hierarchy with a new hierarchy Wα , defined just like Vα except that, in the transition from Wα to Wα+1 , not all subsets of Wα are adjoined but only those that are obtained by S means of certain “well-understood” and clearly specified operations. W = α Wα is then a “slice” of V , as indicated in the following picture. W V ❡ ❆ ✁ ✪ ❡ ✪ ❡ α ❆ ✁ Wα ✪ ❡ ✪ Vα ❡ ✪ ✁ ✪ ❡ ❆ ❡ ✪ ❡ ❆ ✁ ✪ ❡ ✪ ❡ ✪ ❆✪ ✁ ❡ We can hope that, for a suitable choice of these operations, W will formally satisfy all the axioms of set theory. In S this case we call W an inner model.6 Gödel defined the inner model L = α Lα of constructible sets. Lα+1 is the collection of subsets of Lα that can be obtained from the members of Lα ∪ {Lα } by constructive set operations. A pleasant property of this concept is its invariance with respect to different notions of constructivity. (For constructivists disagree as to what “constructive” means.) The ordinal numbers form a non-constructive ingredient in the definition of constructibility, and so we arrive at the same class L by using any “reasonable” notion of constructivity. It follows that many versions of the hierarchy Lα are possible. One frequently used definition sets Lα+1 = Def(Lα ) = the set of subsets of Lα definable in first-order logic over the structure hLα , ∈i with the use of arbitrary parameters from Lα . Gödel showed that L is an inner model. He also showed that the definition of L is absolute in every inner model W in the following sense. If we interpret the definition of L in W rather than in V , then we obtain a class LW ⊆ W . Absoluteness means that LW = L. In particular, it follows that LL = L. 6 The language of set theory is a first-order language with the predicates = and ∈. We interpret this language in W by restricting all quantifiers to range over W while the atomic formulas retain the same meaning in W that they have in V . 396 RONALD JENSEN But then the statement V = L holds in L, for this means just that VL = LL , where LL = L and VL = {x | x = x}L = L. So the axioms of set theory remain consistent if we adjoin the additional axiom V = L, called the axiom of constructibility. Gödel deduced GCH from V = L. So GCH is consistent relative to the other axioms. From GCH follows AC, which asserts that every set can be well-ordered. Gödel went further and showed, as a consequence of V = L, that the real continuum R has a definable well-ordering, and in fact one whose definition can be formulated in second-order number theory.7 (The precise complexity of this definition is specified by its level, Σ12 , in the projective hierarchy.) Gödel’s discovery came shortly before the start of World War II and constituted a coda for the development of naive set theory in the early decades of this century. In the next twenty years, set theory was quiescent. In the sixties, after Cohen’s discovery had revived interest in set theory, we discovered a series of strong and applicable combinatorial principles that follow from the axiom V = L. These principles, which can be regarded as natural strengthenings of GCH, are used in the infinitary parts of algebra, topology, and combinatorics. (The best known of these principles, called ♦, permits constructions in which the ≥ ù2 subsets of ù1 are diagonalized in ù1 steps.) §2. According to Gödel’s theorem, the negation of CH is not provable. It remained to show that CH is not provable. But this could not be shown with the method of inner models, since precisely the success of this method precluded such an application: By absoluteness, we have L = LW ⊆ W for every inner model W . So V = L says that V is the only inner model. As V = L is consistent and implies CH, we cannot prove the existence of an inner model in which CH is false. To negate CH, we must apparently find an “outer model” that extends V . But how is this possible if V already contains all sets? The solution to this puzzle was given by Cohen’s forcing method. Although forcing is somewhat outside the scope of this paper, I shall try to give a sketch of it (which the reader may skip without loss of continuity).8 Instead of working with sets, one can consider their characteristic functions. 2 This suggests an alternative to the Vα hierarchy: We define a hierarchy S Vα of2 2 2 partial characteristic functions. As before, we set V0 = ∅ and Vë = í<ë Ví 2 for limit ordinals ë. At successor ordinals, set Vα+1 = the set of all S we 2 2 2 f : Vα → 2. Finally, we define V = α Vα . Then V 2 is “essentially” isomorphic to V , which can be made precise as follows. Each f ∈ V 2 is defined on a subset Vα2 of V 2 . Extend f to a total function f ∗ on V 2 by 7 Second-order number theory is the theory of the structure hP (ù), ù, +, ·, ∈i. (Real numbers can be identified with elements of P (ù).) 8 Cohen’s original version looks somewhat different from the version presented here, which is based on a later analysis by Solovay. INNER MODELS AND LARGE CARDINALS 397 setting f ∗ (x) = 0 wherever f(x) is undefined. Define an equality relation I on V 2 by letting fIg iff f ∗ = g ∗ , and define a pseudo-∈ relation E by fEg iff g ∗ (f) = 1. Then I is a congruence relation on hV 2 , Ei, and the quotient structure hV 2 , Ei/I is isomorphic to V . This analysis suggests a strategy for extending V 2 . 2 = {0, 1} is the smallest Boolean algebra. If we carry out the same construction with another B B complete Boolean algebra B in place of 2, we obtain a hierarchy S VαB. (Vα+1 B B is then defined as the set of all f : Vα → B.) We set V = α Vα . Then we have V 2 ⊆ V B because 2 ⊆ B. Somewhat more complicated definitions than before give an “equality relation” I and a “pseudo-∈ relation” E. These are, however, not relations in the usual sense but “B-valued relations,” i.e., binary functions whose values lie in B. In other words, we now assign to each statement of the form x = y or x ∈ y (with x, y ∈ V B ) a truth value in B. Then we can recursively define, for each statement ϕ of the language of set theory, a truth value kϕk ∈ B. 9 (kϕk may lie strictly between 0 and 1.) It turns out that all the axioms of set theory have truth value 1 (including AC if we assume AC in V ) and the property of having truth value 1 is preserved by logical deduction. However, a suitable choice of B makes kCHk = 0. This shows that CH does not follow from the axioms. (The consistency of CH with the axioms could also have been established by the forcing method, for one can choose B so that kCHk = 1.) Forcing also led to the discovery of useful combinatorial principles. The best known of these is Martin’s Axiom (MA). The strength of MA depends on the size of the continuum. MA is a trivial consequence of CH. If CH is false, then MA has far-reaching consequences (but remains consistent relative to the axioms). MA is often a useful complement to the combinatorial principles that hold in L. For example, the independence of various statements can be proved by showing that one direction follows from ♦ and the other from MA and ¬CH. §3. The forcing method proved to be an extraordinarily powerful tool for obtaining independence results in set theory and led to a renaissance of this field. It was an exciting time for the young mathematicians who flocked into this field. Major classical problems were solved with gratifying regularity. In the long run, though, it is somewhat unsatisfying to prove only that statements are unprovable. Increasingly, people asked whether the weakness of the set-theoretic axioms could be alleviated by accepting new axioms. An obvious candidate would be Gödel’s axiom of constructibility, V = L. This 9 For example, weTset k¬ϕk = ¬kϕk =the complement of kϕk in B ; kϕ ∧ øk = kϕk ∩ køk; k∀v ϕ(v)k = x∈V B kϕ(x)k. Once this definition is given, the B -valued relations are uniquely characterized by the requirements that the axioms S of equality logic and the axiom of extensionality have truth value 1 and that kx ∈ yk ⊆ z∈dom(y) kx = zk for x, y ∈ V B . 398 RONALD JENSEN axiom makes a clear statement about the nature of the mathematical universe. It is mathematically fruitful in that it solves many problems and leads to interesting new concepts and theories. It is philosophically attractive for adherents of “Ockham’s razor,” which says that one should avoid superfluous existence assumptions. I personally find it a very attractive axiom. Nevertheless, it has been rejected by the majority of set theorists, beginning with Gödel himself. One could perhaps summarize Gödel’s objection as follows. The universe of sets is too large and varied to be fully grasped by the human spirit. Over time, however, mathematical intuition improves and with it our ability to describe this universe. The steps of this approximation process are marked by the axioms that we recognize as true. The axiom of constructibility presents, modulo the ordinal numbers, a complete description of sets and is thus a limiting principle that would halt the approximation process. Instead, one should look for additional existence axioms, called strong axioms of infinity. A typical axiom of this sort asserts the existence of a large cardinal number. As an example, Gödel suggests the inaccessible cardinals. As the universe satisfies all axioms of set theory, it should be so big that even some initial segment Vκ also satisfies the axioms. If Vκ satisfies the axioms (in the sense of second order logic10 ), then we call κ inaccessible. But if an inaccessible cardinal exists, then this statement also should be true in an initial segment Vô , i.e., there should be two inaccessible cardinals κ and ô. By continuing this argument, we see that the inaccessible cardinals are unbounded in the ordinal numbers. But then also this assertion should hold in an initial segment, i.e., there is an inaccessible limit of inaccessible cardinals, etc. Inaccessible cardinals are, however, not an alternative to the axiom of constructibility, for they are consistent with this axiom.11 For this we need a principle saying that the universe is not only “long” but also “wide.” The first large cardinal axiom proved to imply V 6= L asserts the existence of a measurable cardinal. As the name implies, these cardinals originated from questions in measure theory. κ is measurable if and only if there is a κcomplete, non-trivial, 0,1-valued measure defined on all subsets of κ. (Or, equivalently, there is a κ-complete non-principal ultrafilter on κ.) The settheoretic content of this concept becomes clearer in the following definition. Definition. κ is measurable if and only if there exist M and ð such that (1) M is an inner model. (2) ð : V → M is an elementary embedding.12 (3) κ is the critical point of ð (i.e., ð ↾ κ = id, ð(κ) > κ). 10 This means that Vκ is used as the domain of sets and Vκ+1 as the domain of classes. If κ is inaccessible in V , then also in L. 12 That is, ϕ(x1 , . . . , xn ) is true in V if and only if ϕ(ð(x1 ), . . . , ð(xn )) is true in M . 11 INNER MODELS AND LARGE CARDINALS 399 In the late fifties, Dana Scott showed that the existence of a measurable cardinal implies V 6= L. Later, it was shown that these cardinals have interesting consequences for the real continuum, e.g., that every Σ12 set is Lebesgue measurable and has the Baire property. (Consequently, there can be no Σ12 well-ordering of R.) We can generate a hierarchy of even stronger large cardinal principles by imposing additional conditions on the model M in the definition above. For example, we call κ α-strong iff Vα ⊆ M . We call κ strong if it is α-strong for all α. κ is superstrong iff Vð(κ) ⊆ M . (This axiom is stronger than the previous one in the sense of relative consistency , i.e., the existence of superstrong cardinals implies the consistency of strong cardinals, but not conversely.) These cardinals are the first stages of a hierarchy that has been intensively investigated in the last 20 years. Among the later stages are the compact and huge cardinals. The strongest principle of this sort was proposed by Reinhardt. We call κ a Reinhardt cardinal if M = V in the definition above. At first, Reinhardt cardinals were greeted by many set theorists, including the author of this article, as the embodiment of an intuitively especially satisfactory existence principle. But after a few years Kunen proved that these cardinals do not exist. This set an upper limit for the hierarchy. §4. Strong axioms of infinity say that the universe of sets is very large, and, as we have seen, they have important consequences for the real continuum. But one can also formulate axioms saying very directly that the continuum is large. The most successful principles of this sort are the socalled axioms of determinacy. Let ù ù be the set of all infinite sequences of natural numbers. To every set A ⊆ ù ù we associate a game as follows: Player I chooses a number a0 ∈ ù; Player II then chooses a1 ∈ ù; Player I chooses a2 ; etc. After ù moves, the players have defined a sequence a = hai |i < ùi. Player I wins if a ∈ A; otherwise II wins. We say that A is determined if one of the two players has a winning strategy. A winning strategy for I, for example, is a function f, from finite sequences of natural numbers to natural numbers, such that I always wins if he plays a2i = f(a1 , a3 , . . . , a2i−1 ) for all i. Strategies can be coded by real numbers, so the existence of a strategy for A is an existence postulate about R. The simplest form of the axiom of determinacy (AD) asserts that every A ⊆ ù ù is determined. But AD implies that R cannot be well-ordered, so it cannot be accepted as an axiom. There are, however, various axioms of definable determinacy, saying that a certain class of “nicely definable” A ⊂ ù ù are determined. These seem to be consistent with the axiom of choice (AC) and have been intensively studied in the last twenty years. The axiom of projective determinacy (PD) says that the projective, i.e., Σ1ù , sets 400 RONALD JENSEN are determined. Naturally, Σ1n -AD means that the Σ1n sets are determined.13 An especially attractive axiom is AD in L[R], which says that AD holds in the inner model L[R] that is defined exactly like L except that L0 [R] = R. This axiom is equivalent to the statement that every A ∈ P (ù ù ) ∩ L[R] is determined. It turns out that under this assumption the model L[R] becomes a paradise for analysts: Every set of real numbers is Lebesgue measurable and has the Baire property. AC is false there, but if we assume AC in V then L[R] still satisfies the weaker forms of AC—such as the axiom of dependent choice—that are needed by analysts. It was soon discovered that there is a connection between determinacy and large cardinals. For example, AD implies the existence of an inner model with a measurable cardinal. (In fact, this cardinal is ù1 in V .) In the other direction, Σ11 -AD follows from the existence of a measurable cardinal. The great hope of researchers in both fields was to deduce definable determinacy from the existence of sufficiently large cardinals. Until recently, however, this hope remained unfulfilled, and the three main areas of set theory— inner models, large cardinals, and determinacy—went their separate ways. Large cardinals probably attracted the most attention and were certainly the subject of very deep investigations. Nevertheless, this area remained in a certain sense the least satisfactory: It lacked—until recently—both the deep structure theory and the occasional dramatic shifts of perspective of the other two areas. §5. The axiom of constructibility on the one hand and large cardinals and determinacy on the other embody two radically different conceptions of the universe of sets. How can these conceptions be justified? Most proponents of V = L and similar axioms support their belief with a mild version of Ockham’s razor. L is adequate for all of mathematics; it gives clear answers to deep questions; it leads to interesting mathematics. Why should one assume more? The proponents of strong axioms of infinity usually call themselves Platonists and use this to support their belief. I do not understand, however, why a belief in the objective existence of sets obligates one to seek ever stronger existence postulates. Why isn’t Platonism compatible with the mild form of Ockham’s razor cited above? Something else must be going on. I would like to propose a—necessarily somewhat speculative—hypothesis.14 Could it be that the duality in modern set theory is nothing but a new manifestation of an ancient conflict between two points of view—I almost want to say two emotional states—which have always existed in mathematics? I call them the arithmetical and geometrical points of view. I also call the first 13 1 Σ0Sconsists of the Borel sets. Σ1n+1 consists of projections of complements of Σ1n sets. Σ1ù = n Σ1n . 14 The reader can skip the rest of §5 without loss of continuity. INNER MODELS AND LARGE CARDINALS 401 one the Pythagorean point of view, for Pythagoras expressed it in its purest form: Everything consists of numbers. In other words, every mathematical structure can be interpreted in the natural structure of the positive integers. This idea is naturally very attractive; it gives to all of mathematics the intuitive clarity of the natural numbers. I conjecture that, if it could really be carried out, it would still be the dominant point of view today. In reality, however, the geometric point of view has been dominant since the rise of analysis. Thus I also call it the Newtonian point of view.15 The Newtonian directs his gaze to the real rather than the natural numbers. He is less impressed by their clarity than by their boundless multiplicity. The real numbers constitute a gigantic, unfathomable sea. For every principle that generates real numbers, there must be a number not attainable by that principle. This excludes the possibility of an interpretation of the real numbers within the natural numbers. If one accepts set theory, then there is no doubt that Cantor has refuted Pythagoreanism in the strongest terms by showing that there are more real than natural numbers. But Cantor also introduced the ordinal numbers, which are in every sense the transfinite continuation of the natural numbers. They share much of the intuitive clarity of the natural numbers. Thus Cantor, who refuted the old Pythagoreanism, made possible a new Pythagoreanism in which the ordinal numbers take over the role of the natural numbers. In this sense, Gödel’s axiom of constructibility seems to me to embody an entirely coherent Pythagorean picture of the world. And this picture cannot be refuted, for Gödel showed that V = L is consistent if the other axioms are. But this axiom provides—modulo the ordinal numbers—a complete description of all sets and is therefore unacceptable for a Newtonian. For him, there must be a real number not generated by constructible processes. Since one cannot prove in set theory that such a number exists, one must seek new axioms. Thus, the ancient conflict is fought in a new arena. Whether or not one accepts this analysis, there is no doubt that the quest for new axioms of infinity has led to very interesting mathematics. In addition, the proponents of these axioms have, through hard technical work, assembled a series of impressive plausibility arguments.16 Nevertheless, I doubt that one could, with the sort of evidence that we have, convert the mathematical world to one or the other point of view.17 Deeply rooted differences in mathematical taste are too strong and would persist.18 15 Of course one could also call it Leibnizian. But perhaps Newton thought more geometrically. 16 See §7, §8. 17 The author confesses to being emotionally a Pythagorean. 18 In the course of time I have had numerous opportunities to discuss the alternatives in set theory with mathematicians from other disciplines. I am always surprised how rapidly and 402 RONALD JENSEN §6. Strong axioms of infinity should prevent the universe from being “constructed from below” like L. But do they really prevent this? It turns out that we can have a measurable cardinal in a very L-like model. The measurability of κ means that there is a normal ultrafilter U in P (κ).19 Let the hierarchy LUαS be defined like Lα except that LUα+1 = Def(hLUα , ε, U ∩LUα i), U and set L = α LUα . Then LU is an inner model. In addition, κ is measurable in LU , for U ∩ LU ∈ LU appears in LU to be a normal ultrafilter. Silver and others have shown that LU is thoroughly similar to L. It satisfies GCH. It has the combinatorial properties of L. (In fact, until now, no purely combinatorial difference between L and LU has been discovered.) It has a well-ordering of the reals definable in second-order arithmetic (in this case Σ13 instead of Σ12 ). Even the absoluteness of L has an analog: LU has a definition (using κ as a parameter) that is absolute in every inner model M with LU ⊆ M . But our definition of LU does not by any means show that this model can be “built from below by well-understood operations,” for the object U is explicitly used in the construction. That a construction from below is nevertheless possible to a certain extent was proved only later in the so-called core model theory. This theory is somewhat complicated, but I shall try to give a rough sketch of it. Let us assume not only that V 6= L but also that there is a non-trivial elementary embedding ð : L → L.20 It turns out that this assumption is equivalent to the existence of a certain set, denoted 0♯ . In a very concrete way—which unfortunately cannot be described here—0♯ encodes a complete description of the structure of L together with a canonical elementary embedding ð : L → L. The definition of 0♯ is absolute in every inner model that contains 0♯ . Although it unfortunately cannot be justified here, there is no doubt that 0♯ is a “well-understood” mathematical object. If 0♯ does not exist, then the global structure of V does not deviate too much from that of L. This is the main content of the so-called “covering theorem”: If 0♯ does not exist then every uncountable set X of ordinal numbers has a “covering set” Y ⊇ X with Y ∈ L and Card(Y ) = Card(X ). (From this follows, for example, that every singular cardinal number has the same cardinal successor in L as in V .) But if 0♯ exists, then the structure of L-cardinals is “wiped out” in V . (For example, every uncountable cardinal looks inaccessible in L.) So 0♯ may be regarded as the “next larger construction step” after L. Then we form the inner model L[0♯ ], the constructible closure of with what certainty they express an opinion. Their opinions are divided and reflect, in my view, a pre-existing orientation. 19 U is normal if for every regressive function f : X → κ (where “regressive” means f(î) < î for all î ∈ X ) with X ∈ U there is an ç < κ with {î | f(î) = ç} ∈ U . Normality implies κ-completeness. 20 Non-trivial means ð 6= id. INNER MODELS AND LARGE CARDINALS 403 0♯ .21 Now let us assume that L[0♯ ] is non-trivially elementarily embeddable in itself by ð : L[0♯ ] → L[0♯ ]. This leads to a set 0♯♯ encoding L[0♯ ] and a canonical embedding ð. Then we form L[0♯♯] and so on. After ù steps, we have a sequence 0♯ , 0♯♯ , . . . , 0(n) , . . . . We let 0(ù) encode this sequence and form L[0(ù) ]. An embedding ð : L[0(ù) ] → L[0(ù) ] yields 0(ù+1) , and so on. When we have 0(α) for all ordinal numbers α, we form the constructible closure L♯ of all the 0(α) .22 A non-trivial elementary embedding then yields a set 0(∞) , and so forth.23 We arrive at a sequence: 0♯ , 0♯♯ , . . . , 0(α) , . . . , 0(∞) , 0(∞+1) , . . . . The terms in this sequence are called mice. With some effort, one can define the class of all mice along with its natural ordering. (This ordering can be much longer than ∞.) But then we can build the constructible closure of all mice, which we denote by K.24 K is called the core model. K is an inner model. The definition is absolute in the sense that KM = K for every inner model M ⊇ K. K is L-like in the same sense as LU . What happens if there is a non-trivial elementary embedding ð : K → K? A new mouse cannot result, as all mice are already present in K. What we get is an inner model LU , where U ∈ LU witnesses the measurability in LU of some ordinal κ. We have P (κ) ∩ K = P (κ) ∩ LU . U can be obtained in a fairly direct way from the embedding ð. So we have, after all, obtained LU “from below” via the long march through the mice. The model LU is uniquely determined by the number κ. Let us now choose κ as small as possible. What happens if there is a non-trivial elementary embedding ð : LU → LU ? We obtain a set 0† coding LU and a canonical embedding ð. So, after all, 0† is “the next mouse.” It differs from the previous mice in that, to obtain it, we needed an intermediate step marked not with a mouse but with an inner model LU . This suggests the possibility of a continuing process, producing inner models for stronger and stronger axioms of infinity. How far can we go? Could it be that the directions mentioned in §5, Newtonianism and Pythagoreanism, are compatible after all, with Newtonianism indicating possibilities that then turn out to be realizable in a Pythagorean universe? Progress so far has been relatively modest. The next major step after a measurable cardinal is a strong cardinal. A few years ago, Dodd and the author set themselves the goal of defining a core model for a strong cardinal. Mitchell worked in the same direction but with a more modest goal. The technical difficulties turned out to be very great. Dodd and I barely got past the fundamental fine structure theory. Mitchell got closer 21 ♯ 0 can be regarded as a set of ordinal numbers (even as a subset of ù). Then L[0♯ ] is defined from 0♯ as LU was defined from U . S 22 ♯ L = α L[0(α) ]. (Remark: 0(í) ∈ L[0(α) ] for í ≤ α.) 23 ∞ is the class of all ordinal numbers and is also regarded as the “largest ordinal number.” 24 K is the union of all L[m] such that m is a mouse. 404 RONALD JENSEN to his goal. Recently, Mitchell—building on a suggestion of Baldwin— proposed a new definition of mice, which made immense simplifications possible. The theory is still very long, and not all the details have been written down yet. Nevertheless, I believe I can claim that we now have an extended core model K in which a strong cardinal can exist. How big K is depends, naturally, on what there is. If there is no inner model with a measurable cardinal, then the new K coincides with the old one. If there is such an inner model but 0† does not exist, then K = LU , etc. We again have KM = K if M is an inner model with K ⊆ M . K is again thoroughly L-like. The existence of a strong cardinal in V ensures the existence of one in K. If there is no elementary embedding ð : K → K, then again much of the global structure of K is preserved in V .25 But if ð : K → K exists, then we obtain the next mouse. So we can go further. But it turned out that one cannot go much further without seriously modifying the program. §7. A few years ago, Hugh Woodin, in the wake of a deep result of Foreman, Magidor, and Shelah, proved the following theorem: If there is a superstrong cardinal, then there is no well-ordering of R in L[R]. In particular, R has no well-ordering definable in second-order arithmetic. This result was surprising, because the concept of a superstrong cardinal had previously been regarded as a rather modest extension of the concept of measurability. The theorem set off an avalanche of results that have fundamentally altered the landscape of set theory. It also set a sharp limit for the conventional core model theory, because the methods that we had been using for the construction of these models always yielded a Σ13 well-ordering of R. Even more serious for this theory was another consequence of Woodin’s proof: In a possible modified core model theory, one would have to give up the absoluteness theorem; if K contains a superstrong cardinal, then one cannot exclude the possibility that KM 6= K for some inner model M ⊇ K.26 A rapid sequence of discoveries followed. Martin and Steel produced the long-desired proof that determinacy axioms follow from strong axioms of infinity. A typical result—due to Martin, Steel, and Woodin—is that the existence of a superstrong cardinal implies AD in L[R]. An even weaker cardinal—the so-called Woodin cardinal—played a key role in the new developments. Call a cardinal ô strong with respect to a class A if for each 25 But this can no longer be expressed by so strong and simple a statement as the covering theorem. 26 Using forcing, Woodin showed that, given an inner model M with a superstrong cardinal, one can consistently assume the existence of a strictly larger model M ′ and an elementary embedding ð : M → M ′ . Now let M = K. Then V = K holds in M and hence also in M ′ . Absoluteness would imply M ′ = K = M , a contradiction. INNER MODELS AND LARGE CARDINALS 405 â there exist an inner model M , a class B, and an elementary embedding ð : hV, Ai → hM, Bi such that ô is the critical point, Vâ ⊆ M , and B ∩ Vâ = A ∩ Vâ . If κ is inaccessible, A ⊆ Vκ , and ô < κ, then we can relativize this concept to Vκ . We call κ a Woodin cardinal if for each A ⊆ Vκ there exists a ô < κ that is strong with respect to A in Vκ . Martin and Steel showed that Σ1n+1 -determinacy follows from the existence of n Woodin cardinals and a measurable cardinal above them. Then Woodin showed that the existence of ù Woodin cardinals and a measurable cardinal above them implies AD in L[R]. Woodin also has interesting equiconsistency results: Σ12 -determinacy is equiconsistent with the existence of a Woodin cardinal. AD in L[R] is equiconsistent with the existence of ù Woodin cardinals. §8. The two areas, large cardinals and determinacy, had come together. What about inner models? At first glance, the core model program seemed to have reached a dead end. For two essential attributes of this theory would have to be sacrificed in any extension: a nicely definable well-ordering of R and the absoluteness theorem. The second sacrifice seemed especially serious to me. K is defined as the union of its construction steps, where a construction step (e.g., a mouse) should be a “well understood” object. For me, this requirement always meant that one can recognize in an arbitrary inner model M whether a given element of M is a construction step. That implies KM = K if K ⊆ M . But this conclusion is false if a superstrong (or just a Woodin) cardinal occurs in K. Fortunately, Martin, Mitchell, and Steel did not let these considerations trouble them but set out to find a concept of mouse suitable for the continuation of the theory. The first step in this extended core model program was to realize a Woodin cardinal in a core model. Martin and Steel first defined a “model of type LU .” Assuming a Woodin cardinal κ, they defined an inner model W . The construction of W is not really “from below” but uses a set whose existence follows from the assumed property of κ. They showed that, in W , κ remains a Woodin cardinal, CH holds, and R has a Σ13 wellordering. (So Σ12 -determinacy is false in W .) But the internal structure of this model remained quite mysterious; for example we do not know whether GCH holds in it. Mitchell and Steel then created a revised W using the fine structure techniques of core model theory. Their W satisfies GCH and is quite generally L-like. Finally, Steel managed to construct a genuine core model K. But to carry out his construction, he needs third order set theory and a large cardinal property (namely measurability) for ∞ (the class of all ordinals).27 When one has this, one finds a first order definition of K that 27 Steel thinks that second order set theory will suffice and that the measurability assumption can be weakened. 406 RONALD JENSEN is absolute in K. At the moment, only a rather weak analog of the covering theorem has been proved.28 The mice studied in this extended theory are still “well understood” objects, but the definition of mice is of such high logical complexity that the absoluteness requirement is no longer satisfied.29 The possibility that there are models M with K ⊆ M and KM 6= K turned out to be less troublesome than one had expected. The increasing complexity of the concept of mouse also explains why we lose the nicely definable well-orderings of R. The existence of such a well-ordering depends on the fact that every mouse that constructs a new real number is itself a countable object and so can be coded by a real number. The earlier mice were so simple that one could define the set of all such codes—together with their natural well-ordering—in second order arithmetic. With increasing complexity, this is no longer possible. Will it be possible to realize all reasonable large cardinals in “well understood” inner models? This goal is important not only for the reasons indicated in §6 but also because of a consideration connected with Gödel’s justification for strong axioms of infinity. A common plausibility argument for these axioms is that they are—as far as we can see—linearly ordered by the relation of relative consistency.30 Historically, the axioms of infinity have many different mathematical sources, and there is no prima facie reason to suppose that they are linearly ordered. The existence of such an order—insofar as we can confirm it—is surely a strong argument for regarding these axioms as successive approximations to a “final” universe. The axioms in the sequence mentioned earlier—measurable, strong, superstrong, etc.—arise from weaker and stronger variations of a single theme. So it is not surprising that they are linearly ordered. We can regard this sequence as a yardstick with which to compare other strong axioms of infinity. (It turns out that such comparison is sensible even for some assertions that seem to have nothing to do with large cardinals, for example the existence of a saturated ideal on ù1 .) Very many open questions remain. Perhaps the best known is whether compact and supercompact cardinals are equiconsistent. If we go “down” a bit and consider principles of infinity that are weaker than measurability yet strong enough to imply V 6= L, then the so-called Erdős cardinals form an analogous yardstick. Here the core model is available, and it frequently allows us to compare other principles with this yardstick. 28 By this I mean theorems saying that the global structure of K is preserved in V if K is rigid with respect to elementary embeddings. 29 That the absoluteness condition held previously is due to the fact that the concept of mouse was ∆1 in the parameter ù1 . 30 An axiom A is consistent relative to B in the axiom system of set theory (ZFC) if it is provable in first order arithmetic that the consistency of ZFC+A follows from the consistency of ZFC+B. Modulo equiconsistency, this relation is a partial order. INNER MODELS AND LARGE CARDINALS 407 (Typical examples are the Chang conjecture31 and the existence of Jónsson cardinals.32 ) The only strategy we have for attaining comparable results for larger cardinals is a corresponding generalization of the inner model theory. I would like to close with a general comment on the direction of research in set theory. In recent years I have occasionally heard other mathematicians complain that set theory has turned inward, i.e., that set theorists are occupied too much with their own structural questions and not enough with applications. This reproach does not by any means apply to all set theorists, but this paper does little to refute it. I hope, however, to have made it plausible that a subject should from time to time turn inward. The deep inner problems have become steadily clearer in recent years, and the tools needed to overcome them have been developed. There has been enormous progress recently. If set theory now finds itself in a rather introspective phase, I am convinced that it will emerge from this phase stronger than ever and of greater use for other areas of mathematics. REFERENCES [1] K. Devlin, Constructability, Springer-Verlag, 1984. [2] F. R. Drake, Set theory: An introduction to large cardinals, North-Holland, 1974. [3] R. Jensen, Measures of order 0, unpublished manuscript. [4] K. Kunen, Set theory: An introduction to independence proofs, North-Holland, 1980. [5] D. A. Martin and J. R. Steel, A proof of projective determinacy, Journal of the American Mathematical Society, vol. 2 (1989), pp. 71–125. , Iteration trees, Journal of the American Mathematical Society, vol. 7 (1994), [6] pp. 1–73. [7] W. Mitchell and J. R. Steel, Fine structure and iteration trees, Lecture notes in logic 3, Springer-Verlag, 1994. [8] J. R. Steel, The core model iterability problem, unpublished manuscript. [9] J. R. Steel, Inner models with many Woodin cardinals, Annals of Pure and Applied Logic, vol. 65 (1993), pp. 185–209. ALL SOULS COLLEGE OXFORD OX1 4AL GREAT BRITAIN 31 The Chang conjecture is the following model-theoretic assertion: If hA, B, . . . i is a model for a countable language with Card(A) = ù2 and Card(B) = ù1 , then there is an elementary ¯ B̄, . . . i with Card(A) ¯ = ù1 and Card(B̄) = ù. submodel hA, 32 κ is a Jónsson cardinal if every algebra A = h|A|, f1 , f2 , . . . , fn i of cardinality κ has a proper subalgebra of the same cardinality.

Log In

Inner Models and Large Cardinals

Related papers

Related papers

Related topics