arXiv:0903.2211v2 [quant-ph] 17 Aug 2009
Many-Worlds
and Schrödinger’s First Quantum Theory
Valia Allori∗, Sheldon Goldstein†,
Roderich Tumulka‡, and Nino Zanghı̀§
August 5, 2009
Abstract
Schrödinger’s first proposal for the interpretation of quantum mechanics was
based on a postulate relating the wave function on configuration space to charge
density in physical space. Schrödinger apparently later thought that his proposal
was empirically wrong. We argue here that this is not the case, at least for a
very similar proposal with charge density replaced by mass density. We argue
that when analyzed carefully this theory is seen to be an empirically adequate
many-worlds theory and not an empirically inadequate theory describing a single
world. Moreover, this formulation—Schrödinger’s first quantum theory—can be
regarded as a formulation of the many-worlds view of quantum mechanics that is
ontologically clearer than Everett’s.
PACS: 03.65.Ta. Key words: Everett’s many-worlds view of quantum theory;
quantum theory without observers; primitive ontology; Bohmian mechanics; quantum nonlocality in the many-worlds view; nature of probability in the many-worlds
view; typicality.
∗
Department of Philosophy, Northern Illinois University, Zulauf Hall 920, DeKalb, IL 60115, USA.
E-mail:
[email protected]
†
Departments of Mathematics, Physics and Philosophy, Hill Center, Rutgers, The State University of New Jersey, 110 Frelinghuysen Road, Piscataway, NJ 08854-8019, USA. E-mail:
[email protected]
‡
Department of Mathematics, Rutgers University, Hill Center, 110 Frelinghuysen Road, Piscataway,
NJ 08854-8019, USA. E-mail:
[email protected]
§
Dipartimento di Fisica dell’Università di Genova and INFN sezione di Genova, Via Dodecaneso 33,
16146 Genova, Italy. E-mail:
[email protected]
1
1
Monstrosity
The ‘many world interpretation’ . . . may have something distinctive to say
in connection with the ‘Einstein Podolsky Rosen puzzle’, and it would be
worthwhile, I think, to formulate some precise version of it to see if this is
really so.
John S. Bell [9]
The many-worlds view of quantum mechanics is popular, but also controversial; it is very
radical and eccentric, but also inspiring. It is an incarnation of the desire to abolish the
vague division of the world, introduced by the Copenhagen interpretation, into system
and observer, or quantum and classical, and to obtain a fully precise formulation of
quantum mechanics, in which the axioms do not concern observers and observation but
reality. In the words of Hugh Everett [23], the inventor of the many-worlds view:
The Copenhagen Interpretation is hopelessly incomplete because of its a
priori reliance on classical physics . . . as well as a philosophic monstrosity
with a “reality” concept for the macroscopic world and denial of the same
for the microcosm.
We report here on some considerations on the many-worlds view of quantum mechanics inspired by Erwin Schrödinger’s [36] original interpretation of the wave function
ψ on configuration space as generating a continuous distribution of matter (or charge)
spread out in physical space. As we shall explain, Schrödinger’s original version of quantum mechanics may be regarded as a version of many-worlds—though some adherents of
many-worlds will presumably not regard it as such—that we think is worth considering.
It is a version that, in our opinion, qualifies as a “precise version of” many-worlds such
as Bell called for in the passage quoted above.
2
Duality
Let us describe Schrödinger’s first quantum theory in our own words. Think first about
classical mechanics. Matter consists of particles, moving along trajectories defined by
the equations of the theory. Alternatively, a classical theory could claim that instead of
consisting of particles, matter is continuously distributed in 3-space and mathematically
described by a function m(x, t), where x runs through physical 3-space, providing the
spatial density of matter at time t. We call this ontology the matter density ontology.
Such a theory would involve classical equations governing the m function. In the m
function we can find the macroscopic objects of our experience, such as tables and chairs,
by noting that at a certain time there is a region of space, with the shape of a table or
chair, in which the matter density is significantly higher than in the surroundings. In
such a theory, it would be wrong to say that matter consists of a large number (such as
1023 ) of particles, since there are no particles in the ontology, just a continuum of stuff.
Now combine the matter density ontology with non-classical equations. Specifically,
suppose that matter is continuously distributed with density m(x, t), but now suppose
2
that the m function is given by the following equation:
Z
N
X
2
m(x, t) =
mi d3 x1 · · · d3 xN δ 3 (x − xi ) ψt (x1 , . . . , xN ) .
(1)
i=1
Here, ψt is a wave function as in quantum mechanics, a function on R3N evolving
according to the usual Schrödinger equation
N
i~
X ~2
∂ψ
=−
∇2i ψ + V ψ ,
∂t
2m
i
i=1
(2)
and mi denotes the mass of particle i, i = 1, . . . , N.
The m function (1) is basically the natural density function in 3-space that one
can obtain from the |ψ|2 distribution in configuration space. The formula means that,
starting from |ψ|2 , one integrates out the positions of N − 1 particles to obtain a density
in 3-space. Since the number i of the particle that was not integrated out is arbitrary,
it gets averaged over. The weights mi are the masses associated with the variables xi ,
which may seem the most natural choice for defining the density of matter.
This provides, in fact, already the complete specification of a physical theory. In the
terminology of [3], this theory is called “Sm” (S for the Schrödinger equation and m for
the m function). It is closely related to—if not precisely the same as—the version of
quantum mechanics first proposed by Schrödinger [36]. After all, Schrödinger originally
regarded his theory as describing a continuous distribution of matter (or charge) spread
out in physical space in accord with the wave function on configuration space [37, p. 120]:
We had calculated the density of electricity at an arbitrary point in space
as follows. We selected one particle, kept the trio of co-ordinates that describes its position in ordinary mechanics fixed; integrated ψψ over all the
rest of the co-ordinates of the system and multiplied the result by a certain
constant, the “charge” of the selected particle; we did a similar thing for
each particle (trio of co-ordinates), in each case giving the selected particle
the same position, namely, the position of the point of space at which we
desired to know the electric density. The latter is equal to the algebraic sum
of the partial results.
This is just a verbal description of the formula (1), except with charges instead of
masses.1 Schrödinger soon rejected this theory because he thought that it rather clearly
1
If we replace the masses mi in (1) with the charges ei , as Schrödinger did, then the following
problem arises that is absent when using masses. If the wave function of a macroscopic body (say, a
piece of wood) is such that the Heisenberg position uncertainties of the atomic nuclei are of the order
of an Angstrom, i.e., of the order of the size of an atom, then the positive charge of a nucleus may
be smeared out over the same volume as the negative charge of the electrons, so that they may cancel
each other, leaving only a negligible remainder in the m function. In this case, the macroscopic body
would hardly be recognizable in the m function, and such an m function would not provide a plausible
image of our world. This problem notwithstanding, replacing the masses mi in (1) with the charges ei ,
or with the constant value 1, leads to theories which are empirically equivalent to Sm and similar to
Sm in all relevant respects. In particular, our conclusions about nonlocality (Section 5) and probability
(Section 7) for Sm apply equally to these theories.
3
conflicted with experiment. After all, the spreading of the matter density arising from
equation (2) would appear to contradict the familiar localized detection events for quantum particles, such as in the two-slit experiment. Moreover, given that there are no
particles in Sm, but instead matter is really continuous, one might think at first that
Sm is empirically refuted by the evidence for the existence of atoms. Yet, Schrödinger’s
rejection was perhaps a bit hasty, as we will see. Be that as it may, Schrödinger did in
fact create the first many-worlds theory, though he probably was not aware that he had
done so.
It is easy to see that Sm has a certain many-worlds character, since if ψ is the wave
function of Schrödinger’s cat then there will be two contributions to the m function, one
resembling a dead cat and the other a live cat. We will say more about this in Section 3.
For now note the duality: there exist two things, the wave function ψ and the matter
density function m. The latter represents the “primitive ontology” (PO) of the theory
[3], the elements of the theoretical picture that correspond to matter in 3-dimensional
space; the wave function tells the matter how to move. The notion of PO is closely
connected with what Bell called the “local beables”:
[I]n the words of Bohr, ‘it is decisive to recognize that, however far the
phenomena transcend the scope of classical physical explanation, the account
of all evidence must be expressed in classical terms’. It is the ambition of
the theory of local beables to bring these ‘classical terms’ into the equations,
and not relegate them entirely to the surrounding talk.
[7]
We note that the matter density m(x, t) (1), defined as it is on physical space, is given
by local beables, while the wave function ψ = ψ(x1 , . . . , xN ), defined on configuration
space, is not.
To introduce a PO for a theory means to be explicit about what space-time entities
the theory is fundamentally about. There are various possibilities for what type of
mathematical objects could represent the elements of the PO, including particle world
lines as in classical or Bohmian mechanics, world sheets as maybe suggested by string
theory, world points as in the GRW theory with the flash ontology [10, 39, 3], or, instead
of subsets of space-time, functions on space-time representing a field or a continuous
density of matter, as in the matter-density ontology of Sm (and GRWm [3]). The wave
function also belongs to the ontology of Sm, but not to the PO: physical objects in Sm
are made of m, not of ψ. Rather the role of ψ in this theory lies in the relation defined
by (1) between ψ and m. (That m is primitive and ψ is not should not be taken to
imply that, contrary to (1), ψ should be defined in terms of m.)
∗∗∗
Let us compare Sm to Bohmian mechanics [13, 6, 26]. The latter is a theory of
particles with trajectories Qi (t) ∈ R3 , guided by a wave function ψ. As in classical
mechanics, particles are points moving around in space, but the equation of motion is
highly non-classical. In this theory there is a wave–particle duality in the literal sense:
there is a wave, and there are particles. For understanding Bohmian mechanics it is
4
important to think of these two parts of reality, ψ and the Qi , in a particular way:
When one says, for example, that the pointer of an apparatus points to the value α
then one means that the particles of which the pointer consists are at the appropriate
positions corresponding to α, but one does not mean that the wave function lies in the
subspace of Hilbert space that can be associated with the description that the pointer
is pointing to α. To put this succinctly, one could say that the matter in Bohmian
mechanics consists of the particles, not of the wave function. The role of the wave
function, in contrast, is to tell the particles how to move. Indeed, the wave function ψ
occurs in the equation of motion for the particles,
dQi
~
ψ ∗ ∇i ψ
=
Im ∗ (Q1 (t), . . . , QN (t)) .
dt
mi
ψ ψ
(3)
Here, the wave function ψ = ψt evolves according to the Schrödinger equation (2). It
is consistent with these two equations that the configuration Q(t) = (Q1 (t), . . . , QN (t))
has probability distribution |ψt |2 at every time t.
Bohmian mechanics thus has the duality in common with Sm: In both theories,
there are mathematical variables specifying the distribution of matter in 3-dimensional
space—and not in 3N-dimensional configuration space. Bohmian mechanics specifies
this distribution by means of the actual configuration Q(t) = (Q1 (t), . . . , QN (t)), and
Sm of course by the m(·, t) function. A difference between Bohmian mechanics and Sm
is that the m function is a function of the quantum state ψ, whereas Q is not. Indeed,
in the initial value problem of Bohmian mechanics, we have to choose an initial value
for Q in addition to the initial value of ψ.
∗∗∗
The many-worlds view is often presented as asserting that there exists only the wave
function, which evolves unitarily, and nothing else. Let us call this view S0, according
to a notation pattern that indicates first how the wave function evolves (Schrödinger
equation) and then what the PO is (nothing). We believe it is useful to clearly distinguish
between S0 and Sm. Doing so affords a clear separation of the main issues for a manyworlds theory: the issue of whether a theory, in order to make clear sense as a physical
theory, needs to posit a PO in space and time from the issues of whether the existence of
parallel worlds is scientifically plausible, of whether the Bell inequality can be violated
by a local theory, and of whether such a theory can give rise to the appearance of
randomness.
∗∗∗
We have defined Sm using the Schrödinger picture, but it can be formulated as well
in the Heisenberg picture. To this end let
M(x) =
N
X
mi δ 3 (x − Q̂i )
i=1
5
(4)
be the mass density operator at x ∈ R3 , with Q̂i the triple of position operators associated with the i-th particle. Then (1) can be rewritten as
m(x, t) = hψt |M(x)|ψt i ,
(5)
and this expression can be transferred to the Heisenberg picture in the usual way by
setting
M(x, t) = exp(iHt/~) M(x) exp(−iHt/~) ,
(6)
so that
m(x, t) = hψ|M(x, t)|ψi .
(7)
However, it will be convenient for us to continue using the Schrödinger picture.
3
Parallelity
In Sm, apparatus pointers never point in a specific direction (except when a certain
direction in orthodox quantum theory would have probability more or less one), but
rather all directions are, so to speak, realized at once. As a consequence, it would seem
that its predictions do not agree with those of the quantum formalism. Still, it can be
argued that Sm does not predict any observable deviation from the quantum formalism:
there is, arguably, no conceivable experiment that could help us decide whether our
world is governed by Sm on the one hand or by the quantum formalism on the other.
Let us explain.
Whenever the wave function (as a function on configuration space!) consists of disjoint packets ψ1 , . . . , ψL ,
L
X
ψ=
ψℓ ,
(8)
ℓ=1
it follows that
m(x) =
L
X
mℓ (x) ,
(9)
ℓ=1
where mℓ (x) is defined in terms of ψℓ in the same way as m(x) in terms of ψ by (1).
Suppose further, as we shall henceforth do, that in particular the ψℓ represent macroscopically different states, as with Schrödinger’s cat. Then it is plausible that also in
the future the ψℓ will remain (approximately) disjoint (until Poincaré recurrence times),
so that
L
X
mℓ (x, t) ,
(10)
m(x, t) =
ℓ=1
with mℓ (·, t) defined in terms of ψℓ,t (the time-evolved ψℓ ), also for t in the future. Moreover, as long as ψℓ does not itself become a superposition of macroscopically different
states, mℓ behaves as expected of the macro-state of ψℓ and provides a reasonable and
recognizable story.
6
For example, for Schrödinger’s cat we have that ψ = ψ1 + ψ2 with ψ1 the wave
function of a live cat and ψ2 the wave function of a dead cat, and m1 (x, t) behaves like
the mass density of a live cat (up to an overall factor), while m2 (x, t) behaves like that
of a dead cat. Note that, by the linearity of the Schrödinger evolution, the live cat and
the dead cat, that is m1 and m2 , do not interact with each other, as they correspond to
ψ1 and ψ2 , which would in the usual quantum theory be regarded as alternative states
of the cat. The two cats are, so to speak, reciprocally transparent.
More generally, consider an (evolving) decomposition (8) associated with an orthogonal decomposition H = ⊕ℓ Hℓ of the Hilbert space H into subspaces Hℓ corresponding
to different macrostates [42]. Then the components of the corresponding decomposition
(9) should form independent families of correlated matter density associated with the
terms of the superposition, with no interaction between the families. The families can
indeed be regarded as comprising many parallel worlds, superimposed on a single spacetime. Metaphorically speaking, the universe according to Sm resembles the situation of
a TV set that is not correctly tuned, so that one always sees a mixture of several channels. In principle, one might watch several movies at the same time in this way, with
each movie conveying its own story composed of temporally and spatially correlated
events. Thus, in Sm reality is very different from what we usually believe it to be like.
It is populated with ghosts we do not perceive, or rather, with what are like ghosts from
our perspective, because the ghosts are as real as we are, and from their perspective
we are the ghosts. Put differently, within the one universe consisting of matter with
distribution m(·, t) in one space-time, there exist parallel worlds, many of which include
separate, somehow different copies of the same person.
So the “many worlds” here are the many contributions mℓ , and L is the number of
the different worlds. It is important to realize that the concept of a “world” does not
enter in the definition of the theory, which consists merely of the postulate that m(x, t)
means the density of matter together with the laws (2) and (1) for ψ and m. Instead,
the concept of a “world” is just a practical matter, relevant to comparing the m function
provided by the theory to our observations, that may well remain a bit vague. There is
no need for a precise definition of “world,” just as we can get along without a precise
definition of “table.”
While Sm has much in common with Everett’s many-worlds formulation of quantum
mechanics [22], there are some differences. In Sm, the “worlds” are explicitly realized
in the same space-time. Moreover, Sm has a clear PO upon which the existence and
behavior of the macroscopic counterparts of our experience can be grounded. Thus the
“preferred basis problem” does not arise for Sm. Everett’s view is essentially S0, as
his worlds are thought of as corresponding directly to the various parts ψℓ of the wave
function, with no intervening matter densities mℓ .
Since in Sm the wave function evolves according to the Schrödinger equation, it
never collapses. Let us make explicit that this is not in conflict with the collapse rule
of the quantum formalism (i.e., the algorithm for computing the statistics of outcomes
of quantum experiments) because the formalism talks about wave functions of quantum
objects whereas the ψ in the defining equations (2) and (1) is really a wave function
7
of the universe. In the quantum formalism, it seems meaningless to talk about a wave
function of the universe since the wave function of a system is only used for statistical
predictions of what an observer outside the system will see. In Sm, in contrast, the
wave function of the universe is not meaningless at all, as it governs the behavior of the
matter.
As a consequence of the relation between the mℓ and the ψℓ , each world mℓ looks
macroscopically like what most physicists would expect a world with wave function ψℓ
to look like macroscopically. This fact makes clear not only that tables and chairs can
be found in mℓ but also that the possible outcomes of experiments are the same as
in quantum mechanics; for example, particle detectors can only have integer numbers
of clicks. In particular, the empirical evidence for the granular structure of matter
(e.g., the existence of atoms, or the fact that electrons can be counted) is not in logical
contradiction with the continuous nature of matter as postulated in Sm.
Readers may worry that the following problem arises in Sm. Since with every nondeterministic “quantum measurement,” each world splits into several, the number of
worlds should increase exponentially with time. After adding very many contributions
mℓ , we may expect that m looks like random noise, or like mush. The worry is that
the separate stories corresponding to the mℓ then cannot be extracted any more from
an analysis of m. However, when we consider, not just the m function associated
with the present time, but also that in the past and in the future, then the reasonable
possibilities of splitting m into causally disconnected, branching, recognizable worlds mℓ
are presumably very limited, and should more or less correspond to a splitting (8) of the
wave function based on an orthogonal decomposition of H into macrostates.2 Thus,
while it would be a problem for Sm if m(x, t) were constant as a function of x and t, no
problem need arise if m(x, t) is highly intricate.
∗∗∗
We wish to address a question that is often raised against the many-worlds view: If
a conscious observer is in a superposition of very different brain states (say, having read
the figure “1” and having read the figure “2”), what is her or his conscious experience
like? Sm entails that there are two persons, i.e., two contributions to the m field, one
behaving like a person who has read “1” and the other like a person who has read “2”.
So far so good, but that is only a statement about the behavior of matter, and does
not strictly imply anything about the conscious experience. Since we cannot solve the
mind–body problem, or get to the bottom of the nature of consciousness, we invoke
an hypothesis of the kind that has always been implicitly used in physics, in particular
in classical physics: the assumption of a suitable psycho-physical parallelism implying
that a person has a conscious experience of the figure “1” whenever the person, more
precisely the person’s matter, is configured appropriately.
2
Appeal to causal disconnection and branching in the extraction of “worlds” from the quantum
state and its image on physical space has been discussed in the contemporary literature on the Everett
interpretation; see, e.g., [34, 43].
8
4
Reality
In Sm, the right way to understand the theory is to regard the m function as the basic
reality, and not ψ. The way Sm connects with the world of our experiences is analogous
to the way that Bohmian mechanics does. There, the connection is made through
the particles, not through the wave function. Insofar as a universe governed by Sm is
concerned, the essential nature of the wave function is defined by its evolution and its
relation to the m function.
Sm, but not S0, requires that the causally disconnected entities which constitute
worlds are part of or are realized in some precisely-defined, locally specifiable, spatiotemporal entity of a relatively familiar kind (the PO). And on this we disagree with
contemporary advocates of the Everett interpretation such as Simon Saunders, Hilary
Greaves, Max Tegmark, David Deutsch and David Wallace: we require, and they do
not, that worlds be instantiated in such a way. And this corresponds in turn to a
disagreement about whether anything like a PO is required in a physical theory.
We feel the need for a PO because we do not see how the existence and behavior
of tables and chairs and the like could be accounted for without positing a primitive
ontology—a description of matter in space and time. The aim of a fundamental physical
theory is, we believe, to describe the world around us, and in so doing to explain our
experiences to the extent of providing an account of their macroscopic counterparts, an
account of the behavior of objects in 3-space. Thus it seems that for a fundamental
physical theory to be satisfactory, it must involve, and fundamentally be about, “local
beables,” and not just a beable such as the wave function, which is non-local. In contrast,
if a law is, like Schrödinger’s equation, about an abstract mathematical object, like the
wave function ψ, living in an abstract space, like a Hilbert space, it seems necessary
that the law be supplemented with further rules or axioms in order to make contact
with a description in 3-space. For example, formulations of classical mechanics utilizing
configuration space R3N or phase space R6N (such as Euler–Lagrange’s or Hamilton’s)
are connected to a PO in 3-space (particles with trajectories) by the definitions of
configuration space and phase space.
This, at least, is how the matter seems to us. But to a proponent of S0 the existence
of many worlds is a direct consequence of the Schrödinger equation, and the very same
many worlds exist, for example, in a Bohmian universe, since Bohmian mechanics uses
the same wave function. Not so, however, for a proponent of Sm. In Sm the manyworlds character arises from the choice of primitive ontology and the law governing it.
A different choice, such as Bohm’s law (3) for a particle ontology, would retain the
single-world character.
∗∗∗
We are, in fact, not the first to ask about a PO in space and time for the many-worlds
view. Bell [8] suggested as a PO for the many-worlds view that each world consists of
particles with actual positions (like a classical or Bohmian world). In a genuine manyworlds theory based on this ontology, at every time t, every configuration Q ∈ R3N would
9
be realized in some world, in such a way that the distribution across the ensemble of all
worlds is |ψt |2 . However, Bell himself objected that the “other” worlds, other than the
one we are in, serve no purpose and should be discarded. In his words:
[I]t seems to me that this multiplication of universes is extravagant, and
serves no real purpose in the theory, and can simply be dropped without
repercussions.
It is worth noting that this objection does not apply to Sm, as there is no easy, clean,
and precise way of getting rid of all but one world in Sm. Bell, however, can remove
most worlds from his picture, and thus proposes the following for the one remaining
world:
instantaneous classical configurations [Q] are supposed to exist, and to
be distributed . . . with probability |ψ|2 . But no pairing of configurations
at different times, as would be effected by the existence of trajectories, is
supposed.
It is not clear what is meant by the last sentence, given that for every time t a configuration Q(t) is supposed to exist. What Bell presumably had in mind is that for every
time t the configuration Q(t) is chosen independently with distribution |ψt |2 . Let us call
this theory Sip (S for the Schrödinger equation, i for independent, and p for particle
ontology); in [3] it was called BMW for “Bell’s version of many-worlds.” So in Sip, the
PO consists of particles, as in Bohmian or classical mechanics, but their positions vary
with time in an utterly wild and discontinuous way. (Indeed, the path t 7→ Q(t) will
typically not even be a measurable function.)
Notwithstanding the step of removing most worlds, Sip still has a certain manyworlds character, which manifests itself when one considers a time interval. Within this
interval, the configuration Q(t) will visit all regions of configuration space in which ψ is
nonzero, and those regions more often that contain more of |ψ|2 . So in Sip, many worlds
exist, not at the same time, but one after another. For example, if after a quantum
measurement
the wave function of the system and the apparatus is a superposition
P
c
ψ
of
contributions
with the apparatus pointer pointing to different outcomes
α α α
α, then the actual outcome, the one corresponding to the positions of the particles
constituting the pointer, will be different at different times, and more often be a value
with greater weight |cα |2 . Taking into account the occasions in the past at which wave
packets split into several ones, we are led to conclude that there are also moments in
time within every second, according to Sip, in which dinosaurs are still around.
Against Sip, Bell objects that the history of our world, according to Sip, is unbelievably eccentric, implying that our memories are completely unreliable, as the past was
nothing like the way we remember it. It is worth noting that also this objection does
not apply to Sm, as the history of every single world in Sm is much like the way we
normally think of the history of our world.
Sip is related to Nelson’s stochastic mechanics [32, 25] (in the variant due to Davidson
[15] with arbitrary diffusion constant); in fact it can be regarded as the limiting case
10
of stochastic mechanics in which the diffusion constant tends to infinity. As another
drawback of Sip, chances seem low that this theory could ever be made relativistic,
given that it relies explicitly on the concept of simultaneity.
∗∗∗
Another comparison we should make is between Sm and GRWm, the Ghirardi–
Rimini–Weber (GRW) theory of spontaneous wave function collapse [24, 10] in the
version with a matter density ontology [12, 26]. GRWm shares with Sm the law (1) for m,
but uses a stochastic and nonlinear modification of the Schrödinger equation, according
to which macroscopic superpositions like Schrödinger’s cat spontaneously “collapse”
within a fraction of a second into one contribution or another, with probabilities very
close to those prescribed by the quantum formalism. As a consequence, only one of the
wave packets corresponding to different “worlds” remains large while the others fade
away, and the m function of GRWm is essentially just one of the mℓ contributing to
m in Sm. Thus, it seems reasonable to say that GRWm does not share the manyworlds character of Sm. (On the other hand, one might argue that even in GRWm,
other contributions mk , k 6= ℓ, still exist, however small they may be. Note, though,
that those contributions are not just reduced in size by the GRW collapses, but also
distorted, due to a large relative gradient of the tails of the Gaussian involved in the
collapse, so that their evolution is very much disturbed [46].)
5
Nonlocality
Bell’s theorem seems to show that every theory that agrees with the quantum formalism
must be nonlocal [5]. But Bell’s argument relies on the assumption that experiments
have unambiguous outcomes. That is a very normal kind of assumption, but one that is
inappropriate in theories with a many-worlds character, as Bell concedes in the passage
quoted in the beginning of this article. Because of its ontological clarity, Sm provides
an occasion to analyze the relevance of the many-worlds character to locality. So is Sm
a local theory or not?
We first observe that in the absence of interaction between two disjoint regions A
and B of space, experimenters in A have no way of influencing m|B , the matter density
in B. After all, if
ψ = ψ(q1 , . . . , qM , r1 , . . . , rN ) = ψ(~q, ~r)
is a wave function for which some variables are confined to A, q1 , . . . , qM ∈ A, and some
to B, r1 , . . . , rN ∈ B, then m|B depends on ψ only through the reduced density matrix
associated with B, ρB = trA |ψihψ|, where trA means the partial trace over the variables
~q = (q1 , . . . , qM ). Indeed,
Z
ρB (~r; ~s) = d3M ~q ψ ∗ (~q, ~r) ψ(~q, ~s)
(11)
11
and, for x ∈ B,
m(x) =
N
X
mM +j
j=1
Z
d3N ~r δ 3 (x − rj ) ρB (~r; ~r) .
(12)
Since, as is well known, ρB will not depend on any external fields at work in A, fields
that the experimenters may have set up to influence the matter governed by ψ, for as
long as there is no interaction between A and B, it follows that the same thing is true
of m|B . This shows that experimenters in A cannot influence m|B .
And yet, Sm is nonlocal. To see this, consider an Einstein–Podolsky–Rosen (EPR)
experiment, starting with two electrons in the singlet state, one in Alice’s lab A and the
other in Bob’s lab B. While there is no interaction between A and B, Alice and Bob
each perform a Stern–Gerlach experiment in the z direction. Now consider a time t just
after detectors have clicked on both sides. Recall that in ordinary quantum mechanics
the outcome has probability 12 to be (up, down) and probability 12 to be (down, up).
Hence, in Sm the wave function ψ = ψt of the EPR pair together with the detectors
(and other devices) splits into two macroscopically disjoint packets,
X
ψℓ = ψ(up,down) + ψ(down,up) ,
(13)
ψ=
ℓ
and correspondingly,
m=
X
mℓ = m(up,down) + m(down,up) .
(14)
ℓ
Thus the world in which Alice’s result is “up” is the same world as the one in which
Bob’s result is “down,” and it is this fact that is created in a nonlocal way.
To connect, and contrast, this nonlocality with the fact that Alice cannot influence
m|B , we note that the m function alone, while revealing that there are two worlds in A
(corresponding to the results “up” and “down”) and two worlds in B (corresponding to
the results “up” and “down”), does not encode the information conveying which world
in A is the same as which world in B. That is, the pairing of worlds cannot be read off
from m(·, t) even though it is an objective fact of Sm at time t, defined by means of the
wave function ψt .
Moreover, even though Alice cannot influence the PO in B, she can influence other
physical facts pertaining to B as follows. Consider now two options for Alice: she can
carry out a Stern–Gerlach experiment in either the z direction or the x direction (what
is often called measuring σz or σx ). Suppose further that t1 is a time at which a detector
in A has already clicked but the electron in B has not yet reached its Stern–Gerlach
magnet. Then the wave function ψ = ψt1 of the EPR pair and the detectors in A
together is either—if Alice chose the z direction—of the form
ψ=
+
√1
2
√1
2
↑, z = +1
A
↓, z = 0
B
“up”
↓, z = −1
A
↑, z = 0
B
“down”
12
(15)
(with the first two factors referring to spin and position of the EPR pair and the third
to the detectors in A), or—if Alice chose the x direction—of the form
ψ=
+
√1
2
√1
2
→, x = +1
A
←, z = 0
B
“right”
←, x = −1
A
→, z = 0
B
“left” .
(16)
Now suppose that at time t2 > t1 , the electron in Bob’s lab has passed through a Stern–
Gerlach magnetic field oriented in the z direction, but not yet reached the detectors.
Then the above expressions for ψ = ψt1 have to be modified as follows for ψ = ψt2 (of
the EPR pair and the detectors in A): using
and ← = √12 ↑ − ↓ ,
(17)
→ = √12 ↑ + ↓
we have that either—if Alice chose the z direction—
ψ=
+
√1
2
√1
2
↑, z = +1
A
↓, z = −1
B
“up”
↓, z = −1
A
↑, z = +1
B
“down”
(18)
or—if Alice chose the x direction—
ψ=
+
1
2
1
2
→, x = +1
←, x = −1
A
↑, z = +1
B
“right”
+ ↓, z = −1 B “left” .
(19)
P
As a consequence, at time t2 the decomposition m =
mℓ = m1 +m2 into worlds reads,
on the B side, either—if Alice chose the z direction—
A
↑, z = +1
− ↓, z = −1
B
m1 |B = 21 mz=−1 ,
B
m2 |B = 12 mz=+1
(20)
(with mz=−1 a unit bump centered at z = −1, etc.) or—if Alice chose the x direction—
m1 |B = 41 mz=−1 + 14 mz=+1 ,
m2 |B = 41 mz=−1 + 14 mz=+1 .
(21)
That is, while m(x) for x ∈ B is unaffected by Alice’s choice, each mℓ (x) is affected.
This is an example of an objective fact pertaining to region B that is influenced by
Alice’s choice, and illustrates that the nonlocality of Sm is even of the kind involving
instantaneous influences.3
The situation of nonlocality in Sm can be compared to that in the “many minds”
picture described by Albert and Loewer [1]. There, the PO is replaced by a collection of
purely mental events; Alice has many minds, some of which see “up” and some “down,”
and so does Bob, but no pairing is assumed that would specify which of Bob’s minds is in
3
It is interesting that Sm turns out to be nonlocal in situations that do not seem to require nonlocality
in a single-world framework: If both Alice and Bob choose the z direction, then the correlation can of
course be explained locally by means of a “hidden variable” (that was the point of Einstein, Podolsky
and Rosen [20]). If Alice chooses the x and Bob the z direction then the outcomes of both sides are
independent. So, if Alice can only choose between z and x then the statistics can be explained by two
independent hidden variables associated with the z and x directions.
13
the same world as which of Alice’s. This parallels the absence of pairing between Bob’s
worlds and Alice’s worlds in the m function. The clarity of Sm helps exemplify that this
fact alone does not imply locality: Even if one assumes the absence of a pairing in the
PO (or, for “many minds,” in the mental events replacing the PO), the non-primitive
ontology (i.e., the wave function) may define such a pairing nonetheless. For the same
reason, also “many minds” should be regarded as nonlocal. Of course, the nonlocality
of Sm is already suggested by the facts that Sm cannot be formulated purely in terms of
local variables (but needs the nonlocal variable ψ), and that EPR (or Bell) correlations
in Sm do not propagate through space at finite speed.
6
Relativity
Even though Sm is nonlocal, it can easily be made relativistic, at least formally, neglecting cut-offs and renormalization. We acknowledge that to go beyond a formal theory
such as sketched below to one that is well defined and physically adequate would of
course be a formidable challenge.
For any relativistic quantum theory, consider the Heisenberg picture with fixed state
vector ψ, let Tµν (x, t) be the stress-energy tensor operator for the space-time point (x, t),
and set
mµν (x, t) = hψ|Tµν (x, t)|ψi .
(22)
This tensor field on space-time is arguably the most obvious relativistic analog of the
formula (1), for what could in ordinary quantum mechanics be called the average mass
distribution. Indeed, in the nonrelativistic limit, mµν (x, t) should have time-time component
m00 (x, t) = m(x, t) c2
(23)
with m(x, t) as in (1), and all other components negligible. The theory with PO given
by (22) is relativistically invariant because of the relativistic invariance of the underlying
quantum theory and that of the operator-valued tensor field Tµν (x, t).
Other relativistic laws than (22) are conceivable. In fact, the concept of matter
density per se does not even select whether the relativistic analog of the m(·) function
should be a scalar, vector, or tensor field; the above choice of tensor field was inspired
by the relativistic concept of mass-energy, but, as mentioned before, the matter density
function need not be linked to masses.
7
Probability
In ordinary quantum mechanics, the outcome of a “quantum measurement” (say, a
Stern–Gerlach experiment) is regarded as random with certain probabilities. In Sm,
though, all possible outcomes are realized in different worlds, so it is not obvious how
it can make sense to talk of probabilities at all. What are these probabilities the probabilities of ?
14
This problem is often called the “incoherence problem,” applying to any many-worlds
interpretation of quantum mechanics, be it in the form originally put forward by Everett
[21, 22] or in other formulations (see, e.g., [16]). Moreover, most authors agree that once
this problem is solved, there is still the quantitative problem of showing that the probabilities agree with the quantum-mechanical ones; see, e.g., [30]. In recent years various
proposals have been put forward to solve these problems. David Deutsch has suggested
that probabilities can be understood in terms of rational action [17] and that one should
prove, via decision theory, that a “rational” agent who believes himself to be in a manyworlds universe, should nevertheless make decisions as if the quantum probabilities gave
the chances for the results of experiments in the usual way; in this regard, see also the
contribution of David Wallace [45]. Lev Vaidman has put forward his “sleeping pill”
argument to support the validity of the ignorance interpretation of probability [40, 41].
Recently, Simon Saunders and Wallace have considered a “semantic turn” in order to
ensure the truth of utterances typically made about quantum mechanical contingencies, including statements of uncertainty, by speakers living in a many-worlds universe
[44, 35]. See also [38, 30, 31, 4].
Since Sm is a many-worlds formulation of quantum mechanics—albeit with a precise primitive ontology—any of the proposals mentioned above about the meaning of
probabilities in a many-worlds setting can equally well be considered in Sm. We prefer, however, Everett’s approach [21, 22], that of denying that the incoherence problem
is a genuine problem (for more on this see Section 9) and appealing to typicality for
the quantitative problem. Typicality is a notion that goes back at least to Ludwig
Boltzmann’s mechanical analysis of the second law of thermodynamics [27], and that,
in recent years, has been used for explaining the emergence of quantum randomness in
Bohmian mechanics [19].4 According to Everett [22]:
We wish to make quantitative statements about the relative frequencies of
the different possible results . . . for a typical observer state; but to accom4
In Bohmian mechanics different histories of the world, corresponding to different initial configurations, are possible for the same wave function Ψ of the universe, and the observed frequencies may
agree with quantum mechanics in some histories but not in others. Thus a concept of “typical history”
is needed. The only known candidate for this concept that is time translation invariant is the one
given by the |Ψ|2 measure. This measure is equivariant [19], a property which expresses the mutual
compatibility of the Schrödinger evolution of the wave function and the Bohmian motion of the configuration. This measure is used in the following way: A property P is typical if it holds true for the
overwhelming majority of histories Q(t) of a Bohmian universe. More precisely, suppose that Ψt is the
wave function of a universe governed by Bohmian mechanics; a property P , which a solution Q(t) of
the guiding equation for the entire universe can have or not have, is called typical if the set S0 (P ) of
all initial configurations Q(0) leading to a history Q(t) with the property P has size very close to one,
Z
|Ψ0 (q)|2 dq = 1 − ε , 0 ≤ ε ≪ 1 ,
(24)
S0 (P )
with “size” understood relative to the |Ψ0 |2 distribution on the configuration space of the universe. For
instance, think of P as the property that a particular sequence of experiments yields results that look
random (accepted by a suitable statistical test), governed by the appropriate quantum distribution.
One can show, using the law of large numbers, that P is typical; see [19] for a thorough discussion.
15
plish this we must have a method for selecting a typical element from a
superposition of orthogonal states. . . .
The situation here is fully analogous to that of classical statistical mechanics,
where one puts a measure on trajectories of systems in the phase space by
placing a measure on the phase space itself, and then making assertions which
hold for “almost all” trajectories (such as ergodicity, quasi-ergodicity, etc).
This notion of “almost all” depends here also upon the choice of measure,
which is in this case taken to be Lebesgue measure on the phase space.
. . . [T]he choice of Lebesgue measure on the phase space can be justified by
the fact that it is the only choice for which the “conservation of probability”
holds, (Liouville’s theorem) and hence the only choice which makes possible
any reasonable statistical deductions at all.
In our case, we wish to make statements about “trajectories” of observers.
However, for us a trajectory is constantly branching (transforming from state
to superposition) with each successive measurement.
Let us explain how that works in Sm. It is useful to focus on the following statement:
The relative frequencies for the results of experiments that a typical observer
sees agree, within appropriate limits, with the probabilities specified by the
quantum formalism.
(25)
We elaborate on this statement below. The idea is that a derivation of this statement
amounts to a justification of our use of the quantum probabilities. For a discussion of
the idea of a typical observer in a different context see [28], [29, Chap. 5], where the
rule that we humans should see what a typical observer sees was called the “Copernican
principle.”
By what a “typical observer sees,” be it relative frequencies or any other sort of
behavior corresponding to some property P , we mean that P occurs in “most” worlds.
When this is true, we often also say that the behavior is typical, or that P typically
holds, or that P is typical. It is, of course, crucial here to specify exactly what is meant
by “most”—what is meant by saying that P is typical.
The sense of typical we have in mind is given by assigning to each world ml a weight
Z
µℓ = d3 x mℓ (x, t) .
(26)
We say that
A property P holds typically (or, for most worlds) if and only if the sum
of the weights µℓ , given by (26), of those worlds for which P holds is very
near the sum of the weights of all worlds.
(27)
In the next section we will discuss why we believe that this is a reasonable notion of
typicality, one such that we should expect to see what is typical. Let us now explore its
16
mathematical consequences.5
In terms of the decomposition (8) of the wave function ψ into macroscopically different contributions ψℓ , the weight can be expressed as follows, according to the definition
(1) of the m function:
Z
N
X
mi
(28)
µℓ = d3 x mℓ (x, t) = kψℓ k2
i=1
(recall that mi are the mass parameters associated with the N “particles”). That P
is, the
weights we associate with different worlds are the same weights, up to a factor
mi ,
as would usually be associated with different worlds in a many-worlds framework. We
can also rephrase the typicality of P in terms of the ψℓ : Let L be the set of all indices
ℓ, L = {1, . . . , L }, and L(P ) the set of those indices ℓ such that the world with index
ℓ has the property P . By (27), the property P holds typically if and only if
P
µℓ
ℓ∈L(P )
P
µℓ
= 1−ε,
0 ≤ ε ≪ 1.
(29)
ℓ∈L
Since the ψℓ do not overlap, and assuming kψk = 1 as usual, we have that
X
X
X
µℓ = kψk2
mi =
mi .
i
ℓ∈L
i
Thus, P holds typically if and only if6
X
kψℓ k2 = 1 − ε ,
0 ≤ ε ≪ 1.
(30)
ℓ∈L(P )
Everett showed that with the sense of typicality provided by the weights (26,28) the
law of large numbers yields (25). A simple example should suffice here. Consider an
observer performing a large number n of independent Stern–Gerlach experiments for
which quantum mechanics predicts “spin up” with probability p and “spin down” with
probability q = 1 − p. Let this n-part experiment begin at time t0 and end at time t; let
us focus on just one world at time t0 . Assume that the sequence of outcomes, such as
↑↓↓↑ . . . ↓↑↑↑ ,
(31)
gets recorded macroscopically, and thus in mℓ (·, t). The one world at time t0 splits into
L ≥ 2n worlds at time t,
ψ = ψ(t) =
L
X
ψℓ (t) ,
m(x, t) =
L
X
mℓ (x, t) .
(32)
ℓ=1
ℓ=1
5
One might be tempted to think that some of the worlds, those represented with less weight µℓ , are
somehow less real, corresponding perhaps to a lesser degree of existence (a “measure of existence” was
considered by Vaidman [41]). But we do not think that there can be different degrees of existence, and
we certainly see no basis for such a position in Sm.
6
Note the similarity between (24) and (30).
17
Now some of the worlds at time t feature a sequence in which the relative frequencies of
the outcomes agree, within appropriate limits, with the quantum probabilities p and q.
However, this is true only of some worlds, but not all. It is a property P that a world
may have or not have.
Is P typical? Let L(k) be the set of those ℓ such that the world mℓ features a
sequence of k spins up and n − k spins down; taken together, these worlds have weight
X n
X X
X
2
pk q n−k .
(33)
kψℓ k =
mi
µℓ =
mi
k
i
i
ℓ∈L(k)
ℓ∈L(k)
Since n is large, the weight is overwhelmingly concentrated on those worlds for which the
relative frequency k/n of “up” is close to p. This follows from the law of large numbers,
which ensures that, if we generated a sequence of n independent random outcomes, each
“up” with probability p or “down” with probability q, then the relative frequency of
“up” will be close to p with probability close to 1. Thus the total weight of the worlds
with k/n ≈ p is close to the total weight. This illustrates how (27) yields (25). The
upshot is that Sm is empirically equivalent to both orthodox quantum mechanics and
Bohmian mechanics.7
In both Sm and Bohmian mechanics, typicality is used for two purposes: prediction
and explanation. Namely, when deriving predictions from Bohmian mechanics we claim
that the typical behavior will occur, even if there are possible universes in which different
behavior occurs. For explanation of why the world looks the way it does, we say that
it is typical for a Bohmian world to look that way. Likewise in Sm: When we want
to make predictions, we know that a property like P will hold in some worlds mℓ and
not in others, so what we predict is the typical behavior—the one that occurs in most
worlds (with the weighted notion of “most”); and the explanation for why we see a
certain behavior is that it occurs in most worlds. Insofar as the typicality reasoning is
concerned, in Sm the world we are in plays the same role as the actual world in Bohmian
mechanics.8
7
We note the following subtlety about the empirical equivalence between Sm and Bohmian mechanics. Even though there is no experiment that could distinguish them, there exist contrived situations
in which Sm does not make the same empirical prediction as Bohmian mechanics, but rather makes
no empirical prediction at all. Namely, there exist contrived situations in which Sm provides no recognizable macroscopic objects, while any experimental test would, of course, require the existence of
such objects, in particular of pointers to register the outcome and of humans or other beings as experimenters. For example, suppose that 3-space is not R3 but a 3-torus (S 1 )3 , where S 1 denotes a circle (of
some large perimeter). Then some wave functions on configuration space (S 1 )3N are invariant under
translations of (S 1 )3 . Such wave functions can be obtained from any ψ by superposing all translates of
ψ. They would appear completely acceptable in Bohmian mechanics but would lead to a profoundly
problematical state of the PO in Sm, namely a constant m function. To see this, note that if two wave
functions are translates of each other then the m functions they give rise to are translates of each other
as well; as a consequence, a translation invariant wave function ψ (which may be very nontrivial) gives
rise to a translation invariant m function (which must be constant). Of course, this fact is not fatal to
the viability of Sm, as the wave function of the universe need not be translation invariant.
8
What is different about the use of typicality in the two theories is that while in Bohmian mechanics
typicality is used for explaining physical facts, in Sm it is used for explaining indexical facts. Indexical
18
8
Typicality
In this section, we address the following two questions: Should not the concept of
typicality (or that of “most” worlds) be based on the number of worlds, disregarding
the weights µℓ ? And, which reasons select (26) as the rule for determining the weights?
It would seem that counting would provide a better measure of typicality, maybe
even the only acceptable one. And counting would also seem to lead to rather different
predictions. After all, in the example above involving n ≫ 1 Stern–Gerlach experiments,
if the worlds are taken to be in one-to-one correspondence with the possible outcomes
(i.e., the sequences of ups and downs) then, by the law of large numbers, the worlds in
which the relative frequency k/n of “up” is approximately 1/2 far outnumber those in
which k/n ≈ p (provided p is sufficiently different from 1/2).
But counting worlds is not well defined; there is no fact of the matter as to how many
worlds have some property P . In this respect, “worlds” are
Pnot like beans (that can
be counted) but more like clouds. The decomposition ψ =
ψℓ is associated with an
orthogonal decomposition H = ⊕ℓ Hℓ of the Hilbert space H into subspaces Hℓ corresponding to different macrostates [42], a decomposition that is inevitably arbitrary, due
to the vagueness of the notion of “macroscopic” or, in other words, due to the arbitrariness of the boundaries between macrostates.9 Concretely, it is often unclear whether
two wave packets φ1 and φ2 should be regarded as “macroscopically different” or not; as
a consequence, it is then unclear whether ψ = φ1 + φ2 should be regarded as two worlds,
ψ1 = φ1 and ψ2 = φ2 , or as one world, ψ1 = φ1 + φ2 . Indeed, the decomposition of ψ into
“its macroscopically different contributions” ψℓ will usually depend on our interpretation of “macroscopically different.” For example, if ψ as a function of the center-of-mass
coordinate of a meter pointer is smeared out, into how many “macroscopically different”
parts should we divide it? And in the example above of n Stern–Gerlach experiments,
should we regard the worlds as corresponding to different outcomes (given as sequences
of ups and downs), or should we choose a finer decomposition by taking into account the
times when detectors clicked, and regard contributions to ψ that correspond to different
times (but the same sequence of ups and downs) as different worlds? For many purposes,
the ambiguity inherent in the notion “macroscopically different” is not a problem, but
for the purpose of counting worlds it is.
2
However, if we use the weights µℓ (or, equivalently, kψℓ kP
) then, while there is still
the same amount of arbitrariness in the decomposition ψ = ψℓ , the weight associated
with the property P ,
X
µ(P ) =
µℓ ,
ℓ∈L(P )
statements are statements referring to concepts like “here,” “now,” or “I.” A simple example of an
indexical statement is “there are five coins in my pocket.” In physics, once I am told all physical facts
about my universe, I may still need to be told where I am in this picture: which space-time location
corresponds to here-now, and furthermore, for any theory with many-worlds character, in which of the
worlds to find me. Those are indexical facts. The indexical fact to be explained here is that I find
myself in a world with property P .
9
The suggestion that world count is ill-defined has been discussed recently in [45].
19
is unambiguous, as a consequence of what Everett [21, 22] called
P the additivity of the
weights: When we further decompose a contribution ψℓ into ℓ′ ψℓ,ℓ′ then the norm
squares add according to
X
kψℓ k2 =
(34)
kψℓ,ℓ′ k2 .
ℓ′
This follows if the ψℓ,ℓ′ have disjoint supports in configuration space. (In fact, Everett
2
showed that the weights must be equal, up to an overall factor,
P to kψℓ k if they are
given by some fixed function f (kψℓ k) and additive, f (kψℓ k) = ℓ′ f (kψℓ,ℓ′ k), where ψℓ,ℓ′
are mutually orthogonal. However, he did not make explicit the connection between
additivity and the ambiguity of the notion of the macroscopically different.)
Thus, counting the worlds is not an option. But even in theories in which the
concept of world is precisely defined and thus allows us to count worlds, weights may
arise naturally in the form of multiplicities, associated with representing several worlds
with the same configuration by one world with multiplicity.
Another factor supporting the use of the weights µℓ (26) for the measure of typicality
is their quasi-equivariance, i.e., the two facts, analogous to the equivariance of the |ψ|2
distribution in Bohmian mechanics, that the weight µℓ of a world does not change
under the unitary time evolution unless it splits, and that when a world splits then
the sum of the weights after splitting is the same as the weight before splitting. The
former is a consequence of unitarity and the fact that µℓ is proportional to kψℓ k2 , and
the latter follows from the additivity mentioned above. Quasi-equivariance is relevant
since, as Everett says in the passage quoted above, “we wish to make statements about
“trajectories” of observers.” As a consequence of quasi-equivariance, memories and
records obey the following type of consistency: If it is typical at time t1 that, say,
between 33% and 34% of the outcomes of a certain experiment are “up” then it is
typical at time t2 > t1 that between 33% and 34% of the records of those outcomes are
records of “up.”
The notions of typicality and quasi-equivariance we have considered are just what
Everett considered after the passage quoted above:
To have a requirement analogous to the “conservation of probability” in the
classical case, we demand that the measure assigned to a trajectory at one
time shall equal the sum of the measures of its separate branches at a later
time. This is precisely the additivity requirement which we imposed and
which leads uniquely to the choice of square-amplitude measure. Our procedure is therefore quite as justified as that of classical statistical mechanics.
In other words, Everett’s assessment is that the most natural measure is indeed the one
using kψℓ k2 weights, and we agree. In addition, owing to Sm’s greater ontological clarity,
we believe that Everett’s analysis, when applied to Sm, becomes even more transparent
and compelling than for more standard versions of the many-worlds interpretation.
20
9
Uncertainty
Probabilities are often regarded as expressions of our lack of knowledge, of ignorance
and uncertainty; the typicality approach, however, does not directly involve uncertainty.
So let us make some remarks about the status of uncertainty in Sm.
Since Bohmian mechanics is a deterministic theory, it also gives rise to the question
about the meaning of probabilities. But in Bohmian mechanics this question is much
less problematical than in Sm; in a Stern–Gerlach experiment, for example, the outcome
depends on the initial wave function and the initial position of the particle, and while we
may know the wave function, we cannot know the position with sufficient precision to
infer the outcome (except when the initial wave function is an eigenstate of the relevant
spin operator). But in Sm we cannot be uncertain about what “the” outcome of a
Stern–Gerlach experiment will be, since we know that both outcomes will be realized.
If we believe we are living in a many-worlds universe, we should regard our feelings of
uncertainty about the future as sheer illusion.
As shocking as this may seem, however, it should not be held against many-worlds
theories. After all, modern physics has accustomed us to the illusory character of many
of our experiences; for example, according to the standard understanding of relativity
(at least among physicists), our feeling of the passage of time is also a sheer illusion
(“however persistent,” as Albert Einstein wrote to the widow of Michele Besso). Likewise, the fact that in Sm (as in any physical theory with a many-worlds character) there
is a severe gap between metaphysics and experiences—i.e., the fact that the reality is
very different from what our experiences suggest, in that the future is not as uncertain
as we normally imagine and that there are other worlds we do not see—need not conflict
with the goal of explaining our experiences. It seems not at all impossible to explain
why observers who believe in a many-worlds universe nevertheless behave as if they were
uncertain about the future; some approaches are presented in, e.g., [44, 38, 30, 31, 4, 35].
(In fact, we would presumably often feel uncertain in a many-worlds theory for the same
evolutionary reasons as for a single-world theory.) However, this problem should not
be confused with the problem of explaining the origin of physical probabilities, i.e., of
explaining the relative frequencies we see, a problem resolved by the typicality analysis.
10
Summary
We have shown that Schrödinger’s first interpretation of quantum mechanics, in which
the wave function is regarded as describing a continuous distribution of matter in space
and arguably the most naively obvious interpretation of quantum mechanics, has a
surprising many-worlds character. We have also shown that insofar as this theory makes
any consistent predictions at all, these are the usual predictions of textbook quantum
mechanics.
Funding. This work was supported by the National Science Foundation [DMS-0504504
to S.G.]; and Istituto Nazionale di Fisica Nucleare [to N.Z.].
21
Acknowledgments. We thank David Albert (Columbia University), Cian Dorr (Oxford), Detlef Dürr (LMU München), Adam Elga (Princeton), Ned Hall (Harvard), Barry
Loewer (Rutgers), Tim Maudlin (Rutgers), Travis Norsen (Marlboro), Daniel Victor
Tausk (São Paulo), David Wallace (Oxford), and Hans Westman (Sydney) for useful
discussions and comments on previous versions of this paper. We are particularly grateful to Travis Norsen for discussions on nonlocality.
References
[1] Albert, D.Z., Loewer, B.: Interpreting the many worlds interpretation. Synthese
77: 195–213 (1988).
[2] Allori, V.: Fundamental Physical Theories: Mathematical Structures Grounded on
a Primitive Ontology. Ph. D. thesis, Department of Philosophy, Rutgers University
(2007). Online http://www.niu.edu/∼ vallori/thesis4.pdf
[3] Allori, V., Goldstein, S., Tumulka R., Zanghı̀, N.: On the Common Structure of
Bohmian Mechanics and the Ghirardi–Rimini–Weber Theory. British Journal for
the Philosophy of Science 59: 353–389 (2008). arXiv:quant-ph/0603027
[4] Baker, D.: Measurement outcomes and probability in Everettian quantum mechanics. Studies in the History and Philosophy of Modern Physics 38: 153–69 (2007).
[5] Bell, J.S.: On the Einstein–Podolsky–Rosen Paradox. Physics 1: 195–200 (1964).
Reprinted as chapter 2 of [11].
[6] Bell, J. S.: On the Problem of Hidden Variables in Quantum Mechanics. Reviews
of Modern Physics 38: 447–452 (1966). Reprinted as chapter 1 of [11].
[7] Bell, J. S.: The Theory of Local Beables. Epistemological Letters 9: 11 (1976).
Reprinted in Dialectica 39: 85 (1985), and as chapter 7 of [11].
[8] Bell, J. S.: Quantum Mechanics for Cosmologists. In C. Isham, R. Penrose, and
D. Sciama (editors), Quantum Gravity 2, 611–637. Oxford: Clarendon Press (1981).
Reprinted as chapter 15 of [11].
[9] Bell, J. S.: Six possible worlds of quantum mechanics. Proceedings of the Nobel
Symposium 65: Possible Worlds in Arts and Sciences. Stockholm, August 11–15,
1986. Reprinted as chapter 20 of [11].
[10] Bell, J. S.: Are There Quantum Jumps? In C. W. Kilmister (editor) Schrödinger.
Centenary Celebration of a Polymath, 41–52. Cambridge: Cambridge University
Press (1987). Reprinted as chapter 22 of [11].
[11] Bell, J. S.: Speakable and Unspeakable in Quantum Mechanics. Cambridge: Cambridge University Press (1987).
22
[12] Benatti, F., Ghirardi, G.C., Grassi, R.: Describing the macroscopic world: closing
the circle within the dynamical reduction program. Foundations of Physics 25:
5–38 (1995).
[13] Bohm, D.: A Suggested Interpretation of the Quantum Theory in Terms of “Hidden” Variables, I and II. Physical Review 85: 166–193 (1952).
[14] Chalmers, D.: The Conscious Mind. Oxford: Oxford University Press (1996).
[15] Davidson, M.: A generalization of the Fényes–Nelson stochastic model of quantum
mechanics. Letters in Mathematical Physics 3: 271–277 (1979).
[16] Deutsch, D.: Quantum theory as a universal physical theory. International Journal
of Theoretical Physics 24: 1–41 (1985).
[17] Deutsch, D.: Quantum theory of probability and decisions. Proceedings of the Royal
Society of London A 455: 3129–3137 (1999). arXiv:quant-ph/9906015
[18] DeWitt, B., Graham, R.N. (ed.s): The Many-Worlds Interpretation of Quantum
Mechanics (Princeton Series in Physics). Princeton: Princeton University Press
(1973).
[19] Dürr, D., Goldstein, S., Zanghı̀, N.: Quantum Equilibrium and the Origin
of Absolute Uncertainty. Journal of Statistical Physics 67: 843–907 (1992).
arXiv:quant-ph/0308039
[20] Einstein, A., Podolsky, B., Rosen, N.: Can Quantum-Mechanical Description of
Physical Reality Be Considered Complete? Physical Review 47: 777–780 (1935).
[21] Everett, H.: The Theory of the Universal Wavefunction. Ph. D. thesis, Department
of Physics, Princeton University (1955). Reprinted as pp 3–140 of [18].
[22] Everett, H.: Relative State Formulation of Quantum Mechanics. Reviews of Modern
Physics 29: 454–462 (1957).
[23] Everett, H., in a letter to B. S. DeWitt (1957), quoted from Byrne, P.: The Many
Worlds of Hugh Everett. Scientific American December 2007, 98–105.
[24] Ghirardi, G.C., Rimini, A., Weber, T.: Unified Dynamics for Microscopic and
Macroscopic Systems. Physical Review D 34: 470–491 (1986).
[25] Goldstein, S.: Stochastic Mechanics and Quantum Theory. Journal of Statistical
Physics 47: 645–667 (1987).
[26] Goldstein, S.: Quantum Theory Without Observers. Physics Today, Part One:
March 1998, 42–46. Part Two: April 1998, 38–42.
23
[27] Goldstein, S.: Boltzmann’s Approach to Statistical Mechanics. In Bricmont, J.,
Dürr, D., Galavotti, M.C., Ghirardi, G.C., Petruccione, F., and Zanghı̀, N. (eds.),
Chance in Physics: Foundations and Perspectives (Lecture Notes in Physics 574).
Berlin: Springer-Verlag (2001). arXiv:cond-mat/0105242
[28] Gott, J.R.: Implications of the Copernican Principle for Our Future Prospects.
Nature 363: 315–319 (1993).
[29] Gott, J.R.: Time Travel in Einstein’s Universe. Boston: Houghton Mifflin (2001).
[30] Greaves, H.: Understanding Deutsch’s probability in a deterministic multiverse.
Studies in the History and Philosophy of Modern Physics 35: 423–456 (2004).
arXiv:quant-ph/0312136
[31] Lewis, P. J.: Uncertainty and probability for branching selves. Studies in
the History and Philosophy of Modern Physics 38: 1–14 (2007). Online
http://philsci-archive.pitt.edu/archive/00002636/
[32] Nelson, E.: Quantum Fluctuations. Princeton: Princeton University Press (1985).
[33] Saunders, S.: Relativism. In R. Clifton, ed., Perspectives on Quantum Reality,
125–142. Dordrecht: Kluwer (1995).
[34] Saunders, S.: Time, decoherence, and quantum mechanics. Synthese 102: 235–66
(1995).
[35] Saunders, S., Wallace, D.:
Branching and Uncertainty. British Journal for the Philosophy of Science 59:
293–305 (2008). Online
http://philsci-archive.pitt.edu/archive/00003811/
[36] Schrödinger, E.: Quantisierung als Eigenwertproblem (Vierte Mitteilung). Annalen
der Physik 81: 109–139 (1926). English translation in [37].
[37] Schrödinger, E.: Collected Papers on Wave Mechanics, translated by J. F. Shearer.
New York: Chelsea (1927).
[38] Tappenden, P.: Identity and Probability in Everett’s Multiverse. British Journal
for the Philosophy of Science 51: 99–114 (2000).
[39] Tumulka, R.: A Relativistic Version of the Ghirardi–Rimini–Weber Model. Journal
of Statistical Physics 125: 821–840 (2006). arXiv:quant-ph/0406094
[40] Vaidman, L.: On Schizophrenic Experiences of the Neutron or Why We should Believe in the Many-Worlds Interpretation of Quantum Theory. International Studies
in the Philosophy of Science 12: 245–261 (1998). arXiv:quant-ph/9609006
[41] Vaidman, L.: Many-Worlds Interpretation of Quantum Mechanics. In E. N.
Zalta (ed.), Stanford Encyclopedia of Philosophy (Fall 2008 Edition). Online
http://plato.stanford.edu/archives/fall2008/entries/qm-manyworlds/
24
[42] von Neumann, J.: Mathematical Foundations of Quantum Mechanics. Princeton:
Princeton University Press (1955). Translation of Mathematische Grundlagen der
Quantenmechanik, Berlin: Springer-Verlag (1932).
[43] Wallace, D.: Everett and Structure. Studies in History and Philosophy of Modern
Physics 34: 87–105 (2003).
[44] Wallace, D.: Epistemology quantized: circumstances in which we should come to
believe in the Everett interpretation. British Journal for the Philosophy of Science
57: 655–689 (2006).
[45] Wallace, D.: Quantum probability from subjective likelihood: Improving on
Deutsch’s proof of the probability rule. Studies in History and Philosophy of Modern
Physics 38: 311–332 (2007). arXiv:quant-ph/0312157
[46] Wallace, D., in Maudlin, T., private communication (2006).
25