Concepts in Theoretical Physics by Baumann PDF
Concepts in Theoretical Physics by Baumann PDF
Concepts in Theoretical Physics by Baumann PDF
Daniel Baumann
[email protected]
Contents
Preface 1
2 Quantum Mechanics 11
2.1 The Split Personality of Electrons . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.1 Particles or Waves? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.2 The Structure of Atoms . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.2 Principles of Quantum Mechanics . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.2.1 States are Vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.2.2 Observables are Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.2.3 Measurements are Probabilistic . . . . . . . . . . . . . . . . . . . . . . . . 15
2.2.4 Collapse of the State Vector . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.2.5 The Uncertainty Principle . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.2.6 Combining Systems: Entanglement . . . . . . . . . . . . . . . . . . . . . . 17
2.3 Quantum Mechanics in Your Face . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.3.1 The GHZ Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.3.2 Completely Nuts, But True! . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.3.3 Quantum Reality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3 Statistical Mechanics 24
3.1 More is Different . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.2 The Distinction of Past and Future . . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.3 Entropy and The Second Law . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.3.1 Things Always Get Worse . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.3.2 Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.3.3 Counting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
3.3.4 Arrow of Time . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.4 Entropy and Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.4.1 Maxwell’s Demon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.4.2 Szilard’s Engine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
3.4.3 Saving the Second Law . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
i
ii Contents
5 Particle Physics 49
5.1 The Standard Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
5.1.1 A New Periodic Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
5.1.2 Four Forces Bind Them All . . . . . . . . . . . . . . . . . . . . . . . . . . 51
5.2 From Fields to Particles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
5.3 From Symmetries to Forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
5.4 From Virtual Particles to Real Forces . . . . . . . . . . . . . . . . . . . . . . . . 54
5.4.1 Heisenberg’s Uncertainty Principle . . . . . . . . . . . . . . . . . . . . . . 54
5.4.2 Feynman Diagrams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
5.4.3 QED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
5.4.4 QCD and Confinement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
5.4.5 Why the Sun Shines∗ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
5.5 The Origin of Mass . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
5.5.1 An Analogy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
5.5.2 Beyond Cartoons∗ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
5.5.3 Discovery of the Higgs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.6 Beyond the Standard Model∗ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
5.6.1 Dark Matter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
5.6.2 Supersymmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
6 General Relativity 66
6.1 The Happiest Thought . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
6.1.1 Fictitious Forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
6.1.2 Equivalence Principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
6.1.3 Black Holes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
6.2 Gravity as Geometry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
Contents iii
6.2.1 Spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
6.2.2 Matter Tells Space How To Curve ... . . . . . . . . . . . . . . . . . . . . 73
6.2.3 ... Space Tells Matter How To Move. . . . . . . . . . . . . . . . . . . . . . 74
6.3 Gravitational Waves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
6.4 Quantum Gravity∗ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
7 Cosmology 78
7.1 The Big Bang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
7.1.1 Nucleosynthesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
7.1.2 Recombination . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
7.1.3 Cosmic Microwave Background . . . . . . . . . . . . . . . . . . . . . . . . 82
7.1.4 Gravitational Instability . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
7.2 The Horizon Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
7.3 Inflation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
7.3.1 Solution of the Horizon Problem . . . . . . . . . . . . . . . . . . . . . . . 85
7.3.2 The Physics of Inflation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
7.4 From So Simple A Beginning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
7.4.1 Quantum Fluctuations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
7.4.2 CMB Anisotropies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
7.5 Breaking News: BICEP2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
7.5.1 B-modes from Gravitational Waves . . . . . . . . . . . . . . . . . . . . . . 89
7.5.2 Have They Been Seen? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
7.6 A Puzzle and A Mystery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
7.6.1 Dark Matter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
7.6.2 Dark Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
References 99
Preface
“If you decide you don’t have to get A’s, you can learn an enormous amount in college.”
Isidor Isaac Rabi.
This course is intended to be purely for fun and entertainment. I want to show you a few of
the wonderful things that lie ahead in your studies of theoretical physics. We will talk about
some of the most mind-boggling and beautiful aspects of modern physics. We will discuss the
physics of the very small (quantum mechanics and particle physics), the physics of the very
big (general relativity and cosmology), the very fast (special relativity), and the very complex
(statistical mechanics).
I will try to strike a balance between keeping things simple and staying honest. In particular,
I will make an effort to avoid oversimplifying analogies. Instead I will look for ways of giving
you a gist of the real deal without too much lying. This won’t always possible, either because
it would take too much time to present a detailed argument or because you haven’t yet been
exposed to the advanced mathematics used in modern physics. In that case, I will try to find
words and diagrams to describe the missing equations and I ask you to focus on the big picture,
rather than the technical details. You will have the opportunity to catch up on those details in
future courses.
I sincerely hope that these notes will be easy to read. A few words of advice: don’t obsess
about details! You don’t have to understand every line and every equation. You are welcome
to focus on the parts that you enjoy and ignore the rest. Having said that, many of the ideas
that I will present in these lectures are so beautiful, that it will be worth making some effort
to understand them. If you don’t understand an equation, just try to understand the words
surrounding it. In fact, if you want, you can view any equations in this course simply as art and
listen to my words to get a sense for their meanings.
To show you a bit more of the mathematics behind the ideas, I will include extra material as short
digressions. These parts will be separated from the main text by boxes. Most of this material
won’t be covered live in the lectures, but is for you to read in your own time.
Sections that are marked by a ‘∗’ are even more non-examinable than the rest of the course.
Acknowledgements. Thanks to Prof. David Tong. I have stolen shamelessly from his notes
for a previous version of the course. Whatever good jokes are in these notes are most likely his.
1
1 Principle of Least Action
You have all suffered through a course on Newtonian mechanics. You therefore all know how to
calculate the way things move: you draw a pretty picture; you draw arrows representing forces;
you add them all up; and then you use F = ma to figure out where things are heading next. All
of this is rather impressive—it really is the way the world works and we can use it to understand
things about Nature. For example, showing how the inverse square law for gravity explains
Kepler’s observations of planetary motion is one of the great achievements in the history of
science.
However, there’s a better way to do it. This better way was found about 150 years after
Newton, when classical mechanics was reformulated by some of the giants of mathematical
physics—people like Lagrange, Euler and Hamilton. This new way of doing things is better for
a number of reasons:
• Firstly, it’s elegant. In fact, it’s not just elegant: it’s completely gorgeous. In a way
that theoretical physics should be, and usually is, and in a way that the old Newtonian
mechanics really isn’t.
• Secondly, it’s more powerful. It gives new methods to solve hard problems in a fairly
straightforward manner. Moreover, it is the best way of exploiting the symmetries of
a problem (see Appendix A). And since these days all of physics is based on symmetry
principles, it is really the way to go.
• Finally, and most importantly, it is universal. It provides a framework that can be extended
to all other laws of physics, and reveals a deep relationship between classical mechanics
and quantum mechanics. This is the real reason why it’s so important.
In this lecture, I’ll show you the key idea that leads to this new way of thinking. It’s one of
the most profound results in physics. But, like many profound results, it has a rubbish name.
It’s called the principle of least action.
2
1.1 A New Way of Looking at Things 3
F~ = −∇V
~ . (1.1.2)
The potential V (~r) depends on ~r, but not on ~r˙ . Newton’s equation then reads
m~r¨ = −∇V
~ . (1.1.3)
This is a second-order differential equation, whose general solution has two integration con-
stants. Physically this means that we need to specify the initial position ~r(t1 ) and the initial
velocity ~r˙ (t1 ) of the particle to figure out where it is going to end up.
The integrand is called the Lagrangian L = 21 m~r˙ 2 − V (~r ). The minus sign is not a typo. It is
crucial that the Lagrangian is the difference between the kinetic energy (K.E.) and the potential
energy (P.E.) (while the total energy is the sum). Here is an astounding claim:
Let us prove it: you know how to find the extremum of a function—you differentiate and set
it equal to zero. But the action not a function, it is a functional—a function of a function. That
makes it a slightly different problem. You will learn how to solve problems of this type in next
year’s “methods” course, when you learn about “calculus of variations”. It is really not as fancy
as it sounds, so let’s just do it for our problem: consider a given path ~r(t). We ask how the
action changes when we change the path slightly
but in such a way that we keep the end points of the path fixed
~ · δ~r + O(δ~r 2 ) .
V (~r + δ~r ) = V (~r) + ∇V (1.1.8)
Since δ~r is infinitesimally small, we can ignore all terms of quadratic order and higher, O(δ~r 2 , δ~r˙ 2 ).
The difference between the action for the perturbed path and the unperturbed path then is
Z t2 h i
δS ≡ S[~r + δ~r ] − S[~r ] = dt m~r˙ · δ~r˙ − ∇V
~ · δ~r (1.1.9)
t1
Z t2 h i h it2
= dt −m~r¨ − ∇V
~ · δ~r + m~r˙ · δ~r . (1.1.10)
t1 t1
To go from the first line to the second line we have integrated by parts. This picks up a term
that is evaluated at the boundaries t1 and t2 . However, this term vanishes since the end points
are fixed, i.e. δ~r(t1 ) = δ~r(t2 ) = 0. Hence, we get
Z t2 h i
δS = dt −m~r¨ − ∇V
~ · δ~r . (1.1.11)
t1
The condition that the path we started with is an extremum of the action is
δS = 0 . (1.1.12)
This should hold for all changes δ~r(t) that we could make to the path. The only way this can
be true is if the expression in [· · · ] is zero. This means
m~r¨ = −∇V
~ . (1.1.13)
But this is just Newton’s equation (1.1.3)! Requiring that the action is an extremum is equivalent
to requiring that the path obeys Newton’s equation. It’s magical.
1.1 A New Way of Looking at Things 5
Comments:
• From the Lagrangian L(~r, ~r˙, t) we can define a generalized momentum and a generalized force,
~P ≡ ∂L and ~F ≡ ∂L . (1.1.14)
∂~r˙ ∂~r
In our simple example, L = 12 m~r˙ 2 − V (~r ), these definitions reduce to the usual ones: ~P = m~r˙
and ~F = −∇V
~ . (For more complicated examples, ~P and ~F can be less recognizable.) Newton’s
equation can then be written as
d~P ~
d ∂L ∂L
=F ⇔ = . (1.1.15)
dt dt ∂~r˙ ∂~r
1.1.3 Examples
To get a bit more intuition for this strange way of doing mechanics, let us discuss two simple
examples:
L = 12 m~r˙ 2 . (1.1.18)
In this case the least action principle implies that we want to minimize the kinetic energy
over a fixed time. It is reasonable to expect that the particle must take the most direct
route, which is a straight line:
But, do we slow down to begin with, and then speed up? Or, do we accelerate like mad
at the beginning and slow down near the end? Or, do we go at a uniform speed? The
average speed is, of course, fixed to be the total distance over the total time. So, if you do
anything but go at a uniform speed, then sometimes you are going too fast and sometimes
you are going too slow. But, the mean of the square of something that deviates around
an average, is always greater than the square of the mean. (If this isn’t obvious, please
think about it for a little bit.) Hence, to minimize the integrated kinetic energy—i.e. the
6 1. Principle of Least Action
At this point, you could complain: what is the big deal? I could easily do all of this with
F = ma. While this is true for the simple examples that I gave so far, once the examples
become more complicated, F = ma requires some ingenuity, while the least action approach
stays completely fool-proof. No matter how complicated the setup, you just count all kinetic
energies, subtract all potential energies and integrate over time. No messing around with vectors.
You will see many examples of the power of the least action approach in future years. I promise
you that you will fall in love with this way of doing physics!
Exericise.—You still don’t believe me that Lagrange wins over Newton? Then look at the following
example:
Derive the equations of motion of the two masses of the double pendulum.
Hint: Show first that the Lagrangian is
1
L = mr2 θ̇2 + (θ̇ + α̇)2 + θ̇(θ̇ + α̇) cos α − mg r [2 cos θ + cos(θ − α)] .
2
Another advantage of the Lagrangian method is that it reveals a deep connection between
symmetries and conservation laws. I describe this in Appendix A. You should read that on you
own.
1.2 Unification of Physics 7
Maxwell
z }| {
µν
L ∼ R −
|{z}
1
4 Fµν F + iψ̄γ µ Dµ ψ + |Dµ h|2 − V (|h|) + hψ̄ψ . (1.2.20)
| {z } | {z } | {z } |{z}
Einstein Yang−Mills Dirac Higgs Yukawa
Don’t worry if you don’t understand many of the symbols. You are not supposed to. View this
equation like art.
Let me at least tell you what each of the terms stands for:
• The first two terms characterize all fundamental forces in Nature: The term ‘Einstein’ de-
scribes gravity. Black holes and the expansion of the universe follow from it (see Lectures 6
and 7).
• The next term, ‘Maxwell’, describes electric and magnetic forces (which, as we will see
in Lecture 4, are just different manifestations of a single (electromagnetic) force). A
generalization of this, ‘Yang-Mills’, encodes the strong and the weak nuclear forces (we
will see what these are in Lecture 5).
• The next term, ‘Dirac’, describes all matter particles (collectively called fermions)—things
like electrons, quarks and neutrinos. Without the final two terms, ‘Higgs’ and ‘Yukawa’,
these matter particles would be massless.
The Lagrangian (1.2.20) is a compact way to describe all of fundamental physics. You can print
it on a T-shirt or write it on a coffee mug:
8 1. Principle of Least Action
• In the Newtonian approach, the intuition for how particles move goes something like this:
at each moment in time, the particle thinks “where do I go now?”. It looks around, sees the
potential, differentiates it and says “ah-ha, I go this way.” Then, an infinitesimal moment
later, it does it all again.
• The Lagrangian approach suggests a rather different viewpoint: Now the particle is taking
the path which minimizes the action. How does it know this is the right path? It is sniffing
around, checking out all paths, before it decides: “I think I’ll go this way”.
At some level, this philosophical pondering is meaningless. After all, we proved that the two
ways of doing things are completely equivalent. This changes when we go beyond classical
mechanics and discuss quantum mechanics. There we find that the particle really does sniff
out every possible path!
I have been lying to you. It is not true that a free particle only takes one path (the straight
line) from ~r(t1 ) to ~r(t2 ). Instead, according to quantum mechanics it takes all possible paths:
It can even do crazy things, like go backwards in time or first go to the Moon. According to
quantum mechanics all these things can happen. But they happen probabilistically. At the
deepest level, Nature is probabilistic and things happen by random chance. This is the key
insight of quantum mechanics (see Lecture 2).
The probability that a particle starting at ~r(t1 ) will end up at ~r(t2 ) is expressed in terms of
an amplitude A, which is a complex number that can be thought of as the square root of the
probability
Prob = |A|2 . (1.3.21)
To compute the amplitude, you must sum over all path that the particle can take, weighted by
a phase X
A= eiS/~ . (1.3.22)
paths
1.3 From Classical to Quantum (and back) 9
Here, the phase for each path is just the value of the action for the path in units of Planck’s
constant ~ = 1.05×10−34 J·s. Recall that complex numbers can be represented as little arrows in
a two-dimensional xy-plane. The length of the arrow represents the magnitude of the complex
number, the phase represents its angle relative to say the x-axis. Hence, in (1.3.22) each path
can be represented by a little arrow with phase (angle) given by S/~. Summing all arrows
and then squaring gives the probability. This is Feynman’s path integral approach to quantum
mechanics. Sometimes it is called sum over histories.
The way to think about this is as follows: when a particle moves, it really does take all
possible paths. However, away from the classical path (i.e. far from the extremum of the action),
the action varies wildly between neighbouring paths, and the sum of different paths therefore
averages to zero:
A B
C
A B
i.e. far from the classical path, the little arrows representing the complex amplitudes of each
path point in random directions and cancel each other out.
Only near the classical path (i.e. near the extremum of the action) do the phases of neigh-
bouring paths reinforce each other:
B
A
C B C
A
i.e. near the classical path, the little arrows representing the complex amplitudes of each path
point in the same direction and therefore add. This is how classical physics emerges from
quantum physics. It is totally bizarre, but it is really the way the world works.
electron source
B
In Newtonian physics, each electron either takes path A OR B. We then expect the image
behind the two slits just to be the sum of electrons going through slit A or B, i.e. we expect the
10 1. Principle of Least Action
electrons to build up a pair of stripes—one corresponding to electrons going through A, and one
corresponding to electrons going through B.
We don’t just see two strips, but a series of strips. In quantum physics, each electron seems to
take the paths A AND B simultaneously, and interferes with itself! (Since the electrons are fired
at the slits one by one, there is nobody else around to interfere with.) It may seem crazy, but
it is really the way Nature ticks. You will learn more about this in various courses on quantum
mechanics over the next few years. I will tell you a little bit more about the strange quantum
world in Lecture 2.
2 Quantum Mechanics
“If quantum mechanics hasn’t profoundly shocked you, you haven’t understood it.”
Niels Bohr
Today, I will tell you more about quantum mechanics—what weird thing it is and why
it is so weird. The importance of quantum mechanics cannot be overstated. It explains the
structure of atoms, it describes how elementary particles interact (Lecture 5) and it may even
be the ultimate origin of all structure in the universe (Lecture 7). And yet it is so mind-boggling
strange and bizarre that it is very difficult to come to terms with. To the point where it suggests
that, ultimately, our brains may not be best equipped to understand what’s going on! On the
other hand, if we just “shut up and calculate”, quantum mechanics simply works. In fact, it is
arguably the most successful scientific theory ever developed. In this lecture, I will try to give
you a glimpse of the wonderful world of quantum mechanics. This will be, by far, the most
challenging lecture of the course. Are you ready?
We would predict that the pattern behind the two slits simply to be the sum of the patterns
for each slit considered separately: if half the bullets were fired with only the left slit open and
then half were fired with just the right slit open, the result would be the same.
11
12 2. Quantum Mechanics
But, I showed you what happens in the actual experiment: The electrons, like bullets, strike
the target one at a time. We slowly see a pattern building up—yet, it is not the pattern that
we would expect if electrons behaved like bullets. Instead the pattern we get for electrons looks
more like the pattern we would get for waves:
With waves, if the slits were opened one at a time, the pattern would resemble that for particles:
two distinct peaks. But when the two slits are open at the same time, the waves pass through
both slits at once and interfere with each other: where they are in phase they reinforce each
other; where they are out of phase they cancel each other out. Electrons seem to do the same.
However, if each electron passes individually through one slit, what does it “interfere” with?
Although each electron arrives at the target at a single place and time, it seems that each has
passed through both slits at once. Remember that in Lecture 1 we claimed that electrons “sniff
out all possible paths.” In the double slit experiment we see this in action.
proton
electron
The electric force keeps an electron in orbit around a proton, just like the gravitational force
keeps the Earth in orbit around the Sun. According to this view of the atom, the electron
follows a single classical path with no restriction on the size or eccentricity of the orbit (which
only depend on the initial conditions).
In quantum mechanics, the answer is very different. The electron “sniffs out” many possible
orbits and its position smears out into ‘quantum orbits’:
2.2 Principles of Quantum Mechanics 13
Moreover, there are only a finite number of allowed quantum orbits, with specific energies. The
orbits and their energies are ‘quantized’. In this way, quantum mechanics explains the periodic
table of elements and therefore underlies all of chemistry. Having agreed that this is important
stuff, let’s see some of the mathematical details.
We will use |Ψi for the state vector.1 As a simple example consider a system that only has two
possible states: 0/1, up/down, on/off, left/right, dead/alive, etc. Such a system is also called a
1
The notation |Ψi goes back to Dirac. It is called a ket-vector or just ket. There are also bra-vectors, hΦ|,
which are the transposes (or Hermitian conjugates) of the ket-vectors. The inner product of a bra and a ket,
hΦ|Ψi, is a bra-ket, or bracket.
14 2. Quantum Mechanics
bit. The two states of the bit can be represented by the following two-component vectors
!
1
|↑i ≡ , (2.2.1)
0
!
0
|↓i ≡ . (2.2.2)
1
To have a physical example in mind, you may think of the spin of an electron relative to say the
z-axis.2 In fact, this will be our go-to example for the rest of this lecture.
In the classical world, the system has to be either in one state or the other. However, in the
quantum world, the state can be in a superposition of both states at the same time, i.e. we can
add the state vectors !
α
|Ψi = α |↑i + β |↓i = , (2.2.3)
β
where α and β are complex numbers satisfying |α|2 + |β|2 = 1. Such a state is called a qubit
(quantum bit). The possibility of a superposition of states is the origin of all the weirdness of
quantum mechanics.
More generally, the state vector can be the sum of many basis states
X
|Ψi = αi |i i , (2.2.4)
i
where each basis state | ii is weighted by a complete number αi and i |αi |2 = 1. The spectrum of
P
states can also be continuous and the corresponding vector space infinite-dimensional. For example,
this is the case for a particle with position x. In quantum mechanics, the state of the particle is a
superposition of all possible locations
Z
|Ψi = dx α(x)|x i , (2.2.5)
We have a different matrix M for each kind of measurement we can make on a system: energy,
position, momentum, spin, etc. The matrices corresponding to the spin of an electron along the
x-axis, y-axis and zaxis are
! ! !
0 1 0 −i 1 0
X= , Y= , Z= . (2.2.6)
1 0 i 0 0 −1
2
Do not try to visualize the electron as a spinning top. Particle spin is a purely quantum phenomenon. In
quantum mechanics, the spin can either be up, |↑i, or down, |↓i. To show that the electron can only have these
two spin states is hard.
2.2 Principles of Quantum Mechanics 15
The matrices “act” on the state vectors simply by matrix multiplication. In general, when a
matrix acts on a vector, it will change the direction of the vector. However, there are certain
vectors whose direction are the same after the matrix multiplication. These special vectors are
called eigenvectors. In our notation, this means
where |ii are the eigenvectors and mi the corresponding eigenvalues. The vectors (2.2.1) and
(2.2.2) are eigenvectors of Z with eigenvalues +1 and −1, i.e.
i.e. the possible outcomes of a measurement are the eigenvalues mi of the matrix M associated
with the observable. Since the measurement matrices are Hermitian, the eigenvalues are al-
ways real. You can easily check that the eigenvalues of all matrices in (2.2.6) are +1 and −1.
Measuring the spin of an electron along any axis will therefore always give +1 or −1.
In general, the outcome of a measurement will be probabilistic (see below). An important
exception is that
In other words, if the system is prepared in an eigenvector of M, i.e. |Ψi = |ii, you will always
measure the corresponding eigenvalue mi , with certainty! This is important, so let me say it
again: Although we will see that quantum mechanics is all about probabilities and uncertainties,
if the system is prepared in an eigenvector of a particular observable and you measure that
observable, you will always just get the eigenvalue. For example, if we prepare a system in the
state |↑i and decide to measure its spin along the z-axis, we will always get +1.
A special matrix H (called the Hamiltonian; see Appendix A) represents a measurement of energy.
The possible outcomes of the corresponding measurement are the energy eigenvalues Ei , and (2.2.7)
becomes the famous Schrödinger equation,
H| ii = Ei | i i . (2.2.9)
For a given system, you need to figure out the matrix H, then solve its eigenvalues. This is usually
hard! Sometimes, like for the hydrogen atom it can be done exactly. Most of the time (e.g. for
all other atoms) it is done in some approximation scheme. Solving the eigenvalue equation (2.2.9)
explains the discrete energy levels of atoms.
The measurement could give any one of the eigenvalues mi . Each with some probability. We
can expand any arbitrary state |Ψi in a basis of eigenvectors of M,
X
|Ψi = αi |ii , (2.2.10)
i
where αi are complex constants. The probability of the measurement giving the eigenvalue mi is
i.e. the probability of measuring mi is simply the square of the expansion coefficient αi . The
sum of all probabilities has to add up to 100%, so we have
X X
Prob(mi ) = |αi |2 = 1 . (2.2.12)
i i
Let’s see what this means for the spinning electron. If the system is prepared in the qubit
state (2.2.3) then
mi
|Ψi −−→ |ii ,
i.e. if the eigenvalue mi has been measured, then the state of the system after the measurement
is the corresponding eigenvector |ii. If we now were to repeat the measurement we would
get the same value mi with certainty. But, if we then perform a measurement of a different
observable, corresponding to a matrix N, the outcome will again be probabilistic, unless |ii is
also an eigenvector of N.
Again, the spinning electron makes this clear. If the system is prepared in the state (2.2.3),
then we measure spin-up or spin-down with probabilities |α|2 and |β|2 , respectively. Once the
measurement is made, the state vector |Ψi will collapse into one of the two states, | ↑i or | ↓i,
depending on which is measured. Every subsequent measurement of spin along the z-axis will
give the same value. However, if we then decide to measure a different quantity, say the spin
along the x-axis, the result will be probabilistic again. I warned you that this is strange stuff.
2.2 Principles of Quantum Mechanics 17
Let’s say you make a measurement of the spin along the z-axis and find +1. After that the state will
be
1
|Ψi = |↑i = . (2.2.16)
0
Check for yourself that the eigenvectors of the matrix X in (2.2.6) are
1 1 1 1
|→i ≡ √ , |←i ≡ √ . (2.2.17)
2 1 2 −1
next measurement of the spin along the z-axis would again be probabilistic, giving up or down with
equal probabilities. And so on ...
~
∆x∆p ≥ , (2.2.19)
2
where ~ ≈ 10−34 J · s is Planck’s constant. The reason we don’t notice these uncertainties in
everyday life is because ~ is so tiny. Similar uncertainty relations exist for other observables.
have two electrons each with spin-up or spin-down. We will label the electrons A and B. If A is
in the state up, |↑iA , and B is in the state down, |↓iB , then the combined state is
The left-hand side is called a tensor product. The simplified notation on the right-hand side
should be clear: the first arrow denotes the spin of A, the second arrow that of B. In total, there
are four possible combined states:
The state vector |Ψi may be a superposition of these four states. For example, imagine that the
system is prepared in the state
This is called an entangled state since it cannot be separated into the product of states for
the individual electrons. Not all superpositions of the states (2.2.21) are entangled states. An
example of a product state 3 is
|Ψi = √12 |↑↓i + √12 |↓↓i , (2.2.25)
which can be written as
|Ψi = √1 |↑iA + √1 |↓iA ⊗ |↓iB . (2.2.26)
2 2
The main feature of a product state is that each subsystem behaves independently of the other.
If we do an experiment on B, the result is exactly the same as if A did not exist. The same
is true for A of course. In contrast, in an entangled state measurements of A and B are not
independent.
The state (2.2.22) can arise if a particle without spin decays into two electrons. Since angular
momentum is conserved, the spins of the two electrons have to be anti-aligned. While in classical
physics the system then has to be in the state |↑↓i or |↓↑i, in quantum physics it can be in the
state |↑↓i and |↓↑i. The two electrons are then separated by a large distance (as large as you
wish). Say one of the electrons stays here, while the other is brought to the end of the universe.
We then measure the spin of the electron that stayed here. There is a 50% chance that the
result will be spin-up, and a 50% chance that it will be spin-down. However, once the spin of
the electron is measured, the spin of the far-away electron is determined instantly. By itself
this correlation of the measurements is not a problem. The same happens in classical physics.
Imagine a pair of gloves. Each glove is put into a box and the two boxes are then separated by
a large distance. If you open one of the boxes and find a left-hand glove, you know immediately
that the far-way glove is a right-hand glove. No problem. However, entangled states like (2.2.22)
are superpositions of correlated states. As we will see in the next section, such states can have
correlations that are purely quantum. Quantum gloves are actually left- and right-handed (and
everything in between) up until the moment you observe them. Moreover, the left-handed glove
3
In general, the set of product states takes the form
doesnt become left until you observe the right-handed one — at which moment both instantly
gain a definite handedness. This really upset Einstein. He called it “spooky action at a distance”.
A B C
It is conventional in the quantum mechanics literature to call them Alice (A), Bob (B) and
Charlie (C).5 They are sent to labs at three different locations. Every minute, they receive a
4
This section is based on a famous lecture by Sidney Coleman (http://www.physics.harvard.edu/about/video.html),
which itself was based on Greenberger, Horne, and Zeilinger (GHZ) (1990) “Bell’s theorem without inequalities”,
Am. J. Phys. 58 (12): 1131, and Mermin (1990) “Quantum mysteries revisited” Am. J. Phys. 58 (8): 731734.
5
Admittedly, A doesn’t look much like an Alice.
20 2. Quantum Mechanics
B C
Each scientist has a machine that performs measurements of the packages. The machine has
two settings, X or Y, and each measurement can give two outcomes, +1 and −1.
+1 -1
Alice, Bob and Charlie are told what they have to do:6
1. Choose the setting X or Y on the machine.
2. Insert the package into the machine.
3. Make a measurement.
4. Record whether the result is +1 or −1.
5. Go back to Step 1.
Each measurement is recorded until each scientist has a list that looks like this:
X X Y X Y Y X Y X X X Y ···
+1 −1 +1 −1 −1 +1 +1 +1 −1 −1 +1 −1 ···
After they each have made a bazillion measurements, Alice, Bob and Charlie get together and
start looking for correlations in their measurements. (Since their packages came from the same
source, it isn’t unreasonable to expect some correlations.) They notice the following: Whenever
one of them measured X, and the other two measured Y, the results always multiply to +1, i.e.7
XA YB YC = YA XB YC = YA YB XC = +1 . (2.3.27)
6
They are not told what’s in the packages: They could be blood samples, with the machine testing for high/low
glucose when the switch is on X, and high/low cholesterol when the switch is on Y. They could be elementary
particles with the machine measuring the spin along x or y. Or, the whole thing could just be a hoax with the
machine flashing up +1/ − 1 at random.
7
Here, the notation XA YB YC means that Alice measured X, while Bob and Charlie measured Y.
2.3 Quantum Mechanics in Your Face 21
Maybe this occurred because all three got the result +1; or perhaps one got +1 and the other
two got −1. Since the central station doesn’t know ahead of time which setting (X or Y) each
scientist will choose, it has to prepare each package with definite states for both property X and
property Y. The observed correlation in (2.3.27) is consistent with only the following 8 different
shipments from the central station:
XA YA + + − − − + − −
= XB YB = + + − − − + + +
XC YC + + + + + − − −
− + + − + + + −
+ − + − − − − + (2.3.28)
− + + − − − − +
Now, notice that (2.3.27) gives a prediction . . . if all three scientists measure X, the results
multiplied together must give +1. You can see this simply by multiplying the entries of the
first columns in the matrices in (2.3.28). Alternative, we can prove the result using nothing but
simple arithmetic:
In the first equality we used that the product is associative, while the second equality follows
from (±1)2 = +1. The final equality is a consequence of (2.3.27).
XA YB YC = YA XB YC = YA YB XC = +1 . (2.3.30)
XA XB XC = −1 . (2.3.31)
In classical physics, this shouldn’t happen, since (2.3.29) was a logical consequence of (2.3.27).
Something about our basic (classical) intuition for how the universe works is wrong!
definite assignments to all possible measurements. Instead we have to allow for the possibility
of a superposition of states and experimental outcomes being probabilistic. It turns out that
this special feature of quantum mechanics resolves the puzzle.
More concretely, Alice, Bob and Charlie were, in fact, measuring the spins of electrons along
the x- and y-axes. In this case, the measurement matrices are
! !
0 1 0 −i
X= , Y= . (2.3.32)
1 0 i 0
You can check that these matrices both have eigenvalues +1 and −1, corresponding to the
measurements. (But, X and Y do not have the same eigenvectors.) As before, we define two
special state vectors corresponding to an electron spinning up or down relative to the z-axis,
! !
1 0
|↑i ≡ , |↓i ≡ . (2.3.33)
0 1
These states are not eigenstates of X and Y. It is easy to see that acting with the matrix X on
the up-state turns it into a down-state and vice versa:
Similarly, acting with the matrix Y also exchanges up- and down-states (up to factors of i and
−i)
Now, you are told that the central station sent out the following entangled state
This corresponds to the superposition of two states: a state with all spins up
|↑↑↑i ≡ ,
|↓↓↓i ≡ .
2.3 Quantum Mechanics in Your Face 23
As before, I am using a notation where the arrows are ordered: the first arrow corresponds to
the spin of the first particle (the one sent to Alice), the second arrow corresponds to the spin of
the second particle (the one sent to Bob), etc. The measurement matrix XA therefore acts on
the first arrow of each state, XB on the second arrow, etc.
So, whenever exactly one scientist measures X, the results indeed multiply to give +1. However,
when all three scientists measure X, we get
h i
XA XB XC |Ψi = XA XB XC |↑↑↑i − |↓↓↓i
= (1)(1)(1)|↓↓↓i − (1)(1)(1)|↑↑↑i
= |↓↓↓i − |↑↑↑i
= −1 |Ψi . (2.3.43)
The classical expectation, eq. (2.3.29), is wrong. But, it makes total sense in the quantum world.
It was important that the spin states of the three particles weren’t independent, but that they
were “entangled” in the state (2.3.38)—a superposition of all spins up |↑↑↑i and all spins down
|↓↓↓i. No matter how far the scientists are separated in space, this entanglement of spin states
is reflected in their measurements. It is how the world works.
3 Statistical Mechanics
“Ludwig Boltzmann, who spent much of his life studying statistical mechanics, died in 1906
by his own hand. Paul Ehrenfest, carrying on the work, died similarly in 1933. Now it is
your turn to study statistical mechanics.”
David Goodstein
Today, I want to tell you about entropy and the Second Law of thermodynamics. This will
address deep questions about the world we live in, such as why we remember the past and not
the future. It will also have relations to information theory, computing and even the physics
of black holes. Along the way, we will encounter one of the most important formulas in all of
science:
24
3.1 More is Different 25
It contains about N = 1024 atoms. In fact, any macroscopic object contains such a stupendously
large number of particles. How do we describe such systems?
An approach that certainly won’t work, is to write down the equations of motion for all 1024
particles and solve them. Even if we could handle such computations, what would we do with
the result? The positions of individual particles are of little interest to anyone. We want answers
to much more basic questions about macroscopic objects. Is it wet? Is it cold? What colour is
it? What happens if we heat it up? How can we answer these kind of questions starting from
the fundamental laws of physics?
Statistical mechanics is the art of turning the microscopic laws of physics into a description
of the macroscopic everyday world. Interesting things happen when you throw 1024 particles
together. More is different: there are key concepts that are not visible in the underlying laws of
physics but emerge only when we consider a large collection of particles. A simple example is
temperature. This is clearly not a fundamental concept: it doesn’t make sense to talk about the
temperature of a single electron. But it would be impossible to talk about the world around us
without mention of temperature. Another example is time. What distinguishes the past from
the future? We will start there.
the law of gravity as an example. Take a movie of the planet going around a star. Now run
the movie in reverse. Does it look strange? Not at all. Any solution of Newton’s equations
can be run backward and it is still a solution. Whether the planet goes around the star one
way or the opposite way is just a matter of its initial velocity. The law of gravitation is time
reversible. Similarly, the laws of electricity and magnetism are time reversible. And so are all
other fundamental laws that are relevant for creating our everyday experiences.
So what distinguishes the past from the future? How do reversible microscopic laws give rise
to apparent irreversible macroscopic behaviour? To understand this, we must introduce the
concept of entropy.
Similarly, the natural tendency of coffee and milk is to mix, but not to unmix
These basic facts of life are summarized in the Second Law of Thermodynamics:2
To the physicists of the late 19th century the Second Law was a serious paradox. They knew
that the microscopic laws of physics are time reversible. So if entropy can increase, the laws
of physics say it must be able to decrease. Yet, experience says otherwise. Entropy always
increases.
3.3.2 Probability
This is where Ludwig Boltzmann’s genius came in. He realized is that the Second Law is not a
law in the same sense as Newton’s law of gravity or Faraday’s law of induction. It’s a probabilistic
law that has the same status as the following obvious claim: if you flip a coin a million times you
will not get a million heads. It simply won’t happen. But is it possible? Yes, it is—it violates
no law of physics. Is it likely? Not at all. Boltzmann’s formulation of the Second Law is just
that. Instead of saying entropy does not decrease, he said that
This is where the difference between 1 (or a few) and 1024 is important again. It is much more
likely for a handful of particles to spontaneously do crazy things than for 1024 particles. The
Second Law emerges for a large number of particles.
This also implies that if you wait around long enough, you will eventually see entropy decrease:
• By accident, particles and dust will come together and form a perfectly assembled bomb.
But, how long does it take for that to happen? A very long time. A lot longer than the
time to flip a million heads in a row, and even a lot longer than the age of the universe.
• Imagine I drop a bit of black ink into a glass of water. The ink spreads out and eventually
makes the water grey. Will a glass of grey water ever clear up and produce a small drop
of ink? Not impossible, but very unlikely.
• The air in this room is uniformly distributed. Is it possible that all air molecules sponta-
neously collect in one corner of the room, leaving the rest a vacuum? Not impossible, but
very unlikely.
2
The First Law is the conservation of energy.
28 3. Statistical Mechanics
3.3.3 Counting
Let us now get a bit more precise and less philosophical. First, we want to give definitions of
three related concepts: microstates, macrostates and statistical entropy.
As a concrete example, consider a collection of N particles in a box. If a particle is in the
left half of the box, we say that it is in state L. If it is in the right half, we call its state R. We
specify a microstate of the system by making a list of the states of each particle, whether its
left (L) or right (R). For instance, for N = 10 particles, a few possible microstates are
LLLLLLLLLL
RLLLLLLLLL
LRLLRRLLLL
···
The total number of possible microstates is 2N (two possibilities for each particle). For N = 1024
24
particles, this is a ridiculously large number, 210 . Luckily, we never need a list of all possible
microstates. All macroscopic properties only depend on the relative number of left and right
particles and not on the detail of which are left and right.
We can collect all microstates with the same numbers of L and R into a single macrostate,
labelled by one number
n ≡ NL − NR . (3.3.1)
How many microstates are in a given macrostate? Look at N = 10. For n = 10 (all left) and
n = −10 (all right), we only have 1 unique microstate each. For n = 8 (one right), we get 10
possible microstates since for 10 particles there are 10 ways of putting one particle on the right.
For n = 0 (equal number on the left and the right), we get 252 microstates. The complete
distribution of the number of microstates per macrostate is summarized in the following figure:
252
250
210
200
150
120
100
50 45
10
1
n
-10 -8 -6 -4 -2 0 2 4 6 8 10
It is easy to generalize this: let W (n) be the number of ways to have N particles with NL
particles on the left and NR particles on the right. The answer is
N! N!
W (n) = = N −n N +n . (3.3.2)
NL !NR ! ( 2 )!( 2 )!
3.3 Entropy and The Second Law 29
For very large N , your calculator won’t like evaluating the factorials. At this point, a normal
distribution is a very good approximation
2 /2N
W (n) ≈ 2N e−n . (3.3.3)
So far this was just elementary combinatorics. To this we now add the fundamental assump-
tion of statistical physics:
dS
≥0. (3.3.6)
dt
We are getting closer to a microscopic understanding of the Second Law.
it was being run forward or backward in time. At the level of individual particles, we can’t
distinguish the past from the future.
However, let’s look at the evolution from a more macroscopic perspective:
time
t = 200
t = 50
t=1
The box has N = 2,000 molecules in it, and starts at time t = 1 with 1,600 molecules on the
left-hand side and only 400 on the right. It’s not very surprising what happens: because there
are more molecules on the left, the total number of molecules that shift from left to right will
usually be larger than the number that shift from right to left. So after 50 seconds we see that the
numbers are beginning to equal out, and after 200 seconds the distribution is essentially equal.
This box clearly displays an arrow of time. Even if we hadn’t labelled the different distributions
in the figure with the specific times to which they correspond, you wouldn’t have any trouble
guessing that the bottom box came first and the top box came last. We’re not surprised when
the air molecules even themselves out, but we would be very surprised if they spontaneously
congregated all on one side of the box. The past is the direction of time in which things were
more segregated, while the future is the direction in which they have smoothed themselves out.
It’s exactly the same thing that happens when a ice cube melts or milk spreads out into a cup
of coffee.
It is easy to see that this is consistent with the Second Law. Using eqs. (3.3.2) and (3.3.5),
we can associate an entropy with the system at every moment in time. A plot of the evolution
looks as follows
1.0
0.9
entropy
0.8
0.7
0.6
The setup is the same as before: a box of gas divided in two by a wall with a hole. However,
this time the hole comes with a tiny door that can be opened and closed without exerting a
noticeable amount of energy. Each side of the box contains an equal number of molecules with
the same average speed (i.e. same temperature). We can divide the molecules into two classes:
those that move faster than the average speed—let’s call them red molecules—and those that
move slower than average—called blue. At the beginning the gas is perfectly mixed (equal
numbers of red and blue on both sides, i.e. maximum entropy). At the door sits a demon, who
watches the molecules coming from the left. Whenever he sees a red (fast-moving) molecule
approaching the hole, he opens the door. When the molecule is blue, he keeps the door shut. In
this way the demon ‘unmixes’ the red and blue molecules, the left side of the box gets colder
and the right side hotter. We could use this temperature difference to drive an engine without
putting any energy in: a perpetual motion machine. Clearly this looks like it violates the Second
Law. What’s going on?
Maxwell’s demon and its threat to the Second Law have been debated for more than a century.
To save the Second Law there has to be a compensating increase in entropy somewhere. There
is only one place the entropy could go: into the demon. So does the demon generate entropy in
carrying out his demonic task? The answer is yes, but the way that this work is quite subtle
32 3. Statistical Mechanics
and was understood only recently (by Szilard, Landauer and Bennett).4 The resolution relies
on a fascinating connection between statistical mechanics and information theory.5
A partition (now without a hole) is placed in the middle. The molecule is on one side and the
other side is empty
The demon measures and records on what side of the partition the gas molecule is, gaining
one bit of information. He then pushes in the piston that closed off the empty half of the box
In the absence of friction, this process doesn’t require any energy. Note the crucial role played
by information in this setup. If the demon didn’t know which half of the box the molecule was
in, he wouldn’t know which piston to push in. After removing the partition, the molecule will
push against the piston and the one-molecule gas “expands”
4
One of the giants in this story wrote a nice article about it: Bennett, “Demons, Engines and the Second
Law”, Scientific American 257 (5): 108-116 (1987).
5
William Bialek recently gave a nice lecture of the relationship between entropy and information. The video
can be found here: http://media.scgp.stonybrook.edu/video/video.php?f=20120419 1 qtp.mp4
6
Szilard, “On the Decrease of Entropy in a Thermodynamic System by the Intervention of Intelligent Beings.”
3.4 Entropy and Information 33
In this way we can use the system to do useful work (e.g. by driving an engine). Where did the
energy come from? From the heat Q of the surroundings (with temperature T ),
The work done when the gas expands from Vi = V to Vf = 2V is given by a standard formula
in thermodynamics:
V
f
∆W = kT log = kT log 2 . (3.4.7)
Vi
Recall that dW = F dx = p dV , where p is the pressure of the gas. The integrated work done is
therefore Z Vf
∆W = p dV .
Vi
Using the ideal gas law for the one-molecule gas, pV = kT , we can write this as
Vf
kT V
Z
f
∆W = dV = kT log .
Vi V Vi
This completes one cycle of operation. The whole process is repeatable. Each cycle would allow
extraction and conversion of heat from the surroundings into useful work in a cyclic process. The
demon seems to have created a perpetual motion machine of the second kind.7 In particular,
in each stage of the cycle the entropy decreases by ∆S = ∆Q/T (another classic formula of
thermodynamics). Using ∆Q = −∆W , we find
∆S = −k log 2 . (3.4.8)
gained one bit of recorded information. The demon needs to erase the information stored in his
mind in order for the process to be truly cyclic. However, Rolf Landauer had shown9 in 1961
that the erasure of information is necessarily an irreversible process.10 In particular, destroying
one bit of information increases the entropy of the world by at least
∆S ≥ k log 2 . (3.4.9)
So here is the modern resolution of Maxwell’s demon: the demon must collect and store infor-
mation about the molecule. If the demon has a finite memory capacity, he cannot continue to
cool the gas indefinitely; eventually, information must be erased. At that point, he finally pays
the entropy bill for the cooling he achieved. (If the demon does not erase his record, or if we
want to do the thermodynamic accounting before the erasure, then we should associate some
entropy with the recorded information.)
Stephen Hawking thought that this was crazy! If black holes had entropy, they also had a
temperature, and you could then show that they had to give off radiation. But everyone knows
that black holes are black! Hawking therefore set out to prove Bekenstein wrong. But he failed!
What he found12 instead is quite remarkable: Black holes aren’t black! They do give off radiation
and do carry huge amounts of entropy.
The key to understanding this is quantum mechanics: In quantum mechanics, the vacuum is
an interesting place. According to Heisenberg’s uncertainty relation, nothing can be completely
empty. Instead particle-antiparticle pairs can spontaneously appear in the vacuum. However,
they are only virtual particles, living only for a short time, before annihilating each other
annihilation
particle
anti-particle
Most pop science explanations of this effect are completely wrong (try googling it!), so it is worth giv-
ing you are rough outline of the correct argument. We start with the following version of Heisenberg’s
uncertainty principle
~
∆E∆t ≥ .
2
This means the following:
To measure the energy of a system with accuracy ∆E, one needs a time ∆t ≥ ~
2∆E .
In other words, to decrease the error in the measurement, we perform an average over a longer time
period. But this increases the uncertainty in the time to which this energy applies.
Now consider a particle-antiparticle pair with total energy E spontaneously appearing out nothing:
energy
time
vacuum vacuum
The story changes if the particle-antiparticle pair happens to be created close to the event
horizon13 of a black hole. In that case, one member of the pair may fall into the black hole and
12
Hawking, “Black Hole Explosions?”
13
The event horizon is the point of no return. Nothing, not even light, can escape from inside the event horizon:
see Lecture 6.
36 3. Statistical Mechanics
disappear forever. Missing its annihilation partner, the second particle becomes real :
Hawking Radiation
An observer outside the black hole will detect these particles as Hawking radiation.
Analyzing this process, Hawking was able to confirm Bekenstein’s guess (3.5.10). In fact, he
did much more than that. He derived an exact expression for the black hole entropy
1A
SBH = , (3.5.11)
4 `2p
where `p is the Planck length, the scale at with the effects of quantum mechanics and gravity
become equally important (see Lecture 6). In terms of the fundamental constants of quantum
mechanics (~), relativity (c) and gravity (G), the Planck length is
r
~G
`p = ≈ 1.6 × 10−35 m . (3.5.12)
c3
Eq. (3.5.11) is a remarkable formula: it links entropy and thermodynamics (l.h.s.) to quantum
gravity (r.h.s.). It is therefore the single most important clue we have about the reconciliation
of gravity with quantum mechanics.
What are those microstates? They are not apparent in classical gravity (where a black hole has
no hair). Ultimately, they must be states of quantum gravity. Some progress has been made on
this in string theory, our best candidate theory of quantum gravity. In 1996, Andy Strominger
and Cumrun Vafa derived the black hole entropy from a microscopic counting of the degrees
of freedom of string theory (which are strings and higher-dimensional membranes). They got
eq. (3.5.11) on the nose, including the all important factor of 1/4.
4 Electrodynamics and Relativity
The first time I experienced beauty in physics was when I learned how Einstein’s special relativity
is hidden in the equations of Maxwell’s theory of electricity and magnetism. Today, it is my
privilege to show this to you. In the end, we will speculate how gravity might fit into this.
+ + + + + +
At a distance r, there is a point charge q travelling to the right at speed u < v. The charges are
close enough together so that we may regard them as continuous line charges with densities ±λ
(= charge/length). The net current to the right is
I = 2λv . (4.1.1)
Because the positive and negative line charges cancel, there is no electrical force on q (in the
rest frame of the wire).
Now consider the same situation from the point of view of an observer comoving with the
charge q (i.e. an observer travelling to the right with speed u):
+ + +
The velocity of the positive charges is now smaller than the velocity of the negative charges. The
Lorentz contraction of the spacing between negative charges is more severe than that between
positive charges; therefore, the wire, in this frame, carries a net negative charge! There is now
an electric force on the charge. In the box below I show you how to compute this force in terms
37
38 4. Electrodynamics and Relativity
of the charge density in the original frame. Transforming this force back to the original frame
leads to
µ0 I
F = −quB , where B ≡ . (4.1.2)
2πr
We have derived the Lorentz force using only electrostatics and a sequence of relativistic trans-
formations. I encourage you to look at the details of the calculation. It is neat.
Let us call the original frame S and the new frame S 0 . By the Einstein velocity addition rule, the
velocities of the positive and negative charges in S 0 are
v∓u
v± = . (4.1.3)
1 ∓ vu/c2
If λ0 is the charge density of the positive line charge in its own rest frame, then
1
λ± = ±(γ± )λ0 , where γ± = q . (4.1.4)
1 − v±2 /c2
• Gauss’ Law
(M1) simply states that “electric charges produce electric fields”. Specifically, the diver-
~ is determined by the charge density ρ (= charge/volume).
gence of the electric field E
• No Monopoles
Unlike electric charges, we have never observed isolated magnetic charges (“monopoles”).
Magnetic charges always come in pairs of plus and minus (or North and South). (If you
cut a magnet into two pieces, you get two new magnets, each with their own North and
South poles.) The net magnetic charge density is therefore zero. (M2) embodies this fact,
~
by having zero in place of ρ in the equation for the divergence of the magnetic field B.
• Ampère’s Law
It was known well before Maxwell that an electric current produces a magnetic field—
e.g. Oersted discovered that a magnet feels a force when placed near a wire with a current
~ ×B
flowing through it. However, the equation that people used to describe this, ∇ ~ = µ0 J,
~
was wrong. Maxwell pointed out that the original form of Ampère’s law was inconsistent
with the conservation of electric charge, eq. (C), (do you see why?). To fix this he added
an extra term to (M4). This extra term implies that a time-varying electric field also
produces a magnetic field. This led to one of the most important discoveries in the history
of physics.
1
See Example Sheet 3 of your course on ‘Vector Calculus’.
40 4. Electrodynamics and Relativity
• Conservation of Charge
Charges and currents source electric and magnetic fields. But, a current is nothing but
moving charges, so it is natural to expect that ρ and J~ are related to each other. This
relation is the continuity equation (C). Consider a small volume V (bounded by a surface S)
containing a total charge Q.
The amount of charge moving out of this volume per unit time equals the current flowing
through the surface:
∂
Z
Q = − dS ~ · J~ . (4.2.13)
∂t
Eq. (C) embodies this locally (i.e. for every point in space),
∂ ∂
Z Z Z
Q= ~ ~
dV ρ = − dV ∇ · J = − dS ~ · J~ , (4.2.14)
∂t ∂t
where we used Stokes’ theorem in the last equality.
Maxwell’s equations are ugly! As we will see, this is because space and time are treated
separately, while we now know that Nature is symmetric in time and space. By uncovering
this hidden symmetry we will be able to write the four Maxwell equations in terms of a single
elegant equation. In the process, we will develop a unified relativistic description of electricity
and magnetism.
~
Now take the curl of the curl equation for the E-field, ~
i.e. ∇×(M30 ),
2~
~ × (∇
∇ ~ =−∂∇
~ × E) ~ = −µ0 0 ∂ E ,
~ ×B (4.2.15)
∂t ∂t2
where we have used (M40 ) in the final equality. We use a vector identity to manipulate the
l.h.s. of this expression,
~ × (∇
∇ ~ × E)
~ = ∇(
~ ∇
~ · E)
~ − ∇2 E
~ (4.2.16)
~ ,
= −∇2 E (4.2.17)
~
∂2E
µ0 0 ~ =0.
− ∇2 E (4.2.18)
∂t2
~
This partial differential equation is the wave equation for the E-field.2 Try substituting
~ x) = E ~ 0 sin 2π t x
E(t, − , (4.2.20)
T λ
where T and λ are constants. You will find that this is a solution of (4.2.18), if
λ 1
v≡ =√ , (4.2.21)
T µ0 0
The solution repeats with a time period T . Hence, (4.2.20) indeed describes a wave propagating
along the x-axis with speed v = λ/T given by (4.2.21).
As boring as eq. (4.2.21) looks, it is an absolutely remarkable result! Through the genius of
Einstein it led to the special theory of relativity.
2 ~
Similar manipulations for the curl equation for the B-field, ~
eq. (M40 ), give the wave equation for the B-field
~
∂2B
µ0 0 ~ =0.
− ∇2 B (4.2.19)
∂t2
42 4. Electrodynamics and Relativity
1
c≡ √ = 3 × 108 m/s . (4.2.22)
µ0 0
But, this is just the speed of light. Electromagnetic waves are light!
Eq. (4.2.22) looks innocent, but notice that I never mentioned the speed of the observer.
According to Maxwell’s equation,
This flies in the face of Newton’s law of velocity addition. The light emitted from a moving
spaceship travels at exactly the same speed as the light from a stationary source. This means
that you can never catch up with a light ray. No matter how fast you run after a light ray, it
will always recede from you at speed c. It would have been easy to dismiss this craziness as a
flaw of Maxwell’s theory. However, Einstein dared to accept this strange feature of light as a
fundamental principle of Nature and built his theory of relativity around it.
∆x ∆x0
c= = . (4.2.23)
∆t ∆t0
Eq. (4.2.23) can be rewritten as follows
In fact, even for objects not moving at the speed of light, observers will disagree on ∆x and ∆t,
but will always agree on the combination
More important than the fact that space and time are relative, is the fact that space and time
are related by a special symmetry that guarantees the invariance of ∆s2 . This symmetry is
called Lorentz symmetry.
As you know, an elegant way to make this symmetry manifest is to combine space and time
coordinates into a single four-dimensional spacetime coordinate
time
t , ~x ⇒ Xµ = . (4.2.26)
space
4.3 Relativity unifies Electricity and Magnetism 43
s2 ≡ ηµν X µ X ν , (4.2.27)
Eq. (4.2.27) uses the Einstein summation convention, so that repeated indices are summed
over. You should remember that the coordinates of different observers are related by a Lorentz
transformation
Xµ0 = Λν µ (v) Xν . (4.2.29)
You can think of these as “rotations” in the 4d spacetime, that leave s2 = X µ Xµ the same. This
is analogous to 3d spatial rotations
x0i = Rj i (θ) xj , (4.2.30)
~ ·B
∇ ~ =0. (4.3.31)
~ =∇
B ~ ×A
~ . (4.3.32)
~ ×E
Next, we use eq. (4.3.32), to rewrite Faraday’s law for the electric field, eq. (M3), ∇ ~ = −B,~˙
as
~˙ = 0 .
h i
~ × E
∇ ~ +A (4.3.33)
Since “curl grad = 0”, we can write E ~ +A~˙ as the gradient of a scalar function. The Maxwell
~ is written as
equation (M3) is hence solved if E
~ = −∇φ
E ~˙ .
~ −A (4.3.34)
44 4. Electrodynamics and Relativity
~ is not unique. We can add the gradient of any scalar function α(t, ~x) to
The choice of vector field A
~
A without changing the magnetic field B ~ =∇ ~ ×A~ (since “curl grad = 0”),
~ 7→ A
A ~0 = A
~ + ∇α
~ . (4.3.35)
~ = −∇φ
The electric field E ~ −A~˙ also remains unchanged under the transformation (4.3.35), if φ
simultaneously transforms as
φ 7→ φ0 = φ − α̇ . (4.3.36)
~ and B.
We hence have the freedom to pick any scalar α without changing E ~ This is equivalent to
~ ~
the freedom of fixing ∇ · A, since
~ ·A
∇ ~0 = ∇
~ ·A
~ + ∇2 α . (4.3.37)
A particularly useful choice is the so-called Lorenz gauge
∇ ~ = − φ̇ .
~ ·A (4.3.38)
c2
We have shown that the two Maxwell equations (M2) and (M3) are automatically satisfied if
the two vector fields E~ and B ~ are expressed in terms of a scalar field φ and a new vector field A.
~
But, a scalar and a vector can be combined into a four-vector. Just like t and ~x were combined
into X µ = (ct, ~x). Let’s try and define the vector potential
electricity
~
φ, A ⇒ Aµ = . (4.3.39)
magnetism
A single four-vector describes both electric and magnetic fields. As we will see in Lecture 5, in
quantum mechanics, Aµ is actually more fundamental than E ~ and B.
~
Aµ → A0µ = Aµ − ∂µ α , (4.3.40)
∂ µ Aµ = 0 . (4.3.42)
Two Maxwell equations are automatically satisfied, but what about the other two? (M1) and
(M4) have source terms: the charge density ρ (a scalar) and the current density J~ (a vector).
Again, a scalar and a vector make a four-vector
electricity
ρ , J~ ⇒ Jµ = . (4.3.43)
magnetism
4.3 Relativity unifies Electricity and Magnetism 45
As I show in the insert below, expressed in terms of the four-vectors Aµ and J µ , the Maxwell
equations (M1) and (M4), unify into a single pretty equation3
2Aµ = µ0 J µ , (M5)
2
where 2 is shorthand for η µν ∂µ ∂ν = c12 ∂t
∂
2 − ∇ . Notice that 2 is the same combination that
2
appears in the wave equation (4.2.18). Hence, there will be similar wave solutions for Aµ .
∇ ~ =−∂∇
~ ·E ~ − ∇2 φ = ρ ,
~ ·A (4.3.44)
∂t 0
or !
1 ∂2
2 ρ ∂ ~ + φ̇
~ ·A
−∇ φ= + ∇ . (4.3.45)
c2 ∂t2 0 ∂t c2
~ ×B
∇ ~ =∇
~ × (∇
~ × A) ~¨ − ∇
~ = µ0 J~ + 0 µ0 (−A ~ φ̇) . (4.3.47)
~ × (∇
Using the identity ∇ ~ ×A
~ ) = ∇(
~ ∇~ ·A
~ ) − ∇2 A,
~ we get
!
1 ∂2
2 ~ ~ ~ ~ ~ φ̇
− ∇ A = µ0 J − ∇ ∇ · A + 2 . (4.3.48)
c2 ∂t2 c
~ = µ0 J~ .
2A (4.3.49)
Hence,
2Aµ 7→ 20 Aµ0 = Λµ ν 2Aν . (4.3.53)
Combining (4.3.53) and (4.3.51), we see that if (M5) holds in the frame S 0 , then it also holds in
the frame S,
20 Aµ0 = µ0 J µ0 ⇔ 2Aµ = µ0 J µ . (4.3.54)
This is the defining feature of a theory that is consistent with Einstein’s relativity. Its equations
take the same form in all inertial frames.4
It is remarkable that Maxwell’s theory automatically was of that form and didn’t need any
fixing. This isn’t true for many other theories. Consider, for example, Coulomb’s law
d2 x 1 qQ
m 2
= . (4.3.55)
dt 4π0 x2
No matter how much massaging you do, you will never be able to write this as an equation
involving only four-vectors. It also easy to see that (4.3.55) violates the basic principle of
relativity that nothing travels faster than the speed of light. Changing the source at x = 0
instantly affects the solution everywhere in space. The famous Coulomb’s law is therefore only
an approximation. The exact result follows from Maxwell’s equations.
15 = 10 + 4 + 1 . (4.4.56)
4
These days we usually reverse the logic. When we come up with new theories, we don’t dare to write down
anything that doesn’t have Lorentz symmetry.
4.4 More Unification?∗ 47
Kaluza noticed that 15 is the number of independent components of a 5×5 symmetric matrix. A
5×5 symmetric matrix GM N has enough room to fit both the vector potential and the spacetime
metric5
gravity
G00 G01 G02 G03 G04
G10 G11 G12 G13 G14
G20 G21 G22 G23 G24 = (4.4.57)
G30 G31 G32 G33 G34
G40 G41 G42 G43 G44
electromagnetism
GM N = (4.4.58)
Kaluza went on to speculate about the physical meaning of the object GM N . He suggested
it may be the metric of a five-dimensional spacetime. More than that: he showed that the
equation of gravity in 5D could be split into the equations of Einstein’s theory gravity in 4D
and Maxwell’s theory of electromagnetism:
Einstein was intrigued that gravity in five dimensions unifies gravity and electromagnetism in
four dimensions.
Kaluza’s theory had an obvious flaw. The world isn’t five-dimensional! Or, is it? In 1926,
Oscar Klein pointed out that we wouldn’t notice that the world is five-dimensional if one of the
space dimensions, say x4 , is curled up into a small circle:
5
In fact, GM N has room to spare for an extra scalar field S(X). We will ignore this detail.
48 4. Electrodynamics and Relativity
Now there is enough room to unify all forces of Nature: gravity, electromagnetism, weak and
strong nuclear force all become one. Unfortunately, we haven’t yet found ways to test the theory
experimentally. It therefore remains an intriguing, but highly speculative, endeavor.
5 Particle Physics
This lecture is about particle physics, the study of the fundamental building blocks of Nature
and the forces between them. We call our best theory of particle physics the Standard Model
(in capital letters). This pompous name reflects our confidence in the theory. Recently, the
Large Hadron Collider (LHC) at CERN discovered the last puzzle piece of the Standard Model:
the Higgs particle. In this lecture, you will learn why the Higgs is so important.
49
50 5. Particle Physics
p = (uud) ,
n = (ddu) .
So we’ve reduced the 110 elements in the periodic table to just 3: an electron and two quarks.
In fact, there’s one further particle that we should add called the (electron) neutrino (νe ). It
doesn’t live inside atoms, but it is created in certain radioactive processes, most notably in beta
decay and inside the Sun. Right now, about 60 billion of them are streaming through every
square centimetre of your body every second. Virtually all of them sail straight through you,
which is why you’ve never seen or felt one.
These four particles give us a new periodic table:
charge mass
+ 23 u ∼4
− 13 d ∼8
−1 e 1
0 νe ∼0
That’s pretty nice. Four is a good number. Except that Nature did something very odd, and no
one really knows why. It repeated this pattern twice more. These copies (called “generations”)
have exactly the same properties as the first lot—which means the same charge, and the same
properties under other forces—but their masses are different.
charge mass mass mass
u c t
d s b
e μ τ
νe νμ ντ
Why does this happen? We have absolutely no idea. Not a whole lot would change if this wasn’t
the case.
The masses of these particles look rather random. Why these numbers? Notice that the
mass of the up quark is smaller than that of the down quark, while it is the opposite in the
other two generations. This is one of the most important inversions in science! It means that
neutrons are heavier than protons. As a consequence, if neutrons are left on their own, they
decay into protons in about 14 minutes. If this down/up mass ratio was inverted then protons
would decay into neutrons instead; the nuclei of atoms would be unstable and there wouldn’t
be any interesting chemistry to speak of. There would then also be no biology and no life. So
5.2 From Fields to Particles 51
it’s indeed an important property. But is it an important question?!! Should we try to answer
it, or is it just one of those fortunate accidents of life that we have to accept?
Finally, for each type of particle there is a corresponding anti-particle—i.e. particles with the
same mass, but opposite charge. This doubles the number of particle species. When a particle
and an anti-particle come in contact, they annihilate. This process was particularly efficient in
the early universe when the density of particles was very high. In fact, it is a big mystery why
there is any matter left at all. Something in the early universe must have created an imbalance
between matter and anti-matter, so that some matter survived the annihilation process. We
don’t know how that happened.
• Gravity
We experience gravity every day, so it may come as a surprise that we understand it the
least. For example, we don’t understand why gravity is so much weaker than all other
forces in Nature. A small magnet can lift a paper clip off the table, thereby overcoming the
gravitational pull of the entire Earth! Gravity is an extremely weak force. The electric force
between two electrons is a factor of 1040 times larger than their gravitational attraction.
This is why gravity can usually be completely ignored when discussing elementary particles
and atoms. Only when an enormous number of particles is combined, e.g. to form a planet
like the Earth, does gravity become important. But, fundamentally gravity is an extremely
weak force. And we have no idea why!
about Nature to the fledgling civilization that starts again. What would this be? Obviously it
can’t be complicated mathematics—it has to be something simple that can be explained in plain
language. Feynman’s answer was that he would tell them about atoms. Or, more precisely, he
would tell them that matter was discrete, made up of fundamental building blocks. (He would
then hope that they would be smart enough to come up with quantum mechanics.)
However, in the last few decades, we have come to appreciate that particles and atoms aren’t
the most fundamental objects in the universe. It is fields. One of the key breakthroughs of the
20th century was the realization that
there is a different kind of field for each particle in Nature.
An electron field, several quark fields, and so on. What happens is that ripples in this field get
tied up in knots by virtue of quantum mechanics. It is those ripples that we identify as particles.
weak bosons : W+ , W− , Z .
What do the SU (2) matrices of the weak force field act on? It has to be 2-component vectors
made out of the matter particles of the Standard Model. Looking at our periodic table, an
obvious guess is to pair the two quarks and the electron and the neutrino
! !
u e
Q= L= .
d νe
The SU (2) matrices of the weak force indeed act on these vectors. (The decay of the down quark
into the up quark—which happens when the neutron decays into the proton—can therefore be
thought of as a rotation!)
Similarly, the strong force is associated with matrices of type SU (3) (i.e. 3×3 unitary matrices
with determinant one). How many such matrices are there? Eight! (The Gell-Mann matrices.)
There are therefore eight force particles for the strong force,
gluons : g1 , g2 , · · · , g8 .
What do the SU (3) matrices of the strong force field act on? The strong force only acts on the
quarks, so we are looking for 3-component vectors with quarks as its entries. This time looking
at the periodic table doesn’t reveal an obvious pairing. Instead we have to postulate that each
quark comes in three flavors:
red, blue, and green .
(This property of quarks is called the colour charge and the theory that describes it is called
Quantum Chromo-Dynamics or QCD.) This triplet of colours can be arranged into a vector,
e.g. for the up and down quarks
uR dR
u = uB d = dB .
uG dG
The SU (3) matrices of the strong force act on these colour-vectors of quarks. They don’t act
on the leptons.
In this language, the electromagnetic force is associated with “matrices” of type U (1) (complex
numbers of unit norm). There is only one such matrix, and hence only one type of photon.
In summary, the forces of the Standard Model are determined by the matrices (or symmetry
groups)
U (1) × SU (2) × SU (3) .
1
For more on SU (n) matrices see Example Sheet 4, Problem 12 of your ‘Groups’ course.
54 5. Particle Physics
For each “matrix field” we have a “force particle”: a photon for the electric force, eight gluons
for the strong force, and three vector bosons for the weak force. We add these to our periodic
table:
matter (fermions)
u c t
quarks
s
forces (bosons)
d b
e μ τ
leptons
νe νμ ντ
time
vacuum vacuum
I will now show you that the same mechanism explains the forces of the Standard Model.
To warm up, consider a stationary electron with rest mass energy me c2 (even if the electron
is moving, you can always go to its rest frame). Now imagine that the electron spontaneously
emits a photon of energy Eγ . To conserve momentum, the electron has to recoil with momentum
equal and opposite to that of the photon. The combined electron+photon state has energy
E = me c2 + Eγ + KEe ,
| {z }
∆E
where KEe is the kinetic energy of the recoiling electron. In classical physics, this process is
clearly impossible since it violates the conservation of energy. However, in quantum mechanics,
5.4 From Virtual Particles to Real Forces 55
energy
time
In other words, a free electron can emit a virtual photon as long as it reabsorbs the photon and
returns to its original state within a time
~
∆t < .
2∆E
time
space
The vertical parts of the diagram correspond to the electron remaining at the same point. (If
the electron was moving with a steady speed, these lines would be tilted at an angle to the
vertical.) The loop in the middle corresponds to the virtual state.
Now let us consider two electrons approaching each other with constant speed.2 At some
moment t1 , one of the electrons emits a virtual photon. To conserve momentum, the electron
has to recoil. Moreover, according to the Heisenberg uncertainty principle, the virtual photon
must be reabsorbed within ∆t = t2 − t1 < 2∆E~
. However, this time, instead of the first electron
2
They may or may not be heading straight towards each other. On our schematic Feynman diagrams we only
have one space dimension and so both possibilities would look similar.
56 5. Particle Physics
reabsorbing the photon, the second electron can absorb the photon:
time
space
Provided this happens in such a way that the total energy of the two electrons before and after
the intermediate state is the same, no quantum rule of physics is violated. The second electron
recoils to the right because of the momentum it picks up from the photon. The Feynman diagram
of the process looks as if the particles have repelled each other. So in quantum mechanics we
can think of the electromagnetic force between two electrons as arising from the exchange of
virtual photons. To understand the origin of attractive forces in this way requires more quantum
mechanics than we want to get into right now. But that is details, the idea is the same.
5.4.3 QED
The photon exchange can happen more than once
+ + ...
Also, while in-flight, the photon may spontaneously turn into a electron-positron pair and back
Quantum mechanics associates a probability amplitude with each of these little diagrams.3
However, it turns out that in Quantum Electro-Dynamics (QED)
So to a good approximation we get away with only the single photon exchange diagram. In a
few years, you will be able to calculate the probability amplitude for that diagram and show
that it is equivalent to the Coulomb force.
3
Compare this to Lecture 1, where we saw that a particle has a probability amplitude for every possible path.
Here, there is a probability amplitude for every possible diagram that starts and ends with two electrons. Add
up all the amplitudes and square the result to get the probability for the process to happen.
5.4 From Virtual Particles to Real Forces 57
But there are crucial differences between numbers (photons) and matrices (gluons): first, there
is only one photon, but there are eight gluons. Second, being matrices these gluons can interact
with each other (while photons don’t). So we can have very complicated diagrams, like this one
This means that the Feynman diagram way of calculating things in QCD becomes completely
useless. Unlike QED, in QCD we can’t just draw a few simple diagrams and get a good ap-
proximation to the exact answer. The difference between QED and QCD is like the difference
between4
1 1 2 1 3
2 + (2) + (2) + · · · = 1
and
2 + 2 2 + 23 + · · · = ∞ .
Nobody has ever seen a free quark. Instead, quarks always come in pairs or triplets and
resist being separated. In contrast, when two electrically-charged particles separate, the electric
fields between them diminish quickly, allowing (for example) electrons to become unbound from
atomic nuclei. However, as two quarks separate, the interacting gluons form narrow tubes (or
strings), which tend to bring the quarks together as though they were some kind of rubber band.
4
In the first case (QED), keeping only the first few terms in the sum (i.e. the simplest diagrams) gives an
accurate approximation to the true answer. In QCD, you need to understand how to deal will all diagrams at
once.
58 5. Particle Physics
This is quite different in behaviour from electrical charge. The force between quarks does not
diminish as they are separated. In fact, it increases with distance. Because of this, it would take
an infinite amount of energy to separate two quarks; they are forever bound into protons and
neutrons.6 This phenomenon is called confinement. To prove confinement from the equations
of QCD is one of the most important open problems in theoretical physics and mathematics. In
fact, the Clay Institute will pay you $1 million7 if you solve it!
proton
5
http://www.physics.adelaide.edu.au/theory/staff/leinweber/VisualQCD/Nobel/
6
This also implies that when two quarks fly apart, as happens in particle accelerators, at some point it is
energetically favourable for a new quark-antiquark pair to spontaneously appear, rather than to allow the tube
to extend further. As a result, when quarks are produced in accelerators, instead of seeing the individual quarks
in detectors, scientists see “jets” of quark pairs (mesons) and triplets (baryons).
7
http://www.claymath.org/millennium/Yang-Mills Theory/
5.5 The Origin of Mass 59
An up quark turns into a down quark, while emitting a W + particle. The W + then decays
into a positron (the anti-particle of the electron) and a neutrino. This process is of fundamental
importance for life on Earth. Without it, the Sun wouldn’t be shining.
As you know, the Sun shines because of nuclear fusion—light nuclei combine into heavier nu-
clei, while releasing energy in the process. The simplest fusion process is two protons combining
into a helium nucleus. However, there is a puzzle. Protons are positively charged, so they don’t
like being pushed together. In fact, one can show that the temperature inside the Sun isn’t high
enough, and hence the protons aren’t moving fast enough, to overcome the electric repulsion.
The weak force comes to the rescue. Because of the W particle, one of the protons in the colli-
sion can convert into a neutron. The newly formed neutron and the remaining proton can get
very close, because the neutrons doesn’t carry electric charge. Freed from the electromagnetic
repulsion they can fuse together (as a result of the strong force) to form deuteron. This quickly
leads to helium formation, releasing life-giving energy in the process.
5.5.1 An Analogy
The Higgs mechanism isn’t easy to explain without some additional training in physics and
mathematics. I will start with a popular analogy. In §5.5.2, I will give a few more details.
Imagine that a room full of physicists, all talking to their nearest neighbours
When a famous scientist enters the room, the physicists cluster around him, slowing the
scientist’s progress:
In much the same way, the Higgs field becomes locally distorted whenever a particle moves
through it. The distortion—the clustering of the field around the particle—generates the parti-
cle’s mass.8
Higgs Boson
Now consider a rumour passing through our room full of uniformly spread physicists
Those near the door hear of it first and cluster together to get the details, then they turn and
move closer to their next neighbours who want to know about it too. A wave of clustering
passes through the room. Since the information is carried by clusters of people, and since it was
8
The idea comes directly from the physics of solids. Instead of a field spread throughout all space, a solid
contains a lattice of positively charged crystal atoms. When an electron moves through the lattice the atoms are
attracted to it, causing the electron’s effective mass to be as much as 40 times bigger than the mass of a free
electron. The postulated Higgs field in the vacuum is a sort of hypothetical lattice which fills our universe.
5.5 The Origin of Mass 61
clustering which gave extra mass to the famous scientist, then the rumour-carrying clusters also
have mass. The Higgs particle is just such a clustering in the Higgs field.9
Higgs Mechanism
In the early universe, the value of the Higgs field was zero everywhere. All the particles of the
Standard Model were therefore massless and travelled at the speed of light. As the universe
expanded it cooled. At some critical point it became cold enough for the Higgs field to condense.
Since then, the Higgs field has a constant value throughout space. The particles that interact
with the Higgs field became massive.
Some particles do not interact with the Higgs. This is the case for photons, gluons and
gravitons. The most likely path of these particles between two points is therefore a straight line:
massless particle
A light particle (e.g. the electron) has only a small probability to interact with the Higgs field.
This interaction deflects the particle:
Higgs
light particle
A heavy particle (e.g. the top quark or the W and Z bosons) is more likely to interact with the
Higgs field and therefore experiences more deflections:
heavy particle
Particles get their masses because interactions with the Higgs field force them to zig-zag their
way through space. All particles of the Standard Model get their masses this way. Different
particles get stuck in the Higgs condensate by different amounts. But why? What gives rise to
the vast difference in masses of the particles? We don’t know.
Higgs Boson
There is a particle for every field. The Higgs field is no different, so we expect a particle to
correspond to an excitation of the Higgs field. Unsurprisingly, this is called the Higgs particle,
or Higgs boson. It will be much easier to believe that the Higgs field exists, and that the
mechanism for giving other particles mass is true, if we actually see the Higgs particle itself.
The search for the Higgs boson has occupied experimental particle physicists for the past few
decades. Last year, they found it!
9
Again, there are analogies in the physics of solids. A crystal lattice can carry waves of clustering without
needing an electron to move and attract the atoms. These waves can behave as if they are particles. They are
called phonons.
62 5. Particle Physics
It accelerates protons and anti-protons around an underground ring seventeen miles in circum-
ference.
A number of accelerating structures boost the energy of the particles along the way until they
reach speeds close to the speed of light and energies close to 14 TeV (centre of mass)—15,000
times the energy in the mass of a proton. The beams travel in opposite directions in separate
beam pipes—two tubes kept at ultra-high vacuum. They are guided around the accelerator
ring by a strong magnetic field, achieved using superconducting electromagnets. These are
built from coils of special electric cable that operates in a superconducting state, efficiently
5.5 The Origin of Mass 63
conducting electricity without resistance or loss of energy. This requires cooling the magnets to
about −271◦ C or two degrees above absolute zero.
The particles collide at two locations in the ring, where two huge detectors—called ATLAS
and CMS—record the debris
About 600 million collisions occur per second! This results in 10 Petabytes of data per year
(= 20 km high stack of CDs!).
64 5. Particle Physics
On July 4th, ATLAS and CMS announced the discovery of a “Higgs-like” particle with a mass
133 times the mass of the proton (126 GeV):
1500
Events
1000
Data
500 S+B Fit
B Fit Component
0
110 120 130 140 150
Mass
They are still doing more checks, but we all know it is the Higgs. This completes the periodic
table of the Standard Model:
matter (fermions)
u c t
quarks
s
forces (bosons)
d b
e μ τ
leptons
Higgs boson
νe νμ ντ
5.6 Beyond the Standard Model∗ 65
There are reasons to believe that the answer is no. For example, cosmological observations
suggest that the particles of the Standard Model make up less than 15% of the total matter in
the universe. The majority is in some form of dark matter (see Lecture 7). But we have no idea
what the dark matter is. A satisfactory theory of particle physics should explain this.
5.6.2 Supersymmetry
A popular extension of the Standard Model is Supersymmetry (SUSY). For reasons that I don’t
have time to explain, SUSY overcomes a number of theoretical shortcomings of the Standard
Model.10 According to SUSY, every particle of the Standard Model has a hidden partner.
Essentially, SUSY proposes a doubling of our periodic table:
u c t t c u squarks
quarks
d s b
gauge bosons
b s d
gauginos
e μ τ τ μ e
sleptons
leptons
With some luck the LHC could discover a SUSY shadow world. Moreover, the lightest super-
symmetric partner of the Standard Model would be stable and could be the dark matter! So
things could fall into place. Or maybe they won’t—the next few years will tell. Stay tuned ...
10
Essentially, without SUSY it is hard to understand why the Higgs particle wouldn’t be much heavier than it
is found to be.
6 General Relativity
Today, we are going to talk about gravity as described by Einstein’s general theory of rela-
tivity.
We start with a simple question:
We notice that the mass m cancels, so the acceleration doesn’t depend on it. Heavier
objects feel a larger gravitational force, but they also experience more inertia. The two
effects exactly cancel. Einstein wouldn’t be Einstein if he didn’t question this seemingly
obvious fact. He noticed that the meaning of ‘mass’ on the left-hand side and the right-
hand side of (6.0.1) is very different. We should really distinguish between the two masses
by calling them something different:
mg M
mi a = G . (6.0.2)
r2
The inertial mass is labelled mi , while the gravitational mass is denoted mg . You should
think of mg more like charge in Coulomb’s law, F ∝ qQ/r2 , and you wouldn’t be tempted
to cancel mi and q in
1 qQ
mi a = . (6.0.3)
4π0 r2
66
67
This makes it clear that we should really think of mi and mg as distinct entities. The
gravitational mass mg is a source for the gravitational field (just like the charge q is a
source for an electric field), while the inertial mass mi characterizes the dynamical response
to any forces. It is a non-trivial result, that experimentally one finds
mi
= 1 ± 10−13 . (6.0.4)
mg
This equality of inertial and gravitational mass is responsible for the well-known fact that
objects with different masses fall at the same rate under gravity.
But, is there a deeper reason why the gravitational force is proportional to the inertial
mass?
68 6. General Relativity
Albert Einstein
The path of the particle appears to be curved in response to those forces. However, for both
of these forces we understand very well why they are proportional to the inertial mass mi . It’s
because these are “fictitious forces”, arising in a non-inertial frame (rotating with frequency ω ~ ).
To an outside observer in an inertial frame there is no force on the particle and it travels in a
straight line. Could gravity also be a fictitious force, arising only because we are in a non-inertial
frame?
In other words, there is no experiment you could perform that would distinguish a rocket ac-
celerating at g = 9.8 m/s2 in empty space, from a stationary rocket in the gravitational field of
the Earth:
6.1 The Happiest Thought 69
There are some obvious difficulties with trying to identify gravitation with acceleration. In
Cambridge, it feels as if we’re accelerating upwards. The people in New Zealand feel as if
they’re accelerating in the opposite direction. Why aren’t we getting further apart?! Moreover,
if two balls are dropped from different heights, the lower ball will accelerate faster in gravity due
to the 1/r2 law. This seems to give a way to experimentally determine the difference between
acceleration and gravitation.
However, this is not in conflict with the equivalence principle because of the careful choice
of the word local. Here, “local” means in a small enough region of spacetime, where “small”
is relative to the scale of variation of the gravitational field. The equivalence principle simply
states that at every point in spacetime, we can remove or create the gravitational field by going
to an accelerating frame.1 Patching together different frames at each point in spacetime gives
rise to the curvature of spacetime. But this is getting a bit ahead of ourselves ...
... instead, let’s follow the equivalence principle to derive a few interesting experimental
predictions:
• Bending of light
An immediate consequence of the equivalence principle is that light bends in a gravitational
field:
Imagine shinning a beam of light from one side of the rocket to the other. During this
time, the accelerating rocket moves forward, so the light reaches the second wall slightly
below the height that it left the first wall (see the figure, if the words aren’t clear enough).
By the equivalence principle, the same has to be true for the stationary rocket on the
surface of the Earth. Light has to bend in a gravitational field. And it does!
1
Recall from Lecture 4, that in special relativity we can remove or create a magnetic field by boosting to
another inertial frame. Gravity is similar and arises from going to a non-inertial frame.
70 6. General Relativity
General relativity’s prediction for the bending of light was first confirmed by Eddington,
who observed the deflection of starlight passing close to the Sun during a solar eclipse:2
Since then we have observed many examples of the gravitational bending of light, such as
gravitational lensing of entire galaxies by other galaxies along the line-of-sight:
• Gravitational redshift
Consider what happens when we shine the light along the rocket, rather than across. Let
the rocket have height h. It starts from rest, and moves with constant acceleration g.
Light emitted from the bottom of the rocket at t = 0 is received at the top at t = h/c. By
this time, the rocket is travelling at speed
v = gt = gh/c . (6.1.7)
2
To measure this effect one has to compare the relative positions of stars on the night sky (i.e. in the absence
of the Sun) to the shifted positions during the eclipse (i.e. in the presence of the Sun).
6.1 The Happiest Thought 71
Due to the Doppler effect, the received frequency f 0 is smaller than the emitted frequency f ,
by an amount
0
v gh
f =f 1− =f 1− 2 . (6.1.8)
c c
Since f = c/λ, the observed light is more red than the emitted light, λ0 > λ. We say that
the light is redshifted.
By the equivalence principle, the same effect must be observed for light “climbing out”
of a gravitational potential well.3 The gravitational field Φ gives rise to the acceleration
~
~a = −∇Φ. We fix the gravitational potential at the bottom of the rocket to be Φ ≡ 0. For
constant acceleration a = −g, we then find that the gravitational potential at the top of
the rocket is
Φ = gh . (6.1.9)
Comparing this to (6.1.8) gives the formula for gravitational redshift
0 Φ
f =f 1− 2 . (6.1.10)
c
From this we can derive our third, and most dramatic, prediction.
Φ −2
0 2 2 2 2Φ
(∆t ) = (∆t) 1 − 2 ≈ (∆t) 1 + 2 . (6.1.11)
c c
In the second equality we have assumed4 Φ c2 , which holds for a weak gravitational
field. This result (6.1.11) holds not just for light. In general, time goes slower at the
bottom of the rocket or, by the equivalence principle, closer to the Earth, ∆t < ∆t0 . If
you stay at the bottom of the rocket, or closer to the Earth, you will live longer than your
friend at the top of the rocket, or further from the Earth.5 If you choose to spend your
life on the ground floor, and never climb the stairs, you will live longer by a couple of
microseconds.6
We have set Φ ≡ 0 at infinity, so ∆t is the time interval measured by an observer far from any
gravitational sources. Observers close to gravitational sources will measure ∆t0 < ∆t. Something
special happens at the so-called Schwarzschild radius
2GM
rS = , (6.1.13)
c2
where the factor in (6.1.12) vanishes. Objects whose size is smaller than rS are black holes:
Notice that the Schwarzschild radius of an object depends on its mass. For the Earth, this
distance is 1 cm. For the Sun, it is about 1 km. This is how much the mass of the Earth or the
Sun would have to be compressed before they would form black holes.
Imagine you and your friend meet at infinity (or any other place sufficiently far from gravi-
tational sources). Your friend then decides to jump into a black hole. She sends back signals
at regular intervals ∆t0 (on her clock), which you measure at intervals ∆t (on your clock). As
she gets closer to rS her signals arrive less and less frequently, ∆t ∆t0 . In fact, at r = rS , we
get ∆t → ∞ for fixed ∆t0 . You will never see her crossing the event horizon at rS . Her image
will be infinitely redshifted and she will appear frozen in time (and space). What happens to
her at rS ? Nothing special. For a sufficiently large black hole7 she won’t feel anything when she
crosses rS . She might not even know that it happened. But she will feel it soon enough. As
she approaches the centre of the black hole the gravitational force on her feet will become much
larger than the force on her head. (I forgot to tell you that she decided to jump in feet first.)
She will get stretched more and more. The technical term for this is spaghettification. It is not
a happy ending.
6.2.1 Spacetime
We can write down a metric, akin to the Minkowski metric that you’ve met in special relativity.
But this time, the metric will depend on where in space we are and what matter is around.
7
There is a black hole at the centre of our galaxy. Its mass is 4.1 million solar masses (or 8.2 × 1036 kg) and
its radius is 6.25 light-hours. If the mass was uniformly distributed inside the event horizon, the density would
be that of water.
6.2 Gravity as Geometry 73
We’ve already seen what the time component should look like:
2 2Φ(t, ~x) 2 2
ds = 1 + c dt + · · · (6.2.14)
c2
Since time and space mix under Lorentz transformations, the space part should also vary with
space. This means that measuring sticks contract and expand at different points in space. In
general, the metric of spacetime is a 4 × 4 matrix, whose components can be any function of t
and ~x
ds2 = gµν (t, ~x)dX µ dX ν . (6.2.15)
When the metric varies in space and time, the spacetime is curved. In general relativity, the
dynamics of gravity is encoded in the curvature of spacetime.
where Curvature is a complicated function of the metric gµν . The left-hand side takes the metric
and computes a local measure of the curvature of spacetime. The right-hand side expresses how
all the matter around sources the bending of spacetime. In short:
“ matter tells spacetime how to curve ”
where Gµν is the Einstein tensor and Tµν is the stress-energy tensor. This doesn’t look so bad, until
I tell you what Gµν is in terms of the metric. First, we write
1
Gµν ≡ Rµν − Rgµν , (6.2.18)
2
where Rµν is the Ricci tensor and R = g µν Rµν is the Ricci scalar. The Ricci tensor is defined as
1 λσ
Γλµν ≡ g (∂µ gνσ + ∂ν gσµ − ∂σ gµν ) . (6.2.20)
2
You can imagine what kind of mess you get when you plug (6.2.20) into (6.2.19) into (6.2.18), and
finally all into (6.2.17).
It shouldn’t come as a surprise that the Einstein equation (6.2.17)—coupled non-linear PDEs
for the 10 independent components of the metric—is very hard to solve. Only in a few special
cases are explicit solutions known:
8
This is analogous to the Maxwell equations which tell us how to calculate the electric and magnetic fields
sourced by a given distribution of charges and currents.
74 6. General Relativity
• The Schwarzschild solution describes the static and spherically symmetry spacetime in the
vacuum around an object of mass M
2GM −1 2
2 2GM 2 2
ds = 1 − c dt − 1 − dr − r2 (dθ2 + sin2 θdφ2 ) . (6.2.21)
rc2 rc2
This is the spacetime around a black hole, or outside a spherical star.
• Allowing space to expand, but assuming it is curved the same way at every point and in
all directions, leads to the Friedmann-Robertson-Walker solution
where a(t) is a monotonically increasing function of time associated with the expansion of
space. This metric describes the Big Bang. More about that in Lecture 7.
It’s simplest to describe this in terms of the principle of least action (see Lecture 1). We consider
all possible paths between two points:
This doesn’t quite yet look like the usual integral over kinetic minus potential energy, but we
will get there. Paths which minimise this action are called geodesics.
6.3 Gravitational Waves 75
where we have pulled out dt from under the square root and written ~x˙ for d~x/dt. Expanding
the square root for small velocities, ~x˙ 2 c2 , and weak gravity, Φ c2 , we find9
Z t2 h
m ˙2 i
S≈ dt ~x − mΦ(~x) + · · · . (6.2.26)
t1 2
Comparing this to the familiar “kinetic energy minus potential energy” for the non-relativistic
Lagrangian, we see that the time dilation in the metric induces exactly the gravitational potential
energy
V (~x) = mΦ(~x) . (6.2.27)
At lowest order in an expansion in ~x˙ 2 /c2 and Φ/c2 , general relativity therefore exactly reproduces
Newtonian gravity. However, for large velocities, ~x˙ 2 ∼ c2 , or strong gravity, Φ ∼ c2 , general
relativity predicts corrections to the old results: these are the dots in (6.2.26).
9
We have dropped the rest mass energy mc2 since it doesn’t affect the dynamics.
76 6. General Relativity
The experiments that try to detect these gravitational waves are quite fascinating.
In the LIGO experiment, test masses are separated by a distance of about 1 km. When a
gravitational wave passes by, the space in between the test masses stretches and contracts.
Powerful lasers are used to measure this change in the separation between the test masses. How
much does a typical gravitational wave change the length between the masses? 10−20 m !!! This
is a factor of 106 times smaller than the size of an atomic nucleus! That LIGO is capable to
measure such incredibly small changes in distance is astounding.
In the future, these experiments might go into space:
There the test masses can be separated by enormous distances (up to about 1 million km),
giving the experiments access to lower frequency gravitational waves.
6.4 Quantum Gravity∗ 77
Quantum mechanics (QM) and general relativity (GR) are the two pillars of modern physics.
QM applies to very small objects, like atoms and elementary particles, where the effects of
gravity are miniscule. GR, on the other hand, deals with very large objects, such as stars,
galaxies and the universe. On those scales, quantum mechanics plays no significant role. In
most situations, we therefore use either QM or GR, but not both together. The interior of
black holes and the Big Bang are two important exceptions. In these situations large masses
occupy small regions of space. GR and QM are then equally important. A naive treatment of
quantum field theory (QFT) for gravitons (the particles associated with the gravitational field,
cf. Lecture 5) blows up in our faces. If we apply the standard QFT methods to gravity, we find
negative probabilities and other nonsense.
These problems can be traced back to the fact that, in the standard approach, interactions
between particles occur at points in spacetime. If the fundamental objects of the theory instead
are extended objects—such as strings and membranes—then the interactions are smeared out
over a finite region of spacetime:
This cures the pathologies of the standard approach. String theory is the only example of a
sensible theory of quantum gravity. Unfortunately, at the time of writing, string theory hasn’t
provided testable predictions. It therefore remains a speculative proposal.
7 Cosmology
“I’m astounded by people who want to ‘know’ the universe when it’s hard enough to find your
way around Chinatown.” Woody Allen
We are going to finish this course with a big topic: the entire universe. The study of the
structure and evolution of the universe is called cosmology. Interestingly, all of the topics that
we have studied in this course come together in cosmology. To describe the universe we need
relativity, quantum mechanics, statistical mechanics, particle physics, and more.
Imagine you look at the night sky. You randomly select a patch of the sky only a fraction of
the size of the full moon. To the naked eye it will look pitch black:
You then decide to look at the same region of the sky with the Hubble Space Telescope. But,
instead of just looking at it for an instant, the telescope collects light for about 1 million
seconds (= 12 days).
78
79
The result is one of the most stunning astronomical images ever produced:
Every object in the picture is an entire galaxy! A few thousand of them, each containing billions
of stars similar to our Sun. Many of the stars have planets around them like our Earth. But,
now remember that this tiny patch of the sky was selected at random. Any other randomly
selected region in the sky would look essentially the same. From this we can estimate that the
observable universe contains about 350 billion large galaxies and about 7 trillion dwarf galaxies.
The total number of stars is about 30 billion trillion.
Here is the latest image of the entire observable universe:
The white dots are galaxies, the surrounding sphere is the last-scattering surface of the cosmic
microwave background. The goal of this lecture is to tell you where these structures came from.
80 7. Cosmology
Modern cosmology started with the striking observation that the universe is expanding. This
led to the Big Bang theory.2 The idea is simple: Everything is getting further apart. In the
past everything was therefore closer together. That’s it. Well, there is a little bit more: when
things get compressed they tend to heat up. Similarly, the early universe was much hotter and
denser than it is today:
In the earliest moments, the average particle energies exceeded those produced in the LHC by
many orders of magnitude. Going even further back in time, at some point the equations of
general relativity become singular (the technical term for “blow up in your face”). We label
this event the time t = 0. Space and time lose their familiar meanings at and before t = 0. We
sometimes call t = 0 the Big Bang (singularity).
On the other hand, from 10−10 seconds to today, the history of the universe is based on well
understood and experimentally tested laws of particle physics, nuclear and atomic physics and
gravity. We are therefore justified to have some confidence about the events shaping the universe
during that time.3 Table 7.1 shows key events in the thermal history of the universe. In the
following, I want to highlight two important events:
• Recombination
1
http://www.youtube.com/watch?v=5U1-OmAICpU
2
Notice that what we usually call the Big Bang theory has nothing to say about how the universe started. Also,
do not think of the Big Bang as an explosion. The Big Bang occurred everywhere at the same time. Every point
is space is equivalent to every other point. There is no centre of the universe, although it looks as if everything
is moving away from us. But the same is true for any other observer in a galaxy far, far away.
3
I will indulge in some speculations about what happened at even earlier times in §7.3.
7.1 The Big Bang 81
Time Energy
−43 18
Planck Epoch? < 10 s 10 GeV
−34
Inflation? & 10 s . 1015 GeV
−10
Matter 6= anti-matter? < 10 s > 1 TeV
Electroweak phase transition 10−10 s 1 TeV
Protons/neutrons form 10−4 s 2
10 MeV
BBN: H, He, Li form 3 min 0.1 MeV
Redshift
Matter = radiation 104 yrs 1 eV 104
CMB: recombination 105 yrs 0.1 eV 1,100
Dark ages 105 − 108 yrs > 25
Reionization 108 yrs 25 − 6
First galaxies form ∼ 6 × 108 yrs ∼ 10
Dark energy takes over ∼ 109 yrs ∼2
Solar system formed 8 × 109 yrs 0.5
Albert Einstein born 14 × 109 yrs 1 meV 0
7.1.1 Nucleosynthesis
The early universe was too hot for stable atoms to exist. The matter was in the form of free
electrons and nuclei (protons and neutrons). Once in a while two protons would combine into
a helium nucleus. This process of Big Bang Nucleosynthesis (BBN) predicts the proportions of
the light elements4 with glorious precision: 75% H, 25% He, 10−5 % D and 10−10 % Li. We see
precisely those proportions in old gas clouds. This is one of the great successes of the Big Bang
theory.
380,000 yrs
Recombination
CMB
BBN Last-Scattering
Cosmic Microwave
Big Bang Nucleosynthesis Background
3 min
4
Heavy elements are later produced inside stars.
82 7. Cosmology
7.1.2 Recombination
The earliest we can see with light is 380,000 years after the Big Bang. Before this time, the
universe was still filled with the plasma of free electrons and nuclei. Light therefore couldn’t
propagate very far before bouncing off the charged electrons. Recombination marks the time
when the universe cooled enough to allow the formation of the first stable atoms. At that
moment, light started to stream freely. Today, some 13.7 billion years later, we receive the light
from that era as the so-called cosmic microwave background (CMB).5
5
Although the CMB photons started off very energetic at recombination, billions of years of cosmic expansion
has stretched their wavelength into the microwave range.
7.1 The Big Bang 83
time
A Last-Scattering B
Now consider two points A and B on the last-scattering surface separated by 180 degrees (i.e. op-
posite points on the sky). These points have their own past light cones which intersect with
the Big Bang singularity in finite time. Notice that the time between the singularity and last-
scattering is much smaller than the time between last-scattering and the present time. We see
that the past light cones of the points A and B do not overlap. But regions of spacetime can
only exchange information if they have overlapping past light cones. This means that the points
A and B have never been in causal contact:
time
NO CAUSAL CONTACT
A Last-Scattering B
The same applies to any two points in the CMB that are separated my more than 2 degrees.
It seems that the CMB is made out of many causally-disconnected patches. (In fact, 104 of
them.) Yet, we still observe an almost perfectly uniform temperature of the CMB, even for
widely separated points. How can that be? Who told these independent regions to share the
same temperature? This puzzle is called the horizon problem.
7.3 Inflation
7.3.1 Solution of the Horizon Problem
The horizon problem would be solved if we could somehow achieve that the effective time6
between the singularity and last-scattering is larger than the time between last-scattering and
the present—in other words, if there was extra time before what we usually associate with the
conventional Big Bang:
conventional
Big Bang
Last-Scattering
Reheating
INFLATION
At sufficiently early times, the past light cones of any points in the CMB would then have
overlapped and the whole CMB would have originated from a causally connected region of
space. The uniformity of the CMB would have been given a causal explanation.
A process that achieves this solution to the horizon problem is cosmological inflation. During
inflation the size of the universe grows exponentially:
If inflation lasts long enough, CMB patches on opposite sides of the sky would have been
close enough to communicate at primordial times. Because of the inflationary expansion the
6
Technical remark for full disclosure: The effective time is called conformal time. The physical time between
the singularity and recombination is always 380,000 years. However, it is the conformal time that is relevant for
the horizon problem.
86 7. Cosmology
observable universe originated from a much smaller region of space than a naive extrapolation
of the conventional Big Bang evolution suggests:
Inflation
1040
nsion
B ig B ang Expa
Standard
Radius of universe
1020
100
10-20
10-40
10-60
Measured in physical time, inflation lasted for only about 10−34 seconds.
energy density
time
Notice that this implies somewhat exotic physics. In particular, maintaining a constant energy
density requires that energy is created in order to compensate for the fact that the volume grows.
This is challenging to arrange, but it can be done if the early universe is filled by something like
the Higgs field (but it can’t be the Higgs field itself).
7.4 From So Simple A Beginning 87
energy density
inflation locally ends at
slightly different times
classical
density fluctuations
quantum
time
It is these fluctuations that are the source of the CMB fluctuations. To see this, imagine that
the universe is filled with small clocks that measure the amount of time left before a given region
of space decays. But, I told you in Lecture 2 that the uncertainty principle limits how well one
can keep track of time with a very small clock. As a result, inflation will end at slightly different
times in different regions. Hence, different regions of the universe grow by slightly different
factors and end up at slightly different densities. The CMB photons in high density regions
are more energetic than in low density regions. Voilá, the source of the CMB fluctuations is
explained.
4 K Blackbody
2.725 K Blackbody
10-18 2 K Blackbody
Brightness [W/m2/sr/Hz]
10-19
10-20
Rocket (COBRA)
Satellite (COBE: FIRAS)
Satellite (COBE: DMR)
10-21 Optical (CN transition)
Ground
Balloon
The mean temperature associated with the radiation is 2.725 K. In addition, different directions
on the sky show small variations in the temperature, at the level of 10−5 K. A key test of the
88 7. Cosmology
inflationary hypothesis comes from studying the statistics of these CMB fluctuations. Consider
two points in the CMB separated by an angle θ:
We want to know how correlated these two points are. The following plot shows this correlation
as a function of the separation angle θ:
WMAP
Acbar
5000
Power [μK2]
3000
1000
90 2 0.5 0.2
Angular Scale [degrees]
The points with error bars are data, the line is the quantum mechanical calculation (plus basic
fluid dynamics and gravity to evolve the fluctuations forward in time). Something is clearly
working. It is remarkable that we can now trace the origin of galaxies, some of the largest
objects we know, to quantum mechanics, the physics of the very small.
7.5 Breaking News: BICEP2 89
E-mode
(grad)
The polarization vectors are aligned radially or tangential around the hot and cold spots of the
CMB. Gravitational waves, on the other hand, produce a B-mode pattern:
B-mode
(curl)
This time the polarization vectors are arranged in a swirly pattern around hot and cold spots in
the CMB. Before March, only the E-mode pattern had been seen. Many consider the B-mode
pattern the ultimate test for inflation. Some have called it the “smoking gun”.
90 7. Cosmology
−55
0
−60
−65
−0.3
50 0 −50
Right ascension [deg.]
Can you see the swirly pattern? They also produced a beautiful measurement of the two-point
correlations of the polarization signal:
BICEP2 CBI
BICEP1 Boomerang
QUAD DASI
QUIET−Q WMAP
QUIET−W CAPMAP
g
sin
len
Multipole
But, not everybody is convinced yet. Doubts have been raised whether BICEP2 is really seeing
the primordial signal. Some believe that the team has underestimated the contamination from
dust in our galaxy. This dust also produces polarised microwave radiation which could mimic the
inflationary signal. We will know the answer within a year. The scientific life of a cosmologist
has rarely been as exciting.
7.6 A Puzzle and A Mystery 91
hot gas
dark matter
The picture shows two clusters of galaxies that have (relatively) recently passed right through
each other. It turns out that the large majority (about 90%) of ordinary matter in a cluster is
not in the galaxies themselves, but in hot X-ray emitting intergalactic gas. As the two clusters
passed through each other, the hot gas in each smacked into the gas in the other, while the
individual galaxies and the dark matter (presumed to be collisionless) passed right through. So,
the collision has swept out the ordinary matter from the clusters, displacing it with respect to
the dark matter.
We now believe that 85% of the matter in the universe is dark matter. But, we don’t know
what it is. However, there are many theoretical ideas with fancy names, such as axions, WIMPs,
neutralinos, etc. In addition, there are many future experimental tests that will help us to
narrow down the range of possibilities and hopefully detect dark matter particles directly. So,
dark matter is a puzzle, but one that we are quite optimistic we will solve.
You have probably heard that recently the expansion of the universe started accelerating
again. This is like inflation, but at much, much lower energies. What form of dark energy is
responsible for this?
92 7. Cosmology
Two important components are matter (dark or not) and radiation (photons or neutrinos). Matter
dilutes as the universe expands: ρm ∝ 1/a3 (t) (energy is constant, so this simply describes the growth
of the volume—three factors of a(t)). Whenever the universe is dominated by matter, we therefore
find
eq. (7.6.3)
ρm ∝ a−3 (t) −−−−−−→ a(t) ∼ t2/3 . (7.6.4)
Radiation also dilutes and in addition it redshifts. The energy density therefore decreases with an
extra factor of a(t), i.e. ρr ∝ 1/a4 (t). A universe dominated by radiation therefore evolves as
eq. (7.6.3)
ρr ∝ a−4 (t) −−−−−−→ a(t) ∼ t1/2 . (7.6.5)
A common feature of the solutions (7.6.4) and (7.6.5) is that the expansion slows down:
d2 a
<0. (7.6.6)
dt2
Another way of seeing this is to consult a second Einstein equation
1 d2 a 4πG
2
= − ρ + 3p . (7.6.7)
a dt 3
Here, p is pressure. For matter we have p = 0, while radiation has p = ρ/3. Since ρ > 0, it follows
that both matter and radiation slow down the expansion.
In the 90s, astronomers tried to measure the rate of deceleration in order to determine exactly how
much matter and radiation there was in the universe. To their surprise they found that the universe
isn’t slowing down at all. Instead the expansion is speeding up:
d2 a
>0. (7.6.8)
dt2
How can that be? According to eq. (7.6.7), it requires the universe to be filled with a negative
pressure fluid: p < −ρ/3. We call this fluid dark energy, but we have no idea what it is.
7.6 A Puzzle and A Mystery 93
However, when we use quantum mechanics to compute the size of the vacuum energy we find
this
ρquantum ≈ 10120 ρobs . (7.6.9)
This is the worst disagreement between theory and experiment in the history of science. It is
called the cosmological constant problem. In physics, we are not used to screwing up so badly.
Especially, in quantum mechanics and relativity we have been spoiled with incredible precise
predictions. So, we take this problem personally.
In fact, we do have an answer to the problem. Here it is: we postulate that in addition to the
large quantum contribution ρobs , there is an equally large, but negative, classical contribution
ρclassical . We add these two contributions to get the observed dark energy density
10−120 = − 2. 1568
| .{z
. . 3521} . . . + 2.1568 . . . 3523 . . . (7.6.11)
120 digits
I know, it looks ludicrous. The amount of fine-tuning seems ridiculous. Unfortunately, this is
the best we have come up with. Can you help us to find a better solution?
A Symmetries and Conservation Laws
In Lecture 1, we got so close to one of the most important results in all of physics that I
couldn’t resist the temptation to write it down for you. It is called Noether’s theorem and
relates symmetries to conservation laws.
If the potential only depends on the distance between the particles, then the Lagrangian
is
m 2
q̇1 + q̇22 − V (q1 − q2 ) .
L= (A.1.1)
2
It is instructive to rewrite this in terms of two new coordinates: q+ ≡ q1 + q2 and q− ≡
q1 − q2 . The Lagrangian does not depend on q+ ,
m 2 2
L= q̇+ + q̇− − V (q− ) . (A.1.2)
2
The Euler-Lagrange equation then implies
dP+ ∂V
=− =0 ⇒ P+ = mq̇+ = mq̇1 + mq̇2 = const. (A.1.3)
dt ∂q+
This is the conservation of momentum.
94
A.2 Noether’s Theorem 95
The Lagrangian does not depend on θ. The Euler-Lagrange equation then implies
dPθ ∂V
=− =0 ⇒ Pθ = mr2 θ̇ = const. (A.1.6)
dt ∂θ
This is the conservation of angular momentum.
The conservation laws in these examples might look like accidental properties of the potential.
In reality, they are a consequence of a deep principle.
• Example 1: Imagine taking the two particles and shifting them both by the same dis-
tance :
q1 7→ q1 + δq1 =
⇒ , (A.1.7)
q2 7→ q2 + δq2 =
where δqi is the difference between the new and the old coordinates. Since the potential
only depends on the difference of the coordinates, the Lagrangian doesn’t change. We call
the transformation (A.1.7) a translation symmetry of the system.
• Example 2: In the second example, the Lagrangian doesn’t change if we rotate the
particle position around the origin
q1 →
7 +q1 cos + q2 sin δq1 = +q2
⇒ , (A.1.8)
q2 →7 −q1 sin + q2 cos δq2 = −q1
where in the last expression we have assumed that is infinitesimal, so that sin ≈ and
cos ≈ 1. (Any continuous transformation can be built up from a sequence of infinitesimal
transformations.) We call the transformation (A.1.8) a rotation symmetry of the system.
i.e. each coordinate shifts by an amount proportional to , but the proportionality factor can
depend on where you are. This includes Example 1 (f1 = f2 = 1) and Example 2 (f1 = q2 ,
f2 = −q1 ). We can calculate how much L(q, q̇) changes under this transformation
X ∂L ∂L
δL = δ q̇i + δqi . (A.2.10)
∂ q̇i ∂qi
i
Now we do a bit of magic. Watch it carefully. First, we remember that Pi = ∂L/∂ q̇i . Thus the
P
first term is i Pi δ q̇i . Hold on to that while we look at the second term ∂L/∂qi δqi . Using the
Euler-Lagrange equation, ∂L/∂qi = dPi /dt, we get Ṗi δqi . Combining the terms, here is what
we get for the change of the Lagrangian
Xh i
δL = Pi δ q̇i + Ṗi δqi . (A.2.11)
i
96 A. Symmetries and Conservation Laws
What does all of this have to do with symmetry and conservation? First of all, by definition,
symmetry means that the Lagrangian is unchanged, δL = 0. So if (A.1.7) is a symmetry, then
d X
Pi δqi = 0 . (A.2.13)
dt
i
Q = P1 + P2 . (A.2.16)
That is just the conservation of momentum that we found before. But now we can say a
far more general thing:
Sweet!
Q = q2 P1 − q1 P2 = ~q × ~P . (A.2.17)
That is just the conservation of momentum that we found before. Again, there is a deeper
thing involved than just the angular momentum of a single particle:
whether we perform it today, tomorrow or in ten years. In the language of the Lagrangian
method, such a symmetry means that the Lagrangian has no explicit dependence on time.
This is a subtle point: The value of the Lagrangian may vary with time, but only because the
coordinates and velocities vary. Explicit time dependence means that the form of the Lagrangian
depends on time.
Abstractly, we allow for an explicit time dependence in the Lagrangian by adding time as a
coordinate,
L = L(qi , q̇i , t) . (A.3.20)
The total time derivative of this Lagrangian is
dL X ∂L ∂L ∂L
= q̇i + q̈i + , (A.3.21)
dt ∂qi ∂ q̇i ∂t
i
where the final term is only non-zero if the Lagrangian has an explicit time dependence. Let’s
examine the various terms in (A.3.21) using the Euler-Lagrange equations. The terms in the
square brackets can be written as
∂L ∂L d
q̇i + q̈i = Ṗi q̇i + Pi q̈i = (Pi q̇i ) . (A.3.22)
∂qi ∂ q̇i dt
We get
dL d X ∂L
= (Pi q̇i ) + . (A.3.23)
dt dt ∂t
i
Notice that even if there is no explicit time dependence in L, the Lagrangian will nevertheless
P
depend on time through the first term i (Pi q̇i ). There is no such thing as conservation of the
Lagrangian. However, inspection of eq. (A.3.23) reveals something interesting. If we define a
new quantity H—called the Hamiltonian—by
X
H≡ (Pi q̇i ) − L (A.3.24)
i
If the Lagrangian is invariant under time translations, then the Hamiltonian is conserved.
Example.—Let us evaluate the Hamiltonian for a simple example. Consider a single particle with
Lagrangian,
m
L = q̇ 2 − V (q) . (A.3.26)
2
The momentum is
∂L
P= = mq̇ . (A.3.27)
∂ q̇
Using eq. (A.3.24), we get
m
H = (mq̇)q̇ − q̇ 2 − V (q) (A.3.28)
2
m 2
= q̇ + V (q) . (A.3.29)
2
This is just the energy of the particle.
It turns out that all conservation laws in nature are related to symmetries through Noether’s
theorem. This includes the conservation of electric charge and the conservation of particles such
as protons and neutrons.
References
Quantum Mechanics
- Susskind and Friedman, Quantum Mechanics
- Zeilinger, Dance of the Photons
- Feynman, QED
Statistical Mechanics
- Carroll, From Eternity to Here
- Feynman, The Character of Physical Law
- Gleick, The Information
Particle Physics
- Sample, Massive: The Hunt for the God Particle
- Carroll, The Particle at the End of the Universe
General Relativity
- Thorne, Black Holes and Time Warps
- Ellis and Williams, Flat and Curved Space-Times
Cosmology
- Weinberg, The First Three Minutes
- Guth, The Inflationary Universe
- Overbye, Lonely Hearts of the Cosmos
Symmetry
- Zee, Fearful Symmetry
- Huang, Fundamental Forces of Nature
99