COURSE OF THEORETICAL PHYSICS
Volume 4
Second Edition
QUANTUM ELECTRODYNAMICS
Other titles in the COURSE OF THEORETICAL PHYSICS by
LANDAU and LIFSHITZ
Volume 1 Mechanics, 3rd Edition
Volume 2
The Classical Theory of Fields, 4th Edition
Volume 3
Quantum Mechanics (Non-relativistic Theory), 3rd Edition
Volume 5
Statistical Physics, Part 1, 3rd Edition
Volume 6
Fluid Mechanics, 2nd Edition
Volume 7
Theory of Elasticity, 3rd Edition
Volume 8
Electrodynamics of Continuous Media, 2nd Edition
Volume 9
Statistical Physics, Part 2
Volume 10 Physical Kinetics
QUANTUM ELECTRODYNAMICS
by
V. B. BERESTETSKII, E. M. LIFSHITZ
and
L. P. PITAEVSKII
Institute of Physical Problems, U.S.S.R. Academy of Sciences
Volume 4 of Course of Theoretical Physics
Second edition
Translated from the Russian by
J. B. SYKES and J. S. BELL
Butterworth-Heinemann
An Imprint of Elsevier
Butterworth-Heinemann is an imprint of Elsevier
Linacre House, Jordan Hill, Oxford OX2 8DP, UK
30 Corporate Drive, Suite 400, Burlington, MA 01803, USA
First edition 1971
Reprinted 1974
Second edition 1982
Reprinted 1989, 1994, 1996, 1997, 1999, 2002, 2004, 2006, 2007, 2008
Copyright © 1982, Elsevier Ltd. All rights reserved
No part of this publication may be reproduced, stored in a retrieval system
or transmitted in any form or by any means electronic, mechanical, photocopying,
recording or otherwise without the prior written permission of the publisher
Permissions may be sought directly from Elsevier's Science & Technology Rights
Department in Oxford, UK: phone (+44) (0) 1865 843830; fax (+44) (0) 1865 853333;
email:
[email protected]. Alternatively you can submit your request online by
visiting the Elsevier web site at http://elsevier.com/locate/permissions, and selecting
Obtaining permission to use Elsevier material
Notice
No responsibility is assumed by the publisher for any injury and/or damage to persons
or property as a matter of products liability, negligence or otherwise, or from any use
or operation of any methods, products, instructions or ideas contained in the material
herein. Because of rapid advances in the medical sciences, in particular, independent
verification of diagnoses and drug dosages should be made
British Library Cataloguing in Publication Data
Berestetskii, V B.
Quantum electrodynamics - 2nd ed.
(Course of theoretical physics; V4)
1. Quantum field theory
I. Title II. Liftshitz, E. M. III. Pitaevskii, L. P.
IV Berestetskii, V B. R elativistic quantum theory
V Series
ISBN: 978-0-7506-3371-0
For information on all Butterworth-Heinemann publications
visit our website at books.elsevier.com
Printed in the United States of America
Transferred to Digital Printing, 2010
Working together to grow
libraries in developing countries
www.elsevier.com | www.bookaid.org | www.sabre.org
ELSEVIER
POOK A™
Sabre Foundation
PREFACE TO THE SECOND EDITION
THE first edition of this volume of the Course of Theoretical Physics was published
in two parts (1971 and 1974) under the title "Relativistic Quantum Theory". It
contained not only the basic material on quantum electrodynamics but also chapters on weak interactions and certain topics in the theory of strong interactions.
The inclusion of those chapters now seems to us inopportune. The theory of strong
and weak interactions is undergoing a vigorous development founded on new
physical ideas, and the situation in this field is changing very rapidly, so that the
time for a consistent exposition of the theory has not yet arrived. In the present
edition, therefore, we have retained only quantum electrodynamics, and accordingly changed the title of the volume.
As well as a considerable number of corrections and minor changes, we have
made in this edition several more significant additions, including the operator
method of calculating the bremsstrahlung cross-section, the calculation of the
probabilities of photon-induced pair production and photon decay in a magnetic
field, the asymptotic form of the scattering amplitudes at high energies, inelastic
scattering of electrons by hadrons, and the transformation of electron-positron
pairs into hadrons.
A word regarding notation. We have reverted to the use of circumflexed letters
for operators, in line with the other volumes in the Course. No special notation is
used for the product of a 4-vector and a matrix vector yM, previously denoted by a
circumflexed letter; such products are now shown explicitly.
We have, alas, had to prepare this edition without the aid of Vladimir Berestetskiï,who died in 1977; but some of the added material mentioned above had been
put together previously, by the three authors jointly.
Our sincere thanks are offered to all readers who have given us their comments
on the first edition of the book, and in particular to J. S. Bell, V. P. Kraïnov, L. B.
Okun\ V. I. Ritus, M. I. Ryazanov and I. S. Shapiro.
July 1979
E . M . LlFSHlTZ
L . P . PlTAEVSKIÏ
This page intentionally left blank
FROM THE PREFACE TO
THE FIRST EDITION
IN ACCORDANCE with the general plan of this Course of Theoretical Physics, the
present volume deals with relativistic quantum theory in the broad sense: the
theory of all phenomena which depend upon the finite velocity of light, including
the whole of the theory of radiation.
This branch of theoretical physics is still far from completion, even as regards
its basic physical principles, and this is particularly true of the theory of strong and
weak interactions. But even quantum electrodynamics, despite the remarkable
achievements of the last twenty years, still lacks a satisfactory logical structure.
In the choice of material for this book we have considered only results which
appear to be reasonably firmly established. In consequence, of course, the greater
part of the book is devoted to quantum electrodynamics. We have tried to give a
realistic exposition, with emphasis on the physical hypotheses used in the theory,
but without going into details of justifications, which in the present state of the
theory are in any case purely formal.
In the discussion of specific applications of the theory, our aim has been not to
include the whole vast range of effects but to select only the most fundamental of
them, adding some references to original papers which contain more detailed
studies. We have often omitted some of the intermediate steps in the calculations,
which in this subject are usually very lengthy, but we have always sought to
indicate any non-trivial point of technique.
The discussion in this book demands a higher degree of previous knowledge on
the part of the reader than do the other volumes in the Course. Our assumption has
been that a reader whose study of theoretical physics has extended as far as the
quantum theory of fields has no further need of predigested material.
This book has been written without the direct assistance of our teacher, L. D.
Landau. Yet we have striven to be guided by the spirit and the approach to
theoretical physics which characterized his teaching of us and which he embodied
in the other volumes. We have often asked ourselves what would be the attitude of
Dau to this or that topic, and sought the answer prompted by our many years'
association with him.
Our thanks are due to V. N. Baler, who gave great help in compiling §§90 and
97, and to V. I. Ritus for great help in writing §101. We are grateful to B. É.
Meïerovich for assistance with calculations, and also to A. S. Kompaneets, who
made available his notes of L. D. Landau's lectures on quantum electrodynamics,
given at Moscow State University in the academic year 1959-60.
June 1967
V. B.
BERESTETSKII,
vii
E. M.
LIFSHITZ,
L. P.
PITAEVSKIÏ
This page intentionally left blank
CONTENTS
NOTATION
INTRODUCTION
l
§1. The uncertainty principle in the relativistic case
1
I. PHOTONS
§2.
§3.
§4.
§5.
§6.
§7.
§8.
§9.
Quantization of the free electromagnetic
Photons
Gauge invariance
The electromagnetic field in quantum theory
The angular momentum and parity of the photon
Spherical waves of photons
The polarization of the photon
A two-photon system
field
5
10
12
14
16
19
24
29
II. BOSONS
§10.
§11.
§12.
§13.
§14.
§15.
§16.
The wave equation for particles with spin zero
Particles and antiparticles
Strictly neutral particles
The transformations C, P and T
The wave equation for a particle with spin one
The wave equation for particles with higher integral spins
Helicity states of a particle
33
37
41
44
50
53
55
III. FERMIONS
§17.
§18.
§19.
§20.
§21.
§22.
§23.
§24.
§25.
§26.
§27.
§28.
§29.
§30.
§31.
Four-dimensional spinors
The relation between spinors and 4-vectors
Inversion of spinors
Dirac's equation in the spinor representation
The symmetrical form of Dirac's equation
Algebra of Dirac matrices
Plane waves
Spherical waves
The relation between the spin and the statistics
Charge conjugation and time reversal of spinors
Internal symmetry of particles and antiparticles
Bilinear forms
The polarization density matrix
Neutrinos
The wave equation for a particle with spin 3/2
62
64
68
73
75
80
84
88
91
94
99
101
106
111
115
Contents
X
IV. PARTICLES IN AN EXTERNAL FIELD
§32.
§33:
§34.
§35.
§36.
§37.
§38.
§39.
§40.
§41.
§42.
Dirac's equation for an electron in an external
field
Expansion in powers of 1/c
Fine structure of levels of the hydrogen atom
Motion in a centrally symmetric
field
Motion in a Coulomb
field
Scattering in a centrally symmetric
field
Scattering in the ultra-relativistic case
The continuous-spectrum wave functions for scattering in a Coulomb
An electron in the field of an electromagnetic plane wave
Motion of spin in an external
field
Neutron scattering in an electric
field
field
118
122
126
128
133
140
142
144
148
151
157
V. RADIATION
§43.
§44.
§45.
§46.
§47.
§48.
§49.
§50.
§51.
§52.
§53.
§54.
§55.
§56.
§57.
§58.
The electromagnetic interaction operator
Emission and absorption
Dipole radiation
Electric multipole radiation
Magnetic multipole radiation
Angular distribution and polarization of the radiation
Radiation from atoms: the electric type
Radiation from atoms: the magnetic type
Radiation from atoms: the Zeeman and Stark effects
Radiation from atoms: the hydrogen atom
Radiation from diatomic molecules: electronic spectra
Radiation from diatomic molecules: vibrational and rotational spectra
Radiation from nuclei
The photoelectric effect: non-relativistic case
The photoelectric effect: relativistic case
Photodisintegration of the deuteron
159
161
164
166
171
173
181
186
189
192
197
203
205
207
212
216
VI. SCATTERING OF RADIATION
§59.
§60.
§61.
§62.
§63.
The scattering tensor
Scattering by freely oriented systems
Scattering by molecules
Natural width of spectral lines
Resonance
fluorescence
221
231
237
240
244
VII. THE SCATTERING MATRIX
§64.
§65.
§66.
§67.
§68.
§69.
§70.
§71.
The scattering amplitude
Reactions involving polarized particles
Kinematic invariants
Physical regions
Expansion in partial amplitudes
Symmetry of helicity scattering amplitudes
Invariant amplitudes
The unitarity condition
247
252
256
258
264
268
274
278
VIII. INVARIANT PERTURBATION THEORY
§72.
§73.
§74.
§75.
§76.
The chronological product
Feynman diagrams for electron scattering
Feynman diagrams for photon scattering
The electron propagator
The photon propagator
283
286
292
295
300
Contents
xi
§77. General rules of the diagram technique
§78. Crossing invariance
§79. Virtual particles
304
311
312
IX. INTERACTION OF ELECTRONS
§80.
§81.
§82.
§83.
§84.
§85.
Scattering of an electron in an external
Scattering of electrons and positrons by an electron
Ionization losses of fast particles
Breit's equation
Positronium
The interaction of atoms at large distances
field
317
321
330
336
343
347
X. INTERACTION OF ELECTRONS WITH PHOTONS
§86.
§87.
§88.
§89.
§90.
§91.
§92.
§93.
§94.
§95.
§96.
§97.
§98.
§99.
§100.
§101.
Scattering of a photon by an electron
Scattering of a photon by an electron. Polarization effects
Two-photon annihilation of an electron pair
Annihilation of positronium
Synchrotron radiation
Pair production by a photon in a magnetic
field
Electron-nucleus bjemsstrahlung. The non-relativistic case
Electron-nucleus bremsstrahlung. The relativistic case
Pair production by a photon in the field of a nucleus
Exact theory of pair production in the ultra-relativistic case
Exact theory of bremsstrahlung in the ultra-relativistic case
Electron-electron bremsstrahlung in the ultra-relativistic case
Emission of soft photons in collisions
The method of equivalent photons
Pair production in collisions between particles
Emission of a photon by an electron in the field of a strong electromagnetic wave
354
359
368
371
376
386
389
400
410
413
419
426
431
438
444
449
XL EXACT PROPAGATORS AND VERTEX PARTS
§102.
§103.
§104.
§105.
§106.
§107.
§108.
§109.
§110.
§111.
§112.
Field operators in the Heisenberg representation
The exact photon propagator
The self-energy function of the photon
The exact electron propagator
Vertex parts
Dyson's equations
Ward's identity
Electron propagators in an external
Physical conditions for renormalization
Analytical properties of photon propagators
Regularization of Feynman integrals
field
456
459
465
468
472
476
478
481
487
493
4%
XII. RADIATIVE CORRECTIONS
§113.
§114.
§115.
§116.
§117.
§118.
§119.
§120.
§121.
§122.
§123.
§124.
Calculation of the polarization operator
Radiative corrections to Coulomb's law
Calculation of the imaginary part of the polarization operator from the Feynman integral
Electromagnetic form factors of the electron
Calculation of electron form factors
Anomalous magnetic moment of the electron
Calculation of the mass operator
Emission of soft photons with non-zero mass
Electron scattering in an external field in the second Born approximation
Radiative corrections to electron scattering in an external
field
Radiative shift of atomic levels
Radiative shift of mesic-atom levels
501
504
508
513
517
521
524
529
534
540
544
551
Contents
Xll
§125.
§126.
§127.
§128.
§129.
§130.
§131.
The relativistic equation for bound states
The double dispersion relation
Photon-photon scattering
Coherent, scattering of a photon in the field of a nucleus
Radiative corrections to the electromagnetic field equations
Photon splitting in a magnetic
field
Calculation of integrals over four-dimensional regions
552
559
566
573
575
585
592
XIII. ASYMPTOTIC FORMULAE OF QUANTUM
ELECTRODYNAMICS
§132.
§133.
§134.
§135.
§136.
§137.
Asymptotic form of the photon propagator for large momenta
The relation between unrenormalized and actual charges
Asymptotic form of the scattering amplitudes at high energies
Separation of the double-logarithmic terms in the vertex operator
Double-logarithmic asymptotic form of the vertex operator
Double-logarithmic asymptotic form of the electron-muon scattering amplitude
597
601
603
608
614
616
XIV. ELECTRODYNAMICS OF HADRONS
§138.
§139.
§140.
§141.
§142.
§143.
§144.
Electromagnetic form factors of hadrons
Electron-hadron scattering
The low-energy theorem for bremsstrahlung
The low-energy theorem for photon-hadron scattering
Multipole moments of hadrons
Inelastic electron-hadron scattering
Hadron formation from an electron-positron pair
INDEX
624
629
632
635
638
643
645
649
NOTATION
Four-dimensional
Four-dimensional tensor indices are denoted by Greek letters A, /x, v,...,
the values 0, 1,2, 3.
A 4-metric with signature ( +
) is used. The metric tensor is
taking
g^igoO = 1, gll = g22 = g33 = - 1).
Components of a 4-vector are stated in the form a* = (a 0 , a).
To simplify the formulae, the index is often omitted in writing the components
of a 4-vector.t The scalar products of 4-vectors are written simply as (ab) or ab\
ab = aj)* = a0b0 - a • b.
The 4-position-vector is JCM = (t, r). The 4-volume element is d4x.
The operator of differentiation with respect to the 4-coordinates is dM = dldx*1.
The antisymmetric unit 4-tensor is eX[LVp, with e0123 = -e 0 i23 = +1.
The four-dimensional delta function S (4) (a)=ô(a 0 )S(a).
Three-dimensional
Three-dimensional tensor indices are denoted by Latin letters i, Jc, I, . . . , taking
the values x, y, z.
Three-dimensional vectors are denoted by letters in bold type.
The three-dimensional volume element is d3x.
Operators
Operators are denoted by italic letters with circumflex.^
Commutators or anticommutators of two operators are written {/, g} ± = fg ± g/.
The transposed operator is /.
The Hermitian conjugate operator is / + .
Matrix elements
The matrix element of the operator F for a transition from initial state i to final
state / is Ffi or </|F|i).
t This way of writing the components is often used in recent literature. It is a compromise between
the limited resources of the alphabet and the demands of physics, and means, of course, that the reader
must be particularly attentive.
X However, to simplify the formulae, the circumflex is not written over spin matrices, and it is also
omitted when operators are shown in matrix elements.
xiii
Notation
XIV
The notation \i) is used as an abstract symbol for a state independently of any
specific representation in which its wave function may be expressed. The notation
(/| denotes a final ("complex conjugate") state.t
Correspondingly, (s\r) denotes the coefficients in the expression of a set of
states with quantum numbers r as superpositions of states with quantum numbers
s: |r> = 2|s><s|r>.
5
The reduced matrix elements of spherical tensors are </||F||/).
Dirac's equation
The Dirac matrices are y*\ with (70)2 = 1, (71)2 = (y2)2 = (y3)2 = - 1 . The matrix
<* 7°7> ß = 7°- The expressions in the spinor and standard representations are
(21.3), (21.16) and (21.20).
y5 = -iy°yly2y\(y5)2
= 1; see (22.18).
er*1" = 5(7 V ~ 7 V ) ; see (28.2).
Dirac conjugation is expressed by i/r = i//*y°.
The Pauli matrices are or = (ax, <ry, crz), defined in §20.
The 4-spinor indices are a, ß , . . . and a, ß , . . ., taking the values 1, 2 and i, 2.
The bispinor indices are i, k, I , . . . , taking the values 1, 2, 3, 4.
=
Fourier expansion
Three-dimensional:
fir) = | / ( k ) elk
r
^ 3 ,
/(k) = | / ( r ) e"'k
r
d3x,
and similarly for the four-dimensional expansion.
Units
Except where otherwise specified, relativistic units are used, with h = 1, c = 1.
In these units, the square of the unit charge is e2 = 1/137.
Atomic units have e = l , f i = l , m = l . I n these units, c = 137. The atomic units
of length, time and energy are h2jme2, h^/me4 and me4lh2\ the quantity Ry =
me4l2h2 is called a rydberg.
Ordinary units are given in the absolute (Gaussian) system.
Constants
Velocity of light c = 2.998 x 1010 cm/sec.
Unit charge* |e|=4.803xl0 10 CGS electrostatic units.
t This notation is due to Dirac.
t Throughout the book (except in Chapter XIV), e denotes the charge with the appropriate sign, so
that e = -\e\ for an electron.
Notation
Electron mass m = 9.11 x 10 28 g.
Planck's constant ft = 1.055 x 10~27 erg. sec.
Fine-structure constant a = e2/ftc; 1/a = 137.04.
Bohr radius ft2/me2 = 5.292 x 10"9 cm.
Classical electron radius re = e2lmc2 = 2.818 x 10-13 cm.
Compton wavelength of the electron hjmc = 3.862 x 10"" cm.
Electron rest energy mc2 = 0.511 x 106eV.
Atomic energy unit me4lh2 = 4.360 x 10" erg = 27.21 eV.
Bohr magneton \e\hl2mc = 9.274 x 10~21 erg/G.
Proton mass mp = 1.673 x 10"24g.
Compton wavelength of the proton hlmpc = 2.103 x 10~14cm.
Nuclear magneton \e\hl2mpc = 5.051 x 10"24 erg/G.
Mass ratio of muon and electron mjm - 2.068 x 102.
References to volumes in the Course of Theoretical Physics:
Mechanics = Vol. 1 {Mechanics, third English edition, 1976).
Fields = Vol. 2 (The Classical Theory of Fields, fourth English edition, 1975).
QM or Quantum Mechanics = Vol. 3 (Quantum Mechanics, third English
edition, 1977).
ECM = Vol. 8 (Electrodynamics of Continuous Media, English edition, 1960).
PK = Vol. 10 (Physical Kinetics, English edition, 1981).
All are published by Pergamon Press.
This page intentionally left blank
INTRODUCTION
§ 1. The uncertainty principle in the relativistic case
THE quantum theory described in Volume 3 (Quantum Mechanics) is essentially
non-relativistic throughout, and is not applicable to phenomena involving motion at
velocities comparable with that of light. At first sight, one might expect that the
change to a relativistic theory is possible by a fairly direct generalization of the
formalism of non-relativistic quantum mechanics. But further consideration shows
that a logically complete relativistic theory cannot be constructed without invoking
new physical principles.
Let us recall some of the physical concepts forming the basis of non-relativistic
quantum mechanics (QM, §1). We saw that one fundamental concept is that of
measurement, by which is meant the process of interaction between a quantum
system and a classical object or apparatus, causing the quantum system to acquire
definite values of some particular dynamical variables (coordinates, velocities,
etc.). We saw also that quantum mechanics greatly restricts the possibility that an
electront simultaneously possesses values of different dynamical variables. For
example, the uncertainties Aq and Ap in simultaneously existing values of the
coordinate and the momentum are related by the expression^ AqAp ~~ h; the
greater the accuracy with which one of these quantities is measured, the less the
accuracy with which the other can be measured at the same time.
It is important to note, however, that any of the dynamical variables of the
electron can individually be measured with arbitrarily high accuracy, and in an
arbitrarily short period of time. This fact is of fundamental importance throughout
non-relativistic quantum mechanics. It is the only justification for using the concept
of the wave function, which is a basic part of the formalism. The physical
significance of the wave function \p(q) is that the square of its modulus gives the
probability of finding a particular value of the electron coordinate as the result of a
measurement made at a given instant. The concept of such a probability clearly
requires that the coordinate can in principle be measured with any specified
accuracy and rapidity, since otherwise this concept would be purposeless and
devoid of physical significance.
The existence of a limiting velocity (the velocity of light, denoted by c) leads to
new fundamental limitations on the possible measurements of various physical
quantities (L. D. Landau and R. E. Peierls, 1930).
t As in QM, §1, we shall, for brevity, speak of an "electron", meaning any quantum system.
t In this section, ordinary units are used.
1
2
Introduction
§1
In QM, §44, the following relationship has been derived:
i/-u)ApAt-ft,
(1.1)
relating the uncertainty Ap in the measurement of the electron momentum and the
duration At of the measurement process itself; v and v' are the velocities of the
electron before and after the measurement. From this relationship it follows that a
momentum measurement of high accuracy made during a short time (i.e. with Ap
and At both small) can occur only if there is a large change in the velocity as a
result of the measurement process itself. In the non-relativistic theory, this showed
that the measurement of momentum cannot be repeated at short intervals of time,
but it did not at all diminish the possibility, in principle, of making a single
measurement of the momentum with arbitrarily high accuracy, since the difference
v' - v could take any value, no matter how large.
The existence of a limiting velocity, however, radically alters the situation. The
difference v' - v, like the velocities themselves, cannot now exceed c (or rather 2c).
Replacing v'- v in (l.T by c, we obtain
ApAf-ft/c,
(1.2)
which determines the highest accuracy theoretically attainable when the momentum is measured by a process occupying a given time At. In the relativistic theory,
therefore, it is in principle impossible to make an arbitrarily accurate and rapid
measurement of the momentum. An exact measurement (Ap ->0) is possible only in
the limit as the duration of the measurement tends to infinity.
There is reason to suppose that the concept of measurability of the electron
coordinate itself must also undergo modification. In the mathematical formalism of
the theory, this situation is shown by the fact that an accurate measurement of the
coordinate is incompatible with the assertion that the energy of a free particle is
positive. It will be seen later that the complete set of eigenfunctions of the
relativistic wave equation of a free particle includes, as well as solutions having the
"correct" time dependence, also solutions having a "negative frequency". These
functions will in general appear in the expansion of the wave packet corresponding
to an electron localized in a small region of space.
It will be shown that the wave functions having a "negative frequency"
correspond to the existence of antiparticles (positrons). The appearance of these
functions in the expansion of the wave packet expresses the (in general) inevitable
production of electron-positron pairs in the process of measuring the coordinates
of an electron. This formation of new particles in a way which cannot be detected
by the process itself renders meaningless the measurement of the electron coordinates.
In the rest frame of the electron, the least possible error in the measurement of
its coordinates is
&q~hlmc
(1.3)
This value (which purely dimensional arguments show to be the only possible one)
§1
The Uncertainty Principle in the Relativistic Case
3
corresponds to a momentum uncertainty Ap — mc, which in turn corresponds to the
threshold energy for pair production.
In a frame of reference in which the electron is moving with energy e, (1.3)
becomes
àq-chle.
(1.4)
In particular, in the limiting ultra-relativistic case the energy is related to the
momentum by e - cp, and
Aq~ft/p,
(1.5)
i.e. the error Aq is the same as the de Broglie wavelength of the particle.t
For photons, the ultra-relativistic case always applies, and the expression (1.5)
is therefore valid. This means that the coordinates of a photon are meaningful only
in cases where the characteristic dimensions of the problem are large in comparison with the wavelength. This is just the "classical" limit, corresponding to
geometrical optics, in which radiation can be said to be propagated along definite
paths or rays. In the quantum case, however, where the wavelength cannot be
regarded as small, the concept of coordinates of the photon has no meaning. We
shall see later (§4) that, in the mathematical formalism of the theory, the fact that
the photon coordinates cannot be measured is evident because the photon wave
function cannot be used to construct a quantity which might serve as a probability
density satisfying the necessary conditions of relativistic invariance.
The foregoing discussion suggests that the theory will not consider the time
dependence of particle interaction processes. It will show that in these processes
there are no characteristics precisely definable (even within the usual limitations of
quantum mechanics); the description of such a process as occurring in the course
of time is therefore just as unreal as the classical paths are in non-relativistic
quantum mechanics. The only observable quantities are the properties (momenta,
polarizations) of free particles: the initial particles which come into interaction, and
the final particles which result from the process (L. D. Landau and R. E. Peierls,
1930).
A typical problem as formulated in relativistic quantum theory is to determine
the probability amplitudes of transitions between specified initial and final states
{t -» +<») of a system of particles. The set of such amplitudes between all possible
states constitutes the scattering matrix or S-matrix. This matrix will embody all the
information about particle interaction processes that has an observable physical
meaning (W. Heisenberg, 1938).
There is as yet no logically consistent and complete relativistic quantum theory.
We shall see that the existing theory introduces new physical features into the
nature of the description of particle states, which acquires some of the features of
t The measurements in question are those for which any experimental result yields a conclusion
about the state of the electron; that is, we are not considering coordinate measurements by means of
collisions, when the result does not occur with probability unity during the time of observation.
Although the deflection of a measuring-particle in such cases may indicate the position of an electron,
the absence of a deflection tells us nothing.
4
Introduction
§1
field theory (see §10). The theory is, however, largely constructed on the pattern of
ordinary quantum mechanics. This structure of the theory has yielded good results
in quantum electrodynamics. The lack of complete logical consistency in this
theory is shown by the occurrence of divergent expressions when the mathematical
formalism is directly applied, although there are quite well-defined ways of eliminating these divergences. Nevertheless, such methods remain, to a considerable
extent, semiempirical rules, and our confidence in the correctness of the results is
ultimately based only on their excellent agreement with experiment, not on the
internal consistency or logical ordering of the fundamental principles of the theory.
CHAPTER I
PHOTONS
§ 2. Quantization of the free electromagnetic field
WITH the purpose of treating the electromagnetic field as a quantum object, it is
convenient to begin from a classical description of the field in which it is
represented by an infinite but discrete set of variables. This description permits the
immediate application of the customary formalism of quantum mechanics. The
representation of the field by means of potentials specified at every point in space
is essentially a description by means of a continuous set of variables.
Let A(r, t) be the vector potential of the free electromagnetic field, which
satisfies the "transversality condition"
divA = 0.
(2.1)
The scalar potential 4> = 0, and the fields E and H are
E=-À,
H = curl A.
(2.2)
Maxwell's equations reduce to the wave equation for A:
AA-d 2 A/df 2 = 0.
(2.3)
In classical electrodynamics (see Fields, §52) the change to the description by
means of a discrete set of variables is brought about by considering the field in a
large but finite volume V.t The following is a brief résumé of the argument.
The field in a finite volume can be expanded in terms of travelling plane waves,
and its potential is then represented by a series
A = 2(ake'k
r
k
+ aïe-|kr),
(2.4)
where the coefficients ak are functions of the time such that
ak-e-^,
o> = |k|.
(2.5)
The condition (2.1) shows that the complex vectors ak are orthogonal to the
corresponding wave vectors: ak • k = 0.
The summation in (2.4) is taken over an infinite discrete set of values of the
t We shall take V = 1, in order to reduce the number of factors in the formulae.
5
§2
Photons
6
wave vector (i.e. of its components kXJ kyy kz). The change to an integral over a
continuous distribution may be made by means of the expression d3k/(27r)3 for the
number of possible values of k belonging to the volume element d3k = dkxdkydkz in
k-space.
If the vectors ak are specified, the field in the volume considered is completely
determined. Thus these quantities may be regarded as a discrete set of classical
"field variables". In order to explain the transition to the quantum theory, however,
a further transformation of these variables is needed, whereby the field equations
take a form analogous to the canonical equations (Hamilton's equations) of
classical mechanics. The canonical field variables are defined by
Qk=
V(b) ( a k + a * ) '
(2.6)
and are evidently real. The vector potential is expressed in terms of the canonical
variables by
A = V(4TT) Y (Qk COS k • r - - Pk sin k • rV
k \
(o
(2.7)
)
To find the Hamiltonian H, we must calculate the total energy of the field,
^J"(E 2 + H 2 )d 3 *,
8i
and express it in terms of the Qk and Pk. When A is written as the expansion (2.7),
and E and H are found from (2.2), the result of the integration is
It
Each of the vectors Pk and Qk is perpendicular to the wave vector k, and
therefore has two independent components. The direction of these vectors determines the direction of polarization of the corresponding wave. Denoting the two
components of the vectors Qk and Pk (in the plane perpendicular to k) by Qka, Pka
(a = 1, 2), we can write the Hamiltonian as
H = 22(J>L + <u2QL).
k, a
(2.8)
Thus the Hamiltonian is the sum of independent terms, each of which contains
only one pair of quantities Qka, Pka. Each such term corresponds to a travelling
wave with a definite wave vector and polarization, and has the form of the
Hamiltonian for a one-dimensional harmonic oscillator. This expansion is therefore
often referred to as an oscillator expansion of the field.
§2
Quantization of the Free Electromagnetic Field
7
Let us now consider the quantization of the free electromagnetic field. The
classical description of the field given above makes the manner of transition to the
quantum theory obvious. We have now to use canonical variables (generalized
coordinates Qka and generalized momenta Pka) as operators, with the commutation
rule
PkaQka-QwaPka=-i;
(2.9)
operators with different values of k and a always commute. The potential A and,
according to (2.2), the fields E and H likewise become (Hermitian) operators.
The consistent determination of the Hamiltonian requires the calculation of the
integral
H=^-
f(E 2 + H2)d3x,
07T J
(2.10)
in which Ê and H are expressed in terms of Pka and Qka. However, the fact that the
latter do not commute is actually unimportant, since the products QkaPka appear
multiplied by cos k • r sin k • r, which becomes zero on integration over the whole
volume. The resulting expression for the Hamiltonian is therefore
H = £ 5 ( P L + <o2QL),
(2.11)
It, a
which is, as we might have expected, exactly the same in form as the classical
Hamiltonian.
The determination of the eigenvalues of this Hamiltonian involves no further
calculation, since it is equivalent to the familiar problem of the energy levels of
linear oscillators (QM, §23). We can therefore immediately write down the field
energy levels:
E = 2 ( N k û + i)û>,
(2.12)
where the N ^ are integers.
The further discussion of this formula will be left until §3; here we shall write
out the matrix elements of the quantities Qka, which can be done at once by means
of the known formulae for the matrix elements of the coordinates of an oscillator
(see QM, §23). The non-zero matrix elements are
<Nka\Qka\Nka - 1> = (Nka - l|Qka|Nka>
= V(Nka/2<o).
(2.13)
The matrix elements of the quantities Pka = Qka differ from those of Qka only by a
factor ±iû>.
In subsequent calculations, however, it will be more convenient to replace the
quantities Qka and Pka by the linear combinations a>QkQ ± iPka, which have non-zero
Photons
§2
matrix elements only for transitions Nka->Nka
tors
± 1. We therefore define the opera-
8
1
(<oQka + iP ka ),
(2.14)
1
the classical quantities cka, cj?a are the same, apart from a factor V(2ir/û>), as the
coefficients akan afa in the expansion (2.4). The matrix elements of these operators
are
<Nka - l|c ka |N ka > = (Nka\cL\Nka - l)
= VN k a .
(2.15)
The commutation rule for cka and c ka is obtained by using the definitions (2.14) and
the rule (2.9):
ckacka
(2.16)
ckacka — 1.
For the vector potential, we return to an expansion of the type (2.4), but with
operator coefficients, writing it in the form
(2.17)
À = 2 (^ka Aka + Cl^Ak),
where
Ako = V(4TT) -
?
eikr
(2.18)
The symbol e(a) denotes the unit vectors in the direction of polarization of the
oscillators; these vectors are perpendicular to the wave vector k, and for every k
there are two independent polarizations.
Similarly, for the operators Ê and H we write
E — 2J (CkaEka + CkaE£a),
k.a
(2.19)
H — 2J (c ka H ka + ckaH*„),
with
Eka = »û)Aka,
Hka = n x Eka,
n = k/a>.
(2.20)
The vectors Aka are mutually orthogonal, in the sense that
I Aka • Afa d x = — 8aa OU'.
J
(o
!
(2.21)
§2
Quantization of the Free Electromagnetic Field
9
For, if Aka and Ajfv belong to different wave vectors, then their product contains a
factor e'(k~k)'r, which gives zero on integration over the volume; if they differ only
in polarization, e(a) • e(a'** = 0, since the two independent directions of polarization
are mutually orthogonal. Similar arguments apply to the vectors Eka and Hka. They
are conveniently normalized by imposing the condition
^ j (Eka • Ek*a + Hka • Hk*v) d\ = a)8kk8aa.
(2.22)
Substituting the operators (2.19) in (2.10), and carrying out the integration by
means of (2.22), we obtain the field Hamiltonian expressed in terms of the
operators c, c + :
H = 2 2(o(ckacta + c ka c k J.
(2.23)
This operator is diagonal in the representation considered (the matrix elements of
the operators c and c* being given by (2.15)), and its eigenvalues are of course
(2.12).
In the classical theory, the field momentum is defined as the integral
P = -p- f E x H d3x.
4TT J
In changing to the quantum theory, we replace E and H by the operators (2.19), and
thus easily find
P = S2(PL + o>2QL)n,
k,a
(2.24)
in agreement with the familiar classical relationship between the energy and
momentum of plane waves. The eigenvalues of this operator are
P = ï k ( N k a + |).
k.a
(2.25)
The representation of operators by means of the matrix elements (2.15) is the
"occupation number representation", corresponding to the description of the state
of a system (the field) by specifying the quantum numbers Nka (the occupation
numbers). In this representation the field operators (2.19), and therefore the
Hamiltonian (2.11), act on the wave function of the system, expressed in terms of
the numbers N ka ; let this be 4>(Nka, t). The field operators (2.19) are not explicit
functions of the time. This corresponds to the customary Schrödinger representation of operators in non-relativistic quantum mechanics. The state of the system,
4>(Nka, 0» does depend on the time, and this dependence is governed by
Schrödinger's equation,
id&ldt = H$.
§3
Photons
10
This description of the field is, by its nature, relativistically invariant, since it is
based on the invariant Maxwell's equations. But this invariance is not explicitly
shown, primarily because the space coordinates and the time appear in the
description in a highly asymmetric manner.
In relativistic theory, it is convenient to put the description in a form which is
more obviously invariant. T o do so, we must use what is called the Heisenberg
representation, in which the explicit time dependence is transferred to the operators themselves (see QM, §13). Then the time and the coordinates will appear on an
equal footing in the expressions for the field operators, and the state of the system,
4>, will depend only on the occupation numbers.
For the operator A, the change to the Heisenberg representation amounts to
replacing the factor elk r in each term of the sum (2.17) by e,(k r " w0 , i.e. to regarding
the Akü as the time-dependent functions
Aka =
V(4TT)
^ ^
e-,M-k'r).
(2.26)
This is easily proved by noticing that the matrix element of the Heisenberg
operator for the transition i-+f must include a factor exp{-i(E, - Ef)t}, where E,and Ef are the energies of the initial and final states (see QM, §13). For a transition
in which N k decreases or increases by 1, this factor becomes e~lwt or eltot respectively, a condition which is satisfied by effecting the change mentioned above.
Henceforward, in discussing both the electromagnetic field and particle fields,
we shall always assume that the Heisenberg representation of operators is used.
§3. Photons
We shall now further analyse the field quantization formulae obtained in §2.
First of all, formula (2.12) for the field energy raises the following difficulty. The
lowest energy level of the field corresponds to the case where the quantum
numbers Nka of all the oscillators are zero; this is called the electromagnetic field
vacuum state. But, even in that state, each oscillator has a non-zero "zero-point
energy" equal to {to. Summation over an infinite number of oscillators then gives an
infinite result. Thus we meet with one of the "divergences" which are due to the
fact that the present theory is not logically complete and consistent.
So long as only the field energy eigenvalues are under discussion, we can
remove this difficulty by simply striking out the zero-point oscillation energy, i.e.
by writing the field energy and momentum ast
E = 2N kfl <o,
k.a
P = £Nkak.
k,a
(3.1)
t This procedure can be formally carried out without contradiction if we agree to regard the
products of operators in (2.10) as "normal" products, that is, as products in which the operators c + are
always placed to the left of the operators c. Then formula (2.23) becomes
§3
Photons
11
These formulae enable us to introduce the concept of radiation quanta or
photons, which is fundamental throughout quantum electrodynamics.t We may
regard the free electromagnetic field as an ensemble of particles each with energy co
{=tici)) and momentum k (=nhcolc). The relationship between the photon energy
and momentum is as it should be in relativistic mechanics for particles having zero
rest-mass and moving with the velocity of light. The occupation numbers Nka now
represent the numbers of photons having given momentum k and polarization e(a).
The polarization of the photon is analogous to the spin of other particles; the exact
properties of the photon in this respect will be discussed in §6 below.
It is easily seen that the whole of the mathematical formalism developed in §2 is
fully in accordance with the representation of the electromagnetic field as an
ensemble of photons; it is just the second quantization formalism, applied to the
system of photons.$ In this treatment (see QM, §64), the independent variables are
the occupation numbers of the states, and the operators act on functions of these
numbers. The particle "annihilation" and "creation" operators are of basic importance; they respectively decrease and increase by one the occupation numbers.
The cka and cla are operators of this kind: cka annihilates a photon in the state k, a,
and cia creates a photon in that state.
The commutation rule (2.16) corresponds to particles which obey Bose statistics. Photons, therefore, are bosons, as was to be expected, since the number of
photons that can be in any one state must be unrestricted. The significance of this
will be further discussed in §5.
The plane waves Aka (2.26) which appear in the operator A (2.17) as coefficients
of the photon annihilation operators may be treated as the wave functions of
photons having given momenta k and polarizations e(a). This corresponds to an
expansion of the (//-operator in terms of the wave functions of stationary states of a
particle in the non-relativistic second quantization formalism; however, unlike the
latter, the expansion (2.17) includes both particle annihilation and particle creation
operators. The meaning of this difference is explained in §12.
The wave function (2.26) is normalized by the condition
| ^ : ( | E k a | 2 + |H ka | 2 )d 3 x = cu.
(3.2)
This is the normalization to "one photon in the volume V = 1": the integral on the
left is the quantum-mechanical mean value of the photon energy in the state having
the given wave function. § The right-hand side of (3.2) is just the energy of a single
photon.
The "Schrödinger's equation" for the photon is represented by Maxwell's
equations. In the present case (when the potential A(r, t) satisfies the condition
t This concept is originally due to A. Einstein (1905).
$ The application of the second quantization method to the theory of radiation was first worked out
by P. A. M. Dirac (1927).
§ It should be noted that the factor 1/4-7T in the integral (3.2) is twice the usual factor 1/8 TT (2.10).
This is ultimately due to the fact that the vectors Ek«, Hk« are complex, whereas the field operators Ê, H
are Hermitian.
12
§4
Photons
(2.1)), this leads to the wave equation:
a 2 A/ar 2 -AA-o.
The "wave functions" of the photon, in the general case of arbitrary stationary
states, are complex solutions of this equation, whose time dependence is given by
the factor e~iwt.
In referring to the photon wave function, we must again emphasize that this can
not be regarded as the probability amplitude of the spatial localization of the
photon, in contrast to the fundamental significance of the wave function in
non-relativistic quantum mechanics. This is because, as has been shown, in §1, the
concept of the coordinates of the photon has no physical meaning. The mathematical aspect of this situation will be further discussed at the end of §4.
The components of the Fourier expansion of the function A(r, 0 with respect to
the coordinates form the wave function of the photon in the momentum representation; we denote this by A(k, t) = A(k) e lu>t. For example, in a state with a given
momentum k and polarization e(a), the wave function in the momentum representation is given simply by the coefficient of the exponential factor in (2.26):
A ka (k', a') =
V(4TT)
^ ^
ÔkkôQ a .
(3.3)
Since the momentum of a free particle is measurable, the wave function in the
momentum representation has a more profound physical significance than that in
the coordinate representation: it enables us to calculate the probabilities wka of
various values of the momentum and polarization of a photon in a specified state.
According to the general rules of quantum mechanics, wka is given by the square of
the modulus of the corresponding coefficient in the expansion of the function A(k')
in terms of the wave functions of states with given k and e(a):
wka
2 \L(k\ a') - \(k')
the proportionality coefficient depending on the way in which the functions are
normalized. Substitution of (3.3) gives
wka oc | e (a) • A(k)|2.
(3.4)
Summation over the two polarizations gives the probability that the photon
momentum is k:
wk a |A(k)|2.
(3.5)
§4. Gauge invariance
The field potential in classical electrodynamics is well known to be subject to an
arbitrary choice: the components of the 4-potential AM can undergo any gauge
Gauge Invariance
§4
13
transformation of the form
(4.1)
Ar^A^ + dM,
where \ is a n y function of coordinates and time (see Fields, §18).
For a plane wave, if we consider only transformations which do not change the
form of the potential (proportional to exp (-ik^xM)), the freedom of choice reduces
to the possibility of adding to the wave amplitude any 4-vector proportional to k*\
This arbitrariness in the potential persists in the quantum theory, of course,
where it relates to the field operators or to the wave functions of photons. In order
not to prejudice the choice of the potentials, we must replace (2.17) by the
corresponding expansion for the operator 4-potential,
Â^^ic^L
k,a
+ àLAtt),
(4.2)
where the wave functions A£a are 4-vectors of the form
At = V(4TT) - ^
e-**\
e.e»* = - 1 ,
or more concisely, omitting the four-dimensional vector indices,
Ak = V(41T)
V&>) €""' *** = " L
(4 3)
'
Here the 4-momentum kM = (co, k) (and so kx = cut-k-r), and e is the unit
polarization 4-vector.t
If we consider only gauge transformations which do not alter the dependence of
the function (4.3) on the coordinates and the time, the transformation must be
e» -» e^ + xKy
(4-4)
where \ = x(^) is an arbitrary function. Since the polarization is transverse, it is
always possible to choose a gauge such that the 4-vector e is
e M =(0,e),
e-k = 0;
(4.5)
this will be called the three-dimensionally transverse gauge. In invariant fourdimensional form, this condition becomes the condition of four-dimensional
transversality
ek = 0.
(4.6)
t The expression (4.3) is not in a fully relativistic-covariant (4-vector) form; this is because the
normalization to a finite volume V = l, used here, is not invariant. This is, however, of no fundamental
importance, and is entirely compensated by the advantages of the normalization used. We shall see later
that it allows a simple and straightforward deduction of actual physical quantities in the necessary
invariant form.
14
Photons
§5
It should be noticed that this condition (like the normalization condition
ee* = - 1 ) is preserved by the transformation (4.4), since k2 = 0. If the square of the
4-momentum of a particle is zero, its mass must also be zero. This demonstrates
the relationship between gauge invariance and the zero mass of the photon. Other
aspects of the relationship will be discussed in §14.
There can be no change in any measurable physical quantities under a gauge
transformation of the wave functions of photons concerned in a process. In
quantum electrodynamics this requirement of gauge invariance is of even greater
importance than in the classical theory. We shall see many examples of the fact
that gauge invariance is here, like relativistic invariance, a valuable heuristic
principle.
Gauge invariance is, in turn, closely related to the law of conservation of
electric charge. This aspect will be discussed in §43.
It has already been mentioned in §3 that the coordinate wave function of the
photon cannot be interpreted as the probability amplitude of its spatial localization.
Mathematically, this is shown by the impossibility of constructing from the wave
function any quantity which has even the formal properties of a probability
density. Such a quantity would have to be expressed as a positive-definite bilinear
combination of the wave function AM and its complex conjugate. Moreover, it
would have to satisfy certain conditions of relativistic covariance by being the time
component of a 4-vector. This is because the continuity equation, which expresses
the conservation of the number of particles, is given in four-dimensional form by
the vanishing of the divergence of the current 4-vector. The time component of the
current is here the particle localization probability density; see Fields, §29. On the
other hand, by the condition of gauge invariance, the 4-vector AM could appear in the
current only as the antisymmetric tensor F^ = d^Av - d„AM = -i(k^Av - kvA»). Thus
the current 4-vector would have to be a bilinear combination of F^ and F*v (and
the components of the 4-vector fcM). But such a 4-vector cannot be formed, since
every expression (such as kKF*vFvk) which satisfies the conditions stated is zero by
the transversality condition (kkFpk = 0), and in any case could not be positivedefinite, since it contains odd powers of the components k^.
§5. The electromagnetic field in quantum theory
The description of the field as an ensemble of photons is the only description
that fully accords with the physical significance of the electromagnetic field in
quantum theory. It replaces the classical description in terms of field strengths.
These appear in the mathematical formalism of the photon picture as second
quantization operators.
The properties of a quantum system are known to be similar to the classical
properties when the quantum numbers defining the stationary states of the system
are large. For a free electromagnetic field (in a given volume) this means that the
oscillator quantum numbers, i.e. the photon numbers N k a , must be large. In this
respect the fact that photons obey Bose statistics is of great importance. In the
mathematical formalism of the theory, the relationship of the Bose statistics to the
properties of the classical field is shown by the commutation rules for the operators
§5
The Electromagnetic Field in Quantum Theory
15
ck«, CL- When the Nkn are large, and the matrix elements of these operators are
therefore large also, we may neglect unity on the right-hand side of the commutation rule (2.16), obtaining
these operators thus become the commuting classical quantities cka and cfa, which
determine the classical field strengths.
The condition for the field to be quasi-classical needs to be made more precise,
however, since, if all the numbers N ka are large, the energy of the field is certainly
infinite on summation over all the states k, a, and the condition then becomes
meaningless.
A physically meaningful statement of the problem as to the conditions for a
quasi-classical field can be based on a consideration of values of the field averaged
over some short time interval At. If the classical electric field E (or magnetic field
H) is represented as a Fourier integral expansion with respect to the time, then,
when it is averaged over the time interval At, only those Fourier components
whose frequencies are such that o>At ^ 1 will make a significant contribution to the
mean value Ë, since otherwise the oscillating factor e~tu,t almost vanishes on
averaging. Thus, in determining the condition for the averaged field to be quasiclassical, we need consider only those quantum oscillators whose frequency
a) *s 1/At. It is sufficient that the quantum numbers of these oscillators should be
large.
The number of oscillators having frequencies between zero and a) — 1/At (for a
volume V = 1) is, in order of magnitude,!
(co/c) 3 ~l/(cAt) 3 .
(5.1)
The total field energy per unit volume is proportional to E1. Dividing this by the
number of oscillators and by some mean value of the energy of a single photon
(—ftû)),we find as the order of magnitude of the numbers of photons
Nk-ÉV/foo4.
With the condition that this number should be large, we obtain the inequality
|E|^>V(ftc)/(cAt) 2 .
(5.2)
This is the required condition, which allows the field averaged over time
intervals At to be treated as classical. We see that the field must reach a certain
strength, which increases as the averaging time At decreases. For variable fields,
this time must not, of course, exceed the time during which the field changes
appreciably. Thus variable fields, if sufficiently weak, can never be quasi-classical.
Only for static (time-independent) fields can we make At -*<», so that the right-hand
side of the inequality (5.2) tends to zero. Thus a static field is always classical.
t In this section, ordinary units are used.
16
Photons
§6
It has already been mentioned that the classical expressions for the electromagnetic field as a superposition of plane waves must be regarded in quantum
theory as operator expressions. These operators, however, have only a very limited
physical meaning. A physically meaningful field operator would have to give zero
field values in the photon vacuum state, whereas the mean value of the squared
field operator Ê2 in the ground state, which is the same as îhe zero-point energy of
the field apart from a factor, is infinite; by the "mean value" is meant the
quantum-mechanical mean value, i.e. the corresponding diagonal matrix element of
the operator. This infinity cannot be avoided even by any formal cancelling
operation (as was done for the field energy), since here this would have to be
carried out by means of some appropriate modification of the operators Ê and H
themselves (not their squares), which is impossible.
§6. The angular momentum and parity of the photon
The photon, like any other particle, can possess a certain angular momentum. In
order to determine the properties of this quantity for the photon, let us first recall
the relationship between the properties of the wave function of a particle and the
angular momentum of the particle, in the mathematical formalism of quantum
mechanics.
The angular momentum j of a particle consists of its orbital angular momentum
I and its intrinsic angular momentum or spin s. The wave function of a particle
having spin s is a symmetrical spinor of rank 2s, i.e. is a set of 2s + 1 components
which are transformed into definite combinations of one another when the coordinate axes are rotated. The orbital angular momentum is related to the way in
which the wave functions depend on the coordinates: states with orbital angular
momentum / correspond to wave functions whose components are linear combinations of the spherical harmonic functions of order I.
The consistent distinguishability of the spin and the orbital angular momentum
therefore requires that the "spin" and "coordinate" properties of the wave functions should be independent of each other: the dependence of the spinor components on the coordinates (at a given instant) must not be subject to any
additional restrictions.
In the momentum representation of the wave functions, their dependence on
the coordinates is replaced by their dependence on the momentum k. The photon
wave function (in the three-dimensionally transverse gauge) is the vector A(Jk). A
vector is equivalent to a spinor of rank 2, and in this sense the photon might be said
to have spin 1. But this vector wave function satisfies the transversality condition,
k • A(k) = 0, which is a further condition imposed on the function A(k). Consequently, this function cannot be arbitrarily specified as regards every component
of the vector at the same time, and therefore the orbital angular momentum and the
spin cannot be strictly distinguished.
The definition of the spin as the angular momentum of a particle at rest is also
inapplicable to the photon, because there is no rest frame for a photon, which
moves with the velocity of light.
§6
The Angular Momentum and Parity of the Photon
17
Thus only the total angular momentum of the photon has a meaning. It is,
moreover, obvious that this total angular momentum must be integral, since the
quantities describing the photon do not include any spinors of odd rank.
The state of a photon, like that of any particle, is also described by its parity,
which refers to the- behaviour of the wave function under inversion of the
coordinates (see QM, §30). In the momentum representation, the change of sign of
the coordinates is replaced by the change of sign of all the components of k. The
effect of the inversion operator P on a scalar function <£(k) is simply to produce
this change of sign: P<f>(k) - ^>(~k). When it is applied to a vector function A(k), we
must also take into account the fact that the reversal of the directions of the axes
changes the sign of all the components of the vector; hencet
PA(k) = - A ( - k ) .
(6.1)
Although the separation of the angular momentum of the photon into the orbital
angular momentum and the spin has no physical meaning, it is nevertheless
convenient to define a "spin" s and an "orbital angular momentum" / as formal
auxiliary quantities which express the transformation properties of the wave
function under rotations: the value s = 1 corresponds to the fact that the wave
function is a vector, and the value of / is the order of the spherical harmonics
which occur in the wave function. Here we are considering the wave functions of
states in which the photon angular momentum has a definite value; for a free
particle, these are spherical waves. The number /, in particular, defines the parity of
the photon state, which is
P=(-l)m.
(6.2)
In the same way, the angular momentum operator j may be represented as the
sum s +1. The operator j is related to the operator of an infinitesimal rotation of the
coordinates, or, in the present case, to the action of this operator on a vector field.
In the sum s + Î, the operator s acts on the vector index, transforming the
components of the vector into combinations of one another. The operator Î acts on
these components as functions of the momentum (or of the coordinates).
We may count the number of states (with a given energy) which are possible for
a given value j of the photon angular momentum, ignoring the trivial (2j + l)-fold
degeneracy with respect to the directions of the angular momentum.
When I and s are independent, this calculation is made by simply counting the
number of ways in which the angular momenta 1 and s can be added, according to
the rules of the vector model, so as to obtain the required value of j. For a particle
t We shall choose to define the parity of a state according to the effect of the inversion operator on a
polar vector, such as A (or the corresponding electric vector E = ia>A). This differs in sign from the
effect on the axial vector H = ik x A, since the direction of such a vector is unaltered by inversion:
PH(k) = H(-k).
18
Photons
§6
with spin 5 = 1, and a given non-zero value of j, this would give three states, with
the following values of / and the parity P:
l=L
P = ( - 1 ) I H = (-1)' + I;
l=j±l,
P = ( - l ) ' + l = (-!)''.
If j = 0, however, only one state is obtained, with / = 1 and parity P = +1.
In this calculation the condition that the vector A is transverse has not been
taken into account; all its three components have been assumed to be independent. We must therefore subtract, from the numbers of states found above,
the numbers of states which correspond to a longitudinal vector. This vector may
be written in the form k$(k), whence we see that its three components are
equivalent, as regards their transformation properties (under rotations), to a single
scalar </>.t We can therefore say that the extra state which is incompatible with the
transversality condition would correspond to the state of a particle having a scalar
wave function (spinor of rank 0), i.e. having "spin zero".$ The angular momentum j
of this state is therefore equal to the order of the spherical harmonics which occur
in 0. The parity of the state as a state of the photon is determined by the action of
the inversion operator on the vector function k</>:
P(k<t>) = -(-k)4>(-k) = (- D'kcMk),
and is therefore (- l)j. Thus we must subtract one from the number of states found
above which have the parity (-1) J , i.e. two for jV 0 and one for j = 0.
The conclusion is, then, that when the photon angular momentum j is non-zero
there is one even state and one odd state. When j = 0, no states exist. This means
that a photon cannot have zero angular momentum; j therefore takes only the
values 1,2,3,
The impossibility of j = 0 is evident a priori, since the wave
function of a state with zero angular momentum must be spherically symmetrical,
and this cannot be true for a transverse wave.
The following terminology is customary to denote the various states of the
photon. A photon with angular momentum j and parity (-1) J is called an electric
2'-pole (or Ej) photon; one with parity (-1) , + 1 is called a magnetic 2]-pole (or Mj)
photon. For example, an odd state with j = 1 corresponds to an electric dipole
photon, an even state with j = 2 to an electric quadrupole photon, and an even state
with j = 1 to a magnetic dipole photon.§
t This is because the transformation of a quantity under rotation is a transformation at a given point,
i.e. for a given value of k. Under such a transformation, k<Mk) is unchanged, i.e. it behaves as a scalar.
$ It should be again emphasized that this does not refer to a state of an actual particle. The
calculation given here is a formal one, and amounts mathematically to a classification of the set of
quantities which are transformed into combinations of one another, in terms of the irreducible
representations of the rotation group.
§ This nomenclature corresponds to the terminology of classical radiation theory; we shall see later
(§§46, 47) that the emission of electric and magnetic photons is governed by the electric and magnetic
moments of a system of charges.
§7
19
Spherical Waves of Photons
§ 7. Spherical waves of photons
Having ascertained the possible values of the photon angular momentum, we
must now determine the corresponding wave functions.t
Let us first consider the formal problem of determining vector functions which
are eigenfunctions of the operators j 2 and j 2 , without deciding as yet which of these
functions will appear in the desired photon wave functions, and without taking
account of the transversality condition.
We shall look for the functions in the momentum representation. In this
representation, the coordinate operator is f = idjdk (see QM, (15.12)). The orbital
angular momentum operator is
I = f x k = - i k x d/3k,
and therefore differs from the angular momentum operator in the coordinate
representation only in that r is replaced by k. The solution of the problem is thus
formally identical in the two representations.
Let the required eigenfunctions be denoted by Yim and referred to as spherical
harmonic vectors. They must satisfy the conditions
pYim - j(j + l)Yim,
jzYim = mYim,
(7.1)
the z-axis being in a specified direction in space. We shall show that these
conditions are satisfied by any function of the form aY,m, where a is any vector
formed from the unit vector n = k/co, and Yjm are the ordinary (scalar) spherical
harmonic functions. The latter will everywhere be defined as in QM, §28:
Y,m(n) = ( - l)*"-'"">i« V ( 2 4^(| X +lmD! l ) ! Pl",(C0S $) '**'
(? 2)
'
where 0 and cf> are the spherical polar angles of the direction n.t
The proof is based on the commutation rule
{lh ak}- = ieik\a\
(QMy (29.4)). The right-hand side may be written as - Siak, where s is the operator of
spin 1; the effect of this operator on a vector function is in fact given by
Siûk = -ieik\a\ (see QMy §57, Problem 2). Hence
liük - aJi = -Sjök,
t This problem was first discussed by W. Heitler (1936). The solution given here is due to V. B.
Berestetskil (1947).
t For future reference, the value of the function when 6 = 0 (n is along the z-axis) is
YJm(nz) = i' y]-£~- «mo.
(7.2a)
20
§7
Photons
and therefore
jtük = (I, + Si)ak = a k /,.
Consequently
j : (aY j m ) - a!2 Yjm,
J:(aY jm ) - af z Y im .
Since the spherical harmonic Yjm is the eigenfunction of the operators I2 and \z
which c o r r e s p o n d s to the respective eigenvalues j(j + l) and m, we arrive at
equations (7.1).
T h e three essentially different types of spherical harmonic vectors are obtained
by taking as the vector a the three following v e c t o r s : t
Vn
nxVn
V[j(j + Dr
V[j(j + i)r
'
K
'
The spherical harmonic vectors are thus defined as
Y{€) = —>—
Jm
V[j(j + D]
Y}r = n x Y } ^
V Y
P = (-1V-
)my
P=(-ir1;
YjJi = nYjm,
(7.4)
P=(-iy.
The parity P is also shown for each vector. The three vectors are orthogonal, Y$l
being longitudinal and Y $ and Y}^) transverse with respect to n.
The spherical harmonic vectors can be expressed in terms of the scalar
spherical harmonics: Y^ } in terms of spherical harmonics of the order I = j only,
and Yjm and Yjm in terms of those of order / = j ± 1. This is immediately evident on
comparing the parities shown in (7.4) with the parity (-1) / + 1 of a vector field in
terms of the order of the spherical harmonics concerned.
The spherical harmonic vectors of any one type are orthonormal:
| Y ^ - Y . V do = Sä.fimm..
(7.5)
For the vectors Yjm this is obvious from the normalization condition for the
T The operator V„ = |k|Vk, and acts on functions which depend only on the direction of n. In
spherical polar coordinates its two components are
The operator denoted below by A. is the angular part of the Laplacian operator:
—— — ( ■
m 6— \+
sin0 3 0 V
de)
l
d
sin2 6 ~d4?
Spherical Waves of Photons
§7
21
spherical harmonics Y,m. For the vectors Yjm the normalization integral is
jlJTT^V^
■ VmYfm-do = - j ^
j Y,VAnYimdo,
and, since AnYim = - j ( j + \)Yjm, equation (7.5) follows. The normalization for the
vectors Y)%} leads to a similar integral.
The spherical harmonic vectors (7.4) could also be derived without the direct
verification of equations (7.1) that has been carried out above, using only general
arguments concerning the transformational properties of functions. In §6, these
arguments were employed to show that a vector function n<t> corresponds to an
angular momentum j which is the same as the order of the spherical harmonics
occurring in c/>. If we put simply c/> = Yjm, the function n</> will also correspond to a
definite value m of the angular-momentum component. Thus we derive at once the
spherical harmonic vectors Y $ . But the discussion of transformational properties
in §6 is unaffected if the factor n in the product n</> is replaced by the vector Vn or
by n x Vn. This leads to the other two types of spherical harmonic vectors.
Let us now consider the photon wave functions. For an electric photon of type
Ej, the parity of the vector A(k) is (-1)'. The spherical harmonic vectors Y $ and
Yj„ possess this parity, but only the former satisfies the transversality condition.
For a magnetic photon of type Mj, the parity of the vector A(k) is (-1) J>1 ; only Y]%]
has this parity. The wave functions of a photon having a given angular momentum
j , component thereof m, and energy co, are therefore
Awim(k) = %
CO
5(|k| - co)Yjm(n),
(7.6)
where Yjm must be taken as Y ^ and Y ^ ) for electric and magnetic photons
respectively. The given value of the energy is taken into account by the factor
5(|k|-o>).
The functions (7.6) are normalized by the condition
j^—i
I wco'A*; m (k) • Awim(k) d3k = coô(o/ - cu)^ Smm .
(7.7)
For wave functions of the coordinate representation, the condition (7.7) is
equivalent to the conditibnt
^
| {E*Tm.(r) • Ewim(r) + H* i m (r) • Htt,m(r)} d'x = œô(œf - co)ÔirÔmm :
(7.8)
the integral on the left, when written in terms of the potentials, is
2^ J A*Tm-(r) • A^vnCiOcu'cü d3x,
t This condition is of the same type as (2.22). The factor 8(w' - to) on the right-hand side appears
because we are now considering a field (spherical wave) throughout infinite space instead of in the finite
volume V = 1.
22
§7
Photons
and with
Awj-m(r) =
- /
A wjm (k) e
ik.r
d'k
(2TT)3'
(7.9)
the integral over d3x gives the delta function (2îr) 3 ô(k'-k). This is eliminated by
integrating over d3/c, and the integral reduces to (7.7).
So far, we have assumed that the potentials are in the transverse gauge, for
which the scalar potential <£> = 0. In certain applications, however, other gauges of
the spherical wave may be more convenient.
The transformation of the potentials that can be conducted in the momentum
representation is
A -> A + n/(k),
$ -+ 0> + /(k),
where /(k) is an arbitrary function. In the present case we shall choose it so that
the new potentials are expressed in terms of the same spherical harmonics and
again have a definite parity. For an electric photon, these conditions limit the
choice of potentials to the following:
(k) = ^ J ô ( | k | - û ) ) ( Y B + C n y j m ) ,
CO
(7.10)
:4TT
oLV.(k) = ^72Ô(|k|-co)cyjm,
where C is an arbitrary constant. For a magnetic photon, this addition to A(m)(k)
would leave it without a definite parity, and (7.6) is therefore the only possible
choice under these conditions.
The probability that a photon having a definite angular momentum and parity
will be recorded as moving in a direction n which lies in the solid-angle element do
is, according to (3.5) and (7.6),
w(n) do
!(n)\2 do.
(7.11)
This is the expression for an E photon, but, since |Yj£}|2 = |Y$| 2 , the probability
distribution w(n) is the same for both types of photon.
The squared modulus |Y$| 2 is independent of the azimuthal angle </>, since the
factors e±im<i> in the spherical harmonic functions cancel. The probability distribution w(n) is therefore symmetrical about the z-axis. Moreover, since each of
the spherical harmonic vectors has a definite parity, their squared moduli are
unaffected by inversion, i.e. by the change of polar angle 0-*rr - d; this means that
the expansion of the function w(0) in Legendre polynomials will contain only those
of even order. The determination of the expansion coefficients is equivalent to a
calculation of the integrals of products of three spherical harmonic functions,
Spherical Waves of Photons
§7
23
followed by summation over components. These processes are effected by means
of the formulae derived in QM, §§107, 108, and the result is
w ( e ) = ( _ i r . ( ^ | o ( 4 n + 1)(;
i2n)(i
, 2n){i , 2n J
^
$) (?
^
Finally, we shall give the expressions for the components of the spherical
harmonic vectors as expansions in terms of spherical harmonic functions. To do so,
we shall use the "spherical components" of a vector, defined as in QM, §107. These
components /A of a vector f are
/o=
ft,
/ + i = - ^72 (/x + ify),
(7.13)
/-i = ^ (/x - ify).
In terms of the "spherical unit vectors",
e (0) =ie (2) ,
e(+l)=-77^(cu)+ie(y)),
V2'
e
V2(e
lV }
'
(714)
where e(x,y,z) are unit vectors in the direction of x, y and z, we have
f = E(-l)'"Y-Ae (A) ,
/ A =(-l)'- A f.e ( -'>*
= f • e(A).
(7.15)
The spherical components of the spherical harmonic vectors are expressed in
terms of 3j-symbols and spherical harmonic functions as follows:
(-lV*»WO. - - v j ^ ; 1 , _[ _i)v,*,-«+
(-l)'*'"*"1(Y!:,)» = -V(2j + l)( m , + A _J _j,)n»«. Ï (7.16)
W i ^ _i _ J ) w
These formulae are derived in the following way. Each of the three spherical
harmonic vectors is of the form Yjm = aY,m, where a is one of the three vectors
(7.3). Hence
Yim = 2</m'|a|jm>Y,m.,
§8
Photons
24
and the problem is equivalent to that of finding the matrix elements of the vector a
with respect to the eigenfunctions of the orbital angular momentum. According to
Q M (107.6), we have
I
m'
(lm'\aK\jm)=i(-\Y
>i«w>.
where j m a x is the larger of / and j . It is therefore sufficient to know the non-zero
reduced matrix elements (J||a||j). These are given by the formulae
<l-l||n||l> = </||n||/-l>*=iVl f
</||V n ||/-l) = / ( / - l ) V / ,
< / - l | | V j ) = i(/ + l)V/,
(7.17)
</||nxV n ||0 = iV[/(l + l)(2l + l)L
§8. The polarization of the photon
The polarization vector e acts for the photon as the "spin part" of the wave
function (with the limitations stated in §6 in connection with the concept of photon
spin).
The various cases which can occur with regard to the polarization of the photon
are identical with the possible types of polarization of a classical electromagnetic
wave (see Fields, §48).
Any polarization e can be represented as a superposition of two mutually
orthogonal polarizations e(1) and e(2) (e(l) • e(2)* = 0), chosen in some specified manner.
In the resolution
J2)
e = eie1 ' + e2t
(i)
(8.1)
the squares of the moduli of the coefficients e{ and e2 determine the probabilities
that the photon has polarization e(l) and e(2) respectively.
These polarizations may be taken to be two mutually perpendicular linear
polarizations. We can also resolve any polarization into two circular polarizations
having opposite directions of rotation. The vectors of the right-hand and left-hand
circular polarizations will be denoted by e(+1) and e ( l ) respectively; in coordinates £,
7), £, with the £-axis in the direction of the photon n = k/co,
j+n
^ e (e ( ^ ^+ i îe
e ( î î ),,),
e*<-!>' ' = ^ ( e ( * ) - i e ( 1 , ) ) .
(8.2)
The possibility that the photon has two different polarizations (for a given
momentum) is equivalent to the statement that each eigenvalue of the momentum is
doubly degenerate. This property is closely related to the fact that the mass of the
photon is zero.
I
§8
The Polarization of the Photon
25
A freely moving particle with non-zero mass always has a rest frame. The
intrinsic symmetry properties of the particle, as such, will evidently appear in this
particular frame of reference. Symmetry with respect to all possible rotations about
the centre (i.e. with respect to the entire spherical symmetry group) must be
considered. The property which describes the symmetry of the particle with
respect to this group is its spin s; this determines the degree of degeneracy, the
number of different wave functions which are transformed into linear combinations
of one another being 2s + 1. In particular, a particle having a vector (threecomponent) wave function has spin 1.
If the mass of the particle is zero, however, there is no rest frame, since it
moves with the velocity of light in every frame of reference. For such a particle,
there is always a distinctive direction in space, the direction of the momentum
vector k (the £-axis). In such a case there is clearly no symmetry with respect to
the whole group of rotations in three dimensions, but only axial symmetry about
the preferred axis.
When there is axial symmetry, only the helicity of the particle is conserved, i.e.
the component of its angular momentum along the £-axis, which we denote by A.t
If we also impose the condition of symmetry under reflections in planes passing
through the £-axis, the states differing in the sign of A will be mutually degenerate,
and when A^O there is therefore twofold degeneracy.t The state of a photon
having a definite momentum in fact corresponds to one type of these doubly
degenerate states. It is described by a "spin" wave function which is a vector e in
the ^T)-plane; the two components of this vector are transformed into combinations
of each other by any rotation about the £-axis and by any reflection in a plane
passing through that axis.
The various cases of the polarization of the photon are in a certain relationship
to the possible values of its helicity. The relationship can be deduced from the
formulae in QM (57.9), which connect the components of a vector wave function
with those of the equivalent spinor of rank two.§ Vectors e with only the component
ek - iev or ek + iev non-zero correspond to the components A = +1 or - 1 respectively; these are e = e(+l) and e = e {1) . In other words, the values A = +1 and - 1
correspond to right-hand and left-hand circular polarization of the photon. In §16
the same result will be derived by direct calculation of the eigenfunctions of the
spin component operator.
Thus the component of the photon angular momentum along the direction of its
motion can have only the two values ± 1 ; the value zero is not possible.
A state of the photon having a definite momentum and polarization is a pure
state, in the sense defined in QM, §14; it is described by a wave function, and
corresponds to a complete quantum-mechanical description of the state of the
particle (the photon). "Mixed" photon states are also possible, which correspond to
a less complete description by a density matrix only, not a wave function.
t This is to be distinguished from m, the component of the angular momentum in a specified
direction in space (the z-axis), which was used in §7.
$ This is the method of classifying the electron terms of the diatomic molecule (QM, §78).
§ It is the contravariant spinor components that correspond to the components of the wave function
as the probability amplitudes of various values of the angular momentum of the particle (which are here
considered).
26
Photons
§8
Let us consider a state of the photon which is mixed as regards its polarization,
but corresponds to a definite value of the momentum k. In such a state (called a
state of partial polarization), a "coordinate" wave function exists.
The polarization density matrix of the photon is a tensor paß of rank two, in a
plane perpendicular to the vector n (the ^TJ-plane; the suffixes a, ß take only two
values). This tensor is Hermitian:
Paß = P*a,
(8.3)
Pa« =Pll + p22= 1.
(8.4)
and is normalized by the condition
From (8.3), the diagonal components pu and p22 are real, and either is given in terms
of the other by (8.4). The component pn is complex, and p2i = p*2. The density
matrix therefore involves three real parameters.
If the polarization density matrix is known, we can find the probability that the
photon has any given polarization e. This probability is determined by the "projection" of the tensor paß on the direction of the vector e, i.e. by the quantity
PaßeUß.
(8.5)
For example, the components p u and p22 are the probabilities of linear polarizations
along the £ and TJ axes. The probability of the two circular polarizations is given by
taking the projections along the vectors (8.2):
J[l±i(pi2-P2i)]-
(8.6)
The properties of the tensor paß are essentially the same as those of the tensor
Jaß which describes partially polarized light in the classical theory (see Fields, §50).
Some of these properties are the following.
For a pure state with a definite polarization e, the tensor paß reduces to products
of components of the vector e:
Pjaß = eae*ß,
(8.7)
and the determinant |pa/3| = 0. In the opposite case of an unpolarized photon, all
directions of polarization are equally probable, i.e.
Paß=\8aßy
(8.8)
and |p«p| = i
In the general case, it is convenient to describe the partial polarization by means
of three real Stokes parameters £i, £2, &>t in terms of which the density matrix can
t These are not to be confused with the £-axis.
§8
The Polarization of the Photon
27
be written
n
Pa
_I/l +6
"-2U. + i6
£i-i&\
1-6 >
r89)
(8,9)
All three parameters take values between - 1 and +1. In the unpolarized state,
£i = & = £3 = 0; for a completely polarized phQton, Ç] + £2 + £3 = 1.
The parameter £3 describes the linear polarization along the £ or TJ axis; the
probability that the photon is linearly polarized along these axes is respectively
2(1 + £3) and 2d - £3). The values £3 = +1 and - 1 therefore correspond to complete
polarization in these directions.
The parameter £1 describes the linear polarization along directions at angles
<f> = ±\TT to the £-axis. The probability that the photon is linearly polarized along
This is easily shown by
these directions is respectively 2O + £i) and \{\-%\).
projecting the tensor paß on the directions e = (1, ±1)/V2.
Finally, the parameter £2 represents the degree of circular polarization: according to (8.6), the probability that the photon has right-hand or left-hand circular
polarization is respectively 2O + &) and 2(1 ~ ^2)- Since these two polarizations
correspond to helicities A = ± 1, it is clear that £2 is the mean value of the helicity of
the photon. Moreover, for a pure state with polarization e,
£2 = i e x e * - n .
(8.10)
The quantities £2 and V(£? + £3) are invariant under Lorentz transformations
(see Fields, §50).
We shall later encounter the problem of the behaviour of the Stokes parameters
under the operation of time reversal. It is easily seen that they are invariant. This
property is evidently independent of the state of polarization, and therefore need
be proved only for a pure state. In quantum mechanics, time reversal corresponds
to replacing the wave function by its complex conjugate (QM, §18). For a
plane-polarized wave, this implies the changest
k^-k,
e->-e*.
(8.11)
Under this transformation, the symmetrical part
\{eae% +eße*)
of the density matrix is unchanged, and therefore so are £1 and £3. The fact that &
is unchanged by this transformation is seen from (8.10), and is also evident from
the fact that £2 is the mean value of the helicity: the helicity is the component of
the angular momentum j in the direction of n, i.e. the product j • n, and both these
vectors change sign under time reversal.
t The change in the sign of e is necessary because time reversal changes the sign of the vector
potential of the electromagnetic field. The scalar potential, however, does not change sign, and the effect
of time reversal on the 4-vector e is therefore as follows:
(eo, e)->(eff, -e*).
(8.11a)
§8
Photons
28
In later calculations, we shall need the photon density matrix written in
four-dimensional form, i.e. as a certain 4-tensor p^,. For a polarized photon
described by the 4-vector e^ this tensor can naturally be defined as
p Ml = e^e*.
(8.12)
In the three-dimensionally transverse gauge, e = (0, e), and if one of the spatial
coordinate axes is taken to be along n the non-zero components of this 4-tensor are
the same as (8.7).
For an unpolarized photon the three-dimensionally transverse gauge corresponds to a tensor p^„ having components
Pik = 2(Sjk - n,nk),
poi = pio = poo = 0;
(8.13)
if one of the axes is in the direction of n, the result is again (8.8). It would,
however, be inconvenient to use the tensor p^v in this three-dimensional form. But
a gauge transformation can be applied, which for the density matrix is
Pu* -> Pw + XtMv + X»K>
where the x»
are
(8.14)
arbitrary functions. Putting
Xo = -l/4ci>,
Xi = ki/4o)2,
we obtain instead of (8.13) the simple four-dimensional expression
(8.15)
P^ = -\g^
The four-dimensional form of the density matrix for a partially polarized photon is
easily found by first writing the two-dimensional tensor (8.9) in three-dimensional
form:
Pik
= i(«iM !) + e?e?) + \ £,(« j»eF> + e JM'*) - \ iHe(M2) ~ eiM") + \Ue{Ml)
-
e?e?),
where e(1) and e(2) are unit vectors along the £ and TJ axes. The required generalization is obtained on replacing these 3-vectors by real space-like unit 4-vectors e(]\
e(2) which are orthogonal to each other and to the photon 4-momentum k:
e(l)2 = é>(2)2 = - l ,
*<V2) = 0,
ed)k = emk = 0 -
(8.16)
In one particular frame of reference, eiX) = (0, e(1)) and e{2) = (0, e(2)). Thus the
four-dimensional density matrix of the photon is
-\iHeye?-efe^ + \^ey-efe?).
(8.17)
A Two-photon
§9
System
29
The convenience of any specific choice of the 4-vectors e{l\ ei2) depends on the
conditions of the problem concerned.
It must be noted that the conditions (8.16) do not uniquely define the choice of
e(]) and ea\ If a 4-vector e^ satisfies these conditions, then so does any 4-vector
eM + xK' since k2 = 0. This non-uniqueness occurs because the density matrix is not
invariant under gauge transformations.
The first term in (8.17) corresponds to the unpolarized state. According to (8.15),
it can therefore be replaced by —{g^. This change is again equivalent to a certain
gauge transformation.
The following formal device is useful in calculations with 4-tensors of the form
(8.17) expressed in terms of two independent 4-vectors. We write the tensor (8.17)
in the form
and the coefficients p{ab) as a two-rowed matrix:
9
1^(2.)
p (22)j.
This, like any two-rowed Hermitian matrix, can be written in terms of four
independent two-rowed matrices: the Pauli matrices ax, cry, crz and the unit matrix 1.
The result is
p = !(l + t-cr),
É = (£i,& 6 ) ,
(8.18)
as is easily seen by direct comparison with (8.17), using the expressions (18.5) for
the Pauli matrices. The combination of the three quantities £i, &» & into a "vector"
£ is, of course, purely formal and is done only for convenience of notation.
PROBLEM
Write the photon density matrix for the case where the coordinate "axes" are the circular unit
vectors (8.2).
SOLUTION. The components p'Qß of the tensor relative to the new axes (a, ß = ±1) are obtained by
projecting the tensor (8.9) on the unit vectors (8.2):
P'u = Paße{:"*e{;l\
,1/
pi
l+ £
, = Paße{;"*e{ß-l\ . . . ;
-6+'*\
§9. A two-photon system
By arguments similar to those in §6, we can calculate the number of possible
states in a more complicated case, that of a system of two photons (L. Landau,
1948).
30
Photons
§9
We shall consider the photons in their centre-of-mass system; their momenta
are ki = - k 2 = k.t The wave function of the two-photon system (in the momentum
representation) can be written as a three-dimensional tensor of rank two Aik(n),
formed by a bilinear combination of the vector wave functions of the two photons;
each of the suffixes of this tensor corresponds to one of the photons (n being a unit
vector in the direction of k). The transversality of each photon is expressed by the
orthogonality of the tensor Aik to the vector n:
Ailnl=Q,
Alkn, = 0 .
(9.1)
An interchange of the photons corresponds to an interchange of the suffixes of
the tensor Aik and a simultaneous change in the sign of n. Since photons obey Bose
statistics, we have
A i k (-n) = A w (n).
(9.2)
The tensor Aik is not in general symmetrical with respect to its suffixes. It can
be resolved into symmetric and antisymmetric parts: Aik = sik + aik. The equation
(9.2), and the orthogonality conditions (9.1), must evidently apply to each part
separately. Hence we have
sik(-n) = sik(n),
(9.3)
alk(-n) = -alk(n).
(9.4)
Inversion of the coordinates does not affect the sign of the components of a tensor
of rank two, but changes the sign of n. From (9.3), therefore, the wave function sik
is symmetrical under inversion, i.e. it corresponds to even states of the two-photon
system, while the wave function aik corresponds to odd states.
An antisymmetric tensor of rank two is equivalent (dual) to a certain axial
vector a, whose components are given in terms of those of the tensor by
eik\ being the antisymmetric unit tensor; see Fields, §6. The orthogonality of the
tensor ak{ and the vectorn implies that the vectors a and n are parallel.X We can
therefore write a = nc/>(n), where </> is a scalar; according to (9.4), we must have
a ( - n ) = -a(n), and therefore
<M-n) = 4>(n).
This equation signifies that the scalar <$> can be formed linearly only from spherical
harmonic functions of even order L (including order zero).
t This frame of reference always exists except in the case of two photons moving in the same
direction. The total momentum ki + k: and the total energy o>i f o>2 of such photons are related in the
same way as those of a single photon, and there is therefore no frame of reference in which k, + k2 = 0.
t For aik = eikiQiy and the orthogonality condition gives a^ru = etkiaink = (n x a), = 0.
§9
A Two-photon
31
System
We see that the transformation properties of the antisymmetric tensor aik under
rotations are equivalent to those of a single scalar (cf. the second footnote to §6).
When the latter is assigned a "spin" zero, the angular momentum of the state is
found to be J = L. Thus the tensor aik corresponds to odd states of a photon
system with even angular momentum J.
Let us now consider the symmetric tensor sik. Since this is unaltered when n
changes sign, it corresponds to even states of the photon system. Hence all the
components sik can be expressed in terms of spherical harmonic functions of even
order L (including zero). It is well known that any symmetric tensor sik of rank two
can be expressed as the sum of a scalar sn and a symmetric tensor s'ik with zero
trace (s'a = 0).
The scalar su can be assigned a "spin" zero, and the angular momentum of the
corresponding states is therefore J = L, i.e. is even. The tensor s'ik has "spin" two
(see QM, §57). Adding this "spin" to the even "orbital angular momentum" L by
the law of addition of angular momenta, we find that for a given even J ^ 0 three
states are possible (with L = J ± 2, J), and for odd J ^ 1 two states (with L = J ± 1).
The exceptions are J = 0 with one state (L = 2) and J = 1 with one state (L = 2).
In these calculations, however, we have not yet included the condition that the
tensor sik is orthogonal to the vector n. We must therefore subtract, from the
numbers of states found above, the numbers of states corresponding to a symmetric tensor of rank 2 "parallel" to the vector n. Such a tensor, which we denote
by s"k, can be written as
s"k = riibk + nkbi,
where b is a certain vector. According to (9.3), this vector must be such that
b(-n) = -b(n). Thus the tensor s"k which gives the "unwanted" states is equivalent
to an odd vector. The latter must be expressible in terms only of spherical
harmonics of odd order L. Moreover, the vector has a "spin" one, and therefore,
for any even angular momentum JV 0, two states are possible (with L = J ± 1), and
for any odd J one state (with L = J); an exception is J = 0 with one state (L = 1).
Summarizing the results obtained, we obtain the following table giving the
numbers of possible even and odd states of a two-photon system (with zero total
momentum) for various values of the total angular momentum J:
J
0
1
2k
2k + 1
even
1
0
2
1
odd
1
0
1
0
(9.5)
where k is any positive integer (not zero). We see that for odd J there are no odd
states, and the value J = 1 cannot occur.t
The wave function Aik of the two-photon system determines the correlation
between the polarizations of the photons. The probability that both photons
t Another way of deriving these results is given in §70, Problem 1.
32
Photons
§9
simultaneously have definite polarizations ei and e2 is proportional to
Aikefiefk.
Thus, if the polarization ei of one photon is given, the polarization e2 of the other is
e2k
a
Aikefi.
(9.6)
In odd states of the system, Aik is equal to the antisymmetric tensor aikj and
e2-e* oc a*efi£fk = 0,
so that the polarizations of the two photons are orthogonal. For linear polarization
this means that the directions of polarization are perpendicular; for circular
polarization, that the directions of rotation are opposite.
An even state with J = 0 corresponds to a symmetric tensor which reduces to a
scalar,
Su, = constant x (8ik - riink).
From (9.6), therefore, we have ci = c?. For linear polarization this means that the
directions of polarization are parallel; for circular polarization, that the directions
of rotation are again opposite. The latter result is obvious, since when J = 0 the
sum of the components of the photon angular momenta in the same direction k
must always be zero, because the components in opposite directions k| and k2, i.e.
the helicities, are equal.
CHAPTER II
BOSONS
§ 10. The wave equation for particles with spin zero
IT HAS been shown in Chapter I how a quantum description of the free electromagnetic field can be constructed on the basis of the known properties of the
field in the classical limit and the concepts of ordinary quantum mechanics. The
resulting scheme for describing the field as a system of photons contains many
features which occur also in the relativistic quantum theory description of particles.
The electromagnetic field is a system having an infinite number of degrees of
freedom. For this system there is no law of conservation of number of particles
(photons), and its possible states include states with an arbitrary number of
particles.t In the relativistic theory, systems composed of any particles must in
general share this property. The conservation of number of particles in the
non-relativistic theory depends on the law of conservation of mass: the sum of the
(rest) masses of the particles is unaffected by their interactions, and the constancy
of the total mass in a system of electrons, say, implies that the number of electrons
is also unchanged. In relativistic mechanics, however, there is no law of conservation of mass; only the total energy of the system is conserved, which includes
the rest energy of the particles. The number of particles therefore need not be
conserved, and consequently every relativistic theory of particles must be a theory
of systems having an infinite number of degrees of freedom. That is to say, any
such theory of particles must be a field theory.
The second quantization formalism (QMy §§64, 65) is a satisfactory means of
describing systems with a variable number of particles. In the quantum description
of the electromagnetic field, the second quantization operator is the 4-potential A.
This is expressed in terms of the (coordinate) wave functions of the individual
particles (photons) and their creation and annihilation operators. The quantized
wave function operator has a similar role in the description of a system of particles.
To derive this operator, we must first know the form of the wave function of a
single free particle and the equation satisfied by this function.
The concept of a field of free particles is, it must be emphasized, only an aid to
the theory. Actual particles interact, and the task of the theory is to consider these
interactions. But any interaction is equivalent to a collision, before and after which
the system may be regarded as an ensemble of free particles. It has been remarked
in §1 that the only measurable objects are of this kind. We therefore use the fields
of free particles as a means of describing the initial and final states.
+In reality, of course, the number of photons changes only as a result of various interaction
processes.
33
34
Bosons
§10
Let us first consider the relativistic description of free particles having spin
zero. This case is mathematically simple, and illustrates most clearly the basic ideas
and typical features of the description.
The state of a free particle (with spin zero) can be completely defined by
specifying its momentum p only. The energy e of the particle! is given by
e 2 = p 2 + m2 (where m is the mass of the particle) or, in four-dimensional form,
p2=m2.
(10.1)
The laws of conservation of momentum and energy are well known to be
related to the homogeneity of space and time, i.e. to the symmetry with respect to
any parallel displacement of the 4-coordinate system. In the quantum description,
this requirement of symmetry means that, under such a transformation of the
4-coordinates, the wave function of a particle having a given 4-momentum must be
multiplied by a phase factor (of unit modulus). This can be true only for an
exponential function, with the exponent linear in the 4-coordinates. Thus the wave
function of the state of a free particle with a given 4-momentum pM = (e, p) must be
a plane wave:
constant x e~ip\
px = et - p • r;
(10.2)
the choice of sign of the exponent in the relativistic theory itself is arbitrary, and is
here made in accordance with the non-relativistic case.
The wave equation must have the functions (10.2) as particular solutions for
any 4-vector p which satisfies the condition (10.1). It must be linear, on account of
the principle of superposition: any linear combination of the functions (10.2) also
describes a possible state of the particle, and must therçfore also be a solution.
Finally, the equation must be of the lowest possible order; any higher order would
bring in redundant solutions.
The spin is the angular momentum of the particle in a frame of reference in
which the particle is at rest. If the spin of the particle is s, its wave function in the
rest frame is a three-dimensional spinor of rank 2s. To describe the particle in an
arbitrary frame of reference, its wave function must be expressed in terms of
four-dimensional quantities.
A particle with spin zero is described in the rest frame by a three-dimensional
scalar. This scalar, however, can have more than one four-dimensional "origin":
either a four-dimensional scalar i//, or as the fourth component of a (time-like)
4-vector i//^ of which only the component i//0 is non-zero in the rest frame.t
For a free particle, the only operator that can appear in the wave equation is the
4-momentum operator p. Its components are the operators of differentiation with
respect to coordinates and time:
PM = Î3M = ( Î J J , - Î V ) .
(10.3)
+We denote the energy of a single particle by e, to distinguish it from the energy E of a system of
particles.
tOr, similarly, as the time component of a 4-tensor of higher order; but this would lead to
higher-order equations.
§10
The Wave Equation for Particles with Spin Zero
35
The wave equation must be a differential relationship between the quantities if/
and i/r^ through the operator p. This relationship must, of course, be given by
relativistically invariant expressions. Such expressions are
m</V = P^
P ^ = m*l>>
(10.4)
where m is a dimensional constant characteristic of the particle.!
Substituting i//M from the first equation (10.4) in the second equation, we obtain
(p 2 -m 2 )i// = 0
(10.5)
(O. Klein, and V. A. Fock, 1926; W. Gordon, 1927). The explicit form of this
equation is
- d^4>
s
( - jfi + A)I/> = m%
(10.6)
Substitution of i// as the plane wave (10.2) gives p 2 = m 2 , from which it is evident
that m is the mass of the particle. We may note that the form of equation (10.5) is
in any case obvious a priori, since p 2 is the only scalar operator which can be
derived from p (and, for the same reason, a similar equation is satisfied by every
component of the wave function of a particle having any spin value, as will be seen
on several occasions below).
Thus a particle with spin zero is essentially described by a single (fourdimensional) scalar i/>, which satisfies the second-order equation (10.5). In the
first-order equations (10.4), the wave function is represented by the set of quantities i// and i//^, the 4-vector t//M being the 4-gradient of the scalar if/. In the rest
frame, the wave function of the particle is independent of the (space) coordinates,
and the space components of the 4-vector i//M are therefore zero, as they should be.
In order to continue with the second quantization procedure, it is useful to
express the energy and momentum of the particle as the space integrals of certain
combinations bilinear in if/ and i//*, which represent a kind of space density of these
quantities. We thus have to find an energy-momentum tensor T^v which corresponds to equation (10.5). In terms of this tensor the law of conservation of
energy and momentum is expressed by the equation
dMT? = 0.
(10.7)
Following the general procedure of field theory (see Fields, §32), we write down
a variational principle which would lead to equation (10.5). This principle must be
that the "action integral"
S = [ Ld4x
(10.8)
t The constants m are shown in (10.4) so that i/v and i// shall have the same dimensions. There
would be no point in using different constants n\\ and rr\2 in the two equations, since they could always
be made the same by redefining t|/ or i/v
§10
Bosons
36
of some real 4-scalar L, the Lagrangian density of the field,t should take a
minimum value. Using the scalar i// (and the operator d*1), we can construct a real
bilinear scalar expression of the form
L = drf* • d*$ - m 2 f ^
(10.9)
where m is a dimensional constant. Regarding i// and i//* as independent variables
describing the field ("generalized field coordinates" q), we easily see that
Lagrange's equation
a
dL
dx* dq, M
dL
(iai0)
dq
(where q^ = d^q) is in fact the same as the equation (10.5) for (//and i//*, m being
the mass of the particle. The sign of the expression (10.9) has been taken such that
the square of the time derivative, |dt///dt|2, appears in L with a positive sign;
otherwise, the action could not take a minimum value (cf. Fields, §27). The choice
of the numerical factor in L is arbitrary (and affects only the normalization factor
in (//).
The energy-momentum tensor can now be calculated from the formula
n =2 ^ ^ ; - ^ .
uo.li)
the summation being over all q. Substitution of (10.9) gives
TM„ = M>* • d„«fr + d„0* • drf - Lg»v\
(10.12)
these quantities are real (as they should be), since L is real. In particular,
^f1^"^^2^'
Tl
°" at äT + l F a r -
(1013)
(1014)
The 4-momentum of the field is given by the integral
P^JlVod 3 *,
(10.15)
i.e. Too and T0, act as the energy and momentum densities. The quantity Too is
essentially positive.
t The corresponding second-quantized operator L is called the Lagrangian of the field. To simplify
the terminology, we shall use this term for either the "quantized" or the 44non-quantized" Lagrangian
density, as convenient.
§11
Particles and Antiparticles
37
Formula (10.13) can be used for the normalization of the wave function. A plane
wave, normalized to "one particle in the volume V = 1", is
P
iX
*p=vh)€
(1016)
'
since for this function Too = e, and the total energy in the volume V = 1 is therefore
equal to the energy of a single particle.
The angular momentum, whose conservation is due to the isotropy of space,
can also be expressed as a space integral, but we shall not need this representation.
There is one further conservation law allowed by equations (10.4) in addition to
those arising directly from space-time symmetry. It is easily seen that these
equations and those for (//* lead to the equation
3^=0,
(10.17)
JV = m(^*i/v + i/>*</0
= i[^*a^-(d M ^*)^].
(10.18)
where
Thus j M acts as a current density 4-vector, and (10.17) is the equation of continuity
expressing the law of conservation of the quantity
Q = |jod 3 x,
(10.19)
where
ii= ,=i
' (+'f-f4
(io 2o
- >
It should be noted that }0 need not be positive. This shows that it cannot in
general be interpreted as the probability density of spatial localization of the
particle. The significance of the conservation law expressed by equation (10.17) will
be shown in §11.
§11. Particles and antiparticles
In accordance with the general procedure of the second quantization method,
we have to consider the expansion of an arbitrary wave function in terms of the
eigenfunctions of a complete set of possible states of a free particle, for instance in
plane waves «//p:
</> = 2 a P ^
«/>* = £
a
W
38
Bosons
§11
The coefficients a^,a% are then to be regarded as the annihilation and creation
operators dp, dp of particles in the corresponding states.!
Here, however, we immediately encounter a difference of principle as compared
with the non-relativistic theory. In a plane wave which is a solution of equation
(10.5), the energy e need satsify (for a given momentum p) only the condition
e2 = p2 + m2, i.e. it can have two values, ±V(p2„+ m2). Only positive values of e can
have the physical significance of the energy of a free particle. But the negative
values cannot be simply omitted: the general solution of the wave equation can be
obtained only by superposing all its independent particular solutions. This shows
that the interpretation of the expansion coefficients for i// and i//* in the second
quantization method must be somewhat different.
We may write the expansion in the form
*=?7<b fl, '* , '*' M,+ ?v<b a '~ !v *'""'
(IU)
where the first sum contains plane waves with positive "frequency", normalized
according to (10.16), and the second sum contains those with negative "frequency",
e always denoting the positive quantity +V(p 2 + m2). In the second quantization,
the coefficients aj,+) in the first sum are replaced as usual by the particle annihilation
operators dp. In the second sum, we note that, in the subsequent derivation of the
matrix elements, the time dependence of the terms will correspond to particle
creation, not annihilation: the factor eiet = (e~iet)* corresponds to one extra particle
with energy e in the final state (cf. the end of §2). Accordingly, the coefficients a(p")
are replaced by creation operators b% relating to other particles. If the summation
variable p in the second sum in (11.1) is replaced by - p in order to put the
exponential factor in the form e'l{p'r~Ft)y the (//-operators are obtained as
^ = Svè7) (ap ^ ,px " fb '^ ipx) ' 1
(11.2)
Thus all the operators dp, bp are multiplied by functions with the "correct" time
dependence (~-e~l€f), while the operators dp, bp are multiplied by the complex
conjugate functions. This makes it possible to interpret the former operators, in
accordance with the general rules, as annihilation operators for particles with
momentum p and energy e, and the latter as creation operators for these particles.
In this way we arrive at the concept of particles of two types which occur
simultaneously and on an equal footing. These are called particles and antipartides ; the significance of the names will be shown later. One type corresponds to the
operators dp, dp in the second quantization formalism, and the other type to bp, b p .
The two types of particle have the same mass, since their operators appear in the
same (//-operator.
t The I/J function is given the 4-momentum p as suffix, since we intend to denote the functions with
"negative frequency" by </f-p. The operators d and d + are given the three-dimensional momentum p as
suffix, since this entirely defines the state of an actual particle.
§11
Particles and Antiparticles
39
The reason for these results can also be examined from the point of view of the
requirements of relativistic invariance.
The Lorentz transformations are, mathematically, rotations of the four-dimensional coordinate system which change the direction of the time axis; together with
the purely spatial rotations which do not affect the time axis, they form the Lorentz
group of transformations.! All the Lorentz transformations have the property that
they leave the t axis within the corresponding light cone, and this expresses the
physical principle that there exists a maximum possible velocity of propagation of
signals.
In a purely mathematical sense, the simultaneous change of sign of all four
coordinates (Jour-dimensional inversion) is also a rotation, since the determinant of
this transformation is +1, like that of any rotational transformation. The time axis
is thereby carried from one light cone to the other. Although this means that such a
transformation is physically impossible (as a transformation of the frame of
reference), the only difference mathematically is that, because the metric is
pseudo-Euclidean, such a rotation cannot be effected continuously without allowing also a complex transformation of the coordinates.
It is reasonable to suppose that this difference is unimportant in relation to
four-dimensional invariance. Then any expression which is invariant under the
Lorentz transformations must be invariant under 4-inversion also. A precise
statement of this condition as applied to the scalar (//-operator will be given in §13,
but here it may be noted that the condition will certainly make necessary the
simultaneous presence in the i/f-operators of terms having both signs of e in the
exponents, since this sign is changed by the substitution t - » - 1 .
Let us return now to equations (11.2) and derive the commutation relations
between the operators dp, âp (and bp, bp). For photons (the operators cp, Cp), this was
done on the basis of the analogy with oscillators, that is, essentially from the
properties of the electromagnetic field in the classical limit. Here there is no such
analogy. In deriving the (Bose or Fermi) commutation rules between the operators, we
can be guided only by the form of the Hamiltonian constructed from these operators.
This Hamiltonian is obtained (see QM, §64) by substituting tp and i£+ in place of
ifr and I/J* in the integral / Tood3x.1: We then find
H=2e(â;âp+bpb;).
(11.3)
p
It is easily seen that a reasonable result is obtained for the eigenvalues of this
Hamiltonian only if the operators satisfy the Bose commutation rules:
{dp,dp+}_ = {bp,bp+}- = l
(11.4)
t The set of all three-dimensional (spatial) rotations is itself a group, which constitutes a subgroup of
the Lorentz group. The set of the Lorentz transformations is not itself a group, since the result of
successive Lorentz transformations may be a purely spatial rotation.
t In the non-relativistic theory, the conjugate operator i£* is by convention written to the left of <£.
Here, the order is of no importance, since the interchange of <£ andty*would cause only the interchange
of the equivalent operators ap and bp. However, once a particular order has been selected, the same
order must be used throughout.
40
Bosons
§11
(all other pairs of operators commute, including each particle operator dp, dp with
each antiparticle operator b^ b p ). For, in this case,
p
The eigenvalues of the products dpdp and bpbp are positive integers N p and N p , the
numbers of particles and antiparticles. The infinite additive constant 2 e (the
"energy of the vacuum") may again be simply omitted:
E = 2e(Np
p
+ Np);
(11.5)
cf. formula (3.1) and the footnote to it. This expression is essentially positive, and
corresponds to the idea of two types of actual particles. Similarly, we have for the
total momentum of the system
P = 2P(NP+NP).
p
(11.6)
If, instead of (11.4), we used the Fermi commutation rules (anticommutators
instead of commutators), we should obtain
Û=2,e(â;âp-6;6p+\),
p
and instead of (11.5) the physically meaningless expression X e(N p —Np), which is
not positive-definite and hence cannot represent the energy of a system of free
particles.
Particles with spin zero are therefore bosons.
Next, let us consider the integral Q (10.19). Replacing the functions i// and (//* in
j° by the operators t// and <//*, and carrying out the integration, we obtain
Q = 2(â;âP-bpb;)
= 2(â;âp-b;bp-\).
p
p
(iu)
The eigenvalues of this operator are (omitting the unimportant additive constant
2D
Q = S(NP-NP),
(11.8)
P
and are therefore equal to the differences between the total numbers of particles
and antiparticles.
So long as we are discussing free particles and ignoring any interaction between
them, the law of conservation of the quantity Q is, of course, largely conventional
(like those of total energy (11.5) and total momentum (11.6)): what is actually
§12
Strictly Neutral Particles
41
conserved is not only the sum Q but the numbers Np, N p individually. The nature
of the interaction decides whether the quantity Q is conserved. If Q is conserved
(i.e. if the operator Q commutes with the Hamiltonian of the interaction), the
formula (11.8) shows the limitation imposed by the conservation law on the
possible variation of the number of particles: only "particle-antiparticle" pairs can
be formed or disappear.
If a particle is electrically charged, its antiparticle must have a charge of the
opposite sign: if both had charges of the same sign, the creation or annihilation of
the particle-antiparticle pair would contravene a rigorous law of nature, the
conservation of total electric charge. We shall see later (§32) how the theory
automatically leads to this oppositeness of the charges (for interactions of particles
with an electromagnetic field).
The quantity Q is sometimes called the charge of the field of the particles
concerned. For electrically charged particles Q gives, in particular, the total
electric charge of the system in terms of the unit charge e. But particles and
antiparticles may also be electrically neutral.
Thus we see that the nature of the relativistic relation between the energy and
the momentum (the twofold root of the equation e2 = p 2 + m 2 ) , together with the
requirements of relativistic invariance, leads in the quantum theory to a new
principle of classification of particles: there can exist pairs of different particles
(particle and antiparticle) which are interrelated in the way described above. This
remarkable prediction was first made (for particles with spin 2) by Dirac in 1930,
before the discovery of the first antiparticle, the positron.t
§ 12. Strictly neutral particles
In the second quantization of the (//-function (11.1), the coefficients a(p+) and a(p_)
were treated as operators relating to different particles. This is not necessary,
however: as a particular case, the annihilation and creation operators in t// may
relate to the same particles, as for photons (cf. (2.17)). Then, denoting these
operators by c p and c p , we write the (//-operator as
i = 2^-)(cPe^
+ c;ein.
(12.1)
The field described by this operator corresponds to a system of particles of one
kind only, which may be said to be their own antiparticles.
The operator (12.1) is Hermitian (i//+ = 1//), and in this sense such a field has only
half as many "degrees of freedom" as a complex field for which the operators 1//
and i//+ are not the same.
In consequence, the field Lagrangian, expressed in terms of the Hermitian
operator 1//, must contain a further factor \ in comparison with (10.9):$
L=
l
2(dJ-d»ip-m2ip2).
(12.2)
t The antiparticle concept was extended to bosons by V. Weisskopf and W. Pauli (1934).
t This resembles the extra factor 2 in the operator (2.10) of the electromagnetic field energy density
(when the field is expressed in terms of the Hermitian operators Ê and H), in comparison with the
photon energy density (3.2) expressed in terms of the complex wave function; cf. the last footnote to §3.
42
Bosons
§12
The corresponding energy-momentum tensor is
TV„ = d^ • d„i/) - Lg^v,
(12.3)
and hence the energy density operator is
t00 = td^ldt)2-L
= 2[W/d02 + (V«£)2+m2«£2].
(12.4)
Substituting (12.1) in the integral / Too d*x, we obtain the field Hamiltonian:
H = l22^c;cp + cpc;).
(12.5)
P
This again shows that Bose quantization is necessary:
{c P ,c p + }-=l,
(12.6)
and the energy eigenvalues (again without the additive constant) are
E = 5>pN P .
(12.7)
p
Fermi quantization would lead to the absurd result that E is independent of Np.
The "charge" Q of this field is zero, as is evident from the fact that Q must
change sign when particles are replaced by antiparticles, whereas in the present
case there is no difference between the two. The current density 4-vector therefore
does not exist, since the expression
L = i[*+d^-(dJ+)*j,]
(12.8)
for the operator / of the conserved 4-vector is zero when tjß = </r+ (the vector ipd^
is not itself conserved). This, in turn, means that there is no special conservation
law restricting the possible changes in the number of particles. Such particles must
clearly be electrically neutral.
Particles of this kind are said to be strictly neutral, as opposed to electrically
neutral particles which are not their own antiparticles. Whereas the latter can be
annihilated (transformed into photons) only as pairs, strictly neutral particles can
be annihilated singly.
The structure of the ^-operator (12.1) is similar to that of the electromagnetic
field operators (2.17)-(2.20). In this sense we may say that photons are themselves
strictly neutral particles. For the electromagnetic field, the operators are Hermitian
because the fields are measurable physical quantities (in the classical limit) and are
therefore real. For the ^-operators of particles there is no such relation, since they
do not correspond to any quantities that are directly measurable.
The absence of a conserved current 4-vector is a general property of strictly
neutral particles, and does not require the spin to be zero; for instance, it occurs
§12
Strictly Neutral Particles
43
for photons also. Physically, it expresses the absence of the corresponding prohibitions on a change in the number of particles. There is a direct formal relation
between the absence of a conserved current and the fact that the field is real (the
operator i£ is Hermitian).
The Lagrangian of a complex field,
L = d^+ • d ^ - m 2 i £ + i £ ,
(12.9)
is invariant under multiplication of the (//-operator by any phase factor, i.e. under
the gauge transformations
4>^eia&
^^e-ia4>\
(12.10)
In particular, the Lagrangian is unchanged under the infinitesimal transformation
ip-+ip + i8a • <A, i/^-»i/^-i'8a • i£+.
(12.11)
When the "generalized coordinates" q undergo an infinitesimal change, the
change in the Lagrangian is
^-sdh+é*-)
(with summation over all q). The first term is zero, from the "equations of motion"
(Lagrange's equations). If the "coordinates" q are taken to be the operators <£ and
i// , and with 8<£ = i8a • t//, Si£+ = - i 8a • i£+, we obtain
Hence we see that the condition for the Lagrangian to be invariant (8L = 0) is
equivalent to the equation of continuity (dj* = 0) for the 4-vector
!* = iU+l±-*l±]
02.12)
It is easily shown that, with the Lagrangian (12.9), this formula yields the current
(12.8).
Thus, in the mathematical formalism of the theory, the existence of a conserved
current is related to the invariance of the Lagrangian under the gauge transformations (W. Pauli, 1941). The Lagrangian (12.2) of the strictly neutral field does
not possess this symmetry.
44
Bosons
§13
§ 13. The transformations C P and T
Unlike 4-inversion, three-dimensional (spatial) inversion is not reducible to any
rotations of the 4-coordinate system; its determinant is - 1 , not +1. The symmetry
properties of particles- with respect to inversion (the P transformation) are therefore not determined already by considerations of relativistic invariance.t
The inversion operation, as applied to a scalar wave function, is the transformation
W , r ) = ±W,-r),
(13.1)
where the plus and minus signs on the right correspond to true scalars and
pseudoscalars respectively.
Hence we see that two features of the behaviour of the wave function under
inversion must be distinguished. One of these relates to the coordinate dependence
of the wave function. In non-relativistic quantum mechanics, only this aspect was
considered; it leads to the concept of the parity of the state (which we shall here
call the orbital parity), describing the symmetry properties of the motion of the
particle. If the state has a definite orbital parity (+1 or - 1 ) , this means that
</,a,-r) = ±i//(t,r).
The other feature is the behaviour of the wave function at a given point (which
may conveniently be taken as the origin) under inversion of the coordinate axes.
This leads to the concept of the internal parity of the particle. The two signs in
(13.1) correspond to internal parity +1 and - 1 (for a particle with spin zero). The
total parity of a system of particles is given by the product of their internal parities
and the orbital parity of their relative motion.
The "internal" symmetry properties of various particles appear, of course, only
in their mutual transformation processes. In non-relativistic quantum mechanics,
the analogue of the internal parity is the parity of a bound state of a composite
system, such as a nucleus. In the relativistic theory, which makes no essential
distinction between composite and elementary particles, this internal parity is no
different from the internal parity of those particles which are regarded as elementary in the non-relativistic theory. In the non-relativistic case, where these particles
are regarded as unalterable, their internal symmetry properties are not observable,
and a discussion of these would therefore be devoid of physical significance.
In the second quantization formalism, the internal parity is expressed by the
behaviour of the (//-operators under inversion. Scalar and pseudoscalar fields
correspond to the transformation laws
P : *(t,r)-*±iÊ(f,-r).
(13.2)
The actual significance of the action of inversion on the (//-operator must be
t The Lorentz group together with spatial inversion is called the extended Loreniz group (in contrast
to the original group without P, which in this connection is called the proper Lorentz group). The
extended group includes all transformations which leave the t axis within the corresponding light cone.
§13
The Transformations C, P and T
45
formulated as a particular transformation of the particle annihilation and creation
operators, such as to lead to the result (13.2). It is easily seen that such transformations are
P: d p -+±d_ p ,
bp-+±b-p
(13.3)
(and the same for the conjugate operators). For, on making these changes in the
operator
ku r) = s V(b)(dp e'i<üt+ipr+^ *üi"~'p r)
(13 4)
-
and then changing the notation for the summation variable (p-> - p), we can bring it
to the form ±i£(f, - r ) . Thus, if iP(t, r) denotes the operator after the substitutions
(13.3), we have
<Par) = ±iMt,-r).
(13.5)
The transformation (13.3) is entirely reasonable, since inversion changes the sign of
the polar vector p, and particles with momentum p are therefore replaced by
particles with momentum - p .
In (13.3) the operators dp and bp are transformed either both with the upper sign
or both with the lower sign. In the second quantization formalism, this expresses
the fact that particles and antiparticles (with spin zero) have the same internal
parity, a result which is evident because they are described by the same (scalar or
pseudoscalar) wave functions.
The ^-operator (13.4) is also symmetrical under a transformation which has no
analogue in the non-relativistic theory, that of charge conjugation (the C transformation). If all the operators dp and bp are respectively interchanged:
C: d p ->b p ,
bp-*âp
(13.6)
(i.e. if particles and antiparticles are interchanged), then ^ becomes the chargeconjugate operator t/fC, where
^ c (t f r) = ^ a , r ) .
(13.7)
This equation expresses the symmetry of the concepts of particles and antiparticles
in the theory.
There is an unimportant formal arbitrariness in the definition of the chargeconjugation transformation. The significance of the transformation is unchanged if
an arbitrary phase factor is included in the definition (13.6):
âp^eiabpy
bp^e~iaâp.
4f^eia^\
*p+^e~ia&
This would lead to
46
Bosons
§13
and a twofold repetition of the transformation would again yield an identity
(i/z-n/0. All such definitions are equivalent, however. Since the properties of the
(/^-operators are unchanged on multiplication by a phase factor (cf. the end of §12),
we can simply write \peial2 in place of i//, and thus again obtain the definition of
charge conjugation (13.6), (13.7).
Since charge conjugation replaces a particle by its antiparticle, which is not
identical with it, no new properties of a particle or a system of particles, as such,
will in general arise.
An exception is formed by systems comprising equal numbers of particles and
antiparticles. The operator C transforms such a system into itself, and so in this
case the operator has eigenstates, corresponding to the eigenvalues C = ± 1 (since
C 2 = 1). To describe the charge symmetry, we may regard the particle and the
antiparticle as two different "charge states" of the same particle, differing in the
value of the charge quantum number Q = ± 1. The wave function of the system is
the product of an orbital function and a "charge" function, and must be symmetrical with respect to simultaneous interchange of all the variables (coordinate
and charge) of any pair of particles. The symmetry of the "charge" function
determines the charge parity of the system (see the Problem at the end of this
section), t
The concept of charge parity, which arises in a natural manner for "strictly
neutral" systems, must apply also to strictly neutral "elementary" particles. In the
second quantization formalism, this concept is represented by the equation
<£c = ± ^
(13.8)
where the plus and minus signs correspond to charge-even and charge-odd particles
respectively.
Relativistic invariance implies invariance under 4-in\nersion (see §11). For a
scalar field operator (in the sense of 4-rotations) this means that 4-inversion must
give
*(f,r)-»<£(-f,-r)
with the right-hand side always positive. In terms of transformations of the
operators dp, bp, the transformation of tj/(f, r) into i/>(-f, - r ) is obtained by interchanging the coefficients of e~ipx and etpx in (13.4), i.e. by making
dp->6;,
6p^â;.
(13.9)
Since a-operators are replaced by b -operators, this involves interchange of particles and antiparticles. We see that, in the relativistic theory, there is a natural
requirement of invariance under a transformation in which spatial inversion (P)
and time reversal (T) are accompanied by charge conjugation (C); this is called the
CPT theorem.t
t In this discussion we are considering a particle with spin zero. The treatment given here can be
immediately generalized to other spin values; see, for instance, §27, Problem.
t This theorem was enunciated by G. Lüders (1954) and W. Pauli (1955).
§13
The Transformations C, P and T
47
Here, however, it must be emphasized that, although the arguments given in
§§11 and 12 and the present section are a natural development of the ideas of
ordinary quantum mechanics and classical relativity theory, the results thus
obtained go beyond these both in form ((//-operators including both particle creation
and particle annihilation operators at the same time) and in content (particles and
antiparticles). They cannot therefore be regarded as logically necessary, but
embrace new physical principles whose correctness can be tested only by experiment.
If the operator (13.4) transformed by (13.9) is denoted by t//CPT(t, r), we can
write
* CPT (t,r) = * ( - t , - r ) .
(13.10)
Thus, if 4-inversion is formulated as the transformation (13.9), we thereby
establish also the formulation of the time-reversal transformation of the i/f-operator: together with the combined inversion transformation CP, it must give
(13.9). Using the definitions (13.3) and (13.6), we therefore find
T: <5 p -*±d! p ,
6p^±6t?9
(13.11)
where the signs ± correspond to those in (13.3). The significance of this transformation is obvious: time reversal not only changes motion with momentum p into
motion with momentum - p , but also interchanges initial and final states in the
matrix elements. The annihilation operators for particles with momentum p are
therefore replaced by creation operators for particles with momentum - p . Making
the substitutions (13.11) in (13.4) and changing the notation for the summation
variable ( p - > - p ) , we obtaint
< F a r ) = ± ^ ( - t , r).
(13.12)
This is similar to the general rule for time reversal in quantum mechanics: if a
certain state is described by the wave function ifß(t,r)9 then the "time-reversed"
state is described by the function i//*(—r, r). The change to the complex conjugate
function is necessary because the "correct" time dependence must be restored,
after being lost through the change in the sign of t (E. P. Wigner, 1932).
Since the transformation T (and therefore CPT) interchanges the initial and
final states, there are no eigenstates and eigenvalues, and therefore no new
properties of particles as such. The consequences as regards scattering processes
will be discussed in §§69 and 71.
Let us see how the current 4-vector operator j M (12.8) is affected -by the
transformations C, P and T. The transformation (13.2), together with (d0, 3,-)-*
(do,-d,-), gives
P:(/°j\r^(/°,-î)f.-n
(13.13)
+ If the operation T is defined without regard to the other transformations, there is the same
arbitrariness in the choice of the phase factor as occurs for the operation C. The requirement of CPT
symmetry implies that the phase factor can be chosen arbitrarily for only one of the transformations C
and T.
48
Borons
§13
as we should expect for a true 4-vector. The transformation (13.7) would give simply
C:(/°,j)t,r^(-/°,-jKr
(13.14)
if the operators i// and i//+ commuted. However, the non-commutativity of these
operators is due only to that of the operators dp and dp (or b? and bp) with the
same p, and from the commutation rules (11.4) the interchange of these operators
produces only terms independent of the occupation numbers, i.e. independent of
the state of the field. Omitting these terms as unimportant, as in (11.5), (11.6), we
return to (13.14), whose significance is evident: charge conjugation replaces particles by antiparticles and thus changes the sign of every component of the 4-current.
Since the operation of time reversal involves transposing the initial and final
states, it changes the order of the factors in a product of operators. For example,
Here, however, this is not important: since the (//-operators commute (in the sense
explained above), the result is unaffected by returning to the original order of
factors. Since also (d0, d ( )->(-3 0 , d,) under time reversal, the current transformation
rule is
T:(f 0 ,j), r ->(j°,-j)-, r .
(13.15)
The three-dimensional vector j changes sign, in accordance with its classical
significance.
Finally, for the CPT transformation,
CPT:(/°,j) f , r ->(-j°,-JK- r ,
(13.16)
in accordance with the significance of this operation as 4-inversion. Here it must be
emphasized that, since 4-inversion is a rotation of the 4-coordinate system, it does
not correspond to two types (true and pseudo) of 4-tensors of any rank.
So far, we have assumed that the particles are free; but parity quantum
numbers acquire real significance only when interacting particles are considered
and definite selection rules are imposed which allow or forbid specified processes.
Only conserved properties, however, can have this significance; that is, the eigenvalues of operators which commute with the Hamiltonian of the interacting
particles.
Because of relativistic invariance, the CPT transformation operator always
commutes with the Hamiltonian. For the C and P (and therefore T) transformations separately, experiment shows that the electromagnetic and strong
interactions are invariant, and the corresponding parity quantum numbers are
therefore conserved in these interactions. In a weak interaction, these conservation
laws do not hold.t
t The idea that parity might not be conserved in weak interactions was first put forward by T. D. Lee
and C. N. Yang (1956). The general notion that the laws of physics might not have P and T invariance
had previously been suggested by Dirac (1949).
§13
The Transformations C, P and T
49
Anticipating a little, we may mention that the operator of the interaction
between charged particles and the electromagnetic field is given by the product of
the operator 4-vectors A and /. Since charge conjugation changes the sign of j , the
invariance of the electromagnetic interaction under this transformation means that
the sign of A must also be changed. Thus photons are charge-odd particles.
This behaviour of the operators À is in accordance with the properties of the
4-potential in the classical theory: from the transformations
C:
(A 0 ,Â)->(-Ao,-Â) f . r ,
P:
(Â0,k)^(Âo,-k)u-r9
CPT: ( Â 0 , Â ) ^ ( - Â o , - Â K - r ,
it follows that
T:
(Âo,Â)^(Ao,-ÂKr,
in agreement with the classical rule for the transformation of the electromagnetic
field potentials under time reversal.
The requirement of CPT invariance does not impose any limitations on the
properties of the particles themselves, but it implies certain relations between those
of particles and antiparticles. Firstly, their masses must be equal, as is evident from
the relation described in §11 between 4-inversion and the basis of the concept of
particles and antiparticles. Next, it follows from CPT invariance that there is only a
difference of sign in the proportionality coefficients between the electric and magnetic
moment vectors and the particle and antiparticle spin vector. The magnetic moment
changes sign under the C and T transformations but (being an axial vector) is not
affected by the P transformation. Hence the CPT transformation, which converts a
particle into an antiparticle, does not change the sign of the magnetic moment; the spin
vector does change sign. The same applies to the electric moment, which is unchanged
by time reversal but changes sign under the C transformation and (being a polar
vector) under spatial inversion.
The requirements of P and T invariance (if complied with) restrict the properties of each particle, prohibiting the existence of an electric dipole moment: the
only vector that can be constructed from the ^-operators of an elementary particle
at rest is its spin operator vector, which is P-even and T-odd, and can therefore give
rise to a magnetic moment but not an electric moment. We must emphasize that
either P invariance.or T invariance is sufficient to invoke this prohibition.
PROBLEM
Determine the charge and spatial parities of a system of two particles with spin zero (particle and
antiparticle) and orbital angular momentum of relative motion /.
SOLUTION. Interchanging the coordinates of the particles is equivalent to inversion (about their
centre of mass), and therefore multiplies the orbital function by (-1)'; interchanging the charge variables
is equivalent to charge conjugation, and multiplies the "charge" factor in the wave function by the
required parity C. The condition C(-l) 1 = 1 gives
C = (-!)'.
50
§14
Bosons
The spatial parity P of the system is the product of the orbital parity and the internal parities of the two
particles. Since the particle and the antiparticle have the same internal parity, in this case P is equal to
the orbital parity:
P =(-!)'.
§ 14. The wave equation for a particle with spin one
A particle with spin one is described in its rest frame by a three-component
wave function, a three-dimensional vector; such a particle is often called a vector
particle. The four-dimensional origin of this vector may be as the three spatial
components of the space-like 4-vector i//M or the mixed components of the
antisymmetric 4-tensor ty*v of rank two; the time component </>° and the space
components t//'k are zero in the rest frame.t
The wave equation is a differential relation between the quantities ty* and <^M",
and will be written as the equations
tyvv = PiLfa-prfi»
2
im ^=p^^
(14.1)
(14.2)
with p = id (A. Proca, 1936). Applying the operator pM to both sides of equation
(14.2), we have
p^=0,
(14.3)
since i/v* is antisymmetric.
By substituting (14.1) in (14.2) to eliminate if/^ and using (14.3), we obtain
(p2-m2)</v=0,
(14.4)
whence it is again evident (cf. §10) that m is the mass of the particle. Thus a free
particle with spin one can be described by a single 4-vector i//*\ whose components
satisfy the second-order equation (14.4), and also the further condition (14.3),
which eliminates from if/* the part pertaining to spin zero.
In the rest frame, where i/fM is independent of the spatial coordinates, we find
that pVo = 0.- Since also p V o = m^o, it is seen that in the rest frame t/fo^O, as it
should be, and the ^ik are likewise zero.
A particle with spin one can have different internal parities, according as ty* is a
true vector or a pseudovector. In the former case
Pr = (*0.-*').
and in the latter case
P ^ = (-^°,f).
t Anticipating, we may mention that the ensemble of the 4-vector fa and the 4-tensor ^
corresponds to that of the 4-dimensional spinors of rank two Caß% 1700, Cß> where (aß and TJ^ are
symmetrical spinors changed into each other on inversion (§19).
§14
51
The Wave Equation for a Particle with Spin One
Equations (14.1), (14.2) can be derived from the variational principle, using the
Lagrangian
L = it/vi//""* - 2^"*(d^v - d„«/v) -^"{drf* - drf*) + m 2 «/^*.
(14.5)
The independent generalized coordinates are here represented by i/v, «//*, i^„, i//J„.t
To find the energy-momentum tensor, formula (10.11) is not entirely suitable
here, since it would lead to an unsymmetrical tensor requiring further symmetrization. Instead, we can use the formula
iT^V-g
- -
w
dg^
+
dgliV
,
(14.6)
in which L is assumed to be expressed in a form appropriate to any curvilinear
coordinates (see Fields, §94). If L contains only the components of the metric
tensor gM„ and not their derivatives with respect to the coordinates, the formula
becomes simply
^
2
V^
3(LV-g) =
dg»v
?
dL
dg^v
^v
(since d log g = - g^dg*1").
Since the differentiation in formula (14.6) is not with respect to the quantities
<Ko */V" these quantities need not be regarded as independent when applying the
formula; we may immediately make use of the relationship (14.1) to rewrite the
Lagrangian (14.5) as
L = -faMpg^g"
+ m2i*Î8^
(14.7)
Then
TV, = - t/vAiK* - </>*A</>Î + m2(i^iK + 0*«^) + g ^ i M ^ ' * - m Vî* A )- (14.8)
In particular, the energy density is given by the essentially positive expression
To* = i«Mrô + MÛ + m2(<M4 + Ml).
(14.9)
The conserved current density 4-vector is given by
jM = i ( ^ * ^ - ^ > * ) .
(14.10)
This can be obtained, in accordance with (12.12), by differentiating the Lagrangian
t If the variation were made with respect to fa only (assuming fav already expressed in terms of fa
by (14.1)), equation (14.3) would have to be imposed as an additional condition unrelated to the
variational principle.
52
§14
Bosons
(14.5) with respect to the derivative dMi/v In particular
j 0 =i(iA 0 k *^-^°Vt)
(14.11)
and is not an essentially positive quantity.
A plane wave normalized to one particle in the volume V = 1 is
^ = V ( k ) Ufi€'iP%
u u =
» ** "lf
(1412)
where wM is the unit polarization 4-vector, which, by (14.3), satisfies the condition
of four-dimensional transversality,
1*^=0.
(14.13)
For, on substituting the function (14.12) in (14.9) and (14.11), we obtain
Too=-2eV^* = ^
J°= 1.
Unlike the photon, a vector particle with non-zero mass has three independent
directions of polarization. The corresponding amplitudes are given in (16.21).
The density matrix for partially polarized vector particles is defined so that in a
pure state it reduces to the product
P^ = HMU*
(similarly to (8.7) for photons). According to (14.12) and (14.13), it satisfies the
conditions
P ^ = 0f
p£ = - l .
(14.14)
For unpolarized particles, p^v must have the form ag^ + bp^pv. When the
coefficients a and b are found from (14.14), the result is
P»v = -kg** -PvPJm2).
(14.15)
The quantization of the vector particle field is entirely analogous to the scalar
case, and there is no need to repeat the arguments. The ^-operators of the vector
field are
(d
«f : = z v^T) > * *
eipx+
^ « P «~*>»
where the suffix a labels the three independent polarizations.
(14.16)
§15
The Wave Equation for Particles with Higher Integral Spins
53
As in the scalar case, Bose quantization is necessary because the expression
(14.9) for Too is positive definite and the expression (14.11) for j° is not.
There is a close connection between the properties of strictly neutral vector and
electromagnetic fields. The neutral vector field is described by an Hermitian
(//-operator:
k = 2 V(W){êpaU{;) e~ipx + ap+a"(;>*eiPX)'
(1417)
The Lagrangian of this field is
L = } i M > * - ïfr'idjr
- cMv) + WM*-
(14.18)
The electromagnetic field corresponds to m = 0. The 4-vector ty* then becomes
the 4-potential A*\ and the 4-tensor iff*11' becomes the field tensor F*\ which is
related to the potential by the definition (14.1). Equation (14.2) becomes dv\\ß^v = 0,
corresponding to the second pair of Maxwell's equations. This does not imply the
condition (14.3), which therefore is no longer obligatory. Since the extra condition
has disappeared, there is no need to regard i/^ and ^ as independent "coordinates" in the Lagrangian, and (14.18) becomes
L = -\i^\
(14.19)
in agreement with the familiar classical expression for the Lagrangian of the
electromagnetic field. This Lagrangian, like the tensor i//M„ is invariant under any
gauge transformation of the "potentials" i//^. There is an evident connection
between this property and the zero mass: the Lagrangian (14.18) does not possess
the property, because of the term m2i//Mt£M.
§ 15. The wave equation for particles with higher integral spins
Since the wave equations (14.3), (14.4) follow immediately when the particle
mass and spin are given, the practical utilization of the Lagrangian involves not so
much the derivation of these equations as the establishment of expressions for the
field energy, momentum and charge.
To do so we can, as already mentioned, use in place of (14.5) the expression
(14.7), and the latter can be further transformed as follows. From (14.1), it can be
rewritten as
L = - (M/*)(a^n+idrfwr)+m 2 ^*
The last term is zero, by (14.3), and the one preceding it is a total derivative.
Omitting this, we obtain the Lagrangian
V = - ( a ^ W i / O + m2(//*i//^.
(15.1)
§15
Bosons
54
This has the same form as the Lagrangian (10.9) for a particle with spin zero, the
only difference being that the scalar if/ is replaced by the 4-vector i//^ and the sign is
changed. The change of sign occurs because i/^ is a space-like vector, so that
«/V^*1* < 0> whereas for a scalar particle ipij/* > 0.
On constructing the energy-momentum 4-tensor and the current 4-vector from
the Lagrangian (15.1), we obtain expressions of the same form as (10.12) and
(10.18) for the scalar field:
i ; , = - a^ A * • d A - d^x* • a^i//x - L'g^
U = -imh*k-i*MWkl
(15.2)
(15.3)
Thd difference between these and (14.8), (14.10) is again a total derivative. But it
has already been stressed that the local values of these quantities have no profound
physical significance. Only the volume integrals P^ (10.15) and Q (10.19) are
important, and these will be the same for either choice of T^v and jM.
This method of description can be immediately generalized to particles with any
(integral) spin. The wave function of a particle with spin s is an irreducible 4-tensor
of rank s, i.e. a tensor symmetrical in all its indices and vanishing on contraction
with respect to any pair of indices:
<>
/ ..„.,,.= *. .„.,...,
ifc.^.^O.
(15.4)
This tensor must satisfy the additional condition of 4-transversality:
P ^ . M . = 0,
(15.5)
and each of its components must satisfy the second-order equation
(p2-mV
=0.
(15.6)
In the rest frame, the condition (15.5) means that every component of the 4-tensor
whose indices include a zero must vanish. Thus the wave function in the rest frame
(i.e. in the non-relativistic limit) is equivalent, as it should be, to an irreducible
3-tensor of rank s, the number of independent components of which is 2s + 1.
The Lagrangian, the energy-momentum tensor and the current vector for a field
of particles with spin s differ from (15.1M15.3) only in that ^A is replaced by ^
The normalized plane wave is
ru
= y g ^ ii""" e-*\
«*,..«'"'■■■ = - 1,
(15.7)
the wave amplitude satisfying the conditions
u
M
pM=0.
There are 2s + 1 independent polarization states.
(15.8)
§16
Helicity States of a Particle
55
The quantization of the field is effected by an obvious generalization from the
cases of spin zero and one.
The procedure given above is entirely sufficient for the stated purpose: to
describe a field of free particles. The situation is different if it is proposed to
describe the interaction of the particles with an electromagnetic field. This interaction would have to be included in the Lagrangian in order to yield all the
equations without the need to impose additional conditions. In practice, however,
this description of the interaction is found to be applicable only for electrons, i.e.
particles with spin 2 (see §32). For other spin values, therefore, the problem is only
of methodological interest.
For any spin s > 1 (integral or half-integral), it proves impossible to formulate a
variational principle by means of a single (tensor or spinor) function whose rank
corresponds to the given spin. It is necessary to use additional tensor or spinor
quantities of lower rank. The Lagrangian is then so chosen that these auxiliary
quantities must be zero on account of the free-particle field equations which follow
from the variational principle.t
§ 16. Helicity states of a particle t
In the relativistic theory the orbital angular momentum 1 and the spin s of a
moving particle are not separately conserved. Only the total angular momentum
j = 1 + s is conserved. The component of the spin in any fixed direction (taken as the
z-axis) is therefore also not conserved, and cannot be used to enumerate the
polarization (spin) states of the moving particle.
The component of the spin in the direction of the momentum is conserved,
however: since 1 = r x p the product s • n is equal to the conserved product
j • n (n = p/|p|). This quantity is called the helicity ; it has already been mentionedin
§8 in relation to the photon. Its eigenvalues will be denoted by A (A = - s , . . . , +5),
and states of a particle having definite values of A will be called helicity states.
Let i//pA be the wave function (plane wave) describing the state of a particle with
definite values of p and A, and u(A)(p) its amplitude; to simplify the notation, we
shall omit the indices for the components of this function (4-tensor indices for a
particle with integral spin).
It has been shown in earlier sections that a wave function with more than 2s + 1
components is needed in order to give a relativistic description of particles with
non-zero (integral) spin. But the number of independent components remains equal
to 2s + 1; the "extra" components are eliminated by imposing additional conditions
which cause these components to vanish in the rest frame. In Chapter III this will
be shown for half-integral s also.
According to the formulae for transformation of the angular momentum (see
Fields, §14), the helicity is invariant under those Lorentz transformations which do
not alter the direction of p along which the angular momentum component is taken.
The number A therefore remains a good quantum number under such
t See M. Fierz and W. Pauli, Proceedings of the Royal Society A173,211,1939. The procedure indicated
above is carried out in this paper for particles with spin 3/2 and 2.
t The discussion in this section relates to particles with any spin (integral or half-integral).
56
§16
Bosons
transformations, and the symmetry properties of helicity states can b<e studied by
means of a frame of reference in which the momentum |p| < m (in the limit, the rest
frame). Then ippk reduces to a non-relativistic wave function with 2s + 1 components.
Let its amplitude be denoted by w(A)(n), the argument being the direction n = p/|p|
along which the angular momentum is quantized. The amplitude w(A) is an eigenfunction of the operator n • s:
(n-s)w (A) (n) = Aw(A)(n).
(16.1)
In the spinor representation, w(A) is a contravariant symmetrical spinor of rank 2s;
according to the correspondence formulae (QM, (57.2)), its components can also be
enumerated by the corresponding values of the spin component a along a fixed
z-axis.t
In the momentum representation, the wave functions of the states considered
are essentially the same as the amplitudes u(A)(p):
i//pA(k) = u(A)(k)S(2)(v - n) = u(A)(p)S(2)(v - n),
(16.2)
where the momentum as an independent variable is denoted by k, as contrasted
with its eigenvalue p, and v = k/|k|, as against n = p/|p|.t In the non-relativistic limit,
<Mv) = H>(A)(v)S(2)(v - n) - w(A)(n)Ô(2)(v - n).
(16.3)
This expression should be written in the more explicit form
t//nA(v,c7)=>v!A)(v)ô{2)(v-n),
showing the discrete independent variable o\
The helicity operator s • n commutes with the operators jz and j 2 , since the
angular momentum operator is related to an infinitesimal rotation of the coordinates, and the scalar product of two vectors is invariant under any rotation.
There exist, therefore, stationary states in which the particle simultaneously has
definite values of the angular momentum j , its component jz = m, and the helicity A.
Such states will be called spherical helicity states.
Let us determine the wave functions of these states in the momentum
t These arguments, like the possible values shown for A, apply to particles with non-zero mass. For
massless particles there is no rest frame, and the helicity can take only the two values A = ± s. This is
because of the fact already mentioned in §8, that the states of such a particle are classified by their
behaviour with respect to the axial-symmetry group, which allows only twofold degeneracy of levels (as
regards the properties of the wave equation, this means that in the limit as m -»0 the set of equations for
a particle with spin s separates into independent equations corresponding to massless particles with
spins s, s — l f . . . ) . For example, the photon has A = ± l , and the corresponding w(A) are the threedimensional vectors c(±I) (8.2).
t The delta function S(2) is defined so that
[ 8i2)(v-n)dov=
1.
The delta function which imposes a fixed value of the energy is omitted in (16.2), and similarly in (16.4)
below.
§16
Helicity States of a Particle
57
representation. This may be done by direct analogy with the formulae derived in
QM, §103 for the wave functions of a symmetrical top. They were obtained there
on the basis of the formulae for the transformation of wave functions under finite
rotations (QM, §58). These in turn were based solely on the symmetry properties
with respect to rotation, and are therefore applicable to functions in the momentum
representation just as much as to coordinate functions.
In addition to the coordinates x, y, z fixed in space (with respect to which the
functions i//JmA are written), we shall also use "moving" coordinates £, rj, £, with the
£-axis in the direction of v. Without repeating the argument (cf. the derivation of
QM, (103.8)), we can write
^JmA(k) = i|ig ) D^(v),
where ipfx is the wave function in the moving coordinates, describing the state of a
particle with a definite value of the ^-component of the angular momentum, j^ = A;
in the momentum representation, of course, this function is the same as the
amplitude u(A). The normalized wave function (see below) is
<fcmA<k) = ^
^
D^(v)u (A) (k).
(16.4)
Here, however, there is a question of the choice of phases, because of the
following non-uniqueness: a rotation of the coordinates £, rj, £ relative to x, y, z is
defined by three Eulerian angles a, ß, 7, whereas the direction of v, on which the
particle wave function can alone depend, is defined by the two spherical angles a = <f>
and ß = 6. It is thus necessary to agree on some definite choice of the angle 7. We shall
take 7 = 0 , defining Dfmiv) as
D{pm(v) = D{lU<t>, 0,0) = e1"* d&(0).
(16.5)
From QMy (58.21), the functions (16,5) are seen to satisfy the orthonormality
conditions:
/ D ^ ( v ) D ^ m i ( v ) ^ = 2JTÎ S''h *"-»""■
(16 6)
'
where dov = sin 6 dd d<f>. The orthogonality of the functions i/*irnÀ with respect to the
suffix À is ensured by the factor u(A). Thus the functions t//jmA are orthogonal in all
three suffixes, as they should be, and with the coefficient chosen in (16.4) they are
normalized by the condition
j\4>jmx\2dov=\.
(16.7)
Here we assume that the amplitudes u(A) are normalized to unity: H ( A V A ) * = 1.
Let us now consider the behaviour of the wave functions of helicity states
under inversion of the coordinates. The product of the polar vector v and the axial
vector j is a pseudoscalar. It is therefore obvious that inversion will change a state
58
§16
Bosons
with helicity A into one with helicity - A ; all that is necessary is to determine the
phase factors in these transformations.
Under inversion, v-> - v. The vector v is defined by the two angles cf> and 0, and
the transformation v -» - v is brought about by the changes </> -> <f> + IT, 0 -> TT - 0.
This determines the £-axis but leaves indefinite the position of the £ and TJ axes,
which depends also on the third Eulerian angle y\ the transformation of 0 and <f>
alone does not distinguish, in this sense, between reflection of the coordinates and
rotation of the £-axis. Expressed in terms of all three Eulerian angles, inversion is
the transformation
a = 4> -> <f> + 77,
ß = 0 - » 7r - 0,
y^TT
— y.
(16.8)
Hence, if D(pm(v) is defined as in (16.5) (i.e. with y = 0), and the transformation
v-> - v is regarded as being the result of inversion, then
D{i]m(-V)
= D(Pm((t> + TT, 7T - 0, TT).
(16.9)
From formulae QM (58.9), (58.16) and (58.18) we hence find
D^m(-v) =
e^d{pm(7T'd)eimU^)
= (^y-VmVJ>,m(0)
= (-irAD°U(^,0,o),
or
D ^ ( - v ) = (-iy- A D ( iU(v),
(16.10)
where j - A is an integer.
A similar formula for the spinor wik) can be obtained by noticing that its
components w ^ are the same, apart from a factor, as the functions
w(aA)(v)-D(AsJ(v)*.
(16.11)
For, by applying the transformation formulae QM (58.7) to the spin eigenfunctions
and taking the ^-component of the spin to have a definite value A (i.e. replacing i//im
by Sm A on the right-hand side of QM (58.7), we find that D(^(v) are the spin wave
functions corresponding to definite values of the z and £ components (a and A) of
the spin. The set of these functions with a = - 5 , . . . , + s forms, according to the
correspondence formulae (QM (57.6)), a covariant spinor of rank 2s. The components of the contravariant spinor, which according to the formulae QM (57.2)
correspond to the components w(aA), are transformed as the complex conjugates of
the components of the covariant spinor of the same rank.
From (16.10) and (16.11), we have
w<*>(-v) = ( - l ) s - V - A ) ( v ) ,
(16.12)
where s - A is an integer. The inversion operation applied to w(A), however, not
only changes v into -v but also multiplies w(A) by a common phase factor (the
§16
Helicity States of a Particle
59
"internal parity" of the particle), which we shall denote by TJ:
jV A ) (v) = T)W (A) (-V) = T,(-1) S ~ A W ( ~ A) (V).
(16.13)
For the relativistic amplitude u(A)(k), this transformation becomes
Pu(A)(k) = r)ßu(A)(-k)
= Tj(-irVA)(k),
(16.14)
where ß is a certain matrix which is a unit matrix with respect to the components
of M(A) which remain in the limit |p|->0. It is important to note that this matrix does
not depend on the quantum numbers of the state, and in this sense the difference
between (16.13) and (16.14) is unimportant.t
On applying (16.14) to (16.2), we obtain the law of transformation of the wave
functions of the states |nA):
P<Mv) = r , ( - i r A ^ n , - A ( v ) .
(16.15)
For spherical helicity states, using (16.10) and ( 16.12), we obtain the transformation
law
J%»A(V)
= r,(-iy> ; m ,- A (v).
(16.16)
The states $jm0 are transformed into themselves, according to (16.16), i.e. they
have a definite parity. If A ^ 0, however, only superpositions of states with opposite
helicities have a definite parity:
</>jm|A| = -^2 ^mX
±
^i»».-*)-
(16.17)
On inversion, these are transformed into themselves:
P*BA,(V) = ± r,(-iy->^ A | (v).
(16.18)
It should be noted that in this section we have arrived at a classification of
states of a free particle with a given angular momentum, using only conserved
quantities and without invoking the concept of the orbital angular momentum
(which was employed, for instance, in §§6 and 7 for classifying photon states).
As an example, let us consider the case of spin one. In the rest frame the
amplitudes u(A) (4-vectors) become the three-dimensional vectors e(A\ which here
take the place of the amplitudes w(A). The action of the operator of spin one on the
vector function e is given by the formula
(s,-e)k = -ie,-we,;
(À)
(16.19)
t For example, when s = 1 the amplitudes u are the 4-vectors (16.22); ß is then entirely a unit
matrix with respect to the 4-vector indices, ß^ = 5^. When s = i as we shall see in Chapter III, u u ) is a
bispinor, the phase factor TJ = i, and ß is the Dirac matrix y° (see (21.10)).
60
§16
Bosons
see QM, §57, Problem 2. Thus equation (16.1) becomes
m x e , A ' = Ae
(A)
_
v
(A)
(16.20)
The solutions of this equation (in £t)£ coordinates with the £-axis i n t h e direction of
n) are the same as the spherical unit vectors (7.14):t
e(0)=
e(H)=
f(0,0, 1),
T-^(1,±/,0).
(16.21)
In a frame of reference in which the particle has momentum p, the hèlicity state
amplitudes are the 4-vectors
M(o>,i =
/|pl _Le(0)
\m m
I
(16.22)
If e is a polar vector, then rj = - 1, and the functions (16.17), which are
three-dimensional vectors when s = 1, have the following parities:
*£]AI:P=(-1)\
^m)|Ai:P=(-Di+1,
^mo:P = ( - l ) j .
On comparing with the definition of the spherical harmonic vectors (7.4), we see
that these functions are identical (apart from phase factors) with Yjm\ Y}£\ Y $
respectively. After ascertaining the phase factors (by comparing values for 0 = 0 ,
say), we obtain the equations
(16.23)
where j is an integer; e (x) = n x e U ) are spherical unit vectors along axes £ \ TJ\ £
which are obtained from £, rj, £ by a rotation of 90° about the £-axis.
The last formula (16.23) is equivalent to the expression QM (58.23) for d^miB).
The first or second formula (16.23) leads to a simple expression for the functions
t The choice of phase factors is determined by the condition that the spin operator matrix elements
calculated with the eigenfunctions (16.21) must be in accordance with the general definitions in QM,
§§27 and 107.
§16
61
Helicity States of a Particle
di\.m. We have
« M V ^ ^ U = Y^-e^*
= —7—
e(±,)* • V Y
V[j(i + l ) ] e
The scalar product on the right can be written explicitly in the coordinates £, TJ, £,
with
l±_ J_\ ^ /_3
Vd£'aTj/
1_ J_\
\a0'sin0 d<j>r
With the definitions (7.2) of Yjm and (16.5), the result is
"»■■w-(-ir l V a+ (i m;w )! + i)( ± ^ + ^) i,r(co,>) '
ms0
(16.24)
CHAPTER III
FERMIONS
§17. Four-dimensional spinors
IN THE non-relativistic theory, a particle with arbitrary spin s is described by a
quantity with 2s + 1 components, a symmetrical spinor of rank 2s. These quantities
are, mathematically, realizations of the irreducible representations of the spatial
rotation group.
In the relativistic theory, this group is only a subgroup of the wider group of
four-dimensional rotations, the Lorentz group. It is therefore necessary to develop
the theory of four-dimensional spinors (4-spinors), as quantities which are realizations of the irreducible representations of the Lorentz group. This theory will be
given in §§17-19. In §§17 and 18 we shall consider only the proper Lorentz group,
which excludes spatial inversion; the latter will be dealt with in §19.
The theory of 4-spinors is analogous in structure of that of three-dimensional
spinors (B. L. van der Waerden, 1929; G. E. Uhlenbeck and O. Laporte, 1931).
A spinor £ a is a quantity having two components (a = 1,2); as components of
the wave functipn of a particle with spin 2, £] and £2 correspond to the respective
eigenvalues +1 and -2 of the z-component of the spin. Under any transformation
belonging to the (proper) Lorentz group, the two quantities £* and £2 are transformed into linear combinations of themselves:
* " = ^ , + <2* ; : }
e=ye+s£ .
(,7.D
The coefficients a, ß, 7, S are definite functions of the angles of rotation of the
4-coordinate system, and must satisfy the condition
a8-ßy = 1;
(17.2)
that is, the determinant of the binary transformation (17.1) is equal to unity, as are
the determinants of the coordinate transformations in the Lorentz group.
Because of the condition (17.2), the bilinear form ^ ! E 2 - ^ 2 H 1 (where £ a and
a
H are two spinors) is invariant under the transformation (17.1), and corresponds to
a particle with spin zero which "consists" of two particles with spin 2. In order to
write such invariant expressions in a natural way, the "covariant" components £ft
are used as well as the "contravariant" components £a of the spinor. Their
relationship is governed by the "metric spinor" g a ß :t
ta=gaße>
(17.3)
t The spinor indices will be denoted by the letters at the beginning of the Greek alphabet: a, 0,
62
Four-dimensional Spinors
§17
63
where
*«* = (_? £)•
<i7-4>
so that
«! = «2,
&= -*1.
(17.5)
Then the invariant ^ 1 H 2 -^ 2 E 1 becomes the scalar product £ a E a , and £ a S a =
- £ Ea
The properties so far stated are formally the same as those of three-dimensional
spinors. A difference arises, however, when complex-conjugate spinors are considered.
In the non-relativistic theory, the sum
<^y* + ^fy2*,
(17.6)
which determines the probability density for the localization of the particles in
space, must be a scalar, and the components i//a* must therefore be transformed as
the covariant components of a spinor; the transformation (17.1) must therefore be
unitary (a = 8*, ß = - 7*). In the relativistic theory, however, the particle density
is not a scalar, but is the time component of a 4-vector. The above-mentioned
condition therefore no longer applies, and the transformation coefficients need
satisfy no condition other than (17.2). The four complex quantities a, ß, 7, 8 under
the condition (17.2) alone are equivalent to 8 - 2 = 6 real parameters, in accordance
with the number of angles which define a rotation of the 4-coordinate system
(rotations in six coordinate planes).
Thus complex-conjugate binary transformations are quite different, and in the
relativistic theory there exist two types of spinors. A special notation is customary,
in order to distinguish these two types: the indices of spinors which are transformed by the complex conjugate formulae to (17.1) are written with dots over
them and are called dotted indices. Thus, by definition,
T|B~r*.
(17.7)
where the sign — denotes "is transformed as". The transformation formulae for a
"dotted" spinor are therefore
V ' = a * V + j3*Tj2,
Tj2' = 7 V + 8*n J .
(17.8)
The operations of raising and lowering the dotted indices are carried out in the
same way as for the undotted indices:
TN = T)4,
i72 = - V .
(17.9)
The behaviour of 4-spinors as regards spatial rotation is the same as that of
64
§18
Fermions
3-spinors, for which, as we know, i>* — t//ft. Accoiding to the definition (17.7), the
4-spinor r/a therefore behaves under rotations in the same way as the contravariant
3-spinor <//\ The covariant components r}( and 172 therefore correspond, as the
components of the wave function of a particle with spin 2, to the eigenvalues \ and
-2 of the spin component.
Spinors of higher rank are defined as sets of quantities which are transformed
as products of the components of a number of spinors of rank one. The indices of
these spinors of higher rank may be partly dotted and partly undotted. For
example, there exist three types of spinors of rank two:
In this respect, the statement of just the total rank of a spinor does not uniquely
define it; we shall therefore, where necessary, indicate the rank as a pair of
numbers (k, /), the numbers of undotted and dotted indices respectively.
Since the transformations (17.1) and (17.8) are algebraically independent, it is
not necessary to specify the sequence of dotted and undotted indices; in this sense
the spinors Caß and £ ßa , for example, are the same.
In order to be invariant, every spinor equation must have on each side the same
numbers of undotted and dotted indices, since otherwise the equation could not
remain valid when the frame of reference was changed. Here we must remember
that taking the complex conjugate implies interchanging dotted and undotted
indices. The relationship r)aß = (£ aß )* between two spinors is therefore invariant.
Spinors or their products can be contracted only.with respect to pairs of indices
of the same kind (dotted or undotted); summation with respect to two indices of
different kinds is not an invariant operation. Hence, from the spinor
ra\a2
a
kß)ß2
ßl
M 7
1Q\
which is symmetrical in all k undotted indices and in all / dotted indices, we can
obtain no spinor of lower rank (since contraction with respect to a pair of indices in
which the spinor is symmetrical gives zero). Thus we cannot construct from the
quantities (17.10) a smaller number of linear combinations of them which in turn
are transformed into linear combinations of themselves by every transformation in
the group. That is, the symmetrical 4-spinors are realizations of the irreducible
representations of the proper Lorentz group. Each irreducible representation is
specified by the pair of numbers (k, /).
Each spinor index takes two values, and there are therefore k + 1 essentially
different sets of numbers aj, a 2 , . . . , ak in (17.10) (containing 0, 1, 2 , . . . , k ones and
k, k - 1,. . . , 0 twos) and I + 1 sets of numbers ßu ß 2 , . . . ß/. The symmetrical spinor
of rank (k, /) thus has a total of (k + l)(l + 1) independent components, and this is
also the dimension of the corresponding irreducible representation.
§ 18. The relation between spinors and 4-vectors
The spinor £a/3, with one dotted and undotted index, has 2 x 2 = 4 independent
components, the same as the number of a 4-vector. It is therefore clear that both
§18
The Relation Between Spinors and 4- Vectors
65
are realizations of the same irreducible representation of the proper Lorentz group,
and that there must consequently be a certain relation between their components.
In order to ascertain this relation, let us first consider the corresponding relation
in the three-dimensional case, using the fact that 3-spinors and 4-spinors must
behave in the same manner with respect to purely spatial rotations.
For the three-dimensional spinor iffaß9 the correspondence formulae are as
shown in QMy §57; they will here be written as
where a*, ay, az are the components of a three-dimensional vector a. For the
four-dimensional case, the components ifßaß must be replaced by £a*, and ax, ay, az
must be taken to be the contravariant components a\ a2, a3 of a 4-vector. The
form of the expression for the fourth component a0 is evident from the fact, noted
in §17, that the quantity (17.6) must transform as a0. Hence a ^ ^ ' + E2*, the
coefficient of proportionality being determined so that the scalar £aߣaß is the same
as the scalar 2aMa^ = 2a2.
Thus we obtain the correspondence formulae
aMa'W2*), a° = &" + £*). *
The inverse formulae are
£" = fc = a3 + a°,
xi
l =-tn
x
= a -ia\
t22 = tn = a°-a\
21
]
2
£ = -£■*= a'-f«a J
(18.2)
with
UC* = 2a2.
(18.3)
Moreover
as is seen from the fact that the spinor CaßCyß, of rank two, is antisymmetric in the
indices a, 7, and is therefore proportional to the metric spinor.
The correspondence between the spinor Caß and the 4-vector is a particular case
of a general rule: any symmetrical spinor of rank (Jc, k) is equivalent to a symmetrical 4-tensor of rank k which is irreducible (i.e. which gives zero on contraction with respect to any pair of indices).
The relation between the spinor and the 4-vector may be written in a compact
66
§18
Fermions
form by means of the two-rowed Pauli matricest
<* = (î o)'
a =
> (? "<>)• ^ = (o -?)•
(185)
If the matrix of the quantities Caß (with the indices raised and the first undotted) is
symbolized by £, then formulae (18.2) become
£ = a-<7 + a°,
(18.6)
the second term denoting of course the product of a° and a unit matrix. The inverse
formulae are
a = | t r (£or),
a° = h r £
(18.7)
Using formulae (18.6), (18.7), we can determine the relation between the laws of
transformation of the 4-vector and the spinor, and thus express the law of
transformation of the spinor in terms of the parameters of rotations of the
4-coordinates.
We write the transformation of the spinor £a in the form
r ' = (B£) a ,
B = (a
g),
(18.8)
where B is a two-rowed matrix formed from the coefficients of the binary
transformation. Then the transformation of the dotted spinor is
T,^-(B*7]) ß = (T]B+)^,
(18.9)
and the transformation of the spinor £ a / 3 ^£ a Tj ß , of rank two, may be symbolized
ast £' = BÇB*. For the infinitesimal transformation B = 1 + A, where A is a small
matrix, we have as far as first-order quantities
£ W + (A£ + £A+).
(18.10)
Let us first consider the Lorentz transformation to a frame of reference moving
with an infinitesimal velocity ÔV (without change in direction of the space coordinate axes). Then the 4-vector a* = (a 0 , a) is transformed as follows:
a' = a - a°SV,
a 0 ' = a 0 - a -ÔV.
(18.11)
t To simplify the notation, matrix operators acting on spin variables are written without circumflexes.
t For the covariant components we have
«; = (B-|€)„ = uB-|)rt,
V . = (T,B*- , ) i ,
a
so that the product £ a E of two spinors remains invariant.
<18-8a)
67
The Relation Between Spinors and 4-Vectors
§18
We how make use of formulae (18.7). The transformation of a0 may be represented, firstly, as
a0' = a 0 - a - 8\ = a°-|tr(£«r • ÔV);
secondly, as
a0' = i t r r = a° + itr(À£ + £A+)
= a° + !tr£(À + À+).
These two expressions must be identically equal (i.e. equal for all values of £)•
Hence
X + X + = -<r-ÔV.
Treating the transformation of a in the same way, we find
<TÀ + À + <T=
-SV.
These equations, as equations for X, have the solution
A = A + = -i«r-ÔV.
Thus an infinitesimal Lorentz transformation of the spinor £a has the matrix
B = l-^cr-nSV,
(18.12)
where n is a unit vector in the direction of the velocity ÔV. From this we
can easily find the transformation for a finite velocity V. To do so, we recall
that a Lorentz transformation signifies (geometrically) a rotation of the 4-coordinates in the plane of t and n through an angle </> which is related to the velocity V
byt tanh <f> = V. An angle 8<f> = ÔV corresponds to an infinitesimal transformation,
and a rotation through a finite angle <f> is carried out by a <t>l8<j>-{old repetition of a
rotation through 8<f>. Raising the operator (18.12) to the power <j>l8<f> and taking the
limit 8<j> -* 0, we obtain
B = «-*♦■•*.
(18.13)
The mathematical significance of this operator is seen by noticing that, from the
properties of the Pauli matrices, all even powers of n • <r are equal to 1, and all odd
powers are equal to n • a. Since the expansions of the hyperbolic sine and cosine
contain respectively odd and even powers of the argument, we have finally
B = cosh i<£ - n • a sinh \<t>>
tanh 4> = V.
t The metric is pseudo-Euclidean in planes containing the time axis.
(18.14)
68
Fermions
§19
The matrices B of the Lorentz transformations are Hermitian: B = B*
Let us now consider an infinitesimal rotation of the space coordinates. The
three-dimensional vector a is transformed as follows:
a' = a - 0 9 x a ,
(18.15)
where 68 is the vector of the infinitesimal angle of rotation. The corresponding
transformation of a spinor may be found similarly. There is no need to do so,
however, since the behaviour of 4-spinors under spatial rotations is the same as
that of 3-spinors, and the transformation of the latter is known from the general
relationship between the spin operator and the operator of an infinitesimal rotation:
B = l+iia-88.
(18.16)
The change to a rotation through a finite angle 0 is made in the same way as that
from (18.12) to (18.14):
B = exp (\idn • a ) = cos \d + in • a sin \0,
(18.17)
where n is a unit vector along the axis of rotation. This matrix is unitary (B+ = B" 1 ),
as it should be fol* a spatial rotation.
§ 19. Inversion of spinors
The discussion (in QM) of the three-dimensional theory of spinors did not
consider their behaviour under the operation of spatial inversion, since in the nonrelativistic theory this would not have led to any new physical results. Here we
shall examine the point, however, in order to make clearer the subsequent analysis
of the inversion properties of 4-spinors.
The operation of inversion does not alter the sign of the spin vector, or of any
axial vector, and the spin component sz is therefore also unchanged in value.
Hence it follows that inversion can change each component of the spinor i//a only
into a multiple of itself:
i// a ^Pi// a ,
(19.1)
where P is a constant factor. On repeating the inversion, we return to the original
coordinates. For a spinor, however, a return to the original position can be
regarded in two different ways, as a rotation through 0° or 360°. These two
definitions are not equivalent with respect to spinors, since ifta changes sign on
rotation through 360°. Thus two alternative views of inversion are possible: one
where
P2=l,
P = ±l,
(19.2)
§19
Inversion of Spinors
69
and one where
P2=-l,
P=±i.
(19.3)
Here it is important to note that the concept of inversion must be defined in the
same way for all spinors. It is not permissible for different spinors to behave
differently under inversion (i.e. in accordance with both (19.2) and (19.3)), since in
that case it would not be possible to construct a scalar (or a pseudoscalar) from
every pair of spinors: if the spinor ifja were transformed according to (19.2), and <t>a
according to (19.3), then the quantity if/a<t>a would be multiplied by ±i under
inversion, instead of remaining constant (or simply changing sign).
It should be emphasized that (whatever the definition of inversion) the assignment of a particular parity P to a spinor has no absolute significance, since spinors
change sign on rotation through 27r, and this can always be carried out
simultaneously with inversion. The "relative parity" of two spinors, defined as the
parity of the scalar ifja<f>a formed from them, has absolute significance, however; on
rotation through 27r, both spinors change sign, and the indeterminacy therefore
does not influence the parity of this scalar.
Let us now go on to discuss four-dimensional spinors, first noting that inversion
changes the sign of only three coordinates x, y, z out of four x, y, z, t; it
therefore commutes with spatial rotations but not with transformations which
rotate the t-axis. If L is the Lorentz transformation to a frame of reference moving
with velocity V, then PL = UP, where Û is the transformation to a frame moving
with velocity - V .
Hence it follows that the components of the 4-spinor £a cannot be transformed
into multiples of themselves under inversion. If the inversion of the spinor £a were
given by the transformation (19.1) as before (i.e. if it were represented by a matrix
proportional to the unit matrix), it would commute with every Lorentz trans^
formation, and this certainly cannot be true, since the operations L and Û are not
the same when applied to £a.
Thus inversion must transform the components of the spinor £a into expressions involving other quantities. The latter can only be the components of some
other spinor r\a whose transformation properties are not the same as those of £a.
Since inversion does not affect the z-component of the spin (as mentioned above),
the components Ç1 and £2 can only become TJI and rji on inversion, these corresponding to the same values s2 = \ and sz = - 2 - If inversion is taken to be an
operation which gives identity when carried out twice, its effect may be expressed
by the formulae
r-Tfc,
Tfc->r.
(19.4)
For the covariant components £a and contravariant components TJ", these transformations change sign:
&->-T» d ,
V-+-Ê»
(19.4a)
since the lowering and raising of the same index lead to opposite signs (cf. (17.5)
70
Fermions
§19
and (17.9)).t If, however, inversion is taken in the sense such that P 2 = — 1, its
effect is given by
r-+")*>
Tfc-MT
(19.5)
or, equivalently,
&->-iV\
Và-*-iè<,
(19.5a)
There is a certain difference between the two definitions of inversion in that
with the second definition complex-conjugate spinors are transformed in the same
manner: if Ha = TJ *, Hd = £"*, then by (19.5) E« -» - iH°', H° -+ - iE«, i.e. the rule is the
same as for £a, 17". According to the definition (19.4), however, we should obtain
E a - * H a , H a ->E a , which is opposite in sign to the transformation of the spinors £,,
T)a. We shall return in §27 to some possible physical consequences of this
difference.
In the following, the definition (19.5) will be used.
The spinors £a and 7)à are, as we know, transformed in the same way by the
rotation subgroup. On taking the combinations
E°±tfc
(19.6)
we obtain quantities which are transformed under inversion according to (19.1)
with P = ± i. These combinations, however, do not behave as spinors under all the
transformations of the Lorentz group.
Thus the inclusion of inversion in the symmetry group makes necessary the
simultaneous treatment of a pair of spinors (£ a , TJ«); this is called a bispinor (of
rank one). The four components of a bispinor form a realization of one of the
irreducible representations of the extended Lorentz group.
The scalar product of two bispinors (£ a, TJ«) and (E a , H d ) can be formed in two
ways. The quantity
f B . + T|«H"
(19.7)
is unchanged by inversion, i.e. it is a true scalar. The quantity
rE.-TfcH*
(19.8)
is also invariant under rotations of the 4-coordinates, but changes sign under
inversion, i.e. it is a pseudoscalar.
^
A spinor of rank two, f "*, may also be defined in two ways. If it is defined by
the transformation rule
£«'~£"H* + E V f
(19.9)
t The definition (19.4) is, of course, to some extent arbitrary, since the quantities £ a and Tja are
independent. For instance, if tja is replaced by a new spinor tji = el5rjb, (19.4) is replaced by the equivalent
definition
§19
71
Inversion of Spinors
we obtain quantities which are transformed under inversion as follows:
(19.10)
C^U-
The 4-vector a* to which such a spinor is equivalent is transformed, according to
(18.1), by (a0, a)-»(a 0 ,-a), i.e. it is a true 4-vector, and the three-dimensional
vector a is a polar vector.
It is also possible, however, to define £aß thus:
£«0~£«H'-EV-
(19.11)
Thent
(19.12)
?*-+-£*.
Such a spinor corresponds to a 4-vector such that under inversion (a 0 , a)-»(-a 0 , a),
i.e. a 4-pseudovector (the three-dimensional vector a being an axial vector).
Symmetrical spinors of rank two, with indices of the same type, are defined by
r/3
_
r5ß
+
pBa9
^
_
VaHß
+
^
H Q
(19:i3)
On inversion they are transformed into each other:
r*-*-r, a 0.
(19.14)
The pair (£aß9 TJ^) forms a bispinor of rank two. It has 3 + 3 = 6 independent
components. The antisymmetric 4-tensor of rank two a*v also has this number of
independent components. There must therefore be a certain correspondence between the bispinor and the tensor; both are realizations of equivalent irreducible
representations of the extended Lorentz group.
Since the spinors Caß and t)aß are transformed independently by the proper
Lorentz group, we can construct from the components of the 4-tensor a*1" two
groups of quantities which are transformed only into combinations of one another
under any rotation of the 4-coordinates. This division is achieved as follows.
We define a three-dimensional polar vector p and a three-dimensional axial
vector a related to the components of the 4-tensor a*v by
(
0
px
py
Pz
m
(p, a),
(19.15)
-px
0 -a2
ay
-py
az
0 -ax
-pz - a y
ax
0
where (p, a) is a concise notation which we shall use in order to specify the
t It must be emphasized that the transformation rules (19.10) and (19.12), which differ in the sign on
the right, are not equivalent, since components of the same spinor appear on both sides (cf. the last
footnote),
72
§19
Fermions
components of such a tensor. Then aßV = ( - p , a), and, of the two quantities
a2 - p2 = {a^a^,
a• p=
le^a^a*",
the first is a scalar and the second a pseudoscalar; both are invariant under the
proper Lorentz group.-The squares of the three-dimensional vectors f± = p ± ia are
therefore also invariant. Thus any rotation in 4-space is equivalent, as regards the
vectors f*, to a "rotation" in 3-space, through angles which are in general complex;
the six angles of rotation in 4-space correspond to three complex "angles of
rotatign" of the three-dimensional coordinates. The operation of spatial inversion
changes the sign of p but not that of a, and converts the vectors f+ and -f" into
each other. The components of these vectors are the required two groups of
quantities formed from the components of the tensor a*1".
This also makes evident the correspondence between the components of the
4-tensor a*v and the spinors £ aß , 7)aß. Since the Lorentz group contains as a
subgroup the spatial rotations, the relations between the components of the spinor
and those of the three-dimensional vector must be the same as for three-dimensional
spinors:
f~x = kvii
- TJii),
/ y = 2Î(T)22 + T|ü),
n = -nu.
(19.16)
PROBLEM
Derive the general correspondence between spinors of even rank and 4-tensors.
SOLUTION. All spinors for which k + / is even are realizations of single-valued irreducible
representations of the extended Lorentz group, and are therefore equivalent to the 4-tensors which are
realizations of similar representations.t
A spinor of rank (k, k) can be defined so that it is transformed under inversion by
r3
yS
--*±u...7*....
(i)
Such a spinor is equivalent to a symmetrical irreducible 4-tensor of rank k, which is a true tensor or a
pseudotensor according to the sign in (1).
Spinors of ranks (k, I) and (/, k), forming a bispinor, are transformed under inversion by
k
i
When I = k + 2, the bispinor is equivalent to an irreducible 4-tensor a^^po... of rank k + 2, antisymmetric
in the indices [piv] and symmetric in all the other indices. The irreducibihty of this tensor signifies that it
gives zero on contraction with respect to any pair of indices and on dualization with respect to any
three indices (i.e. eklAVpa[^v\pa... = 0); the latter condition implies that the result is zero on taking the cyclic
sum over three indices, \LV and any one other.
When / = k + 4, the bispinor is equivalent to an irreducible 4-tensor a^H^iar.. of rank k + 4, having the
following properties: it is antisymmetric in the pairs of indices [A/n] and [vp], symmetric in all others,
symmetric for interchange of [Xfi] with [i>p], and gives zero on contraction with respect to any pair of
indices and on dualization with respect to any three indices.
Generally, when I = k -I- 2n, the bispinor is equivalent to an irreducible 4-tensor of rank k + 2n,
antisymmetric in n pairs of indices and symmetric in the other k indices. 4-tensors antisymmetric in
t Spinors of odd rank are realizations of fwo-valued representations of the group: a spatial rotation
through 360° changes the sign of spinors, so that two matrices of opposite sign correspond to each
element of the group.
§20
Dime's Equation in the Spinor Representation
73
larger numbers (threes, fours, etc.) of indices do not appear in this classification, for the obvious reason
that an antisymmetric tensor of rank 3 is equivalent (dual) to a pseudovector, and an antisymmetric
tensor of rank 4 reduces to a scalar (is proportional to the unit pseudotensor ex^p)\ antisymmetry in a
still greater number of indices is not possible in 4-space.
§ 20. Dirac's equation in the spinor representation
A particle with spin 2 is described, in its rest frame, by a two-component wave
function, i.e. a three-dimensional spinor. The "four-dimensional origin" of this may
be either an undotted or a dotted 4-spinor. Both these 4-spinors appear in the
description of the particle in an arbitrary frame of reference; we shall denote them
by r and rj a .t
For a free particle, the only operator which can appear in the wave equation is
(as shown in §10) the 4-momentum operator pM = îdM. In the spinor notation, this
4-vector corresponds to the operator spinor paßy with
P,,
=
P22 = Pz+Po,
Mi
p l = -P2i = p* -iPy.
P 22 = Pii = Po-Pz, 1
21
P = - P i i = P* + iPyJ
(20.1)
The wave equation is a linear differential relation between the components of
spinors, expressed by the operator paß. The requirement of relativistic invariance
leads to the equations
A aß ^ .
—»£<*
p
p r)ß - mÇ ,
(20.2)
PßaCa = wi?ß,
where m is a dimensional constant. There would be no meaning in using different
constants m\ and m2 here, or in changing the sign of m, since the equations could
still be reduced to the above form by an appropriate transformation of £ a or r)a.
By substituting tjß from the second equation (20.2) in the first, we can eliminate
one of the two spinors:
From (18.4), pa*pyß = p 2 ô", and thus we obtain
( p 2 - m 2 ) r = 0,
(20.3)
whence it is evident that m is the mass of the particle.
It should be noticed that the need to use the mass in the wave equation implies
the simultaneous consideration of two spinors (£ a and r)a): with only one of these,
it would not be possible to construct a relativistically invariant equation containing
t A three-dimensional spinor of rank one may also "originate" from 4-spinors of higher odd rank
which, in the rest frame, become antisymrtietric in one or more pairs of indices. These would, however,
lead to higher-order equations (cf. the third footnote to §10).
74
§20
Fermions
a dimensional parameter. The wave equation is necessarily invariant under spatial
inversion if the transformation of the wave function is defined by
**-►«*•.
P: *■->"?«
(20.4)
It is easily seen that the two equations (20.2) are interchanged by this substitution
(together with p"ß-*paß, which is evident from (20.1)). Two spinors which are
interchanged by inversion form a four-component quantity, a bispinor.
The relativistic wave equation given by (20.2) is called Dirac's equation, having
been first derived by Dirac in 1928. In order to analyse und apply this equation
further, let us consider various ways in which it may be written.
Using (18.6), we can rewrite equations (20.2) as
(Po + p-oO-n = m&
(Po ~ P * or)£ = mi).
(20.5)
Here the symbols £ and TJ denote two-component quantities, the spinors
(the first with upper and the second with lower indices). Here and below,
multiplication of the matrices a by any two-component quantity / means multiplication by the usual matrix rule:
(<rf)a = <raßfß.
(20.7)
The vertical column notation for / is in accordance with the multiplication of each
row of a by the column /.
For subsequent reference, the Pauli matrices may be written once more;
-Ci).
H?-.!). -G-Ï)-
Their fundamental properties are
Ojcrjc + flTkO-, = 26jk, 1
.
_
o-j<rk = iCikio-j + s,k ; J
(20.9)
see QM, §55.
We shall also give the wave equation satisfied by the complex-conjugate wave
function formed from the spinors
É*«« 1 *,* 2 *),
TÏ* = « , * ? ) .
(20.10)
Since all the operators p\ contain the factor i,p*= -p M . In taking the complex
§21
The Symmetrical Form of Dime's Equation
75
conjugate of both sides of equation (20.5), we must also use the fact that, since the
matrices a are Hermitian (cr* = cr),
(o/)î = a*flfî=/|crflB = (/*cr)a;
the resulting equations are
,•».+»->—«*,i
(20ll)
^*(Po-p-a) = -mf}*.J
Here it is conventionally implied that the operators p*1 act on the function to the
left of them. The writing of £* and -n* as horizontal rows is in accordance with the
matrix multiplication in these equations: the row / is multiplied by the columns of
the matrix a,
(/*<r)a=/S<rßa.
(20.12)
The inversion transformation for £*, TJ* is defined as the complex conjugate of
the transformation (20.4):
P:
f*-*-h|!,
*i!->-£•*.
(20.13)
§21. The symmetrical form of Dirac's equation
The spinor form of Dirac's equation is the most natural one, in the sense that its
relativistic invariance is immediately apparent. In applications of the equation,
however, other forms of the wave equation may be more convenient, which are
obtained by a different choice of the four independent components of the wave
function.
We shall denote the four-component wave function by the symbol $, with
components «ft (i = 1, 2, 3, 4). In the spinor representation, it is a bispinor:
M£>
(21.1)
But the independent components of ty can equally well be taken as any linearly
independent combinations of components of the spinors £ and ij.t We shall
arbitrarily limit the acceptable linear transformations by the one condition of
unitarity; such transformations leave unchanged the bilinear forms constructed
from t/r and ifr* (§28).
In the general case of an arbitrary choice of the components of </*, Dirac's
equation can be put in the form
t For brevity, the four-component quantity *ft will be referred to as a bispinor even in non-spinor
representations.
76
§21
Fermions
where y*1 (/x = 0, 1, 2, 3) are certain four-rowed matrices (Dirac matrices). We shall
usually write this equation in a symbolic form, omitting the matrix indices:
(yp-m).// = 0,
(21.2)
where
=
Po7°-P'7
y=
(y\y2,yi)-
yp = 7"PM
For example, the spinor form of the equation with the components of $ as in
(21.1) corresponds to the matricest
*•-(?*). - e i ) .
as is easily seen by writing the equations (20.5) as
\Po-p-o-
0
AT)/
\T?/
and comparing with (21.2).
In the general case, the matrices y need satisfy only conditions ensuring that
p 2 = m2. To find these conditions, we multiply equation (21.2) on the left by yp:
(y*P»){yvPM
= m(pM7^)i/f - mV.
Since p$v is a symmetrical tensor (all the operators pM commute), this equation
may be rewritten
and we must therefore have
y»yv + yvy» = 2g»\
(21.4)
Thus all the pairs of different matrices y*1 anticommute, and their squares are
(71)2 = (72)2 = (73)2 = - 1 ,
(7°)2 = 1.
(21.5)
Under an arbitrary unitary transformation of the components ty\ ip' = Lty, where
t Here and below, we use a compact two-rowed notation for four-rowed matrices. Each symbol in
(21.3) represents a two-rowed matrix.
§21
77
The Symmetrical Form of Dirac's Equation
17 is a unitary four-rowed matrix, the matrices y are transformed as follows:
y'=UyU-x=UyU\
(21.6)
so that the equation ( y p - m ) ^ = 0 becomes (y'fi - m)ty' = 0. The commutation
relations (21.4) remain unchanged, of course.
The matrix 70 (21.3) is Hermitian, and the matrices y are anti-Hermitian. These
properties are preserved under any unitary transformation (21.6), and we therefore
always havet
7+ = " 7 ,
7°+ = 7°.
(21.7)
The equation for the complex-conjugate function </f* may also be given. Taking
the complex conjugate of equation (21.2) and using the properties (21.7), we obtain
(-poyo-p"y-m)\ff*
= 0.
We commute $* by 7 ^ * = ^*y't and then multiply the whole equation on the right
by 70; since 77° = - 7°Y, we have in terms of a new bispinor
$ = 4,*y>,
4,* = ^ y
(21.8)
the result
$(yp + m) = 0.
(21.9)
As in (20.11), the operator p is here taken to act on the function to its left. The
function «/> is called the Dirac conjugate (or relativistically conjugate) function to «/>.
The factor 70 in its definition signifies that (in the spinor representation) it
interchanges the spinors £* and 17*; thus, in ^ = (TJ*, £*) the first spinor is undotted
(as in $) and the second is dotted. For this reason $ is a more natural "partner" of
tjf than if/* is; they appear together, for instance, in various bilinear combinations
(see §28).
The inversion transformation for the wave function may be written as
P:
«^-MW,
«Ä = -i«Ä7°.
(21.10)
In the spinor representation of iff, the matrix 70 interchanges the components £ and
T), as should happen on inversion. The invariance of Dirac's equation under the
transformation (21.10) in the general case is immediately obvious: changing p into
- p and ifß into 17V in equation (21.2), we have
(po7° + p - 7 - m ) 7 V = 0.
Multiplying this equation on the left by 70 and taking into account the fact that 70 and 7
anticommute, we return to the original equation.
t These equations may be written jointly in the form
78
§21
Fermions
Multiplying the equation (yp-m)ifj = 0 on the left by i£, and the equation
i//(yp + m) = 0 on the right by </>, and adding, we obtain
<Â7M(PM<Â) + (/v/0yM</> = (P^y^)
= 0,
where the parentheses indicate the function on which the operator p acts. This
equation is in the form of an equation of continuity, dj* = 0, so that
= (</,*</,, if,*y°yil,)
(21.11)
is the particle current density 4-vector. Its time component j°= if/*ip is positivedefinite.
Dirac's equation may be put in the form of an expression for the time
derivative:
idtfr/3* = ]fy,
(21.12)
where H is the Hamiltonian of the particle.t To obtain this form, we need only
multiply equation (21.2) on the left by y°. The resulting expression for the
Hamiltonian is
H = a - p + ßm,
(21.13)
where
<*=7°Y,
ß = 7°
(21-14)
is the customary notation for the matrices concerned.
It may be noted that
a«ak + aka, = 2fiikf
ßa + aß = 0,
ß 2 = 1,
(21.15)
i.e. all the matrices a, ß anticommute and their squares are unity; they are all
Hermitian. In the spinor representation,
«■(; -:> H : DIn the limit of small velocities the particle must be described, as in the
non-relativistic theory, by a single two-component spinor: on taking the limit p-»0,
e-^m in equations (20.5), we find £ = TJ, SO that the two spinors which form the
bispinor are equal. This, however, reveals a defect of the spinor form of Dirac's
t For a particle with spin zero, the wave equation was not capable of being written in this form: the
equation (10.5) for the scalar iff is of the second order in the time, while the first-order equations (10.4)
for the five-component quantity (i//, ^M) contain the time derivatives of only some of the components.
i
§21
The Symmetrical Form of Dirac's Equation
79
equation: in the limit, all four components of \jj are non-zero, although only two of
them are really independent. A more convenient representation of the wave
function \p would be one in which two of its components were zero in the limit.
Accordingly, we replace £ and TJ by linear combinations <f> and \ :
(21.17)
Then x = 0 for a particle at rest. This will be called the standard representation of
tp. On inversion, <f> and x a r e transformed as follows:
P:
<t>-+i<t>, X-+-ÎX-
(21.18)
The equations for 4> and \ are obtained by adding and subtracting equations
(20.5):
F
1
- PoX + P * <*4> = ™X- I
(21.19)
Hence we see that the standard representation corresponds to the matrices
Since the first and second components of £ and TJ are added separately in
(21.17), the components ty\ and fa correspond to the spin component eigenvalue +2
in both the standard and the spinor representation, and fa and ^4 to -{. In both
representations, therefore, the matrix 2X, where
= (0 a
(21.21)
in a three-dimensional spin operator: when &z acts on a bispinor containing only
the components >/M, fa, o r ^2, fa, this bispinor is multiplied by +2 or -{. In an
arbitrary representation, (21.21) may be written in the form
2 = - a 7 5 = -\iaxa;
(21.22)
the definition of 7 s is given in (22.14) below.
PROBLEMS
PROBLEM 1. Find the formulae giving the transformations of the wave function under an infinitesimal Lorentz transformation and an infinitesimal three-dimensional rotation.
80
Fermions
§22
SOLUTION. In the spinor representation of ifr, an infinitesimal Lorentz transformation gives
£' = ( l - l a - 5 V ) £
V = (1+^-8V)T,;
see (18.8), (18.8a)i (18.10). These formulae may be combined as
«KMl-ia-SVW.
(1)
Similarly, the transformation under an infinitesimal rotation is
♦' = (l + lil-8ew.
(2)
In this form the results are valid for any representation of ^ if a and X are matrices in that
representation.
It is easily verified that the matrices a and X are the components of an antisymmetric "matrix
4-tensor",
^ » k r V - r V ) = («.«);
the components are arranged as shown in (19.15). Using also the infinitesimal antisymmetric tensor
SeM" = (8V,Se), we have
<r^8e^ = 2 i ï - S e - 2 a - 8 V ,
and formulae (1) and (2) above may be combined as
♦'«(l+ia^fic^)^
(3)
PROBLEM 2. Write Dirac's equation in a representation such that it contains no imaginary
coefficients (E. Majorana, 1937).
SOLUTION. In the standard representation, the only imaginary quantities in the equation
are the matrices ay and iß. These may be eliminated by a transformation ty' = Ity such that the
imaginary matrix ay and the real matrix ß are interchanged. This is achieved by putting
then ai = UaxU = - ax, ay = ß, ai = - az, ß' = ay, and Dirac's equation becomes
in which all the coefficients are real
§ 22. Algebra of Dirac matrices
In calculations using Dirac's equation, the matrices y occur repeatedly without
reference to their specific form in any particular representation. The rules of operation
with these matrices are entirely given by the commutation relations
y Y + 7 Y = 2g^
Ox, v = 0,1,2,3),
which determine all their general properties.
(22.1)
§22
81
Algebra of Dirac Matrices
In this section we shall give various formulae and rules of the algebra of these
matrices which are useful in such calculations.
The "scalar product" of the matrices y with themselves is g^y^y" = 4. For
brevity we use the notation y^ = g^y" by analogy with the covariant components
of 4-vectors. Then
(22.2)
7*7* = 4.
If the matrices yM and y* are separated by one or more factors 7, then they can be
brought to adjoining positions by one or more interchanges using the rule (22.1), and
the summation over pt is then carried out by means of (22.2). This yields the formulae
7*7 V = -27"»
7M7 A 7V=4g A *,
(22.3)
= -27P7V>
7*7*7 W
7MYVYPYV
= 2(?VyV + 7V7V).
The factors y*\ etc., usually appear in combination with various 4-vectors as
"scalar products" with the latter,t
(22.4)
y a = 7**a^.
For such products, formulae (22.1) become
(ay)(by) + (by)(ay) = 2(ab)A
(ay)(ay) = a2,
(22.5)
J
and formulae (22.3) become
7^(07)7" = -2(07),
y„{ay){by)y>t
= 4(ab),
i
7*(û7)(&7)(c7)7' =
yMyXbyXcyXdyW
-2(cy)(by)(ay),
= 2[(dy)(ay)(by)(cy)
+
(22.6)
(cy)(by)(ay)(dy)].\
A frequent operation is taking the trace of the product of a number of
matrices y. Let us consider the quantities
J-MIMI•••»*■ sitr(7* t , 7 M 2 • • • 7*1").
(22.7)
On account of a familiar property of the trace of a product of matrices, this tensor
is symmetrical with respect to cyclic permutations of the indices /LL(, j x 2 , . . . , /iR.
t In this edition, no special notation is used for such products. Letters with circumflexes or with strokes
through them are often found with this meaning in the literature.
82
§22
Fermions
Since the matrices 7 have the same form in any frame of reference, the
quantities T are also independent of this frame, and they therefore form a tensor
which can be expressed entirely in terms of the metric tensor gM„ which h^s this
property.
From the tensor gMI/ of rank two, however, only tensors of even rank 0an be
constructed. Hence it follows immediately that the trace of the product of any odd
number of factors 7 is zero. In particular, the trace of each 7 is zero:t
tr 7* = 0.
(22.8)
The trace of a unit four-rowed matrix (which is implied on the right-hand side of
the commutation rule (22.1)) is 4. Thus, if we take the trace of both sides of (22.1),
we find
T14" = g91".
(22.9)
The trace of the four-matrix product is
T^vp
= gA*g,p
_ g A, g MP
(22.10)
+ gkpgv»m
This formula may be derived, for instance, by "pulling" the factor yA in tr 7A7*7,'7P
to the right by means of the relation (22.1); after each interchange one of the terms
in (22.10) appears:
and so on. After all the interchanges there remains on the right -T^vpk = -Tk$lvp9
which we take to the left-hand side. The trace of a product of six 7 can similarly be
reduced to the traces of four-factor products, and so on. For instance,
yA^vperr
=
g^f
V
P^ _ g * vj* P-fxrr ^ „ A p y / x ^ __ Qk<rT ^vfn + gkr'J'fiVpCr
( 2 2 1 1)
All the traces TXM are real, and they are non-zero only if each of the matrices
7°, y\... appears in the product an even number of times; both these results are
obvious from the above formulae. Hence we easily find that the trace is unchanged
when the order of the factors is reversed:
yAji..p<7
=
yap.../!*
( 2 2 \2)
As already mentioned, the factors 7 usually appear as "scalar" products with
various 4-vectors. In such cases, formulae (22.9) and (22.10), for example, become
ïtr(ay)(by) = ab,
}
i tr(ay)(by)(cy)(dy) = (ab)(cd) - (ac)(bd) + (ad)(bc). J
(22.13)
t The trace of a matrix is invariant under the transformations 7 = UyU~l. Thus the result (22.8) is
also evident from the expressions (21.3) for the matrices.
I
§22
83
Algebra of Dirac Matrices
The product 7°7 I 7 2 7 3 is of particular importance. There is a special notation for
it which is customarily used:
(22.14)
y*= -iy*y*y2y\
It is easily seen that
(75)2=1,
7 V + y V = 0,
(22.15)
i.e. the matrix y5 anticommutes with all the y*. For the matrices a and ß, the rules
are
,<*75 - 7 5 a = 0,
ßy5 + ysß = 0;
(22.16)
the commutability with a follows because a = 7°7 is a product of two matrices y*\
The matrix 7 5 is Hermitian, since
and hence
75+ = 7 5 ,
(22.17)
because the sequence 3210 is changed to 0123 by an even number of transpositions.
The form of this matrix in two particular representations is:
5 /-I
r = ( 0
spinor
0
5
standard
7 = (_ 1
0\
ly)>
(22.18)
-1\
ftV
The trace of the matrix y5 is zero:
tr7 5 = 0,
(22.19)
as can be seen directly from (22.18). The traces of the products 757^7V are also
zero. For the products of y5 with four factors 7^ we have
1 tr
7
5
7
V
7
teAw
(22.20)
N A = ek™ a,bvc<»
(22.21)
V =
Another formula is
7N
= iy5(ya)(yb)(yc),
which is valid for mutually normal 4-vectors a, b, c:
ab = ac = be = 0.
§23
Fermions
84
In some cases (for problems involving non-relativistic particles), it may be
necessary to calculate the traces of products which involve 7 0 and the threedimensional "vector" 7 separately. The only non-zero traces are those of products
containing even numbers of factors y° and 7. All the factors y° become unity, and
the traces of products with two and four factors 7 are respectively
4 tr(a • 7Kb • 7) = - a • b,
.}
J tr(a • 7)(b • 7)(c • Y)(d • 7) = (a • b)(c • d) - (a • c)(b • d) + (a • d)(b • c).
§23.
(22.22)
Plane waves
The state of a free particle having definite values of the momentum and energy
is described by a plane wave which may be written in the form
*'=vh)u>ripx-
(23
-°
The suffix p indicates the value of the 4-momentum; the wave amplitude up is a
suitably normalized bispinor.
In proceeding with second quantization we need
not only the wave functions (23.1) but also functions with a "negative frequency",
which arise in the relativistic theory because of the two-valuedness of the square
root ±V(p 2 + m2), as shown in §11. As in §11, we shall always take e to be the
positive quantity e = +V(p 2 + m2), so that the "negative frequency" is —e; on
changing also the sign of p, we obtain a function which may naturally be called ^/-p:
+-p=vh)u-pe'px-
(23 2)
-
The significance of these functions will be explained in §26; here we shall write
parallel formulae for i//p and i//_p.
The components of the bispinor amplitudes up and u-p satisfy the algebraic
equations
(yp-m)up
=0, }
(7P + m)u-p = 0 ,
(23.3)
which are obtained by substituting (23.1), (23.2) in Dirac's equation (this is
equivalent to replacing the operator p in that equation by ±p).t The relation
p 2 = m2 is then the condition for each such pair of equations to be compatible. We
t There are also similar equations obtained from Dirac's equation (21.9) for the complex-conjugate
function:
üp(yp-m)
= 0y
ü-p(yp + m) = 0.
(23.3a)
85
Plane Waves
§23
shall always normalize the bispinor amplitudes by the invariant conditions
üpUp = 2m,
(23.4)
ü-pU-p = - 2 m ,
where the bar over a letter denotes, as usual, Dirac conjugation: ü = u*y°. Multiplying
equations (23.3) on the left by ü±p, we obtain (ü±pyu±p)p = 2m2 = 2p2, whence
üpjUp = ü-p-yu-p = 2p.
(23.5)
It may be noted that the change from the formulae for up to those for u-p is made
by changing the sign of m.
The current density 4-vector is
J = ö±Py*l>±p = 27 û±pyu±p = pie,
(23.6)
i.e. j*1 = (1, v), where v = p/e is the velocity of the particle. Hence we see that the
functions 4ip are "normalized to one particle in the volume V = 1".
Equations (23.3) show that the components of the wave amplitude are related,
but the actual form of the relations depends, of course, on the specific representation of $. For the standard representation they are found as follows.
From equations (21.19) we have, for a plane wave,
(e-m)4>-p'<rx=0,)
(23.7)
(e + m)x - p • o"4> =0. J
From these we find the relation between <j> and x in two equivalent forms:'
4>=J^L*,
x=J~<t>;
(23.8)
their equivalence is evident on multiplying thefirstform on the left by p • cr/(e + m)
and using the results (p • cr)2 = p2 and e 2 - m 2 = p2, which gives the second form.
The common factor in 4> and x *s chosen to satisfy the normalization condition
(23,4). Thus we obtain for up, and correspondingly for M-p, the expressions
9
\V(e -m)(n- a)w/
p
\ V ( e + m)H>
/
the second formula is obtained from the first by changing the sign of m and
replacing w by (n • a)w\ Here n is a unit vector in the direction of p, and w is an
arbitrary two-component quantity subject only to the normalization condition
w*w = 1.
(23.10)
§23
Fermions
86
For ü = u*y (with 7 from (21.20)) we have
üP = (V(e + m)w*,-V(e - m)u>*(n • a)),
w_p = (V(e - m)w'*(n • a), - V(e + m)w'*)J
(23U)
and multiplication shows that in fact ü±pu±p = ± 2m.
In the rest frame (e = m), we have
up = V(2m)(£),
«-p = V(2m)(°,),
(23.12)
i.e. w is the three-dimensional spinor to which the amplitude of each wave reduces
in the non-relativistic limit. In the bispinor w_p, the first two components, not the
second two, vanish in the rest frame. This property of solutions of Dirac's equation
having ''negative frequencies1' is evident, since by putting p = 0 and replacing e by
-m in (23.7), we find </>=0.t
The amplitude of the plane wave contains one arbitrary two-component quantity. Thus, for a given momentum, there are two different independent states,
corresponding to the two possible values of the spin component. But the spin
component along an arbitrary z-axis cannot have a definite value. This is evident
because the Hamiltonian of a particle with definite p (i.e. the matrix H = a • p +
ßm) does not commute with the matrix 2 2 = -iotxOCy In accordance with the general
conclusions of §16, however, the helicity A (the component of the spin in the
direction of p) is conserved: the Hamiltonian commutes with the matrix n • 2.
Helicity states correspond to plane waves in which the three-dimensional spinor
w = w(A)(n) is an eigenfunction of the operator n • a:
\(n • a)w(A) = Aw(A).
(23.13)
The explicit form of these spinors is
(23.14)
w
{eKosie r J
where 0 and <f> are the polar angle and the azimuth of the direction of n relative to
fixed axes xyz.t
Another possible choice of the two independent states of a free particle with
given p, which is simpler but less clear, corresponds to the two values of the
z-component of the spin in the rest frame, which we denote by a. The spinors are
wl"b =(o)>
w(<7=4
=(i)-
<23-15>
t In the spinor representation £ = - TJ, instead of £ = TJ as in the rest frame for solutions having
positive frequencies"
t The solution of equation (23.13) can be multiplied by any phase factor, because of the possibility
of an arbitrary rotation about the direction of n.
t4
§23
87
Plane Waves
As the two linearly independent solutions with "negative frequency" we take plane
waves in which the three-dimensional spinors are
>v(<7)' = - <r,w(_<r) = 2<riwla);
(23.16)
the significance of this choice will be shown in §26.
We can also find a representation of a plane wave such that in any frame of
reference (not only in the rest frame) the wave has only two components corresponding to definite values of the same physical property—the spin component in
the rest frame (L. Foldy and S. A. Wouthuysen, 1950).
Starting from the amplitude up (23.9) in the standard representation, we seek a
unitary transformation to such a representation in the form
u'p=Uup,
U=
ewf\
where W is real; since y* = - 7 , it follows that U+ = I/"1. Expanding in series and
noting that (7 • n)2 = - 1 , we put
U = cos W + 7 • n sin W;
cf. the derivation of (18.14) from (18.13). The condition that the second pair of
components in the transformed amplitude u'p should be zero gives
tan W = |p|/(m + e),
so that
,,
=
m + g+(7-n)|p|
V[2£(e + m)]
In the new representation,
^ = V(2e)(£).
(23.17)
The Hamiltonian of the particle in this representation is
H' = U(a • p + j3m)ir* = ße,
(23.18)
where all the matrices ß,a and 7 belong to the standard representation. This
Hamiltonian commutes with the matrix
which is, in the new representation, the operator of a conserved quantity, the spin
in the rest frame.
88
Fermions
§24
§ 24. Spherical waves
The wave functions of states of a free particle (with spin 2) having definite
values j of the angular momentum are spinor spherical waves. To determine their
form, let us first state the corresponding formulae of the non-relativistic theory.
The non-relativistic wave function is a three-dimensional spinor
H»
For a state having definite values of the energy e (and therefore of the momentumt
p), the orbital angular momentum I, the total angular momentum j and its component m, the wave function is
(// = i?p/(r)nj/m(0,(/)).
(24.1)
The angular functions Üjim are three-dimensional spinors whose components (for
the two values j = / ± 2 which are possible for a given J) are
Ql+\Xm —
(24.2)
£ll-{Xm —
(see QM, §106, Problem). We shall call the nj(m spherical harmonic spinors. They
are normalized by the condition
j n ^ m H i ' i ' m ' do -
8jj'8||'ô m m '.
(24.3)
The radial functions Rpi are the common factor in the two components of the
spinor «//, and are given by
(24.4)
(QM, (33.10)). They are normalized by the condition
f r2RplRp,dr = 2TT8(p'-p).
Jo
t In this section, p denotes |p|.
(24.5)
Spherical Waves
§24
89
Returning now to the relativistic case, let us note first of all that separate laws
of conservation of spin and orbital angular momentum do not exist for a moving
particle: the operators s and I do not separately commute with the Hamiltonian.
But the parity of the state is still conserved (for a free particle). The quantum
number / therefore no longer refers to a definite value of the orbital angular
momentum, but it defines the parity of the state (see below).
Let us consider the required wave function (bispinor) in the standard representation:
-
(
!
)
■
Under rotations, <f> and \ behave like three-dimensional spinors. Their angular
dependence is therefore given by the same spherical harmonic spinors [lilm. Let
4> a Hjim, where I is a certain one of the two values j + \ and j - [. Under inversion
<Mr)-»i<M-r) (see (21.18)), and iljlm(-n) = (-\ynilm(n), so that
*(r)-W(-l)'<Mr).
The components #(r), under inversion, become - i ^ ( - r ) . In order that the state
should have a definite parity (i.e. that all the components should be multiplied by
the same factor on inversion), it is therefore necessary that the angular dependence
in x should be given by the spherical harmonic spinor CïjVm with the other of the
two possible values of /; since ,these two values differ by 1, (—l)r = - ( - 1 ) ' .
The radial dependence of cf> and x w*ll be given by the same functions Rpl and
Rpi> (with the values of I and /' which give the order of the spherical harmonics in
iljim). This is clear because each component of \\t satisfies the second-order
equation ( p 2 - m2)i// = 0, which for a given value of |p| becomes
(A + p2)i// = 0,
and this is formally identical with Schrödinger's non-relativistic equation for a free
particle.
Thus
(f> = AK p I fl i I m f
x = BRpVCliVm9
(24.6)
and it remains to determine the constant coefficients A and B. To do so, we
consider a distant region, where the spherical wave may be regarded as a plane
wave. According to the asymptotic formula (QM, (33.12)),
Kl
Ä
^{e K p f -^ n - e'l{pr-^%
(24.7)
sa that </> is the difference of two plane waves propagated in the directions
± n (n = r/r). For each plane wave, by (23.8),
X = —?—(±n • a)<£.
e+m
90
§24
Fermions
From the previous results (formulae (24.6)) it is obvious that (n • oOftjim = an jrm ,
where a is a constant. This constant is easily found by comparing the values of the
two sides of the equation when m = 2 and n is along the z-axis. Using (7.2a), we find
(n-a)[lHm = ï'-l[ln.m.
(24.8)
These formulae, on comparison with (24.6), show that
e+m
Finally, the coefficient A is determined by the normalization of i/r. If this is
specified by
j Vtnmtprrm- d'x = 2irô0.ô,rôMM.ô(p - p'),
(24.9)
we have
* * = 7 ^ Wfc-mUWW'
' =2i~l
(24 l0)
'
Thus, for given values of j and m (and of the energy e) there exist two states
differing in parity. The parity is uniquely defined by the number /, which takes the
values j ± i : on inversion, the bispinor (24.10) is multiplied by i(-l) 1 . The components of this bispinor, however, contain spherical harmonics of both orders I and
J', showing that the orbital angular momentum has no definite value.
When r->°o, the spherical waves (24.7) may be regarded as plane waves in any
small region of space, with momentum p = ± pn. It is therefore clear that the wave
functions in the momentum representation differ from (24.10) essentially only in
that the radial factors are absent and n denotes the direction of the momentum.
In order to make a direct change to the momentum representation, we must
carry out a Fourier transformation:
<Mp')= [<Kr)e~,y rd3x.
(24.11)
The integral is calculated by means of the expansion of a plane wave in spherical
waves:
With an expansion of this kind for e~'pr in (24.11) and using (24.5), we find that the
Fourier components of the function
<Kr) = J*p,e)fliim07r)
§25
91
The Relation Between the Spin and the Statistics
are
W)
= <^-s(p'-p)r'y,„ (£)/uita,(;) Yt(^)do.
The integral is equal to the coefficient of the spherical harmonic function in the
definition (24.2) of the spherical harmonic spinors, and together with the factor
yimip'lp') it yields the same spherical harmonic spinor, but with argument p'lp':
^(p') = ^ ô ( p ' - P ) i - ' n ) l m ( ^ ) .
Applying this result to the bispinor wave function (24.10), we obtain the
momentum representation
,
^
/ r V(e + m)i-'n i t w (p7p , )\
x(n> n\ I2??
< / W P ) = «<P - P ) ^ 7 ^ ( V ( e _ m)r >a, m (p'lp'))-
m n
^
(24 13)
"
The states \pjlm) are the same as the states |pjm|A|) (with |A| = 2) discussed in
§16: both have definite values of pjm and the parity. The spherical harmonic spinors
Qjim are therefore related in a certain way to the functions Di'J, (both with argument
pip). When p -*0, the wave functions (24.13) reduce to the three-dimensional spinors
n jim , the parity of which is P = T J ( - 1 ) ' (where rj = i is the "internal parity" of the
spinor). A comparison with the results of §16 gives the formula
< V = il yjy^
( v v ( ^ \ m ± w*D|«i)
(24.14)
where I = j1+5, and the w(A) are the three-dimensional spinors (23.14).
§25. The relation between the spin and the statistics
The second quantization of a field of particles with spin { (a spinor field) is
carried out in a similar way to that of a scalar field in §11.
Without repeating the arguments, we shall immediately write down expressions
for the field operators, which are exactly analogous to (11.2):
* = 2 V(kj ((V"^~iPX + ^>P^eipx\
â
$ - * V = 2 V(^j ( >p«
eipx +
(25.1)
M-P.-*«-**);
the summation is over all values of the momentum p and over or = ± |. The
antiparticle annihilation operators bp<7 (like the particle annihilation operators d pa )
appear as the coefficients of functions whose coordinate dependence ( e , p r )
§25
Fermions
92
corresponds to a state having momentum p.t
To calculate the Hamiltonian of the spinor field, it is not necessary to determine
the energy-momentum tensor (as we did for the scalar field), since in this case
there exists a particle Hamiltonian which can be used to derive the wave equation
(Dirac's equation) (21.12). The mean energy of the particle in a state with wave
function i// is the integral
= iffc°%d>x.
(25.2)
It should be noticed that the "energy density" (the integrand) is here not a
positive-definite quantity.
Replacing the functions i// and ^ in (25.2) by (//-operators, using the orthogonality of the wave functions with different p or a, and also using the relation
ü±p(Ty°u±p(T = le for the wave amplitudes, we obtain the field Hamiltonian in the
form
H = 2 e(dpadp<7 - bp(Tbpa).
(25.3)
Hence it is seen that in this case Fermi quantization must be used:
{âpa, dpa}+ = 1,
{bpaj bpa}+ = 1,
(25.4)
and all other pairs of operators d, d \ b, b + anticommute (see QM, §65), since then
the Hamiltonian (25.3) may be written
H = 2
^(dpadpcx + bpcrkpa -
1),
and the energy eigenvalues are (with the usual omission of an infinite ad'ditive
constant)
E = 2 s ( N p . + N p(7 ),
(25.5)
p.*
and are positive-definite, as they should be. With Bose quantization, we should
obtain from (25.3) the eigenvalues
2 e(Npa-Np(T),
which are not positive-definite and have no meaning.
t The two functions also correspond to the. same value a of the spin component in the rest frame;
for the functions i/r-Pl-<, this will be proved in §26 (see (26.10)).
§25
The Relation Between the Spin and the Statistics
93
An expression analogous to (25.5) is obtained for the momentum of the system,
i.e. the eigenvalues of the operator / i//*p*//d3jc:
P = E P ( N P . + NP(7).
(25.6)
/" = $y"h
(25.7)
p,cr
The 4-current operator is
and the "charge" operator of the field is found to be
Q = [ $y°4>d3x
P,<7
= S (â;aâpa - 6la6pa + 1);
(25.8)
p , <j
its eigenvalues are
Q = I(NprT-NpJ.
(25.9)
P,<7
Thus we again arrive at the concept of particles and antiparticles, and the whole
of the discussion of these in §11 is applicable.
But particles with spin \ are fermions, whereas those with spin zero are bosons.
An examination of the formal origin of this difference shows that it is due to the
different nature of the expressions for the "energy density" in the scalar and spinor
fields. In the scalar field the expression is positive-definite, and the terms d + d and
bb* therefore both have a positive sign in the Hamiltonian (11.3). If the energy
eigenvalues are positive, the replacement of bb* by b*b must occur without change
of sign, i.e. in accordance with the Bose commutation rule. For the spinor field,
however, the "energy density" is not a positive-definite quantity, and hence the
term bb* appears with the minus sign in the Hamiltonian (25.3); to obtain positive
eigenvalues, the replacement of bb* must be accompanied by a change of sign, i.e.
must occur in accordance with the Fermi commutation rule.
The form of the energy density is directly related to the transformation
properties of the wave function and to the requirements of relativistic invariance.
In this sense we may say that the relation between the spin and the statistics
obeyed by the particles is likewise a direct consequence of these requirements.
The fact that particles with spin 2 are fermions also leads to a general conclusion: all particles with half-integral spin are fermions, and those with integral
spin are bosons (including those with spin zero, as demonstrated in §ll).t
t The origin of the relation between the spin of a particle and the statistics which it obeys was first
elucidated by W. Pauli (1940).
94
Fermions
§26
This is evident because a particle with spin s may be regarded as "composed"
of 25 particles with spin \. When s is half-integral, 2s is odd; when 5 is integral, 2s
is even. A "composite" particle containing an even number of fermions is a boson,
and one containing an odd number of fermions is a fermion.t
If a system consists of particles of various kinds, then creation and annihilation
operators must be defined separately for each kind of particle. The operators
pertaining to different bosons, or to bosons and fermions, commute. Operators
pertaining to different fermions may be regarded, in the non-relativistic theory, as
either commuting or anticommuting (QM, §65). In the relativistic theory, which
allows transformations of particles into one another, the creation and annihilation
operators of different fermions must be regarded as anticommuting, like those
which pertain to different states of the same fermions.
PROBLEM
Find the Lagrangian of the spinor field.
SOLUTION. The Lagrangian corresponding to Dirac's equation is given by the real scalar expression
L = ii(^7MaM0 - 3 ^ • yMi//)-miM/.
(1)
Taking the components of ip and ip as the ''generalized coordinates" q, we easily see that the
corresponding Lagrange's equations (10.10) are the same as Dirac's equations for tp and ip. The overall
sign of the Lagrangian (like the common factor in it) is arbitrary. Since L involves the derivatives of ip
and îp liqearly, the action S = f L dAx can in any case have no minimum or maximum. The condition
8S - 0 here gives only a stationary point of the integral, not an extremum.
The Lagrangian of the spinor field is obtained by replacing ^ in (1) by the operator i//. Applying
formula (12.12) to this Lagrangian, we find the current operator (25.7).
§ 26. Charge conjugation and time reversal of spinors
The coefficients typ<T = up<Te~ipx which appear with the operators â^ in (25.1) are
the wave functions of free particles (electrons, say) having momenta p and
polarizations cr:
The coefficients i/>-p,-<7 of the operators b^ are to be regarded as the wave functions
of positrons having the same p and a. It is found, however, that the electron and
positron functions are expressed in different bispinor representations. This is
evident from the fact that \p and ip differ in their transformation properties and
their components satisfy different sets of equations. To eliminate this defect, it is
necessary to carry out a certain unitary transformation of the components i/>_p,-a,
t In this argument it is assumed that all particles with the same spin obey the same statistics
(whatever the way in which they are "compounded"). The truth of this assumption is seen by
analogous arguments. For example, if there existed fermions with spin zero, then a fermion with spin
zero and one with spin 1 would yield a particle with spin L which would be a boson, in contradiction with
the general result demonstrated for spin 5.
§26
95
Charge Conjugation and Time Reversal of Spinors
such that the new four-component function satisfies the same equation as i//p<7.t This
will be referred to as the wave function of the positron (with momentum p and
polarization a). Denoting the matrix of the required unitary transformation by ITC,
we may write
(26.1)
¥tt=Uc*-P^
The operation C whereby this function is obtained from ^-p-a is called charge
conjugation of the wave function (H, A. Kramers, 1937). This concept is, of
course, not restricted to its application to plane waves: for any function ^, there
exists a "charge-conjugate" function
(26.2)
ÙW9T)=Uc$(t9T),
which has the same transformation properties as if/ and satisfies the same equation.
The properties of the matrix Uc follow from this definition. If i/> is a solution of
Dirac's equation (yp - m)ty = 0, then ifj satisfies the equation
i//(yp + m) = 0,
or
(yp + m)^ = 0.
Multiplying this equation on the left by £/<>
Ucypiï + mUcîp = 0,
we apply the condition that the function Uc$ satisfies the same equation as i//:
(yp - m)Uc4f = 0.
A comparison of the two equations gives the following "commutation relation"
between Uc and the matrices y^it
UcV = - y ' l / c .
(26.3)
We shall further suppose that the wave functions are stated in the spinor or
standard representation; the general case of any representation will be considered
only at the end of this section. In these representations, we have
(y o...3 ) * = y c..3 f
7 >*
= -y.j
(26 4)
'
Then the condition (26.3) is satisfied by the matrix Uc = T)C7 2 7°. the constant TJC
being arbitrary. The condition C2 = 1 shows that |TJC|2= 1, and the matrix Uc is
t For particles with spin zero, this problem did not arise, since the scalar functions ^ and iff* satisfy the
same equation, and i//*p is identical with i/*p.
t From this there follows also the equation
UcY =y5Uc.
(26.3a)
Fermions
96
§26
therefore determined apart from a phase factor. We shall take t]C = 1 ; thus
Uc = y2y°=-ccr
(26.5)
Noting also that $ = i//*7° = yV* = YV*> we may write the effect of the operator C
as
Cty = y2y°«Ä = y V -
(26.6)
The explicit form of the transformation (26.6) for the spinor representation is
C: r-+-«V*>
Tfc->-£*,
(26.7a)
T»*--ir*.
(26.7b)
or, equivalently,
C: Ê.--WIÎ.
The charge-conjugation transformation for the plane waves $±p<r is easily
carried out by using the explicit expressions (23.9) for the plane waves and the
matrix Uc in the standard representation:
VC-(_1 7 ) .
(26.8,
'y
Since
(Tv&
p
=
" ~ — crcr
~ ~ ~ yv >
T
we have, if w(or)' is defined as in (23.16),
UcÜ-p-a
= Upay
UCU-pt-<r =
fipcr.
(26.9)
Thus
C(A-p^ = (/rpa,
(26.10)
so that the functions t/>-p - a which appear with the operators b?a in the (//-operators
(25.1) do in fact correspond to states of a particle having momentum p and
polarization or. We see also that the electron and positron states are described by
the same functions:
C = VS = </w
This is to be expected, since the functions ifßp(T embody information only as to the
momentum and polarization of the particle.
The operation of time reversal may be treated similarly. When the sign of the
time is changed, the wave function must change to the complex conjugate. In order
§26
97
Charge Conjugation and Time Reversal of Spinors
to obtain the "time-reversed" wave function (Ti/0 in the same representation as the
original i//, we must also perform some unitary transformation on the components
of i//* (or i//). Thus the action of the operator T on ^ can be written, similarly to
(26.2), as
fi/i(t,r)=l7 T ^(-t,r),
(26.11)
where UT is a unitary matrix.
Dirac's equation satisfied by if/ is
( i y ° ^ + Î 7 - V - m ) ^ a r ) = 0,
and the equation for i£ is
In the latter equation we change t into -t and multiply on the left by - l / j :
{iU^°it
" IUT^ * V )* ( ~ f > r ) "" mUr4>(-U R) = 0.
We want the function l/r</>(-t, r) to satisfy the same equation as i//(f, r):
(iy°yt + iy • v ) l/ T ^(-t, r) - m[/^(-r, r) = 0.
Comparing the two equations, we find that the matrix UT must satisfy the
conditions
UTy° = y°UT,
UTy = -yUT.
(26.12)
In the spinor and standard representations, these conditions are satisfied by the
matrixt
UT = iy>yly°.
(26.13)
Thus the effect of the operator T is given by
fiKt, r) = iy*yxy°i(-t, r) = iyVi//*(~', r).
(26.14)
The explicit form of this transformation for the spinor representation is
T: É"->-iÉÎ,
T»«-MV*.
(26.15a)
t The choice of the phase factor in (26.13) depends on that in (26.5); see the second footnote to §27.
§26
Fermions
98
or
Ça-+iÇa*,
T:
T, a -*-«T,*.
(26.15b)
In the standard representation,
ü
* = (o
(26 16)
-",)•
-
To find the effect on if/ of all three operations P, T and C, we write successively
f ^ a r ) = -iy!yV*(-f,r),
PftKt, r) = i V ( f ^) = 7°y ! y V * ( - t . ~r),
CPfi//(r, r) = ? 2 (7VY V ) * = 7 W ? V ( ~ ' . " r),
or
CPfi//(t >r ) = i"7V(-t,-r).
(26.17)
In the spinor representation,
CPT:
r->-£a.
Î?Ô^ÎÎ|«,
(26.18)
as is also easily seen directly from the transformation rules (20.4), (26.7) and
(26.15).t
The expressions given above for the matrices Uc and UT assume the spinor or
standard representation of ifr. Let us finally see which properties of these expressions are retained for any representation of i//.
If \\ß is subjected to a unitary transformation:
V = Ity,
Y = UyU~\
& = *'*7°# = W* = t?l/"\
(26.19)
then in the new representation we have
( W = U(Cifß) = UUc$ = UUc(4>'U) =
UUcVi'.
A comparison with the definition of the matrix Uc in the new representation,
( W = U'd\ shows that
Ub = UUcÜ.
(26.20)
The transformation (26.20) is the same as that of the matrices 7 only if U is real.
The expression (26.5) also is therefore valid only in representations which are a real
transformation of the spinor or standard representation.
t The notation CPT implies that the operatorsfcact in the sequence from right to left. The sign of (26.17)
and (26.18) depends on this sequence, since T does not commute with C and P (as regards their action on a
bispinor).
§27
Internal Symmetry of Particles and Antiparticles
99
The matrix (26.5) is unitary, and changes sign when transposed:
l/cl/c=l,
Uc= -Uc.
(26.21)
These properties are unaffected by the transformation (26.20), and are therefore
retained in any representation. The matrix (26.5) is also Hermitian ( l / c = Uc), but
this property is in general not preserved by the transformation (26.20).
The above discussion and formulae (26.21) apply likewise to the properties of
the matrix l/T.
In the second quantization formalism, the transformations C, P. T for ^-operators
must be formulated as transformation rules for the particle creation and annihilation
operators. These rules can be established (as in §13 for particles with spin zero) from
the condition that the transformed ^-operators may be written
ftr)=UÀr),
]
P
iP {Ur) = iy°4,(U-r)A
(26.22)
T
4, (Ur)=UTk-t,r).\
PROBLEM
Find the charge-conjugation operator in the Majorana representation (§21, Problem 2).
SOLUTION. The matrix [/ein the Majorana representation is obtained from the matrix Uc = -ay in the
standard representation by the transformation (26.20) with U = (ay + ß)/V2, and is Uc= -a y (where ay
and ß denote matrices in the standard representation). If primes denote quantities in the Majorana
representation, we have Cty' = UH^ß')* and since ß' = ay,
Cip' — a y ((/f'*a y ) = aydyi/f'* = r//*,
i.e. charge conjugation is the same as complex conjugation.
§ 27. Internal symmetry of particles and antiparticles
The wave function of a particle with spin i, in its rest frame, is a single
three-dimensional spinor, which will be denoted by 4>a. The behaviour of this
spinor under inversion is related to the concept of the internal parity of the particle.
However, as mentioned in §19, although the two possible laws of transformation of
three-dimensional spinors (<î>a -* ± i<&a) are not equivalent, there is no absolute
significance in assigning a particular parity to a spinor. We therefore cannot speak
of the internal parity of any one particle with spin i but we can refer to the relative
internal parity of two such particles.
From two (three-dimensional) spinors 4>(1) and 4>(2), a scalar <&^W2)a can be
formed. If this is a true scalar, the particles described by the spinors are said to
have the same internal parity; if it is a pseudoscalar, they are said to have opposite
internal parities.
We shall show that particles and antiparticles (with spin 2) have opposite parities
(V. B. Berestetskiï, 1948).
Firstly, if the operation C (26.7) is applied to both sides of the P transformation
100
§27
Fermions
(19.5) (in the spinor representation)
P:
T,ô-MÊa,
r->»l«.
(27.1)
we obtain
where the index c marks the components of the bispinor
if/c = (
c
J charge-conjugate to i// = ( ) .
Taking the complex conjugate and interchanging the indices, we find
P:
T)a-+i'r\
r^ivl
(27.2)
Thus charge-conjugate bispinors are transformed in the same manner by inversion.
Let ip{e) be the wave function of a particle (say an electron) and ift(p) that of the
antiparticle (a positron). The latter is a'bispinor which is the charge conjugate of a
"negative-frequency" solution of Dirac's equation. In the rest frame, each function
becomes a three-dimensional spinor:
£<e>a = <«(.<) = <j><e>a
£<?><* = r){p) — 4>(p)a.
According to (27.1), (27.2), these spinors are transformed as follows by inversion:
<D a ^/<D a ,
(27.3)
the same for both <t>{€) and 3>(p). The product <ï>(°<I>(p), however, changes sign, and
this proves the result stated above.
A strictly neutral particle is one which coincides with its antiparticle (§12). The
(//-operator of a field of such particles satisfies the condition
if(r,r) = ifc(t,r).
For particles with spin 2, this implies the conditions (in the spinor representation)!
t=
-irT+,
Tfc- -ifr.
(27.4)
Like any relation which expresses a physical property, these conditions are
invariant under the transformation CPT.t It is easily verified that they are in fact
t In the Majorana representation, strict neutrality implies simply that the operator $' is Hermitian;
see §26, Problem.
t More precisely, the transformation CPT must here be defined so as to leave relations such as
(27.4) unchanged. This is achieved by an appropriate choice of the phase factor in the definition of the
matrix l/ T ; see the third footnote to §26.
§28
101
Bilinear Forms
invariant not only with respect to CPT but also with respect to each of the three
transformations separately.
In §19, inversion of spinors was defined to be a transformation for which
P 2 = - l , arid this definition has been used so far. The result derived above
concerning the relative parity of particles and antiparticles is easily seen to be, as it
should be, independent of the way in which inversion is defined.
If inversion is defined by the condition P 2 = 1, then (27.1) becomes
P:
r - > t | * . *»<--> f".
(27.5)
The charge-conjugate function is then transformed according to
HCa
Ç
__v
~+c
-> - T ) a ,
~+c
_^w
T)a~> - £
tca
,
which differs in sign from (27.5). Accordingly, the three-dimensional spinors <ï> will
be transformed thus:
<!><*■>* -► $ ( < ? ) a ,
<i> ( p ) a - * -
<j> ( p ) a ,
and the product <&{eWp) will again be a pseudoscalar.
The only possible difference in the physical consequences of the two views of
inversion is that with the definition (27.5) the condition for a strictly neutral field
would not be invariant under this transformation (or the transformation CP),
which would alter the relative sign of the sides of the equations (27.4). Actually, no
strictly neutral particle with spin \ is known, and we cannot yet say whether this
difference between the two definitions of inversion has any real physical meaning.t
PROBLEM
Find the charge parity of positronium (a hydrogen-like system consisting of an electron and a
positron).
SOLUTION. The wave function of two fermions must be antisymmetric with respect to simultaneous
interchange of coordinates, spins and charge variables of the particles (cf. §13, Problem). The
interchange of the coordinates multiplies the function by (-1)', that of the spins by (-1) I+S , where S (=0
or 1) is the total spin of the system, and that of the charge variables by the required parity C. The
condition
(-iy(-l) ,+s C = - l
gives C = ( - l) / + s . Since the internal parities of the electron and the positron are opposite, the spatial parity
of the system is P - ( - l ) m . The combined parity CP = ( - l ) s + 1
§ 28. Bilinear forms
Let us consider the transformation properties of various bilinear forms which
can be constructed from components of the functions ip and </**. Such forms are of
great importance in quantum mechanics. They include the current density 4-vector
(21.11).
t The incomplete equivalence of the two definitions of inversion was first noted by G. Racah (1937).
102
§28
Fermions
Since \\f and t//* have four components each, a total of 4 x 4 = 16 independent
bilinear combinations can be formed from them. The classification of the transformation properties of these is evident from the ways listed in §19 of multiplying
any two bispinors (in this case i// and i//*). We can form a scalar (denoted by S), a
pseudoscalar (P), a mixed spinor of rank two equivalent to a true 4-vector Vß (four
independent quantities), a mixed spinor of rank two equivalent to a 4-pseudovector AM
(four quantities), and a bispinor of rank two equivalent to an antisymmetric 4-tensor
TßV (six quantities).
In a symmetrical form (for any representation of i//), these combinations may be
written
S = <W,
P=
ih$^
(28.1)
where
<r»v = l(y V " 7 V ) = (<*> «X);
(28.2)
the components in (28.2) are stated as in (19.15).t All the expressions given above
are real.
The fact that S is a scalar and P a pseudoscalar is evident from their spinor
representations:
which agree with (19.7) and (19.8). The fact that the V*1 form a vector is then evident
from Dirac's equation: multiplying the equation fvyMi/> = mif/ on the left by i/vwe
obtain
M . 7 ^ = mijhjr.
Since the right-hand side is a scalar, so is the left-hand side.
The rule whereby the quantities (28.1) are obtained is obvious: they are
constructed as if the matrices y* formed a 4-vector, y5 were a pseudoscalar, and
the i/f and ^ on either side together formed a scalar, t The non-existence of bilinear
forms which are symmetrical 4-tensors is seen from the spinor representation and
also from this rule: since the symmetrical combination of matrices is 7^7" + 7"7|i =
2g*1", any such form would reduce to a scalar.
The second-quantized bilinear forms are obtained by replacing the (//-functions
in (28.1) by i/r-operators. For greater generality, we shall assume that the two
(//-operators relate to fields of different particles, denoted by suffixes a and b. Let
t_For a unitary transformation of *p (change of representation), we have <//->lty,
ift-nfrU~ , and the invariance of the bilinear forms under such transformations is obvious.
t The "pseudoscalar" nature of y5 is itrtif in accordance with these rules, since
y = 24«A**y y * y y p .
y^UyU'\
§28
103
Bilinear Forms
us see how such operator forms are transformed under charge conjugation. We
havet
4fc = Uc$,
$c = U+ci,
(28.3)
and therefore, using (26.3) and (26.21),
= -iaUcUc^b
= ~ 4>aU+cy»Uc$b
= ky^bWhen the operators are restored to their original order (^ to the left of <£), the
Fermi commutation rules (25.4) show that the sign of the product is changed (and
moreover terms appear which are independent of the state of the field; we omit
these, as in the corresponding treatment in §13). Thus we have
$ïfô = $bia, &Y M # = ~ky*h.
Proceeding similarly with the other forms, we find the results for charge conjugationt
C
'.
Jab ~* •Jfco»
-»ab ~* *ba»
»4b ~~* ~~ » oa>
/yo
A\
The behaviour of these forms under time reversal may be ascertained similarly,
remembering (see §13) that this operation brings about a change in the order of the
operators, so that, for example,
(</Ub)T = il$l
t To derive the second of these equations from the first, we write
î>c=[UKfa0*))y°
= y0uty0î,
= -y°U+cy°4>
= y°y°*U+cj,
using (26.3), (26.21) and the fact that 7 0 is Hermitian.
t It should be noticed that, for bilinear forms constructed from (/»-functions (not «fr-operators), the
transformations (28.4) would have the opposite signs, since the return to the original order of the factors
ji and «// would not involve a change of sign.
104
§28
Fermions
Substituting here
i//T = UT$,
$T = -UT&
(28.5)
we obtain
Treating the other forms in the same way, we obtian
Sab -> &fl,
T:
Pab -> - A « ,
(A0, Â)ûb -> (- ° A)*,
( V°, V)ûb -> ( V°, - V ) t o
f a ; = (p, â)ûb -> (p, - â)>«,
(28.6)
where p and a are three-dimensional vectors equivalent to the components of T *v
as shown in (19.15).
Under spatial inversion we have, in accordance with the tensor properties,!
P:
£*->&*,
0
Pab -> -Pab9
(A , \)ab ->(-A°, A)*,
(V°,V) û b -^(V°,-VU
Tj£ = (p, â)ûb - * ( - p , â)«„.
(28.7)
Finally, the simultaneous application of all three operations leaves all the Saby Pab
and f {£ unchanged, and changes the sign of all the V£b and A£b, in agreement with
the fact that this transformation is a 4-inversion: since 4-inversion is equivalent to a
rotation of the 4-coordinates, it creates no difference between true tensors and
pseudotensors of any rank.
Let us now consider products of pairs of bilinear forms constructed from four
different functions i/ffl, </>*, ij>c, if/d. The result depends on which pairs of functions
are multiplied together. It is possible, however, to reduce any such product to
products of bilinear forms with specified pairs of factors (W. Pauli and M. Fierz,
1936). We shall derive the relationship on which this reduction is based.
If we take the set of four-rowed matrices
hy\y\iy*y5,i<rl"'>
(28.8)
where 1 is the unit matrix, arrange these 16 (=1 + 1 + 4 + 4 + 6) matrices in any
definite order and denote them by yA (A = 1 , . . . , 16), and also denote the same
matrices with lowered 4-tensor indices (pt, v) by yA, then they will have the
following properties:
try A = 0 ( y A * l ) ,
1
^A^
«
lfr^ A ^ - KA I
y yA - i ,
4 tr 7 ye - o B. J
(28.9)
t To avoid any misunderstanding, it should be mentioned that the transformations T and P also
involve a change in the arguments of the functions; the right-hand (transformed) sides in (28.6H28.7)
are respectively functions of xT = (-f, r), xp = (U -r), when the left-hand sides are functions of
x=(f,r).
Bilinear Forms
§28
105
The last of these shows that the matrices yA are linearly independent. Since their
number is equal to the number (4 x 4) of elements of a four-rowed matrix, the
matrices yA form a complete set in terms of which an arbitrary four-rowed matrix
T may be expressed:
r =Sc
A
7
A
,
cA = itry A r f
(28.10)
or, in explicit form, with matrix suffixes i, k - 1, 2, 3, 4,
r,k = 4 ZJ r/m7ml7AiJtA
Assuming, in particular, that in the matrix T only the element Fim is non-zero, we
obtain the required relation (the "completeness condition"):
fi«ôta,=si2 7A*7£i.
A
(28.11)
Multiplying both sides of this equation by ^kümty*, we have
( # V X * V ) = 4 S (ryA*i>b)Wy V ) .
A
(28-12)
This is one equation of the type mentioned above, reducing the product of two
scalar bilinear forms to products of forms involving other pairs of factors.t
Other equations of the same type may be obtained from (28.12) by the changes
using the expansion
7V
= Ç
CR7*,
CR = \ tr 7 V T K
(see the Problem).
Here we may also give for future reference, the relation for two-rowed matrices
which corresponds to (28.11). The complete set of linearly independent two-rowed
matrices aA (A = 1, 2, 3,4) is
1, o\, <7y, <rz.
(28.13)
These have the properties
tr<rA = 0
UT<TA<TB
( a A # 1),
=
8AB.
(28.14)
t It should be mentioned, to avoid any misunderstanding, that we are referring here to forms
constructed from »//-functions. The sign of the transformation would be opposite for forms constructed
from the anticommuting (//-operators.
106
§29
Fermions
The completeness condition is
= 2<raß • a Ô7 + 2ßa0Ö07
(28.15)
(a, ß, y, 8 = 1,2), or
«Ja/s • <*ày = ""2cra7 • o"^ + iSa-ySsß.
(28.16)
PROBLEM
Derive formulae analogous to (28.12) for products of pairs of bilinear forms, P, V, A, T.
SOLUTION. We use the notation
JP = (<Äay V)(£ c y V>,
Js = (*VX* V ) ,
Jv = ( * V * b X*CY**').
JA = (♦"iW^X^friO'V),
Jr =
Waia^b){Vi*^d),
and the same symbols with primes to denote the products with tyb and *ftd interchanged. The method
used above gives
4J£= Js+ Jv+ h+
4J{, = 4 J S - 2 J V
4Jf=6J s
-2J T
4JA = 4Js + 2Jv
4JP=
JA+ JP,
+2JU-4JP,
/s-
+ 6JP,
-2JA-4JP,
Jv + J T -
JA+ JP,
thefirstof these equations being the same as (28.12).
§ 29. The polarization density matrix
The dependence on the coordinates of the wave function $ which describes
free motion with momentum p (a plane wave) reduces to a common factor e , p r ,
and the amplitude up acts as a spin wave function. In such a state (a pure state), the
particle is completely polarized (see QM9 §59). In the non-relativistic theory this
means that the particle spin has a definite direction in space (more precisely, there
exists a direction in which the spin component has the definite value +2)- In the
relativistic theory, this description of a state in an arbitrary frame of reference is
not possible, because, as already mentioned in §23, the spin vector is not conserved. The term "pure state" signifies only that the spin has a definite direction in
the rest frame of the particle.
In a state of partial polarization, there is no definite amplitude, but only a
polarization density matrix p* (i, k = 1, 2, 3,4 being bispinor indices). We define this
matrix in such a way that in a pure state it reduces to products:
pik = UpMpk.
(29.1)
The Polarization Density Matrix
§29
107
Accordingly the matrix p is normalized by the condition
tr p = 2m ;
(29.2)
cf. (23.4).
In a pure state, the mean value of the spin is given by the quantity
= ±üpy°Xup.
(29.3)
The corresponding expression for a state of partial polarization is
s = ^tr(p7°X) = ^ t r ( p 7 5 7 ) .
The amplitudes up and üp satisfy the algebraic equations (7p -m)up
üP(yp - m) = 0. The matrix (29.1) therefore satisfies the equations
(yp -m)p
= p(yp - m) = 0.
(29.4)
= 0,
(29.5)
The density matrix must satisfy similar linear equations in the general case of a
state which is mixed (with respect to the spin); cf. the analogous argument in QM,
§14.
If we consider a free particle in its rest frame, the non-relativistic theory is
applicable. In that theory the state of partial polarization is completely defined by
three parameters, the components of the mean spin vector s (see QM, §59). It is
therefore clear that the same parameters will define the polarization state after any
Lorentz transformation, i.e. for a moving particle.
Let twice the mean spin vector in the rest frame be denoted by Ç; in a pure state
|Ç| = 1, in a mixed state |£| < 1. For a four-dimensional description of the polarization state it is convenient to define a 4-vector aß which in the rest frame is the same
as the three-dimensional vector £; since £ is an axial vector, aM is a 4-pseudovector.
This 4-vector is orthogonal to the 4-momentum in the rest frame (in which
fl" = (0, £), P* = (m, 0)); in any frame, we therefore have
a^=0.
(29.6)
a,a»-~ -Ç 2 .
(29.7)
In any frame, moreover,
The components of the 4-vector aM in a frame in which the particle is moving
§29
Fermions
108
with velocity v = p/e are found by a Lorentz transformation from the rest frame,
and are
Û° = ^ I I ,
ax = L ,
ai = ^ | .
(29.8)
where the suffixes || and 1 denote the components of the vectors £ and a parallel and
perpendicular to the direction of p.t These formulae may be expressed in vector form:
. - t + JälEL
*
m(e + m)
a« =
!LJ> = R^,
e m
«».f + k ^ .
*
ml
(29.9)
'
Let us first consider the unpolarized state (Ç = 0). The density matrix in this case
can contain as parameters only the 4-momentum p. The only form for such a
matrix which satisfies the equations (29.5) is
(29.10)
P=kyp + m)
(I. E. Tamm, 1930; H. B. G. Casimir, 1933). The constant coefficient is chosen in
accordance with the normalization condition (29.2).
In the general case of partial polarization (Ç # 0), we seek the density matrix in
the form
P = 4^- (7P + m)p'(yp + m),
(29.11)
which necessarily satisfies the equations (29.5). When Ç = 0, the auxiliary matrix p'
must become a unit matrix; since (yp + m)2 = 2m(yp + m), (29.11) is then the same
as (29.10). The matrix p' must also contain the 4-vector a linearly as a parameter,
i.e. must be of the form
p'=l-Ay5(ya);
(29.12)
the second term includes the "scalar" product of the pseudovector a and the
"matrix 4-pseudovector" y 5 y. To determine the coefficient A, we write the density
t As regards their transformation properties, the components of the mean spin vector s (like those of
any angular momentum) are, in relativistic mechanics, the space components of an antisymmetric tensor
A
S *\ The 4-vector ak is related to this tensor by the equations
SAM = — ex*vpa n
nK - - — p^^S
n
o
2m t
"»Pi»
a e o^vP».
It must be emphasized that, in an arbitrary frame of reference, the spatial part a of the 4-vector aA is not
the same as the vector 2s: we easily see that
2SII = — ( a i e - a 0 | p | ) = Ch
§29
The Polarization Density Matrix
109
matrix in the rest frame:
p=}m(l + 70)(l + A y V W + 7°)
= im(l+y°)(l + A T V Q
and calculate the mean spin by (29.4). Using the rules given in §22, we easily find that
the only non-zero term in the trace is
2s = 2 ^ t r ( p 7 5 7 )
= -iAtr[(7-Ü7l
Equating this to Ç, we have A = 1. The final expression for p is obtained by
substituting (29.12) in (29.11) and interchanging the factors p' and yp + m; since a
and p are orthogonal, the product yp anticommutes with ya:
(ya)(yp) = lap - (yp)(ya) = -
(yp)(ya),
and therefore commutes with y5(ya).
Thus the density matrix of a partially polarized electron is given by the
expression
P = l2(yp + m)(\-y5(ya))
(29.13)
(L. Michel and A. S. Wightman, 1955). If the matrix p is known, the 4-vector a
which describes the state can be found from
a'=^tr(p7V),
(29.14)
and the vector Ç is therefore also known.
The formulae for the density matrix of the positron are entirely analogous to
those of the electron. ïf the positron (with 4-momentum p) were described by a
positron amplitude a(ppos) and by a density matrix p(pos) defined on the basis of this
amplitude, then there would be no difference from the case of the electron, and the
matrix p(pos) would be given by the same formula (29.13). But, in actual calculations
of cross-sections of scattering processes involving positrons, it is necessary (as we
shall see below) to deal not with n(ppos) but with the "negative frequency" amplitudes
u-p. Accordingly, the polarization density matrix (which we denote by p(_)) must be
defined so as to reduce to u-pfi-pM for a pure state.
According to (26.1), the positron amplitude n(ppos)= Ucü-P. Conversely,
K-p = Lrcw(ppos),
û-p = U£u jr* = ujr'l/Jf;
Fermions
110
§29
cf. (28.3). If
pifc^n-p..-0-p.k,
pir s } = nSRos)fiS,s\
then these formulae give
p M = Ucßipos)Ut
(29.15)
Substituting for p(pos) the expression (29.13), we obtain, after some simple
rearrangements using (26.3) and (26.21),
pM = k 7 P - m ) ( l - 7 5 ( 7 û ) )
(29.16)
In particular, for an unpolarized state
P{) = kyp-m).
(29.17)
In referring to positron density matrices below, we shall intend the matrices p ( ) ,
and the index (-) will be omitted; the matrices p(pos) will not be needed in practice.
In various calculations it will often be necessary to average over spin states an
expression such as üFu (=üiFikuk) where F is a (four-rowed) matrix and u is the
bispinor amplitude of a state having a definite 4-momentum p. This averaging is
equivalent to replacing the products uküi by the density matrix pki of a partially
polarized state.
In particular, complete averaging over two independent spin states is equivalent
to changing to an unpolarized state, and, by (29.10), we have
-2 S üpF"p = 2 tr (yp + m)F.
polar.
(29.18)
Similarly, for wave functions with negative frequency,
\ 2
polar.
ö-pFn-p=hr(YP-m)F.
(29.19)
If summation over spin states replaces averaging, the result is doubled.
Let us see how the density matrix (29.13) tends to its non-relativistic limit. To
do so, we use the rest frame of the electron. In the standard representation of the
wave functions, the amplitudes up in this frame have two components in the limit,
and the density matrix must accordingly have two rows. For, in the rest frame,
p = im(7°+l)0-7Va
and we find from the expressions (21.20) and (22.18) for the matrices y
P = ( Jn0nr
I ),
Pnon-r = m ( l + CT • Ç),
( 29.20)
111
Neutrinos
§30
the zeros denoting two-rowed null matrices. If we use the normalization of the
density matrix to unity (tr pnon.r = 1), as is customary in the non-relativistic theory,
instead of the normalization to 2m, then the above expression must be divided by
2m, giving
(l+a-0
in agreement with QM, (59.4), (59.5).
Similarly, the non-relativistic limit for the positron density matrix is
P = L
W
),
Pnon-r /
Pnon-r =
~ m ( l + CT • Ç).
Finally, there is a simpler expression for the density matrix in the ultrarelativistic case. Putting in (29.8) |p| — e (and thereby neglecting small quantities of
order (m/e) 2 ), substituting these expressions in (29.13) or (29.16), and taking the
direction of p as the x-axis, we can write
P=5[e(y0-71)±m][l-75(^(7°-71K|| + Ç1-7l)],
where the upper sign refers to the electron and the lower sign to the positron.
When the product is expanded, the leading terms cancel, and those of the next
order give
P = ^ ( 7 ° - 7 , ) [ l - 7 5 ( ± ^ l l + Ci-7i)]
or, again writing 6 ( 7 ° - 71) in the form 7p,
P = kyp)[l - 75( ± ft + L • 7i)l.
(29.21)
This is the required expression for the density matrix in the ultra-relativistic
case. It should bç noticed that all the components of the polarization vector £
appear in this expression equivalently, as terms of the same order of magnitude. It
will be recalled that £| is the component of this vector parallel (if £n>0) or
antiparallel (if C\\ < 0) to the particle momentum. In particular, for a helicity state of
the particle, £| = 2A = ± 1 ; the density matrix then has an especially simple form,
p=\yp{\±2\y\
(29.22)
which is, as it should be, the same as the neutrino or antineutrino density matrix,
these being particles with zero mass and definite helicity (see §30).
§30. Neutrinos
We have seen in §20 that the necessity of two spinors (£ and rj) to describe a
particle with spin \ is due to the mass of the particle. This necessity disappears if the
mass is zero. The wave equation which describes such a particle can be derived from a
Fermions
112
§30
single spinor, say the dotted spinor rj :
P a % = 0,
(30.1)
(Po + p-<r)r,=0.
(30.2)
or, equivalently,
It has also been noted in §20 that the wave equation containing the mass m is
necessarily symmetrical with respect to inversion (the transformation (20.4)). When
the particle is described by a single spinor, this symmetry is lost, but it is not needed,
since symmetry with respect to inversion is not a universal property of Nature.
The energy and momentum of a particle with m = 0 are related by e = |p|. For a
plane wave (% ~ e~ipx), equation (30.2) therefore gives
(n-a)rjp = -TJP,
(30.3)
where n is a unit vector in the direction of p. A similar equation,
(n • a)îj-p = - Tj-p,
(30.4)
applies to a wave with "negative frequency" (rj-p — eipx).
The second-quantized ^-operator is
p
(30.5)
Hence it follows, as usual, that T)*P are the wave functions of the antiparticie.
The definition (20.1) of the operators p"* shows that pa** = - paß. The complex
conjugate spinor ij* therefore satisfies the equations p^t? J = 0, or, equivalently,
J W * = O.
We write TJ** = £ß, thus expressing the fact that complex conjugation changes the
dotted to the undotted spinor. The wave function of the antiparticie then satisfies the
equation
P\*^=0,
(30.6)
(Po-p-cr)| = 0.
(30.7)
(n-a)& = &.
(30.8)
or
Hence, for a plane wave,
§30
113
Neutrinos
But 2H • er is the operator which projects the spin on the direction of motion.
Equations (30.3) and (30.8) therefore signify that states of the particle with a definite
momentum are necessarily helicity states, for which the spin component in the
direction of motion has a definite value. If the particle spin is opposite to the
momentum (helicity -£), then the antiparticle spin is along the momentum (helicity
+Ö.
The neutrinos which occur in Nature appear to be particles possessing these
properties. The particle with helicity —2 is conventionally called the neutrino, and that
with helicity +2 the antineutrino.t
In connection with the fact that the neutrino states are not degenerate with respect
to spin directions, reference should be made to the comment in §8 that a massless
particle has only axial symmetry about the direction of the momentum. For a strictly
neutral particle (the photon), this symmetry includes both rotations about the axis and
reflections in planes passing through the axis. For the neutrino, there is no reflection
symmetry, leaving only the group of rotations about the axis, which conserve the
angular momentum component along the axis and do not change its sign. The
symmetry with respect to reflections exists only if the particle is at the same time
replaced by the antiparticle.
It should also be noted that the necessarily longitudinal polarization signifies that
the spin of the neutrino cannot be distinguished from its orbital momentum, just as
with the necessarily transverse photon fields (§6).
From one spinor TJ (or £), only four bilinear combinations can be constructed,
which together form the 4-vector
JM = (T)*T>,T,*aT,).
(30.9)
It is easily verified that the equations
(Po + p-cr)î) =0,
T)*(Po-p-<r) = 0
imply the continuity equation dj* = 0, so that j*1 acts as a particle current density
4-vector.
It is convenient to normalize neutrino plane waves in a manner similar to that used
in §23 for particles possessing mass:
^ = V ( b ) Upe~lPX> v~p=vfe)
Up PX
*' '
(3(U0)
the spinor amplitudes being normalized by the invariant condition
u?p(l,cr)H±p=2(e,p).
(30.11)
The particle density and particle current density are then j° = 1, j = p/e = n.
Since a free neutrino with a given momentum is always completely polarized, there
t The existence of neutrinos was theoretically predicted by W. Pauli (1931) in order to explain the
properties of ß-decay. Equation (30.1) wasfirstdiscussed by H. Weyl (1929). The neutrino theory based on
these equations was evolved by L. D. Landau, T. D. Lee and C N. Yang, and A. Salam (1957).
114
Fermions
§30
are no states which are mixed (with respect to the spin). It may nevertheless be
convienent to define a two-rowed polarization "density matrix" simply as the spinor
of rank two
Paß = uaw*,
(30.12)
for which tr p = 2e. An expression for this matrix can be obtained by noting that it
must satisfy the equations
(s + p • v)p = p(e + p • a) = 0.
Hence we have
p = e-p-a.
(30.13)
In the consideration of various interaction processes, neutrinos may appear
together with other particles having spin 2 and possessing mass, which are therefore
described by four-component wave functions. In such cases it is convenient to retain
uniformity of notation by formally defining for the neutrino also a "bispinor" wave
function having two components zero:
♦-0
But this form of i}/ is in general not preserved in another (non-spinor) representation.
This difficulty can be overcome by noting that in the spinor representation we have
identically
where £ is an arbitrary "makeweight" spinor which does not appear in the final result;
the matrix y5 is given by (22.18). The condition for a true "two-component" neutrino
will therefore be maintained when it is described by a four-component ^ in any
representation, if <// is taken to be the solution of Dirac's equation with m = 0:
(7P)* = 0f
(30.14)
subject to the additional condition |(1 + y5)^ = *l*> or
y V = *,
(30.15)
This condition may be taken into account by replacing ^ and <j>, wherever they
would occur, by the following expressions:
§31
The Wave Equation for a Particle with Spin 3/2
115
For example, using these expressions in t//7Mi//, the current density 4-vector may be
written in the form
j"=;|.Ml-75)7'1(l+7>
= ^ 7 * ( 1 + 75)i/>.
(30.17)
In the same way, the four-rowed neutrino density matrix becomes
P=l(\ + 75)(YP)(1 - 75) = i d + Y5)(7P)-
(30.18)
In the spinor representation it reduces, as it should, to the two-rowed matrix (30.13),
p=
/
0
0\
L - < r - p 0J-
The corresponding formulae for the antineutrino differ from those given above by
a change of the sign of y\
The neutrino is an electrically neutral particle, but, with the properties described
above, it is not a strictly neutral particle. Here it must be noted that the "neutrino field"
described by a two-component spinor is equivalent, as regards the number of possible
particle states (but not, of course, as regards its other physical properties) to a strictly
neutral field described by a four-component bispinor. Instead of states of particles and
antiparticles with definite helicity, we should here have the same number of states of
one particle with two possible values of the helicity, and the property of symmetry
with respect to inversion would automatically occur. However, the zero mass of the
"four-component" neutrino would be, so to speak, "accidental", since it would be
unrelated to the.symmetry properties of the wave equation describing the neutrino
(which allows also a non-zero mass). The various interactions of such a particle would
therefore necessarily imply the existence of a small but not zero rest mass.
§31. The wave equation for a particle with spin 3/2
A particle with spin 3/2 is described, in its rest frame, by a three-dimensional
symmetrical spinor of rank three (having 2s + 1 = 4 independent components).
Correspondingly, in an arbitrary frame of reference, its description may involve the
4-spinors £a/*7, r)aßy and £aßy9 \aßy, each of which is symmetrical in all the indices of one
kind (dotted or undotted). The spinors in each pair are interchanged by inversion.
In order that the 4-sponors £aßy and r\aßy should become in the rest frame 3-spinors
symmetrical in all three indices, they must satisfy the conditions
0*W
= O,
Paß^y = 0:
in the rest frame we have
pdß-+Po8ßa = m8l
(31.1)
116
Fermions
§31
as is seen from (20.1), and the conditions (31.1) therefore imply the equations
«2T,'% 7 = O,
s £ r P 7 = o,
where the primed letters denote the corresponding three-dimensional spinors. Thus
these spinors give zero on contraction with respect to the indices a, ß, which means
that they are symmetrical in these two indices, and hence in all three.
The differential relations between the spinors £ and 17 are
The conditions (31.1) ensure that the left-hand sides of (31.2) vanish on contraction with respect to any other pair of indices, and hence that these quantities are
symmetrical in ß, y or a, 5. In the rest frame, the three-dimensional spinors £' and TJ'
are the same according to (31.2). On eliminating r) or £ from (31.2), we find that each
component of the spinors £ and 17 satisfies the second-order equation
( p 2 - m 2 ) r ß 7 = 0.
(31.3)
The equations (31.1), (31.2) form a complete set of wave equations for a particle
with spin 3/2.t No further results would be obtained by using the spinors Ç and x- Their
structure is as follows:
mtaßy =
pa*Jit\
mXaßy — PaSCßy
The equations of particles with spin 3/2 can also be put in a different form, using the
vectorial properties of spinors (W. Rarita and J. Schwinger, 1941 ; A. S. Davydov and I.
E. Tamm, 1942). One four-dimensional vector index /x is assigned to a pair of spinor
indices aß. Thus the components £aßy of the spinor of rank three can be put into
correspondence with the components of the "mixed" quantities i/f£, which have one
vector and one spinor index. Similarly, the spinor r)ßay is correlated with i//£, and the
two spinors together correspond to a "vector" bispinor i//M (omitting the bispinor
index). The wave equation then becomes a "Dirac equation" for each of the vector
components 1/^:
(7P-m)</v=0,
(31.4)
7^=0.
(31.5)
with the added condition
Using the expressions for the matrices y*1 in the spinor representation and the
t See the paper by Fierz and Pauli, cited in §15, concerning the Lagrangian formulation of these
equations.
§31
The Wave Equation for a Particle with Spin 3/2
117
relations (18.6), (18.7) between the spinor and vector components, we can easily verify
that equation (31.4) implies equations (31.2), and that the condition (31.5) is equivalent
to the condition for the spinors £a/*7 and r)aßy to be symmetrical in the indices ßy or ßy.
Multiplying (31.4) by y* and using (31.5), we find
or, with the commutation rules for the matrices yß,
2g^p^-7^p,7^=0.
(31.6)
The second term in turn is zero by (31.5), and the first term gives
P^M=0.
(31.7)
This condition, which is implied by (31.4), (31.5), is easily seen to be equivalent to the
conditions (31.1).
Finally, yet another way of expressing the wave equation is to use quantities i/>iW (i,
k, / = 1, 2, 3, 4) with three bispinor indices, in which they are symmetrical (V.
Bargmann and E. P. Wigner, 1948). The set of these quantities is equivalent to the
components of all four spinors £, rj, £, \- The wave equation becomes a set of "Dirac
equations"
P^im^mki
= mil>lki.
(31.8)
We easily see that these equations yield the necessary number (four) of independent
components ifcw, and there is therefore no need to impose any further conditions. In the
rest frame, (31.8) becomes
yL&mki = fou,
according to which all the components with i, Jc, I = 3, 4 are zero (in the standard
representation), i.e. the fau reduce to the components of a three-dimensional spinor of
rank three.
The results given above have an obvious generalization for particles with any
half-integral spin 5. In the description by equations of the form (31.4), (31.5), the wave
function is a symmetrical 4-tensor of rank |(2s - 1) with one bispinor index. In the
description by equations of the form (31.8), the wave function has 2s bispinor indices
and is symmetrical in these.
CHAPTER IV
PARTICLES IN AN EXTERNAL FIELD
§ 32. Dirac's equation for an electron in an external field
T H E wave equations of free particles express essentially only those properties
which depend on the general requirements of space-time symmetry. Physical
processes involving the particles, however, depend on their interaction properties.
The description of the electromagnetic interactions of particles in relativistic
quantum theory can be effected by a generalization of the method used in classical
non-relativistic quantum theory.
This method, however, is applicable to the description of electromagnetic
interactions only of particles that are not capable of strong interactions. These
include electrons (and positrons), and the very wide domain of electron quantum
electrodynamics is therefore accessible to the existing theory. There are also
unstable particles, the muons, which are not capable of strong interactions; they
are described by the same quantum electrodynamics as regards phenomena occurring in times short in comparison with their lifetime (with respect to weak
interactions).
In this chapter we shall discuss problems of quantum electrodynamics which
fall within the scope of single-particle theory. These are problems in which the
number of particles is unchanged, and the interaction can be represented in terms
of an external electromagnetic field. Besides the conditions which ensure that the
external field may be regarded as given, there are conditions arising from "radiative
corrections" which also impose limits on the validity of such a theory.
The wave equation of an electron in a given external field can be derived in the
same way as in the non-relativistic theory (QM, §111). Let AM = (4>, A) be the
4-potential of the external electromagnetic field (A being the vector potential and <I>
the scalar potential). We obtain the desired equation on replacing the 4-momentum
operator p in Dirac's equation by p - eA, where e is the charge on the particle t:
[y(p-e>A)-m]<A = 0.
(32.1)
The corresponding Hamiltonian is found by making the same change in (21.13):
H = a • (p - e A) + ßm + ed>.
(32.2)
The invariance of Dirac's equation under gauge transformations of the electromagnetic field potential is shown by the fact that it is unchanged in form if the
transformation A-* A + ip\ (where \ *s a n arbitrary function) is accompanied by
t The charge together with its sign is meant, so that for the electron e = -|*|.
118
§32
Diràc's Equation for an Electron in an External Field
119
the following transformation of the wave function:!
i/f-»i//e**;
(32.3)
cf. the corresponding transformation for Schrödinger's equation in QM, §111.
The current density is expressed in terms of the wave function by the same
formula (21.11),
j = i/ryi//,
as when there is no external field. It is easily seen that, when the calculations used
in deriving (21.11) are repeated with the equation (32.1) (and the equation (32.4)
below), the external field disappears, and the continuity equation contains the same
expression for the current as previously.
Let us apply the operation of charge conjugation to equation (32.1). To do so,
we write the equation
$[y(p + eA) + m] = 0,
(32.4)
which is obtained as the complex conjugate of (32.1), in the same way as equation
(21.9) was derived in Chapter HI, and using the fact that the 4-vector A is real.
Putting this equation in the form
[y(p + eA) + m]ijj = Oi
multiplying it on the left by the matrix Uc and using the relations (26.3), we find
[y{p + eA) - m ](Ct/0 = 0.
(32.5)
Thus the charge-conjugate wave function satisfies an equation which differs
from the original equation by a change in the sign of the charge. The operation of
charge conjugation, however, corresponds to a change from particles to antiparticles. We see that, if the particles possess an electric charge, the signs of the
electron and positron charges are necessarily opposite.
Thefirst-orderequations (32.1) can be transformed into second-order equations
by applying to them the operator y(p - eA) + m :
[YV(PM
- eA^){pv - eAv) - m2]^ = 0.
The product y''y1' may be written as follows:
y V = K Y V + y V ) + 2(7M7' - y V )
t The transformation (32.3) with a function #(f, r) is sometimes called a local gauge transformation,
in contrast to the global gauge transformation (12.10) with a constant phase a.
120
Particles in an External Field
§32
where crMl is the antisymmetric "matrix 4-tensor" (28.2). On multiplying by o-*1" we
can antisymmetrize by the substitution
(pM ~ MM)(p(, - eA,)->£{(pM ~ eK)(P» ~ ^A,)}= \e{- A^p, + pvA» ~ P»AV + Avp^)
=
\ie{dvAli-dlxAv)
= -2JeFpV
(with FM„ = dßAt. - a^AM the electromagnetic field tensor). The result is the secondorder equation
[(p - eA)2 -m2-
YieF^W
= 0.
(32.6)
The product FM„cr^ may be written in three-dimensional form in terms of the
components
or"'' = (a, iX),
F ^ = (-E,H).
Then
[(p - M) 2 - m2 -(- eX • H - iea • E]i^ = 0,
(32.7)
or, in ordinary units,
[(H-f*) 1 -(»V + fA) , -»V + f s . H - l f « . E ] * - a «32.7a,
The occurrence in these equations of terms in the fields E and H is due to the spin
of the particle, and will be further discussed in §33.
The solutions of the second-order equation include, of course, "redundant"
solutions which do not satisfy the original first-order equation (32.1), being solutions of that equation with the opposite sign of m. The choice of the appropriate
solutions in particular cases is usually obvious and causes no difficulty. The
customary procedure is that, if </> is any solution of the second-order equation, then
a solution of the correct first-order equation is
if, = [y(p - eA) + m]<t>.
(32.8)
For, on multiplying this equation by y(p - eA) - m, we see that the right-hand side
vanishes if <£ satisfies (32.6).
It should be emphasized that the introduction of the external field into the
relativistic wave equation by replacing p by p - eA is not self-evident. In doing so,
we have essentially made use of a further principle: this substitution must be
applied to first-order equations. For this reason equation (32.6) contains additional
terms which would not appear if the substitution were made directly in the
second-order equation.
§32
Dirais
Equation for an Electron in an External Field
121
The stationary-state solutions of Dirac's equation in an external field may
include states of both the continuous spectrum and the discrete spectrum. As in the
non-relativistic theory, states of the continuous spectrum correspond to infinite
motion, in which the particle can be at infinity; it may there be regarded as a free
particle. Since the eigenvalues of the Hamiltonian of a free particle are ±V(p 2 +
m2), it is clear that the "continuous spectrum of energy eigenvalues is in the ranges
e^m and e^-m.
If -m < e <m, the particle cannot be at infinity, and the
motion is therefore finite and the state belongs to the discrete spectrum.
As in the case of free particles, the wave functions with "positive frequency"
(e > 0 ) and with "negative frequency" (e < 0 ) appear in a definite manner in the
second quantization procedure. For particles in an external field, there is a natural
generalization of this procedure, the plane waves in formulae (25.1) being replaced
by appropriately normalized eigenfunctions i//(n+) and ip^ of Dirac's equation,
corresponding to positive and negative frequencies (e(„+) and -e ( n -) ):
n
(32.9)
However, as the potential well becomes deeper, the energy levels may cross the
boundary e = 0, so that positive levels become negative (or vice versa when the
potential has the opposite sign). Nevertheless, for the sake of continuity we must
still regard these as electron (not positron) levels. That is, the electron states are to
be regarded as all those which approach the positive limit of the continuous
spectrum (e = m) when the field is removed with infinite slowness.
Although Dirac's equation for an electron in an external field does, as already
mentioned, yield solutions for a large class of problems in quantum electrodynamics, we must at the same time emphasize that the applicability of the
concept of an external field in the one-particle problem of relativistic theory is
nevertheless restricted, because of the spontaneous formation of electron-positron
pairs in sufficiently strong fields (see §§35 and 36).
We shall not deal in this book with the inclusion of an external field in the wave
equations for particles with spin other than i, since the topic has no immediate
physical significance: actual particles having such spins are hadrons, and their
electromagnetic interactions cannot be described by wave equations.
PROBLEM
Determine the electron energy levels in a constant magnetic field.
SOLUTION. The vector potential is Ax = A: = 0, Av = Hx (the field H being along the z-axis). The
components py, pz of the generalized momentum (as well as the energy) are conserved.
We use the second-order equation for the auxiliary function <f> (see (32.8)), and assume that <f> is an
eigenfunction of the operator X? (with eigenvalue a = ± 1) and of the operators py, pz. The equation for <f>
(
d2
1
- jp + (eHx - py)2 - eHa U = (e2 - m2 - p2z)<f>.
122
Particles in an External Field
§33
This equation is the same in form as Schrödinger's equation for a linear oscillator. The eigenvalues of e
are given by
e2 - m2 - p\ = \e\H(2n + 1) - eHa< n = 0, 1 , 2 , . . . ;
cf. QM, §112. The wave function i/>, which is to be determined from <f> according to (32.8), is not an
eigenfunction of the operator S z , in accordance with the fact that the spin is not conserved for a particle
in motion.
§33. Expansion in powers of 1/ct
We have seen in §21 that, in the non-relativistic limit (v ->0),
two components (x) of the bispinor i// = ( J vanish.
^A '
Hence \ ^ 4> when the electron velocity is small. This leads to an approximate
equation involving only the two-component quantity <f>y obtained by a formal
expansion of the wave function in powers of 1/c.
Dirac's equation for an electron in an external field may be written
ih ^ = {cot • (p - 1 - A ) + ßmc2 + e<ï> W
(33.1)
The relativistic energy of the particle includes also its rest energy mc2. This must
be excluded in arriving at the non-relativistic approximation, and we therefore
replace if/ by a function <//' defined as follows:
</, = (/,' e-imc2tlh.
Then
(ih j-+ mc2)ilß' = (cot- U--A\
+ßmc2 + e<t>]if>'.
Substituting i/>' = ( , ), we obtain the equations
^A
'
[ih j
r
*<!>)</>' = ca • ( p - i A ) * \
(iÄ^-6cI) + 2mc 2 ^ , = c a • ( p - ^ A y , .
(33.2)
(33.3)
The primes to <f> and \ will henceforth be omitted; this will cause no misunderstanding, since only the transformed function i// is used in the present section.
t In this section, ordinary units are used.
§33
Expansion in Powers of 1/c
123
In the first approximation, only the term 2mc2x is retained on the left-hand side
of (33.3), which gives
A
1
cr-(p-^A)^
2mc
(33.4)
(thus x ~ <t>lc). Substitution of this in (33.2) gives
1
{*Tr<*)+=M°'{HA))+For the Pauli matrices we have the relation
(a • a)(cr • b) = a • b + icr • a x b,
(33.5)
where a and b are arbitrary vectors (see (20.9)). In the present case, a = b =
p - eA/c, but the vector product a x b is not zero, since p and A do not commute:
[(p-^)x(p-i
A
) ] ^ i ^ A x V + VxAH
= i — curl A • 6.
c
Thus
where H = curl A is the magnetic field, and for <$> we obtain the equation
This is PaulVs equation. It differs from the non-relativistic Schrödinger's
equation by the last term in the Hamiltonian, which has the form of the potential
energy of a magnetic dipole in the extertial field (cf. QM9 §111). Thus, in the first
approximation (with respect to 1/c), the electron behaves as a particle having both
a charge and a magnetic moment
» = -£-**•
(33.8)
The gyromagnetic ratio elmc is twice its value for a magnetic moment due to
orbital motion.t
t This remarkable result was first derived by P. A. M. Dirac in 1928. The two-component wave
function satisfying equation (33.7) was introduced by W. Pauli (1927), before Dirac's discovery of his
equation.
Particles in an External Field
124
§33
The density p = i//*i/f = <f>*<f> + \*X- The second term must be omitted in the first
approximation, so that p = |</>|2, as would be expected for Schrödinger's equation.
The current density is
j -
=
Ci//*oti/f
C(<$)*<JX
+ X*v<t>)-
We substitute here, from (33.4),
A
A
2mc
\
2mc\
c
c
/
/^
and transform the products containing two factors a by means of (33.5) in the form
(<r • a)cr = a + i a x a ,
cr(a • a) = a + ia x a.
j= ^(</>V<^-0n^^
zm
mc
zm
(33.9)
The result is
(33.10)
in agreement with the expression in the non-relativistic theory, QM (115.4).
Let us now derive! the second approximation, continuing the expansion as far
as terms of order 1/c2, and assuming that there is only an electric external field
(A = 0).
First, we note that, when terms —1/c2 are included, the density is
This differs from the Schrödinger expression. In order to find (in the second
approximation) the wave equation corresponding to Schrödinger's equation, we
must replace <t> by another (two-component) function 4>Sch, for which the timeindependent integral would be of the form / |</>sch|2 d3x, as it should be for
Schrödinger's equation.
To obtain the required transformation, we write the condition
/ *£**&* d'x = j {*** + -j^rp (V</>* • cr)(a • V<J>)} d'x
t The method used here is due to V. B. Berestetskif and L. D. Landau (1949).
§33
Expansion in Powers of 1/c
125
and integrate by parts:
[ (V0* • a)(or • V<f>) d'x = - | 4>*(<r • V)(cr • V)</> d3x
= - [ </>*A</)d3x
(or the same with <f> and </>* interchanged). Thus
whence it is evident that
*»-( , + ä& 1 )*
*-( | -&ife')*»-
<33
-">
To simplify the notation we shall assume a stationary state, replacing the
operator ihdidt by the energy e (with the rest energy subtracted). In the next
approximation after (33.4) we have from (33.3)
This is to be substituted in (33.2) and cf> then replaced by <£Sch according to (33.11),
omitting all terms of higher order than 1/c2. A simple calculation leads to an
equation for </>Sch in the form e<f>sch = H</>sch, where the Hamiltonian is
2
r • pXK*r • P) - r W® + r<*>P)}H = £2m
- + e<t> -8m^X-L
T^n{(<r
c +
4m
c
The expression in the braces is transformed by means of the formulae
(a • p)4>(a • p) = Op2 + (a • p4>)(or • p)
= <J)p 2 +ift(a-E)(a-p),
p2<D - 4>p2 = - ft2A$ + 2iftE • p,
where E = -V4> is the electric field. The final expression for the Hamiltonian is
H ^ ^ + e O - ^ ^ - j ^ a - E x p - ^ V i d i v E .
r
8m c
2m
8m c 4m c
(33.12)
The last three terms are the required corrections of order 1/c2. The first of these
three terms is due to the relativistic dependence of the kinetic energy on the
momentum (the expansion of the difference cV(p 2 + m2c2)- mc2). The second,
which may be called the spin-orbit interaction energy, is the energy of the
126
Particles in an External Field
§34
interaction of the moving magnetic moment with the electric field.t The last is zero
except at points where there are charges creating the external field (for instance, in
the Coulomb field of a point charge Ze, A3> = -AirZeb{r)) (C. G. Darwin, 1928).
If the electric field is centrally symmetric, then
rdcD
r dr
and the spin-orbit interaction operator can be put in the form
eh
d4>
h2 dU g A
„_.
r X
=
-—rr-tf •
P~T~ ^—r-T"~r~I • s>
(33.13)
r
v
Amlelr
dr
2mlelr dr
'
where I is the orbital angular momentum operator, s = 5a is the electron spin
operator, and U = e<$> is the potential energy of the electron in the field.
§ 34. Fine structure of levels of the hydrogen atom
Let us determine the relativistic corrections to the energy levels of the
hydrogen atom—an electron in the Coulomb field of a fixed nucleus.$ The velocity
of the electron in the hydrogen atom is vie — a < 1. The required corrections can
therefore be calculated by the use of perturbation theory, averaging over the
unperturbed state (i.e. over the non-relativistic wave function) the relativistic terms
in the approximate Hamiltonian (33.12). For somewhat greater generality we shall
take the charge of the nucleus as Ze, assuming, however, that
Za<\.
The field of the nucleus is E = Zer/r 3 , and its potential satisfies the equation
A4> = -47rZeS(r). Substituting this in the last three terms in (33.12) and using the
fact that the electron charge is negative, we obtain the perturbation operator
V=- X
8m
+
- 4 ^ I . s + f^ô(r).
2r m
2m
(34.1)
Since, according to the non-relativistic Schrödinger's equation,
(where e 0 = -m Z 2 a 2 / 2 n 2 is the unperturbed level and n the principal quantum
t With the magnetic moment (33.8) and the velocity v = p/m, this energy becomes - j i - Ex v/2c. At
first sight this result may appear unlikely, because on changing to a frame of reference fixed to the
particle there arises a magnetic field H = E x v/c, in which the energy of the magnetic moment should be
-jji • H. The occurrence of the factor 2 (the 'Thomas h a l f ; L. H. Thomas, 1926) is due to the general
requirements of relativistic invariance together with the particular properties of the electron as a
"spinor" particle with the corresponding value of the gyromagnetic ratio (see §41).
t The effect of the motion of the nucleus on these corrections is a quantity of a higher order of
smallness, which will not be considered here.
§34
Fine Structure of Levels of the Hydrogen Atom
127
number), the mean value is
?
4m 2 (s 0 + ^ ) 2 .
=
This quantity, like the mean value of the second term in (34.1), is calculated by
means of the formulae (see QM, §36)
r"1 = maZIn2,
(34.2)
(maZ)2ln\l+\),
r* =
rr5 = (maZ) 3 /n 3 /(/+|)(/+l),J
the last of which is valid if l^ 0; the eigenvalue is
l-s = i [ j ( j + l ) - / ( / + D - | ]
iff*0,
1•s=0
if / = 0.
Finally, the third term is averaged by means of the formulae
.3/2
i//(0) = 0,
f*0. J
(34.3)
The result of a simple calculation using the above formulae may be written for
all cases (for all j and /) as
Ae =
Za)4/
m(Za)
2
1 __3\
(34.4)
Formula (34.4) gives the required relativistic correction to the energy of the
hydrogen levels; that is, it gives the fine-structure energy.! In the non-relativistic
theory, there is both degeneracy with respect to spin direction and Coulomb
degeneracy with respect to i The fine structure (spin-orbit interaction) removes
this degeneracy, but not completely: there remain levels with mutual double
degeneracy, having the same n and j but different f = j ± i The levels with the
maximum possible value for j for a given n,
î
Jmax
= /
J. i
Imax * 2
=
n —1
"
2y
are then not degenerate. Thus the sequence of hydrogen levels, with allowance for
t This formula, and the more exact formula (36.10), were first derived by A. Sommerfeld from the
old Bohr theory before the development of quantum mechanics.
128
Particles in an External Field
§35
the fine structure, is
lsi/2;
2S|/2, 2pi/2, 2p3/2i
3si/2, 3pi/ 2 , 3p 3 /2, 3d 3 /2, 3d 5 / 2 ;
The level with principal quantum number n is split into n fine-structure components.
In non-relativistic mechanics, the "accidental" degeneracy of the energy levels
in a Coulomb field is due to the existence of a conservation law peculiar to this
field, relating to a quantity A whose operator is
r
r
i
2ma
r
K
see QM, (36.30). This specific conservation law is also responsible for the twofold
degeneracy which remains in the relativistic case: the Hamiltonian H =
a • p + ßm - e2lr of Dirac's equation commutes with the operator
I - - • X + — 0(2 • I + 1)75(H - mß)
r
ma
^
(M. H. Johnson and B. A. Lippmann, 1950). In the non-relativistic limit, J-»X • A.
We shall see later (§123) that this remaining degeneracy is removed by "radiative corrections" (the Lamb shift), which are neglected in Dirac's equation for the
single-electron problem.
To anticipate, it may be mentioned here that the order of magnitude of these
corrections is mZ4a5 log(1/a). The second-order spin-orbit interaction correction
would be — m(Za)6, and the ratio of this to the radiative corrections is therefore
— Z 2 a/log(l/a). For hydrogen (Z = 1), this ratio is certainly small, and the exact
solution of Dirac's equation is therefore of no significance in that case, but it may
be significant as regards the electron energy levels in the field of a nucleus with
large Z (§36).
§35. Motion in a centrally symmetric field
Let us now consider the motion of an electron in a centrally symmetric electric
field.
Since the angular momentum and the parity (relative to the centre of the field,
which is taken as the origin) are conserved in a central field, the discussion in §24
regarding spherical waves of free particles is entirely applicable to the angular
dependence of the wave functions of such a motion; only the radial functions vary.
Accordingly, we shall seek the wave function of the stationary states (in the
I
§35
Motion in a Centrally Symmetric Field
129
standard representation) in the form
where l=j±\,
l' = 2j-l, and the power of - 1 is chosen for subsequent convenience.
Dime's equation in the standard representation yields the following equations
for <f> and x:
(.-m-U»-a.fc,|
(j52)
(e + m - l/)x = <r-p<M
where U(r) = e<$(r) is the potential energy of the electron in thefield.The result of
substituting the expressions (35.1) is calculated by evaluating the right-hand sides
of these equations.
Expressing the spherical harmonic spinor njrm in terms of nj/m by
jlm
(see (24.8)), we can write
(<r • p)x = - i(cr • p)(cr • r) ^ nilm.
Now transforming the product (cr • p)(<r • r) by means of the formula (33.5), and
expanding the vector operators, we have
(a • p)x = -i{p • r + ia • p x r}&nj(m
= {-div r - (r • V) - a • r x p} & ni/m
= -{g' + *g+7<r-î}n„ m ,
where î = r x p is the orbital angular momentum operator; the prime denotes
differentiation with respect to r. The eigenvalues of the product a • î = 2Î • s are
21 • s = j 2 - 1 2 - s2
= j(j + i ) - i ( / + i ) - 2
_ /f j -- Ji if/
L:
if I = Jj -—
1—J — § i f / = / + :
In order to be able to use the same formulae for both cases (/ = j ± 2), it is
130
§35
Particles in an External Field
convenient to write
K = - 0 ' + J 2 ) = - ( 1 + 1)
for j =
l+l
for j = 1-1
= +(j + \)=l
(35.3)
The number K takes all integral values except zero, the positive numbers corresponding to the case } = I -{, and the negative numbers to the case j = I + \. Then
1 • a = - ( 1 + K), and
(cr • p)x = - \g' +
1 - K
—^-gpjlm.
When this expression is substituted in the first equation (35.2), the spherical
harmonic spinor £lilm cancels from the two sides. Proceeding similarly with the
second equation, we finally obtain the following equations for the radial functions:
f' + ^r1f-(e
+ m-U)g
r
g' +
1 - K
= 0,
(35.4)
g + ( e - m - U)/=0,
or
(/r)' + y ( / r ) - ( c + m - t / ) g r = 0,
(35.5)
(gr)'-j(gr)
+ {e-m-U)fr
= 0.
Let us examine the behaviour of / and g at small distances, assuming that the
field U(r) increases more rapidly than 1/r as r->0. Then, for small r, equations
(35.4) become
r+ug = o,
g'-uf=o.
These have real solutions, of the form
/ = constant x sin
(35.6)
g = constant x cos ( f Udr + s \
where 5 is an arbitrary constant. These functions oscillate as r->0, but tend to no
limit. It is easy to see that this situation corresponds, in the non-relativistic theory,
to the "falF' of a particle to the centre.
First of all, we note that the smallness of r here places no restrictions on the
§35
131
Motion in a Centrally Symmetric Field
choice of solution, since there is no condition at r = 0 for the oscillatory function,
and so the choice of 8 remains arbitrary (the correct behaviour of the wave
function for large r can be ensured by an appropriate choice of 5, for any value of
e). This indeterminacy can be eliminated by regarding the potential with a singularity (at r = 0) as the limit, when r 0 -»0, of a potential cut off at some r0, i.e.
equal to U(r) for r>r0 and U(r0) for r < r0. With r0 finite, we of course obtain a
definite set of energy levels, but the energy of the ground state tends to -<» as
In the non-relativistic theory, this signifies a "fall" to the centre, since a particle
at a deep level is localized in a small region near r = 0. In the relativistic theory,
such a situation is impossible, since it would imply that the system was unstable
with respect to the spontaneous generation of electron-positron pairs. For, whereas
an energy exceeding 2m is needed to create such a pair in a vacuum, a smaller
energy is sufficient in a field. In the presence of an electron bound state with energy
e < m, a pair can be formed by expending an energy e + m < 2m, the result being a
free positron and an electron in a bound state. If, on the other hand, the bound
state energy is e < - m , such a field can create positrons (with energy - e > m )
spontaneously, without taking energy from an external source. In the field under
consideration, as r 0 ->0, there is an infinity of such "anomalous" levels with
e < - m . Thus fields whose potential 4>(r) increases more rapidly than 1/r as r-»0
cannot be dealt with by means of Dirac's theory. This applies to potentials of either
sign. Although a "fall" can occur, of course, only with attraction, the sign of
U = e<$> depends also on that of the charge, so that electron levels behave
anomalously in one case and positron levels in the other; in the latter case, the field
generates free electrons.
Let us next consider the behaviour of the wave functions at large distances. If
the field U(r) decreases sufficiently rapidly as r -x», it may be entirely neglected in
the equations when determining the asymptotic form of the wave functions at large
distances. When e > m, i.e. in the continuous spectrum, we then return to the
equation of free motion, so that the asymptotic form of the wave functions
(spherical waves) differs from that for a free particle only by the presence of
additional "phase shifts", whose values are determined by the form of the field at
short distances.t These shifts depend on the values of j and /; that is, on the
number K defined above (and also, of course, on the energy e). Denoting the phase
shifts by 6K and using the expresssion (24.7) for a free spherical wave, we can
therefore immediately write down the required asymptotic formula:
. , 2
v
1 /
V(e + m)njim sin (pr-\lir
+ 8K) \
r V(27jV- V(e - m)n j r m sin (pr - \l'ir + 8K))y
{ f)
^'
or, with the definition (35.1),
f)
g)
=
V 2 le ± m sin ,
i, , c .
V
(pr - \ITT + ÔK),
r V e cos
v
(35.8)
'
+
Cf. QM, §33. As in the non-relativistic theory, U(r) must decrease more rapidly than 1/r. The case
U — 1/r will be discussed separately in §36.
§35
Particles in an External Field
132
where p = V ( e 2 - m 2 ) . The common coefficient here corresponds to the normalization of the radial functions by (24.5).
The wave functions of the discrete spectrum (e < m) decrease exponentially as
r-»oo;
/ = - V ^ 4 f S = ^ e x p t - r V ( m 2 - e 2 )],
(35.9)
where A0 is a constant.
As in the non-relativistic theory, the phase shifts SK (more precisely, the
quantities elih« - 1) determine the scattering amplitudes in a given field, as will be
further discussed in §37. We shall not pause to investigate here the analytical
properties of these amplitudes (cf. QM, §128), but merely note than e2l&« again has,
as a function of energy, poles at the points corresponding to the levels for bound
states of the particle. The residue of e216* at such a pole is related in a certain
manner to the coefficient in the asymptotic expression for the corresponding wave
function of the discrete spectrum. This relation may be found by a generalization
of the non-relativistic formula, QM (128.17). The necessary calculations are
entirely similar to those in QM, §128.
Differentiation of equations (35.5) with respect to the energy gives
\
de /
r
de
de
V de )
r
de
de
ö
'
We multiply these two equations by rg and -rf respectively, and the two equations (35.5) by -rg and rf, and add all four term-by-term. After simplification, we
have
We now integrate with respect to r:
o
and take the limit as r-*«. The integral on the right tends to unity, by the
normalization condition. On the left-hand side, we use the fact that in the asymptotic region the functions / and g are related by
133
Motion in a Coulomb Field
§36
which is derived from (35.5) by neglecting terms in U and in Mr. The result is
^_lwmi.r{(m\.]
f + mL
\ de ) J
de
( ,j. I 0 )
= l.
This formula differs only in the coefficient (e + m replacing 2m) from the
corresponding non-relativistic formula (for the function x). There is therefore no
need to repeat the subsequent calculations; the final formula, valid near the point
e = eo (where F 0 is the energy level), is
^ = ( _ . 1 } i J^L J"^>,
(35.11)
e - eo* m + e0
where A0 is the coefficient in the asymptotic expression (35.9).
PROBLEM
Find the limiting form of the wave function for small r in a field U ~ r~\ s < 1.
S O L U T I O N . For a free particle we have, when r is small, f ~~ r\ g — r', so that / > g if J < I', and
f <g if I > /'. We make the assumption (which is confirmed by the result) that this relationship exists
also in the field considered here. If / < i (i.e. I = j - i, K = - 1 - 1), the term in g may be omitted from the
first equation (35.4), so that f ~~ rl as before. The second equation then gives g - r/U, i.e. g — r m _ i =
r'" s . The case I > I' is treated similarly. The result is
for/</\
f~~r\
g-rr~s;
for/>l\
/~r'\
g-rv.
§ 36. Motion in a Coulomb field
We shall begin the study of the properties of the motion in
case of a Coulomb field by considering the behaviour of the
short distances, taking the particular case of an attractive field:
For small r, the terms in e ± m may be omitted in equations
the very important
wave functions at
U = -Za/r.t
(35.5), leaving
(/r)' + f / r - ^ g r = 0,
(gr)'-^gr
+ ^fr
= 0.
The two functions fr and gr appear in an equivalent manner in these equations, and
we therefore seek them as equal powers of r: fr = ary, gr = bry. Substitution gives
a(y +
K)
- bZa = 0,
aZa + b(y -
K)
= 0,
whence
y2=K2-(Za)2.
(36.1)
t In ordinary units, U = - Ze2lr. In relativistic units, e2 is replaced by the dimensionless quantity a.
134
Particles in an External Field
§36
Let (Za) 2 <K 2 . Then 7 is real, and the positive value must be taken: the
corresponding solution either does not diverge at r = 0 or does so less rapidly than
the other. The choice may be justified by considering a potential which is "cut off"
(see §35) at a certain small r0 and then taking the limit r 0 ->0; cf. the analogous
discussion in QM, §35. Thus
Za
}
f = —■— g = constant x r u \
I
\
y+ K
7 - V(K 2 - Z V ) - V[(j + \)2 - Z V].J
(36.2)
Although the wave function may become infinite at r = 0 (if 7 < 1), the integral of
|(//|2 remains finite, of course.
If (Za)2 > K2, both values of 7 given by (36.2) are imaginary. The corresponding
solutions oscillate as r~] cos(|7J log r) when r ^ O , and this again corresponds to a
situation which is inadmissible in the relativistic theory, as already shown above.
Since K2^ 1, this means that a purely Coulomb field can be discussed in Dirac's
theory only if Za < 1, i.e. Z < 137.
Let us now give a qualitative description of the situation which arises when
Z > 137. In order to avoid an indeterminacy in the boundary condition at r = 0, we
must again consider a potential cut off at a distance r0 (I. Ya. Pomeranchuk and Ya.
A. Smorodinskiï, 1945). This has a direct physical significance as well as a formal
one. The charge Z > 137 can in practice be concentrated only into a "superheavy"
nucleus of finite radius. Let us therefore see how the configuration of levels varies
as Z increases for a given r0.
In the Coulomb field when not cut off, the energy e{ of the lowest level tends to
zero when Za = 1, and the e\(Z) curve terminates, the level e{ becoming imaginary
for Za > 1; see (36.10) below. In the cut-off field, for a given r0 ^ 0, the level ex
passes through zero only for some Z a > l . The value ei = 0 has no physical
distinctiveness, and when r0 5* 0 it also has no formal distinctiveness, the e\{Z)
curve not terminating there. When Z increases further, the levels continue to
descend, and at a certain "critical" value Z = Zc(r0) the energy ex reaches the
bottom ( - m ) of the lowest continuum of levels. As explained in §35, this means
that zero energy is required for the creation of a free positron. The critical value Zc
is therefore the maximum charge that the "bare" nucleus can have for a given r0.
When Z>ZC, the level e\<-m,
and the formation of two electron-positron
pairs becomes energetically favourable. The positrons go to infinity, carrying
kinetic energy 2 ( | e 1 | - m ) , and the two electrons occupy the level e\. This gives an
"ion" with an occupied K shell and a charge Zeff = Z - 2 (S. S. Gershteïn and Ya.
B. Zel'dovich, 1969). The system is stable for Z > Zc, up to values of Z for which
the limit -m reaches the next level.t
Lastly, it may be noted that, even for a point charge, the form of the potential at
short distances is affected by radiative corrections, but the resulting corrections to
Zca are only of the order of a.
t For example, if the nuclear charge is uniformly distributed in a sphere with radius ro =
1.2 x 10",2cm, the critical value Zc = 170, and the next level reaches the limit -m when Z = 185 (V. S.
Popov, 1970). A detailed account of the quantitative theory is in the review article by Ya. B. ZePdovich
and V. S. Popov, Soviet Physics Uspekhi 14, 673, 1972.
135
Motion in a Coulomb Field
§36
Let us now turn to the exact solution of the wave equation (C. G. Darwin, 1928;
W. Gordon, 1928).
(a) Discrete spectrum (e < m). We seek the functions / and g in the form
/ = V(m + e)e^p 1 '- , (Q 1 + Q2),
(36.3)
g = -V(m-e)«-V",(Qi-Q2).
with the notation
p = 2Ar,
A=V(m2-e2),
7 = V(K 2 -Z 2 a 2 ).
(36.4)
This is reasonable, since we already know the behaviour of the functions as p-»0
(36.2) and their exponential decrease (~e"?p) as p-*°°. Since, as p->°°, the functions / and g must have the same asymptotic behaviour, we must expect that
Q\ > Qi as p -* 00.
Substitution of (36.3) in (35.4) yields the equations
p(Qx + Q2Ï + (y + K)(Q{ + Q2) - pQ2 + ZaJ^^
(Q, - Q2) = 0,
* fit + e
P(QI - Q2)' + (7 - K)(Q, - Q2) + pQ2 - Za J^ziQi
y m—e
+ Q2) = 0,
where the prime denotes differentiation with respect to p. The sum and difference
of these give
PQ'1 + (T-^)Q, + ( K - ^ ) Q
2
= 0,
(36.5)
PQS+(T +
^ - P ) Q I + ( « + ^ ) Q . - 0 , ]
or, eliminating either Qf or Q2,
pQï + (2y + 1 - p)Qi - (7 - ^ ) Q , = 0,
pQ2' + (27 + l - p ) Q 2 - ( 7 + 1 - ^ ) 0 2 = 0,
where we have used the fact that y2-(Zael\)2
these equations which is finite when p = 0 is
Q, = AF(y-?f-t
= K2-(Zaml\)2.
The solution of
27 + l , p ) ,
(36.6)
Q2 = BF(7 + l - ^ p , 2 7 + l , p ) ,
Particles in an External Field
136
§36
where F(a, ß, 2) is the confluent hypergeometric function. Putting p = 0 in either of
the equations (36.5), we obtain the relation between the constants A and B:
B = - -—~
7- A.
K - Zam/A
(36.7)
The two hypergeometric functions in (36.6) must reduce to polynomials, since
otherwise they would increase as ep when p->°°, and the wave function itself
would increase as e~2P. The function F(a, ß, 2) reduces to a polynomial if a is a
negative integer or zero. We write
7 - Z a e / A = -n r .
(36.8)
If nr = 1 , 2 , . . . , the two hypergeometric functions reduce to polynomials. If nr = 0,
only one of them does so. In that case, 7 = Zaelk, and Zam/A = |K|, as is easily
verified. If K < 0 , the coefficient B (36.7) is zero, so that Q2 = 0 and the necessary
condition is satisfied. If K > 0 , however, then B = - A , and Q2 remains divergent
when nr = 0. Thus the following values are possible for the quantum number nr:
nr = 0 , 1 , 2 , . . .
= 1,2,3,...
for K <0,1
for
K>0.J
(36.9)
The definition (36.8) then yields the following expression for the discrete energy
levels:
H'V^-fz^wJ-
(3610)
In particular, the energy of the ls\ ground level (|K| = 1, nr = 0) is
e1 = m V [ l - ( Z a ) 2 ] .
For Za < 1, the leading terms in the expansion of (36.10) are
s
m
*
(Za) 2 f
2(|K| + n r ) 2 r
(Za)2ri
|K| + nrL|K|
3
11
4(|K| + n r )Jr
On writing nr + |K| = n ( = 1 , 2 , . . . ) and noting that \K\ = j + 5, we return to formula
(34.4), which was previously derived by means of perturbation theory. As already
mentioned at the end of §34, the further terms in this expansion have no
significance, since they are certainly exceeded by the radiative corrections. Formula (36.10) as it stands, however, is meaningful when Za — 1. The double
degeneracy of the levels shown by the approximate formula (34.4) exists also in the
exact formula, which involves only |K|, so that levels with the same j and different /
again coincide.
We have still to determine the common normalization factor A in the wave
function. The wave function of the discrete spectrum must, as usual, be normalized
§36
137
Motion in a Coulomb Field
2
3
by the condition J |i//| d x = 1; the corresponding condition on the functions / and g
is
j(f2 +
g2)r2dr=\.
The value of A is most simply determined from the asymptotic form of the
functions as r-»°c. Using the asymptotic formula
f(-"»2^'.P)-r(^ T 2V+i) ( " p ^
(see QM, (d. 14)), we find
/ - (- 1)"-A V(m + g ^ ^ V + l ) e~Kr(Ury+n'~lComparing this with (36.22) derived below, we can find A. Collecting together the
formulae, we can write out in full the final expressions for the normalized wave
functions:
f2 =
h 8
±(2Af2r
(m±s)r(27 + nr+l)
V
F(2y + \)[4m(Zaml\)(ZamlK
- K) • nr\\
x
{{^Y1
-
KW-M,,
K
}
{
2y + 1, 2Ar) + n r F(l - nr, 2y + 1, 2Ar)j,
(36.11)
where the upper signs refer to / and the lower signs to g.
(b) Continuous spectrum (e > m). There is no need to solve the wave equation
afresh for the states of the continuous spectrum. The wave functions for this case
are obtained from those of the discrete spectrum by the substitutions!
V(m - e ) - > - i V ( e - m),
A-*--îp,
- n r - » y - iZaelp;
(36.12)
see QM, §128 concerning the choice of sign in the analytical continuation of the
square root V(m - e). The normalization of the functions must, however, be done
again.
Making these substitutions in (36.11), we may write
gg = i V ( e - m ) J
x [e*F{y - iv, 2y + 1, -2ipr) + e~liF(y + \-iv,2y+
+ In the rest of this section, p denotes |p| = V ( e 2 - m~).
1, -2ipr)],
138
Particles in an External Field
§36
where A' is another normalization constant,
v = Zae/p,
e'2* =
y
7lv. ;
K-ivmle
(36.13)
the value of £ is real, since y2 + (Zaelp)2 = K2 + (Zamlp)2.
According to the formula
F(a,ß,z) = e 2 F(ß-a,ß,--z)
(see QM, (d. 10)), we have
F(y + 1 - ii/, 2y + 1, -2ipr) = e"2lprF(y + iv, 2y + 1, 2ipr)
= e"2iprF*(y - iV, 2y + 1, -2ipr).
Hence
/, g = 2iA'V(s ± m)(2pr)y] im, re{ei(pr+É)F(y - iV, 2y + 1, -2ipr)}.
(36.14)
The normalization coefficient A' is found by comparing the asymptotic expression
for this function with the general formula (35.7) for a normalized spherical wave. The
resulting expression for the wave functions of the continuous spectrum (which we
shall afterwards verify) ist
f e
7 g
'
= 2Z3 / \ / « ± £ , *e J r ( y + l + fr)|(2prr
V E
r(27+D
r
x im, re{eiipr+i)F(y - iv, 2y + 1, -2ipr)}.
(36.15)
The asymptotic expression for this function is derived by means of QM (d. 14),
where only the first term is now significant, the second decreasing as a higher
power of 1/r:
V2 le ±m
l
/, g = — y
sin, cos(pr 4- SK + v log 2pr -1 lu),
(36.16)
where
5K = £ - arg V(y + \ + h)-\iry
+ \ITT,
(36.17)
or
eMm
_K-ivmler(y+'\-iv)
y- w
I(Y
e,nU-y)
(36>,g)
+ 1 + 11/)
For future reference, we shall give an expression for the phase in the ultrat The wave functions for a repulsive field are obtained by changing the sign of Za, i.e. that of v.
§36
Motion in a Coulomb Field
139
relativistic case (e > m, v = Za):
y - IZCL T(y+
1+
iZà)
The expression (36.16) differs from (35.8) only by the logarithmic term in the
argument of the trigonometric function. As in Schrödinger's equation, the slowness
of the decrease of the Coulomb potential affects the phase of the wave, which
becomes a slowly varying function of r.
Analytical continuation into the region e < m gives, in place of (36.18),
€
K - Zam/A T(y + 1 - Zae/A) lV(/-y)
y - Zae/A T(y + 1 + Zae/A) *
"
2I6K
nf% 7 m
P
'
This expression has poles at the points where y + 1 - Zae/A = 1 - nr, nT = 1 , 2 , . . .
(poles of the gamma function in the numerator), and also at the point y - Zae =
- n r = 0 (if also K < 0); these points coincide with the discrete energy levels, as they
should.
Near any of the poles with n r ^ 0, we have
e "«
T ^ , « ,—r— T(y + 1 - Zae/A).
w
n r r(2y + 1 + nr)
The form of the gamma function near its pole is found by means of the familiar
formula r(z)r(l - z) = 7r/sin ÎTZ:
r(
V
+
i _ ^ i \ %
IL
A /
r(nr)sin7r(7+l-Zae/Ar
sin 7r(y + 1 - Zae/A) « IT cos rrnr • T " ( ~ r - ) * (e - e0)
= (-ir(7rZam 2 /A 3 )(e~e 0 ),
where e0 is the energy level. Thus we havet
*2,7>_/
(Zam/A-K) A3
1
n r !r(27 + 1 + nr) Zam e - e0
l V ^g"
iïï7
n*?n
At the end of §35 a formula (35.11) was derived which relates the residue of the
function em* at its pole to the coefficient in the asymptotic expression for the wave
function of the corresponding bound state. For a Coulomb field, however, the
formula (35.10) must be slightly modified, because the constant phase shift SK in
(35.7) is replaced in (36.16) by the sum bK + v\o%2pr. We must therefore replace
em« on the left-hand side of (35.11) by
exp(2i6K + 2iV log2pr)^e 2l ^(2iAr) 2(n ^ 7) .
+ This formula is easily seen to be valid even if nr = 0.
140
Particles in an External Field
§37
Using (36.21) and determining from (35.11) the coefficient A0 (which will now be a
power function of r), we find the asymptotic form of the normalized wave function
of the discrete spectrum:
This has already been used to determine the coefficient in (36.11).
§ 37. Scattering in a centrally symmetric field
The asymptotic expression for the wave function which describes the scattering
of particles in the field of a fixed centre of force may be writtent
i// =
Mepe
ipz
+ w;pÇ.
(37.1)
Here uep is the bispinor amplitude of the incident plane wave. The bispinor u'ep is a
function of the direction of scattering n', and for any given value of n' its form (but
not, of course, its normalization) is the same as that of the bispinor amplitude of
the plane wave propagated in the direction n\
We have seen in §24 that the bispinor amplitude of the plane wave is entirely
determined by specifying a two-component quantity, the three-dimensional spinor w,
which is the non-relativistic wave function in the rest frame of the particle. The flux
density is expressed in terms of the same spinor, and is proportional to w*w, with a
proportionality coefficient which depends only on the energy e and is therefore the
same for the incident and scattered particles. The scattering cross-section is
da = (w'*w7w*w) do or, if as in §24 the incident wave is normalized by the condition
w*w = 1,
da = w'*w' do.
We define the scattering operator / by
w' = fw.
(37.2)
Since the quantities H>, W' have two components, the operator thus defined is
exactly analogous to the operator scattering amplitude which appears in the
non-relativistic scattering theory taking account of spin (QM, §140). We can
therefore apply immediately the formulae derived there which express the operator
in terms of the phase shifts of the wave functions in the scattering field. It is only
necessary to transform these phase shifts by expressing 8f and 8i from QM, §140,
in terms of the phase shift 8K which appears in the relativistic formula (35.7). The
phases St and 8i referred to states with orbital angular momentum f and total
t In §§37 and 38 p denotes |p|, and e and p will be written separately as suffixes to the amplitude.
141
Scattering in a Centrally Symmetric Field
§37
angular momentum j = I + 2 and j = / -*• According to the definition (35.3), K =
- / - 1 for j = / + 2 and K = / for j = / - \. We must therefore make the changes
8|+->S_(i+i),
Sf-*S/
and remember that the suffix to 8 now represents the value of K. Thus we find
f=A+
A==
| - 2 [(I + IK«
2ip 1=0
B
=Uy
2iS
(37.3)
Bv<r,
— - l ) + /(e 2 i s <-l)]P,(cos0),
- e2*>)P\(cos 0),
(37.4)
(37.5)
where v is a unit vector in the direction of n x n'.
Since w is the spinor wave function in the rest frame, the polarization properties of the scattering are given in terms of / by the same formulae as in QM, §140.
For a Coulomb field, it is possible to express both functions A(0) and B(0) in
terms of one function. The calculation is briefly as follows.t
In a Coulomb field, the phases 5K are given by (36.18), which we, write in the
form
Ze22m\
my
.Ze
P
/
T(y - iv)
f(y + 1 + iv)
K
r
(37.6)
,M|K|-7).
_ _ein* w n e n K < 0 u s j n g t h e quantities thus defined
we can put the series (37.4), (37.5) in the form
eM = ei*K w h e n K > 0
a n d eM
A(0) = ^ G ( 0 ) - i ^ ^ F ( 0 ) ,
(37.7)
B(6) =--tanie
p
• G(0) + ^ - ^2 c o t | 0 • F(0),
p
where
G(6) = ii g l2Q(Pt + ?,_,),
F(6) = \i g IQiPi - P,_,).
(37.8)
In transforming the series B(0), we have used the following recurrence relations
between Legendre polynomials:
P i + ?,*_, = - c o t i f l - K P i - P , - , ) .
P i - P !_, = tan i 0 • l(P, + Pi-,)t R. L. Gluckstern and S. R. Lin, Journal of Mathematical Physics 5, 1594, 1964.
(37.9)
(37.10)
142
§38
Particles in an External Field
According to the identity
- [P/(cos 0) - P/_,(cos 0)] = /[P,(cos 0) + P/_,(cos 0)],
(1 + cos 0) -j
a cos 0
(37.11)
the functions F(0) and G(0) are related by
G
= (1 - cosfl) d£ê=- cotJe f-
<3712)
Thus A(0) and B(0) are expressed in terms of the single function F(0).t
§ 38. Scattering in the ultra-reiativistic case
We shall now discuss separately the scattering in the ultra-relativistic case
(e > m). In the first approximation, we neglect altogether the mass m in the wave
equation. It is convenient to use for if/ the spinor representation
H<>
since the equations for £ and t) are separable when m = 0:
-îcr-V£ = ( e - l / ) £
j
— i'cr • Vr) = — (e - L/)rj J
(38.1)
(the "neutrino" form, §30).
A helicity state of an electron polarized in the direction of p corresponds to
a wave function \fß = (^ j, and for polarization opposite to p we have <>
/ = ( J.
Since the equations for £ and TJ are separable, it is evident that this property is
unaffected by scattering. Thus helicity is conserved in the scattering of ultrarelativistic electrons. From considerations of symmetry (longitudinal polarization)
ft is obvious that there is no azimuthal asymmetry in the scattering of helical
(longitudinally polarized) particles. We can also say that the scattering crosssection of helical electrons is independent of the sign ôf the helicity; this follows
because a central field is invariant under inversion, while the sign of the helicity is
reversed.
In the ultra-relativistic case, formulae (37.3H37.5) may be considerably simplified (D. R. Yennie, D. G. Ravenhall and R. N. Wilson, 1954).
t The function F(0) cannot be expressed in a closed form in terms of the elementary functions, but
it can be written as a certain double integral; see the paper cited in the last footnote.
§38
Scattering in the Ultra-relativistic
Case
143
Let the incident electron be polarized, say in the direction of motion n. For a
plane wave with a definite value of n • o\ the spinor £ (=(</>+ x)/V2) is proportional to the same three-dimensional spinor w as appeared in the standard
representation of the wave. The relation between the spinor amplitudes of the
incident and scattered waves in the new representation is therefore given by the
same operator /.
As a result of the scattering, the polarization is rotated with the momentum to
the direction n'. The effect of the operator / on the spin wave function of the
electron therefore reduces to a rotation of the spin through the angle 0 (between n
and n) about the axis v. This rotation is itself equivalent to a rotation of the
coordinates about that axis but in the opposite direction, i.e. through an angle - 0 .
Hence it follows that the operator / must be the same (apart from a factor) as the
operator which transforms the wave function when the coordinates are changed in
the way described, i.e. the operator (18.17) with - 0 instead of 0. A comparison of
(37.3) with (18.17) shows that
B/A = - i tan 50.
(38.2)
Thus, in the ultra-relativistic limit,
/ = A(0)[1 - i tan \ 0 • v • cr].
(38.3)
The expression (37.4) for A(0) can also be simplified if a relation between 8K
and S-« which exists in the ultra-relativistic limit is used. To derive this relation, we
note that, when the terms in m are omitted, the equations (35.4) for the functions /
and g become invariant with respect to the changes
/-*g,
K-»-K,
g-»-/,
which do not affect the parameters of the particle or field itself. We must therefore
have fJgK = -g~Jf-K, and substitution of the asymptotic expressions gives
tan(pr - \lir + 8K) = -cot(pr - \VTT + Ô_K),
whence
e2iSK
=
e2i6.K
( 3 g
4 )
From this relation, and replacing the summation variable I by / - 1 in the first term
of the sum in (37.4), we find
A(0) = 2 ^ g K*2'5' ~ l)][Pi(cos 0) + Pi-i(cos • ) ] .
(38.5)
From (38.2) it follows that re(AB*) = 0. Hence, in the approximation considered, the cross-section is independent of the initial polarization of the particles,
144
Particles in an External Field
§39
and an unpolarized beam remains unpolarized after scattering (see QM, (140.8)(140.10)). We may also note that, when 0-*7r, the expression (38.5) for A(0) tends
to zero as (TT - 0)2 (since P/(— 1) = (-1)'). The cross-section
^ = | A\2 + \B\2 = | A(0)|2/cos2 \6
(38.6)
therefore tends to zero also. These properties do not occur, of course, in higher
approximations with respect to the small quantity m/e. In particular, analysis shows
that as 0->7T the cross-section tends to a limit proportional to (m/e)2.
For a Coulomb field in the ultra-relativistic case, the phases ÔK are independent
of the energy, as is seen from (36.19),t Hence, in a purely Coulomb field, the
scattering cross-section for e > m has the form
da = ^
e
(38.7)
do,
where T is a function of the angle only.
§ 39. The continuous-spectrum wave functions for
scattering in a Coulomb field
In later sections (§§95, 96) we shall consider various inelastic processes which
occur when ultra-relativistic electrons are scattered in the field of a heavy nucleus
(Za ~~ I). To calculate the relevant matrix elements, we need wave functions whose
asymptotic form (as r -► <») is the sum of a plane wave and a spherical wave.
We shall see that, in the ultra-relativistic case (electron energy e > m), the most
significant values of the momentum transfer from electron to nucleus in scattering
are q = | p ' - p | ~ m. These values of q correspond to impact parameters p — Ilq —
1/m, the electron being deflected through anglest
(39.1)
e-qlp-mle.
In terms of the coordinates r (distance from the centre) and i~r cos 0, this
represents the region
ps
r
sin 0 - 1/m,
p(r - z) = pr(l - cos 0) - 1,
(39.2)
and r — e/m2, so that the distances concerned are large.
We write Dirac's equation in the form
(e - U - mß + ia • V)^ = 0,
U = -Za/r,
(39.3)
and transform it into a second-order equation by applying the operator e - U +
t This is also evident directly from equations (38.1), since for a Coulomb field the energy e may be
eliminated from the equations by the substitution r->r le.
t In this section, p denotes |p|.
§39
The Continuous-spectrum
Wave Functions
145
mß - ia • V:
(A + p 2 - 2el/)t// = ( - i a • V17 - l72)«/f.
(39.4)
Since r §> Zale in the region considered, U < e. As a first approximation, the
right-hand side of (39..4) may be neglected. The remaining equation,
( A + p 2 + 2eZalr)*l/ = 0,
(39.5)
is of the same form as the non-relativistic Schrödinger's equation in a Coulomb
field:
(2kA+£r+v>=°-
(39 5a
- >
differing only in an obvious change in the notation for the parameters (the
''potential energy" containing an extra factor elm). We can therefore write down
immediately the solution which has the required asymptotic form (see QM, §136).
For example, the wave function which asymptotically comprises a plane wave
( <* e ,p r) and an outgoing spherical wave is
C =
e"Za€l2pr(l-iZaelp),
(39.6)
where F is the confluent hypergeometric function and u£p the constant bispinor
amplitude of the plane wave, normalized by the condition stated earlier (23.4):
WepWep = 2m.
(39.7)
The wave function (39.6) is normalized in such a way that the plane wave in its
asymptotic limit has the usual form,
■!OL
V(2e)
corresponding to "one particle in unit volume". Since p % e in the ultra-relativistic
case, we can write Zaelp « Za in (39.6):
*(«V=C^^el>"rF(îZaflfî(pr-p-r)),
C = ez"ßr(l-iZa).
(39.8)
It should be noted that, although we are considering distances so large that
pr> 1, the hypergeometric function in (39.8) cannot be replaced by its asymptotic
form: the argument of F is not pr but pr(l - c o s 0), which is not assumed large.t
t In QM, §135, we were concerned with arbitrarily large r, and this approximation was therefore
allowable for all values of 0.
146
Particles in an External Field
§39
In applications, the next approximation for i// is also needed, which has a spinor
structure different from (39.8) (the latter reducing to the factor uep). To calculate
this approximation, we write i// in the form
</> =
V(27) e
ip r
(u f p F + 4>).
On the right-hand side of (39.4) we now retain the term linear in [/, obtaining for </>
the equation
(A + 2/p • V - 2eU)<j> = ~iuepa -VU.
(39.9)
The solution of this may be found by noticing that the function F satisfies the
equation
(A + 2 i p - V - 2 e l T ) F = 0,
as may be seen by substituting (39.6) in (39.5). Applying the operator V to this
equation, we obtain
(A + 2ip • V - 2eU)VF = 2eFVU.
A comparison with (39.9) shows that
4> = - ^ ( a V ) u f p F
le
The final expressions for i//+) and for a similar function i// () whose asymptotic
form contains an ingoing spherical wave are
*" = V § 7 j e'? "{l ~ 2 7 a ' V ) F < i Z a ' l> *<pr-p • r»"M>,
^ ) = v ^ e , p r ( l " 2 7 a ' v ) F ( ~ ' Z a ' 1 ' " l ' ( p r + p,r))M£P'
(39.10)
C = enZal2T(l - iZa)
(W. H. Furry, 1934). We shall also write out the corresponding functions («/*-£,-p)
with "negative frequency", which are needed when dealing with processes which
involve positrons. These can be derived from the functions *\it9 by the substitutions
p - * - p , e - > - £ , with p = |p| unchanged; the parameter iZa of the hypergeometric
function therefore changes sign, as will be seen from the original expression (39.6),
where this parameter occurs in the form iZaelp. Thus we have
*-^» Ä v^ e "''' r ( l+ è a ' v ) F(aa,1, " l ^ r " p " ^))u ■••" F,
C = g-*Zo/2r(l + iZa).
(39.11)
§39
The Continuous-spectrum
Wave Functions
147
The following comment is necessary regarding the above calculations. Our
asymptotic condition is not in itself sufficient to provide a unique choice of the
solution of the wave equation; this is clear, since we can always add to *p any
outgoing Coulomb spherical wave without violating the condition. By writing the
solution of equation (39.5) in the form (39.6), we have tacitly presupposed the
choice of a solution finite at r = 0. This requirement was necessary in QM, §§135,
136, where we were considering solutions; valid in all space, of the exact
Schrödinger's equation.t In the present case, however, equation (39.5) applies only
to large distances, and therefore the choice of solution demands further
justification.
This is provided by the fact that large impact parameters p = r sin 0 correspond
to large orbital angular momenta / and small scattering angles 0: when p ~~ 1/m, we
have
{ — pp — pe — elm > 1,
and the angle 0 may be estimated by a quasi-classical procedure:
p J dr
p
e
Thus, in the expansion of if/ in terms of spherical waves the main contribution (in
this range of r and 0) will come from waves with these large values of I. But a
spherical wave with large / will certainly decrease to small values at distances from
the origin r < lie which are "classically inaccessible" (because of the centrifugal
barrier). Hence, if we "join" the solution of equation (39.5) to that of the exact
equation (39.4) at short distances r ~ ru where lie > rx > Zale, then the boundary
condition for the solution of equation (39.5) will be that it is small, and this justifies
our choice.
PROBLEM
For an attractive Coulomb field with Za<l, find the correction (of relative order Za) to the
non-relativistic wave function of the discrete spectrum.
SOLUTION. The electron velocity in a bound state is v ~~ Za, and therefore, for Za <£ 1, the wave
function is non-relativistic in the zero-order approximation, i.e.
<A =
M^non-r,
where i/fn0n-r is the Schrödinger function and u a bispinor of the form
with w a spinor describing the polarization state of the electron. In the next approximation, we write
<A = M^nonr+ i/^(1) and, substituting this in (39.4), obtain for i/r(l) the equation
(s*-w+TK-'ë('J)-<«*t In the method of solution given in QM, §135, this condition was satisfied by taking the particular
integral in the form (135.1) instead of as a general sum of integrals with different values of ß\ and 02.
148
§40
Particles in an External Field
where en is the non-relativistic discrete energy level. Here we have omitted terms of relative order
~~(Za)2; in the non-relativistic case, the important distances are of the order of the Bohr radius,
r - 1/mZa. The solution of this equation is ip = - (i/2m)otu • VtKonr, and therefore
§40. An electron in the field of an electromagnetic plane
Wave
Dirac's equation can be solved exactly for an electron moving in the field of an
electromagnetic plane wave (D. M. Volkov, 1937).
The field of a plane wave with wave 4-vector k (fr = 0) depends on the
4-coordinates only in the combination <f> = kx, so that the 4-potential is
A» = A"(<f>),
(40.1)
and satisfies the Lorentz gauge condition
d„AM = k^A»' = 0,
the prime denoting differentiation with respect to #. Since the constant term in A is
unimportant, we can omit the prime, writing the condition as
kA = 0.
(40.2)
We start from the second-order equation (32.6), in which the field tensor is
F^ = kßA'v - KA».
(40.3)
When expanding the square (id - eA)2 it must be remembered that, from (40.2),
d^(A^) = A^i/r. The result is
[-d2 - 2ie(Ad) + e2A2 - m2 - ie(yk)(yA')]if* = 0,
(40.4)
where d 2 = d^M.
We seek a solution of this equation in the form
* =e " ^ ) ,
(40.5)
where p is a constant 4-vector. This form of the function i// is unaltered by adding
to p any constant multiple of the vector k, if the function F(<f>) is appropriately
redefined. We can therefore, without loss of generality, impose one further condition on p. Let
p2 = m2.
(40.6)
Then, when the field is removed, the quantum numbers p* become the components
§40
An Electron in the Field of an Electromagnetic Plane Wave
149
of the free particle 4-momentum. The significance of the components of the
4-vector p, when the field is present, is more clearly seen in a particular frame of
reference chosen so that A0 = 0. Let the vector A in this frame be along the x'-axis
and k along the x3-axis; the electric field of the wave is then along x'3 the magnetic
field along x2, and the wave itself is propagated along x3. Then (40.5) will be an
eigenfunction of the operators
p, = I
dP
P2 =
W'
p
°- p 3 = , läP-äP>
with eigenvalues pu Pi, P0-P3; the operators themselves are easily seen to
commute with the Hamiltonian of Dirac's equation. Thus, in this frame of
reference, p1 and p2 are the components of the generalized momentum along the x1
and x2 axes; p°-p3 is the difference between the total energy and the x3component of the generalized momentum.
In substituting (40.5) in (40.4), we note that
d»F = k>lF',
d ^ F = k2F" = 0,
and obtain for F(<f>) the equation
2i{kp)F' + [~2e(pA) + e2A2 - ie(yk)(yA')]F = 0.
The integral of this equation is
0
where u/V(2p0) is an arbitrary constant bispinor; the reason for writing it in this
form will be shown below.
All powers of (yk)(yA) above the first are zero, since
(jk)(yA)(yk)(yA) = -(yk)(yk)(yA)(yA) + 2(kA)(yk)(yA) - -k2A2 = 0.
We can therefore write
so that 4ß becomes
* - [ I + «SS(*KH?S5*"
(407)
150
Particles in an External Field
§40
wheret
kx
^-''-ly^M-m*]**0
(40 8)
-
To determine the conditions to be imposed on the constant bispinor w, we must
suppose that the wave is "switched on" with infinite slowness, starting from
t = -<*. Then A-»0 when /oc->-oc, and ip must become the solution of the free
Dirac's equation. Consequently, u = u(p) must satisfy
(yp-m)u=0.
(40.9)
This condition rejects the "redundant" solutions of the second-order equation.
Since u is independent of time, the condition remains valid for finite kx. Thus u(p)
is the same as the bispinor amplitude of the free plane wave; we shall take it to be
normalized by the same condition (23.4): üu = 2m.
The foregoing arguments also show immediately the normalization of the wave
functions (40.7). The infinitely slow application of the field does not alter the
normalization integral. Hence it follows that the functions (40.7) satisfy the same
normalization condition,
| i/r* ^ d3x = J </yy V P d3x - (2ir)3S(p' - p),
(40.10)
as the free plane waves.
Let us calculate the current density corresponding to the functions (40.7), first
noting that
and hence obtaining by direct multiplication
r = *V*. = £{p' -M>
+ k
- ( ^ ) - g l ) } .
(40..1,
If the A*(<f>) are periodic functions, and their time-average value is zero, the mean
value of the current density is
^4(p"-2é)7k")-
<4012>
-
We can also find the kinetic momentum density in the state ifjp. The kinetic
t This S is the same as the classical action for a particle moving in the field of a wave; cf. Fields,
§47, Problem 2.
§41
151
Motion of Spin in an External Field
momentum operator is the difference p -eA = id - eA. A direct calculation gives
0*0^ - eA^P
= ^Py0(p" ~ eA'Wp
-p
eA +k y(kp)
2(kp)) +
K
8(kp)p0
kA
h
(40.13)
The time-average value of this 4-vector, denoted by q*\ is
«'- p '-|£ k '
(40 ,4)
'
Its square is
q2=m\,
m^m^l-^Ä
5
),
(40.15)
where m* acts as an "effective mass" of the electron in the field. A comparison of
(40.14) with (40.12) shows that
F = <r/Po.
(40.16)
The normalization condition (40.10), expressed in terms of the vector q, is
f rP tpdh = (27r) 3 ^ô(q'-q);
J
Po
(40.17)
this is most simply proved in the particular frame of reference mentioned above.
§41. Motion of spin in an external field
The quasi-classical approximation in Dirac's equation is reached in the same
way as in the non-relativistic theory. In the second-order equation (32.7a) we
substitutet
where S is a scalar and u a slowly varying bispinor. The usual condition of the
quasi-classical case is assumed to be satisfied: the momentum of the particle must
vary only slightly over distances of the order of the wavelength ft/|p|.
In the zero-order approximation with respect to h, we obtain the usual classical relativistic Hamilton-Jacobi equation for the action S. All the terms which
contain the spin (and are proportional to ft) are absent from the equations of
motion. The spin would appear only in the next approximation with respect to ft.
t Ordinary units will be used at first.
152
Particles in an External Field
§41
Thus the influence of the magnetic moment of the electron on its motion is always
of the same order of magnitude as the quantum corrections. This is to be expected,
since the spin is a purely quantum property and its magnitude is proportional to h.
We can therefore reasonably formulate the question of how the electron spin
will behave when the electron is executing a given quasi-classical motion in an
external field. The answer to this question is contained in the next approximation
with respect to h in Dirac's equation. We shall, however, use another method
whose significance is more obvious and which does not directly involve Dirac's
equation. It has the advantage of allowing a treatment of the motion of any particle,
including a particle which has an "anomalous" gyromagnetic ratio not describable
by Dirac's equation.
The objective is to derive an "equation of motion" for the spin when the
particle moves in any (given) manner. Let us first take the non-relativistic case.
The non-relativistic Hamiltonian of a particle in an external field is
H = H'-/x(T-H,
(41.1)
where H' includes all terms independent of the spin (see QM, §-111), and n is the
magnetic moment of the particle. This form of the Hamiltonian relates to any kind
of particle. For electrons, /x = ehllmc (the electron charge being e = - | e | ) , a n ^ f° r
nucléons /x also contains the "anomalous" partt
li' = M - ehllmc.
(41.2)
According to the general rules of quantum mechanics, the operator equation of
motion for the spin is obtained from the formula
l = ^(Hs-sH) = 2^(Ha~aH).
(41.3)
Substitution of (41.1) gives
or
l =^sxH.
(41.4)
We average this operator equation over the state of the quasi-classical wave
packet moving in a given path. This is equivalent to replacing the spin operator by
t When radiative corrections are taken into account the magnetic moment of the electron also
contains a very small "anomalous" part.
§41
Motion of Spin in an External Field
153
its mean value s, and the vector H by the function H(t), which represents the
change in the magnetic field at the position of the particle (or wave packet) as the
latter moves along its path. In the non-relativistic approximation (i.e. in terms of
Pauli's equation), s = 2<r is the spin operator of the particle in its rest frame, whose
mean value was denoted in §29 by 2Ç. Thus we obtain the equation
f
=^xH(().
(41.5)
This form of the equation is, in essence purely classical. It signifies that the
magnetic moment vector precesses about the direction of the field with angular
velocity -2/xH/fi, remaining constant in magnitude.!
Again in the non-relativistic case, the velocity v of the particle varies in
accordance with the equation
d\ldt = evxH/mc,
i.e. the vector v rotates about the direction of H with angular velocity -eHlmc. If
/LI' = 0, then ix = ehllmc, and this angular velocity is the same as the angular
velocity -2/mH/fi with which the vector Ç rotates; thus the polarization vector is at
a constant angle to the direction of motion. We shall see below that this result
remains valid in the relativistic case.
Let us now proceed to the relativistic generalization of equation (41.5). For a
covariant description of the polarization it is necessary to use the 4-vector a
defined in §29, and the equation of motion of the spin will determine its derivative
with respect to the proper time r.t
The form of this equation is given by considerations of relativistic invariance:
its right-hand side must be linear and homogeneous in the electromagnetic field
tensor FM" and in the 4-vector a^, and apart from these can include only the
4-velocity uM = p*7m. The only form of equation satisfying these conditions is
da^ldr = aF»vav + ßu»FvKuuaK,
where a and ß are constant coefficients. It
a^u* = 0 and the antisymmetry of the tensor
other expressions with the required properties
As u-*0, this equation must become the
uß = (1, 0), T = f, we have
(41.6)
is easily seen, from the condition
F*v (whence FßPu^uv = 0), that no
can be constructed.
same as (41.5). Putting a*1 = (0,£),
dildt = a £ x H .
A comparison with (41.5) shows that a = 2/UL.
t The classical equation (41.5) can be derived directly from the equation
dMldt = j i x H ,
where M is the angular momentum and |i the magnetic moment of the system; p. x H is the torque acting
on the system. Putting M = 2AC, ft = (M/2S)£ = M£> W^ have (41.5).
t From here onwards we again take c = 1, ft = 1.
§41
Particles in an External Field
154
To determine /3, we use the fact that a*u^ = 0. Differentiating this with respect
to T and using the classical equation of motion of a charge in a field,
mdu»ldT = eF»vuv,
(see Fields, §23), we obtain
^
da*
dr
M
du*
dr
e _
^ m
e _
m
*
Hence, on multiplying both sides of equation (41.6) by u^, using the equation
ußu* = 1 and cancelling the common factor F^u^a» we have
"- 2 ("-5sr)— 2 <*'
Thus the final relativistic equation of motion for the spin is
da*
^
= 2^iF*vav - 2fjL'u*FvKuvak
(41.7)
(V. Bargmann, L. Michel and V. L. Telegdi, 1959).t
We can change from the 4-vector a to the quantity Ç which directly represents
the polarization of the particle in its "instantaneous" rest frame. The relation
between a and Ç is given by formulae (29.7)-(29.9). First of all, from (41.7) we
necessarily have a^da*jdr = 0, and therefore a^a* = constant. Since a^a* = -Ç 2 ,
this is equivalent to the obvious result that the polarization Ç of the particle remains
unchanged in magnitude during its motion.
The equation which shows the change in direction of the polarization is
obtained by using three-dimensional notation in (41.7). The space components of
this equation are, in explicit form,
da 2/xm
„ 2/mm
2/x'e , _
2jx'e ,
w
- j - = —— a x H + —— (a • v)E
— - — v(a • E) + —— v(v • a x H) +
at
e
e
m
m
+ ^-iv(a-v)(vE).
Here we must substitute (29.9), using in the differentiation the equations p = ev,
g2 = p 2 +m 2 , and the equations of motion
dp/dt = eE + e v x H ,
deldt = ev • E.
(41.8)
A lengthy but elementary calculation leads to the result^
t This equation was first derived, in another form, by Ya. I. Frenkel' (1926).
t If the gyromagnetic ratio (Lande factor) g is used (as is often done) for charged particles, with
PL = g(el2m) • \ (= g(e/2mc) • \h)y this equation becomes
f-^-2
+2
7H > < H + à ( «- 2) 7f^<''• H >'^ + ^(^7W> ><(E><v, • »'*>
155
Motion of Spin in an External Field
§41
^J^m^^-m)
dt
e
*
Vj
e+m
'
»
2^ +2^
e+m
'
Ex
(41.9)
The variation of the direction of polarization relative to the direction of motion
is of more interest than the variation of its absolute position in space. We write
S-DÉj + Çx,
(41.10)
where n = v/u, and derive the equation for the component £| of the polarization in
the direction of motion. A calculation using (41.8), (41.9) leads to the resultt
f
=2rt-Hx„
+
f(^-M')!;1.E.
(41.11)
The problems at the end of this section include a number of examples of the
application of the above formulae. Here it may be noted that, in motion in a purely
magnetic field, the polarization of a particle having no anomalous magnetic moment
is at a constant angle to the velocity (£j = constant). Thus this result, already
mentioned previously for the non-relativistic case, is in fact a general one.
The conditions for the above formulae to be applicable can be stated more
precisely. The requirement specified initially, that the momentum of the particle
should vary sufficiently slowly, is equivalent to a certain condition that the fields E
and H should be small; in particular, the Larmor radius in the magnetic field
(—pleH) must be large compared with the wavelength of the particle. There is also,
however, another condition which must, strictly speaking, be fulfilled: the fields
must not vary too rapidly in space, and must vary only slightly within the
dimensions of the quasi-classical wave packet. That is, the field must vary only
slightly over distances of the order of the particle wavelength (1/p) and of the
Compton wavelength (l/m).t
In practical problems of motion in macroscopic fields, however, the condition of
slow variation is certainly satisfied, and only the condition of smallness remains.
In §33 we have derived the first relativistic corrections for the Hamiltonian of
an electron moving in an external field. For an electron in an electric field the
approximate Hamiltonian is (see (33.12))
H = H ' - ^ a - E x p/m,
p = - iV,
(41.12)
where H includes the terms which do not contain the spin. In our case, since the
t This equation can be obtained a little more directly by writing explicitly the time component of
equation (41.7).
t The latter requirement arises from the condition that the spread of velocities in the wave packet, in
its rest frame, must be small compared with c, since otherwise the non-relativistic formulae could not be
applied in this frame.
If the field varies too rapidly, the equations may contain significant additional terms in the
derivatives of the field with respect to the coordinates.
156
Particles in an External Field
§41
field varies slowly, we neglect the term in H ' which involves derivatives of E (i.e.
the term in div E); the small term in p4 may also be omitted, since it is unrelated to
the field effects in question here. Thus H \ in the absence of a magnetic field,
reduces to the non-relativistic Hamiltonian H' = p2/2m + eO.
Formula (41.12) can also be derived from (41.9) without making direct use of
Dirac's equation. This method will generalize it (in the quasi-classical case) to
particles with anomalous magnetic moment.
The equation of motion of the spin in an electric field, as far as first-order terms
in the velocity v, is obtained from (41.9) as
|
= (M + ^ x ( E x v ) = ^ + 2 ^ x ( E x v ) .
If we impose the condition that this equation should be derived quantummechanically by commuting the spin operator with the Hamiltonian (as in (4L3)),
then it is easily seen that we must put
U = H'-(,L'
+^ y
• E x p/m.
(41.13)
This is the required expression. If /LI' = 0, we again obtain (41.12). It should be
noted that the "normal" magnetic moment ellm is multiplied by an extra factor \ in
comparison with the anomalous moment jx/.t
PROBLEMS
PROBLEM 1. Determine the change of the direction of polarization of a particle when it moves in a
plane perpendicular to a uniform magnetic field (vlH).
SOLUTION. The right-hand side of equation (41.9) is reduced to its first term, and the vector Ç
therefore precesses about the direction of H (the z-axis) with angular velocity
2fxm +2jx'(e - rn)
e
H= -(f
+
2"')H.
The projection of £ on the xy-plane (denoted by CO rotates in that plane with the same angular velocity.
The vector v rotates in that plane with angular velocity -eVLIe, as can be seen from the equation of
motion p = ev = e\ x H. Hence £i rotates with angular velocity -2jx'H relative to the direction of v.
PROBLEM 2. The same as Problem 1, but for motion parallel to the magnetic field.
SOLUTION. When v and H are in the same direction, equation (41.9) reduces to
dt
e
so that Ç precesses about the common direction of v and H with angular velocity -2^mH/e.
PROBLEM 3. The same as Problem 1, but for motion in a uniform electric field.
SOLUTION. Let the field E be along the x-axis, and let the motion be in the xy-plane (with
py = constant). According to (41.9), the vector £ precesses about the z-axis with instantaneous angular
t This is the "Thomas half" mentioned in the last footnote to §33. Its origin is clearly shown by
the derivation given here.
157
Neutron Scattering in an Electric Field
§42
velocity
\e + m
W
e
We again resolve Ç into components Ci (in the xy-plane) and f:. Then
{it = £i cos <£,
^ • E = - f i sin (i> • uy/i\
From (41.11), Ci rotates relative to the direction of v with instantaneous angular velocity
§42. Neutron scattering in an electric field
In collisions between neutrons and nuclei, the scattering through large angles is
determined by the main interaction, the nuclear forces. In small-angle scattering,
however, it can be shown that the interaction of the magnetic moment of the
neutron with the electric field of the nucleus becomes important (J. Schwinger,
1948).
We shall assume that the neutron is non-relativistic, so that the interaction in
question is described by the approximate Hamiltonian (41.13). The magnetic
moment of an electrically neutral particle is wholly "anomalous" and the operator
H ' reduces in this case to the kinetic-energy operator:t
H = - ^ A +i^a«ExV.
2m
mc
(42.1)
Since the electromagnetic interaction of the neutron is small, the corresponding
scattering amplitude / cm may be calculated by the Born approximation:
/-=-2SP/«""-'"('^-BXV)«"-"^
(see QM, §126), or
/em = 2 ^ p * • Eq x p,
Eq = [ E(r) e , q
r
d3x,
(42.2)
where p and p' are the neutron momenta before and after scattering, and hq =
p' - p. In this form, the amplitude / cm is an operator with respect to the spin
variable.
Before continuing the calculation, we should note the following point. Formula
(42.1) has been derived in §41 for slowly varying fields (which in practice meant
neglecting terms in the Hamiltonian containing coordinate derivatives of the field).
As applied to the Coulomb field of the nucleus, this means that the wavelength h/p
t In this section, ordinary units are used, and m denotes the mass of the neutron.
158
Particles in an External Field
§42
must be small compared with the distances r—1/q which are important in the
integral Eq. Hence hq<p, so that the scattering angle 6 -hqlp <\. Thus the
required condition is in fact satisfied for small-angle scattering.
For a Coulomb field with potential 4> = Ze\r, the Fourier component of the field
is
4jrZe
E q =-iqcl> q = - i q — j - ;
see Fields, (51.5). Substitution in (42.2) gives
-
. IZeiL
,
For small scattering angles, hq - p0 and px p'«p 2 0v, where v is a unit vector in
the direction of p x p'. Thus
-
. 2Zeix
The nuclear scattering amplitude must be added to this expression. Owing to the
rapid decrease of the nuclear forces with increasing distance, this amplitude tends
for small angles to a finite (energy-dependent) complex limit, which we denote by
a. The total scattering amplitude is therefore
f = a + i(blO)a • v,
b= 2Ze^lch = 2Za/x/e.
(42.3)
We see that the electromagnetic scattering is indeed predominant at sufficiently
small angles.
The expression (42.3) is the same in form as that discussed in QMy §140. We can
therefore make direct use of the formulae derived there. The scattering crosssection summed over all possible final polarization states is
^ = M 2 + ^ + 2 b i m a ■ v • Ç,
do
u
(42.4)
where Ç is the initial polarization of the neutron beam (called P in QMy §140). If the
initial state is unpolarized (Ç = 0), then the polarization after scattering is
c=
2bd im a
| a | ! e ! + b!V-
This is a maximum when 0 = bl\a\y and £max = im al\a\.
, - ..
<415)
CHAPTER V
RADIATION
§43. The electromagnetic interaction operator
T H E interaction of electrons with an electromagnetic field can, as a rule, be treated
by means of perturbation theory. This is because the electromagnetic interaction is
comparatively weak, as is shown by the smallness of the corresponding dimensionless "coupling constant", viz. the fine-structure constant a = e2lhc = 1/137. The
smallness of this number is of fundamental importance in quantum electrodynamics.
In classical electrodynamics (see Fields, §28), the electromagnetic interaction is
described by the term
(43.1)
-epA^
in the Lagrangian density of the "field + charge" system (A being the 4-potential of
the field and j the particle current density 4-vector). The current density satisfies
the equation of continuity,
d,j" = 0,
(43.2)
which expresses the law of conservation of charge. According to Fields, §29, the
gauge invariance of the theory is closely related to this law: when Aß is replaced by
A» + d^x (4.1), a term -ej^d^x is added to the Lagrangian density (43.1), and this, by
(43.2), may be written as the 4-divergence -ed^xj* 1 ); it therefore disappears on
integration over dAx in the action S = / L d4x.
In quantum electrodynamics, the 4-vectors j and A are replaced by the
corresponding second-quantized operators. The current operator is expressed in
terms of the i/f-operators by j = i/ryt/r. The generalized "coordinates" q in the
Lagrangian
J Linter d*X = - t J (/Â)
d\
are represented by the values of ip9 i and A at each point in space. Since the
Lagrangian density is found to depend only on the "coordinates" q themselves
(and not on their derivatives with respect to x), the change to the Hamiltonian
density by formula (10.11) amounts simply to a change in the sign of the Lagrangian density.t Thus the electromagnetic interaction operator (the space integral of
t Independently of these arguments, it may be noted that, when only the first-order small correction
is considered, any small correction in the Lagrangian appears in the Hamiltonian with just a change of
sign (see Mechanics, §40).
159
160
Radiation
§43
the interaction Hamiltonian density) has the form
V = e J (/A) d3x.
(43.3)
The free electromagnetic field operator is the sum
 = 2 [cnAn(x) + cJA*(x)],
(43.4)
n
which contains the operators of photon creation and annihilation in various states
labelled by the suffix n. Each operator has matrix elements only for an increase or
decrease of the corresponding occupation number Nn by 1 (the other occupation
numbers remaining unchanged). The operator À therefore also has matrix elements
only for transitions in which the number of photons changes by 1. That is, only
processes of the emission or absorption of a single photon occur in the first
approximation of perturbation theory.
According to (2.15), the matrix elements are
<N„ - \\cn\Nn) = (Nn\c:\Nn - 1) = VN„.
(43.5)
If there are no photons (of type n) in the initial state of the field, then (l|Cn|0) = 1.
The matrix element of the operator (43.3) for photon emission is
Vfi(t) = ej(jfiA*)d\
(43.6)
where An(x) is the wave function of the emitted photon and jfi the matrix element
of the operator / for a transition of the emitter from the initial state i to the final
state /.t The 4-vector jfl = (p/„ j/i) is called the transition current.
Similarly, we obtain the matrix element for photon absorption:
Vfi(t) = ef(jfiAn)d>x.
(43.7)
This differs from (43.6) only by having A„(x) in place of A£(x).
The argument t of Vfi is shown in order to emphasize that the matrix element is
time-dependent. By separating the time factors in the wave functions, we can
change in the usual way to time-independent matrix elements:
Vfi(t)=Vfie-iiErE^)t,
(43.8)
where E„ Ef are the initial andfinalenergies of the emitting system, and the sign +
is for emission and absorption respectively of a photon co.
t The notation in (43.6) is slightly inconsistent. The suffixes in V/, refer to states of the whole system
"emitter +field",those in jfi to states of the emitter only.
!
§44
Emission and Absorption
161
The wave function of a photon with a definite momentum k and a definite
polarization is
A = V(4ir)
"
vè) c '"
(439)
(see (4.3); the time factor is omitted). Substituting in (43.6), we find the matrix
element for the emission of such a photon:
Vfi = e V(4TT) ^ ^
e*j£(k),
(43.10)
where j/,(k) is the transition current in the momentum representation, i.e. the
Fourier component
Jfi(k) = jifi(r)eikrd3x.
(43.11)
The corresponding formula for photon absorption is
Vfi
= e V(4TT) ^ ^
e,m-k).
(43.12)
The equation of conservation of current in the momentum representation is the
condition of 4-transversality of the transition currents:
kj$ = WOO ~ k • J/iOO '= 0.
(43.13)
The formulae given in this section do not assume any particular form of the
current operator, and are generally valid for electromagnetic processes involving
any charged particles. The existing theory allows the form of the current operator
to be determined (and hence, in principle, its matrix elements to be calculated) only
for electrons. For applications to systems of strongly interacting particles, including nuclei, a semi-phenomenological theory will be used, in which the transition
currents appear as empirically determined quantities subject only to the conditions
of space-time symmetry and to the equation of continuity.
§44.
Emission and absorption
The transition probability under the action of a perturbation V is given, in the
first approximation, by the well-known formulae of perturbation theory (QM, §42).
Let the initial and final states of the emitting system belong to the discrete
spectrum.t Then the probability (per unit time) of the transition i -*/ with emission
t This certainly implies that recoil is neglected, the emitter as a whole remaining at rest.
§44
Radiation
162
of a photon is
dw = 2TT\ Vfi\2ô(Ei -Ef-co)
dv,
(44.1)
where dv arbitrarily denotes the ensemble of quantities describing the state of the
photon and taking a continuous sequence of values; the photon wave function is
assumed normalized by the delta function "on the v scale".
If a photon having a definite angular momentum is emitted, the only continuous
variable is the frequency co. Integration of (44.1) with respect to dv = dco eliminates
the delta function, co being replaced by E,- - E/, and the transition probability is
w=27r|V /i | 2 .
(44.2)
If, however, we consider the emission of a photon having a given momentum k,
then dv = d3kl(2ir)3 = co2 dco doj{2ir)3. Here it is presupposed that the photon wave
function (plane wave) is "normalized to one photon in the volume V = V\ as
always in this book; dv is the number of states in the phase volume V d3k. Thus the
probability of emission of a photon with a given momentum is
dw = 2TT|Vfi\28(Ei -Ef-<o)
d3Jc/(27r)\
(44.3)
or, after integration over dw,
dw =-p-j | V « | V do.
(44.4)
In this we must substitute the matrix element Vfi from (43.10).
In subsequent sections we shall use these formulae to calculate the probability
of emission in various specific cases. Here we shall consider certain general
relations between radiative processes of various kinds.
If in the initial state of the field there is already a non-zero number N„ of the
photons in question, the matrix element for the transition is multiplied by
<Nn + l|c;|N n > = V(N n + l),
(44.5)
i.e. the transition probability is multiplied by Nn + 1. The 1 in this factor corresponds to the spontaneous emission which occurs even if Nn = 0. The term Nn
represents the stimulated or induced emission: we see that the presence of
photons in the initial state of the field stimulates the further emission of photons of
the same kind.
The matrix element Vif for the transition with the opposite change of state of
the system (/-» i) differs from Vfi in that (44.5) is replaced by
<N n -l|c n |N n ) = V N n
(and the other quantities are replaced by their complex conjugates). This opposite
transition is a transition of the system from the level Ef to the level E, with
§44
163
Emission and Absorption
absorption of a photon. Thus the photon emission and absorption probabilities for a
given pair of states i, / are related byt
wjwa = (N„ + 1)/N„,
(44.6)
an expression first derived by A. Einstein (1916).
The number of photons can be related to the intensity of the external radiation
incident on the system. Let
I kc dco do
(44.7)
be the radiation energy incident on unit area per unit time and having polarization
e, frequency in the range dco and wave-vector direction in the solid-angle element
do. These ranges correspond to k2 dk do/(27r)3 field oscillators, each having Nke
photons of the specified polarization. Hence the same energy (44.7) is given by the
product
c
k2 dkdo xr .
h(o3 vr J J
$ Nkthoj = Q ii Nke dco do.
n
\Z7T)
OTT C
From this we find the required relation:
Nke = ^ ^ I k e .
(44.8)
Let dw$] be the probability of spontaneous emission of a photon with
polarization e into the solid angle do, and let the indices (in) and (a) denote the
corresponding probabilities for induced emission and for absorption. According to
(44.6) and (44.8), these probabilities are related as follows:
d<> = dwW = dwJT • ^f-
Jke.
(44.9)
If the incident radiation is isotropic and unpolarized (Ike independent of the
directions of k and e), then the integration of (44.9) with respect to do and
summation with respect to e gives similar relations between the total probabilities
of radiative transitions (between given states i and / of the system):
w(a)
=
w(in)
=
w(sp)
2 2
TTJ^ j
( 4 4
1())
where / = 2 x 4irlkt is the total spectral intensity of the incident radiation.
If the states i and / of the emitting (or absorbing) system are degenerate, the
total probability of emission (or absorption) of the photons concerned is found by
summation over all mutually degenerate final states and averaging over all possible
t In the rest of this section, ordinary units are used.
§45
Radiation
164
initial states. Let the degrees of degeneracy (statistical weights) of states i and / be
gi and gf. For processes of spontaneous or induced emission, the states i are the
initial states, and for absorption the states /. Assuming in each case that all g, or gf
initial states are equally probable, we obviously have instead of (44.10) the relations
gfwM = giw™ = giw^] j 4 l
(44.11)
In the literature one frequently meets the Einstein coefficients, defined as
Aif = w{sp\
Bif = w(in)c/I, Bfi = w{a)cll
(44.12)
where I/c is the spatial spectral density of radiation energy. They are related by the
equations
gfBfi = giBif = giAif7T2c}lhœ\
(44.13)
§45. Dipole radiation
Let us apply the formulae derived above to the emission of a photon by an
electron (in general, a relativistic electron) moving in a given external field. In this
case the transition current is the matrix element of the operator
/ = ijiytjf,
in which the (//-operators are assumed expanded in terms of the wave functions of
stationary states of the electron in a given field (§32). The matrix element
<0« l/|j|l«0/> corresponds to a transition of the electron from state i to state /. This
change in the occupation numbers is brought about by the operator âfâ» and the
transition current is
iti = i/y»to = ( W i , *?«*)>
(45.1)
where ^ and i/// are the wave functions of the initial andfinalstates of the electron.
Let the wave function of the photon be chosen in the three-dimensionally
transverse gauge (the polarization 4-vector e = (0, e)). Then the product jfie* =
- j/i • e* in (43.10). Substituting Vfi in (44.4), we obtain the following expression for
the probability (per unit time) of emission of a photon with polarization e into the
solid-angle element do:
dw„ - e2(ü>/27T)|e* • }fi(k)\2 do,
(45.2)
j/,(k) = J - ^ a * •«-*•'d3x.
(45.3)
where
Dipole Radiation
§45
165
Summation with respect to the polarization of the photon is effected by
averaging over the directions of e (in a plane perpendicular to the given direction
n = k/co), and the result is then doubled because of the two independent possible
transverse polarizations of the photon.t Thus the result is
dwn = e2(col2iT)\n x j/f.(k)|2 do.
(45.4)
A very important case is that where the photon wavelength À is large compared
with the dimensions a of the radiating system. This usually means that the velocity
of the particles is small compared with that of light. In the first approximation in
a/À (corresponding to dipole radiation; cf. Fields, §67), the factor e lk r varies only
slightly in the region where «ft or i/// is appreciably different from zero, and it can be
replaced by unity in the transition current (45.3). This implies that the photon
momentum is neglected in comparison with the momenta of the particles in the
system.
In the same approximation, the integral j/,(0) may be replaced by its nonrelativistic value, which is simply the matrix element \fi of the electron velocity
with respect to the Schrödinger wave functions. In turn, this element \fi = - iW/,,
and etfi = d/„ where d is the dipole moment of the electron (in its orbital motion).
Thus we have the following formula for the probability of dipole radiation:
dwtn = (w3/27r)|e* . d/e|2 do.
(45.5)
(Here the direction of n occurs implicitly: the vector e must be perpendicular to n.)
Summation with respect to the polarizations gives
dwn = (co3/27r)|n x d /t f do.
(45.6)
Since these formulae are non-relativistic (as regards the electron), they can be
immediately generalized to any electron system by taking d/, as the matrix element
of the total dipole moment of the system.
Integrating (45.6) over all directions, we have the total probability of radiation:
w = (4o>3/3)|d/i|2,
(45.7)
w = (4a)3/3Äc3)|d/l|2.
(45.7a)
or, in ordinary units,
t In the averaging, we use the formula
^ t = l(6,k-n,7ik)
(45.4a)
or
(a • c)(b • e*) = |{a • b - (a • n)(b • n)}
= |[axn][bxn],
where a and b are constant vectors.
(45.4b)
166
§46
Radiation
The intensity I is found by multiplying the probability by H(D:
I = (4oi4/3c3)|d/i-|2.
(45.8)
This is directly analogous to the classical formula (see Fields, (67.11)) for the
intensity of dipole radiation from a system of periodically moving particles: the
intensity of radiation at frequency co5 = sco (where to is the frequency of the
particle motion and 5 an integer) is
IM=(4<o4J3c*)\ds\29
(45.9)
where d5 are the Fourier components of the dipole moment, i.e. the coefficients in
the expansion
d ( 0 = É dMe'iM.
(45.10)
The quantum formula (45.8) is got from (45.9) by replacing these Fourier components by the matrix elements of the corresponding transitions. This rule (which is
an expression of Bohr's correspondence principle) is a particular case of a general
relation between the Fourier components of classical quantities and the quantum
matrix elements in the quasi-classical case (see QM, §48). The radiation is quasiclassical for transitions between states having large quantum numbers; the transition energy hay = Ex - Ef is then small in comparison with the energies Ei and Ef
of the radiator. This, however, would not lead to any change in the form of (45.8),
which is valid for all transitions. This explains the fact (which is something of an
accident) that the correspondence principle for the radiation intensity is valid not
only in the quasi-classical but in the general quantum case.
§ 46. Electric multipole radiation
Instead of considering the emission of a photon in a given direction (i.e. with a
given momentum), let us now consider the emission of a photon with definite
values of the angular momentum j and its component m in some chosen direction z.
We have seen in §6 that such photons can be of two kinds, electric and magnetic.
Let us take first the emission of electric photons, and again assume that the
dimensions of the radiating system are small in comparison with the wavelength.
The calculations are conveniently carried out by means of the photon wave
functions in the momentum representation, i.e. by expressing the 4-vector A^(r) as
a Fourier integral. Then the matrix element is
Vfi^ejifMAWd'x
= ejd3x> jjKr) j0p
A*(k) <T' kr ;
for simplicity, we omit the suffixes oijm to the photon wave functions.
(46.1)
§46
Electric Multipole Radiation
167
For an Ej photon we take the wave function from (7.10), with the arbitrary
constant C having the value
The reason for this choice is to ensure that, in the spatial components of the wave
function (A), the terms containing spherical harmonics of order j - 1 cancel (as is
seen from formulae (7.16)). Then A will include only spherical harmonics of order
j + 1, and therefore the corresponding contribution to Vfi is (as will be clear from
the subsequent calculation) of a higher order of smallness (in a/A) than the
contribution from the component A0 = <ï>, which includes spherical harmonics of
the lower order j.
Thus we put
A"=(<M),
cl> = ^ ^ ^ ^ ô ( | k | - a ) ) y j m ( n )
(n = k/w). Substituting this expression in (46.1) and carrying out the integration
over d|k|, we obtain
• P/l(r) j doa e-ikrYfm(n).
Va = -eyjijl^-jd'x
(46.2)
To calculate the inner integral, we use the expansion (24.12), written in the form
e,kr = 4 7 r i 2
Nm=-I
i'g,(kr)Yfm(k/k)Y,m(r/r),
(46.3)
where
g.(fcr)= > /^;J ( + i(kr);
(46.4)
see QM, (34.3).t
Substitution of this expansion in (46.2) gives
| e-ikrYfm(n)
doB = 47ri-'gi(kr)Yfm(r/r);
the remaining terms are zero because of the orthogonality of the spherical harmonics. On account of the condition a/A « 1, only distances such that kr < 1 will be
important in the integral with respect to d3x. We can therefore replace the
t The normalization of the functions gi is such that their asymptotic form as kr-»» is
g,(Jcr)~(l/kr)sin(kr-^r).
(46.4a)
168
§46
Radiation
functions gj(kr) by the first terms of their expansions in powers of fcr:t
a(kr)-(kr)V(2j+l)!!.
(46.5)
The result is
v
\\m+{ii
-(
v/i - (-1)
i y
/ ( 2 j + l ) ( j + 1)
—.
u)}+~2
(2jTT)!!
(Q,
ie)
'" w)/i '
(
}
with the notation
( Q
^
=
V ^ T T / PrMr%rn{rlr) d3x
(46.7)
(Vj.-m = (-l),"mYPjm)- The quantities (46.7) are called the I'-pole electric transition
moments of the system, by analogy with the corresponding classical quantities
(Fields, §41).*
For an electron in an external field, pfi = i//*t//j, and the quantities (46.7) are then
calculated as the matrix elements of the classical quantity
Uj«
J
V 2 / + - r Y-
In the non-relativistic case (as regards the particle velocities), the transition
moment can in principle be calculated similarly for any system of N interacting
particles. The transition density is expressed in terms of the wave functions of the
system by
N
f
J
n=l
(46.8)
where the integral is taken over the whole of configuration space.§
The photon wave function used here corresponds (in the coordinate representation) to normalization by the delta function on the œ scale, as assumed in formula
(44.2). Substituting (46.6), we find the probability of Ej radiation:)]
< = ü o i Ä i t f 0 <"**'^WS-'")*!2-
<46-9)
t The power of kr is equal to the order of the function Yjm by which g, is multiplied. This justifies
the neglect of the terms in A which contain higher-order spherical harmonics.
t The multipole moments are defined without the factor €, since in this book the currents also are
defined without the charge factor.
§ A situation can occur where the transition probability vanishes according to the approximate
selection rules, valid only when the spin-orbit interaction of the electrons is neglected. Then, to obtain a
non-zero result, we must use the wave functions with the relativistic correction which takes account of
this interaction.
|| It might appear at first sight that, owing to the isotropy of space, the total probability of photon
emission ought not to depend on the value of m. The incorrectness of this conclusion is easily seen if we
notice that different final states of the system (for a given initial state) correspond to the emission of
photons with different values of m; cf. the rule (46.16) below.
I
§46
169
Electric Multipole Radiation
In particular, for j = 1 we have
wiï=^e2\(Q\?-m)fi\2.
(46.10)
The quantities Q ^ are related to the components of the electric dipole moment
vector by
eQ(ifo} = id»
eQ\% = +^(dx
(46.11)
± idy).
Summing (46.10) with respect to m, we naturally obtain the earlier formula (45.7)
for the total probability of dipole radiation.
The angular distribution of multipole radiation is given by formula (7.11). When
this is normalized to the total emission probability Wjm, we have
dwjm = |Y#(n)|2H> do
Wj,
JG' + D
|VnYim|2 do.
(46.12)
In particular, for j = 1,
Vio = î y ^ c o s ö ,
Yu±x = TiyJ—smd
• e±i4>,
where 0 and <>
/ are the polar angle and azimuth of the direction n relative to the
z-axis. On calculating the gradient, we find that the angular distribution of dipole
radiation with a definite value of m is given by
A
3 •2nJ
aw 10 = w,o£— sin' 0 do,
OTT
A
3 l + cos20,
dwl±l = w, ± i^
r
do.
OTT
L
,,, n ,
(46.13)
These expressions could also, of course, be obtained from formula (45.6) by putting
firstly (for m = 0) dx = dy = 0, d2 = d, secondly (for m = ± 1) dy = + idx = d/V2,
d2=0.
If the order of magnitude of the dimensions of the system (atom or nucleus) is
a, then that of the electric multipole moments is, in general, Q^ — a1. The
probability of multipole radiation is
wft-akika)21.
(46.14)
When the multipole order increases by one, the probability decreases by a factor
-(ka)2.
The laws of conservation of angular momentum and parity imply certain
selection rules which restrict the possible changes in the state of the radiating
system. If the initial angular momentum of the system is J„ then, after emission of
a photon with angular momentum j, the angular momentum of the system can have
Radiation
170
§46
only those values // which are in accordance with the angular momentum addition
rule (J, - J/ = j):
|/i-J/|*j*Ji+J/.
(46.15)
For given values of /, and Jf, the same rule (46.15) specifies the possible values
of the photon angular momentum j. But, since the probability of emission decreases
rapidly with increasing j, the emission occurs principally with the lowest possible
multipole order.
The components M, and Mf of the angular momenta J, and J/, and m of the
photon angular momentum, satisfy the relation
(46.16)
Mi-Ms = m,
which is obvious from the same law of addition of angular momenta.
The parities P, and Pf of the initial and final states of the radiating system must
be such that P/PPh = Pi, where Pph is the parity of the emitted photon. Since the
parities can have only the values ±1, this condition may also be written
P,P/ = PPh.
(46.17)
For an electric photon Pph = (-l) J , and the parity selection rule for electric multipole radiation is therefore
P,P/ = (-iy.
(46.18)
The selection rules for total angular momentum and for parity are entirely
rigorous and must be satisfied in emission by any systems. There may also be
other rules which are more restrictive and which arise from certain properties of
the structure of particular radiating systems. These latter rules must of necessity be
approximate to some extent; they will be discussed in later sections of this chapter.
The dependence of the emission probability on the quantum numbers m, M-, and
Mf is entirely determined by the tensor character of the multipole moments.. The
quantities QJm with a given j form a spherical tensor of rank j. The dependence of
its matrix elements on these quantum numbers is given by the formula
|(n///M/|Q,-m|niJIM1>|2=(^
^
J^jW/lQ>IMi>l 2
(46.19)
(see QM, (107.6)), where n conventionally denotes all the quantum numbers
specifying the state of the system, other than J and Af. The reduced matrix
elements on the right of (46.19) do not depend on m, M„ Mf. On substituting this
formula in (46.9), we obtain the required dependence, which is proportional to
Ih
\Mf
J'
m
Ji\\
-Mi)'
Magnetic Multipole Radiation
§47
171
here it is, of course, assumed that the emitter is not in an external field, and that the
transition frequency w is thus independent of M, and Mf.
Summing the probability over all values of Mf (for a given M;), we have the
total probability of emission of a photon of a given frequency from the initial level
n„ Jj of the system. It is obvious from the isotropy of space that this quantity must
also be independent of the initial value M,. The summation is carried out by means
of the formula
2 Kn/J/M/IQ,. - m MM,>| 2 = j j ^ Kn^llQUn,/,)!2
(46.20)
(see QM, (107.11)).
§47. Magnetic multipole radiation
The wave function of a magnetic photon is A*1 = (0, A), where A is given by (7.6).
Substitution in (46.1) gives for the transition matrix element
Vfl = ~ e ^
| d3x • \fi(r) J don • e~ik"Yff*(n).
(47.1)
The components of the vector Yjm) can be expressed in terms of the spherical
harmonics of order j , as shown in (7.16). Again using the expansion (46.3), we
obtain for the inner integral
| e-'k-'Yg)*(n) do0 = 47rrigi(kr)Yfc)*(r/r),
and, on substituting g, from (46.5),t
V/i =
" ei~l (2JTÏJ!! / J ' i ( r ) r J " W W r ) d'x.
Here we must substitute, in accordance with the definition (7.4),
YS l (r;r) =
V[j0l+|)]rxVyftn;
we then transform the integrand by means of the formula
r'j / i TxVYf m = - r x j / i . V(r'Y*,),
obtaining
V„ = (- , r ,i V (2 ' + ff + 1> g ^ l j e(QS»>„),„
t The current j must not be confused with the angular momentum j.
(47.2)
172
Radiation
§47
with the notation
(Qï% = J^Y V ^ T /
rx
hi ' WYim)
(47.3)
d*x.
These are called the 2}-pole magnetic transition moments.
Because of the analogy between the expressions (47.2) and (46.6)
emission probability, we obtain a formula which differs from (46.10) only in
electric moments are replaced by magnetic moments. Formula (46.12)
angular distribution also remains valid (as has already been mentioned
nection with (7.11)).
Let us analyse the form of (47.3) when j = 1. In this case, the functions
for the
that the
for the
in conare
and their gradients are simply the spherical unit vectors e{0\ ef n (7.14). The
quantities e(Q\Z})fi are therefore the spherical components of the vector
H/, = k J r x j / t d 3 x ,
(47.4)
which is similar in form to the classical magnetic moment (see Fields, §44). The
total probability of M l radiation is given in terms of this quantity by the formula
(in ordinary units)
w=(4co 3 /3ftc 3 )| fJ i /i | 2 .
(47.5)
We shall show how formula (47.4) is related to the usual non-relativistic quantum
expression for the magnetic moment operator.
The expression for the transition current is (see QM, §115)
J/i = - 2m W V * ' ~ ^Vt//*} + fs c u r WSM.
(47 6
->
where /i is the magnetic moment of the particle and s its spin. Hence
^
=
~ 4 ^ / * K r X V)*' d'X + Ûi / * l(r X V ) ^ d'x
+
Is \
T X curl(
^^<) d >
(47.7)
In the second term, we write
| «Mr x V)<//? d3x = - [ i//?(r x V)^ d3x + J c u r l ( n ^ ) d3x.
The last integral can be transformed into one over an infinitely distant surface, and
§48
Angular Distribution and Polarization of the Radiation
173
is zero. Thus the first two terms in (47.7) are equal. In the third term, we transform
the integral as follows (temporarily writing F = i/>?si//,):
( r x (V x F) d3x = <j> r x (dt x F) - [ (F x V) x r d3x.
The surface integral is zero, and in the last term
(F x V) x r = - F div r + F = - 2F.
Thus,
J r x c u r l F d 3 x = 2 J F d3x.
The expression for \kjx therefore becomes
^ = / ^ ^ L + ^s)^d3x,
(47.8)
where L = - ir x V is the particle orbital angular momentum operator. This is, as it
should be, the matrix element of the operator
A - £ t
+
£«,
(47.9)
which contains the operators of the orbital and intrinsic magnetic moments of the
particle.
The selection rules for magnetic multipole radiation are analogous to those for
the electric case: the rules (46.15), (46.16) again apply to the total angular momentum, and the parity rule is
P i P / = (-iy + 1 ,
(47.10)
which is obtained by substituting in (46.17) the parity of the Mj photon, Pph = ( - i y + l .
§48. Angular distribution and polarization of the radiation
The formulae derived in §§46 and 47 relate to the emission of a photon with
definite values of the angular momentum j and component thereof m. It was
accordingly assumed that the radiating system (a nucleus, say) has not only definite
values of the angular momentum J but also definite polarizations, i.e. values of M,
both before and after the emission.
Let us now consider the more general case of emission by a partially polarized
nucleus (whose dimensions are again assumed small in comparison with the
wavelength). The emitted photon again has a definite angular momentum j, but may
be partially polarized. Let us find the emission probability as a function of the
§48
Radiation
174
direction n of the photon. This probability must be expressed in terms of density
matrices which describe the polarization states of the nucleus and the photon.
For this purpose, we shall first write down the emission probability as a
function of the direction n and helicity A of the photon (A = ± 1), for the case
where the initial and final nuclei have definite values J„ M,- ; //, Mf.
The matrix element for emission of a photon with definite values j, m is
proportional to the matrix element of the (electric or magnetic) 2J-pole moment of
the nucleus:
(JfMf; jmlVlJM,) * (-irVfMAQ^JM).
(48.1)
The wave function of the emitted photon (in the momentum representation) is
proportional to Yjm(n) or YJJÏÎ*(n). The wave function of a photon whose momentum
is in the direction n and whose helicity is A is proportional to the polarization
vector e(A). The matrix element for emission of a photon n, A is found by
multiplying (48.1) by the projection of the wave function of the state |jm) on that of
the state |nA):
(JfMf, nA| V|JM> « (-l)m<J/M/|Qj)-m|J)M()e(A)* • Yjm.
According to (16.23), for photons of either type
e(A)* • Yjm(n) a D&(n).
(48.2)
The matrix element of the multipole moment can be expressed in the usual way in
terms of the reduced element. Thus we find the transition probability amplitude in
the form
<J / M / ;nA|V|J I M i )«(-l) J r M / + '"(_^
Jm
^JQD&OI),
(48.3)
where Q denotes <J/||Q||Jj).
We can now proceed to the general case of mixed polarization states. According
to the general rules of quantum mechanics, the transition probability is proportional
to the expressiont
2 (JfMf ; nA| V|WMW,; nA'| V'|J,Mj)* x
(m)
x <M,|p(l)| M JKMjp^M/XA V^A>f
t If the initial and final states of the system are described by the superpositions
n
then the matrix element is
m
(48.4)
§48
175
Angular Distribution and Polarization of the Radiation
where pu\ p{f\ p{y) are the density matrices of the initial nucleus, the final nucleus
and the emitted photon; the symbol (m) beneath the summation sign indicates that
the sum is taken over all the m-type quantities which occur twice (M„ M 5, M/, M/,
À,À'). Then (48.3) is to be substituted in (48.4).
Let w(n) do denote the probability of emission of a photon into the solid angle
do. The total probability of emission, in any direction and with any polarizations of
the photon and the final nucleus, is evidently independent of the initial polarization
state of the nucleus, is given by formulae already known, and is of no interest here.
We shall therefore arbitrarily normalize the probability w(n) to unity. The result ist
w(n) = < 2 i + 1 > M
+ 1
> £ (-ip-M,-MlD(i) Dfö x
Ö7T
J
>
xl
fZ\
j
Ji
\ ( J>
j
J i
\x
\-Mf
- m MJX-M'i
- m ' MS/
x<Mi|p(0|Mi)<M^p(/,|M/><A'|pw|A>;
it will be seen below that the normalization is correct. This formula can be.
transformed by using the series expansion QM (110.2) for the product of the two D
functions:
D O)
n(i)* _
f—l\A+™'n(J>
D(^
_>A. _LA)(<
= (-ir«?(2L+,)(<
Jm,
j-jDg,
where A = A - A ' , fx = m - m' and L takes integral values 2*2j. Thus we have finally
w(n) = (2j + y 27 ' + U S 2 (-l)2}>-M>-M>+m+\2L + 1) x
OTT
L
(m)
/i
j
L \/j
VA - A ' - A A m
j
L W J,
j
J , \ / J,
j
J,\
- m ' - / u i A - M , - m M, A - M } - m ' M',/
xDt'diKM.Ip^MlXMIlp^M/KA'lp^lA).
(48.5)
and its square is
K/|V|«)|2=
2
n. n', m, m'
V^VJva.aiH^-bÄ.
The case of mixed states is obtained by making the changes
so that
K/|v|i>|2-» S
n, n', m, m'
v^v^M*.
t In transforming the sign factor note that the numbers 2J„ 2J/, 2M„ 2M/ have the same parity; j and
m are integers, and A = ± 1.
176
§48
Radiation
As previously, 2(m) denotes summation over all m-type quantities which occur
twice. Here it must be noted that A and A' differ from the other quantities, since
they have only two values, A, A' = ± 1, corresponding to the two polarizations of
the photon, and not 2) + 1 values for any given j .
Formula (48.5) embodies all the necessary information about the angular distribution and polarization of the emitted photons, and also about the polarization of
the secondary nuclei (i.e. those which have emitted a photon). It is assumed that
the initial density matrix is given.
ANGULAR DISTRIBUTION
The angular distribution of the photons is obtained by summation over all
polarizations of the photon and the secondary nucleus. The averaging with respect
to polarizations is done by substituting the density matrices of the unpolarized
states:
<A|pw|A'> = J«AAS
(M,\pW\M',) = Y^T\
b
(48 6)
-
»M
after which the summation amounts to a multiplication by 2 for the photon or by
2Jf + 1 for the nucleus. Thus the summation is effected by simply making the
changes
<A|pw|A')->SAA.,
(48.7)
(MI\pW\hrf)-+8MfM-P
and the angular distribution is
w(n) = flJ + lX2Ji + l) y 2 (-ir' + I (2L + l)Dfe>(n) x
87r
T (m)
(j j L\(j
j
L \( Jf
\A -A OAm - m ' -pIK-Mf
j
-m
J , \ / Jf
MjK-M,
j
J-t\
- m' M\)
x<M,|p(l)|Mi>.
This formula can be considerably simplified by carrying out the summation over
m-type quantities. First of all, we note that
a \ O H - ^ A ; o)-
«M>
and therefore
=0
for odd L.
In the sum over L, therefore, only the terms with even L remain, and it involves
§48
177
Angular Distribution and Polarization of the Radiation
only even-order spherical harmonics D ^ . This result is obvious a priori, since, by
the conservation of parity, the probability must be unchanged by inversion, i.e. by
putting n -* - n.
Thus we have
Ä ( p ) .ßl±i22i^ ? ( 2 L + 1)(J
£/
u
Ji
L)DBWX
-m M , / \ - M , - m' M'J
\m -m' -IL)\-M,
(i)
x<Mi|p |M'i).
The normalization here is easily verified: with the formula
f D\tf(n) dol4ir = ÔL0 8Mo,
integration over all directions leaves only the term with L = 0, ix = 0, and the
formulae
Ii
\m
j^-l-M/
* °\ = K(-1V"
}
-m
0)
1
V(2j +1)'
- m AfJ =2JT+T'
tr >0)=1
'
then show that the integral is equal to unity.
The further summation with respect to m, m', Mf in the inner sum in w(n) is
effected by means of QM, (108.4). The final expression for the photon angular
distribution is
*(n) = (-l)'W 2 > + l M 2 J ' + 1)X
\5 en ( - i)LV(2L+i) (i -i o){J; Î
3?*B"«H
(48.9)
where
(48.10)
w
)
^ i * = (-l) 0l - M .
The inner sum in (48.9) is taken over all |/i| ^ L, and the outer sum over all even L
such that
L*s2j,
Lss2J f .
(48.11)
178
§48
Radiation
(These conditions result from the triangle rule which has to be satisfied by the
quantities in the 3j-symbols that appear in (48.9), (48.10).) The number of terms in
the sum is therefore usually small. For instance, when J, = 0 or |, only the term with
L = 0 remains, and the radiation is isotropic; this term is easily seen to equal 4, as it
should by the normalization condition. When J, = 1 or 3/2, or j = 1, the two terms
with L = 0 or 2 remain in the sum over L. If the density matrix p0) is diagonal
(Mi = Mi), then /x = 0, and the distribution function (48.9) becomes an expansion in
Legendre polynomials; according to (16.5) and QM, (58.23), the functions D&* are
the functions PL (cos 0). Finally, if
i.e. if the initial nucleus is unpolarized, then all the 0*^ are zero except 0*$ = l.t
The quantities ^LM are convenient characteristics of the polarization state of the
nucleus, and will be called polarization moments. Formula (48.10) defines them in
terms of the density matrix pM\i- The inverse formula expressing the density matrix
in terms of the polarization moments is easily verified:
'"»"£ VllTTrL<-1»"M'(-M' \ M K
< 48 ' 12 >
Let /L^ be a spherical tensor depending on the polarization state of the nucleus.
According to the general rules (see QM, (14.8)), its mean value in a state having the
density matrix pMM is
h. = 2 PMM</M'|/ L J/M).
(48.13)
Expressing the matrix elements of the fL$L in terms of the reduced element </||/L||/)
by means of the formula
= iH-i)}-M(_JM,
W\UM)
L
i)<J|/L|U>,
t Using the result that
/ J
\-M'
l
.1
° J \ - ( nJ-»
0 MJ-(_l)
V(2J + 1)5MM'
we have
&-«-*(-'*■ Ï i)«--^ + \2(_' M . I L)(-Jv I i)
= V(2J + 1) 8Lo S^O,
and the conclusion stated then follows from the definition (48.10).
§48
179
Angular Distribution and Polarization of the Radiation
and using the polarization moments as defined by (48.10), we obtain
j . = . <J\MJ)
UlL
V[(2L+1)(2J + 1)]
«
^
(4814)
K
}
PHOTON POLARIZATION
When the matrices p and pu\ as well as p (,) , are specified, formula (48.5)
determines the probability of a transition in which a photon is emitted, and the
nucleus left, in definite polarization states. Such states are essentially characteristic
not of the emission process as such, but of the detectors which record the photon
and the recoil nucleus and distinguish definite polarizations of these. There is
another and more natural formulation of the problem, in which the final state of the
"nucleus + photon" system is not specified from the start, and the polarization
density matrix of this state is to be determined, with only the direction of the
photon emission fixed.
The answer to this problem is given by the same formula (48.5). If this is written
as
iy)
w = w(n) S <M,; nA|p|M; ; nA'><Alp(y)|AXAf aPtf)|M/>f
(m)
(48.15)
then the expression (Mf; nA|p|M}; nA') is the required density matrix, since according to the general rules of quantum mechanics the probability w of a transition to a
specified state is given by its "projection" on the given piy\ pu\ The factor w(n) is
written in (48.15) so that this matrix shall be normalized by the usual condition,
2
(M/;nA|p|M/;nA)=l.
If we want the polarization of the photon alone, a summation over M/ = M} is
necessary:
<nA|p|nA'> = 2 (Mf; nk\p\Mf; nA').
Using a derivation exactly similar to that of (48.9), we obtain
x ? (-0VUL + 1)(| _'A, ffij
J
! J } 2 ni'DkM.
(48.16)
where A = A - A', and the summation is over all integral values of L which satisfy
the conditions (48.11).
In particular, circular polarization is determined by the Stokes parameter
6 = <nl|p|nl)-<n,-l|p|n,-l);
§48
Radiation
180
see §8, Problem. Because of the relation (48.8), all terms with even L in this
difference are zero, and the resulting formula for £2 differs from (48.9) only in that
the summation is over odd instead of even values of L.
SECONDARY NUCLEUS POLARIZATION
Finally, if we are interested only in the final polarization of the nuclei, we must
put p{y)-+ 8. If the integration with respect to directions of the photon is also carried
out, the density matrix of the secondary nucleus is
(Mf\p\M'f) = f w(n)(M,n\p\M'fn) do
= (2Ji + l) S
(-I)2*-*'-"'' x
* U -I M,)(-i -l
AJ;K"ÏM»-
The polarization moments calculated by means of this matrix are
9% = (M)J'+i'+L+'V[(2J, + \){2J, + 1)]{''
J
J
L
}^&.
(48.17)
If the initial nucleus is unpolarized, so is the final nucleus, but there exists a
correlation polarization, i.e. a polarization of the nucleus after emission in a
specified direction. Putting p(,)->S/(2J, + 1) (and correspondingly w(n) = l/47r) and
calculating as in the derivation of (48.9), we obtain for the density matrix describing
this polarization
(M/;n|p|M;;n) = (2J- + 1)(-1,^«^L2n(2L + 1)('1 _', £)(_'<,, J J ) x
x{j
J
> £}D&>(n).
(48.18)
The corresponding polarization moments are
9% = iL(-l)l+Ji+J'(2j + 1)V[(2L + 1)(2J, + 1)] x
*(i -i o){/1 *H>>-
(48i9)
Only even-order moments occur (which is also a consequence of the conservation
of parity already mentioned).
If the secondary nucleus in turn emits a photon, it will generate an anisotropic
distribution, being polarized. Since the polarization moments (48.19) depend on the
direction n of the photon emitted in the first decay, there is a certain correlation
between the directions of successively emitted photons (with an unpolarized
§49
Radiation from Atoms: The Electric Type
181
primary nucleus). Other correlation effects (of polarization, etc), in cascade emission can be treated similarly.!
PROBLEM
Find the relation between the polarization moments 0 ^ , 0 V and the mean values of the angular
momentum vector J and the quadrupole moment tensor Q».
SOLUTION. The reduced elements of the vector J and the tensor Qik are determined from
J1 = (J\\J\\J)2l(2J + \),
QZ = (J||Q||J)2/(2J + 1);
cf. QM, (107.10), (107.11). The operator 0 * >s expressed in terms of the angular momentum operators as
in QM (75.2):
Hence we find the mean value
rpr
0i
3Q 2
2
'-2J*(2J-l)
2p(J
2
iJ(4J
"
3)
-°L
+1X2J + 3)]
2J(2J-1)
}
The reduced matrix elements are
<J||J||J) = V[J(J + 1)(2J + 1)],
From (48.14) we now see that the polarization moments 0>iM are equal to the spherical components of
the vector
and the moments ^ M are equal to the spherical components of the tensor
r 10J(2J-1) VQ~k
L3(J + 1)(2J + 3)J Q
§49. Radiation from atoms: the electric typet
The energies of the outer electrons of an atom (which take part in optical
radiative transitions) have, as a rough estimate, the order of magnitude E ~ me4lh29
so that the radiated wavelengths A — hclE — h2l<xme2. The dimension of the atom is
a — h2lme2. Thus, in the optical spectra of atoms, we generally have the inequality
alk~~a<\. The ratio vie — a, where t; is the velocity of the optical electrons, has
a similar order of magnitude.
Thus, in the optical spectra of atoms, a condition is satisfied which means that
t A detailed account of these problems is given in the paper by A. Z. Dolginov, in: L. A. Sliv, ed.,
Gamma-luchi (Gamma Rays)> USSR Academy of Sciences, Moscow 1961, pp. 523-681.
t In §§49-51 and 53-55, ordinary units are used.
182
Radiation
§49
the probability of electric dipole radiation (if this is allowed by the selection rules)
considerably exceeds the probabilities of multipole transitions.! For this reason it
is electric dipole transitions which are the most important in atomic spectroscopy.
As has already been mentioned, such transitions are subject to strict selection
rules as regards the total angular momentum J of the atom and the parity P:$
| J ' - J | ^ 1 ^ J + J\
(49.1)
PP' = - 1 .
(49.2)
The inequality |J'- J\^ 1 signifies that the angular momentum J can change only
by 0 or ±1; also, the transition 0->0 is forbidden by the inequality J +.J'^ 1. The
parities of the initial and final states must be opposite.§
The probability of emission by the transition nJM-*n'J'M' is determined by
the corresponding matrix element of the dipole moment of the atom:
w(nJM -> n'J'M') = jjjp \(n'J'M'\d-m\nJM)\2y
(49.3)
a) = w(nJ->n7').
On summing (49.3) over all values of Af ' = M - m (with M given), we obtain the
total probability of emission with a given frequency from the atomic level n, J. The
summation is carried out by means of (46.20), and the result is
w(nJ -> nT) = ^ p 2 J T T \<n'J'\\dWnJ>\2-
< 49 ' 4 >
The squared modulus of the reduced matrix element is sometimes called the
transition line strength; it is symmetrical as between the initial and final states.
The observed radiation intensity is found by multiplying w by haj and by the
number NnJ of atoms in the source which are at the excitation level concerned. For
example, in a gas at temperature T this number is Nnj <* (2J + 1) exp(- EnjlT)\ the
factor 2J + 1 is the statistical weight of the level with angular momentum J.
Further deductions regarding transition probabilities in atomic spectra can be
obtained only for specific kinds of atomic states. We shall not here discuss methods
of calculating matrix elements where the degree of approximation has no clear
theoretical significance, but simply derive some relations valid for a fairly large
class of states (especially in light atoms) of the LS coupling type (see QAf, §72).
Such states are described not only by the total angular momentum but also by
definite values of the orbital angular momentum L and the spin S, which in this
case are conserved.
t Typical values of the dipole transition probability in the optical region of the spectra of atoms are of
the order of 108sec"'
t We shall now denote the quantum numbers of the initial and final states by unprimed and primed
letters respectively. The letters n, n' will denote all the quantum numbers which define the state of the
system, other than those shown explicitly.
§ The parity selection rule was first established by O. Laporte (1924).
§49
Radiation from Atoms: The Electric Type
183
Since the dipole moment is a purely orbital quantity, its operator commutes
with the spin operator, i.e. its matrix is diagonal with respect to the number S. For
the number L, the dipole moment is subject to the same selection rules as any
orbital vector (see QM, §29). Thus transitions between LS-type states are subject
to the following selection rules (in addition to (49.1), (49.2)):
S' - S = 0,
(49.5)
|L' - L| « 1 « L + L'.
(49.6)
It should again be stressed that these rules are approximate, and no longer apply
when the spin-orbit interaction is taken into account.
The rule (49.5), which forbids transitions between terms of different multiplicity, is valid not only for electric dipole transitions but for all electric transitions:
the electric multipole moments of all orders are orbital tensors, and therefore their
matrices are diagonal with respect to spin. For instance, for electric quadrupole
transitions, in addition to the general rules
|J' - J\ *s 2 ^ J + J\
PP' = 1,
(49.7)
in the case of LS coupling we have the further rules
S' - S = 0,
\L' - L\ ^ 2 ^ L + U.
(49.8)
The emission probability can be written in explicit form as a function of the
numbers S, J, J'. This is done immediately by means of the matrix elements of
spherical tensors in the addition of angular momenta. According to QM, (109.3), we
havet
Kn'L'SJl|d||nLS/)|2 = (2J + l)(2J'+l){y
£
^}2|<n'Ll|d||nL>|2.
(49.9)
Substitution of this in (49.4) gives
w(nLS/^nT'SJ') = ^ ( 2 J ' + l ) [ y
£
^}2|<n'Ll|d||nL)|2,
(49.10)
with co = o)(nLS -► n'L'S).$
A sum rule can be derived for these probabilities. The squares of the 6j-symbols
satisfy the summation formula (see QM, (108.7))
?(2',+»{y i' ?F-ÜTM-
(49U)
t The "angular momenta of sub-systems 1 and 2" in the formulae in QAf, §109, are here to be taken
as the orbital angular momentum and spin of the atom, whose interaction is neglected; the quantities f\lq
are represented by the orbital vector dq.
t In neglecting the spin-orbit interaction in the calculation of the matrix elements, we also neglect
the dependence of the frequencies on J and J\ i.e. the fine structure of the initial and final levels of the
atom,
184
§49
Radiation
Using this, we obtain from (49.10)
Ç w(nLSJ - n'L'SJ') = ^
<4912)
^ I T T \<n'LWinL>\2•
This quantity is thus found to be independent of the initial value of J.
For radiation from a gas whose temperature is much greater than the finestructure intervals in the atomic term nSL, the states with different / are uniformly
occupied, i.e. all values of J are equally probable. The probability that the atom is
at a level with some definite value of J is then
(2L+1)(2S + 1)'
(49.13)
i.e. is equal to the ratio of the statistical weight of the level to the total statistical
weight of the term nSL. Averaging the expressions (49.10) or their sums (49.12)
with respect to these probabilities is equivalent to multiplying by the factor (49.13).
This averaging will be denoted by a bar over the letter w. The total probability of
emission of all the lines in a spectral multiplet (formed by all possible transitions
between the fine-structure components of the two terms nSL and n'SL') is the sum
= 'Z'Z w(nLSJ -> n'L'SJ').
w(nLS-+n'L'S)
j
(49.14)
j'
Since, of course,
2 ( 2 J + 1) = (2S+1)(2L+1),
the result obtained for the total probability agrees with (49.12). Thus the relative
probability (which is the same thing as the relative intensity) of a single line is
wjnLSJ^n'L'SJ')
w(nLS-+n'L'S)
=
(2J + \)(2J' + 1) [V
2S +1
[J
J'
L
S\2
l)'
{
}
The analysis of the numerical values given by this formula shows that the
strongest lines in the multiplet are those for which AJ = AL (called main lines,
while the remaining components of the multiplet are called satellites). The intensity
of the main lines increases with the initial value of J.
Summation of the quantities (49.15) with respect to J' and J gives respectively
2
w(nLSJ-+n'L'SJ')
w(nLS -+n'L'S)
Y w(nLSJ ^n'L'SJ')
w(nLS -> n'L'S)
2J + 1
(2L + 1)(2S + 1)'
2Jf+{
* (2L + 1)(2S + 1)* J
(49.16)
§49
Radiation from Atoms: The Electric Type
185
Thus the total intensity of all the lines in a spectral multiplet having a common
initial or final level is proportional to the statistical weight of that common level.
We may also consider the hyperfine structure of atomic spectral lines. The
hyperfine splitting of atomic levels is due to the interaction of the electrons with
the spin of the nucleus if the latter is non-zero (see QM, §122). The total angular
momentum F of the atom (including the nucleus) consists of the total electron
angular momentum J and the angular momentum I of the nucleus. Each component
of the hyperfine structure of the level n, J has a different value of the quantum
number F.
The rigorous law of conservation of angular momentum now leads to a rigorous
selection rule for the total angular momentum F: for electric dipole radiation,
(49.17)
|F'-F|ssl*£F + F\
But, in view of the extreme weakness of the interaction of the electrons with the
spin of the nucleus, this interaction may be neglected in calculating the matrix
elements of the electric (and magnetic) moments of the electron shell of the atom.
Thus the previous selection rules regarding the electron angular momentum J and
the electron parity remain valid also. In particular, the latter selection rule prohibits
electric dipole transitions between hyperfine structure components of the same
term: all these levels have the same parity, whereas such transitions can occur only
between states of different parity.
Since the dipole moment pperator commutes with the nuclear spin, the dependence of the matrix elements on the numbers I and F can be found explicitly, the
calculations differing only by an obvious change of notation from those given
above for LS coupling. The probability of emission, summed over the final values
of the component of the total angular momentum F, is
w(nJIF -> n'J'IF') = j£p ^Tl
\(nTIF'ld\\nJIF)\2
(49.18)
o) = o)(nJ -+n'J'),
and the square of the reduced matrix element is
\(n'J'IF'\\d\\nJIF)\2 = (2F + 1)(2F' + 1){£
*'
' } V^MI|nJ>| 2 .
(49.19)
PROBLEM
The majority of the lines in the spectra of the alkali metals can be described as resulting from
transitions of a single outer (optical) electron in the self-consistent field of the rest of the atom, which
forms a configuration of closed shells; the state of the atom is governed by LS coupling. Under these
conditions, determine the relative intensities of the fine structure components of the spectral lines.
SOLUTION. The total angular momenta L and S = 2 of the atom are equal to the orbital angular
momentum and spin of the optical electron. The parity of the state is therefore (-\)L (the parity of the
closed configuration of the rest of the atom being positive). The parity selection rules therefore forbid
the dipole transition with L' = L, and so only transitions with L' - L = ± 1 are possible. The transitions
§50
Radiation
186
J=L +
nL
J=L--'
J=Ln! L-1
\
J--L
FlG. 1.
between components of the doublet levels n, L and n', L - 1 give only three lines, because of the
selection rule for J (Fig. 1). Their relative intensities (denoted by a, b, c) are most simply determined
from the rules (49.16), instead of using (49.15) directly. The ratios of total intensities of lines having each
initial (or final) level give two equations
b+c
2L
2L + 2'
a+b
2L
2L-V
whence
a:b:c
= (L+ 1)(2L - 1) : 1 : (L - 1)(2L + 1).
If L = 1, the lower level is unsplit, line c does not appear and alb = 2.
§ 50. Radiation from atoms: the magnetic type
The magnetic moment of an atom is equal, in order of magnitude, to the Bohr
magneton: /ut ~~ eh/mc. This differs by a factor a from the order of magnitude of the
electric dipole moment, d — ea — h2lme (since vie — a, we have ix — dvlc, as is to be
expected). Hence it follows that the probability of magnetic dipole (Ml) radiation
from the atom is about a2 times less than that of electric dipole radiation at the
same frequency. The magnetic radiation is therefore important in practice only for
transitions forbidden by the selection rules for the electric case.
The ratio of the probability of electric quadrupole (E2) radiation to that of Ml
radiation is, in order of magnitude,
£2
Ml
(ea2)Wlc7
172
a4mW
- ( E )
(50.1)
the quadrupole moment ~ea 2 , E ~ tflma1 is the energy of the atom, and AE the
change in energy in the transition. We see that, for medium atomic frequencies (i.e.
when AE ~ E), the probabilities of E2 and M1 radiation are of the same order of
magnitude (assuming, of course, that both are allowed by the selection rules). If,
however, AE < E (as for transitions between fine structure components of the
same term), then Ml radiation is more probable than E2.
The magnetic dipole transitions are subject to the rigorous selection rules
(50.2)
PP'=1.
(50.3)
Radiation from Atoms: The Magnetic Type
§50
187
For LS coupling, there are additional selection rules, which are even more
restrictive than in the electric case. This is because of a particular property of the
magnetic moment of the atom, which arises from the fact that all the particles
(electrons) in the system are identical: the magnetic moment operator of the atom
can be expressed in terms of the total orbital and spin angular momentum
operators:
|k = /jLo(l + 2S) = - jxo(J + S),
(50.4)
where /LL0 = \e\hl2mc is the Bohr magneton (see QM, §113). Owing to the conservation of total angular momentum, the operator J has only matrix elements
which are diagonal with respect to the energy, and in considering radiative transitions it is therefore sufficient to put p. = - jmoS.t
The angular momenta L and S are separately conserved when the spin-orbit
interaction is neglected. The spin operator is therefore diagonal in all the quantum
numbers n, S, L which belong to the unsplit term. In order for a transition to occur
at all, the number J must change. The selection rules are consequently
n' = n,
S' = S,
L' = L,
J'-J = ±l,
(50.5)
i.e. transitions are possible only between fine structure components of a single
term.
The emission probability can be calculated exactly in this case. By an appropriate change of notation in formula (49.10), we obtain
w(nLSJ^nLSJ')
= ^é(2J'+\){SJ
£
^}2|<S||S||S)|2.
The reduced spin matrix element with respect to the spin eigenfunctions is
(S\\S\\S) = V[S(S + 1)(2S + 1)];
(50.6)
see QM, (29.13). The 6/-symbol is
(S
U
J-\
S
L)2
1J
=
(L + S + J + 1)(L + S - J + 1 ) ( L - S + J)(S - L + J)
S(2S + 1)(2S + 2)(2J-1)2J(2J + 1)
(50.7)
see QM, §108, Table 10. The result is then
w(nLSJ -* nLS, J - 1) = | j ^ - | w(nLS, J - 1 -> nLSJ)
=jf^jhï(L+s+J+])(L+s-J+i)x
x(J + S- L)(J + L-S).
(50.8)
t An exception occurs in cases where the electronic angular momentum J of the atom is not
conserved: when the hyperfine structure is taken into account, when an external field is present, and so
on (see the Problems).
188
§50
Radiation
Transitions between hyperfine structure components of one level (whose
frequencies are in the radio wave range) cannot occur as electric dipole transitions,
since all the components have the same parity. E2 and M l transitions involve no
change of parity. But, owing to the relatively very small intervals in the hyperfine
structure, E2 radiation has a low probability compared with M l (cf. (50.1)), so that
these transitions occur as magnetic dipole transitions.
PROBLEMS
PROBLEM 1. Find the probability of an Ml transition between hyperfine structure components of a
single level.
SOLUTION. The transition probability is given by formulae (49.18), (49.19), in which the diagonal
reduced matrix element (nJ||ji||nJ) of the magnetic moment will now appear. Its value can be written
down immediately by noting that the total (not reduced) matrix element (nJM\fiz\nJM) determines the
splitting of the relevant level by the Zeeman effect (see QMy §113), and is - jiogM, where g is the Lande
factor. The reduced matrix element is (see QM, (29.7))
(nJ\\fi\\nJ) = ^ V[J(J + 1)(2J + l)l<n/MHfizln/M>
= -/iosV[J(J + l)(2J + l)].
The required probability is thus found to bet
w(nJIF -> nJI, F - 1) = ^Zr —
4 1 w(n/Jf F - 1 -> nJIF)
= 3ftc^2F°+ 1)F ( J * l
+ F
* 1)(J + l '
F
* 1)(F *J ~I ) ( F ~J
+ J)
'
This expression differs from (50.8) only by an obvious change of notation and the extra factor g2.
PROBLEM 2. Find the probability of an Ml transition between Zeeman components of a single
atomic level.
SOLUTION. This is a transition M-+M - 1 with the values of n and J unchanged; the transition
frequency is (see (51.3) below)ftco= jiogH, where g is the Lande factor. The matrix element of the
spherical component ^ut-i of the vector JJL is
\(nJ, M - l\^\nJM)\ - ^
^
j
^
IMWM>|
= - MogV&J - M + W + M))
(see QM, (27.12), and Problem 1). The transition probability is
»3
* = TJTi \(nJ, M - l|/i-.|nJM)|2
3ftV
t An interesting example is the transition between the hyperfine structure components of the ground
level (\si) of the hydrogen atom, where both El and £2 transitions are strictly forbidden, the latter by
the rule which prohibits a quadrupole transition with J + J'=l. This transition has a frequency
(o = 2ir x 1.42 x 109 sec"1 (wavelength A = 21 cm). Putting g = 2, I = i J = l F = 1, F' - 0, we obtain
w = 4a> Vo/3ftc3 = 2.85 x 1<T15 sec"'.
§51
189
Radiation from Atoms: The Zeeman and Stark Effects
§51. Radiation from atoms: the Zeeman and Stark effects
In an external magnetic field H (assumed weak), each atomic level with total
angular momentum J is split into 2J + 1 levels,
E M = E(0) + iaogMH,
(51.1)
where E(0) is the unperturbed level, /ut0 the Bohr magneton, g the Lande factor, and
M the component of J in the direction of the field (see QM, §113). Thus the
degeneracy with respect to directions of the angular momentum is entirely
removed.
Spectral lines resulting from transitions between two split levels are correspondingly split. The number of components of the line is determined by the
selection rule for the number M, according to which, for dipole radiation, we must
have
m = M - M ' = 0,±1.
(51.2)
In addition to this rule, the transitions with M = M' = 0 are forbidden if also
J' = J. This is seen immediately from the general expressions (QMy (29.7)) for the
matrix elements of an arbitrary vector.
The components resulting from transitions with m = 0 and m = ± 1 are called IT
and cr components respectively. Their frequencies are
ha>v = hœm + noH(g-g')M,
m
1
- g\M ± 1)]. J
hua = h(o + ^H[gM
(51.3)
In the particular case where g = g', we have
hcon = ha>{0\
hioa = ha)m + /uiogH,
(51.4)
whatever the value of M ; thus, in this case, the line is split into a triplet with an
undisplaced IT component and two a components lying symmetrically on either
side of it (called the "normal" Zeeman effect).
The total probability (for all directions) of emission of radiation is proportional
to the squared modulus |(n'J'M'|d_ m |nJM)| 2 . Hence, using formula (46.19) with
j = 1, we see that the relative probability of emission of each of the Zeeman
components of the spectral line is
(y
\M'
X
J
m
-M)
V
(51.5)
K
}
In the particular case of the "normal" Zeeman effect there are only three
components, each arising from transitions with all initial values of M for given m.
Since
£.(*r m - M ) "3
(516)
190
Radiation
§51
(see QM, (106.12)), the emission of all three components is equally probable in this
case.
The relative intensity of the Zeeman components when observed in a particular
direction (relative to the direction of the magnetic field applied to the source) is of
greater interest, however. According to (45.5), the probability of emission (and
therefore the line strength) in a given direction n is proportional to 2|e*-d/;|2,
where the summation is over the two independent polarizations e which are
possible for a given n.
For observation along the field (along the z-axis), this sum is
\(dX)fi\2+\(dy)fi\\
or, in spherical components,
|(di)/i|2 + |(rf-i)/«|2.
This means that only the two a components (m = ± 1) are observed in the
longitudinal direction (along the field). Their intensities are proportional to
(A/;,
< 51 - 7 >
il - M ) -
These lines have definite values of the component m of the angular momentum in
the direction of propagation, and either right-hand (m = 1) or left-hand (m = - 1)
circular polarization (see §8).
For observation in a direction perpendicular to the field (along the x-axis, say),
the intensity is proportional to the sum
\(dz)fi\2 + |Wy)/.f = \(do)fi\2 + M(d,)/i|2 + |(d-,)/,f}.
Thus two cr components and a TT component are observed in the transverse
direction, with respective intensities proportional to
\{MT\
±\
-M)
and
(M
o
-M);
(518)
the intensities of the a components are half as great as in the case of longitudinal
observation. The TT component is linearly polarized along the z-axis; the a
components, as observed in this direction, are linearly polarized along the y-axis.
The relative intensities of the Zeeman components are seen to be entirely
determined by the initial and final values of J and M, and not by any other
properties of the levels.
The selection rules forbid electric dipole transitions between Zeeman components of the same level, since all these have the same parity. Such transitions
occur as magnetic dipole transitions, for the same reason as was mentioned at the
end of §50 in respect of transitions between hyperfine structure components of a
§51
191
Radiation from Atoms: The Zeeman and Stark Effects
level. Because of the selection rule for the number M, the transitions occur only
between adjacent components (M' - M = ± l).t
The splitting of atomic levels in a weak electric field (the Stark effect), unlike
that in a magnetic field, does not cause complete removal of the degeneracy with
respect to directions of the angular momentum. All the levels except those with
M = 0 remain doubly degenerate, each corresponding to two states with angular
momentum components M and - M .
The calculating of the relative intensities of the Stark components of a spectral
line is exactly similar to that given above for the Zeeman effect.t It must be remembered that the intensity of the TT components includes contributions from the
transitions M-*M and -M-^>-M
(when M 7*0), and that of the a components
includes contributions from the transitions M -» M ± 1 and - M -> - (M ± 1).
Hence, for example, in transverse observation the intensities of the TT components
are proportional to
i(y
*
VM 0
J
V
-M/'
and those of the a components are proportional to the sums
u J'
i
J \2 u
J'
i
J\2
/ J'
2\M±1
+ 1 -M)
2 V - M T 1 ±1 MJ
\M±1
i
+1
J \2
-M/
(when all the numbers in the second row change sign, the 3j-symbols can at most
change sign, so that their squares are unaltered).
In an external field, even if it is weak, the total angular momentum J is no
longer strictly conserved; in a uniform field, only the angular momentum component
M is exactly conserved. Thus, in radiative transitions in a weak field, the conservation of angular momentum need not be rigorously maintained, and the atomic
spectra may contain lines which are forbidden by the usual selection rules.
The calculation of the intensities of these lines is equivalent to the calculation
of the corrections in the dipole moment matrix, which in turn requires the
determination of corrections to the wave functions of stationary states. In the first
approximation of perturbation theory (with respect to the weak external field), the
wave function includes "admixtures" of states which are connected to the initial
state by non-zero matrix elements of the perturbation (-E • d in the electric field):
the admixture of a state fa in a state i/^ is
-E-d2l
t The frequencies of these transitions are usually in the range corresponding to centimetre
wavelengths, and are observed in absorption and induced emission (electron paramagnetic resonance):
the absorbing atoms are in a strong constant magnetic field (which causes the Zeeman splitting) and a
weak radio-frequency field of the resonance frequency.
t This refers to the quadratic Stark effect, which occurs in all atoms except hydrogen (see QM, §76).
The field is assumed so weak that the level splitting which it causes is small even in comparison with the
fine structure intervals.
192
§52
Radiation
Thus the matrix element of the "forbidden" transition contains a term
- (E • d2i)d32
E} ~E2
which is not zero if transitions from the "intermediate" state 2 to the initial state 1
and final state 3 are allowed.
§ 52. Radiation from atoms: the hydrogen atom
The hydrogen atom is the only one for which the transition matrix elements can
be completely calculated in an analytical form (W. Gordon, 1929),
The parity of a state of the hydrogen atom is (-1)1, i.e. is uniquely determined
by the orbital angular momentum of the electron (the number J, which defines the
parity of the state, retains its significance for the exact relativistic wave functions,
i.e. when the spin-orbit interaction is taken into account). The parity selection rule
therefore strictly forbids electric dipole transitions without change of I; only
transitions with / -* / ± 1 are possible. There is, however, no restriction on the
change in the principal quantum number n.
The dipole moment of the hydrogen atom is equivalent to the position vector of
the electron: d = er. Since the electron wave function in the hydrogen atom is the
product of an angular part and the radial function Rni, the reduced matrix elements
of the position vector can also be written as the product
<n\ I - l||r||nl> = <l - 1||H|I> j Rnj-xrR«r2 dry
0
where </ - l||v||I) are the reduced matrix elements of the unit vector v in the
direction of r. These are
(I - 1||HID = <IMI - D * = i VI;
see QM, (29.14). Thus
<n\ I - \\\r\\nl) = -(nl\\r\\n', / - 1)= /V/ j A...,-,«*!-3 dr.
o
(52.1)
The non-relativistic radial functions of the discrete spectrum of the hydrogen
atom are given in QM, (36.13):t
Rnl = M
n (2l + l)\ V(n-I- ) 1)! ( 2 r ) ' <rr"'F(~ n + l + *'2l + 2'2r/n)-
(52 2)
'
t In the present section, atomic units are used. In ordinary units, the expressions given below for the
matrix elements of the coordinate are multiplied by h2/me2 (or by h2lmZe2 for a hydrogen-like ion with
atomic number Z).
Radiation from Atoms: The Hydrogen Atom
§52
193
The integral in (52.1), containing the product of two confluent hypergoemetric
functions, is calculated by means of the formulae in QM, §f.t The result is
(n,l-
\\\r\\nl) - i V/
m
_ {y V ( n _ , _ 1 ) ! ( „. _ ,)»
(n
+
. i\n + n'-2l-2
nT
-'
x | F ( - n + I + 1, - n' + I,21, - 4nn'l(n - n'f) + I - 1, - «' + I, 21, -4nn7(n - n')2)},
~ i^Tpf^-"
(52.3)
where the F(a, ß, 7, 2) are hypergeometric functions. Since the parameters a and ß
in this case are negative integers or zero, these functions reduce to polynomials.$
For reference, the following are the expressions obtained from (52.3) in some
particular cases (the values of / being indicated by the spectroscopic symbols
s,p, d,...):
2
2 6
2 _ 2 ' V ( n - l)(n - 2) "'
K2*||r||np>P =
5T+7"
(n + 2)
|<2pl|rl|nd)P =
2l9w
^
)
2
(
p
2)2n
(52.4)
7
" ,
Formula (52.3) is not applicable to transitions with no change in the principal
quantum number n (transitions between the fine structure components of a level).
In this case (n = n'), the integration is carried out by expressing the radial functions
in terms of generalized Laguerre polynomials:
2
l
l(n-l-l)\
ln(2r\
2l+l(2r\
(52.5)
In the integral
j i?n.,-,J?„,r3 drocj e''p2,+2L2n'++/(p)L2n'+-,L1(p) dp,
we replace one of the polynomials by its expression in terms of a generating function
(see QM, §d):
T 21+1/ ^
(" + ' ) !
p
■""(i)"""''(e''""t[)-
t In the notation used there, we have to calculate the integral J2?+2(~ n + / + 1, - n' + 0- This is done
by means of formulae (f.12)—(f. 16).
X Numerical tabulations of the matrix elements and transition probabilities for hydrogen are given
by H. A. Bethe and E, E. Salpeter, Handbuch der Physik 35, 88-436, Springer, Berlin, 1957.
194
§52
Radiation
After n — / - 1 integrations by parts, we obtain an integral of the form
JV p p n+ '(^)" ' \pLT,Up))dp,
0
in which we replace the Laguerre polynomial by its explicit form:
L:<P> = <-.r n ! "g( m « + k )tf.
After the differentiation in the sum, only three terms remain, and the integration is
then elementary. The result is simply
<n, I - \\\r\\nl) = iVl • \nV(n2-
I2).
(52.6)
The integral
1 Rn i-xRnlr3 dr = \ Xn\i-\(rxni) dr,
0
0
where xm = rR*h is the coefficient in the expansion of the function rxni in terms of
the orthogonal functions Xn\i-\ (n' = 1,2,...). The sum of the squared moduli of
these coefficients is equal to the integral of the square of the expanded function.!
Hence
2 |<nM-l||r||nl>| 2 = I (r2xlidr.
n'
(52.7)
J
o
Using the known expression for the mean square of r in the state nl (see QMy
(36.16)), we obtain the sum rule
2 \(n\ I - l||r||nl>|2 = \ln2[5n2 + 1 - 31(1 + 1)].
(52.8)
n'
For given n, / and large n\ the matrix element of the transition
decreases according to
nl-*n'l'
|<n'!,||r||n/)|2oc3/n'3,
(52.9)
as can be seen both from the particular expressions (52.4) and from the general
formula (52.3). This result is to be expected: the Coulomb levels of energy
E' = - l / 2 n ' 2 have an almost continuous distribution when n' is large, and the
t The summation is over states of both the discrete spectrum and the continuous spectrum.
§52
Radiation from Atoms: The Hydrogen Atom
195
f
probability of a transition to a level in the interval dE is proportional to the density
of these levels, which in turn is proportional to n'~\
The Stark effect in hydrogen has the peculiarity that the splitting is proportional
to the first power of the electric field (QM, §77). Here the field is assumed to be
both weak enough for perturbation theory to be applicable and such that the level
splitting is large compared with the fine structure of the levels. Under these
conditions, the magnitude of the angular momentum is not conserved, and the
levels have to be classified by the parabolic quantum numbers nu n2, m. The last of
these, the magnetic quantum number m, again determines the component of the
orbital angular momentum along the z-axis (the direction of the field), which is
conserved under the conditions stated (neglecting the spin-orbit interaction). It is
therefore governed by the ordinary selection rule
m'-m=0,±l.
(52.10)
There is no restriction on the changes in nx and n2.
The matrix elements of the dipole moment in parabolic coordinates can also be
calculated analytically. The resulting formulae, however, are very lengthy, and will
not be given here.t
PROBLEMS
PROBLEM 1. Find the Stark splitting of hydrogen levels when it is small compared with the fine
structure intervals (but large compared with the Lamb shift).
SOLUTION. Under the conditions stated, there remains a twofold degeneracy of the unperturbed
levels with / = j ± 5, and the Stark splitting is therefore again linear in the field. The splitting A is
determined from the secular equation
I
-A
- E(dz)\i\
A
A
^ -.„ . . ,
the suffixes 1 and 2 correspond to states with I = j ± 2 and a given magnetic quantum number m ; the
perturbation V = - Edz is diagonal in m and has no elements diagonal in J. The matrix element of the
orbital quantity dz is calculated by means of formulae (29.7) and (109.3) in QM:
<J. 1 - 1 . m|A|Jlm> = V [ j 0 . +
1X2J
^ 0 ] <j. I - 1H
<jj-iiidiijj> = - ( 2 j + i ) { ' : 1 j ] } < I - I I I
where we must put / = j + r, the quantity (/ - l||d||() is taken from (52.6), and the result is
PROBLEM 2. Determine the probability of photon emission in the transition 2sj->lsj of the
hydrogen atom (G. Breit and E. Teller, 1940).
SOLUTION. The process is strictly forbidden by parity for an El transition, and by the rule (46.15)
for an £ 2 transition. We have therefore to calculate the probability of an M1 transition, given by (47.5).
t These formulae and the corresponding numerical tabulations are to be found in the work by Bethe
and Salpeter cited above.
1%
§52
Radiation
In the present case (I = 0), however, the magnetic moment is a purely spin quantity, and its matrix
element is zero if the spin-orbit interaction is neglected, because of the mutual orthogonality of the
orbital wave functions with different principal quantum numbers. This means that, to obtain a result
other than zero, the PaulFs equation approximation is insufficient, and we must start from the complete
Dirac's equation..
In the standard representation of the wave functions, the transition current ist
jfi = ilßjaipi = <f>Jaxi + x1<*4>iAccording to (35.1), (24.2) and (24.8), the wave functions of the states with I = 0, j = 1 are
*
4irl-ig(rX*-n)w(m)/f
\x)
where n = r/r, and w(m) is a real three-dimensional unit spinor corresponding to the spin projection value m.
Thus
jfi = ^JT {//g.H>/cr(a • n)wi - g//,vv/(cr • n)awi}.
Substituting this in (47.4) and carrying out the integration over the directions of n, we find
fifi = - (el6i)wfKT x awj = - \ewfOwj
(from the Pauli matrix commutation rules, a x a = 2ia); here
I = J</*.'+/®)r 3 «fr.
o
(1)
The photon emission probability (47.5), summed over the values of m/, is
w = (4e V/27)Wi<rW 2 = 4 e V / 2 / 9 .
(2)
From (35.4) we have, with K = - 1.
5
e + m + alr
2m
\
r )Aml
in the second term, the exact function / is replaced by the non-relativistic radial function R. With the
approximation g = R'/2m, the integral
1 =
2 ^ / {RfRi)'r' dT = " 2^ / RfRif2 dT = °'
o
o
since Rf and R< are orthogonal. In the next approximation, using (3), we find
' Ä 2ÏT / (M»1
dr
+ 4 ^ / { R , f R i ( E i " €f) ~ 7 <*'*>'>r3 d r
o
Since, from the orthogonality of the exact functions </* and ^/, when K, = K/,
J(/i// + g*/)r 2 dr = 0,
t In this Problem, relativistic units are used.
(3)
(4)
§53
Radiation from Diatomic Molecules: Electronic Spectra
197
the first term in (4) may be written, after integration by parts, as
0
0
0
A calculation of the integral, with the functions
Rf = 2(ma)V m o \
Rt = (1/V2)(ma)3(l - \mar)eJlmar
(see QM, §36) and the energy difference
w = ei - Ef = 2ma2(l - 1/22) = gma2,
gives J = 23/2a2/9m. Hence the transition probability is (in ordinary units)
M =
' -3^W^
=
2:^737ft
= 5
•6X,0
SCC
'
The corresponding lifetime of the 2s\ state is very long, and in practice it is much more likely that
de-excitation will occur by the simultaneous emission of two photons; see the next-to-last footnote to
§59.
§53. Radiation from diatomic molecules: electronic spectra
The specific features of molecular spectra are mainly due to the partition of the
energy of the molecule into electronic, vibrational and rotational parts, each of the
latter two being small compared with the previous one. The level structure of
diatomic molecules has been described in detail in QM, Chapter XL Here we shall
consider the resulting pattern of the spectrum and the calculation of line strengths.t
Let us take first the general case, in which the electronic state of the molecule
(and therefore also, in general, the vibrational and rotational states) changes in the
transition. The frequencies of such transitions lie in the visible and ultra-violet
regions of the spectrum. They are spoken of collectively as the electronic spectrum
of the molecule. We shall always be considering electric dipole transitions; those of
other types are of little importance in molecular spectroscopy.
As with dipole transitions in any system, the following selection rule applies to
the total angular momentum J of the molecule:
| j ' - j r | * £ i « j + jr\
(53.1)
In the present case, the strict selection rule regarding the parity of the system
corresponds to a selection rule regarding the sign of the level. (In the customary
terminology of molecular spectroscopy, states having wave functions which do or
do not change sign on inversion, i.e. when the coordinates of the electrons and the
nuclei change sign, are called negative and positive states respectively.) Thus we
have the rigorous rule
+ ->-,
--»+.
(53.2)
t The discussion below is based on QM, §§78 and 82-88. For brevity, we shall not make constant
reference to those sections.
198
Radiation
§53
If the molecule consists of identical atoms (with nuclei of the same isotope), the
levels can be classified with respect to interchange of the coordinates of the nuclei:
symmetric (s) levels, with wave functions which do not change sign under this
transformation, and antisymmetric (a) levels, with functions which dd change sign.
Since the electron dipole moment operator is unaffected by this transformation, its
matrix elements are non-zero only for transitions without change of this symmetry:!
s^s,
a^>a.
(53.3)
This rule is not absolutely rigorous, however, since the existence of a given
symmetry property of a level depends on a certain definite value of the total spin I
of the nuclei in the molecule. Owing to the extreme weakness of the interaction
between the nuclear spins and the electrons, the spin I is very nearly conserved,
but not exactly. When this interaction is taken into account, I does not have a
definite value, the symmetry property (5 or a) is not conserved, and the selection
rule (53.3) no longer applies,
The electron terms of a molecule consisting of identical atoms are also described
by their parity (g or w), i.e. the behaviour of the wave functions when the electron
coordinates (measured from the centre of the molecule) change sign while the
coordinates of the nuclei remain unchanged. This property is closely related to the
nuclear symmetry and the sign of the rotational levels belonging to this term. The
levels which belong to an even (g) electron term can be s + or a - , and those
belonging to an odd (u) term can be s - or a +. The rules (53.2) and (53.3) therefore
give the further rule
g-»u,
M->g.
(53.4)
The rule (53.4) remains approximately valid for molecules consisting of different
isotopes of the same element. Since the nuclear charges are equal, we can consider
the electron term with fixed nuclei, and thus have a system of electrons in an
electric field which possesses a centre of symmetry at the midpoint of the line
joining the nuclei. The symmetry of the electron wave function with respect to
inversion in this point determines the parity of the term, and since the electric
dipole moment vector changes sign under this transformation, we arrive at (53.4).
The rule as derived in this way'is only approximate, because the nuclei have been
regarded as fixed, and it therefore ceases to be valid when the interaction between
the electron state and the rotation of the molecule is taken into account.
Further selection rules depend on specific assumptions concerning the relative
magnitude of the different interactions in the molecule (i.e. the type of coupling), and
therefore can only be approximate.
The majority of the electron terms in diatomic molecules belong to coupling
type a or b. Both these have the property that the coupling of the orbital angular
momentum with the axis (the electric interaction between the two atoms in the
molecule) is large compared with all other interactions. The quantum numbers A
t This rule is clearly valid for transitions of any multipole order.
§53
199
Radiation from Diatomic Molecules: Electronic Spectra
and S therefore exist, these being respectively the component of the orbital angular
momentum of the electrons along the axis of the molecule and the total spin of the
electrons. The operator of an orbital quantity, the electron orbital angular momentum, commutes with the spin operator, so that
S ' - S = 0 (cases a and b).
(53.5)
The change in A must satisfy the selection rule
A; - A = 0, ±1 (cases a and b),
(53.6)
and for transitions between states with A = 0 (2 terms) there is a further rule
2 + -»2 + ,
2"-*2~ (cases a and b).
(53.7)
(The states 2* and 2" differ as regards behaviour under reflection in a plane
through the axis of the molecule.) The rules (53.6), (53.7) are obtained by considering the molecule in a system of coordinates fixed to the nuclei (see QAf, §87); the
rule (53.6) is analogous to the selection rule for the magnetic quantum number in
atoms.
The coupling types a and b differ as regards the relation between the spin-axis
interaction energy and the rotation energy (the intervals between rotational levels).
In case a the former is greater, in case b it is much smaller. We shall now examine
these cases separately.
Case a. Here the quantum number 2 exists, which is the component of the total
spin along the axis of the molecule (and therefore so does the number ft = 1 + A,
the component of the total angular momentum). If both the initial state and the final
state belong to case a, then we have the rule
2 ' - X = 0 (case a),
(53.8)
which follows from the fact, already mentioned, that the dipole moment commutes
with the spin. From (53.6) and (53.8) it follows thatt
ft'-ft = 0,±l.
(53.9)
If ft = ft' = 0, then in addition to the general rule (53.1) the transitions with J' = J
are forbidden:t
J ' - J = ± l when ft = ft'= 0 (case a).
(53.10)
t This rule remains valid in case c also (where the coupling of the orbital angular momentum with
the axis is small compared with the spin-orbit coupling) although the numbers A and 1 do not separately
exist.
t This rule is analogous to the prohibition of atomic transitions with J = J' when M = M' = 0 (see
55IX but that rule was of possible interest only in the presence of an external field. Here the rule follows
immediately from formula (53.12) below; the 3j-symbol ( n
/ ' + J + 1 is od<f.
ft
ft)
is zero if J' = J, since the sum
§53
Radiation
200
Let us consider transitions between any two specified vibrational levels belonging to two different electron terms (of type a). When the fine structure of the
electron term is taken into account, each of these levels splits into several
components, the number of which, 2S + 1, must be the same for both, according to
the rule (53.5). According to the rule (53.8), each component of one level combines
with only one component of the other level, having the same value of X.
Let us next take a pair of levels with the same X; the values of Ct and CÏ can
differ (like A and A') by 0 or ±1. When rotation is taken into account, each level
splits into a series of levels with different values of J and J' in the ranges J ** |ft|,
J ' ^ | f i ' | . The dependence of the transition probabilities on these numbers can be
derived in a general form (H. Hönl and F. London, 1925).
The matrix element of the transition n AÜJMj -* n'A'ü'J'M ; (where n denotes the
characteristics of the electron term other than O and A) is
|<n'AÏÎ7'MHMnjMj)| =
-VK2J
+
1XU'+»](_';,,
\,
£)(_JM,
*
^Kn'AKlnA».
(53.1.)
where dq and dq are respectively the spherical components of the dipole moment
vector in the fixed coordinate system xyz and in the "moving" system £TJ£ with the
£-axis along the axis of the molecule. This formula is derived by means of QM,
(110.6). The matrix elements (n'A'|d q |nA) are independent of the rotational quantum numbers J,J\ and depend only on the characteristics of the electron terms
(and in this case are also independentt of the number 2 ; the numbers Ü' = A'+ 2
and H = A + 2 are therefore omitted in the notation for the matrix element.
The probability of the transition nAÜJ -»n'A'O'J' is proportional to the square
of the matrix element (53.11) after summation over Mj. Using the formula QM
(106.12):
ftK-M'j
q
Mj)
2J + V
we obtain
w(nAn/^n'A'n7,) = (2J'-+l)(jJ^,
fl^n
j^) B ( n \ n ; À \ À ) ,
(53.12)
where the coefficients B are independent of J and J' (we are, of course, neglecting
the relatively very small difference in the frequencies of transitions with different J
and J').t
t This can be shown in the same way as was done for the scalar / in QMy beginning of §29. In the
present case the operator of the vector quantity d commutes with that of the vector S, which is
conserved (in the zero-order approximation), and 2 is the component of S along the £-axis in the rotating
coordinate system in which the condition of commutability of d and S has to be considered.
t Each of the rotational levels J considered splits into two when A-doubling is taken into account;
one of the two is positive and the other negative. Thus, instead of one transition J -► J\ we have, using
the selection rule (53.2), two transitions: from the positive and negative components of the level / to the
negative and positive components, respectively, of the level /'. The probabilities of these transitions are
equal.
§53
Radiation from Diatomic Molecules: Electronic Spectra
201
If we sum (53.12) with respect to J \ then (because of the orthogonality of the
3j-symbols, QM (106.13)) the result is simply B(n\ n; A', A). Thus the total
probability of transitions from a rotational level J of the state Ü to all levels J' of
the state fT is independent of J.
Case b. Here the quantum number K exists, which is the angular momentum of
the molecule without regard to its spin, as well as the total angular momentum J.
The selection rules for K are the same as the general selection rules for any orbital
vector quantity (such as the electric dipole moment):
\K' - K| « 1 « K + K'
(case b),
(53.13)
together with the prohibition of a transition with K = K' when A = A' = 0 (corresponding to (53.10)):
K' - K = ± 1 when A = A' = 0.
(53.14)
Let us consider transitions between rotational components of specified vibrational levels of two electron states belonging to type b. The probabilities of such
transitions are given by the same formula (53.12), with K and A instead of J and (Î.
When the fine structure is taken into account (for S # 0 ) , each rotational level K
splits into 2S + 1 components with J = |K - S | , . . . , K + S, and so a multiplet
appears in place of the single line J -»/'. Since in this case we have addition of the
angular momenta K and S, which are free (i.e. not coupled to the axis of the
molecule), the formulae for the relative transition probabilities for the various lines
in the multiplet are the same as the corresponding formulae (49.15) for the fine
structure components of atomic spectra, where the corresponding angular momenta
(in the case of LS coupling) are L and S.
Thus we have examined the selection rules governing the possible spectral lines
in all the fundamental cases that can occur in diatomic molecules.
The group of lines arising from transitions between rotational components of
two given electronic-vibrational levels forms what is called in spectroscopy a band\
,the lines in a band are very close together, because the rotational intervals are
small. The frequencies of these lines are given by the differences
hcojj = constant + BJ(J + 1) - B'J'W + D,
(53.15)
where B and B' are the rotational constants in the two electronic states; in order to
avoid unnecessary complications, the electron terms are assumed to be singlets. For
J' = J, J ± 1, formula (53.15) is represented graphically (Fig. 2) by three parabolic
branches, whose points for integral J give the values of the frequencies. (The
arrangement of the branches in Fig. 2 corresponds to the case B' < B. If B' > B, their
open ends are towards small values of coy and the branch with J' = J - 1 is the
highest.)t The existence of a branch which passes through an apex is seen from the
diagram to cause the lines to become increasingly dense towards a certain limiting
position (the head of the band).
t The series of lines corresponding to transitions with J' = J + 1, J, J - 1 are called the P, Q and R
branches respectively.
202
Radiation
FIG.
§53
2.
K %
In connection with line strengths, mention should be made of the curious effect of
alternating intensities in certain bands of the electronic spectra of molecules
consisting of atoms of the same isotope (W. Heisenberg and F. Hund, 1927). The
symmetry conditions pertaining to the nuclear spins have the result that, in the
electron X terms, the rotational components with even and odd K have opposite
symmetry with respect to the nuclei, and therefore different nuclear statistical weights
gs and ga (see QMt §86). According to the rule (53.14), only J' = J ±\ is allowed in
transitions between the two different X terms, and according to the rule (53.4), one of
the X terms must be even and the other odd. The result is that, for a given value of
/' - /, transitions with successive values of J take place alternately between pairs of
symmetric levels and pairs of antisymmetric levels, as shown in Fig. 3 for the example
of the states Xg and X«. The observed line strength is proportional to the number of
molecules in the initial state concerned, and therefore to its statistical weight. Thus the
intensities of successive lines (J = 0,1,2,...) will be alternately greater and less, being
alternately proportional to g, and ga (this behaviour being superimposed on the
monotonie variation given by formula (53.12)).t
There are no exact selection rules concerning the change in the vibrational
quantum number in transitions between two different electron terms. There is,
however, a rule (Franck and Condon's principle) whereby the most probable
change in the vibrational state may be predicted. It is based on the fact that the
t Here we assume that all the states having different values of the total nuclear spin are uniformly
occupied.
I
§54
Radiation from Diatomic Molecules : Vibrational and Rotational Spectra
203
U(r)
U'(r)l
/
E
£'
FIG.
4.
motion of the nuclei is quasi-classical, because of their large mass (cf. the discussion
of pre-dissociation in QM, §90).t
In the integral which determines the matrix element of the transition between
vibrational states E and £ ' of electron terms U(r) and U'(r), the most important
range is the neighbourhood of the point r = r0 where
U(ro)-U'(r,) = E-E\
(53.16)
i.e. the momenta of the relative motion of the nuclei in the two states are the same,
p = p'. For a given value of E, the transition probability as a function of the final
energy E' increases as each of the differences E - U and E'— U' decreases, and is
a maximum when
E - l/(r 0 ) = E' - l/'(r 0 ) = 0,
(53.17)
i.e. when the "transition point" r0 (the root of equation (53.16)) coincides with the
classical turning point of the nuclei. (Fig. 4 illustrates graphically this relationship
between E and the most probable E'.) This can be intuitively expressed by saying
that the transition is most probable near the turning point of the nuclei, where they
spend a relatively large amount of time.
§ 54. Radiation from diatomic molecules: vibrational and
rotational spectra
The selection rules and formulae for transition probabilities given in §53 remain
valid for transitions in which the electronic state of the molecule is unchanged.t
Here we shall discuss only some particular features of these transitions.
First of all, the selection rule (53.4) prohibits all (dipole) transitions without
change of electronic state in molecules consisting of like atoms, since in such a
transition the parity of the electron term would remain unaltered. It follows from
the discussion in §53 that such transitions can be allowed only when the interaction
between the nuclear spins and the electrons is considered or, for molecules of
t Strictly speaking, it is also necessary that the vibrational quantum number should be sufficiently
large.
t Transitions in which the vibrational (and therefore the rotational) state changes form what is called
the vibrational spectrum of the molecule; this lies in the near infra-red (wavelengths <20 iim).
Transitions in which only the rotational state changes form the rotational spectrum in the far infra-red
(wavelengths >20fim).
204
Radiation
§54
different isotopes of the same element, because of the effect of the rotation on the
electronic state.
The calculation of the dipole moment matrix elements is reduced (by the
formulae in QM, §87) to a calculation in a coordinate system rotating with the
molecule. The wave function of the molecule in these coordinates is the product of
the wave function of the electrons for a given distance r between the nuclei and
the wave function of the vibrational motion of the nuclei in the effective field U(r)
of the electrons and the nuclei. When the influence of the motion of the nuclei on
the electronic state is entirely neglected, the initial and final electron wave functions for the transitions in question are the same. The integration over the electron
coordinates therefore gives, in the matrix element, simply the mean dipole moment
d of the molecule (which is obviously along its axis) as a function of the distance r.
Owing to the smallness of the vibrations, the function d(r) can be expanded in
powers of the vibrational coordinate q = r - r0. For transitions which involve a
change in the vibrational state, the zero-order term in the expansion does not occur
in the matrix element, because the wave functions are orthogonal for vibrational
motion in the same field l/(q), and this leaves the term which is proportional to q.
If the vibrations are regarded as harmonic, it follows from the known properties of
the linear oscillator (QM, §23) that the matrix elements are zero except for
transitions between adjacent vibrational states; thus, for the vibrational quantum
number t; we have the selection rule
v'-v
= ±\.
(54.1)
This rule is not valid, however, when the vibrations are not harmonic and the
subsequent terms in the expansion of the function d(q) are taken into account.
For a purely rotational transition (with no change in the vibrational state also),
the matrix element of the dipole moment component along the moving f-axis can
simply be equated to the mean dipole moment of the molecule, d = d(0).t The
probability of the transition J -* J - 1 is then
» | « / . U - l ) . ^ f j g j ,
(54.2)
and from this formula we can calculate not only the relative probabilities (as in
(53.12)) but also the absolute probabilities. Formula (54.2) is for case a; in case b, J
and H are to be replaced by K and A.
The frequencies of the purely rotational transitions are given by the differences
of the rotational energies BJ(J + 1):
hù>u-x = 2BJ.
Successive lines are at equal distances 2B.
t It is obvious from symmetry that d = 0 in a molecule consisting of like atoms.
(54.3)
§55
Radiation from Nuclei
205
§55. Radiation from nuclei
For y radiation from nuclei it is usually true that the dimensions of the system
(the radius R of the nucleus) are small compared with the wavelength of the
photon. But the intervals between the nuclear levels, and therefore the energy of
the 7 quantum, are generally small compared with the energy per nucléon in the
nucleus. The quantity R/À is thus not directly related to the velocity vie of the
nuclçons in the nucleus, and is in general considerably less than vfc. Accordingly,
the probability of Ml radiation is usually greater than that of E, / + 1 radiation (cf.
the beginning of §50).
The general selection rules for the total angular momentum (the "spin") of the
nucleus and for the parity are the same as for radiation from any system. The
characteristic feature of nuclear radiation is that transitions of high multipole order
commonly occur. Unlike atoms, whose radiation is usually of the electric dipole
type, nuclei undergo such transitions comparatively rarely at low energies, on
account of the selection rules.
If a radiative transition of a nucleus can be regarded as a single-particle
transition (a change of state of one nucléon while the state of the rest of the
nucleus remains unchanged), then there are additional selection rules regarding the
angular momentum of that nucléon, but in practice such "single-particle" selection
rules are found to be only approximately obeyed.
The selection rules for the isotopic spin are peculiar to nuclei. The component
T3 of the isotopic spin is determined by the atomic weight and atomic number of
the nucleus:
T^\{Z-N)
= Z-\A.
When the value of T3 is specified, the absolute magnitude of the isotopic spin can
take any value T ^ |T3|. The selection rule for the number T in radiative transitions
arises because the electric and magnetic moment operators of the nucleus, expressed in terms of the isotopic spin operators of the nucléons, are the sums of a scalar
and the x3-component of a vector in isotopic space; see QMy §116. Their matrix
elements are therefore zero unless
T ' - T = 0,±1.
(55.1)
This rule itself, however, imposes no special restrictions on transitions in light nuclei
(the only ones for which the isotopic spin can be said with reasonable accuracy to be
conserved); the low levels of these nuclei in fact include none with T > 1.
For El transitions, however, there is a further rule which arises because there
is no isotopic-scalar part for the electric dipole moment, and the operator of this
moment is simply the x3-component of the isotopic vector (see QM, §116). Hence,
if T3 = 0, transitions with AT = 0 are also forbidden, and so, in nuclei having equal
numbers of neutrons and protons (N = Z,A = 2Z), El transitions are possible only
if
T'-T = ±l
(T3 = 0).
(55.2)
§55
Radiation
206
The accuracy with which this rule is obeyed depends, of course, on the exactness
of conservation of the isotopic spin of the nucleus.
The probability of E l transitions in the nucleus is influenced also by the recoil
of the rest of the nucleus when a particular nucléon moves. The result is that the
protons contribute to the dipole moment with an effective charge e(\ - Z/A) instead
of e, and the neutrons with a charge -eZjA instead of zero (see QM, §118). The
decreased effective charge of the proton causes some reduction in the probability
of £ 1 transitions.
The energy levels of non-spherical nuclei have a rotational structure, and
therefore such nuclei show a characteristic rotational structure of the 7-ray
spectrum.
The symmetry of the field in which the nucléons move in a "fixed" nonspherical (axial) nucleus is the same as the symmetry of the field in which electrons
move in a "fixed" diatomic molecule consisting of like atoms (the point group C*h).
The symmetry properties of the levels of a non-spherical nucleus (and hence the
selection rules for the matrix elements) are therefore analogous to those of the
diatomic molecule (see QM, §119). In particular, electric dipole transitions within a
single rotation band (i.e. without change in the internal state of the nucleus) are
forbidden, as in a diatomic molecule of like atoms; cf. §54. Such transitions
therefore occur as E2 or M1 transitions. In the former the total angular momentum
J of the nucleus can change by 2 or 1, in the latter by 1.
According to (46.9), the probability of a quadrupole transition, summed over
values of the component M' of the total angular momentum of the nucleus in the
final state, is
WE2 = j^p
g KJ'nMiQ^-m|jnM)|2,
where J is the total angular momentum of the nucleus, H its component along the
axis of the nucleus, and m = M - M ' . By means of QM, (110.8) this sum can be
expressed in terms of the squares of given quantities, the diagonal (with respect to
the internal state of the nucleus) quadrupole transition moments Q2A, defined
relative to coordinates £TJ£ moving with the nucleus. Here A = Ct - H', so that in the
case considered (H' = O) only the component Q2o appears. The quantity
eQo = e j p«(2£2 - Ç2 - T,2) dÇ dr) d£ = -
2e(Q2Qh
is called simply the quadrupole moment of the nucleus. Hence
w E;( n^nj') = ^ Q j ( 2 J ' + 1 , ( _ J n I
J
n)\
Explicitly, we have
WE2(ÜJ -> n , / - 1 ) = ^
w rnr->n r n-
^
Ql
n2
( J
_
1 ) J ( J + 1)(2J +
i r
(J 2 -n 2 )[(j-i) 2 -n 2 ]
w E 2 (nj -* n , J - 2) - 4ÖJ-3 Qo ( j _
1)/(2
/ - 1X2J + I)*
(55.3)
§56
The Photoelectric Effect: Non-relativistic Case
207
The following remark is necessary concerning these formulae. They include
matrix elements calculated with wave functions of the form
</>JÜM
= constant x *nl>i&(n),
where \n is the wave function of the internal,state of the nucleus. These functions
correspond to values of the £-component of the angular momentum which are
definite in both magnitude and sign. In nuclei, however, states have only a definite
parity and a definite magnitude of the angular-momentum component (usually
taken as O). Hence, if Cl ^ 0, the initial and final wave functions would have to be
taken as combinations of the form
The products of the first terms and of the second terms will give the same value as
above for the quadrupole moment matrix elements, but the cross-products will lead
to non-vanishing integrals if 2 H ^ 2 . t Hence formula (55.3) is not strictly valid if
0 = 2 or 1. In these cases the transition probability contains an additional term
which cannot be expressed in terms of the mean value of the quadrupole moment.X
In a similar manner to the derivation of (55.3), we obtain for the M l transition
probability
where fi is the magnetic moment of the nucleus. This formula is not valid if Ü = \.
§ 56. The photoelectric effect: non-relativistic case
In §49-52 we have discussed radiative transitions (with emission or absorption
of a photon) between atomic levels of the discrete spectrum. The photoelectric
effect differs from such a photon absorption process only in that the final state
belongs to the continuous spectrum.
The cross-section for the photoelectric effect can be calculated in an exact
t For the matrix elements of 2'-pole moments, the integrands will include products of the form
The angle integral will not be zero if q' = - 2ft, and the range of values of q' is only from -I to +/; thus
we must have 2ft ^ /.
t This term in fact gives a significant correction only in the case ft = i when the coupling between
the rotation and the internal state of the nucleus is especially large (see QM, §119).
208
§56
Radiation
analytical form for the hydrogen atom and for a hydrogen-like ion (with atomic
number Z«137).
In the initial state, the electron is at a discrete level £, = - J (where I is the
ionization potential of the atom) and the photon has a definite momentum k. In the
final state, the electron has momentum p (and energy ef = e). Since p takes a
continuous series of values, cross-section for the photoelectric effect is
da = 2TT| Vfi\28(-I + co - e) d3p/(27r)3
(56.1)
(cf. (44.3)); the wave function of the final state of the electron is normalized to
"one particle per unit volume". The wave function of the photon is, as before,
normalized in the same way; in order to obtain the cross-section da, the probability
dw then has to be divided by the photon flux density (which is c/V = c when
V = 1), but when relativistic units are used this does not affect the form of (56.1).
As in (45.2), we choose the three-dimensionally transverse gauge for the photon.
Then
Vfi = - eA • j/i = - e V(4ir) ^J^)
Mfi
'
where
M/, = J <K*(<*'e)eik > d 3 x ,
(56.2)
with \\f = ifßi and ifß' = \\ßf the initial and final wave functions of the electron. Putting
in (56.1) d3p ->p2 d|p| do = e\p\ de do and integrating to remove the delta function of
e, we can write this formula as
da = e2^\Mfi
|2 do.
(56.3)
The calculations will be given in two cases, which differ as regards the magnitude
of the photon energy: co > I and co <§ m. Since I ~ me4Z2 < m, these two ranges
partly overlap (when I <<a<m), and so an examination of the two cases gives an
essentially complete description of the photoelectric effect.
We shall take first the case
a><m.
(56.4)
The electron velocity is then small in both the initial and the final state, and the
problem is therefore entirely non-relativistic as regards the electron. Accordingly,
we replace a in (56.2) by the non-relativistic velocity operator v = - iVfm (cf. §45).
We can also use the dipole approximation, putting e'k,r**l, i.e. neglecting the
§56
The Photoelectric Effect: Non-relativistic Case
209
momentum of the photon in comparison with that of the electron. Then
2lT(t)
(56.5)
We shall consider the photoelectric effect from the ground level of the hydrogen
atom (or of a hydrogen-like ion). Then
^ = (Ze2m)3/2e_Ze2wr.
(566)
V V
in ordinary units, me2 becomes l/a0, where a0 = h2lme2 is the Bohr radius.
The wave function if/' must be taken such that its asymptotic form comprises a
plane wave (e'p'r) together with an ingoing spherical wave; cf. QM, §136, where
this function was denoted by ij/^\ On account of the selection rule for I, a transition
from an s state can only be to a p state (in the dipole case). Thus, in the expansiont
*ï"} = ^ jÈ i'W + D e-iStRPi(r)P,(n • n,),
(56.7)
where n = p/p, n, = r/r, it is sufficient to retain the term with 1 = 1. Omitting
unimportant phase factors, we therefore have
_3_
^^(n-nOJUr).
2p
(56.8)
With the functions *p and </*' given by (56.6) and (56.8), we have
e • v„ = ^vCmp' 2 / / ( ° ' *l*mi ' e ) ^ ^ ^ ^
V(27r)(Zg2m)5/2/
pm
, f
J
o
2
_^mrj>
.
d x
° ' r*dr
w
According to QM (36.18), (36.24), the radial function is (in the units employed here)
_ tor ™ , • A ^. ^
„
V(%<ir)Ze2m I \ + v2
(
Rpi = —L—|
VKl-e" 2 "") p r e
^ ' pr)>
with
v = Ze2m/p (= Ze2lhv).
t In the rest of this section, p denotes |p|.
(56.9)
210
§56
Radiation
The integral can be calculated by means of the formula
[ e-Xzzy-xF{a, y, kz) dz = Y{y)k*-y{k - kya\
o
cf. QMy (f.3). Noticing also that
(v + l \
--2vcor« v
=
we obtain
e V/
' '
27/27ri/3(n - p)
Vpm(\
+
g-*"*»-1"
2 m
v ) V(\-e-2™y
The energy of ionization from the ground level of the hydrogen atom (or a
hydrogen-like ion) is I = Z2e4m/2. Hence
a>=^+I=^(\ + v2).
(56.10)
Using this relation, we obtain as the final expression for the cross-section for the
photoelectric effect with emission of an electron into the solid-angle element do
(
r \ 4 p-4i/ cot~! v
■t)
i_e-^(n-e)2do,
(56.11)
where a = h2lmZe2 = a0/Z (ordinary units are used here and below). The angular
distribution of the emitted electrons is governed by the factor (n • e)2. This has
maxima in the directions parallel to the direction of polarization of the incident
photons, and is zero in directions perpendicular to e, including the direction of
incidence. For unpolarized photons, formula (56.11) must be averaged over the
directions of e, which is equivalent to substituting
(n-e^knoXn)2
with no = k/k; see (45.4b).
Integration of formula (56.11) over all angles gives the total cross-section for the
photoelectric effect:
29
2
/ I \ 4 „-Avcot"1 v
(M. Stobbe, 1930).
The limiting value of <x as h<a -*■ I (i.e. as v -* <») is
a =^
aa* = ^ - f r ° = 0.23a§/Z2,
(56.13)
§56
The Photoelectric Effect: Non-relativistic Case
211
where e in the denominator is the base of natural logarithms. The cross-section for
the photoelectric effect tends to a constant limit near the threshold, as it must for a
reaction forming charged particles (see QM, §147).
The case in which h(o > I (still with ha> < mc2) corresponds to the Born
approximation {v = Ze2lhv < 1). Formula (56.12) becomes
^2JfaaizitT'
(56M
>
where I0 = e4ml2h2 is the ionization energy of the hydrogen atom.
The process inverse to the photoelectric effect is the radiative recombination of
an electron with an ion at rest. The cross-section artc for this process can be found
from that for the photoelectric effect (<rph) by means of the principle of detailed
balancing (QM, §144). This principle states that the cross-sections for the processes
i -* / and / -» i (with two particles in each of the states i and /) are related by
where ph pf are the momenta of the relative motion of the particles, and g„ gf the
spin statistical weights of the states i and /. Since g = 2 for the photon (which has
two possible directions of polarization), wefindfor the hydrogen atom ground state
CTree - (Tph • 2 k V ,
(56.15)
where p = mv is the momentum of the incident electron and k that of the emitted
photon.
PROBLEMS
PROBLEM 1. Derive formula (56.14) by direct use of the Born approximation in the non-relativistic
case.
SOLUTION. In the Born approximation, $' in (56.5) is simply the plane wave i£' = e , p r , and ^ is
again the function (56.6). Then
v/i = Vi/= — J *lfp*lf'd3x
_ P (Ze
m)
,-Zt2mr,
The Fourier component is given by (57.6b), and so
v/i = 8V7rp-3m3/2(Ze2)5/2n.
Substitution in (56.5) and integration over do leads to (56.14) (here, with sufficient accuracy, p2/2m « <*>).
PROBLEM 2. Determine the total cross-section for radiative recombination of a fast but nonrelativistic electron (I < mv2 < mc2) with a nucleus having charge Z <3 137.
SOLUTION. The cross-section for capture to the K shell (principal quantum number n = 1) is obtained
by substituting (56.14) in (56.15):
_rec _ 2 7T ^ 3
y i ( h \
§57
Radiation
212
where e = \mv2 is the energy of the incident electron, and hw ~ F. Among the other states of the
resulting atom, only s states are important: in the calculation of the matrix element in the Born
approximation, the important values are those of the wave function of the bound state when r is small
(as will be seen in §57), and when / > 0 these values are small compared with those for / = 0. It is
sufficient to take the first two terms in the expansion of <A in powers of r. For states with / = 0 and any n,
these terms are
—M-;)V(7raV)
i.e. they contain n only as a common factor n'V2\ this expression is obtained by expansion of QM,
(36.13). The total recombination cross-section is therefore
rec _
V*
rec
rtc V
n=l
-
r(1\~Tec
n=|H
The value of the zeta function is f(3) = 1.202.
§57.
The photoelectric effect: relativistic case
Let us now consider the case
a)>L
(57.1)
Here we have also e = co - 1 > I, and the influence of the Coulomb field of
the nucleus on the wave function i//' of the emitted electron can be taken into
account by means of perturbation theory. We shall write
1
^' = 7V(2?>
77^("'eipr+<A(1)).
(57.2)
The electron may be relativistic, and therefore the unperturbed function in (57.2) is
written as a relativistic plane wave (23.1).
Although the electron is non-relativistic in the initial state, its wave function i//
must nevertheless, for reasons to be explained below, include the relativistic
correction {—Ze2). This function is (cf. §39, Problem)
*-{x-&*-v)>7ifc)*~-
(57 3)
-
where i/wr is the non-relativistic bound-state function (56.6), and u is the bispinor
amplitude of the electron at rest, normalized by the usual condition üu = 2m.
We substitute the functions (57.2), (57.3) in the matrix element (56.2):t
M
<>=wh)l{a'^ ■e) [(* - £ *v v)"*~] ^K'"l),+
+ iÄ"'(-re)e""'<«/Wr}d 1 x.
(57.4)
t The function (57.3) has been derived for distances r - l / m Z e 2 , at which the relative order of
magnitude of the correction term is Ze2. But for the ground state (and for all 5 states) formula (57.3) is
valid for any r, since the derivative of the purely exponential function (56.6), and therefore the
correction term in (57.3), are always proportional to Ze2. This enables us to use formula (57.3) in the
present problem, where (as we shall see below) it is the small values of r that are important.
§57
The Photoelectric Effect: Relativistic Case
213
In order to derive the first term of the expansion of this quantity in powers of Ze2,
we can replace t/w r in the second term in the braces by the constant (Ze2m)3/2/vV
simply. The first term would vanish if treated in this way when p - k ï 0, and it is
for this reason that the first relativistic correction, proportional to Ze2, has to be
included in «//. When v ~ 1, this correction gives a contribution to the cross-section
that is of the same order as the contribution of the next term in the expansion of
«Anon-r in powers of Ze1.
In the first term in (57.4) we integrate by parts, transferring the action of the
operator V from i//n0IW to the exponential factor. The result is
(57.5)
where the vector suffix denotes the spatial Fourier component. As far as the Ze2
term we havet
-zt^mrs
SrrZe m
(e- z<m V*=
Z(p-k)
.."•
,p-k
(57.6)
To calculate the Fourier components i^0, we write down the equation satisfied
by the function i|/l):
( A + iy • V - m)^(,) = e ^ A J i i ' e * "
= - —7Ve'p",
r '
obtained by substituting (57.2) in (32.1). Applying the operator y°e + iy • V + m to
both sides, we find
(A + p V u = - Ze\y°e + iy • V + m)(y°u')- e i p r .
Multiplying this equation by e~ikr and integrating with respect to d3x, with the
t Taking the Fourier component of each side of the equation
-Ar
( A - À 2 ) — = -4ir6(r),
we obtain
(Ç),-îfc>
and differentiation with respect to A gives
214
§57
Radiation
usual integration by parts in the terms containing A and V, gives
(p2 - k2)^> = - Ze\y°e - y • k + m)(y°u')(±^
= - Ze2(2ey0 - y • (k - p))(7 V ) ^ ^ 5 In the last line we have used the fact that the amplitude u' satisfies the equation
(ey° - p • 7 - m)u' = 0, or (e?0 + p • 7 - m)ybu' = 0.
Hence
M =« V
= 4irZ«2fi' 2 f f l y ) ( k ( k , ~ ^ 7°.
(57.7)
Substituting (57.6) and (57.7) in the matrix element (57.5), we can write it as
..
M/i =
A>nm{Ze2mf>2
_, A
( e my /2 (k-p) ittAw '
where
A = a(7 • e) + (7 • e)7°(7 • b) + (7 • c)?0(7 • e),
a
1
(k^p?
g 1
rnFj?'
.
D
k-p
2m(k-p) 2 '
C
k-p
2m(k 2 -p 2 )'
The cross-section is
da =
Xk —\J£
a>(.
p) m (U'AU)(UAM') do,
where Ä = 7°A+y°; see §65. This expression has to be summed over final directions
and averaged over initial directions of the electron spin, using the rules given in §65
below and the polarization density matrices of the initial andfinalstates:
P = im(7°+1), p' = i ( 7 ° e - 7 - P + m);
in the initial state, p = 0 and e = m. The resulting expression is
.
d(r
=
16e2(Ze2m)5lpL
r. .
t r , ,A
/T—TP
mw(k-p)
(p ApA) do.
The calculation of the trace by means of the formulae (22.22) is purely algebraic,
§57
215
The Photoelectric Effect: Relativistic Case
and the result is
tr(p'ApA) = — — [ap - (b - c)(e + m)]2 + 4m(b • e)[(e + m)(c • e) + a(p • e)];
the vector e is assumed real, i.e. the photon is assumed to be linearly polarized..
The formula for the photoelectric effect cross-section will be put in its final
form by using the polar angle 6 and the azimuth <t> of the direction of p when the
direction of k is the z-axis and the plane of k and e is the xz plane (so that
p • e = |p| cos <t> sin 0). When <u > I, the conservation of energy may be written in
the form e -m = <o instead of e - m = w - 1 . We then easily see that
k 2 - p2 = - 2m(e - m),
(k- p)2 = 2e(e - m)(l - v cos 6),
where v = p/e is the velocity of the photoelectron. A simple calculation gives finally
.
4 2
75
t? 3 (l ~ t?2)3 sin 2 $
dcr = Z a r < [ 1 _ V ( 1 _ y 2 ) ] i ( 1 _ v c o s 0 ) 4 x
„f[i-V(i-u 2 )] 2 „
I
2(1 -V) 3 / 2
„r.
0 - w cos 0) + p -
[i-V(i-t?2)](i-t?cosfl)l
*—Y^p
2.i
.
^J cos2 <f> | do,
(57.8)
where re = e2lm.
In the ultra-relativistic case (e > m), the photoelectric effect cross-section has a
sharp peak at small angles 6 ~ V(l -1>2), i.e. the electrons are emitted predominantly in the direction of incidence of the photon. Near the maximum
l - u c o s 0 « 2 f ( l - t ; 2 ) + 02],
and the leading terms in (57.8) give
da - 4ZVr 2 ^_y+ey
dB d<f>.
(57.9)
The integration of (57.8) over angles is elementary but lengthy, and leads to the
following expression for the total cross-section (F. Sauter, 1931):
(57.10)
where the "Lorentz factor" y is used for brevity:
1
e _ m+w
2
V
(
l
ü
)
"
m
* m •
y TT7T—7Z = -~--—
=
.„,,.
(57.11)
216
§58
Radiation
In the ultra-relativistic case, this formula reduces to the simple expression
(j = 27rZVr*/y;
(57.12)
in the case I <o> <m, the limit of small y - 1 in (57.10) yields the already known
result (56.14).
§58. Photodisintegration of the deuteron
A distinctive property of the deuteron is that its binding energy is small in
comparison with the depth of the potential well. This enables reactions involving
the deuteron to be described without a detailed knowledge of the behaviour of
nuclear forces, using only the binding energy (see QM, §133). Here it is assumed
that the wavelengths of the colliding particles are large compared with the range a
of the action of nuclear forces.
This applies to the disintegration of the deuteron by y quanta having ka < 1. It
will also be assumed that pa < 1, where p is the momentum of relative motion of
the neutron and proton released; this is a stronger condition than ka < l.t
We start from the non-relativistic formula (56.5) for the photoelectric effect
cross-section, integrating over all directions:
e2p M
4TT
U
fl
where p is the momentum of relative motion of the proton and neutron,^ and m in
(56.5) has been replaced by their reduced mass M/2 (where M is the nucléon mass).
The matrix element is that of the proton velocity vp, since only the proton interacts
with the photon. Expressing vp in terms of the momentum p (vp = iv= p/M), we
have
~ w - - 3Mo>
^|p/i|2.
(58.D
The superscript (e) denotes that this formula corresponds to electric dipole transitions: epIM = e\p = d, so that epf-JM = ia>d/j.
The normalized wave function of the initial (ground) state of the deuteron
is
t The photon energy for which pa * 1 (with a = 1.5 x l(Tncm) is 15 MeV.
t In this section, p denotes |p|.
i
§58
Photodisintegration of the Deuteron
217
where I = 2.23 MeV is the binding energy (see QM, §133).t The wave function of
the final state can be taken to be that of free motion, i.e. the plane wave
i//' = e,p r.
(58.3)
The reason is that, in the theory under consideration, the "size of the deuteron" 1/K
is assumed large in comparison with the effective interaction radius a. The
interaction between the proton and the neutron therefore has to be taken into
account only in S states, and can be neglected in states with /^O, whose wave
functions are small at small distances. According to the selection rules, electric
dipole transitions between two S states (the ground state and an S state of the
continuous spectrum) are forbidden, and it is therefore possible in this case to
neglect the nucléon interaction in the final state.
Integration by parts gives the matrix element
V
K
4îrp
2TTF+^;
see the second footnote to §57.
Using also the equation
which expresses the conservation of energy, we finally obtain the photodisintegration cross-section (in ordinary units) as
a
M
_ 8TT
h2 Vl(fta>-I) 3 / 2
~T M—(M3—
a
(58 4)
-
(H. A. Bethe and R. Peierls, 1935). It has a maximum at hw = 21, and tends to zero
as h(o -* I orftco-» ».
The electric dipole absorption of the photon, described by formula (58.4), does
not, however, give the main contribution to the cross-section near the photoelectric
t This function can be made more accurate by including a correction due to thefinitenessof a, the
normalization coefficient in (58.2) being replaced by
v
2ir(l - a * ) '
see QM, (133.13). A factor l/(l - QK) accordingly appears in the cross-section formulae. This correction
is in fact quite large: for the ground state of the deuteron, QK =«0.4.
The deuteron ground state is 3Si, with a small "admixture" of 3Di due to the action of the tensor
nuclear forces (see QM, § 117). This admixture will be neglected, and therefore so will the tensor forces.
218
Radiation
§58
effect threshold (hu) close to I). This is because, in this range, the principal effect
must come from transitions to an S state, and these do not occur in electric dipole
absorption. Nor do they occur in electric quadrupole absorption, since, although
they do not then violate the parity selection rule, they are forbidden by the
selection rule for orbital angular momentum (the tensor forces are here neglected,
and L and S are therefore separately conserved). To calculate the photodisintegration cross-section near the threshold, we have therefore to consider magnetic
dipole absorption, for which the selection rules allow transitions between S states
(E. Fermi, 1935).
Replacing the electric moment in (58.1) by the magnetic moment, we have
a(m) = WMp ||i,i|2.
(58.5)
The magnetic moment of the orbital motion makes no contribution to n/*, since the
orbital angular momentum L has no matrix elements for transitions between S
states. The spin magnetic moment
|x = 2pipSp +2/j, n s„
= 2(jlp - /Lln)Sp +2/ÜLnS,
where S = sp + s„, and pip, nn are the magnetic moments of the proton and the
neutron. When the tensor nuclear forces are neglected, the total spin is conserved,
and its operator therefore yields no transitions. Hence
JI/j = 2(Sp)/,(/Xp -jLl,,).
In the same approximation (neglecting the tensor forces), the spin and coordinate.
variables are separable. The matrix element, like the wave functions, becomes a
product of a spin part and a coordinate part:
l&/i = 2(/xp - iXn)(spS'M'\sp\spSM) j i//'*W(r) d'x.
But the presence of spin-spin nuclear forces has the result that the wave equation
for the coordinate functions t//(r) includes the spin value S as a parameter. If
S' = S, then ift'(r) and \\>(r) are eigenfunctions of the same operator, and are
therefore orthogonal. Thus a photodisintegration from an initial 3S state can occur
only to a XS state of the continuous spectrum.
The square ||i/j|2 in (58.5) must, of course, be averaged over components M of
the spin S in the initial state. Thus the problem is to calculate the quantity
25^SK^S'M'|s p |5 p SM>| 2 ,
where sp = sn = 2, S = 1, S' = 0. The general rules for matrix elements in the
§58
Photodisintegration of the Deuteron
219
addition of angular momenta give
(25 + l) 1 (2S^l) l< ^ 1|Spl|5pS)|2 = { s l
ïfl<*>M*p>l2
= l\(sM\sP)f
(see QM, (107.11), (109.3)). The reduced matrix element is
<sPK||sp> = V[5p(sp + l)(2sp + 1)] = V(3/2).
Formula (58.5) then becomes
<r(m) = \a>Mp(tip - fx„)2| | «/r'*«/r d3x\\
(58.6)
The initial function ty is given by (58.2); the final function is
V—^Rt/Lr).
2p
This is the first term (/ = 0) in the expansion (56.7)Fof a function whose asymptotic
form comprises a plane wave and an ingoing spherical wave; an unimportant phase
factor has been omitted. Since the integration is taken over the region outside the
range of action of the nuclear forces, the radial function is
R,.(r) = 2 s i n <P/ + 8>.
The phase 5 is related to the value of the virtual level (Ij = 0.067 MeV) of the
proton + neutron system when S = 0:
cot 6 = Kt/p,
K, = V(Mh);
see QM, §133. Then
[ «/,"•> d*x = (2TT)3/2 - ^ im f e~Kr+ipr eiS dr
V3/2VK.
= (27r) i/2 — i m
pTT
e'
K — ip
After a simple algebraic reduction, we obtain the following expression for the
photodisintegration cross-section (in ordinary units):
Radiation
220
§58
When fao-* J, the cross-section tends to zero as V(fta> - / ) , in accordance with the
general properties of cross-sections near the reaction threshold (QM, §147).
The inverse process to photodisintegration is radiative capture of a proton by a
neutron. The capture cross-section (<JC) is obtained from the photoelectric effect
cross-section ((7ph) by means of the principle of detailed balancing; cf. the derivation of (56.15). The spin statistical weights of the neutron and the proton are
2 x 2 = 4; those of the deuteron (in a state with S = 1) and the photon are 3 x 2 = 6.
Hence
ac
3(M 2 „ ..
3(M2
,
aph
~ 2TV
- 2Me\ha>-I) **■
rcao,
'
(58 8)
CHAPTER VI
SCATTERING OF RADIATION
§ 59. The scattering tensor
T H E scattering of a photon by a system of electrons (which will be referred to
below as an atom) consists of the absorption of the initial photon k and the
simultaneous emission of another photon k\ The atom may be left either at its
initial energy level or at some other discrete energy level. In the former case the
photon frequency is unchanged (Rayleigh scattering); in the latter case the
frequency changes by
a>,-a> = E]-E2y
(59.1)
where Ex and E2 are the initial and final energies of the atom (Raman scattering)^
If the initial state of the atom is the ground state, then E2 > Ex in Raman scattering,
and so co'<co: the frequency is decreased by the scattering (the Stokes case). In
scattering by an excited atom, either the Stokes case or the anti-Stokes case
(cof > a)) may occur.
Since the electromagnetic perturbation operator has no matrix elements for
transitions in which two photon occupation numbers simultaneously change, the
scattering effect appears only in the second approximation of perturbation theory.
It must be regarded as taking place via certain intermediate states, which may be of
one of two types:
(I) The photon k is absorbed and the atom enters one of its possible states En ;
in the subsequent transition to the final state, the photon k' is emitted;
(II) The photon k' is emitted and the atom enters the state En ; in the transition
to the final state, the photon k is absorbed.
In this process, the matrix element is represented by the sum
„ \e>\ ~ e „
ei — e„ /
(see QM (43.7)), where the initial energy of the atom + photons system is %\ =
Ei + to, and the energies of the intermediate states are
rn = Eny
n = En + a> + o>\
t In this chapter, the suffixes 1 and 2 will denote quantities pertaining respectively to the initial and
final states of a scattering system.
221
Scattering of Radiation
222
§59
The V. are the matrix elements for the absorption of the photon k, and the V
are those for the emission of the photon k'; the initial state is excluded from the
summation over n, this being indicated by the prime to the summation sign. The
scattering cross-section is
da = 2n\V2i\^f,
(59.3)
where do' is a solid-angle element for the directions k'. The radiation energy dl'
scattered into the solid angle do' per unit time is expressed in terms of the intensity
(energy flux density) / of the incident radiation by
dL = I(co7o>) da.
We shall assume that the wavelengths of the initial and final photons are large
compared with the dimensions a of the scattering system. All transitions will
therefore be considered in the dipole approximation. If the photon states are
described by plane waves, this approximation is equivalent to replacing the factors
e l k r by unity. Then the wave functions of the photons are (in the three-dimensionally transverse gauge)
*- - v < 4 , r ) V ( b e"">
*" = v < 4 , r ) V(b) e ""'-
Under the conditions considered, the electromagnetic interaction operator may
be written as
V=-dÊ,
(59.4)
where Ê= - À is the field strength operator and d the dipole moment operator of
the atom (similarly to the classical expression for the energy of a small system in
an electric field; Fields, §42). The matrix elements are
Vn] = -iV(27rcu)(e • dnl),
V'2n = iV(27Tü>')(e'* • d2„).
Substituting these expressions in (59.2), (59.3), we find as the scattering crosssection (written in ordinary units)t
da.
l g ((*. • OW.. ■ «) + (fa ■ e)(d , • O i l W
ftü)„| = E„ — E i ,
O)' — CO = 0>i2.
The summation is over all possible states of the atom, including those of the
continuous spectrum (states 1 and 2 cannot appear in the sum, since the diagonal
t This formula was first derived by H. A. Kramers and W. Heisenberg (1925), before the development of quantum mechanics.
The Scattering
§59
Tensor
223
matrix elements d n and d22 are zero). The infinitesimal imaginary increments in the
denominators correspond to the usual rule for pole avoidance in perturbation
theory (see QM, §43): an infinitesimal negative imaginary part is added to the
energies En of the intermediate states over which the summation is carried out. The
avoidance rule is important when the poles of (59.5) with respect to the variable En
are in the region of the continuous spectrum; for example, if state 1 is the ground
state of the atom, this would occur for hù> exceeding the ionization threshold of the
atom.t
We shall use the notation^ (in ordinary units)
(n \ J v
F (dihn(dk)n\
, (dk)2n(di)n\ 1
\Cikh\ - j ; 2J\
Tc\+
1—?—Tï\ b
*
n L^nl ~* 0) — W
(X)n2 + CO — lUJ
<<Q6\
P9.t>)
where i, k = x, y, z are three-dimensional vector indices. Then formula (59.5) can be
written as
da = <»(6> + (ül2)3|(cÄ)2ieS*ft|2 do'lc4.
(59.7)
The notation (59.6) is justifiable in that this sum can in fact be represented as
the matrix element of a certain tensor. This is most easily seen by defining a vector
quantity b whose operator satisfies the equation
ib + cob = d.
Its matrix elements are
K
t>nl=
dnt
0) — (On\
,
.
ö2n =
Ain
T
,
CO + CO„2
so that
(Cik)2i = (Mi-dib k )2i.
(59.8)
The matrix elements (clk)2i will be called the radiation scattering tensor.
It follows from the above that the selection rules for scattering are the same as
the selection rules for the matrix elements of an arbitrary tensor of rank two. We
can see immediately that, if the system has a centre of symmetry (so that its states
can be classified by parity), transitions are possible only between states of the same
parity (including transitions without change of state). This rule is the opposite of
the parity selection rule for (electric dipole) emission, and so there is an alternate
prohibition: transitions allowed in emission are forbidden in scattering, and vice
versa.
We can resolve the tensor cik into irreducible parts:
cik = c°8ik + csik + cL
(59.9)
t In a molecule, the threshold for dissociation into atoms here takes the place of the ionization
threshold.
t Most of the results derived in §§59-61 below are due to G. Placzek (1931-1933).
Scattering of Radiation
§59
c° = tcu,
cfk = l(clk + c k j)-c°5i k ,
C?k =
(59.10)
2(Cik-Cki)
are respectively a scalar, a symmetric tensor (with zero trace) and an antisymmetric tensor. Their matrix elements are
(c°)2i = i S 7
" Wl v" W2 +
Adihniddm,
(59.11)
(Cfthl = 2 S 7
"nlw"n2+
J(^)2n<dk)nl + (4)2«(4) i .l] ~ (C°)2lO*,
(59.12)
V (cu n i-cu)(a) rt2 +a))
a
+ (Oi2 y » ( d j ) 2 n ( d k ) n l - (dk)2n(dj)n
_2(Q
2
i.
/Wnx
(cunl-co)(a)rt2 + a))
T
the symbols indicating pole avoidance are omitted, for brevity.
Let us consider some properties of the scattering tensor in the limiting cases of
low and high photon frequencies.t
For Rayleigh scattering (coî2 = 0), the antisymmetric part of the tensor vanishes
as (o->0, because of the factor to in front of the sum in (59.13). The scalar and
symmetric parts of the scattering tensor, however, tend to finite limits as a>->0.
The cross-section is therefore proportional to co4 when cu is small.
In the opposite case, when the frequency co is large compared with all the
frequencies co„i, a)n2 which are important in (59.6) (but of course the wavelength is
still much greater than a), we must arrive at the formulae of the classical theory.
The first term in the expansion of the scattering tensor in powers of l/o> is
— 2 [(dk)2n(di)n\ - (di)2n(dk)n{] = —{dkdx - dxdk)iu
(O
(On
and is zero, since the operators dt and dk commute. The next term in the expansion
is
(Cik)2\ = —Î 2 [0>2n{dk)2n(di)n\ ~ (di)2nü)n\(dk)n\]
(*>
n
= 7-ï(<Wi -
didk)2\.
Using the definition d = 2 er (with the summation over all the electrons in the
atom) and the commutation rules for momenta and coordinates, we obtain
<C*)II = - ^ * I *
ÏÏÏO)
(c*) 2 i = 0,
(59.14)
t The case of resonance (when eu is close to one of the frequencies Ü>„I and a)«2) will be discussed in
§63.
§59
The Scattering Tensor
225
where Z is the total number of electrons in the system, and m the electron mass.
Thus, in the limit of high frequencies, there remains in the scattering tensor only
the scalar part, and scattering takes place without change in the state of the system,
i.e. the scattering is entirely coherent (see below). The scattering cross-section in
this case is
da = r2eZ2|e;*-e|2do',
(59.15)
where re = e2lm. After summing over polarizations of the final photon, we have
da = r2,Z2{l-(e-n')2}d<>'
= r2eZ2 sin2 0- do',
(59.16)
which is in fact the same as the classical Thomson's formula (Fields, (80.7)); 0 is
the angle between the direction of scattering and the polarization vector of the
incident photon.
Let us consider the scattering of radiation by an assembly of N identical atoms
situated in a region small compared with the wavelength. The corresponding
scattering tensor is equal to the sum of the tensors for scattering by each atom. It
must, however, be remembered that the wave functions (which are used to
calculate the dipole moment matrix elements) for several identical atoms taken
together are not simply equal functions. The wave functions are essentially defined
only to within an arbitrary phase factor, which is different for each atom. The
scattering cross-section has to be averaged over the phase factor of each atom
separately.
The scattering tensor (cik)2\ of each atom includes a factor el{*x~*2>, where <f>\ and
<t>2 are the phases of the wave functions of the initial and final states. For Raman
scattering, the states 1 and 2 are different, and this factor is not equal to unity. In
the squared modulus
\e?ek 2 (c*)2i|2,
where the sum is over all N atoms, the products of terms pertaining to different
atoms will include phase factors which vanish on independent averaging over the
phases of the atoms, and only the squared modulus of each term remains. This
means that the total cross-section for scattering by N atoms is found by taking N
times the cross-section for scattering by one atom; the scattering is incoherent.
If, however, the initial andfinalstates of the atom are the same, then the factors
eK*i-*2> = i# xhe amplitude for scattering by the assembly of atoms is N times that
for scattering by one atom, and the scattering cross-section consequently differs by
a factor N 2 ; the scattering is coherent.! If the atomic energy level is not
degenerate, Rayleigh scattering is therefore entirely coherent. But if the energy
level is degenerate, there will also be incoherent Rayleigh scattering arising from
t The factor Z2 in formulae (59.15) and (59.16) has the same origin: the cross-section for scattering
by Z electrons in one atom is Z 2 times that for scattering by one electron.
§59
Scattering of Radiation
226
the transitions of the atom between various mutually degenerate states. This is a
purely quantum effect; in the classical theory, any scattering without change of
frequency is coherent.
The coherent scattering tensor is given by the diagonal matrix element (cik){U
and will be denoted by aik, omitting for brevity the index which shows the state of
the atom. According to (59.6),
T7i +
aik(cü) = (Cifc)ii = 2J
T
77i I
(59.17)
This expression may also be written as
a^H^I-Zfr+i-S \
mco2 {
{ p M k
\ 4- (Pk^(P^1],
m r ki-a)-iO
conl + co - iOjJ
(59.18)
using the limiting form (59.14). Here p is the total momentum of the electrons in the
atom. The equivalence of the two forms is easily seen by noting that the matrix
elements of the momentum and the dipole moment are connected by
epjm
= i(Dlnd\n
and using the same relationships as in the derivation of (59.14).
If the sum and difference E\ ± CJ are not equal to any of the energy levels En of
the atom (including the continuous spectrum), the terms iO in the denominators
may be omitted. Since pfn = pnU the tensor aik is then seen to be Hermitiant:
ctik = at
(59.19)
This means that its scalar and symmetric parts are real, and its antisymmetric part
is imaginary. The latter is certainly zero if the atom is in a non-degenerate state; the
wave function of such a state is real, and therefore the diagonal matrix elements
are also real.
The tensor aik is related to the polarizability of the atom in an external electric
field. To show this relation, let us calculate the correction to the mean value of the
dipole moment of the system when the latter is placed in an external electric field
kEe- |wr + E*e , w ).
(59.20)
This can be done by using a well-known formula of perturbation theory (QM, §40).
If the system is subjected to a perturbation
t This result depends on the neglect of the natural line width, and therefore of the possible
absorption of the incident radiation; see §62.
111
The Scattering Tensor
§59
then thefirstvordercorrection to the diagonal matrix elements of a quantity / is
fW(f\
1 K)
_. _ V1 IT
f\nFn\
T'lUi.i-û>-iO
fn\P\n
1
-iut
, |
Oni + co + iO]
J\nF
\n
.
Jn\Fn\
1 'not 1
l(on] + <D-iO ù>nl-û> + îOJ
J*
The perturbation V must be regarded as being applied with infinite slowness from
t = -oo, so that in the first term <o is to be interpreted as co + iO, and in the second
term as w - iO; the imaginary increments in the denominators have been written
accordingly.
In the present case
P = -id-E,
and the correction to the diagonal matrix element of the dipole moment is found to
be
dVi) = kde- |W + d*eia,r),
(59,21)
where d is a vector whose components are
di = aWEk.
(59.22)
The expression for the tensor aftXio) differs from (59.17) for a,* by a change in the
sign of the imaginary part in the denominator of the second term. By definition,
atXco) is the polarizability tensor of the atom in a field of frequency a>. For
frequencies such that the imaginary parts in the denominators can be omitted and
the tensor aik is Hermitian, aik and a\V are identical. In particular, when œ = 0 the
formula (59.22) becomes QM, (76.4), and the expression QM, (76.5) for the static
polarizability tensor is the same as c*jk(0) from (59.17). Note also that, if state 1 is
the ground state,t all o)n\ > 0 and the avoidance rule in the first and second terms in
(59.17) is important only when o> > 0 and co < 0 respectively. In that case,
aÄ(oi) «a^Clcol).
(59.23)
The formulae of scattering theory implicitly have a> > 0; the tensor aik is then the same
as the polarizability tensor.
We shall need not only the cross-section but also the photon scattering amplitude /. As usual in perturbation theory, this is equal, apart from a normalization
factor, to minus the matrix element (59.2). Choosing this factor so as to express the
cross-section (59.7) in the form da = |/|2 do\ we have as the elastic scattering
amplitude
/^CUWÏ*«*.
(59.24)
According to the optical theorem (see (71.10) below), the imaginary part of the
t Only this case (which will be assumed henceforward) allows a completely rigorous treatment,
because of the finite lifetime of the excited states; see §62 below.
Scattering of Radiation
228
§59
forward scattering amplitude (without change in momentum and polarization)
determines the total cross-section at for all possible elastic and inelastic processes
for a given initial state of the photon:
crr =
(4TT/O))
im(co 2 a jk ^t) = 47ra)(a* - a£)ef ek\ll
(59.25)
Thus the total cross-section is determined by the anti-Hermitian part of the
scattering tensor.
The formula (59.25) has a simple classical significance. The electric field E does
work Xe\ • E = E • d on the system of charges per unit time. Expressing the field in
the form (59.20), and the dipole moment in the form (59.21), (59.22), and averaging
this work with respect to time, we find
\a)\E\2e*ek(aik-
afc)/2i,
with E = eR On the other hand, if E is the incident radiation field, the mean energy
flux density in it is |JE|2/87r, and the energy absorbed by the atom is |E|2O-,/8T7.
Equating the two expressions, we find (59.25).
If the angular momentum J of the ground state of the atom is zero, then by
spherical symmetry aik = aöiky and
crt =
4TT(O
im a.
(59.26)
For a system having angular momentum, a similar relation holds for quantities
averaged over the spatial directions of the angular momentum (see §60).
For photon energies above the ionization threshold of the atom, the principal
contribution to the total cross-section vt comes from the ionization process (the
absorption of a photon in the photoelectric effect). The scattering cross-section is a
quantity of higher order in e2; compare, for instance, (56.13) and (59.16).
If, however, the photon energy is below the ionization threshold (but not too
close to resonance, i.e. to any of the discrete excitation frequencies of the atom),
then the cross-section (which in this case reduces to the scattering cross-section),
and therefore the imaginary part of the amplitude, are of a higher order of
smallness than the real part of the amplitude. Neglecting the former, we again
obtain (59.19). The situation is different in the neighbourhood of resonance, where
the cross-section increases; this case will be discussed in §62.
As well as scattering, the two-photon processes which occur in second-order
perturbation theory also include double emission, i.e. the simultaneous emission of
two quanta by an atom.
The expression for the probability of this process differs from (59.5) only by the
changes o> ->-cu, e-*e* (emission of a photon co, instead of absorption) and by the
extra factor
d3k/(27r)3 = a>2do)do/(27r)3,
the number of quantum states of the emitted photon in given ranges of the
frequency o> and the directions of k; the frequency of the second photon is
§59
The Scattering Tensor
229
determined from a> by the equation to + to' = o>i2. The emission probability per unit
time is therefore!
dw = \(bik)2Xe'*e% | 2 ( 2 ^ W d ° d ° ' dû>'
(5927)
where
^ ^ " ^ L ^ i
+ a i - Î O + ai.t + c o ' - î o J
differs from ( c ^ i in (59.6) only by a change in the sign of co. Summing this
expression over the polarizations of the photons and integrating over their directions of emission,^ we obtain
dw=|^|(M2l|2dio.
(59.28)
The probability of the emission of two photons <o and o/ is usually very small in
comparison with that of the emission of a single photon with frequency <o + a/. An
exception occurs in cases where the selection rules forbid the latter process but
allow the former, such as transitions between two states with J = 0, where all
processes of single-photon emission are strictly forbidden. Another example is the
transition from the first excited state (2s ±) of the hydrogen atom to the ground state
(Isi), which is forbidden for both El and Ml radiation; see §52, Problem 2.§
If the atom is in the field of an incident flux of photons o>, k, there is not only
spontaneous double emission, with the probability (59.27), but also induced double
emission, in which the field causes emission of a similar photon and a photon co', k'.
The probability of this process differs from that of spontaneous emission by a
factor Nke, the number density of incident photons with given k and e. The incident
photon flux density is
dl = cNkt d3k/(27r)3 = Nke(co2/87rV) d<o do.
Expressing Nke in terms of dl and dividing the probability of the process by dl, we
obtain the cross-section
da = f£\{bik)ne'*et\2do'.
(59.29)
Similarly, if the atom is in a field of photons a/, k', the incidence of a photon <o, k
causes induced Raman scattering, whose cross-section is proportional to the
density of photons o/, k'.
t In the rest of this section, ordinary units are used.
t This operation amounts to complete averaging over the directions of e by eiét = 18,*, followed by
multiplication by 2 x 2 x 4rr x ATT.
§ The lifetime of the 2s\ level for double emission is 0.15 sec.
230
Scattering of Radiation
§59
The calculation of the tensors (cik)\2 or (bik)\2 for specific atoms requires that of
sums of the form
(M
^))2,=?^^i0'
(59 30)
'
with £ taking the values E\±ho) or Ei±fta>'. To simplify the notation, let us
discuss a hydrogen atom. We write the sum (59.30) as the integral
(Mß>)2, = J <//f(r) d,G(r, r') d'k«//,(r') d'x dh',
(59.31)
G(r,r';£) = 2 | ^ f ^ .
(59.32)
where
Let the operator H - E, where H is the Hamiltonian of the atom, act on the
function G. Since Hiftn = E„\fß„, we obtain
( H - E ) G = 2«Mr)«/'Ur').
n
The sum is the delta function S(r-r'), since the set of functions \\ßn is complete.
Thus the function G satisfies the equation
(È - E)G(r, r'; E) = ô(r - r'),
(59.33)
i.e. it is the Green's function of Schrödinger's equation; the avoidance rule in
(59.32) decides which solution of this equation is to be taken. Thus the problem of
calculating the sum (59.30) reduces tofindingthe Green's function of the atom. An
exact solution of equation (59.33) is, however, possible only if we know the exact
solutions of the homogeneous Schrödinger's equation, i.e. in practice only for the
hydrogen atom.t
PROBLEM
Calculate the probability of elastic scattering of a (non-relativistic) electron by an almost monochromatic standing light wave (P. L. Kapitza and P. A. M. Dirac, 1933).
SOLUTION. The standing wave may be regarded as a combination of photons with momenta k and
- k (and equal polarizations). The scattering of the electron may be regarded as the absorption of a
photon k and induced emission of a photon - k , so that the electron momentum p is changed by 2hk and
rotated (without change of magnitude) through an angle 6 such that |p| sin \Q = ftw/c. The probability of
this process can be obtained from the Thomson scattering cross-section (59.15),
da = rl\ë* • e| 2 do'
= rl do\
t See L. Hostler, Journal of Mathematical Physics 5, 591, 1964. The application of this Green's
function to calculate the scattering amplitude for the hydrogen atom is given by Ya. I. Granovskif,
Soviet Physics JETP 29, 333, 1969.
I
§60
Scattering by Freely Oriented Systems
231
by multiplying by the flux density of photons with momentum k and the number of photons with
momentum - k .
The flux density of photons having frequencies in the range du> fs clLda>/2ftû>, where ILdco is the
energy density in the standing wave in the spectrum interval dw ; the factor 2 appears because the energy
of the wave is equally divided between the photons moving in opposite directions. The momenta k of all
the photons forming the standing wave are parallel to a certain direction n (the "direction" of the
standing wave). In other words, the energy density as a function of the frequency and direction of the
photons n' is LLn = [/w6(2)(n' - n). Accordingly, the number of - k photons is
cf. (44.8). The electron scattering probability per unit time is then found to be
2irV
f -_
.
w = — r m I U2u d(û.
The factor w~* is taken outside the integral, since the non-monochromaticity Aa> is assumed small. The
value of the integral is inversely proportional to Aa> (for a given total intensity).
§ 60. Scattering by freely oriented systems
If an atomic energy level is not degenerate, the polarizability and intensity of
coherent scattering are determined by the same tensor aik = (cik)n- If the level is
degenerate, however, the observed values of these quantities are averaged over all
states belonging to the level in question. The polarizability must be defined as the
mean value
a * = (Cjk)n.
The observed scattering intensity is determined by the mean values
(Cik)n(c| m )ii.
The relation between the polarizability and the scattering is therefore more indirect.
Although each of the quantities (cik)n may be complex, their mean values (in the
absence of absorption, with a* an Hermitian tensor) are real, since on averaging we
can choose arbitrarily the set of independent wave functions (corresponding to a
given degenerate level), and we can always ensure that all the functions are real.
For free atoms or molecules (not in an external field), the degeneracy of levels
is usually due to an angular momentum which is freely oriented in space. Let the
initial state in scattering have angular momentum Ju and the final state J2. As usual,
the scattering cross-section must be averaged over all values of the component Mi,
and summed over the values of M2. After the averaging, the cross-section is
independent of M2, and the summation is therefore equivalent to multiplying by
2/2+I. Thus the averaged scattering cross-section is
d<7 = û>a>'H^e/*ek4eSdo',
(60.1)
232
Scattering of Radiation
§60
where
<" iklm
^ T
X
, 1
(c,-jc)2i(cim)fi
= (2/ 2 +l)(c l k ) 2 i(C| m )! 1 1 ;
(60.2)
the bar with index 1 signifies averaging over M\.
For Rayleigh scattering, states 1 and 2 belong to the same energy level (a>i2 = 0).
If only coherent scattering is considered, then states 1 and 2 must coincide
completely, so that Mi = M2. In that case the summation over M2, and hence the
factor 2J2 + 1 in (60.2), no longer appear:
CCiklm — ( C j * ) l l ( C / m ) î 1 -
(60.3)
The result of the averaging can be written down without further calculation by
using the fact that averaging over Mi is equivalent to averaging over all orientations
of the system, after which the mean value can only be expressed in terms of the
unit tensor Sik, and the only non-zero mean values are those of products of
components of either the scalar, the symmetric or the antisymmetric part of the
scattering tensor; it is clear that the unit tensor cannot yield expressions with the
symmetry properties of cross-products. Thus
,,(21) _ r;0 o c
C iklm — VuOikOim
» ,,(21)5 , r(2\)a
+ C iklm + C iklm »
(60.4)
where
GS, = (2J2+1)I(?CTI.
c ^ I = (2J2+l)(cfk)2l(cfm)!, ,
(60.5)
cg/i- = (2J2 + lKc&)2i<cgl.)},|1.
The scattering cross-section (and therefore the scattering intensity) for a freely
oriented system is therefore a sum of three independent parts, which will be
referred to as scalar, symmetric and antisymmetric scattering.
Each of the three terms in (60.4) can be expressed in terms of one independent
quantity: the scalar scattering is expressed in terms of G2i, and for the symmetric
and antisymmetric scattering we have
C)Um — loG2l(ôijôkm + 8jm8kl ~ 35|Jkô/m),
G^ = (2J 2 +l)(cy 2 i(c? k ) 21 ;
Ciklm — hGi\(babkm
~
(60.6)
àimSkl),
G2fl, = (2/ 2 +i)(cr k ) 2 .(c3c)2.;
the combinations of unit tensors are derived from the symmetry properties, and the
common factor is then found by contracting with respect to the pairs of indices i, /
and Je, m.
1
§60
Scattering by Freely Oriented Systems
233
On substituting (60.4)-(60.6) in (60.1), we obtain for the scattering cross-section
da = oKo'3{G2i|e'* • e|2 + foG^l + |e' • e|2 - Je'* • e|2) + *G!i(l - |e' • e|2)} do'.
(60.7)
This formula shows explicitly the angular dependences and polarization properties
of the scattering.
The total cross-section for scattering in any direction, summed over the
polarization of the final photon and averaged over the polarization and direction of
incidence of the initial photon, is easily obtained directly from (60.1) by noting that
e*ek = \Sik
if the averaging is over both the polarization and the direction of propagation of the
photon; summation over these would give a corresponding result larger by a factor
2 x 477. The result is
87T
,3
(21)
er = - y a>cü;3c&V
= ^<**'\3G°2l
+ G'2l + Gh).
(60.8)
It has already been mentioned that the selection rules for scattering are the
same as those for the matrix elements of an arbitrary tensor of rank two. Because
of the separation of the scattering intensity into three independent parts, it is
convenient to state the rules for each part separately.
The selection rules for symmetric scattering are the same as those for electric
quadrupole radiation, since the latter is likewise determined by an irreducible
symmetric tensor (the quadrupole moment tensor). For antisymmetric scattering,
the selection rules are the same as those for magnetic dipole radiation, since both
are determined by an axial vector (an antisymmetric tensor is equivalent, or dual,
to an axial vector).t There is a difference here, however, in that the diagonal matrix
elements, which in the case of emission give the mean values of the electric or
magnetic moments (and do not correspond to radiative transitions), are important
in the case of scattering, since they relate to coherent scattering.
For scalar scattering the selection rules are the same as those for the matrix
elements of a scalar. This means that only transitions between states of the same
symmetry are possible. In particular, the values of the total angular momentum J
and its component M must be the same, and the matrix elements diagonal in M are
independent of M ; see QM, (29.3). For Rayleigh scattering, therefore, states 1 and
2 must coincide completely (as regards M as well as energy), and so scalar
Rayleigh scattering is entirely coherent. Conversely, since in scalar scattering all
states always combine with themselves, it follows that in coherent scattering there
is always a scalar part.
t This refers, of course, to the selection rules based on symmetry, and not due to the specific form
of the axial vector in the case of emission; the magnetic moment vector includes a spin part, whereas in
scattering we have the matrix elements of orbital (coordinate) quantities.
Scattering of Radiation
234
§60
For a system freely oriented in space, the polarizability tensor also must be
averaged over the directions of the angular momentum Ju in the same way as the
scattering cross-section has been averaged above. The averaging is very simply
carried out: we evidently have
aiks(c,k)n =(c°)n 8ik.
The symmetric and antisymmetric parts of the scattering tensor vanish on
averaging, since 5* is the only isotropic tensor of rank two.
It has been mentioned that the diagonal matrix elements of a scalar are
independent of Mi. The mark of averaging of (c% may therefore be omitted, and
this quantity calculated for any Mu so that the polarizability is
a* = (c°)u8,k.
(60.9)
For the same reason, the averaging sign may be omitted in the quantity G?i, which
determines the scalar part of the coherent scattering:
GÎ^KP^F 1 -^^;
(60.10)
the factor 2J2+1 is omitted in accordance with (60.3). Thus there is a
simple relation between the mean polarizability and the scalar part of the coherent
scattering: both are determined by the quantity
(C%,H2-T^I<U2.
„ 0)n\ — O)
(6o.li)
PROBLEMS
PROBLEM 1. Find the angular distribution and the degree of depolarization in the scattering of
linearly polarized radiation.
SOLUTION. Let 0 be the angle between the direction of scattering n' and the direction of
polarization e of the incident radiation. The scattered radiation has two independent components,
polarized one in the plane of n' and e (intensity h) and one perpendicularly to this plane (intensity /:);
the degree of depolarization is I2//1. The intensities h and Î2 are given by (60.7) with the appropriate
directions of e'.
In scalar scattering, the radiation remains completely polarized in the same plane {h = 0), and the
angular distribution of intensity is
/ = I sin2 d.
Here and below, the expressions for / = I\ + h are normalized so as to give unity on averaging over
directions.
In symmetric scattering
I = &6 + sin2 0),
hi h = 3/(3 + sin2 0).
In antisymmetric scattering
I == i(l + cos2 6),
hlh = 1/cös2 Ö.
PROBLEM 2. The same as Problem 1, but for the scattering of natural light.
§60
235
Scattering by Freely Oriented Systems
SOLUTION. Formula (60.7) can be applied to natural (unpolarized) incident light by the substitution
eirt-*2(S.jc-rtinic),
which corresponds to averaging over the direction of polarization e with a given direction of incidence n.
The scattered light will be partly polarized, and from considerations of symmetry it is evident that its
two independent components will be linearly polarized in the scattering plane of n and n' (intensity /|)
and perpendicularly to this plane (intensity h). The scattering angle between n and n' will be denoted by
For scalar scattering
J = h + 1|| = l( 1 + COS2 #),
III h = COS2 #.
For symmetric scattering
/ = à(13 + cos2 #),
îf/h = (6 +cos 2 *)/7.
For antisymmetric scattering
I = 1(2 + sin2 *),
Itlh = 1 -f sin2 #.
PROBLEM 3. For scattering of circularly polarized radiation, determine the reversal factor (the ratio
of the intensity of the component circularly polarized in the "reverse" direction to that of the
component polarized in the original direction).
SOLUTION. For circularly polarized incident radiation, the angular distribution and the degree of
depolarization (JR//±) are the same as in the scattering of natural light.
Let the vector e of the incident radiation have components (1/V2)(1, i, 0) in coordinates such that the
xz-plane is the scattering plane and the z-axis is along n. Then the polarization vectors for the reverse
and original circularly polarized components of the scattered radiation are
e' = —TZ (cos d, - i, - sin d)
and
e' = TTT (COS d, i, - sin -d).
Calculation of the intensity by means of (60.7) gives the reversal factors P for the three types of
scattering:
Do
, 4u
P =tan 2#,
where # is the scattering angle.
DJ
13 + cos2fl + 10cosfl
P =7T1
r~Z—TK
â>
13 + cos d - 10 cos d
Dfl
P
l-cos 4 iO
- 1
•4i.>
1 - sin 3d
PROBLEM 4. Calculate the cross-section for scattering of a low-frequency photon by a hydrogen
atom in the ground state.
SOLUTION. A low-frequency photon can undergo only elastic scattering. Since the orbital angular
momentum J of the hydrogen atom in the.ground state is zero, the selection rules (neglecting the spin-orbit
interaction) allow only scalar scattering. The static polarizabÜity of the atom is (in ordinary units)
a - (9/2)(ft2/me2)3; see QMy §76, Problem 4. Substitution in (60.8) gives the required cross-section:
a, = 54<rr(ü>lc)\h2lme2f.
PROBLEM 5. Calculate the cross-section for elastic scattering of y rays by a deuteron (H. A. Bethe
and R. E. Peierls, 1935).
SOLUTION. The wave functions of the deuteron ground state and of its continuous-spectrum states
(the dissociated deuteron) are
see (58.2), (58.3). The matrix element of the dipole moment is dpo = -ieppo/Mcopo and has been calculated
236
§60
Scattering of Radiation
in §58:
_
I K
4 Trie
p
with the frequencies ü>po = (p2 + K 2 ) / M
The polarizability tensor is
" 13 J ^ o ^ ? | d o p | ( 2 ^ " 2 M ^ r k The first term is due to the virtual excitation of the internal degrees of freedom of the deuteron, and is
written in the form (60.11). The second term is due to the action of the wave field on the translational
motion of the deuteron as a whole. Since this motion is quasi-classical, the corresponding part of the
scattering tensor is given by (59.14), with m replaced by the deuteron mass 2M.
The calculation of a* depends on that of the integral
■j
4
j
(zJ+i)Jt(zJ+i)'-7!r
2 = P/K
? = M <*/« 2 = <"/'•
'
We have
VA dÀ/J»=r
s[dk
where
J
' - 2 _ J < r 2 + Aï)[(z1+l>l-yJr
When y < 1, the integrand has poles at the points iA, i V ( l + 7), i V ( l - 7) in the upper half-plane of the
complex variable z ; the integral Jo can be calculated from the residues at these poles. The result is
J.
1 r(i ++Y ) } / 2 , ( i - Y / 2
-r[-rr- -w~
p
i\i
w+7)y
The total scattering cross-section is expressed in terms of a* by (60.8), and is (in ordinary units)
8TT/
e2 V\
4
i + T ^ 10 + y)m + O - y)m]\
37 37
for y = hull < 1.
For 7 > 1 the scattering amplitude (above the deuteron dissociation threshold) is found from that for
7 < 1 by analytical continuation; it has an imaginary part, which must be positive (in accordance with the
avoidance rule in (59.17)):
8ir / e2 \ 2 | ,
4
2 ,
. .0/2 . . 2
3
for 7 > 1.
When 7 > 1 we have or = (87r/3)(e2/Mc2)\ which agrees, as it should, with (non-relativistic) scattering by
a free proton.
The angular distribution of radiation is
do- = <r-4(l+cos 2 0)do/47T,
where 0 is the scattering angle. If the scattering amplitude is defined by (59.24), we have
im/(0) =
3Mc3
J
{0Ty>L
According to the optical theorem (59.26), this quantity must equal OXTIMTT, where <jt is the total
I
§61
Scattering by Molecules
237
cross-section for photodissociation (58.4). The elastic scattering cross-section is of a higher order (~~eA)
than the dissociation cross-section (~e 2 ; see (58.4)), and therefore at is equal to the dissociation
cross-section. For the same reason, in the approximation considered, the scattering amplitude was found
to be real for y < 1 (i.e. below the dissociation threshold).
§61. Scattering by molecules
The specific properties of molecular scattering are due to the same properties of
molecules as form the basis of the theory of molecular spectra, namely the
possibility of treating separately the state of the electrons with the nuclei fixed and
the motion of the nuclei in a given effective field of the electrons.
Let the frequency o> of the incident radiation be less than the energy <oe of the
first electron excitation. Then the electron terms will not be excited in the
scattering process. The scattering will be either Rayleigh scattering, or Raman
scattering due to the excitation of rotational or vibrational levels.
Let us further assume that the electron ground term of the molecule is not
degenerate (and has no fine structure). That is, we assume that the total spin of the
electrons and the component of their total orbital angular momentum along the axis
of the molecule (for molecules of the symmetrical-top type) are both zero.
For diatomic molecules this means that the electron ground term must be *2.
These conditions are known to be satisfied for the ground states of most molecules. t
Finally, we shall assume the frequency eu large compared with the intervals in
the nuclear (rotational and vibrational) structure of the ground term, and the
difference (oe - œ to be in a similar relation to the nuclear structure of the excited
term. Thus the frequency of the incident radiation must be sufficiently far from
resonances. These conditions make it possible, in calculating the scattering tensor,
to ignore at first the motion of the nuclei and to discuss the problem with a given
configuration of the nuclei.
In such a problem, the scattering tensor is the same as the polarizability tensor,
oLik = (Cik)n, and can in principle be calculated from the general formula (59.17), in
which the summation is over all excited electron terms. The quantities a* thus
obtained will be functions of the coordinates q of the nuclear configuration (the
energies and wave functions of the electron terms depend on these coordinates as
parameters). Since the state is not degenerate, the tensor a^iq) is real, and
therefore symmetrical.
The tensor aik(q) is the electronic polarizability of a given nuclear configuration
in the molecule. To solve an actual problem of scattering, we have also to take into
account the motion of the nuclei in the initial and final states. Let ^,,(q) and ^(q)
be the nuclear wave functions of these states, sx and s2 being the sets of vibrational
and rotational quantum numbers. The required scattering tensor is the matrix
t The results given below are, however, valid (to a certain approximation) also for cases where
degeneracy of the electron ground term is due to a non-zero spin, the spin-orbit interaction being small
(so that the resulting fine structure may be neglected). In this approximation, states with different spin
directions do not combine, and in this sense they behave as if they were not degenerate. The molecule
O2, with ground term 3 2, is of this type.
Scattering of Radiation
238
§61
element of the tensor aik(q) with respect to these functions:
<S2|«ik|si> = J ^f2(q)aik(q)i//Sl(q) dq.
(61.1)
Because the tensor a*(q) is symmetrical, so is the tensor (61.1) (whether S\ and s2
are the same or not). Thus we conclude that, under the conditions stated, there will
be no antisymmetric part in either Rayleigh or Raman scattering. The scattering
will include only scalar and symmetric parts.
The scalar part <x°(q) of the polarizability is independent of the orientation of
the molecule, and depends only on the internal configuration of the atoms within it.
Let v denote the set of vibrational quantum numbers of the molecule, and r the set
of rotational numbers other than the magnetic number m. Then the matrix elements
are
{v2r2m2\a*\vxrxmx) = <v2i«°|i;1>ôrir2ôm|m2.
(61.2)
The diagonality with respect to the numbers r and m is true of any scalar. The
particular property of (61.2) is that here the elements do not depend on these
numbers at all. Thus the scalar scattering occurs only for purely vibrational
transitions and does not depend on the rotational state.
The symmetric scattering is determined by the matrix elements of the tensor
a?*. Its components in a fixed coordinate system xyz are expressed in terms of the
components ä'>k> in a system £TJ£ moving with the molecule by
ai = ^ â k - D A ,
(61.3)
where the D;, are the direction cosines of the new axes relative to the old. The
quantities 5,V do not depend on the orientation of the molecule, and the DVl do not
depend on the internal coordinates. Hence
(v2r2m2\a\k\vxrxmx) = ^
{v^ähM{r2m^Dv^k\rxm^.
The sum of the squared moduli of these quantities over r2, m2, i, k is easily seen to
bet
2
2 \{v2r2m2\a>ik\vxrxmx)\2 = V \(v2\äh]vx)f.
t In transforming the sum we use the equation
S2
(nmilDoDktlnmàirimilDu'IXg'lnmi)
= (rimi 2 DuDkgDirDkAr\mn = (r.imi|5irSOT<|rimi) = 6 f r6 a s
which expresses the unitarity of the matrix Dik.
(61.4)
§61
239
Scattering by Molecules
This means that the total intensity of scattering with transitions from a given
vibrational-rotational level vx, rx to all rotational levels of the vibrational state v2 is
independent of rv
For molecules of the symmetrical-top type, we can go further and derive a
relation between the scattering intensity and the rotational quantum numbers for
every transition v\rx -► v2r2. In this case the numbers r are the angular momentum J
and its component k along the axis of the molecule. We replace the Cartesian
components of a5ik by the corresponding spherical tensor of rank two, denoting its
components by aA (A = 0, ± 1, ±2). According to QM, (110.7), the squared moduli of
its matrix elements are
\(v2J2k2m2\aK\vxJxkxmx)\2
- (y.+1X2J,+i)(_£ I *)'(_£ I ^)W,| C ,>r,
where äx(q) is the spherical polarization tensor relative to axes fixed in the
molecule, and A' = k2 —fei-Summing over m2 and A = m 2 -mi (with m fixed), we
obtain (cf. QM, (110.8))
2j<U2J 2 fc 2 m 2 |^
l
fyMävM2.
(61.5)
This quantity determines the intensity of scattering with the vibrational-rotational
transition v\J{k\-+ v2J2k2. Since the matrix elements (I>2|«A'|I>I) do not depend on the
rotation of the molecule, this also defines the dependence of the intensity on Ju h
and on ku k2. The right-hand side of (61.5), it may be noted, involves only one
spherical component of the polarizability tensor.
Summation of (61.5) over J2 and k2 givest
E
À
2
J2» *2»
m
2
Kü2/2k2m2|aA|üiJikimi)|2 = 2 ItolâxWI2»
*'
and we return to the sum rule (61.4).
A special case of the symmetrical top is the rotator, a linear molecule (or, as a
particular instance, a diatomic molecule). The angular momentum component along
the axis of such a molecule is zero (in a non-degenerate electronic state with zero
electronic orbital angular momentum).t In this case, therefore, we must put
k1 = k2 = 0in(61.5).
Finally, let us consider the question of the selection rules in vibrational Raman
t In the summation over J2 with given ki, A' (and k2 = k\ -I- A'), we have
?<»♦.)(_* v ÏÏ-1
according to QAf, (106.13). The summation over k2 (or, equivalently, over A' - k2-k\) is then effected
for given k\.
t Here we do not include effects due to the interaction between the vibrations and the rotation of the
molecule (see QM, §104).
240
Scattering of Radiation
§62
scattering, together with the cognate question of vibrational emission (or absorption) spectra of molecules.t
For scattering, the problem is simply to find the conditions under which there
are non-zero matrix elements of the tensor aik(q) with respect to the vibrational
wave functions iK(q); the scalar a 0 (for scalar scattering) and the irreducible
symmetrical tensor afk (for symmetric scattering) have to be considered separately.
A corresponding role in emission (or absorption) is played by the matrix elements
of the vector d(q), the dipole moment of the molecule averaged over the electronic
state with a given position of the nuclei. This has already been stated in §54 for
diatomic molecules.
The vibrations of a polyatomic molecule are classified according to types of
symmetry, the irreducible representations Da of the corresponding point group,
where a numbers the representation (see QM, §100). These representations also
define the symmetry of wave functions of vibrational states of the molecule (see
QM, §101). The symmetry of the wave functions of the first vibrational state
(quantum number va = 1) is the same as the symmetry Da of the vibration type; the
symmetry of the higher states (va > 1) is given by the representations [Dvaa]y which
are symmetric products of va representations Da. Finally, the symmetry of states in
which different vibrations a and b are simultaneously excited is given by the direct
product [Dvaa] x [D&].t The selection rules for the various quantities (scalar, vector,
tensor) with respect to types of symmetry are found as described in QM, §97.
The selection rules resulting from the symmetry properties of the molecule are
rigorous. There are also approximate rules based on the assumption that the
vibrations are harmonic and that the functions <*,*((?) or d(q) can be expanded in
powers of the vibrational coordinates q. These are a consequence of the known
selection rule for a harmonic oscillator, according to which the matrix elements of
the oscillator coordinate q are zero except for transitions in which the change in
the vibrational quantum number At; = ± 1 .
§ 62. Natural width of spectral lines
So far, in the study of emission and scattering of radiation, we have regarded all
the levels of the system (an atom, say), as being strictly discrete. But in fact excited
levels have a certain probability of decay by emission, and therefore a finite
lifetime. According to the general principles of quantum mechanics, this has the
result that the levels become quasi-discrete, with a certain small but finite width
(see QM, §134); they can be written in the form E - 21T, where T(=r/A) is the total
probability (per unit time) of all possible processes of "decay" of the state
concerned.
Let us consider how this situation affects the process of emission (V. Weisskopf
and E. Wigner, 1930). It is evident that, because of the finite width of the levels, the
emitted radiation will not be strictly monochromatic: its frequencies will be spread
t These spectra lie in the infra-red, and are usually observed as absorption spectra.
t The symmetry properties of the vibrational wave functions are, of course, independent of the
specific form of the vibrational potential energy, and in particular are independent of the assumption
made in QAf, §101, that the vibrations are harmonic.
§62
Natural Width of Spectral Lines
241
over a range Ao> — T (=T/fi). But, in order to measure the frequency distribution of
the photons with this accuracy, the time needed is T > L/Aa> — 1/I\ During this time
the level will almost certainly decay by emission. We therefore have to deal with
the determination of the total probability of emission of a photon of a given
frequency, not with the probability per unit time. We shall calculate this total
probability, first of all, for a transition of an atom from some excited level E\ -21T1
to the ground level E2> which has an infinite lifetime and is therefore strictly
discrete.
Let ^ be the wave function of the atom and the photon field, and H = H(0)+ V
the Hamiltonian of the system, where V is the atom-field interaction operator. We
shall seek a solution of Schrödinger's equation
i ^ = (H ( 0 ) +V)*
01
(62.1)
in the form of an expansion in terms of the wave functions of the unperturbed
states of the system:
* = 2 a¥im? = 2 aAt) e-"'*?.
v
v
(62.2)
For the coefficients av(t) we obtain the equations
' % = 2 <"l Vk><V exp{i(g„ - «,)*}.
(62.3)
or „•
Let | v) be a state with energy %v = E2 + w, in which the atom is at the ground
level Ei and there is one quantum with a definite frequency w; this state will be
symbolized by |o>2>. At the initial instant, the system is in the state |1), the atom
being excited to the level Ex, with no photons present. Thus, for t = 0 we must have
a,= l,
a,=0
for|i/')*|l>.
(62.4)
The solution of equation (62.3) with this initial condition will give (with the
appropriate normalization of the wave functions) the probability that at time t there
has been a transition 1 -* 2 of the atom with emission of a photon in the frequency
range da>: it is |a«,,2(0|2 d<o. We are interested in the ultimate probability as *-►<»:
dw = k2(oo)|2 deo.
(62.5)
In order to clarify the problem, it may be recalled that, in finding the ordinary
emission probability (per unit time) with a transition l-»2 (neglecting the level
width), equation (62.3) has to be solved with all the a„(f) on the right-hand side
replaced, to a first approximation, by the values (62.4). The solution thus obtained
is then examined for large t; cf. QM, §42. We can now describe this procedure
more precisely; it relates to times short in comparison with the lifetime of the
excited level, and the large values of t concerned are large compared with
l/(Ei - E2) but small compared with 1/IY
242
§62
Scattering of Radiation
In our present case, where times comparable with 1/T| are considered, the
function a\{t) decreases in time according to
a,(0 = *" !iv .
(62.6)
The functions av(t) for states \v') which can result from emission by the atom
increase with time, however. If the transition from a given level Ex can occur to
various atomic levels (as well as to E 2 ), there will be many increasing functions
av(t), each corresponding to a state in which the atom is at a certain level and there
is one photon with the appropriate energy. Nevertheless, there still remains on the
right of (62.3) only the term with |i/) = |l): since the matrix elements are zero
except for transitions in which the number of photons with some one energy
changes by 1, they are certainly zero for transitions between states containing one
photon each, with different energies.
Thus we have for a^t) the equation
i ^
= <co2|V|l)e l<E ^- E ' )c a 1
= <a>2| V|l) exp{i(co - û>l2)f - UV},
(62 J)
where co\2 = E\ - E2. Integration, with the condition aw2(0) = 0, gives
a . ^ M V l l )
1
"
6
^ ^ - ^ ! " ^
O) ""* ù)\2 + ill
i
(62.8)
Hence the probability dw (62.5) is
dw = |(a>2|V|l)2 , „ * +
(0) ~ (On)
.1 r *
+4I 1
Since the width T\<o)\29 we can put o> = cüi2 in the factor |(co2|V|l)|2. Then the
quantity 27r|(a>2|V|l)|2 is the ordinary probability (per unit time) for the emission of
a photon with frequency <o\2 and other properties besides the frequency, such as
the direction of motion and the polarization, whose existence has so far been
ignored in order to simplify the notation. The dependence of the probability on
these characteristics is entirely determined by the factor |(o>2|V|l)|2. Thus the
allowance for the level width does not affect the polarization properties or the
angular distribution of the radiation.
The sum
r ^ 2 = 2 T r S |<cu2|V|l>|2,
(62.9)
taken over the polarizations and directions of motion of the photon, is the usual
total probability of emission. It is also the part of the width of the level £1 (the
partial width) which is due to the transition 1 -► 2, as distinct from the total width
1
§62
243
Natural Width of Spectral Lines
T|, which is made up of contributions from all possible modes of "decay" of the
quasi-stationary state considered.t
By a similar summation of the probability dw, we obtain the following final
formula for the frequency distribution of the emitted radiation:
<62l0)
""'"'^-t'Kirî-
where w, = T^2I^\ is the total relative probability of the transition l-»2. This is a
dispersion-type distribution. The shape of the spectral line that is given by formula
(62.10) is that which occurs for an isolated atom at rest, and is called the natural
shaped
Now let the level E2 of the atom be also an excited level with a finite width T2.
We shall assume for simplicity that this width is due to transitions of the atom to
the ground state EQ with the emission of one photon; the final result (62.12) will not
depend on this assumption. The decay of state 1 can then be regarded as an
emission of two photons, discussed in §59. The matrix element for this process (not
yet taking account of the finite lifetime of state 2) is
(62.n)
^^.isçŒ&pgm.
fco-" £ 2 » w ~*~ t\)
the state 2 in (59.2) becomes state 0, and in the sum over n the only remaining term
corresponding to the atom in st^te 2 is the one which is large by resonance when a/
is close to E2 - E0. If we now take account of the finite lifetime of state 2, this
simply changes E2 into E2-\iY2 in (62.11), giving
<wo|y«ii)=<7,0Wf>Hffl>.
CQ—
C2+
(0 + 2*1 2
Substituting this value of the matrix element in the equation for a^iiO (which
differs from (62.7) only as regards notation), we obtain by a derivation exactly
similar to that of (62.8)
n
,fooW
<a>a>;0lV|ü>2)<co2|Vll)
(û) - û>20 + 2'I 2){<t> + 0> — COio+2'Il)
The probability of emission of the photons u> and o>' is
dw = |aWo(°°)|2dû>dtt)'
_ rt->2 r 2 -o
2
day dot'
2
i-n 2TT [(«' - <o 20 ) +im(<»+<»'- Û>,O) +*nr
an
n\
^
lAl>
t Formulae (62.6) and (62.9) can, of course, also be obtained by solving the equation for a\{t)
analogous to (62.7).
We may note that transitions to states of the continuous spectrum, causing a finite level width, do not
necessarily involve the emission of photons. Highly excited (X-ray) levels can decay with emission of an
electron and formation of a positive ion in the ground state (the Auger effect).
t As distinct from the broadening caused by the interaction of the atom with other atoms (collision
broadening) or by the presence of atoms in the source which move with various velocities (Doppler
broadening).
§63
Scattering of Radiation
244
This expression has sharp peaks at o>' =» <u2o and at o> « o>I2, as it should.
The shape of the spectral line corresponding to the transition 1 —> 2 is obtained
by integrating (62.12) with respect to a/; the range of integration can extend from
-oo to + ». The integral is most simply calculated by closing the contour of
integration with an infinite semicircle in the upper half of the complex o>'-plane, and
is given by the sum of the residues of the integrand at the poles
(Of = CUIQ— 01 + 2 l T i .
(*>' = CÜ20 + 2 ^ 2 ,
The result is
j
Fi + T2
d w = w, - ^
27T
da)
,},Lr
,
r
A
(0>-CUi 2 ) + 4 ( l i + r 2 )
_x
(62.13)
tr0%
where vvr = Vx^i-dTi^i
is the total probability of the double transition 1 ->2->0.t
The line shape (62.13) differs from (62.10) only in that Y\ is replaced by Tx + T2:
the line width is equal to the sum of the widths of the initial and final states.
The line width is not, in general, equal to the probability T\^2 of the transition
l-*2 itself, i.e. is not proportional to the line intensity as in the classical theory.
Since Ti + T2 > r ^ » the line can have a large width with a relatively small intensity.
§63. Resonance fluorescence
The allowance for the finite width of the levels in problems of radiation
scattering is important when the frequency o> of the incident radiation is close to
one of the "intermediate" frequencies (on\ or o>2n; this is called resonance fluoréscence (V. Weisskopf, 1931).
Let us consider Rayleigh scattering by a system (an atom, say) in the ground
state, so that the initial and final levels are the same and are strictly discrete. Let
the frequency of the radiation be close to a certain frequency o>„i, where the level n
is an excited level and is therefore quasi-discrete.
This problem could be solved by the method shown in §62, but there is no need
to do so, since it is exactly analogous to the problem of non-relativistic resonance
scattering at a quasi-discrete level (QM, §134). According to the results derived
there, the scattering amplitude must contain a pole factor
1
<o-{En-tirH-Exy
When |o> - o>ni| > r„, on the other hand, the result must tend to the non-resonance
formula (59.5). It is therefore clear that the required scattering cross-section is
obtained by simply replacing En by En-\iTn
in (59.5); the sum over n can be
t In more complex cases, wt is the total probability of all cascades which begin with the transition
1 -► 2 and finish at the level 0.
§63
Resonance Fluorescence
245
restricted to the resonance terms:
E(d 2 n -e'*)(d„,-e)
dcr= 7
vt , l r t ' o>4do'.
LtL
( < 0 „ i - u ) ) + 4I n
(63.1)
The summation is over all states (having different angular-momentum components
M„) corresponding to the resonance level E„; the states 1 and 2 belong to the same
level (the ground level), but may differ in the values Mi and M2.
The cross-section (63.1) has its maximum value when o> = ionl, and this value is,
in order of magnitude, <rmM « <t>4d4/r2. Since the probability of the spontaneous
transition n -» 1, and hence the width T„, ~û>3d3, this value is
a max ~l/o> 2 ~X 2 ,
(63.2)
of the order of the square of the wavelength and independent of the fine structure
constant, as compared with typical values ~ r2 outside the resonance region.
It must be emphasized that, since the atom is at a strictly discrete level (the
ground level) before and after the scattering, the frequencies of the primary and
secondary photons are exactly the same. If the incident radiation is monochromatic, the scattered line will therefore be monochromatic also. If the incident
radiation has a spectral intensity distribution J(<o) which varies only slightly over
the width T„, the intensity of scattered radiation will be proportional to
I((o„i)d(t>
,„
STTTrT2
^
(63.3)
(<o-ü>n,) + Jn
Thus the shape of the scattered line will be the same as the natural shape for
spontaneous emission from the level En.
The cross-section (63.1) corresponds to the scattering tensor
^
(di)2„(dk)n\
.
i 1: 7
r ^■
(On
0)„|-0>-2lln
((c,k)2i
c , f c ) 2 .= ^M"
(63.4)
In particular, the polarizability tensor is
2 (d,)i„(dk)n\
a,k = (cik)„ = f*—
i.r .
(63.5)
It can be seen immediately that the addition of an imaginary part to the energy
levels of the intermediate excited states makes the polarizability tensor no longer
Hermitian, even at frequencies below the ionization threshold. It contains an
imaginary part which is directly related to the absorption of radiation.
After absorbing a photon, the atom will sooner or later return to the ground
state, emitting one or more photons. The absorption cross-section, viewed in this way,
246
Scattering of Radiation
§63
is just the total cross-section at for all possible scattering processes.! On the other
hand, according to the optical theorem (59.25), the cross-section can be expressed in
terms of the anti-Hermitian part of the polarizability tensor.
Substituting in (59.25) the tensor aik from (63.5), we find the following formula
for the cross-section for absorption of a photon with frequency o> close to coni:
.„=4^|d,,.eP^ [(M _^ )} + j r , r
(63 . 6 )
In the limit as Tn ->0, the last factor tends to the delta function 8(co - co„i), in
accordance with the fact that in this case only a photon having one particular
frequency can be absorbed. Let radiation with a spectral and angular energy flux
density Ikt (cf. (44.7)) be incident on the atom. Then the flux density of number of
photons is (IkJco) dco do, and the probability of absorption is
dwa = aa(IkJ(A)) dco do.
(63.7)
If the function ht(o)) varies only slightly over the width Tn, then we have after the
integration over frequencies
dwa = 4TT2 X |dm • e|2Ike((onl) do.
According to (45.5),
dwSp = T~^ ld"i # e*| 2 do
= f£S|d*i-e| 2 do
is the probability of spontaneous emission of a photon having the frequency cu„i;
thus we return to formula (44.9).
t This discussion, it must be emphasized, refers to absorption by a system in its stable ground state.
The problem would have to be stated differently for an excited state, because of the finite duration of
the experiment.
CHAPTER VII
THE SCATTERING MATRIX
§ 64 The scattering amplitude
THE general problem concerning collisions is to find, for a given initial state of the
system (an assembly of free particles), the probabilities of various possible final
states (other assemblies of free particles). If |i) denotes the initial state, the result
of the collision can be represented as the superposition
2 \fXf\S\i),
(64.1)
/
in which the summation is taken over the various possible final states |/). The
coefficients in this expansion (f|S|i) (or, more concisely, S/,), form the scattering
matrix or S-matrix. The squares \Sfi\2 give the probabilities of transitions to
particular states |/).
If there were no interaction between the particles, the state of the system would be
unchanged, corresponding to a unit S-matrix (absence of scattering). It is convenient to separate this unit matrix in all cases, writing the scattering matrix in the
form
Sfi = 8fi + i(27t)A8{4)(Pf
- Pt)Tfh
(64.2)
where Tfi is another matrix. In the second term we have written separately the
four-dimensional delta function which expresses the law of conservation of the
4-momentum (P, and Pf being the sums of the 4-momenta of all the particles in the
initial and final states); the other factors are included for subsequent convenience.
In the non-diagonal matrix elements, the first term in (64.2) does not appear, and so,
for the transition i->/, the elements of the matrices S and T are related by
Sri = i(27r)4S(4)(P/ - PJTfi.
(64.3)
The matrix elements Tfi which remain after separation of the delta function will be
called the scattering amplitudes.
When the moduli \Sfi\ are squared, the square of the delta function appears, and
is to be interpreted as follows. The delta function comes from the integral
Ô(V/ - Pô = J^y j e«*r«» d<x.
(64.4)
If another such integral is calculated with Pf = Pf (since one delta function is
247
248
The Scattering Matrix
§64
already present), and if the integration is taken over some large but finite volume V
and a time interval f, the result is Vf/(27r)4.t Thus we can write
|S/l|2 = (27r)4fi(4)(P/-P,0|T/i|2Vt.
Dividing by t, we obtain the transition probability per unit time:
wf^ = (2TT)4Ô(4)(P/ - Pi)\Tfi\2V.
(64.5)
Each of the free particles, initial and final, is described by its own wave
function—a plane wave having some amplitude u (a bispinor for an electron, a
4-vector for a photon, and so on). The structure of the scattering amplitude Tfi is of
the form
T,i = i i î i i î . . . Q i i i i i 2 . . M
(64.6)
where on the left we have the amplitudes of wave functions of final particles, and
on the right those of initial particles; Q is some matrix relating to the indices of the
wave amplitude components of all the particles.
The most important cases are those where the initial state comprises only one
or two particles. Then we have respectively the decay of one particle or the
collision of two particles.
Let us first consider the decay of a particle into any number of other particles
having momenta p'a in an element II d3p'aof momentum space; the suffix a labels
the particles in thefinalstate, so that 2 p'a = P/. The number of states in this element
and in the normalization volume^ V is
nvd 3 pi/(27r) 3 .
a
The expression (64.5) must be multiplied by this quantity:
dw = (2TT)4Ô(4)(P/ - Pi)\Tfi\2V n Ï0*.
(64.7)
The wave functions used in calculating the matrix element must be normalized
to "one particle in the volume V". For an electron, e.g., the wave function is the
plane wave (23,1); for a particle with spin one it is (14.12); for a photon it is (4.3).
All these functions include the factor 1/V(2EV), where e is the energy of the
particle. Henceforth, however, it will be convenient to omit such factors in the
wave functions, and include them in the expression for the probability. Thus the
t This can be shown in a différent way byfirstcalculating the integral over each coordinate in (64.4)
for a finite range and then making the limits tend to infinity by means of QM, (42.4):
.. sin2 ag
lim —jnr* = Tro(a).
*~ £a
$ For greater clarity, in the calculations in thii section, we shall not take V to be unity.
§64
249
The Scattering Amplitude
electron plane wave will be
ilß = ue'ip\
üu=2m,
(64.8)
and the photon wave
A = V(4îr)ee-fa,
ee* = - 1 ,
ek = 0.
(64.9)
The scattering amplitude calculated with these functions will be denoted by Mfi
to distinguish it from Tfi. Evidently
Mr
Tfi =
(2eV...2e;V...) , / 5 ;
(64 10)
'
the denominator contains one factor V(2eV) for each initial orfinalparticle.
In particular, the decay probability is, instead of (64.7),
dw = (2TT)4§(4)(P/ - P.OIM/,-!2 j ; f i r? fv?i M
ze a VZ7T; zefl
(64.11)
where e is the energy of the decaying particle; as we should expect, the normalization volume does not appear in this formula.t
Formula (64.11) can be given a more definite form by eliminating the delta
functions, if the decay produces two particles (with momenta p(, pi and energies ei,
si). In the rest frame of the decaying particle pî = - p i = p', e[ + e£ = m. We have
d w =
^
, M / l | 2
^ 4 ^
The first delta function is eliminated by integration over d3pu the differential d3pî
is written as
d3p' = p'2d|p'|d°
= |p'| do g ^2d(si + a0
c |
T
c2
(6412)
The validity of this is easily seen by noting that E\2- ml2= ei 2 - m'22 = p'2. The
integration over e\ + E'2 eliminates the second delta function, and the result is
^ = 32^|M / / | 2 |p1do'.
(64.13)
Let us now consider a collision of two particles (having momenta pi and p2 and
t If thefinalparticles include N which are identical, a factor 1/N! must be inserted when integrating
over their momenta to obtain the total probability; this factor takes into account the identity of states
which differ only by an interchange of the particles.
250
The Scattering Matrix
§64
energies e\ and £2), in which they are transformed into any number of particles
having momenta p^. Instead of (64.11) we now have
The quantity that is of interest in this case is, however, not the probability but
the cross-section der. The cross-section invariant under the Lorentz transformations is obtained from dw on dividing by
j = I/Ve,e2,
(64.14)
/ = V[(plP2)2-m?m^];
(64.15)
where I denotes the 4-scalar
see Fields, §12.t In the centre-of-mass system (pi = -p2 = p)
I = |p|(ei + e2),
(64.16)
so that
which is the same as the usual definition of the flux density of colliding particles, vx
and v2 being their velocities.$ Thus the cross-section is
da = (27r)46(4>(P/ - P»)|M„|2 JJ U J^ffc-
(64.18)
This formula can be put into its final form by eliminating the delta function for
the case where in the final state also there are only two particles. Let us consider
the process in the centre-of-mass system, and let e = e\ + e2 = e[ + EPI be the total
energy; pi = -p2 = p and p5 = -~P2^p; be the initial and final momenta. The delta
function is eliminated in the same way as in the derivation of (64.13), and the result
tFor future reference, another form of / is
I 2 = i[s - (m, + m2)2)[s - (m, - m2)2],
(64.15a)
where s =(pi + P2)2.
t In an arbitrary frame of reference,
i = ^V[(v1-V2)2-(v,xv2)2].
This expression is the same as the ordinary flux density whenever vj is parallel to V2; then j = |vj - V2I/V.
§64
The Scattering Amplitude
251
is
"^M^'l2^'
(64 19)
-
in the particular case of elastic scattering, where the nature of the particles is
unchanged in the collision, |p'| = |p|.
This formula can be written in yet another form by using the invariant quantity
t = (P\ ~ Pi)2 = rni + mi2 - 2(p!p',)
= m i + mî2 - 2e,e [ + 2|pi||pi| cos 0,
(64.20)
where 8 is the angle between pi and pi. In the centre-of-mass system the momenta
|pi| = |p| and |pi| = |p'| are determined only by the total energy e, and when e is
given we have
dt = 2|p||p'| d cos e.
(64.21)
Hence, in (64.19),
do>=
- d ^ c o s ö = ^ ^ ,
2|p||p I
where <f> is the azimuth of pi relative to pi.t Thus
d<-j^|M„r£f,
(64.22)
where I is again the invariant (64.16). The azimuth <£, and therefore the crosssection in the form (64.22), are invariant under those Lorentz transformations
which do not change the direction of relative motion of the particles. If the
cross-section is independent of the azimuth, formula (64.22) takes the particularly
simple form
da = ^\Mfi\2^
(64.23)
If one of the colliding particles is sufficiently heavy (and its state is unaltered by
the collision), it acts only as a fixed source of a constant field in which the other particle
is scattered. Since the energy (though not the momentum) of the system is conserved
in a constant field, in this treatment of the collision process we can write the S-matrix
elements in the form
Sfi = i • 2ir8(Ef - E{)Tfi.
(64.24)
t Since the correct sign of the differential in such cases is obvious, we shall henceforward write
simply dt for d(-f), and so on.
252
The Scattering Matrix
§65
In the expression |S/,| 2 , the square of the one-dimensional delta function must be
interpreted as
[ôiEf-Eitf^j-SiEf-Eùt.
LIT
Now, as in the derivation of (64.11), we change to the amplitude Mfi instead of Tfh
and obtain the following expression for the probability of a process in which one
particle is scattered in a constant field and produces in the final state a certain
number of other particles:
Here e (= Et) is again the energy of the initial particle, p'a and e'Q the momenta and
energies of the final particles. The scattering cross-section is found by dividing dw
by the flux density j = i>/V, where v = |p|/e is the velocity of the particle that
undergoes scattering. The normalization volume again disappears, and the result is
dCT = 2 . « ( E , -
£
)|M,,|'^n^7.
(64.25)
In the particular case of elastic scattering, there is only one particle in the final
state, with the same energy and the same momentum (in absolute value). Writing
d V ^ p ' 2 d | p 1 do'= | p y d e ' d o '
and eliminating S ( e ' - e) by integrating with respect to e\ we find the cross-section
in the form
d(j = T 7 ^ | M / l | 2 d o ' .
(64.26)
Finally, if the external field is time-dependent, such as the field of a system of
particles executing a given motion, the S-matrix also lacks the delta function of
energy. Then S/,-= iT/,- and, after the change from Tfi to Mfi by (64.10), the
probability of (e.g.) a process in which the field creates a given set of particles is
"»•wn&fk-
<".27)
§65. Reactions involving polarized particles
In this section we shall show by means of simple examples how the state of
polarization of the particles concerned in the reaction is taken into account when
calculating the scattering cross-section.
§65
Reactions Involving Polarized Particles
253
Let there be one electron in the initial state and one in the final state. Then the
form of the scattering amplitude is
Mfi = û'Au («fiJAftiik),
(65.1)
where u and u' are the bispinor amplitudes of the initial and final electrons, and A
is some matrix, which depends on the momenta and polarizations of the other
particles (if any) which take part in the reaction.
The scattering cross-section is proportional to |M/,|2, and
(ü'Aw)* = u'y°*A*u*
= u*AVV
= üAu\
(65.2)
wheret
Ä = 7°A V .
Thus
\Mri\2 = {ü'Au)(üÄu')
s n;.fi£AUM,ömÄmi-.
(65.3)
If the initial electron is in a mixed (partially polarized) state with density matrix
p, and if we wish to find the cross-section for a process in which the final electron
is in a specified polarization state p \ the products of the bispinor amplitude
components must be changed as follows: ui-ü&->?!-*, u{üm -+pim. Then
\Mfi\2 = tr(p'ApÄ).
(65.4)
The density matrices are given by formula (29.13):
P=kyp
+ m)(l-y\ya))
(65.5)
and similarly for p'.
If the initial electron is unpolarized, then
P = i(yp + m).
(65.6)
Substituting this expression is equivalent to averaging over the polarizations of the
electron. If it is desired to determine the cross-section for scattering with any
t Since the matrix Ä has to be constructed, we shall note here, for future reference, the following
easily verified equations:
7
_ : / ï 7 V . . . / = /...7Y.)
y
(65 ,2a)
254
§65
The Scattering Matrix
polarization of the final electron, we must also put p' = \{yp' + m), and double the
result; this operation is equivalent to summation over the polarizations of the
electron. Thus we have
22
polar
\Mfl\2=]2tr{yp'+m)A(yp
+ m)Äl
(65.7)
where the sum is taken over initial and final polarizations, and the factor { converts
one summation into an averaging.
The density matrix p' in (65.4) is a secondary quantity which essentially
represents the properties of the detector as selecting one or the other polarization
of the final electron, not the properties of the scattering process as such. There is
the question of the polarization state of the electron resulting from the scattering
process itself. If p a ) is the density matrix of this state, then the probability of
detecting an electron in the state p' is obtained by projecting p a ) on p', i.e. by taking
the trace tr (p a ) p'). This will be proportional to the corresponding cross-section, i.e. to
|M/j|2. A comparison with (65.4) shows that
p^-ApA.
(65.8)
Since we know that p a ) must have the form (65.5) with some 4-vector a a ) , we need
only determine the latter. This could be done by means of formula (29.14), but it is
even simpler to proceed as follows.
We have seen in §29 that the components of the 4-vector a can be expressed in
terms of those of the 3-vector Ç which is (twice) the mean value of the electron spin
in its rest frame. The polarization states of the electrons are entirely determined by
these vectors, and it is convenient to express the scattering cross-section also in
terms of them. The square |M/j|2 will clearly be linear in each of the vectors Ç and £'
which relate to the initial and final electrons, and its form as a function of Ç' will be
|Mfl|2 = a + ß - £ \
(65.9)
where a and ß are themselves linear functions of Ç.
The vector Ç' in (65.9) is the particular polarization of the final electron that is
selected by the detector. The vector Ç(/), corresponding to the density matrix p{J\ is
easily found as follows. According to the above argument,
iM/ip-trCp'p^).
Since this quantity is relativistically invariant, it may be calculated in any frame of
reference. In the rest frame of the final electron we have, by (29.20),
p'p(/)^(l + a - n ( l + ^ - 0 Hence
|M / ( f~l+£.£»,
§65
Reactions involving Polarized Particles
255
and from a comparison with (65.9)
S^ß/a.
(65.10)
Thus the calculation of the cross-section as a function of the parameter Ç' also
gives the polarization Ç(/).
In more complex cases, when there is more than one initial or final electron, the
calculations are similar to the foregoing.
For instance, if there are two electrons both initially and finally, the form of the
scattering amplitude is
Mfii =(fiiAii,)(ÖJBii2)+ (fiJCii,)(öiDii2),
where uu u2 are the bispinor amplitudes of the initial electrons, and ni, u2 those of
the final electrons. The square \Mfi\2 includes terms of the forms
\ü[Au]\2\üf2Bu2\2
and
{ü\Aux)(üiBu2){ü[Cu\)*(ü\Du2)*.
The former reduce to products of two traces like (65.4); the latter reduce to traces
having the form
tr (p\APxCp'2Bp2D).
Positrons are described by amplitudes with "negative frequency" u(-p). For
reactions involving positrons, the only difference from the preceding analysis is
that the expressions to be used for the density matrices differ from (65.5), (65.6) as
regards the sign of m; cf. (29.16), (29.17).
Let us now consider the polarization states of photons participating in the
reaction.
The polarization of each initial photon appears linearly in the scattering amplitude in the form of a 4-vector e, and that of each final photon as e*. In each case
the 4-tensor e^e* occurs in the cross-section (i.e. in the square |M/j|2). To obtain an
arbitrary partially polarized state, this tensor must be replaced by the fourdimensional density matrix, the 4-tensor p^:
e^î-^p,*,,.
(65.11)
In particular, for an unpolarized photon, according to (8.15),
P»v = -ig*»*
(65.12)
Thus averaging over polarizations of the photon is equivalent to contracting in
|M/j|2 with respect to the corresponding two tensor indices /LL, i/.t
If summation over the photon polarizations is desired, not averaging, then we
t The expression (65.12) as it were reduces the averaging over the two actually possible polarizations of the photon to one over the four independent directions of the four-vector e.
256
The Scattering Matrix
§66
must replace e^e% by a quantity twice as large:
<Ve*-> -&u>-
(65.13)
The density matrix of the polarized photon is given by formula (8.17). The
choice of the 4-vectors e(1), e(2) which appear in this expression is usually governed
by the particular conditions of the problem. In some cases they may be related to
certain spatial directions in a given frame of reference; in other cases, it is more
convenient to relate them to the 4-vectors which characterize the problem, namely
the 4-momenta of the particles.
In (8.17) the polarization of the photon is described by the Stokes parameters,
which form the "vector" £ = (£i> 6> &)• As with the electron, it is necessary to
distinguish the polarization ^ of the final photon as such from the polarization f'
that is selected by the detector. If the square of the scattering amplitude is known
as a function of the parameter (•':
|M fl | 2 =a + ß.E\
then the polarization | ( / ) = ß/a, exactly as in (65.10).
§66. Kinematic invariants
Let us consider some kinematic relations for scattering processes in which
there are only two particles, both in the initial state and in the final state. The
relations in question are deduced from the general conservation laws alone, and are
therefore valid for all particles and all laws of interaction.
The law of conservation of 4-momentum, in a general form that does not specify
which are the initial and which the final particles, is
qi + q2 + (?3+q4 = 0.
(66.1)
Here ±qa are the momentum 4-vectors; two of them pertain to the incident
particles and two to the scattered particles, the momenta for the latter being -qQ.
Thus for two of the qa the time component q°>0, and for two q2<0.
The law of charge conservation must be satisfied as well as that of 4-momentum
conservation. Here the charge may be interpreted not only as the electric charge
but as any other conserved quantity whose sign is opposite for particles and
antiparticles.
For given types of particles concerned in the process, the squares of the
4-vectors qa are the squares of the particle masses, which are fixed (ql=ml).
Three different reactions occur, according to the values taken by the time components q°a and the values of the charges. These reactions may be written
(I) l + 2-*3 + 4,
(II) l + 3-*2 + 4,
(III) 1+4-+2 + 3J
i
(66.2)
257
Kinematic Invariants
Here the numbers refer to the particles, and the bar over a number denotes the
corresponding antiparticle. The change from one reaction to another, i.e. the
transfer of a particle to the opposite side of the formula, corresponds to a change in
sign of the corresponding time component q°a and in the sign of the charge (i.e. a
replacement of the particle by its antiparticle). The reactions inverse to (66.2) are
also possible, of course.
The three processes (66.2) are referred to as three cross-channels of a single
general reaction.
The following are some examples. If particles 1 and 3 are electrons, and 2 and 4
are photons, then channel I represents the scattering of a photon by an electron;
channel III is the same as channel I, since the photon is strictly neutral. Channel II
is the conversion of an electron-positron pair into two photons. If all four particles
are electrons, then channel I is the scattering of an electron by an electron, and
channels II and III the scattering of a positron by an electron. If particles 1 and 3
are electrons, and 2 and 4 are muons, then channel I is the scattering of e by /x,
channel III the scattering of e by ß9 and channel II the conversion of a pair el into
a pair fiß.
In the discussion of scattering processes, the invariant quantities which can be
constructed from the 4-momenta are particularly important. The invariant scattering amplitudes are functions of these quantities (§70).
Two independent invariants can be constructed from four 4-momenta, since,
according to (66.1), only three of the 4-vectors qa are independent. Let these be qu
q2, qi. From them, six invariants can be constructed: the three squares q}9 q\> q\ and
the three products qxq2y qxq^ q2q^ But the first three are the given squares of the
masses, and the second three satisfy one relation which follows from the equationt
(q\ + q2+qi)2 = q24 = ml
In order to increase the symmetry it is, however, convenient to consider not
two but three invariants, which may be taken as
s=(q\
+ qi)2 = (qi+q4)\
2
]
2
t = (qi + 43) = (<Ï2 + <Ï4) ,
2
(66.3)
2
H = ( q i + q4) = (q 2 +q3) .
These are easily seen to be related by
s + t + u = h,
(66.4)
h = m] + m\+ m] + mi
(66.5)
where
t In the general case of a reaction involving n (^4) particles, the number of functionally independent invariant quantities is 3n - 10. There are altogether An quantities, the components of the n
4-momenta qu, between which there are n functional relations ql=ml and four given by the conservation law lqa - 0. Arbitrary values can be assigned to six quantities, in accordance with the number
of parameters which define the general Lorentz transformation (a general four-dimensional rotation).
The number of independent invariants is therefore An - n - 4 - 6 = 3n - 10.
258
§67
The Scattering Matrix
In the principal channel (I), the invariant s has a simple physical significance. It
is the square of the total energy of the colliding particles (1 and 2) in their
centre-of-mass system (for pi + p2 = 0, s = (ei + e2)2). In channel II, the invariant t
has a similar significance, and in channel III the invariant u. The three channels are
therefore often called s, t and u channels.
It is easy to express each of the invariants s, t and u in terms of the energies
and momenta of the colliding particles in each channel. Let us consider the s
channel. In the centre-of-mass system of particles 1 and 2, the time and space
components of the 4-vectors qQ are
q\ = Pi = (eu Ps),
<Ï3 = - P 3 = ( - £ 3 , ~Pi)>
Qi = Pi = (e2, "P*)> ]
Q*
=
~ P * = ( - C 4 , Ps); ]
(66.6)
the suffix s in p s and p'5 indicates that these momenta refer to the reaction in the 5
channel. Then
S = E),
£S = E\ + 62 = ^3 + 64;
(66.7)
45ps = [s - ( m j + m2)2][5 - ( m r m2)2],
4sp',2 = [s - (m3 + m4)2][s - (m3 - m4)2];
2f = fi - s +4p 5 • pi — ( m 2 - m 2 )(m 2 - m 4 ),
2u = fi - s - 4p, • pi + - (m ? - m\)(m\-
(66.8)
(66.9)
ml)
For elastic scattering (mi = m3, m2 = m4), we have |ps| = |pi|, and hence ei = e3,
£2 = £4. Instead of (66.9), the simpler formulae
t = "(Ps - pi)2 = -2p 2 (l - cos e s ),
M = -2p 2 (l + cos 0S) + (e, - e2)2
(66.10)
are then obtained, where 0S is the angle between p s and p's. The invariant -t is here
the square of the (three-dimensional) momentum transfer in the collision.
Similar formulae for the other channels are found by a straightforward change of
notation. For the t channel we must interchange 5 and t, and 2 and 3, in
(66.6M66.10); for the u channel, we interchange s and w, and 2 and 4.
§67. Physical regions
When considering the scattering amplitudes as functions of the independent
variables 5, t, u (which are related only by s + t + u = h), we encounter the need to
distinguish regions in which their values are physically permissible from those in
which they are not. Values which can correspond to a physical process of
scattering must satisfy certain conditions which follow from the law of con-
259
Physical Regions
§67
servation of 4-momentum and the fact that the square of each of the 4-vectors qa is
a given quantity m2,.
The product of two 4-momenta
(67.1)
PaPb ** mamb.
Hence
(qa + Qb)2 = (pa + Pb?^* (ma + mb)\
if Qa = Pa, qb = Pb (or qa = - pa, qb = - p b ); or
(qa + qb)2 = (p<, - Pb)2 < (ma - mb)2,
if qa = pa, qj, = -pb. Hence, for a reaction in the s channel,
(mi + m2)2 ^ J ? (m3 + m4)2,
(rr^-ma)^* « ( m 2 - m 4 ) 2 ,
2
(m i - m4) 5=
M
(67.2)
2
^ (m2 - m3) ,
and similarly in the t and u channels.
To determine the remaining conditions, we form a 4-vector L which is dual to
the product of any three of the 4-vectors qa, say
LK = <V„pqïq2qÇ.
(67.3)
In the rest frame of particle 1, say, we have q\ = (q?, 0). Then L has only the spatial
components L; = emiq^q^ql Thus L is a space-like vector, and L 2 «£ 0 in every frame
of reference. Expanding L 2 , we obtain the condition
qt
qiqa
qiqj
qiq\
q\
q2q3
qsqi
qrti
q\
0.
(67.4)
This can be expressed in terms of the invariants s, t, u in a form which is the same
for all channels:
stu ** as + bt + CM,
(67.5)
where
ah={m\m\-
m]ml){m] + ml-m]-
ml),
bh = (m\m\- mlml)(m] +m]-ml-
ml),
ch = {m]ml- mlm])(m] + ml-m\-
m])
(T. W. B. Kibble, 1960).
(67.6)
260
The Scattering Matrix
§67
For a graphical representation of the regions of variation of s, t and w, it is
convenient to use triangular coordinates in a plane, called the Mandelstam plane
(S. Mandelstam, 1958). The coordinate axes are three straight lines which intersect
to form an equilateral triangle. The coordinates s, f, u are measured along
directions perpendicular to these three lines; the directions towards the interior of
the triangle are reckoned positive, as shown by the arrows in Fig. 5. Thus each
point in the plane has corresponding values of s9 t and u which are represented
(with the appropriate signs) by the lengths of the perpendiculars to the three axes. The
condition s +1 + u = h is satisfied on account of a known theorem of geometry, h
being equal to the altitude of the triangle.t
Let us consider the important case where the principal channel (s) corresponds
to elastic scattering. Then the masses of the particles are equal in pairs:
nij = W3 s rtt> 1TI2 = rn* = ix.
(67.7)
Let m > JA. The condition (67.5) has
h = 2(m2 + /i,2),
a = c = 0,
b = (m 2 - /x2)2,
so that
sut^(m 2 -V) 2 t.
(67.8)
The boundary of the region defined by this inequality comprises the straight line
t = 0 and the hyperbola
su = (m2 - /x2)2,
(67.9)
whose two branches lie in the sectors u <0, s < 0 and s >0, u >0; the axes s = 0
u=0
s=0
FlG. 5.
t For example, if the point P in Fig. 5 is joined to the three vertices A, B, C of the triangle, the
latter is divided into three triangles with altitudes s, f and u ; equating the sum of their areas to that of
the triangle ABC, we obtain the required relation. The proof is similar when P lies outside the triangle
ABC.
1
§67
Physical Regions
261
and u = 0 are the asymptotes of the hyperbola. Instead of (67.8) we can write
f>0,
su>(m2-^2)2
f<0,
su<(m 2 -/x 2 ) 2 .
or
Moreover, according to the conditions (67.2)-we must apply the inequality s>
(m + /x)2 in the s channel and u>(m + /x)2 in the u channel; the remaining
inequalities are then necessarily satisfied. We thus find that channels I, II, III (s, t,
u) correspond to the shaded regions in Fig. 6, which are called physical regions.
If /x = 0 (particles 2 and 4 are photons), the lower branch of the hyperbola
touches the axis t = 0, and the physical regions are as shown in Fig. 7.
If m = fi, the boundaries of the region (67.8) degenerate to the coordinate axes,
and the physical regions are the three sectors shown in Fig. 8.
In the general case of four different masses, the equation
(67.10)
stu = as + bt + cu
defines a third-order curve whose branches are the boundaries of the physical regions
u=0
u=(m+(.*)2
s=(m+fxp
s=(m-^i)
u = (m-(z)
FIG.
u=0
6.
s=0
t=o
FIG.
7.
262
§67
The Scattering Matrix
FIG.
8.
(a)
(b)
FIG.
9.
of the three channels, as shown in Fig. 9. Let mi 2* m2 2* m3 s» m4. Then a 2» b 2* c,
a > 0 , b > 0 . The curve (67.10) meets the coordinate axes at points on the line
as + bt + cu = 0 (see the broken lines in Fig. 9). This line is as shown in Fig. 9a and 9b,
depending on the sign of c. If c < 0, the physical region of the u channel includes part
of the area of the coordinate triangle. In this case, therefore, the quantities s, t and u
may all be positive at the same time. All three branches of the boundary curve have the
appropriate coordinate axes as asymptotes; this may be seen by eliminating one of the
variables from (67.10) by means of the relation s + t + u = h, and then making one of
the other variables tend to infinity. In general, the conditions (67.2) yield nothing in
addition to the limits defined by equation (67.10). The straight lines which correspond
to the equality signs in (67.2) do not intersect the physical regions shown by the shaded
areas in Fig. 9; some of them touch the boundaries of these regions, corresponding to
extreme values of the variable s, f or u in the corresponding channel.
When the mass of one of the particles exceeds the sum of the masses of the other
three (mi > m2 + m3 + m4), a fourth reaction channel is possible, corresponding to the
disintegration
(IV)l-*2 + 3 + 4.
(67.11)
§67
263
Physical Regions
For this channel, in the rest frame of the disintegrating particle,
q, = (m,,0),
qi = ( " 6 2 , ~P2>,
q4 = (~e4, -P4),
^3 = ( - £ 3 , -P3>,
^2 + €3 + e 4 = m i ,
P2 + P3 + P4 = 0.
The invariants are
s = m] +
m\-2m\E2,
t = m? + m3-2mi£3,
(67.12)
« = m î + m4-2mi£ 4 .
We then have from (67.1)
(m3 + m4)2 ^ s ^ ( m i - m2)2,
(m 2 +m 4 ) 2 ^f ^ ( m , - m 3 ) 2 ,
(67.13)
(m2 + m 3 ) 2 ^ M ^ ( m i - m4)2.
Thus all three invariants are positive, and the physical region of the disintegration
channel is within the coordinate triangle.
PROBLEMS
PROBLEM 1. Find the physical regions for the case of three equal masses: mi = m, mi = m 3 = m4 s M
(for example, the reaction K + TT -» IT + 7r).
SOLUTION. Equation (67.10) becomes
sfw = / L i 2 ( m 2 - p t 2 ) 2 ,
(1)
with
s + f + u = 3jLt2 + m 2
Regions I, II and III are bounded by curves of the same shape, with s > 0, t < 0, u < 0 f or region I, and so on.
If m > 3fi, equation ( 1 ) also has a branch in the form of a closed curve with s > 0, t > 0, u > 0, which bounds
the region of channel IV (Fig. 10).
The Scattering Matrix
264
§68
PROBLEM 2. The same as Problem 1, but for the case m i = m, m 2 = \i, m 3 = m4 = 0, m > /x (for example,
the reaction /x + y -» e + v).
SOLUTION. The condition (67.5) becomes
stu s? m2it25,
with 5 -I-1 + u = m2 + Lt2. The physical regions are bounded by the axis s = 0 ?nd the two branches of the
hyperbola tu = m2/x2 (Fig. 11).
PROBLEM 3. The same as Problem 1, but for the case mi = m3 s m, m2 = 0, rru = ti, with m > 2p, (for
example, the reaction p + 7 -> p + 7r°).
SOLUTION. The boundary equation (67.10) becomes
stu = a(s + u) + bt,
ah = m V ,
bh = m4(2m2 - ji 2 ),
h = 2m2 + ^L\
Elimination of u gives
,.+ (Lz« + J _ h
For a given value of s, this is a quadratic in t. If s > (m + p) 2 (the region of the s channel), there are two
negative values of t for each value of s. If s = (m + IL)\ these two roots of the quadratic coincide at
t = -m^2l(m +p,). The boundary of the s channel region is then as shown in Fig. 12. The lower branch of the
boundary tends asymptotically to the axis u = 0, and the upper branch crosses this axis at the point
f = M 4 /0* 2 -m 2 ).
The u channel region is symmetrical with the s channel region; the t channel region is situated as shown
in Fig. 12.
§68. Expansion in partial amplitudes
An important step in the analysis of a reaction of the form
a + b^c
+d
(68.1)
is the expansion of the scattering amplitude in partial amplitudes, each of which
1
§68
Expansion in Partial Amplitudes
FIG.
265
12.
corresponds (for a given total energy e) to a definite value of the total angular
momentum of the particles J in their centre-of-mass system.t
These partial amplitudes are, therefore, elements of the S-matrix in the angular
momentum representation:
(eJ'M'\S\eJM).
Since the angular momentum J and its component M along a specified z-axis are
conserved, the S-matrix is diagonal with respect to these numbers (and also with
respect to the energy e). Because of the isotropy of space, the diagonal elements are
independent of the value of M For given J, M and e, the scattering matrix is still a
matrix with respect to the spin quantum numbers; the elements of this matrix will be
written in a more concise form:
(eJM\'\S\eJM\)
^ <A'|S'(e)|A>,
(68.2)
where A and A' are the sets of spin quantum numbers. These can most naturally be
taken to be the helicities of the particles. The helicity, unlike the spin component along
an arbitrary axis in space, is conserved for a free particle, and it commutes with both
the momentum and the angular momentum of the particle (§16). The helicities may
therefore be used in both the momentum and the angular momentum representation of
the scattering matrix.
The elements of the S-matrix with respect to the helicity indices will be called the
helicity scattering amplitudes, and A, A' will be taken to include the helicities of the
initial and final particles respectively:
A = (Aa, At,),
A' = (Ac, Ad).
In the momentum representation, the scattering matrix elements are defined with
respect to the states |enA) (where n = p/|p| is the direction of the momentum of relative
motion in the centre-of-mass system); in the angular momentum representation, they
t Most of the results in §§68 and 69 are due to M. Jacob and G. C. Wick (1959).
The Scattering Matrix
266
§68
are defined with respect to the states |eJMA). They can be related by means of the
expansions
\JMk) = | |nA)<nA|JMA> dom,
(68.3)
where the integration is over the directions n; the energy e is, for brevity, omitted from
the state symbols. Since this transformation is unitary (see QM, § 12), the coefficients
of the inverse transformation are
<JAfA|nA> = <nA|JMA>*.
(68.4)
By the general rule of matrix transformation, the same coefficients give the relation
between the S-matrix elements in the two representations:
<n'A'|S|nA> = 2 (n'A'lJMA'XJMA'lSlJMAXJMAlnA).
IM
(68.5)
The coefficients in the expansion (68.3) are easily found by means of the results of
§16. Let the wave functions of all states be expressed in the momentum representation, i.e. as functions of the direction of the momentum (for a given energy); this
direction, as an independent variable, will be denoted by v to distinguish it from the
direction n as a quantum number of the state. In this representation, the wave function
has the form (16.2):
4>nM = uM8{2)(v-n).
(68.6)
When (68.6) is substituted in the expansion (68.3), the latter reduces to a single term:
^MA=<VA|JMA)U (A) .
(68.7)
The helicities Aa and Ab of the two particles are defined as the components of their
spins in the directions of their respective momenta. If the momenta are pa = p,
Pb = -p, then these directions are n for thefirstparticle and - n for the second particle.
If now the system is regarded as a single particle with helicity A in the direction n, then
A = Aa - \b. Its wave function (in the momentum representation) can be written,
according to (16.4), in the form
<Wv) = u ( A » v ) V ^ T '
(68 8)
'
Comparison of (68.7) and (68.8), with the variable v replaced by n, gives the required
coefficients:
<nA|/MA) = V ^ h T D ^ ( n ) -
(68,9)
§68
Expansion in Partial Amplitudes
267
Substituting these coefficients in (68.5), we have
<n'À'|S|nÀ> = S ^ T ^
A = Aa — A&,
D%(n')D(lC(n)(k'\SJ\k)9
(68.10)
A' = Ac — Ad,
with the abbreviated notation (68.2). If the direction n is taken as that of the z-axis,
then
D%(n) = 5AM,
and (68.10) becomes
<n'À'|S|nÀ> = 2 ^ T ^ DA'AOI'XÀ'IS'IÀ).
(68.11)
We see that the expansion in partial amplitudes has the functions D{\>\ as
coefficients. For a reaction of the form (68.1), it is convenient to define the scattering
amplitude / in such a way that the cross-section (in the centre-of-mass system) is
da = |<n'A'|/|nA>|2 do';
(68.12)
by comparison with (64.19), we can relate this amplitude to the matrix element Mfi. The
expansion of the amplitude in partial amplitudes may be written
<n'A'|/|nA> = 2 <2J + \)D%(nlD^(n)(kf\fJ\Kh
JM
(68.13)
or, taking the z-axis in the direction n,
<n'A'|/|nA> = £ <2J + l)D(A(n')<A'|/J|A).
(68.14)
This is a generalization of the usual expansion in partial amplitudes for the scattering
of spinless particles; see QMy (123.14). Since D&} = PL(COS 0), (68.14) reduces in the
case of zero spins to an expansion in Legendre polynomials
/W-2(2L+1)/LPL(COSO).
L
The cross-section (68.12) is valid when all the particles have definite helicities. If
they are in mixed polarization states, the cross-section is found by averaging the
product
< AcAd l/l AaAb><A ^A i|/| A iA t>*
over the polarization density matrices of the particles,
(Aalp^lA^Afclp^lAiXAap^AcXAilp^Xd);
268
The Scattering Matrix
§69
see the first footnote to §48. For example, in a reaction between unpolarized particles
a, b to form unpolarized particles c, d, we have
x(AcAd|/J|AûAb)<XfAd|/r|AûAb)*D(/)A(n')D^*(n');
(68.15)
the 2-axis is along n, and the first summation is over Aa, Ab, Ac, Ad. Using QM, (58.19) for
the function DA A* and then the expansion QM, (110.2), we have finally
d(r
= (2s
n< ++it°>«
+ n 2 (-DA-A(2J + l)(2J'+l)x
\){2s + 1) t D '
a
u
h
x (AcAd|/J|AûAb)<AcA(/|/J]A<1Ab>* x
X
?
( 2 L + 1 )
(A
-A
O)(A'
-A' O ) P L ( C O S Ö ) '
(6816)
where 6 is the angle between n' and the z-axis; the summation with respect to L is over
all integers which can occur when J and J' are added vectorially. The expansion of the
scattering amplitude in partial amplitudes gives a full expression of all properties of
the angular distribution of scattering that are due to the symmetry with respect to
spatial rotations. But it does not explicitly reveal the properties that are due to the
symmetry with respect to spatial inversion. The P invariance (if possessed by the
interaction) leads to certain relations between the various helicity amplitudes (see
§69).
§69. Symmetry of helicity scattering amplitudes
The conditions imposed by the symmetry with respect to the transformations P, T,
C (if, of course, the particle interaction process in question in fact possesses such
symmetry) lead to certain relations between the helicity scattering amplitudes, and
therefore reduce the number of independent amplitudes.t
To establish these relations, we shall first determine the symmetry properties of
the helicity states of a system of two particles.
Let us consider the particles in their centre-of-mass system. One particle has
momentum pi = p and helicity At with respect to the direction of p; the other has
momentum p 2 = - p and helicity A2 with respect to the direction of - p . If the helicity is
defined with respect to the same direction, that of p, its values are k\ and -A 2 , and the
particles will thus be described by plane waves with amplitudes MpA,) and Up~À2>- The
two-particle system is described by a (multi-component) function u{pK]Kl) formed from
the products of the amplitudes w(pA,) and w(p~X2).
Let us next regard the system as a single particle with helicity A = Aj - \2 in the
direction n = p/|p|; we can then write the wave function (in the momentum
t This number does not, of course, depend on the specific representation of the matrix S\ and is the same
for any choice of the spin variables.
I
§69
Symmetry of Helicity Scattering Amplitudes
269
representation, i.e. as a function of n) for a state with definite values of J, M, Ai, A2 (and
of the total energy e):
</OMA,A2 =
up»D%Un) V ^ T >
A = A, - A2;
(69.1)
cf. (68.8). Since A is the component of the total angular momentum in the direction of
p, we must have
|A|*J.
(69.2)
According to (16.14), under inversion
Ptt (x,A2) (n) = T?i-r?2M(XlX2)(-n)
= TÏ,TÏ2(-1),|+,2"A,+X2M(~X,"A2)(II),
(69.3)
where TJI and 172 are the internal parities of the particles. Using also (16.10), wefindthe
transformation law for the functions (69.1):
PI/OMA.A,
= Tî.T^-ir'^-Wx,,-^
(69.4)
If the two particles are identical, the question arises of the symmetry with respect
to their interchange. This interchange implies interchanging their momenta and their
spins. To show the significance of this operation as applied to the function (69.1), we
note that its definition contains an asymmetry, in that the angular momenta of the two
particles are projected on the direction of the same vector p, = p, the momentum of the
first particle. After the interchange, this vector is replaced by p2 = -p, and the
components of the angular momenta ji and j 2 along this vector are - Ai and A2 (instead
of Ai and - A2 along p). The result of applying the particle interchange operator Pt2 to
the function (69.1) may therefore be written
P,2^MX,A 2 =
u<-A>-A'>(-n)DttU-n) V H ^ T »
where again A = Ai - A2. Then, using (69.3) and (16.10), we find that
Pl2^MX,X2 = (-l) 2S- >/MA 2 A„
(69.5)
where s\ = s2 = s.
For identical particles, the permissible states must be either symmetric (for
bosons) or antisymmetric (for fermions) with respect to interchange. Since the former
case occurs when the particle spin s is integral and the latter case when it is
half-integral, in either case the permissible helicity states of the two-particle system
can be written as linear combinations
[l+(-l) 2 T 1 2 ]^MA,A 2 ,
270
The Scattering Matrix
§69
or, according to (69.5),
l/OMA^ + t-OViMA^,.
(69.6)
It is noteworthy that this combination is the same for both bosons and fermions.
For a particle-antiparticle system, the result of the interchange is expressed by the
same formula (69.5), but, unlike the case of identical particles, states of either
symmetry under interchange are here permissible, i.e. both combinations
tfr = ^MA1A2±(-D>;MA2A1
(69.7)
can occur. These states have certain charge parities C. The operation of charge
conjugation may be regarded as the result of a total interchange of all variables (spin
and charge) of the two particles, followed by reverse interchange of the spin variables
(helicities). The result of thefirstoperation must be the same as that of interchange in a
system of two identical particles. Hence it is clear that, with the upper sign in (69.7)
(which is the same as the sign in the state (69.6) permissible for identical particles), the
system will be charge-even, and with the lower sign charge-odd:
G / r = ± </r.
Finally, let us consider the operation of time reversal. The wave function of a
particle at rest with spin s and component thereof a is transformed according to
f ^ = (-Ds">s,-.;
see QM, (60.2). The wave function of two particles in their centre-of-mass system may
also be regarded (in respect ot its transformation properties) as that of a "particle" at
rest, with angular momentum J and component thereof M. The helicities Ai, A? are
unchanged: time reversal changes the sign of the momentum and angular momentum
vectors, and the products j • p are therefore unaffected. Hence
t^MA.A^C-l/^jMA.Ar
(69.8)
We can now write down immediately the symmetry relations for the helicity
amplitudes.
If the interaction is P -invariant, then for the reaction a + b-+c + d the amplitudes
of the transitions
|A«À>)->|ÀcÀd) and P|AflAb)->P|AcAd)
must be the same (for given J and e). Hence, using (69.4), we find
<AcAd|S'|AaAb> = M * (-l)'e+'<-*-*< -
Ac,
- A,|SJ| - A„, - Ab).
(69.9)
§69
Symmetry of Helicity Scattering Amplitudes
271
If states with definite parities, i.e. the combinations
where A|, A2 = A„,Ah or Ac, A<*, are chosen instead of those with definite helicities, then
the amplitudes of transitions in which parity is not conserved are zero.
Time reversal transforms each state in accordance with (69.8), and also interchanges initial and final states. Thus T invariance leads to the relations
<AcAd|SJ(e)|AaA>> = <ÀaÀ*|SJ(e)|ÀcÀ,>.
(69.10)
These two amplitudes, however, pertain to different processes, the direct and
reverse reactions. These two processes are essentially equivalent only in the case of
elastic scattering, and (69.10) is then a relation between helicity amplitudes for the
same reaction.
In elastic scattering of two identical particles, the number of different amplitudes is
further reduced because of the symmetry with respect to interchange. We have seen
that, for a given J, the states which occur are either all symmetric or all antisymmetric
in Ai, A2. The conservation of angular momentum therefore implies that of the
symmetry with respect to interchange of helicities.
A similar situation occurs in the elastic scattering of a particle by its antiparticle, or
the conversion of one particle-antiparticle pair into another, i.e. a reaction
a + ä -> b + b. For given /, there are both symmetric and antisymmetric states with
regard to Ai, A2, but they correspond to different values of the charge parity of the
system. Hence it follows that, if the interaction of the particles is C-invariant, so that
the charge parity is conserved, transitions between states of different symmetry with
regard to Ai, A2 are forbidden.t It must be emphasized, however, that there is a
difference from the case of identical particles, in which states of one symmetry are
entirely absent for any given J. In the "particle-antiparticle" case, only transitions
between states of different symmetry are forbidden; the states themselves exist for
every J.
Because of the universal CPT invariance, the existence of T invariance implies
that of CP invariance. The latter brings about the equality of amplitudes for two
reactions, one obtained from the other by replacing all particles by antiparticles (and
changing the sign of the helicities):
(AcAdlS'lAaAb) = (Ac-AjlS'lAâA*),
(69.11)
where Aa = - A„ and so on.t
The number of independent amplitudes is the same for all the cross-channels of
t A similar prohibition can also arise from isotopic invariance of the interaction of non-identical
particles. For instance, transitions between states of different symmetry with regard to A i, À2 are forbidden,
to the extent that this invariance holds, in the scattering of a neutron by a proton.
Î Since these two amplitudes relate to different reactions, interference between which is not possible,
the phase factor in (69.11) would have no significance, and can be taken as unity. Only the equality of
cross-sections which follows from (69.11) is actually meaningful.
272
§69
The Scattering Matrix
one generalized reaction, and therefore this number can be determined from any
channel. For example, the elastic scattering a + b -* a + b and the annihilation
a + ä-*b + b are described by the same number of independent amplitudes. The
restrictions imposed by T invariance in thefirstcase are equivalent to those imposed
by C invariance in the second case.
Let us also consider a reaction in which one particle disintegrates into two:
a-+b + c. In the centre-of-mass system (the rest frame for particle a), we have
Pb = -p c . Scalar multiplication of the equation j a = \b + j c by pb gives
Aa = A,-A c
(69.12)
(the helicity Afl of particle a is defined as the component of its spin in the direction of
the momentum of one of the secondary particles). This relation can be regarded as a
consequence of the additional symmetry present in the process considered, namely
the axial symmetry about the directions of pb and pc. If the spin sa of particle a is less
than sb + sc, the relation (69.12) reduces the number of possible sets of values of Aa, Ab,
Ac and therefore the number of independent helicity amplitudes of the disintegration.
The total angular momentum J is then equal to the spin sa of the primary particle, and
is consequently fixed.
The P invariance in the disintegration is expressed by the relation
<AbAc|SJ|Aa> = ^
Va
( _ i r a - ^ e ( _ kbf_ A c | S i|
_ Afl>f
(69.13)
where we have used (69.4) and also the transformation (16.16) for the wave function of
a single particle.
If the primary particle is strictly neutral, further limitations arise if C parity is
conserved. Three cases are to be distinguished here. If the disintegration products are
also strictly neutral, we must have Ca = CbCc ; this condition either prohibits the
disintegration altogether, or is satisfied and causes no further restriction. If the
particles b and c are different, then C invariance implies a relation between the
amplitudes of the different processes a-+b + c and a-*b + c. Finally, for the
disintegration a -► b + b, there is a restriction because, for a given charge parity C and
a given total angular momentum J = sfl, the system may be in states either symmetric
or antisymmetric with respect to fhe helicities, depending on the parity of the number J
and on the sign of C.
CP invariance implies the equality of amplitudes for the disintegrations a^>b + c
and ä-+b + c:
<A,Ac|SJ|Au) = (Ab-Ac-|S/|Aä),
(69.14)
where Aâ = -Afl and so on; i.e. it implies equal probabilities of disintegration for the
particle and the antiparticle. If the particle can disintegrate in more than one way
(through various channels), this equality applies to each channel. This conclusion, it
must be emphasized, is based on the existence of CP invariance, which is not a
universal property of Nature. Only CPT invariance is universal, and this by itself
§69
Symmetry of Helicity Scattering Amplitudes
273
would lead only to the equation
(kbkc\SJ\ka) = (kà\SJ\\B\ch
in which the right-hand side refers to the process inverse to disintegration. We shall
see later (§71) that the condition of CPT* invariance, together with unitarity
requirements, does lead to a relation, although a more restricted one, between the
disintegration probabilities for the particle and the antiparticle.
PROBLEMS
PROBLEM 1. Using (69.6), obtain a classification of the possible states of a two-photon system.
SOLUTION. In this case Ai, A2 = ± 1. For even J (>0), according to (69.6), three states symmetric in Ai,
A2 are allowed:
(a)
4fJMll,
( b ) l/OM.-l.-l,
(C) lfr/MI.-l + ^JM.-l.l.
For odd J (>1), one antisymmetric state is allowed:
(d)
I//JMI.-1 ~ </>JM,-l,I.
States (c) and (d) also have a definite parity (+1): according to (69.4),
P(*JMI.-I±*JM.-I.I)=
±(-1)J(*/MI.-I±^M.-I,I);
the factor ± ( - \)J = 1, since the upper sign refers to even values of J and the lower sign to odd values. States
(a) and (b) themselves have no definite parity, but even and odd states are obtained by taking the
combinations
(a') ^jMii + «fcjM.-i.-i,
(b') ^ M I I - ^ ; M , - I . - I .
When / = 0, only Ai = A2 is allowed by the condition |Ai - A2I ^ J, so that state (c) does not occur, leaving one
even and one odd state, (a') and (b'). Finally, if J = 1, state (d), which is the only possible state for odd J, is
forbidden because it has A = 2 > J. Thus we arrive at the table (9.5) for the permissible states.
PROBLEM 2. In the non-relativistic approximation, the total angular momentum / of the system is
found by adding the spin S and the orbital angular momentum L. For a system of two particles,findthe
relation between the states \JLSM) and IJMA1A2).
SOLUTION. According to the rule for constructing wave functions when adding angular momenta,
we have
*JLSM = l^s^s^criailSMs^LM^MLMslJMh
0)
where *fßsa are the eigenfunctions of the spin s with component cr along a fixed z-axis, ^ L M L those of the
orbital angular momentum L with component Mc> the expression in the braces corresponds to the
addition of s\ and 52 to give S, after which S is added to L to give J; the summation is over all m-type
indices. Let all functions be expressed in the momentum representation, as functions of the direction n
of the momentum p = pi, and let the functions i//J<r be expressed in terms of the functions i/rlA of the
helicity states by means of QM, (58.7):
^2"2 = Ç D-^lOt/V-Ar
For the function $LMU we have
iltLML = Y L M L ( I I )
274
§70
The Scattering Matrix
using QM (58.25) and the definition (16.5). Substituting these functions in equation (1), and twice using the
expansion QM, (110.1), together with the orthogonality of the Clebsch-Gordan coefficients (QM, (106.13)),
we obtain the expansion
1/OI.SM = 2J ^MA,A 2 <JMA,A 2 |JLSM> >
(2)
where
«frjMA.A, = 1/TnA^n. A,D%(ll) J ~ ^ ,
V 477
A = A, - A2,
and the coefficients are
</MA,A2|JLSM)=(-0L(-l),,",2fV[(2L+l)(2S+l)](^
_£
_*)(£
*
^).
(3)
Since the transformation (2) is unitary, we have
<JLSM|JMA,A:> = (JMA|A2|JLSM>*.
§ 70. Invariant amplitudes
In the helicity amplitudes, a particular frame of reference is used, namely the
centre-of-mass system. But, in order to calculate the scattering amplitudes by means
of invariant perturbation theory (and also to examine their general analytical
properties), it is convenient to write them in an explicitly invariant form.
If the particles concerned in the reaction have no spin, the scattering amplitude
depends only on the invariant products of the 4-momenta of the particles. For a
reaction of the form
a + b-*c + d,
(70.1)
these invariants may be taken as any two of the quantities s, f, u defined in §66. Then
the scattering amplitude reduces to a single function Mfi = f(s, t).
If the particles have spins, then, besides the kinematic invariants s, f, u, there are
also invariants which can be constructed from the wave amplitudes of the particles
(bispinors, 4-tensors, etc.). The scattering amplitudes must then have the form
Mfi = 2fn(s,t)Fn,
n
(70.2)
where the Fn are invariants which depend linearly on the wave amplitudes of all the
particles concerned (and also on their 4-momenta). The coefficients /„(s, 0 are called
invariant amplitudes.
By choosing the wave amplitudes in such a way as to correspond to particles with
definite helicities, we obtain definite values of the invariants Fn = F„(A„ A/). Then the
helicity scattering amplitudes are linear homogeneous combinations of the invariant
amplitudes /„. Hence we see that the number of independent functions /„(s, t) is equal
to the number of independent helicity amplitudes. Since the latter number is easily
determined, as shown in §69, this makes easier the construction of the invariants F„,
their number being known in advance.
Let us consider some examples, assuming in every case that the interaction is
T-invariant and P-invariant. The latter property implies that the invariants F„ must be
true scalars, not pseudoscalars.
§70
Invariant Amplitudes
275
To find the number of invariants (that is, the number of independent helicity
amplitudes), we note that the total number of elements of the matrix SJ (i.e. of
different sets Ab A2, Ai, À 2) is in this case four: Ai = Ai = 0, A2, À 2 = ±1 When the P
invariance is taken into account, the number of independent elements is reduced to
two, and this is unchanged by the inclusion of T invariance.
The two independent invariants may be taken as
F, = M'M,
F2 = ü'(yK)u,
(70.3)
where u = w(p), u' = u(p') are the bispinor amplitudes of the initial and final fermions;
K = k + k\ where k and k' are the 4-momenta of the initial and final bosons.t
The T invariance of the quantities (70.3) is evident if we note that under time
reversal the products ü'u and ü'y^u are transformed according to the same rule (28.6)
as the operators i//t£ and i/ry^, whose matrix elements they are: ü'u is invariant, and
the 4-vector ü'yu is transformed according to
ü'y°u -» ü'y°uy
ü'yu -> - ü'yu.
The 4-momenta are transformed similarly: (K°, K)->(K°, -K), and the scalar
product F2 = K^(ü'y^u) is therefore invariant.
ELASTIC SCATTERING OF TWO IDENTICAL PARTICLES WITH SPIN 2
To find the number of independent helicity amplitudes, it is convenient to start
from linear combinations of the helicity states:
<//,g = (/>++ + i/r__,
4/3g = <//+_ + <//-+,
fag
= i//++ - if/—.
\\ßu = 1//+- - (//_+,
where the suffixes ± denote the values ±2 of the helicities of the two particles. The
states lg, 2g, 3g are even, and u is odd, with respect to interchange of the particles.
The transitions g<->n are forbidden, so that there remain 1 6 - 6 = 10 matrix elements, when the interchange symmetry is taken into account. The functions ifß\g
and t//3g, and ij/2g have opposite parities with respect to the inversion P ; the
prohibition of transitions between them reduces the number of independent amplitudes to six. Lastly, the T invariance equalizes the amplitudes of the transitions
lg-*3g and 3g->lg, leaving only five independent amplitudes. The five independent invariants may be taken as
F, = {ü\ux){ü'2u2)>
F2 - {ü\y5ux)(ü'2y5u2)>
F 3 = (MI7 M U,)(Ü2YMH2),
F4 =
(ü\y^y5ul)(Ü2y^y5u2),
(70.4)
F 5 = (Öicr^ü1)(Ö20r^M2),
t Atfirstsight, there might appear to be another invariant of the form ü'apJtWu (with the matrices ov„
defined by (28.2)), but this is easily seen to reduce to the invariants (70.3) by means of the conservation law
k' = p + k - p' and the equations (yp)u = m«, ö'(yp') = mû' satisfied by the bispinor amplitudes.
§70
The Scattering Matrix
276
where ux, u2 are the bispinor amplitudes of the initial particles and w|, u'2 those of
the final particles. Interchange of the initial (or of the final) particles gives no new
invariants: the invariants obtained can be expressed in terms of the previous ones
(§28, Problem). But the expression (70.2), with the F„ given by (70.4), does not
explicitly take account of the requirement that interchange of two identical fermions must change the sign of the scattering amplitude. An expression which
satisfies this condition may be written
M/l- = [(fi;M,)(öiu2)/i(r,u)"(ö5w,)(öiM2)/i(M,0]+ • * ••
(70.5)
When pi and pi (or p\ and p2) are interchanged, the kinematic invariants s -> s, t -> u,
u -* f, so that the condition is necessarily satisfied.
ELASTIC SCATTERING OF A PHOTON BY PARTICLES
WITH SPIN 0 OR [
The amplitude of this process is conveniently expressed by means of the
space-like unit 4-vectors e(l), e{2) which satisfy the conditions
e
(D2=e(2)2=
<W 2 ,
M
ewk = emk = 0,
= 0,
)
e{{)k' = e{2)kf = 0; \
(70.6)
for each of the two photons, these 4-vectors can be the unit 4-vectors by means of
which an invariant description of their polarization properties is obtained (§8).
Let k and k' be the initial and final 4-momenta of the photon; p and p' those of
the scattering particle. The 4-vectors
P = P +P
-KX
=rf
,
N A = eA^PMq,Kp,
J
(70.7)
where
K = k + k\
q = p - p ' = k'-k,
are evidently orthogonal to one another and also to the 4-vectors K and q, and
therefore to k and k'. Being orthogonal to the time-like 4-vector K (K2 = 2kk' >0),
they must themselves be space-like: in a frame of reference for which K=*0, it
follows from K P = 0 that P 0 = 0 and hence P 2 < 0 . Normalizing P and N by
putting
NA
e^^rjf^,
2
V(-N y
(2U=
P
ee^^-jf-^.
2
V(-p y
(70.8)
we obtain a pair of 4-vectors which have all the required properties. It may be
noted that e(2) is a true vector and e (l) a pseudovector.
§70
277
Invariant Amplitudes
The photon scattering amplitude may be written
Mfi = F A ^l*e„
(70.9)
in terms of the polarization 4-vectors e and e' of the initial and final photons.
The photon helicity has only two values, ±1. Hence, for the scattering of a
photon by a particle with spin zero, the number of independent helicity amplitudes
is the same as for the mutual scattering of particles with spin 0 and \, namely two.
The tensor FKlk in (70.9) has to be constructed from the particle 4-momenta only. It
can be written
F A * = / 1 6 ( I ) A 6 ( , ) ' 4 +/ 2 € ( 2 ) A 6 ( 2 ) M ,
(70.10)
where fx and f2 are invariant amplitudes. It should be noted that no term containing
a product eimei2)lA can appear in FA*\ since this product is a pseudotensor and
would give a pseudoscalar on substitution in (70.9).
Lastly, let us consider the scattering of a photon by a particle with spin i To
find the number of independent helicity amplitudes, we note that the total number
of elements of the matrix SJ in this case is sixteen; the helicity of each of two
initial and two final particles has two values. The condition of P invariance reduces
this number to eight, and that of T invariance brings it down to six.
Here, we write the tensor FXtx in the form
FA, = GteVe™ + e?e <2>) + GMl)e?} + e[2)e^) +
-+ G2(e!'>e<2>- e i ^ i V G j f c W - *?*?),
(70.11)
where G0 and G3 are true scalars, G\ and Gi are pseudoscalars, and all four are
bilinear in the bispinor fermion amplitudes ü(p') and u(p), being of the form
G„ = ü(p')Q„M(p).
(70.12)
The general form of the matrices (with respect to the bispinor indices) Q„ is
Qo = /. + MyK),
Q, = 75(/3 + MyK)),\
Q2 = Y U + U(yK)),
Q3 = h + HyK\
\
(70.13)
where K = k + k'. The coefficients / 1 , . . . ,/g are invariant amplitudes, in this case
eight in number (instead of the correct value of six), because the condition of T
invariance has not yet been imposed.
Time reversal interchanges the initial and final 4-momenta of the particles, and
also changes the sign of their space components:
(ko, k)~(k0, - k'),
(Po, p)~(pô, - p').
(70.14)
The photon polarization 4-vectors are transformed according to
(e 0 ,e)~(e 0 *,-e'*)
(70.15)
278
The Scattering Matrix
§71
(cf. (8.11a)); hence
(eo*e0, eTe^ ei*ek)->(eo*eo, - *o*«i, ^ex).
By virtue of the last transformation, the condition of invariance of the scattering
amplitude (70.9) is equivalent to
(Foo, F,o, Fik) -> (Foo, -F 0 „ Fki).
On the other hand, the changes (70.14) imply
(Ko, K) -* (Ko, - K),
(q0, q) -> (-q 0 , q),
(P0, P) -> (Po, - P),
(No, N) -> (N0, - N),
so that
(e^2\^2))^(e^\-^2)).
(70.16)
Hence, from (70.11), we must have
Go, if3 -» Go, i f 3,
G2-> - G 2 .
Under time reversal,
ü'y5u -> - w'y5w,
ö'y5(yK)w -> ö'y5(Y*Oii,
as is evident from the transformation laws for pseudoscalar and pseudovector
bilinear forms (28.6). From (70.12), (70.13) it is now evident that, because of the T
invariance of the scattering amplitude,
/ 3 = /6 = 0.
(70.17)
§71. The unitarity condition
The scattering matrix must be unitary: SS* = 1, or in terms of matrix elements,
n
where the suffix n labels the possible intermediate states.t This is the most general
property of the S-matrix, which ensures that the orthonormality of the states is
preserved in the reaction; cf. QM, §§125 and 144. In particular, the diagonal
elements of equation (71.1) simply express the fact that the sum of the transition
t The actual meaning of 5/j in (71.1) depends, of course, on the specific choice of quantum numbers
and on the normalization of the wave functions of the system. It must be defined so that 2 / 8,/ = 1.
§71
The Unitarity Condition
279
probabilities from a given initial state to all final states is unity:
S|Sn,|2=l.
n
Substituting in (71.1) the matrix elements in the form (64.2), we obtain
Tfi - VIf =
I(2TT) 4
2 8{4\Pf - Pn)TfnT*n
n
= i(2ir) 4 2 S(4)(P/ " Pn)T*fTn,
n
(71.2)
The two equivalent forms on the right are obtained by writing the unitarity
condition respectively as SS+ = l and S*S = I, with opposite orders of the factors S
and S\
It should be noticed that the left-hand side of the equation is linear in the matrix
elements of T, but the right-hand side is quadratic. If the interaction contains a
small parameter (e.g. the electromagnetic interaction), the left-hand side is therefore of the first order of smallness and the right-hand side is of the second order.
The latter may consequently be neglected in a first approximation; then
(71.3)
Tfi = T*h
i.e. the matrix T is Hermitian.
In order to make the unitary condition (71.2) more specific, we must understand
precisely what is meant by the summation over n. Let us do this for a two-particle
collision, assuming that the conservation laws allow only elastic scattering. Then all
the intermediate states in (71.2) are likewise "two-particle" states. Summation over
these signifies integration over the intermediate momenta p", p2, and summation
over the spin quantum numbers (for example, the helicities) of the two particles,
which we denote by A":
V2d3p'jd3p'iY
n
J
(2TT)6
.£■
Eliminating the delta functions in the same way as in §64, we obtain the "twoparticle" unitarity condition in the form
Tfi
" ^ = ( 2 ^ 7 ? ^ / T/"Pr"e'e'2'do">
where p is the momentum and e the total energy in the centre-of-mass system. The
normalization volume does not appear after changing from the amplitudes T/t to M/,
in accordance with (64.10):
Mfi
~ Wlf = jÙf ? ^ / Mf^"do"-
<71-4)
280
The Scattering Matrix
§71
Let the elastic scattering amplitude be defined so that
da = |<n'A'|/|nA)|2 do',
(71.5)
where n and n' are the directions of the initial and final momenta, A and A' the
initial and final spin quantum numbers. Comparison with (64.19) shows that
<n'A'|/|nA) = ^ - M / j ,
07T£
(71.6)
and the unitarity condition (72.4) becomes
<o'Al/|nA>-(nA|/|nfAf>* = ^ 2
f <nV|/|n'V')<nA|/|n'Vr do\
(71.7)
which generalizes the familiar formula of the non-relativistic theory, QM (125.8).
The "amplitude of zero-angle elastic scattering" is the diagonal matrix element
Tu, in which the final states of the particles are the same as their initial states.t For
this amplitude the unitarity condition (71.2) becomes
2 im Tu =
(2TT)4
2 |Tifl|2ô(4)(Pi - Pn).
n
(71.8)
The right-hand side of this equation differs only by a factor from the total
cross-section for all possible processes of scattering from the given initial state i.
For let this cross-section be denoted by crf; then summation of the probability (64.5)
over states / and division by the flux density j gives
CTf=^^2l^n|28(4)(P-Pn),
J
n
whence
(2V/j)imT, = crf.
The normalization volume is eliminated by putting T« = MJ(2ei V • 2e2V) (where ex
and e2 are the energies of the particles in the centre-of-mass system) and substituting j from (64.17):
imM« = 2|p|ea,.
(71.9)
This formula expresses the optical theorem. If the elastic scattering amplitude
(71.6) is used, the theorem takes the customary form
im<nA|/|nA) = |p|crr/4iT;
(71.10)
cf. QM, (142.10).
t It must be stressed that the matrix elements of T are concerned, not those of S; that is, the
diagonal element is taken after subtracting the unit matrix from S.
§71
The Unitarity Condition
281
If the S-matrix is given in the angular-momentum representation (partial amplitudes), it is diagonal with respect to J, and the unitarity condition can therefore
be written separately for each value of J.
For example, if only elastic scattering is possible, the unitarity condition is
2<A'|S J |À"><À|S J |À'")* = ÔAA'.
A"
(7L11)
Because of the T invariance, the elastic scattering matrix is symmetric (cf. (69.10)),
and hence can be reduced to diagonal form. The unitarity condition then requires
that the diagonal elements should be of unit modulus, and they are customarily
written in the form
Sj=exp(2iô jB ),
(71.12)
where the 8Jn are real constants, depending on the energy (the suffix n labelling the
diagonal elements for a given J). In the general case, when the number N of
independent amplitudes exceeds the order of the (square) matrix S\ the coefficients
of the transformation which diagonalizes S3 depend on J and E (these coefficients
then comprise not only the principal values of the matrix but also independent
quantities equivalent to the original N quantities). If, however, the number N is
equal to the order of the matrix SJ (and therefore to the number of its principal
values), the diagonalization coefficients are universal constants, and the diagonalizing states have definite parities (but not, of course, definite helicities).
The condition (71.11), expressed in terms of the partial amplitudes (A'|/J|A), is
<Al/ J |A)^(A|/ / |Ar = 2i|p|2<Al/i|A''><A|/;|AT,
A"
(71.13)
as is easily seen by substituting the expansion (68.13) in (71.7) and using the
orthonormality of the D functions. If there is T invariance, the matrix (A'|/J|A) is
symmetric, and (71.13) becomes
im<A'|/J|A) = |p|<Al/J/JlA>.
(71.14)
If the matrix is diagonalized, the diagonal elements are
/
"= ^(*2"'n~1)
=
^ ^
(7U5)
Finally, we may mention some consequences which follow from the unitary
condition together with the requirement of CPT invariance. The latter shows that
T/i = TV,
(71.16)
where f and / are states which differ from i and / in that all the particles are
replaced by antiparticles (and helicities are reversed, and also angular momentum
282
The Scattering Matrix
§71
components if spherical waves are used). In particular, for the diagonal elements,
It therefore follows from (71.8) and (71.9) that the total cross-section for all
possible processes (with a given initial state) is the same for reactions of particles
and of antiparticles.
In particular, the total disintegration probabilities (i.e. the lifetimes) of the
particle and the antiparticle are equal. These results, together with the equality of
particle and anti-particle masses (§11), are most important consequences of the
CPT invariance of the interactions. A similar statement for each possible disintegration channel separately would require CP invariance also (see the end of §69).
PROBLEM
From the unitarity condition, find the relation between the phases of the partial amplitudes for
photoproduction of pions from nucléons (y + N-*ir + N) and elastic scattering of pions by nucléons
(7T + N - > 7 T + N ) , using the fact that TTN scattering depends on strong interactions but photoproduction
and yN scattering depend on an electromagnetic interaction.
SOLUTION. Let the partial amplitudes be denoted by
(irN\S\yN) = S„y,
(yN\S\yN)
= SYT,
(TTN\S\TTN) = Sm
the suffix J and the helicity suffixes being omitted. Photoproduction is a first-order process with respect
to the charge e, and yN scattering a second-order process; hence SW7 — e, Syy - 1 ~ e2. The amplitude S™
is not small. The conditions (71.1) give, as far as terms in e,
S„yS*y + S„„S*n - S„nS*„ = 1 ;
(2)
on the right-hand side of (2), 1 denotes a unit matrix in the spin variables. Because of T invariance the
matrix S™ is symmetric, and S71T = Si,7. Let us take the matrix S™ in diagonal form, i.e. with respect to
pion states having definite parities; then it follows from (2) that the diagonal elements have the form e2t*w
with various constants 5*. Then (1) gives for each element of the matrix S*y
SJS*y=
-em\
whence
S W7 = ±|S,ry|lV6V
Thus the phase of the partial amplitude for photoproduction (in a state having a definite parity) is
determined by the phase of elastic TTN scattering.
CHAPTER VIII
INVARIANT PERTURBATION THEORY
§72. The chronological product
T H E probabilities of various processes in collisions between particles whose
interaction may be regarded as small are calculated by means of perturbation
theory. In its ordinary form (in non-relativistic quantum mechanics), however, the
formalism of this theory has the defect of not exhibiting explicitly the conditions of
relativistic invariance. Although, when this formalism is applied to relativistic
problems, the final result will satisfy these conditions, the calculations are considerably complicated by the non-invariant form of the intermediate expressions.
The present chapter will deal with the development of a consistent relativistic
perturbation theory free from this defect, first established by R. P. Feynman
(1948-1949).
With a view to a second-quantization description of the system, let 4> denote its
wave function in the "space" of occupation numbers for the various states of free
particles. The Hamiltonian of the system is H = H 0 + V, where V is the interaction
operator. Let <!>„ be the eigenfunctions of the unperturbed Hamiltonian, each
corresponding to certain definite values of all the occupation numbers. Any
function $ can be expanded as O = 2 C„4>n. Then the exact wave equation
id<tydt = (H 0 + V)(D
(72.1)
becomes a set of equations for the coefficients Cn :
i t ^ V ^ e ' ^ ' C .
m
(72.2)
where Vnm are the time-independent matrix elements of the operator V, and En the
energy levels of the unperturbed system (cf. QM9 §40).
By definition, the operator V does not depend explicitly on the time. The
quantities
V nm (f)= Vnmei{E»-E»\
(72.3)
on the other hand, may be regarded as matrix elements of the time-dependent
operator
ViO^e^o'Ve-1"«'.
(72.4)
This is said to be an operator in the interaction representation, as opposed to the
283
284
Invariant Perturbation Theory
§72
original time-independent Schrödinger operator V.t Now denoting the wave function in this new representation by the same letter 3>, we can write equations (72.2)
symbolically as
/<!> = V(r)<D.
(72.5)
The change in the wave function in this representation is due entirely to the action
of the perturbation, i.e. it corresponds to processes which result from the interaction of the particles.
If 4>(f ) and <!>(£ + 8t) are the values of <ï> at two successive instants, (72.5) shows
that
<&(f+ Sf) = [ l - i 5 t • V(t)]$(t)
Accordingly the value of <ï> at any instant tf can be expressed in terms of its value
at some initial instant t, (<tf) by
*a/)=(l|l«""'^,-))*ai),
(72.6)
where the product Y\ is t n e limit of the product over all the infinitesimal intervals
8ta between U and tf. If V(t) were an ordinary function, this limit would reduce
simply to
'/
exp(-i j V(t)df),
but this result depends on the commutativity of the factors pertaining to different
instants, which is assumed in changing from the product in (72.6) to the summation
in the exponent. For the operator V(t) there is no such commutativity, and the
reduction to an ordinary integral is not possible.
We can write (72.6) in the symbolic form
</
<D(t,) = Texp{-i | V(t)dt}<t>(til
(72J)
<i
where T is the chronological operator, implying a certain "chronological" sequence
t It must be emphasized that the definition (72.4) makes use of the unperturbed Hamiltonian Ho- In
this it differs from the Heisenberg representation of operators, where
VH(t) = e^Ve "'*;
see QM, §13.
The Chronological Product
§72
285
of time instants in the successive factors of the product (72.6). In particular, putting
U -* -3c, tf -> +°°, we have
4>(+oo) = Sd>(-oo),
(72.8)
S = T e x p j - i | V(t)dt}.
(72.9)
where
The significance of writing the formally exact solution of the wave equation in
the form (72.7)-(72.9) is that it easily leads to the series in powers of the
perturbation
S = %{JJT
\ dU f dt2... f dtk • T{V(t,)V(t 2 )... V(tk)}.
(72.10)
Here, in each term, the fcth power of the integral is written as a fc-fold integral, and
the symbol T signifies that in each range of values of the variables ti, t2,..., t* the
corresponding operators must be put in chronological order, with the value of t
increasing from right to left.t
It is evident from the definition (72.8) that, if the system was in a state 4>; (an
assembly of free particles) before the collision, the probability amplitude for a
transition to a state <î>/ (another assembly of free particles) is the matrix element Sp.
Thus these form the S-matrix.
The electromagnetic interaction operator has already been given in §43:
-ej(jÂ)
d'x.
(72.11)
Substitution of this in (72.9) gives
S = Texp[-ie[(/A)d 4 x}.
(72.12)
It is important to note that the operator (72.12) is relativistically invariant. This is
seen from the facts that the integrand is a scalar, the integration over dAx is
invariant, and the time-ordering operation is invariant. The last point, however,
needs further explanation.
The order of two time instants tx and f2, i.e. the sign of t2-tu is independent of
the frame of reference chosen if these instants relate to world points X\ and x2
separated by a time-like interval: (x2-X\)2>0. In such a case the invariance of
t The derivation of the rules of relativistic perturbation theory by means of the expansion (72.10) is
due to F. J. Dyson (1949).
286
Invariant Perturbation Theory
§73
time-ordering necessarily follows. But if ( x 2 - X i ) 2 < 0 (a space-like interval), we
may have both t2> t\ and t2< t\ in different frames of referenced Now two such
points correspond to events between which there can be no causal connection. It is
therefore evident that the operators of two physical quantities relating to such
points must commute, since the non-commutativity of operators signifies, physically, that the corresponding quantities cannot be measured simultaneously, and this
presupposes a physical connection between the two measurements. Thus tpe
time-ordering of the product remains invariant in this case also: though a Lorentz
transformation may reverse the sequence of time instants, the factors commute and
can therefore be restored to their chronological order.t
It is easy to see that the definition of the S-matrix given in this section
necessarily satisfies the unitarity condition. Writing S as the chronological product
in (72.6) and using the fact that V is Hermitian, we find that S* is given by the
product of similar factors, exp[îôta • V(ta)] (with the opposite sign of the exponent),
in the reverse of the chronological order. Thus all the factors cancel in pairs when
S is multiplied by S \
It should be noted that the unitarity of the operator S is ensured in this case
because the Hamiltonian is Hermitian. The unitarity condition is actually more
general than the assumptions on which the theory given here is based. It must be
satisfied even in a quantum-mechanical description which makes no use of the
concepts of the Hamiltonian and the wave functions.
§73. Feynman diagrams for electron scattering
We shall show by means of specific examples how the scattering matrix
elements are calculated. These examples will facilitate the subsequent formulation
of the general rules of invariant perturbation theory.
The current operator / contains the product of two electron (//-operators. Hence
processes might occur in the first order of perturbation theory which involve (in the
initial and final states) only three particles: two electrons (the operator j) and one
photon (the operator A). It is easily seen, however, that such processes cannot
occur between free particles, being forbidden by the laws of conservation of
energy and momentum. If p\ and p 2 are the 4-momenta of the electrons, and k that
of the photon, the conservation of 4-momentum would be represented by k =
p 2 - P\ or k = pi + p\. But such equations are impossible, since for a photon k2 = 0,
whereas the square ( p 2 ± p 0 2 is certainly not zero: if we calculate this invariant in
t Instead of using the terms "time-like" and "space-like", we often refer briefly to regions
respectively inside and outside the light cone: all points x separated from a point x' by an interval such
that (x - xf > 0 lie within a double cone having its vertex at x'; points for which (x - x')2 < 0 lie outside
this cone.
X This statement needs refinement to avoid misunderstanding in its application to the product
V(t\)V(ti). • . . Since the operator V itself is not gauge-invariant (it varies with Â), the factors V(fi),
V(f2),..., though commuting in one gauge of the potential, may be non-commutative in some other
gauge. The statements made above must therefore be formulated as asserting the possibility of choosing
a gauge for the potential in which V(fi) and V(f2) commute outside the light cone. This reservation
clearly has no effect on the invariance of the S-matrix: the scattering amplitudes, which are actual
physical quantities, cannot depend on the gauge of the potential, a result which formally follows from
the gauge invariance of the action integral (§43).
§73
Feynman Diagrams for Electron Scattering
287
the rest frame of one of the electrons, we have
(p 2 ±Pi) 2 = 2(m 2 ±p,p 2 )
= 2(m2±£i£2^Pi
P2)
= 2m(m ± £2),
and, since e2 > w, it follows that
(p2 + Pi) 2 >0,
(p2-p02<0.
(73.1)
Thus the first non-vanishing (non-diagonal) elements of the S-matrix can appear
only in the second order of perturbation theory. All the relevant processes are
comprised in the second-order operator obtained by expanding the expression
(72.12):
S(2) = - ^ I J d4x d V •
T(Hx)Aß(x)iv(x')Av(xt)).
Since the electron and photon operators commute, the T product can be resolved
into two:
S (2) = - | j J j d4xd4xf
T(r(x)r(xf))T(Â,(x)ÂAxf)).
(73.2)
As a first example, let us consider elastic scattering of two electrons. In the
initial state there are two electrons with 4-momenta px and p 2 , in the final state two
electrons with other 4-momenta p 3 and p 4 . It is also assumed that all the electrons
are in definite spin states; the spin variable indices will be everywhere omitted, for
brevity.
Since there are no photons in either state, the required matrix element of the T
product of the photon operators is the diagonal element ( 0 | . . . |0), where |0) denotes
the photon vacuum state. This value of the T product averaged over the vacuum is
(for each pair of indices /x, v) \ definite function of the coordinates of the two
points x and x'. Since 4-space is homogeneous, the coordinates can appear only as
the difference x - x\ The tensor
DßU(x - x') - /<0|TAM(x)Ay(x,)|0)
(73.3)
is called the photon propagation function or photon propagator It will be calculated in §76.
For the T product of the electron operators, we have to calculate the matrix
element
<34|Tj'(x)r(x')|12>,
(73.4)
where the symbols |12), |34) denote states in which pairs of electrons have the
§73
Invariant Perturbation Theory
288
corresponding momenta. This element also can be represented as a vacuum
expectation value, by using the obvious relation
<2|F|l> = <0|a2Faf|0>,
where F is any operator, df the creation operator for the first electron and d2 the
annihilation operator for the second electron. Hence, instead of (73.4), we can
calculate the quantity
(73.5)
(0\a3a4TU»(x)r(x'))aUï\Oh
the indices 1,2,... being abbreviations for pu p 2 , . . . .
Each of the two current operators is a product, / = i//yi//, and each of the
i/f-operators is a sum:
i = 2 (M'P + *;<M.
$ = 2 (â;iP + bpiÂ-p);
p
(73.6)
p
the second term in each expression contains the positron operators, which in the
present case "do not act". Hence the product ]^(x)jv{xf) is a sum of terms, each
containing the product of two operators dp and two dp. These operators must
annihilate electrons 1 and 2, and create electrons 3 and 4. They must therefore be
the operators au d2, d^, at, which are said to contract with the "external"
operators df, d2, ai, d4 in (73.5) and cancel according to the equations
<0|apa;|0>= 1.
(73.7)
Four terms result, according to the «//-operators from which au d2, d3, dj in
(73.5) are taken:
i
2
i
1
Z
'
2
'
(73.5) = a 3 a 4 (07^X^7> f )aîar + a,a4(^y^)(^yv^)aUt
i
i
i
i
>
i
i
« — -
+ a&AfyWWyWaîaî
l
1
«
i
'—'
'
■
«
■
»
+
l
' — Z
+ a&AfyWWyWaUi,
I
I
«
.
(73.8)
where i// = ^(x), ty1 = iK*')> and the brackets join operators which contract, i.e.
those from which a pair of operators d, d* is taken for the cancellation according to
(73.7). In each term we can bring the conjugate operators together in pairs (â\âU
etc.) by successive interchanges of au â2,..., and the mean value of their product
is then equal to the product of the mean values (73.7). Since all these operators
anticommute (1, 2, 3, 4 being different states),! we find that the matrix element
(73.4) is
<34|Tj* (x)j*(*0|12> = (^y^iWiyW)
+ (f^hWiJ^i)
-
- (fc7 M ^2)(*47>î) ^ (*47 M *iX*37>2).
(73.9)
t Because of this anticommutativity, the operators f(x) and j(x') may here be considered to
commute (in the calculation of the matrix element), and the T product symbol may therefore be omitted.
§73
Feynman Diagrams for Electron Scattering
289
The sign of the entire sum is arbitrary, and depends on the order of the "external"
electron operators in (73.5). This is in accordance with the fact that the sign of the
matrix element for scattering of identical fermions is itself arbitrary. The relative
sign of the various terms in (73.9), of course, does not depend on the order of the
external operators.
The two terms in each line of (73.9) differ only by a simultaneous interchange of
the indices fx, v and the arguments x, x'. This interchange clearly does not affect
the matrix element (73.3), in which the order of factors is still established by the
symbol T. Hence when (73.3) is multiplied by (73.9) and integrated over d4xd4x',
the four terms in (73.9) give two pairs of equal results, and the matrix element is
therefore
Sfl = ie2jJ
d4x d4x' D,v(x - x'){($4y^2)My"*l,\) - (^7^.)(^W>i)};
(73.10)
the factor 2 has now disappeared.
The electron wave functions are the plane waves (64.8). The expression in the
braces is therefore
{...} = (047^2X037"«,) e-«"2-P4>*-«P.-Pj>*' _
- (Ü47,Au1)(Ü37,,W2) e - ^ - ^ - ^ - p , ) * '
= {(047^2X037^,) e - ^ - p ^ - p . » « / * - (Ü^Um^Ui)
e-«0»|-P4)+(l»3-P2)]tf2} e-«P.+P2-P|-P4>Xf
where X = 2U+ *'), Ç = x-x'. The integration over d*xd*x' is replaced by one
over d4Ç d4X. The integral over d*X gives a delta function, so that pi + p2 = Ps + PAThen, changing from the matrix S to the matrix M (§64), we have finally for the
scattering amplitude
Mfi = e 2 {(«47""2)D^(P4- P2)(U37"".)-(Ö47 >1 W.)^(P4-P.)(Ö37 ,, "2)}.
(73.11)
Here we have used the photon propagation function in the momentum representation:
Dtlv(k) = JD^v(Oeik'd4l
(73.12)
Each of the two terms in the amplitude (73.11) can be symbolically represented
by means of a Feynman diagram : the first term by
e'^Y'u^D^kMüjY'u,) =
|k
(73.13)
290
Invariant Perturbation Theory
§73
Each point of intersection of lines (a vertex of the diagram) has a corresponding
factor 7. The "incoming" continuous lines towards a vertex represent the initial
electrons, which are associated with the factors u, the bispinor amplitudes of the
corresponding electron states. The "outgoing" continuous lines leaving a vertex are
the final electrons, and correspond to the factors ü. When the diagram is "read",
these factors are written from left to right in the order of movement along the
continuous lines against the direction of the arrows. The two vertices are joined by
a broken line which represents a virtual (intermediate) photon "emitted" at one
vertex and "absorbed" at the other, and corresponds to the factor -iD^ik). The
4-momentum of the virtual photon k is determined by the "conservation of
4-momentum" at the vertex: the total momenta of the incoming and outgoing lines
are equal. In this case k = p} - p 3 = p 4 ~P2. As well as the factors mentioned, the
whole diagram is also assigned a factor {-ief (the exponent being the number of
vertices in the diagram), and then represents a term in iMfi. Similarly, the second
term in (73.11) is represented by the diagram
P
e2(C4Yuu1)D:JV(k,)(Ü3Yvu2)
=
'^
^
\ k*
Pl
(73.14)
with k' = pi - p 4 = P3 - P2. It does not matter whether the diagram is read from the
end of p 3 or from that of p4. The resulting expressions are equal, because the tensor
DM„ is symmetrical. The choice of direction for the virtual photon line is also
immaterial: a change in its direction simply reverses the sign of k, which does not
matter, since the functions Dßl/(k) are even (see §76).
The lines corresponding to the initial and final particles are called the external
lines or free ends of the diagram. The diagrams (73.13) and (73.14) differ by the
interchange of two electron free ends (p3 and p4). This interchange of two fermions
reverses the sign of the diagram, in accordance with the fact that the two terms
appear with opposite signs in the amplitude (73.11).
We shall everywhere use Feynman diagrams in this momentum representation,
but they can also be associated with the terms in the scattering amplitude in the
original coordinate representation (the integrals (73.10)). Here the electron amplitudes are replaced by the corresponding coordinate wave functions, and the
propagators are in the coordinate representation. Each vertex corresponds to one
of the variables of integration (x or x' in (73.10)); the factors assigned to the lines
that meet at a vertex are taken as functions of the corresponding variable.
Let us now consider the mutual scattering of an electron and a positron; their
initial momenta will be denoted by p . and p+ respectively, and their final momenta
by p . and p.+.
The positron creation and annihilation operators appear in the (//-operators
(73.6) together with the electron annihilation and creation operators respectively.
Whereas in the previous case the operator tp annihilated the two initial particles
and (// created the two final particles, here these operators act oppositely with
§73
291
Feynman Diagrams for Electron Scattering
regard to electrons and positrons- The conjugate function i/>(-p+) will therefore
now describe the initial positron, and i/>(-p+) the final positron, both being
functions of the 4-momentum with reversed sign. Taking account of this difference,
we obtain the scattering amplitudet
Mfi = -e?(fi(pl)y^(pO)D^(p--pL)(fi(-p + )7^(-p;))^
+ e 2 (ö(-p + )7^(p-))D^(p- + p+)(ö(p'-)7^(-p'+)).
(73.15)
The two terms in this expression are represented by the following diagrams:
(7.3.16)
The rules for constructing the diagrams are altered only as regards the positrons.
The incoming and outgoing continuous lines are again associated with factors u and
ü respectively. Now, however, the incoming lines correspond tofinalpositrons and
the outgoing lines to initial positrons, the momenta of all the positrons being taken
with reversed sign.
The difference between the two diagrams (73.16) should be noted. In the first
diagram, lines of the initial and final electrons meet at one vertex, and those of the
two positrons at the other. In the second diagram, initial electron and positron lines
meet at one vertex, and final lines at the other. The upper vertex represents
annihilation of a pair with emission of a virtual photon; the lower vertex represents
the creation of a pair from this photon.
This difference affects the properties of the virtual photons in the two diagrams.
In the first diagram ("scattering" type), the 4-momentum of the virtual photon is
the difference between those of the two electrons (or positrons); hence k 2 <0 (cf.
(73.1)). In the second diagram ("annihilation" type), k' = p_ + p+, and hence k'2 >0.
Here it should be noted that for a virtual photon we always have k2 # 0, unlike a
real photon, for which k2 = 0.
If the colliding particles $re not identical and also not a particle and its
antiparticle (for instance, an electron and a muon), then the scattering amplitude is
represented by a single diagram:
p(o);
(73.17)
t The sign of the whole amplitude is definite in the scattering of non-identical particles, being
determined by the fact that in (73.5) the "external" operators must be arranged so that the two electron
operators are both at the ends: <0|a'b'... b*a+\0) (or both in the middle); this condition ensures the
"same sign" of the initial and final vacuum states. The sign of the amplitude can also be verified from
the non-relativistic limit: we shall see later (§81) that, in this limit, the second term in (73.15) tends to
zero, and the first term tends to the Born amplitude of Rutherford scattering.
292
Invariant Perturbation Theory
§74
There can be no annihilation or exchange type diagram in this case. The same
result can be obtained analytically by writing the current operator as the sum of
electron and muon currents:
and taking, in the product j(*Xx)jlvXx')> the matrix elements of terms which give the
required annihilations and creations of particles.
Let us now consider first-order processes, which, as mentioned at the beginning
of this section, are forbidden by the conservation of 4-momentum. The matrix
elements of the operator
Sil) = -ie [ j(x)Â(x)d4x
(73.18)
for such transitions correspond to the creation or annihilation of three real particles
(two electrons and one photon) at "the same point x". They occur by the
contraction of the operators i//(x) and <//(x) at the same point, and are expressed,
for example in the case of photon emission, by integrals of the form
Sfi = -iej
Ûx)Hx)(yA*(x))
d4x,
which vanish because the integrand includes the factor exp[-f(pi - p2 - k)x] with a
non-zero exponent. In the language of Feynman diagrams, this means that diagrams
with three free ends such as
I
I*
y\
(73.19)
are zero.
For the same reason, second-order processes involving six particles in the initial
state (or in the final state) are impossible. In the matrix element Sfl for such a
transition, the integral over dAx d V would separate into a product of two vanishing
integrals over d4x and d V of products of three wave functions taken at the same
point. In other words, the corresponding diagram would separate into two independent diagrams of the type (73.19).
§ 74. Feynman diagrams for photon scattering
Let us now consider another second-order effect: the scattering of a photon by
an electron (the Compton effect). In the initial state let the photon and the electron
§74
Feynman Diagrams for Photon Scattering
293
have 4-momenta k\ and p{9 and in the final state k2 and p2 (and also definite
polarizations, which will be omitted for brevity).
The photon matrix element is
<2|TA|1(x)A,(x')|l> = <0|c2TAM(x)A,(x')c||0),
(74.1)
where
k
Contraction of the external and internal operators gives
(74.1) = CiA^A'vC^ + c2A^Afvct
= AfcAl, + A1|iA?1,\
(74.2)
where we have used the commutativity of the operators cx and c 2 ; for the same
reason, the symbol T can here be omitted.
The electron matrix element is
<2|Tr(x)f(x')|l> = (0\a2T(W*)WyW)atfi)>
(74.3)
This involves four ^-operators. Only two are concerned with the annihilation of
electron 1 and the creation pf electron %, and will be contracted with the operators
df and d2. These may be <//, tp or i£', ^ (but not i£, i£ or ty\ $': the creation and
annihilation of two real electrons and one real photon at the same point x or x'
would give an expression equal to zero). By carrying out the two possible ways of
contraction, we obtain two terms in the matrix element (74.3). These will first be
written on the assumption that t > t':
(74.3) = a^y^)(<Â V < / ^ r + aAfaWiiy'l'laî.
(74.4)
tp'ât-^â\â\^\.
In the first term the contracted operators are d 2 ^-»d 2 d 2 (£ 2 ,
Since the operators d2d2 and â\â\ are diagonal and appear at the end of the
products, they can be replaced by the vacuum expectation value, i.e. unity. To
make a similar transformation in the second term of (74.4), the operator d 2 must
first be "pulled" to the left, and â\ to the right. This is done by means of the
commutation rules for the operators dp, dpi
(74.5)
{up, $}+ = <ÄP, {dp, <£}+ = i/>p.J
Then (74.4) becomes
<0|(fcY^)(*'YTi) - (*y^iX*27> # )|0>,
t > t'\
(74.6)
294
§74
Invariant Perturbation Theory
only the operator factors are averaged, of course. Similarly, for t < t\ we obtain an
expression differing by the interchange of /J, and v and of the primed and unprimed
symbols:
<0| - (i//y>i)(i£ 2 y^) + (<Â27WO(^Y^i)|0>,
t < V.
(74.7)
The two expressions (74.6) and (74.7) can be written as one by using the
chronological product of the (//-operators:
Til(x)$k(x()=ipi(x)$k(x^
= -<Mx')ifc(x),
t'<t;}
V>U J
(74.8)
where i and k are bispinor indices. Then the first and second terms in (74.6), (74.7)
can be combined in the form
tÂ27^<0|Ti// • i//|0>y>', + t ^ ? " « ) ^ ' •
IÂ|0)T^I,
(74.9)
where i// • \\i denotes the matrix i//,i//k.
It should be noted that, in the natural definition (74.8), the operator products are
taken with opposite signs for t < V and t > t'. In this respect it differs from the
definition of the T product which has been used for the operators A and j. This
difference arises because the fermion operators ip and \\ß anticommute outside the
light^cone, unlike the commuting boson operators A and the bilinear operators
j = i//yi//.t This procedure ensures the relativistic invariance of the definition (74.8).
A formal proof of the commutation rules for the (//-operators will be given in §75.$
We shall define the electron propagation function or electron propagator, a
bispinor of rank two, as
Gik(x - x ' ) = -i<0|T^(jc)fe(jc')|0>.
(74.10)
Then the electron matrix element becomes
<2|TrU)r(x')|l> = i$iy»GyvVx + Wy'Gy^x.
(74.11)
On multiplication by the photon matrix element (74.1) and integration over d4x d V ,
the two terms in (74.11) give the same result, and so we have
S/i = - ie2 [ [ d4x d V i2(x)yILG(x - x')7yi//,(x') x
x {A2*M(x)Au(x') + Af,(x')AlM(x)}.
(74.12)
t The ^-operators themselves, it will be recalled, correspond to no measurable physical quantities,
and therefore need not commute outside the light cone.
t The T product of any number of ^-operators may be defined similarly. It is equal to the product of
all the operators arranged in order of increasing time from right to left, the sign being determined by the
parity of the interchange needed to obtain this order from the order shown under the T product symbol.
Accordingly, this sign changes when any two «//-operators are interchanged; for example,
TÄ(x)e(x') = -T(f k U')Ä(x).
I
§75
295
The Electron Propagator
Substituting the plane waves (64.8), (64.9) for the electron and photon wave
functions and separating the delta function as in (73.10), we obtain finally the
scattering amplitude
Mfl = -47re2û2{(ye^G(pi + kd(yei) + (yeùG(p[-k2)(yemuu
(74.13)
where e\, e2 are the photon polarization 4-vectors and G(p) the electron propagator
in the momentum representation.
The two terms in this expression are represented by the following Feynman
diagrams:
k2
j«,
v
4 7 r e 2 G 2 ( y e l ) G (f) (^e,) u, =
/
\
>
«
fsP, + k,
\
(74.14)
The broken-line free ends of the diagrams correspond to real photons; the
incoming lines (initial photon) are associated with a factor V(47r)e, and the
outgoing lines (final photon) with a factor V(47r)e*, where e is the polarization
4-vector. In the first diagram, the initial photon is absorbed together with the initial
electron, and the final photon is emitted together with the final electron. In the
second diagram, the final photon is emitted together with the annihilation of the
initial electron, and the initial photon is absorbed together with the creation of the
final electron.
The continuous internal line joining the two vertices represents a virtual
electron whose 4-momentum is determined by the conservation of the 4-momentum
at the vertices. This line is associated with a factor iG(J). Unlike the 4-momentum
of a real particle, that of the virtual electron has a square which is not equal to m2.
If the invariant f2 is considered, for example, in the rest frame of the electron, we
easily find that
f = (Pi + M 2 > m2,
/' 2 = (p, - k2)2 < m2.
(74.15)
§75. The electron propagator
The propagation functions or propagators defined in §§73 and 74 are of
fundamental importance in the formalism of quantum electrodynamics. The photon
propagator D^ is a basic characteristic of the interaction of two electrons, as is
Invariant Perturbation Theory
296
§75
shown by its position in the electron scattering amplitude, in which it is multiplied
by the transition currents of the two particles. The electron propagator plays a
similar part in the electron-photon interaction.
Let us now calculate the actual values of the propagators, taking first the
electron propagator. Let the operator yp - m, where pM = id^ act on the function
Gik(x-x')
= -KOlTi/zKxJ^U^O),
(75.1)
i and k being bispinor indices. Since tp(x) satisfies Dirac's equation (yp - m)^f(x) = 0,
we find that the result is zero at all points x, except those for which t = t\ The reason is
that G ( x - x ' ) tends to different limits as t->f' + 0 and f - » t ' - 0 : according to the
definition (74.8) these limits are respectively
-i<0|^(r,t)A(rM)|0>
and +i<0|^k(r', t)^(r, t)|0>,
and, as we shall see, they are not the same on the light cone. This causes an additional
delta-function term to appear in the derivative dG/dt:
. / , T^Mx>)
dG
0^ + o ( t - 0 ( G , ^ o - G , ^ - o ) .
(75.2)
Since the derivative with respect to t appears in the operator yp - m in the form
iy° djdty we therefore have
(yp - m)ikGkl(x - x') = 8(t - 07*<0|{iMr, 0 , <Mr\ 0}+|0>.
(75.3)
The antiçommutator is calculated as follows. On multiplying the operators
<£(r, t) and <£(r', t) (see (73.6)) and using the rules for interchange of the fermion
operators dp, bp, we find
{<Mr, 0 , <Mr\ OK = S tMr)<Mr') + ^i(r)*. P i k (r')],
p
(75.4)
where i//±p(r) are wave functions without the time factor; as in §§73 and 74, the
polarization indices are omitted for brevity. The set of all functions </>±p(r), which
are eigenfunctions of the electron Hamiltonian, forms a complete set of normalized
functions, and according to the general properties of these (cf. QAf, (5.12)) we have
2 iMmW)
p
+ ^pAr)^Ar9)] = M(r-r').
(75.5)
The sum on the right-hand side of (75.4) differs from that in (75.5) in that <}/t is
replaced by (*lt*y\, and its value is 7? k 6(r-r'). Thus
{.Mr, 0 , «K(r\ OK = 5(r - r')y?k.
(75.6)
From this formula it follows, in particular, that the operators tjf and ip anti-
The Electron Propagator
§75
297
commute outside the light cone, as stated in §74. When (x -x'f < 0 there is always
a frame of reference in which t - t'\ if then rï r', the anticommutator (75.6) is in
fact zero.
Substituting (75.6) in (75.3) (and omitting the bispinor indices), we have finallyt
(yp - m)G(x - x') = S(4)(x - x').
(75.7)
Thus the electron propagator satisfies Dirac's equation with a delta function on
the right-hand side. Mathematically speaking, therefore, it is the Green's function
for Dirac's equation.
We shall later be concerned not with the function G(£) itself (£ = x - x'), but with
its Fourier components:
G(p) = JG(OeipidAÇ
(75.8)
(the propagator in the momentum representation). Taking the Fourier component
of each side of (75.7), we find that G(p) satisfies the algebraic equations
(7p-m)G(p)=l,
(75.9)
G(p) = 2 Ç ^ .
(75.10)
the solution of which is
The four components of the 4-vector p in G(p) are independent variables, not
related by p2 = Po~p 2 = m2. Writing the denominator in (75.10) as Po-(p 2 + m2), we
see that G(p), as a function of p0 for given p2, has two poles at p0 = ±e, where
e = V(p2 + m2). Thus, in the integration with respect to p0 in the integral
G(a = (2^/e- i p *G(p)d 4 p
=
(àfjd3p ' e" "f df>0 ' e_iP0T G(P)
(75U)
(where r = t — t'), the question of avoiding the poles arises; until this is decided, the
expression (75.10) remains essentially indeterminate.
To settle this question, we go back to the original definition (75.1), and
substitute in it the ^-operators as the sums (73.6), noting that the only non-zero
vacuum expectation values are those of the following products of creation and
annihilation operators:
<0|apap+|0)=l,
(0|bp*>p+|0)=l.
t The explicit form, including the bispinor indices, is
(yp - m )uGtk(x - x') = 5(4,(x - x')6ik.
(75.7a)
298
Invariant Perturbation Theory
§75
(Since in the vacuum state there are no particles, a particle has to be "created" by
the operator âp or bp before it can be "annihilated" by dp or b9.) The result is
Ga(jc - x') = -i 2 «Mr, 0«Mr\ O
p
= - i 2 e"M"0 ^(r)^ k (r')
for f - r' >0;
P
(75.12)
Gik(x - x') = » 2 <Ê_M(r\ f')^-p,i(r, 0
P
= i £ eh(,-,,) ^-p,,(r)t/;_p.k(r')
for t - t' <0.
For t > f only the electron terms contribute to G, and for t < t' only the positron
terms.
If the summation over p is replaced by an integration over d3p» a comparison of
(75.12) and (75.11) shows that the integral
Jé?-ip°TG(p)dpo
(75.13)
must have a phase factor e~iET for T > 0 and eUr for T <0. This can be achieved by
passing above the pole p0 = e and below p0 = - e in the plane of the complex
variable p0:
-\v
■^V
(75.14)
For, when T > 0, the path of integration is closed by an infinite semicircle in the
lower half-plane, so that the value of the integral (75.13) is given by the residue at
the pole po = +e; when T < 0 , the path is closed in the upper half-plane, and the
integral is given by the residue at the pole p0 = - e. The desired result is thus
obtained in each case.
This rule for avoiding the poles (Feynman's rule) can be differently stated as
follows: the integration is everywhere along the real axis, but the mass m of the
particle is given an infinitesimal negative imaginary part:
m-*m- iO.
We then have
e-*V[p 2 + (m-iO) 2 ]
= V[p2 + m 2 -iO]
= e - /O.
(75.15)
§75
The Electron Propagator
299
The poles p0 = ± e are therefore moved off the real axis:
- e + iO
—!
.
■
(75.16)
+ c-iO
and the integration along this axis is equivalent to integration along the path
(75.14).t Using the rule (75.15), we can write the propagator (75.10) in the form
G(p)=
^PV".ft.
p -m +i0
(75.17)
The rule of integration with displaced poles can be proved by means of the
relation
1
x + iO
(75.18)
=P--i7TÔ(x),
x
which is to be taken in the sense that multiplication by any function /(x) and
integration gives
I l&L dx = P jtMdx.
— oo
lV/(0 )
(75.19)
—oc
(the symbol P denoting the principal value).
The Green's function (75.10) is the product of the bispinor factor yp + m and a
scalar,
G (0) (p)=l/(p 2 -m 2 ).
(75.20)
The corresponding coordinate function G{0)(£) is evidently a solution of the
equation
(p2 - m2)G(0)(x - x') = 8(4)(x - x'),
(75.21)
i.e. it is the Green's function of the equation (p2 - m2)tj/ — 0. In this sense we can
say that G(0)(x -x') is the scalar-partide propagator. It is easily seen by calculation,
in the same manner as above, that the scalar field propagation function can be
expressed in terms of the «//-operators (11.2) by
G(0)(x - x') = -i<0|T«KxW+(x')|0>,
(75.22)
which is analogous to the definition (75.1). The chronological product is defined (as
t It is useful to note that the rule for moving the poles corresponds to an infinitesimal damping of
G(x - x') with respect to |T| = |I - t'\: if the value of po at the displaced poles is written as -(e - i8) and
+ (f -i5), with S-»+0, then the time factor in the integral (75.13) becomes exp(-ie|r|- S|T|).
300
Invariant Perturbation Theory
§76
for all boson operators) by
T$(x)4,+(x') = i(x)4t+(x'),
= ti^(x')tMx),
r>f';i
t <t\ I
(75.23)
with the same sign for both t > t' and t < f'.
§ 76. The photon propagator
Hitherto we have been concerned (in §§43 and 74) with the explicit form of the
electromagnetic field operator A only in finding the matrix elements with respect to
a change in the number of real photons. For this purpose it was sufficient to use the
representation (§2) of the free field potentials in terms of transverse plane waves.
This representation, however, does not give a complete description of every
field, as is clear from the fact that the scattering diagrams (73.13), (73.14) must also
take account of the Coulomb interaction of the electrons. The latter is described by
the scalar potential 3> and certainly cannot be reduced to an exchange between
transverse virtual photons (describable by a vector potential such that div A = 0).t
Thus we have as yet essentially no complete definition of the operators A, and
without this it is impossible to carry out a direct calculation of the photon
propagator by means of the formula
D^(x -x') = i<0|TA^(jc)A,(x')|0).
(76.1)
On the other hand, the fact that the potentials are not gauge-invariant deprives of
much of their physical meaning the operators which would be needed for a
complete quantization of the electromagnetic field.
These difficulties, however, are purely formal, not physical, and can be avoided
by using certain general properties of the propagator, which are evident from the
requirements of relativistic invariance and gauge invariance.
The most general 4-tensor of rank two which depends only on the 4-vector
£ = x - x' is
D»AO = g».D{è2) - d^D{i\e\
(76.2)
where D and D (l) are scalar functions of the invariant £2.t This tensor is necessarily
symmetrical.
t With the condition div A = 0, Maxwell's equations lead to the following equations for A and $:
d<t>
DA = -4irj + V — ,
dt
A * = -4<7rp.
In this gauge, the potential <I> satisfies the static Poisson's equation; cf. (76.13) below with Doo in the
same gauge.
t These functions are different in the three ranges of values of the argument which are not mutually
interchanged by Lorentz transformations: the regions outside the light cone (£' <0), and within its two
parts ( £ 2 > 0 ; & > 0 , £o<0).
§76
301
The Photon Propagator
In the momentum representation, we correspondingly have
D^(k) = D(k2)g^ + k,KDil)(k2).
(76.3)
where D{k2), D(l\k2) are the Fourier components of the functions D(£2), D (0 (£ 2 ).
The photon propagation function, in physical quantities (scattering amplitudes),
is multiplied by the transition currents of two electrons, i.e. it appears in combinations of the form (j^)2\D^(jv)^; see, for instance, (73.13). But, because of the
conservation of current (3 M j M =0), the matrix elements J21 = «/^Y^i satisfy the
condition of 4-transversality,
Mn2i=0,
(76.4)
where k = p2~P\; cf. (43.13). It is therefore clear that all physical results are
unchanged by the substitution
( 76 - 5 )
D^ -+ D M , + XILK + XvK>
where the *M are any functions of k and k0. This arbitrariness in the choice of D^
corresponds to that in the field potential gauge.
The arbitrary gauge transformation (76.5) can violate the relativistically invariant form D ^ assumed in (76.3) if the quantities \n do not make up a 4-vector.
But, even considering only relativistically invariant forms of the propagator, we see
that the choice of the function D{l)(k2) in (76.3) is entirely arbitrary; it does not
affect any physical results, and can be made in any convenient manner (L. D.
Landau, A. A. Abrikosov and I. M. Khalatnikov, 1954).
Thus the determination of the propagation function amounts to that of a single
gauge-invariant function D(k2). If we take a given value of k2, and the z-axis in the
direction of k, the transformations (76.5) will not affect the components Dxx = Dyy =
-D(k2). It is therefore sufficient to calculate the component Dxx, using any gauge
for the potentials.
We shall use a gauge in which div A = 0 and the operator A is given by the
expansion (2.17), (2.18):
(c k a e ( a ) e~ikx + cLeM*
eikx\
a> = Ikl;
(76.6)
the index a = 1, 2 labels the polarizations. The only non-zero vacuum expectation
values of products of the operators c, c* are (0|ckaCka|0) = 1. Then, by the definition
(76.1), we have
D Ä (|) = ^ | ^ ^ ( 2 e i a ) e l r ) * ) e - M T , + , k - 1 .
(76.7).
where i, k are three-dimensional vector indices; the summation over k has been
replaced by an integration over d3fc/(27r)\ The absolute value of r = t - V appears in
the exponent, because the operator product in (76.1) is chronological.
Invariant Perturbation Theory
302
§76
It is evident from (76.7) that the integrand without the factor e ,k * is the
component of the three-dimensional Fourier expansion of the function D,k(r, 0«
For Dxx - -D, it is
(2iri/a>) e-^ 2 K°T = (27ri/a>) <TiwW.
a
To find Dxx(k2) we now have to represent this function as a Fourier integral over
time. The appropriate formula is
M €-Mn = - -L f
<o
4
J
e-V dko.
2ir J ko-kl + »0
—x
As explained in §75, this integration is understood to be taken along a contour
passing below the pole k0 = |k| = o) and above the pole ko = _ N = _ w ; for T > 0 the
value of the integral is determined by the residue at the pole k0 = +a>, and for T < 0
by that at k0 = - w.
Thus we have finally
D(k2) = 47r/(k2 + /O).
(76.8)
The term +i0 in the denominator which results from this proof is in accordance
with the rule (75.15), JO being subtracted from the (zero) mass of the photon. It is
evident from (76.8) that the corresponding coordinate function D(£2) satisfies the
equation
- d^D(x - x') = 4TTS(4)(X - x\
(76.9)
i.e. it is the Green's function of the wave equation.
We shall generally take D (0 = 0, i.e. use the propagation function
DM„ = g^D(k2) = p ^ g,.
(76.10)
(the Feynman gauge).
There are also other gauges which may be advantageous in certain applications.
Putting D(,) = -Dlk2, we obtain the propagator in the form
(the Landau gauge), with D^Jk" = 0. This choice is similar to the Lorentz gauge for
potentials (Aßk" = 0).
The propagator gauge conditions Duk' = 0, D0jk' = 0 are analogous to the threedimensional gauge condition div A = 0 for the potentials. Together with
Dxx = -D = -4nlk\
§76
The Photon Propagator
303
D
< 76 - 12)
these conditions give
- = -Ä(5«-f)-
In order to obtain this Da, we must apply to the propagator (76.10) the transformation (76.5), putting
-
*°~
4TTÙ)
_
2(a>2-k2)k2'
Xi
Atrkj
~2(o) 2 -k 2 )k 2,
The remaining components DM„ are then found to be
Doo = -47r/k2,
Do,=0.
(76.13)
This is called the Coulomb gauge (E. E. Salpeter, 1952). Dw is here the Fourier
component of the Coulomb potential.
Finally, the propagator gauge in which
Du =
p-TlUi ~ *TT),
A>, = Doo = 0,
(76.14)
is analogous to the potential gauge condition O = 0. This is a convenient form for
use in non-relativistic problems (I. E. Dzyaloshinskil and L. P. Pitaevskiï, 1959).
All the above expressions relate to the momentum representation of the
propagator. In some cases, it is convenient to use the mixed frequency-coordinate
representation, i.e. the function
D^(w, r) = f D^(û), k) eik" d3kl(2ir)\
In the Feynman gauge (76.10)
D^w, r) = g>tvD(o>, r),
where
rv
* = A4 7 rf J e_ikr . d'k
D(a>,r)
w2 k5+ 0(27r)3
=
o
'Vr}<oi-k^i0kdk
or, changing k to -k in the second term in the integrand,
™
,
i f e'krkdk
irr J a>£ - k + tO
(76.15)
304
Invariant Perturbation Theory
§77
The integration here is carried out by closing the contour of integration with an
infinite semicircle in the upper half-plane of the complex variable k, and amounts to
taking the residue at the pole k = |o>| + iO. The final result is
D(co,r) = - - e , ' M r .
(76.16)
The following comment may be made regarding this expression. The process
described by the diagrams (73.13) and (73.14) may be intuitively regarded as the
scattering of electron 2 in the field due to electron 1 (or vice versa). The function
(76.16) corresponds to the usual "retarded" potential <* elior (see Fields, (64.1),
(64.2)) only when to > 0. The sign of CD, however, depends on the arbitrary choice of
the direction of the arrow of k in the diagram. The above-mentioned property of
D(oj, r) signifies that in quantum electrodynamics the source of the field is to be
regarded as the particle which loses energy, i.e. emits a virtual photon.
To conclude this section, let us also consider the problem of the propagator for
particles with spin 1 and non-zero mass. There is then no arbitrariness of the gauge,
and the choice of the propagator is unambiguous.
Substituting the i/f-operators (14.16) in the definition
G^ = -i<0|T^(jc)^:(jc')|0>f
(76.17)
we obtain an expression that differs from (76.7) only in that the sum over
polarizations in the integrand is replaced by
a
Summation over polarizations is equivalent to averaging and multiplication by 3,
the number of independent polarizations. Averaging gives the density matrix of
unpolarized particles (14.15). Thus we find for the propagator of vector particles
The propagators (75.17) and (76.18) have similar structures: the denominator
contains the difference p2-m2,
and the numerator is (apart from a factor) the
density matrix of unpolarized particles with a given spin.
§77. General rules of the diagram technique
The calculation of the scattering matrix elements that has been given for some
simple cases in §§73 and 74 contains all the fundamental features of the general
method. There is no particular difficulty in deriving the corresponding general rules
for calculating the matrix elements in any order of perturbation theory.
As has already been mentioned, the matrix element of the scattering operator Ê
for the transition between any initial and final states is equal to the vacuum
§77
General Rules of the Diagram Technique
305
expectation value of the operator obtained by multiplying S on the right by the
creation operators of all the initial particles and on the left by the annihilation
operators of all the final particles,
This treatment puts the S-matrix element in the following form in the nth order
of perturbation theory:
</|S(1i> = ^ < 0 | . . . M t / . - - f l i / . . - c , , x
x J d 4 *,... d4xnT{Û-ieyA{)i!>ô...
(<M-i>yAJtM}cu . . . aïi... i>r,... |0>;
(77.1)
the suffixes li, 2i,... label the initial particles (positrons, electrons and photons
separately), and the suffixes 1/, 2 / , . . . label the final particles. The suffixes 1,2,...
to the operators i£ and  signify that <£i = 4*(x\) and so on. The operators i£ and A
which appear here are linear combinations of the creation and annihilation operators of the corresponding particles in various states. Thus we obtain expressions
for the matrix elements which are the vacuum expectation values of the products
of the particle creation and annihilation operators and of their linear combinations.
The calculation of such expectation values is effected by means of the following
results, which constitute Wick's theorem (G. C. Wick, 1950).
(1) The vacuum expectation value of the product of any number of boson
operators c+ and c is equal to the sum of the products of all possible expectation
values of these operators taken in pairs (contraction). In each pair, the factors must
be placed in the same order as in the original product.
(2) For the fermion operators d+, d, b+, b (of the same or different particles),
the rule is the same except that each term appears in the sum with positive or
negative sign according to the parity of the number of interchanges of fermion
operators needed to bring together all the operators that are averaged in pairs.
The expectation value must obviously be zero unless the product contains a
factor d+, b+, c+ for each factor d, b, c. Then only pairs of operators (d, d + ),...,
pertaining to the same states are to be contracted, and moreover only those pairs in
which d*, etc., is to the right of d, etc.: the particle is first created and then
annihilated (whereas <0|a+a|0) = 0, etc.).
If each pair (d, d+), etc. appears only once in the product, Wick's theorem is
obviously true, the expectation value then reducing to a single product of pairwise
expectation values. Its validity is also evident when all the annihilation operators in
the product are to the right of the creation operators; this is called a normal
product. The expectation value is then zero. Wick's theorem is now easily proved
by induction for the general case where one pair of operators appears k times in
the product, as follows.
Let us consider the expectation value (0|.. cc* .. |0>, in which the pair of boson
operators appears k times; the argument is entirely similar for fermion operators. If
we interchange the factors c and c* in one pair, the commutation rules give
<0| ..cc\.
|0> = <0|.. c+c.. |0> + <0|.. 1.. |0>.
(77.2)
306
Invariant Perturbation Theory
§77
The expectation value (0|. . 1 . . |0) contains k - 1 pairs, and Wick's theorem is
assumed to be valid for it. If the expectation value <0|.. cc* . . |0) is expanded by
Wick's theorem, it differs from (0| . . c V . . |0) by just the term
<0|..
1 . J0><0|ccf|0> = <0|.. l . . | 0 > ;
in the expansion of (0| . . c c . . |0), the corresponding term (0|. . 1 . . |0)(0|c 4 c|0) is
zero. Hence it follows from (77.2) that, if Wick's theorem is valid for a matrix
element (0| . . c c. . |0), it is still valid when c and c + are interchanged. Since the
theorem is known to be valid for one particular order of factors (the normal order),
it is therefore true in every case.
Since Wick's theorem is valid for products of operators d, b , . . . , it is also true
for all products which contain the linear combinations <£, i//, A of d, b , . . . , as well
as the latter operators themselves. On applying this theorem to the matrix element
(77.1), we bring it to the form of a sum of terms, each term being the product of a
number of pairwise expectation values. The latter will include contractions of the
operators t//, i//, Â with "external" operators—those which create the initial particles or annihilate the final particles. These contractions are expressed in terms of
the wave functions of the initial and final particles by the formulae
<0|ACp|0> = Ap,
<0|cpA|0> = A*, ]
<O|<K|O) = 0p,
<0|ap^|0> = t^*
<o|bpt//|o> = iKP,
<o|<fa;|o> = ^ -p»
(77.3)
where Ap and i//p are the photon and electron wave functions with momentum p
(the polarization indices are omitted for brevity, as in §§73 and 74). Contractions of
the "internal" operators in the T product will also occur. Since the sequence of
factors in each contracted pair is preserved when applying Wick's theorem, the
chronological sequence of operators is preserved in these contractions, and they
are therefore replaced by the corresponding propagators.!
Each term of the sum obtained from the matrix element by applying Wick's
theorem is represented by a particular Feynman diagram. In the nth-order diagram
there are n vertices, each corresponding to one of the variables of integration (the
4-vectors xu x2,...).
Three lines meet at each vertex, two being continuous
(electron lines) and one broken (photon line); these correspond to the electron
operators t// and ty and the photon operator A as functions of the same variable x.
The operator i/> corresponds to the incoming line and if/ to the outgoing line.
By way of illustration, we shall give some examples of the correlation between
the terms of the matrix element in the third approximation and the diagrams.
t The following comment must be made regarding this last statement. In proving Wick's theorem,
we have made use of the commutation rules for the operators c and c \ which are meaningful only for
real ("transverse") photons. The "external" operators cî> Cf do in fact, of course, correspond to such
(initial and final) photons, but the operators A (which appear within the T product) describe, as shown
in §76, not only transverse photons. The situation here is similar to that in the calculation of D^„ (§76).
Owing to the relativistic and gauge invariance, it is sufficient to prove the theorem for those products
(i.e. components of the tensors (0|TA^A,—10» which are determined by the transverse parts of the
potentials. The theorem is then valid for all products.
General Rules of the Diagram Technique
§77
307
Omitting the integral sign and the symbol T, and also the operator symbols, the
factors -iey, and the arguments of the operators, we can symbolically write these
terms as
(a)
($A<J,) (4-A'D (.J,A|)
(b)
(4-A4-) (vJ,A<[<) ($A<(,)
(77.4)
(c)
(.;A-J/)
(}A<;)
(.pA4,)
(d)
(^A-;)
(>;A^)
( W )
, 1
^ 1
.
For clarity, the electron and photon contractions are shown by continuous and
broken lines as in the diagrams. The direction of the arrows from i/r to \j* for the
electron contractions is the same as in the diagrams. For the internal photon
contractions the direction is immaterial (the photon propagator is an even function
of x -x').
The terms thus obtained include equivalent terms which differ only in that the
vertices are renumbered, i.e. that the correlation between the vertices and the
variables is changed, or simply that the variables of integration are renamed. The
number of such interchanges is n !; this cancels the factor 1/n ! in (77.1), and there is
then no need to consider diagrams differing only by interchange of vertices. This
has already been noted in §§73 and 74. For example, there are two equivalent
diagrams in the second approximation:
( + A + ) (<J/A<|0
=
/-A
(77.5)
In (77.4) and (77.5) only internal contractions which correspond to internal
diagram lines are shown (virtual electrons and photons). The operators still free are
contracted with external operators, and this establishes a correlation between the
free ends of the diagrams and certain initial and final particles. Then 1// (contracting
with operators âf or bt) gives the final electron line or the initial positron line, and
ip (contracting with at or bf) gives the initial electron line or the final positron line.
308
Invariant Perturbation Theory
§77
The free operator A (contracting with ct or cf) can correspond to either an initial
or a final photon. Thus we obtain sets of several topologically identical diagrams
(i.e. diagrams having the same number of lines arranged in the same way), differing
only by interchanges of initial and final particles between incoming.and outgoing
free ends.
Each such interchange is clearly equivalent to a certain interchange of the
external operators d, b , . . . in (77.1). It is therefore evident that, if the initial
particles or the final particles include identical fermions, diagrams which differ by
an odd number of interchanges of free ends must have opposite signs.
An uninterrupted sequence of continuous lines in the diagrams constitutes an
electron line along which the arrows maintain a constant direction. Such a line may
have two free ends or form a closed loop. For example, the diagram
('W) QÊAij,)
=
O*
has a loop with two vertices. The maintenance of direction along the electron line is
the graphical expression of the conservation of charge: the "incoming" charge at
each vertex is equal to the "outgoing" charge.
The arrangement of the bispinor indices along the continuous electron line
corresponds to writing the matrices from left to right in motion contrary to the
arrows. The bispinor indices of different electron lines can never become confused.
Along an open line, the sequence of indices terminates at the free ends with
electron (or positron) wave functions; in a closed loop, the sequence of indices is
itself closed, and the loop corresponds to the trace of the product of the matrices
found on it. This trace must be taken with negative sign, as is easily seen. A loop
with k vertices corresponds to a set of k contractions:
(•W)
(.}A+) .. • (•; A •})
(or to another which is equivalent, differing only in an interchange of the vertices).
In the (k - l)th contraction the operators \p and <// are together in the order ($ to
the right of i/0 in which they must appear in the electron propagator. The operators
at the ends are brought together by an even number of interchanges with other
(//-operators, and are then in the order i//i//.
Since
<0|T^>|0) = - < 0 | T ^ | 0 >
(see the second footnote to §74), the replacement of this contraction by the
corresponding propagator means a change in the sign of the whole expression.
In general, the change to the momentum representation is made in an exactly
similar manner to that in §§73 and 74. As well as the general law of conservation of
4-momentum, "conservation laws" must also be satisfied at each vertex. But all
these laws may not suffice to determine uniquely the momenta of all the internal
lines in the diagram. In such cases, there remain integrations over dApl{2ir)A for all
§77
General Rules of the Diagram Technique
309
the undetermined internal momenta; these integrations extend throughout p-space,
including p0 from -<* to +00.
In the above discussion it has been assumed that the perturbation is represented
by the interaction between those particles which are "actively" concerned in the
reaction (i.e. between particles whose state is altered as a result of the process). A
similar treatment can be given for the case where there is an external electromagnetic field, i.e. a field generated by "passive" particles, whose state is not altered in
the process.
Let A{e\x) be the 4-potential of the external field. It appears in the Lagrangian
of the interaction together with the photon operator A, as the sum À + A(f) (which
is multiplied by the current operator j). Since A{€) does not involve any operators, it
cannot contract with other operators. Thus only external lines in Feynman
diagrams can correspond to an external field.
If A{€) is expressed as a Fourier integral:
A(e)(x) = f A{t\q) e~iqx d4ql(2ir)\
(77.6)
A{e\q)=(
A(e\x)eiqxd4x,
the expressions for the matrix elements in the momentum representation will
contain the 4-vector q together with the 4-momenta of other external lines
corresponding to real particles. Each such external-field line can be correlated with
a factor AU)(q), and the line is to be regarded as "incoming" (in accordance with
the sign of the exponent in the factor e~tQX which accompanies A{e\q) in the Fourier
integral; an "outgoing line" would be correlated with a factor A(f)*(q)). If the
4-momenta of all the external-field lines are not uniquely defined (for given
4-momenta of all the real particles) by the law of conservation of 4-momentum,
then there remain integrations over d4q/(27r)4 for all the "free" q and over all the
other undetermined 4-momenta of the diagram lines.
If the external field is independent of time, then
AM(q) = 27rÔ(q°)AU)(q),
(77.7)
where A(e)(q) is the three-dimensional Fourier component:
A(€)(q) - j A(°(r) e~* r d3x.
(77.8)
In this case the external line is correlated with A(f)(q) and assigned a 4momentum qM ~ (0, q); the energies of the electron lines which (together with the
field line) meet at a vertex will be equal by virtue of the conservation law.
Integration over d3p/(2ir)3 is necessary for the other "free" three-dimensional
momenta p of the internal lines. The amplitude Mfi thus calculated determines, for
example, the scattering cross-section by (64.25).
We may give a list of final rules for the diagram technique whereby an
310
Invariant Perturbation Theory
§77
expression may be obtained for the scattering amplitude (or rather for iMfi) in the
momentum representation.
(1) The nth approximation of perturbation theory corresponds to diagrams with
n vertices, each of which is the meeting point of one incoming and one outgoing
electron line (continuous) and one photon line (broken). The amplitude of the
scattering process involves all the diagrams having free ends (external lines) equal
in number to the initial and final particles.
(2) Each incoming continuous external line is associated with the amplitude
u(p) of an initial electron or u(-p) of a final positron (where p is the 4-momentum
of the particle). Each outgoing continuous line is associated with the amplitude
ü(p) of a final electron or ii(-p) of an initial positron.
(3) Each vertex is associated with a 4-vector — iey91.
(4) Each incoming broken external line is associated with the amplitude
y/(4ir)e^ of an initial photon, and each such outgoing line with the amplitude
V(47r)e* of a final photon, where e is the polarization 4-vector. The vector index /LL
is the same as the index of the matrix y* at the corresponding vertex, so that the
scalar product ye or ye* is obtained.
(5) Each continuous internal line is associated with a factor iG(p), and each
broken internal line with a factor -îD M „(p). The tensor indices /m, v are the same as
the indices of the matrices y*\ yv at the vertices joined by the broken line.
(6) The arrows have a constant direction along any continuous sequence of
electron lines, and the arrangement of the bispinor indices along them corresponds
to writing the matrices from left to right in motion contrary to the arrows. A closed
electron loop corresponds to the trace of the product of the matrices found on it.
(7) At each vertex, the 4-momenta of the lines which meet there satisfy a
conservation law, i.e. the sum of the momenta of the incoming lines is equal to the
sum of the momenta of the outgoing lines. The momenta of the free ends are given
quantities (subject to the general conservation law), with momentum - p assigned
to the positron line. Integration over d4pl(27r)4 is carried out for the momenta of
internal lines which remain undetermined after application of the conservation laws
at every vertex.
(8) An incoming free end corresponding to an external field is associated with a
factor A{e\q); the 4-vector q is related to the 4-momenta of the other lines by the
conservation law at the vertex. If the field is constant, the line is associated with a
factor A(e)(q), and integration over d3pl(2ir)3 is carried out for the three-dimensional momenta of internal lines which remain undetermined.
(9) An additional factor - 1 is included in iMfl for each closed electron loop in
the diagram and for each pair of positron external lines if these are the beginning
and end of a single sequence of continuous lines. If the initial particles or the final
particles include more than one electron or positron, the diagrams differing by an
odd number of interchanges of identical particles (i.e. of the corresponding external
lines) must have opposite signs.
To clarify the last rule, it may be added that diagrams having the same
continuous lines, i.e. diagrams which would be identical after removal of all photon
lines, must always have the same sign. When identical fermions are present, the
sign of the amplitude as a whole is arbitrary.
§78
Crossing Invariance
311
§78. Crossing invariance
The representation of the scattering amplitudes Mfi by Feynman integrals
reveals the following noteworthy symmetry property of these amplitudes.
Any of the incoming external lines in a Feynman diagram may be regarded
(without changing the.direction of its arrow) as either an initial particle or a final
antiparticle, and any outgoing line as either a final particle or an initial antiparticle.
When the change is made from particle to antiparticle, the significance of the
4-momentum p assigned to the line also changes: p = pe for the electron (say), and
p = ~pp for the positron. The polarization assigned to the particle is also changed.
Since an incoming external line must correspond to an amplitude u and an outgoing
one to w*, we have u = ue for the electron and u = u% for the positron; and the
change from M t o w * signifies a change in the sign of the spin component (or the
helicity) of the particle.
For the photon, a strictly neutral particle, this is simply a change from emission
to absorption or vice versa: an external photon line with momentum k corresponds
either to the absorption of a photon with momentum ka = k, or to the emission of a
photon with momentum ke = -k and the opposite helicity.
This change in the significance of the external lines is equivalent to a change
from one cross-channel of the reaction to others. Hence it follows that the same
amplitude, as a function of the momenta of the free ends of the diagrams, describes
every channel of the reaction.t Only the meaning of the arguments of the function
varies with the channel: the change from particle to antiparticle implies Pi-*-pf,
where p, is the 4-momentum of the initial particle (in one channel) and pf the
4-momentum of the final particle (in the other channel). This property of the
scattering amplitude is called crossing symmetry or crossing invariance.
In terms of the invariant amplitudes defined in §70 as functions of the kinematic
invariants, we can say that these functions will be the same for all channels, but for
each channel their arguments will take values in the corresponding physical region.
Thus the Feynman integrals determine the invariant amplitudes as analytic functions; their values in the various physical regions are the analytical continuation of
a function specified in one region. Since the integrands in the Feynman integrals
have singularities, so do the invariant amplitudes, and their singularities can be
determined from the expressions for the integrals, using the rule of pole avoidance.
If the invariant amplitudes are calculated for any one channel from the Feynman
integrals, their analytical continuation to the other channels will necessarily take
account of these singularities.
It should be emphasized that crossing invariance goes beyond the properties of
the scattering matrix which follow from the general requirements of space-time
symmetry. The latter imply the equality of amplitudes for processes which differ by
the interchange of initial and final states and the replacement of all particles by
antiparticles (with the momenta p of all particles unchanged and the signs of their
angular momentum components reversed). This is the condition of CPT int If a particular channel is forbidden by the conservation of 4-momentum, the transition probability
is necessarily zero because of the delta function which appears as a factor in (64.5).
312
Invariant Perturbation Theory
§79
variance.t Crossing invariance, however, allows this transformation not only for all
the particles at once but also for any one particle.
§79. Virtual particles
The internal lines in the Feynman diagrams play a role in invariant perturbation
theory analogous to that of the intermediate states in the "ordinary" theory, but the
nature of these states is different in the two theories. In the ordinary theory the
(three-dimensional) momentum is conserved in the intermediate states, but the
energy is not, and for this reason they are said to be virtual states. In the invariant
theory, the momentum and the energy appear on an equal footing: in the intermediate states, the whole 4-momentum is conserved (this results from the fact that
the integration in the S-matrix elements is over both coordinates and time, thus
ensuring the invariance of the theory). But the relation between energy and
momentum which holds for real particles and is expressed by the equation p2 = m2
is no longer satisfied in the intermediate states, which are therefore spoken of as
intermediate virtual particles. The relation between the momentum and energy of a
virtual particle may be anything required by the conservation of 4-momentum at
the vertices.
Let us consider a diagram consisting of two parts I and II, joined by a single
line. Ignoring the internal structure of these parts, we can represent the diagram in
the schematic form
(the lines shown may be either continuous or broken lines). By the general
conservation law, the sums of the 4-momenta of the external lines for parts I and II
are equal. Because of the conservation at each vertex, they are also equal to the
4-momentum p of the internal line joining parts I and IL Thus this momentum is
uniquely defined, and there is therefore no integration with respect to it in the
matrix element.
The quantity p 2 may be either positive or negative, depending on the reaction
channel. There is always a channel in which p 2 >0.t Then the virtual particle is
entirely analogous, as regards its formal properties, to a real particle with real mass
M = Vp 2 . Its rest frame can be defined, its spin determined, and so on.
The tensor structure of the photon propagator (76.11) is the same as that of the
density matrix of an unpolarized particle with spin 1 and non-zero mass:
P ^ = -3(g,i,
-p»pJm2)
t The formal description of the change from one of these reactions to the other by reversing the
signs of all the 4-momenta in the Feynman diagrams corresponds to the significance of the operation
CPT as 4-inversion.
t For example, the channel (if it is allowable on energy grounds) in which all the free ends of part I
correspond to initial particles and those of part II to final particles. Then p = P, (the sum of the
4-momenta of all the initial particles), and in the centre-of-mass system p = (P?% 0), so that p2 > 0 .
313
Virtual Particles
§79
(see (14.15)). For a virtual particle the propagator (a quantity obtained from a
quadratic combination of the field operators) plays a role analogous to that of the
density matrix for a real particle. Thus a virtual photon, like a real photon, must be
assigned spin 1. But, unlike the two independent polarizations of the real photon,
all three polarizations are possible for the virtual photon, which is a "particle" with
finite mass.
The electron propagation function is
G oc yp + m,
where m is the mass of the real electron, the "mass" of the virtual particle being
M = Vp2. Putting
7P
2M
(7P
*
2M
(7P
"
*'
(
)
we see that the first term corresponds to the density matrix of a particle with mass
M and spin \, and the second term to that of a similar "antiparticle"; cf. (29.10) and
(29.17). Since the particle and the antiparticle have different internal parities (§27),
we conclude that the same spin \ must be assigned to the virtual electron, but that
no definite parity can be assigned to it.
A characteristic feature of the diagram (79.1) is that it can be cut into two
unconnected parts by dividing only one internal line.t This line corresponds, in
such cases, to a one-particle intermediate state, i.e. a state having only one virtual
particle. The scattering amplitude corresponding to such a diagram contains the
characteristic factor (which does not undergo integration)
1
p2-m2+*0'
arising from the internal line p; m is the electron mass for an electron line and zero
for a photon line. Thus the scattering amplitude has poles at the values of p for
which the virtual particle would become a physical one (p 2 = m2). This situation is
similar to the one in non-relativistic quantum mechanics, where the scattering
amplitude has poles for energy values corresponding to bound states of the system
of colliding particles (QM, §128).
Let us consider the diagram (79.1) for the reaction channel in which all the free
ends on the right correspond to initial particles, and all those on the left to final
particles; then p 2 > 0 . Then we can say that, in the intermediate state, all the initial
particles are converted into one virtual particle. This is possible only if such a
conversion would not contradict the necessary conservation laws (not including the
conservation of 4-momentum), namely the conservation of angular momentum,
charge, charge parity, etc. This is the necessary condition for the occurrence of
what are called pole diagrams. If these exist for one reaction channel, they exist
also for the remaining channels, because of crossing invariance.
t This property occurs for the diagrams of almost all processes in the first non-vanishing approximation.
314
Invariant Perturbation Theory
§79
For example, the conservation laws mentioned do not preclude the formation of
a virtual electron by e + y -> e. This corresponds to a pole of the Compton effect
amplitude (and therefore a pole of the other channel of this reaction, namely
two-photon annihilation of an electron-positron pair). The formation of a virtual
photon by e~ + e* -*y corresponds to a pole of the amplitude for the scattering of
an electron by a positron, and therefore that of an electron by an electron. Two
photons can give neither a virtual electron nor a virtual photon: the conversion
7 + 7 -► e is forbidden by the conservation of charge or angular momentum, and
7 + 7 ^ 7 by that of charge parity. Accordingly, the photon-photon scattering
amplitude cannot involve pole diagrams.
The origin of the pole singularities of the scattering amplitudes, which has been
discussed above on the basis of Feynman integrals, is really more general and is
not dependent on perturbation theory. We shall show that such singularities arise
simply as a consequence of the unitarity condition (71.2).
Let us assume that the intermediate states n which appear in (71.2) include a
one-particle state. The contribution of this state is
(Tp - T$)(one-p) =
Î(2TT)4
2 f «(4)(P/ - p)TfnTtn 7 Ä ,
where p and À are the 4-momentum and helicity of the intermediate particle. The
integration over d3p is replaced by one over dAp (in the range p 0 = e >0):
d'p-*2e8(p2-M2)d4p,
where M is the mass of the intermediate particle. The integration eliminates the
delta function 8{4\Pf - p); we then change from the amplitudes Tfi to Mfi by (64.10),
obtaining
(Mfi - M£)(one-p) = 27Ti8(p2 - M2) 2 MfnM*.
A
(79.3)
Assuming T and P invariance, we have (apart from a phase factor) Mif = M/v,
where the states i\ /' differ from i, / only in the sign of the particle helicities (with
the same momenta). Taking the sum of equation (79.3) and the corresponding
equation for Mrt'- Mff>, we have
im M jTe'p) = -
ITHP2
- M2)R,
(79.4)
where
M/i = M„ + M/r,
R = -2(MfnMfn
+ MrnM?n).
A
Hence it follows that M/„ as an analytic function of p2 = P* = P% has a pole at
315
Virtual Particles
§79
p - M . According to (75.18) the pole part is
Mr»
= pi_*,
(79.5)
+ i()-
Real transitions to a one-particle state are possible only for one value of P] = P),
namely M2. Thus we in fact obtain the scattering amplitude structure corresponding to a diagram of the form (79.1).
Finally, let us consider an important property of diagrams containing closed
electron loops. This property is easily derived by applying the concept of charge
parity to a virtual photon: a virtual photon, like a real photon, must be assigned a
definite (negative) charge parity.t
If a diagram contains a closed loop (with number of vertices N > 2), the
amplitude for the process concerned must include not only that diagram but also
another which differs only in the direction of traversal of the loop (if N = 2, there
is evidently no distinguishable "direction of traversal"). If these loops are "cut
out" along the broken lines which come to them, we obtain two loops, W\ and Uu:
i
i
/
i
i
vil^
\
/
wiT ^
\
(79 6)
'
which may be regarded as diagrams determining the amplitude for the process of
conversion of one set of photons (real or virtual) into another; the number N is the
sum of the numbers of initial and final photons. But the conservation of charge
parity forbids the conversion of an even number of photons into an odd number.
When N is odd, therefore, the sum of the expressions corresponding to the loops
(79.6) must be zero. The total contribution to the scattering amplitude from two
diagrams containing these loops as constituent parts is consequently zero also, a
result known as Furry's theorem (W. H. Furry, 1937).
Thus, in constructing the amplitude for a given process, we can ignore diagrams
containing loops with an odd number of vertices.
This cancellation of diagrams occurs for the following reason. A closed electron
loop corresponds to an expression (with given momenta ku k 2 ,..., kN of the
photon lines)
\d*p
tr[(yel)G(p)(ye2)G(p + k.)...],
(79.7)
where p, p + ki,... are the momenta of the electron lines (which are not completely
determined after the conservation laws have been applied at the vertices). Let the
operation of charge conjugation be applied to all matrices y*1 and G, replacing them
t This follows from the same arguments as were given at the end of §13 for a real photon,
concerning the electromagnetic interaction operator acting at each vertex.
316
Invariant Perturbation Theory
§79
by Ucy^Uc and U cGUc. The expression (79.7) is then unchanged, since the trace
of a product of matrices is unaffected by such a transformation. According to
(26.3),
UcVl/c = - f ,
(79.8)
and hence
Uc]G(p)Uc = = p ^ =
<5(-p).
(79.9)
But the replacement of G(p) by the transposed matrix with the sign of p changed is
clearly equivalent to a change in the direction of traversal of the loop, all the
arrows being reversed. Thus this transformation changes one loop into the other,
and there is a factor (-1) N from the change (79.8) at each vertex. Hence
n, = (-l) N n„,
(79.10)
i.e. the contributions from the two loops are the same when the number of vertices
is even, but equal and opposite when this number is odd.
CHAPTER IX
INTERACTION OF ELECTRONS
§80. Scattering of an electron in an external field
ELASTIC scattering of an electron in a constant external field is a simple process
which occurs even in the first approximation of perturbation theory (the first Born
approximation). It corresponds to a diagram with one vertex:
(80.1)
where p and p ' are the initial and final 4-momenta of the electron, and q = p ' - p.
Since the electron energy is conserved in scattering in a constant field (e = e'), we
have q = (0, q).t
The corresponding scattering amplitude is
Mp = -eü(p')[yAw(q)]u(p),
(80.2)
where A(e)(q) is the component of the spatial Fourier resolution of the external
field. The scattering cross-section is, according to (64.26),
da^Tjr-ilMjifdo'.
167T
(80.3)
For an electrostatic field, A(e) = (A ( o\0), and hence
M/,= -efi(p')7°M(P)Ai €) (q)
= - eu*(p')u{p)AVXq).
(80.4)
In the non-relativistic case, the bispinor amplitudes u(p) of the plane waves reduce
to the non-relativistic (two-component) amplitudes. For scattering without change
of polarization, u' = uy and u*u = 2m by the normalization condition chosen. Thus
da =
%u«>"
do\
t When there is an external field, such a diagram is, of course, not forbidden by the law of
conservation of 4-momentum, as the diagram (73.19) with a real photon was: q2, unlike the square of the
4-momentum of a real photon, need not be zero, and the component with the necessary q is
automatically taken from the Fourier integral which represents the external field.
317
§80
Interaction of Electrons
318
where U(q) = eAtfXq) is the Fourier component of the potential energy of the
electron in the field; this expression is the same as the,familiar Born's formula
(QM, (126.4)).
In the general relativistic case, the cross-section for scattering of unpolarized
electrons is obtained by averaging |M/j|2 over initial polarizations and summing over
final polarizations, i.e. by taking the quantity
\ 2 \Mfi\\
polar.
where the summation is over the spin directions of the initial and final electrons;
the factor 2 changes one of these summations into an averaging. According to the
rules given in §65, we obtain
1
2
y
>olar.
pol
|M/l|2 = 2trp'(7A" ) )p(7A ( ' ) *)
= i|A^(q)| 2 tr (m + yp')y\m
+ yp)y°.
To calculate the trace, we note that y°(yp)y° = (yp), where p = (e, - p ) , and
therefore
îtr (m + yp')y°(m + yp)y° = I tr (m + yp')(m + yp)
= m2 + p'p
= e 2 + m2 + p • p'
= 2e2-iq2.
Hence the cross-section is
^_£3A£!$9Ïe,(,_iU,
AIT
\
(80.5)
4e )
For a field due to a static distribution of charge with density p(r), we have
Af >(q) = 4îrp(q)/q2,
(80.6)
where p(q) is the Fourier transform of the distribution p(r) (the form factor). In
particular, for the Coulomb field of a point charge Ze we have p(q) = Ze. The
cross-section is then
da = d 0 . « W
(,_.£)
(80 .7)
(N. F. Mott, 1929). The quantity q2 = 4p2 sinMfl, where 6 is the scattering angle. The
angular dependence of the quantity preceding the parenthesis is therefore that of a
§80
Scattering of an Electron in an External Field
319
Rutherford cross-section:
.
. 4(Ze 2 )V
acrRu = do
4
q
= do ( Z ^ : ) 4 C "sin" 4 ^e;
(80.8)
in the non-relativistic limit, e2lp4-+ \lm2v4. Thust
da = daRu(\-v2
sin2 \0).
(80.9)
In the ultra-relativistic case, the angular distribution differs from the non-relativistic case in that there is much less backward scattering: as 0-*7r, do7da Ru -» m 2 /e 2 .
In the ultra-relativistic case, formula (80.7) gives for small-angle scattering
da
(80.10)
= ^^do>.
Although this formula has been derived in the Born approximation (i.e. on the
assumption that Ze2 < 1), it remains valid (for angles 0 ^ m/e) even if Ze1 ~ 1. This
can be seen by using the ultra-relativistic wave function if/^ (39.10), which is exact
as regards Ze2. This solution, which is valid in the range (39.2), of course remains
valid in the asymptotic range r-**>. Here
F ~ 1 + constant x
e
i{pr
~*'r\
a
' V F - 1 - cos d - 02 « 1,
e
so that the correction term remains small, as it should. The wave function of the
form eip r F, which has the same form as the non-relativistic function (with an obvious
change of parameters), has the same asymptotic expression, and therefore the
cross-section is given by the Rutherford formula.
To calculate the scattering cross-section for electrons with any polarization, we
could use the density matrix (29.13), following the general procedure. In this case,
however, the result can be more readily obtained by expressing the bispinor
amplitudes u(p') and w(p) in the form (23.9). Multiplication gives
M*(p')w(p) = w'*{e + tn +{e - m)(n' • <r)(n • v)}w,
or, using (33.5),
w*(p')"(p) = vv'*/vv,
(80.11)
t The difference between der and dorRU shown by this formula is specific to particles with spin i In
the scattering of particles with spin 0, if their motion in the electromagnetic field is described by the
wave equation, the result is der = daRu. At first sight it might appear puzzling that the factor expressing
this purely quantum effect does not contain h. However, it must be remembered that the
condition for the Born approximation to be valid (e~ltiv<£ 1) is contrary to the condition for quasiclassical motion in a Coulomb field, and therefore formula (80.9) cannot be taken to the classical limit.
320
Interaction of Electrons
§80
wheret
/ = A + Bvcr,
A = (e + m) + (e - m) cos 0,
B = - i(e - m) sin 0,
(80.12)
v = n x n'/sin 0.
The two-component quantity (three-dimensional spinor) w is the non-relativistic
spin wave function of the electron. The change to the partially polarized states is
therefore made by replacing the products wQw$ (where a, ß are spinor indices) by
the non-relativistic two-rowed density matrix paß. Thus we must put
\Mfi\2-+e2\A{0e)(q)\2 tr p(A - Bv • u)p'{A + Bv • or),
where
p H a + a-O,
p' = iO + «r-D.
and Ç, Ç' are the vectors of the initial polarization and the final polarization selected
by the detector. The result of calculating the trace is
where dor0 is the scattering cross-section for unpolarized electrons.
Expressing the quantity in the braces in (80.13) in the form {1 +£(/) • £'}, w e find
the polarization Ç(/) of the final electron itself, as opposed to the detected
polarization £ (see §65):$
£/> = (A2-lBl2)C + 2|Bp(vQv + 2A|B|vX^
(gQ> M )
We see that the scattered electrons are polarized only if the incident electrons are
polarized. This is a general property of the first Born approximation; cf. QM, §140.
In the non-relativistic case (e -» m), (80.14) gives ÇV) = Ç, i.e. the electron retains
its polarization on scattering, a natural consequence of the neglect of spin-orbit
interaction.,
In the opposite (ultra-relativistic) case, we have
A = e(l + cos 0),
B = - i€ sin 0,
in accordance with the general formula (38.2).
t The definition of / used here differs by a factor from that in §§37 and 38.
r Formula (80.14) corresponds to that derived in QM, §140, Problem 1, and is obtained from it by
taking A real and B imaginary.
§81
Scattering of Electrons and Positrons by an Electron
321
If the incident electron has a definite helicity (Ç = 2An, A = ±|), (80.14) gives
after a simple calculation
Ç(/) = 2An'.
Thus the electron remains helical after scattering, with the same value (À) of the
helicity.
This property occurs because, as already mentioned in §38, when the mass is
neglected Dirac's equation in the spinor representation separates into two independent equations for the functions £ and rj. The result has also a more general
significance, since the current
J = (£*£ + i7*T),
É*«rÊ-î|*«rt|),
and therefore the electromagnetic perturbation operator V = ejÀ, do not contain
mixed terms in £ and rj, and thus have no matrix elements for transitions between £
states and T) states. Hence it follows that, if an ultra-relativistic electron has a
definite helicity (i.e. if either tj or £ is non-zero), this helicity is conserved in
interaction processes in an approximation corresponding to completely neglecting
the electron mass.
§81. Scattering of electrons and positrons by an electron
Let us consider the scattering of an electron by an electron, in which two
electrons with 4-momenta pi,P2 collide and emerge with 4-momenta pî,P2- The
conservation of 4-momentum is expressed by
Pl + P2 = Pi + P2-
(81.1)
We shall use the kinematic invariants of §66, defined by
s =(Pi + P2)2 = 2(m2 + p,p2), 1
t = (Pi-p5) 2 = 2(m 2 - P l pi),
y
w=(Pi-P2r = 2(m 2 -p,p9,
j
s + t + u=4m 2 .
[fi 1.1)
The process in question is represented by the two Feynman diagrams (73.13),
(73.14), and its amplitude ist
M,, = 4 i r e 2 { y ( f i f r * ^
(81.3)
t This form of Mfi is in accordance with the general expression (70.5). In the first non-vanishing
approximation of perturbation theory, only one of the five invariant amplitudes is non-zero: / 3 (f,u) =
4<rre2lL
§81
Interaction of Electrons
322
According to the rules given in §65 for the states of initial and final particles
described by polarization density matrices pu p i , . . . , we make the change
|M/,| 2 ^ 16TTVlptr(p27>27") tr (piyMp,7»<) + jptr
(p\y*p2yv)tr(P27MPI7.)-
- ^ tr (p27MP27"pîyiiPi7J ~ JH tr (PI7* 1 P27 , 'P27MPI7,)]-
(81-4)
For the scattering of unpolarized electrons (without regard to their polarization
after scattering), we must put for all the density matrices p = i ( 7 p + m), and
multiply the result by 2 x 2 = 4 (averaging over the polarizations of the two initial
electrons, and summation over the polarizations of the two final electrons). The
scattering cross-section is given by formula (64.23), in which, by (64.15a), 72 =
\s(s - 4m2). It may be written
da = dt
Aire*
s(s - 4 m T,
) {fit,
u) + git, u) + fiu, t) + g(u, r)},
fit, u) = j^p tr [(7P2 + m)7"(7P2 + m)7"] tr [(7p{ + m)yiliypx + m)yv],
(81.5)
git, u) = - j ^ tr [iyp'2 + m)7M(7p2 + m)7"(7p{ + m)y^(ypl + m)7„].
In f(t, u) the traces are first calculated (using (22.9), (22.10)), followed by summation over /x and i>;t in g(t, u) the summation over p, and v is taken first, using
formulae (22.6). The result is
fit, u) = p [(p,P2)2 + iPxPif + 2m2(m2
~piP[)],
git, u) = — ( p , p 2 - 2m2)(p,p2),
or, in terms of the invariants (81.2),
/(r,u) = p [ i ( s 2 + u 2 ) + 4 m 2 ( f - m 2 ) ] ,
(81.6)
git, u) = giu, t) = ^ ils - m2)i\s - 3m2).
t The following formula is given for future reference:
4 tr (yp\ + m)y*(yp2 + m)yy = g*v(m2- p\p2) + ptpi + pïpÇ.
§81
Scattering of Electrons and Positrons by an Electron
323
Thus the cross-section is
da = T\ ^ 4 ^ {p tiU2 + u2) + Am\t - m2)] 4+ p[i(s 2 + t2) + 4m 2 ( M -m 2 )] + ^ ( ! 5 - m 2 ) ( i s - 3 m 2 ) } ,
(81.7)
where re = e2/m.
In the centre-of-mass system, we have
= 4e2,
t = - 4p2 sin2|0,
u = - 4p2 cos2 |e,
= - 2p2 d cos 0 = (P2/TT) do,
-dt
(81.8)
where |p| and e are the magnitude of the momentum and energy of the electrons,
which are unchanged in the scattering, and 0 is the scattering angle. In the
non-relativistic case (e Ä m),t we obtain for the cross-section
2
da = r,
um4 dt /1 , 1
5— ( Ti. + - î
pz
,2 \2
\r
u
_
1\
7"
tu)
W V \sinM0 + cos4\B
sin2 £0 cos 2 \e) d°
a x2
4(l + 3cos 2 0)
do (non-relativistic),
\mtr/
sin40
(81.9)
where v = 2p/m is the relative velocity of the electrons, in accordance with the
non-relativistic theory (see QM, §137). In the general case of arbitrary velocities,
formula (81.7) with the substitution (81.8) can easily be brought to the form
r
da - r]mY.+4pV
[Lsin
- 440^ -snr0
^ + M
) (l
4 2^0/J
1 do
\e 2 r+ .pv
V + -sm
(81.10)
(C. M0ller, 1932). In the ultra-relativistic case (p2= e2),
do- = r\ ^V ( 3 t c . ° L g ) do (ultra-relativistic).
e
4 sin 0
(81.11)
In the laboratory system, where one of the electrons (say electron 2) is at rest
before the collision, the cross-section can be expressed in terms of the quantity
m
m
'
t The velocity v is assumed small (v <^ 1) but such that the condition for perturbation theory to be
applicable is still satisfied: el\v ( = e2lhv)<€ 1.
§81
Interaction of Electrons
324
the energy (in units of m) transferred by the incident electron (electron 1) to
electron 2.t The invariants are
s=2m(m + £\),
f=-2m2A,
u = -2m(ei - m - mA).
(81.13)
Substitution of these expressions in (81.7) gives the following formula for the
energy distribution of the secondary electrons (called 8 electrons) formed in the
scattering of fast primary electrons:
A
<> 2 da [ ( y - l ) V
2 7 2 + 2 y - l , %\
^ =27rre73T(Ai(7_1_A)i-A(7_/_A)+l),
(81.14)
where 7 = exlm. The quantities m A and m (y - 1 - A) are the kinetic energies of the
two electrons after the collision; the identity of the two particles is shown here by
the symmetry of the formula with respect to these quantities. If the term "recoil
electron" is arbitrarily applied to the electron with the smaller energy, A takes
values from 0 to 2(7 - 1). When A is small, formula (81.14) becomes
A
o -2 y2 <*A 2-rrr] dA A
da = liTfe -^ZTl^T = "^f"^
A ^ y - 1.
(81.15)
This formula, if expressed in terms of the velocity of the incident electron
(ui = |pi|/ei), retains the same form in the non-relativistic case. Its form is naturally,
therefore, the same as that of the result given by the non-relativistic theory (cf.
QM, (148.17)).
Let us now consider the scattering of a positron by an electron (H. J. Bhabha,
1936). This is another cross-channel of the same general reaction as the electronelectron scattering. If p_, p+ are the initial momenta of the electron and positron, and
P-yPl their final momenta, the change from one case to the other is made by the
substitutions
Pi->-p+>
P2-*P-,
Pî-*-p+.
P2-+P--
The kinematic invariants (81.2) become
s=(p.-pl)\
r = (p,-p;) 2 ,
n=(p_ + p+)2.
(81.16)
If ee scattering is the s channel, le scattering is the u channel of the reaction. The
square of the scattering amplitude, expressed in terms of s, t and w, remains as
before; in the denominator of (81.5), s must be replaced by u. Thus the crosssection for scattering of a positron by an electron is, instead of (81.7),
da = r\ ? ™ 2 A (72 [fa 2 + u2) + Am\t - m2)] +
+ pfl(s 2 + t2) + 4m2(M-m2)] + ^( 1 2 s-m 2 )(is-3m 2 )}.
(81.17)
t The kinematic relations for elastic collisions in various frames of reference are given in Fields,
§13.
§81
325
Scattering of Electrons and Positrons by an Electron
In the centre-of-mass system, the values of the invariants s, t, u differ from
(81.8) by the interchange of s and u:
5 = -4p 2 cosM0,
f = -4p 2 sinMö,
w=4e2.
(81.18)
In the non-relativistic limit, formula (81.17) reduces to Rutherford's formula:
(-^-!l
2 v 2
J
(non-relativistic),
-^TT^
(81.19)
mir/ sin iQ
where v = 2p/m. This comes from the first term in the braces in (81.17), which
originates from the "scatterings-type diagram (see §73). The contributions from the
"annihilation" diagram (the second term in (81.17)) and from its interference with
the scattering diagram (the third term) vanish in the non-relativistic limit.t
In the general case of arbitrary velocities, the contributions of all three terms in
(81.17) are of the same order of magnitude; the first term predominates only at
small angles, because of the factor t~2 « sin~458. Combining like terms, we can write
the cross-section for scattering of a positron by an electron (in the centre-of-mass
system) in the form
. r] m 2 f(e 2 + p2)2 1
f.4|
16 B I p
sin 20
der = do — —j {
+
8e4-m4
1 ,
Y~1
■ 1 In +
p e sin £0
i2£^-iä!^ sin »J, + S $ , i n M , ] .
e
e
e
J
(81.20)
The symmetry with respect to 0 and TT - 0 which is typical of scattering
involving identical particles does not, of course, occur when a positron is scattered
by an electron. In the ultra-relativistic limit, the expression (81.20) differs from the
electron-electron cross-section only by the factor cos 4 20:
ddee = cos 4 20 dcre€ (ultra-relativistic).
(81.21)
In the laboratory system, where one of the particles (say the electron) is at rest
before the collision, we again define
A =
£ ± ^
m
£i_H£L
m
=
(8122)
i.e. the energy transferred by the positron to the electron. As in (81.13), we now
have
s = - 2m(e+ - m - m A),
t = - 2m2A,
u = 2m(m + e+).
Substitution of these expressions in (81.17) easily gives the following formula for
t See (83.4) and (83.20) for the passage to the non-relativistic limit in the scattering and annihilation
terms in the scattering amplitude. The latter term contains a factor l/c\ and therefore tends to zero.
326
§81
Interaction of Electrons
the energy distribution of the secondary electrons:
2
da
fy 2
^-2^73i{^
2y 2 + 4 7 + l 1 3 7 2 + 6 7 + 4
V^T~I+ (7 + i)5
27
1
21
(7Tï7A + (7?l7 A r
(81.23)
where y = e+/m; A varies from 0 to y - 1. When A <è y - 1, (81.23) leads to the same
formula (81.15) as for electron scattering.
The polarization effects in the scattering of electrons or positrons are calculated
by the general rules given in §65. In all but special cases, the resulting formulae are
lengthy. Here we shall give only some comments.t
In the approximation considered (the first non-vanishing approximation of
perturbation theory), the cross-section contains no terms linear in the polarization
vectors of the initial or final particles. As in the non-relativistic theory (QM, §140),
such terms are forbidden in consequence of the requirement for the scattering
matrix to be Hermitian. The scattering cross-section is therefore unchanged if only
one of the colliding particles is polarized; and unpolarized particles do not become
polarized as a result of scattering.
The same conditions prohibit correlation terms in the cross-section which
contain the products of the polarizations of three of the particles (initial and final)
concerned in the process. The cross-section does, however, contain double and
quadruple correlation terms. In the scattering of unlike particles (electron and
positron, electron and muon), these terms vanish in the non-relativistic limit, since
there is no spin-orbit interaction. In collisions of like particles, however, there are
correlation terms even in the non-relativistic case, because of exchange effects.
PROBLEMS
PROBLEM 1. Determine the scattering cross-section for polarized electrons in the non-relativistic
case.
SOLUTION. In the non-relativistic case, the bispinor amplitudes in the standard representation have
two components, and the density matrices are the two-rowed matrices (29.20). In the scattering
amplitude (81.3), the only non-zero terms are those with \i = v = 0, which contain matrices y° that are
diagonal (in the standard representation). Instead of (81.4) we have
-16irV.4m4.4[i!+iï-.i(l+CI.b)],
the summation being over the polarizations of the final electrons. Hence the scattering cross-section is
d<r=d<ro 1
( -i+w^ | -4
where 0 is the scattering angle in the centre-of-mass system and dao the scattering cross-section (81.9)
for unpolarized particles. For completely polarized electrons, this formula is the same as the result in
t For further details see the paper by W. H. McMaster, Reviews of Modern Physics 33, 8, 1961.
§81
Scattering of Electrons and Positrons by an Electron
327
QM, §137, Problem, with |£i| = |^| = 1, Çi'£2 = cosa, where a is the angle between the directions of
polarization of the electrons.
For the scattering of positrons by electrons, there is no dependence on the polarization in this
approximation (da = dao); this is easily seen by noticing that, in the non-relativistic limit, different pairs
of components are non-zero in the electron and positron amplitudes up and a p .
PROBLEM 2. In the non-relativistic case, determine the polarization of scattered electrons in the
scattering of an unpolarized beam by a polarized target.
SOLUTION. We can calculate the scattering cross-section for given initial polarization £ and
detected final polarization Ci; only the polarization of onefinalelectron is detected. By the same method
as in Problem 1, we find
A
^A \\
Y> Y
2C0S
da = *d<ro[ !-<;,•&
9(1 "COS 0)1
,+ 3cos*e
J.
The polarization vector of the scattered electron is therefore
rU)
Cl = _ f c
2cos 6(1 -cos 0)
l + scos'e •
PROBLEM 3. In the non-relativistic case, determine the probability of spin reversal of a completely
polarized electron scattered by an unpolarized electron.
SOLUTION. We similarly find the cross-section for given polarizations £i and £i:
fi -ur r 2 cos 0(1 +cos 0)1
der-2dcro[l+<;,•<;,
, + 3 c o s 2 e J.
A~-±A~
Putting \\ • £i = - 1 , we then find the probability of reversal of the spin direction:
da _ (1-cosfl) 2
d<7o 2(1 + 3 cos2 6Y
PROBLEM 4. Determine the ratio of the scattering cross-sections for helical electrons with parallel
and antiparallel spins, in the ultra-relativistic case.
SOLUTION. In (81.4) we must put, according to (29.22),
Pi =5(yp.)(l - 2A,y 5 ),
p2 = kypi)d
pi = 27pi»
p2 = 27P2,
-2À2Y 5 ),
where Ai, À2 = ±2. The traces are calculated by the formulae given in §22; in particular,
tr ( y ^ a ) ? 1 ^ ) - / " ) tr (y\yc)yAyd)y.)
= i\e>»k»apbk)(e^cad*)
= 2(SS5Î - 6prba)apbxcadr
= 2(acKM)~2(ad)(bc).
The result is
Since the momenta of the colliding electrons (in the centre-of-mass system) are opposite, antiparallel
spins correspond to like helicities (Ai - A 2), and parallel spins to unlike helicities (Ai = - A2). Substituting s, f,
u from (81.8) (with p 2 ~ e2), we find the required ratio:
da 1\ld*U= s(l + 6 cos2 0 + cos4 0).
This has its least value, J, when 0 = |ir.
(1)
§81
Interaction of Electrons
328
P R O B L E M 5. The same as Problem 4, but for the scattering of positrons by electrons.
S O L U T I O N . In this case we have to calculate, instead of (81.4),
|M/i|2 -♦ 1 6 i r V | p tr (p' - y V - 7") tr (p+7^P + 7*) ~ ^
t r
(P-y W P + T Ü P W*) + • • • p
the remaining terms are obtained by interchanging p+ and p-. The density matrices are
P- = kyp-)(l " 2A-73),
p . = hyp+)(\ + 2A.7 5 ),
pi = l7PH
P- = 27P-»
where A+, A- = ± 2 (and for the positron, as for the electron, A+ = 2 denotes that the spin is parallel to the
momentum). The result of the calculation is
,2
-7-«
dt
1 ..2
2 ,
2— +
\
V
#
2
7— +
u2
- 4 A + A-
tu I
(s2-u2
3— +
\
V
s2-t2
2s2\
u1
tu 1
7—+
.
Hence we find for the ratio of cross-sections the same value as formula (1), Problem 4.
P R O B L E M 6. Determine the cross-section for scattering of muons by electrons.
S O L U T I O N . The process is described by the one diagram (73.17). Instead of (81.5) we have
ire4dt
AireAdt
a;/(MO,
[s-(m + /ir][s-(m-/ir]
(1)
1
/ ( ' . M) = ^ p tr [(7p; + ii)yk(yp„ + M)7*] tr [<7Pi + m)?A(7Pe + m)yv]\
p«, pM and pi, p i are the initial and final 4-momenta of the electron and the muon, and m, /1 their masses.
The invariants are
« =(Pr + P*)2 = m 2 + /A2 + 2p«p,i;
1 =(P*-Pi)2= 2(m2-pep;)
= 2(/x 2 ~p M p;),
2
w =(P*-P^) = n\2 +
ii2-2pepl>
2
s + f+ u = 2(m + /12).
The result of the calculation is
/ = p {(P<P*)2 + (PePi)2 + km 2 + ,i 2 )t}
= p{i(52-fu2)4(m2 + fi2)(2t-m2-/x2)}.
(2)
Formulae (1) and (2) give the solution of the problem. In the centre-of-mass system,
e4 do
da = 8(g, + €jysinMfl
I(M|i +
^
* (€C€M + P* C ° S 0 ) ' ~ 2 ( m ' + ^
sif|2
2
^
where do = 2ir sin 0do; £«, eM are the energies of the electron and the muon; p = s]- m 2 - el~ p2. If
p2<*/x2, we return to formula (80.9) for scattering by a fixed centre of Coulomb force. In the
ultra-relativistic case (p 2 > jx2),
.
e4 1 + cos 4 ifl .
4 i „ — do.
8p sin he
fl<r = -z—9 — .
§81
Scattering of Electrons and Positrons by an Electron
329
In the laboratory system (where the electron is at rest before the collision),
„2\2
,
,
2
A
.
(ezV daA L
ml A A
2 A
dc7 = 27r — -T77ll""VaT—
+ X-7A 1,
2 2
M
Amax 2e* /'
\m) v A \
where e» is the energy of the incident muon, and vM = p^/e^ is its velocity; m A = ei - m = EM - £^ is the
energy of the recoil electron; and
A
A
2
P*
" M " m 2 + M! + 2me|1
is the maximum value of A.
PROBLEM 7. Determine the ratio of cross-sections for the mutual scattering of helical electrons and
muons with parallel and with antiparallel spins, in the ultra-relativistic case (e^ > /i, ee>m).
SOLUTION.t As in Problem 4, we find
daîî/daîi = c o s 4 k
where 0 is the scattering angle in the centre-of-mass system.
PROBLEM 8. Determine the cross-section for the conversion of an electron pair into a muon pair (V.
B. Berestetskiï and I. Ya. Pomeranchuk, 1955).
SOLUTION. This is another cross-channel of the reaction to which fie scattering belongs. In this
channel,
t = (p, + p,) 2 ,
s = (Pe - p»)\
w = (p, - pM)2,
where p„ pt are the 4-momenta of the electron and the positron, and p^, pM those of the muon and the
antimuon. The reaction threshold corresponds to an energy 2\x. of the electron pair (in the
centre-of-mass system), so that we must have t > 4 j x \ In the laboratory system, where the electron is at
rest before the collision and the positron has energy e+,
t = 2m(e+ + m) « 2me+,
so that we must have e+ > e,, where the threshold energy et=2fi 2 /rn; here and below, all approximations allowed by the inequality /x > m are made.
The differential cross-section is (instead of formulae (1) and (2), Problem 6)
.
4ire4ds
~ W
£t4>
.
y [{(S2 + M2) + l^t
- /].
For given t, the quantity s takes values between the limits determined by the equations su = pi4,
s + t + u == 2fi2, i.e.
^ - it - W[t(t - A112)] « s « ti2 - \t + W[t(t - V ) ] .
An elementary integration gives
'-^T'(-¥)V¥)-
'•-«-'-
in the laboratory system, t = 2me+. This formula is not valid in the immediate neighbourhood of the
threshold: when e + - e , - j x e \ the muons formed cannot be regarded as free particles; when the
Coulomb interaction between them is taken into account, the cross-section tends not to zero but to a
constant value as e+->et (see QM, §147).
The cross-section (1) has its maximum value when £+ = 1.7c Its maximum value is about 20 times
less than the cross-section for two-photon annihilation at the same energy.
t Another method of solving this problem is given at the end of §144.
330
Interaction of Electrons
§82
§ 82. Ionization losses of fast particles
Let us consider a collision between a fast relativistic particle and an atom,
accompanied by excitation or ionization of the atom. Such inelastic collisions for the
non-relativistic case have been discussed in QM, §§148-150; here we shall derive
the relativistic generalization of the formulae obtained in QM (H. A. Bethe, 1933).
The velocity of the particle incident on the atom is assumed large in comparison
with those of the atomic electrons; thus we always assume that Za < 1, i.e. that the
atomic number is fairly small. This condition ensures the applicability of the Born
approximation to the process under consideration. The solution of the problem
depends to some extent on whether the fast particle is light (electron or positron) or
heavy (meson, proton, a-particle, etc.). The second case is the simpler, and will be
taken here.
Let p = (e, p) and p' = (e', p') be the initial andfinalmomenta of the fast particle
in the laboratory system, where the atom is at rest before the collision; the
difference q = p'-p gives the energy and momentum transferred to the atom by
the particle. The range of possible momentum transfers can be divided into two
parts:
(I) q2/m < m,
(II) q2/m > I,
(82.1)
where m is the electron mass and I is a mean energy of the atom, its ionization
potential. The two parts overlap with I < q2/m < m ; this allows an exact joining of
the results for each part separately. The momentum transfer will be said to be
respectively small and large in the two parts of the range.
SMALL MOMENTUM TRANSFER
In this range, the atomic electrons may be regarded as non-relativistic in both
the initial and the final state of the atom.
The amplitude of the process is
Mp} = e2JM-q)JvpP(q)D»v(q),
(82.2)
where Jn0 is the transition 4-current of the atom from the initial state (0) to the final
state (n), and Jpp is the transition 4-current of the fast particle; these currents here
replace ü'yuy which would occur, for example, in the scattering amplitude of two
"elementary" particles such as the electron and muon in (73.17); cf. also (139.3).
The transition currents are taken in the momentum representation (see (43.11)). The
cross-section of the process in the laboratory system is
d<rn = 2*6(6 - e ' - <o„0) [M^l2 2\p\?2e'(2<ir)i'
(823)
where a)n0 = En- E0 is the transition frequency between the states of the atom. The
final state may belong to either the discrete or the continuous spectrum, corresponding to excitation and ionization of the atom respectively. In the law of
§82
331
Ionization Losses of Fast Particles
conservation of energy (represented by the delta function in (82.3)), the recoil
energy of the atom is neglected; this is certainly permissible for a small momentum
transfer.
The photon propagator is here conveniently taken in the gauge (76.14), in which
only its space components are non-zero:
u)
q \
(o I
Then only the space components of the transition currents in (82.2) are likewise
needed.
The atomic transition current J„o(q) is here the Fourier component of the usual
non-relativistic expression:
J«o(q) = ~
j « H q ' r 0Mty* - *!>*nV*l>o) d'x,
(82.5)
where if/o and ipn are the atomic wave functions; for simplicity, the sign of
summation over the electrons in the atom will be omitted henceforward, i.e. the
formulae will be written as if there were only one electron in the atom. Integrating
by parts in the first term, we can rewrite this expression as a matrix element:
Ji.o(q) = l ( v « " i , w r + « " l v r v U
(82.6)
where v = -(i/m)V is the electron velocity operator.
Since the momentum lost by the scattered particle is relatively small (|q| <? |p|),
the transition current for this particle can be replaced simply by the diagonal
element
Jpp(0) = 2pz,
(82.7)
corresponding to classical motion in a straight line (cf. (99.5)); a factor z is included
to take account of a possible difference between the particle charge ze and the
electron charge e.
Since q is small, so is the angle of deviation d of the particle. The longitudinal
and transverse components of q (relative to p) are
-<ïl « (dplde)ü)n0 = (onolvy
qx ~ |p|#,
(82.8)
and hence q • p « - 6a>„o.
Substitution of (82.4M82.8) in (82.2) gives
M(n)=
_±!IZL/n | ^ ( q . v ^ - | q r + e - l q r q - v ) + (p-ve"lqr+€-,qrp-v)|oy
In the first term, since
q - v / + / q - v = 2i/,
332
§82
Interaction of Electrons
w h e r e f = e~i<K r (see QM, §149), the matrix element of this operator is 2i(/)„ 0 =
2ü)„o/rtO- In the second term, e~'q r can be taken as unity, since q is small. Then
Mp
=
_8^{e(e-iqr)/i0_
/p
. rnOÙJno}
The squared modulus is
= 6 4 ^ ( y 2 ) 2 {e2\(e-^rU2
W?
+ 2(q • rn0)(p • rnQ)e<ono + (p • rn0)Wn0};
(82.9)
here, in the second term, we have put e",q r % 1 — iq • r, but this cannot be done in
the first term, for a reason explained in the next footnote but two.
The energy lost by the fast particle in its inelastic collisions with atomst is given
by
K = 2J^odan=j^2J^o\M^\2do\
(82.10)
where the summation is over all possible final states of the atom, and the
integration is over the directions of the scattered particle; this quantity will be
called the effective retardation (KIS is known as the energy loss cross-section).
The integration in (82.10) can be carried out in two stages, by averaging over the
azimuth of the direction of p' relative to p and then integrating over do' ** 2ird d#,
where d is the small scattering angle. The first stage makes the change
q • r„o-» qjXno = - (o>„0/u)Xno,
where xn0 is the matrix element of one of the Cartesian coordinates of the atomic
electrons.t The integration over d# can be replaced by integration with respect to
q2, since
- ql = - ü)no + q ^ - <i>no + —r + P # ~
5— + P #
(82.11)
and therefore 2 # d # = d|q2|/p2, M being the mass of the fast particle. The result is
K
= 4.(»V 2 / {K«-"- -).O|J «? - «A W» (*£ + J,)} gjf.
(82.12)
The lower limit of the integration with respect to q2 is
k 2 U = (M 2 /pVno.
(82.13)
t These are often called ionization losses, although they are due to excitation as well as ionization of
atoms.
t It does not matter which coordinate: after the summation over the directions of the angular
momentum of the atom in thefinalstate, which is implied below, the matrix element does not depend on
the direction of the x-axis.
§82
Ionization Losses of Fast Particles
333
As the upper limit, we take a value |q2|t such that
(82.14)
K\q\lm<m,
which thus lies in the overlap of ranges I and II (82.1).
The integration and summation in (82.12) are carried out in the same way as was
done for the non-relativistic case in QM, §149. The entire range of integration is
divided into two parts: (a) from |q2|mi„ to \q\ and (b) from |q2|0 to |q2|i, where |q2|0 is
such that
JM/|pHV|q 2 |o«ma;
(82.15)
the quantity ma on the right is of the order of the momenta of the atomic electrons. In
part (a) we can put e~,qT = 1 - iq • r, and the contribution of this part to K is
l<i2lo
47r(ze2)22 J
{pû) n o|x„o| 2 i^--rû>noko| 2 i^p]d|q 2 |
l<J2lmin
4-7T(ze2)2 ^
,
|2 r.
|q2lop2
21
In the second term, the integration can be extended to infinity.
The summation is carried out by means of the formula
2û>n0|xn0|2 = Z/2m,
n
(82.16)
where Z is the number of electrons in the atom; see QM, (149.10). The result can
be written
where I is an average energy of the atom, defined by
2
log
J=J!
w
no|x„ 0 | 2 lOg W„o
2 ^nol^nol
n
= ^ L S^oW 2 loga> B0 .
(82.18)
In part (b), (82.11) shows that |q2| * p2#2, i.e. |q2| is independent of the particular
final state n of the atom, and the limits of integration are also independent of n.
The summation over n can therefore be taken inside the integral in (82.12). In the
334
§82
Interaction of Electrons
first term, the summation is carried out by means of the formula
2 |(^ iqr ) n o| 2 w n0 = (Z/2m)q2
(82.19)
n
(see QM, (149.5)), and the integral ist
mvz
1^ |o
The integral of the second term in (82.12) over this part of the range makes a
negligible contribution to K.
Adding the last formula to (82.17), we find as the contribution to K from the
whole range of small momentum transfers
LARGE MOMENTUM TRANSFER
Let us now consider collisions with a momentum transfer which is large
compared with the momentum of the atomic electrons (q2>ml).
Here we can
evidently neglect the binding of the electrons in the atom, regarding them as free.
Accordingly, the collision between the fast particle and the atom may be taken as
an elastic collision with each of the Z atomic electrons. Because of the high speed
of the particle, the atomic electrons may be assumed to be originally at rest.
Let m A denote the energy transferred from the fast particle to an atomic
electron, and let dcrA be the cross-section for elastic scattering with this energy
transfer. The differential effective retardation by the whole atom is then
dK = ZmAdaA.
(82.21)
The maximum energy which can be transferred to an electron at rest by the impact
of a particle with mass M > m is
A
mûmax
2
2 m p2
2
2m p 2
2
" m + M + 2ms ~ M + 2me'
where e and p are the energy and momentum of the incident particle; see Fields,
(13.13). We shall also suppose that the energy e, though ultra-relativistic (e > M), is
nevertheless such that
e<M2lm.
t The logarithmic divergence of the integral at the upper limit is the reason why e
expanded in powers of q in the first term in (82.12).
(82.22)
,q r
' cannot be
§82
lonization Losses of Fast Particles
335
Then even the maximum energy transfer
m Amax - 2mp 2 /M 2 = 2mv2y2
(y = ejM = 1/V(1 - i;2))
(82.23)
is small in comparison with the initial kinetic energy of the incident particle
(mAmax < e - M). Correspondingly, the momentum transfer q is always small in
comparison with the initial momentum p of the particle. This enables us to regard
the motion of the particle as being unaltered by the collision, i.e. the particle itself
as having infinite mass. Then the scattering, cross-section is found by simply
transforming the cross-section (80.7) for electron scattering by a fixed centre to the
laboratory system, in which the electron is initially at rest. This is easily done by
noting that, in the approximation used,
- q2 « q2 = 4p2 sin2 id,
do' = Trd|q2|/p2,
and the relative velocity is v in both systems. Formula (80.7) becomes
vl
V 4mV/ k I
The energy transfer A is expressed in terms of the same invariant q2: - q2 = 2m2A,
and therefore t
.
2TT(ze2)2 / ,
2
A \dA
.
™*„* I 1 " " A—)~ÄT(82.24)
m v \
Amax/ A
The contribution to the effective retardation from this range of momentum
transfers is found by integrating (82.21) from the limit |q2|, defined above-» to
!q2|max = 2m2Amax. The result is
dcr
* =
2ir(zc 2 ) 2 Z/, 2Amaxm2
: T oe
m
i
^ ~\ ~W\ — r
(82 25)
-
Finally, adding the contributions (82.20) and (82.25), we have for the total
ionization losses by the fast heavy particle (in ordinary units)
=
4nZ(ze2)2
^-rg/d-.V)"?)-
(82 26)
-
In the non-relativistic case, this reduces to QM, (150.10):
K=
4irZ{ze2)2.
2mv2
T - ^ log —=—,
mv
I
(82.27)
t In this expression, of course, no account is taken of the specific effects of strong interactions when
the heavy particle is a hadron. Such effects (corresponding to the hadron form factor) are, however,
important only when |<j2| « ]/Af2, and such momentum transfers are excluded by the condition (82.22).
336
§83
Interaction of Electrons
and in the ultra-relativistic case
47rZ(ze2)2 /,
2mc2
,\
(82.28)
The retardation depends only on the velocity of the fast particle, and not on its
mass. The decrease of the retardation with increasing velocity (82.27) changes to a
slow (logarithmic) increase in the ultra-relativistic range.
PROBLEMS
PROBLEM 1. Determine the effective retardation of a relativistic electron.
SOLUTION. The contribution of the range of small momentum transfers is again given by (82.20).
For large momentum transfers, (82.24) must be replaced by (81.14), which includes exchange, effects.
Integrating A • dan, over dA from |q2|i/2m2 to 2(7 - 1) and adding to (82.20), we get
2-nZe' f,
mV-l)(y-l)c4
(2
1\.
„ , 1 , (y-1) 2 ]
,n
where
y = (l-t>V)-'.
In the non-relativistic case, we get the formula given in QM, §149, Problem, and in the ultra-relativistic
case (7 > 1)
2-irZe*/, m V y ' n
,~
PROBLEM 2. The same as Problem 1, but for a positron.
SOLUTION. For dai in the range of large momentum transfers, the expression (81.23) must be used,
the upper limit for A being y - 1. In the ultra-relativistic case, the result is
2irZe*(t
2m 2 cV
23\
§83. Brett's equation
In classical electrodynamics, a system of interacting particles can be described
by means of a Lagrangian function depending only on the coordinates and
velocities of the particles, and correct as far as terms ~ l / c 2 (Fields, §65). This is
because radiation appears only as an effect of order 1/c3.
In the quantum theory, this corresponds to the possibility of describing the
system by Schrödinger's equation including second-order terms. For an electron
moving in an external electromagnetic field such an equation has been derived in
§33. We shall now derive a similar equation describing a system of interacting
particles.
We start from the relativistic expression for the scattering amplitude for two
particles. In the non-relativistic approximation, this becomes the usual Born
BreiVs Equation
§83
337
amplitude, proportional to the Fourier component of the potential of electrostatic
interaction of two charges. By calculating the amplitude as far as second-order
terms, we can establish the form of the corresponding potential, taking account of
terms ~ l / c \
Let usfirstassume that the two particles are different, with masses mi and m2 (say
an electron and muon): Then the scattering process is represented by a single diagram,
^
Pi > ^
p
i
w
The corresponding amplitude is
Mfi =
e\û\y»ul)Dllv(q)(ûWuî),
q = pi — pi = Pi-p'ï,
(83.1)
here it is assumed that the charges have the same sign. If the signs are different, e2
becomes -e2.
The subsequent calculations are considerably simplified if the photon propagator DM„ is chosen not in the ordinary gauge but in the Coulomb gauge (76.12),
(76.13):t
DM=-*£,
Do,=0,
O» = q ! . j v - i o ( 8 » - ^ ) -
(832)
Then the scattering amplitude is
Mfl = * 2 { ( ö i Y % ) ( ö * A ^
(83.3)
If all terms in 1/c are neglected, the second term in the braces vanishes, and the
first term gives
Mfi = -2m, • 2m2(vvi0)'*vvi0))(vvr*vv^0))Lr(q),
(83.4)
U(q) = 4Tre2lq\
(83.5)
where
and w\°\ w{2°\... denote the spinor (two-component) amplitudes of the non-relativistic plane waves, as defined in §23. The function U(q) is the Fourier component
of the Coulomb interaction potential energy, U(r) = e2lr.
In the next approximation (with respect to 1/c), the "Schrödinger" wave
function of the free particle 4>Sch (normalized by the integral / |4>Sch|2 d3x) satisfies
+ In this section, factors of c will be written in all formulae, and factors of h in the final formulae.
338
§83
Interaction of Electrons
the equation
IJ(0)
_ /_
( ±
H
VsehM
e - m c 2 ) 42\>i>Sch,
s
(83.6)
Zm
8m c
which includes the next term in the expansion of the relativistic expression for the
kinetic energy. The (spinor) amplitude of this plane wave will be denoted by w,
which tends to w(0) as 1/c ->0. The required scattering amplitude must be expressed
in terms of these amplitudes, in order to determine from its form the "Schrödinger"
interaction potential of the particles in the approximation considered.
In accordance with formula (33.11), the bispinor amplitude u of the free particle
can be expressed in terms of the "Schrödinger" amplitude w, with sufficient
accuracy, by
u
w(2
This formula gives
m )
(<'-^/"'c;>n
\ (or • p/2mc)w /
,83.7)
v
'
öiy0Ui = u\*ux
l
8mjc
4m\c
J
ü[yu\ = u[*au\
= (l/c)w;*{a(<r • pi) + (a • p[)a}w{
= (l/c)w{*{icr x q + 2pi + q}wi,
where q = pi - pi = p2 - pi. The corresponding expressions for (Ü2Y°M2) and (Ö27U2)
differ in that the suffix 1 is replaced by 2 and q by - q .
We now substitute these expressions in (83.3). Since the product (ü[yu\)(Ü2yu2)
already contains the factor 1/c2, the term a>2lc2 in the denominator of Dik may be
neglected. The scattering amplitude is then
Mfi = -2mi • 2m 2 (wi*W2*l/(p,,p 2 ,q)w, w2),
where
in
\ A „2P
1
1 , (q • Pi)(q * P2)
l/(p„P2,q) = 4Tre ^ - ^ - ^ - h
* £„£/
P1P2
mim2q2
îarqxp!
4m2c2q2
uri-gxp,
2m\m2c q2
ia 2 q X p 2 [
4m 2 c q
i<r ; -qxp, + ( a r q ) ( g , - q ) _ « r ^ l
4mim?cV
4mim 2 c J
2mjm2c q
(83.8)
§83
339
Breit's Equation
the suffixes 1 and 2 to the Pauli matrices indicate the spinor indices on which they
act, <7i acting on w\ and a 2 on vv2.
The function l/(pi,p2, q) is the particle interaction operator in the momentum
representation. It is then related to the operator Û(pi,p 2 , r) in the coordinate
representation by
f e"i('i-r'+'*
r2)
Ü(p,,p 2 ,r)e i(Pl r'+P2 r2) d3Xi d3x2
=
+ P2 ~ PÎ - pQUipu P2, q).
(2TT) 3 5( PI
(83.10)
If the operator Û is simply a function U(r)(r = n - r 2 ) , then U(p,,p 2 , q) is
independent of pi and p2, and formula (83.10) reduces to the usual definition of the
Fourier component:
j V ' q rU(r)d3x
= U(q).
Hence it is clear that, to find L7(pi, p2, r), we must calculate the integral
|e i < , r Lr( P l ,p 2 ,q)dV(27r) 3 ,
and then replace pi and p2 by the operators pi = -iV\, p2 = -iV 2 , writing these to the
right of all the other factors.
The required integrals are found by differentiation of the formula
f
iar 47T
d3q
1
For example, taking the gradient gives
f^-^TMV-IV1-1?.
qz
J
r r
(2TT)
Next, with a and b constant vectors, we have
J
( 5 ? 2,Va dr))e VD a q / 7 ( 5 ? '
£
?
the resulting integral, after integration by parts, reduces to (83.12), so that
f4ir(a-qXb-q) . i , , A
J
q4
€
= i(
(2^r7 *
-.mJLLE
' r
(83.12)
340
§83
Interaction of Electrons
Finally,
Mir(«-yb- q ) e „. r d^
J
q
1
r
(2ir)
In expanding the derivatives, it must be remembered that these expressions include
the delta function Ô(r). To separate this, we note that, after averaging over the
directions of r,
- (a • V)(b • V)^ = - Ka • b)A \ = ^f (a • b)S(r).
Now expanding the derivatives in the usual manner, we find
f 47r(a • q)(b • q)
J
J
^*
iq r
q
d3 q
1f
^-?(a.b-3
- (a • r)(b • r)l 4TT
p
] + Ta-bÔ(r); (83.14)
on averaging over the directions of r, the first term vanishes and only the
delta-function term remains.
Using these formulae, we obtain the following final expression for the particle
interaction operator:
fT„
Ä
,
e2 ire'h*
ire2h2 I/ 11 __,. 11 \
e'
e2
+
L
A
fi/
,
, r • (r ■ pi)p21
e2h
A
4l^rXP2,a2-2m^cV{rXp,,a2-rXp2,<F,}
+ 4mim
T^-t 2c [( g JrJ ^ ~ 3 ( g ' ' r)r^ 2 ' ^ - ¥3 »» '** g <J 4
+
(83-15)
The total Hamiltonian of the two-particle system in this approximation is
H = H{0) + H f + Û,
(83.16)
where H(0) is the free-particle Hamiltonian (83.6).
TWO ELECTRONS
If the two particles are identical (two electrons), then the scattering amplitude
includes a second term which is represented by the "exchange" diagram
§83
341
BreiVs Equation
There is, however, no need to calculate the contribution of this term to the
interaction operator. The reason is that the description of a system of identical
particles by means of Schrödinger's equation can be achieved with an interaction
operator similar to that for non-identical particles, if the solutions of the equation
are appropriately symmetrized. In particular, for particle scattering this symmetrization will automatically take account of the contributions to the amplitude
which correspond to the two Feynman diagrams.
Thus the Hamiltonian of the two-electron system is obtained from formulae
(83.15), (83.16) by simply putting m, = m2:t
Û =
2 ^ (P' + ^ ~ 8^? (P' + ^
+ Ü(
P " P 2 ' r) '
e2h
+ 4 m 2 c i r i {-(^1 + 2<T2) • r x p, + (<r2 + 2cr,) • r x p2} +
+
1 {Ûïi^ -3(g| ' T2 'r)-T°>'°>5(r)}-
(8317)
'
The presence of terms in 5(r) does not, of course, imply that there is a particularly
strong interaction. The value of all the correction terms after integration is of the
same order, and according to the sense of the expansion used they are all to be
regarded as small compared with the first term (the Coulomb interaction).
The different groups of terms in the interaction operator (83.17) are of different
types. Thefirstthree terms have a purely orbital origin. The next term is linear in the
spin operators of the particles, and corresponds to the spin-orbit interaction. The last
term, which is quadratic in the spin operators, describes the spin-spin interaction.$
ELECTRON AND POSITRON
The electron-positron system needs special consideration. The scattering amplitude in this case consists of two terms:
Mp = -« 2 [fl(p:)7 M «(pO]D^(p.-p:)[flep + )7 F ii(-pi)] +
+ e 2 [ö(-p,)7^(P-)]D^(p- + p,)[ö(p:)7vM(-p;)];
(83.18)
the first term corresponds to the scattering diagram and the second to the annihilation diagram. Since the wave function of the "electron + positron" system need
t The wave equation with the Hamiltonian (83.17) was first derived by G. Breit (1929); a consistent
quantum-mechanical derivation was given by L. D. Landau (1932).
t This interaction has been mentioned in CM, §72, in connection with the fine structure of the
atomic levels, and the spin-spin interaction between the electrons and the nucleus is considered in QM,
§121, in connection with the hyperfine structure of levels. In particular, the formula QM (121.9)
corresponds to the delta-function term in the spin-spin interaction operator.
342
§83
Interaction of Electrons
not be antisymmetric, the two terms make independent contributions to the
interaction operator.
The first term (which has the same structure as the amplitude (83.1)) leads, of
course, to an operator differing only in sign from (83.17). Let us now consider the
transformation of the second term.
Here we use the photon propagator in the ordinary gauge:
U v
»
" "F g ^ ~ cu 2 /c 2 -k 2
g
^
In the present case k = p+ + p_, and since the particles are "almost non-relativistic", we have
o> V
= ( 6+ + e_)2/c2 - 4m 2 c 2 > (p+ + p.) 2 ^ k2.
(83.19)
For the photon propagator it is therefore sufficient to write
D^-(7r/m 2 c 2 )g^.
This already contains a factor 1/c2. It is therefore sufficient to take the amplitudes
u(p) in the zero-order approximation:
ii(p-) = V(2m) (W0-0)),
ii(-p + ) = V(2m)
(*m),
where w ( -\ w(0) are the three-dimensional spinors which appear in (23.12); the index
(0) will henceforward be omitted. With these amplitudes we have
« ( - P * ) 7 ° K ( P - ) = u*(-p+)u(p.)
= 0,
fi(""P+)7w(p-) = u*(-p + )au(p_) = 2m(w*crw_).
On substitution of these expressions, the "annihilation" term in the scattering
amplitude becomes
M(ann> =
_ £2 _ ^ (2m )2( w*a w-)( W^CTW').
(83.20)
It is not yet possible, however, to draw from this any immediate conclusions as to
the form of the interaction operator. Firstly, the spinors w in terms of which the
amplitudes w(-p+) are expressed are not yet literally positron spinors. The positron
amplitudes are got from n(-p+) by charge conjugation, and according to (26.6) the
corresponding spinors (which we denote by w+) are related to w by w+ = cryw*,
whence
w* = cryw+ = -w+ory,
w = -o-yWÎ.
(83.21)
Secondly, the scattering amplitude must be brought to a form in which the
§84
343
Positronium
electron spinors (w_ and w!) are contracted, and likewise the positron spinors (w+
and w+). This is achieved by means of the formula
(w*orw-)(w!*orw') = \(wL*w-)(w*w') - 5(vv:*crvv-)(vv*aYiO,
(83.22)
which follows from (28.17)
Finally, expressing w and w' in terms of w+ and vv+ by (83.21), we easily find
(83.23)
(w*aw') = -(w+*crw+).
Substituting (83.23) in (83.22) and then in (83.20), we obtain the final expression
for the annihilation part of the scattering amplitude:
M£ nn) = - 4m 2 (wL*wl* y^rp
(3 + cr+ • a_)l w_w+),
the matrices cr_ and cr+ acting on w_ and w+ respectively. The expression in the
square brackets is the interaction operator in the momentum representation. The
corresponding coordinate operator is
u
(r)=
ii^p °+a+ • a-) ô(r)'
r=r
"r+
(83.24)
(J. Pirenne, 1947; V. B. Berestetskiï and L. D. Landau, 1949). The total electronpositron interaction operator is -Û + L7(ann), with Û given by (83.17).
§84. Positronium
The results obtained in §83 can be applied to positronium, a hydrogen-like
system consisting of an electron and a positron.
In the centre-of-mass system, the electron and positron momentum operators in
positronium are p- = -p+ = p, where p = -iftV is the operator of the momentum of
relative motion corresponding to the relative position vector r = r- - r+. The total
Hamiltonian for positronium ist
.
n 2 e2 *
H = * - - - + Vi+ V 2 + V3,
m r
F{^+
^»}.
V2 = 6MoAî-S,
V3 = 6^ ^{{è'rf'r)
+ In ordinary units.
- IS2} + Air^m ~ 2)fi(r).j
(84.1)
344
Interaction of Electrons
§84
Here /LIO = ehjlmc is the Bohr magneton, hi = r x p is the orbital angular momentum
operator, S = 2(CF+ + CT-) the total spin operator of the system, whose square
S2 = 2(3 + <r+ • or_). Vi includes all the purely orbital correction terms, V2 the spinorbit interaction, and V3 the spin-spin and "annihilation" interactions.
The "unperturbed" Hamiltonian
H = p2/m - e2lr
naturally differs from the Hamiltonian of the hydrogen atom only in that the
electron mass is replaced by the reduced mass jm. The energy levels of positronium
therefore have absolute values which are half those of hydrogen:
E = -me*l4h2n\
(84.2)
where n is the principal quantum number.
The remaining terms in (84.1) cause a splitting of the levels (84.2), i.e. the
appearance of a fine structure. The resulting levels are classified primarily by the
values of the total angular momentum j. We also see that the particle spin operators
appear in the Hamiltonian (84.1) only through the sum S. This means that the
Hamiltonian commutes with the squared total spin operator S2, i.e. the value of the
total spin continues to be conserved in the approximation considered (the second
approximation with respect to 1/c). The energy levels of positronium can therefore
be classified by the total spin, which takes values S = 0 and S = 1. The levels with
spin 0 are called parapositronium levels, and those with spin 1 orthopositronium
levels.
It must be emphasized that the conservation of the total spin in positronium is
actually exact, and does not depend on any particular approximation with respect
to 1/c; it follows from the CP invariance of electromagnetic interactions. Positronium is a strictly neutral system, and its states therefore have definite charge
parity and combined parity. The latter is equal to (- l) s + 1 (see §27, Problem); since S
can take only two values, 0 and 1, the conservation of combined parity is equivalent to
that of total spin.
When S = 0 the total angular momentum j is equal to the orbital angular
momentum, but when S = 1 and j is given, the number / can take the values j, j ± 1,
so that in general each level (n,j) of orthopositronium is split into three. Since the
values / = j and / = j ± 1 correspond to opposite parities, the Hamiltonian has no
matrix elements between these states. But the perturbation operator (the first term
in V3) in general has non-diagonal elements between states with / = j + 1 and
/ = j - 1; the number I then, of course, no longer has the strict significance of an
orbital angular momentum.
The Zeeman effect in positronium has some unusual features (V. B. Berestetskil
and I. Ya. Pomerachuk, 1949).
The orbital magnetic moment of positronium is always zero: since in positronium r+ x p+ = r_ x p_, we have the operator
|Xf = /"-o(r+ x p+ - r_ x p_) = 0.
§84
Positronium
345
The spin magnetic moment operator is
(ILS = /x0(<r+ - a );
(84.3)
it is not proportional to the total spin operator S = j(cr+ + o-_), and the operators S2
and \K2 do not commute. The states with definite values of the total spin S and its
component Sz are therefore not, in general, eigenstates for the magnetic moment.
States with given S kand Sz are described by spin functions xss, having the form
X\i = a+a-,
*,,-, = ß + ß-,
Xio = :^j(a + /3- + u!-ß+),
(84.4)
Xoo = ^ 7 2 ( a + ß - - a - ß + ) ,
where a and ß are the spin functions of one particle corresponding to spin
projections +2 and - 2 ; the suffixes + and - indicate that the function belongs to
the positron and the electron respectively. The first two spin functions, *n and
X1.-1, are also eigenfunctions of the operator ixz, corresponding to the eigenvalue
zero. The functions xw and #00 are not eigenfunctions of /xz, but the following
combinations are eigenfunctions:
^72(Xio + Xoo) = <*+ß-,
^ 2 (Xio ~ Afoo) = oußH
(84.5)
It is easy to see that the only non-zero matrix elements (S'Sz|ji,z|SSz) calculated
from the functions (84.4) are
<00|M*|10> = <10|/iz|00> = 2/uio.
(84.6)
In weak magnetic fields (when iioH <^ A, where A is the difference between the
level energies with S = 0 and S = 1) the initial approximation for the calculation of
the Zeeman splitting is formed by states with definite values of the total spin. In the
first approximation, this splitting is given by the mean value of the perturbation
energy operator
V„ = -ßzH.
But all the diagonal matrix elements of the operator ßz, and therefore VH, as
calculated from the functions (84.4), are zero. Thus, in weak fields, there is no
linear Zeeman effect in positronium.
In the opposite limiting case of strong fields (/x0H > A), we can neglect the spin
interaction which brings about definite values of S. The components of the split
level will then correspond to states with definite values of ixz = ±2/x0 (described by
the functions (84.5)), and the displacement of these components will be ±2/x0H.
§84
Interaction of Electrons
346
PROBLEMS
PROBLEM 1. Determine the fine structure of the levels of parapositronium (V. B. Berestetskif,
1949).t
SOLUTION. The required level splitting energy is given by the mean values of the correction terms
in the Hamiltonian (84.1), calculated by means of the wave functions of the unperturbed states with
different values of j = I ( = 0 , 1 , . . . , n - 1). When S = 0, the only non-zero contributions come from Vi
and the second term in V3.
The unperturbed wave functions, which we denote by «/>, satisfy Schrödinger's equation^
P2* = - A 0 = ( E + | W
E = -l/4n 2 .
Hence
P V = P 2 ( E + ^ = ( E + ^) *-*A±
+ 2(Vjy<r<lß)
= (E+J)^ + 4ir«(r)^+^.
The mean value is
= (E + 1) + 47r|<M0)|2 + J J - ^ d r d o .
The integral is equal to —/ |*A(0)|2 do ; since i/f(0) = 0 except when I =0, and the wave functions of S
states are spherically symmetric, the integral is -4TT|I/K0)|2 and cancels with the second term.
Using the orbital angular momentum operator I = r x p, we can write
v v
d r
T
r d r r
r
r)w
\
The other required mean value is therefore
J**p-<r-Wp*d J *=-J**JÜfd J *
= ^(E + ^-4ir|^(0)| 2 -(+l)r';
if J = 0, the last term does not appear.
According to the familiar formulae in the theory of the hydrogen atom (QM (36.14), (36.16)), with the
electron mass m replaced by im, we have
2n2'
8irrT ""
73 =
4nJl(! + l)(2J + l)
2n\2l +l)'
(,
"0)-
From these formulae, we find the required energy levels of parapositronium:
p
^
=
_
l
4?
_ 2m*4 I / _ i
a
H\
"F2ÏP\2l + l 32« I
t The fine structure of orthopositronium has been discussed by A. A. Sokolov and V. N. Tsytovich,
Zhurnal éksperimentaVnoi i teoreticheskol fiziki 24, 253, 1953.
t In the calculation it is convenient to use atomic units.
§85
The Interaction of Atoms at Large Distances
347
PROBLEM 2. Determine the difference between the energies of the ground states (n = 1,1 =0) of
orthopositronium and parapositronium.
SOLUTION. The dependence of the energy on the total spin S when I = 0 arises only from the mean
value of the second term in V3; the first term gives zero on averaging over angles in the spherically
symmetric S state.t The ground level of orthopositronium (3Si) lies above that of parapositronium ('So)
by an amount
E( 3 S,)-E( l So) = ^ a 2 ^ - 8 . 2 x l O - 4 e V .
§85. The interaction of atoms at large distances
Attractive forces act between two neutral atoms at a distance r apart which is
large compared with the dimensions of the atoms themselves. The usual quantummechanical calculation of these forces (see QM9 §89) is, however, inapplicable at
very large distances, because this calculation considered only the electrostatic
interaction, i.e. retardation effects are ignored. Such a treatment is valid only if the
distance r is small in comparison with the characteristic wavelengths A0 of the
interacting atoms. In this section we shall give a calculation not subject to that
limitation.
The procedure is much the same as in §83: the amplitude of elastic scattering
(i.e. scattering without change of internal state) for two different atoms is calculated in the first non-vanishing approximation. The resulting expression is compared with the amplitude which would result if the interaction between the atoms
were described by the potential energy U(r).
In the latter case, the first non-vanishing S-matrix element describing the
process in question would be the first-approximation element
Sfi = - i j ^i*(r,)^5*(r2)U(r)^i(r,)^2(r2) d3*i <*3x2 x
x f exp{-i(e, + e 2 ~ eî - e'2)t) dt.
(85.1)
Here \J/{9 if/2 and i/r{, ^2 are the time-independent parts of the wave functions (plane
waves), describing the translational motion of the two atoms with initial and final
momenta; ei, e2 and ei, e2 are the kinetic energies of this motion; the coordinates rj
and r2 of the atoms as a whole can be regarded as the coordinates of their
nuclei, and the distance r = |ri — r2|. The time integral in (85.1) gives, as usual, the delta
function which expresses the law of conservation of energy. For convenience in the
subsequent comparison, however, it is better to consider formally the limiting case of
atoms of infinite mass; for given momenta, this limit corresponds to zero energies e. Or
we can say that the times considered are small in comparison with the periods 1/s.
t The averaging over angles must precede the integration over r, as is evident from the manner of
calculation of the integral (83.14) which leads to the first term in V3.
Interaction of Electrons
348
§85
Then (85.1) becomes
Sfi = - it jj
M*M*U(r)M2
d3x, d3x2,
(85.2)
where t is the time integration range.
The actual calculation of the elastic-scattering amplitude, under these assumptions, can be divided into two stages. We first average the S-operator over the
wave functions of the unchanged (ground) states of the two atoms (for given
coordinates ri and r2 of their nuclei) and over the photon vacuum: no photons are
present at the beginning and end of the process. We then obtain a quantity which is
a function of the distance between the nuclei, and which we denote by (S(r)).t In
order to find the required transition matrix element, we have then to calculate the
integral
Sfi = jf
^i*^2*<S(r))^,^2 d3x,d3x2.
(85.3)
Comparison with (85.2) shows that, if (S(r)) is obtained in the form (S(r)) =
-itU(r), then the function U(r) is the required energy of interaction of the atoms.
Since we are here concerned with a collision not of elementary particles but of
more complicated systems, namely atoms, which may be excited in the intermediate states, the usual formal rules of the diagram technique are not directly
applicable, and we shall begin from the expression of the S-operator as the
expansion (72.10).
In the interaction of atoms, the important field components are those whose
frequencies are of the order of atomic frequencies or less. The corresponding
wavelengths are large compared with atomic dimensions. The electromagnetic
interaction operator can therefore be taken in the form
V-=-Ê(r,)-di-Ê(r2)-d 2 f
(85.4)
where di, d2 are the dipole moment operators of the atoms (i.e. the time-dependent
or Heisenberg operators) and Ê(r) is the electric field operator at the positions of
the corresponding atoms.
The mean values of the dipole moment of the atom in its stationary states are
zero (QM, §75). Hence it follows that a non-zero amplitude occurs only in the
fourth approximation of perturbation theory, i.e. as the matrix element of the
operator
S(4) = {^f j dt,...
| dU. T{ V(t,) V(t2) V(f3) V(t4)}:
(85.5)
in lower orders, every term in the product of operators V will contain at least one
of the operators di and d2 in the first degree, and on averaging over the state of the
corresponding atom the result is zero.
t In place of the more lengthy notation for a diagonal matrix element, indicating the states of the
atom and of the photon field.
§85
349
The Interaction of Atoms at Large Distances
Let us now average the operator (85.5) over the photon vacuum. According to
Wick's theorem, the expectation value of the product of four field operators E is
the sum of products of pairwise expectation values (contractions). The division into
pairs can be made in three ways, which may be represented by the diagrams
1
2
0
o
o—
3
0
4
1
2
\\
3
/
/
>\/
P
//
\
1
?
I
l
I
\
]
4
À
3
2
?
!
I
I
|
A
(85.6)
4
where the broken lines represent contractions and the numbers correspond to the
arguments tu f2, t3, f4. Moreover, spatial coordinates ri or r2 may correspond to
each point, with two points having n and two r2, since otherwise, in the relevant
term of the sum, one of the operators di and d2 will appear in the first degree, giving
zero on averaging with respect to the state of the atom. It is clear that there must
be one n and one r2 at the ends of each line, since otherwise the diagram (i.e. the
corresponding term in the matrix element) will reduce to a product of independent
functions of n and of r2 instead of being a function of the difference r i - r 2 ; such
terms do not pertain to scattering.t In accordance with these conditions, the
arguments n and r2 can be assigned to the four points in the diagram in four ways.
Using also the commutativity of the operators di and d2 and averaging over the
states of each atom, we find that all the 3 x 4 = 12 terms thus obtained are equal,
differing only in the naming of the variables of integration. The result is
(S(r)) = \fdtx...f
dU. <T(E<(r„ f ,)Efc(r2, f2))> x
x <T(E,(r2, U)Em{TU r4))>mdii(fi)dim(t4))><T(d2k(t2)d2i(r3))>,
(85.7)
where i, k, /, m are three-dimensional vector indices.
To calculate the quantities
DUxx ~ x2) = <T(E,(Xi)Efc(x2))>
< 85 ' 8 )
we use the gauge in which the scalar potential 4> = 0. Then E = - dkldt, and we
have
D*U. - x2) = ^ L <T(Ai(xi)Ak(x2))>
where x = xx - x 2 and Dik(x) is the photon propagator in this gauge.t
+ They give corrections, of no interest here, to the energy eigenvalues of each atom.
t The first derivative dDlk(t)ldt has a finite discontinuity at t = 0. The second derivative, i.e. the
function D u ( 0 , therefore includes a delta-function term ~~8<4)(X2- xi). This term, however, is zero for
all ri * ri and is of no interest here.
§85
Interaction of Electrons
350
We shall find it more convenient to use the propagator Dik(co, r) in the mixed
<o-r representation, related to Dik(ty r) by
Dik(f,r) = | DÄk(cü,r) e~iü>t dcollir,
with
Dfk(U r) = - i | co2Dik(w, r) <T/ù" dcollir.
(85.9)
The quantities
aik(t. - t2) = «<T(A(ri)dk(f2))>
(85.10)
can be expressed as a Fourier integral
00
ocik(t) = j e-*" aik((o) d(ol2n.
—oü
Putting for convenience t2 = 0, fi = t, and using the definition of the T product, we
can write
a ik ((o)= |
e"aik(0*
o
= i | « M <4k(0)d,(t)> dt + ij e^iMOMO))
dt.
(85.11)
0
-oo
The mean values (with respect to the ground state of the atom) which appear here
can be expressed in terms of the matrix elements of the dipole moment:
(dk(0)di(t)) =
(di(t)dk(0)) =
n
2(dk)on(diUei^\
,
n
Z(di)on(dkUe-i'0-'ot.
For convergence of the integrals in (85.11) it is necessary to take <o in the first
integral as to - «0, and in the second as <o + iO. Carrying out the integrations, we
obtain
/
x _ \ ? / (^i)on(dk)no
otik{(o) = 2J I
, (dk)on(dj)nQ \
TK^
7
JKh
<Q< \~)\
(85.12)
§85
The Interaction of Atoms at Large Distances
351
If the ground state is an S state, this tensor is simply a scalar, aik((o) = a8ik,
where
a((o) = ^2ldon| 2 (
^+
-1
JK\
(85.13)
If, however, the atom has an angular momentum, the same result is obtained on
averaging over the directions of this angular momentum, and it will be assumed that
this has been done; we are, of course, interested in the interaction of atoms
averaged over their mutual orientations.
Comparison of (85.12) with (59.17) shows that alfc(co) is the same as the tensor
for coherent scattering of a photon of frequency co by an atom. According to
(59.23), a(cu) for co > 0 is the polarizability of the atom. Its values for cu < 0 are
expressed in terms of those for w > 0 by means of the relation a(-<o) = a(<o),
which is obvious from (85.13).
Substitution of these expressions in (85.7) gives
A*
<S(r»-iJ aii...
j* dfl»
2diOi
2 w
ai4
-r dü
-—
——dw
-z—
x
Z7T
LIT
ITT ZTT
x a,(ni)a2(n2)a>iDjk(ü)i, r)(ü\Dlk{h>2, r) x
x exp{-icuiU - h) - i<D2(h - t4) - iù{(t, - U) - iü2(t2 - h)},
where r = n - r2, and we have used the fact that Djk(a), r) is an even function of r.
The integration over three times gives delta functions (whereby -Hi = (l2 = <o2 =
o>i), and that over the fourth time gives a factor t:
(S(r)) = -itU(r),
where
U{r) = \i f
to4a,(to)a2(to)[Dik(to,r)]2dto/27r.
(85.14)
This formula gives the energy of interaction of two atoms at any distance large
compared with the atomic dimensions a. We have now to find and insert an explicit
expression for Dlk(û>, r).
Comparison of the expressions (76.14) and (76.8) shows that
kik
k
Dik(to,k)= - ( s i k - ^ ) D
(o),k),
Interaction of Electrons
352
§85
where D(<o, k) is given by (76.8). In the co - r representation, the relationship is
correspondingly
Da(co,r) = -(**+^J^)D(<»>rl
<8515)
Substitution of D(co, r) from (76.16), and carrying out the differentiations, gives
D
r)=
i+
^- h( 4"^)
r
W r
CÜ
r
+
/J r
Then, substituting this expression in (85.14), we find by a simple calculation,
using the fact that a(co) is even, the final expression for the interaction energy of
the atoms:
l/(r) = - L i \ <o*aMai{ü))e2i"r\\ + — 7 ^ z- 7 ^ 3 + r V 4I d<u.
-rrr J
L cur (cor) (cor) (cor) J
o
(85.17)
This general result can be simplified in the limiting cases of "small" distances
{a<r < A0) and "large" distances (r ^> A0).
When r < A0, the important values in the integral are (see below) œ ~ o>0, where
cuo^ c/A0 are the atomic frequencies, and therefore cor<\. Then only the last term
in the bracket need be retained, and the exponential may be replaced by unity.
Writing the integral as one from - » to °o (with a view to the subsequent
calculations), we find
3
l/(r) = ^
' f
ai(üi)a2(oi)dcü.
(85.18)
The interaction law at these distances proves to be 1/r6, as it should. The integral in
(85.18) is easily calculated, after substitution of a(ce) from (85.13), by closing the
contour of integration with an infinite semicircle in the lower half of the complex
o>-plane; the integral is determined from the residues of the integrand at the poles
o) = Ü>„O ~ cuo. Assuming (to simplify the result) that the two atoms are identical, we
find (in ordinary units)
U(r)=-^^n!äM_
(85.19)
the same as the familiar London's formula (see QM, §89, Problem).
In the limit of large distances (r > A0), the important values in the integral are
o> ^ c\r < CJQ; when <o ** Ù>0, the integral is made small by the rapidly oscillating
§85
The Interaction of Atoms at Large Distances
353
factor exp2io>r. We can therefore replace the polarizabilities ai(o>) and a2(<o) by
their static values c*i(0) and a2(0). The integration is then elementary. (To ensure
convergence, r in the exponential is to be replaced by r + iO.) The final result is (in
ordinary units)
4ir
r
(H. B. G. Casimir and D. Polder, 1948).t
+ The derivation given here is due to I. E. Dzyaloshinskif.
CHAPTER X
INTERACTION OF ELECTRONS WITH PHOTONS
§ 86. Scattering of a photon by an electron
T H E conservation of 4-momentum in the scattering of a photon by a free electron
(the Compton effect ) is expressed by the equation
(86.1)
p + k = p' + k\
where p and k are the 4-momenta of the electron and the photon before the
collision, and p' and k' their 4-momenta after the collision. The kinematic invariants defined in §66 are
s = (p +fc)2= (p' + kf = m 2 + 2pk = m2 + 2p'k\ 1
t = (P -p'f
M
= ( k ' - k)2 = 2 ( m 2 - p p ' ) = - 2 k k \
= (p - k')2 = ( p ' - fc)2 =
m
2
- 2pk' = m 2 - 2p'k,
s + f + w=2m2.
i
The process in question is represented by the two Feynman diagrams (74.14),
and its amplitude is
Mfi = - 4ir« 2 e;*^(ö l Q My ii),
(86.3)
where
Q1" = ^ p
y^(7P + 7k + m ) 7 v + j ^ j p
yv(yp - yk'+
m
)YM-
<86-4)
Here e, e' are the polarization 4-vectors of the initial and final photons; u, u' the
bispinor amplitudes of the initial and final electrons.
According to the rules given in §65, for arbitrary polarization states of the
particles \Mfl\2 is replaced by
iM/ip^lôirVtrip^a'Q^Wp^On
(86.5)
where p (f) , p{e)' are the density matrices of the initial and final electrons, piy\ pKy)l
those of the photons. The photon (tensor) indices are written explicitly, but the
electron (bispinor) indices are not. The trace symbol refers to the latter indices, as
does the superscript plus in the definiton QßV = y°Qlvy°.
354
i
§86
355
Scattering of a Photon by an Electron
Let us consider the scattering of an unpolarized photon by an unpolarized
electron, without regard to their polarizations after the scattering. The averaging
with respect to the polarizations of all particles is given by the density matrices:
PS = pfc* = "
te*,
PU) = 2<7P + m),
p(e)' = \{yp' + m);
the change to summation over the polarizations of the final particles involves a
further multiplication by 2 x 2 = 4.
From formula (64.23), in which we must now put I2 = \(s -m 2 ) 2 (see (64.15a)),
we fini the cross-section
TTP*
At
From (65.2a), Q^A = QxM- Separating the terms which differ only by the changes
k «-> - kf (and accordingly s <->M), we can put the cross-section in the form
d<T = dt
77c
4
2 2
( 5 - m ) t/ (S ' U) + g(5 ' U) + f(U* S) + g( "' 5^'
with the notation
fU, u) =
g ( 5 M) =
'
_ m y tr{(7p' + m)7"(YP + yk + m)7"(yp + m)7„(7p + 7k + m)7^},
4(5-m 2 )(M-m 2 ) t r { ( 7 P ' +
m)
^^p
+
Vk + m)"y"(7P + m)yM x
x(7p-7k' + m)7,};
this notation takes account of the fact that the result will depend only on the
invariant quantities.
The summation over p. and v is effected by means of formulae (22.6); then,
omitting terms which contain an odd number of factors 7, we obtain
f(s, ") = ,
m ytr{(7p')(7P
+ yk)(yp)(yp + yk) + 4m\yp + yk)(yk - yp')+
+ m2(7p)(7p') + 4m4}.
The trace is calculated by means of formulae (22.13); expressing all quantities in
terms of the invariants s and u, we easily obtain
fis, u) = {s _ 2 m y {4m4 - (5 - m2)(u - m2) + 2m2(s - m2)}.
Similarly,
g(5>M) =
(s-m2iXu-m2){4m2 +
(5
~m2)
+
( M
-
m
^
356
Interaction of Electrons with Photons
§86
The cross-section is thus
,
Q
+
2
m2dt
( ^
+
\{ m2
^ \
,
m2 y ,
1 ( ^ +^ ) 1
(86 . 6)
where re = e2lm. This formula expresses the cross-section in terms of invariant
quantities, and can easily be used to express it in terms of the collision parameters
in any specified frame of reference.
Let us do this for the laboratory system, in which the electron is at rest before
the collision: p = (m, 0). Here
s - m 2 = 2mco,
u - m 2 = -2mco'.
(86.7)
Squaring the equation of conservation of 4-momentum in the form p + k- k' = p \
we have
pfc-pk'-kk' = 0,
whence (in the laboratory system)
m(co - co') - ÜHD'O - cos # ) = 0,
where fl is the angle of scattering of the photon. This equation gives the relation
between the photon energy change and the scattering angle:
oj
<i)
m
The invariant t is
t = - 2kk' = - 2WÛ)'(1 - cos
#).
For a given energy <o we find, using (86.8),
at = 2<u'2 d cos # = (l/tr)a)'2 do' (do' = 2TT sin # dd).
Substitution of these expressions in (86.6) gives the following formula for the
scattering cross-section in the laboratory system:
da=jr2(«:y(i2.+«I.
\<û I \<t)
b)
sin *
A
I
do>
(86.9)
(O. Klein and Y. Nishina, 1929; I. E. Tamm, 1930).
Since the angle § is unambiguously related to o>' by (86.8), the cross-section can
§86
Scattering of a Photon by an Electron
357
be expressed in terms of the energy co' of the scattered photon:
* , . „ ; * * q i + -:+(«_ UV _ 2„ ( J. _ I) j,
(86.10)
with ay' varying in the range
w
1 + 2ü>lm
-w'^tu.
(86.11)
When a» < m, we can put a>' ~ a> in (86.9), and the result is, as it should be, the
classical non-relativistic Thomson's formula
der = ir 2 (l + cos 2 d) do';
(86.12)
see Fields, (78.7).
To calculate the total cross-section, we return to formula (86.6). The invariants
s, t, u there take values satisfying the inequalities
s^m2,
fssO,
us^m\
(86.13)
These have already been derived in §67; the corresponding physical region is I in
Fig. 7 (§67). They are also easily obtained directly from the expressions for the
invariants in the centre-of-mass system. Here p + k = 0, and the energies e of the
electron and <o of the photon are related by e = V(co2 + m2). The invariants are
s =(e + (x>Y = m2 + 2O)(Û> + e), 1
u = m2 - 2o)(e + a) cos 0),
(86.14)
2
t = -2û> (l-cos0),
where 6 is the scattering angle (the angle between p and p' or between k and k').
The three inequalities (86.13) then result from the conditions w ^ O and -1*£
cos 6 «s 1.
For a given s (i.e. a given energy of the particles), the integration with respect
to t can be replaced by one with respect t o u = 2 m 2 - s - t over the range
m4ls^u^2m2-s.
Using instead of s and u the quantities
x=(s-m2)/m\
y = (m 2 -M)/m 2 ,
we obtain
•¥ \ [(H),+H+SG-f)]<*
*/(* + !)
(86.15)
358
Interaction of Electrons with Photons
§86
and after the elementary integration
The leading terms in the expansion for x < 1 (the non-relativistic case) are
(86.17)
<i = ~^{\-x).
The first term is the classical Thomson cross-section. In the opposite, ultrarelativistic, case (x > 1), the expansion of (86.16) gives
a = 2-rrrl £ (log x + {).
(86.18)
In the laboratory system,
x = 2o>/m,
(86.19)
so that formulae (86.16H86.18) give immediately the photon energy dependence of
the cross-section for scattering by an electron at rest. Figure 13 shows <r as a
function of w/m.
In the ultra-relativistic case, the cross-section decreases with increasing energy
both in the laboratory system (tr « Û>-' log a>) and in the centre-of-mass system
(x"=4û>2/m2, <r « a)"2 log w). But the angular distribution in the ultra-relativistic
case has quite different forms in these two frames of reference.
In the laboratory system, the differential cross-section has a sharp peak in the
forward direction. In a narrow cone d s£ V(m/o>) we have o>' ~ CD and the cross10
—
0-8
06
q
8
5r2
3 •
04
02
0
001 002 005 01 0 2 0 5
FIG.
1
13.
2
5
10 20
50 100
§87
359
Scattering of a Photon by an Electron
section dcr/do' — r2., reaching the value r2 as #-*0. Outside this cone, the crosssection decreases, and in the range ü2>mlo> (where w ' « m/(l - c o s #)) we have
do' " 2 r ' < ü ( l - cos #)'
i.e. the cross-section is reduced by a factor ~w/m.
In the centre-of-mass system, on the other hand, the differential cross-section
has a peak in the backward direction. For IT - 6 < 1 we have from (86.14)
s-m2
m
_ 4<o2
m
m2-u _ , , <o2,
m
m
2
The largest term in the cross-section (86.6) is
dor « 8UT2.
m 2 dt
4(s-m z )(m z -ur
whence
^ =H1
+ (T_
d
°9)V/m!.
(86.20)
The cross-section dcrldo' ~ r« in a narrow cone ir - 6 *£ m/w; outside this cone it is
reduced by a factor of the order of ~o)2/m2.
§87. Scattering of a photon by an electron. Polarization effects
We shall now go back to the original formulae of §86 and show how the
calculations must be made in order to take account of the polarization of the initial
and final photons and electrons.
The density matrix of the photon can be expressed, according to (8.17), by
means of a pair of unit 4-vectors e(]\ el2) which satisfy the conditions (8.16). In the
present case, these vectors can be taken to be, for both photons, the 4-vectors
defined in §70t
e(1) = N/V(-N 2 ),
e(2) = P/V(-P 2 ),
(87.1)
P x = (pA + p' x )- Kk(pK + p'K)IK\
(87.2)
where
k
k
t An alternative procedureqxis=to consider
from
start a specified frame of reference (say the
k'k-k
=pthe
-p'\
laboratory system) and take for each photon as e0), em purely spatial unit vectors e = (0, e) which are
orthogonal to the photon momenta and to each other. In that case, however, the calculations will be
entirely in three-dimensional form, and the result will not be invariant.
§87
Interaction of Electrons with Photons
360
The quantities Q*1* in (86.5) are given by (86.4). They may be regarded as
components of a 4-tensor (in the sense that they form a 4-tensor after being
contracted with spinors as the quantities ü'Q^u). All the components of a 4-tensor
can be obtained by projecting it on four mutually orthogonal 4-vectors, for instance
on P, N, q and K defined above. Since the tensors p{JÏ,pl$ contain only components along P and N, we need in fact only the components of Q^ along these
4-vectors. In other words, it is sufficient to find in Q^ the terms of the form
Q,„ = Q o M V » + efe?)
+ Q,(e<M2) + efe?)
- iQ2(e(;)et2) - efe?)
-
+ Q3(e!Ml) - e?e?);
(87.3)
the remaining terms would disappear on substitution in (86.5). The quantities Q0
and Qi are scalars in the same sense that QM„ is a 4-tensor; they therefore contain
the matrices 7 only in the "invariant" combinations yK, etc. In the same sense, Q\
and Q2 are pseudoscalars (N is a pseudovector), and hence must contain the matrix
By direct projection of the tensor QM„ we find
etc. In the calculation it is convenient first to express Q»v in terms of the mutually
orthogonal 4-vectors P, N, q, K:
There then remain some purely algebraic calculations using the formulae given
in §22. It is also possible to make changes in Q*v which do not affect the result
after the subsequent construction of the product ü'Q^u. For instance, since
ü\yp + yp')u = 2mö'u,
0
V(7<Ï)M
=
ö'(Y5(YP) + (yp')Y5)w =2mö'75M,
we can make in Q*v the changes
JP + YP'-»2m,
y\yq)-*2my5.
(87.4)
The detailed calculations are omitted here; the final result ist
Qo = - ma+,
Qx = \ia+y5(yK),
Q2 = - ma+y\
Q3 = ma+ + \a-(yK),
(87.5)
t The expression (87.3) with the values (87.5) corresponds to the formulae (70.11M70.13) derived in
§70 from general considerations. Besides the equations /} = A = 0 which follow from T invariance,
another invariant amplitude (Ji) is here zero also. This is a property of the approximation of perturbation
theory used here, and would not occur in higher approximations.
§87
Scattering of a Photon by an Electron
361
where
a± =
1 ^ 1
5±
5.
s- m
u- m
In the subsequent calculations, it is convenient to apply to Q^ the same formal
treatment as has been described in §8 for the photon density matrix: the four
components of the tensor (87.3) in the directions e(l), e{2) are combined to form a
two-rowed matrix Q which is then expanded in terms of Pauli matrices. Similarly
to (8.18), we obtain
Q = Qo + Q • or,
Q = (Q lf Q2, Q3).
(87.6)
The components of the tensor Q^ = y°Qtvy° in (86.5) are easily seen from (87.3),
(87.5) and the rules (65.2a) to be obtained from those of QßV on replacing Q0, Q i , . . .
by Oo, Qu . • •, where
Oo=Qo,
Qi = - Q i ,
Q2 = - Q 2 ,
Q3=Q3,
(87.7)
and simultaneously interchanging the indices /x, vA In matrix form,
Q = Qo + Q - * .
(87.8)
Let us now define more precisely the sense of the 4-vectors e(1), ei2) in relation
to the polarization of the photons. For each photon, the independent directions of
polarization will be determined by the components of the 3-vectors e(l), e(2) transverse to the photon momentum k.t It is easily seen that, in both the centre-of-mass
system and the laboratory system (in which the initial electron is at rest), the vector
P is in the plane of k and k', and N perpendicular to that plane. The direction e 0) is
therefore that of the polarization perpendicular to the plane of scattering, and e(2) is
that of the polarization in the plane of scattering. It must also be noted that the
Stokes parameters £i, &> & are defined with respect to the axes xyz, which form a
right-handed set with the z-axis in the direction of k. It is easily seen that for the
initial photon the vectors N, P x , k form such a set, and for the final photon the
vectors N, - P I , k' (where P± and Pi are the components of P perpendicular to k and
k' respectively). A change of sign e(2) in the photon density matrix (8.17) is
equivalent to a change of sign of £i and £2. The density matrices of the initial and
final photons, referred to the unit 4-vectors e(1) and e{2\ are therefore
(87.9)
t For the matrix Q»v in the original form (86.4) we should have simply Q„„ = Q,K. This property,
however, is lost as a result of transformations such as (87.4).
t The longitudinal components of e, like the time components of the 4-vectors ef can here be simply
ignored; this is permissible, owing to gauge invariance.
362
Interaction of Electrons with Photons
§87
The tensor trace
P AM V
pppW
is now calculated as the trace of the matrix product of the matrices (87.6)-(87.9),
using (33.5). The result is
\Mfi\2 = 8TTV tr{(pw'QapwQ0 + Pi€)'Q • p('}Q) +
+ (I + r ) • (p(e)'QoP(f)Q + P(e)'Qp(f)Oo) - i ( | -1') • P(0'Q x P(e)Q +
+ (I • * Vc)'QoP(f)Qo - P(e)'Q • P ( € ) Q) +
+ P(e)'(C • Q)p{e\i • Q) + p(f >'« • Q)P W (C • Q) - »È x *' • (p(e)'Qop(<)Q - P U ) / QP ( 0 OO)}.
(87.10)
SCATTERING BY UNPOLARIZED ELECTRONS
We shall complete the calculation of the cross-section for the scattering of
polarized photons by an unpolarized electron, summed over polarizations of the
final electron. To do so, we must put in (87.10)
P{e) = \(yp + m),
p(e)' = K7P' + m),
double the result, and substitute it in place of |M/j|2 in the formula (64.22) for the
cross-section:
where <f> is the azimuth in the centre-of-mass or laboratory system. Some of the
terms in (87.10) are identically zero; the calculation of the other terms gives the
final result (with the notation (86.15))
+<-H) + ^KH)K-7) +
+
««[(H),+(H)+a)Here da is the scattering cross-section for unpolarized photons given by (86.9); the
factor 2 appears because there is no summation over the polarizations of the final
photon in (87.11).
§87
363
Scattering of a Photon by an Electron
In the laboratory system, formula (87.11) becomes
.'\2
da = H ( ^ - ) do'{F0 + F3(6 + 6) + F„«,{i + Fafctf + F » ^ } ,
do' = sin tiddd<}>,
(87.12)
where
o = —; +
sin'' -9,
to
tu
Fii = 2 cos d,
F3 = sin2 #,
(87.13)
2
) cos d,
= \(t)
f-^ + —
<o 1
F33 = 1 + cos #
(U. Fano, 1949). Although (87.12) shows no explicit dependence on the azimuth <f>
of the scattering plane, there is an implicit dependence, since the parameters £1, £2,
$3 are defined with respect to the axes xyz, which are fixed to the scattering plane.
The x-axis is the same for both photons and perpendicular to the scattering plane:
x||kxk',
and the y-axes are in that plane:
y||kx(kxk'),
y'Uk'x(kxk').
Taking the sum of cross-sections differing in the sign of (■' (i.e. putting £' = 0 and
doubling the result), we obtain the total cross-section (summed over polarizations
of the final photon) for scattering of a polarized photon by an unpolarized electron.
Denoting this cross-section by dcr(£), we have
d(T{%) = \r\{to'ltofFdo\
(87.14)
where
F = F0 + Ç3Fi = -, + —
CO
(O
-(\-b)sin2d.
(87.15)
We see that the scattering cross-section for photons polarized perpendicular to the
scattering plane ( 6 = 1 ) is greater than that for photons polarized in the scattering
plane (& = - 1). The cross-section is independent of circular polarization and of the
parameter £1. The scattering cross-section is therefore equal to that for unpolarized
photons if there is no linear polarization relative to the x and y axes (£3 = 0) or even
if there is polarization relative to directions at 45° to these axes.
The cross-section for scattering of unpolarized photons with detection of a
polarized photon has similar properties. This cross-section, which we denote by
dcr(f'), is obtained from (87.12) by putting £ = 0:
_
iJli.jl
dvig)-rv =
irKto'ltofF'
do',
F' = Fo + £F 3 .
(87.16)
364
Interaction of Electrons with Photons
§87
From formula (87.12) it is also possible to deduce the polarization of the
secondary photon itself; we shall denote the parameters of this polarization by £tf)
to distinguish them from the detected polarization £'. According to the rules given
in §65, the quantities £K} are equal to the ratios of the coefficients of the £\ to the
term independent of £':
î? = (FnlF)Çu
& = (F22/F)£2,
#> = (F3 + F33fe)/F.
(87.17)
In particular, for the scattering of an unpolarized photon
tf> = #> = <>,
tf>-
. ,+ sin?*
.i v
(ojo) + eu /co — sin v
(87.18)
Here ffi > 0, i.e. the secondary photon is polarized perpendicular to the scattering
plane. Circular polarization of the secondary photon occurs only if the primary
photon is circularly polarized: ffi # 0 only if £2 ^ 0.
Let us consider the case of complete linear polarization of the incident photon
(6 = 0, £?+£ 3 = 1), and find the cross-section for scattering with detection of a
linearly polarized secondary photon. Expressing the parameters £, and £\ in terms
of the components of the photon polarization vectors e and e\ we obtain the
following expression for the scattering cross-section:
d* = lrl(^)2(^
+ - - 2 + 4cos2®)do\
(87.19)
where © is the angle between the directions of polarization of the incident and
scattered photons.t
According to this formula, the cross-section behaves quite differently when the
polarizations e and e' are perpendicular and when they are parallel. Distinguishing
these two cases by the suffixes JL and ||, we have in the non-relativistic limit (co <^ m,
ù/
Ä
0))
daL = 0,
dcr\\ = r\ cos2 0 do\
(87.20)
in agreement with the classical formulae. In the opposite, ultra-relativistic, case we
have a) > m, a>/ass m/(l - c o s #). Here the two ranges of large and small angles
(large and small co/co') must be distinguished:
m do
for tr
# 2 £>m/<o;
> mlw ; i
ddL = dan ^irl — do'^ \r\ —r—° — ior
1
"
ù)
û>(l — COSv)
it)
do\i = 0,
da\\ = r] cos 0 do'
for d < m/o>.
(87.21)
t Formula (87.19) itself could be more simply derived by writing from the start e = (0, e), e' = (0, e')
in the scattering amplitude (86.3) and continuing the calculation of the squared amplitude in threedimensional form (i.e. separating the time and space components of the 4-vectors).
On averaging cos 2 0 = (e • e')2 over the directions of e and e' (using (45.4a)), and doubling the
cross-section (to sum over e'), we of course return to (86.9).
§87
365
Scattering of a Photon by an Electron
We see that the scattering cross-section has its classical value at very small angles.
The approximate equality of da± and da^ at angles which are not very small
signifies that in this range, in the ultra-relativistic case, the scattered radiation is
unpolarized; but it must be emphasized that this conclusion applies specifically to a
linearly polarized incident photon. From (87.17) it is evident that, for a circularly
polarized photon in the ultra-relativistic case, ffi ^ & • cos d.
SCATTERING BY POLARIZED ELECTRONS
For polarized electrons, the calculation of the traces in formula (87.10) becomes
very laborious, though not difficult in principle. Here we shall give only some of the
final results of the calculation.!
In general, the cross-section depends both on the polarization parameters £ and
£' of the initial and final photons, and on the polarizations of the initial and final
electrons, described by vectors £ and £'. The dependence on each of these
parameters is linear. The cross-section has the form
da = idcr«, 6') + W(aiVai)2 do'{t • Çfe + f • « i + g • *'& + g' • Ç& + GM'k + •••},
(87.22)
where dcr(£, £') is the cross-section (87.12). All the terms which contain products of
two polarization parameters have been written out in (87.22). Terms containing
products of three or four parameters have been omitted; they are unimportant as
regards correlations between the polarizations of only two particles, and disappear
when the polarization parameters of the other two particles are equated to zero. The
following are the values of some of the coefficients in the laboratory system:
f = —m (1 - cos d)(k cos # + k'),
f = - — (1 - cos d)(k + k' cos #),
m
»-"^a-cosd^Ckcosd
+ kO-a + cosd) (o "*f,„(k~fc')1,
m
L
— to + Im
J
g' = -^(l-cosfl)[(k + k'cosfl)-(l + cosfl)
m
L
" + "'
o) — (Û + Zm
(87.23)
Ck-IO.].
J
The cross-section (87.22) contains no term of the form G • Ç. This signifies that
the polarization of the electron does not affect the total cross-section (summed
over (•' and £') for the scattering of unpolarized photons. There is also no term of
the form G' • £'. This signifies that, in the scattering of unpolarized photons, the
recoil electron is unpolarized.
We see also that the terms bilinear in the polarizations of the electron and
photon contain only the parameters & and $ which correspond to circular
t Further details may be found in the review articles by H. A. Tolhoek, Reviews of Modern Physics
28, 277, 1956; W. H. McMaster, ibid. 33, 8, 1961.
366
Interaction of Electrons with Photons
§87
polarization of the photon. The polarization vectors Ç and Ç' of the electrons appear
in the form of scalar products f • Ç, etc., which contain only the projections of these
vectors on the scattering plane. Hence, for example, the cross-section for scattering of a polarized photon by a polarized electron,
d<r{& 0 = dor(l) 4- \r2e(<o'la>)2C2f • £ do\
(87.24)
differs from dcr(Ç) only in that the photon is circularly polarized and the electrons
have a non-zero projection of the mean spin on the scattering plane. For the same
reason, the recoil electron is polarized only if the photon is circularly polarized; the
resulting electron polarization vector is then in the scattering plane:
f^fcg/F.
(87.25)
SYMMETRY RELATIONS
Finally, we shall show that the qualitative properties of the polarization effects
in the scattering of photons by electrons follow from the general requirements of
symmetry.
The parameter £2 of circular polarization is a pseudoscalar (see §8). Hence, from
the requirement of P invariance, terms <*£2 (or <x££) in the scattering cross-section
could occur only as the product of £2 with some pseudoscalar formed from the
available vectors k and k'.t But a pseudoscalar cannot be formed from two polar
vectors. It therefore follows that no such terms can appear in the cross-section.
The parameters £i and £3 of linear polarization are related to the components of
the two-dimensional (in a plane perpendicular to k) symmetric tensor
Saß=Wß
+ p{&)
In the present case, one of the polarization axes is taken to lie along the vector
v = k x k\ and the other lies in the plane of k and k' (along k x v for one photon and
along k ' x v for the other). Terms «£i could occur in the cross-section only as
products Saßva(k' x v)^ (or, equivalently, Saßvak'ß)y etc. But, since v is an axial vector, k
a polar vector, and Saß a true tensor, such products are not invariant with respect to
inversion. There are therefore also no terms <* £, (or « £j) in the cross-section. Terms
a
& (or a ^ ) , however, occur as products Saßvavßj etc., and are not forbidden by
considerations of symmetry.
Terms in the cross-section that are proportional to the electron polarization Ç
are not forbidden by parity: such terms could arise from the products £• v of two
axial vectors. They must, however, be absent in the first non-vanishing approximation of perturbation theory, here considered, because the scattering matrix
t We are considering the process in the laboratory system, where p = 0, p' = k - k\ It is evident that
the relevant consequences of the symmetry requirements (the presence or absence of particular terms in
the cross-section) will not depend on the choice of the frame of reference.
§87
Scattering of a Photon by an Electron
367
is Hermitian in that approximation (§71). Owing to this property, the square of the
scattering amplitude (and therefore the cross-section) is unchanged when the initial
and final states are interchanged. At the same time the cross-section must be
invariant under time reversal, i.e. interchange of the initial and final states together
with a change of sign of the momentum and angular momentum vectors of all the
particles; the Stokes parameters £i, £2, £3 are then unaltered (see §8). On combining
these two requirements, we find that in the approximation considered the crosssection must be unchanged by a change of sign of all the momenta and angular
momenta without interchange of the initial and final states, i.e. by the transformation
k-^-k,
k'-*-k',
£-*-£,
£'-♦-£'
(87.26)
with £ and £' unaltered.
The transformation (87.26) changes the sign of the product Ç • v, and such terms
therefore cannot appear in the cross-section. It must be emphasized, however, that
this prohibition is not a consequence of strict requirements of symmetry, and may
therefore no longer apply in higher approximations of perturbation theory.
Among the terms of the binary correlation between the polarizations of the
photons, only those of the form £i£3 and £2£3 are forbidden by parity, and none of
those of the photon-electron correlation are forbidden. But all terms of the form
£i&> £i£> 6£ are forbidden in the first approximation by the requirement of
invariance under the transformation (87.26). For instance, terms of the form £i£ 2
and £i£ could be formed (so far as parity is concerned) as scalars such as eiSQßkfaPß
and Saßk'aVßt, • k, but such combinations change sign under the transformation
(87.26).
The allowed correlation terms of'the form £2£ can be formed as products of the
tyPe 6£ • k- The electron polarization vectors appear in them only as projections on
the scattering plane.
Finally, a number of relations between the coefficients in the allowed terms
result from the requirements of crossing symmetry. Reaction channels which differ
by an interchange of initial and final photons correspond to the same process—
scattering of a photon by an electron. The squared modulus of the amplitude, and
therefore the scattering cross-section, must consequently be invariant under a
transformation which expresses the change from one of these channels to the
other:
k **-k\
e ++ e'*
with the electron momenta and polarizations unchanged. In three-dimensional
form, this transformation is
a>«+-a>\
k<-*-k',
(87.27)
The change in the sign of £2 is evident from the expression £2 = ie x e* • n, in which
the vector e x e* changes sign when e and e* are interchanged, while the vector
368
§88
Interaction of Electrons with Photons
n = k/o) is unchanged when k<-> - k, u>+* - a>. The transformation (87.27) does not
affect the electron momenta and therefore leaves the laboratory system unaltered.
Hence the cross-section (87.22) cannot change its form under this transformation,
and in fact the formulae (87.12), (87.22), (87.23) comply with this requirement.
§88. Two-photon annihilation of an electron pair
The annihilation of an electron and a positron (with 4-momenta p_ and p+) to
form two photons (k\ and kî) corresponds to two diagrams
k, _ — .
,
p.
k2
k
p+
.
m
■
1 — - -
J
— ■
P_
p+
(88.1)
These differ from the diagrams for scattering of a photon by an electron as follows:
P'-*~P+> fc —> — ki,
p-*p~,
(88.2)
k'-+k2..
The two processes are two cross-channels of the same (generalized) reaction. After
the changes (88.2), the kinematic invariants (86.2) become
s = (p--k,) 2 ,
t = (p_ + p+)2 = (k, + k2)2,
(88.3)
u = (P--* 2 ) 2 .
If the photon scattering is the s channel, then the annihilation is the t channel.
The quantity |M/,|2 for annihilation (averaged over polarizations of the electrons
and summed over those of the photons), when expressed in terms of the invariants
s and u, is the same as corresponding quantity for scattering, only the meaning of
the invariants being changed.! In the formula (64.23) for the cross-section, the
change s *-*■ t is needed in the coefficients of |M/(|2, and I 2 is now, according to
(64.15a), equal to Jf(t-4m 2 ). Making the appropriate alterations in formula (86.6),
we find the annihilation cross-section
m2 ds
2
I m
,
L\s-m
2
m
\
u-m)
1 (s - m 2 , u-m2\]
/M
A.
The physical region of the annihilation channel is region II in Fig. 7 (§67). For
given t (given energy in the centre-of-mass system), the range of variation of s is
t This takes account of the fact that the photons and the electrons have the same number of
independent polarizations (two), and it is therefore immaterial which correspond to the averaging of
|M/(| and which to the summation.
§88
Two-photon Annihilation of an Electron Pair
369
determined by the equation of the boundary su = m4. Together with the relation
s + t + u = 2m2, this gives
-l2t-W[t(t-4m2)]^s-m2^-\t
+ W[t(t-4m2)l
(88.5)
The integration of (88.4) is elementary; the result must be divided by two to
take account of the identity of the two final particles (the photons). Thus we have
g =
2 ^ 1 ) ((T2 + T~b
log
VT - V(r - i) "(T + 1 ) V [ T ( T " 1)3 }>
(886)
where T = Ulm2 (P. A. M. Dirac, 1930).
In the non-relativistic limit (r -*• 1), this gives
<r = My/(T-\).
(88.7)
In the ultra-relativistic case (T -> <»),
<r = (Wr2JT)(\og 4T - 1).
(88.8)
In the laboratory system, in which one particle (say the electron) is at rest
before the collision, the invariant T is
T = kl + 7),
7 = ejm.
(88.9)
Formulae (88.6)-(88.8) give as the dependence of the total cross-section on the
energy of the incident positron
- ^ P ^ l ° 8 [ ? + V<=
, -l)]-v^}.
(88.10)
In particular, in the non-relativistic limitt
cr = irr2elv+ (non-relativistic),
(88.11)
where v+ is the velocity of the positron.
In the centre-of-mass system the electron, the positron and the two photons have
equal energies, e = <o. The invariants are
m 2 - s =2e(£ - | p | cos 0),
m2-u =2e(e + |p|cos Ö),
t=4e 2 ,
(88.12)
where 6 is the angle between the momentum of the electron and that of one of the
t This formula becomes inapplicable, however, when v+^a and the Coulomb interaction of the
components of the pair cannot be neglected; cf. the end of §94.
370
Interaction of Electrons with Photons
§88
photons. Substituting in (88.4), we find the angular distribution of the annihilation
photons:
,
r]m2 f£ 2 + p2(l + sin2 0)
2p4 sin4 6 ] A
«*Q \-\\
da = -rn\—2
—*
T-;
, t r a — r z v \do(88.13)
4e|p| L e - p ' c o s ' 0
(e - pz cos 2 0)ZJ
In the ultra-relativistic case this has symmetrical maxima in the directions 0 = 0
and 0 = IT. Near 0 = 0, we have
to * i w l \
U (ultra-relativistic).
2e (0 + m le )
(88.14)
The total cross-section is obtained from (88.6):
^ =^ i ^ [ ^ l o g | ^ - 2 ( 2 - ,
2
) ] ,
(88.15)
where v = |p|/e = V ( e 2 - m2)/e is the velocity of the colliding particles.
We shall not discuss here the details of the polarization effects in annihilation^
but merely consider certain qualitative features of these effects in the limiting cases
where the velocity v of the colliding particles is large or small. The process will be
considered in the centre-of-mass system.
In the limit v -> 0, only the state with orbital angular momentum of relative
motion Z = 0 gives a non-zero contribution to the cross-section. But the S state of
the electron + positron system has negative parity (§27, Problem). In odd states of a
two-photon system, their polarizations are orthogonal (§9). The same must therefore be true of the annihilation photons in the non-relativistic case.
If the electron and positron are polarized, their annihilation is possible (again in
the non-relativistic case) only if their spins are antiparallel: since the annihilation
occurs in the S state, the total angular momentum of the system is equal to the
total spin of the particles, which is 1 when the spins are parallel. The two-photon
system, however, has no state with total angular momentum 1 (see §9).
In the ultra-relativistic limit (v -» 1), the annihilation of a longitudinally polarized
(helical) electron and positron is possible only when their helicities have opposite
signs.$ In this limit, helical particles behave as neutrinos (see the end of §80), and
the electron and positron undergoing annihilation must be analogous to a neutrino
and an antineutrino, whence the result stated follows.
The annihilation of an electron and a positron with the same helicity occurs, in
the ultra-relativistic case, only when terms containing m are taken into account. The
amplitude of this process differs, in order of magnitude, by a factor m/e from that
of the annihilation of a pair with parallel spins; the cross-section accordingly differs
by a factor (m/e) 2 .
t See W. H. McMaster, Reviews of Modern Physics 33, 8, 1961.
% Since the directions of the particle momenta are also opposite (in the centre-of-mass system),
helicities of opposite sign correspond to parallel spins.
§89
Annihilation of Positronium
371
PROBLEM
Find the cross-section for the formation of an electron pair in the collision of two photons (G. Breit
and J. A. Wheeler, 1934).
SOLUTION. This is the process inverse to the two-photon annihilation of an electron pair. The
squared amplitudes are the same for the two processes, and their relationship to the cross-section differs
only in that here I2 = (kik2)2 = it 2 . Hence
A
A
«CTform = ÖO"ann
t~4m*
'
.
In the centre-of-mass system (t = 4e2 = 4<o2),
d(T(orm
=
V
do"ann,
where v is the velocity of the components of the pair. In integrating to obtain the total cross-section, the
result is not to be divided by 2 (as in the case of annihilation), because the two final particles (electron
and positron) are not identical. Hence, in the centre-of-mass system,
CTform
=
2\)
Œann
= We{\ - i>2){(3 - v4) log Y^
- 2t>(2 - v2)].
(1)
In an arbitrary frame of reference K, in which the two photons k\ and Jc2 are moving in opposite
directions, we have (from the invariance of kifo)
__
2
0>lû)2 = û> ,
where eu is the energy of the photons in the centre-of-mass system. Since this energy is equal to that of
the pair components, we have a> = e = m/V(l - v2). To change to the frame K, we must therefore put in
(1)
t? = V ( l - m2/cüi(ü2).
§89. Annihilation of positronium
Owing to the conservation of momentum, the annihilation of the electron and
positron in positronium must be accompanied by the emission of at least two
photons. Such a decay is possible (in the ground state), however, only for
parapositronium. In §9 we have shown that the total angular momentum of a
two-photon system cannot be 1. Hence orthopositronium in the 3Si state cannot
decay into two photons. Moreover, since positronium in the 3S\ state is a chargeodd system (see §27, Problem), Furry's theorem (§79) shows that it cannot decay
into any even number of photons. In the ! S 0 state, on the other hand, positronium is
charge-even, and the decay of parapositronium into any odd number of photons is
therefore forbidden.
The main process which determines the lifetime of positronium is therefore
two-photon annihilation for parapositronium and three-photon annihilation for
orthopositronium (I. Ya. Pomeranchuk, 1948). The decay probability can be related
to the cross-section for annihilation of a free pair.
The electron and positron momenta in positronium are ~~me2lhy i.e. small
compared with mc. Hence, in calculating the probability of annihilation, we can
372
Interaction of Electrons with Photons
§89
take the limit of two particles at rest at the origin. Let &2y be the cross-section for
two-photon annihilation of a free pair, averaged over the spin directions of both
particles. In the non-relativistic limit, according to (88.11),t
ä2y = 7T(e2lmc2)2clv,
(89.1)
where v is the relative velocity of the particles. The annihilation probability w2y is
obtained on multiplying â2y by the flux density u|i/f(0)|2. Here i/f(r) is the wave
function, normalized to unity, of the positronium ground state:
*(r) = ^
^
e~ri\
a = 2ft2/me2;
(89.2)
the Bohr radius a for positronium is twice that for the hydrogen atom, because its
reduced mass is half as great. This probability, however, corresponds to the initial
state averaged over spins, whereas in positronium, of the four possible spin states
of a two-particle system, only one (with total spin 0) can undergo two-photon
annihilation. Hence the mean decay probability w2y is related to the parapositronium decay probability vv0 by w2y = Jw0, and so
vvo = 4 | i K O ) | W 2 , W
(89.3)
Substituting the values from (89.1), (89.2), we obtain for the lifetime of parapositronium
T0 = 2ft/mcV = 1.23 x 1(T10 sec.
(89.4)
It should be noticed that the level width T0 = hlr0 is small compared with the
level energy
|Egr| = m e W = m c 2 a 2 / 4 .
For this reason positronium may be regarded as a system in a quasi-stationary
state.
Similarly we find that the decay probability for orthopositronium is related to
the spin-averaged cross-section for three-photon annihilation of a free pair by
Wi = l * 3 7 H | * ( 0 ) | W 3 7 U
(89.5)
the statistical weight of a state with spin 1 being i Anticipating, we may mention
that
t^M^JJ^.
7
5v
\mc1/
(89.6)
The lifetime of orthopositronium is therefore
T
«=Ä)^ = 1 - 4 x l 0 " , s e c '
t Formulae (89.1M89.7) are written in ordinary units.
(89 7
->
§89
373
Annihilation of Positronium
The inequality T i ^ E ^ I is here, of course, satisfied even more markedly than for
parapositronium.
Let us now calculate the cross-section for three-photon annihilation of a free
pair (A. Ore and J. L, Powell, 1949). According to (64.18), the cross-section in the
centre-of-mass system is expressed in terms of the squared amplitude by
=
da
iy
r
—AT
ô k + k
<i
2 + k3)S(coi + co2 + co3 - 2m) 0
41
.^
r
z—,
(27T) 2O>I • 2Ù>2 • 2CÜ3
(89.8)
where, according to (64.16), I = 2m \mv = m v, v being the relative velocity
(assumed small) of the positron and the electron; ki, k2, k3 and (ou o>2, O>3 are the
wave vectors and frequencies of the photons formed; the delta functions express
the laws of conservation of energy and momentum. Because of these laws, the
three frequencies <oi, <o2, <Ü3 must be represented by the lengths of the sides of a
triangle with perimeter 2m. Thus the magnitudes of the momenta ki, k2, k3 and the
angles between them are entirely determined by specifying two frequencies.
The three-photon annihilation corresponds to the diagram
2
k,
— ^ - - - j — . — p_
—
k2
k3
— - - - 1 — — -
P +
and a further five diagrams obtained from it by interchanging th£ photons ki, k2,
ky. The amplitude may be written
Mfl = (47r)3/2e(A3)*eï)*et,)*û(- p+)Qx,"u{p-),
(89.9)
Q ^ = S7ÄG(krP+)7^(p--k,)7f,
(89.10)
where
int.
the sum being taken over all interchanges of the photon numbers 1, 2, 3 together
with corresponding simultaneous interchanges of the tensor indices A, jx, v. The
squared modulus of the amplitude, averaged over the polarizations of the electron
and the positron and summed over those of the photons, is
4 2 \Mfl\2 = (4TT)3 trti+Q^p-Q^l
polar.
(89.11)
where
P- = 2(yp - + m),
p+ =
i(yp+-m).
The matrices Q*"" differ from the matrices QAM" in that the order of the factors is
374
Interaction of Electrons with Photons
§89
reversed in each term of the sum. In the limiting case considered, where the
electron and positron velocities are small, their 3-momenta p_ and p+ may be taken
as zero, putting p= p+ = (m, 0). Then the electron Green's functions are
G(p
-
k,)
"(p--k,)2-m2
= 2 ^
'
etc., and the density matrices reduce to
p? = i m ( y ° ± l ) .
A large number of terms arise on carrying out the multiplication in (89.11), but
the number that need to be calculated can be greatly reduced by making full use of
the symmetry with respect to interchanges of photons. For example, it is sufficient
to multiply out the six terms in Qk^p (89.10) each with only one term in QkßV. In the
six traces then remaining, we can again select certain parts which are transformed
into one another by various interchanges of photons. The products of the 4-vectors
p, ki, k2, ^3 which occur when the traces are expanded can all be expressed in terms
of the frequencies a>i, to2, co3. Since p = ( m , 0), we have pk\ = m<ou
The
products k\k2,... are determined from the equation of conservation of 4-momentum: 2p = k\ + k2 + k3; for example, writing this equation in the form 2p-Jc 3 =
k\ + k2 and squaring, we have
k1k2 = 2 m ( m - û > 3 ) , . . . .
(89.12)
The result of the calculation, which is still fairly lengthy, is
i 2 |M,p-(4-)»««. i6f(2i^i) 1 + (a^y + (a^ö) 2 ].
pSïar.
L\
k>2^3 /
V ü)iu) 3 /
V CüiCÜ2 / J
Substituting this expression in (89.8), we obtain the differential cross-section for
three-photon annihilation:
27
IT m
v
L\
Û^ 2 U) 3
/
V (OIÜ)3
/
\
CUICÜ2 /
x S(k, + k2 + k3)fi((o, + a>2 + a> 3 - 2m)
1
J
x
d k l d k2 d k
\
(89.13)
The delta functions have still to be eliminated. The first is removed by
integrating over d3k3, and we then write
d 3 ki (Pkj^
ATTQ)}
doi\ •
2TTÜ)2
d(cos Ö12) dcu2,
where Bn is the angle between k, and k2; it is assumed that the integration has
already been performed over the directions of k, and the azimuth of k2 relative to
§89
375
Annihilation of Positronium
ki. Differentiating the equation
Û>3 = V(û>î + (02 + 2(O\0)2 COS #12),
we find
d COS 0i2 = (0)3/0)1(02) d(tii.
The second delta function is removed by integrating over dcjj. The resulting
cross-section for annihilation with formation of photons having specified energies
is
1 Se6 \(nLz^)2+(nL^i)2+
6 vm2 t \ (OiO)2 /
\ (Oi(03 /
däyy = -
(HL^I)2}
\ O)2(03 / J
dùh dû)2;
(89 .i4)
the factor 1/6 has been included in order to take account of the identity of the
photons in the subsequent integration over frequencies (cf. the third footnote to
§64).
Each of the frequencies (ot, o>2, 0)3 can take values between 0 and m; the latter
can be reached by two frequencies when the third is zero. For given o>i, the
frequency (y2 varies between m-oj\ and m. Integrating (89.14) over do)2 between
these limits, we obtain the spectral distribution of decay photons:
dv^ = (8e73i>m3)F((o,) <fo)i,
nv x o)i(m -0)1) , 2m -o>i , f2m(m -(oQ 2m(m-o)i) 2 ] 1 m-o)i
(2m-o>i)i+"-aTr~+L
^
(2m-o),) 5 J l 0 g m
F(û>,)=
The function F((oi) increases monotonically from zero when (01 = 0 to unity when
toi = m, and is shown graphically in Fig. 14.
The total annihilation cross-section is obtained by integrating (89.14) over both
F(o),)
02
04
FIG.
06
14.
08
■10
376
Interaction of Electrons with Photons
§90
frequencies:
V}y7 =
4e6
3vm 2
^
J0
m
n
0)\0)2
J
d^i dù)2
The value of the integral is (7r - 9 ) / 3 , and we thus return to formula (89.6).
§90. Synchrotron radiation
According to the classical theory (Fields, §74), an ultra-relativistic electron
moving in a constant magnetic field H emits a quasi-continuous spectrum with a
maximum at the frequency
ù)-coo(elm)\
(90.1)
where
o>0 = i>|e|H/|p|
Ä
|e|H/e
(90.2)
is the frequency of revolution of an electron having energy e in a circular orbit (in
a plane perpendicular to the field).! We shall assume that the longitudinal velocity
of the electron (parallel to H) is zero, as can always be achieved by a suitable choice
of the frame of reference.
Quantum effects in synchrotron radiation originate in two ways: from the
quantization of the motion of the electron, and from the quantum recoil when a
photon is emitted. The latter is determined by the ratio hale, and this must be
small if the classical theory is applicable. It is therefore convenient to use the
parameter
where H 0 = m2l\e\h (= m2c3l\e\h) = 4.4 x 1013 G. In the classical case, \ ~ M e « 1.
In the opposite limit (^ > 1), the energy of the emitted photon hco — s, and (as we
shall see below) the significant region of the spectrum extends to frequencies at
which the electron energy after the emission is
e'-mHolH.
(90.4)
If the electron remains ultra-relativistic, the field must satisfy the condition
HIH0<èl.
(90.5)
The quantization of the electron motion itself is expressed by the ratio hcoole;
hcoo is the interval between adjacent energy levels for motion in a magnetic field.
t In this section we shall put c = 1 but retain factors of h.
§90
Synchrotron Radiation
311
Since
hioole = (H/Ho)(m/e)2,
it follows from (90.5) that ÄCÜ0 < e, i.e. the motion of the electron is quasi-classical
for all values of x- T n a t »s> t n e non-commutativity between the operators of
dynamical variables of the electron (quantities of order ho)0le) may be neglected,
while the non-commutativity of these operators with those of the photon field
(quantities of order hale) is not neglected.t
The quasi-classical wave functions of stationary states of an electron in an
external field can be put in the symbolic form
* = y/QÏÏ)U(P)
<r(i
"°*'<Mr)'
(906)
where </>(r)~ exp(iS/ft) are the quasi-classical wave functions of a spinless particle
(S(r) being its classical action); u(p) is the operator bispinor
"(P) = /
W(H + m)w
obtained from the bispinor plane-wave amplitude u(p) (23.9) on replacing p and e
by the operators^
p = P - e A = -iftV-eA,
H=V(p 2 + m2),
where P is the generalized momentum of the particle in a field with vector potential
A(r). The order of the operator factors in ip is immaterial, since their noncommutativity is neglected, and the spin state of the electron is determined by the
three-dimensional spinor w.
In order to calculate the probability of photon emission in the quasi-classical
case, it is more convenient to start not from thefinalformula (44.3) of perturbation
theory but from a formula in which the integration with respect to time has not yet
been carried out. For the total (over all time) differential probability we have§
00
d* = £ Kf 0p,
op = J Vfl(t) dt
(90.7)
— 00
t The full solution of the quantum problem of synchrotron radiation was first given by N. P.
Klepikov (1954), and the first quantum correction to the classical formula by A. A. Sokolov, N. P.
Klepikov and I. M. Ternov (1952). The derivation given here, which explicitly makes use of the fact that
the motion is quasi-classical, is due to V. N. Baîer and V. M. Katkov (1967). A similar method had been
used earlier by J. Schwinger (1954) to derive the first quantum correction in the radiation intensity.
t In this section, unlike Chapter IV, the generalized momentum is denoted by the capital letter P,
while p denotes the ordinary (kinetic) momentum.
§ Putting Vn(t) = V/i exp(ia>/jt), we find a/, = 2TTV/,-6(û>/i), and, since the squared delta function is to be
taken as [S(o>)] -> (t/27r)6(ü>), where t is the total observation time (cf. the derivation of (64.5)), we obtain
from (90.7) the formula (44.3) for the probability per unit time.
378
Interaction of Electrons with Photons
§90
(cf. QM, (41.2)); the summation is over final states of the electron.
Using (90.6), we can write the matrix element V/j(0 for emission of a photon o>,
k in the operator form
V/,U)
V{2f^))rfe
V(2H)\€
(C
a)
V(2H)
*
'
where the operators in the square brackets act to the left; the photon field is taken
in the three-dimensionally transverse gauge. The factors cxp(±iHtlh) convert the
Schrödinger operators between them into explicitly time-dependent operators of
the Heisenberg representation. We can write Vfi(t) in the form
V/,(0 = ^ | ^ < / | Q ( 0 | i ) e i where Q(t) denotes the Heisenberg operator
Q(t) = J&ÊL (a. e«)6-«k.K„ J I K P L ,
V(2H)
V(2H)
(90.8)
and the matrix element is taken with respect to the functionsfa,fa.
The summation in (90.7) is taken over allfinalwave functionsfa,an d is effected
by means of the equation
^4>*f{T')<f,f(r) = 8(T'-T),
which expresses the completeness of the set of functionsfa.The result is
dw = ^ 0 J A . J A 2 . e"-("-^<(|0+(r2)Q(f,)|i).
(90.9)
If the integration is over a sufficiently long time interval, fi and t2 can be replaced
by new variables
T = t 2 -tl,
t=kU+h\
and in the integral over t the integrand may be regarded as the probability of
emission per unit time. Multiplying by h<o, we obtain the intensity
dl = - ^ d\ [ e~™(i\Q\t +i<r)Q(f ~h)\i) dr.
4TT
J
(90.10)
An ultra-relativistic electron radiates into a narrow cone at angles 0 — m/e
relative to its velocity v. The emission in a given direction n = k/co therefore occurs
over a section of the path in which v turns through an angle ~-m/e. This section is
traversed in a time T such that T|V| ** TCO0- m/e < \. This region gives the principal
§90
379
Synchrotron Radiation
contribution to the integral over r. In the subsequent calculations, we shall
therefore expand all quantities in powers of O>0T. It may, however, be necessary to
retain more than just the leading term in the expansion, because of cancellations
which occur since 1 - n • v -~ 02 ~ (m/s) 2 .
If the operator Q*Q is reduced to a product of operators which commute (to
the necessary degree of accuracy), the taking of the diagonal matrix element
( ï | . . . |î) is equivalent to replacing these operators by the classical (time-dependent)
values of the corresponding quantities. This is achieved in the following way.
According to the foregoing discussion, in the expression for Q(t) only the
non-commutativity of the electron operators with the photon field operator
e x p ( - ik • f(0) need be taken into account. We have
p * - * • ' = g- ik -*(p-ftk),
H(p)e-
ikm
ik f
* = e- ' H(p-hk).
1
>
(90.11)
These formulae follow because e~tke is the displacement operator in momentum
space. Using (90.11), we can take the operator e'lk'Ht) out on the left in
(90.8), and write 0 ( 0 in the form
Q{t) = e^'WRit),
v
R(t)=
U {p
j ? (tt-e*)-7^,
V(2H')
V(2H)
(90.12)
where H ' = H - fto>, p' = p - hk.
Then
Q2Öi=Ä2eik •''«-*•'*!;
(90.13)
here and henceforward, the suffixes 1 and 2 denote the values of quantities at the
times ti = t — 2T and t2-t + 2r. It remains to calculate the product of the two
non-commuting operators elk *2 and e~ik*h. This product itself may be regarded as
commuting with the remaining factors.
We write
£(T) = e - ^ e l k ' * ' e - | k - \
(90.14)
this being the combination of operators which appears in (90.10). The operator eiArlh
is a time-shift operator, and so
£ l k * * 2 = ßifol* £ l k * i
e-iÛrl*
Substituting this in (90.14) and noting that eik't{ is a displacement operator in
momentum space, we find
L ( T ) = exp{i[H - M r / f t } e x p H H ^ - hk)rlh}.
(90.15)
Differentiating (90.15) with respect to T and again using the properties of the
380
§90
Interaction of Electrons with Photons
time-shift operator, we havet
dLldr = (i/ft) exp{/(H - hco)rlh}[H - hco - H(fa - hk)] x
x exp{-iH(p, - hk)rlh} = (Hh)[H -hoy- H(p 2 - ftk)]L(r).
(90.16)
Having thus made use of the non-commutativity of the operators, we can
replace all the operators by the corresponding classical quantities (the Hamiltonian
H by the electron energy e). We have identically
e(p2~hk) = [(p2-hk)2 + m2]m
= [(e-hoj)2 +
2h((oe-k-p2)]m.
The difference
cos — k • p2 = o>e(l - n • v2)
is small, since from the above analysis 1 - v • n — (m/s)2. As far as the first order in
this difference,
e(p 2 - hk)^ e' + (ele')h(u) - k • v2),
where e' = e - too. From (90.16), we now find the differential equation for L(T):
indLldr = (ele')h(œ - k • v2)L.
(90.17)
This equation is to be solved with the obvious initial condition L(0) = 1. Since
T
o
v2dT = r 2 -ri,
we have
L(T) = exp{i(e/e')(k • r 2 -k • r, - WT)}.
(90.18)
So far, no use has been made of the specific form of the electron trajectory.
Now expressing r2-ri in (90.18) in terms of pi by means of the equation of motion
of the electron in the plane perpendicular to the field H (see Fields, §21):
eHT\
Pi . eHr , pi x H /,
r ; -r, = JLs.n — + ^ ( l - c o s — ) ,
and expanding in powers of T gives
k • ( r j - r , ) - Û>T « «r|(n • v , - 1) + T £HJJL£Ü- T
2
^},
(90.19)
where in the last term we have put n • vi *» 1.
t Because of the conservation of energy, the Heisenberg operators H(pi) and Ufa) are the same; we
therefore omit the arguments of H in such cases. However, H (pi - hk) is of course not the same as
Ä(fr-Äk).
Synchrotron Radiation
§90
381
We next transform the remaining factors in (90.13). A direct expansion of the
product in R(t), using the matrix a from (21.20), leads to'
R(t) = w*fe* • (A + iB x <r)Wi,
. - , / l 1\ e + e'
\e + m
e +m/
(90.20)
le
where p'(0 = p(0~^k; terms of higher order in m/e are omitted. Thus we have
finally
(90.21)
RfJRj = tr |(1 + Ç, • <r)(A2 - ÎB2 x a) • e • |(1 + & • a)(A, + iB, x a) • e*.
The factors Kl + Ç' or) are two-rowed polarization density matrices of the initial
and final electron.
Let us consider the radiation intensity summed over the polarizations of the
photon and of the final electron, and averaged over the polarizations of the initial
electron. These operations give, after a simple calculation,!
U««-^(—>*^),(7)'
With sufficient accuracy we can put
Vi • V2 = V — JTV + JTV • V
1
= 1
m
1 2
2
f-2<0oT .
Substitution of these expressions in (90.21) and thence in (90.10) gives
e2
(11 = — -A—5 o) 2 day don x
00
x
J (^ + £ ^ < "« T J ) e "p{- ! T 1 ('- n - v + ^»)} d T -
—00
O0-22'
t This calculation makes use also of the following result. In the summation over e,
2 (vi ' *)(V2 • e*) = vi • V2 - (vi • n)(v2 • n).
c
On substituting (90.21) in (90.10), we can integrate by parts, noting that
(v1.n)exp(-ii7k.r1) = ^ ^ e x p ( - , f 7 k . r 1 ) ,
and similarly for V2 • n. Consequently, in the remaining integration vi • n and v2 • n can be replaced by
unity.
§90
Interaction of Electrons with Photons
382
This formula shows the frequency and angular distribution of the radiation intensity.
To find the frequency distribution, we integrate over don. If the direction of v is
taken as the polar axis, with an angle # between n and v, then
n • v = v cos #,
don = sin d dd d<f>,
and
f
ficure
) ,
lire'
/ia)Ts\
f
/
io>Te\)
fexp(—) - exp(—-)j.
J e x p ( — n • T ) do. = ^
When this is substituted in (90.22), only the first term need be retained, since the
second term yields a faster varying exponential (with a factor 1 + v « 2 instead of the
small 1 - v = m2/2e2). Hence
dl
ie2<o f (m1 , e2+e'2
f icrre /,
\
, T2 2\1 .
According to the integral representation of the Airy function 4> (see QM, §b), the
first term reduces to the integral of the Airy function, and the second term to its
derivative. The final result is
X
x = ( W e » 2 ' 3 = (m 2 /e 2 )(ecu/eW 3
(90.24)
(A. I. Nikishov and V. I. Ritus, 1967). The frequency distribution has a maximum
when x ~ l ; for ^ < 1 we find (90.1), and for x > l (90.4). In the classical limit,
ho) ^ e' = e, x = ((olü)Q?(mle)2; the second term in the round brackets is small, and
(90.23) becomes the classical formula (Fields, (74.13)).
Figure 15 shows diagrams of the frequency distribution for various values of xThe quantity
1
dl
3/cl/2d(ü>/o)c)
is plotted against ct>/a>c, where
The quantity Jci is the classical value of the total radiation intensity; cf. Fields,
(74.2).
§90
383
Synchrotron Radiation
To calculate the total radiation intensity, (90.23) must be integrated with respect
to to from 0 to e. We change to integration with respect to x, noting that
-'('"ï+M'
ha)
and x therefore varies from 0 to °°. With two integrations by parts in the first term
in (90.23), we find
1
- ~ ww J ( l + W
0
*(x)x dx-
(90 25)
-
Figure 16 shows a graph of the function I(x)lh\When x<^l, the important region in the integral is x — 1. Expanding the
integrand in powers of x a n d integrating by means of the formula
j x'V(x) dx = - ^
o
3{Av'mT(\v + l)T(\v +1),
we obtain
55V3
/ =/ „ ( l - ^ *
+
48*'-...).
(90.26)
When x ^ 1» the important region is that in which *x3'2 ~ 1, i.e., x «^ 1. In the first
approximation, we can therefore replace $'(*) by 4>'(0) = - 3l/6r(j)/2Vir, and the
§90
Interaction of Electrons with Photons
384
1U
09
08
07
06
I
o«
04
03
02
01
05
15
10
3X
FIG.
16.
integration then leads to the result
1
m%eW
243ft2
V X)
m
(90.27)
Synchrotron emission causes the occurrence of a polarization of electrons
moving in the field (A. A. Sokolov and I. M. Ternov, 1963). To discuss this, we
have to find the probability of a radiative transition with spin reversal.
Putting in (90.21) Ç, = -£/ ■ Ç, |£| = 1, we have
RÎK, = B, • B 2 - (e* • B,)(e • B2) - (e* • B, x Ç)(e • B2 x Q - i(C ■ e*)(e • B, x B2).
Summation over polarizations of the photon gives, after a simple calculation,
2 i?!R, = (B, • B2)(l - ({ • n)2) + ({ • n)(n • B.XÇ • B2) +
c
+ (£ • n)(n • B2)(£ • B,) - i(£- n(n • Q) • B, x B2.
(90.28)
We shall assume that \ ^ 1 anc* s e e ^ ° n ly the principal term in the expansion of
the probability in powers of ft. Since the expression (90.28) (with B given by (90.20))
already contains ft2, all the remaining quantities e\ including those in the exponent
in (90.18), can be replaced by e.
§90
385
Synchrotron Radiation
With the expansions
Bl ,£(.-„ +!T * + „a).
B2 =
27l n - v - i T V + v 7>
r2 - ri =
TV
+ 24 v,
and substituting (90.28) in (90.21) and thence in (90.10), we find the differential
transition probability per unit time (dw = dIlho>). The integration over d3k is carried
out by means of the formula
J
d\
„.„ ,
4ir
(90.29)
where in this case
Xo = T,
x = r2 - r,,
X -Xo
X -T ^ +
l2
j
The result of the calculation is
W=
«^[m)
Wo
f(l + zVl2)4?"Ï2?+(? + Ï2?) ( ^ v) -?^o^ V><V J'
where z = rcooWm and the contour of integration passes below the real axis and is
closed in the lower half-plane. After this integration we finally obtain for the total
probability of a radiative transition with spin reversal
w =
5V3<* h2 /e\5
,/.
lt0,
8V3
ifyfey-K-W-^HO'
where fn = ^ • v, C± = i' H/H. This formula is valid for both electrons (e < 0) and
positrons (e > 0).
The probability (90.30) is independent of the sign of the longitudinal polarization
£ll but depends on that of £x. The polarization resulting from the emission is
therefore transverse.! For electrons, the probability of a transition from a state
with the spin parallel to the field (£x = 1) to a state with the spin antiparallel to the
field is greater than that of the opposite transition. The radiative polarization of the
electrons is therefore antiparallel to the field, and the degree of polarization in a
t This is also evident from the fact that the axial vector of the resultant polarization must be along
H, which is the only axial vector occurring in the problem.
386
Interaction of Electrons with Photons
§91
stationary state is (when fj = 0)
w(£i = - l ) - w ( £ i = 0
wtf± = - l ) + w ( £ ± = l )
=
8V3
15
U
^'
Positrons are polarized, to the same degree, parallel to the field.
§91. Pair production by a photon in a magnetic field
The production of an electron-positron pair by a photon in a magnetic field, and
synchrotron radiation, are two cross-channels of the same reaction. The amplitude
Mfi of the pair production process is therefore found from the synchrotron
radiation amplitude by simply making the changes
e , p - > - e+, - p + ;
e',p'-» e_,p_;
<u, k - * - co,-k.
(91.1)
Here e-, p_ and e+, p+ are the electron and positron energies and momenta; e, p and
E\ p' the initial and final energies and momenta of the electron in synchrotron
radiation. In terms of angles and magnitudes, the momenta are transformed
according to
IPMP+I>
IP'MP-I,
0->7T-0+,
» # ->e-,
*->*-*,
(91.2)
where d± are the angles between p ± and k, <f> the angle between the kp+ and kp_
planes.
For synchrotron radiation, the cross-section is given in terms of the amplitude
byt
dff = M
l ''| ! 8J^ S(e -*'-"> 1 ^
;
< 9U)
see (64.25). The delta function is eliminated by integrating with respect to e'. Since,
in the present case, p' and k are independent variables, and
d3p' = | p V de' do\
d'k = <o2 d<odok,
we have simply to substitute
8(e - e' - a)) d*p' d*k -> co2|p'N' dok do1 dio.
Then
da = |M/f|2 gQ^jpi àok do da>.
t In this section we again put h = 1 as well as c = 1.
I
(91.4)
§91
Pair Production by a Photon in a Magnetic Field
387
For pair production by a photon, the cross-section is given in terms of the
amplitude by
d
da = \Mfi\2 r - ^ — S(o) - 6 + - g _) d'P(;
>-
or, after elimination of the delta function,
dv = IM/,12 J^%1O\ZTT)
do+ do_ de+.
(91.5)
it)
Comparison with (91.4) shows that, to obtain the pair production cross-section
from the synchrotron radiation cross-section, we have to make in (91.4) the
changes (91.1), multiply by
(p2+/û>2) dejda),
(91.6)
and replace do' dok by do+ do-.
In the ultra-relativistic case a> > m,t this can be done in the formulae derived in
§90. Here it is assumed that both particles in the pair are ultra-relativistic; it is
easily verified that all the approximations used in §90 then remain valid.
In particular, the probability of pair production by an unpolarized photon,
summed over the electron and positron spin projections and integrated over the
directions of emergence of the electron, is found on making the changes (91.1) in
(90.22) (or rather in the expression for dJ/dw), with d*k = o>2 dw don replaced by
dW-
.
00
e2 d3p+ f ( m2
el + el
2
2\ v
x e x p { ^ ( l - n . v + +^wl)}dr.
(91.7)
where a>o+ = |e|H/e+, and n is a unit vector parallel to the photon momentum, which
lies in the plane perpendicular to the magnetic field. The integration is carried out in
the same way as in §90, and (since (91.7) depends only on the angle between n and
v+) it does not matter whether we integrate over do+ or over don. The result can
therefore be obtained directly by analogy with (90.23):
00
dw =
v T ^ {/ ^ d* + (f " K Vjc)*'<*>}»
(918)
t More precisely, we must have eu sin d > m, where d is the angle between k and H; when d = 0, no
pairs are formed. In what follows, we shall take d = fa.
388
§91
Interaction of Electrons with Photons
where now
x = (m3ù>l\e\He+e-)\
}
\
K = \e\H(olm3 (= ft2|e|Hû)/mV). J
(91.9)
The total pair production probability per unit time is found by integrating (91.8)
with respect to e+; in view of the obvious symmetry in e+ and e- = <o - e+, it is
sufficient to take twice the integral from 0 to \u. Changing variables from e+ to x and
integrating by parts in the first term in (91.8), we obtain
?H
V^
f
J
f(x3/2-4/K)1/2
3(X3/2-2/K)<J>'(X)1
73
<Pix) ül ili
I
P
-x \x -4lK)ili\dx
(9U0)
(A. I. Nikishov and V. I. Ritus, 1967).
In the limit of weak fields (K <^ 1), values of x near the lower limit are important
in the integral (91.10). Since these values are large, we can use the asymptotic
expression for the Airy function,
$(*)~2P*expHx 3 / 2 );
see QM, §b. With the variable of integration V = X 3 / 2 - 4 / K , and putting y = 0
wherever possible, we find by calculation
W=
^re_8/3K'
(9U1)
K<L
The exponential decrease of the probability as K -* 0 corresponds to the impossibility of pair production in the classical limit.
In the opposite limit of strong fields (K > 1), only the second term in (91.10) is
important, and it is governed by the range of x for which 'XW~HK<\.
In that
range, <ï>'(x) may be replaced by $'(0) = - 3l,6r(20/2irl With the value of the integral
| y"'(y - i r _ l dy = I V - jOroo/IV),
we find
W
- 2* W^/6)
^
=
0.38|e|'H/mK "',
K>l.
The function mw(K)/|e|3H has a maximum value of 0.11 at K « 11.
(91.12)
§92
Electron-Nucleus
Brems Strahlung
389
§92. Electron-nucleus bremsstrahlung. The non-relativistic case
This section and those following are concerned with the important phenomenon
of bremsstrahlung, the radiation emitted in a collision between particles. We shall
first consider a non-relativistic collision between an electron and a nucleus,
assuming that the nucleus remains at rest; that is, we consider radiation from the
scattering of an electron in the Coulomb field of a fixed centre (A. Sommerfeld,
1931).
We begin from formula (45.5) for the probability of dipole radiation:
dw = (co3/27T)|e* . d/i|2 dok.
(92.1)
In the present case, the initial and final states of the electron belong to the
continuous spectrum, and the photon frequency
cü=(l/2m)(p 2 -p' 2 ),
(92.2)
where p = mv and p' = mv' are the initial and final momenta of the electron. If the
initial and final wave functions of the electron are normalized to "one particle per
unit volume' 1 ( V = 1) the expression (92.1), on multiplication by d3p7(27r)3 and
division by the incident flux density D/V = D, will give the cross-section do\ P ' for
emission of a photon k into the solid angle dok with scattering of the electron into
the range of states d3pf. Replacing the matrix element of the dipole moment d = er
by that of the momentum:
we can write the expression for the cross-section in the formt
<*ov = {2°fmp
|e* ' P/.I2 dok d'p\
(92.3)
where
P/i = J tWi
à?x = - ii J 0fV<fo d3x.
For «/>, and ty we must use the exact wave functions in an attractive Coulomb field,
whose asymptotic form consists of a plane wave and a spherical wave. The
spherical wave must be ingoing in «/// and outgoing in ^ (see QM, §136). These
functions are
ifc = A, el>rF(iv, 1, i(pr - p T ) ) ,
iiff = Af eip' r F ( - i V , 1, -i(p'r + p' • r)),
t In this section, p and p' denote |p| and |p'| respectively.
V = Ze2mlp ;
v' = Ze2mlp', I
(92.4)
390
Interaction of Electrons with Photons
§92
with the normalization factors
A, = ew"/2r(l - iv),
Af = e*v'l2Y(\ + iv').
(92.5)
Since
VFOV 1, i(pr - p • r)) = i(pr/r - p)F'
r \dp)v'
we can write the gradient v>, as
' * - * * - * « * • ' £ (ff).On multiplication by t/rf and integration, the first term vanishes, because «/f, and if/f
are orthogonal. The matrix element pfi is therefore
p/, = iAiAfpdJldp,
(92.6)
where J denotes the integral
J = j ~-^ F(iV, 1, i{p'r + p' • r))F(iv, 1, i(pr- p • r)) d3x,
(92.7)
q = p' - p.
The symbol ô/ap has been taken outside the integral, with the understanding that, in
the differentiation of J, the quantities v, v', q are to be regarded as independent
parameters, v and q being expressed in terms of p only after the differentiation.
The integral is calculated by replacing the confluent hypergeometric functions
by their expressions as contour integrals. Here we shall give only the result:t
J = BF0V.ii/, l,z)
B = 4TTe-""(-q 2 -2q • p)"''(q2- 2q • pT'O^r^'"
(92.8)
__ 2 q 2 (pp; + p-p')-2(q-p)(q*p')
(q'-2q.p')(q 2 + 2q.p)
Here F(iV, iv, 1, z) is the complete hypergeometric function.
After differentiating in (92.6), we can put q = p' - p; then
(92.9)
tThe calculations are given by A. Nordsieck, Physical Review 93, 785, 1954.
§92
Electron-Nucleus Bremsstrahlung
391
(z < 0). Also,
- q 2 - 2 q . p = q 2 -2q.p' = p 2 -p' 2 >0.
The matrix element is thus finally found to be
- AA
8TTJ g"™
Pfi AiAf
~ (p-p')\p+p')\p-p')
i\
(p+p'\
-i(v+v')
X
x (1 - 2)'("+*'>-,[«VpqF(z) + (Î - z)F'(z)(p'p- pp')],
(92.10)
where we have put for brevity
F(z) = F(iV, iV, l,z).
(92.11)
The cross-section is obtained by substituting (92.10) in (92.3), but the general
formula is very lengthy and obscure. We shall therefore go on immediately to
calculate the spectral distribution of the radiation, i.e. to integrate the cross-section
over the directions of the photon and the final electron.
The integration over dok and the summation over the polarizations of the photon
are equivalent to averaging over all directions e and multiplication by 2 x ATT, i.e. to
the substitution
e^î dok->(8-Tr/3)ôjk.
The cross-section is then
= ^\Pfi\2da>dov:
(92.12)
The value of |p/,|2 is calculated by using (92.9)-(92.11) and the formula
|r(l - iV)|2 = 7ri//sinh iTv.
The result is
,
|2
IP/i|
32Tr(Ze2)2m3
P(P + P'Y(P - P')4(l - e-^Xe2"" - 1) X
x | Y ^ \F\2- z\Ff + \i(v + v') j 4 ^ (FF'* - F*F')V
(92.13)
To integrate the cross-section (92.12) over dop=2ir sinddfl, we change from
the variable d (the scattering angle) to
(P-PT
'
PP
392
§92
Interaction of Electrons with Photons
In order to integrate with respect to z, we transform the expression in the braces in
(92.13) as follows. According to the differential equation of the hypergeometric
function (see QM, (e.2)), we have
z(l - z)F" + [1 - (1 + iv + iv')z]F' + vv'F = 0,
z(l - z)F"* + [1 - (1 - iv - iv')z)F'* + vv'F* = 0.
Multiplying these equations by F* and F respectively and adding, we obtain
(1
" z)\lzz(FF*+
F
'* F) ~ 2 z ' F '' 2 + i ( > itz' ) z ( F '* F - F ' F *) + Y=-Z lFl2] = °-
Hence the expression in the braces in (92.13) is seen to be
{• • } = -\-^z(F'F*
(92.14)
+ FF'*),
and the integration is immediate.
Collecting the above formulae, we find as the final expression for the bremsstrahlung emission cross-section in the frequency range do>t
A
647T2 ^ 2 _2
d<r„= —
m2C2
1
p'
Z «r. ^ 7 7 -
(1
_ t-u+^u.
/
d |c.^x,2\dû>
fon
...
_ D ( " 5 lF(^)l j —> (92-15)
where
v = Zamclp = Ze2lhv,
F(|) = F(iv', iv, 1, a
v' = Ze2lhv',
p' = V(p 2 - Imhai),
£ = - 4pp'/(p " P'f.
Let us consider the limiting case where both velocities v and v' are so large that
v < 1, i/ < 1 (but, of course, still with t> <^ 1, so that Za < v < 1; this is possible only
if Z is small). To calculate the derivative F'(£) in this case, we use the formula
^ F ( a , 0 , y , z ) = ^ F ( a + 1,0 + 1,7 + 1,2),
which is easily obtained by simple differentiation of the hypergeometric series.
Then
F ' Œ - f r - i V F O , 1,2,*)
«(w7fllog(l-fl;
the last equation is evident from a direct comparison of the corresponding series.
t Formulae (92.15M92.25) are given in ordinary units.
Electron-Nucleus
§92
Bremsstrahlung
393
For the function F(£) itself, we have simply
F(É)«F(0,0,1,*)=1.
Then, from (92.15),
d*„ = iZ are p log ^ ^ 7 — ,
2
Ze /ftu«U,
(92.16)
2
Ze lhv'<\.
The smallness of v and »>' is just the condition for the Born approximation to be
valid in the case of Coulomb interaction. Formula (92.16) itself can therefore be
more simply obtained directly by means of perturbation theory (see Problem 1).
Now let a fast electron (v <^ 1) lose a considerable fraction of its energy by
radiation, so that v' < v and v' may not be small. Then
-£~4p'/p
=4vlv'<*\t
F(£)~F(ii/, 0 , 1 , | ) = 1 ,
F'(0 = - vv'F{\ + iV, 1,2, Ç) « -
vv\
and the cross-section is
*'-*TZW4)
Ze2lhv<\,
dco
l
1 - exp[- 2rrZe2lhvf] <o
Ze2lhv'^l.
(92.17)
When v'<^ 1, this formula yields the same limiting expression,
d(Tw = 3 Z z a r , —5
tr
to
,
as (92.16) does when v'<v. Hence formulae (92.16) and (92.17) jointly cover the
whole range of v' (when v <l).
When <o-+<oo, where ha>Q = 2mv29 the velocity v'-+Q and i / - » » . In this limiting
case, (92.17) gives
d
„=12|£Z3
«'O
mv 2 '
(92.18)
Thus dajdio tends to a finite limit as cu -♦ co0. This can be explained in a general
manner by arguments similar to those given in QM, §147. The physical reason is
that the frequency co0 is the limit only of the continuous bremsstrahlung spectrum.
The electron can also go to a bound state with emission of a frequency oi > co0. But
highly excited bound states in a Coulomb field have properties almost the same as
those of the free states near their limit. Hence the boundary between the con-
394
Interaction of Electrons with Photons
§92
tinuous and the discrete spectrum is not essentially a physically distinctive point.
Let us now consider the case where both parameters v, v'> 1. The motion of
both the initial and the final electrons is then quasi-classical. If the condition
hü)<p2l2m is also satisfied, the matrix element too is quasi-classical. Then the
formula of quantum mechanics must become the result given by the classical
theory (see Fields, §70). We shall, however, suppose that p2/2m —fta>,so that we
need an asymptotic expression for the function F(£) when v, v'->oo and £ ~ 1; a
more exact condition will be stated below, (92.24).
To derive this expression, we start from the integral representation of the
hypergeometric function, QMy (e.3), writing it as
F(iPv\iv\ho = ^ff
t^-W-ty^ii-tçr" dt'9
(92.19)
c
where
p = !//„',
0<p<l,
so that
£ = -4p/(l-p) 2 .
(92.20)
The contour of integration is taken as shown in Fig. 17, passing along part of the
real axis and avoiding the points t = 0 and t = 1.1"
When v, v> 1, the value of the integrand is small on the lower part of this
contour, and may be neglected. On passing downwards round the point t = 0, the
integrand is multiplied by the small factor exp(-27rpi/), and on passing upwards
round t = 1 it is multiplied by exp(27rpi>'). The integral
F=
27TÎ
je^dtlU
/ ( 0 = ilog ( 1 ^ f ) p f ( P 1 ^^ ) ,
(92.21)
may be calculated by the saddle-point method. The saddle point t0 is given by the
condition /'(to) = 0, whence t0 = 2O - p). At this point, however, the derivative /"(to)
is also zero, so that we must write
/(t)~/(t 0 ) + liar3,
t--o
T = f-to,
t--i
FIG.
17.
t For the hypergeometric function F(a, ß, y, £), the contour is to be chosen so that the function
V(t) =
e'ta-y+l(t-iy-"
returns to its initial value on passing round the contour. When y is integral (here, y - 1), the contour
chosen satisfies this condition.
§92
395
Electron-Nucleus Bremsstrahlung
where
/(to) = 2TTP + i(l + p) log | ^ ,
a = ^ f"\U) =
^p^j.
The coefficient lit of the exponential in the integrand may be written
i/r « i/r0 —r/rg;
we cannot here take simply the term l/t0, as this would reduce to zero the
derivative d|F(£)|2/d£ in (92.15). Thus we have, after an obvious substitution in the
integrals,
F
~2*rto(L-)" i e X p { ~ , r p ' / + ' / / ( ' o ) } > <
x
00
00
ei dx+
{ - / "' laey»
—oo
J xeii"<**}••
(9222)
—oc
The two integrals here are, respectively,
00
2 J cos |x3 dx = 3a/sp(2)»
0
2 f xsinix 3 dx = 3 1/6 $.
o
The derivative F'(£) is calculated similarly; according to (92.19), it is given by an
integral which differs from (92.21) only in that the coefficient 1/if of the exponential
is replaced by v'l(\ - £0- A simple calculation then leads to the result
-A.F(.)P
Çl
dV
=
(I-P)2*24V3TT P
•
Finally, substitution of this in (92.15) gives, with the necessary accuracy, the
simple expression
,
167T _ im c dot
d<r„ = -T57T 2Z2ar] —-j
.
J
P
(x)
/n-
~~.
(92.23)
The condition for this to be valid, i.e. for the asymptotic formula (92.22) to be valid,
is that the second term in (92.22) should be much less than thefirst:(1 - p)v > 1, or,
expressing the parameters of the hypergeometric function in terms of physical
quantities,
ha> > (hvIZe2) • \mv\
(92.24)
Interaction of Electrons with Photons
396
§92
This inequality is compatible with the quasi-classicality condition hu> <\mv2, i.e.
\- p<\. When the latter condition is satisfied, the result must be the same as the
corresponding formula in the classical theory, since on multiplication byfta>the
expression (92.23) becomes the classical formula (Fields, (70.22)) for the "effective
retardation" in the high-frequency limit.t In order to go to the classical formulae
throughout the range ( l - p ) v - l , v>\, we should have to find the asymptotic
form of the hypergeometric function when the saddle point is close to the
singularity t = 0; we shall not deal with this here, since the final result is obvious.
All the formulae given above refer to an attractive Coulomb field. The crosssection for emission in a repulsive field is obtained from (92.15) by changing the
signs of v and v'. Then, in particular, the limiting Born formula (92.16) remains
unaltered, but in the limit v < 1, v'-*<x> we have instead of (92.18)
,
1287r„3 2 2 / c \ 3
/ V(2mc2)>nZa\hd(i>
d„„ = — ZWr](-)
e x p ( - V ( ^ M ) ^ ,
,M,.,
(92.25)
i.e. the differential cross-section tends exponentially to zero as Ù>->IO0. This result
also is reasonable: in a repulsive field, there are no bound states, and the frequency
o>o is the true boundary of the radiation spectrum.
PROBLEMS
PROBLEM 1. In the Born approximation, find the bremsstrahlung cross-section for a non-relativistic
collision of two particles having different values of the ratio elm.
SOLUTION. The dipole moment of two particles with charges eu e2 and masses mi, m2, in their
centre-of-mass system, is
\m\
mi)
where fi - m\mij{m\ + mi), r = n - r2. Hence
m2/
\m\
\m\
mi)
r
The matrix element is
dp P =
Ù)
2 (d)p P ,
(t) = ( p 2 - p'2)/2/m,
where p = ftv, p' = JAY' are the momenta of relative motion, and it is calculated from the plane wavesj
by means of the formula
t The agreement of the formula (92.23) for hto der» with the classical formula, when the one condition
(92.24) is satisfied, is to some extent accidental. In the classical formula, the difference between v and v' is
beyond the accuracy postulated, and there is no reason to identify vo in Fields, (70.22) with v specifically; if
vo is identified with v\ there is no longer agreement with (92.23).
t The replacement of two particles by a single particle having the reduced mass is, of course,
permissible only in the non-relativistic case.
§92
Electron-Nucleus Bremsstrahlung
397
The result is
.
e]el ( e\
e2\2 v' /x2
day . .
m
dorkp = — r — - — I 7 7 7 (e • q)(c* • q ) — <V dok.
After summation over polarizations, the angular distribution of the radiation is given by a factor
sin2 0 , where 0 is the angle between the direction of the photon k and the vector q, which lies in the
scattering plane (see (45.4a))..
After integration over the directions of the photon we have
■
i6 2 it e\
\mi
e2\2 v'da)
sin Odd
mi) v W v .+ v -2vv cos 0
where 0 is the scattering angle. Finally, integration with respect to 0 gives
,
16 2 li*\
da» = T6i€2l
\m\
* 2 \ 2 1 1 V + V' dto
- 7 log
-,—.
mi) v
v - v to
For radiation in the field of a fixed centre of Coulomb force, this formula is equivalent to (92.16).
PROBLEM 2. In the Born approximation, find the bremsstrahlung cross-section for a non-relativistic
collision of two electrons.t
SOLUTION. In this case there is no dipole radiation, and we must therefore consider quadrupole
radiation. In the classical theory, the spectral distribution of the total intensity of quadrupole radiation is
given by
where D* = I eOxiXk - r28ik) is the quadrupole moment tensor of a system of charges.* For two
electrons we have, in their centre-of-mass system,
Dik = {eOxiXk - r25ik),
r = n - r2.
In the quantum theory, the Fourier components must be replaced by the matrix elements (cf. the
discussion of dipole radiation in §45), and, with appropriate normalization of the wave functions (plane
waves) and division by the photon energy o>, we obtain the cross-section for emission of radiation with
scattering of the electrons into the range of states d*p':
where v = 2p/m is the initial velocity of relative motion; the emitted frequency o> = ( p 2 - p,2)lm.
The operator D,* is calculated by threefold commutation of the operator Dik with the Hamiltonian
m
r
and is§
t The collision velocity v satisfies the conditions a<e2lhv<\.
The classical case (e2lhv > 1) is
discussed in Fields, §71, Problem.
t This formula is obtained from Fields (71.5) in the same way as (67.11) in that book is derived from
(67.8).
§ This expression is analogous to the classical form
l
*
1
fi - 4 * 3 1"* Xi « J. * X k „ oXiXk
Dik = ^ 7 16ppk + 6 p p i - 9 - p ~ p • r - p & k p • rl,
which would be obtained on differentiating Dik and using the classical equation of motion:
5mr=e2r/r\
§92
Interaction of Electrons with Photons
398
Since the two particles (electrons) are identical, the matrix elements are calculated from the wave
functions
^ = V2<eiP'r±e
Pr)
^=V2(eiP
'
r±e i9
~ '">'
where the signs -I- and - correspond to total spins 0 and 1 of the electrons (interchange of the electrons
corresponds to changing the sign of r).
The lengthy calculations lead to the following formula for the spectral distribution of the radiation:
«
A
2I17
3x2
, 12(2-x)4-7(2-x)V-3x4
._, 1 1 V ( l - x )
J
where x = ü>le and e = p2lm is the initial energy of relative motion of the electrons; the cross-section is
averaged over values of the total spin of the electrons. The cross-section for energy loss by radiation is
\i
(oder* = 8.1 are
(B. K. Fedyushin, 1952).
PROBLEM 3. Determine the energy of the radiation resulting from the emission by a nucleus of a
non-relativistic electron in the 5 state.
SOLUTION. The wave function of the emitted electron is an outgoing spherical s wave normalized
to unit total flux:
1
eipr
V(4TTV) r '
see QAf, (33.14). As the wave function of the final state of the electron (after emission of the photon),
we choose the plane wave
The transition matrix element is
P/.=(Pi/)*=(J*?P<M3x)
=
, P'
f ,-ip'-r+lpf <^X
V(4irv) J
-V
47T
r
p'
v p' - p
V 7^
IT
y'
the integral is calculated by means of (57.6a). The radiation energy is given by (45.8) on multiplying by
d3p7(27r)3 and integrating over the directions of p' (which is equivalent to multiplying by 4ir). The
spectral distribution of the emitted energy is then
dE„ = (2e V3/37Tt;) dco.
When u) -♦(), thefinalvelocity v' of the electron tends to t\ and the formula agrees, as it should, with the
non-relativistic limit of the classical result; see Fields, §69, Problem. The total emitted energy is (in
ordinary units)
*-£-(D'2
where e = \mv is the initial energy of the electron.
§92
Electron-Nucleus
Brems Strahlung
399
PROBLEM 4. Determine the energy of the radiation resulting from the reflection of a non-relativistic
electron from an infinitely high potential barrier.
SOLUTION. Let the electron be moving perpendicularly to the barrier. Although the photon may be
emitted in any direction, in the non-relativistic case the photon momentum is small compared with the
electron momentum, and we may therefore suppose that the reflected electron also is moving perpendicularly to the plane of the barrier. Let the barrier be at x = 0, and the electron be moving on the
side where x > 0. The wave functions of the stationary states of one-dimensional motion, normalized by
S(p/2ir) (p = px), have the form of stationary waves (see Qhf, §21):
\\t\-2 sin px,
\\ff = 2 sin p'jc.
The matrix element of the operator p = px is
Pfi = - 4i I sin P*~r
sin px dx
P -P
integrals of this form are to be understood as the limit, as S -♦ + 0, of the values obtained by including a
factor e-ÄX in the integrand.
The energy radiated in a single reflection of the electron is found from (45.8) by multiplying by
dp* = d(olv' and dividing by VI2TT (the flux density of the wave approaching the barrier in the initial
function ty):
dE» = ~z—r \Pfi\
3m ' '
Q
vv
—
- -z— e2vv' do).
(1)
At low frequencies (o)<e = \mv2) we have v'Ä V9 and formula (1) becomes the classical formula (Fields
(69.5)), which has to be integrated over angles, using the fact that v = iAt>, where Av is the change in
velocity of the electron on reflection; this is as it should be, since the condition for the collision time to
be small (Fields, (69.1)) is always satisfied in reflection from a barrier. The quantum formula (1),
however, also gives the total emitted energy (in ordinary units):
■ /
dEw .
16
v2
~5—
dû) = zr— ae ~i.
do)
9ir
c
PROBLEM 5. Determine the bremsstrahlung energy in the scattering of a slow electron by an atom.
SOLUTION. With the condition pa ^ 1 (where a denotes the atomic dimensions), the scattering by
the atom is isotropic and does not depend on the electron energy; see QM, §132. The wave functions of
the initial and final states of the electron may be written
^ = e i p r + /e ,p 7r,
*/ = e,pr'r + /e"*'r/r,
where / is the constant real scattering amplitude. These expressions pertain to the asymptotic range of
distances rt>a, which in the present case is the important range: r—\lp>a.
The matrix element
calculated from these functions is
P/J =(27r//a>)(v-v');
the integrals are calculated as in Problem 3. Substituting this expression in (92.12), we obtain the
cross-section for radiation with scattering of the electron in the direction p\ in ordinary units,
***** =3trpc^
Tzfzt (v - v')2 dcrei — ,
(1)
where doei = / 2 do^ is the differential cross-section for elastic scattering. When ha> <Zp2l2m, we can take
400
§93
Interaction of Electrons with Photons
p ^ p \ and this formula then becomes, as it should, the non-relativistic expression for the emission of soft
photons; see §98.t
Integrating (1) over the directions of p \ we obtain
d<r. = ^ - ( r
2
+
v'V^,
3TTC p
(2)
&
where <rc\ = 4ir/2 is the total elastic scattering cross-section. Lastly, multiplying by hw and integrating with
respect to o> from 0 to p2/2m = e, we obtain the "effective retardation"
— I htoda» =77— a0eie(u/c)2.
J
43 ir
(3)
§93. Electron-nucleus bremsstrahlung. The relativistic case
Let us consider the electron-nucleus bremsstrahlung for the case of relativistic
electron velocities.$ We shall assume that the condition for the Born approximation
to be valid is satisfied for both the initial (v) and the final (t>') velocity of the
electron: Ze2lhv<\, Ze2lhv'<\. The charge on the nucleus must be such that
Za<\.
As in §92, we shall neglect the recoil of the nucleus, so that the latter acts only
as the source of an external field; the justifiability of this treatment is discussed in
§97.
According to (91.4), the cross-section for bremsstrahlung is given in terms of
the amplitude by
(à?|M/i|2 iSfd0k do'd<0'
(93,1}
d(T=
In the first non-vanishing approximation, the matrix element Mfi corresponds to
two diagrams:
k\
/ q
\
q\
/
/k
\
/
(93.2)
The free end q corresponds to the external field, so that q= p'-p + k is the
4-vector of momentum transfer to the nucleus. Since the recoil is neglected, the
time component q° = 0.
According to the diagrams (93.2),
M* = - e2AWq)V(47r)e*ü'(^ f ^
y° + / ^ f c ^ 7M)«.
(93.3)
t The fact that4'factorization" of the cross-section (the separation of the factor crci) occurs here for
any o> is to some extent accidental, arising because the scattering amplitude is independent of the energy.
t The majority of the results given below were first derived by H. A. Bethe and W. Heitler (1934)
and independently by F. Sauter (1934).
§93
Electron-Nucleus Bremsstrahlung
401
The intermediate 4-momenta are / = p - k, /' = p' + k. We shall use the notation
/ 2 - m 2 = -2kp=-2Kû>,
/ ' 2 - m 2 = 2kp's2K'o).
(93.4)
A{>e) is the scalar potential of the external field; for a purely Coulomb field,
AMq) = 4irZe/q2.
(93.5)
Substitution in (93.1) gives for the cross-section
da = j £ t e ^„(Ö'Q'MXWQ V ) dok do' do,
(93.6)
where
V
7
7
2WK'
2O)K'
7
7
2o)K
7
'
7
7
2o)K
7
'
Disregarding polarization effects, we average the cross-section over the directions
of spin of the initial electron and sum over the polarizations of the final electron
and photon. This is equivalent to the substitution
e*ev(û'Q,iu)(ûQvu')-+-12tr Q^yp + m)Q»(yp' + m).
The trace is calculated by means of the standard formulae (§22). The calculations
are somewhat simplified by using the equation
y\yp)y° = yp,
where p = (e, -p) if p = (e, p). Moreover, the number of terms to be calculated can
be reduced by using the symmetry with respect to the change p +*p', k-^-k,
q^>-q; this simply interchanges cyclically the factors in the product of matrices,
leaving the trace unaltered.
The result is the following expression for the differential cross-section for
bremsstrahlung in which a photon of a given frequency is emitted in a given
direction and the secondary electron travels in a given direction:!
t Here and in the rest of §93, p, p' and q denote the magnitudes of the three-dimensional vectors:
P = IPI» P ' = IP'I, <ï = |q|.
402
§93
Interaction of Electrons with Photons
A
Z2(xr
^
[KK
+
P'™4
dù)
A
A ' v
m
\K
^.n^l.)^(lL
m
\K
K)
ml
\K
+
K
!
\K
K
i
(93.7)
JL)\
K
))
where K = e - n • p, K' = e' — n • p \ n = k/co, q = p' + k - p.
By means of simple transformations, this expression can be put in a form
somewhat more convenient for analysis:
da = —z—£ — - — r sin 0 d0 sin 0 dö dé x
z u eu pq
x {^75 ( 4 e 2 - q2) sin2 0' + j ^ ( 4 e ' 2 - q2) sin2 0 +
+ ^ ( p 2 s i n 2 0 + p ' 2 s i n 2 0 ^ - ^ ( 2 e 2 + 26,2-q2)sin0sin0'coS(/>],
KK
KK
J
(93.8)
where
K = e - p cos 0,
K' = e ' - p ' c o s 0',
q2 = p2 + pf2 + <o2-2p<D cos 0 + 2p'û> cos 0 ' - 2 p p ' ( c o s 0 cos 0' + sin 0 sin 0'cos </>);
0, 0' are the angles between k and p, p' respectively; <f> is the angle between the
plane of k and p and that of k and p \
The integration of (93.8) over the directions of the photon and the secondary
electron is fairly lengthy. It leads to the following formula for the spectral
distribution of the radiation :t
*72„r2da>p'(4
^r,p2 + p'2 m2 f, e' ., e
IV \
aaw = Z are
i -z — Zee —r~7T~ + m 1 ' ~s+ ' ~ïi
;)+
p 2 p ' I p3
p' 3 p p 7
co p 13
, r [See' , a)2 t 2 a . 2 a , 2 ,v , w2co /. ee' + p 2 ,,£e' + p ' 2 \ l l
A
(93.9)
where
. ee' + p p ' - m 2
r
L
= log—T—^7——5,
ee — pp — m
t The integration over the directions of the secondary electron only can also be completed in an
analytical form; see R. L. Gluckstern and M. H. Hull, Jr., Physical Review 90, 1030, 1953.
Reference may also be made to the review paper by H. W. Koch and J. W. Motz, Reviews of Modern
Physics 31, 920, 1959, in which the bremsstrahlung formulae are represented graphically.
Electron-Nucleus Brems Strahlung
§93
403
The permissible values of the frequency in these formulae are limited only by the
condition imposed on the final velocity of the electron (Ze2lv'< 1): the electron
must not lose almost all its energy. As œ ->0, the emission cross-section diverges as
(MOD; this illustrates a general rule which will be discussed in §98.
In the non-relativistic limit (p <?m), the photon momentum is small compared
with the electron momentum, since
Hence q2 — (p'-p) 2 - Putting in (93.8) e = e ' = m and neglecting p, p' and o> in
comparison with m, we find
da = - Z2ar] — ^ \ sin 0 dd sin 0' d0' d<f> x
IT
a)
pq
x (p2 sin2 0 + p'2 sin2 0' - 2pp' sin 0 sin 0' cos <f>),
or
d a Ä ^ n X q f ^ ^ ,
(93.10)
in accordance with the Born-approximation formula derived in §92, Problem 1.
Correspondingly, the spectral distribution of the radiation is given by the formula
(92.16) already derived.t
In the ultra-relativistic case, when both the initial and the final energies of the
electron are large (e, e'>m), the angular distribution of photons and secondary
electrons is very unusual. For small angles 0, 0', the quantities K, K' which appear
in the denominators of formula (93.8) are
K~\e(£+e2),
K'~ie'(^+0'2),
(93.11)
and become very small in the range 0 ^ m/e. In this range the magnitude of the
vector q is also small (q ~ m). Thus, in the ultra-relativistic case, the photon and
the secondary electron move forwards in a narrow cone with aperture angle ~m/e.
A quantitative formula for the angular distribution in the ultra-relativistic case is
easily obtained from (93.8) by substituting for K, K' from (93.11), replacing p, p' in
all other places by e, e\ and neglecting q2 in comparison with e2. With the
convenient notation
S = E0/m,
S' = e'07m,
(93.12)
t The derivation of this formula by taking the limit in (93.9) is somewhat laborious, however,
because of the cancellation of various terms.
404
§93
Interaction of Electrons with Photons
we can put (93.8) in the form
da = -Z2ar)^K^-8d8
eq
IT
f
Ô2
2 1
1(1 + ô )
■ 8' d8' d</> x
o)
8'2
22
(1 + Ô' )
82 + 8'2
ô*2
lee' (1 + 82)(\ + 8'2)
-(H)<r^W-
CM.»)
Putting q2 = (n x q)2 + (n • q)2 (n = k/to), we can easily find that for small angles
^ = (82+8'2-188'cos
m
<t>)+m2(^^-l-^)\
\ le
le
}
(93.14)
When 5 — ô'— 1, the second term in (93.14) is small compared with the first. The
two terms become comparable at even smaller angles, where ô - m/e. Although q
here becomes particularly small (q — m2le <?m), the integrated contribution from
this region to the cross-section is still small compared with that from the whole
region S *s 1 (the ratio of the contributions is easily seen to be m2le2). But q can
also reach values q — m2/e when 5 ~ S' ~ 1 if
|S-6'|«m/c,
(93.15)
4>^mle.
The contribution from this region is of the same order as the whole integral
cross-section, or may even be the principal term in it (see below).
The integration of (93.13) with respect to <j> and S' gives the angular distribution
of photons (with given frequency), regardless of the direction of the secondary
electron: t
j
7 22 22 dio
d<r
= o8Z
«r —
x
rre
+
e'
e'
Ô dô
7 n T
g ^ xw
4S2
1,
l0ß
lee'
Ue
«'
+ +2
1662 H
U7 7-öTwJ ^"2b 7 -(rro?J|-
,al
1Ä .
(9316)
Integrating with respect to 8, we find the spectral distribution of the radiation in
the ultra-relativistic case:
da. = 4 Z V ' , ^ ( A + - - ^ ( l o g ^ : - i V
to e \e
e 3 / \ ° mco 2/
(93.17)
this formula can also, of course, be obtained directly from (93.9).
The presence of the logarithm of a large quantity (the ratio se'I mo* ~ e'/m > 1
t The integration over </> from 0 to 2v is taken first. That over 5' is conveniently replaced by integration
over the difference |A| = \8' - 8|, dividing the range into parts, from 0 to some Ao and from Ao to °o, where A0
satisfies the inequalities m / e ^ A o ^ l . In each region, appropriate approximations are possible in the
integrand.
§93
405
Electron-Nucleus Brems Strahlung
even if <o ~ e) should be noted. If this quantity is so large that its logarithm is also
large, the logarithmic terms become the principal ones in these formulae. The
logarithm arises from integration in the range (93.15).t Thus, in the logarithmic
approximation, i.e. when the terms not containing a large logarithm are neglected,
the secondary electron moves at an angle ~(m/e)2 to the direction of incidence.
Finally, we shall give the limiting formula for the region near the hard end of
the spectrum, when the ultra-relativistic electron radiates almost all its energy:
ay « e > e'. From (93.9) we easily find
.
.„2
idiù\eax
e' + p'
mV.
2
e'+ p'
e'l
/M
10 .
Formulae (93.17) and (93.18) together cover the whole range of values of o> for an
ultra-relativistic initial electron, and agree for <o ^ e > e'> m. If the secondary
electron is non-relativistic (p' <m), then
d(Tu> =
2ZWjV[2m(£-(t,)]^.
(93.19)
POLARIZATION EFFECTS
Polarization effects in bremsstrahlung can be studied by the general method
described in §65. The choice of the 4-vectors e(1), e(2) is here particularly simple.
Since there is only one frame of reference (the rest frame of the nucleus) which is
of practical importance, it is sufficient to put e{X) = (0, e(1)), e{2) = (0, e(2)), where e(,),
e(2) are unit vectors perpendicular to k, one lying in the plane of £ and p and the
other perpendicular to that plane.
We shall not give here either the fairly lengthy calculations or their quantitative
results, but merely note some qualitative properties of the polarization effects.t
These properties can be derived by means of various symmetry relations, as was
done for the Compton effect in §87.
The theory under consideration corresponds to the first non-vanishing approximation of perturbation theory. In this approximation the cross-section cannot
contain a term proportional only to the polarization vector Ç of the initial electron
or Ç' of the final electron. The absence of a term <*£ means that the total emission
cross-section (summed over the polarizations of the photon and the secondary
electron) is independent of the polarization of the incident electron.
Of the terms proportional to only the photon polarization parameters £{, ££> &>
t This is easily seen by considering the rsmge of integration in which 4> and A = 6 ' - S satisfy the
conditions mle <*A, 4> < 1. In this range, q2/m2=* A2 + <£252, and the terms in the braces in (93.13) are
proportional to <t>2 or A2 (becoming zero when 4> =0 and A = 0). Integrals of the form
f <ft2d<HA
JWTsW?
or
f A2<tydA
J(AV5V)2
diverge logarithmically; they are "cut off" at the limits of the above-mentioned range of the variables.
t For a fuller discussion of these effects, see W. H. McMaster, Reviews of Modern Physics 33, 8,
1961, and V. N. Baïer, V. M. Katkov, and V. S. Fadin, Radiation from Relativistic Electrons (Izluchenie
relyativistskikh ilektronov)^ Atomizdat, Moscow, 1973.
406
Interaction of Electrons with Photons
§93
the term °c££ is absent. Thus a photon radiated by an unpolarized electron is not
circularly polarized. Here, however, there is a difference from the corresponding
result for the Compton effect: in the latter case such terms were forbidden by
spatial parity because of the impossibility of constructing a pseudoscalar from the
only two available independent vectors, k and k'. For bremsstrahlung, there are
three independent momenta p, p' and k, and these suffice to construct the pseudoscalar k • p x p \ A term of the form ^ k • p x p' does not violate spatial parity, and
therefore, strictly speaking, need not be zero; but it is not invariant under a change
of sign of all the momenta (cf. (87.26)), and is consequently absent in the first Born
approximation.
The existence of the pseudoscalar k • p x p' also has the result that, as well as
the term proportional to £3, a term proportional to ^i is also allowed in the
cross-section, unlike the case of the Compton effect. This term arises as a product
of the form
Saßva(kxv)ßk
pxp'
(where v = k x r l , which is invariant both under spatial inversion and under a
change of sign of all the momenta. Thus the emitted photon has linear polarization of both kinds (both along the axes e(1) and e(2), and in the "diagonal" directions
at 45° to these axes). This refers, however, only to conditions where the direction
of motion of the secondary electron is also recorded. On integration over all
directions of p \ the term *£i in the cross-section vanishes. This is evident from
symmetry, since after the integration the two non-coincident "diagonal" directions
become equivalent, and there can therefore be no preferential polarization along
one of them, such as occurs when Ç\ ï 0.
The degree of linear polarization is independent of the state of polarization of
the incident electron: the correlation terms of the form £}£ and £3^ in the
cross-section are forbidden in the first Born approximation. The term Ç&, however,
is allowed, so that the photon radiated by a polarized electron is circularly
polarized (Ya. B. Zel'dovich, 1952).
SCREENING
The formulae derived above are for a purely Coulomb field. If radiation in a
collision not with a "bare" nucleus but with an atom is considered, allowance must
be made for the screening of the nuclear field by the electrons, which reduces the
cross-section. For this purpose we must include the atomic form factor F(q) in the
potential A{€\q) of the external field; see QMy §139. According to QM (139.2), this
is done by writing Z-F(q)
instead of Z. We shall show under what conditions
screening is important.
A given value of q in the form factor corresponds to distances r—llq in the
spatial distribution of the electron charges in the atom. The form factor becomes
almost equal to Z (total screening) when q ^ 1/a, where a is the dimension of the
atom.
In the ultra-relativistic case, as we have seen, an important contribution to the
emission cross-section comes from the range of values of q near its minimum
§93
Electron-Nucleus Bremsstrahlung
407
possible value for given initial and final energies of the electron. In the ultrarelativistic case,
<imin =
p-p'~0)
= V(e2-m2)-V(e'2-m2)-(e-e')
= m2w/2ee'.
(93.20)
Screening is important if qmin^ 1/a, or
ee'lma) > am.
(93.21)
This condition is always satisfied for sufficiently large energies of the incident
electron.
If Qmin*^ 1/û ("total screening") we can immediately write down, with logarithmic accuracy, the spectral distribution of the radiation. The argument of the
logarithm in (93.17) is just the left-hand side of the inequality ee'lm<o > am. When
the inequality is satisfied, the integral over q which leads to this logarithm is cut off
at a quantity of the order of the right-hand side of the inequality. According to the
Thomas-Fermi model a ~ a0Z~1/3, where a 0 ~ lime2 is the Bohr radius (see QM,
§70); then am ~ l/aZ1/3. Thus, when there is total screening, the logarithm in (93.17)
should be replaced by log(llaZ m ).
ENERGY LOSS
The energy
rgy lost as radiation by the electron is expressed by the "effective
retardation"
e-m
K r a d = J a>dcrm.
(93.22)
0
The calculation of the integral, with daa from (93.17), givest
2
4m 2 ,log e- +p
Krad = 7Z2 arriee fl2e
( 3+
^ ep
+ 6
_(8e-f6p)m'
P . > m 26t zo gl 2± l£±_E4- l+, W F / 2 £ ( £ + £ ) \ l
e
3ep
m
3ep
m
33 ep
ep \ m
/J
(93.23)
where the function F(^) is Spence's function (131.19).
In the non-relativistic case, (93.23) becomes
Krad = 16Z2ar2m/3,
(93.24)
t Although formula (93.17) is inapplicable near the upper limit, this fact is unimportant, since the
integral converges.
408
Interaction of Electrons with Photons
§93
since F(£) = £ when (■< 1; see (131.23). This formula can, of course, be obtained
by direct integration of the non-relativistic Born formula (92.16).
In the ultra-relativistic case,
Krad = 4Z2ar]e (log ^ - 1 ) ;
(93.25)
when Ç> 1, F(£) « | log2 £; see (131.20). The two log2 terms in (93.23) can then be
omitted.
The ratio Kraa/e is also called the cross-section for energy loss by radiation. It
increases logarithmically when e is large. This increase no longer occurs, however,
when screening is taken into account. For total screening, Krad/e tends to a constant
limit = 4ZW2log(l/aZ"3).
For a collision with an atom, it must also be remembered that some radiation
originates from the electrons, as well as that from the nucleus. We shall see later
(§97) that, in the ultra-relativistic case, the electron-electron emission cross-section
differs from the electron-nucleus cross-section only in that the factor Z2 is absent.
Hence the presence of Z atomic electrons can be approximately allowed for by
replacing Z2 by Z(Z + 1).
On passing through a medium containing N atoms per unit volume, a fast
electron loses its energy, on average, over a distance
irad ~ elNKTad ~ [Z2aNr2 logd/aZ"3)]"1,
(93.26)
called the radiation length.
COHERENCE LENGTH
Formula (93.20) may be given a different and more general interpretation. Fpr
the expressions derived above to be valid, it is necessary that the external field in
which the electron moves should vary only slightly (in the direction of motion) over
distances
Jcoh ~ l/flmin ~ ee'lm2(o (= ee'/c3m2ü));
(93.27)
this is called the coherence lengthA The value (93.27) obtained in the Born
approximation is actually quite generally valid for ultra-relativistic particles: it is
easily derived in the opposite case of quasi-classical motion also, since from
(90.22)$ we see at once that the important times for radiation at small angles to the
direction of motion are
T ~ e'le(o(\ - v) ~ e'e/m2o),
corresponding to a section of the trajectory with length CT ~ /COh.
+ The discussion here is due to M. L. Ter-Mikaélyan (1953).
t The derivation of (90.22) was based only on the smallness of the curvature of the trajectory, and in
that sense does not depend on the fact that a magnetic field was specifically under consideration in §90.
409
Electron-Nucleus Brems Strahlung
§93
For a given frequency co, the coherence length increases with the electron
energy. However, the formulae obtained for bremsstrahlung at an isolated atom
can be valid also for passage through a medium only'if there is no secondary
photon emission or electron scattering over a distance equal to the coherence
length. The condition for no secondary photon emission is Jcoh^'rad, but the
condition for no electron scattering is violated^ much sooner: there is repeated
scattering of the electron by the atomic nuclei in the medium over a distance ~ /rad.
To arrive at a quantitative condition, we return to formula (90.22) before the
integration with respect to time in the exponent and write this as
f, + T
-i^r
tj + T
\ (l-n-v)dt~-^{(l-i>)r
+ 12
j 0 2 dt},
(93.28)
where 6 is the small angle between v and n which results from scattering by nuclei.
For Coulomb scattering, 0 changes by small jumps, and its time variation is
therefore a slow "diffusion in angle". The mean square deflection of the electron
over t - tj is, in order of magnitude,
Ô*~(t-U)llcouu
where /Coui is the mean free path for Coulomb collisions, given by
1/lcoul - (NZ2e4le)
\Og(XmJXmin),
where #max and #min are the maximum and minimum angles of scattering in one
collision for which the process may still be regarded as Rutherford scattering (cf.
PK, §41).t The value of Xmin is determined by the atomic dimensions a, over which
the field of the nucleus is screened: xmin~~ 1/pa. Large scattering angles are limited
(for an ultra-relativistic electron) by distances of the order of the nuclear radius R:
Xmax - 1/pJR. If we put R - 1.5 x 10",3Zl/3 cm - reZll\ we find
/coul
s
as
~ NZVlog(l/aZ"3) ~ 7 F ,rad-
(93 29)
-
The second term in (93.28), which covers a time T ~ /coh, is now estimated as
6 T ~ (ü>e/e')'coh/'coul~ /coh/afrad.
For the bremsstrahlung formulae to be valid that were derived without allowance
for multiple scattering, this term must be much less than unity. Hence we find the
condition
U^afrad,
(93.30)
which is stronger than /coh <l lrad (L. D. Landau and I. Ya. Pomeranchuk, 1953).
t The mean free path is determined by the transport cross-section a, = I (1 - cos x) d<r(x)- For the
scattering of ultra-relativistic electrons by a Coulomb centre, the cross-section da(x) is given by (80.10).
410
§94
Interaction of Electrons with Photons
§94. Pair production by a photon in the field of a nucleus
The formation of an electron-positron pair in a collision between a photon and
a nucleus (Z + y -> Z + e~ + e+) and electron-nucleus bremsstrahlung (Z + e~ -+Z +
e~ + y) are two cross-channels of the same reaction. Rules have already been
formulated in §91 for the transformation of formulae from the latter case to the
former. Applying these rules to (93.8), we find the following expression for the
differential cross-section for pair production by an unpolarized photon, averaged
over the polarizations of the components of the pairit
.
der =
.
.
.
Z2ar] m2p+p-dè+ .
z—- — \ 4 sin 0+ d0+ • sin 0_ d0_ a<£ x
Z7T
(i) q
x te (4e- - q2) sin2 0+ + ^ i (4s2 - q2) sin2 0. K—
(, K +
--^-
K+ K—
(pi sin2 0+ + p2_ sin2 0 _ ) - ^ ^ ( 2 e 2 + + 2e2 - q2) sin 0+ sin 0_ cos <f>],
K + K—
J
(94.1)
K± = e±-p±cos ö±,
q2 = (p+ + p- - k)2,
e+ + e- = û>
(H. A. Bethe and W. Heitler, 1934).
A similar transformation derives from (93.9) the energy distribution of the
components of the pair:
dat+ = Z2ar] ^
û)
\
P-
de À- \ - 2e+e- ^P^I
+
p+p-
pi
+L \ - ^ +
L 3p+p_ p+p-
e+e.-p+p. + m
P+P-J
-£T(elel+plpl-mh+e-)-
c±~*P±
Since the above formulae are based on the Born approximation, they are valid
under the conditions Ze2lv± < 1. The symmetry of (94.1) and (94.2) with respect to
the electron and the positron is itself a consequence of the Born approximation,
and would not occur in higher approximations.
In the ultra-relativistic case (e± > m), the electron and the positron are emitted at
angles 0± ~ m/e± relative to the direction of the incident photon. The angular
t Polarization effects in pair production by a photon are discussed in the papers already quoted in
893 in connection with bremsstrahlung.
i
§94
Pair Production
by a Photon
411
distribution is given by a formula similar to (93.13):
, _ 8 _2
da
z
2m
4
e+e-
J
f
g2
51
f
,
co2
Ô2+ + ô2_
- ^ ^ ^ ^ M " â T ^ - ( T T W " 2 e + e _ ( i + 5i+)(i + si) +
(94.3)
with
K = 81 + 81 + 28+ 8-cos $ + m2(^^
+ K^)2.
(94.4)
The energy distribution in this case is
da = 4Z2ar2€—y(el
(o
+ e_ + 3e+e-)(log——- - ^ Kultra-relativistic case).
\
mco LJ
(94.5)
Integration of (94.5) over E+ from m to w gives the total cross-section for pair
production by a photon having a given energy:!
a = ?Z 2 ar 2 (log ~ ™ ) ,
« » m.
(94.6)
As with bremsstrahlung, the logarithmic term in the ultra-relativistic crosssection arises from the range of values q — m2le. This now corresponds to angles
for which
|8+-8-|«m/e,
|7r-<^|^m/e,
instead of <f>^mle as in (93.15). Thus, in the logarithmic approximation, the
directions of the electron and the positron are at small angles to the direction of the
photon and are almost coplanar with the direction of the photon but on opposite
sides of it.
Near the reaction threshold (o>->2m), the Born approximation is invalid. The
derivation of a quantitative formula in this case would require an exact calculation
of the Coulomb interaction of the three charged particles (the nucleus and the pair)
in the final state. The symmetry with respect to the electron (which is attracted to
the nucleus) and the positron (which is repelled from the nucleus) is then, of
course, lost.
If
Za<j2^<\9
*
(O
(94.7)
t Since the integral converges at both limits, the inapplicability of formula (94.5) for small values of
e± - m is not important.
Interaction of Electrons with Photons
412
§94
the Born approximation is still valid. At non-relativistic energies of the pair,
co ~ 2m > p±, and therefore q *** cu. In (94.1) we can everywhere put e± = K± = m,
(o = 2m, and this formula then reduces to
d(T
?!
2
= ^rrf2^r(pl
o47T
rn
sin2 0+ + pi sin2 Ö-) do+ do- de+.
(94.8)
After integration over angles,
m
= ^ | ^ (CÜ - 2m)V[(e+ - m)(e- - m)] de,.
(94.9)
Finally, integration over e+ from m to o> - m gives the total cross-section
If the relative velocity (v0) of the components of the pair formed is small, their
Coulomb interaction must be taken into account (A. D. Sakharov, 1948). This
interaction becomes important when vo is of the order of (or less than) the
velocities of the particles in the bound state of the electron and positron (positronium):
vo^a.
(94.11)
Let us consider the process in the centre-of-mass system of the pair. Virtual
momenta ~m are important in the diagrams which represent the process in this
system; that is, distances — 1/m between the electron and the positron are important. The wave function ip(r) of their relative motion changes appreciably only over
distances r — \lmvo~ lima, which are large compared with 1/m. The allowance for
the interaction of the particles therefore amounts to the inclusion of a factor i/**(0)
in the transition matrix element. The differential cross-section is accordingly
multiplied by |^(0)|2, i.e. by
27ralvo
1=- eT » 5 i
(94 12)
'
see Qhf, (136.11). The relative velocity of the two particles is the velocity of one
particle in the rest frame of the other. Comparing the values of the invariant p+rf>t
here and in the laboratory system (the rest frame of the nucleus), we find
whence v0 may be found. If p+ and p_ are similar in magnitude and direction, vo is
§95
413
Exact Theory of Pair Production
given by the approximate formula
t^ = p 2 0 2 + ( p + - p - ) V
(94.13)
valid for VQ< 1; here p = i(p+ + p ) , e = \(e+ + £-), and -d is the angle between p+
and p_.
The correction to the cross-section according to (94.12) and (94.13) causes an
anomaly in the correlation between the momenta of the electron and positron
formed: it has a narrow maximum at p+ « p..
§95. Exact theory of pair production in the ultra-relativistic
case
In §§93 and 94 we have discussed bremsstrahlung and pair production by a
photon in the relativistic case, using the Born approximation, for which the
condition Za < 1 must always be satisfied. In §§95 and 96 we shall describe a theory
of these processes which is not subject to the limitation just mentioned, i.e. is valid
even if Za — 1 (H. A. Bethe and L. C. Maximon, 1954). We shall assume that both
the particles (the initial and final electrons, or the constituents of the pair) are
ultra-relativistic, with energy e > m.
We have seen that in the ultra-relativistic case both particles move at small
angles (0, 0' or 0+, 0_) to the direction of the photon: 0 ^ m/s. This property is
preserved in the exact (with respect to Za) theory, and we shall therefore consider
just this range of angles.
The momentum transfer to the nucleus in this range is q — m. This means that in
the wave functions the important values of the impact parameter are p ~~ \lq~~ 1/m,
i.e. "large" distances. At such distances the wave function derived in §39 can be used.
The calculations for pair production are as follows.
The pair production cross-section is similar in form to the photoelectric effect
cross-section (cf. (56.1), (56.2)):
der =
2TT <?V(4TT)
1
V^hô)
Mn
8(a) - e+ - e.)
n
i
(95.1)
where
Af,n
= liï-X("'*)*ik'f*%.-P.dh.
(95.2)
Here i/^.p. is the wave function of the electron, and ^(_V+i-p+ the wave function
with negative energy -e+ and momentum -p+.
The function i/^"^, which pertains to a particle in the final state, must have an
asymptotic form which includes (besides the plane wave) an ingoing spherical
wave; this is indicated by the superscript (-). According to (39.10), this wave
§95
Interaction of Electrons with Photons
414
function ist
^• p - = v ^ T j C'P ~"{l ~ l~TI^)Fi~iv>u ~i{p-r + p * r))w( P- } 'I
C - ^ e ^ T d + iV),
(95.3)
y = Za.
The function «/>-+e)+_p+ must have an asymptotic form which includes an outgoing
spherical wave (indicated by the superscript (+)), since it denotes the wave
function of an "initial state with negative energy". The asymptotic form of the
wave function of the positron, obtained from $(-2*-v+, then has an ingoing wave, as
is correct for a final particle. According to (39.11), this function is
*-•►•-'►= wh) e~i?+"(l +
l
~fï7)Fi~iv'u
i(p+r + p+
■ r ) ) M ( -P + ) ' (95.4)
C ( + ) = «- w W T(l + iV).
The terms ~l/e in (95.3) and (95.4) have to be included because of the matrix
structure of M/, (95.2). The matrix element (a)/, is a vector whose direction is close
to that of k. The leading term in (a • e)/i is therefore small, and the correction terms
are of the same order of magnitude as that term.
Substituting (95.3) and (95.4) in (95.2) and neglecting terms —1/e+e-, we find
Mfl = 2 v £ ~ T ) "*<P-M(e * «)I + (« * «)(« • I+) + (« • I-Xe • a)}«(-p + ),
(95.5)
where
N = C^C^ = Wsinh vv,
I+ = ^ - IV1'« rF*VF+d'x,
2e + J
(95.6)
(95.7)
I = 2 r / ^ , q r(VF_)*F+d3x,
q = p+ + p_ - k;
F- and F+ are used for brevity to denote the hypergeometric functions which
appear in (95.3) and (95.4). The integrals I, I+, I- satisfy one identical relation: from
fv(e _ i , ' r F*F + )d 3 x=0,
t In this section, p- = |p±|, q = |q|.
§95
Exact Theory of Pair Production
415
qI + 2e + I + + 2e-I- = 0.
(95.8)
we have
We average |M/j|2 over polarizations of the incident photon, and sum over directions of the electron and positron spins.t This is done by the tensor substitution
eiet -»i(5ik - n^k),
n = k/w,
and changing the bispinor products according to
u±ü±->2p± = (e ± 7°-p± • 7 + m).
Putting also a = y°7, we find
|M/,| 2 ->(N 2 /2e + e-){tr p _Q . p + Q - tr p_(n • Q)p+(n • Q)},
Q = 7 j - 7<>7(7 . 1 + ) _ 7 o ( 7 . ,
)7
Q = yl* - 7°(7 * Iî)7 - 7°7(7 ' I-)The final result, obtained after making the appropriate approximations, for the
ultra-relativistic case at small angles
0 ± ~ m / e <t\,
(95.9)
will be given here. We define the auxiliary vectors
1
m
8 ± = — (p
r ± )i,
"
(95.10)
m ~
where the suffix 1 denotes the component perpendicular to the direction of k. Then
|M„P-iN ! {^|fp + 2
m 6+
+ I+
2e +
where we have used the fact that I ~ el±lq ~ eljm
terms of higher order in m/e are omitted.
The integrals I± may be expressed as
'
/ = j^-^F(-iv,
-IT
^ + I '2
+ 2 I 2e.
(95.11)
(as is seen from (95.8)), and
2e ± dp±*
1, i(p + r+ p+ • T))F(iv, 1, i(p_r+ p_ • r)) dyx.
(95.12)
t Calculations with allowance for the polarizations of all the particles are given by H. Olsen and L.
C. Maximon, Physical Review 114, 887, 1959, and in the book by Bâter et ai cited in §93.
416
§95
Interaction of Electrons with Photons
The integral J can be written in terms of the complete hypergeometric function: t
/V-2p+-q> F(-iv, iv, l,z),
J=ÛZ
_
V-2p_.qj
(95.13)
q2(P+P- ~ P+ • P-) + 2(p+ • q)(p- • q)
2
(<r-2p + -q)((T-2p--q)
The differentiation with respect to p± must be carried out with q fixed, only
thereafter putting q = p+ + p_ - k. The result, after making the approximations
corresponding to the ultra-relativistic case and the conditions (95.9), is
■=7ÄfëjlH*"a*™ + ' S »"«(«I, - m«j}.
(95.14)
with, for brevity, the notation
*±
=
1
m
I + ÔT
(95.15)
F(z) = F(-iV,iV,l,2),
F(z) being a real function. The integral I is then found immediately from (95.8).
Substituting the values of the integrals in (95.11) and thence in (95.1), we find
the required cross-section:
n
d*.±(^
)
IT \sinh 1TV
Z2ar] - ? - , S+ dÔ+ • 5- d5- • d<f> de+ x
x JF2(z)[-2e+e-(82+£2+ + 81?-) + <o\Sl + 8l)Ç+Ç- +
+ 2(eî'+ el)ô+ô-Ç+Ç- cos d>] +
+
'%* ^
^H-^+M«^+sie)+û>2d+SUDU-
- 2(el + e*)S+S-£+f- cos </>]}.
(95.16)
When v -> 0,
7TV
sinh TTv
■*!,
F(z)-»1,
F'(z)«v 2 ->0.
The expression (95.16) then reduces, as it should, to Bethe and Heitler's formula
(94.3), which corresponds to the Born approximation. It also reduces to this
formula for any v if the angles of emission of the pair satisfy the conditions
|Ô+-Ô-|<31,
|TT-4>|<1.
t The calculations are given in Nordsieck's paper quoted in (92.
Exact Theory of Pair Production
§95
417
For then q <m, so that the second term in the braces in (95.16) can be omitted
because of the extra factor (q/m)4 as compared with the first term, and in the first
term we have (since 1 - z ~ q2lm2< l)t
F(2)-*F(1)^F(-IV,IV,1,1)
1
T(l - iv)T(\ + iv)
sinh ITV
1TV
(95.17)
so that the similar factor in front of the braces is cancelled.
Let us now consider the integration of the cross-section over the directions of
emission of the pair. The integration over angles is divided into two regions I and
II, in which we have respectively
(I) 1 - 2 > 1 - 2 , ,
(II) l - 2 < l - 2 , ,
where Z\ is a certain value such that 1 > 1 - Z\ > (m/e)2. Since in region II 1 - z < 1,
q2<m2, it follows from the above discussion that in this region do- » daB s dav^,
where daB is the cross-section in the Born approximation. The integral over angles
is therefore
do-e+ = I do- = I dar + I do-y^o
i
ii
= (do-t+)B + j(d<r- dav^\
(95.18)
i
where {dat+)s is the Born cross-section (94.5) integrated over angles.
In region I we have
q2lm2 « 8l + Ô2. + 28+8- cos d>.
We shall change from the variables Ô+, 8-, d> to £+, £_, z. A direct calculation of the
Jacobian for this transformation gives
%ml (£+£-) sindV
where
\-z
= (q2lm2)M= £+ + £_ - 2£+£- + 2V[M_(1 - £+)(l - £_)] cos 4>.
t This value of the function can be obtained from QM, (e.7), which relates hypergeometric functions
with arguments z and 1 - z.
§95
Interaction of Electrons with. Photons
418
Expressing sin cf> and cos <f> in terms of the other quantities by means of this
equation and substituting in (95.16), we obtain after some simple algebra
da = A de +
2d£+ dg- dz
ÏTÎ75
[z(l-z)-(l-z)(^ + ^ l ) 2 - z ( ^ - f . ) 1
+ ^ ^ [ ( ^ , + e2-)Z + 26,6-(^ + ^ - l ) 2 ] } ,
=
/ Try \2Z2ar2e
\sinh 7ri^/ 27rcu
Finally, we replace £+ and £_ in terms of new "spherical" variables x
anc
* </^
£+ + £- - 1 = V z sin x cos i//,
£ + - £ - = V ( l - z) sin x sin i//,
0 « x ^ 2^,
0 ^ i// ^ 2TT,
2 d£+ d£_ -> V[z(l - z)] sin # cos x dx dip.
These ranges of variation of x a n d <A correspond to the range 0 to 1 for £+ and £_,
i.e. to the range 0 to °o for S+ and 8- (or, equivalently, 0+ and 0-); the rapid
convergence of the integral allows the range of variation of the angles to be
extended in this way. After the transformation, the root in the denominator
/ is elementary, and the
becomes V[z(l - z)] cos x\ the integration over x an( * <>
result is
da = 2A-27rdz(ei + e! + ^ + 6 - ) r | ^ + -^F / 2 (z)lde + .
1 20
118
^
^T~ I
^V
I
1 14
I
1 12
I
!
1 10
—
I
1 08
1 06
.,..__
|
116
f (v)
I
I
01
02
v
FIG.
18.
03
04
Brems Strahlung in the Ultra-relativistic Case
§96
419
An extra factor 2 has been included because the integration over z is to be taken
from 0 to Zu whereas, when the azimuth <j> varies from 0 to -rr and from v to 2TT,
each value of z occurs twice.
The integration over z is effected by means of formula (92.14), which, for
v' = - v (and F(z) accordingly real), becomes
F2 . z „l22 _ \ d
, + ^ F ' = -^f-(zFF').
1-z
v
v dz
The integral of this expression is Z\F(z\)F\z\)jv2. The value of ZiF(zi)«s F(l) is
taken from (95.17), and the limit of F'(z\-* 1) is given byt
\F'(Z)
= F(1-W,
l + ii/,2,z)
~-[log(l-z) + 2 / ( , ) ] ^ ^ ,
where
f(p) = \[V(l + iv) + ¥(1 - iv) - 2^(1)]
= v2t„,„L1)ïy
„=i n(n + v )
(95.19)
nz) = r(z)/r(2).
Substituting the above expressions in (95.18), we obtain as thefinalformula
dat+ = 4Z2ar2(e2 + e2. + \e+e-)\\og ^ ^ - -[ - /(aZ)l %
(95.20)
The total cross-section for pair production by a photon with energy co is
*=?Z2ar2[log^-^-/(«Z)].
(95.21)
We see that the only change in these formulae is that a universal function f(aZ) of
the atomic number is subtracted from the logarithm. Figure 18 shows a graph of
this function. For v < 1, f(v) « \2v2.
§96. Exact theory of bremsstrahlung in the ultra-relativistic
case
The matrix element for the bremsstrahlung process is
Mfi = j *ft*(a • e*) e-ik'VtV d3x;
(96.1)
t The derivation of this formula is in the Appendix to the paper by H. Davies, H. A. Bethe and L. C.
Maximon, Physical Review 93, 788, 1954.
420
Interaction of Electrons with Photons
§96
the wave functions of the initial electron (e, p) and the final electron (e\ p') include
respectively outgoing and ingoing spherical waves in their asymptotic forms. The
calculation of this integral is similar to that of the matrix element (95.2), but will not
be given here. Instead, we shall describe another way of calculating the bremsstrahlung cross-section, based on the fact that the process is quasi-classical, and
not using the explicit form of the wave functions of the electron in the field of the
nucleus (and in this sense independent of the precise form of the field potential)(V.
N. Baïer and V. M. Katkov, 1968).
In the brems Strahlung process, the nucleus transfers to the electron and the
photon a momentum q = p' + k - p. As in the pair production problem, we must
distinguish two ranges of values of the transfer qx which is transverse relative to p:
(I) m^ql><am2le2,
(II) q x ~ <ûm2le2<Zm.
(96.2)
It is evident that in region I the emission cross-section is equal to the Born value:
for these values of qx, the recoil momentum of the nucleus is unimportant, as will
be shown in §98 (see the derivation of the condition (98.10)). In region I, the
cross-section for the process is therefore the product of the exact cross-section for
electron scattering in the field of the nucleus at rest and an emission probability
which is independent of the form of the field. But since, according to (80.10), the
exact cross-section for scattering at small angles in a Coulomb field is equal to the
Born value, so is the cross-section for the whole process in region I.
Thus only region II need be considered. Small momentum transfers correspond
to the passage of the electron at large distances from the nucleus: p ~ 1/qi ~ elm2.
But, at these distances, the motion of the electron is certainly quasi-classical, as is
easily seen by direct application of the usual quasi-classical condition, QM (46.7), to
the ultra-relativistic equation (39.5).
Since the motion is quasi-classical, we can use the method already applied in
§90 for synchrotron radiation. The expression (90.7) is in this case the probability of
emission when the electron passes the nucleus once.
Formula (90.18) remains valid for the function L used in §90; the only
difference is in the form of the quasi-classical electron path r = r(0, which is used
to calculate the difference r 2 -ri.
At large impact parameters, the field of the nucleus may be regarded as weak.
In the zero-order approximation, the path is a straight line passing at a distance p
from the centre. In the next approximation, we have as the equation of motion (cf.
Mechanics, §20)
dp =
dt
pdU
r dry
where p is a vector lying in the xy-plane and perpendicular to the initial momentum
of the electron, and r on the right-hand side is to be taken as the zero-order
function:
r ~ V ( p 2 + u 2 t 2 ) « v V + f2).
§96
Bremsstrahlung in the Ultra-relativistic case
421
Hence
,,,
f dU dt
(96.3)
The velocity v(f) = p(fj/e, where the energy e depends on the magnitude but not
the direction of p, may be regarded as constant with sufficient accuracy. A further
integration then gives
r(t) - r, = v,(* - U) - j j [p(t') - p,] dt'.
(96.4)
We shall take ti = -°°, so that the quantities pi = p(-°°) s p and v = p/e are the
initial momentum and velocity of the electron.
We can put the probability (90.7) in the form
dw = \a(p)\2dikK2Tr)\
(96.5)
where
a(p) = e
Vï / R(t) exp {' 7 M "k * r(0) } d*'
^ ) = V^)a'e*Vfe'
(%.6)
and e' = e - o>, p'(0 = p ( 0 - k - The classical function p(t) is given by (96.3). If p
denotes the initial momentum of the particle, we have for a Coulombfield(U = - p/r,
v = Za)
p(,)=p
-^[Wr?) +1 ]-
and
e
p e
In terms of the change of momentum in classical scattering,
A = p(oo) - p ( - oo) = - 2pWp2,
(%.7)
we can rewrite these formulae as
r(o = (p+!A)i+A va'+p5).
6
Z£
(96.8)
422
Interaction of Electrons with Photons
§96
Now using formula (90.20) for R(t) and the expressions (96.8) for p(t) and r(t),
we can calculate the integral with respect to time in (96.6). The integration is carried
out by replacing the variable t by
è =
e
--,(<ot-k-r(t))
and using the formula
/
where Kx is the Macdonald function. There is, however, no need to complete these
calculations, since we want the expression a(p) only for small values of A (A <è m),
an independent parameter. Then we obtain
a(p)=w|DwIAxK1(x),
(96.9)
where
X= p-p-U-n-v),
n = k/co, and D is some function of p, e and k (but not of p), whose precise form is
unimportant.t Since, in the ultra-relativistic case, the photon is emitted at a small
angle 0 to the direction of the electron velocity, we have
X^Pjiioil-v+W)
or
X = p ^ T (1 + S2),
Zee
6 = Oelm.
(96.10)
It has already been mentioned that (96.5) is the probability of photon emission
in a single passage of an electron past a nucleus at impact parameter p. The
cross-section for the emission of a photon with given frequency and direction is
obtained by multiplying by tT1 dpx dpy « dpx dpy = d2p and integrating with respect
to the impact parameter:
^ = ^,||a(p)|2d2p.
(96.11)
t The spinors wt and wf may be taken as constant in the integration, i.e. the change in the electron's
polarization in its classical ultra-relativistic motion may be neglected. This can be seen from the
equations derived in §41.
§96
Brems Strahlung in the Ultra-relativistic Case
423
However, it should not be thought that this formula without the integration over
d2p would also give the directional distribution of the final electrons. The deviation
of the electron in motion in a classical path, which is uniquely determined by the
external field, is certainly not the same as the indeterminate quantum-mechanical
deviation (and the limit p'(°°) of the classical function p'(0 is therefore also
different from the actual final momentum of the electron). Consequently, in order
to obtain the angular distribution of the electrons, we must re-expand their wave
function in plane waves.
It is seen from (96.11) that a(p) is the amplitude of photon emission in a passage
at impact parameter p. The expressions (96.5) and (96.6), however, define this
amplitude only to within a phase factor, which is easily seen to be e~lk p: on
account of the time-independent term rx(oo) = p in r(t), this constant factor must
be present in Vfi(t)y and may be taken outside the time integral. Since it is not an
operator, it is not affected by the process of commutation, and the amplitude for
the emission process is thus
<Tikpa(p),
(96.12)
where a(p) is given by (96.9).
Now let the electron be described, as z -* - a>, by a plane wave with momentum
p along the z-axis. This means that the wave function of the electron as z -» - <*>, as
a function of x and y, reduces to a constant, which can be taken as unity. Then the
wave function of the electron which has passed through the field is, for z -> <»,t
00
«K°°) = S(p) = exp{- i j U{xy y, z) dz}.
(96.13)
— 00
According to the significance of the transition amplitude (96.12), the wave function
of an electron which has passed through the field and emitted a photon is
<T,kpa(p)S(p).
(96.14)
The amplitude for emission in which the electron goes to a state with definite
momentum p' is given by the corresponding Fourier component of (96.14), i.e. by
a(qi) = J V * peHk^(p)S(p)d2p
= |e-^ p a(p)S(p)d 2 p,
(96-15)
where q± is the transverse component of the momentum transfer to the nucleus; cf.
t See QM (131.4); we have in mind the analogy between equation (39.5) (in which we put p 2 ~ e2)
and the non-relativistic Schrödinger's equation (39.5a). Bearing in mind the difference in the significance
of the coefficients in these equations, it is easily seen that in our case the conditions QM (131.1) for the
formula QM (131.4) to be valid are in fact satisfied. The fact that this formula is not valid for arbitrarily
large z is unimportant, for the same reasons as in QM §131.
424
§96
Interaction of Electrons with Photons
QM (131.7). The cross-section for scattering with a given transfer qi is
dCT= 0(<
i >*>i ! (Slfr
<9616)
Let us now calculate S(p). In the case of a Coulomb field considered here, the
integral in the exponent diverges, in accordance with the phase divergence in
Coulomb scattering. The integral must therefore be first calculated between finite
limits:
R
R
= - 2v[\og(R + V(R2 + p2)) - log p]
^ - 2v log 2JR + 2v log p
(R > p). The first term, which is a constant, is unimportant, and therefore
S(p) = e" 2iHogp -p- 2, \
(96.17)
Substituting (96.9) and (96.17) in (96.15) and integrating over the directions of
the vector p in the xy-plane, we find
00
a(qx) « v j p-2,'K,0r>JI(q1p)p dp,
o
(96.18)
where Jj is the Bessel function. The factors not involving v = Za have not been
written here.
We see that the dependence of the amplitude a(qx) (and therefore of the
cross-section (96.16)) on v is contained in a separate factor. On the other hand,
when v -* 0, the cross-section must tend to its value in the Born approximation. It is
therefore immediately clear that the cross-section will differ from the Born value
only by a factor which is independent of the electron polarization and hence does
not influence the polarization effects.
The integral (96.18) can be expressed in terms of the hypergeometric function
by means of the formula
J x xKx(ax)J{(bx)xdx=—
2xfl3-x
{l
+
~p)
F
(^ '
2
' '^Tpj-
0
This gives
a(qx) « v(\ - i»(iq) 2i T 2 (l - iv)F(ivt 1 - iV, 2, 2),
(96.19)
§96
Bremsstrahlung in the Ultra-relativistic Case
425
where
(1 + S2)2,
2 = 1 - Amîf,î
8 = eBlm ;
(96.20)
here we have used the fact that, in region II (see (96.2)), the component of the
vector q parallel to p is
4
2
-q
2
- q ï ~ ^ 0 + Ô2)2.
(96.21)
This is easily proved, since in that region the angles between the momenta p, p' and
k satisfy the conditions (93.15).
The hypergeometric function in (96.19) can be reduced to the function F(z) in
(95.15) by means of the formula
F(a,b + \,c + \,z) = -^F(a,b,c,z)
+
^^F'(a,b,c,z).
The final result is then
da = dcr B -p^[F 2 (z) + ^ ^ F ' 2 ( z ) ] ,
(96.22)
where dvB is the Born cross-section (93.13) (H. A. Bethe and L. C. Maximon,
1954). When q >m2le, we have z « 1, and the whole coefficient of daB tends to
unity; in this sense formula (96.22), which has been derived for region II, is
automatically satisfied for all q ** m. When q *£ m2/e and the correction factor in
(96.22) is different from unity, the vectors p, p' and k are almost coplanar, and the
quantities 5 and 5' are almost equal; this has already been taken into account in
(96.22). Thus q2 in the expression (96.20) for z can be written as
2
2
2
-2-, = S2 + 8'2 - 288' cos <t> + ^ ^ 5 (1 + S2)2,
(96.23)
m
4e e
i.e. we can put 8 = 8' in the second term in (93.14), but not in the first term, which
does not contain a mall coefficient (~m2/e2).
To find the cross-section integrated over angles, there is no need to repeat the
integration: we can proceed as follows (H. Olsen, 1955). Various directions of p'
(for a given energy e') correspond to degeneracy of thefinalstate of the electron. It
is evident that the result of summing over states which belong to one degenerate
level is independent of how the complete set of these states is chosen. We can
therefore use, in summing over directions of p', the set of functions i//£y instead of
the tyty which are needed in calculating the differential cross-section, i.e. we can
define the bremsstrahlung matrix element as
M?/ = | *$*<« • e*)e-,fc-y.y a°x.
426
§97
Interaction of Electrons with Photons
This integral is easily seen to be the same as (M ft)* if the parameters of the wave
functions in the latter are changed as follows:
p+, p+, 6+ -H> - p, - p, - e ;
p-, p_, e- -> p', p', e';
k-> - k
and the sign of the integration variables is reversed: r - > - r .
Hence it is clear that the brems Strahlung cross-section integrated over angles
can be obtained from the integral pair production cross-section (95.20), on multiplication by
p+de+
£+ds +
(cf. (91.6)) and replacement of e+ by ~e, e- by e'. Thus we have
d<r = 4ZW5 Si ( i + i - ?)[log 2H: - ' - / ( o Z ) ] *£.
(96.24)
We see that the corrections to the Born formulae for the integral brems Strahlung
and pair production cross-sections are given by the same function /(aZ).
Formula (96.24), which does not depend on any limitations on the value of Za,
allows a passage to the classical limit (h ->0, Za -><»). In this limit, we must also put
e =*e\ Bearing in mind the asymptotic expression ¥ ( z ) « l o g z as |z|-**> and the
value ¥(1) = - C (where C is Euler's constant), we find the effective retardation
( ■ » d ^ l ^ f l o g ^ p - i - c ] ^ .
(96.25)
This expression, which does not contain ft, is the classical frequency distribution of
the bremsstrahlung intensity.
§97. Electron-electron bremsstrahlung in the
ultra-relativistic case
Electron-electron bremsstrahlung is represented by eight Feynman diagrams:
four diagrams
1 k
1
\
. — 1 —
"■
11
"
*
1
^
1
•
1
1I
1
p2
p'
n
—
-
1
—
1
1
1
]1 .
—
1
P.
1
1
1
—
P2
0
1
1
1
—
1
(97.1a)
1 — ' —
ik
^2
(97.1b)
§97
Electron-Electron
Bremsstrahlung
427
and four "exchange" diagrams obtained from those shown by interchanging p\ and
P2. Here we shall give the results of the calculations for the ultra-relativistic case
(G. Altarelli and F. Buccella, 1964; V. N. Baïer, V. S. Fadin and V. A. Khoze,
1966).t
In the laboratory system (the rest frame of one of the initial electrons, say the
second), the emission cross-section integrated over the directions of the photon can
be written as a sum da = da{]) + daa\ where
da(1> = 4 a r ^ ^ ^ ( - ^ a)
e \e-co
m }
m2
. 3m
+ -r
2a>
^=^-|)(log^^^-i);
e
3/\
mco
2/
m t m 2 \ , le
+ — 5 log
a) 4co /
m
. ) 2 2mdiù \IA
da{2)(2 = jar]
3 - \[4
IA
a)
eu t \
+
m
\
m }
5m2] i
-5—7 \ f or œ 2* -2my
öa>J
<an i\
(97.3)
(O
.3
_ 2
m /L4(u
2a>
^ . 2CÜ 4co2l
2+
~ . 2m
2+
(0
(97.2)
co
m \
—1
r
5- > for to ^ im
m m )
/0-7 /l^
(97.4)
(e being the initial energy of the first electron).
These formulae are accurate as far as terms of relative order m/e. To this
accuracy, it is found that the contributions to the cross-section from different
diagrams do not interfere, and in this sense da{]) and dcr(2) correspond to emission
by each of the two electrons: the fast electron and the recoil electron respectively,
diagrams (97.1a) and (97.1b).
The "exchange" diagrams give the same contribution to the cross-section as do
the "direct" diagrams. Since the electrons are identical, the total contribution from
the direct and exchange diagrams has to be halved, and we may therefore consider
only the contribution of the direct diagrams and ignore the identity of the particles.
For electron-positron collisions the exchange diagrams are replaced by annihilation
diagrams, but their relative contribution is of order mis and therefore negligible.
Hence the bremsstrahlung cross-sections are the same, to the accuracy indicated,
in electron-electron and electron-positron collisions.
For eu > m, the ratio
da{2)
m^
t
37,'~,*1>
i.e. the emission from the recoil electron is small compared with that from the fast
electron; when this ratio becomes of the order of m/e, formula (97.3) is of course
no longer meaningful. When a> <m, on the other hand, the two parts of the
t The calculations are given in the book by Barer et ai cited in §93.
428
Interaction of Electrons with Photons
§97
cross-section are almost comparable:
j
(i)
is
2 d(o.
2e
d(TK) = far; — log
,
«)
mw
d*™ = $arl^
0>
a> <3 m.
(97.5)
log ^ ,
O)
For formulae (97.2M97.5) to be valid, it is necessary that at least one of the
electrons should remain ultra-relativistic after emission of radiation, i.e. the photon
frequency must be sufficiently far from the hard boundary of the spectrum (the
maximum frequency a w that can be emitted). The final energy of the electrons is
least, and the photon energy greatest, when both electrons move, after emission, in
the direction of the photon at equal speeds. The conservation laws then give
e + m = a w + 2e ',
|p| = a w + 2|p'|.
Hence, eliminating e' and p', we have
(e + m - o w ) 2 - (|p| - o w ) 2 = 4m 2
and
,%
_
m
(e~m)
(Qn <L\
(97,6)
" - " - m + e-lpr
When e > m, a w « e. Thus formulae (97.2)-(97.4) are valid if
Wmax - o > ~ e ~ f c » > m .
(97.7)
The cross-section (97.2) for emission by the fast electron is exactly equal to that
for electron-nucleus bremsstrahlung when the nucleus has Z = 1 (formula (93.17)).
This agreement is not fortuitous, and can be explained by considering the
significance of recoil in the emission process.
In deriving (93.17) we neglected the recoil of the fixed particle (the nucleus),
replacing it by a constant external field. This was equivalent to neglecting the time
component of the momentum transfer 4-vector q = p'-p + k (the recoil energy).
We shall show that, in the ultra-relativistic case, this treatment is permissible for
electron-electron as well as electron-nucleus bremsstrahlung.
We write
- q2 = - (E' + a) - e) 2 + (p| + a» - p,)2 + (pi -
Pl)
2
,
(97.8)
where the subscripts indicate the components of the vectors p' and p (the initial and
final electron momenta) parallel and perpendicular to the direction of the photon k.
In the ultra-relativistic case the angles 0, 0' between k and p, p' respectively are
§97
429
Electron-Electron Brems Strahlung
small: 6 ^ mie, 6' ^ mle'. Hence
|pi|~|p|0~m,
(97 9)
p--w-é—ir-fl-
-
and similarly for pi and pjj.
Neglecting recoil, we have e' + o> - e = 0; the term pj + w - pj ~ m2/e, and so
-q2~(pl-pi)2~m2.
(97.10)
The energy of (electron-electron) recoil is
q0= e' + <o-e~q2l2m ~ m.
(97.11)
The change in pi due to the change in e' is negligible. The change in q2 with
allowance for recoil, which we donote by Aq2, is therefore given by the first two
terms in (97.8). Using (97.9), we have
A<i 2 «( e ' + a > - E ) ( - ^ - ^ + ^ + ^ )
\ e
E
e
e/
2
~ m • mle.
Comparison with (97.10) shows that Aq 2 «|q 2 |, and the neglect of the recoil is
therefore justified.!
The fact that the fast particle emits into a narrow cone (with aperture angle
—mle) in the direction of its motion enables us to deduce the cross-section in the
centre-of-mass system by a simple conversion of the cross-section (97.2) from the
laboratory system.$
In the centre-of-mass system the two electrons emit in the same manner, each
in the direction of its motion. (It may be noted that this gives an intuitive
explanation of the absence of interference between the radiation from the two
particles.) The energy E of the ultra-reiativistic electron in this system is related to
its energy e in the laboratory system by 2E2 = me ; the respective photon frequencies Cl and a) are related by w/e = H/E. These equations are easily obtained by
comparing the values of the invariants (P1P2) and (p\k) in the two systems. The
cross-section for emission by each electron in the centre-of-mass system is therefore
d(7(1, = d(7(2)
A
2dft£-n/
E
, E-CÏ
2\/.
4E\E-(l)
1\
,Q7 n ^
t This conclusion is, of course, valid a fortiori in the case of electron-nucleus bremsstrahlung, for
which the recoil energy qo^q2l2M - m2IM, where M is the mass of the nucleus.
t In general such a conversion is not possible, because the contribution to the spectrum in a given
frequency range d<o comes from photons emitted in quite different directions.
430
Interaction of Electrons with Photons
§97
For (97.12) to be valid it is also necessary that the photon frequency should not
be close to the boundary of the spectrum. For an ultra-relativistic particle, the
above-mentioned transformation gives immediately, when <omax ~ e,
nmax « <omaxE/e « E.
(97.13)
Thus, in the centre-of-mass system, the electrons can emit only half of their total
energy 2E. A direct calculation of nmax is easily performed by noting that, after the
emission of such a photon, the electrons will move (in that system) at equal speeds
in the direction opposite to that of the photon. We have
2E = 2E' + nmax,
2|p'| = nmax,
whence
nmax = p2/E = E-m 2 /E,
(97.14)
and in the ultra-relativistic case again (97.13). Thus formula (97.12) is applicable
under the condition
n m a x -n~E-n>m.
(97.15)
We shall now give some formulae for emission in the centre-of-mass system in
the opposite limiting case, near the boundary of the spectrum, whent
nmax-n«m.
(97.16)
Since in this case the recoil is very important, the results differ from those for
scattering by a fixed centre and are also different for electron-electron and
electron-positron scattering (V. N. Baier, V. S. Fadin and V. A. Khoze, 1967).
In electron-electron scattering, besides the squares of the diagrams (97.1), there
is also a contribution to the emission cross-section near the boundary of the
spectrum from products (interference terms) of the direct and exchange diagrams,
in which a given initial particle emits, for example, the product of the second
diagram (97.1a) and the diagram
lk
P2 _ _ ^ — _ J
I
, — p
l
P*
-
I
m
P
This is because, near the boundary, the final particles have similar momenta and
there is no reason for the exchange terms to be small. The final result for the
t The result obtained in the Born approximation is, of course, as usual valid only if the relative
velocity of thefinalelectrons is large in comparison with a. If not, the interaction of the particles in the
final state has to be taken into account.
§98
431
Emission of Soft Photons in Collisions
cross-section is
v
n
, 2 [E(n
max -n)p
da = 2ar?
Tdil
T^.
(97,17)
Umax
M
In electron-positron scattering, a logarithmically large contribution to the emission cross-section comes from the squares of annihilation diagrams, in which there
is emission by the initial particles:
k
i
- p.
_ i
lk
r— — - P..
" p . '—
t
-p;
, ( -,—.—-
-P'
P:
A
L-^_
• — r
-
• P.
(97.18)
i
m
«
i
• P;
The squares of other diagrams are significant when the accuracy is not logarithmic,
but the interference terms are small. The final result is
da = 2arAm^zM(logm+l\^L,
m
\
m
/ ilmax
(97.19)
Thus the emission in electron-positron scattering is logarithmically large in comparison with that in electron-electron scattering.
§98. Emission of soft photons in collisions
Let da0 be the cross-section for a given process of scattering of charged
particles, which may be accompanied by the emission of a certain number of
photons. Together with this process, we shall consider another which differs from it
only in that one extra photon is emitted. If the frequency co of this photon is
sufficiently small (the necessary conditions will be formulated below), the crosssection da for the second process is related in a simple manner to da0.
When co is small, we can neglect the influence of the emission of this quantum
on the scattering process. The cross-section da can therefore be represented as a
product of two independent factors, the cross-section dao and the probability dw of
emission of a single photon in the collision. The emission of a soft photon is a
quasi-classical process; the probability is therefore the same as the classically
calculated number of quanta emitted in the collision, i.e. the same as the classical
intensity (total energy) of emission dl, divided by co (= h<o). Thus
da = da0dll(o.
(98.1)
We shall show how this formula can be derived from the general rules of the
diagram technique (J. M. Jauch and F. Rohrlich, 1954).
The diagrams for the process involving an additional photon are obtained from
those for the original process by adding an external photon line which t4branches
432
§98
Interaction of Electrons with Photons
off" from some (external or internal) electron line, i.e. by replacing
■
p
.
\
by
(98.2)
■—
p
■ *
P-k
It is easily seen that the most important diagrams will be those in which this change
is made in external electron lines. For, if p is the momentum of the external line
(p2 = m2), then for small k we have also (p-k) 2 s =m 2 , i.e. the factor G(p-k)
added to the diagram is near its pole.
For an initial electron line p the change (98.2) amounts to the following change
in the reaction amplitude:
u(p)-eV(4ir)G(p - k)(ye*)u(p)
«-eV(41r)^^(7c*)u(p).
Since (yp)(ye*) = 2pe* - (ye*)(yp) and 7pu(p) = mu(p), we obtain the following
rule:
u(p)-+- « V ( 4 T T ) ^ M(p).
(98.3)
Similarly, for a final electron line p \ the replacement of
k
\
by
P'
\
P'
^_
P'+k
in the diagram implies the change
ü(p') - e V(4*)ü(p')
{
j^
(98.4)
in the amplitude.
In the rest of the diagram we can everywhere neglect the changes in the
momenta of the lines as a result of the emission of the photon k. Here it is assumed
that the photon energy o> is always small in comparison with the energies of all the
particles participating in the reaction (and in comparison with those of the hard
photons, if any, that are emitted).
Let the cross-section da0 refer, say, to the scattering of an electron by a fixed
nucleus (with possible emission of hard photons). The amplitude of this process,
which will be conventionally called "elastic", is
M}f>=û(p')Mii(p).
§98
Emission of Soft Photons in Collisions
433
Making the successive substitutions (98.3) and (98.4) and adding the results, we
obtain the bremsstrahlung amplitude for emission of the same hard photons
together with a soft photon k:t
M^-Mjr.V^^-ttg).
(98.5)
Accordingly, the cross-section is
ip'e) (pe)
d'k
da = dvt\ • Aire1
3
(P'k) (Pk) (2ir) 2tu*
(98.6)
Summation over polarizations of the photon k gives
^-•te-&]'&**
(987)
In terms of three-dimensional quantities, this formula becomes^
/ v'xn
da = a 11 ;
Vl-V'D
v x n \2d<udok A_
•=
I -r—j— dtrei,
1 - v n / 4ir<u
, ofi fi.
(98.8)
where n = k/<u, and v and v' are the initial and final velocities of the electron. We
see that the coefficient of datX is in fact the same as the classical intensity of
emission (cf. Fields (69.4)), divided by eu, as already asserted in formula (98.1).
The condition for the above formulae to be applicable is that not only is <u small
compared with e but also the momentum transfer q to the nucleus is large
compared with the change 5q in this quantity due to the emission of the soft
photon. We have
Sq = ( p ' - p - k ) - ( p ' - p ) „ = o
= 5p'-k,
where |5p'| — <od\p'\lde ~ <alv and |k| = <u. In the non-relativistic case (t> < 1), we
therefore obtain the condition
W|q|v <^ 1.
(98.9)
For scattering by a Coulomb potential (or by any potential that decreases slowly
with increasing distance) |q| ~ 1/p (where p is the impact parameter), and so this
condition can also be written as <UT < 1, where T ~ pfv is the characteristic time of
the collision.
t It should be noted that the difference term in this formula arises naturally from gauge invariance:
the reaction amplitude must be unchanged when the polarization 4-vector e is replaced by e + constant x
k.
t To derive (98.8) it is convenient to return to (98.6), putting p = (e, ev), pk = ea>(l - v • ■ ) , . . . , e =
(0, e), and then summing over polarizations by means of (45.4a).
434
Interaction of Electrons with Photons
§98
In the ultra-relativistic case, the photons are emitted chiefly in directions near v
and v\ as is seen from the denominators in (98.8). If the electron scattering angle 6
is small, the directions of all three vectors p, p \ n are close together. Then
|Sq| = | S p # H k |
and, since |q| ~~ edy we obtain the condition
(98.10)
d>-^j.
e e
Because the formulae (98.5)-(98.8) are quasi-classical, they are valid for emission by any charged particles, not necessarily electrons as assumed in the derivation. In general, when several such particles take part in the reaction, formula (98.5)
must be put in the form
M„ = M | f e V ( 4 ^ ) E z ( ^ - ^ ) ,
(98.11)
where the summation is over all the particles (with charges Ze); formulae (98.6)(98.8) are changed similarly.
In particular, in the non-relativistic case
M = M jjo e V < 4 7 r > 2
(O
Z(v'
- v) • e*.
(98.12)
mi I
(98.13)
For two particles, this formula becomes
o)
q = m(v' - v),
\ nt\
m = mxm1\{m\ + m2),
where v and v' are the relative velocities of the particles before and after the
collision. From this, on integrating |M/j|2 over the directions of emission of the photon
and summing over the directions of polarization of the photon, we find the
nonrelativistic frequency distribution of the radiation:
3ir \mi
m 2 / <*>
The above results can be generalized to the case of simultaneous emission of
435
Emission of Soft Photons in Collisions
§98
several soft photons. For each photon there is an additional factor in M/;, similar to
the coefficient of MJ?0 (98.5). This is easily seen directly for the example of two
photons, say. The lines of the two emitted photons have to be added on external
electron lines, and in two different orders, so that a diagram with external line p is
replaced by two diagrams with the lines
k
k
2 \
\
k
1 \
.
^
p-k,-k2
\
.—^
p-k,
.
k
1 \
\
and
.
^
p-k,-k2
p
2 \
\
i—^
p-k2
—
P
respectively. They contain the factors
1
1
2(pk, + pk2)2pk,
.
1
1
2(pk, + pk2)2pk2
(the denominators of the electron propagators) respectively, and their sum is
_1 1__
2pki 2pk2'
i.e. it is the product of two independent factors relating to the first and the second
photon. Then, in the sum of all the diagrams, the terms combine (because of gauge
invariance) to give the product of differences
(p'el
\p%
pefWp'ef
pkj\p%
pe\*\
pk2)'
The cross-section for the process separates into factors in accordance with the
factorization of the amplitude. The soft photons are therefore emitted independently. The cross-section for emission of n soft photons can be written
da = daA dwx... dwn,
(98.14)
where dwt, dw2,... are the probabilities of individual emission of the photons k(,
k2,
When this formula is integrated over afiniterange of values of the variables
(frequencies and directions), the same for all quanta, a factor 1/n ! must be included
in order to take account of the identity of the photons.
If the emission cross-section (98.1) is integrated over frequencies in some finite
range from a>, to co2, the resulting expression is
der ~ a log(ü>2/a)|) dat\i
(98.15)
cf. (98.8). Here it is assumed that both frequencies are soft, and the possible values
of oij are therefore limited by the condition for the method to be applicable. With
logarithmic accuracy, however, we can put o>2~ e, where e is the initial energy of
the emitting particle. The values of co\ have no lower limit, but on letting o>i-»0 we
436
Interaction of Electrons with Photons
§98
see that the cross-section for emission of all possible soft quanta is infinite. Let us
investigate the significance of this "infra-red catastrophe" (F. Bloch and A.
Nordsieck, 1937).
When
alog(e/<0!)^U
(98.16)
we have da ^ dat\. This, however, means that perturbation theory is inapplicable,
and da cannot be calculated as a quantity of a higher order of smallness than dac].
Thus in this case the small parameter must be taken as a log(£/a>i), not a.
The derivation of formulae (98.5) and (98.6) from perturbation theory is therefore invalid at sufficiently low frequencies. The classical formula for the intensity
dl (Fields (69.4)), on the other hand, becomes more nearly correct as co decreases.
Hence formula (98.1) remains valid if its meaning is made somewhat more classical.
In this formula it has been assumed that one photon is emitted. Then the energy
lost by the particle as radiation is equal to to and the ''relative energy loss
cross-section" is co dale or
daeidlle.
(98.17)
In reality, for sufficiently small co, the emission probability is not small, and the
probability of emission of two or more photons is greater, not less, than the
probability for one photon. Under these conditions, the expression (98.17) remains
valid but the classical intensity dl determines, instead of the probability of
emission of one photon, the mean number of emitted photons
dri = d//co,
(98.18)
or, in a finite range of frequencies,
n=
f
dl/co.
(98.19)
Since the soft photons are emitted in a statistically independent manner (this
being true in every approximation of perturbation theory), Poisson's formula can
be applied to the process of multiple emission: the probability w(n) that n photons
are emitted is given in terms of the mean number n by
w(n) = n n e-*/n!.
(98.20)
The cross-section for a process of scattering with emission of photons may be
written
do-= do-er w(n).
(98.21)
Since 2 w(n) = 1, do-ci is the total cross-section for scattering accompanied by the
§98
Emission of Soft Photons in Collisions
437
emission of any soft radiation. This is evident from a classical treatment; according
to perturbation theory, however, dat\ is the purely elastic scattering cross-section.
But perturbation theory is inapplicable here. Thus we find that dae\ calculated by
perturbation theory as the elastic scattering cross-section actually includes the
emission of any soft photons. The true value of the purely elastic scattering
cross-section is zero: as wi-»0, the mean number n-><», and according to (98.20)
the probability of emission of any finite number of photons vanishes.!
PROBLEMS*
PROBLEM 1. Find the spectral distribution of soft quanta emitted in ultra-relativistic electronnucleus bremsstrahlung.
SOLUTION. Integration of (98.8) over dok gives
da = aF(|)(do)/o)) daei,
(1)
where
^^[^î^log^W^-,!))-!],
(2)
p being the electron momentum and 0 the scattering angle. In the ultra-relativistic case, the most
important range of angles is
m2<M)le3^e<mle;
(3)
the lower limit is given by the condition (98.10), and the upper limit is discussed below. Here
£ « e 0 / 2 m « l , so that
F(|)~(8/3ir)£ 2 ,
and the electron-nucleus elastic scattering cross-section is (see (80.10))
dae,«4ZV,^4.
e 6
(4)
The integral
a) J 0
diverges logarithmically; it is cut off below at angles 0~~m2u>/e3 and above at { — 1, i.e. at angles
0~~mle. When £-*<*>, F-(4/7r)log£ and the integral converges. Thus we have, with logarithmic
accuracy,
da^ZV2-^ log A
(5)
which agrees with the logarithmic part of formula (93.17) (where we must put e « e')- Non-logarithmic
accuracy can be achieved only by going beyond the quasi-classical range.
t We shall return to a more detailed discussion of this in §130, in connection with radiative
corrections.
t The following applications of formula (98.7) are due to V. N. Baler and V. M. GalitskiT (1964).
438
Interaction of Electrons with Photons
§99
PROBLEM 2. For a collision between two ultra-relativistic electrons, determine (in the centre-ofmass system) the cross-section for simultaneous emission of two soft photons in opposite directions at
small angles to the electron momenta.
SOLUTION. Photons moving in opposite directions are emitted by différent electrons, each in the
direction of its motion. The cross-section for simultaneous emission is
,
da=datraF(^aF(0—,
(x)2 I
(0\
(6)
£ = (e/m)sin20,
where e is the energy of each electron, 8 the scattering angle in the centre-of-mass system; 0 is the
same for each electron. No factor 2 is needed in the cross-section, since the photons are certainly
emitted in different directions. The cross-section for elastic scattering of the electrons through small
angles in the centre-of-mass system, in the ultra-relativistic case, is the same as (4); cf. (81.11). Unlike
(1), the cross-section (6) behaves as Odd when 0->O, and the integral therefore converges. On the one
hand, this enables us to extend the integration to 0 = 0, without any difficulty that the method might
cease to be applicable. On the other hand, the main contribution to the integrated cross-section now
comes from the region 0 ~ m/e, not 0 <^ m/e, and so the exact expression (2) has to be used. The result
of integrating the cross-section over scattering angles is
a(7W|W2= — [5 + 2£(3)]r,a
7T
=
û)l
0>2
5 . 9 ^ ^ ^
an
a>2
the value of the Riemann zeta function being £(3) = 1.202.
§99. The method of equivalent photons
Let us compare two processes described by the diagrams
(n)
i
, V
1.
(99.D
where the circles represent the whole of the internal parts of the diagrams. Diagram
(a) represents a collision between a photon k (k2 = 0) and a particle having 4momentum q (and mass m; q2 = m2). The system resulting from the collision is a
particle or group of particles having total 4-momentum Q. Diagram (b) represents a
collision between the same particle q and another particle having 4-momentum p
and mass M (p2 = M2). After the collision, the latter particle has 4-momentum p \
and the same system Q is formed. The second process may be regarded as a
collision between the particle q and a virtual photon emitted by the particle p and
having momentum k = p-p' (k 2 <0). If \k\2 is small, the virtual photon is not
greatly different from a real photon. Such a situation is evidently possible in
collisions of very fast particles: the electromagnetic field of a charged particle
moving with v — 1 is almost transverse, and therefore has properties similar to
§99
The Method of Equivalent Photons
439
those of the field of a light wave. Under these conditions, the cross-section for
process (b) can be expressed in terms of that for process (a).t
We shall thus suppose that the particle M is ultra-relativistic, with energy (in
the rest frame of the particle m) e > Af. If the masses of the colliding particles m
and M are different, we shall take the case where m < Af.
The amplitude of process (a), which involves a real photon, can be written
M^ = -eV(47r)(^/^),
(99.2)
where e^ is the photon polarization 4-vector and JM the transition current corresponding to the vertex (the circle) in the diagram. The amplitude of process (b) is
Mfi = Ze2^(jjn,
(99.3)
where jM is the transition current of the particle m (the lower vertex in the diagram),
and Ze is the charge on this particle. The current J is a function of k = Q - q, and
is therefore not the same in the two cases, since k2 = 0 in (99.2) and k2 ^ 0 in (99.3).
But if, in the second case,
|k 2 |«m 2 ,
(99.4)
we can here also take J for k2 = 0.
The change in the momentum of the particle M when a virtual photon is
emitted, p - p' = k, is small in comparison with its initial momentum |p| * e ; w e can
therefore put p = p' in the transition current j. That is, the motion of the particle Af
may be regarded as uniform motion in a straight line. Since such a motion is
quasi-classical, the corresponding current is independent of the spin of the particle:*
j* = 2p*.
(99.5)
The condition for the current to be transverse (jk = 0) now gives E<O - pxkx = 0,
the x-axis being taken in the direction of p. Hence
a) = vkx,
(99.6)
where v = pje is the velocity of the particle Af. Since
- k2 = - to2 + k\ + ki ~ (o2(\ - v2) + ki,
(99.7)
where kx is the component of the vector k transverse to the x-axis, the condition
t The method given below is due to C. F. von Weizsäcker and E. J. Williams (1934); the basic idea
had been stated earlier by E. Fermi (1924).
t When the wave functions are normalized to one particle in unit volume, the current jM =(l,v),
where v is the velocity. We have, however, decided (§64) to omit the normalization factor 1/V(2e) in the
wave functions. Accordingly, j" must include a further factor 2e, and this gives the expression (99.5).
440
Interaction of Electrons with Photons
§99
(99.4) is equivalent to the inequality |kj <è m and to a considerably weaker
inequality for eu: <o <è m/V(l - v2).
From the condition for the current / to be transverse (Jk = 0) we have, using
(99.6),
J — II. A.
L
'
l
We therefore obtain for the scalar product \J
jj = 2(J0e - JxPx)
Ä2£(ki.Ji +
^ M
(99 . 8)
The product Je in (99.2) can be expanded by taking the polarization 4-vector of
the real photon in the three-dimensionally transverse gauge: ek = - e * k = 0,
whence ex Ä - ei • kjo). Then
/e = - e 1 - ( j 1 - ^ J x ) .
(99.9)
The expressions (99.8) and (99.9) are proportional if the second terms in the
parentheses are negligible. Since the current J pertains to the upper vertex of the
diagram (99.1b), it does not depend on the direction of p; hence Jx and J ± must be
taken to be quantities of the same order. For the terms in question to be negligible,
therefore, we must have |kj<^û> and o> < e 2 |kJ/M 2 ; these conditions are compatible with the previous ones on k± and co.
Assuming that in (99.9) the photon is polarized in the plane of x and k (so that
e±||k_L) and noting that the conditions stated imply that el *** e2 = 1, we now have
M, = M K > ^ £ ^ M .
~ K
(t)
(99I0)
In accordance with the previous discussion, the following conditions are here
assumed satisfied:
|kjj <* <o <* m-y,
o)/72<|ki|<m,
(99.11)
(99.12).
with the notation
y = e/M = l/V(l-t> 2 ).
From this we can find the relation between the corresponding cross-sections.
According to the general formula (64.18) we have (in the rest frame of the
§99
The Method of Equivalent Photons
441
particle m)
der, = |MK)|2(2TT)4S(4)(P/ -
Pu-^dpo,
where dpQ represents the statistical weights of the particles Q. Using (99.10) and
(99.7), we find
d<7 = dovn(k)d 3 p',
(99.13)
where
Here dur is the cross-section for process (a), resulting from a collision between a
real photon and a particle at rest, in which a system of particles Q is formed which
have momenta in certain ranges; der refers to the process (b) of formation of the
same system Q when a fast particle (of mass M ) collides with the same particle at
rest, loses momentum p - p' = k, and remains in the range d3p' of values of p'. The
factor n(k) in (99.13) may be interpreted as the number density (in k-space) of the
photons equivalent to the electromagnetic field of the fast particle.
The integration over d*p' is equivalent to one over d3k = du>d2kL. On integrating over d2fcj., we obtain the cross-section for a process in which the total energy E
of the system of particles Q lies in a given range dE = do> (E - m = e - e' = a>,
where e and e' are the initial andfinalenergies of the particle M). Integration over
the directions of k± signifies averaging over the directions of polarization of the
incident photon (and multiplying by 2TT). The result is
d<7 = n(û>)dcr,dû>,
(99.15)
where
n(w) = J n(k)2irk± dkL
=
2ZV f
k\dk±
™ J (kl +(û2ly2)2'
The integral over dkL diverges when k± is large, but the divergence is only
logarithmic. This enables us (within the range of validity of the method) to obtain a
result in the logarithmic approximation: it is assumed that not only the argument of
the logarithm but the logarithm itself is large. To this accuracy, it is sufficient to
take as the upper limit of integration k±mM — m, the upper limit of the inequality
(99.12). Integration then gives for the frequency distribution of equivalent photons
442
§99
Interaction of Electrons with Photons
(in ordinary units)
log^-—.
n(a>) dio=-Za
IT
flù)
(99.16)
ù)
The approximation used here signifies that the numerical coefficient in the
argument of the logarithm remains indeterminate. The inclusion of such a
coefficient would mean the addition of a relatively small quantity (~1) to the large
logarithm and would be superfluous having regard to the accuracy of the method.
PROBLEMS
PROBLEM 1. From the photon-electron scattering cross-section, find the bremsstrahlung crosssection in a collision between a fast electron and a nucleus.
SOLUTION. In the frame of reference K\ in which the electron is at rest before the collision, the
process may be regarded as the scattering by the electron of the equivalent photons of the field of the
nucleus.t According to (86.10) the cross-section for scattering of a photon by an electron in the frame
Kx is
A /
*\
2 mdœ[ [coi (üi (m
d<7sc(a>i,û>i) = Trre
2— — + — + ( —
cüi
Lan
<*)\
\(o\
m\2
(\
) -2m( —
o)\/
\o)\
1 \]
M,
(1)
cüj/J
where w\ and <o\ are the initial and final energies of the photon in this frame. The bremsstrahlung
cross-section in the frame K\ is
Tbr((0\) == I I dcji • n(u)]) d<7Sc(û>i, <*>!),
dabr((o{)
(2)
where n(o>i) is the function (99.16). Since the cross-section is invariant, the change to a frame K in
which the nucleus is at rest involves only a change in the frequency o)J. The frequencies a>i and o>' in the
frames K\ and K are related by the Doppler formula
OJ' = 70)1(1 - v cos 00,
7 = 1/V(1 - v \
(3)
where 6\ is the scattering angle in the frame K\. The same angle relates o>i and o>i according to (86.8):
(4)
A - — = —(1-cosfli).
(o\
d)\ m
From (3) and (4) we have
toi = ù)\e'le,
(5)
where e (-my) and e' are the initial and final energies of the electron in the frame K ( e - e ' = <i>').
Substituting (5) in (1), we find
.
2
d(T»c = TTTt
m du> (e , e , m eu
£(ÜI
I — + — + ,1 7
\e
e
E o)i
2m<o \
o>i£
7 Y
/
This expression is to be substituted in (2) and the integration over do>i carried out with o>' (i.e. e') fixed,
the range being from o)i,min = mo//2e' to û)i,max = lew'lm; these values are given by (3) and (4) with
0i = O and 8] = IT. Because the integral converges rapidly for large o>i, the main contribution to it comes
t The scattering of virtual photons by the nucleus (in the rest frame of the nucleus) is excluded by
the large mass of the latter: the scattering cross-section tends to zero with increasing mass of the
scattering particle.
§99
The Method of Equivalent Photons
443
from the range of coi near the lower limit, i.e. we may put coi.max-»00. Calculating the integral with
logarithmic accuracyt, we have
, 2 do)' er (e e' 2\. EE'
.
do-br = 4reaZ7 — 7 — I -7 +
T log ;.
w e \e
e 3 / m<o
For this result to be valid, besides the condition e > m (ultra-relativistic electron), the condition
(99.11) must also be satisfied: the frequencies Q>1 ' CO l, niin
important in the integration must be <E. Hence
E - s' = io' < EE'lm. Under these conditions the result agrees (to logarithmic accuracy) with (93.17), as it
should.
PROBLEM 2. The same as Problem 1, but for electron-electron bremsstrahlung.
SOLUTION. In this case, the virtual photon can be scattered either by the fast electron or by the
recoil electron; the photons equivalent to the field of either electron are scattered by the other. The
scattering of virtual photons by the fast electron gives the cross-section dcrtV, which is equal to the
cross-section for an electron and a nucleus with Z = 1.
The scattering of virtual photons by the recoil electron gives a cross-section
diTbr* =
I dit) * n(ù)) dcTsc(u), Co'),
with do-sc(o>, w') given by (1) with the appropriate change of notation for the frequencies. The range of
values of co for given co' is (cf. (4))
Ü/^CÜ^00
foro/>2m,
co' ^ 0) ^ (i)'l{m - 2(x)') for co' < im.
When co' < Im, integration with respect to co gives
. (2> 16 2 da)' I
w' o>' \ , e
dab/ = Tar* —7- 1
+ —2 log —,
co \
m m I a)
in agreement with (97.4). But when to' > \m we must distinguish the cases co' ~ m and to' ~ e > m. In the
former case,
. (2) 2 2 m dco'/
d o i r = 30tre
co
7T-
4
V
m
m \.
; + 7-^7
co
4co /
E
log — ,
m
in agreement with (97.3); in the argument of the logarithm we have, with sufficient accuracy, replaced
e/co' by Elm. In the case co' — e, the method of equivalent photons is not valid for calculating derb?. The
frequency co of the virtual photons takes values beginning with co', and the condition (99.11) is therefore
not satisfied when <D == w' — e.
PROBLEM 3. Determine the total pair production cross-section in a photon-nucleus collision from
the pair production cross-section in a collision between two photons.
SOLUTION. The energy of the photon in the rest frame K of the nucleus is co ^> m. If we change to a
frame Ko in which the nucleus moves to meet the photon at a speed t;o such that 1/V(1 - vl) = 5co/m, then
in this frame the photon energy is
coo = co ^
1 — l>0
n
1 -,//i
2^
_ 2\ Ä 2C0 V ( l - vo) = m.
The required cross-section a is calculated in the frame Ko as the pair production cross-section in
collisions between an incident photon co0 and the equivalent photons of the nucleus, whose energy we
denote by co':
& — I tfyyrt(<*>') d(o\
t This means that, by one integration by parts, the term containing the large logarithm is separated
and the remaining terms then neglected. This operation is equivalent to taking the logarithm log(e/coi)
outside the integral, with COl — COl, min>
444
§100
Interaction of Electrons with Photons
where cryy is the cross-section for pair production by two photons and is given by §§88, Problem,
formula (1), with
v = V ( l - m2/u>o<*>') = V ( l - m/o>').
Changing to the variable t; instead of a/, we have
i
a = 2r\aZ I v log[m(l - u 2 )/m]{(3 - vA) l o g y ^ - 2v(2 - v2)] dv.
o
Because of the convergence at the upper limit, the integral may be taken over the whole range from
the reaction threshold <ü'=m (v = 0) to Û>' = <» (V = 1) and with logarithmic accuracy (replacing
log[cu(l - v2)lm] by its value f or t; = 0 and taking it outside the integral). The result is
<r = $aZ 2 rîlog(a>/m),
in agreement with (94.6); this formula is valid when log((j/m)> 1.
§ 100. Pair production in collisions between particles
Electron-positron pair production in a collision between two charged particles
is described by diagrams of two types:
(100.1)
p,
-p*
p-
-P+
(a)
(b)
The two upper continuous lines in each diagram correspond to the colliding
particles, and the lowest line to the pair formed.
Let us consider a collision of two heavy particles (nuclei) in the ultra-relativistic
case. The change of the motion of the particles themselves in such a collision may
be neglected, i.e. they may be regarded as external-field sources.t This corresponds
to two diagrams of the first type:
|q(,)
t
|q(2)
|q
i
i
P-
i
P
(100.2)
-P«.
where q(1), q(2) are the "momenta" of the Fourier components of the fields of the
two particles.
The potential AM = (A0, A) due to a classical particle moving with a uniform
t The collision of two light particles (electrons), where the change in the motion cannot be
neglected, is a considerably more complicated case; see the book by Bafer et ai cited in §93.
445
Pair Production in Collisions
§100
velocity v satisfies the equations
DA 0 = - A-rrZebix - vf - r0),
DA = - 47rZev8(r - \t - r0).
Its Fourier components are
Ao(cu, k) = -^-^2
e~ikr° 8(<o - k • v),
and similarly for A(tu, k). In four-dimensional form,
A»{q) =
-^-^-e^WôiUq),
where U is the 4-velocity of the particle, and the 4-vector x0 = (0, r0). If nucleus 1 is
at rest at the origin (r^ = 0), then p ss r^2) is the impact parameter vector (in a plane
perpendicular to the direction of motion of nucleus 2). This expression for A^iq) is
to be used in writing analytically the diagrams (100.2).
There is, however, no need to use this method for the actual calculations in the
present case. The pair production cross-section may be determined by the method
of equivalent photons, using the already known photon-nucleus pair production
cross-section. The replacement of the field of one particle (the first, say) by a
spectrum of equivalent photons implies that in the diagrams (100.2) the lines qil) are
regarded as real-photon lines. The two diagrams then become identical with the
diagrams corresponding to pair production by a photon at nucleus 2. When e+,
e- > m, the cross-section for the latter process is given by (94.5). Multiplying this
cross-section by the spectrum (99.16) of equivalent photons of the first nucleus, we
obtain (with logarithmic accuracy) for the differential cross-section for pair
production in a collision between particles
da = -r2€(Z{Z2ay
77
(£++£_)
4
(e% + e 1 + je+s-) log
,c^loge
m ( e + + E-)
/e »
£+ + € -
(100.3)
where y = 1/V(1 - v2)> 1.
Here it is assumed that
m<§e+,e-<^my;
(10&4)
the right-hand inequality is the condition for the method of equivalent photons to
be applicable. The range defined by the inequalities (100.4) is the same as the
electron and positron energy range which is important in the integration of (100.3).
On integration over de+ or de- for a given sum e = £+ + e_ (>m), the important
range is the one near the upper limit; omitting terms which do not contain the large
§100
Interaction of Electrons with Photons
446
logarithm, we find
A
56 2 / 7 x2i
e i wy de
der = — re(Z7 {Z2ay log — log —*- —.
VÎT
m
e s
The integral with respect to e over the range (100.4) diverges as the cube of the
logarithm, but only as the square of the logarithm at the boundaries of the range. In
the logarithmic approximation (log y > 1), therefore, the range (100.4) is in fact the
most important one, and the integral can be taken over the range from m to my.
Since
y
J log £(log y - log O y = 6 log3 y,
i
the total pair production cross-section is
a = 2 ^ r2e(Z,Z2a)2 log3 ^
_ ^
(100.5)
(L. D. Landau and E. M. Lifshitz, 1934).
Let us now consider the case of non-relativistic velocities of the colliding
nuclei. The change in their motion due to their interaction then becomes important,
and the main contribution to the pair production cross-section comes from
diagrams of the second type in (100.1). There are four such diagrams: two of them
are
■
P', — — I
!
P,
Pi
"Pi
T — — P2
(100.6)
'k
p_
-
1
r
p^
and the other two are similar except that the virtual photon Ic (which produces the
pair) is emitted by the first nucleus and not by the second.t
We shall suppose that the energy of the pair is small compared with the kinetic
energy of the relative motion of the nuclei in their centre-of-mass system:
e++z-<\Mv\
(100.7)
where v is the initial relative velocity and M = M\M2I(MX + M2) is the reduced mass
of the nuclei. Then the reciprocal effect of pair production on the motion of the
nuclei can be neglected. If the electron-positron line in the diagrams (100.6) is
omitted, the remainder will represent the emission by the colliding particles of a
t Altogether 36 diagrams correspond to pair production in a collision between two electrons:
2! x 3! = 12 diagrams of type a, differing by interchanges of the two initial and three final electrons, and
2 x 2! x 3! = 24 diagrams of type b, obtained in a similar way from the two diagrams (100.6).
Pair Production in Collisions
§100
447
low-frequency virtual photon ((o = e+ + e-). Thus we return to the situation discussed in §98 for the emission of a real soft photon, and can use the formula (98.13)
derived there for the non-relativistic case (except that the amplitude V(47r)e* of
the real photon will be replaced by the virtual photon propagator).! Thus the
amplitude of the whole process of pair production becomes
Mfi = Mf ^ ( f ^ - | ^ ) q A D A , ( k ) [ - i e ( û - 7 ^ . ) ] ,
(100.8)
where q = (0, q), q = M (v' - v).
As usual, the photon propagator in the non-relativistic case is to be taken in the
gauge (76.14). From the amplitude (100.8) we find the cross-section for the process:
vdnP\fi" 1 ^
da = datl • W # " # V Ô
\M\
M2J
2e+ - 2E-{2TT) œ (<o - k
(4ir)2|ö-Y • Q"4\
)
(100.9)
where
o) = 8+ + e-,
k = p+ + p-,
Q= q
CO
j k(q • k);
dac\ is the cross-section for elastic scattering of one nucleus by the other, in their
centre-of-mass system, and is given by Rutherford's formulait
dcTei = 4(Z,Z2e2)2M2 do/q4
~A{ZxZ2e2)ldq\di}l\
v q
(100.10)
the last equation assumes that the deviation of the nuclei from their original
direction of motion (the x-axis) is small. Substituting this expression in (100.9) and
summing over polarizations of the pair in the usual manner, we obtain
da = (z,z,ey£(!r:|)2*
x U(( 7P . + mXT • Q)(7P. - mXï • ^US^ÄJ-^
<1001,)
The remaining calculation is made in the approximation in which all the
t In the non-relativistic case, the photon momentum is small in comparison with the change in
momentum of the radiating particles (|Sp| — a>lv), and can therefore be neglected, in comparison with Sp,
even when the photon energy is not neglected. This applies a fortiori here to the virtual photon, for
which the four-dimensional square k2 = (p+ + p-f > 0, so that |k| < tu. Under these conditions there is no
difference between real and virtual photons, and the use of formula (98.13) is thereby justified.
t The diagrams (100.6) are shown on the assumption of the Born approximation for scattering of
nuclei. But, since Rutherford's formula is exact (for Coulomb interaction), the validity of the results
obtained does not in fact depend on the fulfilment of the condition for the Born approximation to be
valid.
448
§100
Interaction of Electrons with Photons
logarithms occurring in the integration are assumed large. We shall see that, to this
accuracy, pair energies e+, e > m and angles 6 between p+ and p_ such that
(100.12)
mle<6<\
are the most important. With the appropriate approximations, the calculation of the
trace in (100.11) gives
tr{...} = 4[(e + e- - p+ • p_)(q2 - ^ ^ ) +
(q • k)2 - ^
+ 2(p+ • q)(p- • q) + ^
(e+q
. p_ + e _ q .
p+)j,
where we can also put |p+| = e+, |p-| = e_. In the denominator,
,.2
L-2-_„
-
n2 , „ 2 ( g +
+
g-)
Integration over the directions of p+ and p_, for a given angle between them, gives
da = 3^(Z,Z 2 e 2 ) 2 £ ( ^ - Ä ) V - + el) de, de. x
[e2+m\e+
+ e-yieUlY
q2
The form of the dependence on 0 confirms the hypothesis (100.12), and
integration with respect to 0 gives log[e+e-/m(e+ + e-)]. Integration of the last
factor in (100.13) is from qy = qz = 0 to V(qJ + q]) — \/R, where Ä is a quantity of
the order of the radius of the nuclei (corresponding to the smallest impact
parameters; see below). This integration gives
[IT log(q2 + q) + qï)]$:£#* ~ 2ir log ^ The total energy of the pair, equal to the change in the energy of the nuclei, is
e s ( e + + e-) = iM(v' 2 -v 2 )« Mv(v'x- vx) = vqx,
whence qx = e/v. Thus we find
**=^(Z'Z* > -7- (JE " w ~r~Iog silog -sr de * d£ -
§101
Emission of a Photon by an Electron
449
and, after integration over de+ or de- with a given sum e,
*-g<*^¥(&-i$**-s;*»i;T-
<10014>
The energy e may be correlated with the impact parameter p ~ vie ; the pair
energy is of the order of the frequency which corresponds to the collision time.
Hence the logarithmic divergence on integration over de in (100.14) implies a
similar divergence with respect to impact parameters. This means that large values
of p are important (and this, incidentally, justifies the use of the cross-section
(100.10) for scattering in the purely Coulomb field of the nucleus). Accordingly, the
important range of energy is given by m<e<vIR. Integration of (100.14)gives the
total pair production cross-section; the final result is (in ordinary units)
(E. M. Lifshitz, 1935).t
§101. Emission of a photon by an electron in the field of a
strong electromagnetic wave
The application of perturbation theory to processes of interaction between an
electron and a radiation field requires not only that the interaction constant a
should be small but also that the field should be sufficiently weak. If a is the
amplitude of the classical 4-potential of an electromagnetic wave field, the characteristic quantity in this respect is the dimensionless invariant ratio
£ = <?V(-a2)/m.
(101.1)
In this section we shall consider emission processes occurring in the interaction
of an electron with a field of a strong electromagnetic wave, for which £ can have
any value. The method used is based on an exact treatment of this interaction; the
interaction of the electron with the newly emitted photons can, as before, be
regarded as a small perturbation (A. I. Nikishov and V. I. Ritus, 1964).
Let us consider a monochromatic plane wave, say a circularly polarized one. Its
4-potential may be written in the form
A = a\ cos <f> + a2 sin <f>, d> = kx,
(101.2)
where kM = (w, k) is the wave 4-vector (k2 = 0), and the 4-amplitudes a, and a2 are
equal in magnitude and orthogonal:
a] = al = a2,
aia2 = 0.
t A numerical error was corrected by L. 6. Okun' (1953).
450
§101
Interaction of Electrons with Photons
We shall assume that the Lorentz gauge condition is applied to the potential, so
that a\k = a2k = 0.
The exact wave function for an electron in the field of an arbitrary plane
electromagnetic wave has been derived in §40, and is given by formulae (40.7) and
(40.8). We shall, however, change the normalization by making i//p correspond to
unit mean spatial number density of particles, in the same way as the wave
functions of free particles are normalized J:o "one particle in unit volume". Since
the mean density for the function (40.7) is j 0 = WPo, in order to obtain the required
normalization this function must be multiplied by V(p0/qo), i.e. the factor 1/V(2p0)
in (40.7) must be replaced by 1/V(2q0). For a wave with the 4-potential (101.2), we
find
</<P = [l + 2 ^ ) {(7k)(?fli) cos <f> + (yk)(ya2) sin </>}] ^ ^
x
x expj - ie ^ ^ sin <f> + ie ^ ^ cos <f> - iqxV
(101.3)
where
(101 4)
^-"'-''mk'-
-
According to (40.14), the 4-vector q is the mean 4-momentum of the electron; we
shall call it the quasi-momentum.
The S-matrix element for a transition of the electron from the state \}/p to the
state i//p with emission of a photon having 4-momentum k* = (o/,k') and polarization 4-vector e' is
Sfl = -iej
Mye'*WP y^Fj
d4x.
(101.5)
The integrand in (101.5) is a linear combination of the quantities
0-i<*\ sin <£+ia2cos 4>
sin4> • e - i a ' s i n * + i a 2 c o s *,
where
(a\p
axp'\
(a2p
a2p'\
/iri.
,.
These quantities, together with the factor exp[i(Jc' + p ' - p ) x ] , give the whole
dependence of the integrand on x. We expand them in Fourier series, denoting the
§101
Emission of a Photon by an Electron
451
expansion coefficients by Bs, BXsy B2s respectively; for example,
p~iax
sin<J> + ia 2 cos4>
=
^ - i V f a f + a J ) sin(<f>-<*>0)
= 2 B,«""*.
These coefficients can be expressed in terms of Bessel functions by the formulae
Bs=Js(z)eis\
Bls= J[/,+i(z) ei(s+1)*° + /.-.(z) e««"1^],
(101.7)
B2i = ^ [J,+,(z) e'c*+,*> - Js-{(z) e««"1^],
where
z = V(aï+a2),
cos</>o= ai/z,
sin <£o = a2/z.
The functions Bs, Bis, B2s are related by
a,B„ + a2B2, = sB„
(101.8)
which follows from the familiar relation
/J_1(2) + / 1+ ,(z) = 2s/,(z)/z
between the Bessel functions.
The matrix element (101.5) then becomes
" = (2„- ■ ll ■ 2 W ? "H*2»)4»^* + « ~ "' - k'>;
S
(1019)
we shall not give here the fairly complicated expressions for the amplitudes M \\\
Thus Sfi is an infinite sum of terms, each corresponding to a conservation law
sk + q = q' + k'.
(101.10)
Since
q2
= q'2 = m\\ + £2) = m J
(101.11)
(cf. (40.15)), and k2 = k'2 = 0, the equation (101.10) can be satisfied only if s ^ 1. The
5th term of the sum describes the emission of a photon k' by the absorption from
the wave of s photons with 4-momenta k. The form (101.10) shows that all the
kinematic relationships which occur for the Compton effect will apply to the processes considered here if the electron momenta are replaced by the quasi-momenta q
and the incident photon momentum by fhe 4-vector sk. In particular, the frequency
452
Interaction of Electrons with Photons
§101
of the emitted photon in the frame of reference where the electron is at rest on
average (q = 0, q0 = rn *) is
<o' = TT7—7-^71
SÏ»
(101.12)
1 + (s<i)lm *)(1'~ cos 0)
where 0 is the angle between k and k'; cf. (86.8). We may say that the frequencies
(o' are harmonics of a>.
In the notation previously used (§64), the amplitude of the process of emission
of the sth harmonic is M)f, and the expression
dw
(2,r)4s<<i<s £+ q k,) <i01 i3)
- -w? ^ . y t ^
' « -'-
-
gives the corresponding differential probability per unit volume and unit time.t
The amplitudes M ^ have a structure similar to that of the scattering amplitudes
with plane waves, ü(p')... u(p); the operations of summation over polarizations of
the particles are therefore carried out in the usual manner. After summation over
the polarizations of the final electrons and the photon and averaging over the
polarizations of the initial electron, we have
em
dk'dq'8i4)
4TT
q0<ïoû>
dW
x { - 2J](z) + e(\ + 2( fcp) ( tp>)) (J]+i
+J
'-' " 2J])}'
(10114)
In order to integrate this expression, we note that, owing to the axial symmetry
of the field of a circularly polarized wave, the differential probability is independent
of the azimuthal angle <f> around the direction of k. This fact, together with the
presence of the delta function, enables us to integrate over all variables except one,
which we take to be the invariant u = (kk')l(kp'). Then, after integration over
d3kd<t>d(qi) + ù)'), we find
ö KSK + q
q
K)
qia,
-+(7^)3-
For, in the centre-of-mass system (in which sk + q = q + k' = 0), this integration
gives 2ir|q'|d cos $IES, where Es — sco + q0 = <•>' + (jo and 6 is the angle between k
and q'; cf. the transformation (64.12). In the same system, moreover,
u
Es
,
f
- ~—nf
qi-|q'|cosea~ lIf
, c o s _=
Esdu
w •
w
|q'|(l + ii)2-
-« v
t It should be noted that the normalization of the functions i/fp to unit density corresponds to
normalization by the delta function "on the q/2ir scale"; cf. (40.17), where the factor q0/po on the right
will now be absent. It is for this reason that the number of final states of the electron must be measured
by the element d3q7(27r)3.
§101
Emission of a Photon by an Electron
453
The range -1 «* cos 8 «s 1 corresponds to
0^u^us
=
E]lml-\
= 2s(kp)/m*;
in making the transformations it must be remembered that kp = kq.
Thus the total probability of emission from unit volume in unit time is
0
(101.15)
wheret
u=(kk')l(kp'),
us=2s(kp)lml
/r
/
M
1
(101.16)
When £ <^ 1 (the condition for perturbation theory to be valid), the integrand in
(101.15) can be expanded in powers of £ For example, the first term in the
expansion of W\ is
w.-sja!^ f f2+ "f__4JL(i_JL)]4l4p0
J L
0
1+w
ui V
Mi/J
with Mi *» 2(kp)/m2. This result agrees, as it should, with the Klein-Nishina formula
for the scattering of a photon by an electron: putting in (101.17) - a 2 = 47r/co,
£2 = 47re2lm2œ9 and dividing by the incident flux density (64.14), we return to (86.16)
(the integrated scattering cross-section is independent of the initial polarization of
the photon).^
The expression for the probability of emission of the second harmonic (the first
term in the expansion of W2 for £ < 1) is
t To calculate z, we first note that
z2 = (ûiQ)2 + (a2Q)2 = a2Q2,
where Q = qKkq)-q'l(kq'). This is easily shown by choosing a frame of reference in which (aOo =
(ai)o = 0 and the vectors ai, 82, k are along the axes x\ x2, x\ and noting that Qo = Q3 because kQ = 0.
t This value of a2 corresponds to normalization of the 4~potential to "one particle in unit volume".
To determine it, o> must be equated to the energy of a classical field with the (real) 4-potential (101.2).
§101
Interaction of Electrons with Photons
454
PO
J (1 + W ) 2 U 2 \
U 2 /L
0
^e2m2?\\
1
Po L2 3II,
4
2
M?
M?
1+ M
U 2 /J
U2\
1
2(1 +2M,)
- ( ^ - Ä " A - - ^ ) l o g ( l + 2ii,)l
\2ui 2ui w, ut/
J
(101.18)
The leading term in Ws for fairly small s is proportional to £2s.
Let us now consider the opposite case (Ç> 1). The parameter £ can be made
large, for instance, by decreasing the frequency o> with a fixed field strength;
evidently £ = eF/mco, where F is the amplitude of the field strength. It is therefore
clear that the case £ > 1 essentially refers to processes in a constant and uniform
field where E and H are orthogonal and equal in magnitude; this will be called a
crossed field. The probability of emission in this field can be found by taking the
limit f -> », but it is simpler to assume a constant field in the calculations, taking the
4-potential in the form
A^ = aM</>,
<>
/ = kx,
ak = 0
(101.19)
(so that F^v = k^av - kva^ = constant). The exact wave function of the electron in
this field is obtained
i au it; u by
uy substituting
duuoiiiuiiiig (101.19)
\ 1 u 1 . 1 7 ; in
111 (40.7),
y+xi. 1 )y(40.8):
y+\j.vj,
</>P = [ l + e (yk)(ya) 1 u(p) e x pf i.e (ap)
I S P T *J V(2^)
r m)
2
.
2
a2
3
.
1
* ' 6â^ * " T
IPJ
nft1 2 m
(10, 20)
-
The result given by using this function is exact for emission by an electron with
any energy in a crossed field. However, in the ultra-relativistic case this result
(when put in the appropriate form; see below) applies to emission by an electron
not only in a crossed field but in any constant and uniform electromagnetic field,
including a constant magnetic field as discussed in §90.
To formulate this assertion we note that the state of a particle in any constant
and uniform field is defined by as many quantum numbers as the state of a free
particle, and these may always be so chosen as to become, when the field is
removed, those of a free particle, i.e. its 4-momentum p* (p2 = m2). Thus the state
of a particle in a constant field is described by a constant 4-vector p.
The total intensity of emission, being an invariant, depends only on the
invariants which can be constructed from the constant 4-tensor FM„ and the
constant 4-vector p*\ Since FM„ can appear in the intensity only in combination with
the charge e, we obtain three dimensionless invariants:
X2 = -^(F^Pn2
f=
e\F^)2lm\
=
-^a2(kp)\
(101.21)
§101
Emission of a Photon by an Electron
455
In a crossed field / = g s 0, whereas in general all three invariants are non-zero. If
the electron is ultra-relativistic (po>m), however, and the vector p makes angles
6 >mlpo with the fields E and H, then # 2 > / , g (that is, for an ultra-relativistic
particle any constant field appears to be a crossed field for almost all directions p).
If also the fields |E|, \H\<m2le (= mV/eft), then [f|, | g | « l . t Under these conditions the intensity calculated for a crossed field and expressed in terms of the
invariant \ will apply also to the emission in any constant field.
The invariant \ *s given in terms of the fields E and H by
X2 = ^ { ( p x H + poE)2-(p-E)2}.
For a constant magnetic field, \ ls equal to the quantity (90.3), and the above
arguments are therefore another means of deriving the results in §90.t
t And p in \ may be regarded, with the same accuracy, as being the ordinary 4-momentum of the
particle.
t A detailed account of the theory of various processes in strong fields is given in the review papers
by A. I. Nikishov and V. I. Ritus in Proceedings (Trudy) of the P. N. Lebedev Physics Institute, Vol.
Ill, pp. 5 and 152.
CHAPTER XI
EXACT PROPAGATORS AND VERTEX PARTS
§ 102. Field operators in the Heisenberg representation
HITHERTO, in considering various specific processes in electrodynamics, we have
used only the first non-vanishing approximation of perturbation theory. We shall
now go on to discuss the effects which occur in higher approximations. These are
called radiative corrections.
A better understanding of the structure of the higher approximations can be
obtained by first examining some general properties of exact scattering amplitudes
(i.e. those which have not been expanded in powers of e2). We have seen in §72
that the successive terms of the series in perturbation theory can be expressed
in terms of the field operators in the interaction representation, whose time dependence is determined by the Hamiltonian H 0 of a system of free particles. The exact
scattering amplitudes, however, are more conveniently expressed in terms of the
field operators in the Heisenberg representation, where the time dependence is
determined by the exact Hamiltonian H = H0+ V of a system of interacting
particles.
The general rule for constructing the Heisenberg operators gives
4f(x) s i(t9 r) = eiririMr)<Tiiîr,
(102.1)
and similarly for <j>(x) and A(x), i//(r), etc., being time-independent (Schrödinger)
operators.t It may be noted immediately that the Heisenberg operators for a given
time obey the same commutation rules as the operators in the Schrödinger
representation or the interaction representation: for example,
{*(*, r)$k(U r% = eiA%(rl$k(rf)U-^
= y% S ( r - r');
(102.2)
cf. (75.6). Similarly, the operators (//(t, r) and A(f, r') commute:
rtar),A(t,r')}.
= 0,
but this does not hold good for operators pertaining to different times.
The "equation of motion" satisfied by the Heisenberg ^-operator can be
derived from the general formula QM, (13.7):
. iM*)
=
Hi(x) - i(x)H.
(102.3)
t In this chapter, operators with a time argument belong to the Heisenberg representation; those in
the interaction representation will be given the suffix int.
456
§102
457
Field Operators in the Heisenberg Representation
The Schrödinger and Heisenberg representations are the same as regards the
Hamiltonian, which is expressed in the same way in terms of the field operators.
Here, to calculate the right-hand side of (102.3), we may omit from the Hamiltonian
the part which depends only on the operator A(x) (the Hamiltonian of the free
electromagnetic field), since this part commutes with <£(*). According to (21.13) and
(43.3),
H = ( <£*(t, r)(a • p + ßm)4f(t> r ) d'x + e j $(t, r)(yÂ(U r))i£(t, r) d3x
= | $(t9 r){yp + m + e(yÂ(t, r))}${t, r) d3x.
(102.4)
When the commutator {H, i£(f, r)}_ is calculated from (102.2) and the delta function is eliminated by integration over d3x, we get
(yp - eyk - m)i£(t, r) = 0.
(102.5)
As we should expect, the operator t//(t, r) satisfies an equation which is formally the
same as Dirac's equation.
The equation for the electromagnetic field operator A(f, r) is obvious from the
correlation with the classical case. When that case applies, i.e. when the occupation
numbers are large (cf. §5), the operator equation must become the classical
Maxwell's equation for the potentials, Fields (30.2), after averaging over the state
of the field. It is therefore clear that the equation for the operator is simply the
same as Maxwell's equation, so that we have (for an arbitrary gauge)
a ^ Â ^ U ) - 3|iaMÂ"(x) = -4irej"(x),
(102.6)
where j"(x) = t//(x) yu tp(x) is the current operator, satisfying identically the equation of continuity
flj"(x) = 0.
(102.7)
It is important to note that the equations (102.6) are linear in AM and j*\ and the
question of the sequence of these operators therefore does not arise.
Like the similar equations for wave functions, the operator equations (102,6)
and (102.7) are invai.ant under the gauge transformation
A M (x)^A M (x)-a^(x),
tKx)^«Mx)e'*,
(102.8)
u
${x)->e- *$(x),
where ^(x) is any Hermitian operator which commutes (at a particular time) with
t This refers specifically to the Heisenberg (//-operators. In the interaction representation, the gauge
transformation of the electromagnetic potentials does not affect the (//-operators.
458
Exact Propagators and Vertex Parts
§102
Let us now ascertain the relationship between the operators in the Heisenberg
representation and those in the interaction representation. To simplify the discussion, it is convenient to make the formal assumption (which will not affect the
final result) that the interaction V(0 is adiabatically "switched on" from t - - » to
finite times. Then the Heisenberg and interaction representations are the same for
f-»-oo, and the wave functions of the system, <ï> and <î>in,, are the same:
<*>int(* = - « ) = <*>•
(102.9)
But the wave function in the Heisenberg representation is independent of time
(since the whole of the time dependence is in the operators); in the interaction
representation, the time dependence of the wave function is given by (72.7):
«MO = S«, — y M - « ) ,
(102.10)
Ê(h, r,) = T exp{-i | V(t') «ft'},
(102.11)
where
and the following properties of È are obvious:
S(r, r,)S(t,, to) = S(t, to), |
Ê~\t, U) = $(*,, t). J
(102.12)
Comparison of (102.10) and (102.9) gives
<*U0 = $(f, "«>)<*>
(102.13)
as the relation between the wave functions in the two representations. The operator
transformation formula is similarly
«Kt, r) = $-'(!, -»)**(*, r)Ê(t, -oo)
= S(-«,r)«Mr,r)S(r,-°o),
(102.14)
and likewise for «Ä and À.
One further general remark may be added. It has already been mentioned more
than once that, in relativistic quantum theory, the physical significance of the field
operators is very limited because the zero-point fluctuations are infinite. This is
even more true of operators in the Heisenberg representation, which contain also
divergences due to the interaction. In this chapter, §§102-109 deal with the formal
theory, which ignores the question of eliminating these singularities and which
treats all quantities as if they were finite. The results thus obtained have mainly
heuristic value: they lead to a fuller understanding of the significance of the
expansions given by perturbation theory, and they may also remain valid in some
form in a future theory which is free from the present difficulties.
§103
The Exact Photon Propagator
459
§103. The exact photon propagator
The concepts of exact propagators play a central role in the formalism of the
exact theory (i.e. without expansion in powers of e2).t
The exact photon propagator (denoted by the script letter 2)) is defined by
2>M„(X -
x9) = i<0|TAM(x)A,(x/)|0),
(103.1)
where AM(x) are Heisenberg operators, in contrast to the definition (76.1):
D^(x - x') - i<0|TAi;t(x)Alnt(x')|0),
(103.2)
in which the operators in the interaction representation were used. The function
(103.2) may be called the free (or bare)-photon propagator to distinguish it from
the exact propagator (103.1).
Since the mean value in (103.1) cannot be exactly calculated, it is impossible to
obtain an exact analytical expression for 3)^ although the definition does lead to
some general properties of this function, as will be discussed in §111; here we shall
consider the calculation of Q)^v by perturbation theory, using the diagram technique. For this purpose, we must express 2)^ in terms of the operators in the
interaction representation.
in the interaction representation.
First, let t > t'. Using the relationship between A(x) and Âint(x) (cf. 102.14)), we
can write
%Âx - x') = f<0|AM(x)A„(x')|0>
= i<0|S(-oo, t)Ai\x)S(U -oo)S(-oo, t') x
xAtnt(x')S(t',~oo)|0>.
According to (102.12) we can make the substitutions
S(r,-oo)^(-oo,o = ^(t,r),
S(-oo,o-S(-co,+oo)SKt).
Then
%Ax - x') - WIST^SK t)AlMnt(x)S(t, OAtnt(x')S(t', -°o)]|0>,
(103.3)
S^S(+oo,-oo).
(103.4)
with
Since, according to the definition (102.11), S(t2y t\) includes only operators for times
t These concepts were introduced by F. J. Dyson (1949), who also developed essentially the whole
of the treatment given in this chapter.
460
Exact Propagators and Vertex Parts
§103
between tx and t2 arranged in chronological sequence, it is evident that all the
operator factors in the brackets in (103.3) are in order of decreasing time from left
to right. If the time-ordering symbol T is placed before the bracket, we can
rearrange the factors in any manner, since the operator T will automatically put
them in the necessary order. Then we write the bracket as
[•••] = T[A,;t(x)Alnt(x')S(oc, t)S(U t')S(t\ -co)]
= T[Â'T(x)ÂT(xf)Sl
Thus
9)*Âx ~ x') = i(0\S-lT[A?\x)A™\x')S]\0).
(103.5)
It is easily shown by a similar argument that this formula is also valid if t < t'.
We shall now prove that the factor S_1 can be taken outside the averaging over
the vacuum to form a phase factor. To do so, we recall that the Heisenberg vacuum
wave function $ is the same as the value 4>int(-°°) of the wave function of the same
state in the interaction representation (see (102.9)). From (72.8),
S<Dint(-oo) s S(+oc, -oo)<l>int(-oc) = <Dint(+oo).
The vacuum is a strictly stationary state, in which no spontaneous processes of
particle generation can occur. In other words, in the course of time the vacuum
remains the vacuum; this means that 4)int(+00) can differ from 4>int(-oc) only by a
phase factor eia. Hence
S4>im(-oo) = eIÄ4>int(-oo) = (0|S|0)<ï>int(-oc),
(103.6)
or, taking the complex conjugate and using the unitarity of the operator S,
^U-^)S-l = (o\s\oyl^u^l
Hence it is clear that (103.5) can be written
^(*-*') =.<0'T<^(',)S'°>
(.03.7,
Substituting in the numerator and the denominator the expansion (72.10) for S
and averaging by means of Wick's theorem (§77), we get an expansion of Q)^ in
powers of e2.
In the numerator of (103.7), the quantities to be averaged differ from the matrix
elements of the type (77.1) only in that the "external" photon creation and
annihilation operators are replaced by ™\x) and ÂT(x'). Since all the factors in the
products to be averaged are preceded by the time-ordering symbol, the pairwise
contractions of these operators with the "internal" operators A,nt(xj), Xmt(jc2),... will
give the photon propagators D^v. Thus the results of the averaging are expressed by
sets of diagrams with two free ends, constructed in accordance with the rules in §77,
§103
The Exact Photon Propagator
461
except that propagators DMV, not the amplitudes e of real photons, correspond to
external (and internal) photon lines. In the zero-order approximation, with S = 1, the
numerator of (103.7) is simply D ^ ( x - x ' ) . The next non-zero terms will be
of the order of e2. They are represented by a set of diagrams having two free ends and
two vertices:
C7>-
+
(103.8)
The second of these diagrams consists of two disconnected parts: a broken line
(corresponding to -iD^) and a closed loop. The separation of the parts of the
diagram signifies that the corresponding analytical expression separates into two
independent factors. On adding to the diagrams (103.8) the zero-order approximation diagram (a single broken line) and "taking it outside the brackets", we
find that the numerator in (103.7) is, as far as second-order terms,
—{.-o}-
—o-
The expression <0|S|0> in the denominator of (103.7) is the amplitude of the
"transition" from the vacuum to the vacuum. Its expansion therefore contains only
diagrams without free ends. In the zero-order approximation, (0|S|0) = 1, and as far
as second-order terms we have
I'-OI
When the numerator is divided by the denominator we get, to the same order, the
expression
Thus the diagram with the detached loop does not occur in the result. This is a
general theorem. Having regard to the way in which the diagrams are constructed
which correspond to the numerator and denominator in (103.7), we can easily see
that the role of the denominator (0|S|0) is simply to ensure that in all orders of
perturbation theory the exact propagator 2^,, will be represented only by diagrams
which do not contain separated parts.
The diagrams with no free ends, forming closed loops, have no physical
significance and need not be taken into account, quite apart from the fact that they
disappear when the propagator 3) is formed. Such loops represent radiative
corrections to the diagonal element of the S-matrix for a vacuum-vacuum transition; but, according to (103.6), the sum of all these loops, together with the unity
§103
Exact Propagators and Vertex Parts
462
given by the zero-order approximation, gives only an unimportant phase factor,
which cannot affect any physical results.
The change from the coordinate representation to the momentum representation
is made in the usual way. For example, in the second-order approximation of
perturbation theory, the propagator -i'2)MI,(k), which will be shown by a thick
broken line, is the sum
p+k
-*_ ~~
k
,_ + — r < ^ > — , —
k
k
V ^ X
P
(1019)
k
in which all the diagrams are calculated by the general rules given in §77 except
that factors -iD^ik) are assigned to the external as well as the internal photon lines.
In analytical form, we therefore havet
2W(k) - D^(k) + ie2D.k(k) j tr yK G(p .+ k)ypG(p) ^
Dpv(k);
(103. 10)
the bispinor indices of the matrices y and G are, as usual, omitted.
The terms in subsequent approximations are constructed in a similar manner,
and are represented by sets of diagrams having two external photon lines and the
appropriate number of vertices. For example, the terms in e4 correspond to the
following four-vertex diagrams:
-o-o-
(103.11)
The diagram
0
also has four vertices; its upper part is a loop formed Joy a single "self-closed"
electron line. Such a loop corresponds to the contraction $(x)y<fr(x), i.e. to the value
of the current averaged over the vacuum: (0|j(x)|0). But, by the definition of the
vacuum, this quantity must be zero identically, and the identity cannot of course be
altered by any further radiative corrections to such a loop.t Thus no diagrams
having "self-closed" electron lines need be considered in any approximation.
t The factor -1 from the closed electron loop must be taken into account when deriving the signs.
t Although a direct calculation from the diagrams would lead to divergent integrals.
§103
The Exact Photon Propagator
463
The part of a diagram which lies between two (external or internal) photon lines
is called a photon self-energy part. In the general case, it can itself be divided into
parts joined in pairs by a single photon line, i.e. it has a structure of the form
0"-0~- .... -rO
where the circles denote parts which cannot be further subdivided in the same
manner; such parts are said to be compact or proper. For example, the first three
of the four fourth-order self-energy parts (103.11) are compact.
Let i&pJATT denote the sum of the infinity of compact self-energy parts. The
function ^ ( k ) is called the polarization operator. When the diagrams are classified
by the number of compact parts which they contain, the exact propagator 2)M„ can
be put in the form of a series
where i&^JAir corresponds to each shaded circle. The analytical form of this series is
op
aj>
g>
a=D+D£D+D£D_D+...
= D{1+£[D
+
D£D
+
...]},
(103.12)
where the indices are omitted, for brevity. The series in the brackets is again 2i.
Hence
^„(k) = DM„(k) + D ^ ( k ) £ ^ 9,,,(k).
(103.13)
Multiplying this equation on the left by the inverse tensor (D~l)Tfl and on the right
by (2>_1)w, and renaming the indices, we get the equivalent form
2'\v
= D~\v-j^^v.
(103.14)
It must be emphasized that writing 3) in the form (103.12) assumes that the
diagrams can be broken down into simpler parts calculated by the general rules of
the diagram technique, and that the combination of such parts gives the correct
expressions for the entire diagrams. The admissibility of this breakdown of the
diagrams is an important and by no means trivial feature of the diagram technique,
which arises from the fact that the overall numerical factor in the diagram does not
depend on the order of the diagram.
464
Exact Propagators and Vertex Parts
§103
The same property enables us to use the function 2> (assumed known) to
simplify the calculations of the radiative corrections to the amplitudes of various
scattering processes: instead of treating afresh each time the diagrams with
different corrections to the internal photon lines, we can simply make these lines
thick, i.e. assign to them the propagators 9) (instead of D) in the appropriate
approximation.
If the photon line corresponds to a real and not a virtual photon, i.e. if it is a free
end of the whole diagram, the application to it of all the self-energy corrections
gives what is called an effective external line. It corresponds to the expression
obtained from (103.13) by replacing the factor D by the polarization amplitude of the
real photon:
M
9pX(k)
+ a^OO—r^ex.
4TT
(103.15)
For an external-field line, e» in this expression is to be replaced by Ajf*.
The discussion in §76 of the tensor structure and the gauge non-uniqueness of
the approximate propagator D^„ applies to the exact function äV„ also. Considering
only the relativistically invariant representations of this function, we can write it in
the general form
%Âk) = 2>(k2) (&, -*jj£) + 2 > ( ' W ) ^ ;
(103.16)
the first term corresponds to the Landau gauge, and in the second term 2>(0 is a
gauge-arbitrary function. The corresponding form of the approximate propagator! is
D^fc) = Dik2)^
- ^ ) + D ( 'W) ^ .
(103.17)
The longitudinal part 2>(0 of the propagator is related to the longitudinal part of
the potential 4-vector, whieh has no physical significance. It is therefore not concerned in the interaction and is unaffected by the latter, so that
2>(,)(k2) = D(,)(k2).
(103.18)
The inverse tensors must, by definition, satisfy the equations
Q}-1^"
=
fij,
D~\JDX* = «J.
When the original tensors have the form (103.16) or (103.17), the inverse tensors
are, from (103.18),
(103.19)
D ^ = D\g^-'tr)
t In this formula D <0 is not the same as in (76.3).
+
Wkr-
§104
465
The Self-energy Function of the Photon
From these, it follows that the polarization operator 0V„ is a transverse tensor:
^
= W2)(g^-^),
(103.20)
where 9 = k2-4-n72), or
Air
®(k)~k2[l-W>OT
(103.21)
Thus the polarization operator, unlike the photon propagator itself, is a gaugeinvariant quantity.
§ 104. The self-energy function of the photon
In order to examine further the analytical properties of the photon propagator,
it is useful to define, as well as the polarization operator, another auxiliary function
^„(Jc), called the self-energy function of the photon: iH^JAir is defined as the sum
of all self-energy photon parts (not only the compact ones). If this sum is
represented in the diagram by a square, we can write the exact propagator as the
sum
k
+
k
k
i.e.
'\LV ^
*-VA
(104.1)
4ir
Hence, expressing U^p as
and substituting (103.16) and (103.19) followed by (103.21), we get
IV = n(k2) fa, - ^ ) ,
n = j^ip.
(104.2)
Thus n ^ like 9^ is a gauge-invariant tensor.
The usefulness of U^ arises from the expression for it in the coordinate
representation. This is easily found by noting that the equation
4ir
n^(k) = D-\xD-lpv{®Kp(k) ~ Dk>{k)},
466
§104
Exact Propagators and Vertex Parts
in which the tensor 2)Kp-DKp
coordinate representation
is transverse by (103.18), can be written in the
n^(x -x') = j - (d^K - g^dnWp
- g„pW){2)Ap(x - x') - DKp(x - x')}.
In order to carry out the differentiation, we must substitute
25Ap(x - x') - DAp(x - x') = i<0|TAA(x)Ap(x') - TAAnt(x)Afnt(x')|0>.
(104.3)
In §75 we have seen that the differentiation of a T product generally demands
caution, because the product has discontinuities. But the difference that is to be
averaged in (104.3) is continuous, and so are its first derivatives, since the commutation rules are the same for the components of the operators AA(x) and AAnt(x)
for a given time, and the corresponding discontinuities cancel out (cf. §75). The
difference in (104.3) may therefore be differentiated under the symbol T. According
to (102.6) and the corresponding equation with zero on the right for the free
electromagnetic field operators A£t(x), the result is
nM,(x - x') = 47rf62<0|TjV(x)jV(x')|0).
(104.4)
This shows explicitly the gauge-invariance of nM„, since the current operators are
gauge-invariant.
From (104.4) we can derive an important integral form of this function.
According to (104.2), it is sufficient to consider the scalar function II = \ll*. In the
coordinate representation,
n(x - x') = ^f
4TT
(62<0|TJV(X)J-(X')|0)
.
2
2 <0U»(x>l»)<«U"(*')10> for ( > « ' ,
n
2<0|U*')|n><n|r(x)|0>
for t < t',
(104.5)
where n labels the states of the system electromagnetic field + electron-positron
field.t Since the current operator j(x) depends on x*4 = (r, r), its matrix elements
also depend on x. The relationship can be found explicitly by taking as the states
\n) states which have definite values of the total 4-momentum.
The time dependence of the current matrix elements, like that of any Heisenberg operator, is given by
<n|r(r,r)|m) = <n|r(r)|m)e - i ( E
m-£B)l
where E„ and Em are the energies of the states \n) and |m>, and /(r) is the
Schrödinger operator.
t The current operator conserves charge; hence the states |n) in (104.5) can contain only the same
numbers of electrons and positrons.
§104
The Self-energy Function of the Photon
467
To determine the coordinate dependence of the matrix elements, we consider
the operator /(r) as being the result of transforming the operator j(0) by a parallel
translation over the distance r. The operator of this translation is exp(ir • P), where
P is the total momentum operator of the system (see QM, (15.15)). Using the
general rule for the transformation of the matrix elements (see QM, (12.7)), we
therefore have
<n|r(r)|m> = <n|e- i r -V(0)e i r >>
= <n|j'1(0)|m)ei(P"-,,-)r.
Together with the previous formula, this gives finally
(n\j»(t, r)|m) = (n\j»(0)\m)e-iiP"-p")x.
(104.6)
The matrix (n|jM(0)|m) is Hermitian, like the matrix (104.6) of the entire operator
/•*(!, r), and according to the equation of continuity (102.7) it satisfies the transversality condition
(Pn-Pmr<n|]V(0)|m> = 0.
(104.7)
Let us now calculate the function Il(x-x'). Substitution of (104.6) in (104.5)
gives
n(O = ^It<0\U0)\n)(n\n0)\0)e^i
for T ^ 0 ,
(104.8)
<0|j,(0)|n><0|r(0)|n>*5(4)(k - P„).
(104.9)
where x - x' = Ç = (T, £). We use the notation
p(k2) = - ^
(2TT)3 2
The sum is taken over all systems of real electron-positron pairs and photons that
can be generated by a virtual photon having 4-momentum k = (cu, k) (o> > 0), and for
each such system there is summation over the internal variables (the polarizations
and momenta of the particles in the centre-of-mass system).t After this summation,
the function p can depend only on k, and since it is a scalar it can depend only on
k2. In particular, it does not depend on the direction of k. Using these properties of
p, we can rewrite (104.8) as
œ
n<0 = -i $ do, | ( 0 p ( k 2 ) « i M " M T |
0
0
0
t This definition of the states \n) is evidently identical with their definition as states for which the
matrix elements <0|j|n) of a charge-odd operator are non-zero.
468
§105
Exact Propagators and Vertex Parts
The momentum representation is obtained by substituting
e-^
= 2ico ( €-**-&
ko-o)
J
Vr^T^
(104.10)
+ i0 27T
(see §76); the result is
U(k2) = j difji2) j d(<o2)8(fi2 + k2 - co2)
0
k
i ^ \
i 0
,
0
or, finally,!
0
The coefficient p in this integral form is called the spectral density of the
function n(k 2 ), and has the properties
p(k2) = 0 for
k2<0,|
p(k2)>0
k2>0, J
for
(104.12)
since the 4-momentum k of a virtual photon which can generate a system of real
particles must necessarily be time-like; k2 is equal to the square of the total energy
of the particles in their centre-of-mass system. The transversality condition (104.7)
gives
PS<0|j M (0)|n>-0.
The 4-vector <0|j|n> is orthogonal to the time-like 4-vector P„ and must be
space-like:
<0|jV(0)|n><0|j-(0)|n>* < 0 ;
thus, from the definition (104.9), p > 0 .
§ 105. The exact electron propagator
The exact electron propagator, similarly to that of the photon, is defined by
%k(x-xf)^-i(0\T^(x)Mxf)\0)
(105.1)
t The formal calculations analogous to those given above require caution, on account of the
presence of the divergences previously mentioned. These give rise, in particular, to thé occurrence on the
right of (104.11) of further divergent terms which do not have an explicitly relativistically invariant form,
called Schwinger terms. They will not be written out here, since they in any case disappear on
renormalization (§110) and do not affect the subsequent results.
§105
The Exact Electron Propagator
469
where i and k are bispinor indices, which differs from the definition (75.1) of the
free-particle propagator
Gik{x - x') = -i<0|T^(jt)*jr(x')|0>
(105.2)
in that the ^-operators in the interaction representation are replaced by Heisenberg
operators.
The same arguments as were used to derive (103.7) lead to
*(x-x,--iW^g*m.
(105 .3)
The expansion of this expression in powers of e2 puts the ^ function in the form of
a set of diagrams with two external electron lines and various numbers of
vertices. The denominator in (105.3) again has the function of retaining only the
diagrams which do not have detached "vacuum loops". For example, as far as the
terms in e4, the graphical representation of the propagator <S (denoted by a thick
continuous line) ist
The thick continuous line corresponds to the function fflip) in the momentum
representation, and the sets of continuous and broken lines in the diagrams on the
right of the equation correspond to the free-particle propagators iG and -iD
respectively.
The section between two electron lines is called an electron self-energy part. As
with the photon, it is said to be compact if it cannot be further subdivided into two
self-energy parts by cutting a single electron line. The sum of all possible compact
parts will be denoted by -iMik; the function Mik(p) is called the mass operator. For
example, as far as the terms in e4,
t It has already been shown in §103 that there is also no need to take account of diagrams which
contain "self-closed" lines; these would here appear in the second order:
9
470
Exact Propagators and Vertex Parts
§105
By a summation exactly similar to the derivation of (103.13), we find
«(p) = G(p) + G(p)M(p)(S(p)
(105.6)
(omitting the bispinor indices) or, for the inverse matrices,
<ê-\p) =
G-\p)-M{p)
(105.7)
= yp-m-M(p).
It has already been noted in §102 that the Heisenberg ^-operators (unlike those
in the interaction representation) are altered by a gauge transformation of the
electromagnetic potentials. The exact electron propagator ^ is therefore also not
gauge-invariant. Its gauge transformation behaviour may be derived as follows (L.
D. Landau and I. M. Khalatnikov, 1952).
The change in ^ under the gauge transformation must evidently be expressed in
terms of the same quantity D(,) as is added to the photon propagator by this
transformation. This is clear, since in the calculation of <S by the perturbationtheory diagrams each term of the series is expressed in terms of the functions D,
and no other electromagnetic quantities are involved. The analysis can therefore be
simplified: any special assumptions can be made regarding the properties of the
arbitrary operator x in the transfromation (102.8), provided that the result is
expressed in terms of D (0 .
The transformation (102.8) brings the propagators 2> (103.1) and ^ (105.1) into
the following forms:
2V„-i<0|T[AM(x)- dMx)][Av(x')-
d'vX(x')]\0h l
(105.8)
We shall now suppose that the operators x a r e averaged independently of all
the remaining operators in the T product. This is a reasonable assumption, since
the "field" x takes no part in the interaction, because of the gauge invariance. We
also assume that the mean value, over the vacuum, of the operator x is zero:
(0|x|0) = 0. Then the terms in x m (105.8) can be separated, and the result is
%, -» %v + i{0\TdßX(x) • a^(x')|0>,
3* -♦ "SMTe^^e-^^lO).
(105.9)
(105.10)
The rest of the derivation will be given for the case of an infinitesimal
transformation, and we shall emphasize this by writing 8% in place of xThe transformation (105.9) may be writtent (independently of the smallness of
t Formula (105.11) can be derived from (105.9) if the function d(l) and its derivative with respect to t
are continuous at t = t'\ if they are discontinuous, the right-hand sides of these expressions differ by
delta-function terms (cf. the derivation of (75.2)). In the momentum representation, this condition is
equivalent to assuming that d('\q) decreases more rapidly than llq1 as \q\-*°°-
§105
The Exact Electron Propagator
471
%v -* %v + 5 ^ , , ô9M„ = d^d[dil\x - x'),
(105.11)
d(0(x - x') = i<0|TÔ*(x)ôx(x')|0>.
(105.12)
Ô*)as
where
Hence it is clear that d(,) determines the change caused by the gauge transformation
in the longitudinal part 2)(,) of the photon propagator. The assumption that d(,)
depends only on x - x ' implies, of course, a certain limitation on the properties of
the operator Ox ; in the general case of a completely arbitrary gauge transformation,
the propagator may cease to be homogeneous in space and time.
In the transformation (105.10), we expand the exponential factors in powers of
8\ as far as the quadratic terms:
(0|Te^(at)e"^(x)|0)
~ -ie2<0|S*2(x) + 5X2(x') - 2T8*(x)Sx(x')|0>.
Using the definition (105.12), we thus find the following transformation rule for the
electron propagator:
<S - « + 8% m = ie2(S(x - x')[d(0(0) - d(,)(x - x')].
(105.13)
In the momentum representation,! we have
ô<S(p) = ie 2 1 d{\q)mP)
- <S(p ~ <?)] 0f>
(105.14)
d°\q) is related to the change in the function 2>(0 by
8Q>l,Kq) = q2dl,)(q).
t If the function f(x) = f\{x)ft(x), its Fourier components are
f(p) = jf(x)ei'xd*x
= / / / fx&Jp
*«>-«-« >/■(«■)/:(*)
-// ^r8<4,(p "qi " *>'«<«•>«*>
In deriving (105.14) from (105.13), we also use the result
/(x = 0 , . | / ( , ) ^ , .
(105.15)
472
Exact Propagators and Vertex Parts
§106
An integral expression analogous to (104.11) could be derived for the electron
propagator, using the expressions
^m(x) = ^m(0)e- |(P «- p - ) *
(105.16)
for the matrix elements of the (//-operator, similarly to the expressions (104.6) for
the current matrix elements. Unlike the current, however, the i/z-operators are not
gauge-invariant. The coordinate dependence (105.16) is therefore not general, but
applies only to some particular gauge. The same is true as regards the integral
representation based on (105.16). The deeper physical reason for this situation is
that the zero photon mass leads to the infra-red catastrophe (§98). In consequence,
the electron emits an infinite number of soft quanta during the interaction, and this
means that the "single-particle" propagator (105.1) loses much of its direct
significance.
§ 106. Vertex parts
In complicated diagrams it is possible to distinguish both self-energy parts and
sections of another type which are not equivalent to them. An important class of
such sections is found by considering the function
Krk(xi,X2,x3) = <0|TA^(x,)^(x2)Ä(x3)|0>
(106.1)
which has one 4-vector index and two bispinor indices; since space-time is
homogeneous, this function depends only on the differences of the arguments xu
xi, x3. When expressed in terms of the operators in the interaction representation,
the function K has the form
Ktoux^-WMMtwnxtsw.
(1062)
The momentum representation is obtained by using the formula
(27T) 4 Ô ( 4 ) ( P I
+ k- p2)KMP2, P.; k) = HI
KUxi, x2, x3)e-fa-+<Prt-*»>*3 d% d4x2 d%.
(106.3)
In the diagram technique, the functions Kfc correspond to three-ended (one
photon and two electron) sections of the form
I
§106
473
Vertex Parts
where the momenta are related by the conservation law
Pi + k = p2.
(106.5)
The zero-order term in the expansion of this function is zero; the first-order
term is
K^Od, x2, x3) = e J G(x2 - x)yvG{x - x3) • Dm(x] - x) d4x
in the coordinate representation, and
KHP2, PI; k) = eG(p2)y„G(p,) • D^(k)
(106.6)
in the momentum representation (omitting the bispinor indices); the corresponding
diagram is
I
(106.7)
X
P2
P,
In the subsequent approximations, the diagrams are complicated by the addition
of new vertices, but not all such diagrams provide essentially new information. For
instance, in the third order we have the diagrams
0
!
«
1
(106.8)
/ \
The first three of these can be cut (across one photon or electron line) into a simple
vertex (106.7) and a second-order self-energy part; the fourth diagram cannot be
thus treated. This is a general situation. The corrections of the first kind simply
replace the factors G and D in (106.6) by the exact propagators <S and 2>. The
remaining terms in the expansion give a new quantity to replace the factor y* in
(106.6). Denoting this quantity by P \ we thus have by definition
K*(p2, P.; k) = {i«(p2)[-ier.(p2, pr, k)]i«(p,)}[-i3^(k)].
(106.9)
A section joined to other parts of the diagram by one photon line and two
electron lines is called a vertex part if it cannot be divided into parts joined by only
one (electron or photon) line. The quantity T'1 is the sum of an infinity of vertex
parts, including the simple vertex 7M, and is called a vertex operator or vertex
junction.
474
Exact Propagators and Vertex Parts
§106
The following are all the vertex-operator diagrams as far as the fifth-order
quantities:
(106.10)
the black dot denoting the exact vertex operator -ieT.
The operator T (like the operator 7 of the simple vertex) has two matrix
(bispinor) indices and one 4-vector index; it is a function of two electron momenta
(Pi, Pi) and one photon momentum (k). The three momenta cannot all relate to real
particles simultaneously: the diagram (106.4) in itself (not as part of a larger
diagram) would correspond to the absorption of a photon by a free electron, but
this process is incompatible with the conservation of the 4-momentum of real
particles. Hence at least one of the three free ends of the diagram must pertain to a
virtual particle (or to an external field).
The vertex parts may also be classified as reducible and irreducible. The
irreducible ones are those which do not contain self-energy corrections to internal
lines and in which it is not possible to separate parts which constitute (lower-order)
corrections to internal vertices. For example, of the diagrams in (106.10), the only
irreducible ones are (b) and (d) (apart from the simple vertex (a)). Diagrams (g), (h)
and (i) contain self-energy parts; in diagram (c) the upper broken horizontal line
may be regarded as a correction to the upper vertex, and in diagrams (e) and (f) the
lateral broken lines may be regarded as corrections to the lateral vertices.
When the internal lines in irreducible diagrams are replaced by corresponding
thick lines, and the vertices by black dots, i.e. when the approximate propagators D
and G are replaced by the exact propagators 2 and % and the approximate vertex
operators y by the exact ones I\t we evidently obtain the set of all vertex parts.
Thus the expansion of the vertex operator may be written
ÀvVÀ*À
(106.11)
This equation is an integral equation for I\ with an infinity of terms on the right.
From the above discussion we can easily derive the general principle of
construction of the exact expressions for sections having any number of ends.
t The resulting diagrams are called skeleton diagrams.
§106
475
Vertex Parts
They are obtained as vacuum mean values of T products of Heisenberg operators,
with one operator <//(x) for each initial electron, one i//(x) for each final electron,
and one A(x) for each photon.
A further example is given by diagrams of the form
(106.12)
.
^
with four external electron lines. These are obtained from the function
Klk,,m(x,, x2; xj, x4) = <0|T^Ui)^(x2)^(x3)«Âm(x4)|0),
(106.13)
which, of course, depends only on the differences of the four arguments. Its
Fourier components may be written
d4x, d4x2 d4x3 d4x4
| KlMm(x„ x2; X3, xt)e^x^-p^-^
= (27T)4ô(4)(p, + p2-Pi-
p4)JCik,lm(p3, p 4 ; Pi, p2),
(106.14)
with
Kik,lm(Pl, P4ÎPl,P2)
= (27T)4ô(4)(p, " P3)^l(P.)^m(P2) ~ (27T)4Ô(4)(p2 - P 3 )^(Pl)^l(p 2 ) +
+ ^in(Pi)%r(P4)[-irnr,APi,
PA\ P., P2)]^.(Pl)^m(P2).
(106.15)
In the latter expression, the first two terms exclude from the definition of the
function r(p 3 , p 4 ;pi,p 2 ) diagrams which fall into two disconnected parts, each
having two free ends:
PJ——(
J
•
Pi
P<
*
P}
* (
(
J
"
Pi
or
P<
' f
J
-
Pi
)""* P?
In the third term the ^ factors exclude from the definition of T those parts of
diagrams which are corrections to external electron lines.
From the properties of the T product of the Fermi ^-operators, it follows that
the functions T(p3, p4; p,, p2) are antisymmetric:
TiMmtes, P<» Pi» P2) = - r k i , i m ( p 4 , P3', Pi» P2)
= -r,k,mi(p3, p4; Pa, Pi).
(106.16)
If the momenta pu Pi, P3» p4 correspond to real particles, the non-separating (i.e.
476
Exact Propagators and Vertex Parts
§107
connected) diagrams (106.12) represent the scattering of two electrons. The scattering amplitude is found by assigning the wave amplitudes of the particles (instead
of the propagators <ê) to the free ends of the diagram:t
iMfi = üi(pi)ük(p4)[-ierikjm(p), p4\ pu p2)]ul(px)um(p2).
(106.17)
According to (106.16) this amplitude must have the appropriate antisymmetry with
respect to interchanges of electrons.
§ 107. Dyson's equations
The exact propagators and the vertex part satisfy certain integral relations, the
origin of which is particularly clear if the diagram technique is used.
The concepts of reducibility and irreducibility defined in §106 can be applied not
only to vertex parts but also to any other diagrams or parts thereof. Let us consider
from this aspect the compact self-energy electron diagrams.
It is easily seen that only one diagram out of this infinity is irreducible, namely
the second-order diagram
Any complication of this diagram can be regarded as the application of further
corrections to its internal (electron or photon) lines or to one of its vertices. Here it
is important to note that, owing to the obvious symmetry of the diagram, any
vertex correction need by assigned only to either one or the other vertex.*
Since, therefore, only one of the compact self-energy electron parts is irreducible, the ensemble of all such parts (i.e. the mass operator M) is represented by only
one skeleton diagram:
(107.1)
t It will be seen later (§110) that the self-energy parts in the free ends can be ignored in deriving the
amplitudes of real processes.
X For clarity, it should be emphasized that, although all the required diagrams are found by applying
corrections to only one vertex, for any particular diagram the structure of the correction section in
general depends on the vertex to which it is assigned, for example
where, in identical diagrams, the squares enclose sections which form the vertex part when it is assigned
to theright-handand left-hand vertices respectively.
§107
Dyson's Equations
All
In analytical form, this graphical equation becomest
^(p) = G- 1 (p)-^- 1 (p)
= -ie2 $ y'<S(p + k)P-(p + k,p;k)- %Ak)0p.
(107.2)
A similar expression can be derived for the polarization operator 0*. Again only
one of the compact self-energy photon parts is irreducible, and 0* is therefore
represented by a single skeleton diagram:
~(g>~ = -T<f>V
(107.3)
The corresponding analytical equation is
^ ^ = D"V,(k)-3"U(k)
= ie2 tr j y^(p + k)Fv(p + k,p; k)<S(p) ^ ;
(107.4)
the bispinor indices are omitted from (107.2) and (107.4).
The relations (107.2) and (107.4) are called Dyson's equations; they can also be
obtained by direct calculation. For example, to derive (107.2) we consider the
quantity
(yp - mh<Sik(x - x') = -i(yp - m)li(0|T(///(x)^(x')|0),
where p = id is the operator of differentiation with respect to x, which is found
from (102.5) in exactly the same way as was done when deriving (75.7) for the
free-particle propagator. The result is
(yp - m)n<Sik(x - x')
= -iey^0\TAAx)^(x)^k(x')\0) + 8ik8{4)(x - x');
the delta-function term on the right is the same as in (75.7), since the commutation
properties at t = t' are the same for «//-operators in the Heisenberg and interaction
representations. The first term is -ieyvKik(x9x,x')9 and we can thus write (again
omitting the bispinor indices)
(yp - m)cS(x - x') = -iey^K^x, x, x') + 5(4)(JC - x').
(107.5)
To obtain the Fourier components we note that, if the definition (106.3) is
t If the exact vertex part is assigned to the left-hand vertex in (107.1), the factors y and T are
interchanged in (107.2). The two forms of the equation are, of course, essentially equivalent.
478
§108
Exact Propagators and Vertex Parts
4
4
integrated over d kd p2l(27T)*, the result is
j K»(p + k, p ; k) 0f
= | K"(0,0, xl)e-{^ d4x3
= ! KM(x, x, x')eip{x~x,] d\x - x'),
(107.6)
from which it is seen that the integral on the left is the Fourier component of
KM(x, x, x'). Thus, by taking the Fourier components of both sides of (107.5), and
using the definition (106.9) and the formula yp - m = G~\p), we find
G-\PmP)
= 1 - ie2 $ y"<S(p + k)T-(p +fc,p ; k)<§ip)D»¥{k) 0p.
Finally, multiplying this on the right by ^ ( p ) , we obtain (107.2).
§ 108. Ward's identity
Another relationship between the photon propagator and the vertex part,
simpler than Dyson's equation, follows from gauge invariance. To derive, it, we
apply the gauge transformation (102.8), assuming that x(x) s 8x(x) is an infinitesimal non-operator function of the 4-coordinates x. Then the change in the electron
propagator is
fi«(x, x') - ie<S(x - x')[8*(x) - 8x(x')l
(108.1)
Note that this gauge transformation violates the homogeneity of space-time, and
the function d^ depends on the arguments x and xf separately, not only on the
difference x - x \ Its Fourier expansion must therefore be made in the variables x
and x' separately. Thus, in the momentum representation 5<S is a function of two
4-momenta:
ô^(P2, p,) = f f 5^(x, * VP2X"iPlJC dAx d4xl
Substituting (108,1) and integrating over d4xd4£ or d4£d4x' (£ = x-x')> we get
8«(P + q, p) = ieôx(q)mp) - «(p + q)].
(108.2)
With the same gauge transformation, the operator A„(x) is augmented by the
function
SA(;)(x) = - ^ r 5 ^ ,
(108.3)
which may be regarded as an infinitesimal external field. In the momentum
§108
479
Ward's Identity
representation,
OA{;\q)=iqßoX(q).
(108.4)
The quantity S^ can also be calculated as the change in the propagator under the
action of this field. As far as quantities of the first order in 8\, this change can
evidently be represented by a single skeleton diagram:
p+q
L
p
The thick broken line is the effective external-field line, corresponding to the factor
(see (103.15))
SAi\q) + ÔA[e\q) ~^-
2»^(q).
The 4-vector 8A('\q) is longitudinal (with respect to q) and the tensor ^A" is
transverse. The second term is therefore zero, leaving
I
p+q
P
where the thin broken line corresponds, in the usual manner, to the field 8A{e)
simply. In the analytical form,
S<3 = e%p + q)F*(p + q, p ; q)<S(p) • SA(;\
(108.6)
Substituting (108.4) and comparing with (108.2), we get
«(p + q) - «(p) = -»(P + Q)^(P + q, P ; q)«(p) * <fe
or, in terms of the inverse matrices,
«-,<P + q) - <ê'\p) = <fep(p + q, p ; q)
(108.7)
(H. S. Green, 1953).
Taking the limit of this equation as q-»0 and equating coefficients when qM is
infinitesimal, we get
T—S-'fpHPCp.pîO).
(108.8)
dpM
This is Ward's identity (J. C. Ward, 1950). We see that the momentum derivative of
Exact Propagators and Vertex Parts
480
§108
^_1(p) is equal to the vertex operator with zero momentum transfert The derivative
of the function <3(p) itself is
dp»
mP) = i«(p)[-iT*(p, P ; o)]i«(p).
(108.9)
The higher derivatives could be found similarly by continuing the calculations
to higher orders in 8\, but we shall not need these expressions.
Let us now consider the derivative d&(k)ldkp, of the polarization operator.
Unlike S(p), &(k) is gauge-invariant and is unchanged by the application of the
fictitious external field (108.4). Its derivative therefore cannot be calculated in the
same way, but a diagram expression can be obtained for this derivative too. To do
so, we consider the first diagram in the definition of <3>\ the second-order diagram
ÎÉ.
An
P+ k
h—*—O—
(108.10)
The continuous lines correspond to the factors iG(p) and iG(p + k). Differentiation
with respect to k replaces the second factor by idG(p + k)ldk, and according to the
identity (108.9) this change is equivalent to adding a further vertex on the electron
line:
|k'«0
471 Î T
(108.11)
We see that, in the first non-vanishing order, the required derivative has been
expressed in terms of a diagram having three photon ends. It must be stressed
immediately that this diagram does not itself give the amplitude for the transformation of one photon into two. The amplitude of this process is the sum of
(108.11) and a similar diagram in which the loop is traversed in the other direction,
and the sum is zero by Furry's theorem. The diagram (108.11) is not itself zero.
In a similar manner, we can differentiate more complicated diagrams by
successively adding vertices with k' = 0 on all the electron lines which depend on k.
There are, however, diagrams in which the dependence on k occurs in the internal
photon lines also, for instance the diagram on the left in the next equation:
k-q,-q 2
dk"
t In the zero-order approximation, i.e. for the free-particle propagator, this identity is obvious:
G~\p) = yp - m, and therefore dG'xldp^. - y*.
§109
Electron Propagators in an External Field
481
The derivative of the diagram in the braces is shown here in diagram form by
means of a new graphical symbolism, a fictitious three-particle photon vertex, i.e. a
point where three broken lines meet, corresponding to the quantity
4 7 r i ^ r = 2ikM = v
(108.12)
We can now differentiate any diagram by adding, to the lines depending on k,
vertices tv or y^ and continuing in accordance with the general rules. Summation of
these higher-order corrections gives
1 d&
4ir odk^x
'*"
= r
( m i 3 )
where ief^ is the sum of the internal parts of all the diagrams with three photon
ends thus obtained.
We shall also need the second derivative of the polarization operator. Differentiating the equation (108.13) once more in a similar manner, we have
1 d2& „
î|,<r
4ir dk9dk°
™°v
ïstipov
' *sujopvt
'""""
^lv0.1*fj
where ie19> is the sum of the internal parts of all the diagrams with four photon
ends such as
S0
\v
ie j y
6
/
,0
(108.15)
including, of course, those containing the fictitious three-particle vertices (108.12).
§ 109. Electron propagators in an external field
If a system is in a given external field A(<)(x), the exact electron propagator is
expressed by the same formula (105.1), but in the Hamiltonian Ê = #<>+ V which
converts to the Heisenberg representation of operators we have also the interaction
between the electrons and the external field:
V = e i ÂJ» d*x + e ( A^J* d*x.
( 109.1)
Since the external field makes space and time no longer homogeneous, the
propagator <#(*, x') will now depend on the two arguments x and x' separately and
not only on the difference x-x'.
482
Exact Propagators and Vertex Parts
§109
If we proceed in the usual manner to the interaction representation, the
ordinary diagram technique is obtained, with external-field lines as well as virtual
photon lines. This technique is, however, unsuitable when the external field cannot
be regarded as a small perturbation, in particular when the particles may be in
bound states in the field. Now the electron propagator in an external field is in fact
required principally for the analysis of the properties of bound states, and in
particular for determining the energy levels with allowance for radiative corrections. In order to derive such a propagator, we have to start from a representation of operators where the external field is exactly taken into account even in the
"zero-order" approximation with respect to the electron-photon interaction (W. H.
Furry, 1951).
We shall henceforward assume the external field to be independent of time. The
desired representation of the (//-operators is given by the formulae (32.9) for second
quantization in an external field:
n
(109.2)
fr'XU r) = 2 {âW;\r)e^)l
+
bn^n~\r)e-^H}}
where i/zlr^r) and e{n] are the wave functions and energy levels of the electron and
the positron respectively, which are solutions of the "single-particle" problem, i.e.
of Dirac's equation for a particle in a field. It is easily seen that the operators
(109.2) are (//-operators in a certain representation (the Furry representation) which
is, as it were, intermediate between the Heisenberg and interaction representations.
They may be written
4>{^r)
=
ei**to(r)e-i**itA
iJj{e)(Ur) =
elH^(r)e-lH^\
(109.3)
where
Hx = H0+ef
A{;Xx)]*(x)d}:
The electromagnetic-field operator AM of course commutes with the second term in
Hi, and so the Furry representation is the same as the interaction representation
for this operator.
The electron propagator in the zero-order approximation, in the new representation, is defined as
G{^x, xf) = -i(0\T4fnx)mxf)\0).
(109.4)
The operator \p{e)(t, r) satisfies Dirac's equation in the external field:
[yp - eyA{e)(x) - mW'Xu r) = 0,
(109.5)
§109
483
Electron Propagators in an External Field
(e)
and the function G correspondingly satisfies the equation
[yp - eyA(e)(x) - m]G(e\x, x') = Ô(4)(x - x');
(109.6)
cf. the derivation of (107.5).
The diagram technique, which expresses the exact propagator ^ as a series in
powers of e2, is obtained by changing from the Heisenberg to the Furry representation, in exactly the same way as the earlier change to the interaction representation. The resulting diagrams are of the same form, with the continuous lines now
corresponding to factors iG{e) instead of iG.
One slight difference in the rules for writing the analytical expressions for the
diagrams arises because in the coordinate representation G(e) is not a function of
the difference x-x' only. In a constant external field, however, the homogeneity of
time is preserved, and so the times t and t' will again appear only as the difference
t-t'
= Tl
G{e) = Gu\r, r, r').
The momentum representation is obtained by a Fourier expansion with respect to
each of the arguments of the function:
G<«(r, r, r') = / / / «*> - ->G(., p2, p,) £ 0
$$.
(109.7)
Each line corresponding to the factor iGle)(e, p2, pi) must now be assigned one value
of the virtual energy e and two values of the momentum, the initial value pi and the
final value p2:
iG(e)(e,p2,pl) = p x e^ i .
(109.8)
This leads to the rule for writing the analytical expressions, in which the integration
over de/lv is normal, but those over d^pJilrr)3 and d3p2/(2ir)3 are independent, the
conservation of momentum at each vertex being taken into account. For example,
c-to
= e2 Iff
Gie\e, p2, p*)y*Gl'Xe - o>, p" - k, p' - k) x
x y'G«<«, p\ Pl)D„(o>, k) ^ 0 4 0f
g£.
(109.9)
It is important to note that in this technique one must also take account of
diagrams with "self-closed" electron lines, which in the ordinary technique are
rejected as being associated with a "vacuum current". When an external field is
484
§109
Exact Propagators and Vertex Parts
present, this current need not be zero, because of the "vacuum polarization"
caused by the field. For instance, in the diagram
(109.10)
9j
e
r
r
e
the loop at the top corresponds to the factor
Here, however, we must still specify the meaning of the integral over d<o. This is
because the integration of the Fourier component of the function G W (T) with
respect to w amounts to taking the value of that function at T = 0, and G U ) (T) is
discontinuous at T = 0; we must therefore indicate which of its two limiting values
is to be taken. To resolve this question, we need only note that the integral (109.11)
arises from the contraction of «//-operators in the same current operator:
J"sKr)YV(,,(tfr),
where «p* is to the left of \\t(t)- According to the definition of the propagator (109.4),
this order of factors for t = V is obtained if r' is taken as f +0, i.e. if the limiting
value of the function G(t\t - t') as t - 1 ' - > - 0 is taken. In other words, the integral
over avilir in (109.11) is to be taken as
/
e_lWdw
for
T
^_0
(109.12)
2TT
The mass operator in the external field is defined as in §105: -iM is the sum of
all the compact self-energy parts. It is now a function of the energy e and the
momenta pi and p2 at the ends of the external lines where they respectively enter
and leave the part in question:
(109.13)
- iXcp,,!,)
Proceeding exactly as in the derivation of (105.6), we get the equation
«(e,P2,Pi)-G<e)(e,p2,p,)
= \ \ &'\e, p2, p")^(e, p", p W , p\ p.) 0f
jjj£.
(109.14)
§109
485
Electron Propagators in an External Field
This can be put in a more natural form by returning to the coordinate
representation in terms of the spatial variables, using the function
<S(e, r, r') = jj <S(e, p2, p,)* i ( *-» ' ^ p f f i 2 ,
(10915)
and similarly for the other quantities. Taking the inverse Fourier transform of
(109.14), we obtain
<0(e,r,r')-G(e)(e,r,r')
= jj GM(e, r, r2)M(e, r2, r,)<3(e, r„ r') d3x, d3x2.
Next we apply to both sides the operator
where e is a number, and p = -iV is the operator of differentiation with respect to
the coordinates r. Here it must be noted that, by (109.6),
[y°e - y • p - eyAu\x)]G<e)(e, r, r') = ô(r - r').
(109.16)
The resulting equation is
r, r') - j M(e, r, r,)^(e, r,, r') d3x, = 5(r - r%
[y°e - y • p - eyA{t\xW{ey
(109.17)
The function <S(e,r, r') has the especially valuable property that its poles
determine the energy levels of the electron in the external field. We shall prove this
first for the approximate function G(<)(e, r, r'). Substituting the operators (109.2) in
the definition of the propagator (109.4), we obtain, in exact analogy to formulae
(75.12) for the free-particle propagator,
i 2 ^(rWftV) exp{-ie<+)(* - t%
G\l\t-t\r,r')
=
n
i 2 tf ( I Ä V ) e x p i i e z ~ t')},
n
t> t',
t < t',
(109.18)
and the Fourier time component is
We see that G(e)(e, r, r'), as an analytic function of e, has poles on the positive real
axis which coincide with the electron energy levels, and poles on the negative real
axis which coincide with the positron energy levels. The values e(ir)>m form a
§109
Exact Propagators and Vertex Parts
486
continuous spectrum,! and the corresponding poles form two cuts in the e-plane,
from -oo to - m and from m to oo. The segment \e\ < m contains poles which give
the discrete energy levels.
For the exact propagator ^(e, r, r') we can obtain a similar expansion by
expressing it in terms of the matrix elements of Schrödinger operators; the matrix
elements of Heisenberg (//-operators are related to these by
<m|iKt, r)|n> = (m\^(r)\n)e'li^E^.
(109.20)
Here the En are the exact energy levels (i.e. with all radiative corrections) of the
system in the external field. The operator i// increases the charge of the system by 1
(i.e. by +|e|), and t// decreases it by one. This means that in the matrix elements
<n|i//|0) and (0|<//|n) the states \n) must correspond to a charge of the system of +1,
i.e. they can contain, besides a single positron, only a certain number of electronpositron pairs and a certain number of photons; the energies of these states will be
denoted by E{„~}. Similarly, in the matrix elements (0|i//|n) and <n|</>|0) the states \n)
contain one electron and some pairs and photons (energy E(n+)). Instead of (109.18)
we now have
h{t-t',r9r>)
-i 2 <0|^(r)|n>(n|^(r')|0) exrf-iEiftf - f')},
t > t\
i 2 <0|«Mr')|n)<n|tMr)|0> exrfiEir'O - t%
t < t\
n
=
n
(109.21)
and hence
Let e be close to one of the discrete energy levels E(„+) (or -E(„_)). Then only the
corresponding pole term need be retained in the sum (109.22). Substitution in
(109.17) shows that the factors which depend on the second argument r' (when
r ?* r') do not appear in the equation. The result is a homogeneous integrodifferential equation for the function <0|«/r(r)|n> (or <n|^(r)|0», which we denote for
brevity by ^ n (r).t Omitting the subscript n, we have
[y°e + »y • V - eyA«>(r)]ik¥k(r) - j M*(e, r, r , m ( r , ) d3x, = 0
(109.23)
(J. Schwinger, 1951). The discrete energy levels En now appear as the eigenvalues
of this equation. Thus (109.23) becomes the regular basis for determining these
levels.
t We assume that the external field is zero at infinity.
I When radiative corrections are
neglected, the ^»(r) are the same (for states with one electron or
positron) as the wave functions t/»«+> or i/»î,-) which are solutions of Dirac's equation.
§110
Physical Conditions for Renormalization
487
For example, it can be used to determine the correction, in the first order with
respect to My to the discrete electron energy level e„ given by solving Dirac's
equation:
[7°£n + iy • V - eyA(t\r)]^n(r) = 0;
(109.24)
let the wave function i//„(r) be normalized by the condition
J»KiM3x = l.
(109.25)
The eigenfunction of equation (109.23) may be written
^ n (r) = ^(r) + ^n,)(r),
(109.26)
where tf/„n is the correction to «//„(r). Substituting (109.26) in (109.23), multiplying on
the left by ijt„(r) and integratingt over d3x, we get the required expression
En - en « [ f $ni(r)Mik(ent r, r,)«Mr,) d'x d3x,.
(109.27)
§ 110. Physical conditions for renormalization
The theory discussed so far in this chapter has been largely formal. We have
treated all quantities as if they were finite, and have deliberately passed over any
infinities which occur in the theory. In the practical calculation of the functions 3>,
^ and T by perturbation theory, however, divergent integrals arise, which cannot
be assigned any definite values without further consideration. These divergences
are a manifestation of the logical incompleteness of the existing quantum electrodynamics. It will be seen below, nevertheless, that in this theory it is possible to
establish certain rules which allow an unambiguous "subtraction of infinities",
and thus to obtain finite values for all quantities which have a direct physical
meaning. These rules are based on obvious physical requirements that the
photon mass is zero and the electron charge and mass are equal to their observed
values.
Let us first ascertain the conditions to be imposed on the photon propagator,
and consider a scattering process which can occur through one-particle intermediate states having one virtual photon. The amplitude of such a process must
have a pole when the square of the total 4-momentum P of the initial particles is
equal to the squared mass of the real photon, i.e. when P2 = 0; we have seen in §79
that this requirement follows from the general condition of unitarity. The pole term
t In the integration we use the fact that the differential operator in (109.24) is self-con jugate, and
thus transfer its action from »/»S," to <£,.
488
Exact Propagators and Vertex Parts
§110
in the amplitude arises from a diagram of the form (79.1):
.
k-P
.
.
(U0.1)
and when radiative corrections are taken into account the two parts of the diagram
must be joined by a thick broken line (the exact photon propagator). This means
that the function 2>(k2) must have a pole at k2 = 0, i.e. must be such that
2>->4irZ/k2 when k 2 - 0 ,
(110.2)
Z being a constant. Hence, for the polarization operator S?(k2), (103.21) gives
0>(O) = 0.
(110.3)
The coefficient in (110.2) is given by
1_ T
l
nk2)]
z-[ —jr\
k2-0
Further restrictions on the function 0*(k2) can be derived from an analysis of
the physical definition of the particle's electric charge: two classical (i.e. infinitely
heavy) particles at rest at a large distance apart must interact in accordance with
Coulomb's law, U = e2/r. (These are distances much greater than 1/m, where m is
the electron mass.) This interaction can also be represented by the diagram
;a)
jk
(dK
(HO.4)
Nb)
in which the upper and lower lines correspond to classical particles. The photon
self-energy corrections are taken into account in the virtual photon line. All other
corrections, affecting the heavy-particle lines, would make the diagram equal to
zero: the addition of any further internal lines in the diagram (110.4), for example a
photon line joining the lines a and c or a and b, would produce lines of virtual
heavy particles, with corresponding propagators. But the propagator of a particle
has its mass M in the denominator, and tends to zero asM-*».
The form of the diagram (110.4) makes it clear (cf. §83) that the factor e22(k2)
in it must be (apart from the sign) the Fourier transform of the particle interaction
potential. Since the interaction is steady, the virtual-photon frequency a» = 0, and
large distances correspond to small wave vectors k. The Fourier transform of the
§ 110
Physical Conditions for Renormalization
489
Coulomb potential is 47re2/k2. Since 2) depends only on k2 = a> 2 -k 2 , we finally
arrive at the condition
S-*4<7r/k2
when
k 2 ->0,
(110.5)
i.e. the coefficient in (110.2) must be Z = 1; the sign is obvious, since 2>(k2) tends to
the free-photon propagator D(k2). The polarization operator 0>(k2) must therefore
satisfy
0>(k2)/k2->O
when
k2^0.
(110.6)
This leads not only to the condition (110.3) given previously but also to the result
0>'(O) = O.
(110.7)
It has been noted in §103 that an effective external real-photon line corresponds
to the factor (103.15) or, using (103.16) and (103.20),
We now see from (110.5) and (110.6) that the correction term is zero. Thus we have
the important result that radiative corrections need not be considered in external
photon lines.
The natural physical requirements therefore lead to the establishment of
definite values (namely zero) for the quantities ^(0) and 0*'(O). The calculation of
these quantities from the perturbation-theory diagrams would lead to divergent
integrals, and we see that the way to eliminate such infinities is to assign fixed
values a priori to the divergent expressions, these values being determined by
physical requirements. This procedure is called renormalization of the quantities
concerned.t
The procedure can also be formulated in a somewhat different manner. For
instance, in renormalizing the particle charge one can define a non-physical
intrinsic (bare or unrenormalized) charge ec as a parameter which appears in the
expression for the original electromagnetic interaction operator in formal perturbation theory. The renormalization condition then becomes e2c3)(k2)->47re2lk2
(when k 2 ->0), e being the actual physical charge. Hence we have the relation
e2cZ = e2, which is used to eliminate the non-physical quantity ec from formulae
which concern observable effects. By putting immediately Z = l , the renormalization is effected "en route", and there is no need to use fictitious quant'ties
even in the intermediate steps.
Let us now investigate the conditions for renormalization of the electron propagator. To do so, we now consider a scattering process which can take place
t The idea of this approach was first put forward by H. A. Kramers (1947); the systematic
application of the renormalization method in quantum electrodynamics is due to Dyson, Tomonaga,
Feynman, and Schwinger.
490
Exact Propagators
and Vertex
§110
Parts
through a one-particle intermediate state with one virtual electron. The amplitude
of such a process must have a pole when the square of the total 4-momentum P, of
the initial particles is equal to the squared mass of the real electron, i.e. when
P]-m2. The pole term in the amplitude arises from a diagram of the form
>Pi
(110.8)
and when radiative corrections are taken into account the thick line is the exact
electron propagator. This means that the function <S(p) must have a pole at
p2 = m2, i.e. its limiting form there must be
^(P)^z»pilP^^0
+
g(P)
when
P2^m\
(110.9)
Z\ being a scalar constant and g(p) remaining finite as p2-+m2. The matrix
structure of the pole term in (110.9) (proportional to yp + m) is a consequence of
the same unitarity condition that causes the existence of the pole. We shall prove
this statement and at the same time elucidate the important question of the
renormalization conditions for the external electron lines.
If <S(p) has the limiting form (110.9), the inverse matrix is
^" 1 (p) Ä y"(7P - m ) - ( Y P -m)g(7p - m )
when p2-+m2.
(110.10)
The mass operator is
M = G ~ , - « ~ , ~ ( l - ^ - V y p - m ) + (-yp-m)g(yp-m)
when
p2->m2.
(110.11)
The effective external (say incoming) electron line corresponds (cf. (103.15)) to a
factor
a
U(p) = u(p) + <S(p)M(p)u(p),
(110.12)
where u(p) is the ordinary amplitude of the electron wave function, which satisfies
Dirac's equation ( y p - m ) u = 0 . Because of the requirements of relativistic invariance {% like u, is a bispinor), the limiting value of <ft(p) for p2-+m2 can differ
from that of u(p) only by a constant scalar factor:
<U(p) = Z'u(p).
(110.13)
This factor Z ' is related in a definite manner to the factor Zu but the relation
cannot be determined simply by substituting (110.10) and (110.11) in (110.12),
§110
Physical Conditions for Renormalization
491
because there is an indeterminacy; the result depends on the order in which the
limits of the various factors in (110.12) are taken.
It is, however, possible to avoid the problem of the correct method of taking the
limit, by using instead the unitarity condition for the reaction shown by the diagram
(110.8). The unitarity relation generally applies to amplitudes of processes as a
whole, not to individual diagrams. But when p2-*m2 the pole diagram (110.8) gives
the main contribution to the corresponding amplitude M/„ so that the other
diagrams which pertain to the same reaction can be ignored.
As has been shown in §79, the unitary conditions require that a one-particle
intermediate state should produce in the reaction amplitude an imaginary part with
a delta function:
iirô(p 2 -m 2 ) Y MfnM*,
(110.14)
polar.
where the subscript n refers to a state having one real electron, and the summation
is over the latter's polarizations; to avoid additional complications we assume, as in
§79, that both sides of the unitarity relation are symmetrized with respect to the
helicities of the initial and final particles, so that Mfi = Mif. The amplitude Mfn
corresponds to a process represented by the diagram
ft{
and is
M,n = (M'fn°U) = Z'(M'fnU),
whereM/n is a factor with one free bispinor index.t Similarly, the structure of the
amplitude M * is
M*=(^M;„*)=Z'(ÛM;„*).
Summation over the polarizations of the electron replaces the product (M'fnu)x
(UM'*) by M'fn(yp + m)Min, and so the term (110.14) in the amplitude M/, becomes
Z'2brS(p2- m2){M'fn(yp + m)M'*}.
Using this term in the imaginary part, we can reconstruct the entire pole term in the
scattering amplitude; from (79.5),
_
Z'2{M[n(yp + m)M':}
,
,
t One point should be clarified here. The electron, a stable particle, cannot really be transformed
into an assembly of real particles, but we may formally take as the latter certainfictitiousparticles
whose masses are such as to allow this transformation. The resulting relationship is then to be taken as
an analytical continuation to real masses.
492
§110
Exact Propagators and Vertex Parts
Calculation of the same amplitude directly from the diagram (110.8) gives
iM/,-= iM;n • i<3(p) • iJVC.
A comparison of the two formulae confirms the limiting expression written above
for ^(p) (the first term in (110.9)), and shows that
(H0.15)
Z' = y/zu
We shall now show that, when the limiting form of the electron propagator is
known, there is no need to establish any further conditions for the vertex operator.
Let us consider the diagram
(110.16)
X.
which represents the scattering of an electron in an external field A{e\k), in the first
order with respect to the field, and taking account of all radiative corrections. In
the limit k-*0, P2~»Pi = P, the self-energy corrections to the external-field line are
zero (since they vanish for any k2 = 0). Then the diagram corresponds to the
amplitude
M/i = -e^(p)r(p,p;0)^(p)- A(e) (k-+0),
(110.17)
i.e. the product of the potential A{e) and the electron transition current 'ÛY'U. But
when k-»0 the potential A(t\x) reduces to a constant independent of coordinates
and time. No physical field corresponds to this potential (a particular case of gauge
invariance), which therefore can cause no change in the electron current. Thus, in
the limit considered, the transition current ^VU must be simply the free current
üyu:
<fc(p)P<(p,p;0)<fc(p) = ZMpWuip)
(110.18)
= ü(p)y»u(p).
This is essentially also a definition of the physical charge on the electron. It is
easily seen to be necessarily satisfied, whatever the value of Z\\ substituting ^"'(p)
from (110.10) in Ward's identity (108.8), we find
r*(p, p; 0) = YX y" - 7Mg(P)(7P -m)-(yp-
m)g(p)y\
and (110.18) is satisfied, since (yp - m)u = 0, Q(yp - m) = 0.
We see that, when the amplitude of the physical process is calculated, the
"renormalization constant" Z\ disappears. Moreover, by using the indeterminacy
§111
Analytical Properties of Photon Propagators
493
that arises from divergences in the calculation of I\ we can simply require that
w(p)r,i(p,p;0)M(p) = ö(p)7' t u(p)
when p2 = m\
(110.19)
i.e. put Z\ = 1.
The convenience of this definition lies in the fact that there is no need to apply
corrections to the external electron lines: we have simply
°U(p)=u(p).
This can also be deduced directly by noticing that for Zx - 1 the mass operator (110.11)
is
M=(yp-m)g(yp-m)
(110.20)
and the second term in (110.12) obviously vanishes. Thus there is no need to
"renormalize" the external lines of any real particles, either photons or electrons.!
§111. Analytical properties of photon propagators
It is convenient to begin the study of the analytical properties of the photon
propagator with the function Il(k2). The reason is that the direct use of the
definition (103.1) for this purpose is difficult because the operators Â'1(x) are
gauge-ambiguous and their properties are therefore indeterminate.
The integral representation of the function n(k 2) (104.11) was derived from the
expression for the photon self-energy function in terms of the matrix elements of
the gauge-invariant current operator. Denoting the variable k2 byt t, let us consider
the properties of the function 11(0 in the complex f-plane.
From the integral representation
n
«> = / ^ F n ö
0
< 11U)
we see that 11(0 is real on the negative real axis, and elsewhere in the plane
satisfies the symmetry relation
n(t*) = II*(0-
(Hl-2)
The function U(t) can have singularities only at singular points of p(0- These
<are at values of t = k2 which are threshold values for the creation of various groups
t In renormalizing the photon propagator, the condition Z = 1 arose as a necessary physical
condition, after which the corrections to the external photon lines disappear automatically. Formally,
however, the situation is similar for both photon and electron external lines: when Z* 1, the wave
amplitude e» of a real photon, with corrections, would be multiplied by VZ.
t Not to be confused with the symbol for the time.
494
Exact Propagators and Vertex Parts
§111
of real particles by a virtual photon. At these values, new types of intermediate
states "come into play" in the sum (104.9). Their contribution is zero below the
threshold, but not zero above the threshold, and this causes the singularity at the
threshold itself. These threshold values are, of course, real and non-negative.t The
singularities of n(0 therefore also lie on the positive real t-axis. If a cut is made
along this axis, the function 11(0 is analytic throughout the cut plane.
The term +i0 in the denominator of the integrand in (111.1) shows that we must
pass below the pole V —t. This means that the value of 11(0 for real t must be
taken as its value on the upper edge of the cut. Using the rule (75.18):
—TH = P-:?M(x)t
x ± iO
(111.3)
x
we find that for real t
(111.4)
im 11(0 = im U(t + iO) = - irp(0On the lower edge of the cut, im II has the opposite sign; re II is the same on both
edges. Thus the discontinuity of 11(0 at the cut is
IK* + iO)-n(t - iO) = -2irip(0-
(111.5)
The integral representation (111.1) itself can in this way be regarded simply as
Cauchy's formula for the analytic function 11(0: applying this formula
c
to the contour
(111-7)
which passes around the cut, and assuming that 11(0 decreases sufficiently rapidly
at infinity, wefindthat the integral along the large circle is zero, and those along the
edges of the cut give the following dispersion relation between 11(f) and its
t For example, the point k2 = 0 is a threshold for the production of three (or a higher odd number of) real
photons; k1 ■ 4m2 is a threshold for electron-positron pair production, and so on.
§111
495
Analytical Properties of Photon Propagators
imaginary part:
7T J
t ~~ f
0
= 1 f imn . dt'.
0
irj
t'-t
(Hl.8)
Substitution of (IH.4) then gives (lll.l).t
The analytical properties of the functions 0>(O and 2>(0 are the same as those
of 11(0» in terms of which they are expressed by the simple formulae (104.2) and
(103.21). For 2(t) we have
W)=4jL(1+ni0)
(1119)
On the positive real f-axis, we must take t as t + iO, as shown above. The imaginary
part of 3)(t) can then be calculated from (111.3) and (111.4), bearing in mind that
ll(0/r->0 when f-*0, by (110.6). We thus find
im 2>(0 = -4TT 2 5(0 + (4ir/f2) im 11(0
= -4ir 2 8(0 - (4ir2lt2)p(t).
(111.10)
Now, applying to 2>(0 a dispersion relation of the form (111.8), we obtain the
integral representation
oo
4
0 (K)
O = : _ JL_+4 7 r fp(Q
t + iO
J t^t-t'
o
dt
'
+W
n)
unii;
(lll
called the Källen-Lehmann expansion (G. Kallén, 1952; H. Lehmann, 1954).
There is a close relationship between the position of the cut for the function
2)(0 (and therefore its imaginary part on the cut) and the unitarity condition for the
amplitude of the process a + b^c + d represented by the diagram (H0.4); this
reaction is, of course, a purely imagined one, but it does not violate the conservation laws, and the unitarity condition is formally valid for it.
In the initial state i of this process there are two "classical" particles a and b,
and in the final state there are two other such particles c and d. The unitary
condition is (7l.2)t
T„ - Tff = i(2«7T)4 2 TfnTUi4)(Pf - Pd;
(H1.12)
t Dispersion relations were first used in quantum field theory by M. Gell-Mann, M. L. Goldberger
and W. E. Thirring (1954).
t The amplitudes T/< differ from the M/< only by the factors shown in (64.10).
496
§112
Exact Propagators and Vertex Parts
the summation on the right is taken over all the physical "intermediate" states n. In
the present base these states are evidently those of the systems of real pairs and
photons which can be created by the virtual photon k, i.e. the states which occur in
the matrix elements in the definition (104,9) of the function p(k2). The amplitudes
Mfi and M * include the factors 2)(k2) and 2>*(k2) respectively, and their difference
contains im 2>(k2). We see, therefore, that the relation given by (111.4) between an
imaginary part of 2> and the existence of these intermediate states is a consequence
of the necessary requirements of unitarity.
We shall see later that it is convenient, in practical calculations of Q)(t) (or,
equivalently, 0>(O) by perturbation theory, to begin by finding the imaginary part of
0>, which does not involve divergent expressions. But if &(t) is then calculated
from a dispersion relation of the type (111.8), the integral diverges and further
subtraction operations are necessary in order to satisfy the conditions 0>(O) = 0 and
^'(O) = 0. This subtraction can, however, be effected without the explicit use of
divergent integrals. To do so, we need only apply the dispersion relation (111.8) not
to &(t) itself but to &(t)lt2. Then we have
(,1113)
»w-üf.V-Tio*'
0
This integral is convergent, and the function 0>(O thus obtained must necessarily
satisfy the required conditions. A relationship such as (111.13) is called a "doublesubtraction" dispersion relation. The significance of the change to 9>(t)lt2 becomes
especially clear if (111.13) is written in the form
rrjt-t-iO
0
rr J
0
t
ir J
0
tu
If the first ("non-regularized") integral is denoted by #(f)» the right-hand side is
#(0-£(0)-t#'(0).
§ 112. Regularization of Feynman integrals
The physical conditions of renormalization discussed in §110 enable us, in
principle, to derive a unique finite value for the amplitude of any electrodynamic
process in any approximation of perturbation theory.
Let us first of all ascertain the nature of the divergences that occur in the
integrals derived directly from Feynman diagrams. This is considerably facilitated
by counting the powers of the virtual 4-momenta which appear in the integrands.
Let us consider a diagram of order n (i.e. one containing n vertices), with Ne
external electron lines and Ny external photon lines; Ne is even, and the electron
lines form {Nt continuous sequences, each beginning and ending at a free end. The
number of internal electron lines in each such sequence is one less than the number.
of vertices in it; the total number of internal electron lines in the diagram is
§112
Regularization of Feynman Integrals
497
therefore n - {Ne. One photon line comes to each vertex: at Ny vertices the photon
line is external, and at the other n - NY it is internal. Since each internal photon line
joins two vertices, the total number of such lines is \(n - Ny).
Each internal photon line is associated with a factor D(k), which contains k to
the power - 2 . Each internal electron line is associated with a factor G(p), which
contains p to the power - 1 (when p2>m2). Thus the total power of the 4-momenta
in the denominator of the diagram is In - iNe - Ny.
The number of integrations (over d4p or d4k) in the diagram is equal to the
number of internal lines minus the number (n - 1) of additional conditions imposed
on the virtual momenta (of the n conservation laws at the vertices, one relates the
momenta at the free ends of the diagram). Multiplying by 4, we obtain the number
of integrations over all the 4-momentum components, 2(n -Ne-Ny + 2).
Lastly, the difference r between the number of integrations and the power of
the momenta in the denominator of the integrand is
r = 4-lNe-Ny,
(112.1)
and is independent of the order n of the diagram.
The condition r < 0 for the diagram as a whole is not in general sufficient for
convergence of the integral: the corresponding numbers r' for the internal sections
which can be taken from the diagram must also be negative. The existence of
sections having r' > 0 would make them divergent although the other integrals in
the diagram would converge "with something to spare". The condition r < 0 is,
however, sufficient for the convergence of the simplest diagrams, in which n =
Ne + Ny and there is only one integration over d4p.
If r s* 0, the integral always diverges. The order of divergence is at least r if r is
even, and at least r - 1 if r is odd; the decrease by one in the latter case is due to
the vanishing of the integrals over all 4-space of products of an odd number of
4-vectors. The order of divergence may be higher if there are internal sections with
r'>0.
Since Ne and Ny are positive integers, we see from (112.1) that there exist only
a few pairs of values of Ne and Ny for which r > 0. The simplest diagrams of each
such type may be enumerated, omitting the cases N« = Ny = 0 (vacuum loops) and
Ne = 0, Ny = 1 (mean value of the vacuum current), since they have no physical
meaning and the corresponding diagrams must be rejected, as already shown in
§103. The remaining cases are:
498
Exact Propagators and Vertex Parts
§112
In (a) the divergence is quadratic; in all the others (r = 0 and r = 1) it is logarithmic.
The diagram (112.2d) is the first correction to the vertex operator. It must
satisfy the condition (110.19), which we shall here write as
u(p)A'i(p,p;0)u(p) = 0 when p2 = m\
(112.3)
A* = r t - y ' \
(112.4)
with
Let A'i(p2, Pù k) be the Feynman integral as derived directly from the diagram.
This integral is logarithmically divergent, and does not itself satisfy the condition
(112.3), but we can obtain a quantity which does satisfy this condition by taking the
difference
AM(P2,P.;k) = Â't(p2,p1;k)-[A'1(p1,p1;0)]PÎ=mJ.
(112.5)
The leading divergent term in ÂM(p2, pû k) is obtained by taking the virtual photon
4-momentum / in the integrand to be arbitrarily large. It ist
- 4 ^ f ^ ( ^ : ( ^ > d4fA
and is independent of the 4-momenta of the external lines. In the difference (112.5)
the divergence therefore cancels, leaving a finite quantity. This operation of
removing the divergence by subtraction is called regularization of the integral.
It must be emphasized that the integral ÄM(p2, pù k) can be regularized by one
subtraction because here the divergence is only logarithmic, i.e. as weak as it can
be. If the integral involved divergences of various orders, a single subtraction with
k = 0 might be insufficient to eliminate all the divergent terms.
When the first correction in V1 (i.e. the first term in the expansion of A*1) has
been determined, the first correction in the electron propagator (the diagram
(112.2a)) can be calculated from Ward's identity (108.8), which may also be written
in the form
3^(p)/apM=A"(p,p;0),
(112.6)
with the mass operator M in place of % and A*1 in place of P*. This equation is to
be integrated with the boundary condition
u(p)M(p)u(p) = 0 for p2 = m2,
(112.7)
which follows from (110.20).
Finally, to calculate the first term in the expansion of the polarization operator,
we use the identity (108.14), which after contraction with respect to two pairs of
t The complete expression for the integral is given in (117.2).
§112
Regularization of Feynman Integrals
499
indices gives
±
&
4TT dKdk*
-VP
*'
a relation between the scalar functions 0> = \&* and y = <f%. Both these functions
depend only on the scalar variable k2, and so we have
2k2&"(k2) + 9>\k2) = ^f !f(k\
(112.8)
the primes denoting differentiation with respect to k2. With the condition 0>'(O) = 0,
this equation shows that
^(0) = 0.
(112.9)
In the first approximation of perturbation theory, 5^(k2) is given by the diagram
(112.2e), with the free ends having 4-momenta k, k, 0, 0. The corresponding Feynman integral 9{k2) diverges logarithmically, and can be regularized by a single
subtraction, using the condition (112.9):
^(k 2 ) = ^ ( k 2 ) - ^ ( 0 ) .
Then 0>(k2) is found by solving equation (112.8) with the boundary conditions
0>(O) = O, 0>'(O) = O.
In the next approximation of perturbation theory, the correction to the vertex
operator A(M2) is determined by the diagrams (106.10, (cMO)- The irreducible
diagrams (d)-(f) are calculated by a similar regularization of the integrals, using a
single subtraction as in (112.5), in the same way as in calculating the firstapproximation correction A(M1}. In the reducible diagrams, the internal self-energy
and vertex parts of lower order are immediately replaced by the already known
(regularized)first-approximationquantities (0>(1), M°\ A^), after which the integrals
obtained are again regularized in accordance with (112.5).t The corrections 0*(2) and
Mm can then be calculated from (112.6) and (112.8).
This systematic procedure will in principle enable us to derive finite values for
0>, M and A^ in any approximation of perturbation theory. It thus becomes possible
to calculate the amplitudes of physical scattering processes that are described by
diagrams containing 0>, Ji and AM as constituent sections.
The physical conditions derived in §111 are therefore sufficient for an unambiguous regularization of all the Feynman diagrams occurring in the theory. This
renormalizability is a far from trivial property of quantum electrodynamics.t
In a practical calculation of radiative corrections, the above procedure may,
however, not be the simplest and most rational. In Chapter XII we shall see, in
t In diagrams for still higher approximations, it may be necessary to replace the four-ended sections
& also by already regularized values.
t A different approach to renormalization theory in quantum electrodynamics is given by N. N.
Bogoliubov and D. V. Shirkov, Introduction to the Theory of Quantized Fields, Wiley, New York, 1980.
500
Exact Propagators and Vertex Parts
§112
particular, that a convenient treatment may start by calculating the imaginary parts
of the corresponding quantities; these are given by integrals which do not involve
divergences. The quantity itself is then found by analytical continuation, using the
dispersion relations. It thus becomes possible to avoid the lengthy calculations
which are needed in a direct regularization by subtractions.
CHAPTER XII
RADIATIVE CORRECTIONS
§113. Calculation of the polarization operator
LET us now go on to the actual calculation of the radiative corrections, and begin
with that of the polarization operator (J. Schwinger, 1949; R. P. Feynman, 1949). In
the first approximation of perturbation theory, this is given by the loop in the
diagram
(,i3i)
- - < ! > - -
As already mentioned, the problem becomes easier if we first calculate the
imaginary part of the required function. This in turn is most conveniently done by
using the unitarity relation. The virtual-photon lines are regarded as corresponding
to a fictitious "real" particle, a vector boson with mass M2 = k2 which interacts
with an electron in the same way as a photon does. Then (113.1) becomes the
diagram of a "real" process, and the unitarity condition can be justifiably applied to
it.
Thus we regard (113.1) as a diagram giving the amplitude of the transition of the
boson into itself (ä diagonal element of the S-matrix) via a decay into an electronpositron pair. The crosses in the diagram show where it is to be cut into two parts
so as to show the intermediate state that figures in the application of the unitarity
relation. This state contains an electron with 4-momentum p~^ p and a positron
with p+ = -(p - k).
The unitarity relation with a two-particle intermediate state (71.4), for coincident initial and final states, gives
Here the amplitude M«, from (113.1), is
iMu = V(4ir)e* • V(4ir)e„
l
-j^,
(l 13.3)
where eM is the boson polarization 4-vector, which according to (14.13) satisfies the
equation
eX = 0.
501
502
§ 113
Radiative Corrections
The amplitude Mni corresponds to the diagram for the decay of the boson into
an electron-positron pair:
-jv
p-
The corresponding expression is
j * = ü(p-)y»u(-p+).
Mm = -eV(4TT)e^,
(113.4)
Substitution of (113.3) and (113.4) in (113.2) gives
2
2e*evim^^ = -f-^
*T7T
S
fr*f^,<*°.
(113.5)
Dolar J
Here p = p- = - p+ and e = e+ + e_ = 2e+ are the momenta and total energy of the pair
in its centre-of-mass system; the integration is over the directions of p, and the
summation is over the polarizations of the two particles.
Let us now average both sides of (113.5) over the polarizations of the boson.
This is done by means of the formula (14.15):
e ^ev — — ~z {guv
K2
y
Since the tensor 0 ^ and the vector j M are transverse (3>^kv = 0, jMkM = 0), we have
as the result
2im^=T^-l^X
l^TT
£
f(jj*)d°,
polar J
(113.6)
where 9 =\&».
The summation over the polarizations is effected in the usual manner, the
integration over do reduces to a multiplication by 4TT, and so
2 im 0> = e2 ^ tr y»(yp- + m)y»(yp+ - m)
= - e 2 * ( p + p - + 2m2).
5e
In terms of the variable
= 2(m2 + p+p_),
(113.7)
§113
503
Calculation of the Polarization Operator
we have
e — t, p — 4* ~~ wi ,
and hence finally
im^(t) = - | a ^ f ^ f 4 m
(t + 2m2), f^4m 2 .
(113.8)
The value t =4m2 is the threshold value for the production of one electronpositron pair by a virtual photon (cf. the second footnote to §111); in the
approximation considered (~~e2), the state with a single pair is the only one that
can appear as an intermediate state in the unitarity condition (113.2). In this
approximation, therefore, the right-hand side of (113.2) is zero for t<4m 2 , and
hence
im0>(f) = O, t<4m2.
(113.9)
For the same reason, in this approximation the cut in the complex t-plane for
the function &(t) extends only from t = 4m2 on the real axis, and this point must be
the lower limit in the dispersion integral (111.13). Thus
X
_
a .2 (
l(t'-4m2\t'
at'
+ 2m2
no -~5^ r J T^rnöVr-7—)—F*--
(11310)
It is convenient for the expression of the result to replace t by a variable £, defined
as follows:
t/m2 = - ( l - £ ) 2 / f
(113.11)
This transformation maps the upper half-plane of t on a semicircle of unit radius in
the upper half-plane of £, as shown in Fig. 19, where corresponding line segments
in the two planes are indicated by similar markings. The semicircle £ = e*, 0 ^ <f> ^ ^
0
4m=
-I
FIG.
/
0
\ ©
\
19.
corresponds to the non-physical region 0«£ r7m2=£4. The right-hand and left-hand
radii on the real axis correspond to the physical regions t < 0 and t/m 2 >4.
The integral in (113.10) is most simply calculated by means of the substitution
V
1
504
§114
Radiative Corrections
and we first take the case t < 0 (so that the denominator does not vanish in the
range of integration and the imaginary term JO may be omitted). The result of the
integration, in terms of the variable £, is
»<e-£{-7+i(«+{)+(f+r4)î^to4
(11312)
The analytical continuation of this formula gives the function &(t) in the range
t > 4 m 2 also; this is done by putting £ = |£|e'ff (and the logarithm gives a contribution to the imaginary part: log £ = log |£| + iir).t For the non-physical region,
we must put £ = e'*, and then
nt) =
2am'
3TT
2
{- ¥ sin 2 ^ - 2 + (1 + 2 sin21
\<t>)4> cot &},
(113.13)
t/4m2 = sin2 2$.
In the limit of small \t\ (i.e. £-> 1), these formulae become
nt) = -£-A,
|fN4m2.
(113.14)
In the opposite case of large |r| (i.e. £-»0), we have
nt) = -£z\t\\ogK
(
f log
-t>4m\
i,r
* °-£ ( £- )-
t>4
m
(113.15)
In accordance with the significance of perturbation theory, these formulae are
valid if 0747T <D~] = tl4v. The condition for (113.15) to be applicable is thus
DIT
m
(113.16)
The radiative corrections which involve alog(|t|/m) are called logarithmic corrections.
§114. Radiative corrections to Coulomb's law
Let us apply the formulae derived above to the problem of the radiative
corrections to Coulomb's law. These corrections may be intuitively described as
resulting from the polarization of the vacuum around a point charge.
t The analytical continuation thus obtained is, as it should be, a continuation on the upper edge of
the cut, since the semicircle in the £-p!ane corresponds to the upper half-plane of t.
§114
Radiative Corrections to Coulomb's Law
505
If corrections are neglected, the field of a fixed centre (with charge e\) is given
by the Coulomb scalar potential 4> = Aie) = ejr. The components of its threedimensional Fourier expansion are
<D(k)^A&e)(k) = 47T€,/k2.
When the radiative corrections are included, this field is replaced by the "effective
field"
si? = AV + 20p f^- A[e) = AP + j - &Q)Aie);
(114.1)
cf. (103.15). The second term gives the required change in the scalar potential. In
the first approximation of perturbation theory for 0>(k2), we must take the expression derived in §113 and replace 2>(k2) by the zero-order approximation:
2)(k2)~D(k2) = -47r/k2.
Thus the radiative correction to the field potential is
o$(k) = - ^ 0 > ( - k 2 ) .
(114.2)
To determine the form of this correction in the coordinate representation, we
must take the inverse Fourier transform:
8<Hr) = f eik rô<D(k) d3k/(2Tr)\
(114.3)
Since 6<I>(k) is a function of t = -k 2 only, integration over angles gives
0
= 4 ^ : im J W - y V y d y ;
in the last transformation, we use the fact that the integrand is an even function of
y = V - 1 . The contour of integration can now be moved into the upper half-plane
of y, and made to coincide with the cut of the function 0>(-y2) (Fig. 20). This cut
1
©
2im
FIG.
20.
506
Radiative Corrections
§114
extends upwards along the imaginary axis from the point 2im, the physical sheet
corresponding to the left side of the cut. Replacing y by a new variable, y = ix, we
get
ô<D(r) = 2 ^ ; [ im Ô<D(x2) e'xxdx.
2m
Finally, on returning to the integration over t = x2, we have
[ im ô<ï>(t) e~rV' dt.
8<P(r) = ^ :
(114.4)
4m 2
The imaginary part
im54>(0 = - ^ p £ i m ^ ( 0
is taken from (113.8), and after an obvious change of variable we have
4.(r) = ^ + S*(r) = ^ { l + ^ J e - ^ ( l + ^ ) ^ ! = - ü d f }
(114.5)
1
(E. A. Uehling and R. Serber, 1935).
The integral can be evaluated in two limiting cases. Let us first take that of
small r (mr <^ 1), and divide the integral of the first term in the parenthesis into two
parts:
je-^^j)d(
I=
with £i chosen so that Mmr>(,\> 1. Consequently, we can take r = 0 in the first
integral, so that
1
-log 2£,-l.
§114
507
Radiative Corrections to Coulomb's Law
In Ii, on the other hand, unity may be omitted from £ 2 - 1:
h
-Ci
= - log £, • <T2mrfl + 2mr f <T2mr{ log £ d£.
In the exponential and the lower limit, it is permissible to put Ci = 0. Then, with the
change of variable 2mr£ = x, we have
h = - log 2£, + log — + J e~x log x dx
= -log 2£, + log — - C ,
where C = 0.577 . . . is Euler's constant.
In the integral of the second term in (114.5) we can immediately put r = 0:
l fV(£ 2 -l)
l
When the three integrals are added, £1 disappears, leaving
*<r> = 7 [ 1 + £( l o g ^- c -s)]-
r<<I,m
-
(1146)
When mr > 1, the range £ - 1 ~ 1/mr <^ 1 is important in the integral. The change
of variable £ = 1 + £ and appropriate transformations reduce it to
e-2mr j e~2m'( \ V(2£) d | = ^ 5 7 5 VTT e"2"'.
0
In this case, therefore,t
*<'>-r( l+ 4£?<ïS">
r>1/m
'
(1R7)
We see that the polarization of the vacuum alters the Coulomb field of a point
t The orig? . of the factor e~2mr in S4>(r) is evident from the form of the initial integral (114.4): when
r is large, the important values of ( are those near the lower limit. Thus the exponent is determined by
the position of the first singularity of the function S4>(t)-
508
Radiative Corrections
§115
charge in a region r ~ l/m( = ft/mc), where m is the electron mass. Outside this
region, the change in the field decreases exponentially.
One further general comment may be made. Hitherto we have implicitly
assumed that the radiative corrections arise from the interaction between the
photon field and the electron-positron field. Thus, by associating the internal closed
loops in the photon self-energy diagrams with the electrons, we have taken into
account the interaction of the photon with the "electron vacuum". But the photon
also interacts with the fields of other particles; the interaction with the "vacua" of
these fields is described by similar self-energy diagrams, in which the internal loops
are associated with the appropriate particles. The contributions of these
diagrams differ in order of magnitude from those of the electron diagrams by several
factors of mjm, where m is the mass of the particle concerned and me the electron
mass.
The particles whose mass is closest to that of the electron are the muons and
the pions. The numerical ratios mjmß and mjm^ are close to a. The radiative
corrections from these particles would therefore have to be taken into account
together with the electron corrections of higher orders. For muons, the radiative
corrections can in principle be calculated by means of the existing theory, but for
pions (which are strongly interacting particles) they cannot.
This places a fundamental limitation on exact calculations of specific effects in
present-day quantum electrodynamics. The use of arbitrarily high-order corrections
from the photon-electron interaction alone would be an invalid exaggeration of the
attainable accuracy.
The radiative corrections to Coulomb's law discussed in this section are, as we
have seen, valid even at distances r*£ l/me. We can now add that the formulae
obtained cease to be valid at distances r < 1/m^ (or l/m ïï ), where polarization of the
vacua of other particles becomes significant.
§ 115. Calculation of the imaginary part of the polarization
operator from the Feynman integral
In a direct calculation from the diagram (the loop in (113.1)), the polarization
operator in the first approximation of perturbation theory would be given by the
integral
l
^^~
e2 j tr y»G(p)y"G(p - k)0p.
(115.1)
This integral, however, taken over all four-dimensional p-space, is quadratically
divergent, and in order to obtain a finite result it would be necessary to regularize
the integral by the procedure described in §112.
Here, we shall not give the complete derivation, but show how one can use the
integral (115.1) to calculate the imaginary part of the polarization operator (which
has been determined in §113 by means of the unitarity condition); this derivation
includes a number of instructive points.
The imaginary part of the integral (115.1) does not diverge and therefore does
§115
Imaginary Part of the Polarization
509
Operator
not need regularization. For the scalar function im 9 = J im 0>£ we have
i m <a>-; m J ; i £ £ l f try{yp + m)yß(yp + yk; + m)
.4 1
After calculating the trace, we get
im 0>(k2) = im I
i<t>(p)d4p
(p - m + i0)[(p - k)2 - m2 + iOV
2
le1
<t>(p) = ^i(2m2
2
+
(115.2)
2
pk-p ).
Let k2 > 0. We shall use a frame of reference in which k = (k 0 ,0), and
( p - / c ) 2 = (Po-k 0 ) 2 -p 2 .
Using also the notation e = V(p + m2) (this is not the "energy" of the virtual
electron p0), we can write (115.2) in the form
im 0>(k2) = im f d3p f dp, T-i
J
J
^ J £ ( P o > P ,\;
« .ft1,
( p o - e +iO)[(po-k 0 ) - e 2 + i 0 ] '
(115.3)
9 2
<MPo, p) = Y~j (m 2 r e 2 + p0ko - po).
The integrand has poles at four values of p0:
(a) po = e - iO,
(b) po = k 0 - e + iO,
(a') p 0 = - e + iO,
(b') p0 = k0+ e - »0.
Figure 21 shows the configuration of these poles; we shall take the specific case
k0 > 0, but the final answer depends only on ko, and not on the sign of k0. We can
©
FIG.
21.
calculate the discontinuity of the function &(t) at the cut in the complex plane of
r = k2 = ko or, equivalents, on the real axis in the k0-plane. The real part of &(t) is
continuous at the cut, and the discontinuity is therefore
A W ) = 2iim W ) .
(115.4)
§115
Radiative Corrections
510
We shall first show how the position of the cut may be established from the
form of the integral. Let J(p, k0) denote the inner integral (over dp0) in (115.3). So
long as the upper and lower poles in Fig. 21 are atfinitedistances apart, the path of
integration over p0 can be taken far from the poles, as shown by the broken line. It
is therefore evident that in this case the integral I(p, k0) is unaltered by an
infinitesimal upward or downward movement of the poles b and bl respectively
away from the real axis, i.e. by the change k0-+ k0± îô, Ô -»0. Thus, where k0 tends
to its real value from above or from below, the value of J(p, k0) is the same, and
therefore makes no contribution to the discontinuity A0\ The situation is different
only if two poles (which may be a and b when k0 > 0) are exactly one beneath the
other, so that the contour of integration is "trapped" between them and cannot be
moved to a great distance. Thus the discontinuity aSP/^0 only if the condition
k 0 -£ = e, i.e. k0 = 2e =2V(p 2 +m 2 ), can be satisfied somewhere in the region of
integration over d3p« F° r this to be so, we must evidently have k 0 ^2m, i.e.
t ^4m 2 .t
The integral I(p, k0) can be written in the form
<ll5 5)
^•^-Jcpi-.W-W-e'r
'
c
with the terms I'O omitted from the denominator and the integration contour C
correspondingly modified, as shown in Fig. 22. We see that the discontinuity A^(0
is due to the impossibility of bringing the contour away from the pole a when it is
trapped between a and b. The contour C is therefore replaced by C\ which passes
©
C—•—yij
^
O*
fZy ■*—
a—•—x*s
^
y0J
^TN—^_
FIG.
22.
beneath a, and the integral around a small circle C" centred at a is added. Then C" can
always be moved away from the poles without any difficulty, and the integration along
it therefore contributes only to the regular part of &(t). To determine the required
discontinuity, we need only consider the integral along the circle C", which is done by
calculating the residue at the pole a. This calculation can be made by substituting in the
integrand
-^r—i^-2m8(pl--e2)\
Po~ s
(115.6)
t It can be shown similarly that there is no cut when t = k 2 < 0 . Taking in this case a frame of
reference where k = (0, k), we find that the poles of the integrand are at
po = ± ( e - i O ) ,
p o = ± ( V [ ( p - k ) 2 + m 2 ]-iO).
The two lower poles are always in the right half-plane of po, and the two upper ones in the left
half-plane, so that no pair can be vertically aligned.
Imaginary Part of the Polarization
§115
Operator
511
the minus sign is used because the circle around the pole is traversed in the
negative direction. In the argument of the delta function, only the zero p 0 = + e is
to be used (the pole a, not a'); this is automatically done if we agree to integrate
over only half of momentum 4-space (p0 > 0).
After the substitution (115.6), the discontinuity of the integral I(p, k0) can be
calculated immediately:
AI = {I(p, k0 + iô) ~ Kp, ko - i8)h^+o
= - 2m J d(pl - e2)i<Mpo, P ) x
o
X
L(ko-po) 2 -e 2 +iÔ " ( k o - p o ) 2 - e 2 - i ô J
dp
°'
With the equation
71
^
T-rrz = P 77
^
5+i7r8[(ko-po) 2 -e 2 ]
2
(ko-po) - e ± i ô
(ko-po)-e
(see (111.3)), we have
AI = i(27TÎ)2J ô(p§-£ 2 )ô[(k 0 -po) 2 -e 2 ]</>(Po,p)dpo.
0
The arguments of the delta functions can be rewritten in an invariant form by
adding and subtracting p2:
pl-e2
= p2-m2,
(ko-po)2-e2 = ( k - p ) 2 - m 2 .
We then obtain finally
A0>(k2) =
Î(2TT0 2
I d4p • <f>(p)ô(p2 - m2)ô[(p - k)2 - m2].
Po>o
(115.7)
Because of the delta functions, the integration is actually taken only over the
region of intersection of the hypersurfaces
p2 = m2,
(p-k)2
= m2.
(115.8)
Since in this region all the 4-vectors p are time-like, the condition of integration
over p o > 0 is invariant (the upper interior region of the cone p 2 = m2).
Let us compare (115.7) with the original formula (115.2). We see that the
discontinuity of the function &(i) at the cut in the f-plane can be found by applying
512
§115
Radiative Corrections
in the original Feynman integral the substitution
-3— l-r—rïr-*-27riô(p2-m2)
r
p -m +i0
(115.9)
'
v
in the propagators which correspond to the loop lines intersected in the diagram
(113.1) (S. Mandelstam, 1958; R. E. Cutkosky, 1960).
The conditions (115.8) select the region of momentum space in which the lines
of virtual particles in the diagram correspond to real particles (the 4-momenta p
and p - k are then said to lie on the mass surface). Here we see clearly the
relationship to the unitarity-relation method, where these lines are replaced by lines
of intermediate-state real particles.
We also observe the mathematical reason for the absence of divergence in the
imaginary part of the diagram: the integration is over a finite region of the mass
surface, not over the whole of infinite momentum 4-space as in the original
Feynman integral.
In order to derive from (115.7) the formula obtained in §113, we return to the
frame of reference in which k = 0 and integrate over
d4p - |p|e de dpo do.
The integration amounts to removing the delta functions, with
5(p 2 - m2) dpo = 5(po- e2) dp0->^- Hpo~e) dp0,
IE
and then
Ô[(p - k)2 - m2] de = ô[(po - ko)2 - e2] de
=
8(-2ek0+kî)de
->2k~S(e-2ko)<*e.
The result is
A3>(t) = - \ iV I yji^Y^y^
P) do,
where t = k2 = ko; the value of <j> is taken for
Po=e=5k 0 ,
p2 = e 2 - m 2 = Jko-m 2 ,
i.e. it is
4>(e,p) = ^ ( 2 m 2 + 0
(115.10)
§116
Electromagnetic Form Factors of the Electron
513
and is independent of the angle. The integration over do reduces to multiplication
by 4ir, and we come back to (H3.8).
The only vital point in the foregoing derivation is that the diagram is divided
into two parts by cutting no more than two lines. The rule stated therefore remains
valid for diagrams comprising any two sections joined by two (electron or photon)
lines. The integral calculated by the substitution (H5.9) then gives the contribution
to the imaginary part of the diagram that arises (in the unitarity-relation method)
from the corresponding two-particle intermediate state.
§ 116. Electromagnetic form factors of the electron
Let us consider the vertex operator P 1 = Ttl(p2, p\\ k) in the case where the two
electron lines are external lines and the photon line is internal. The external
electron lines correspond to factors MI = M(PI) and Ü2 = M(P2), and T therefore
appears in the expression for the diagram as the product
if^ü.T^.
(116.1)
As already noted in §111, this is the electron transition current, including radiative
corrections. The conditions of relativistic and gauge invariance enable us to
establish the general matrix structure of this current.
The electromagnetic interaction operator V = e(fÂ) is a true scalar (not a
pseudoscalar), in accordance with the conservation of spatial parity in these
interactions. The transition current j/, is therefore a true 4-vector (not a pseudovector), and hence can be expressed only in terms of other true 4-vectors formed
from the two available 4-vectors px and p 2 (k = p 2 - p\ is a third) and the bispinors
U\ and u2. There are three independent 4-vectors of this kind bilinear in ü2 and u,:
M2Y"l,(Ml)pl,(«2Wl)P2,
or, equivalently,
ü2yuu(ü2Ui)PAÜ2U\)k,
(116.2)
where P = pi + p2. The condition of gauge invariance requires that the transition
current should be transverse to the photon 4-momentum k:
j/ik=0.
(116.3)
This is satisfied by the first two 4-vectors (116.2), respectively because of Dirac's
equations
(YPi-m)tt, = 0, ö 2 (yP2-m) = 0
(116.4)
and because Pk = 0. The current jfi is given by a linear combination of these two
514
Radiative
Corrections
§116
4-vectors:
J}5 = / i ( ö 2 M , ) P ^ +/ 2 (ö 2 -y | £ Mi),
where f\ and f2 are invariant functions, called the electromagnetic form factors of
the electron.
Since the 4-momenta px and p2 relate to the free electron, p} = p2 = m2, and
from the three 4-vectors pu p2 and k (which are related by k= p2-p\)
we can
construct only one independent scalar variable, which we take as k2. Then the form
factors are functions of k2.
The expression for the current can also be put in other forms with different
choices of the two independent terms. Using the equations (116.4) and the commutation rules for the matrices y, we can easily show that
{ü2d^ux)kv = - 2m(û2y'4ii,) + (ö2u,)PM,
(116.5)
where a^ = 2(yfly1' - y'y*1)* The coefficient of this term will later be seen to have
an important physical significance, and we therefore write
P =y W
2
) ~ g ( k V a ,
Lm
(116.6)
where / and g are two other form factors; the reason for writing the factor 112m
separately will be explained below.t For brevity, we shall always use the vertex
operator instead of the current, the ü2 and U\ on either side being understood.
In order to determine the properties of the form factors, let us consider the
diagram (110.16) for the interaction of an electron with an external field. The
corresponding scattering amplitude is
Mfi = -eJtistïXk)9
(116.7)
where d^ is the effective external field (taking account of the polarization of the
vacuum).
The amplitude (116.7) describes two reaction channels. In the scattering channel, the invariant t is such that
t = k2 = ( p 2 - P l ) 2 ^ 0 .
Putting p- for p2 and - p + for pu we change to the annihilation channel, which
corresponds to pair production with 4-momenta p_ and p+. In this channel,
t = (p_ + p+) 2 ^4m 2 .
The range 0 < t < 4 m 2 is non-physical.
t To avoid misunderstanding, it may be mentioned that in the definition (116.6) k is assumed to be
the 4-momentum of the photon line coming to the vertex; for the outgoing line, the sign of the second
term would be reversed.
§116
Electromagnetic Form Factors of the Electron
515
Let us now consider the unitarity condition (111.12). In the scattering channel
(r<0) there are no physical intermediate states in this case: one free electron
cannot change its momentum or give rise to any other particles. There are also, of
course, no intermediate states in the non-physical region. Hence, when t <4m 2 , the
right-hand side of (111.12) is zero, so that the matrix Tfi (or, equivalently, M/() is
Hermitian:
Mfl = M«?/.
The interchange of the initial and final states corresponds to the interchange of p2
and pi, and therefore to a reversal of the sign of k. Putting Mfi in the form (116.7),
we therefore have
}f,s4{t)(k) =
}^s4le)*(-k).
Since sd(e)(- k) = si(e)*(k), it follows that the transition-current matrix is also
Hermitian:
jfi = j1f when t<4m2.
(116.8)
Using the properties of the matrices y (21.7), we can easily prove that
(Ü27>1M.) = ( Ö 1 7 ^ 2 ) * ,
^(T^u,) = -(".0-^2)*.
Thus jf/ differs from jfi only in that the functions /(f) and g(f) are replaced by their
complex conjugates, and it then follows from (116.8) that these functions are real.
Thus
im/(f) = img(f) = 0 when f<4m 2 .
(116.9)
In the annihilation channel (f>4m 2 ), the / state is a pair which can be
transformed into another pair with different momenta (elastic scattering) or into a
more complex system. The right-hand side of the unitarity condition is therefore
not zero, the matrix Mfi (and hence jfi) is not Hermitian, and so the form factors are
complex.
The analytical properties of the functions /(f) and g(f) are exactly similar to
those of the function 0>(f) discussed in §111, although it is difficult to prove them so
directly. These functions are analytic in the complex t-plane cut along the positive real
axis t >4m 2 , with
/*(*) =/0*),
g*(t) = g(t*).
The renormalization condition (110.19) applied to the vertex operator (116.6)
leads to the requirement
/(0)=1.
(116.10)
516
Radiative Corrections
§116
In order to include this condition automatically (when calculating f(t) from its
imaginary part), we must apply a dispersion relation of the form (111.8) not to the
function f(t) itself but to (/ - l)/f. Then we get the dispersion relation "with one
subtraction":
4m 2
No values for the form factor g(t) are prescribed by physical conditions. Its
dispersion relation will therefore be written "without subtractions":
«<«>-£ f 7^Todt'4m 2
<"6-,2>
The value of g(0) has an important physical significance, as it specifies the
correction to the magnetic moment of the electron. In order to see this, let us
consider the scattering of a non-relativistic electron in a constant magnetic field
which is almost uniform in space.
The term in the scattering amplitude (116.7) which depends on the form factor
g(fc2) is
SMfl,.=j^g{k*Kü2cr'u'ul)k¥Al:Kk).
(116.13)
For a purely magnetic field, A(')M = (0, A); since the field is constant in time, the
4-vector k** = (0, k), and since it varies only slowly in space, k is small. With a view
to subsequently taking the limit k-*0, we have already replaced the effective ,s#(€)
by A w in (116.13). Expanding (116.13) and expressing it in terms of threedimensional quantities, we find
SM/i = 2 ^ g( - k2)(ü2Xu,)ik x Ak,
where 2 is the matrix (21.21). The product ik x Ak is replaced by the magnetic field
Hk, and we can then take the limit k->0. Finally, with the non-relativistic spinor
amplitudes w,, w2 given by (23.12):
ü2 = V(2m)(M>!0),
u, = V(2m)(o'),
we have
ÔM /I =^-g(0)H k -2m(wîo-w l ).
(116.14)
§117
Calculation of Electron Form Factors
517
This expression may be compared with the scattering amplitude in a constant
electric field having the scalar potential 4>k:
Mfi = - e(û27°Ui)4)k — - e4>k • 2m(w$Wi).
We see that an electron in a magnetic field can be regarded as having an additional
potential energy
This means that the electron has an ''anomalous" magnetic moment (in ordinary
units)
(116.15)
I*.'= (ehHmc)g(O)
in addition to its "normal" Dirac magnetic moment ehUmc.
§ 117. Calculation of electron form factors
Let us now go on to the actual calculation of the electron form factors (J.
Schwinger, 1949). In the zero-order approximation of perturbation theory, the
vertex operator P 1 = y*\ i.e. the electron form factors are / = 1, g = 0. The first
radiative correction to the form factors is given by the vertex dia~ r am
(117.1)
with two real external electron lines and one virtual external photon line. We shall
first calculate the imaginary parts of the form factors. As has been shown in §116,
these differ from zero only in the annihilation channel (Jc 2 >4m 2 ); accordingly, the
4-momenta of the external electron lines in the diagram (117.1) correspond to the
production of an electron and a positron, and are denoted by p - and - p + . The
analytical expression of the diagram (117.1) is
- ieü(p_)Pii( - p*) = ( - ie?ü(p-)y*i j G(p)y»G(p
- k)y>D.M) ^
II( - p+)
(117.2)
518
§117
Radiative Corrections
or, in expanded form,
y7(fc 2 )-^-g(fcV^,= f, i
''
2m
ô
l(
\y)d[%
J (p - n r ) [ ( p -k)
rv
-m ]
(117.3)
with the notation
</>^(p) =
- g ^^P
+m
)y(^-Jfc-fm)^
47T ( P - ~ p )
(117.4)
and omitting for brevity the factors ü(p_), w(-p+); it is understood that both sides of
the equations below are placed between these factors.
The horizontal dotted line divides the diagram (117.1) into two parts, in such a
way as to indicate the intermediate state which would figure in a calculation of the
imaginary part of the form factor by means of the unitarity condition, namely the
state of an electron-positron pair with momenta differing from p., p + . The
intersection also shows where the pole factors are to be replaced in the integral
(117.2) in order to use the rule (115.9) in the calculation (in (117.3), these factors
have been separated in the integrand).
The integral in (117.3) has the same form as that in (115.2), and we can therefore
immediately write the result of the transformation as in (115.10):
2 7 M i m / ( 0 - ^ c r * % im gU
("7.5)
where t = k2, the integration is over the directions of the vector p, and the 4-vectors
p~ = p and p+ = k-p in the definition of the function <f>^(p) (117.4) become the
4-momenta of real (not virtual) particles. The expression (117.5) relates to a frame
of reference in which k = 0, i.e. the centre-of-mass system çf the pair p_, p+ (and
hence of the "intermediate" pair p~, pi). In this frame, therefore, k = (k 0 ,0),
P- = (2*0, P-), P+ = (2^0, - p - ) , p = (2JC0, P)> a n c i it is easy to verify that
f2 = (P~ P)2 = - 2p2(l - cos 0) = - i(f - 4m2)(l - cos 0),
(117.6)
where 0 is the angle between p and p- (and p2 = p 2 ). Not substituting (117.4) in
(117.5) and eliminating the matrices y \ .. yp in the integrand by means of (22.6), we
have
y» im / < 0 - 2 ^ **% im g(f)
=
- SvffirhïP» / ïïo^ë) y " i y p
=
' W [ , ( / - W ) ] / 2,(1 -"cos 0) ' - 2 m V
+ 2(yp+-yf)y»(yp-
+ yf)l
+ m)y iyp
" ~ yk + m)y'
+4
"" P " + 2 ^ ) +
(117.7)
§117
Calculation of Electron Form Factors
519
with the 4-vectors
/ = P - P - = (0,I), P = p - - P * = (0,2p-).
(117.8)
The integration now amounts to the calculation of the three integrals with
different numerators
J 1 - C O S 6 Lit
The integral I is logarithmically divergent when 0 -♦ 0. If it is written as
t-4m2
-(l-4ro2)
2
1= |
2
d(f )/f =
J
d(/ 2 )// 2 ,
we see that the divergence corresponds to small "masses" of the virtual photon, i.e.
it is an "infra-red" divergence, which will be further discussed in §122; here we
shall merely note that the divergence is fictitious, in the sense that, when all
physical effects are correctly taken into account, the divergences of this kind
cancel out. We can therefore cut the integral off at any lower limit, and then, in the
subsequent calculation of real physical phenomena, make this limit tend to zero.
Here it will be simplest to apply the cut-off in a relativistically invariant manner.
To do so, we assign to the virtual photon/ a small butfinitemass A (*^m), i.e. make the
change
/2-/2-A2
(117.10)
in the photon propagator D(/2) in (117.2). Then
r
/=
-<J-4m*)
f
J
o
<*(/2) . t - 4 m 2
r2_A2 = log—p—.
M
n n i n
(117.11)
The integral Jf\ in which / is a space-like 4-vector, must be expressed in terms
of the 4-vector P", which (unlike k", the only other available 4-vector) is space-like
for all p+ and p_. Hence J*1 = AP». If this equation is multiplied by P^ and the
integral PJ* is calculated in the centre-of-mass system of the pair (with the
components of the 4-vectors / and P given by (117.8)), the result is
i rrj»,dcos $
2?) I i - COS0
-'J
1
d cos (F= - 1.
520
Radiative Corrections
§117
Thus
I" = - ? * .
(117.12)
We can similarly calculate the integral
V = \P2(g^ -^^-)
+ \P^Pv;
(117.13)
to determine the coefficients in this expression, we have only to evaluate the
integrals 1% and I»VP»PV.
The calculation continues as follows: (117.11)—(117.13) are substituted in (117.7)
to obtain a series of terms between ü(P-) and u(-p+). In each term we use the
commutation rules for the matrices y* to "propel" the factor yp+ to the right and
yp- to the left; then we can make the replacements yp-^m, ?p+->- m, since
ü(p-)yp- = mü(p-), yp+u(-P+) = - mu(-p+).
In the resulting sum
- 4(p+p_)Iy'1 + 2mP* - 3P V ,
we can then replace P* by 2myM + crM%, which is equivalent to it when between
the ü and u factors; cf. (116.5). Finally, all quantities are expressed in terms of the
invariant t = k2 (2p+p_ = t -2m 2 , P 2 = 4m 2 -f) and the two sides of (117.7) are
compared; this yields the following expressions for the imaginary parts of the form
factors:
""^'Vud-W)]'
im
/<*> = 4V[t(r- 4m2)] [ ~ 3' + 8m2 + 2(f - 2m2) log ^ ^ ] .
(11714)
(117.15)
The infra-red divergence occurs only in im/(t).
The functions f(t) and g(t) themselves are obtained from their imaginary parts
by means of (116.11) and (116.12), in which the integrations are effected by the
same substitutions as were used in §113 to calculate 0*(O- The form factors are
given in terms of the variable £ (113.11) by
+j ^
I ^ M log21 - 2F(|) +2 log £ log (1+ £)]},
where F(£) is Spence's function (131.19).
(117.17)
§118
521
Anomalous Magnetic Moment of the Electron
In the non-physical region (0< r/m 2 <4), we must put £ = ei4>. The expressions
for the form factors can then be written
IT l\
tan<£/
A
2 sin <>
/
tan <>
/ J
o
J
(117.18)
(H7.19)
SW-TZZ&A-
Z7T s m <p
Finally, the following are the limiting expressions: for small |t|,
/(0
~1
=
3^( 1 O 8 T""I)' l'H4"1*' *(0 = a/2irf
(117.20)
and for large |t|,
+
+ 1,B wx
/(.)-'-èo^ ^ï^) {
>4m 2 ,
>4m 2 ,
(117.21)
ï M:
Formula (117.21) is valid (as regards re/) with what is called double-logarithmic
accuracy, i.e. as far as the squares of the large logarithms.!
§ 118. Anomalous magnetic moment of the electron
As has been shown in §116, the value of g(0) determines the radiative correction
to the magnetic moment of the electron. If we seek to calculate only this quantity,
there is of course no need to find the whole function g(t). From (117.14) and
(116.12),
nl(\\-
x
f
im
g(*') if -
a
f
dx
-
a
man
With this correction, the magnetic moment of the electron is
"-&('►£)•
<»«*■
a formula first derived by Schwinger (1949).
t The expression for thé vertex operator in the case of one virtual and one real external electron line
and a real external photon line is given by A. I. Akhiezer and V. B. Berestetskii, Quantum Electrodynamics, Interscience, New York, 1965, §47.5.
522
§118
Radiative Corrections
In the next approximation (with terms in a2), the radiative corrections in the
form factors are represented by the seven diagrams (106.10, (c)-(i))- Even to find
the value of g(0) in this approximation demands very lengthy calculations. Details
of these may be found in the original papers; here we shall give only the final
valuet for the correction in the second approximation:
= -0.328 a2/ir2.
(H8.3)
The magnetic moment of the electron is therefore
"=2^( l + £-°- 3 2 8 £)
(ll84
>
(C. M. Sommerfield, 1957; A. Petermann, 1957).
Particular consideration may be given to the contribution of the vacuum
polarization to the correction g(2)(0), namely the diagram
(118.5)
which contains a photon self-energy part. It differs from the first-approximation
diagram (117.1) only by having instead of the photon propagator D(/2) = 47r//2 the
product
D</>)^W> = ^ ,
where 0>(/2) is the polarization operator in the first approximation (terms in a),
calculated in § 113. Repeating, with this difference, some of the calculations in § 117, we
find as the "polarization part" of the correction
(2) „ ,
am2
( 9>(j2) 1 + 3cos 6 JCOS
im gghrXO = V[f(f-4m 2 )] J
<f
2
n
'
(
„ , 60) ^
-i
with
/2 = - i ( t - 4 m 2 ) ( l - c o s 0 ) ;
(118.7)
t The calculation by the unitarity method is given by M. V. Terent'ev, Soviel Physics JETP 16,444,
1963.
§118
Anomalous Magnetic Moment of the Electron
523
see (117.6). When this integral and then
ggur.(0) = -
IT
f img£Uo¥
J
4m2
I
(118.8)
are calculated, the result is
g(X(0) = ^ ( ^ - | T T 2 ) = 0.016^,
(118.9)
which is about 5% of the total quantity (118.3).
It has already been noted at the end of § 114 that vacuum polarization for other
particles may also make a contribution to the radiative corrections. The contribution of the muon vacuum to the anomalous magnetic moment of the electron is
obtained from the same formulae (118.6MH8.8), in which m is again the electron
mass me (this applies also to the definition of the variable /*) but the parameter m
in the expression for 0>(/2) must be the muon mass mM. The function ^(f 2 )// 2
depends only on the ratio f\m\. In the integral (118.8) the important values of t
(and therefore of f2) are those comparable with m2; hence the ratio / 2 / m 2 ~
(mjmll)2<^ 1, and in evaluating the integrals we can use the limiting formula
(113.14), according to which
0>(/2) =
f
2
o_ f2
\5<rrml'
From this we see that the contribution to g2(0) from the muon vacuum polarization
contains the extra small factor {mjm^f.
The opposite situation, however, occurs for the corrections to the magnetic
moment of the muon. Since the particle mass does not appear in (118.3), this value
of g2(0) is valid for the muon also, and it takes into account the contribution of the
muon vacuum polarization. But the contribution from the vacuum polarization of
other particles, namely the electrons, is here considerably greater. It can be
calculated from formulae (118.6HU8.8), with mM for m and the electron polarization operator for &(t). Unlike the previous case, the important range of values is
and ^(/ 2 ) must be taken as the limiting
now given by f2jm2e~{mjmef>\,
expression (113.15):
j
5TT
me
The calculation of the integrals gives
[g2(0)W
—= (f) 2 (i l08 ^-i)
= 1.09 a2h2
(H. Suura and E. H. Wichmann, 1957; A. Petermann, 1957).
(118.10)
524
§ 119
Radiative Corrections
Adding (118.10) and (118.3), we find as the magnetic moment of the muon
eh
- ( l + ^ + OJÔ^À
2m
2TT
IT1)
Lc \
(118.11)
The contribution of the muon vacuum polarization (118.9) is here about 2% of the
total value of g(2)(0). The pion vacuum polarization would give a contribution of the
same order (because of the similarity of the masses), but this cannot be calculated
exactly, and so there would be no point in finding the corrections —a3 in the
magnetic moment of the muon.
§ 119. Calculation of the mass operator
The method of direct regularization of the Feynman integrals may be demonstrated by a calculation of the mass operator. In the first non-vanishing approximation, the mass operator is represented by the loop in the diagram
p-k
(119.1)
k
corresponding to the integral
- iM(p) = ( - ie)2 $ y»G(p - k)y'D^(k)
^ ;
on substituting the propagators and combining the factors y*...
formulae (22.6), we get
yM by means of
(the bar over M denotes the non-regularized value of the integral). The fictitious
"photon mass" A is used in the photon propagator in orde'r to eliminate the
infra-red divergence (as in §117).
The integral may be transformed by means of formula (131.4), with a, and a 2
the two factors in the denominator of (119.2). A simple rearrangement of the terms
in the denominator of the new integral gives
0
with
a2 =
m
2
x 2 - (p 2 - m 2 )x(l - x) + A2(l - x).
(119.4)
§119
525
Calculation of the Mass Operator
The change of variable k-^k + px brings the integrand in (119.3) to a form in
which its denominator depends only on k2; according to (131.17), (131.18) a
constant is then added to the integral:
A(p) =
i
e
d k dx 2m
- 5Ü7 H ' J
0
<kiy-% X)~ï iir'w }•
(1195)
(The term in yk in the numerator is here omitted, since it gives zero on integration
over the directions of the 4-vector k; cf. (131.8).)
The regularization of this integral involves subtractions such as to reduce it to
the form (110.20). The latter expression gives zero on multiplication by the wave
amplitude u(p), if p is the 4-momentum of a real electron. Without bringing in u(p)
explicitly, we can formulate this condition as stating that M(p) vanishes when we
substitute
yp^m9
p2^m\
(119.6)
The form of the integral (119.5) is convenient in that the 4-vector p appears in it
only as yp and p2, but not as kp.
Subtracting from (119.5) a similar expression after the substitution (119.6), we
get
i
~ IfMJ W d*[2m ~w(i ~ * 4 < F ^ ~ ( F ^ F ] 0
1
-Jd4k|dx(Jj^(7p-m)-ii7r2(7P-m)},
(119.7)
0
with a £ = m V + A 2 (l-x).
To complete the regularization, however, a further subtraction is needed:
according to (110.20), the substitution (119.6) should reduce to zero not only M(p)
itself but also M{p) without the factor yp -m. A corresponding subtraction
removes entirely the second and third terms in the braces in (119.7).t The first
integral is transformed by incorporating a further integration (using (131.5)), taking
n = 2, a = k 2 - a2, and b = k2-a\. Then (119.7) becomes
t
x
167TÎ
(2TT)4
i
2
i
f ,4i f J f A (7P + m)[2m - YP(1-X)]X(1-JC)
J
J
J
[k - o o + (p -rrr)x(l-x)zr
0
0
t Thus, in the process of renormalization "en route" (§110) we omit corrections to the renormalization constant Zi. The corresponding integrals are logarithmically divergent. If we use a "cut-off
parameter" A 2 >m 2 , p2, and limit the region of integration over d k by the condition k2*sA\ the
correction can be explicitly calculated; the result is
z,. 1 + 2 n
z r =-i[i l o g ^ + ,„ g i; + ?].
„„.T.,
526
§ 119
Radiative Corrections
where we have also used the identity p2- m2 = (yp - m)(yp + m). The integration
over d4k is immediate: assuming that p 2 - m 2 < 0 and using (131.14), we have
WK
27T J
0
m x + Az(l - x ) + (m -pO*(l - x ) z
J
0
We now need to subtract a similar integral with the substitution (119.6), omitting
temporarily the factor yp - m; after some simple calculations, we get
i
2
2
M(p) =
(yp-m) ^jdxfdz
x
mO-x_») - (W_ + m)(l -Xf[l
L
*
T vA/wj J
~J$$g
MIOft*
m X + (W ~~ P )(1 ~~ *)Z
(the term in A2 is omitted from the common denominator, since this causes no
divergence in the present case; elsewhere, A 2 (l-x) is replaced by A2, since the
infra-red divergence will correspond to the divergence as x -» 0).
The integration in (119.8) (first over dz and then over dx) is fairly lengthy but
elementary, and the result is
M(p)
- ÛH(w - m*[w=7) (' _ T%log ") ~
with
p = (m2-p2)lm2
(R. Karplus and N. M. Kroll, 1950). The integral has been calculated on the
assumption that p > 0 and p > Kim. In accordance with the rule for passing round
poles, in the analytical continuation of (119.9) into the region p < 0 the phase of the
logarithm is found by making m-nn- iO; then p^*p- i0, and log p must be taken
f or p < 0 as
logp = log|p|-iir,
p<0.
Let us now consider how the mass operator behaves when p2>m2.
- p ** p2lm2 > l, and with logarithmic accuracy we have
M{p) =
(H9.10)
Then
-[<S-\p)-G-\p)\
~^(7p)log^.
(Il9.ll)
§119
Calculation of the Mass Operator
527
As in the case of the photon propagator (cf. formulae (113.15) and (113.16) for the
polarization operator), the correction to G 1 is small only when the energy is so
small that
47T
m
In the present case, however, the logarithmic increase is in a certain sense
fictitious, and can be eliminated by a suitable choice of gauge, i.e. of the function
D (n in the photon propagator (L. D. Landau, A. A. Abrikosov and I. M. Khalatnikov, 1954). To achieve this, we put (in the notation of §103)
D (0 = 0,
(119.12)
whereas formula (119.9) has been derived with the gauge
D{i) = D.
(119.13)
This property of the gauge (119.12) makes it especially suitable for investigating the
theory when p2> m2, as we shall do in §132.
To prove the result stated, we note that, if only terms in e2 are concerned, the
transformation from the gauge (119.13) to (119.12) can be regarded as infinitesimal.
Accordingly, we can apply immediately the formula (105.14), with
d(0(q) = -D/q 2 =-47r/(q 2 ) 2 ,
and also replace ^ in the integrand by G with the necessary accuracy. In the
integral over d4q, the important range is q > p, for which G{p - q) in the integrand
is much less than G(p) and can be neglected. Then
SV = =
G^p^ip)
-ie2G-l{p)fdm{q)d4ql{2<n)4.
Finally, with the transformation (131.11) and (131.12), we have
«r'(p)~£o-'(p)Jfi^P
where A is an upper limit, at which the divergence is removed by renormalization;
the renormalization consists in subtracting the same expression with p2s*m2,
giving finally
This just cancels the difference 9~' - G"1 in (119.11).
528
Radiative Corrections
§119
Lastly, let us consider why it is necessary to use a finite "photon mass" A in
regularizing the integral (119.2), which is closely related to its behaviour when
p2-*m2. Firstly, this integral itself is finite when p2 = m2 and A = 0; to exclude the
divergence for large k, which is here unimportant, we assume that the integral is
taken over a large but finite region of k-space. The need to use A arises in the
subtraction of the re normalization integral, which would otherwise diverge at
p2 = m2. Let us therefore ascertain how the non-regularized mass operator would
behave as p2-> m2. Since this behaviour depends essentially on the choice of gauge,
we shall consider the general case of an arbitrary gauge, whereas the integral
(119.2) has been written for the specific gauge (119.13).
We again apply the transformation (105.14). Writing
dlt)(q)=8D{l)lq2
= ^8a(q2),
(119.14)
we assume that 8a is the variation of a function a(q2) which changes appreciably
only over intervals q2~m2 and is finite when q2 ** m2. In the integrand on the right
of (105.14), the two terms in the difference ^(p) - ^(p - q) are almost equal when q
is small, and the integral converges. For small q,
and so *S(p - q) may be neglected in comparison with <3(p) when q
The integral
>(p2-m2)lm.
ô<3(p) = i e 2 < S ( p ) J d % i ) ^
diverges logarithmically in the range
(p 2 -m 2 ) 2 ^ .
m
2
We therefore have, with logarithmic accuracy,
_ = -_ôa(m2)log?rr^.
This can be integrated as follows. When a = e 2 -»0, the exact propagator ^ must
be the same as the free-particle propagator G, and therefore
1
/
m2
\*<c--oW»
§120
Emission of Soft Photons with Non-zero Mass
529
where ao = a{m2) and C is a constant. To determine C, we compare the expression
^ p ) = (yp-m)[l+^(C-a0)logp],
(119.16)
which is obtained from (119.15) in the first approximation with respect to a, and the
corresponding expression given by the integral (119.2) when A = 0:t
<r l (p) = ( 7 P - " 0 [ l + f logp].
(119.17)
According to the definition (119.14), the function a{q2) is equal to the ratio D{,)ID.
The gauge (119.13) to which (119.17) belongs therefore corresponds to a = ao=\.
Equating (119.16) and (119.17) for this value of a0, we find C = 3.
Thus we have finally as the limiting (infra-red asymptotic) expression for the
unrenormalized electron propagator with p2->m2
*O>=Ï^(F^P)
(,1918)
(A. A. Abrikosov, 1955). The validity of this formula depends only on the
inequalities a <^ 1, | \ogp\> 1, whereas the formulae of perturbation theory would
require also that a| logp|/27r <? 1. The sign of the difference p 2 - m 2 is also unimportant here, since the imaginary part of (119.18) is in any case beyond the limits of
its accuracy.
The renormalized propagator must have a simple pole when p2 = m2. We see
that (119.18) satisfies this condition only in the gauge where
D (0 = 3D
(119.19)
(so that a0 = 3). Then the regularization of the Feynman integral (in order to
prevent its divergence at the upper limits) will not require the use of a finite
"photon mass". In other gauges, the zero mass of the photon produces a branch
point instead of a simple pole at p 2 = m2, and the finite parameter A is needed in
order to remove this "defect".
§ 120. Emission of soft photons with non-zero mass
In calculating the electron form factors in §117, we encountered a divergence of
the integrals when the frequencies of the virtual photons are small. This divergence
is closely related to the infra-red catastrophe discussed in §98, where it was pointed
out that the cross-section for any process involving charged particles (including the
t It is not necessary to repeat the calculations in order to derive (119.17). The term in logp in (119.9)
is obtained on the assumption that p > A, which allows the limit À ->0 to be taken. The term in log (A/m)
arises from the subtraction of the renormalization integral, and is not present in the original integral
(119.2). The subtraction is easily seen to have no effect on the logp terms.
530
§120
Radiative Corrections
scattering of electrons by an external field, represented by a diagram such as
(117.1)) has no significance in itself, but only when the simultaneous emission of
any number of soft photons is taken into account. It will be shown in §122 that all
the divergences cancel in the total cross-section, which includes the emission of
soft quanta. Here, of course, in order to obtain the correct result it is necessary for
the initial "cut-off" of the divergent integrals to be taken in the same manner in all
the cross-sections in the sum.
In §117, this cut-off was applied by means of a fictitious finite mass A of the
virtual photon. We must therefore now modify the formulae of §98 in such a way
that they describe the emission of soft "photons" with non-zero mass.
Formally, such a photon is a "vector" particle with spin 1, whose free field has
been discussed in §14. Such particles are described by a 4-vector (//-operator
fa = V(4TT) g ^ ^
(cka e{? e- f a + cL «<;>► «*"),
a = 1,2, 3;
(120.1)
the notation and normalization differ from that in (14.16), in order to bring them
into line with the photon case.
The interaction of the "photons" (120.1) with electrons is to be described by a
Lagrangian of the same form as for true photons:
(120.2)
-eFfa,
the potential A^ being replaced by fa. Then the amplitudes for the processes of
emission of photons with finite mass will be given by the usual rules of the diagram
technique, the only differences being that
k2 = A2
(120.3)
and that the summation over the polarizations of the emitted photon must be taken
over three independent polarizations (two transverse and one longitudinal) instead
of two as for the ordinary photon.This is equivalent to averaging with respect to
the density matrix of unpolarized particles
P^ = - i ( g „ , - ^ )
(120.4)
(cf. (14.15)), followed by multiplication by 3.
The propagator for "photons with non-zero mass" is
n
4TT
/
kuK\
(cf. (76.18)), but in the case of gauge invariance the amplitudes of real scattering
processes do not depend on the longitudinal part of the photon propagator, and this
property is not the result of the specific form of its transverse part. The second
term in the parentheses can therefore be omitted, leaving an expression of the same
§120
Emission of Soft Photons with Non-zero Mass
531
type as for ordinary photons:
*V = ki*kî8i™
(120.5)
as has been used in §§117 and 119.
Let us now consider the emission of soft photons (in the sense explained in
§98). The derivation of (98.5) and (98.6) can be applied to the present case, the only
difference being that the term k2 = A2 is added in expanding the squares (p ± k)2 in
the denominators of the electron propagators. We thus have, instead of (98.6),
,
.
Pe
I2 d3k
z
pk-\ l2\ 4TT1Ù)
Pe
\p'k + ku
2|
where dcrt\ is the cross-section for the same process without emission of a soft
quantum, which we shall conventionally call an "elastic process". In the integrations over d3Jc, the important range is |k| — A. Then p'k ~~ pk> A2, so that the
terms in A2 in the denominators may be neglected. The summation over polarizations of the photon is carried out by means of (120.4), as already described. When
the approximation stated is used, the second term in (120.4) makes no contribution
to the cross-section, and there remains!
* , -*"" , (Ä)-(fe)TO
(i2o 6)
-
We thus recover formula (98.7), but <o must now be taken as
û> = V(k2 + A2).
(120.7)
Formula (120.6) is completely general: it is applicable to both elastic and
inelastic scattering and even when the type of particle changes. The result of the
further integration over d3k depends on the 4-vectors p and p', i.e. on the nature of
the basic scattering process.
Let us take the case of elastic scattering, for which
IPIHP'I.
* = *'>
and determine the total probability of photon emission with a frequency less than
some o)max, with the assumption that
A<3ow
(120.8)
and that o w is subject to an upper limit governed by the conditions (98.9) and
(98.10) for the theory of soft-photon emission to be valid. We first calculate the
t There may be some doubt at first as to the validity of neglecting A2 before averaging, since it
occurs in the denominator of the second term in (120.4), but we can easily show directly that this term
gives, on averaging, a contribution ~ A4 • 1/A2, which is negligible.
532
§120
Radiative Corrections
integral over dik in the non-relativistic limit. For |p| = |p'| <^ m,
(_VL
\(p'k)
P V-,(q;fc) 2
q2,
~mT^r
mW'
(pk))
where q = p' - p. Integration over the directions of k gives
/ k2
W
A
Then, from (120.6),
da = dore,
'3 /o [1~3(k^Ai)](k^P
or, after integration with the assumption that a>max/A > 1,
dcr = d<xel | f ^ ( l o g ^ - f ) ,
q 2 «m 2 .
(120.9)
In the general relativistic case, the integral is calculated by means of (131.4).
The angle integral is then
i
1 =
J (pk)(p'k) = J dx J [(pk)x + (p'k)(l - x)]2
0
or, expanding the scalar products with p = (e, p), p' = (e, p'),
i
7 =
J dX J {e<o-k-[px + p'(l-x)]} 2 '
o
The inner integral is now easily calculated in spherical coordinates with the polar
axis along the vector px + p'(l - x), giving
i
J(ea>)2-[px + p'(l-x)] 2 k 2
0
1
- f
Airdx
" J [m2 + q 2 x(l-x)]k 2 + s2A2'
o
The other two integrals, with (pk)2 and (p'k)2 in the denominators, are derived from
§120
533
Emission of Soft Photons with Non-zero Mass
this by putting q = 0. Using also the formula
pp' = e 2 - p - p ' = m2 + iq2,
we get
. _ 2e2 f . f
k2d|k| f
m2 + W
m2
}
dx
2
2
2
2
2
2 2
2 2
"V)
J V(k +A )l[m + q x(l-x)]k +e A "m k +e 2 A 2 jo
o
d(T
(120.10)
The integration over d|k| calls for the calculation of integrals having the form
w
max
r
fc
y\k\
J (ak2 + A2)V(k2 + A2)
«max
o
„
a J V(k2 + A2)
«"max
a J (ak2 + A2)V(k2 + A2)
0
0
1 iA2fr)max 1 f
dz
a J (az 2 +l)V(z 2 +l)'
a g A
o
In the second integral we have put Az for |k| and replaced the upper limit a>max/A by
infinity; this is permissible, since the integral converges.
The integrals over dx which then occur in (120.10) cannot be expressed entirely
in terms of elementary functions. The result may be written in the form
da = a [ F ( M ) log ^+Fi]d<rtu
(120.11)
wheret
lQ
g <* + V<£2 + *»" l]>
F( j) = \ [^fjlx)
(12012>
_,
2e , e +1P1
^=-7-7 log—-uy7T|p|
m
i
2mz.2 +. ~q22 f
dx
— ^ J ^Vâ^)
l0g
0
l + V(l-q)
Va
'
m f t l
„
(12013)
a = A l m 2 + q 2 x(l-x)].
t The function F(£) has already occurred in §98, Problems. This is not surprising, since (120.11) can
be derived with logarithmic accuracy by integrating the cross-section (98.8) for the emission of
zero-mass photons over du> from A to o>mM.
If £ is replaced by a variable 0 such that £ = sinh 50, then
F(0) = (2M(0 coth 0 - 1).
(120.12a)
534
§121
Radiative Corrections
An asymptotic expression for the cross-section in the ultra-relativistic case can
be obtained, assuming not only that e > m but also that |q| > m, i.e. that the
scattering angle is not very small. Then the important range of values of x in the
integral in (120.13) is that for which a < 1; the appropriate approximations give
LTTE }
a
o
Ä
f log ( q V ) + log x -Hog (1 - x)
J_
dx
The integral is to be cut off at a ~ 1, i.e. at x~m 2 /q 2 at the lower limit and
1 - x ~ m2/q2 at the upper limit. Then
F, - ±- [2 log ^ log -^ - log2 -^1
2TT
I
e m
m}
= ^ r i o g 2 - ^ - 4 1 oeg ^ l o g i > l .
2ir L m
m
m]
This formula is valid as far as the squares of the logarithms (with doublelogarithmic accuracy). To the same accuracy, it is sufficient to put in the first term
in (120.11)
F(0~(4/7r)log£
(Ç>\).
The final result is
da = ^ [log - ^ log ^ p - log ^ log
^ + \ log2 - ^ 1 d<r<uM q2>m\
IT 1 * m
\
m em
4 ° mj
(120.14)
§ 121. Electron scattering in an external field in the
second Born approximation
In the first two approximations with respect to the external field, the scattering
of an electron is represented by the diagrams
|q
1
i f p- 1 1
[
/
P'
\
I
fq,=f-p
(121.1)
r
The first of these corresponds to the amplitude M (,) ~ Ze2 considered in §80. The
amplitude of the second approximation is M<2) ~ (Ze2)2.
The Second Born Approximation
§121
535
It is easily seen that terms of the same order as this arise from the radiative
corrections. In the third order of perturbation theory, the radiative corrections to
the scattering amplitude are represented by the diagrams
(121.2)
Here M(3) ~ Ze2 • e\ and M(3) ~ M(2) (if Z ~ 1).
According to (64.26), the scattering cross-section is
da = |M}!> + Mff+ Mf\2 do'/16ir2.
(121.3)
In the squared amplitude we can retain not only |M}Pj2 but also the interference
terms between M}P and M\f and between M}!* and Mft. Thus the cross-section is
given, as far as the terms in e\ by the sum
do- = d<riU + do-i2) + d<Tr*i,
(121.4)
where dcr{l) is the cross-section in the first Born approximation (§80), and the
corrections are
da(2) = 2 re M}J> Mff* do'/16ir 2 ,|
From §80,
d<rrad = 2 re M}!> M g* do7167r2. J
Miï = \e\(û'y°umq),
(12L5)
(121.6)
where <ï>(q) is a Fourier component of the scalar potential of the constant external
field (<I> = Ao}), and we have used the fact that the electron charge e = -\e\.
The two expressions (121.5) can evidently be calculated independently. The first
will now be discussed, and the second in §122.
The second-approximation amplitude is given by the diagram (121.1) as the
intégrait
Mf = - e2 j { f l l p V ^ ^ j o A(P)}*(P' - WH ~ P) 0p.
The "4-momenta" <?i = / - p and q2 = p'-f
(121.7)
of the constant external field have no
t Here it is necessary to apply the diagram-technique rule concerning a constant external field; see
rule 8 in §77.
§121
Radiative Corrections
536
time components. Hence
/ o = e = e\
(121.8)
where e and e' are the initial and final electron energies, which in elastic scattering
are the same.
In the purely Coulomb field of a stationary charge Z\e\,
4>(q) = 47rZH/q 2 .
For this potential the integral (121.7) is logarithmically divergent (when f — p and
f~p'). This divergence is specific to the Coulomb field, and arises from the
slowness with which the field decreases at large distances. Its origin is most easily
shown for the non-relativistic case. According to QM (135.8), the coefficient of the
spherical wave el|p|r/r in the asymptotic expression for the wave function of an
electron in a Coulomb field is
/(Ö) exp ( - î -H3p log |p| r j .
This coefficient is also the electron scattering amplitude in the field, and we see that
its phase includes a term which diverges as r -> ». When the scattering amplitude is
expanded in powers of Zay this term causes divergence of all the terms in the
expansion from the second term onwards (since the function /(Ö) is proportional to
Za). In the relativistic case there is, of course, a similar situation.
These arguments also show that the divergent terms must cancel when we
calculate the scattering cross-section, in which the phase of the amplitude is
unimportant. The simplest procedure for a correct calculation is to consider first
the scattering in a screened Coulomb field, putting
cD(q) = 47rZ|e|/(q2 + S2)
(121.9)
with a small screening constant 8 <^ |p|. This eliminates the divergence in the
scattering amplitude, and we can then put 8 = 0 in the final formula for the
cross-section.
Substituting (121.9) in (121.7), we have
Mf = - - Z2a2ü(p')[(y°e
+ m)J, + y • J] ii(p),
•JT
with the notation
J[(p'-f) 2 + « 2 ][<f-p) 2 +8 2 ][p 2 -f 2 +«or
J
!
!
!
5
!
- J w - f) + s K(i -'!?+ s ][p - f + iO] "
(121.10)
kp + p,) h
-
The Second Born Approximation
§121
537
p2 = e2 - m2 = p'2, and the integral J is symmetrical in p and p'; from considerations
of vector symmetry, it is immediately obvious that the vector J must be parallel to
p + p\ Now, eliminating the matrices 7 by means of the equations
y • pw = (y°e - m)u,
ü'y
p' = ö'(y°e - m ) ,
we obtain
Mf = - - Z2a2ü(p')[y°e(J{
+ J2) + m(Jx - J2)} u(p).
(121.11)
In order to continue the calculations, we change (as in §80) from the bispinor
amplitudes u and u' to the three-dimensional spinors w and w' which correspond to
them in accordance with (23.9) and (23.11). A direct multiplication gives
ü'u = W'*{(E + m) - (e - m) cos 6 + i v • a(e
- m) sin 0} w,
ü'y°u = w'*{(e + m) + (e - m) cos 0 - i v • <r(e - m) sin 0} w,
where v = nxn'/sin0, n = p/|p|, n'= p'/|p'|, cos 0 = n • n\ Then the amplitude
(121.11) may be writtent
2
U) a. D<
r A (2)
M)V = 4TTW*/ *(A
+ B U)> 1/ • a)
A(2) = - i
w,
Z2a2{[(e + m) + (e - m) cos 6] e(J] + J2) +
(121.12)
4- [(e + m) - (e - m) cos 0] m(Ji - J2)},
B(2) = ^
Z 2 a 2 (e - m) sin 0[e(J, + J2) - m(J, - J2)}.
The first-approximation scattering amplitude is, in corresponding notation,
M}}> = 4TTW'*(A(1) + B (,)
va)w,
A(1) = ^ [ ( £ + m) + ( £ - m) cos 0], [
(121.13)
B(,) = - i ^ ( e - m ) s i n 0 ,
q
where q = p' - p.
The scattering cross-section and the polarization effects are expressed in terms
of the quantities A = A(1) + A(2) and B = B(1) + B(2) by the formulae derived in QM,
t The definition of A and B is as in §37 and in QM, §140, and differs by a factor from that in §80.
538
Radiative Corrections
§121
§140.
For example, the scattering cross-section for unpolarized electrons is
do- = (|A| 2 +|B| 2 )do'
« da0) + 2(A m re A(2> - iB0) im B ,2) ) do'.
Substitution of (121.12) and (121.13) and straightforward calculation gives
daa)
= - do'
?ia.\\a
7T p sin 2"
[(1 - v2 sin2 i0) re (J, + J2) + (m 2 /e 2 ) re (J, - J2)],
(121.14)
where v = p/e is the electron velocity and 6 the scattering angle. The electrons are
polarized by scattering, and the polarization vector of the final electrons is
,,_2re(AB*)
^ " IAMBI2"
, 2(A(I) re B ( 2 ) -iB ( 1 ) im A(2))
|A ( , f+|B ( , ) | 2
"
or, substituting (121.12) and (121.13),
4Zamp 4 sin 3 J0cosj0 .
„,
t1
T.
,n.
1(>
Let us now calculate the integrals J\ and J2. This is more easily done by using
the parametrization method (131.2). The integral J\ then becomes
d 23 /dg,2 dfr dfc 8(12- g, - fe- 2fe) 2
r _ _ 9 ((((
Jl
^ J J J J {[(p'-1) +s ]i.+[(p-o +«%+[f -p - m t r
0 0 0
The integration over d£3 eliminates the delta function, and reduction of the
denominator then gives
r = _ ? f V*2 f
1
d'fdtxdk
J J J {Ô2(£, + &) + p2(2£, + 2& - 1) - 21 • (£,p' + &p) + «2 - i0}3'
0
0
Using in place of f a new variable k = f- £ip' - | 2 p , we can reduce the integration
over d 3 / to the form
f
d[k
) (k2-a2-iW
so that
j
=
1
_..
21T
2(
i
\-(2
f
_
l
IT2
4<?'
d& d 6
„
J J {p2(£2 + & ~ Ki - 2fc + 1) + 2 * 6 P • p' - ô2(£, +fe)- iO}3'2'
§121
539
The Second Born Approximation
Instead of £1 and £2 we use the symmetrical combinations x = £i + £2> y = £1 - &•
The integration over dy from 0 to x is elementary, and gives
i
j _
l
"
Î7T2 f
Xdx
r___^_x
2[^p J [bx2 - 2x + 1 - (S2/p2)* ~ <0][(1 - x) 2 - (S2/p2)* ~ *0]1/2'
o
where
b = ( p 2 + p-p')/2p 2
To calculate the integral over dx as 5->0, we divide the range of integration
into two parts:
i
î-ô,
f -dx= f
0
i
•dx+
0
[ ■ -dx, ls>S,s>S/|p|.
1-6,
In the first integral we can put 0 = 0 ; thent
1-5,
1-8,
dx =
/
loe
■
0
2(r=T)[ bx>-2~xll-io] 0
In the second integral we can put x = 1 everywhere except in the term (1 - x) 2 and
8 = 0 in the first bracket in the denominator. Then*
J
-«,
'dx-
i-bj
0
[x'*-(«V)-iO] w
«I
=
'yh[j
+i
S/IPI
[x-»-(»W>+
/ [(«V>-*'=]'"]
0
t The term iO arises from the rule for avoiding the singularity, which gives the change in the argument of
the logarithm between 0 and 1 7- 5i, namely from 0 to - IT as we pass below the branch point.
X Here again the singularity avoidance rule gives the sign of the square root as we go from positive to
negative values of the radicand.
540
§122
Radiative Corrections
On adding the two integrals, ô\ disappears, as it should, leaving
Ji =
OT,0g(Tsin4
<12116>
2|p|
The integral J2 is calculated similarly:
j
7T3(l~sin2Ö)
T
J; = J
Î7T2
J
.
'-4|pl cosM»sinjfl-2|ppcos'J9'
.
06S
1,,
i9
"' -
, _ ,
_
(12U7)
We now have only to substitute these expressions in (121.14) and (121.15),
obtaining as the final results
*
2Zam\r\ sin'|6 log sin je
e
(1 - IT sin2 50) cos 50
(W. A. McKinley and H. Feshbach, 1948; R. H. Dalitz, 1950).
In the first Born approximation, the electron and positron scattering crosssections are the same (in the same external field); in the second approximation, this
symmetry does not occur. In the scattering of a positron (charge + |e|) the
amplitude of the first approximation (121.6) has the opposite sign, but the sign of
M f is unchanged. The cross-section dcr(2), which is the interference term between
M}P and A#J?\ therefore changes sign. The same occurs for the expression (121.19)
for the polarization vector. The formulae for electron scattering are all converted
to those for positron scattering by the formal change Z-+-Z.
§ 122. Radiative corrections to electron scattering in an
external field
Let us now calculate the radiative corrections to electron scattering in an
external field (J. Schwinger, 1949). The corresponding part of the scattering
amplitude is represented by the two diagrams (121.2). The contribution from the
first of these to the amplitude is
-(ö>0u)^T^D(-q2)e4>(q),
where 0*(-q2) is the polarization operator corresponding to the loop in the
diagram. The contribution from the second diagram is
-(fi'A°ii)«*(q),
where A0 is the correction term in the vertex operator ( P = yß + AM); according to
§122
Radiative Corrections to Electron Scattering
541
(116.6),
A° = 7°[/( -&-^-^x°*V
**( - &•
Adding the two contributions, we havet
J _ ~/ _ ~2x
2
Qrad(q) = / ( - qJ\2 )_- 1l -_j J_
^ (Ob(
- q _2 )~2x+ ,2^g<-"q
)q-y.
(122.1)
Let us first consider the infra-red divergence in the form factor / ( - q 2 ) and
therefore in the scattering amplitude (122.1). It has already been mentioned in §98
that the exact value of the purely elastic scattering amplitude is zero, i.e. it has no
meaning. The only physically significant thing is the amplitude of scattering defined
as a process in which any number of soft photons can be emitted, each having an
energy less than a specified value û>max satisfying the conditions for soft-photon
emission theory to be valid. That is, only the sum
w
max
<*>max
da = dore) + dcrei I dww + dat\ • ^
0
w
mix
I dwW| I dw^ + • • •
0
(122.2)
0
is meaningful, where dac\ is the cross-section for scattering without emission of
photons, and dw^ the differential probability for the emission by the electron of a
photon with frequency co. Here it is assumed that dat\ itself is calculated as a
perturbation-theory series, i.e. as an expansion in powers of a.t Then, on bringing
together the terms of each order in a in (122.2), we obtain da as an expansion in
powers of a in which each term is finite.
In the first Born approximation, dae\~~a2. This term has, of course, an independent significance. If, however, the next correction (~~a3) to dat\ is to be
taken into account, we must also include the second term in the sum (122.2): since
dw» ~~ a, multiplication by dae\ — a2 likewise gives a quantity ~ a \ We shall show
that the infra-red divergence disappears when these two quantities are added.
The divergent term in the form factor / (117.17) is§
-JaF(|q|/2m)log(m/A).
The corresponding term in the amplitude (122.1) is
\<xF log (ml\)(û'y°u)
1
e<ï>(q),
0i
t Note that q„ = (0, - q) if q* = (0, q), and therefore a q, = - y°q • y.
t The need to take account of radiative corrections in the probability dww is governed by the value
of ü)max; the limit <o -+0 corresponds to the classical case where the radiative corrections are zero, and so
the latter can always be made small by taking a sufficiently small o>max.
§ This expression is easily verified by using the relation
|q|/m=(l-0/V£
between |q| and the variable £ in terms of which (117.17) is written.
542
§122
Radiative Corrections
and in the cross-section (121.5)
dainfra = -aF
log (m/A)|fi'7°u|2|e*(q)|2 do'/167r2.
Comparing this with the Born cross-section
da{]) = \ü'y«u\2\e<P(q)\2 do'l\6Tr\
we find that
dainfra = - aF log (m/À) da (,) .
(122.3)
The second term in (122.2), with / dw» from (120.11), gives
w
max
do-ei f d w ^ a F log (2o>max/A)da(,).
o
(122.4)
Finally, adding (122.3) and (122.4), we obtain
- dor(,)aF(|q|/2m) log (m/2aw).
(122.5)
We see that the divergent contribution from soft (|k| ~~ A) virtual photons does
in fact cancel with that from the emission of real photons of the same kind. A
similar result occurs in any other scattering process.
There is also a dependence of the scattering cross-section on a)max, resulting
from the fact that a>max appears in the definition of scattering as a process in which
any number of soft photons can be emitted. The cross-section for such a process
will of course decrease with the upper frequency limit comax for photons whose
emission we regard as belonging to the scattering process in question.
Let us now determine the complete radiative correction to the scattering
cross-section. Proceeding in accordance with the standard rules (see (65.7)), we find
as the cross-section averaged over the polarizations of the initial electron and
summed over the polarizations of the final electron
da = dcr(1) + darad
= |e<D(q)|2 tr{(yp'+ m)(y° + Y°Qrad)(yp + m)(y°+ /Q r a d )} do'/327r2.
According to (122.1),
Qrad = a + b y • q,
Qrad = y°QÏ*ôy° = a - b y • q,
a=/(-q2)-l-^(-q2),
^=J^S(-^
As far as the terms linear in a and b, the trace in (122.6) is given by
i t r { - } = 2(e 2 ~iq 2 )(l + 2a)-2bmq 2 .
(122.6)
§122
Relative Corrections to Electron Scattering
543
Hence
darad = 2 [ / A ( - q 2 ) - l - ^
(122.7)
where da{U is the Born cross-section (80.5) for the scattering of unpolarized
electrons, and a subscript A is added to the form factor / in order to show explicitly
that it is cut off at photon mass A.
We now have only to add to (122.7) the cross-section for the emission of soft
photons. If we write /A in the form
/A( - q2) = 1 - \aF(\q\l2m) log (m/A) + aF2,
(122.8)
then from (120.11) this addition simply means replacing /A in (122.7) by
/aw = 1 - 5aF(|q|/2m) log (m/2aw) + {ccF, + aF2.
(122.9)
With this change, (122.7) gives the final answer.
In the non-relativistic limit we havet
The particular form of the external field appears in the radiative correction to
the cross-section only through d<r(1); the factor in the braces in (122.7) is universal.
In the non-relativistic approximation,
"^-"^■Mi^ïtyiy
"2«m2'
<i22">
which includes contributions from all the terms in (122.7). In the opposite (ultrarelativistic) limit, the main contribution comes only from the term in / Wmax - 1:
darad = - da{[) • — log - ^ log - £ - ,
TT
m
comax
q2 > m\
(122.12)
Finally, it may be noted that the radiative corrections considered here do not
cause any additional polarization effects that are not present in the first Born
approximation (unlike the corrections of the second Born approximation, discussed
in §121). The reason is that the particular features of the first Born approximation
are ultimately due to the fact that the S-matrix is Hermitian. This property is
maintained even when the radiative corrections described above are taken into
t This differs from the non-relativistic formula (117.20) by the change
log A -» log 2(umax - 6.
544
Radiative Corrections
§123
account, since in this approximation there are no real intermediate states in the
scattering channel (and so the right-hand side of the unitarity relation is zero).t
§123. Radiative shift of atomic levels
The radiative corrections cause a shift of the energy levels of bound states of an
electron in an external field, called the Lamb shift. The most interesting case of this
kind is that of a hydrogen atom (or hydrogen-like ion).$
A consistent method of finding the energy-level corrections is based on the use
of the exact electron propagator in an external field (§109). But, if
Za«U,
(123.1)
it is possible to use a simpler procedure in which the external field is regarded as a
perturbation.
In the first approximation with respect to the external field, the radiative
correction in the interaction between an electron and a constant electric field is
described by the two diagrams (121.2) already used in connection with the problem
of electron scattering in such a field; the change from one problem to the other
needs no more than a simple reformulation (see below).
However, it is easily seen that this treatment can give only the part of the level
shift that is due to the interaction with virtual photons of sufficiently high
frequency. Let us consider, for example, the next radiative correction (as regards
order with respect to the external field) to the electron scattering amplitude:
«
i
•
M
,
i
i
i
(123.2)
p
(unlike (121.2b), this diagram contains two external-field vertices). In the range of
integration over d4k where k0 is sufficiently large, this correction involves an extra
power of Za, and is therefore unimportant. But the addition of a second externalfield vertex to the diagram also brings in a further electron propagator G(/). When
t The calculation of the radiative corrections for processes which appear only in the second
approximation of perturbation theory is considerably more laborious, and will not be given here. We shall
simply list some references: L. M. Brown and R; P. Feynman, Physical Review 85, 231, 1952 (radiative
corrections to photon scattering by an electron); I. Harris and L. M. Brown, ibid, 105, 1656, 1957 (r.c. to a
two-photon pair-annihilation); M. L. G. Redhead, Proceedings of the Royal Society A220,219, 1953, and R.
V. Polovin, Soviet Physics JETP 4, 385, 1957 (r.c. to electron scattering by an electron or a positron); P. I.
Fomin, ibid. 8, 491, 1959 (r.c. to bremsstrahlung).
t The shift of the hydrogen levels was first calculated by H. A. Bethe (1947) with logarithmic accuracy,
using a non-relativistic treatment; this work provided the initial stimulus for the whole subsequent
development of quantum electrodynamics. The difference between the 2sm and 2pi/2 levels (in the first
non-vanishing approximation of perturbation theory) was exactly calculated by N. M. Kroll and W. E.
Lamb (1949); the complete formula for the level shift is due to V. F. Weisskopf and J. B. French (1949).
§123
Radiative Shift of Atomic Levels
545
k is small, and the free ends p and p' are non-relativistic, the important values of
the virtual-electron momenta / are those close to the pole of the propagator G(/).
The small denominator which thus occurs cancels the extra small factor Za. The
same evidently applies to the corrections of all orders with respect to the external
field. Thus, at low frequencies of the virtual photons, the external field must be
taken into account exactly.
We can divide the required level shiftt 8ES into two parts:
8ES = 8E{? + 8E™,
(123.3)
which originate from the interaction with virtual photons having frequencies in the
ranges (I) k0> K and (II) k0 < * ; * is chosen so that
{Zafm <K<m,
(123.4)
where Z2a2m is of the same order as the binding energy of the electron in the
atom. Then, in region I, it is sufficient to take account of the nuclear field in the first
approximation. In region II, the nuclear field must be treated exactly, but on the
other hand, since K<€m, we can solve the problem in the non-relativistic approximation—not only as regards the electron itself, but for all the intermediate
states*. With the condition (123.4), the ranges of validity of the two methods of
calculation overlap, and it is therefore possible to make an exact "joining" of the
two parts of the level correction.
T H E H I G H - F R E Q U E N C Y PART O F T H E SHIFT
Let us first consider region I. Here it is possible to use the correction (122.1) to
the scattering amplitude, after removing the contribution of the virtual photons
which pertain to region II. These make only a small contribution to the form factor
g, which therefore can be left unaltered. The low-frequency virtual photons make a
large contribution to /, however, because of the infra-red divergence. Thus / in
(122.1) must be taken as a function fK from which the region k0<K has been
excluded.
This could be done directly by subtracting from / the integral over the region
k0 < K, but the required result can be obtained without fresh calculations by using
the results of §122. To do so, we note that the exclusion of frequencies k0< K can
be regarded as one possible type of infra-red cut-off. The result for the correction
to the scattering cross-section must, of course, be independent of the cut-off used,
provided that the real soft photon emission probability is cut off in the same way,
i.e. the concept of "elastic" scattering includes the emission only of photons with
frequencies from K to the specified û>max. If we take a>max = K, there is no need to
take explicit account of the photon emission. Hence we see that /„ is obtained from
the /„,„,„ determined in §122 by simply replacing <omax by K. In particular, in the
t In this section, £, denotes the energy of an electron in an atom, not including its rest energy. The
suffix s stands for all the quantum numbers which define the state of the atom.
546
§123
Radiative Corrections
non-relativistic case
/
- -
|
- - 3 ^ (
t o 8
S
+
24)-
(123 5)
-
Let us now transform the correction (122.1) to the scattering amplitude by
representing it as the result of a corresponding correction to the effective potential
energy of the electron in the field. Comparing the amplitude (122.1)
-e(u'*Qrad<I>u)
with the Born scattering amplitude (121.6)
- e(u'*$>u),
we see that the correction is given (in the momentum representation) by the
function
eö(D(q) = 6Qrad(q)0(q).
(123.6)
In the non-relativistic case, taking 9 and g from (113.14) and (117.20), and
substituting fK from (123.5) for /, we get
^■(■ia(''Mi-l) t Ê'''K
(1237)
The corresponding function S4>(r) in the coordinate representation ist
^«^(^Mi-il^-'Ä^^
«23-8>
The level shift 8E(5l) is found by averaging eÔ<ï>(r) over the wave function of the
unperturbed state of the electron in the atom, i.e. as the corresponding diagonal
matrix element:^
8E[l) =
3irm
( l0 s£ + iH)< s i A *i s >-
-'J^<s|yV*|s>.
(123.9)
In the first term, the non-relativistic electron function suffices for the averaging.
t Note that this correction to the potential is not the same as the one discussed in §114, which
included only the effect of the vacuum polarization (diagram (121.2a)) on the Coulomb field as such. The
correction (123.8) relates to the interaction of the field with the electron, and includes also the effect of a
change in the motion of the electron (diagram (121.2b)).
t Strictly speaking, the form factors determined in §117 related to the vertex operator with two
external electron lines (p2 = p'2 = m2). For an electron in an atom, the energy Es is a level which is
unrelated to p. The distinction may, however, be neglected in region I.
Radiative Shift of Atomic Levels
§123
547
In the second term, this approximation is insufficient: the zero-order approximation
with respect to the non-relativistic functions is zero on account of the absence of
diagonal elements in the matrices y. Here, therefore, we must use the approximate
relativistic function
*
-(!)
derived in §33, retaining the components \ which are small (in the standard
representation). We have
and, substituting from (33.4)
we get, using the identity (33.5) and integrating by parts,
(s\y • V0>|s) = - ^ j - f {4>*(<r • V$)(<r ■ V<j>) + (V<f>* • <r)(a • V<t>)<f>} d'x
= Y~\ {<!>*£<& ' <t> - 2i<r • 4>*[V4> x V<f>]} d3x.
Since <î> = <ï>(r),
_.
rd<I>
r dr
and hence
-icr-[V4>xV] = - ^ « r - î ,
r dr
where 1 = - ir x V is the orbital angular momentum operator. Finally, bringing
together the expressions obtained and substituting in (123.9), we have
SEf = T ^ ( l o g ^ L + ^V$|A4>|s) + r ^ ( s | < r - l - ^ | 5 \ ,
A-nm \ \
r dr \ /
3irm \
2K 30/ ' '
(123.10)
in which the averaging is over the non-relativistic wave function in both terms.
THE LOW-FREQUENCY PART OF THE SHIFT
In order to calculate the second part of the level shift, we use a technique based
ultimately on the unitarity condition.
Since a photon can be emitted, the excited state of the atom is not strictly
548
§123
Radiative Corrections
stationary, but only quasi-stationary. A complex energy value can be assigned to
such a state, its imaginary part being - 2 w if w is the decay probability of the state,
in this case the total photon emission probability (see QM, §134). In the nonrelativistic approximation there is dipole radiation, and from (45.7)
im 8ES = - 1 ws = - i 2 K\2(Es ~ E,)\
s'
where the summation is over all the lower levels (Es>< E5), or, equivalently,
im 8ES = -I$ da • Ç |d„.|2(E, - EM.?6{EM - Es - a>).
o
(123.11)
In order to find the real part of 6E5, we must regard Es as a complex variable
and use analytical continuation. This may be done by treating the delta functions as
originating from poles. The rule for the avoidance of poles is, as usual, specified by
adding a negative imaginary part to the masses of the virtual particles—in this case,
to the masses ms> of the electron in the intermediate states of the atom. These are
m5 = m + Es, and so we must put
Es ->E5 -iO,
whence
8(E i -E I .-ai) = - ^ i m p
F
*
/
. m;
(123.12)
cf. (111.3).
Thus, substituting (123.12) in (123.11), we find
(E
• 2' | d' „E,f p- E
^~ Es)\ aim SE, = im 3ir
^ - Jf da> 7*
s - w + lO
o
The required analytical continuation is now obtained by simply omitting the symbol
im, but we must take from 8ES only the part due to the contribution from
frequencies in region II (o> < K). TO do so, we need only replace the upper limit of
integration by K. The result of the integration is
*S?D = 3^~ ? K?(Es' - Es)3 l o g £s,_Es + iO;
(l23l3)
because of the inequality (123.4), the difference Es - E5> is neglected in comparison
with K at the upper limit. We shall henceforward be concerned only with the real
part of the level, which is obtained by using KI\ES■- Es\ as the argument of the
logarithm in (123.13).
§123
Radiative
Shift of Atomic Levels
549
In the expression (123.13), the term in log K can be transformed by replacing the
matrix elements of the dipole moment d = er by those of the momentum p = mv
and its derivative p:
2 |dss|2(ßs- - Esy = - A 2 WftE,. - E.)
s'
"*
s'
Now replacing p in accordance with the operator equation of motion of
the electron, p = - eV3>, we get
2J |d„.| 3 (E,. - E s ) 3 = - ^
2 {(V<P)SS- • p , , - p ss • (V*),.,}
= ^<*|p-V<î>-V<ï>.p|s>
= 2^5<s|A<P|5).
(123.14)
We can therefore write instead of (123.13)
5TTYY\
m
+ 3 7 S kn'ftfi- - g^)3 Jog 2 | E , " E,.|THE
(12315)
T O T A L SHIFT
Finally, adding the two parts, w e have the following formula for the level shift:
8E
>
=
37?
,
+
lr
" | 2 ( E j '"
Ej)3l0g
2|E,-E,j+
e3 1 9 . | A . , ,
<?3 / I , 1 d4> I \
<s|A
s) =
3^'3Ô
^
4^\T',r^|S/;
„ - , ,.,
'
(123 16)
as was to be expected, the auxiliary quantity K does not appear.!
All the matrix elements in (123.16) are taken with respect to the non-relativistic
wave functions of the electron in the atom. For a hydrogen atom or a hydrogen-like
ion, these functions depend only on three quantum numbers: the principal quantum
number n, the orbital angular momentum I and its component m, but not on the
t The determination of the next-order corrections in the level shift involves very complicated
calculations. The most complete tabulation and a systematic derivation of the corrections, together with
further references, are given by G. W. Erickson and D. R. Yennie, Annals of Physics 35, 271, 447, 1965.
550
§123
Radiative Corrections
total angular momentum j ; the corresponding energy levels depend on n only. We
shall use the notation^
L„, = 2mfZe^ Em, \<nTm'\r\nlm)\\En. - Enf • log 2 | E ^ % | -
(123.17)
The energy levels are proportional to (Ze2)2, and the characteristic dimension of the
atom is proportional to Ze2, so that the Lnt defined by (123.17) are independent of
Z. They can be calculated numerically.
We shall take separately the cases / = 0 and 1^0. When I = 0, the last term in
(123.16^ is zero. In the second term, we use the equation
eAQ = 4TTZe2Ô(r),
which is satisfied by the potential of the Coulomb field of the nucleus. Hence
<n/m|A4>|nlm> = 47rZe2|^,m(0)|2
_ r4m3(Ze2)V3
(1=0)
~U
(1^0)
(cf. (34.3)). In the first term, with the notation (123.17) and again using (123.14),
£
Kn'l'm'|r|n00>|2(E„. - E„)3 = ^<n00|A*|n00>
= 2m(Ze2)*ln\
This gives the following expression for the shift of the s terms (in ordinary units):
The numerical values of some of the Ln0 are:
n
1
L„o -2.984
2
-2.812
3
-2.768
4
-2.750
oo
-2.721
The unperturbed levels are En = - mc2(Za)2/2n2, and so the relative magnitude of
the radiative shift is
\8EJEn0\ ~ Z V log (1/Za).
(123.19)
When 17*0, the second term in (123.16) is zero. The third term can be calculated
t The matrix elements of r are diagonal in j and independent of j; the summation over s in (123.16)
therefore reduces to summation over n, 1 and m. Because of the isotropy of space, the sum (123.17) is also, of
course, independent of m.
§124
Radiative Shift of Mesic-atom Levels
551
by means of the formulae in §34, and leads to a dependence of the level shift on the
number j also. The result is
3jq+l)-<(l + l ) - n
. .0
_4mc2ZVr
J
Ln
~
3im
L ' 8 1(I + 1)(2I + 1) J' ' ^ ° -
8En,i
,m2m
°23-20)
Thus the radiative shift removes the last degeneracy which remains after the
spin-orbit interaction has been taken into account, namely the degeneracy of levels
having the same n and j but different I = j ± i For example, the numerical value of
L2\ is +0.030, and formulae (123.18)—( 123.20) give as the difference of the 2sm and
2pi/2 levels of the hydrogen atom
Eaxi/2) " E2K1/2) = 0.41mcV,
corresponding to a frequency of 1050 MHz.
§ 124. Radiative shift of mesic-atom levels
At the end of §118 the electron vacuum polarization has been shown to play an
important role in the (second-approximation) radiative correction to the magnetic
moment of the muon. This still more true (and even in the first approximation) as
regards the radiative level shift in /x-mesic hydrogen, a hydrogen-like system
consisting of a proton and a muon (A. D. Galanin and I. Ya. Pomeranchuk, 1952).
In calculating the level shift for an ordinary atom in §123, we took account, in
particular, of the electron vacuum polarization effect (the electron loop in the
diagram (121.2a)). If the muon vacuum polarization effect is similarly treated in the
mesic atom, the entire calculation can be applied to this case, simply replacing the
electron mass m = me by the muon mass mM. Since the relative shift (123.19) of the
levels does not depend on the electron mass, the same result is obtained for mesic
hydrogen.
It is easily seen that the electron vacuum polarization has a much stronger
effect on the level shift in the mesic atom, because the replacement of the muon
loop in the diagram by an electron loop implies the replacement of the muon
polarization operator by the electron polarization operator; and the polarization
operator &(q2) is inversely proportional to the square of the particle mass for
non-relativistic values of q2. Hence the change mentioned must increase the effect
by a factor (mjme)2, and it is this contribution which determines the order of
magnitude of the level shift:
8EI\E\~~a\mJme)\
or four orders of magnitude greater than in ordinary hydrogen.t The origin of this
effect can be more clearly seen by noting that the distortion of the Coulomb
t For a similar reason, the contribution of the muon vacuum polarization to the level shift in the
ordinary hydrogen atom is, conversely, negligible.
552
Radiative Corrections
§125
potential by the electron vacuum polarization extends to distances — \jme (§114).
In the ordinary hydrogen atom the electron is at distances from the nucleus that are
of the order of llm€a, i.e. outside the main region of field distortion, but in mesic
hydrogen the muon is at distances — l/mMa which are in this region.
To calculate precisely the level shift in the mesic atom, however, it is not
possible to use the approximate non-relativistic expression for the polarization
operator, as was done when using (123.7) to find the level shift in the ordinary
atom. The reason is that the characteristic momenta of the muon in the mesic
hydrogen atom are | p j ~ otmß. For the muon these momenta are non-relativistic,
but for the electron they are relativistic.
We must therefore use the full relativistic formula (114.5) for the effective
potential of the nuclear field as modified by the electron vacuum polarization. The
level shift is found by averaging over the wave function of the muon in the atom:
ôEnl =
-\e\j\il,nl\2ô<ï>(r)d'x
= - |e| f Rîi(r)8<ï>(r)r2 dry
o
(124.1)
where JR„/ is the radial part of the (non-relativistic) Coulomb wave function. For a
hydrogen-like ion with nuclear charge Z\e\, the functions Rni(r) depend on r only
through the dimensionless combination p = Z a m / (the distance in Coulomb units).
Using this fact and substituting ô<ï>(r) from (114.5) (with the charge Z\e\ in place of
ei), we can bring the integral (124.1) to the form
2
8Enl = -x—Za3m^Q„/(m€/ZamM),
(124.2)
where
00
»
R2nl(p)e-2*«(\ + 2^) V ( *V" ° dt.
Qnl(x) = jpdpj
0
1
The first few levels of mesic hydrogen are shown by numerical evaluation to have
the following relative shifts:
SE10/|E,o| = -6.4xl(r 3 ,
SE2o/|E2o| = -2.8xl(r 4 ,
ôE2i/|E2i| = -2.0xl(r 5 ,
§125. The relativistic equation for bound states
The method used in the preceding sections to calculate the radiative shift of the
atomic levels is not valid for solving a problem such as that of determining the
corrections to the levels of positronium, a system consisting of two particles of
§125
The Relativistic Equation for Bound States
553
equal rank, neither of which can be regarded as the source of the external field
acting on the other.
The systematic procedure for solving this problem is based on the fact that the
energy levels of the bound states are poles of the exact amplitude of mutual
scattering of these two particles, as a function of their total energy in the
centre-of-mass system. In any of its discrete states, positronium may be regarded
as an "intermediate particle" having a definite mass, which can be formed as a
stage in the electron-positron scattering process, and a pole of the scattering
amplitude corresponds to each "one-particle" intermediate state; these poles of
course lie in the non-physical region of 4-momenta of the particles undergoing
scattering.
According to (106.17), the exact scattering amplitude comprises the exact
four-ended vertex part Tik^m and the polarization amplitudes u of the particles. The
latter are clearly unconnected with the pole singularities, and it is therefore more
convenient to ignore them, referring instead to the poles of the vertex part itself,
i.e. of the function
r,k,im(P-, - P+; P-, - pi),
(125.1)
where the notation for the 4-momenta of the external lines of the diagram (106.12)
corresponds to the scattering of a positron by an electron.
It should be stressed that the assertion that poles are present refers to the exact
scattering amplitude or the exact vertex part; there is no pole in any separate term
of the perturbation-theory series, as can be seen from the fact that the Feynman
diagrams in each approximation include only electron (and photon) lines, not lines
belonging to the composite particle positronium as a whole. Hence it follows in
turn that the calculation of the scattering amplitude near its poles involves
summation of an infinite series of diagrams. The diagrams concerned can be
determined as follows.
In the first non-vanishing approximation of perturbation theory (the first approximation with respect to a), the vertex part (125.1) corresponds to two secondorder diagrams:
(125.2)
or, in analytical form,
Tium = - e27f[7LDM,(p_ -pL) + e2y?myvk{D^(P- + p.).
(125.3)
In the next approximation (the second with respect to a) there are ten
fourth-order diagrams:
§125
Radiative Corrections
554
q-pii
I
fp_q
/A,
y q-p+-p- V
I
(125.4)
and a further five diagrams obtained from (125.4) by interchanging p_ and -p,+. All
these include an extra power of e2 = a in comparison with (125.2), but we shall
show that in diagram (a) this extra order of smallness is cancelled by a denominator
which is also small when the electron and positron momenta are small.
All quantities will be taken in the centre-of-mass system, but, since the
4-momenta of the external lines in the diagrams are not assumed to be physical (i.e.
p2 j£ m2), e+ /^ e- in this system, although p+ = - p_. Thus these 4-momenta are
p_ = (e-,p), p+ = (£ + ,-p),
P:
= ( e :,p') p; = ( e ; , - P ' ) ,
(125.5)
The binding energy of the electron and the positron in positronium is ~ ma2. Thus,
in the neighbourhood of the scattering-amplitude poles, with which we are concerned,
IPI ~ IP'I ~ma<m,
| e - - m|~ |e+-m|~p 2 /m ~ ma2,.
(125.6)
The contribution to the vertex part from the diagram (125.4a) is
rj&, = -ie4j
(yKG(q)y'iHy''G(q - p_ - p+)yp)km
x DKp(q -p'-)D»v{p- ~ q) d4qK2ir)\
(125.7)
The important range of values of q" = (qo.q) in the integral (125.7) is that which is
close to poles of both functions G simultaneously. In this range, |q| and |<j0— "i| are
§125
The Relativistic Equation for Bound States
555
small, and the electron propagators are
.0
G(q)
y qo - y • q + m
« lufi
~*<7°+l)
q0-m
(q0 + m)(q0- m) - q2 + iO
G(q-p--p + )-ky°-D
1
-(q 2 /2m) + i0'
1
q o _ £ - - e + + m + (q / 2 m ) - i O '
(125.8)
The poles of these two expressions are on opposite sides of the real axis of the
complex variable q0; closing the path of integration along this axis by a contour in the
upper half-plane (say), we can calculate the integral over dq0 from the residue at the
corresponding pole.t The result is
'(4«)
~,j (q - pl) (p. - q)\2md \
2
- e- - E + + q 2 /m)'
and so, using (125.6), we have in order of magnitude
F(4a)^a2
(ma) 3 _ 1
(ma) ma
ma'
The contribution to V from the second-order diagram (125.2a) (the first term in
(125.3)) is of the same order, and this proves the statement made above about the
order of smallness of the diagram (125.4a). A similar situation occurs in all higher
approximations of perturbation theory.
Thus the calculation of the relevant vertex part near its poles calls for the
summation of an infinite succession of 4t anomalously large" diagrams with intermediate states resembling the internal lines of (125.4a). A typical property of these
diagrams is that they can be cut between the ends p_, - p + and pL,-p + into parts
joined only by two electron lines.$ The set of all diagrams which do not satisfy this
condition will be called a "compact" vertex part and denoted by f ikM; since it does
not include the anomalously large diagrams, such quantities can be calculated by
ordinary perturbation theory. For example, in the first approximation f is given by
the two second-order diagrams (125.2), and in the second approximation by the
eight fourth-order diagrams in (125.4), excluding diagrams (a) and (b).
If the non-compact vertex parts are classified according to the number of
"double bonds", we can represent the total T as an infinite series:
(125.9)
where the continuous thick internal lines are exact propagators ^ ; this is often
t For the diagram (125.4c), which differs from (125.4a) only as regards the relative direction of the
electron lines, both poles would be on the same side of the real axis, and so the integral would be zero in
the approximation considered.
t This definition includes all the anomalously large diagrams and also some "normal" diagrams such
as (125.4b).
§125
Radiative Corrections
556
called a "ladder" series. To sum the series, we "multiply" it on the left by a further
f:t
Comparison with the original series (125.9) shows that
p
-\
q
V
P-
pl
v
vp~
yp~
P-v
(125.10)
-p/
q-p;-pi
\p+
-pf
Vp +
-v/
^-P,
This graphical equation corresponds to the integral equation
iT lMm (p:, - p+; p_, - pi) = if ik.im(p -, ~ p+; p-, - p ;) +
+ J f ir,5m(p-, q - pi - P-; q, - P+)^(q)^nr(q - pi ~ P-) x
x rlWll(q, - p+; p-, q - pi - pL) d4ql(27r)4.
(125.11)
The functions f and <S are calculated by perturbation theory, and equation (125.11)
then allows, in principle, the determination of V with any desired accuracy.
To find the energy levels, we need to know only the positions of the poles of I\
Near the poles, T^>f, and so the first term on the right of (125.11) (the second
diagram from the right in (125.10)) may be neglected, the equation then becoming
homogeneous in I\ The variables p+,p~ and the indices k and I become parameters,
the dependence on which is arbitrary and is not defined by the equation itself.
Omitting these parameters and also the primes in the remaining variables pl> p-, we
have
ir,,m(p_; - p+) = J f ir,jm(p-, q - p+ - p-; q, - p+)%Aq) x
x^nr(q-p.-p-)rM(q;q-p+-p.)d4q/(27r)4
(125.12)
(E. E. Salpeter and H. A. Bethe, 1951).
Equation (125.12), written in the centre-of-mass system (p+ + p_ = 0), has solutions only for certain values of e+ + e-, and these give the positronium energy
levels. The function Tim plays only an auxiliary role. Another function is more
convenient in practice:
XAPU
Pi) = %t(Pi)rtAPù Pi)%riPi).
(125.13)
t That is, we multiply each term in the series by f and two % and integrate appropriately over the
4-momenta of the new internal bonds.
§125
557
The Relativistic Equation for Bound States
Then equation (125.12) becomes
= J f jr . jm (p-,q-p + -p_;q,-p + )Xsr(q, q-p+-p-)d4ql(2ir)\
(125.14)
where f appears as the kernel of an integral operator. As already mentioned, f may
be calculated by perturbation theory, and the same is of course true of <S~X.
We shall show that, in the first approximation of perturbation theory (with
respect to a), (125.14) reduces, as we should expect, to the non-relativistic
Schrodinger's equation for positronium.
In the first non-relativistic approximation, f is determined by the diagram
(125.2a) alone; the annihilation-type diagram (I25.2b) is zero in this approximation.! For a similar reason to that in §83, it is convenient to take the photon
propagator in the Coulomb gauge (76.12), (76.13), and only Doo need be retained in
it. Then
f irtJm(P-, q - P+ - P-; q, - P+) = - e2Y® y™Ax>(<J - P-)
= -U(q-p-)7?î7l,
where
l/(q) = -4ire2/q2
is the Fourier component of the potential energy of the Coulomb interaction
between the positron and the electron. Equation (125.14) becomes
= [G<P-)7° J U(q - p-)x(q, q - p+ - p_) 0^
• y°G( - p.)]^,
(125.15)
where we have also replaced the exact propagators <ê by the free-electron propagators G. The latter are given by the approximate expression (cf. (125.8))
G{p-) ~ \(\ + 7°)g(p_), G( - p+) « |(1 - y°)g(P+),
where the matrix factors have been separated, and g(p) is the scalar function
«(">-.-,-A2m)+ «r
(,2516)
In substituting these expressions in (125.15), we note that all the non-zero matrix
t The particle velocities in positronium are such that vie — a. In this sense the expansions in powers
of a and of 1/c are interrelated.
558
§125
Radiative Corrections
elements
Öd + yVx7° • Id - 7°)]- = û(y°+ Dx ' 1(7° - DL«
are equal to the elements - Xim- The matrix equation (125.15) is therefore equivalent
to one for the scalar function
î*(P-, - P+) = - g(P-)g(p + ) J U(q - p-)*(q, <ï ~ P+ " P-) d4ql(2rr)4.
(125.17)
We now replace p+ and p_ by the variables
P = (e, p) = i(p- - p+),
P =P- + p+\
these are the 4-momentum of the relative motion of the particles and that of the
positronium as a whole. In the centre-of-mass system, P = ( E + 2m,0), where
E+2m is the total energy, and E therefore the energy level relative to the rest
mass. In terms of these variables, (125.17) becomes
iX(p, P) = -g(p +\P)g(-p+{P)j
^-gip+lPM-p+ïP)!
U(q - p - ) x ( q -{P 9 P) dV(2ir) 4
U(q' - p)X(q\ P) d4q'l(27r)4.
In this equation, P occurs only as a parameter, and x figures on the right-hand side
only as the integral
*(q)= /
x(q,P)dqo.
Integrating both sides of the equation over de, we get an equation for $ in a closed
form:
^
(p)=
oc
~àl g ( p + i P ) g ( " p + i P ) d e ! U ( < , " p H ( q ) â'
where
g(±P+iP)=±E+,E_jp,/2m)+.o.
If the path of integration over de is closed by a contour in the upper half-plane
(say) of the complex variable e, we can evaluate the integral from the residue at the
corresponding pole, obtaining
( ^ - E ) * ( p ) + J l / ( p - q ) ^ ( q ) ^ , = 0.
(125.18)
This is Schrödinger's equation for positronium in the momentum representation;
see QM, (130.4).
§126
The Double Dispersion Relation
559
If only the diagrams (125.2) were used in f, but including in them (and in <$) the
next terms in the expansion in powers of 1/c, we should arrive at Breit's equation
(§83). The inclusion of the diagrams (125.4) (together with the further terms in the
expansion in 1/c) gives the radiative corrections to the positronium levels, but the
calculations become very complicated.
The following is the difference between the ground levels of ortho- and
para-positronium, including the above-mentioned corrections:t
ECSÔ - E('So) = a2 ^{l~(j
+ log 2) f - l t e } ;
(125.19)
the first term in the braces is the fine splitting (see §84, Problem 2). The second term
is the radiative correction to the difference between the levels. The imaginary part
of the difference arises from the parapositronium annihilation probability (see
(89.4)), i.e. from the fact that the level ^o is complex; for parapositronium, the
level width is found to be of the same order as the radiative correction to the real
part of the level.
§ 126. The double dispersion relation
After the vertex part with three external lines, the next in order of complexity is
a section with four external lines. In quantum electrodynamics, three basic
diagrams of this type are possible:
n
(126.1)
The first describes the scattering of a photon by a photon; the others are individual
terms in the radiative corrections to the scattering of (b) a photon and (c) an
electron, by an electron.
This §126 deals with some general properties of such diagrams, but to be simple
and specific we shall refer only to (126.1a).
The momenta of the lines in such a diagram will be denoted as follows:
krVkrk
\
\
<J-V '
J
/
k3
/
J \<\~\l
«H*** 's
\
t R. Karplus and A. Klein, Physical Review 87, 848, 1952.
(126.2)
Radiative Corrections
560
§126
The 4-momenta ku k2, k}, k4 correspond to real photons, and their squares are
therefore zero.
If the dependence on the photon polarizations is written separately, the amplitude Mfi which corresponds to the diagram (126.2) can be expressed in terms öf
various scalar functions of the photon 4-momenta. These are the invariant amplitudes discussed in §70; they will be derived in §127 for the specific case of
photon-photon scattering. Being scalars, they depend only on scalar variables,
which may be taken as, for example, any two of the quantities
s = (ki + k2)\
t = (k, - fc3)2,
M = (ki - k4)\
s + t + u = 0;
(126.3)
in what follows we shall take s and t as independent variables.
Each of the invariant amplitudes, which will be denoted here by the same letter
M, can be written as an integral:
"I
„-.
'Äd^R
W - m2][(q - k4)2 - m2][(q - k, - k2)2 - m2)[(q - k2)2 - m2]9
2
2
m -*m -
(126.4)
Î0,
where B is some function of all the 4-momenta; the factors in the denominator
arise from the propagators of the four virtual electrons.
When s and t are sufficiently small, the amplitudes M are real (more precisely,
they can be made real by a Suitable choice of the phase factor), since if s is small
the photons cannot generate real particles (an electron-positron pair) in the s
channel, and if t is small the same applies to the t channel.! Thus neither channel
has real intermediate states which could, according to the unitarity condition, lead
to an imaginary part of the amplitude.
Now let s increase while t remains at a fixed small value. When s ^4m 2 , the
amplitude M has an imaginary part due to the possibility of pair production by two
photons in the s channel. We can therefore write for M a dispersion relation in the
variable 5:
7T J
4m2
S — S — II)
where A ls (s, t) denotes the imaginary part of M (s, 0As in any diagram having the form
t The directions of the external lines as shown in the diagram (126.2) correspond to the s channel. In
the t channel, lines 1 and 3 are incoming, and so the 4-momenta of the initial photons are k\ and- ky.
The physical regions for photon-photon scattering in the variables s, f, u are the shaded sectors in Fig. 8
(§67). For example, the s channel corresponds to the region s >0, t <0 f u <0.
§126
561
The Double Dispersion Relation
A|S(s, 0 is calculated by the rule (115.9), replacing the pole factors in the integral
(126.4) by delta functions:
I\A
(, f\-n^;\2
f iB8(q2 - m2)8[(q - k, - k2)2 - m2]
2iA lf (M)-<2iri) J
[(q
.k4)J.mi][(q_k2)J.m4]
4
dp,
n u
. ,
(126.6)
the integration is taken over the half of q-space in which q°>0.
An important further step can be taken by noting that the integral (126.6) has a
structure (of pole factors) similar to that of the amplitude for a reaction represented by a diagram having the form
q-k 4
x
q-k 2
k,
The analytical properties of Ai,(s, t) as a function of t are therefore similar to the
analytical properties of this amplitude. In particular, the function A|,(M) can
acquire an imaginary part (as t increases) only if both factors in the denominator
become zero simultaneously. This will not, however, occur as soon as t reaches the
value 4m2 which is the threshold for pair production in the t channel. The reason is
that the presence of the delta functions in the integrand restricts the region of
integration in q-space, which may be incompatible with the value t =4m 2 . The
extent of the region of integration depends on s (the arguments of the delta
functions contain k\ and k2), and therefore so does the limiting value t = tc(s)
beyond which A\s(s, t) becomes complex.
In the same way as M(5, t) is expressed in terms of its imaginary part A\5(s, t)
by (126.5), the function AXs(s, t) is in turn expressed in terms of A2(s, 0 = im
Au(s, 0 by a dispersion relation in the variable t:
30
Au(s,t) = l
j j^j^dt'.
(126.7)
tc(s)
If we now substitute (126.7) in (126.5), we get the double dispersion relation or
Mandelstam representation for the amplitude Af(s, ()•
00
4m2
00
tt(s)
(S. Mandelstam, 1958).
The function A2(s, t) is called the double spectral density of M(5, t). It can be
obtained from the integral (126.6) by twice applying the substitution rule (115.9).
Putting for brevity
Ji = q, h = q-k4,
h = q-k2,
l4 = q-k]-k2,
(126.9)
562
§126
Radiative Corrections
we have
(2i)2A2(s, 0 =
(2TT04
j iB8(l]- m2)8(l22- m2)8(l]-
m2)8(H- m2) d4q,
(126.10)
the integration being taken over the region q ° > 0 .
It should be noted, however, that formula (126.10) is purely symbolic, since the
region s > 0, t > 0 is non-physical, and accordingly lu J 2 ,... are in general complex
in this region when q is real; and the delta function is not fully defined for a
complex argument. It would be more accurate to refer immediately to the taking of
residues at the corresponding poles of the original integral (126.4). In our case this
is, however, of no importance. The condition for the four expressions in the
denominator in (126.4), or the four arguments of the delta functions, to be zero,
entirely determines the components of the 4-vector q. On changing to integration
with respect to l], / 2 , . . . (see below) and formally applying the usual rules to
(126.10), we obtain (apart from the sign) the expression for A2.
To continue the calculations, we use the centre-of-mass system (in the s
channel). Then
k, = (co, k),
k2 = (o>, - k),
s = 4<o2,
k3 = (<o, k'),
k4 = (<o, - k'),
* = - (k - k')2 = - 4o>2 sin2 iO,
u = - ( k + k ; ) 2 =-4a) 2 cos 2 i0,
(126.11)
(126.12)
where 9 is the angle between k and k' (the scattering angle). The x-axis of spatial
Cartesian coordinates is taken along the vector k + k', and the y-axis along k - k ' . t
We shall now transform the integral (126.10) by taking J2, ! ! , . . . as new variables
of integration in place of the four components of q. Then
a(i?)/d<r = 2 i i M > . . . f
and the Jacobian of the transformation is therefore
-7—75
r — lDU,
where D is the determinant formed by the sixteen components of the four
4-vectors l|, I 2 ,.... The integration in (126.10) amounts simply to replacing the
functions B and D in the integrand by their valuest when
|2=|2
=
|2^|2==m2
(126.13)
t When f >0, (k-k') 2 <0, i.e. the vector k-k' is imaginary. This difficulty is, however, easily
circumvented by expanding all vector expressions with t < 0 and using analytical continuation to t > 0.
t This method of integration automatically takes account of only one zero of each argument of the
delta functions.
§ 126
The Double Dispersion Relation
563
2
Prom the conditions l] = l\ = m we have, as in §115,
q°=o>,
q 2 =w 2 -m 2 .
(126.14)
The other two conditions give
(q - k4)2 - m2 = - 2qk4 = - 2<o2 - 2q ► k' = 0,
( q - k 2 ) 2 - m 2 = -2<o 2 -2q-k = 0,
and hence
q k = q k ' = -ls,
or, in components,
q° = a), qx = - s/2(s + i), qy = 0,
q2 = ± V ( w 2 - m 2 - q 2 )
^ + rst-4m 2 (5 4-t)]' /2
"L
4(5 + 0
J "
(126.15)
Thus the integral (126.10) is
(126.16)
where the summation is over the two values of q given by (126.15).
The determinant D can be written in terms of the antisymmetric unit tensor:
D = e^itnm
= - w ( q - W(k* - *i)"(*2 ~ WK*
where the antisymmetry of e{kvva has been used. Since only ki among the four
factors has a time component, we deduce that
D = - «q • (k + k') * (k - k').
Expanding this expression with t < 0 and then continuing to t > 0, we find
D = - (oqzV(s + tW(-t)^±ii{st[st-4m\s
+ t)]}m.
(126.17)
The choice of sign needed here can be made as follows. For simplicity, let
8 = 1. Then A\s(s, t)<0 in the physical region (s >0, t <0), since the two factors
in the denominator in (126.6) have the same (negative) sign:
(q - k4)2 - m2 = - 2o>2 - 2q • k' < - 2o)(w - |q|) < 0,
(q - k2)2 - m2 = - la)2 - 2q • k < - 2a>((o - |q|) < 0
Radiative Corrections
564
§126
(here we use the results (126.14) which follow from the presence of the two delta
functions in the numerator, and which show that |q|<o>).t From (126.7) it is then
seen that A2(s, t) also must be negative when s > 0 and t > 0 (since, as is evident
from (126.16), A2(s, t) does not change sign). This means that the upper sign must
be taken in (126.17), giving finally
A,=
-7T4
SB
{ s f [ s f - 4 m 2 ( s + 0]} ,/2 "
(126.18)
Since, from its significance, A2(s, t) must be real, there is a further condition: as
well as s and t, the expression in brackets in the denominator must be positive:
+ 0^0,]
st-4m\s
s>0,
f>0.
(126.19)
These inequalities define the region (shaded in Fig. 23) over which the integration is
to be taken in the double dispersion integral (126.8). The region is bounded by the
curve
st - 4m2(s + r) = 0,
with asymptotes s = 4m 2 and t = 4m2.
The dispersion relations in the form (126.5) and (126.8) do not yet take account
of the renormalization conditions; if they were applied as they stand, the integrals
would be divergent and would need to be regularized. The renormalization condition for the amplitudes M (s, 0 is
M(0,0) = 0:
(126.20)
the photon-photon scattering amplitude must be zero when k\ = k2 = ki = k4 = 0
(and therefore s = t = 0), since k = 0 implies a potential constant in time and space,
corresponding to no physical field; this condition will be further discussed in §127.
4m'
4m2
FIG.
23.
t This is, of course, not fortuitous: Ai, is negative, in fact, because of the unitarity condition, as is
especially clear when t = 0 and Ai, determines the total cross-section.
§126
565
The Double Dispersion Relation
To include this condition automatically, we must write the dispersion relation
"with subtraction" (as in deriving (111.13) from (111.8)). The required relation is
obtained in a natural manner by first using an identical transformation:
!
.
(s'-s)(t'-t)
ä
(s'-s)(t'-t)s't'
s
+
-
+
(s'-s)s't'
*-
+-L
(t'-t)s't'
s't"
Substitution of this in the integrand (126.8) gives
w,
„
st f f A2(s\t')ds'dt'
s f f(s')ds'
t f g(t') dt'
where
/(s) =
If>MMO d( ,
TT J
I
g(() =
lfM£lOds,
77 J
S
These equations would, however, be meaningful only if all the integrals converged. If not, the functions /(s), g ( 0 and the constant C must be assigned
specified values in accordance with the renormalization condition, putting
C=0,
/(s) = A l 5 (s,0),
g(t) = A u (0,t),
where A ïf is the imaginary part of M(s, 0 which appears as t increases for a given
small s (just as A\s is the imaginary part which appears as s increases for a given
small 0- The first of these equations is obvious: C = M(0,0) = 0. The second (and
similarly the third) follows on comparing the equation
IT J {S
-
S)S
with the single dispersion relation (126.5) written "with subtraction" according to
(126.20):
Thus the double dispersion relation "with subtraction" is finally
If s and t are themselves within the region of integration, the integrals (126.21),
566
§127
Radiative Corrections
(126.22) must as usual be taken in the sense
s ->s + iO, f -> t + iO.
(126.23)
§ 127. Photon-photon scattering
The scattering of light by light (in a vacuum) is a specifically quantumelectrodynamic process; in classical electrodynamics it does not occur, owing to
the fact that Maxwell's equations are linear.t
In quantum electrodynamics, photon-photon scattering is described as the
result of the production of a virtual electron-positron pair by the two initial
photons, followed by the annihilation of the pair into the final photons. The
amplitude of this process (in the first non-vanishing approximation) is represented
by six "square" diagrams with every possible relative position of the four external
ends. These include the diagrams
l
q-M
/
/
i \
/k<
k
N
Iq-kî
q-krk:\
\k,
(127.1)
*>'
\k,
k
4
/
\k,
and another three which differ from these only in that the internal electron loop is
traversed in the opposite direction. The contribution of these three diagrams is the
same as that of the diagrams (127.1), and the total scattering amplitude is therefore
Mfi = 2(MW + Mw + Mic%
(127.2)
where M(a), M(b) and M(c) are the contributions of diagrams (a), (b) and (c).
According to (64.19) the scattering cross-section is
, do'
1
IM«! (2^p.
da = 6 4 7T1W\
(127.3)
where do' is the solid-angle element for the direction k' in the centre-of-mass
system. The scattering angle in that system is denoted by 6.
INVARIANT AMPLITUDES
Writing separately the polarization factors of the four photons, we have M/, in
the form
(127.4)
Mfi = e M * 3 M * M w ;
t In the limit of low frequencies, this process was first discussed by H. Euler (1936), and in the
ultra-relativistic case by A. I. Akhiezer ( 1937). The complete solution is due to R. Karplus and M. Neumann
(1951).
§127
Photon-Photon
Scattering
567
the 4-tensor MAMi/p (called the photon-photon scattering tensor) is a function of the
4-momenta of all the photons. If the arguments of functions are written with the
signs which correspond to like directions of the external lines in the diagram, it is
evident from the symmetry of the group of diagrams (127.1) that
MK^p(kuk2,-k},-k4)
is symmetrical with respect to any interchange of the four arguments together with
a simultaneous corresponding interchange of the four suffixes. Because of the
gauge invariance, the amplitude (127.4) is unchanged when e is replaced by
e + constant • k. Thus we must have
k\Mk^
= ktMk9HM,= --- = 0.
(127.5)
It is easily deduced from this that, in particular, the expansion of the scattering
tensor in powers of the 4-momenta k\, k2,... must begin with terms containing
quaternary products of the components, and certainly
M w (0,0,0,0) = 0.
(127.6)
To determine the actual invariant amplitudes, however, it is desirable to take
from the start a particular gauge of the polarization 4-vectors e, in which
er = (0,ei), e? = (0,e 2 ),....
(127.7)
Mfi = MMmeuelke^ie*Am,
(127.8)
Then
where Miklm is a three-dimensional tensor.
We take as the two independent polarizations for each photon the circular
polarizations with opposite directions of rotation, i.e. two helical states with
helicities A = ± 1.
The tensor Mik\m can then be written
M iklm =
S
À,À 2 À 3 À4
MA|A2A,A4ettï)*eiV)*e^>eSis>;
(127.9)
the sixteen quantities AfÀ|A2À3À4 are functions of s, t and uy and act as invariant
amplitudes, but they are not all independent.
The quantities MAjA2A3A4 are three-dimensional scalars. Spatial inversion changes
the sign of the helicities, while the invariant quantities s, t and u remain unaltered.
The condition of P invariance therefore gives the relations
MAlA2A3À4(s, r, u) = Af-A|.-5A2.-Aj,-A4(^ t, u).
(127.10)
Time reversal interchanges the initial and final photons without affecting their
Radiative Corrections
568
§127
helicities; s, t and u again remain unaltered. The condition of T invariance
therefore gives the equation
AfAlx2A3A4(s, t, u) =
M A 3 M I A 2 (S,
t, u).
(127.11)
Lastly, one further relation follows from the invariance of the amplitude Mfi under
the interchange of the two initial or the two final photons. If both interchanges are
made (k\ <-> k2, ki<r* k4), the variables s, t, u are unaltered, and the interchange in
the polarization indices leads to
M A l A 2 À 3 A 4 (s, r, u) = M A 2 A l A 4 A 3 (s, t, u).
(127.12)
It is easy to see that, because of the symmetry properties (127.10M127.12), the
number of independent invariant amplitudes is only five, which may be chosen, for
example, as
M++++,
M++—, M+-+-,
M+—+, M+++-
(the suffixes + and - denoting, for brevity, helicity values + 1 and - 1).
If one of the amplitudes MXlk2x3\4 is substituted for Mfi in (127.3), the result is the
cross-section for scattering with specified polarizations of the initial and final
photons. The cross-section summed over the final polarizations and averaged over
the initial polarizations is obtained by the substitution
\Mfi\2 -> }{2| M++++I2 + 2|M++__|2 + 2|M+-+_|2 +
+ 2|M,-„| 2 + 8|M, + .-| 2 }.
(127.13)
The symmetry relations (127.10>-(127.12) connect different invariant amplitudes
as functions of the same variables. Further functional relations are obtained from
crossing invariance (§78), since the amplitude Mfi describes the same reaction
(photon-photon scattering) in every channel, and therefore must be the same for
every channel.
The s channel (corresponding to the arrow directions as shown in the diagrams
(127.1)) is converted to the t channel by interchanging the 4-momenta k2 a n d - k 3
(i.e. by changing the variables s ++1) and interchanging the helicity suffixes
À2 <->- A3. Similarly, it is converted to the u channel by interchanging k2 and - k4
(s *+ u) and A2 <-» - A4. This leads to the relations
M+-+-(s, r, u) = Af++++(w, t, s),
M+-+(s, U u) = M++++(t, s, u), \
(127.14)
M + + + + (s, r, u) = M++++(s, M, 0 ;
M++— and M+++_ are completely symmetrical in 5, t and w.t It is therefore sufficient
t Here we have also used the symmetry with respect to the two final photons. Since the three
variables s, f and u are not independent» it would be sufficient to write two arguments (for example, the
first two), but we have retained all three, simply in order to clarify the symmetry of the interchanges.
Photon-Photon
§127
Scattering
569
to calculate only three of the sixteen amplitudes, for instance M++++> M++— and
M
The relations ( 127.10M127.12) and (127.14) apply to the total amplitudes, i.e. the
sums of the contributions of all three diagrams (127.1). But these contributions
themselves are related in a manner which is obvious on comparing the diagrams.
For example, diagram (b) is obtained from diagram (a) by the substitutions
k2 <-» - k4, £:<-*£*, and so their contributions to the invariant amplitudes are
obtained from each other by interchanging the variables s <-♦ u and the suffixes
A2«->-A4; similarly, the contribution of diagram (c) is obtained from that of
diagram (a) by the changes t «-» w, A3 <-> - A4.
CALCULATION OF THE AMPLITUDES
The integral M\V corresponding to the diagram (127.1a) has the form (126.4),
with
B(a) = — tr {ye^yq -yk2 + m)(ye2) x
x (yq + m)(yet)(yq
- yk4 + m){ye%){yq - yk{ - yk2 + m)}.
(127.15)
The integrals (126.4) are logarithmically divergent. In accordance with the
condition (127.6), they are regularized by subtracting the value when k\ = k2 = * • • =
O.t The calculation of the regularized integrals is, however, exceedingly laborious.
The most straightforward way to calculate the photon-photon scattering amplitudes is based on the use of the double dispersion relation (B. De Tollis, 1964).
This method makes the most complete allowance for the symmetry of the
diagrams, and almost entirely eliminates the difficulties of the integrations.
The function A\as\s, t) (and similarly Aft}) for any given set of helicities Ai, A2,
A3, A4 is calculated in accordance with (126.6); owing to the presence of two delta
functions in the integrand, the value of B{a) is needed only for
l] = q2 = m\
15 s (q - fc, - k2)2 = m2.
(127.16)
These equations can be utilized in calculating the trace (127.15). For substitution in
t In the summation of the contributions from all the diagrams, the divergent parts of the integrals cancel,
as is easily seen by noting the asymptotic form of the integral as q-*»:
M^ P «
jtT{y,{yq)yAyq)yP(yq)yÄyq)}d4ql{q2)4.
After averaging over the directions of q (cf. (131.10)), the trace is easily calculated, giving
M{KU « (g^g,P + gA.g^p - 2gApgM„) I d4ql(q2)2.
The summation over the diagrams is equivalent to symmetrizing this expression with respect to the suffixes
A, ji, v and p, the result of which is zero. However, this is in a certain sense fortuitous, and does not remove
the need for regularization, even thpugh the latter amounts only to the subtraction of a finite quantity.
570
§127
Radiative Corrections
(126.22) we need the value of AW only for t = 0. This implies that k = k' and k2 = fc4.
Then the integral (126.6) becomes
ms,0)
= .^^nlj[(q_B^lw
(12717)
cf. the derivation of (115.10). With the angle d between q and k, we have
(q - k2)2 - m2 = - 2cu(l - |q| cos d)
= - Vs[l - |V(s - 4m2) cos #].
Thé integrals (127.17) can in fact be expressed in terms of elementary functions.
The calculation of A^a)(s, t) from its definition (126.18) involves no integration; here
the expression for B(a) is to be taken for the values of q given by (126.15), which
satisfy not only (127.16) but also the conditions (q - k2)2 = m2, (q - k4f = m2.
When the functions Ais, Au and A2 have been calculated, the dispersion relation
(126.22) gives the amplitude directly as single and double definite integrals. We
shall give thefinalresult for the three invariant amplitudes which, according to the
preceding discussion, are sufficient to determine all the other amplitudes:!
ji,*#„„ = - ! - ( 2 + ^ ) B ( 0 - ( 2 + ^ ) B ( « ) -
-[?^-f][T(0 + T(»)] + f(l-2)/(..r) +
.I
f i J k ^L
^ sJ - l s- it- iuktu]
» , , (,27.18)
u \ s/
5^M+++-=l+4(-
+ 7 + f)[T(s)+T(D+T(ii)]\s t u/
8a
-4(ÏÏ+ÏÏK'>-4(Ï+SK">-4(Î+^K")>
S?**„.-.i-l/(,.o-i/(,..)-^/(«.«).
Here, B(s), T(s) and I(s, t) denote the functions
B(s) = V ( l - 4 / s ) s i n h _ 4 V - s - l ,
_,
2
T(5) = (sinh |V-s) ,
s<0,
s<0,
i
J(s,t) = i J y ( 1 _ y ) f y ( 5
+ 0 / s f {log[l-iO-5 y (l-y)]
+ log[l-iO-ty(l-y)]},
0
(127.19)
t Some further details of the transformations of the integrals, various representations of the
transcendental functions B, T and I, and some limiting forms are given by B. De Tollis, Nuovo Cimento [10]
32, 757, 1964; 35, 1182, 1965; V. Costantini, B. De Tollis and G. Pistoni, ibid. [11] 2A, 733, 1971.
Photon-Photon Scattering
§127
571
and the expressions in the ranges 0 < s < 4 and s>4 are obtained from (127.19) by
analytical continuation with the rule s -> s + iO, i.e. through the upper half-plane of
these variables. To simplify the notation, s and t denote slm2 and tlm2, in (127.18)
and (127.19) only.
SCATTERING CROSS-SECTION
The limiting case of low frequencies (to < m) corresponds to small values of the
variables s, t and u. Thefirstterms in the expansion of the invariant amplitudes in
powers of these variables are
M + + + + - lleV/45m 4 ,
M+_+_ ~ lleV/45m 4 ,
M+__+~ lleV/45m 4 ,
M + + - « - e4(s2 + t2 + u2)/15m4,
(127.20)
M + + + -~0.
Substituting these values in (127.3), we find the cross-sections for the scattering of
polarized photons. The differential scattering cross-section for unpolarized photons, calculated from (127.13), is (in ordinary units)
d(J =
4 ^ W a V <(^) 6 ( 3 + C0S2 0) d°'
02721)
and the total cross-section ist
973
2
er =10125- a r
2/ftw Y
5
\mc2/
= 0.031a 2 r 2 (^p) , h<o <§ mc2.
IT
(127.22)
In the opposite (ultra-relativistic) case, the total scattering cross-section for
unpolarized photons ist
<r = 4.7a4(c/<o)2, hot > mc2.
(127.23)
Finally, the differential cross-section for small-angle scattering in the ultrarelativistic case is
der = - ^ 1 log4 ^ do, mc2lho><e<\.
7T 0)
V
(127.24)
This expression is valid with logarithmic accuracy (the next term in the expansion
contains a power of the large logarithm lower by one unit). In the limit 0 = 0
t In going from der to o\ a factor i has to be included to take account of the identity of the two final
photons.
X The origin of this dependence of or on <o will be further discussed at the end of §134.
572
Radiative Corrections
§127
10°
io- 3
«
©
i<r 6
l<Tl
_____
I02
IO 3
IO1
10°
FIG.
24.
(forward scattering), (127.24) is invalid, and is replaced by
da = ^2
log4 - ^ do,
6 « mc2/ft„.
(127.25)
This expression is easily derived from the general formulae (127.18), putting t = 0
and noting that, for 5 > 1, the highest power (the square) of the large logarithm is
present only in the function
T(slm2) ~ i log2 (slm2) - log2 (o>/m).
To this accuracy, the only non-zero amplitudes are
M++++ = M — . = M+-+- = - 16e4 log2 {(olrn).
In particular, therefore, the photon polarization is in this case unchanged on
scattering.
Figure 24 shows the total scattering cross-section as a function of the
frequency, plotted on a double logarithmic scale. The cross-section decreases
towards both low and high frequencies, reaching a maximum when hit) — l.Srhc2.
The break in the curve at too = mc2 corresponds to the change in the nature of the
process when the production of a real electron-positron pair becomes possible.
LOW FREQUENCIES
For low frequencies (o> <m), the photon-photon scattering amplitude can also
be derived by a totally different method, based on the correction terms in the
Lagrangian of a weak electromagnetic field (§129).
The small correction V' to the interaction Hamiltonian differs only in sign from
that to the Lagrangian. From (129.21),
V' = -
45x
j ^ m w | {(Ê2 - H2)2.+ 7(Ê • H)2} d3x.
(127.26)
§128
Coherent Scattering of a Photon in the Field of a Nucleus
573
Since this operator is of the fourth order in the field, it has matrix elements for the
relevant transition, even in the first approximation.
For the calculation, we substitute in (127.26)
Ê = -3Â/3f,
H = curl A,
A = V(4ir) 2 (ckA ekA e'ikx + c k \ e{A e f a ),
(127.27)
where A numbers the polarization; the S-matrix element is then given by
Sfi =
-i(f\jv'dt\i)
= - i<0|c M3 c M 4 j V at cilkt ct2h\0)
(127.28)
(cf. §§72 and 77). When A is normalized as in (127.27), the scattering amplitude Mfi
is found immediately from Sfi:
Sfi = i(27r)4ô(4)(/c3 + k4 - fci - ki)Mn
(127.29)
(cf. §64). The mean value in (127.28) is calculated by means of Wick's theorem,
using (77.3), with contraction of only the "external" operators ckA, ckA with the
internal operators A.
§ 128. Coherent scattering of a photon in the field of a
nucleus
Other effects which are non-linear, like photon-photon scattering, and are
described by square diagrams of the form (127.1), are the disintegration of one
photon into two in an external field (and the reverse process of combination of two
photons into one), and photon scattering in an external field. The former corresponds to diagrams in which one of the four external photon lines is replaced by
an external field line; the latter process corresponds to diagrams with two external
lines of real photons and two of virtual photons.
This class includes, in particular, coherent (elastic) scattering of a photon in the
constant electric field of a stationary nucleus. In general, the calculations lead to
very lengthy formulae involving multiple quadratures.t Here, only some estimates
will be given.
Because of the requirements of gauge invariance, the scattering amplitude as
tu->0 must contain products of the components of the 4-momenta of the initial
photon (k) and the final photon (Jc'), just as the expansion of the photon-photon
scattering amplitude begins with the quaternary products of the components of the
4-momenta of all the photons. Thus the scattering amplitude for a low-frequency
photon is proportional to <o2. Since also this amplitude involves the external field
t See V. Costantini, B. De Toüis and G. Pistoni, Nuovo Cimento [II] 2A, 733, 1971 ; B. De Tollis, M.
Lusignoli and G. Pistoni, ibid. 32A, 227, 1976.
574
§128
Radiative Corrections
(the field of the nucleus with charge Ze) in the second order, we conclude that the
scattering cross-section is
da - Z4a4r2e(a>lm)4 do
(co < m).
(128.1)
The frequency dependence is, of course, in agreement with the general results of
§59.
The coefficient in (128.1) cannot be calculated from the Lagrangian for a
uniform electromagnetic field (as was done for photon-photon scattering). The
reason is that, in the process here considered, distances from the nucleus r— 1/m
at which its field cannot be regarded as uniform are important.
The result of the exact calculation is
d(7++ = d ( j - = 1.004 x 10 ~3(Za)4r2(co/m)4 cos 4 \e do,
da+- = dcr-+ = 3.81 x \0-\Za)4r2e((olm)A sin4i2d do.
(128.2)
Here, as in §127, the suffixes + and - denote the helicities + 1 and - 1 of the final
and initial photons; 6 is the scattering angle in the rest frame of the nucleus (V.
Costantini, B. De Tollis and G. Pistoni, 1971).
To estimate the cross-section at high frequencies, we use the optical theorem
(§71). The intermediate state which appears on the right-hand side of the unitarity
relation is here a state of the electron-positron pair (corresponding to the division
of the diagrams at two internal electron lines between external photon lines). The
optical theorem therefore relates the amplitude for elastic scattering of a photon
through an angle of zero and the total cross-section ap^r for photon pair production
in the field of the nucleus. If the amplitude /(o>, 0) for scattering through an angle 6
is so defined that the scattering cross-section is da = |/| 2 do (cf. (71.5)), we have
i m /(<*), 0 ) = 0>CTpair/47r.
The cross-section <rpair is, of course, zero unless œ > 2m. In the ultra-relativistic
case, taking crpair from (94.6), we get
f"W - im /(ai, 0) = ^(Za)2re^
[log ~ ™ ]
(a>»m).
(128.3)
The real part of the scattering amplitude is determined by the imaginary part,
through the dispersion relation. The latter must here be written "with one subtraction", i.e. for the function fit (where t = o>2), since as a>->0 the amplitude / <* o>2;
compare the dispersion relation "with two subtractions" (111.13). Separating the
real part of the dispersion integral (for which it is sufficient to take the integral as a
principal value), and changing from integration with respect to V = o>'2 to that with
respect to <o', we have
/'<•) • re /(a,, 0) = ^
7T
P f j f f l t v
J O) \0)
1m
(O )
UM-4>
§129
Radiative Corrections to the Electromagnetic Field Equations
575
When a) > m, the important values in the integral are o)1 — co> my so that we can
use the expression (128.3) for /"(<*/); the lower limit of integration may then be
replaced by zero. The principal value of the integral can be represented as half the
sum of the integrals along paths on the upper and lower edges of the positive real
axis in the complex cu'-plane; these paths may then in turn be rotated in the
CÜ'-plane to lie along the positive and negative imaginary axes respectively. Then
f
u
/(w)
-
Q>2n"(iÇ) +
TTJ
o
= — (Za)
97T
m2+o>i)
— a)
m
f"(-iQd,
di
^ * j
J £ +ar
and the final result is
re /(to, 0) = ^ (Za)2 reœlm.
(128.5)
Note that the real part of the amplitude, unlike the imaginary part, does not contain
a large logarithm.
The sum of the squares of (128.3) and (128.5) gives the cross-section for
scattering through an angle of zero as
dcr.=0 = g g , (Za)' ^ ) 2 { l o g ' ( 4 ^ ) + V } do
(128.6)
(F. Rohrlich and R. L. Gluckstern, 1952).
The result (128.6) derived for scattering exactly forwards is valid also over a
certain range of small angles. The condition for its validity can be shown to be
0<^(m/(u)2. This range, however, makes only a small contribution to the total
scattering cross-section. The main contribution to the latter comes from angles
6 *£ m/cu, as is easily seen from the general (not only for angle zero) unitarity
relation between the amplitudes for photon-photon scattering and photon pair
production. In that range, however, the logarithmic term is absent, and the total
scattering cross-section is thus
a - (Za)4 r](a>lm)262 - (Za)4 r\
(128.7)
(H. A. Bethe and F. Rohrlich, 1952). For large w, therefore, the coherent scattering
cross-section tends to a constant limit.
§ 129. Radiative corrections to the electromagnetic field
equations
In the quantization of the electron-positron field (§25) it has been shown that
the expression for the vacuum energy contains an infinite constant, which may be
576
Radiative Corrections
§129
writtent
£o=-24-
(129.1)
P,C7
where - e{pJ are the negative frequencies of the solutions of Dirac's equation. This
constant itself has no physical meaning, since the vacuum energy is, by definition,
zero. When an electromagnetic field is present, however, the energy levels e{pJ will
change. The changes are finite and are physically significant. They describe the field
dependence of the properties of space, and alter the equations of the electromagnetic field in a vacuum.
The changes in the field equations correspond to the change in the field
Lagrangian. The density L of the Lagrangian is a relativistic invariant, and
therefore can depend only on the invariants E2 - H2 and E • H. The usual expression
L0 = (E2-H2)/8TT
(129.2)
is the first term in the expansion of the general expression in powers of the
invariants.
Let us derive the Lagrangian for the case where the fields E and H vary so
slowly in space and time that they can be regarded as uniform and constant. Then
L may be assumed not to involve the derivatives of the fields. The necessary
conditions for this will be discussed at the end of the section.
However, if the problem stated is to be meaningful, we must also assume the
electric field to be sufficiently weak. The reason is that a uniform electric field can
generate pairs from the vacuum. The field itself can be treated as a closed system
only if the pair production probability is sufficiently small:
\E\<èm2l\e\
( = m2c3l\e\h),
(129.3)
i.e. the change in the energy of a charge e over a distance hlmc must be small in
comparison with mc2. We shall see below (cf. also Problem 2) that the pair
production probability is then exponentially small.
If there is a magnetic field as well as an electric field, it is in general possible to
choose a frame of reference in which E and H are parallel. Then the magnetic field
does not influence the motion of the charge in the direction of E. The condition
(129.3) is to be satisfied in this frame, which will be the one used in the subsequent
calculations.
The calculation of the Lagrangian begins with that of the change W in the
vacuum energy. This is given by the change in the "zero energy" (129.1) due to the
field. From this, however, we must subtract the mean values of the potential energy
of the electrons in the "states" of negative energy. The subtraction simply makes
the total charge of the vacuum zero by definition.
The zero energy in the presence of the field is
*o = - 2 4 - = 2 I *£* • « Yf *V d\
P,CT
p,<T J
Q+
t Here we shall write # in place of E, to avoid confusion with the electric field.
(129.4)
§129
Radiative Corrections to the Electromagnetic Field Equations
577
where i//p7r are the negative-frequency solutions of Dirac's equation in the field
concerned. We shall assume that the integration is over a unit volume, and that the
wave functions are normalized to unity in that volume; then %§ is the energy per
unit volume. According to the preceding discussion, we have to subtract from g 0
the quantity
P,(T J
where <>/ = - E • r is the potential of the uniform field. According to the theorem on
the differentiation of an operator with respect to a parameter (see QM, (11.16)),
= -E-2de ( p ;7dE
= E • dg0/dE.
Thus the total change in the vacuum energy density is
W = («o - E • d%oldE) - («o ~ E • dg0/dE)E==H=o.
(129.5)
We can relate W to the change L' in the Lagrangian density (L = L 0 + L') by
using the general formula
W = 2 q dLldq - L,
where q represents the "generalized coordinates" of the field (see Fields, §32). For
an electromagnetic field, the quantities q are the potentials A and </>. Since
E = -À-V<fc
H = curl A,
(129.6)
À is the only "velocity" q which appears in L, and the differentiation with respect
to À is equivalent to one with respect to E; hence
W = E-dL7dE-L\
(129.7)
Comparison of (129.5) and (129.7) gives
L' = -[*O-(*O)E.H«O].
(129.8)
Thus L' can be calculated by means of the sum (129.1).
Let us first take the case where there is only a magnetic field. The "negative"
energy levels of the electron (charge e = - \e\) in a constant uniform field Hz = H
578
Radiative Corrections
§129
- 4 _ ) = - V[m2 + \e\H(2n - 1 + cr) + pll
(129.9)
are
n = 0, 1,2,... ;a = ± 1
(see §32, Problem). To find the sum, we note that the number of states in the
interval dpz is
\e\H dpz
2TT 2TT
(see QM, §112); the first factor is the number of states with various values of px,
which do not affect the energy. Moreover, all the levels except n = 0, a = - 1 are
doubly degenerate, the levels n, a = + 1 and n + 1, a = - 1 coinciding. Hence
~^0 = (f^? J {V(m2 + P') + 2nf1 V(m2 + 2|e|Hn+p^)}dpz. (129. 10)
The divergence of the integrals in (129.10) is eliminated in the calculation of L'
(129.8) by subtracting the value of the sum when H = 0. To carry out this
"renormalization", it is convenient to calculate first the convergent expression
(PEE-
dl%
«
0
__l£lïï.f
"
1
8iP"UP
, ?ZY
.
2
!
1
Ä m + 2|e|HnJ-
The sum in the braces can be reduced to that of a geometrical progression, as
follows:
0
o
= - l f ^ f e-m2T,coth(|e|HTj)dî).
OTT
(129.11)
J
0
To find L\ we must now integrate 4> twice with respect to m2 and then subtract the
i
§129
Radiative Corrections to the Electromagnetic Field Equations
579
value of the resulting quantity when H = 0. This gives
V =- i
f ^ - [i]\e\H coth (rj|e|H) - 1} dtj + c, + c2m2,
(129.12)
0
where C\ and c2 depend on H but not on m2.
From considerations of dimensions and of parity with respect to H, it is evident
that L' as a function of H and m must have the form
L' = m4f(H2lm4).
Hence there can be no terms in U which are odd in m2, and so c2 = 0. The
coefficient C\ is given by the condition that the expansion of U in powers of H2
begins with a term in H4: a term in H2 would simply alter the coefficient in the
original Lagrangian L 0 = - H2I$TT, and this would essentially signify a changed
definition of the field and therefore of the charge. The elimination of the H2 terms
thus corresponds to a renormalization of charge. It is easily verified that this is
achieved by putting
o
Finally, making the change of variable m2i) ->T) in (129.12), we have
L\H ; E = 0) = ^
07T J
f {- i]b coth bi\ + 1 + \b2r)2} e'11 ^
0
ÎJ
(129.13)
where b = \e\Hjm2.
Let us now go to the general case where there is not only a magnetic field but
also an electric field E parallel to it, satisfying the condition (129.3).
To find V in this case it is not necessary to determine afresh the energy levels
e{p} of the electron in the field; we need only note that, if the wave function (the
solution of the second-order equation (32.7)) is sought as a product
^ = ^E(z)e^«r(y),
where *v(y) is the wave function in the magnetic field when E = 0 and pz = 0, then
the mass m and the field H appear in the equation for tMz) only in the
combination
m2 + \e\H(2n + \ + a).
If now the factor \e\HI2ir is again taken from the summation over px (the energy
580
Radiative Corrections
§129
levels being independent of px), the dimensional argument shows that
<D(H,E) = d2L'/(dm2)2
F([m2 + \e\H(2n + 1 + cr)]/|e|H)
m2+\e\H(2n + \ + <r)
\e\H A y
Sp'èoaiix
* ir(X\ i ?YFil
8 ? r U ^ À
a = \e\E/m2;
+
2bnla)
]
l + 2bn J'
(129.14)
each term in the sum is - d2e(p_)/(dm2)2 summed over all the quantum numbers
except n. Here F is a function as yet unknown, which will be derived from
considerations of relativistic invariance.
<ï> must be a function of the scalars H2 - £ 2 and (EH)2 = (E • H)2:
<D(H, JB) = f(H2 - E2, (EH)2).
Hence
<D(0,E) = / ( - E 2 , 0 ) = <D(iE,0).
The function <î>(iE, 0) is obtained from (129.11) by putting H ~+ iE; after a change of
notation for the variable of integration, this gives
4>(»E,0) = i
OTT
f e-^cottjdr,.
J
0
(129.15)
The function F can be found by comparing this expression with the limit 4>(H ->
0, E) given by (129.14). The passage to the limit H -+0 in (129.14) can be effected
by replacing the summation over n by integration over dn = dxjlb:
•"»-K'J'frOr?x
= - j T i I ^>dy.
(129.16)
Ma
Equating the expressions (129.15) and (129.16) and differentiating with respect to
1/a = z, we find
F(z)lz = - I e~^z TJ cot T) drj.
o
The summation in (129.14) then reduces again to the summation of a geometric
§129
Radiative Corrections to the Electromagnetic Field Equations
581
progression, and the subsequent calculations are similar to those given previously:
we express <f> in terms of m2, E and H, integrate twice with respect to m2, subtract
the value for E = H = 0, and determine the constants of integration as in the
derivation of (129.13). The final result ist
m4 f e ^
L' = g^p J —r { - (W cot i\a){i\b coth r)b) + 1 - \i)\a2 - b2)} drj,
o
a = \e\Elm2 ( = |e|fiE/m2c3),
(129.17)
b = |e|tf/m 2 (= |e|ftH/m2c3).
The parameters a and b may be written in the invariant form
a = -
i e
-1{(& +
icg)m-(&-i(ß)m},
(129.18)
b =
\&{{UJ
+ i<ß)m
+ (ä?
~
mm)
>
where .> and ^denote the invariants
^ = KB2 - E2),
« = E • H,
9 ± Vê = i(H ± ÎE)2.
(129.19)
When (129.17) is expressed in terms of the invariants ^ and ^, it becomes
applicable in any frame of reference (not only in that where E||H).
The formula (129.17) is written in a somewhat arbitrary manner. It is valid only
if the electric field is small: a <\ (129.3); this condition is not shown explicitly in
(129.17), but can be seen from the fact that the integrand in (129.17) has poles at
7) = mrja (n = 1,2,...), and the integral as written above has, strictly speaking, no
meaning. Hence (129.17) can essentially be used only to derive the terms of the
asymptotic series (see below) in powers of a by a formal expansion of cot a.
The integral (129.17) can be given a mathematical meaning by passing round the
poles in the complex rj-plane. Then L', and therefore the energy density W\ have
an imaginary part. Since the energy is complex, there are quasi-stationary states.t
In the present case, the stationarity condition is violated by pair production, and
- 2 im W is the probability w of pair production per unit volume and time; since
the small increments of W and L differ only in sign, the probability w, expressed in
terms of E and H, is simply
w = 2 im V.
(129.20)
This is clearly proportional to e~^a (see (129.22) below). Because im W is
exponentially small when a <è 1, an asymptotic series in powers of a, retaining any
finite number of terms, is meaningful.
t This was first derived by W. Heisenberg and H. Euler (1935). The analysis given above makes use
also of the principles of a proof suggested by V. F. Weisskopf (1936).
t The direction of passage round the poles must be chosen so that im W < 0 ; this corresponds to
the usual rule m2-> m2 - iO (i.e. here a-*a + I'O).
§129
Radiative Corrections
582
Let us consider the limiting cases of formula (129.17). In weak fields (a<\
b < 1), the leading terms of the expansion are
In particular, when b = 0 the relative correction is
L7L0 = aa2/457T.
The imaginary part of U for a <è 1 is obtained from the integral (129.17) by
taking half the residue at the pole of the cotangent nearest to the origin, i.e. at
rja = 7T - iO. From (129.20), this gives the probability of pair production by a weak
electric field:
or, in ordinary units,
w
1 (eEh\2mc2
(mc\>
- 4? UP?) T nr)
(
exp
7rmV\
( - i^isr)-
M1fl...
(129 22)
-
In a strong magnetic field (a =0, b > 1), we start from (129.13), written (with
brj->Tj) as
-^r-^iï-^^h,4^2 r
»-nib
When b > 1, the important range in this integral is 1 <^ r\ < b, in which e"""' ~ 1 and
we can neglect the second term in the brackets, terminating the range of integration
(with logarithmic accuracy) at TJ « 1 and r\ « b. Then
V = (m V/24ir2) log b;
(129.23)
in a more exact result, log b becomes log b - 2.29. The ratio L'IL0 is here
L7Lo-(a/3ir)logb,
from which we see that the radiative corrections to the field equations may become
of relative order unity only in exponentially strong fields:
H~(m2l\e\)e3lTla.
(129.24)
The corrections calculated above are, nevertheless, meaningful: they remove
the linearity of Maxwell's equations, and thus lead to effects which are in principle
observable (e.g. scattering of light by light or in an external field).
§129
Radiative Corrections to the Electromagnetic Field Equations
583
The relation between the fields E and H and the potentials A and <j> remains, by
definition, as before (129.6), and there is therefore also no change in thefirstpair of
Maxwell's equations:
divH = 0, curlE = - ^ .
(129.25)
The second pair of equations are obtained by varying the action
S=l(L0+L')d4x
with respect to A and <f>, and can be written
curl (H - 4irM) = | - (E + 4irP),
(129.26)
at
div(E + 47rP) = 0,
(129.27)
V = dL'ldE, M = dL'ldH.
(129.28)
with the notation
Equations (129.25M 129.27) agree in form with the macroscopic Maxwell's equations for a field in matter.t Hence we see that P and M signify the electric and
magnetic polarization vectors of the vacuum.
Note that P and M are zero for the field of a plane wave, where both invariants
E2 - H2 and E • H are zero. For a plane wave, therefore, the non-linear corrections
are zero in a vacuum.
Lastly, let us consider the conditions for the above formulae to be valid. If the
fields are to be regarded as constant, their relative changes over distances or times
of the order of 1/m must be small; this ensures that the corrections to L0 arising
"rom the derivatives are small in comparison with L0 itself. For instance, if the field
is only time-dependent, this gives the obvious condition
<o<m.
(129.29)
For a weak field, however, there is also a more stringent condition. This occurs
because the fourth-order term (129.21) must be much larger than the correction to
L0 quadratic in the derivatives; otherwise, the fourth-order term would have no
meaning. For example, in an electric field depending only on the time, this leads to
the condition
o><m\e\Elm\
(129.30)
which is more stringent than (129.29).
t In making the comparison, it must be remembered that in macroscopic electrodynamics the mean
value of the magnetic field is denoted by B, not by H as here.
584
Radiative Corrections
§129
The condition (129.30) does not arise, however, in the problem of photonphoton scattering considered in the last part of §128. There, we are concerned from
the start with a four-photon process only, described by the fourth-order terms in
the Lagrangian, and the relative magnitude of the other terms in L'.is irrelevant. It
is therefore sufficient if the condition (129.29) is satisfied.
PROBLEMS
PROBLEM 1. Determine the correction to the field of a small stationary charge e\ due to the non-linearity
of Maxwell's equations.
SOLUTION. For H = 0, (129.21) gives
P = dL73E
= (a2/907T2m4)E2E.
(1)
In the case of central symmetry, (129.27) gives
( £ + 47rP)r2 = constant = eu
(2)
the value of the constant being obtained from the condition that as r->» the field is the Coulomb field of
the charge e\. An approximate solution of (2) is
E = (e,/r2)(l - 2a2eî/45îrm V ) ,
or
$> = (€,/r)(l - 2a2e]l225Trm4r4).
<3)
The correction in (3) that is non-linear in ^i is to be distinguished from the linear correction in
(114.6), due ultimately to the non-uniformity of the Coulomb field. The correction (3) is of a higher order
in a, but decreases more slowly with increasing distance and increases more rapidly with è\.
PROBLEM 2. Estimate directly the probability of pair production in a weak uniform constant electric
field in the quasi-classical approximation, to exponential accuracy (F. Sauter, 1931).
SOLUTION. The motion is quasi-classical in a weak field E (which has a slowly varying potential
<f> = - E • r = - Ez). Since the reaction amplitude contains the wave function of the final positron as the
initial "negative-frequency" function, pair production may be regarded as a transition of an electron
from a "negative-frequency" to a "positive-frequency" state. In the former state, with the field present,
the quasi-classical momentum is determined by the equation
e = - V [ p 2 ( z ) + m2] + |e|Ez,
0)
= + V[p 2 (z) + m2] + |e|Ez.
(2)
and in the latter state by
e
The change from the first to the second state implies a passage through a potential barrier (the region of
imaginary p(z)), which separates the regions where the functions (1) and (2) apply with real p(z) for a
given c. The boundaries z\ and Zi of this barrier occur at p(z) = 0, i.e.
e = - m + \e\ Ezu
e = + m + |e| Ez2.
The probability of passage through a quasi-classical barrier is
w <xexp^-2 I |p(z)|dzj
.ex P (-4^/V(>-S*)4
0
Photon Splitting in a Magnetic Field
§130
whence
585
w « e x p ( - irm2l\e\ E),
in agreement with (129.22).
§ 130. Photon splitting in a magnetic field
The non-linear corrections in the electromagnetic field equations give rise to a
number of specific effects in photon propagation in external fields.
In order to put these equations in a more familiar form (cf. the last footnote),
we shall denote the electric and magnetic fields in this section by E and B; D and H
will denote the quantities
D = E + 4TTP,
H = B-4ITM,
P=dL'ldE,
M=dL'ldB.
Equations (129.25H 129.27) then become
divB = 0, curlE = -5B/ar,
div D = 0, curl H = dDldt.
(130.1)
Let us consider photon propagation in a constant uniform magnetic field Bo.
Denoting by a prime the quantities which relate to the weak field of the electromagnetic wave, we have for these the equations
k x H ' = -o>D', k x E ' = wB\
k • B' = 0, k • D' = 0,
(130.2)
with
D; = elkEi, B; = filkHi;
(130.3)
the vacuum permittivity and permeability tensors are functions of the external field
B0. Assuming this field to be so weak that \e\B0lm2<è 1, we find from the Lagrangian
(129.21)
/%
4
(130.4)
H<ik = Sik + TF774 Bl(8ik +
2bibk),
where b = B0/Bo.
The photon frequency is assumed so small that w « m (129.29). However, the
structure of the tensors e* and i±ik does not depend on this assumption; it follows
from the invariance of quantum electrodynamics under spatial inversion and charge
conjugation. The first of these prevents the occurrence in D' of terms having the
form constant x B' or constant x B0(B0 • B') (since inversion changes the sign of E
586
Radiative Corrections
§130
and D but leaves H and B unchanged); the second prevents the occurrence in e*
and fin, of terms antisymmetric and odd in B0, of the form eaciBoi (since charge
conjugation changes the sign of all fields).
Since the problem under consideration has a distinctive plane, namely the
kb-plane, it is reasonable to take the linear polarizations in and normal to this plane
as the two independent polarizations of the photon. The subscripts 1 and || wi
denote polarizations in which the vector B' is respectively perpendicular to the
kb-plane and in that plane.
For perpendicular polarization, the vector H' is, like B', at right angles to the
kb-plane:
B
-(
i +
^
B
»)
H
'
The vectors E' and D' are in that plane. Then, from equations (130.2), we obtain the
photon dispersion relation k = n^, with the "refractive index*' (in ordinary units
n^l+^pBlsin'e,
(130.5)
where 6 is the angle between k and Bo.t
In the second case, B' and H' are in the kb-plane, E' and D' perpendicular to it.
The refractive index is found to be
»1=1 + AI€ ? i Bo sin2 0.
(130.6,
■
45m c
Note that nx ^ n\\. The equality occurs when 0 = 0, n± = nj = 1.
The most interesting manifestation of the non-linearity of Maxwell's equations
with radiative corrections is the splitting of a photon into two in an externr
magnetic field (S. L. Adler, J. N. Bahcall, C. G. Calian and M. N. Rosenblut
1970).
In a constant uniform field, this process occurs with conservation of energy and
momentum.$ In the decay of a photon k into photons k, and k2, we have
û>(k) = co(kt) + co(k2),
k, + k2 = k.
(130.7)
For photons in a vacuum, in the absence of external fields, o> = k and the equations
(130.7) can be satisfied only for three photons moving in the same direction. In that
case, however, the decay is rigorously forbidden by the invariance under charge
conjugation: Furry's theorem (§79) shows that the sum of diagrams with three
photon free ends is zero.
t Expressing B' in terms of H' in the second equation (130.2), we substitute H' from there in the first
equation, and then take the projection of the latter on the direction of b. The product k • E' is expressed
in terms of b • E' by means of the equation k • D' = 0.
t The conservation of momentum is due to the spatial uniformity of the field, but of course occurs
only for processes involving uncharged particles. The Lagrangian for charged particles contains not only
the fields but also the field potentials, which depend on the coordinates even in a uniform field.
§130
Photon Splitting in a Magnetic Field
587
The presence of an external field makes the decay of the photon possible; this
decay is represented by diagrams with three photon, ends and one or more
external-field lines. The possibility is, however, dependent on the nature of the
photon polarization. The dependence may be deduced from the conservation laws
(130.7) and the change in the photon dispersion relation in a magnetic field.
The dispersion relation may be written
<o =fc+ ß(k),
(130.8)
where ß(k) is an increment that is small (in a weak field). Its presence makes
possible, in principle, the fulfilment of equations (130.7) for momenta ki and k2 lying
in a certain narrow cone near the direction of k. Since the directions* of all three
vectors k, ki, k2 are close together, they can all be regarded as parallel to k in the
small terms ß(k), and we can take k\ + k2 = k. The law of conservation of energy
then becomes
ß(Kk) - ßiOckO - ß2(tck - KkO = ki + |k - ki| - k
(where K = k/k); since the dispersion relation depends on the polarization of the
photon, the functions ß, ßi, ßi may be different. Since
[k - k,l - [(k - k,)2 + 2fck,(l - cos ö))m ~k-kx
kk
+ 2 ( k , k | ) *2.
where d is the small angle between k and kt, we have
ß(Kk) - ßi(Kk,) - ß2(Kk,) = kM2/2(k - k,) > 0.
(130.9)
This inequality specifies the properties of the dispersion relation that are necessary
for decay.
For frequencies co <m, the dispersion relation is given by (130.5) and (130.6), so
that ß(k)« - k[n(K)- 1], where the function n(K) depends on the direction of the
vector k but not on its magnitude. Then we must have
k,n,(K) + (k - kx)n2{k) - M K ) > 0.
(130.10)
Since n± > % this condition immediately excludes the decays
Y±-*7|+Y|>
Y±->7»+7±>
where y denotes a photon, and 1 and || correspond to the two polarizations defined
above.t
For the decays
7 i - * 7 ± + 7±f
7|| -* 711+7||>
t Numerical calculations show that n± > rig is true not only when o> < m and (130.5) and (130.6) are
valid, but for all o> < 2m, the threshold for pair production by the photon
588
Radiative
Corrections
§130
the left-hand side of (130.10) is zero, since the functions n, n]y n2 are the same. To
solve the problem in this case, we have to take into account the dependence of the
refractive index on k which appears as a> increases. The required inequality is
fc,n(K, k{) + (k- ki)n(K, /c - Âc)- /CH(K, k) > 0 .
It can be shown by general arguments that M(K, k) is an increasing function of k,
and so this inequality cannot be satisfied, so that the above decays also are
impossible: replacing n(k-k\) and n(ki) by n(k) will certainly increase the sum,
and the result of these changes is a sum equal to zero. This conclusion applies to
any transparent media, and follows from Kramers and Kronig's formula for the
refractive index (see ECM, §64). In the present case, the external field is a
"transparent medium" for photons of all frequencies œ <2m, up to the pair
production threshold, i.e. the photon absorption threshold.
Thus the only decay processes allowed are
yn->7i + Yi>-
(130.11)
Y|-*Y| + 7i.
(130.12)
It has already been noted that the momenta ki and k2 are at small angles d to
the initial photon momentum k. If these angles are neglected, i.e. if the momenta of
all the photons are assumed to be parallel (the collinear approximation), then the
decay (130.12) is impossible, as may be shown in the following way.
Similarly to (127.4), we express the decay amplitude as
Mfi = Mk9lveAe<t*el*,
where e, eu ti are the photon polarization 4-vectors, defined as usual by their
4-potentials A. With the three-dimensional potential gauge, e = (Q, e), we can
rewrite this as
Mfi = Mjk,e,efke!,.
Two independent polarizations are defined by the unit vectorst
eullkxb,
ejkx(kxb).
(130.13)
It is easy to see that, in the expansion
A,A1,À2
where A, Ai, A2 take the values 1 and || (cf. (127.9)), the vectors ex must occur an
t The suffixes || and 1 refer to the polarizations defined above. It must be remembered that the unit
vectors e determine the direction of the vector potential A (and therefore of the field E') and are
perpendicular to B'.
§130
Photon Splitting in a Magnetic Field
589
even number of times (0 or 2) in each term. The amplitude Mfi is invariant under
the CP transformation, and since the potentials A (and therefore e) are CPinvariant, so must be the tensor Mikl. Under the CP transformation e|->e|, e i - > - e i ;
charge conjugation changes the sign of b, inversion changes that of k while leaving
the axial vector b unaltered. Hence, if the vector e± occurs once in any term of the
expansion, the corresponding scalar MAA,A2 must be CP -odd. But it is impossible to
construct a CP -odd scalar from only two vectors (in the collinear approximation)
k = ki = k2 and b, both of which change sign under the CP transformation. This
proves the above statement.
The decay (130.12) is therefore forbidden in the collinear approximation. A
more detailed analysis shows that the ratio of the amplitude of this process to that
of the decay (130.11) allowed in the collinear approximation,
^~fl2^a(B0/Bcr)2,
(130.14)
where
Bcr = m2l\e\
( = m2c3l\e\h = 4.4 x 10,3G);
the angles # are estimated from (130.9) as ü2~ n ± - nj.
The fact that the only possible decay (in the principal approximation) is
7l|-*7i + 7± implies that 1 polarization is eventually established in an unpolarized
photon beam propagating in a magnetic field.
Let us now calculate the decay amplitude Mfi = M n j by perturbation theory,
assuming that B 0 ^ BCT.
The first non-vanishing Feynman diagrams (with respect to a, and with respect
to the external field) are of the form
(130.15)
with all possible permutations of the ends, three ends corresponding to photons and
one to the external field. In the collinear approximation, however, the amplitude
corresponding to these diagrams is zero. For, as a result of gauge invariance, the
external field can appear in the amplitude of the process only as a 4-tensor of its
field strengths FM„ and the photon polarization 4-vectors only in the antisymmetric
combinations
with the wave 4-vectors. The final expression for the amplitude is constructed from
the external field tensor FM„, the tensors f^, /iM„ and f2tlv of the three photons and
their wave 4-vectors JcM, k^, k2il\ it must be linear in each of the tensors /M„, and for
the diagrams (130.15) it must be linear in FH„ also. In the collinear approximation
the 4-vectors k{ and k2 reduce to k: k\ = kcui/cu, k2 = k<ù2lo>. Under these conditions,
590
Radiative Corrections
§130
any scalar product formed as stated above is identically zero: we can easily show
that such a product will contain at least one zero factor k2 or ke.
In the collinear approximation, therefore, the first non-zero contribution to the
decay amplitude comes from hexagonal diagrams of the form
<13(U6>
K~~\_)—
with three external-field lines.t The amplitude corresponding to such diagrams
involves three factors FM„. Scalar products of this kind need not be zero, but all
non-zero products contain the photon wave vectors only through the tensors f^: it
is easy to see that the addition of further factors k will lead to the presence of zero
factors k2 or ke in the products. The components of the tensor / ^ are the same as
those of the photon fields E' and B'. This means that, if the decay amplitude
corresponding to the diagrams (130.16) is represented as the matrix element of an
operator, then that operator, expressed in terms of the photon field operators, is
independent of the photon frequencies. Hence it follows in turn that the calculation
of the scattering amplitude corresponding to the diagram (130.16), by means of the
Lagrangian (129.17), gives the correct answer without restriction to the case co < m.
It has been shown at the end of §127 how the interaction Hamiltonian is
obtained from the Lagrangian L found in §129. We now have a process involving
three photons, and the corresponding interaction operator is found from the terms
in the expansion of L which contain products of three photon fields E' and B\ Here
we need consider only the term
(B' • B0)(E' • Bo)2,
(130.17)
in which each of the vectors B' and E' appears as a scalar product with B0: the
products E'2, B'2 and E' • B' arise, in the four-dimensional notation, from scalars of
the form fßJß% which in the collinear approximation are identically zero. The
selection of the term containing one factor B' and two factors E' is made because
the process under consideration involves one || photon and two 1 photons; for the
former, the field B' has a component along B0, and for the latter the field E' has such
a component.
The Lagrangian L is expressed in terms of the invariants ^ = 2(B 2 -E 2 ) and
^ = E • B. The required term in the expansion comes from a term proportional to
5F£2. A calculation using (129.17) gives for this term
13 6
*
630rrW
m 2.
Putting B = B0 + B\ E = E\ and taking the term B0 • B' from 9 and Bo • E' from %
we get the required expansion term having the form (130.17). Thus the three-photon
t The corrections arising from the inclusion of non-collinearity in the diagrams (130.15) would give a
contribution to the amplitude that is of the next higher order in a relative to that from (130.16).
§130
Photon Splitting in a Magnetic Field
591
interaction operator for the decay y^-* y\± + 721 is
V(3) =
* \ * {(Bo • ÊiXBo •feKBo• B') d \
(130.18)
where
B' = iV(4ir)kxc|e l *" r " ttf) c k| ,
Ê; = -iV(47T)a> 1 e 1 6 i(kl — *l)eilX9
and similarly for Ê$; cf. (127.26), (127.27).t
According to the rules given in §64, the decay amplitude Mfi is calculated from
the definition
Sfi = (/| J V(3) df |î) = - i(2ir)4«(4)(k - k! - k^Af,,,
and is
M
/< = " ' ?i* i - 8 (47r)wü)coi(ü2ßo sin3 0
0 being the angle between k and B0. The decay probability per unit time is (see
(64.11))
dw = (27r)45(k - ki - k2)S(oi - a), - a>2)\Mfi\2 „
f fc,d'fc2 „ ,6;
' 2 • 2<o • 2o>i • 2a>2 * (2ÎT)
the extra factor | takes account of the decrease in the phase volume due to the
identity of the two final photons. The first delta function is eliminated by integration over d3k2. To eliminate the second we note that, if dispersion is neglected,
a) - o)i - coi = k - k\ - |k - ki|
Irlr.
k-k>
(l-COS#i),
and therefore^
( I COCUIO)2S(Û> — co\ — (02) d cos #i • 2TTÙ)\ du>\
0 0
Ü)
— 2ix I o>f(a> - o>i)2dcüi = 7TÜ)5/15.
0
t The coefficient in (130.18) is doubled because E( and E2 can be taken from either of the two factors
E' in L.
% Here it is assumed that, with dispersion taken into account, the argument of the delta function in
fact has a zero at some cos #1 < 1. Thus a dispersion of (his type is necessary for decay to be possible,
but the decay probability itself does not depend on the amount of the small dispersion.
592
§131
Radiative Corrections
We finally have for the total photon decay probability per unit time (in ordinary
units)
w
a 3 / 13 \ 2 mc 2 /fad \ 5 /B 0 sinfl\ 6
T5^\315J h KinP) \ Bcr /
= 018a6r^/Msin^y/A)7^)
1
5
.
(130.19)
n \ Sirmc J \mc J \mcl)
As already mentioned, the condition co < m is not necessary for this formula to
be valid. Its validity is restricted only by the condition that the terms corresponding
to eighth-order diagrams be small. To obtain an estimate, we note that the
eighth-order matrix element may contain, for example, a term differing from the
sixth-order terms by a dimensionless invariant factor of the form (eF^kJm3)2. The
condition for this to be small is a very weak one:
o)<m(m2l\e\B0).
§131. Calculation of integrals over four-dimensional regions
We shall now give some rules and formulae that are useful in the calculation of
integrals arising in the theory of radiative corrections.
The typical form of integral corresponding to a Feynman diagram is
HW
*
\d2
,
(13...)
where au <*i, •. • are second-degree polynomials in the 4-vector k, /(k) is a
polynomial of some degree n', and the integration is over the whole of fourdimensional fc-space.
A convenient method for calculating such integrals, due to Feynman (1949), is
based on an initial transformation or parametrization of the integrand by the use of
further integrations with respect to auxiliary variables £i, £2>.... :
s
—
! — = ( , - Di
f ,K, ...J \ it..(ai£i
f:+ a*.;+•
■+j- - » .
a a ...a
' )
&+ • • • + a ^ )
x 2
n
o
o
2
n n
(m.«
This transformation replaces the n different quadratics in the denominator by the
nth power of a single quadratic polynomial.
Eliminating the delta function by integrating over d£n and using new variables
defined by
§131
593
Calculation of Integrals over Four-dimensional Regions
we get as an equivalent form of (131.2)
1
*«-2
X,
= ( n - l ) ! \dxx [ dx2... f dx„_ I X
J0
J
J
1
[a,xn_,+ a 2 (x n - 2 - x„-i) + • • + a B (l-x,)]"'
a\a2...an
(131.3)
When n = 2, this formula becomes
^
=
J[a,x + a 2 (l-x)] 2 '
(l3L4)
0
as can be easily verified. For any value of n, the formula can be proved by
induction: on carrying out the integration over dxn~\ in (131.3), we get on the
right-hand side the difference of two (n - 2)-fold integrals of the same form. If the
formula is assumed valid for these, we have
_L_r
!
«
l
a\ - ai La2a3.. ► a* aia 3 ... any
which is equal to the left-hand side of (131.3).
By differentiating (131.3) with respect to au a2, e t c > w e c a n derive similar
formulae which can be used for the parametrization of integrals whose denominators contain any of the polynomials in powers above the first.
The divergent integrals are regularized by subtracting from them other integrals
of similar form. To determine the difference, it may be convenient first to
transform the difference of the integrands (each of which is already transformed by
means of (131.2)) as follows:
i
J
an
LÄ_ f
n(a-b)dz
U3I
bn
J [(a-b)z + b]n+l°'
o
After the application of (131.3), the four-dimensional integration in (131.1) becomes
f
/(*)d 4 k
J [(k-lt-a*r
(131 6)
'
where I is a 4-vector and a 2 is a scalar, both of these depending on the parameters
X|,..., xn-t; the scalar a2 will be assumed positive.
If the integral (131.6) converges, we can make the change of variables k -1 -+k
(a shift of the origin), which gives (with a different function f(k))
f f(k)d4k
J (k*-«V»
(1317)
§131
Radiative Corrections
594
the denominator now involving only the square k2. In the numerator we need only
consider scalar functions / = F(k2): for integrals having numerators of any other
form,
k»F{k2)d4k
2—r~5vr- = Au,
WF(k2)d4k_
/
(k ?—zi^r
-a )
2
2
1
g' /
(131.8)
k2F(k2)fk
(k2-a2)"
f *!^ ^ ^ ^ ( k O d
'
(131.9)
k
= ^ (g'1"«'" + «"g" + g"Vp) /
Kvn,2)d4k
y^jpr?
(131.10)
and so on, as is evident from symmetry on integration over all directions of k.
In the original integral (131.1), each of the factors ait a 2 , . . . in the denominator
has (as a function of kp) two zeros, which are avoided in thé integration over dk0
according to the general rule (§75). After the transformation to the form (131.7), the
In simple poles of the integrand are replaced by two poles of order n, which are
avoided according to the same rule (path C in Fig. 25). By moving the contour of
integration as shown by the arrows, it can be converted to the imaginary axis in the
ko plane (path C"). Thus the variable ko is replaced by ikô with ko real. Then, writing
also k' for k, we have
k2 = k^-k 2 -*-(k 0 2 + k'2) = -k' 2 ,
(131.11)
where k' is a 4-vector in the Euclidean metric; and
d4k-*id4k'
= ik'2d(ïk'2)dCï,
where dfl is an element of four-dimensional solid angle. The integration over dCi
gives 2TT2 (see Fields, §111), and so
d4k-*iTT2k'2d(k'2).
ci
V
■M/-
FIG.
25.
(131.12)
§131
595
Calculation of Integrals over Four-dimensional Regions
Putting k'2 = z, we have finally
(F(kl)d*k_
(
J (F^Y ~ ~
In particular,
f
d4k
_.
/(F(-z)zdz
1} l7r
o
(l3113)
J (z+ <*>)" *
(-l)"iV
,,
J {k'-a'Y ~ ««-»(n-lXn-2)'
(13L14)
/[(t-ff-aV
(,3U5)
The logarithmically divergent part of the integrals (131.7) can be separated as
It is easily seen that in an integral of this type we can replace k by k + l: the
difference is
and is a convergent integral, so that the change k -» k + / is certainly permissible in
this integral. On doing so and also changing the sign of k, we get the same quantity
with the opposite sign, and it must therefore be zero.
A linearly divergent integral would have to have the form
k»d4k
/ [(k-l)2-a2r
(131.16)
but in fact such an integral is only logarithmically divergent: the integrand tends
asymptotically to k*l(k2)2 as k ->o° and gives zero on averaging over directions. The
change of origin, however, adds a constant to the integral (131.16). This can be
shown for the infinitesimal change k -> k + SJ, by calculating the difference
A
'-JU-«|L.¥-&rjÇ.}' l < f c
(»LIT)
As far as the first order in Of,
A(1
[\4k»(k8l)
81"
}
J4| ,
In the first term, averaging over directions replaces the numerator by k25/'1 (cf.
(131.9)), and hencet
^ = a28l» j(kfkal)i
= -l2iTT28l*.
t The corresponding result for finite I can be obtained by more laborious calculations.
(131.18)
596
Radiative Corrections
§131
In the final expressions for the radiative corrections there is frequently a
transcendental function defined by the integral
FU) =
J!°&ii±l)dl,
03U9)
0
sometimes called Spence's function. Some of its properties (given here for
reference) are
F(è) + F(\lÇ) = ÏTT2 + l2log2Ç,
F(-E)+F(-l + £)=-67r2+log£log(l-a
F(1) = ÎW 2 ,
F(-l) = -U2.
(131.20)
(131.21)
(131.22)
The expansion for small £ is
F(£) = £ - k 2 + k 3 -f6É 4 +---.
I
(131.23)
CHAPTER XIII
ASYMPTOTIC FORMULAE OF QUANTUM
ELECTRODYNAMICS
§ 132. Asymptotic form of the photon propagator for large momenta
THE first-order term (with respect to a) in the expansion of the polarization
operator 9>(k2) has been calculated in §113, and it was found that for \k2\ > m2 this
term is, with logarithmic accuracy,
^(k 2 ) = ^ - k 2 l o g ^ l .
(132.1)
It was also mentioned that the derivation of this formula (as a first-approximation
correction to the propagator 4TTD~X = k2) assumed the condition
£logl£!«U,
(132.2)
which placed an upper limit on the permissible values of \k2\. We shall now show
that the expression (132.1) is in fact valid also with the much weaker condition
£ log |Jh(*l.
(132.3)
The proof is as follows.t We first note that, although in principle the condition
(132.3) allows contributions to &(k2) from the terms of all orders (in a) in the
perturbation-theory series, in every order n only the terms - a " log"(|k|2/m2) need
be considered, which contain the large logarithm in the same power as a ; the terms
containing lower powers of the logarithm are certainly small, since a <^ 1.
The analysis of the perturbation-theory series for 0> can be reduced to that of
the series <3 and P \ using Dyson's equation
&(k2) = i ~
tr | y&ip + k)P(p + Je, p ; k)<S(p) 0p\
(132.4)
see (107.4). Since the function &(k2) is gauge-invariant, it can be calculated with
any gauge for <3 and T. The most convenient here is the Landau gauge, in which the
t This formulation and conclusions are due to L. D. Landau, A. A. Abrikosov and I. M. Khalatnikov
(1954).
597
598
Asymptotic Formulae of Quantum Electrodynamics
§ 132
free-photon propagator has the form (76.11):
D,Ak) = ^(gllv-^)
(132.5)
(D (0 = 0 in (103.17)). It is found that in this gauge the perturbation-theory series for
<ê and r** do not contain any terms with zero power of the logarithms. In (132.4) it
is therefore sufficient to substitute the zero-order approximations % = G, P1 = yM,
obtaining
0>(k2) = i • ^ p tr | yMG(p + k)y*G(p) ^ .
(132.6)
This is the Feynman integral corresponding to the diagram (113.1) of the first
approximation (with respect to a), and leads (after appropriate renormalization) to
(132.1).
In order to prove the foregoing statements, let us first examine the origin of the
logarithm in the integral (132.6). It is easily seen that the logarithmic term comes
from the range of integration
p2>\k2\
when \k2\>m2:
(132.7)
formally expanding G in powers of Hyp, we, have
G(p)~\lyp
= yplp2,
G(p-k)~l/(yp-yk)
1- — yk — yk —
« — + — -yk
yp yp
yp yp
yp
yp
.- YP . (7P)(7fcXyP) i
p7
(p2)2
! (7P)(7fc)(7P)(7fc)(7P)
On substitution in (132.6) the first term, which is independent of k, is removed by
regularization (in accordance with the condition 9>lk2-*Q when k2-*0); the second
term gives zero on integration over the directions of p; the third integral is
logarithmically divergent with respect to p2, and on taking it from p 2 —
• |lc2| (the
2
lower limit of the range (132.7)) to some "cut-off parameter" A we get
-£***$
(132 8)
-
In regularizing, we have to subtract from 91k2 its value for k2 = 0. But, since
logarithmic accuracy presupposes the condition |k2| > m2, in calculation to this
accuracy the regularization is achieved by subtracting the value for |k2| ~ m2, and
A2 in the argument of the logarithm is replaced by m2, giving (132.1).
§132
Asymptotic Form of the Photon Propagator for Large Momenta
599
Since the desired corrections to ^ and P are logarithmic, their inclusion makes
these quantities differ from G and y* by slowly varying logarithmic factors. In the
exact integral (132.4), therefore, the important range will be the same one (132.7) as
in the approximate integral (132.6). However, we cannot simply put Jc=0 in
TM(p + k, p\k): since the integral is quadratically divergent, its regularization involves also the next two terms in the expansion of P*(p + Jc, p; k) in powers of k.
Here we shall consider only the corrections to TM(p, p;0), which show sufficiently
clearly the importance of the choice of gauge and the difference in the nature of the
integrals arising from diagrams of different types. It may also be noted that a
similar analysis of ^ is not needed, since the corrections fo T and <$ are related by
Ward's identity (108.8).
The first-order correction (with respect to a) to T(p, p;0) corresponds to the
diagram
and hence to the intégrait
r*
(,)
= -ia | yKG(Pl)y»G(pdyvDXv(p
-
PI)<*4PI/(2TT)4.
(132.9)
In the ordinary gauge,
DUp
- Pi) = gx* • 4TT/(P -
pt)2,
and the important range in the integral is p]>p2,
divergent. On calculating the integral
f 7À(YPI)7M(7PI)7A
P M O ) « -A-„i
F
47ral
(P?)3
l
where it is logarithmically
d4px
(2TT)4
(13210)
and regularizing the logarithm, we get
r *.>__«
M o g j£.
In the Landau gauge, (132.10) is replaced by
p « > « -4irai | (7A(7P.)7'1(YP.)7x -P?7"}d 4 p./(P?) 3 (2^) 4 .
t To avoid misunderstandings in a comparison with the results of §117, we may note that in §117 the
two electron ends of the diagram were assumed to be physical, whereas here we assume that
p2>\k2\> m2, and these two lines are therefore certainly non-physical.
600
Asymptotic Formulae of Quantum Electrodynamics
§132
Averaging over the directions of px and reducing the matrices y, we find that this
integral is zero and the logarithmic term in TM(1) disappears.t
In the second-order corrections (with respect to a) we take the diagram
The corresponding integral is
r*(2) = -a2 j ykG(p2)y>G(pl)y*G(pl)ypG(p2)y<' x
- p2)dApxd4p2l{2irf.
x Dvp(p2-Pl)DXa(p
In the ordinary gauge of the D functions, this integral includes a term in which the
logarithm is squared, arising from the range of integration
p]>p\>p2\
(132.11)
when p2 is neglected in the argument of Dvp(p2- p0, the integration over d4pi
becomes the same as in (132.9) and gives logpL and the subsequent integration
over d4p2 is likewise logarithmic, giving \og\pllm2). When the Landau gauge is
used for the D functions, however, the logarithmic terms disappear from both
integrations.
A similar situation occurs for all the other diagrams in the skeleton diagram
1
1
A
(132.12)
Diagrams of other types, with intersecting photon lines, for example those in the
skeleton diagram
1
1
A
(132.13)
t The corrections to G ' in both gauges, as derived from the correction r0* by means of the identity
(108.8), are of course in agreement with the results of §119.
§133
The Relation Between Unrenormalized and Actual Charges
601
(cf. (106.11)), do not in any gauge contain terms involving the necessary power of
the logarithm; there is no range of the variables in which the integral reduces to
several successive logarithmic integrations.
These arguments, and similar ones for the subsequent terms in the expansion of
T in powers of k, confirm that in the Landau gauge there are no corrections to ^
and T involving the necessary powers of the logarithm; thus the expression (132.1)
is in fact valid even if the condition (132.3) applies.
The function 3)(k2) which corresponds to the polarization operator (132.1) is
S(k 2 ) = -7Tl—/ ii M—TTTWi—^(132.14)
kM-(a/37r)log(|k 2 |/m 2 )
Because of the condition (132.3), there is no need to expand this expression in
powers of a.
§133. The relation between unrenormalized and actual charges
The applicability of (132.14) is, however, limited on the side of large |k2|,
because the denominator decreases. The derivation of that formula was based on
neglecting the diagram (132.13), and others with even more intersecting thick photon
lines, in comparison with (132.12). But each such added line brings into the diagram a
factor e22>, with 2> the exact propagator. The small parameter is then not a = e2 but
l~(a/37r)log(|k 2 |/m 2 )
<U.
(133.1)
When \k2\ increases and this quantity becomes comparable with unity, the small
parameter essentially disappears from the theory.
The resulting situation can be more clearly understood if the renormalization in
the derivation of (132.14) is made not "en route" but through the use of an
"intrinsic" electron charge ec, which will here be chosen so as to give the correct
value of the observed physical charge e (§110). If the integral is "cut off", as was
done previously, at some upper limit A2, the intrinsic charge is a function ec(A2) and
the limit A -♦ » must finally be taken.
With this treatment, the polarization operator is
«kl>-£***$
(the expression (132.8) with ec in place of e), and therefore
a(fc:) =
F l + (e;/3.)'l0g(AV|)-
Now determining the physical charge e from the condition
e2c2(k2)^4Tre2lk2
when
k2^>~m2,
(133 2)
-
602
Asymptotic Formulae of Quantum Electrodynamics
§133
we have
e2=
l + (62f/37r)iog(A2/m2)'
(1333)
l-(e J /37r)log(A 2 /m 2 )-
°33'4)
or
e?=
If we formally take the limit A^^c in (133.3), then e2-*0 whatever the form of
the function ^(A). This "zeroizing" of the charge means, of course, that no
rigorous renormalization is possible. The limit cannot be taken, however, without
violating the assumptions made in deriving (133.3). It is seen from (133.4) that, as A
increases (for a given value of e2), e\ also increases; and the formulae cease to be
valid even when e2c — 1, since their derivation is based on the assumption that
g<\
(133.5)
as the condition for perturbation theory to be applicable to the "intrinsic"
interaction. The failure of the inequality (133.5) to be satisfied as A increases is of
fundamental significance. It shows that quantum electrodynamics is logically incomplete as a theory that is based on weak interaction. This essentially implies that the
existing theory as a whole is logically incomplete, since its entire formalism is based on
the possibility of treating the electromagnetic interaction as a weak perturbation. All
the quantities calculated by the theory are found as series in powers of el, which are in
fact asymptotic series. In order for them to have a definite meaning when e2c is not
small, further arguments would be needed which do not follow from the general
principles of the existing theory.
It must also be emphasized, however, that in quantum electrodynamics these
difficulties cannot be more than theoretical ones. They arise at enormous energies
that are of no practical significance.t We may expect that in reality the electromagnetic interactions will very much sooner be "merged" with the weak and
strong interactions, so that pure electrodynamics is no longer meaningful.t
To conclude this section, we shall show how formulae (133.3) and (133.4) can be
derived by simple arguments based on the significance of renormalization and on
dimensional reasoning (M. Gell-Mann and F. E. Low, 1954).
Let us consider the square of the unrenormalized charge as a function of the
cut-off parameter, e?(A2), and define a function d as the ratio of the values of e2c for
two different arguments: eliA]) = e2c(A2) d. When A?, fi\>m, the function d is
independent of m and, being dimensionless, can depend only on the likewise
t For example, the equation (CX/TT) log(e2/m2) = 1 is satisfied when e ~ 1093m.
t The opposite situation occurs in theories where the interaction between particles is mediated not
by the electromagnetic field but by Yang-Mills fields. The relation between the renormalized and
unrenormalized charges in such theories is given by an expression of the type (133.4), byt with the
opposite sign in the denominator, so that, for a given value of e2> the unrenormalized charge el
decreases with increasing A. This is called asymptotic freedom of the theory. Such a theory is, of
course, fundamentally different from the theory with zeroized charge.
§134
Asymptotic Form of the Scattering Amplitudes
603
dimensionless quantities e2c(A]) and A2/A2:
e2c(Al) = e2c(A}) d[e2M), Ml Ml
(133.6)
From this functional relation, we can derive a differential equation, by writing it
for infinitesimally close values of Ai and Ai Denoting A, by £ and putting
A2 = £ + d£, we obtain for ac(£) = e2c(A2) the differential equation
dac = <t>(ac)dÇI&
(133.7)
with
<M«c) = ac[dd(ac, x)/ax]x=1;
(133.8)
we have used the fact that d(ac, 1)= 1, from the definition (133.6). Integration of
(133.7) from £ = A? to £ = A\ gives
e?(Aj)
log(A^/A?)= I
dal<t>(a).
(133.9)
2
e c(\})
Throughout the range of integration, e\ is small. We can therefore use for <f>(a)
the expression corresponding to the first approximation of perturbation theory. The
correction to the unrenormalized charge e\ is e2ck2&(k2). Taking the first approximation (132.1) for the polarization operator, we find
d(ac, MIA]) = 1 + (ac/37r) log(A22/A?), <M«c) = a?/3ir,
and the integration in (133.9) then gives
3^,0*fi = ^j-iélj-
(l33l0)
As A?-*~m 2 , the unrenormalized charge ec(A}) tends to the actual charge e, and
(133.10) then agrees with (133.3) and (I33.4).t
§ 134. Asymptotic form of the scattering amplitudes at
high energies
Let us consider the asymptotic form (at high energies) of the amplitudes and
cross-sections for two-particle scattering processes (l + 2-»3 + 4). For the basic
electrodynamic processes in the first non-vanishing approximation (with respect to
a), this problem can be solved by means of the specific formulae derived in the
preceding chapters, which are valid at all energies. Here, however, we shall discuss
t The renormalization group method based on the functional properties of propagators and vertex
parts is systematically developed in the book by Bogoliubov and Shirkov cited at the end of § 112.
604
Asymptotic Formulae of Quantum Electrodynamics
§134
the question from a more general standpoint, which enables such asymptotic fornrs
to be derived directly.
As in §66, we use the invariants
s=(P\ + Pi)\ t = ( P i - Pi)2, u = (p, - p4)2,
(134.1)
with p\ + p2 = P3 + P4; the notation corresponds to reactions in the 5 channel, which
will be considered here. In the ultra-relativistic case, when the energies are much
greater than the particle masses, the energies of the two particles in the centre-ofmass system are approximately equal. We denote by e the sum of the energies of
the colliding particles, and then have, in the centre-of-mass system, Pi = (ie, Pi),
Pi = (k, -pi), P3 = (ie9 p3), PA = (ie, -p 3 ), p? = p2 = 4e2, with
s = e\ t = - i s ( l - c o s 0),M = - j s ( l + cos0),
(134.2)
6 being the angle between pi and p3.
Let us first consider the asymptotic form of the reaction cross-section for a
fixed value of the scattering angle 0. Then all three variables 5, t and u are
proportional, and tend to infinity together. In the ultra-relativistic case, the particle
masses cannot appear in the result, and the only quantity having the dimensions of
length is 1/e (=hc/e). Hence it follows from dimensional arguments that the
differential cross-section for two-particle reactions decreases with increasing
energy in the asymptotic form
dal do a 1/5
as
s,|t|,|u|->«>.
(134.3)
If the cross-section is related not to the solid-angle element do but to the
differential dt, we have, since do oc dtjs>
daldt a Us2.
(134.4)
The cross-section is expressed in terms of the scattering amplitude (in the ultrarelativistic case) as da/do a \Mfi\2ls\ see (64.22), (64.23). The law (134.3) therefore
means that in the asymptotic limit the scattering amplitude is independent of s :
Mfi = constant.
(134.5)
It is clear from the manner of the derivation that these results apply not only to
the first non-vanishing approximation of perturbation theory, but also to higher
approximations (those taking account of radiative corrections), if logarithmic
factors (of the form log s/m 2 ). are ignored; the dependence on dimensionless
logarithms cannot, of course, be determined from dimensional arguments.!
A different situation arises if s increases with t fixed, i.e. with a fixed square of
the momentum transfer. This is scattering through small angles, which decrease as
t The summation of series containing logarithmic corrections may lead to an exponential dependence on the logarithms, which changes the exponent in the power law. This change is, however, small if
a is small.
§134
Asymptotic Form of the Scattering Amplitudes
605
the energy increases:
5 ^oo, |t| ~ so 2 = constant, d - (\t\ls)\
(134.6)
In such a case, dimensional arguments allow us to establish only that the combined
power of 1/s and 1/f in dcrldt is 2, and in the amplitude Mfi zero.t Hence, to find
the part of the cross-section that decreases least rapidly with increasing s, we have to
separate the factor that has the greatest power of 1/f. Such factors arise only if the
Feynman diagram can be divided into two parts between the ends 1,3 and 2,4 by
cutting the lines of virtual particles. The total 4-momentum of such lines is px - p 3 ,
and this leads to the factor that depends on t = (pi - p3)2. Thus the asymptotic form
of the diagram in the range (134.6) depends on the nature of the possible cuttings of
diagrams in the t channel.
Similarly, the asymptotic behaviour in the range
s ->oo, |u| ~ S(TT - 6)2 = constant,
|TT — 0| ~ (\u\ls)\
(134.7)
corresponding to scattering through angles close to IT, is governed by the nature of
the possible cuttings of diagrams in the u channel, i.e. between the ends 1,4 and 2, 3.
The simplest example is electron-electron scattering, described by the two
diagrams (73.13) and (73.14). The first of these allows cutting in the t channel at the
virtual photon line, and it determines the asymptotic form of the scattering amplitude
in the range (134.6). The virtual photon line corresponds to a D function that is
proportional to lit. The asymptotic forms of the amplitude and the differential
cross-section are
Mfi oc sit,
da a dtlt2.
(134.8)
In the limit (134.7), close to the backward direction, the asymptotic form is determined by the "exchange" diagram (73.14); in that limit,
Mfi oc s/u,
da <* dulu2.
For mutual scattering of different particles (electron and muon), there is no
exchange diagram, and so the cross-section for scattering through angles 6 Ä IT
decreases in accordance with (134.3) and (134.4).t
We shall show that these results for the asymptotic behaviour of electronelectron scattering are unaffected by the inclusion of radiative corrections. To do
so, let us consider the corrections of various orders to the diagram (73.13).
It has already been shown that the diagrams which are corrections to the
internal D function (see (113.11)) or to the vertex parts (see (117.1)) lead only to
logarithmic corrections in the amplitude; they do not alter the power law (134.8).
We shall show that the same is true of the diagram allowing cutting at two (not one)
t These arguments assume a constant |f | > m2. The results thus obtained remain valid, as regards the
dependence on s (i.e. on the energy), even when \t\ — m2.
t All these statements are, of course, in accordance with the results of §81; see (81.11) and Problem
6.
606
Asymptotic Formulae of Quantum Electrodynamics
§134
internal photon lines:
P3
+ q
P,
p3-pi-q
—*
*
'—«
P
P
+q
P - q
4
<
2
(134.9)
P
2
The scattering amplitude corresponding to this diagram differs from that corresponding to (73.13), in that the factor lit is replaced by
(y(P\ + q))(y(P2-q))
dA
(Pi + q) 2 (P2-q) 2 q 2 (P3-Pi-q) 2 q '
followed by integration over d4q. The important range of integration is the one
which gives rise to the lowest power of 1/s. For this, q must always be small in
comparison with p{ and p2. Rejecting the terms which are then small (and also the
terms p] = p2 = m2), we can rewrite this expression as
(ypôiypi)
dA
(P\q)(P2q)q2(Pi- P\~ q)2
The denominator does not contain s if q0 and qx (with the x-axis along pi = -P2) are
oc 1/Vs; qy and qz may be * V|t|; then the range of integration <* Us. The order of
magnitude of the numerator is p\Pi<* s. Thus the replacement of one internal
photon line in the diagram by two does not affect the dependence of the diagram on
s (for a given t)t That is, the contribution of the diagram (134.9) to the scattering
amplitude has the same asymptotic behaviour (134.8) as that of the principal
diagram. The position is unaffected by adding other parallel internal photon lines in
the diagram, and also by including corrections to the internal electron lines.
This is a general result: any diagram which can be cut in the t or u channel into
two parts across any number of internal photon lines corresponds to an amplitude
contribution which has the asymptotic form Mfi <* sit with t constant or slu with u
constant (V. G. Gorshkov, V. N. Gribov, L. N. Lipatov and G. V. Frolov, 1967; H.
Cheng and T. T. Wu, 1969).
As a second example, let us consider Compton scattering described by the two
diagrams (74.12). These do not allow cutting in the t channel, but the second
diagram can be cut in the u channel at an internal electron line. In the notation of
the present section, it is
|prp4
P3
*+- -
«
(134.10)
p^
This means that the scattering is largely concentrated near the backward direction, as
t Let us mention again that only power-law asymptotic forms are under consideration, and so we
need not take account of logarithmic divergences in the integrations. Diagrams of the form (134.9) will
be further studied in §137.
1
§134
Asymptotic Form of the Scattering Amplitudes
607
already noted at the end of §86; see (86.20). To find the asymptotic behaviour in
this range, we note that the factor G corresponding to the internal line in (134.10) is
in order of magnitude l/y(pi-p4) a 1/V|«|. Hence the scattering amplitude Mfi <*
a(sl\u\y; this includes a factor a because the diagram (134.10) is of the second
order. The differential cross-section is therefore dal du « a2l\u\s. The integral of this
expression with respect to \u\ is governed by the range |M|<^S. The total crosssection then decreases with increasing energy as a <* a 2 /s, or more exactly a <*
(a 2 /s)log(s/m 2 );cf. (86.20).t
For this process, however, radiative corrections alter the asymptotic behaviour.
The change is due to sixth-order diagrams of the type
!
I
P4
_^J
!
1
(134.11)
U*-
P2
In the t channel, these allow a cut across two internal photon lines, and therefore
contribute to the amplitude, with asymptotic form Mfi * a*slt; the factor a3
corresponds to a sixth-order diagram. When 5 is sufficiently large, this part of the
amplitude becomes the principal one, and the differential cross-section is
then
daldt oc
a
6 2
lt .
The integral of this expression with respect to t is governed by the range of small
|t| ~ m2, i.e. by the range of scattering angles 0 - m V s ; note that the scattering is
now mainly forward, not backward. The total cross-section then no longer
decreases with increasing energy:
a*a6lm2
= a4rl
(134.12)
The decreasing part of the cross-section becomes comparable with this constant
part when e = V s « m/a 2 .
A similar situation occurs for photon-photon scattering. In the first nonvanishing approximation, this is described by the "square" diagrams (127.1), which
can be cut across two internal electron lines. Integration is carried out with respect
to the 4-momentum of these lines in the diagram; momenta ~ V s are important,
and small values of t or u are not especially significant. The asymptotic form of
these diagrams for any constant t or u is given by (134.5): Mfi = constant « a 2 . The
total cross-section decreases with increasing energy: a <* a4ls (cf. (127.23)); angles
close to zero or TT have no special significance here. In the eighth order, however,
there are diagrams which can be cut (in the t or u channel) across two internal
t The exact form of the dependence of the cross-section on \u\ or |f| when these are ^ m2 cannot, of
course, be ascertained from the arguments given here. It is assumed that the integral with respect to \u\
or \t\ converges at values — m2. This is in fact so for all processes except elastic scattering of charged
particles.
608
Asymptotic Formulae of Quantum Electrodynamics
§135
photon lines, for example
I
i
(134.13)
These diagrams give an asymptotically constant cross-section: a <* a 8 /m 2 when
Vs§>m/a2.t
The asymptotic constancy of the total cross-section is a characteristic property
of scattering processes whose diagrams can be cut (in the t or u channel) across
internal photon lines. It occurs even when more than two particles are present in
the final state of the reaction.
§135. Separation of the double-logarithmic terms in the
vertex operator
The corrections having the form (aL) n (where L is the large logarithm) can
become important only at enormous energies, as already mentioned at the end of
§133, and therefore are of purely theoretical significance. But there are also much
larger corrections, of the form (aL2)n, in the amplitudes of actual scattering
processes. Such terms, containing the square of the logarithm to the same power as a,
are called double-logarithmic terms.
The characteristic expansion parameter in the double-logarithmic corrections is
(a/7r)logV/m 2 ),
(135.1)
where e denotes the energies occurring in the problem (for example, the total
energy of the colliding particles in the centre-of-mass system). The condition for
perturbation theory to be valid is that this quantity be small; it ceases to be satisfied
at energies
e-m
exp[iV(îr/a)] - 3 x 104m.
(135.2)
Let us now try to avoid this limitation and derive formulae valid when
(a/7r)logV/m2):£l.
(135.3)
This clearly requires the summation of an infinite series of corrections in all powers
(aL2)n.
The double-logarithmic corrections occur in cases of two kinds. One includes scattering through a fixed finite angle; as has been shown in §134, the
t The cross-section for coherent scattering of a photon in the field of a nucleus is asymptotically
constant even in the first non-vanishing approximation, described by "square" diagrams in which two
ends are external-field lines; see (128.7). In reality, however, these diagrams would have to be
represented in the form (134.11), the upper continuous line being the nucleus line. The external-field
lines then become internal lines in the diagram, and the reason for the asymptotically constant form
becomes evident.
Separation of the Double-logarithmic Terms
§135
609
cross-sections always decrease in the asymptotic high-energy range. In such cases,
the double-logarithmic corrections are closely associated with the infra-red divergence. These cases include, in particular, elastic scattering of electrons in an
external Coulomb field; thefirstdouble-logarithmic correction to the cross-section
has been found in §122. The present section and §136 deal with the complete
determination of these corrections, under the condition (135.3).
The other class of cases includes reaction cross-sections which decrease with
increasing energy for a given square of the momentum transfer, i.e. for scattering
angles which asymptotically approach zero or TT; as shown in §134, this occurs for
processes whose diagrams cannot be cut across internal photon lines in the t or u
channel. Here, the double-logarithmic corrections are not connected with the
infra-red divergence. As an example of this, we shall discuss in §137 the problem of
backward (u = constant) electron-muon scattering.
First of all, with the condition (135.3) the single-logarithmic corrections are
~(a/'n-)log(Ê2/m2)sV(a/7r)^ 1 and can therefore be omitted. Since the doublelogarithmic corrections do not appear in <ê and 2>, the latter functions can now be
taken as simply equal to their unperturbed values G and D.
The calculation of the vertex operator F involves the summation of the
double-logarithmic terms which arise from an infinite sequence of diagrams. This
problem will be analysed in §136, butfirsta method will be described for separating
the double-logarithmic terms in the various Feynman integrals before actually
performing the integrations over all the variables in them (V. V. Sudakov, 1956).
Let us consider the first-order correction (with respect to a) to the vertex
operator, represented by the diagram (117.1), which is here conveniently taken
(with a renaming of the variables) in the form
►o
Pi
Pi
~V V
_ f
(135.4)
\ ,
or, analytically,
7y(7Pz-ff + ™)7M(yP. ~ Y/ + m)yvdAf
V^\n2 n,a)= - - ^ J
VP2 Puq)
'
4 ^ J [ ( p 2 - / ) ^ - m i + iO][(p,-/) 2 -m 2 + «0][/2 + iOr
(135.5)
We shall assume that
\q2\ > Pi plm2,
(135.6)
and that the ends p,, p2 may be either physical or virtual. From (135.6) it follows
that
\PxPi\~W>Puplm\
(135.7)
610
Asymptotic Formulae of Quantum Electrodynamics
§135
i.e. the 4-vectors p{ and p2 have large components but small squares; this is
possible because the four-dimensional metric is pseudo-Euclidean. The doublelogarithmic terms in fact occur when the conditions (135.6) are satisfied.
We shall see below that relatively small values of / are important in the
integration over d4f. We can therefore neglect / in the numerator of the integrand,
and r{1) becomes
r*l> Ä
_ * y{ypi
+m)y»(ypl
+ m)
yju
(135.8)
where
h
~i
[(P2-/) 2 -m 2 +iO][( P l -/) 2 -m 2 +iO][/ z +iO]-
(1359)
The matrix factor in (135.8) can be simplified by using the fact that when T
appears in diagrams it is always, in effect, multiplied by the matrices (7p2 + m) and
(YPi + m):
( 7 p 2 +m)r(7p, + m).
(135.10)
For, if the lines p\ and p2 are virtual, these factors come from G(pi) and G{pi)\ if
the lines correspond to real electrons, Y is multiplied by M2 and U\, and Dirac's
equations show that
uz-u2
2m
,
«,-
2m
u,.
Interchanging the order of the matrix factors and neglecting at each stage, in
accordance with (135.7), the squares p], pi and m2 in comparison with p\ p2, we get
ie2
(ypi + nOP^ypi + m) « - - j (Pip2)(7P2 + m)y,l(ypi + m)/,.
We can therefore put r(,) in the final form
P (1) = (ie2/2ir3) 7*tf„
(135.11)
f = q 2 ~-2(p,p 2 ).
(135.12)
where
The integral Jt converges when / is large, and therefore need not be regularized.
The main point of the subsequent calculations is the introduction of new and
more convenient variables of integration. Let / be resolved into components
§135
Separation of the Double-logarithmic terms
611
tangential and normal to the pip2 plane:
f = upi + vp2 + f±
s
/ | + /i,
/xP. = /xP2 = 0.
(135.13)
(135.14)
As new variables, we take the coefficients u and v and
P= -/i.
(135.15)
It is evident from the conditions (135.7) that the metric in the pip2 plane is
pseudo-Euclidean. The time axis can therefore be taken in this plane, so that f± is a
space-like 4-vector and p > 0.
Let the indices 0 and x temporarily denote the components of 4-vectors in the
Pip2 plane; y and z those in the normal plane. To transform the 4-volume element
d4/ = d2/xd2/i
to the new variables, we write
d 2 /i = |IJ <*|fj d<f>
= 2 dpd<f> -* TT dp
(since the integrand in (135.9) is independent of the angle <f>). Also
d 2 /»=|#^U"dt>
= |PioP2*-P2oPi*|dudt>
~W\dudv,
since pi is small, so that p\x «* plo and
(PioP2x ~ P20P1*)2 * (P10P20-P2xPu)2
= (p.p2)2 = (k 2 ) 2 .
Thus
d4/ = 12|r|dMdüd2/i
-+\-n\t\dudvdp.
(135.16)
The calculations now depend on the relation between p], pi and m2; two cases
will be considered.
612
Asymptotic Formulae of Quantum Electrodynamics
§135
VIRTUAL ELECTRON LINES
Let the momenta p\ and p2 correspond to virtual electrons, with
(135.17)
\pl\pl\>m\
We shall see that the most important range of integration, which leads to a
double-logarithmic expression, is in this case given by the inequalities
o<p«*|ftt|,M,
\p\lt\<\v\<\,\pllt\<\u\<\.
(135.18)
Accordingly, in the denominator of the integrand in (135.9) we can neglect m2, p2,
p\ and p in comparison with (pi/) and (pif):
d4f
-J 2(pz/) • 2(p,/)(/ +7Ö)-
(135.19)
T
The quantities prf, prf and f2 are given by
f2 = (up\ + vp2)2 - p « - tuv - p,
2(Pi/) = 2p,(up, + vp2) « - tvy
2(Plf)~-tu.
Then
d£_
du dv
2\t\) p + tuv-iO u v
2U|Jp
(135.20)
In accordance with the conditions (135.18), the integration over dp is taken from
0 to the smaller of |h;| and \tu\; the result is
min(|fi<|,|ft>|)
Î7T when tuv < 0,
= logmin
{RRi} + { 0 when tuv > 0.
P+ ^ - i O
/
(135.21)
The logarithmic integration over dv is taken from -1 to -|p?/f| an(* ^ r o m I P M t 0 1
(and similarly over du). When (135.21) is substituted in (135.20), the integral of the
first term over dudv is zero, because the integrand is an odd function. The
integration of the second term is carried over ranges of u and v having the same
sign when t < 0 and opposite signs when t > 0. In either case the ranges v > 0 and
v < 0 give the same contribution after integration over du, and the result is
r
iir2 ~ f du f dv in2.
''-IT2 } T J V = T l o g
|pf/»l
IPÎ/'I
the sign being the same as that of t.
log
(135.22)
§135
Separation of the Double-logarithmic Terms
613
Finally, substituting in (135.11), we get
r^\p2,Pûq)= ~£-y» log Kl log KL
2ir
PÎ
P5I
2
l« IHP?|.|PEI*m
(135.23)
PHYSICAL EXTERNAL ELECTRON LINES
Now let the momenta p\ and p2 correspond to real electrons, so that
(135.24)
p} = pl = m\
Then the important range of integration is
o<p«H,M,
(135.25)
0<|»|,|II|«1.
Since p\ - m2 = p\- m2 = 0, we can neglect p\ and p 2 in comparison with p\f and
P2/, and again bring the integral (135.9) to the form (135.19). To eliminate the
infra-red divergence which then occurs, however, we must apply a finite photon
mass A. < m in 'he photon propagator (cf. §117):
n v
J 2(p,/)-2( P 2 /)(/ri2 -At 2^+,i0T
(135.26)
In this case
f2 « - tuv - p,
2pi/~ -fu + 2m2H,
2p2/« -rw + 2m2ü,
and hence
r _
i,_
^ f
dp
du dv
2|f|Jp + fi« + A 2 - i 0 i i - T » » - T i r
n „ T O
(133.27)
where T = 2m2lt < 1.
After the integration over dp, similarly to (135.21), we obtain
r = -ill
1
f f du
2|f|J J
dv
U-TVV-TU'
the integration being subject to the condition tuv + X2 < 0. The ranges v > 0 and
614
Asymptotic
Formulae of Quantum Electrodynamics
§136
v < 0 again make equal contributions, and the result of the integration over du is
i
m2
h-'-fjdv
0
i
J
(u _ Tl ,)" v _ TM)
81 v
1
iir 2 f.
T 8 - U2
= — logrr
K7
t J
ö
dv
r—,
(Ô - T I T ) ( T - v) v
/ i a c 00.
(135.28)
where S = A2t, |ô| <^ |T|, and w e have used the inequality |T| <^ 1.
In the integral (135.28), three ranges of v lead t o double-logarithmic
(II) V ( 6 / T ) < < V <\r\9 (III) V ( T S ) « v < V ( 8 / T ) . ( W e
expressions: (I) \r\<v<U
take the specific case V ( S / T ) < ^ | T | ; the result does not depend on this assumption.)
With the appropriate approximations in each range, we find
ii=
£( i o 8 ^ + 4 i o g ^ i o g f)-
(,35 29)
-
Finally, substituting in (135.11), we have
r">2,p,;q)= - ^ ( l o g
\q2\>P*
2
^ +41og^log^),
(135.30)
= pl = m\
in agreement with (117.21).
§ 136. Double-logarithmic asymptotic form of the vertex
operator
When the corrections r(1) calculated in §135 become of the order of nity, the
vertex operator has to be found by summation of the infinite sequence of rioublelogarithmic terms of all orders in a. This problem can be solved because sucn terms
arise only from diagrams of a particular type, and the contributions from diagrams
of different orders are related in a simple manner. As we shall see below, the
double-logarithmic terms arise from all the diagrams that have the form
(136.1)
§136
615
Double-logarithmic Asymptotic Form of the Vertex Operator
etc., in which each photon line joins the two electron lines, and the photon lines
themselves may intersect in any manner.
Let the photon momenta / i , / 2 , . . . be numbered in the sequence of, say, the
right-hand ends of their lines. Then the various diagrams of a given order will differ
in the sequence of the left-hand ends of the photon lines. In each Feynman integral,
we neglect terms in the numerator and denominator as in (135.5), and then treat the
numerator in the same way as in the derivation of (135.11). Then the sum of all the
diagrams having n photon lines, giving the term * an in T, is
r*"> = y'datfr^yin,
'-£/ 2(pi/i) • 2(
P l /,
dAU... d%
(136.2)
r^—^r
+ p,/ 2 ) / . . 2(p,/, + • • • + p,/„) • 2 ( P 2 / , ) . . . 2( P î /, + • ■ ■ + P2fn)W...
/5'
(136.3)
where the sum is taken over all interchanges (permutations) of the subscripts k in
the products p-Jk\ the terms iO and A2 in the denominators are omitted for brevity.
It is clear that, if the subscripts k in the products pjk in the sum (136.3) are
interchanged in any manner, this is simply equivalent to renaming the momenta and
so does not affect the value of I„. We can therefore extend the summation in (136.3)
to all interchanges of the factors fk in both p-Jk and pjk, and divide the result by n !.
We now make use of the important formula
l
1 1 1
^
tf ai(a1 + a 2 )...(ai + a 2 +- • • + a„) ax a2
a„'
(1364)
where the sum is taken over interchanges of the subscripts 1, 2 , . . . , n.t When this
formula is twice applied to the sum of integrals, we obtain a product of n identical
integrals of the form (135.19) (or (135.26)), so that
In=-ri/î.
(136.5)
Substitution in (136.2) and summation of PH) over all n = 0, 1, 2 , . . . gives finally
r*(p2,pi; q) = y* txpiiehhll^).
(136.6)
In particular, substitution of U from (135.22) gives the double-logarithmic
asymptotic form of the vertex operator with virtual external electron lines:
r*(p2, Pi; q) = Y* exp { - £ log | ^ | log | ^ | } ,
\q2\>\pl\,\pl\>m<.2
(V. V. Sudakov, 1956).
t The formula is obviously true for n = 2, and can easily be proved by induction.
(136.7)
616
Asymptotic Formulae of Quantum Electrodynamics
§137
Substitution of I{ from (135.29) gives the asymptotic form of the vertex operator
for real external electron lines:
r-(p 2 ,P,;q) = 7 " e x p [ - ^ ( l o g 2 ^ + 4 1 o g ^ l o g y ) } ,
(136.8)
\q2\>p2i = pl=m2.
The factor which distinguishes this F* from its unperturbed value 7M defines also
the difference between the amplitude for electron scattering in the external field
and its Born value. The scattering cross-section is therefore
da = d a f î e x p [ - ^ ( l o g 2 ^ + 4 1 o g ^ l o g ^ ) } .
(136.9)
To eliminate the infra-red divergence, we still have to multiply this expression
by the sum of the probabilities for the emission of various numbers of soft photons
with energy not exceeding some small value comax, i.e. by the quantity (see (122.2))
w
w
max
mix
1+ f dwu + jj
0
U
IMX
! dwUl j
0
«"max
dw^+'^expf
0
f dw»}.
(136.10)
0
The integral in the exponential is given by (120.14) (as the expression which there
multiplies dcre\), and the final result is the following asymptotic formula for the
cross-section for scattering of an electron with energy e at a high momentum
transfer:
da = d<TBexp\- — log ^-j l o g - M ,
|q 2 |>m 2 ,
(136.11)
(a/27T)log2(e/m)~l
(A. A. Abrikosov, 1956). The first-order term (with respect to a) in the expansion of
this expression is, of course, (122.12).
Note that, if we put w max ~e, one of the logarithms in (136.11) becomes of the
order of unity; that is, the double-logarithmic corrections cancel in the crosssection for simultaneous emission of photons of any energy.t In the approximation
used, the exponential factor in (136.11) then becomes unity, and the cross-section
has its Born value, in agreement with the general statement at the end of §98.
§ 137. Double-logarithmic asymptotic form of the
electron-muon scattering amplitude
As an example of the second kind, let us consider the scattering of an electron
by a negative muon, taking only the case of scattering exactly backwards through an
t For scattering through a finite angle, the condition formulated in §98 for the photon to be soft
requires only that <om« < e, and so the formulae derived here can be used with logarithmic accuracy
even when &>m.« — e.
§137
617
The Electron-Muon Scattering Amplitude
angle 0 = IT (V. G. Gorshkov, V. N. Gribov, L. N. Lipatov and G. V. Frolov, 1967).
This is a simple process in two respects. Firstly, exchange diagrams do not appear,
because the two particles are not identical. Secondly, in back-scattering there is
very little emission of soft photons, and therefore no infra-red divergence: according to (98.8) the soft-photon emission cross-section is
L\l-v;-n
l-v;-n
l-v,n
1-v^-n/
J
4ir <o
(137.1)
where \t, vM and \'e, v'„. are the particle velocities before and after the collision; in
the ultra-relativistic case equality of momenta implies equality of velocities, and to
this accuracy, in the centre-of-mass system, for back-scattering, v, = - v^ = - v«■ =
v£, so that (137.1) is zero.
If the scattering process considered corresponds to the s channel of the reaction,
in the t channel it becomes the conversion of an electron-positron pair into a /m+/uT
pair. In this channel the condition 6 - IT signifies that the directions of motion of e~
and /x~ (and of e+ and /j,+) coincide. The elimination of the bremsstrahlung in this
channel is particularly clear, since the direction of motion of the charge of each
sign is unchanged.
The cancelling of the leading terms in the emission cross-section has the result
that its asymptotic form does not contain double-logarithmic corrections. Correspondingly, there is (with the same double-logarithmic accuracy) no infra-red
divergence even on integration over the momenta of the virtual photons in the
scattering amplitude.
If the process is described by means of the invariants
s = (P< + P»)\
t = (pe- p'e)2,
u = (pe - p;)2,
the values corresponding to back-scattering in the ultra-relativistic case are
s = -t>ml,
u = 0.
(137.2)
In the first approximation (with respect to a) of perturbation theory, electronmuon scattering is represented by the diagram
}Pe-p.
(137.3)
The corresponding amplitude is
Mji» = ^
(üitL)'yvuM)(uie)'yvu{e)).
(137.4)
The value of this expression in the limit (137.2) is obtained by replacing the matrix
618
§137
Asymptotic Formulae of Quantum Electrodynamics
4-vector y" by its "projection" y{ on a plane perpendicular to the pep'e plane (or,
equivalently, the p^pl plane, since in ultra-relativistic bapk-scattering Pt^pl and
p'e ** P»Y- the components parallel to the pep'e plane are the matrices
v i ( 7 P < ~ 7 P «*
^77 (yp*+ yp «)'
(the first being equal to y° and the second to n, • y, with n, a unit vector along pe),
and on using Dirac's equations for the bispinors u(<) and u(M), we have
(ü(*)'7,V*))(ü(')'y,,|U(<))~l/s,
so that these terms may be omitted.
In the next approximation we add the diagram
f
1
<-*
1
1
Pc
1
1
•
1
tPc-f
1
(137.5)
_l—,
■fv
pe+pM-f
and a diagram with the photon lines "crossed", which is conveniently taken in a
form which differs from (137.5) only as regards the direction of one of the
continuous lines:
p
«
"
\
'
H*l
,
■
P«
fiv*
I
1
p^-Pe+r
(137.6)
■
p;
An analysis of the corresponding integrals shows that in both diagrams we have
double-logarithmic contributions from the regions of "soft" virtual photons: | ( / pt)1\<rn\ or |(/-pi) 2 | <m\. These contributions arise from the infra-red divergences of the integrals and, according to the foregoing discussion, must certainly
cancel here. In the diagram (137.6), however, there is a double-logarithmic contribution also from large momenta: |/ 2 |>mJ, and this contribution must be calculated.
The diagram (137.6) corresponds to the integral
*,»_
Û*2 f [fi ( 'V(y/ + me)YxU(e)][û^Yx(Y/ + mu)yvu^
4
where we we have already used the fact that Pt^p'r We again put
f = upt + vp'e + f1
(137.8)
(cf. (135.13)). The double-logarithmic contribution comes from the range defined by
§137
The Electron-Muon Scattering Amplitude
619
the inequalities
|su|,|st>|>p>m£, |
(137.9)
mlls<\u\,\v\<\t]
where p = -/*. The 4-vector f± is defined so that fxpe = fxp'e = 0; in the present case
of back-scattering, it follows that fl = 0 in the centre-of-mass system, and so p = tl.
In the numerator in (137.7) we can neglect mt and mß, as well as all terms
containing u o r v ; the factors u and v there would cancel the corresponding poles
in the denominator (see below), and so the required squares of logarithms would
not occur. Since
(p<-f)2sstu~
-su,
(pt-f)2«
-sv,
f^suv-p,
we can transform the element of integration d4f in accordance with (135.16) and
rewrite (137.7) as
M}?,.
1
_ £ I" ^ V W ^ f l t f i ' - Y C ; ^ ' ] , du dv d>/i.
lit1)
su sv(suv - p + lOy
The numerator in the integrand is further transformed by averaging over the
directions of fx and replacing y" and yK by 71 and 71 (on the same principle as in
(137.4)). A simple calculation gives
Mf = M$>Jmt
p du dv dp
uv (suv - p + iO)
r(»)
(137.10)
Finally, using in the numerator the identity p = (p- suv) + suv, we can omit the
second term, which would cancel simple poles and therefore give no doublelogarithmic contribution. Thus
j(.) = _ ^ (äudvdß,
4TT
J
Mt? (p - suv - iO)
37 n )
This integral has the same form as (135.20), and the integration over dp can
therefore be performed in the same manner, but since now p>ml we have the
condition suv > mj instead of suv >0. The result is
r(D
a fdudv
2TTJ
(13712)
UV '
the range of integration being defined by the inequalities
mils <u,v <\,
suv>ml;
in the calculation with logarithmic accuracy, the strong inequalities > are replaced
620
§137
Asymptotic Formulae of Quantum Electrodynamics
by > simply. A straightforward calculation gives
J(,)=
< 13713 >
Ä1O82^4ir
m^
In the higher approximations of perturbation theory, the desired terms
-^an log2" s arise from "ladder" diagrams similar to (137.6) but with a larger number
of "rungs". The complete double-logarithmic asymptotic form of the scattering
amplitude is therefore given by the infinite sum
iMm
K- ^m-r~^m- -P«
IV-
■^—
i
i
—•-
+
m
1
- L . <m—-PÜ
1
1
-«—JL - ^ . J .
1
i
m*
i ■
i
i
i ■
i '
i
i
i■
+~
(137.14)
To determine the general form of the terms in this sum, let us consider the
diagram for the third approximation (the third term in the series (137.14)). The
corresponding integral may be written
j(2) _ ( a \2 f
dui dui du2 dv2
[
~ \2ir) J Mifi(ui + u2)(t), + v2Y J
(137.15)
with the range of integration
w»*s < u u , Ui,2< 1,
su\V\, su2v2> m*.
The double-logarithmic term in this integral can be separated by applying to the
variables of integration the further conditions
V2> V\,
(137.16)
u2> U\.
Then
r<2> = I a V f du\dv\du2dv2
\2TT)
= \^J
J
U\U2V\V2
j dÇ\ drj, d& dr)2,
where £ = \og(suJml), TJ, = -log vit and the range of integration is defined by the
inequalities
£i>f)i,
&>T?2,
o->&«T}2>0,
o- = log(s/m*).
Similarly, the nth term of the series can be written as M}"* = M}} ) / (n) , where
J ( n V) = (g^y | d^.dr,,... dÇndVny
(137.17)
§137
The Electron-Muon Scattering Amplitude
621
with the range of integration
$><n, (i = l , 2 , . . . , n ) ,
<r>Cn,Vn>0-
(137.18)
The total scattering amplitude is
M„ = M}!> [l + £ J % ) ] .
(137.19)
To calculate this sum, we use auxiliary functions A("} (£, 17), given by the same
integrals (137.17) but with ranges of integration
6>Tfc(i = l,2,...,rt),
£>&>(),
ti>Tj„>0
(137.20)
(i.e. with different limits of integration with respect to £„ and Tjn, instead of uniform
limits as in (137.18)). It is evident that Mfi = M$ A(<r, <r), where
Ai^J^A^t,),
A ( 0 ) =l.
(137.21)
The definition of the functions A(B) (£, TJ) shows that they satisfy the recurrence
relations
A(B)(£, V) = ^ I «fEidiiiA"-"«,, T,,),
and summation of this equation with respect to n from 1 to « gives an integral equation
for the function A(|, TJ):
A(fc T?) = 1 + ^ I A({„ r,,) d|,dt)„
t\>Vi>
£>£i>0,
(137.22)
T}>TI,>0.
For the subsequent analysis it will be sufficient to consider A(£, 17) in the range
$ > T). Then equation (137.22) can be written
n i
MlT?) = 1 + Yi\
\
A
<*"ii>d&d^
< 13723 >
0 1)t
Differentiating this with respect to 17, we have
d ai
t
* § -S?/A(ft.'>>*.
<13724>
622
§137
Asymptotic Formulae of Quantum Electrodynamics
and a further differentiation with respect to £ gives the differential equation
ßi-uA=o-
(i3725)
This has to be solved with the boundary conditions
A(£,0)=1, [M/dT,]«=„=0,
(137.26)
which follow immediately from (137.23) and (137.24).
The solution can be found by means of a Laplace transformation with respect
to*:
A(£ TJ) = Y^ï j e'tQip, T,) dp,
c
(137.27)
where the contour C in the complex p -plane is a closed curve around the point
p = 0. Substituting (137.27) in (137.25) and equating the integrand to zero, we have
with <f)(p) an arbitrary function. The first boundary condition (137.26) now gives
<MP) = ^ + <KP),
with \fß(p) an analytic function, having no singularity within C. The second
condition (137.26) can be satisfied by putting tfß(p) = -lirpla: then
C
Combination of these expressions with £ = TJ = a gives
c
Finally, integrating by parts and using the familiar formula
c
(where h(z) = -U\(iz) is the Bessel function with imaginary argument), we find the
§137
The Electron-Muon Scattering Amplitude
623
scattering amplitude
The cross-section for scattering through 6 = ir is correspondingly
d<T = d<TW . fr. i,lUJ—
2
a log (5/m^)
v V ir
l0g-*A
° m£/
(137.29)
da(,) = (27raV)df
being the cross-section in the Born approximation in the ultra-relativistic case (see
§81, Problem 6).t
t References to further literature on double-logarithmic asymptotic forms are given in the review
article by V. G. Gorshkov, Soviet Physics Uspekhi 16, 322. 1973.
C H A P T E R XIV
ELECTRODYNAMICS OF HADRONS
§ Î38. Electromagnetic form factors of hadrons
So FAR, we have been discussing in this book the quantum electrodynamics of
particles not capable of strong interactions (electrons, positrons and muons). There
are also many particles known as hadrons,t which take part in strong interactions.
These include, for example, protons and neutrons with spin 2, pions with spin zero,
and other particles. Atomic nuclei, which consist of protons and neutrons, are of
course hadrons also.
Present-day theory does not enable us to derive a complete electrodynamics of
hadrons. It is clearly impossible to set up equations which determine the electromagnetic interactions of hadrons without taking account of the considerably
more powerful strong interactions. In particular, the latter must be included in
order to obtain the explicit form of the hadron current, in order to describe the
interactions in quantum electrodynamics. The hadron current will therefore be
introduced as a phenomenological quantity whose structure is determined only by
the general kinematic requirements independent of any assumption about the
dynamics of the interactions^
The electromagnetic interaction operator will again have the form
(138.1)
e{JA\
where the current is now denoted by the capital letter J to distinguish it from the
electron current j . Since the order of magnitude of this interaction is specified by
the same elementary charge e, we can again use the methods of perturbation
theory.§
Let us first establish the form of the transition current between two states of a
hadron in free motion (without any transformation of the hadron itself). This
current occurs in the three-ended diagram
1
tq
/ \
JRz
(138.2)
P
t From the Greek hadros, "large, massive".
t Topics of hadron electrodynamics which involve the quark model will not be discussed in this
book.
§ In this chapter, e (>0) denotes the unit charge.
624
§138
Electromagnetic Form Factors of Hadrons
625
which itself may be part of a more complex diagram (for example, that for elastic
electron scattering by a hadron). The broken line in the diagram (138.2) represents a
virtual photon; it cannot correspond to a real photon, since a free particle cannot
absorb (or emit) such a photon; and
q2 = (P2"P.) 2 <0.
If we take first a hadron with spin zero, let U\ and u2 be the wave amplitudes of
the initial and final states of the hadron, in which its 4-momenta are pt and pi\ for a
spin-zero particle these amplitudes are scalars (or pseudoscalars).t The hadron
transition current Jfi between these two states must be bilinear in u( and u$. We
can write it
Jft = utTuu
(138.3)
where the 4-vector T is an unknown vertex operator (the circle in the diagram
(138.2)). If we put u, = u2 = 1, then Jfi = V.
The conservation of current is a universal property in electrodynamics, due to
the gauge invariance of the theory. In the momentum representation, it is expressed
by the orthogonality of the transition current and the photon 4-momentum q =
P2-P1:
qjft = 0.
(138.4)
Here, this means that V must have the form
r = PF(q\
(138.5)
where P = p\ + p2 and F(q2) is a scalar function of q2, which is the only invariant
independent variable. Since the type of hadron is unchanged by the transition,
p\ = pl = M2, where M is the mass of the hadron, and hence Pq = 0.
The matrix elements (138.3) with T given by (138.5), and therefore the operator
J, are true 4-vectors. The interaction operator (138.1) is consequently a true scalar.
Thus the electromagnetic interaction of spin-zero hadrons is necessarily P-invariant. It is also T-invariant. Time reversal interchanges the initial and final
4-momenta, leaving the sum P =pi + p2 unaltered, and changes the sign of the
space components of the 4-momenta but not the time components. This is also the
way in which the components of the 4-potential A are transformed, so that the
product JÂ is unaltered.
The invariant function F(q2) is called the electromagnetic form factor of the
hadron. The explicit form of this quantity cannot, of course, be established in a
phenomenological theory, but it is real (in the region q 2 < 0 at present under
consideration), as follows from the same arguments as were used in §116 for the
t The plane wave is written in the form tfi = [u/\/(2e)]e~'''x. The normalization to one particle per
unit volume corresponds (for spin-zero particles) to the normalization of the scalar by u*u = 1, and we
can take simply u = 1 (§!0). In the following we define the transition current with respect to the
amplitudes ui, u2, in accordance with the notation used in §64.
626
Electrodynamics of Hadrons
§138
electron form factors: when q 2 <0, there are no intermediate states which could
appear on the right-hand side of the unitarity relation, and so the matrix Mfi (and
therefore Jfi) is Hermitian.
If q = 0, the initial and final states are the same, and Jfi becomes a diagonal
matrix element. In particular, e{J%J2ei = eF(0) is the charge density, which is
equal to the total charge Ze of the particle because of the normalization to one
particle per unit volume.
For an electrically neutral particle, F(0) = 0, but it must be emphasized that this
does not imply a strictly neutral particle. If a particle is strictly neutral and has a
definite charge parity, F(q2) = 0 for all q2; since the current operator is charge-odd
(§13), its matrix elements between two states of the same hadron are zero.t
Let us now go on to hadrons with spin \. In this case the wave amplitudes ut, u2
are bispinors, and the hadron current is
JSi = ü2Yux.
(138.6)
From the bilinear combinations of ü2 and U\ and the 4-vectors p\ and p2, both true
4-vectors and pseudovectors (satisfying the condition (138.4)) can be constructed.
Hence the condition of P invariance of the interaction is not necessarily satisfied,
and must be imposed separately.t As has been shown in §116, under this condition
the vertex operator contains two independent and (if q 2 <0) real form factors. We
shall now write it as
P-2M(Fe-Fm)^+Fm7'A
= (4M2Fe - q2Fm) p + y f (Fe - Fm)a-q„
(138.7)
where Fe(q2) and Fm(q2) are invariant form factors (M being the mass of the
hadron). It is easily seen from the equation P 2 + q2 = 4M2 and (116.5) that the three
expressions in (138.7) are equivalent^
t This does not, of course, mean that such a hadron has no interaction with an electromagnetic field.
The product of two current operations, j(x)/(x'), is charge-even, and its matrix elements are non-zero
for transitions between states having the same charge parity. Thus a strictly neutral hadron can scatter a
photon or simultaneously emit two photons, i.e. can take part in processes of a higher order in a.
t We shall not consider possible violations of parity conservation in electromagnetic interactions in
consequence of virtual weak interactions.
§ The convenience of defining the form factors as in (138.7) (R. Sachs, 1962) will be shown below. In
the literature, use is also made of form factors F\ and F2 defined similarly to / and g in (116.6):
I*-",*»-£«"«These are related to F, and Fm by
Ft = Ft+ F2q2l4M\
F„ = Fi + F2.
§138
Electromagnetic Form Factors of Hadrons
627
The electromagnetic form factors are among the invariant amplitudes defined in
§70. They may be regarded as the amplitudes of a "reaction" which is (in its
annihilation channel) the decay of a virtual photon into a hadron and an antihadron.
The virtual photon is a "particle" with spin 1. The fact that its decay into two
particles with spin 2 must be described by two independent amplitudes is easily
seen by calculating the corresponding helicity amplitudes (AbAc|SJ|Afl) (see §69): the
P invariance means that the four non-zero elements of the S-matrix must be equal
in pairs:
(iJls'ID^H-lls1!-!),
U-\\Sl\0) =
(-{\\Sl\0).
The requirement of T invariance (or C invariance in the annihilation channel)
imposes no further relations between these elements. This is connected with the
fact that the interaction described by the vertex operator (138.7) is necessarily
T-invariant also (but this does not apply for particles with higher spins).
When q ->0> the terms of zero and first order (in q) in (138.7) are
T* = F€(0)y* - 2ÄT [ F m ( 0 ) " F < ( 0 ) ] o r ^ .
(138.8)
Hence it follows (see §116) that Fe(0) = Z is the electric charge of the particle (in
units of e), and Fm(0) - Fe(0) is its anomalous magnetic moment (in units of e/2M).t
So far we have used only form factors in momentum space. This is, of course,
sufficient to describe the observed phenomena. Purely as an illustration, however,
we shall give a somewhat more intuitive interpretation of the form factors,
regarding them as the Fourier transforms of certain functions of the coordinates.
To do this, it is convenient to take a frame of reference in which P = pi + P2 = 0
(called the Breit frame); this is always possible, since P 2 > 4 M 2 > 0 . In this frame,
ei = e 2 = £ , so that P° = 2e, and the components of the 4-vector q are q 0 = = 0,
q = 2p 2 = - 2 p i .
For a spin-zero hadron, the transition current in the Breit frame has a particularly
simple form:
fjJ2e = F(-q 2 ),
J = 0.
From this we see that F(-q 2 ) may be interpreted as the Fourier transform of a
static distribution of charges with density
ep(r) = e ^ 3 / F(-q 2 ) e*
r
d'q.
(138.9)
In this sense, the particle is said to have a spatial electromagnetic structure: when
F = constant = Z, we have p(r) = Z8(r), and the dependence of the form factor on q
t For example, the proton has F,(0) = 1, F m (0)- F,(0)= 1.79; the neutron has F,(0) = 0, Fm(0) = -1.91
(the magnetic moment being entirely "anomalous").
628
Electrodynamics of Hadrons
§138
is interpreted as the difference between the charge distribution and a point charge.
It must be stressed, however, that this interpretation is not to be taken literally. The
function p(r) does not relate to any particular frame of reference, since there is a
different frame for each value of q.
The Breit frame is the same as the rest frame of the particle, and independent of
q, only in the non-relativistic limit of small q2 « M 2 , when the change in the particle
energy in scattering is negligible. The initial and final states of the particle are the
same in this approximation, and so the transition current becomes a diagonal
matrix element, with p(r) the actual spatial distribution of charges. For the
elementary particles, however, the values of |q| for which the form factors vary
considerably are only slightly less than M. In the non-relativistic limit for these
particles, we can therefore replace F(-q 2 ) by F(0), i.e. regard the particle as a
point. The situation is different for nuclei. The mass M of a nucleus is proportional
to the number A of nucléons in it, and the typical value of |q|—l/J?, i.e. is
proportional to A'113 (R being the radius of the nucleus). Hence, for sufficiently
heavy nuclei, the typical q2 < M 2 , and so the non-relativistic treatment is permissible throughout the significant range. Thus the concept of the electromagnetic
structure of the nucleus becomes a quite definite one.
For a spin-5 particle, (138.7) gives in the Breit frame
J?i = ( F e « F m ) y ( f i 2 i i i ) + Flll(fi270iii)
(138.10)
= Fe(ü2y°ul),
J/i=^Fmiqx(ö22wt),
(138.11)
where S is the three-dimensional spin operator (matrix) (21.21), and in (138.10) the
equation
e(ü2y°ul) = M(ü2ul)
has been used; this is easily verified by means of Dirac's equations for U\ and u2
with pi = - p 2 .
The time component of the transition current (138.10) differs from the
expression for a "point particle" (an electron) by a factor F e (-q 2 ). We can therefore
say that the form, factor Fe (called the electric form factor) describes the "spatial
distribution of charge" in accordance with (138.9).
Similarly, the three-dimensional vector (138.11) can be correlated with-the
"spatial distribution" of the current density ej(r) = curl jt(r), where
^t) =
^X^Fm{^)e^d\
is the "magnetic moment density". Thus the magnetic form factor Fm may be
interpreted as the magnetic moment spatial distribution density, of course with the
same reservations as were expressed above with regard to the charge distribution.
§139
Electron-Hadron Scattering
629
Fm includes both the "normal" Dirac magnetic moment and the "anomalous"
magnetic moment specific to the hadron, the "density" of the latter corresponding
to the difference Fm - F€.
It is reasonable to suppose that the singular points of the hadron electromagnetic form factors, like those of the electron form factors, occur at real positive
values of the argument t = q2 = - q2. From this we can derive certain conclusions
as to the asymptotic behaviour of the distribution p(r) (and ji(r)) as r-»». The same
transformation of the integral (138.9) as was applied in §114 to derive (114.4) from
(414.3) gives the following result for large r:
P(r) oc e-v
where KI is the abscissa of the first singularity of the form factor F(q2); cf. the
footnote to §114. If the nearest singularity is given by the threshold for the
production of a pair of hadrons (each with mass M0) by the virtual photon, then
KO = 2M0.
§ 139. Electron-hadron scattering
The formulae derived in §138 can be applied to the elastic scattering of an
electron by a hadron. Let the initial and final 4-momenta of the hadron be ph and
PH, and those of the electron pe and pi; then
Pe + Pk=Pe+Pk.
(139.1)
The process is represented by the diagram
-p.
(139.2)
Ph
The emission of a virtual photon by the electron corresponds to the ordinary vertex
operator 7, and its absorption by the hadron corresponds to the operator T.
Let us take the most interesting case, that of a hadron with spin ] (for example,
the scattering of an electron by a proton or neutron). The diagram (139.2) corresponds to the scattering amplitude
Mfi
= -47re 2 4(ö;7^e)(öir^ h );
H
(139.3)
in this chapter, the electron charge is -e. The calculation of the cross-section from
this amplitude is essentially the same as the calculations in §81; the operator T is
conveniently taken in the form of the first expression (138.7).
630
Electrodynamics of Hadrons
§ 139
For the scattering of unpolarized particles, the result is
à
=
™2dt
[s - (M + m) ][s - (M - m)2]f2(l - r/4M2)
2
x {F 2 [(S - u)2 + (4M2 - Of] ~ ^
Fi[(s - u)2 - (4M2 - t)(4m2 +I)]}, (139.4)
where M is the hadron mass and m the electron mass,
t = q2 = (pe -p'ef,
S = (Pe + Phf,
2
U = (pe ~ p'hf,
2
s + t + u = 2m + 2M .
The following are some limiting cases.
For the scattering of an electron by a heavy nucleus, an important case is that in *
which the momentum transfer |q| from the electron to the nucleus is small
compared with the mass of the nucleus, but not small compared with 1/R (where R
is the radius of the nucleus), so that the nucleus cannot be regarded as a point. In
this case, the centre-of-mass system approximately coincides with the rest frame of
the nucleus, the recoil of the nucleus may be neglected, and the electron energy is
unchanged. Then
ir|dt| = p2do;,
-t = q2<M2,
s-M1-M1-u-
2Mee,
and formula (139.4) becomes
da = ^
(4e2 - q2)F2(-q2).
(139.5)
In this approximation, the cross-section has only the term containing the electric
form factor, and (139.5) corresponds to formula (80.5), which applies to the
scattering of an electron by a static charge distribution.
In the scattering of an electron by a neutron at rest, in the same limiting case
ee < M (where M is the mass of the neutron), the form factors can be replaced by
their values for q = 0, since, as already mentioned, for a single nucléon the
characteristic "radius",of the charge distribution is comparable with 1/M.t Since
the neutron is electrically neutral, F«,(0) = 0, and the cross-section becomes
2
.
r 4 ( e -jm
da = afi 2
2
) ,+ «1
J ,
1 \do
e
= api2(cosecM# + 1) do'»
(139.6)
where fx - (e/2M)Fm(0) is the magnetic moment of the neutron and d is the
t The empirical value of the r.m.s. "radius" of the nucléon is about 3.51 M * Mim* (where m„ is the
pion mass).
631
Electron-Hadron Scattering
§139
scattering angle. This formula corresponds to the scattering of an electron by a
point magnetic moment at rest.
Finally, we shall give the cross-section for the scattering of an ultra-relativistic
electron by a nucléon, with |q| > m. As before, q2 denotes the square of the
momentum transfer in the centre-of-mass system, and hence the invariant t = -q 2 .
In the rest frame of the original nucléon (the laboratory system), we have
- t ^2(pePe)
= 2e^é(l ~ COS d ) ,
where e€ and e'e are the initial and final energies of the electron, and d is the
scattering angle in this system. In the ultra-relativistic case, e'e is related to # by the
same formula as in the scattering of a photon (cf. (86.8)):
1
l
I;-^
l
=
M
n
(1
-
^
COS,9)
-
Hence
~ f - l + (2e e /M)sin i ^'
(139 7)
*M-[i+Mm*ti*?
(139 8)
'
-
where do't = 2tr sin #dd. In formula (139.4), we can everywhere omit the electron
mass m; expressing all quantities in terms of t and s - M2 = 2Mee, we have
or, using (139.7) and (139.8),
A
d(r
- ■» • a 2
cos2
2#
1
v
" d 0 e 4 T 1 s i n 4 U l + (2 ee /M)sin 2 id
X
i
\-tl4M2
2KPFmtan^\
(139 10)
-
(M. N. Rosenbluth, 1950).
Note that the form factors Fe and Fm contribute independently to the crosssection, and there are no interference terms between them. This shows that the
form factors have been appropriately chosen.
632
Electrodynamics of Hadrons
§ 140
PROBLEM
Find the cross-section for scattering of an electron by a hadron with' spin zero.
SOLUTION. Using (138.5), we have instead of (139.3)
M/,=
Aire1
~-^r{ü'AyPH)ut)F(q\
The cross-section is found to be
,
d(J
Tra2dt[(s-u)2 + (4M2-t)t]
' [s-(M + m)2)[s-(M-m)2)t21
„2,^
(0
'
with the same notation as in (139.4). When |f| > m\
.
. , a2 cos 2 !«
F2(t)
der — doe-—? . i i • -———rr-r-—. i \ 4e; sin 2# 1 + (2e€IM) sm^ id
with the same notation as in (139.10).
§ 140. The low-energy theorem for bremsstrahlung
In §98 we have investigated the emission of a photon in a collision of particles,
in the limit of zero photon frequency, and found that the amplitude of the process
is inversely proportional to co and can be simply expressed in terms of the
amplitude for the same collision without emission of a soft photon; the latter will
again be conventionally referred to below as the amplitude for "elastic" scattering
and denoted by M\f\ In the next approximation with respect to o>,
Mfi = M{f~]) + Mf,
(140.1)
where a correction term independent of <o ( « o>°) has been added to the principal
term ( « aT1). We shall see that this correction term also, like the principal term,
can be expressed in terms of M}?0, and that this is true whatever the detailed
electromagnetic structure of the hadron. This result is called the low-energy
theorem for bremsstrahlung (F. E. Low, 1958).
We have seen in §98 that the main contribution to the amplitude for emission of
a soft photon (corresponding to the first term in (140.1)) arises from diagrams in
which the photon is emitted by the initial or final particle. These are diagrams of
the form
(140.2)
§ 140
The Low-energy Theorem for Bremsstrahlung
633
in contrast to those of the form
(140.3)
where the photon line comes from the internal parts of the diagram. A characteristic feature of the diagrams (140.2) is that they can be divided into two parts by
cutting only one (initial or final) virtual-hadron line. Thus they illustrate an
important property: there exists a one-particle intermediate state with one hadron.
We have seen in §79 that, because of the unitarity conditions, this property
necessarily causes a pole singularity of the amplitude.
Let us assume for simplicity that only one (denoted by the subscript 1) of two
colliding hadrons has an electric charge and therefore can radiate, and that neither
hadron has any spin. The wave amplitudes u of such hadrons are scalars, which
will be taken as unity. Then the contribution of the pole part of the diagram
(140.2a) to the amplitude is
iMf = V(47r)e*(2pï - kn eF {p]_k\i_Mi
iT.
(140.4)
The first factor corresponds to the photon k (eß being its polarization 4-vector). The
second factor corresponds to the electromagnetic hadron vertex (the black dot in
the diagram), and is written in the form (138.5), with F the hadron form factor. The
third factor is the propagator of the virtual hadron pi — k (M being its mass).
Finally, the factor iT denotes the whole remaining section. This differs from the
amplitude of the,elastic process
(140.5)
in that the real hadron pi is replaced by the virtual hadron p\-k.
The first few terms in the expansion of (140.4) in powers of co include (1) terms
inversely proportional to o>, (2) terms independent of o> but depending on the
direction of k, (3) terms independent of both to and k. The terms of the third kind
only are given also by non-singular diagrams of the type (140.3), which do not have
a pole singularity, and by the non-pole parts of the diagrams (140.2). We shall see
that all such terms of this sort are jointly and unambiguously given by the terms of
the first and second kinds when the condition of gauge invariance is applied, so that
no separate calculation of these terms is necessary.
The amplitude of the elastic process (140.5) depends only on two invariants:
s = ( P i + p2)2 = (pî + pD2,
* = (P2-p2)2.
(140.6)
Electrodynamics of Hadrons
634
§140
The replacement of pi by pi - k not only changes s into (pi - k + p2)2 but also brings
in a dependence on a further variable,
( P l - k ) 2 - M 2 = -2p,k,
which represents the "non-physicalness" of the momentum p\-k. But the first
term in the expansion in powers of this new variable (a small quantity) already
eliminates the singularity in the amplitude (140.4), and therefore can yield in this
amplitude only terms independent of k, which according to the foregoing discussion
are not yet relevant. Thus we reach the important conclusion that X in (140.4) can
be replaced by the physical amplitude MftXs, t) with the change
s -+ (Pi + Pi - k2) = s - 2k(px + p2).
(140.7)
The first terms in its expansion are given by
T-> M }f(s, t) - 2(kp, +
kp2)(dMflds)t.
For a similar reason, it is unimportant that the electromagnetic form factor F
here relates to a vertex at which only one of the two external hadron lines (pi and
Pi — k) is physical. The form factor can therefore be replaced by the one described
in §138, for a vertex with two physical external lines; since the photon k is then a
real photon, we have F(k2) = F(0) = Z u where eZ{ is the hadron charge.
Thus (140.4) gives
M }?> = Zxe V(4TT) ^
^
- Z,e V(4TT) 2(e*p,) z ^ ) KP2V ^ff
+ • • -,
(140.8)
where the dots represent terms independent of k (whereas the second term in
(140.8) depends on the direction of k). Similarly, we find that the contribution to Mfi
from the diagram (140.2b) differs from (140.8) in that pu p2 and k are replaced by
pî, pi and -k. The leading term in the expansion is the already familiar expression
(cf. (98.5))
Mfr1} = Z,eV(4TT) ( ^
- *£)
Mf.
(140.9)
The terms independent of k can be found from the condition for the amplitude
as a whole to be gauge-invariant: it must be unaffected by the change e*-*e* +
constant x k, i.e. must have the form M/, = eJJ*\ with k^J" = 0. It is easy to see
that, for this to be so, we must add to (140.8) the term independent of k
-2Z,eV(47r)(p 2 e*),
and similarly for the diagram (140.2b). The final result is
My = 2Z1eV(4ir)«*[pï^-pÇ +P r | ^ - p ^ ] ^ .
(140.10)
§ 141
The Low-energy Theorem for Photon-Hadron Scattering
635
The problem can be solved by means of this formula, which may be more
compactly written by using the identity
and similarly for 5/dpi, and the differential operators
AX
!
*
=
Ptr* kv~—
(Pifc)
dp\
(140.11)
—
dpt
and similarly for d\^. Then
Mf = Z,eV(47r) e*{ât + an
Mf.
(140.12)
The cross-section is given by \Mfi\2; to the appropriate accuracy,
\Mfi\2 = |M}rn|2 + 2 re (M^Mf*).
(140.13)
The second term gives the required correction to the emission cross-section.
Summation over the polarizations of the photon gives as the value of this correction
-4ir(Z,e) 2 ( ^ - j^f
(â\ + lUMf\\
(140.14)
Thus the correction to the emission cross-section is expressed in terms of the
cross-section for the elastic process and its derivative with respect to s.
If the charged hadron has spin i the calculations are unchanged in principle;
only the specific form of the vertices and propagators is altered. It is found that
formula (140.14) rémains valid after averaging over the polarizations of the hadrons
and the photon (T. H. Burnett and N. M. Kroll, 1968).
§141. The low-energy theorem for photon-hadron scattering
In the limit of low frequencies, the cross-section for the scattering of a photon
by any charged particle at rest tends to its classical value given by Thomson's
formula. This limit corresponds to an amplitude independent of the photon
frequency o>, which we denote by M f. It is found, however, that not only the first
term in the expansion of the amplitude in powers of a>,
Mfi = Mf + M<H).
(141.1)
but also the next term (M(I) ~ o>), are independent of the specific electromagnetic
structure of the hadron for photon scattering, as well as for the bremsstrahlung
discussed in §140 (F. E. Low, 1954; M. Gell-Mann and M. L. Goldberger, 1954).
636
§ 141
Electrodynamics of Hadrons
This process is represented by diagrams of three types:
•
,k
,k*
kx
kN
(141.2)
of which the first two again have a one-particle intermediate state, and therefore a
pole singularity.
The analysis and the principle of the calculations are the same as in §140. In
practice, we need only determine the contribution from the pole parts of the
diagrams (141.2a) and (141.2b), expressing their electromagnetic vertices in terms
of the static form factors (the charge Ze and the anomalous magnetic moment /i an ),
as in (140.15).
Unlike the bremsstrahlung case, however, the corrections to the Comptoneffect cross-section are important only for particles that have spin. This is because,
for bremsstrahlung, as well as the spin-dependent corrections, there are corrections
arising from the energy dependence of the amplitude of the "elastic" process. In
photon scattering, this amplitude is replaced by form factors which, for "physical
external lines", are constants independent of the energy, and therefore the correactions arise only from the magnetic moment, which is zero for spinless particles.
We shall discuss the scattering of a photon by a spin4 hadron.
If Mft denotes the contribution of the pole diagrams to the scattering amplitude,
then (cf. (86.3), (86.4))
Mfi = -47r(Ze)2e^ev(üQ^u),
(141.3)
where
Q*" = (y* + S'^yP+^M
s=(p
(yv - S") + <y" - S") yP
2
+ k) = (p' + k')\
u
2
= (p-k')
u
^
M
( 7 * + S"), 1
(141.4)
J
= (p'-k)\
and for brevity we have put
Mancr^kA = ZeS\
/x an o^kl = ZeS'».
(141.5)
By interchanging the operators yp + M and using the equations ö ' ( 7 P ' - M ) =
(yp - M)u = 0, we can transform (141.4) to
Q-=[<*•+s"> ( 7 l % k + 2 p *+ 7 > ( y" <v + s*>] _ \y'(yk')
[
+ 2p"
2p'lc'
' [S" y" +£k+
*
*
L
c..y< (yk')-2p'"\
2pk'
J
" "*' TP ~2pk'+ M S"\
M S
(141 6
'>
§ 141
The Low-energy Theorem for Photon-Hadron Scattering
637
This form, and the corresponding one with k and k' interchanged, clearly show that
(141.3) is gauge-invariant; the relevant condition is
K(ü' Q^u) = (ü(Q^u)K = 0,
(141.7)
in verifying which it must be remembered that (yk)(yk) = 0, kS = k'S' = 0.
Since the pole part of the scattering amplitude is thus gauge-invariant by itself,
so must be the regular part of the amplitude (which includes the contribution of the
diagram (141.2c)). Hence in turn it follows that the expansion of this part in powers
of k and k' must begin with quadratic terms; cf. the similar comment relating to the
condition (127.5). That is, the regular part of the amplitude includes only terms
starting with those proportional to ODCU' — CO2, and makes no contribution to the
terms concerned here, which are proportional to a>° and cu1. These are therefore
included in (141.3).
To calculate the terms in question, we use the laboratory system, in which the
initial hadron is at rest. For the photons, we take a three-dimensionally transverse
gauge, in which e0=eo = 0. Then pe = 0, p'e f * ~ |p'| ~ o>, and from (141.6) it is
obvious"that the leading terms in the expansion of Mfi will be proportional to co°,
and that the terms in /xan will contribute only to the cu1 terms.
The wave amplitudes of the initial and final hadrons in the laboratory system
are, with the necessary accuracy,
M=V(2M)(JJ),
ö' = V(2M)[w'*,-^(k-k')a],
where w and w' are three-dimensional spinors.
A straightforward calculation gives the result
M }?> = -8ir(Z«)2(e'* -e)(w#*iv),
(141.8)
M},!) = - 167riM/uianfc)(w'* crw) • (n' x e'*) x (n x e) - 47riZ£fzanco(w'* <JW) • {n(n x e • e'*) +
+ (n x e)n • e'* - n'(n' x e'* • e) - (nf x e'*)n • e - 2e'* x e},
(141.9)
where n = k/cu, n' = k'/cu'.
The scattering cross-section is
to-skwrfEv*0"-
(14110)
see (64.19). For scattering by a charged particle, both M/P and Mf? are non-zero.
The accuracy used allows us to retain in \Mfi\2 the terms \Mf\2 and re(M f Mfî*).
The first of these gives the Thomson scattering. The second becomes zero on
averaging over the polarizations of the photons and hadrons. In scattering by a
638
Electrodynamics of Hadrons
§142
charged hadron, therefore, the corrections under consideration occur only in the
polarization effects.
For scattering by an electrically neutral hadron, M f = 0 and the cross-section is
determined by IM}!*)2. After averaging over the polarizations of the final particles
and summing over those of the initial particles, it is (in ordinary units)
da = ^ 5 - (2+ sin2 #) do\
(141.11)
where # is the photon scattering angle and the anomalous magnetic moment is
equal to the total moment /x. The angle dependence of this cross-section is the
same as for antisymmetric scattering (see §60, Problem 2).
§ 142. Multipole moments of hadrons
Let us now consider the transition current corresponding to a diagram of the
same kind as (138.2):
Ik
i
4
«^S
(142.1)
but with the lines p\ and p 2 pertaining to different particles (masses Mi and M 2 ); the
photon line k = p\-p2 will be more conveniently represented here as leaving the
vertex. The photon may be either virtual or real, the only necessary condition being
k2 < (Mi - M2)2, so that the value k2 = 0 is permissible. Thus the applications of this
diagram include, in particular, processes of photon emission in transformations of
nuclei as well as other particles (for nuclei, the initial and final particles are the
same nucleus in different states).
The most interesting case here is that in which the wavelength of the photon is
large compared with the characteristic "dimensions" of the particle (i.e. those
which appear in its form factors, equal to the "radius" in the case of a nucleus).
Then the transition current can be expanded in powers of k.t
Note first of all that we must have
J/, = 0 w h e n k = 0 ,
(142.2)
since the limit k -+Q corresponds to a potential constant in space and time, but such
a potential has no physical significance and cannot give rise to real processes. The
same conclusion can be reached by a more formal argument: the currents discussed
in §138 were non-zero for k = 0 on account of the terms proportional to the
4-vector P = p i + p2» but when M)^M2 the product P k ^ O , and such terms are
therefore forbidden by the condition for the current to be transverse.
t The following treatment is due to V. B. Berestetskiî (1948).
639
Multipole Moments of Hadrons
§142
This condition for the current Jfi = (p/t> J/,) is, in three-dimensional form,
k • J/,-= cop*-,
(142.3)
and can be satisfied in two ways:
Jfi = tov(k, eu),
or
J/i = k x a ( k , o ) ) ,
pfi;= k • v(k, a>)
(142.4)
P/i=0.
(142.5)
Here v is a polar vector and a an axial vector. The current is said to be of the
electric and magnetic type respectively. According to (142.2), v and a are finite or
zero when k, to ->0.
Let the photon energy to <^ Mi. Then the recoil may be neglected, and the final
particle M2 also may be regarded as being at rest (in the rest frame of Mi);
a) = Mi - M2 is given quantity. The states of the particles Mi and M2 at rest are
specified by three-dimensional spinors wx and w2 of ranks 2s\ and 2s 2 , where s\ and
52 are the spins of the particles. The transition current must be a bilinear combination of w\ and w?. From the products of the components of these spinors, we
can form irreducible tensors with ranks / = s\ + s 2 , . . . , \s\ - s2|; for a given /, they
are true tensors or pseudotensors according to the internal parities of the particles
Mi and M2. Apart from these tensors, we have available only the vector k. In order
to obtain the first term in the expansion of the current in powers of k, we must
form from these quantities a vector of the lowest possible power of k. This is done
by taking the tensor of lowest rank and contracting it / — 1 times with the vector k.
This will give the polar vector v or the axial vector a.
Let Qim be the spherical components of the tensor formed from the wave
amplitudes of the particles. The spherical components of the tensor of rank / - 1
formed from the components of k are |k|/_1 Yi_iffn(n), where n = k/to. From the general
rule for the addition of spherical tensors (see QM, (107.3)), the spherical components
of the vector v may be written
V(4ir)
111 + 1 . ,i-, v / ! - l
1
l
\n
Y
fnï
where A takes the values 0 and ± 1 ; the choice of the common factor is explained
below. Using formulae (7.16), we can express v in terms of spherical harmonic
vectors:
V=
'" (2/ -V\)V$U(2l
+ 1)] ? ( - ^ ' " " Q ' - ^ V d + l)Yfä(n) + VlYfä(B)].
(142.6)
Substitution in (142.4) gives the El transition current:
J/i =
'" (21 -\ffî[l(2l
p
"
=
+ 1)] ? ( ~ 1 ) ' ' m Q ^ ' ^ V ( / + l)Yfä(n) + V l Y ) » ] ,
'" ( 2 t - ^ ! V ( 2 < - f l ) ? <-1>,~mQt-«Y««<nfc
(142.7)
<142-8>
§142
Electrodynamics of Hadrons
640
|k| and co are distinguished in each formula, with a view to possible applications to
real photons and also virtual photons, for which the two quantities are not equal.
In (142.7) and (142.8) it is assumed that the spherical tensor Qlm (here denoted
by Q\m) is a true tensor. If it is a pseudotensor (denoted by Q\%})9 then (142.6) defines
the pseudovector a, and substitution in (142.5) gives the Ml transition current:
J
" = '" raVï(5ÎTT) M ? <-'>'-Q'ÄY!:»(n), I
P/i=0.
(142 9)
'
The qyantities Q\em} and Q\™] are the hadron electric and magnetic multipole
transition moments. Their role in hadron electrodynamics is exactly analogous to that
of the corresponding quantities in electron electrodynamics. However, for electron
systems these moments can in principle be calculated from the wave functions (as the
matrix elements of the corresponding operators), whereas in hadron electrodynamics
they occur as phenomenological quantities whose values are determined from
experiment.
The normalization of these quantities in (142.7)—(142.9) is chosen so as to agree
with their definition in §46. This can be verified by regarding the currents (142.7)(142.9) as Fourier components of the transition current in the coordinate representation. For example, expanding the factor e~lk r in the integral
PrAk) = fpfl(r)e-ik-rd3x
(142.10)
by means of (46.3), we get
fffi(k) = Am1 2 Ytmin) ( P/i(r)Ytm(r/r)gl(|k|r) d3x.
im
J
Retaining the term with the smallest value of I such that the integral is non-zero,
and replacing gi (|k| r) for |k|r < 1 by the first term in its expansion (46.5), we return
to (142.9), with
Qfö=s
yj^fxjrlpfi(r)Ylm(rlr)d3x,
(142.11)
in agreement with the definition (46.7).
It can also be shown that, when applied to the emission of a real photon,-the
formulae derived above give results already known. The amplitude of a transition
with emission of a photon having momentum k = wn and polarization e = (0, e) is
Mfi = - eV(4tr)e* • J,,-.
(142.12)
If the nucleus has definite values of the angular-momentum component Mt and Mf
in the initial and final states, only one term remains in each sum over m in
(142.7M142.9), namely that with m^M{-Mf.
Since, from (16.23), the products
§ 142
641
Multipole Moments of Hadrons
(X)
)
(A)
(A)
Yfö • e * and Y}^ • e * (where A = ± 1 is the helicity of the photon, and e ±n) are
proportional to D[m, we obtain again the formulae given in §48.
The differential emission probability ist
dw = 27rS[o> - (E, - E/)]|M/i|2d3k/2û>(2ir)3,
(142.13)
where E, and Ef are the initial and final energies of the nucleus. The total
probability is found by summation over the polarizations and integration over d3fc.
Substituting (142.7) or (142.9) in (142.12) and thence in (142.13), and performing the
operations just mentioned, we again obtain (46.9) or (47.2).
Formulae (142.7M 142.9) include all possible cases of the emission of a real
photon. For virtual photons, there is another possible case which they do not
include (R. H. Fowler, 1930).
If the spins and parities of the initial and final states of the nucleus are the
same, we can obtain from their wave amplitudes a scalar Q0, and from this a
transition current of the form
P/i
= Qok2,
Jfi = Qo<ok.
(142.14)
Qo is called the monopole (E0) transition moment. The corresponding transition
amplitude for the emission of a real photon is zero, since e* • k = 0. The monopole
current, however, may give rise to transitions involving the emission of a virtual
photon. It is, moreover, the only such source when S\ = s2 = 0 and all the multipole
moments are zero.
The monopole current (142.14) is analogous to the electric quadrupole
current as regards its dependence on a> and k. Accordingly, the moment Q0 also is a
quantity of the same order as the quadrupole moment. The same conclusion can be
reached by regarding (142,14) as the Fourier components of the current in the
coordinate representation. Using in (142.10) the expansion of e~ikT in powers of
k • r and assuming that pfi(r) is spherically symmetrical, we get
P/i(k)
= ^k 2 J P/l (r)r 2 d 3 x.
Comparison with (142.14) shows that
Qo= ~{j pfi(r)r2d3x.
(142.15)
The similarity to the quadrupole moment is obvious.
PROBLEMS
PROBLEM 1. Find the probability of ionization of an atom from the K shell because of the
excitation energy <o of the nucleus (called internal conversion of 7-rays), in an Ml nuclear transition,
t The factor 2TT8 in this formula, replacing (27r)4S(4) in (64.11), arises because momentum is not
conserved when the recoil of the nucleus is neglected, and so only the conservation of energy remains.
642
§ 142
Electrodynamics of Hadrons
neglecting the binding energy of the electron in the atom and the influence of the nuclear field on the
wave functions of the nucleus.t
SOLUTION. The process is described by the diagram
(1)
where p\ and pi pertain to the nucleus at rest in different states, and p = (m, 0) and p' = (m + a>, p') are
the 4-momenta of the initial and final electrons. This diagram corresponds to the amplitude
-e2-^ü(p')(yJfi)u(pX
Mfi =
where Jfi is the transition current of the nucleus. After summation over the final polarizations of the
electron, and averaging over its initial polarizations, we get
1 2 |M/1|2 = e 4 ^ { q U ^ 0 + 4(J/,p)(J)fip)},
polar.
\({
)
using the fact that Jfiq = 0 and therefore iyxp = J/jp'. The conversion probability is calculated from
dwconv = 2|iM0)|2 (1^ da)
/p-^)
\m
,
where da is the cross-section for the scattering process represented by the diagram (1) with p = (e, p),
and \\ti is the wave function of the atomic electron; for a K electron |»M0)|2 = (Zam) I IT. The factor 2
takes account of the two electrons in the K shell of the atom. The cross-section da is
da = 2*8(s + to - « ' ) | M / , | 2 2 ( p | . 2 e , ( 2 i r ) j ;
cf. the last footnote.
For Ml transitions, the current Jfï must be taken from (142.9). The integration of dwCOnv over de'
removes the delta function, and the integration over do' replaces jYlJU^2 by unity. The conversion
probability is thus expressed in terms of |Qi.^!n|2. But, according to (46.9), the probability wy for the
spontaneous emission of a photon in the same nuclear transition can be expressed in terms of the same
quantity. The final result is
^
Wy
= 2a(Za) 5 ^(l+^Y + ',
CO \
to
)
this ratio being called the conversion coefficient.
PROBLEM 2. The same as Problem l, but for an El nuclear transition.
SOLUTION. By the same method, with the transition current given by (142.7) and (142.8), we obtain
I m2\/4
2m'
"-™^+Th5)(>+W
'
t This approximation implies that the nuclear charge is small and the excitation energies o> are
sufficiently large (but l/o> is assumed large compared with the dimensions of the nucleus). In practice,
the approximation is somewhat unsatisfactory, and a more precise calculation has to take into account
the Coulomb field of the nucleus.
§ 143
Inelastic Electron-Hadron
Scattering
643
PROBLEM 3. The same as Problem 1, but for a monopole nuclear transition.
SOLUTION. With the transition current given by (142.14), the result is
wCOnv= 1 6 a 2 ( Z a ) 3 m V ( l + ^ )
|Qo|2.
Since monopole emission of a photon is impossible, |Qo|2 cannot be eliminated.
§ 143. Inelastic electron-hadron scattering
Elastic electron-hadron scattering has been discussed in §139. The problem of
inelastic scattering may be formulated similarly. The only difference is that the final
hadron state now corresponds to another hadron or several hadrons. The law of
conservation of momentum (139.1) remains valid if p'h denotes the 4-momentum of
the final hadron or the total 4-momentum of the group of hadrons formed in the
scattering process. We thus have now p'h2 ^ pl = M 2 , where M is the mass of the
initial hadron.
With this difference, the inelastic scattering process is described by the same
diagram (139.2). The lower vertex of the diagram is denoted by //, as in §138.
However, in contrast to (138.3) or (138.6), we shall not express the transition
current in terms of the vertex operator and the amplitudes of the states, in order
not to specify in advance the nature of the final hadron state.
We can now write the scattering amplitude in a form analogous to (139.3):
\Pe
P e)
a similar amplitude has already been used in §142, Problem 1, where energy
transfer to an electron was considered, and the amplitude has a similar structure in
the problem of the excitation of nuclei by electrons.
We shall assume that the initial electron energy is so high that many hadrons
can be formed in the final state, and consider the "inclusive" cross-section, for
which only the electron momentum in the final state is fixed, with summation over
all the hadron states. This differential cross-section is written as follows, in
accordance with the formulae of §64:
dor = 4 f ( 2 ^ 2 g ; 2 (27r)A8{4\ph + p'h-pe
- p3|Af„-|2.
(143.2)
The inclusive cross-section can depend only on three kinematic invariants,
which may be determined by measurements on electrons only. The three invariants
are
t = q2s (p€ - p'e)\
s=(pe + ph)\
(143.3)
and PH2. The need to include the third invariant arises because, in contrast to the
Electrodynamics of Hadrons
644
§143
case of elastic scattering, p£2, the "mass" of the final hadron state, is now
unspecified. Instead of it, however, another invariant is more convenient, namely
v = qph.
(143.4)
The relation between v and pk2 follows from Ph = ph + q:
p'h2 = M2+t+2v.
(143.5)
If the initial hadron is stable (for instance, a proton), the rest energy of the final
state exceeds M, i.e. p'h2^M2, and from (143.5), since t < 0 , we have
(143.6)
t\,
the equality occurring for elastic scattering.
The kinematic invariants can be expressed in terms of the electron energies ee
and Be in the initial and final states and the scattering angle 0. We shall assume the
electron to be ultra-relativistic (ee > m, e'e> m), and neglect its mass. Then, in the
rest frame of the initial hadron (the laboratory system),
t = - 4ete't sin21 fl, v = M (ee - e'J,
s - M2 = 2MB.
(143.7)
Substituting (143.1) in (143.2) and summing as usual over the electron polarizations, we obtain the scattering cross-section for unpolarized electrons, which we
write as
d
°
=
7^5
n
jYir
, HWW",
(q ) (2tr) • fSMBcBe
(143.8)
or
43 9)
<*•<&£&*-•""
°-
>V = 4ptlLp'ev - 2(ptllqv + p€vq») + q2g»„
(143.10)
W»v = Ç (27r) 4 5 (4) ( P ;-p h - q)JWr*
(143.11)
where
The tensor WM" of course depends essentially on the properties of the hadron
currents, and in general we can only pose the problem of its phenomenological
structure, similarly to the problem of hadron form factors. We first use the fact that
the tensor structure of W*1" must be determined only by the 4-vectors relating to
the lower vertex of the diagram (139.2), i.e. p* and q. From these (and the metric
tensor gMI/) five independent tensors can be constructed. The requirement of
invariance under time reversal means that the tensor must be symmetrical, and
§ 144
Hadron Formation from an Electron-Positron
Pair
645
there are four such. Lastly, the condition of current conservation, i.e.
W^qv = 0,
W " ^ = 0,
reduces the number of independent tensors to two. These may be taken as
TÏÏ = ^ - g ^
TJ2 = (PH. ~ vqjt)(phv
- vqjt),
(143.12)
and WßV be written as
VV^ = 47rMVVlT^ + (47r/M)W 2 Tg.
(143.13)
Substituting (143.10) and (143.13) in (143.8), we put the cross-section in the form
da = (W2 + 2 Wx tan2 \d) de'e datU
(143.14)
where
a 2 cos 2 20 . ,
.
is the cross-section for scattering of an ultra-relativistic electron in a Coulomb
field; cf. (80.7).
We see that the cross-section is determined by two structure functions which
depend on the two invariants t and v. If the physics of hadrons at high energies
does not contain characteristic quantities having the dimensions of mass (the
hypothesis of scale invariance), we may expect that the structure functions
depend, at high energies, on the only dimensionless parameter, tlv. Then the
functions Wx, W2 must be functions of one variable:
Wx = (Mlv)Fx(tlv),
W2 = (Mlv)F2(tlp);
(143.15)
the ratio M\v is independent of M.
§ 144. Hadron formation from an electron-positron pair
Let us now consider the transformation of an electron-positron pair into
hadrons. We denote the 4-momenta of the electron and positron by p_ and p+, and
the (total) 4-momentum of the hadrons formed by ph, with p- + p+ = ph. The process
is represented by the diagram
-p
P
;
i
(144.1)
646
Electrodynamics of Hadrons
§144
The lower vertex here corresponds to the transition current from the vacuum to a
hadron state |n), which we denote by <n|J|0) as in §104.
The diagram (144.1) corresponds to the scattering amplitude
M„ = - (4Tralq2)ü(-p+)yMP-)n(n\J*\0).
(144.2)
We shall consider the total cross-section <7h for annihilation into hadrons, i.e. sum
over all final states |n). Then, in accordance with (64.18),
<r» = 4 y 2 |Mn|2(27r)46(4)(ph-q),
(144.3)
where q = p- + p+. The mass of the electron will henceforward be neglected; then
q2 = 2(p_p+),I = ^ 2 .
As in §143, we write the cross-section in the form
<rh = (4TT)2W^WJ2I\
(144.4)
where
w»v = a{ptqv + p *q* - 2p?p: - iq V ) ,
(144.5)
W^ = a^ (27r)4S(4)(p* - qKO^InXnl/JO)
(144.6)
n
and t = q 2 >0.
Note that t is the only kinematic invariant in this problem, with the three-ended
diagram (144.1), and q is the only 4-vector on which W^ can depend. Hence, by
the requirement of current conservation, the tensor W^v may be written
V^ = W 0 ( ^ - g , , ) ,
(144.7)
where ph(t) is the only invariant function, depending on the properties of the
hadron current and determining the annihilation cross-section. Substitution of
(144.5)-(144.7) in (144.4) gives
*h = (47T2a/t2)p,(0.
(144.8)
Note that the function ph(t)= -lW» is exactly the same as p(t) defined in
(104.9) if the currents in the latter equation are taken to be hadron currents.
Moreover, p(t) is the spectral density of the self-energy function 11(0:
im 11(0 = - MOIn the lowest approximation with respect to a, which is being considered here, the
function II is the same as the polarization operator 9. In this approximation,
§ 144
Hadron Formation from an Electron-Positron Pair
647
therefore, ph(t) is also the spectral density of the hadron contribution to the
polarization operator:
i m ^ ( t ) = -7rph(0.
(144.9)
Using the dispersion relation (111.13) and expressing ph in terms of ah by (144.8),
we get
0
which expresses the hadron contribution to the polarization of the vacuum in terms
of the measured cross-section for annihilation into hadrons.
Note that we could, in exactly the same way, solve the problem of electronpositron pair annihilation to form a muon pair (in the first approximation with
respect to a, only one such pair is formed). Corresponding to the formula (144.8),
the result is
orM = (47r2a/t V ( 0 ,
(144.11)
where pß(t) is the spectral density of the muon polarization of the vacuum. It
differs from the electron polarization only in that the electron mass m is replaced
by the muon mass /x, and, from (113.8), it is
p»(t) = <a/3irXl + 2fx2)V(l - 4ft2/f).
Substitution in (144.11) brings us back to the result already derived in §81, Problem
8.
This page intentionally left blank
INDEX
Absorption of radiation 161-4
Annihilation
diagram 291
operator 11,38-40
of pair 368-76
three-photon 373
Antiparticles 38, 93, 99-101
Anti-Stokes case 221
Asymptotic form
of electron-muon scattering amplitude 616-23
of electron propagator 529
of photon propagator 597-601
of scattering amplitudes 603-8
of soft-photon emission cross-section 534
of vertex operator 614-16
Asymptotic freedom 602 n.
Atomic levels, radiative shift of 544-52
Atoms
interaction at large distances 347-53
radiation from 181-97
scattering by 231-7
Bare charge see Intrinsic charge
Bare photon propagator 459
Bethe-Heitler formula 410
Bethe-Salpeter equation 556
Bhabha scattering 324
Bilinear forms 101-6
Bispinor 70
Bosons 14, 33-61
Bound states, relativistic equation for
Breit
equation 341
frame 627
Bremsstrahlung 389-409, 419-44
Centrally symmetric field
electron in 128-40, 389^09
scattering in 140-8
Channels, reaction 257
Charge
conjugation 45-6
of spinors 94-9, 103
of field 41
parity 46
Chronological operator 284
Coherence length 408
Compact part of diagram 463, 469
Compton effect 292-5, 354-68
Contraction of operators 288
Correspondence principle 166
552-9
Coulomb
field, electron in 133-40, 389-409
scattering in 141-2, 144-8
gauge 303
law, radiative corrections to 504-8
CPT theorem 46
Creation
operators 11, 38-40
of pair 371, 386-8, 410-19, 443-9
Cross-channels of reaction 257
Crossing
invariance 311
symmetry 311
Deuteron photodisintegration 216-20
Diatomic molecules, radiation from 197-204
Dipole radiation 164-6
Dirac
conjugate function 77
equation 74-80, 118-26
matrices 76-84
Dispersion relation 494, 516
double 561
double-subtraction 496
with one subtraction 516
without subtraction 516
Dotted indices 63
Double emission 228
Double-logarithmic
accuracy 521
terms 608-14
Dyson's equations 477
Effective external lines 464
Einstein coefficients 164
Electric multipole radiation 166-71, 181-6
Electromagnetic
field equations, radiative corrections 575-85
field quantization 5-32
interaction 159-61
Electron
interaction 159, 317-455
propagation function (propagator) 294-300
scattering of 286-92, 317^3, 534-44
scattering of photon by 292-5, 354-68
Emission of radiation 161-220, 389-413, 419-44,
449-55
angular distribution 176-9
double 228
induced 162
stimulated 162
649
650
Energy levels, radiative shift of 544-52
Energy loss cross-section 332
Equivalent photons method 438-44
External
field 118-58,317-43
lines 290
effective 464
Fermions 62-117
Feynman
diagrams 289, 304-16
gauge 302
rules 298
Fine structure in hydrogen 126-8
Form factors
charge 628
electric 628
electromagnetic 514, 625
of electron 514-21
magnetic 628
Franck and Condon's principle 202
Free ends 290
Furry
representation 482
theorem 315
Gauge
Coulomb 303
Feynman 302
invariance 14
Landau 302
three-dimensionally transverse 13
transformation 12, 28, 43, 470
global 119n.
local U9n.
Hadrons 624
electrodynamics of 624-47
formation from electron-positron pair 645-7
multipole moments of 640
Heisenberg representation 10, 284 n., 456-8
Helicity 25,55
scattering amplitudes 265-78
states 55
spherical 56
Hydrogen atom 126-8, 192-7, 208-11
Infra-red
catastrophe 436
divergence 519, 541
Interaction representation 283, 456-8
Intrinsic electron charge 489, 601-3
Invariant
amplitudes 274-8
kinematic 256-8
perturbation theory 283-316
Inversion
combined 47
Index
four-dimensional 39
of spinors 68-74, 104
three-dimensional 17,44-5
Ionization losses of fast particles 330-6
Irreducible vertex part 474
Källen-Lehmann expansion 495
Kinematic invariants 256-8
Klein-Nishina formula 356
"Ladder" diagram 556, 620
Lagrangian operator 36 n., 94
Lamb shift 128, 544
Landau gauge 302
Light cone 286 n.
Line width 240-4
Logarithmic
approximation 405
corrections 504
Lorentz group 39
extended 44 n.
proper 44 n.
Low-energy theorem
for bremsstrahlung 632
for photon-hadron scattering 635
Magnetic moment
anomalous 152, 157
of electron 517, 521-4
of muon 523-4
Magnetic multipole radiation 171-3, 186-8
Majorana scattering 80, 99
Mandelstam
plane 260
representation 561
Mass
operator 496, 484, 524-9
surface 512
Measurement in the relativistic case 1-4
Mesic-atom levels, radiative shift of 551-2
Metric tensor xiii
Molecules
radiation from 197-204
scattering by 231-40
Miller scattering 323
Monopole transition moment 641
Multipole
moments of hadrons 640
radiation 166-73
Natural width of spectral lines 240-4
Neutrino 113-15
Neutron scattering 157-8
Normal product 10 n., 305
Notation xiii-xv
Nuclei, radiation from 205-7
Index
Observables in the relativistic case
Occupation numbers 9
Optical theorem 280
Oscillator expansion of field 6
Pair
annihilation 368-76
production 371, 386-8, 410-9, 443-9
Parametrization method 538, 592
Parity
charge 46
internal 44, 99-101
orbital 44
of photons 17-18
of spinors 68-70
of term 198
total 44
Partial polarization 26-9, 106-11
Particles
and antiparticles 38, 93, 99-101
in an external field 118-58
polarization 252-6, 326-8, 359-68
in the relativistic case 33
scalar 34
strictly neutral 42
vector 50, 304, 530
Pauli
equation 123
matrices 66, 74
Perturbation theory 283-316
Photodisintegration of the deuteron 216-20
Photoelectric effect 207-16
Photons 11-32, 159-220, 354-455; see also Particles
angular momentum 17-18
electric (Ej) 18
equivalent 438-44
finite mass 519, 524, 530
magnetic (Mj) 18
parity 17-18
polarization 24-9,173-81,255-6, 359-68,370-1,
405-6
propagation function (propagator) 287, 300-4
scattering 221-82, 292-5, 354-68, 566-75
soft 529-34
splitting 585-92
wave functions 19-24
Physical regions 261
Plane waves 84-7, 148-51
Polarizability tensor 227
Polarization
density matrix 26-9, 52, 106, 114
moments 178
operator 463, 501-4, 508-13
partial 26-9, 106-11
of particles 252-6, 326-8, 359-68
of photons 24-9, 173-81, 255-6, 359-68, 370-1,
405-6
vacuum 484
Pole diagrams 313, 490, 636
651
Positron, see Antiparticles; Electron; Pair
Positronium 101, 343-7, 371-6, 552-9
Propagation functions (propagators) 287, 294-304
bare photon 459
exact electron 468-72, 529
in external field 481-7
renormalization of 488-93
exact photon 459-65, 597-601
analytical properties of 493-6
renormalization of 487-8
free electron 469
free photon 459
of scalar field 299
of vector particles 304
Proper part of diagram 463
Quantization of electromagnetic field 5-32
Quasi-momentum 450
Radiation, see Photons
length 408
Radiative
corrections 456, 501-96
shift of atomic levels 544-52
Raman scattering 221
induced 229
Rarita-Schwinger equation 116
Rayleigh scattering 221
Reaction channels 257
Reducible vertex part 474
Regularization of Feynman integral 498, 524
Relativistically conjugate function 77
Renormalizability 499
Renormalization 489-500
group method 603 n.
Representation
Furry 482
Heisenberg 10, 284 n., 456-8
interaction 283,456-8
Majorana 80, 99, 100 n.
Mandelstam 561
Schrödinger 456
standard 79
Resonance fluorescence 244-6
Retardation, effective 332
Scattering
amplitude 247, 265
helicity 265-78
coherent 225
diagram 291
of electrons 286-92
in external field 317-43, 534-44
electron-hadron 629-32, 643-5
electron-muon 616-23
induced 229
matrix, see S-matrix
of neutrons 157-8
652
Index
Transition
current 160
line strength 182
moments 168, 172, 640, 641
Transversality condition 5
four-dimensional 13
Two-photon system 29-32, 228-30, 368-76
Scattering—Continued
of particles 247-82
in centrally symmetric field 140-8
of photons 221-82, 292-5, 354-68
photon-hadron 635-8
photon-nucleus 573-5
photon-photon 566-73
scalar 232
tensor 223
Schrödinger representation 456
Schwinger terms 468 n.
Screening of nucleus 406-7
Self-energy function of photon 465-8
Self-energy part
electron 469
photon 463
Sign of level 198
Skeleton diagrams 474 n.
S-matrix 3, 247
unitarity 278
Soft photons 529-34
Spectral density
of amplitude, double 561
of photon self-energy 468
Spence's function 407, 520, 596
Spherical
harmonic spinors 88
harmonic vectors 19-21
helicity states 56
unit vectors 23
waves 87-91
Spin and statistics 91-4
Spin in external field 151-7
Spinors 62-117
Spin-zero particles 34-50
Standard representation 79
Stark effect 191,195
Stokes
case 221
parameters 26
Strictly neutral particles 42
Synchrotron radiation 376-86
Ward's identity 479
Wave equation
for higher-integral-spin particle 53-5
for photon 12
for spin-0 particle 35
for spin-2 particle 74
for spin-1 particle 50-3
for spin-2 particle 115-17
see also Dirac's equation; Pauli's equation
Weizsäcker-Williams method 439
Wick's theorem 305
Thomas half 126 n., 156 n.
Time reversal 27, 46-9
of spinors 96-9, 103-4
Zeeman effect 189, 344
"Zeroizing" of charge 602
Uncertainty principle in the relativistic case
Unitarity 278
Unrenormalized charge see Intrinsic charge
Vacuum
polarization 484
state 10
Vector particle 50, 304, 530
Vertex 290
function 473
operator 473
double-logarithmic asymptotic form
part 473-81
Virtual
particle 312
photon 290
state 312
1-4
614-16