Vol 4 Landau Lifshitz Quantum Electrodynamics

Sergio Manuel

Vol 4 Landau Lifshitz Quantum Electrodynamics

Sergio Manuel

1979

visibility

…

description

669 pages

link

1 file

Obtaining permission to use Elsevier material Notice No responsibility is assumed by the publisher for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions or ideas contained in the material herein. Because of rapid advances in the medical sciences, in particular, independent verification of diagnoses and drug dosages should be made British Library Cataloguing in Publication Data Berestetskii, V B. Quantum electrodynamics-2nd ed. (Course of theoretical physics; V4) PREFACE TO THE SECOND EDITION THE first edition of this volume of the Course of Theoretical Physics was published in two parts (1971 and 1974) under the title "Relativistic Quantum Theory". It contained not only the basic material on quantum electrodynamics but also chapters on weak interactions and certain topics in the theory of strong interactions. The inclusion of those chapters now seems to us inopportune. The theory of strong and weak interactions is undergoing a vigorous development founded on new physical ideas, and the situation in this field is changing very rapidly, so that the time for a consistent exposition of the theory has not yet arrived. In the present edition, therefore, we have retained only quantum electrodynamics, and accordingly changed the title of the volume. As well as a considerable number of corrections and minor changes, we have made in this edition several more significant additions, including the operator method of calculating the bremsstrahlung cross-section, the calculation of the probabilities of photon-induced pair production and photon decay in a magnetic field, the asymptotic form of the scattering amplitudes at high energies, inelastic scattering of electrons by hadrons, and the transformation of electron-positron pairs into hadrons. A word regarding notation. We have reverted to the use of circumflexed letters for operators, in line with the other volumes in the Course. No special notation is used for the product of a 4-vector and a matrix vector y M , previously denoted by a circumflexed letter; such products are now shown explicitly. We have, alas, had to prepare this edition without the aid of Vladimir Berestetskiï,who died in 1977; but some of the added material mentioned above had been put together previously, by the three authors jointly. Our sincere thanks are offered to all readers who have given us their comments on the first edition of the book, and in particular to J. S. Bell, V. P. Kraïnov, L. B. Okun\ V. I. Ritus, M. I. Ryazanov and I. S. Shapiro.

COURSE OF THEORETICAL PHYSICS Volume 4 Second Edition QUANTUM ELECTRODYNAMICS Other titles in the COURSE OF THEORETICAL PHYSICS by LANDAU and LIFSHITZ Volume 1 Mechanics, 3rd Edition Volume 2 The Classical Theory of Fields, 4th Edition Volume 3 Quantum Mechanics (Non-relativistic Theory), 3rd Edition Volume 5 Statistical Physics, Part 1, 3rd Edition Volume 6 Fluid Mechanics, 2nd Edition Volume 7 Theory of Elasticity, 3rd Edition Volume 8 Electrodynamics of Continuous Media, 2nd Edition Volume 9 Statistical Physics, Part 2 Volume 10 Physical Kinetics QUANTUM ELECTRODYNAMICS by V. B. BERESTETSKII, E. M. LIFSHITZ and L. P. PITAEVSKII Institute of Physical Problems, U.S.S.R. Academy of Sciences Volume 4 of Course of Theoretical Physics Second edition Translated from the Russian by J. B. SYKES and J. S. BELL Butterworth-Heinemann An Imprint of Elsevier Butterworth-Heinemann is an imprint of Elsevier Linacre House, Jordan Hill, Oxford OX2 8DP, UK 30 Corporate Drive, Suite 400, Burlington, MA 01803, USA First edition 1971 Reprinted 1974 Second edition 1982 Reprinted 1989, 1994, 1996, 1997, 1999, 2002, 2004, 2006, 2007, 2008 Copyright © 1982, Elsevier Ltd. All rights reserved No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means electronic, mechanical, photocopying, recording or otherwise without the prior written permission of the publisher Permissions may be sought directly from Elsevier's Science & Technology Rights Department in Oxford, UK: phone (+44) (0) 1865 843830; fax (+44) (0) 1865 853333; email: [email protected]. Alternatively you can submit your request online by visiting the Elsevier web site at http://elsevier.com/locate/permissions, and selecting Obtaining permission to use Elsevier material Notice No responsibility is assumed by the publisher for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions or ideas contained in the material herein. Because of rapid advances in the medical sciences, in particular, independent verification of diagnoses and drug dosages should be made British Library Cataloguing in Publication Data Berestetskii, V B. Quantum electrodynamics - 2nd ed. (Course of theoretical physics; V4) 1. Quantum field theory I. Title II. Liftshitz, E. M. III. Pitaevskii, L. P. IV Berestetskii, V B. R elativistic quantum theory V Series ISBN: 978-0-7506-3371-0 For information on all Butterworth-Heinemann publications visit our website at books.elsevier.com Printed in the United States of America Transferred to Digital Printing, 2010 Working together to grow libraries in developing countries www.elsevier.com | www.bookaid.org | www.sabre.org ELSEVIER POOK A™ Sabre Foundation PREFACE TO THE SECOND EDITION THE first edition of this volume of the Course of Theoretical Physics was published in two parts (1971 and 1974) under the title "Relativistic Quantum Theory". It contained not only the basic material on quantum electrodynamics but also chapters on weak interactions and certain topics in the theory of strong interactions. The inclusion of those chapters now seems to us inopportune. The theory of strong and weak interactions is undergoing a vigorous development founded on new physical ideas, and the situation in this field is changing very rapidly, so that the time for a consistent exposition of the theory has not yet arrived. In the present edition, therefore, we have retained only quantum electrodynamics, and accordingly changed the title of the volume. As well as a considerable number of corrections and minor changes, we have made in this edition several more significant additions, including the operator method of calculating the bremsstrahlung cross-section, the calculation of the probabilities of photon-induced pair production and photon decay in a magnetic field, the asymptotic form of the scattering amplitudes at high energies, inelastic scattering of electrons by hadrons, and the transformation of electron-positron pairs into hadrons. A word regarding notation. We have reverted to the use of circumflexed letters for operators, in line with the other volumes in the Course. No special notation is used for the product of a 4-vector and a matrix vector yM, previously denoted by a circumflexed letter; such products are now shown explicitly. We have, alas, had to prepare this edition without the aid of Vladimir Berestetskiï,who died in 1977; but some of the added material mentioned above had been put together previously, by the three authors jointly. Our sincere thanks are offered to all readers who have given us their comments on the first edition of the book, and in particular to J. S. Bell, V. P. Kraïnov, L. B. Okun\ V. I. Ritus, M. I. Ryazanov and I. S. Shapiro. July 1979 E . M . LlFSHlTZ L . P . PlTAEVSKIÏ This page intentionally left blank FROM THE PREFACE TO THE FIRST EDITION IN ACCORDANCE with the general plan of this Course of Theoretical Physics, the present volume deals with relativistic quantum theory in the broad sense: the theory of all phenomena which depend upon the finite velocity of light, including the whole of the theory of radiation. This branch of theoretical physics is still far from completion, even as regards its basic physical principles, and this is particularly true of the theory of strong and weak interactions. But even quantum electrodynamics, despite the remarkable achievements of the last twenty years, still lacks a satisfactory logical structure. In the choice of material for this book we have considered only results which appear to be reasonably firmly established. In consequence, of course, the greater part of the book is devoted to quantum electrodynamics. We have tried to give a realistic exposition, with emphasis on the physical hypotheses used in the theory, but without going into details of justifications, which in the present state of the theory are in any case purely formal. In the discussion of specific applications of the theory, our aim has been not to include the whole vast range of effects but to select only the most fundamental of them, adding some references to original papers which contain more detailed studies. We have often omitted some of the intermediate steps in the calculations, which in this subject are usually very lengthy, but we have always sought to indicate any non-trivial point of technique. The discussion in this book demands a higher degree of previous knowledge on the part of the reader than do the other volumes in the Course. Our assumption has been that a reader whose study of theoretical physics has extended as far as the quantum theory of fields has no further need of predigested material. This book has been written without the direct assistance of our teacher, L. D. Landau. Yet we have striven to be guided by the spirit and the approach to theoretical physics which characterized his teaching of us and which he embodied in the other volumes. We have often asked ourselves what would be the attitude of Dau to this or that topic, and sought the answer prompted by our many years' association with him. Our thanks are due to V. N. Baler, who gave great help in compiling §§90 and 97, and to V. I. Ritus for great help in writing §101. We are grateful to B. É. Meïerovich for assistance with calculations, and also to A. S. Kompaneets, who made available his notes of L. D. Landau's lectures on quantum electrodynamics, given at Moscow State University in the academic year 1959-60. June 1967 V. B. BERESTETSKII, vii E. M. LIFSHITZ, L. P. PITAEVSKIÏ This page intentionally left blank CONTENTS NOTATION INTRODUCTION l §1. The uncertainty principle in the relativistic case 1 I. PHOTONS §2. §3. §4. §5. §6. §7. §8. §9. Quantization of the free electromagnetic Photons Gauge invariance The electromagnetic field in quantum theory The angular momentum and parity of the photon Spherical waves of photons The polarization of the photon A two-photon system field 5 10 12 14 16 19 24 29 II. BOSONS §10. §11. §12. §13. §14. §15. §16. The wave equation for particles with spin zero Particles and antiparticles Strictly neutral particles The transformations C, P and T The wave equation for a particle with spin one The wave equation for particles with higher integral spins Helicity states of a particle 33 37 41 44 50 53 55 III. FERMIONS §17. §18. §19. §20. §21. §22. §23. §24. §25. §26. §27. §28. §29. §30. §31. Four-dimensional spinors The relation between spinors and 4-vectors Inversion of spinors Dirac's equation in the spinor representation The symmetrical form of Dirac's equation Algebra of Dirac matrices Plane waves Spherical waves The relation between the spin and the statistics Charge conjugation and time reversal of spinors Internal symmetry of particles and antiparticles Bilinear forms The polarization density matrix Neutrinos The wave equation for a particle with spin 3/2 62 64 68 73 75 80 84 88 91 94 99 101 106 111 115 Contents X IV. PARTICLES IN AN EXTERNAL FIELD §32. §33: §34. §35. §36. §37. §38. §39. §40. §41. §42. Dirac's equation for an electron in an external field Expansion in powers of 1/c Fine structure of levels of the hydrogen atom Motion in a centrally symmetric field Motion in a Coulomb field Scattering in a centrally symmetric field Scattering in the ultra-relativistic case The continuous-spectrum wave functions for scattering in a Coulomb An electron in the field of an electromagnetic plane wave Motion of spin in an external field Neutron scattering in an electric field field 118 122 126 128 133 140 142 144 148 151 157 V. RADIATION §43. §44. §45. §46. §47. §48. §49. §50. §51. §52. §53. §54. §55. §56. §57. §58. The electromagnetic interaction operator Emission and absorption Dipole radiation Electric multipole radiation Magnetic multipole radiation Angular distribution and polarization of the radiation Radiation from atoms: the electric type Radiation from atoms: the magnetic type Radiation from atoms: the Zeeman and Stark effects Radiation from atoms: the hydrogen atom Radiation from diatomic molecules: electronic spectra Radiation from diatomic molecules: vibrational and rotational spectra Radiation from nuclei The photoelectric effect: non-relativistic case The photoelectric effect: relativistic case Photodisintegration of the deuteron 159 161 164 166 171 173 181 186 189 192 197 203 205 207 212 216 VI. SCATTERING OF RADIATION §59. §60. §61. §62. §63. The scattering tensor Scattering by freely oriented systems Scattering by molecules Natural width of spectral lines Resonance fluorescence 221 231 237 240 244 VII. THE SCATTERING MATRIX §64. §65. §66. §67. §68. §69. §70. §71. The scattering amplitude Reactions involving polarized particles Kinematic invariants Physical regions Expansion in partial amplitudes Symmetry of helicity scattering amplitudes Invariant amplitudes The unitarity condition 247 252 256 258 264 268 274 278 VIII. INVARIANT PERTURBATION THEORY §72. §73. §74. §75. §76. The chronological product Feynman diagrams for electron scattering Feynman diagrams for photon scattering The electron propagator The photon propagator 283 286 292 295 300 Contents xi §77. General rules of the diagram technique §78. Crossing invariance §79. Virtual particles 304 311 312 IX. INTERACTION OF ELECTRONS §80. §81. §82. §83. §84. §85. Scattering of an electron in an external Scattering of electrons and positrons by an electron Ionization losses of fast particles Breit's equation Positronium The interaction of atoms at large distances field 317 321 330 336 343 347 X. INTERACTION OF ELECTRONS WITH PHOTONS §86. §87. §88. §89. §90. §91. §92. §93. §94. §95. §96. §97. §98. §99. §100. §101. Scattering of a photon by an electron Scattering of a photon by an electron. Polarization effects Two-photon annihilation of an electron pair Annihilation of positronium Synchrotron radiation Pair production by a photon in a magnetic field Electron-nucleus bjemsstrahlung. The non-relativistic case Electron-nucleus bremsstrahlung. The relativistic case Pair production by a photon in the field of a nucleus Exact theory of pair production in the ultra-relativistic case Exact theory of bremsstrahlung in the ultra-relativistic case Electron-electron bremsstrahlung in the ultra-relativistic case Emission of soft photons in collisions The method of equivalent photons Pair production in collisions between particles Emission of a photon by an electron in the field of a strong electromagnetic wave 354 359 368 371 376 386 389 400 410 413 419 426 431 438 444 449 XL EXACT PROPAGATORS AND VERTEX PARTS §102. §103. §104. §105. §106. §107. §108. §109. §110. §111. §112. Field operators in the Heisenberg representation The exact photon propagator The self-energy function of the photon The exact electron propagator Vertex parts Dyson's equations Ward's identity Electron propagators in an external Physical conditions for renormalization Analytical properties of photon propagators Regularization of Feynman integrals field 456 459 465 468 472 476 478 481 487 493 4% XII. RADIATIVE CORRECTIONS §113. §114. §115. §116. §117. §118. §119. §120. §121. §122. §123. §124. Calculation of the polarization operator Radiative corrections to Coulomb's law Calculation of the imaginary part of the polarization operator from the Feynman integral Electromagnetic form factors of the electron Calculation of electron form factors Anomalous magnetic moment of the electron Calculation of the mass operator Emission of soft photons with non-zero mass Electron scattering in an external field in the second Born approximation Radiative corrections to electron scattering in an external field Radiative shift of atomic levels Radiative shift of mesic-atom levels 501 504 508 513 517 521 524 529 534 540 544 551 Contents Xll §125. §126. §127. §128. §129. §130. §131. The relativistic equation for bound states The double dispersion relation Photon-photon scattering Coherent, scattering of a photon in the field of a nucleus Radiative corrections to the electromagnetic field equations Photon splitting in a magnetic field Calculation of integrals over four-dimensional regions 552 559 566 573 575 585 592 XIII. ASYMPTOTIC FORMULAE OF QUANTUM ELECTRODYNAMICS §132. §133. §134. §135. §136. §137. Asymptotic form of the photon propagator for large momenta The relation between unrenormalized and actual charges Asymptotic form of the scattering amplitudes at high energies Separation of the double-logarithmic terms in the vertex operator Double-logarithmic asymptotic form of the vertex operator Double-logarithmic asymptotic form of the electron-muon scattering amplitude 597 601 603 608 614 616 XIV. ELECTRODYNAMICS OF HADRONS §138. §139. §140. §141. §142. §143. §144. Electromagnetic form factors of hadrons Electron-hadron scattering The low-energy theorem for bremsstrahlung The low-energy theorem for photon-hadron scattering Multipole moments of hadrons Inelastic electron-hadron scattering Hadron formation from an electron-positron pair INDEX 624 629 632 635 638 643 645 649 NOTATION Four-dimensional Four-dimensional tensor indices are denoted by Greek letters A, /x, v,..., the values 0, 1,2, 3. A 4-metric with signature ( + ) is used. The metric tensor is taking g^igoO = 1, gll = g22 = g33 = - 1). Components of a 4-vector are stated in the form a* = (a 0 , a). To simplify the formulae, the index is often omitted in writing the components of a 4-vector.t The scalar products of 4-vectors are written simply as (ab) or ab\ ab = aj)* = a0b0 - a • b. The 4-position-vector is JCM = (t, r). The 4-volume element is d4x. The operator of differentiation with respect to the 4-coordinates is dM = dldx*1. The antisymmetric unit 4-tensor is eX[LVp, with e0123 = -e 0 i23 = +1. The four-dimensional delta function S (4) (a)=ô(a 0 )S(a). Three-dimensional Three-dimensional tensor indices are denoted by Latin letters i, Jc, I, . . . , taking the values x, y, z. Three-dimensional vectors are denoted by letters in bold type. The three-dimensional volume element is d3x. Operators Operators are denoted by italic letters with circumflex.^ Commutators or anticommutators of two operators are written {/, g} ± = fg ± g/. The transposed operator is /. The Hermitian conjugate operator is / + . Matrix elements The matrix element of the operator F for a transition from initial state i to final state / is Ffi or </|F|i). t This way of writing the components is often used in recent literature. It is a compromise between the limited resources of the alphabet and the demands of physics, and means, of course, that the reader must be particularly attentive. X However, to simplify the formulae, the circumflex is not written over spin matrices, and it is also omitted when operators are shown in matrix elements. xiii Notation XIV The notation \i) is used as an abstract symbol for a state independently of any specific representation in which its wave function may be expressed. The notation (/| denotes a final ("complex conjugate") state.t Correspondingly, (s\r) denotes the coefficients in the expression of a set of states with quantum numbers r as superpositions of states with quantum numbers s: |r> = 2|s><s|r>. 5 The reduced matrix elements of spherical tensors are </||F||/). Dirac's equation The Dirac matrices are y*\ with (70)2 = 1, (71)2 = (y2)2 = (y3)2 = - 1 . The matrix <* 7°7> ß = 7°- The expressions in the spinor and standard representations are (21.3), (21.16) and (21.20). y5 = -iy°yly2y\(y5)2 = 1; see (22.18). er*1" = 5(7 V ~ 7 V ) ; see (28.2). Dirac conjugation is expressed by i/r = i//*y°. The Pauli matrices are or = (ax, <ry, crz), defined in §20. The 4-spinor indices are a, ß , . . . and a, ß , . . ., taking the values 1, 2 and i, 2. The bispinor indices are i, k, I , . . . , taking the values 1, 2, 3, 4. = Fourier expansion Three-dimensional: fir) = | / ( k ) elk r ^ 3 , /(k) = | / ( r ) e"'k r d3x, and similarly for the four-dimensional expansion. Units Except where otherwise specified, relativistic units are used, with h = 1, c = 1. In these units, the square of the unit charge is e2 = 1/137. Atomic units have e = l , f i = l , m = l . I n these units, c = 137. The atomic units of length, time and energy are h2jme2, h^/me4 and me4lh2\ the quantity Ry = me4l2h2 is called a rydberg. Ordinary units are given in the absolute (Gaussian) system. Constants Velocity of light c = 2.998 x 1010 cm/sec. Unit charge* |e|=4.803xl0 10 CGS electrostatic units. t This notation is due to Dirac. t Throughout the book (except in Chapter XIV), e denotes the charge with the appropriate sign, so that e = -\e\ for an electron. Notation Electron mass m = 9.11 x 10 28 g. Planck's constant ft = 1.055 x 10~27 erg. sec. Fine-structure constant a = e2/ftc; 1/a = 137.04. Bohr radius ft2/me2 = 5.292 x 10"9 cm. Classical electron radius re = e2lmc2 = 2.818 x 10-13 cm. Compton wavelength of the electron hjmc = 3.862 x 10"" cm. Electron rest energy mc2 = 0.511 x 106eV. Atomic energy unit me4lh2 = 4.360 x 10" erg = 27.21 eV. Bohr magneton \e\hl2mc = 9.274 x 10~21 erg/G. Proton mass mp = 1.673 x 10"24g. Compton wavelength of the proton hlmpc = 2.103 x 10~14cm. Nuclear magneton \e\hl2mpc = 5.051 x 10"24 erg/G. Mass ratio of muon and electron mjm - 2.068 x 102. References to volumes in the Course of Theoretical Physics: Mechanics = Vol. 1 {Mechanics, third English edition, 1976). Fields = Vol. 2 (The Classical Theory of Fields, fourth English edition, 1975). QM or Quantum Mechanics = Vol. 3 (Quantum Mechanics, third English edition, 1977). ECM = Vol. 8 (Electrodynamics of Continuous Media, English edition, 1960). PK = Vol. 10 (Physical Kinetics, English edition, 1981). All are published by Pergamon Press. This page intentionally left blank INTRODUCTION § 1. The uncertainty principle in the relativistic case THE quantum theory described in Volume 3 (Quantum Mechanics) is essentially non-relativistic throughout, and is not applicable to phenomena involving motion at velocities comparable with that of light. At first sight, one might expect that the change to a relativistic theory is possible by a fairly direct generalization of the formalism of non-relativistic quantum mechanics. But further consideration shows that a logically complete relativistic theory cannot be constructed without invoking new physical principles. Let us recall some of the physical concepts forming the basis of non-relativistic quantum mechanics (QM, §1). We saw that one fundamental concept is that of measurement, by which is meant the process of interaction between a quantum system and a classical object or apparatus, causing the quantum system to acquire definite values of some particular dynamical variables (coordinates, velocities, etc.). We saw also that quantum mechanics greatly restricts the possibility that an electront simultaneously possesses values of different dynamical variables. For example, the uncertainties Aq and Ap in simultaneously existing values of the coordinate and the momentum are related by the expression^ AqAp ~~ h; the greater the accuracy with which one of these quantities is measured, the less the accuracy with which the other can be measured at the same time. It is important to note, however, that any of the dynamical variables of the electron can individually be measured with arbitrarily high accuracy, and in an arbitrarily short period of time. This fact is of fundamental importance throughout non-relativistic quantum mechanics. It is the only justification for using the concept of the wave function, which is a basic part of the formalism. The physical significance of the wave function \p(q) is that the square of its modulus gives the probability of finding a particular value of the electron coordinate as the result of a measurement made at a given instant. The concept of such a probability clearly requires that the coordinate can in principle be measured with any specified accuracy and rapidity, since otherwise this concept would be purposeless and devoid of physical significance. The existence of a limiting velocity (the velocity of light, denoted by c) leads to new fundamental limitations on the possible measurements of various physical quantities (L. D. Landau and R. E. Peierls, 1930). t As in QM, §1, we shall, for brevity, speak of an "electron", meaning any quantum system. t In this section, ordinary units are used. 1 2 Introduction §1 In QM, §44, the following relationship has been derived: i/-u)ApAt-ft, (1.1) relating the uncertainty Ap in the measurement of the electron momentum and the duration At of the measurement process itself; v and v' are the velocities of the electron before and after the measurement. From this relationship it follows that a momentum measurement of high accuracy made during a short time (i.e. with Ap and At both small) can occur only if there is a large change in the velocity as a result of the measurement process itself. In the non-relativistic theory, this showed that the measurement of momentum cannot be repeated at short intervals of time, but it did not at all diminish the possibility, in principle, of making a single measurement of the momentum with arbitrarily high accuracy, since the difference v' - v could take any value, no matter how large. The existence of a limiting velocity, however, radically alters the situation. The difference v' - v, like the velocities themselves, cannot now exceed c (or rather 2c). Replacing v'- v in (l.T by c, we obtain ApAf-ft/c, (1.2) which determines the highest accuracy theoretically attainable when the momentum is measured by a process occupying a given time At. In the relativistic theory, therefore, it is in principle impossible to make an arbitrarily accurate and rapid measurement of the momentum. An exact measurement (Ap ->0) is possible only in the limit as the duration of the measurement tends to infinity. There is reason to suppose that the concept of measurability of the electron coordinate itself must also undergo modification. In the mathematical formalism of the theory, this situation is shown by the fact that an accurate measurement of the coordinate is incompatible with the assertion that the energy of a free particle is positive. It will be seen later that the complete set of eigenfunctions of the relativistic wave equation of a free particle includes, as well as solutions having the "correct" time dependence, also solutions having a "negative frequency". These functions will in general appear in the expansion of the wave packet corresponding to an electron localized in a small region of space. It will be shown that the wave functions having a "negative frequency" correspond to the existence of antiparticles (positrons). The appearance of these functions in the expansion of the wave packet expresses the (in general) inevitable production of electron-positron pairs in the process of measuring the coordinates of an electron. This formation of new particles in a way which cannot be detected by the process itself renders meaningless the measurement of the electron coordinates. In the rest frame of the electron, the least possible error in the measurement of its coordinates is &q~hlmc (1.3) This value (which purely dimensional arguments show to be the only possible one) §1 The Uncertainty Principle in the Relativistic Case 3 corresponds to a momentum uncertainty Ap — mc, which in turn corresponds to the threshold energy for pair production. In a frame of reference in which the electron is moving with energy e, (1.3) becomes àq-chle. (1.4) In particular, in the limiting ultra-relativistic case the energy is related to the momentum by e - cp, and Aq~ft/p, (1.5) i.e. the error Aq is the same as the de Broglie wavelength of the particle.t For photons, the ultra-relativistic case always applies, and the expression (1.5) is therefore valid. This means that the coordinates of a photon are meaningful only in cases where the characteristic dimensions of the problem are large in comparison with the wavelength. This is just the "classical" limit, corresponding to geometrical optics, in which radiation can be said to be propagated along definite paths or rays. In the quantum case, however, where the wavelength cannot be regarded as small, the concept of coordinates of the photon has no meaning. We shall see later (§4) that, in the mathematical formalism of the theory, the fact that the photon coordinates cannot be measured is evident because the photon wave function cannot be used to construct a quantity which might serve as a probability density satisfying the necessary conditions of relativistic invariance. The foregoing discussion suggests that the theory will not consider the time dependence of particle interaction processes. It will show that in these processes there are no characteristics precisely definable (even within the usual limitations of quantum mechanics); the description of such a process as occurring in the course of time is therefore just as unreal as the classical paths are in non-relativistic quantum mechanics. The only observable quantities are the properties (momenta, polarizations) of free particles: the initial particles which come into interaction, and the final particles which result from the process (L. D. Landau and R. E. Peierls, 1930). A typical problem as formulated in relativistic quantum theory is to determine the probability amplitudes of transitions between specified initial and final states {t -» +<») of a system of particles. The set of such amplitudes between all possible states constitutes the scattering matrix or S-matrix. This matrix will embody all the information about particle interaction processes that has an observable physical meaning (W. Heisenberg, 1938). There is as yet no logically consistent and complete relativistic quantum theory. We shall see that the existing theory introduces new physical features into the nature of the description of particle states, which acquires some of the features of t The measurements in question are those for which any experimental result yields a conclusion about the state of the electron; that is, we are not considering coordinate measurements by means of collisions, when the result does not occur with probability unity during the time of observation. Although the deflection of a measuring-particle in such cases may indicate the position of an electron, the absence of a deflection tells us nothing. 4 Introduction §1 field theory (see §10). The theory is, however, largely constructed on the pattern of ordinary quantum mechanics. This structure of the theory has yielded good results in quantum electrodynamics. The lack of complete logical consistency in this theory is shown by the occurrence of divergent expressions when the mathematical formalism is directly applied, although there are quite well-defined ways of eliminating these divergences. Nevertheless, such methods remain, to a considerable extent, semiempirical rules, and our confidence in the correctness of the results is ultimately based only on their excellent agreement with experiment, not on the internal consistency or logical ordering of the fundamental principles of the theory. CHAPTER I PHOTONS § 2. Quantization of the free electromagnetic field WITH the purpose of treating the electromagnetic field as a quantum object, it is convenient to begin from a classical description of the field in which it is represented by an infinite but discrete set of variables. This description permits the immediate application of the customary formalism of quantum mechanics. The representation of the field by means of potentials specified at every point in space is essentially a description by means of a continuous set of variables. Let A(r, t) be the vector potential of the free electromagnetic field, which satisfies the "transversality condition" divA = 0. (2.1) The scalar potential 4> = 0, and the fields E and H are E=-À, H = curl A. (2.2) Maxwell's equations reduce to the wave equation for A: AA-d 2 A/df 2 = 0. (2.3) In classical electrodynamics (see Fields, §52) the change to the description by means of a discrete set of variables is brought about by considering the field in a large but finite volume V.t The following is a brief résumé of the argument. The field in a finite volume can be expanded in terms of travelling plane waves, and its potential is then represented by a series A = 2(ake'k r k + aïe-|kr), (2.4) where the coefficients ak are functions of the time such that ak-e-^, o> = |k|. (2.5) The condition (2.1) shows that the complex vectors ak are orthogonal to the corresponding wave vectors: ak • k = 0. The summation in (2.4) is taken over an infinite discrete set of values of the t We shall take V = 1, in order to reduce the number of factors in the formulae. 5 §2 Photons 6 wave vector (i.e. of its components kXJ kyy kz). The change to an integral over a continuous distribution may be made by means of the expression d3k/(27r)3 for the number of possible values of k belonging to the volume element d3k = dkxdkydkz in k-space. If the vectors ak are specified, the field in the volume considered is completely determined. Thus these quantities may be regarded as a discrete set of classical "field variables". In order to explain the transition to the quantum theory, however, a further transformation of these variables is needed, whereby the field equations take a form analogous to the canonical equations (Hamilton's equations) of classical mechanics. The canonical field variables are defined by Qk= V(b) ( a k + a * ) ' (2.6) and are evidently real. The vector potential is expressed in terms of the canonical variables by A = V(4TT) Y (Qk COS k • r - - Pk sin k • rV k \ (o (2.7) ) To find the Hamiltonian H, we must calculate the total energy of the field, ^J"(E 2 + H 2 )d 3 *, 8i and express it in terms of the Qk and Pk. When A is written as the expansion (2.7), and E and H are found from (2.2), the result of the integration is It Each of the vectors Pk and Qk is perpendicular to the wave vector k, and therefore has two independent components. The direction of these vectors determines the direction of polarization of the corresponding wave. Denoting the two components of the vectors Qk and Pk (in the plane perpendicular to k) by Qka, Pka (a = 1, 2), we can write the Hamiltonian as H = 22(J>L + <u2QL). k, a (2.8) Thus the Hamiltonian is the sum of independent terms, each of which contains only one pair of quantities Qka, Pka. Each such term corresponds to a travelling wave with a definite wave vector and polarization, and has the form of the Hamiltonian for a one-dimensional harmonic oscillator. This expansion is therefore often referred to as an oscillator expansion of the field. §2 Quantization of the Free Electromagnetic Field 7 Let us now consider the quantization of the free electromagnetic field. The classical description of the field given above makes the manner of transition to the quantum theory obvious. We have now to use canonical variables (generalized coordinates Qka and generalized momenta Pka) as operators, with the commutation rule PkaQka-QwaPka=-i; (2.9) operators with different values of k and a always commute. The potential A and, according to (2.2), the fields E and H likewise become (Hermitian) operators. The consistent determination of the Hamiltonian requires the calculation of the integral H=^- f(E 2 + H2)d3x, 07T J (2.10) in which Ê and H are expressed in terms of Pka and Qka. However, the fact that the latter do not commute is actually unimportant, since the products QkaPka appear multiplied by cos k • r sin k • r, which becomes zero on integration over the whole volume. The resulting expression for the Hamiltonian is therefore H = £ 5 ( P L + <o2QL), (2.11) It, a which is, as we might have expected, exactly the same in form as the classical Hamiltonian. The determination of the eigenvalues of this Hamiltonian involves no further calculation, since it is equivalent to the familiar problem of the energy levels of linear oscillators (QM, §23). We can therefore immediately write down the field energy levels: E = 2 ( N k û + i)û>, (2.12) where the N ^ are integers. The further discussion of this formula will be left until §3; here we shall write out the matrix elements of the quantities Qka, which can be done at once by means of the known formulae for the matrix elements of the coordinates of an oscillator (see QM, §23). The non-zero matrix elements are <Nka\Qka\Nka - 1> = (Nka - l|Qka|Nka> = V(Nka/2<o). (2.13) The matrix elements of the quantities Pka = Qka differ from those of Qka only by a factor ±iû>. In subsequent calculations, however, it will be more convenient to replace the quantities Qka and Pka by the linear combinations a>QkQ ± iPka, which have non-zero Photons §2 matrix elements only for transitions Nka->Nka tors ± 1. We therefore define the opera- 8 1 (<oQka + iP ka ), (2.14) 1 the classical quantities cka, cj?a are the same, apart from a factor V(2ir/û>), as the coefficients akan afa in the expansion (2.4). The matrix elements of these operators are <Nka - l|c ka |N ka > = (Nka\cL\Nka - l) = VN k a . (2.15) The commutation rule for cka and c ka is obtained by using the definitions (2.14) and the rule (2.9): ckacka (2.16) ckacka — 1. For the vector potential, we return to an expansion of the type (2.4), but with operator coefficients, writing it in the form (2.17) À = 2 (^ka Aka + Cl^Ak), where Ako = V(4TT) - ? eikr (2.18) The symbol e(a) denotes the unit vectors in the direction of polarization of the oscillators; these vectors are perpendicular to the wave vector k, and for every k there are two independent polarizations. Similarly, for the operators Ê and H we write E — 2J (CkaEka + CkaE£a), k.a (2.19) H — 2J (c ka H ka + ckaH*„), with Eka = »û)Aka, Hka = n x Eka, n = k/a>. (2.20) The vectors Aka are mutually orthogonal, in the sense that I Aka • Afa d x = — 8aa OU'. J (o ! (2.21) §2 Quantization of the Free Electromagnetic Field 9 For, if Aka and Ajfv belong to different wave vectors, then their product contains a factor e'(k~k)'r, which gives zero on integration over the volume; if they differ only in polarization, e(a) • e(a'** = 0, since the two independent directions of polarization are mutually orthogonal. Similar arguments apply to the vectors Eka and Hka. They are conveniently normalized by imposing the condition ^ j (Eka • Ek*a + Hka • Hk*v) d\ = a)8kk8aa. (2.22) Substituting the operators (2.19) in (2.10), and carrying out the integration by means of (2.22), we obtain the field Hamiltonian expressed in terms of the operators c, c + : H = 2 2(o(ckacta + c ka c k J. (2.23) This operator is diagonal in the representation considered (the matrix elements of the operators c and c* being given by (2.15)), and its eigenvalues are of course (2.12). In the classical theory, the field momentum is defined as the integral P = -p- f E x H d3x. 4TT J In changing to the quantum theory, we replace E and H by the operators (2.19), and thus easily find P = S2(PL + o>2QL)n, k,a (2.24) in agreement with the familiar classical relationship between the energy and momentum of plane waves. The eigenvalues of this operator are P = ï k ( N k a + |). k.a (2.25) The representation of operators by means of the matrix elements (2.15) is the "occupation number representation", corresponding to the description of the state of a system (the field) by specifying the quantum numbers Nka (the occupation numbers). In this representation the field operators (2.19), and therefore the Hamiltonian (2.11), act on the wave function of the system, expressed in terms of the numbers N ka ; let this be 4>(Nka, t). The field operators (2.19) are not explicit functions of the time. This corresponds to the customary Schrödinger representation of operators in non-relativistic quantum mechanics. The state of the system, 4>(Nka, 0» does depend on the time, and this dependence is governed by Schrödinger's equation, id&ldt = H$. §3 Photons 10 This description of the field is, by its nature, relativistically invariant, since it is based on the invariant Maxwell's equations. But this invariance is not explicitly shown, primarily because the space coordinates and the time appear in the description in a highly asymmetric manner. In relativistic theory, it is convenient to put the description in a form which is more obviously invariant. T o do so, we must use what is called the Heisenberg representation, in which the explicit time dependence is transferred to the operators themselves (see QM, §13). Then the time and the coordinates will appear on an equal footing in the expressions for the field operators, and the state of the system, 4>, will depend only on the occupation numbers. For the operator A, the change to the Heisenberg representation amounts to replacing the factor elk r in each term of the sum (2.17) by e,(k r " w0 , i.e. to regarding the Akü as the time-dependent functions Aka = V(4TT) ^ ^ e-,M-k'r). (2.26) This is easily proved by noticing that the matrix element of the Heisenberg operator for the transition i-+f must include a factor exp{-i(E, - Ef)t}, where E,and Ef are the energies of the initial and final states (see QM, §13). For a transition in which N k decreases or increases by 1, this factor becomes e~lwt or eltot respectively, a condition which is satisfied by effecting the change mentioned above. Henceforward, in discussing both the electromagnetic field and particle fields, we shall always assume that the Heisenberg representation of operators is used. §3. Photons We shall now further analyse the field quantization formulae obtained in §2. First of all, formula (2.12) for the field energy raises the following difficulty. The lowest energy level of the field corresponds to the case where the quantum numbers Nka of all the oscillators are zero; this is called the electromagnetic field vacuum state. But, even in that state, each oscillator has a non-zero "zero-point energy" equal to {to. Summation over an infinite number of oscillators then gives an infinite result. Thus we meet with one of the "divergences" which are due to the fact that the present theory is not logically complete and consistent. So long as only the field energy eigenvalues are under discussion, we can remove this difficulty by simply striking out the zero-point oscillation energy, i.e. by writing the field energy and momentum ast E = 2N kfl <o, k.a P = £Nkak. k,a (3.1) t This procedure can be formally carried out without contradiction if we agree to regard the products of operators in (2.10) as "normal" products, that is, as products in which the operators c + are always placed to the left of the operators c. Then formula (2.23) becomes §3 Photons 11 These formulae enable us to introduce the concept of radiation quanta or photons, which is fundamental throughout quantum electrodynamics.t We may regard the free electromagnetic field as an ensemble of particles each with energy co {=tici)) and momentum k (=nhcolc). The relationship between the photon energy and momentum is as it should be in relativistic mechanics for particles having zero rest-mass and moving with the velocity of light. The occupation numbers Nka now represent the numbers of photons having given momentum k and polarization e(a). The polarization of the photon is analogous to the spin of other particles; the exact properties of the photon in this respect will be discussed in §6 below. It is easily seen that the whole of the mathematical formalism developed in §2 is fully in accordance with the representation of the electromagnetic field as an ensemble of photons; it is just the second quantization formalism, applied to the system of photons.$ In this treatment (see QM, §64), the independent variables are the occupation numbers of the states, and the operators act on functions of these numbers. The particle "annihilation" and "creation" operators are of basic importance; they respectively decrease and increase by one the occupation numbers. The cka and cla are operators of this kind: cka annihilates a photon in the state k, a, and cia creates a photon in that state. The commutation rule (2.16) corresponds to particles which obey Bose statistics. Photons, therefore, are bosons, as was to be expected, since the number of photons that can be in any one state must be unrestricted. The significance of this will be further discussed in §5. The plane waves Aka (2.26) which appear in the operator A (2.17) as coefficients of the photon annihilation operators may be treated as the wave functions of photons having given momenta k and polarizations e(a). This corresponds to an expansion of the (//-operator in terms of the wave functions of stationary states of a particle in the non-relativistic second quantization formalism; however, unlike the latter, the expansion (2.17) includes both particle annihilation and particle creation operators. The meaning of this difference is explained in §12. The wave function (2.26) is normalized by the condition | ^ : ( | E k a | 2 + |H ka | 2 )d 3 x = cu. (3.2) This is the normalization to "one photon in the volume V = 1": the integral on the left is the quantum-mechanical mean value of the photon energy in the state having the given wave function. § The right-hand side of (3.2) is just the energy of a single photon. The "Schrödinger's equation" for the photon is represented by Maxwell's equations. In the present case (when the potential A(r, t) satisfies the condition t This concept is originally due to A. Einstein (1905). $ The application of the second quantization method to the theory of radiation was first worked out by P. A. M. Dirac (1927). § It should be noted that the factor 1/4-7T in the integral (3.2) is twice the usual factor 1/8 TT (2.10). This is ultimately due to the fact that the vectors Ek«, Hk« are complex, whereas the field operators Ê, H are Hermitian. 12 §4 Photons (2.1)), this leads to the wave equation: a 2 A/ar 2 -AA-o. The "wave functions" of the photon, in the general case of arbitrary stationary states, are complex solutions of this equation, whose time dependence is given by the factor e~iwt. In referring to the photon wave function, we must again emphasize that this can not be regarded as the probability amplitude of the spatial localization of the photon, in contrast to the fundamental significance of the wave function in non-relativistic quantum mechanics. This is because, as has been shown, in §1, the concept of the coordinates of the photon has no physical meaning. The mathematical aspect of this situation will be further discussed at the end of §4. The components of the Fourier expansion of the function A(r, 0 with respect to the coordinates form the wave function of the photon in the momentum representation; we denote this by A(k, t) = A(k) e lu>t. For example, in a state with a given momentum k and polarization e(a), the wave function in the momentum representation is given simply by the coefficient of the exponential factor in (2.26): A ka (k', a') = V(4TT) ^ ^ ÔkkôQ a . (3.3) Since the momentum of a free particle is measurable, the wave function in the momentum representation has a more profound physical significance than that in the coordinate representation: it enables us to calculate the probabilities wka of various values of the momentum and polarization of a photon in a specified state. According to the general rules of quantum mechanics, wka is given by the square of the modulus of the corresponding coefficient in the expansion of the function A(k') in terms of the wave functions of states with given k and e(a): wka 2 \L(k\ a') - \(k') the proportionality coefficient depending on the way in which the functions are normalized. Substitution of (3.3) gives wka oc | e (a) • A(k)|2. (3.4) Summation over the two polarizations gives the probability that the photon momentum is k: wk a |A(k)|2. (3.5) §4. Gauge invariance The field potential in classical electrodynamics is well known to be subject to an arbitrary choice: the components of the 4-potential AM can undergo any gauge Gauge Invariance §4 13 transformation of the form (4.1) Ar^A^ + dM, where \ is a n y function of coordinates and time (see Fields, §18). For a plane wave, if we consider only transformations which do not change the form of the potential (proportional to exp (-ik^xM)), the freedom of choice reduces to the possibility of adding to the wave amplitude any 4-vector proportional to k*\ This arbitrariness in the potential persists in the quantum theory, of course, where it relates to the field operators or to the wave functions of photons. In order not to prejudice the choice of the potentials, we must replace (2.17) by the corresponding expansion for the operator 4-potential, Â^^ic^L k,a + àLAtt), (4.2) where the wave functions A£a are 4-vectors of the form At = V(4TT) - ^ e-**\ e.e»* = - 1 , or more concisely, omitting the four-dimensional vector indices, Ak = V(41T) V&>) €""' *** = " L (4 3) ' Here the 4-momentum kM = (co, k) (and so kx = cut-k-r), and e is the unit polarization 4-vector.t If we consider only gauge transformations which do not alter the dependence of the function (4.3) on the coordinates and the time, the transformation must be e» -» e^ + xKy (4-4) where \ = x(^) is an arbitrary function. Since the polarization is transverse, it is always possible to choose a gauge such that the 4-vector e is e M =(0,e), e-k = 0; (4.5) this will be called the three-dimensionally transverse gauge. In invariant fourdimensional form, this condition becomes the condition of four-dimensional transversality ek = 0. (4.6) t The expression (4.3) is not in a fully relativistic-covariant (4-vector) form; this is because the normalization to a finite volume V = l, used here, is not invariant. This is, however, of no fundamental importance, and is entirely compensated by the advantages of the normalization used. We shall see later that it allows a simple and straightforward deduction of actual physical quantities in the necessary invariant form. 14 Photons §5 It should be noticed that this condition (like the normalization condition ee* = - 1 ) is preserved by the transformation (4.4), since k2 = 0. If the square of the 4-momentum of a particle is zero, its mass must also be zero. This demonstrates the relationship between gauge invariance and the zero mass of the photon. Other aspects of the relationship will be discussed in §14. There can be no change in any measurable physical quantities under a gauge transformation of the wave functions of photons concerned in a process. In quantum electrodynamics this requirement of gauge invariance is of even greater importance than in the classical theory. We shall see many examples of the fact that gauge invariance is here, like relativistic invariance, a valuable heuristic principle. Gauge invariance is, in turn, closely related to the law of conservation of electric charge. This aspect will be discussed in §43. It has already been mentioned in §3 that the coordinate wave function of the photon cannot be interpreted as the probability amplitude of its spatial localization. Mathematically, this is shown by the impossibility of constructing from the wave function any quantity which has even the formal properties of a probability density. Such a quantity would have to be expressed as a positive-definite bilinear combination of the wave function AM and its complex conjugate. Moreover, it would have to satisfy certain conditions of relativistic covariance by being the time component of a 4-vector. This is because the continuity equation, which expresses the conservation of the number of particles, is given in four-dimensional form by the vanishing of the divergence of the current 4-vector. The time component of the current is here the particle localization probability density; see Fields, §29. On the other hand, by the condition of gauge invariance, the 4-vector AM could appear in the current only as the antisymmetric tensor F^ = d^Av - d„AM = -i(k^Av - kvA»). Thus the current 4-vector would have to be a bilinear combination of F^ and F*v (and the components of the 4-vector fcM). But such a 4-vector cannot be formed, since every expression (such as kKF*vFvk) which satisfies the conditions stated is zero by the transversality condition (kkFpk = 0), and in any case could not be positivedefinite, since it contains odd powers of the components k^. §5. The electromagnetic field in quantum theory The description of the field as an ensemble of photons is the only description that fully accords with the physical significance of the electromagnetic field in quantum theory. It replaces the classical description in terms of field strengths. These appear in the mathematical formalism of the photon picture as second quantization operators. The properties of a quantum system are known to be similar to the classical properties when the quantum numbers defining the stationary states of the system are large. For a free electromagnetic field (in a given volume) this means that the oscillator quantum numbers, i.e. the photon numbers N k a , must be large. In this respect the fact that photons obey Bose statistics is of great importance. In the mathematical formalism of the theory, the relationship of the Bose statistics to the properties of the classical field is shown by the commutation rules for the operators §5 The Electromagnetic Field in Quantum Theory 15 ck«, CL- When the Nkn are large, and the matrix elements of these operators are therefore large also, we may neglect unity on the right-hand side of the commutation rule (2.16), obtaining these operators thus become the commuting classical quantities cka and cfa, which determine the classical field strengths. The condition for the field to be quasi-classical needs to be made more precise, however, since, if all the numbers N ka are large, the energy of the field is certainly infinite on summation over all the states k, a, and the condition then becomes meaningless. A physically meaningful statement of the problem as to the conditions for a quasi-classical field can be based on a consideration of values of the field averaged over some short time interval At. If the classical electric field E (or magnetic field H) is represented as a Fourier integral expansion with respect to the time, then, when it is averaged over the time interval At, only those Fourier components whose frequencies are such that o>At ^ 1 will make a significant contribution to the mean value Ë, since otherwise the oscillating factor e~tu,t almost vanishes on averaging. Thus, in determining the condition for the averaged field to be quasiclassical, we need consider only those quantum oscillators whose frequency a) *s 1/At. It is sufficient that the quantum numbers of these oscillators should be large. The number of oscillators having frequencies between zero and a) — 1/At (for a volume V = 1) is, in order of magnitude,! (co/c) 3 ~l/(cAt) 3 . (5.1) The total field energy per unit volume is proportional to E1. Dividing this by the number of oscillators and by some mean value of the energy of a single photon (—ftû)),we find as the order of magnitude of the numbers of photons Nk-ÉV/foo4. With the condition that this number should be large, we obtain the inequality |E|^>V(ftc)/(cAt) 2 . (5.2) This is the required condition, which allows the field averaged over time intervals At to be treated as classical. We see that the field must reach a certain strength, which increases as the averaging time At decreases. For variable fields, this time must not, of course, exceed the time during which the field changes appreciably. Thus variable fields, if sufficiently weak, can never be quasi-classical. Only for static (time-independent) fields can we make At -*<», so that the right-hand side of the inequality (5.2) tends to zero. Thus a static field is always classical. t In this section, ordinary units are used. 16 Photons §6 It has already been mentioned that the classical expressions for the electromagnetic field as a superposition of plane waves must be regarded in quantum theory as operator expressions. These operators, however, have only a very limited physical meaning. A physically meaningful field operator would have to give zero field values in the photon vacuum state, whereas the mean value of the squared field operator Ê2 in the ground state, which is the same as îhe zero-point energy of the field apart from a factor, is infinite; by the "mean value" is meant the quantum-mechanical mean value, i.e. the corresponding diagonal matrix element of the operator. This infinity cannot be avoided even by any formal cancelling operation (as was done for the field energy), since here this would have to be carried out by means of some appropriate modification of the operators Ê and H themselves (not their squares), which is impossible. §6. The angular momentum and parity of the photon The photon, like any other particle, can possess a certain angular momentum. In order to determine the properties of this quantity for the photon, let us first recall the relationship between the properties of the wave function of a particle and the angular momentum of the particle, in the mathematical formalism of quantum mechanics. The angular momentum j of a particle consists of its orbital angular momentum I and its intrinsic angular momentum or spin s. The wave function of a particle having spin s is a symmetrical spinor of rank 2s, i.e. is a set of 2s + 1 components which are transformed into definite combinations of one another when the coordinate axes are rotated. The orbital angular momentum is related to the way in which the wave functions depend on the coordinates: states with orbital angular momentum / correspond to wave functions whose components are linear combinations of the spherical harmonic functions of order I. The consistent distinguishability of the spin and the orbital angular momentum therefore requires that the "spin" and "coordinate" properties of the wave functions should be independent of each other: the dependence of the spinor components on the coordinates (at a given instant) must not be subject to any additional restrictions. In the momentum representation of the wave functions, their dependence on the coordinates is replaced by their dependence on the momentum k. The photon wave function (in the three-dimensionally transverse gauge) is the vector A(Jk). A vector is equivalent to a spinor of rank 2, and in this sense the photon might be said to have spin 1. But this vector wave function satisfies the transversality condition, k • A(k) = 0, which is a further condition imposed on the function A(k). Consequently, this function cannot be arbitrarily specified as regards every component of the vector at the same time, and therefore the orbital angular momentum and the spin cannot be strictly distinguished. The definition of the spin as the angular momentum of a particle at rest is also inapplicable to the photon, because there is no rest frame for a photon, which moves with the velocity of light. §6 The Angular Momentum and Parity of the Photon 17 Thus only the total angular momentum of the photon has a meaning. It is, moreover, obvious that this total angular momentum must be integral, since the quantities describing the photon do not include any spinors of odd rank. The state of a photon, like that of any particle, is also described by its parity, which refers to the- behaviour of the wave function under inversion of the coordinates (see QM, §30). In the momentum representation, the change of sign of the coordinates is replaced by the change of sign of all the components of k. The effect of the inversion operator P on a scalar function <£(k) is simply to produce this change of sign: P<f>(k) - ^>(~k). When it is applied to a vector function A(k), we must also take into account the fact that the reversal of the directions of the axes changes the sign of all the components of the vector; hencet PA(k) = - A ( - k ) . (6.1) Although the separation of the angular momentum of the photon into the orbital angular momentum and the spin has no physical meaning, it is nevertheless convenient to define a "spin" s and an "orbital angular momentum" / as formal auxiliary quantities which express the transformation properties of the wave function under rotations: the value s = 1 corresponds to the fact that the wave function is a vector, and the value of / is the order of the spherical harmonics which occur in the wave function. Here we are considering the wave functions of states in which the photon angular momentum has a definite value; for a free particle, these are spherical waves. The number /, in particular, defines the parity of the photon state, which is P=(-l)m. (6.2) In the same way, the angular momentum operator j may be represented as the sum s +1. The operator j is related to the operator of an infinitesimal rotation of the coordinates, or, in the present case, to the action of this operator on a vector field. In the sum s + Î, the operator s acts on the vector index, transforming the components of the vector into combinations of one another. The operator Î acts on these components as functions of the momentum (or of the coordinates). We may count the number of states (with a given energy) which are possible for a given value j of the photon angular momentum, ignoring the trivial (2j + l)-fold degeneracy with respect to the directions of the angular momentum. When I and s are independent, this calculation is made by simply counting the number of ways in which the angular momenta 1 and s can be added, according to the rules of the vector model, so as to obtain the required value of j. For a particle t We shall choose to define the parity of a state according to the effect of the inversion operator on a polar vector, such as A (or the corresponding electric vector E = ia>A). This differs in sign from the effect on the axial vector H = ik x A, since the direction of such a vector is unaltered by inversion: PH(k) = H(-k). 18 Photons §6 with spin 5 = 1, and a given non-zero value of j, this would give three states, with the following values of / and the parity P: l=L P = ( - 1 ) I H = (-1)' + I; l=j±l, P = ( - l ) ' + l = (-!)''. If j = 0, however, only one state is obtained, with / = 1 and parity P = +1. In this calculation the condition that the vector A is transverse has not been taken into account; all its three components have been assumed to be independent. We must therefore subtract, from the numbers of states found above, the numbers of states which correspond to a longitudinal vector. This vector may be written in the form k$(k), whence we see that its three components are equivalent, as regards their transformation properties (under rotations), to a single scalar </>.t We can therefore say that the extra state which is incompatible with the transversality condition would correspond to the state of a particle having a scalar wave function (spinor of rank 0), i.e. having "spin zero".$ The angular momentum j of this state is therefore equal to the order of the spherical harmonics which occur in 0. The parity of the state as a state of the photon is determined by the action of the inversion operator on the vector function k</>: P(k<t>) = -(-k)4>(-k) = (- D'kcMk), and is therefore (- l)j. Thus we must subtract one from the number of states found above which have the parity (-1) J , i.e. two for jV 0 and one for j = 0. The conclusion is, then, that when the photon angular momentum j is non-zero there is one even state and one odd state. When j = 0, no states exist. This means that a photon cannot have zero angular momentum; j therefore takes only the values 1,2,3, The impossibility of j = 0 is evident a priori, since the wave function of a state with zero angular momentum must be spherically symmetrical, and this cannot be true for a transverse wave. The following terminology is customary to denote the various states of the photon. A photon with angular momentum j and parity (-1) J is called an electric 2'-pole (or Ej) photon; one with parity (-1) , + 1 is called a magnetic 2]-pole (or Mj) photon. For example, an odd state with j = 1 corresponds to an electric dipole photon, an even state with j = 2 to an electric quadrupole photon, and an even state with j = 1 to a magnetic dipole photon.§ t This is because the transformation of a quantity under rotation is a transformation at a given point, i.e. for a given value of k. Under such a transformation, k<Mk) is unchanged, i.e. it behaves as a scalar. $ It should be again emphasized that this does not refer to a state of an actual particle. The calculation given here is a formal one, and amounts mathematically to a classification of the set of quantities which are transformed into combinations of one another, in terms of the irreducible representations of the rotation group. § This nomenclature corresponds to the terminology of classical radiation theory; we shall see later (§§46, 47) that the emission of electric and magnetic photons is governed by the electric and magnetic moments of a system of charges. §7 19 Spherical Waves of Photons § 7. Spherical waves of photons Having ascertained the possible values of the photon angular momentum, we must now determine the corresponding wave functions.t Let us first consider the formal problem of determining vector functions which are eigenfunctions of the operators j 2 and j 2 , without deciding as yet which of these functions will appear in the desired photon wave functions, and without taking account of the transversality condition. We shall look for the functions in the momentum representation. In this representation, the coordinate operator is f = idjdk (see QM, (15.12)). The orbital angular momentum operator is I = f x k = - i k x d/3k, and therefore differs from the angular momentum operator in the coordinate representation only in that r is replaced by k. The solution of the problem is thus formally identical in the two representations. Let the required eigenfunctions be denoted by Yim and referred to as spherical harmonic vectors. They must satisfy the conditions pYim - j(j + l)Yim, jzYim = mYim, (7.1) the z-axis being in a specified direction in space. We shall show that these conditions are satisfied by any function of the form aY,m, where a is any vector formed from the unit vector n = k/co, and Yjm are the ordinary (scalar) spherical harmonic functions. The latter will everywhere be defined as in QM, §28: Y,m(n) = ( - l)*"-'"">i« V ( 2 4^(| X +lmD! l ) ! Pl",(C0S $) '**' (? 2) ' where 0 and cf> are the spherical polar angles of the direction n.t The proof is based on the commutation rule {lh ak}- = ieik\a\ (QMy (29.4)). The right-hand side may be written as - Siak, where s is the operator of spin 1; the effect of this operator on a vector function is in fact given by Siûk = -ieik\a\ (see QMy §57, Problem 2). Hence liük - aJi = -Sjök, t This problem was first discussed by W. Heitler (1936). The solution given here is due to V. B. Berestetskil (1947). t For future reference, the value of the function when 6 = 0 (n is along the z-axis) is YJm(nz) = i' y]-£~- «mo. (7.2a) 20 §7 Photons and therefore jtük = (I, + Si)ak = a k /,. Consequently j : (aY j m ) - a!2 Yjm, J:(aY jm ) - af z Y im . Since the spherical harmonic Yjm is the eigenfunction of the operators I2 and \z which c o r r e s p o n d s to the respective eigenvalues j(j + l) and m, we arrive at equations (7.1). T h e three essentially different types of spherical harmonic vectors are obtained by taking as the vector a the three following v e c t o r s : t Vn nxVn V[j(j + Dr V[j(j + i)r ' K ' The spherical harmonic vectors are thus defined as Y{€) = —>— Jm V[j(j + D] Y}r = n x Y } ^ V Y P = (-1V- )my P=(-ir1; YjJi = nYjm, (7.4) P=(-iy. The parity P is also shown for each vector. The three vectors are orthogonal, Y$l being longitudinal and Y $ and Y}^) transverse with respect to n. The spherical harmonic vectors can be expressed in terms of the scalar spherical harmonics: Y^ } in terms of spherical harmonics of the order I = j only, and Yjm and Yjm in terms of those of order / = j ± 1. This is immediately evident on comparing the parities shown in (7.4) with the parity (-1) / + 1 of a vector field in terms of the order of the spherical harmonics concerned. The spherical harmonic vectors of any one type are orthonormal: | Y ^ - Y . V do = Sä.fimm.. (7.5) For the vectors Yjm this is obvious from the normalization condition for the T The operator V„ = |k|Vk, and acts on functions which depend only on the direction of n. In spherical polar coordinates its two components are The operator denoted below by A. is the angular part of the Laplacian operator: —— — ( ■ m 6— \+ sin0 3 0 V de) l d sin2 6 ~d4? Spherical Waves of Photons §7 21 spherical harmonics Y,m. For the vectors Yjm the normalization integral is jlJTT^V^ ■ VmYfm-do = - j ^ j Y,VAnYimdo, and, since AnYim = - j ( j + \)Yjm, equation (7.5) follows. The normalization for the vectors Y)%} leads to a similar integral. The spherical harmonic vectors (7.4) could also be derived without the direct verification of equations (7.1) that has been carried out above, using only general arguments concerning the transformational properties of functions. In §6, these arguments were employed to show that a vector function n<t> corresponds to an angular momentum j which is the same as the order of the spherical harmonics occurring in c/>. If we put simply c/> = Yjm, the function n</> will also correspond to a definite value m of the angular-momentum component. Thus we derive at once the spherical harmonic vectors Y $ . But the discussion of transformational properties in §6 is unaffected if the factor n in the product n</> is replaced by the vector Vn or by n x Vn. This leads to the other two types of spherical harmonic vectors. Let us now consider the photon wave functions. For an electric photon of type Ej, the parity of the vector A(k) is (-1)'. The spherical harmonic vectors Y $ and Yj„ possess this parity, but only the former satisfies the transversality condition. For a magnetic photon of type Mj, the parity of the vector A(k) is (-1) J>1 ; only Y]%] has this parity. The wave functions of a photon having a given angular momentum j , component thereof m, and energy co, are therefore Awim(k) = % CO 5(|k| - co)Yjm(n), (7.6) where Yjm must be taken as Y ^ and Y ^ ) for electric and magnetic photons respectively. The given value of the energy is taken into account by the factor 5(|k|-o>). The functions (7.6) are normalized by the condition j^—i I wco'A*; m (k) • Awim(k) d3k = coô(o/ - cu)^ Smm . (7.7) For wave functions of the coordinate representation, the condition (7.7) is equivalent to the conditibnt ^ | {E*Tm.(r) • Ewim(r) + H* i m (r) • Htt,m(r)} d'x = œô(œf - co)ÔirÔmm : (7.8) the integral on the left, when written in terms of the potentials, is 2^ J A*Tm-(r) • A^vnCiOcu'cü d3x, t This condition is of the same type as (2.22). The factor 8(w' - to) on the right-hand side appears because we are now considering a field (spherical wave) throughout infinite space instead of in the finite volume V = 1. 22 §7 Photons and with Awj-m(r) = - / A wjm (k) e ik.r d'k (2TT)3' (7.9) the integral over d3x gives the delta function (2îr) 3 ô(k'-k). This is eliminated by integrating over d3/c, and the integral reduces to (7.7). So far, we have assumed that the potentials are in the transverse gauge, for which the scalar potential <£> = 0. In certain applications, however, other gauges of the spherical wave may be more convenient. The transformation of the potentials that can be conducted in the momentum representation is A -> A + n/(k), $ -+ 0> + /(k), where /(k) is an arbitrary function. In the present case we shall choose it so that the new potentials are expressed in terms of the same spherical harmonics and again have a definite parity. For an electric photon, these conditions limit the choice of potentials to the following: (k) = ^ J ô ( | k | - û ) ) ( Y B + C n y j m ) , CO (7.10) :4TT oLV.(k) = ^72Ô(|k|-co)cyjm, where C is an arbitrary constant. For a magnetic photon, this addition to A(m)(k) would leave it without a definite parity, and (7.6) is therefore the only possible choice under these conditions. The probability that a photon having a definite angular momentum and parity will be recorded as moving in a direction n which lies in the solid-angle element do is, according to (3.5) and (7.6), w(n) do !(n)\2 do. (7.11) This is the expression for an E photon, but, since |Yj£}|2 = |Y$| 2 , the probability distribution w(n) is the same for both types of photon. The squared modulus |Y$| 2 is independent of the azimuthal angle </>, since the factors e±im in the spherical harmonic functions cancel. The probability distribution w(n) is therefore symmetrical about the z-axis. Moreover, since each of the spherical harmonic vectors has a definite parity, their squared moduli are unaffected by inversion, i.e. by the change of polar angle 0-*rr - d; this means that the expansion of the function w(0) in Legendre polynomials will contain only those of even order. The determination of the expansion coefficients is equivalent to a calculation of the integrals of products of three spherical harmonic functions, Spherical Waves of Photons §7 23 followed by summation over components. These processes are effected by means of the formulae derived in QM, §§107, 108, and the result is w ( e ) = ( _ i r . ( ^ | o ( 4 n + 1)(; i2n)(i , 2n){i , 2n J ^ $) (? ^ Finally, we shall give the expressions for the components of the spherical harmonic vectors as expansions in terms of spherical harmonic functions. To do so, we shall use the "spherical components" of a vector, defined as in QM, §107. These components /A of a vector f are /o= ft, / + i = - ^72 (/x + ify), (7.13) /-i = ^ (/x - ify). In terms of the "spherical unit vectors", e (0) =ie (2) , e(+l)=-77^(cu)+ie(y)), V2' e V2(e lV } ' (714) where e(x,y,z) are unit vectors in the direction of x, y and z, we have f = E(-l)'"Y-Ae (A) , / A =(-l)'- A f.e ( -'>* = f • e(A). (7.15) The spherical components of the spherical harmonic vectors are expressed in terms of 3j-symbols and spherical harmonic functions as follows: (-lV*»WO. - - v j ^ ; 1 , _[ _i)v,*,-«+ (-l)'*'"*"1(Y!:,)» = -V(2j + l)( m , + A _J _j,)n»«. Ï (7.16) W i ^ _i _ J ) w These formulae are derived in the following way. Each of the three spherical harmonic vectors is of the form Yjm = aY,m, where a is one of the three vectors (7.3). Hence Yim = 2</m'|a|jm>Y,m., §8 Photons 24 and the problem is equivalent to that of finding the matrix elements of the vector a with respect to the eigenfunctions of the orbital angular momentum. According to Q M (107.6), we have I m' (lm'\aK\jm)=i(-\Y >i«w>. where j m a x is the larger of / and j . It is therefore sufficient to know the non-zero reduced matrix elements (J||a||j). These are given by the formulae <l-l||n||l> = </||n||/-l>*=iVl f </||V n ||/-l) = / ( / - l ) V / , < / - l | | V j ) = i(/ + l)V/, (7.17) </||nxV n ||0 = iV[/(l + l)(2l + l)L §8. The polarization of the photon The polarization vector e acts for the photon as the "spin part" of the wave function (with the limitations stated in §6 in connection with the concept of photon spin). The various cases which can occur with regard to the polarization of the photon are identical with the possible types of polarization of a classical electromagnetic wave (see Fields, §48). Any polarization e can be represented as a superposition of two mutually orthogonal polarizations e(1) and e(2) (e(l) • e(2)* = 0), chosen in some specified manner. In the resolution J2) e = eie1 ' + e2t (i) (8.1) the squares of the moduli of the coefficients e{ and e2 determine the probabilities that the photon has polarization e(l) and e(2) respectively. These polarizations may be taken to be two mutually perpendicular linear polarizations. We can also resolve any polarization into two circular polarizations having opposite directions of rotation. The vectors of the right-hand and left-hand circular polarizations will be denoted by e(+1) and e ( l ) respectively; in coordinates £, 7), £, with the £-axis in the direction of the photon n = k/co, j+n ^ e (e ( ^ ^+ i îe e ( î î ),,), e*<-!>' ' = ^ ( e ( * ) - i e ( 1 , ) ) . (8.2) The possibility that the photon has two different polarizations (for a given momentum) is equivalent to the statement that each eigenvalue of the momentum is doubly degenerate. This property is closely related to the fact that the mass of the photon is zero. I §8 The Polarization of the Photon 25 A freely moving particle with non-zero mass always has a rest frame. The intrinsic symmetry properties of the particle, as such, will evidently appear in this particular frame of reference. Symmetry with respect to all possible rotations about the centre (i.e. with respect to the entire spherical symmetry group) must be considered. The property which describes the symmetry of the particle with respect to this group is its spin s; this determines the degree of degeneracy, the number of different wave functions which are transformed into linear combinations of one another being 2s + 1. In particular, a particle having a vector (threecomponent) wave function has spin 1. If the mass of the particle is zero, however, there is no rest frame, since it moves with the velocity of light in every frame of reference. For such a particle, there is always a distinctive direction in space, the direction of the momentum vector k (the £-axis). In such a case there is clearly no symmetry with respect to the whole group of rotations in three dimensions, but only axial symmetry about the preferred axis. When there is axial symmetry, only the helicity of the particle is conserved, i.e. the component of its angular momentum along the £-axis, which we denote by A.t If we also impose the condition of symmetry under reflections in planes passing through the £-axis, the states differing in the sign of A will be mutually degenerate, and when A^O there is therefore twofold degeneracy.t The state of a photon having a definite momentum in fact corresponds to one type of these doubly degenerate states. It is described by a "spin" wave function which is a vector e in the ^T)-plane; the two components of this vector are transformed into combinations of each other by any rotation about the £-axis and by any reflection in a plane passing through that axis. The various cases of the polarization of the photon are in a certain relationship to the possible values of its helicity. The relationship can be deduced from the formulae in QM (57.9), which connect the components of a vector wave function with those of the equivalent spinor of rank two.§ Vectors e with only the component ek - iev or ek + iev non-zero correspond to the components A = +1 or - 1 respectively; these are e = e(+l) and e = e {1) . In other words, the values A = +1 and - 1 correspond to right-hand and left-hand circular polarization of the photon. In §16 the same result will be derived by direct calculation of the eigenfunctions of the spin component operator. Thus the component of the photon angular momentum along the direction of its motion can have only the two values ± 1 ; the value zero is not possible. A state of the photon having a definite momentum and polarization is a pure state, in the sense defined in QM, §14; it is described by a wave function, and corresponds to a complete quantum-mechanical description of the state of the particle (the photon). "Mixed" photon states are also possible, which correspond to a less complete description by a density matrix only, not a wave function. t This is to be distinguished from m, the component of the angular momentum in a specified direction in space (the z-axis), which was used in §7. $ This is the method of classifying the electron terms of the diatomic molecule (QM, §78). § It is the contravariant spinor components that correspond to the components of the wave function as the probability amplitudes of various values of the angular momentum of the particle (which are here considered). 26 Photons §8 Let us consider a state of the photon which is mixed as regards its polarization, but corresponds to a definite value of the momentum k. In such a state (called a state of partial polarization), a "coordinate" wave function exists. The polarization density matrix of the photon is a tensor paß of rank two, in a plane perpendicular to the vector n (the ^TJ-plane; the suffixes a, ß take only two values). This tensor is Hermitian: Paß = P*a, (8.3) Pa« =Pll + p22= 1. (8.4) and is normalized by the condition From (8.3), the diagonal components pu and p22 are real, and either is given in terms of the other by (8.4). The component pn is complex, and p2i = p*2. The density matrix therefore involves three real parameters. If the polarization density matrix is known, we can find the probability that the photon has any given polarization e. This probability is determined by the "projection" of the tensor paß on the direction of the vector e, i.e. by the quantity PaßeUß. (8.5) For example, the components p u and p22 are the probabilities of linear polarizations along the £ and TJ axes. The probability of the two circular polarizations is given by taking the projections along the vectors (8.2): J[l±i(pi2-P2i)]- (8.6) The properties of the tensor paß are essentially the same as those of the tensor Jaß which describes partially polarized light in the classical theory (see Fields, §50). Some of these properties are the following. For a pure state with a definite polarization e, the tensor paß reduces to products of components of the vector e: Pjaß = eae*ß, (8.7) and the determinant |pa/3| = 0. In the opposite case of an unpolarized photon, all directions of polarization are equally probable, i.e. Paß=\8aßy (8.8) and |p«p| = i In the general case, it is convenient to describe the partial polarization by means of three real Stokes parameters £i, £2, &>t in terms of which the density matrix can t These are not to be confused with the £-axis. §8 The Polarization of the Photon 27 be written n Pa _I/l +6 "-2U. + i6 £i-i&\ 1-6 > r89) (8,9) All three parameters take values between - 1 and +1. In the unpolarized state, £i = & = £3 = 0; for a completely polarized phQton, Ç] + £2 + £3 = 1. The parameter £3 describes the linear polarization along the £ or TJ axis; the probability that the photon is linearly polarized along these axes is respectively 2(1 + £3) and 2d - £3). The values £3 = +1 and - 1 therefore correspond to complete polarization in these directions. The parameter £1 describes the linear polarization along directions at angles <f> = ±\TT to the £-axis. The probability that the photon is linearly polarized along This is easily shown by these directions is respectively 2O + £i) and \{\-%\). projecting the tensor paß on the directions e = (1, ±1)/V2. Finally, the parameter £2 represents the degree of circular polarization: according to (8.6), the probability that the photon has right-hand or left-hand circular polarization is respectively 2O + &) and 2(1 ~ ^2)- Since these two polarizations correspond to helicities A = ± 1, it is clear that £2 is the mean value of the helicity of the photon. Moreover, for a pure state with polarization e, £2 = i e x e * - n . (8.10) The quantities £2 and V(£? + £3) are invariant under Lorentz transformations (see Fields, §50). We shall later encounter the problem of the behaviour of the Stokes parameters under the operation of time reversal. It is easily seen that they are invariant. This property is evidently independent of the state of polarization, and therefore need be proved only for a pure state. In quantum mechanics, time reversal corresponds to replacing the wave function by its complex conjugate (QM, §18). For a plane-polarized wave, this implies the changest k^-k, e->-e*. (8.11) Under this transformation, the symmetrical part \{eae% +eße*) of the density matrix is unchanged, and therefore so are £1 and £3. The fact that & is unchanged by this transformation is seen from (8.10), and is also evident from the fact that £2 is the mean value of the helicity: the helicity is the component of the angular momentum j in the direction of n, i.e. the product j • n, and both these vectors change sign under time reversal. t The change in the sign of e is necessary because time reversal changes the sign of the vector potential of the electromagnetic field. The scalar potential, however, does not change sign, and the effect of time reversal on the 4-vector e is therefore as follows: (eo, e)->(eff, -e*). (8.11a) §8 Photons 28 In later calculations, we shall need the photon density matrix written in four-dimensional form, i.e. as a certain 4-tensor p^,. For a polarized photon described by the 4-vector e^ this tensor can naturally be defined as p Ml = e^e*. (8.12) In the three-dimensionally transverse gauge, e = (0, e), and if one of the spatial coordinate axes is taken to be along n the non-zero components of this 4-tensor are the same as (8.7). For an unpolarized photon the three-dimensionally transverse gauge corresponds to a tensor p^„ having components Pik = 2(Sjk - n,nk), poi = pio = poo = 0; (8.13) if one of the axes is in the direction of n, the result is again (8.8). It would, however, be inconvenient to use the tensor p^v in this three-dimensional form. But a gauge transformation can be applied, which for the density matrix is Pu* -> Pw + XtMv + X»K> where the x» are (8.14) arbitrary functions. Putting Xo = -l/4ci>, Xi = ki/4o)2, we obtain instead of (8.13) the simple four-dimensional expression (8.15) P^ = -\g^ The four-dimensional form of the density matrix for a partially polarized photon is easily found by first writing the two-dimensional tensor (8.9) in three-dimensional form: Pik = i(«iM !) + e?e?) + \ £,(« j»eF> + e JM'*) - \ iHe(M2) ~ eiM") + \Ue{Ml) - e?e?), where e(1) and e(2) are unit vectors along the £ and TJ axes. The required generalization is obtained on replacing these 3-vectors by real space-like unit 4-vectors e(]\ e(2) which are orthogonal to each other and to the photon 4-momentum k: e(l)2 = é>(2)2 = - l , *<V2) = 0, ed)k = emk = 0 - (8.16) In one particular frame of reference, eiX) = (0, e(1)) and e{2) = (0, e(2)). Thus the four-dimensional density matrix of the photon is -\iHeye?-efe^ + \^ey-efe?). (8.17) A Two-photon §9 System 29 The convenience of any specific choice of the 4-vectors e{l\ ei2) depends on the conditions of the problem concerned. It must be noted that the conditions (8.16) do not uniquely define the choice of e(]) and ea\ If a 4-vector e^ satisfies these conditions, then so does any 4-vector eM + xK' since k2 = 0. This non-uniqueness occurs because the density matrix is not invariant under gauge transformations. The first term in (8.17) corresponds to the unpolarized state. According to (8.15), it can therefore be replaced by —{g^. This change is again equivalent to a certain gauge transformation. The following formal device is useful in calculations with 4-tensors of the form (8.17) expressed in terms of two independent 4-vectors. We write the tensor (8.17) in the form and the coefficients p{ab) as a two-rowed matrix: 9 1^(2.) p (22)j. This, like any two-rowed Hermitian matrix, can be written in terms of four independent two-rowed matrices: the Pauli matrices ax, cry, crz and the unit matrix 1. The result is p = !(l + t-cr), É = (£i,& 6 ) , (8.18) as is easily seen by direct comparison with (8.17), using the expressions (18.5) for the Pauli matrices. The combination of the three quantities £i, &» & into a "vector" £ is, of course, purely formal and is done only for convenience of notation. PROBLEM Write the photon density matrix for the case where the coordinate "axes" are the circular unit vectors (8.2). SOLUTION. The components p'Qß of the tensor relative to the new axes (a, ß = ±1) are obtained by projecting the tensor (8.9) on the unit vectors (8.2): P'u = Paße{:"*e{;l\ ,1/ pi l+ £ , = Paße{;"*e{ß-l\ . . . ; -6+'*\ §9. A two-photon system By arguments similar to those in §6, we can calculate the number of possible states in a more complicated case, that of a system of two photons (L. Landau, 1948). 30 Photons §9 We shall consider the photons in their centre-of-mass system; their momenta are ki = - k 2 = k.t The wave function of the two-photon system (in the momentum representation) can be written as a three-dimensional tensor of rank two Aik(n), formed by a bilinear combination of the vector wave functions of the two photons; each of the suffixes of this tensor corresponds to one of the photons (n being a unit vector in the direction of k). The transversality of each photon is expressed by the orthogonality of the tensor Aik to the vector n: Ailnl=Q, Alkn, = 0 . (9.1) An interchange of the photons corresponds to an interchange of the suffixes of the tensor Aik and a simultaneous change in the sign of n. Since photons obey Bose statistics, we have A i k (-n) = A w (n). (9.2) The tensor Aik is not in general symmetrical with respect to its suffixes. It can be resolved into symmetric and antisymmetric parts: Aik = sik + aik. The equation (9.2), and the orthogonality conditions (9.1), must evidently apply to each part separately. Hence we have sik(-n) = sik(n), (9.3) alk(-n) = -alk(n). (9.4) Inversion of the coordinates does not affect the sign of the components of a tensor of rank two, but changes the sign of n. From (9.3), therefore, the wave function sik is symmetrical under inversion, i.e. it corresponds to even states of the two-photon system, while the wave function aik corresponds to odd states. An antisymmetric tensor of rank two is equivalent (dual) to a certain axial vector a, whose components are given in terms of those of the tensor by eik\ being the antisymmetric unit tensor; see Fields, §6. The orthogonality of the tensor ak{ and the vectorn implies that the vectors a and n are parallel.X We can therefore write a = nc/>(n), where </> is a scalar; according to (9.4), we must have a ( - n ) = -a(n), and therefore <M-n) = 4>(n). This equation signifies that the scalar <$> can be formed linearly only from spherical harmonic functions of even order L (including order zero). t This frame of reference always exists except in the case of two photons moving in the same direction. The total momentum ki + k: and the total energy o>i f o>2 of such photons are related in the same way as those of a single photon, and there is therefore no frame of reference in which k, + k2 = 0. t For aik = eikiQiy and the orthogonality condition gives a^ru = etkiaink = (n x a), = 0. §9 A Two-photon 31 System We see that the transformation properties of the antisymmetric tensor aik under rotations are equivalent to those of a single scalar (cf. the second footnote to §6). When the latter is assigned a "spin" zero, the angular momentum of the state is found to be J = L. Thus the tensor aik corresponds to odd states of a photon system with even angular momentum J. Let us now consider the symmetric tensor sik. Since this is unaltered when n changes sign, it corresponds to even states of the photon system. Hence all the components sik can be expressed in terms of spherical harmonic functions of even order L (including zero). It is well known that any symmetric tensor sik of rank two can be expressed as the sum of a scalar sn and a symmetric tensor s'ik with zero trace (s'a = 0). The scalar su can be assigned a "spin" zero, and the angular momentum of the corresponding states is therefore J = L, i.e. is even. The tensor s'ik has "spin" two (see QM, §57). Adding this "spin" to the even "orbital angular momentum" L by the law of addition of angular momenta, we find that for a given even J ^ 0 three states are possible (with L = J ± 2, J), and for odd J ^ 1 two states (with L = J ± 1). The exceptions are J = 0 with one state (L = 2) and J = 1 with one state (L = 2). In these calculations, however, we have not yet included the condition that the tensor sik is orthogonal to the vector n. We must therefore subtract, from the numbers of states found above, the numbers of states corresponding to a symmetric tensor of rank 2 "parallel" to the vector n. Such a tensor, which we denote by s"k, can be written as s"k = riibk + nkbi, where b is a certain vector. According to (9.3), this vector must be such that b(-n) = -b(n). Thus the tensor s"k which gives the "unwanted" states is equivalent to an odd vector. The latter must be expressible in terms only of spherical harmonics of odd order L. Moreover, the vector has a "spin" one, and therefore, for any even angular momentum JV 0, two states are possible (with L = J ± 1), and for any odd J one state (with L = J); an exception is J = 0 with one state (L = 1). Summarizing the results obtained, we obtain the following table giving the numbers of possible even and odd states of a two-photon system (with zero total momentum) for various values of the total angular momentum J: J 0 1 2k 2k + 1 even 1 0 2 1 odd 1 0 1 0 (9.5) where k is any positive integer (not zero). We see that for odd J there are no odd states, and the value J = 1 cannot occur.t The wave function Aik of the two-photon system determines the correlation between the polarizations of the photons. The probability that both photons t Another way of deriving these results is given in §70, Problem 1. 32 Photons §9 simultaneously have definite polarizations ei and e2 is proportional to Aikefiefk. Thus, if the polarization ei of one photon is given, the polarization e2 of the other is e2k a Aikefi. (9.6) In odd states of the system, Aik is equal to the antisymmetric tensor aikj and e2-e* oc a*efi£fk = 0, so that the polarizations of the two photons are orthogonal. For linear polarization this means that the directions of polarization are perpendicular; for circular polarization, that the directions of rotation are opposite. An even state with J = 0 corresponds to a symmetric tensor which reduces to a scalar, Su, = constant x (8ik - riink). From (9.6), therefore, we have ci = c?. For linear polarization this means that the directions of polarization are parallel; for circular polarization, that the directions of rotation are again opposite. The latter result is obvious, since when J = 0 the sum of the components of the photon angular momenta in the same direction k must always be zero, because the components in opposite directions k| and k2, i.e. the helicities, are equal. CHAPTER II BOSONS § 10. The wave equation for particles with spin zero IT HAS been shown in Chapter I how a quantum description of the free electromagnetic field can be constructed on the basis of the known properties of the field in the classical limit and the concepts of ordinary quantum mechanics. The resulting scheme for describing the field as a system of photons contains many features which occur also in the relativistic quantum theory description of particles. The electromagnetic field is a system having an infinite number of degrees of freedom. For this system there is no law of conservation of number of particles (photons), and its possible states include states with an arbitrary number of particles.t In the relativistic theory, systems composed of any particles must in general share this property. The conservation of number of particles in the non-relativistic theory depends on the law of conservation of mass: the sum of the (rest) masses of the particles is unaffected by their interactions, and the constancy of the total mass in a system of electrons, say, implies that the number of electrons is also unchanged. In relativistic mechanics, however, there is no law of conservation of mass; only the total energy of the system is conserved, which includes the rest energy of the particles. The number of particles therefore need not be conserved, and consequently every relativistic theory of particles must be a theory of systems having an infinite number of degrees of freedom. That is to say, any such theory of particles must be a field theory. The second quantization formalism (QMy §§64, 65) is a satisfactory means of describing systems with a variable number of particles. In the quantum description of the electromagnetic field, the second quantization operator is the 4-potential A. This is expressed in terms of the (coordinate) wave functions of the individual particles (photons) and their creation and annihilation operators. The quantized wave function operator has a similar role in the description of a system of particles. To derive this operator, we must first know the form of the wave function of a single free particle and the equation satisfied by this function. The concept of a field of free particles is, it must be emphasized, only an aid to the theory. Actual particles interact, and the task of the theory is to consider these interactions. But any interaction is equivalent to a collision, before and after which the system may be regarded as an ensemble of free particles. It has been remarked in §1 that the only measurable objects are of this kind. We therefore use the fields of free particles as a means of describing the initial and final states. +In reality, of course, the number of photons changes only as a result of various interaction processes. 33 34 Bosons §10 Let us first consider the relativistic description of free particles having spin zero. This case is mathematically simple, and illustrates most clearly the basic ideas and typical features of the description. The state of a free particle (with spin zero) can be completely defined by specifying its momentum p only. The energy e of the particle! is given by e 2 = p 2 + m2 (where m is the mass of the particle) or, in four-dimensional form, p2=m2. (10.1) The laws of conservation of momentum and energy are well known to be related to the homogeneity of space and time, i.e. to the symmetry with respect to any parallel displacement of the 4-coordinate system. In the quantum description, this requirement of symmetry means that, under such a transformation of the 4-coordinates, the wave function of a particle having a given 4-momentum must be multiplied by a phase factor (of unit modulus). This can be true only for an exponential function, with the exponent linear in the 4-coordinates. Thus the wave function of the state of a free particle with a given 4-momentum pM = (e, p) must be a plane wave: constant x e~ip\ px = et - p • r; (10.2) the choice of sign of the exponent in the relativistic theory itself is arbitrary, and is here made in accordance with the non-relativistic case. The wave equation must have the functions (10.2) as particular solutions for any 4-vector p which satisfies the condition (10.1). It must be linear, on account of the principle of superposition: any linear combination of the functions (10.2) also describes a possible state of the particle, and must therçfore also be a solution. Finally, the equation must be of the lowest possible order; any higher order would bring in redundant solutions. The spin is the angular momentum of the particle in a frame of reference in which the particle is at rest. If the spin of the particle is s, its wave function in the rest frame is a three-dimensional spinor of rank 2s. To describe the particle in an arbitrary frame of reference, its wave function must be expressed in terms of four-dimensional quantities. A particle with spin zero is described in the rest frame by a three-dimensional scalar. This scalar, however, can have more than one four-dimensional "origin": either a four-dimensional scalar i//, or as the fourth component of a (time-like) 4-vector i//^ of which only the component i//0 is non-zero in the rest frame.t For a free particle, the only operator that can appear in the wave equation is the 4-momentum operator p. Its components are the operators of differentiation with respect to coordinates and time: PM = Î3M = ( Î J J , - Î V ) . (10.3) +We denote the energy of a single particle by e, to distinguish it from the energy E of a system of particles. tOr, similarly, as the time component of a 4-tensor of higher order; but this would lead to higher-order equations. §10 The Wave Equation for Particles with Spin Zero 35 The wave equation must be a differential relationship between the quantities if/ and i/r^ through the operator p. This relationship must, of course, be given by relativistically invariant expressions. Such expressions are m</V = P^ P ^ = m*l>> (10.4) where m is a dimensional constant characteristic of the particle.! Substituting i//M from the first equation (10.4) in the second equation, we obtain (p 2 -m 2 )i// = 0 (10.5) (O. Klein, and V. A. Fock, 1926; W. Gordon, 1927). The explicit form of this equation is - d^4> s ( - jfi + A)I/> = m% (10.6) Substitution of i// as the plane wave (10.2) gives p 2 = m 2 , from which it is evident that m is the mass of the particle. We may note that the form of equation (10.5) is in any case obvious a priori, since p 2 is the only scalar operator which can be derived from p (and, for the same reason, a similar equation is satisfied by every component of the wave function of a particle having any spin value, as will be seen on several occasions below). Thus a particle with spin zero is essentially described by a single (fourdimensional) scalar i/>, which satisfies the second-order equation (10.5). In the first-order equations (10.4), the wave function is represented by the set of quantities i// and i//^, the 4-vector t//M being the 4-gradient of the scalar if/. In the rest frame, the wave function of the particle is independent of the (space) coordinates, and the space components of the 4-vector i//M are therefore zero, as they should be. In order to continue with the second quantization procedure, it is useful to express the energy and momentum of the particle as the space integrals of certain combinations bilinear in if/ and i//*, which represent a kind of space density of these quantities. We thus have to find an energy-momentum tensor T^v which corresponds to equation (10.5). In terms of this tensor the law of conservation of energy and momentum is expressed by the equation dMT? = 0. (10.7) Following the general procedure of field theory (see Fields, §32), we write down a variational principle which would lead to equation (10.5). This principle must be that the "action integral" S = [ Ld4x (10.8) t The constants m are shown in (10.4) so that i/v and i// shall have the same dimensions. There would be no point in using different constants n\\ and rr\2 in the two equations, since they could always be made the same by redefining t|/ or i/v §10 Bosons 36 of some real 4-scalar L, the Lagrangian density of the field,t should take a minimum value. Using the scalar i// (and the operator d*1), we can construct a real bilinear scalar expression of the form L = drf* • d*$ - m 2 f ^ (10.9) where m is a dimensional constant. Regarding i// and i//* as independent variables describing the field ("generalized field coordinates" q), we easily see that Lagrange's equation a dL dx* dq, M dL (iai0) dq (where q^ = d^q) is in fact the same as the equation (10.5) for (//and i//*, m being the mass of the particle. The sign of the expression (10.9) has been taken such that the square of the time derivative, |dt///dt|2, appears in L with a positive sign; otherwise, the action could not take a minimum value (cf. Fields, §27). The choice of the numerical factor in L is arbitrary (and affects only the normalization factor in (//). The energy-momentum tensor can now be calculated from the formula n =2 ^ ^ ; - ^ . uo.li) the summation being over all q. Substitution of (10.9) gives TM„ = M>* • d„«fr + d„0* • drf - Lg»v\ (10.12) these quantities are real (as they should be), since L is real. In particular, ^f1^"^^2^' Tl °" at äT + l F a r - (1013) (1014) The 4-momentum of the field is given by the integral P^JlVod 3 *, (10.15) i.e. Too and T0, act as the energy and momentum densities. The quantity Too is essentially positive. t The corresponding second-quantized operator L is called the Lagrangian of the field. To simplify the terminology, we shall use this term for either the "quantized" or the 44non-quantized" Lagrangian density, as convenient. §11 Particles and Antiparticles 37 Formula (10.13) can be used for the normalization of the wave function. A plane wave, normalized to "one particle in the volume V = 1", is P iX *p=vh)€ (1016) ' since for this function Too = e, and the total energy in the volume V = 1 is therefore equal to the energy of a single particle. The angular momentum, whose conservation is due to the isotropy of space, can also be expressed as a space integral, but we shall not need this representation. There is one further conservation law allowed by equations (10.4) in addition to those arising directly from space-time symmetry. It is easily seen that these equations and those for (//* lead to the equation 3^=0, (10.17) JV = m(^*i/v + i/>*</0 = i[^*a^-(d M ^*)^]. (10.18) where Thus j M acts as a current density 4-vector, and (10.17) is the equation of continuity expressing the law of conservation of the quantity Q = |jod 3 x, (10.19) where ii= ,=i ' (+'f-f4 (io 2o - > It should be noted that }0 need not be positive. This shows that it cannot in general be interpreted as the probability density of spatial localization of the particle. The significance of the conservation law expressed by equation (10.17) will be shown in §11. §11. Particles and antiparticles In accordance with the general procedure of the second quantization method, we have to consider the expansion of an arbitrary wave function in terms of the eigenfunctions of a complete set of possible states of a free particle, for instance in plane waves «//p: </> = 2 a P ^ «/>* = £ a W 38 Bosons §11 The coefficients a^,a% are then to be regarded as the annihilation and creation operators dp, dp of particles in the corresponding states.! Here, however, we immediately encounter a difference of principle as compared with the non-relativistic theory. In a plane wave which is a solution of equation (10.5), the energy e need satsify (for a given momentum p) only the condition e2 = p2 + m2, i.e. it can have two values, ±V(p2„+ m2). Only positive values of e can have the physical significance of the energy of a free particle. But the negative values cannot be simply omitted: the general solution of the wave equation can be obtained only by superposing all its independent particular solutions. This shows that the interpretation of the expansion coefficients for i// and i//* in the second quantization method must be somewhat different. We may write the expansion in the form *=?7<b fl, '* , '*' M,+ ?v<b a '~ !v *'""' (IU) where the first sum contains plane waves with positive "frequency", normalized according to (10.16), and the second sum contains those with negative "frequency", e always denoting the positive quantity +V(p 2 + m2). In the second quantization, the coefficients aj,+) in the first sum are replaced as usual by the particle annihilation operators dp. In the second sum, we note that, in the subsequent derivation of the matrix elements, the time dependence of the terms will correspond to particle creation, not annihilation: the factor eiet = (e~iet)* corresponds to one extra particle with energy e in the final state (cf. the end of §2). Accordingly, the coefficients a(p") are replaced by creation operators b% relating to other particles. If the summation variable p in the second sum in (11.1) is replaced by - p in order to put the exponential factor in the form e'l{p'r~Ft)y the (//-operators are obtained as ^ = Svè7) (ap ^ ,px " fb '^ ipx) ' 1 (11.2) Thus all the operators dp, bp are multiplied by functions with the "correct" time dependence (~-e~l€f), while the operators dp, bp are multiplied by the complex conjugate functions. This makes it possible to interpret the former operators, in accordance with the general rules, as annihilation operators for particles with momentum p and energy e, and the latter as creation operators for these particles. In this way we arrive at the concept of particles of two types which occur simultaneously and on an equal footing. These are called particles and antipartides ; the significance of the names will be shown later. One type corresponds to the operators dp, dp in the second quantization formalism, and the other type to bp, b p . The two types of particle have the same mass, since their operators appear in the same (//-operator. t The I/J function is given the 4-momentum p as suffix, since we intend to denote the functions with "negative frequency" by </f-p. The operators d and d + are given the three-dimensional momentum p as suffix, since this entirely defines the state of an actual particle. §11 Particles and Antiparticles 39 The reason for these results can also be examined from the point of view of the requirements of relativistic invariance. The Lorentz transformations are, mathematically, rotations of the four-dimensional coordinate system which change the direction of the time axis; together with the purely spatial rotations which do not affect the time axis, they form the Lorentz group of transformations.! All the Lorentz transformations have the property that they leave the t axis within the corresponding light cone, and this expresses the physical principle that there exists a maximum possible velocity of propagation of signals. In a purely mathematical sense, the simultaneous change of sign of all four coordinates (Jour-dimensional inversion) is also a rotation, since the determinant of this transformation is +1, like that of any rotational transformation. The time axis is thereby carried from one light cone to the other. Although this means that such a transformation is physically impossible (as a transformation of the frame of reference), the only difference mathematically is that, because the metric is pseudo-Euclidean, such a rotation cannot be effected continuously without allowing also a complex transformation of the coordinates. It is reasonable to suppose that this difference is unimportant in relation to four-dimensional invariance. Then any expression which is invariant under the Lorentz transformations must be invariant under 4-inversion also. A precise statement of this condition as applied to the scalar (//-operator will be given in §13, but here it may be noted that the condition will certainly make necessary the simultaneous presence in the i/f-operators of terms having both signs of e in the exponents, since this sign is changed by the substitution t - » - 1 . Let us return now to equations (11.2) and derive the commutation relations between the operators dp, âp (and bp, bp). For photons (the operators cp, Cp), this was done on the basis of the analogy with oscillators, that is, essentially from the properties of the electromagnetic field in the classical limit. Here there is no such analogy. In deriving the (Bose or Fermi) commutation rules between the operators, we can be guided only by the form of the Hamiltonian constructed from these operators. This Hamiltonian is obtained (see QM, §64) by substituting tp and i£+ in place of ifr and I/J* in the integral / Tood3x.1: We then find H=2e(â;âp+bpb;). (11.3) p It is easily seen that a reasonable result is obtained for the eigenvalues of this Hamiltonian only if the operators satisfy the Bose commutation rules: {dp,dp+}_ = {bp,bp+}- = l (11.4) t The set of all three-dimensional (spatial) rotations is itself a group, which constitutes a subgroup of the Lorentz group. The set of the Lorentz transformations is not itself a group, since the result of successive Lorentz transformations may be a purely spatial rotation. t In the non-relativistic theory, the conjugate operator i£* is by convention written to the left of <£. Here, the order is of no importance, since the interchange of <£ andty*would cause only the interchange of the equivalent operators ap and bp. However, once a particular order has been selected, the same order must be used throughout. 40 Bosons §11 (all other pairs of operators commute, including each particle operator dp, dp with each antiparticle operator b^ b p ). For, in this case, p The eigenvalues of the products dpdp and bpbp are positive integers N p and N p , the numbers of particles and antiparticles. The infinite additive constant 2 e (the "energy of the vacuum") may again be simply omitted: E = 2e(Np p + Np); (11.5) cf. formula (3.1) and the footnote to it. This expression is essentially positive, and corresponds to the idea of two types of actual particles. Similarly, we have for the total momentum of the system P = 2P(NP+NP). p (11.6) If, instead of (11.4), we used the Fermi commutation rules (anticommutators instead of commutators), we should obtain Û=2,e(â;âp-6;6p+\), p and instead of (11.5) the physically meaningless expression X e(N p —Np), which is not positive-definite and hence cannot represent the energy of a system of free particles. Particles with spin zero are therefore bosons. Next, let us consider the integral Q (10.19). Replacing the functions i// and (//* in j° by the operators t// and <//*, and carrying out the integration, we obtain Q = 2(â;âP-bpb;) = 2(â;âp-b;bp-\). p p (iu) The eigenvalues of this operator are (omitting the unimportant additive constant 2D Q = S(NP-NP), (11.8) P and are therefore equal to the differences between the total numbers of particles and antiparticles. So long as we are discussing free particles and ignoring any interaction between them, the law of conservation of the quantity Q is, of course, largely conventional (like those of total energy (11.5) and total momentum (11.6)): what is actually §12 Strictly Neutral Particles 41 conserved is not only the sum Q but the numbers Np, N p individually. The nature of the interaction decides whether the quantity Q is conserved. If Q is conserved (i.e. if the operator Q commutes with the Hamiltonian of the interaction), the formula (11.8) shows the limitation imposed by the conservation law on the possible variation of the number of particles: only "particle-antiparticle" pairs can be formed or disappear. If a particle is electrically charged, its antiparticle must have a charge of the opposite sign: if both had charges of the same sign, the creation or annihilation of the particle-antiparticle pair would contravene a rigorous law of nature, the conservation of total electric charge. We shall see later (§32) how the theory automatically leads to this oppositeness of the charges (for interactions of particles with an electromagnetic field). The quantity Q is sometimes called the charge of the field of the particles concerned. For electrically charged particles Q gives, in particular, the total electric charge of the system in terms of the unit charge e. But particles and antiparticles may also be electrically neutral. Thus we see that the nature of the relativistic relation between the energy and the momentum (the twofold root of the equation e2 = p 2 + m 2 ) , together with the requirements of relativistic invariance, leads in the quantum theory to a new principle of classification of particles: there can exist pairs of different particles (particle and antiparticle) which are interrelated in the way described above. This remarkable prediction was first made (for particles with spin 2) by Dirac in 1930, before the discovery of the first antiparticle, the positron.t § 12. Strictly neutral particles In the second quantization of the (//-function (11.1), the coefficients a(p+) and a(p_) were treated as operators relating to different particles. This is not necessary, however: as a particular case, the annihilation and creation operators in t// may relate to the same particles, as for photons (cf. (2.17)). Then, denoting these operators by c p and c p , we write the (//-operator as i = 2^-)(cPe^ + c;ein. (12.1) The field described by this operator corresponds to a system of particles of one kind only, which may be said to be their own antiparticles. The operator (12.1) is Hermitian (i//+ = 1//), and in this sense such a field has only half as many "degrees of freedom" as a complex field for which the operators 1// and i//+ are not the same. In consequence, the field Lagrangian, expressed in terms of the Hermitian operator 1//, must contain a further factor \ in comparison with (10.9):$ L= l 2(dJ-d»ip-m2ip2). (12.2) t The antiparticle concept was extended to bosons by V. Weisskopf and W. Pauli (1934). t This resembles the extra factor 2 in the operator (2.10) of the electromagnetic field energy density (when the field is expressed in terms of the Hermitian operators Ê and H), in comparison with the photon energy density (3.2) expressed in terms of the complex wave function; cf. the last footnote to §3. 42 Bosons §12 The corresponding energy-momentum tensor is TV„ = d^ • d„i/) - Lg^v, (12.3) and hence the energy density operator is t00 = td^ldt)2-L = 2[W/d02 + (V«£)2+m2«£2]. (12.4) Substituting (12.1) in the integral / Too d*x, we obtain the field Hamiltonian: H = l22^c;cp + cpc;). (12.5) P This again shows that Bose quantization is necessary: {c P ,c p + }-=l, (12.6) and the energy eigenvalues (again without the additive constant) are E = 5>pN P . (12.7) p Fermi quantization would lead to the absurd result that E is independent of Np. The "charge" Q of this field is zero, as is evident from the fact that Q must change sign when particles are replaced by antiparticles, whereas in the present case there is no difference between the two. The current density 4-vector therefore does not exist, since the expression L = i[*+d^-(dJ+)*j,] (12.8) for the operator / of the conserved 4-vector is zero when tjß = </r+ (the vector ipd^ is not itself conserved). This, in turn, means that there is no special conservation law restricting the possible changes in the number of particles. Such particles must clearly be electrically neutral. Particles of this kind are said to be strictly neutral, as opposed to electrically neutral particles which are not their own antiparticles. Whereas the latter can be annihilated (transformed into photons) only as pairs, strictly neutral particles can be annihilated singly. The structure of the ^-operator (12.1) is similar to that of the electromagnetic field operators (2.17)-(2.20). In this sense we may say that photons are themselves strictly neutral particles. For the electromagnetic field, the operators are Hermitian because the fields are measurable physical quantities (in the classical limit) and are therefore real. For the ^-operators of particles there is no such relation, since they do not correspond to any quantities that are directly measurable. The absence of a conserved current 4-vector is a general property of strictly neutral particles, and does not require the spin to be zero; for instance, it occurs §12 Strictly Neutral Particles 43 for photons also. Physically, it expresses the absence of the corresponding prohibitions on a change in the number of particles. There is a direct formal relation between the absence of a conserved current and the fact that the field is real (the operator i£ is Hermitian). The Lagrangian of a complex field, L = d^+ • d ^ - m 2 i £ + i £ , (12.9) is invariant under multiplication of the (//-operator by any phase factor, i.e. under the gauge transformations 4>^eia& ^^e-ia4>\ (12.10) In particular, the Lagrangian is unchanged under the infinitesimal transformation ip-+ip + i8a • <A, i/^-»i/^-i'8a • i£+. (12.11) When the "generalized coordinates" q undergo an infinitesimal change, the change in the Lagrangian is ^-sdh+é*-) (with summation over all q). The first term is zero, from the "equations of motion" (Lagrange's equations). If the "coordinates" q are taken to be the operators <£ and i// , and with 8<£ = i8a • t//, Si£+ = - i 8a • i£+, we obtain Hence we see that the condition for the Lagrangian to be invariant (8L = 0) is equivalent to the equation of continuity (dj* = 0) for the 4-vector !* = iU+l±-*l±] 02.12) It is easily shown that, with the Lagrangian (12.9), this formula yields the current (12.8). Thus, in the mathematical formalism of the theory, the existence of a conserved current is related to the invariance of the Lagrangian under the gauge transformations (W. Pauli, 1941). The Lagrangian (12.2) of the strictly neutral field does not possess this symmetry. 44 Bosons §13 § 13. The transformations C P and T Unlike 4-inversion, three-dimensional (spatial) inversion is not reducible to any rotations of the 4-coordinate system; its determinant is - 1 , not +1. The symmetry properties of particles- with respect to inversion (the P transformation) are therefore not determined already by considerations of relativistic invariance.t The inversion operation, as applied to a scalar wave function, is the transformation W , r ) = ±W,-r), (13.1) where the plus and minus signs on the right correspond to true scalars and pseudoscalars respectively. Hence we see that two features of the behaviour of the wave function under inversion must be distinguished. One of these relates to the coordinate dependence of the wave function. In non-relativistic quantum mechanics, only this aspect was considered; it leads to the concept of the parity of the state (which we shall here call the orbital parity), describing the symmetry properties of the motion of the particle. If the state has a definite orbital parity (+1 or - 1 ) , this means that </,a,-r) = ±i//(t,r). The other feature is the behaviour of the wave function at a given point (which may conveniently be taken as the origin) under inversion of the coordinate axes. This leads to the concept of the internal parity of the particle. The two signs in (13.1) correspond to internal parity +1 and - 1 (for a particle with spin zero). The total parity of a system of particles is given by the product of their internal parities and the orbital parity of their relative motion. The "internal" symmetry properties of various particles appear, of course, only in their mutual transformation processes. In non-relativistic quantum mechanics, the analogue of the internal parity is the parity of a bound state of a composite system, such as a nucleus. In the relativistic theory, which makes no essential distinction between composite and elementary particles, this internal parity is no different from the internal parity of those particles which are regarded as elementary in the non-relativistic theory. In the non-relativistic case, where these particles are regarded as unalterable, their internal symmetry properties are not observable, and a discussion of these would therefore be devoid of physical significance. In the second quantization formalism, the internal parity is expressed by the behaviour of the (//-operators under inversion. Scalar and pseudoscalar fields correspond to the transformation laws P : *(t,r)-*±iÊ(f,-r). (13.2) The actual significance of the action of inversion on the (//-operator must be t The Lorentz group together with spatial inversion is called the extended Loreniz group (in contrast to the original group without P, which in this connection is called the proper Lorentz group). The extended group includes all transformations which leave the t axis within the corresponding light cone. §13 The Transformations C, P and T 45 formulated as a particular transformation of the particle annihilation and creation operators, such as to lead to the result (13.2). It is easily seen that such transformations are P: d p -+±d_ p , bp-+±b-p (13.3) (and the same for the conjugate operators). For, on making these changes in the operator ku r) = s V(b)(dp e'i<üt+ipr+^ *üi"~'p r) (13 4) - and then changing the notation for the summation variable (p-> - p), we can bring it to the form ±i£(f, - r ) . Thus, if iP(t, r) denotes the operator after the substitutions (13.3), we have <Par) = ±iMt,-r). (13.5) The transformation (13.3) is entirely reasonable, since inversion changes the sign of the polar vector p, and particles with momentum p are therefore replaced by particles with momentum - p . In (13.3) the operators dp and bp are transformed either both with the upper sign or both with the lower sign. In the second quantization formalism, this expresses the fact that particles and antiparticles (with spin zero) have the same internal parity, a result which is evident because they are described by the same (scalar or pseudoscalar) wave functions. The ^-operator (13.4) is also symmetrical under a transformation which has no analogue in the non-relativistic theory, that of charge conjugation (the C transformation). If all the operators dp and bp are respectively interchanged: C: d p ->b p , bp-*âp (13.6) (i.e. if particles and antiparticles are interchanged), then ^ becomes the chargeconjugate operator t/fC, where ^ c (t f r) = ^ a , r ) . (13.7) This equation expresses the symmetry of the concepts of particles and antiparticles in the theory. There is an unimportant formal arbitrariness in the definition of the chargeconjugation transformation. The significance of the transformation is unchanged if an arbitrary phase factor is included in the definition (13.6): âp^eiabpy bp^e~iaâp. 4f^eia^\ *p+^e~ia& This would lead to 46 Bosons §13 and a twofold repetition of the transformation would again yield an identity (i/z-n/0. All such definitions are equivalent, however. Since the properties of the (/^-operators are unchanged on multiplication by a phase factor (cf. the end of §12), we can simply write \peial2 in place of i//, and thus again obtain the definition of charge conjugation (13.6), (13.7). Since charge conjugation replaces a particle by its antiparticle, which is not identical with it, no new properties of a particle or a system of particles, as such, will in general arise. An exception is formed by systems comprising equal numbers of particles and antiparticles. The operator C transforms such a system into itself, and so in this case the operator has eigenstates, corresponding to the eigenvalues C = ± 1 (since C 2 = 1). To describe the charge symmetry, we may regard the particle and the antiparticle as two different "charge states" of the same particle, differing in the value of the charge quantum number Q = ± 1. The wave function of the system is the product of an orbital function and a "charge" function, and must be symmetrical with respect to simultaneous interchange of all the variables (coordinate and charge) of any pair of particles. The symmetry of the "charge" function determines the charge parity of the system (see the Problem at the end of this section), t The concept of charge parity, which arises in a natural manner for "strictly neutral" systems, must apply also to strictly neutral "elementary" particles. In the second quantization formalism, this concept is represented by the equation <£c = ± ^ (13.8) where the plus and minus signs correspond to charge-even and charge-odd particles respectively. Relativistic invariance implies invariance under 4-in\nersion (see §11). For a scalar field operator (in the sense of 4-rotations) this means that 4-inversion must give *(f,r)-»<£(-f,-r) with the right-hand side always positive. In terms of transformations of the operators dp, bp, the transformation of tj/(f, r) into i/>(-f, - r ) is obtained by interchanging the coefficients of e~ipx and etpx in (13.4), i.e. by making dp->6;, 6p^â;. (13.9) Since a-operators are replaced by b -operators, this involves interchange of particles and antiparticles. We see that, in the relativistic theory, there is a natural requirement of invariance under a transformation in which spatial inversion (P) and time reversal (T) are accompanied by charge conjugation (C); this is called the CPT theorem.t t In this discussion we are considering a particle with spin zero. The treatment given here can be immediately generalized to other spin values; see, for instance, §27, Problem. t This theorem was enunciated by G. Lüders (1954) and W. Pauli (1955). §13 The Transformations C, P and T 47 Here, however, it must be emphasized that, although the arguments given in §§11 and 12 and the present section are a natural development of the ideas of ordinary quantum mechanics and classical relativity theory, the results thus obtained go beyond these both in form ((//-operators including both particle creation and particle annihilation operators at the same time) and in content (particles and antiparticles). They cannot therefore be regarded as logically necessary, but embrace new physical principles whose correctness can be tested only by experiment. If the operator (13.4) transformed by (13.9) is denoted by t//CPT(t, r), we can write * CPT (t,r) = * ( - t , - r ) . (13.10) Thus, if 4-inversion is formulated as the transformation (13.9), we thereby establish also the formulation of the time-reversal transformation of the i/f-operator: together with the combined inversion transformation CP, it must give (13.9). Using the definitions (13.3) and (13.6), we therefore find T: <5 p -*±d! p , 6p^±6t?9 (13.11) where the signs ± correspond to those in (13.3). The significance of this transformation is obvious: time reversal not only changes motion with momentum p into motion with momentum - p , but also interchanges initial and final states in the matrix elements. The annihilation operators for particles with momentum p are therefore replaced by creation operators for particles with momentum - p . Making the substitutions (13.11) in (13.4) and changing the notation for the summation variable ( p - > - p ) , we obtaint < F a r ) = ± ^ ( - t , r). (13.12) This is similar to the general rule for time reversal in quantum mechanics: if a certain state is described by the wave function ifß(t,r)9 then the "time-reversed" state is described by the function i//*(—r, r). The change to the complex conjugate function is necessary because the "correct" time dependence must be restored, after being lost through the change in the sign of t (E. P. Wigner, 1932). Since the transformation T (and therefore CPT) interchanges the initial and final states, there are no eigenstates and eigenvalues, and therefore no new properties of particles as such. The consequences as regards scattering processes will be discussed in §§69 and 71. Let us see how the current 4-vector operator j M (12.8) is affected -by the transformations C, P and T. The transformation (13.2), together with (d0, 3,-)-* (do,-d,-), gives P:(/°j\r^(/°,-î)f.-n (13.13) + If the operation T is defined without regard to the other transformations, there is the same arbitrariness in the choice of the phase factor as occurs for the operation C. The requirement of CPT symmetry implies that the phase factor can be chosen arbitrarily for only one of the transformations C and T. 48 Borons §13 as we should expect for a true 4-vector. The transformation (13.7) would give simply C:(/°,j)t,r^(-/°,-jKr (13.14) if the operators i// and i//+ commuted. However, the non-commutativity of these operators is due only to that of the operators dp and dp (or b? and bp) with the same p, and from the commutation rules (11.4) the interchange of these operators produces only terms independent of the occupation numbers, i.e. independent of the state of the field. Omitting these terms as unimportant, as in (11.5), (11.6), we return to (13.14), whose significance is evident: charge conjugation replaces particles by antiparticles and thus changes the sign of every component of the 4-current. Since the operation of time reversal involves transposing the initial and final states, it changes the order of the factors in a product of operators. For example, Here, however, this is not important: since the (//-operators commute (in the sense explained above), the result is unaffected by returning to the original order of factors. Since also (d0, d ( )->(-3 0 , d,) under time reversal, the current transformation rule is T:(f 0 ,j), r ->(j°,-j)-, r . (13.15) The three-dimensional vector j changes sign, in accordance with its classical significance. Finally, for the CPT transformation, CPT:(/°,j) f , r ->(-j°,-JK- r , (13.16) in accordance with the significance of this operation as 4-inversion. Here it must be emphasized that, since 4-inversion is a rotation of the 4-coordinate system, it does not correspond to two types (true and pseudo) of 4-tensors of any rank. So far, we have assumed that the particles are free; but parity quantum numbers acquire real significance only when interacting particles are considered and definite selection rules are imposed which allow or forbid specified processes. Only conserved properties, however, can have this significance; that is, the eigenvalues of operators which commute with the Hamiltonian of the interacting particles. Because of relativistic invariance, the CPT transformation operator always commutes with the Hamiltonian. For the C and P (and therefore T) transformations separately, experiment shows that the electromagnetic and strong interactions are invariant, and the corresponding parity quantum numbers are therefore conserved in these interactions. In a weak interaction, these conservation laws do not hold.t t The idea that parity might not be conserved in weak interactions was first put forward by T. D. Lee and C. N. Yang (1956). The general notion that the laws of physics might not have P and T invariance had previously been suggested by Dirac (1949). §13 The Transformations C, P and T 49 Anticipating a little, we may mention that the operator of the interaction between charged particles and the electromagnetic field is given by the product of the operator 4-vectors A and /. Since charge conjugation changes the sign of j , the invariance of the electromagnetic interaction under this transformation means that the sign of A must also be changed. Thus photons are charge-odd particles. This behaviour of the operators À is in accordance with the properties of the 4-potential in the classical theory: from the transformations C: (A 0 ,Â)->(-Ao,-Â) f . r , P: (Â0,k)^(Âo,-k)u-r9 CPT: ( Â 0 , Â ) ^ ( - Â o , - Â K - r , it follows that T: (Âo,Â)^(Ao,-ÂKr, in agreement with the classical rule for the transformation of the electromagnetic field potentials under time reversal. The requirement of CPT invariance does not impose any limitations on the properties of the particles themselves, but it implies certain relations between those of particles and antiparticles. Firstly, their masses must be equal, as is evident from the relation described in §11 between 4-inversion and the basis of the concept of particles and antiparticles. Next, it follows from CPT invariance that there is only a difference of sign in the proportionality coefficients between the electric and magnetic moment vectors and the particle and antiparticle spin vector. The magnetic moment changes sign under the C and T transformations but (being an axial vector) is not affected by the P transformation. Hence the CPT transformation, which converts a particle into an antiparticle, does not change the sign of the magnetic moment; the spin vector does change sign. The same applies to the electric moment, which is unchanged by time reversal but changes sign under the C transformation and (being a polar vector) under spatial inversion. The requirements of P and T invariance (if complied with) restrict the properties of each particle, prohibiting the existence of an electric dipole moment: the only vector that can be constructed from the ^-operators of an elementary particle at rest is its spin operator vector, which is P-even and T-odd, and can therefore give rise to a magnetic moment but not an electric moment. We must emphasize that either P invariance.or T invariance is sufficient to invoke this prohibition. PROBLEM Determine the charge and spatial parities of a system of two particles with spin zero (particle and antiparticle) and orbital angular momentum of relative motion /. SOLUTION. Interchanging the coordinates of the particles is equivalent to inversion (about their centre of mass), and therefore multiplies the orbital function by (-1)'; interchanging the charge variables is equivalent to charge conjugation, and multiplies the "charge" factor in the wave function by the required parity C. The condition C(-l) 1 = 1 gives C = (-!)'. 50 §14 Bosons The spatial parity P of the system is the product of the orbital parity and the internal parities of the two particles. Since the particle and the antiparticle have the same internal parity, in this case P is equal to the orbital parity: P =(-!)'. § 14. The wave equation for a particle with spin one A particle with spin one is described in its rest frame by a three-component wave function, a three-dimensional vector; such a particle is often called a vector particle. The four-dimensional origin of this vector may be as the three spatial components of the space-like 4-vector i//M or the mixed components of the antisymmetric 4-tensor ty*v of rank two; the time component </>° and the space components t//'k are zero in the rest frame.t The wave equation is a differential relation between the quantities ty* and <^M", and will be written as the equations tyvv = PiLfa-prfi» 2 im ^=p^^ (14.1) (14.2) with p = id (A. Proca, 1936). Applying the operator pM to both sides of equation (14.2), we have p^=0, (14.3) since i/v* is antisymmetric. By substituting (14.1) in (14.2) to eliminate if/^ and using (14.3), we obtain (p2-m2)</v=0, (14.4) whence it is again evident (cf. §10) that m is the mass of the particle. Thus a free particle with spin one can be described by a single 4-vector i//*\ whose components satisfy the second-order equation (14.4), and also the further condition (14.3), which eliminates from if/* the part pertaining to spin zero. In the rest frame, where i/fM is independent of the spatial coordinates, we find that pVo = 0.- Since also p V o = m^o, it is seen that in the rest frame t/fo^O, as it should be, and the ^ik are likewise zero. A particle with spin one can have different internal parities, according as ty* is a true vector or a pseudovector. In the former case Pr = (*0.-*'). and in the latter case P ^ = (-^°,f). t Anticipating, we may mention that the ensemble of the 4-vector fa and the 4-tensor ^ corresponds to that of the 4-dimensional spinors of rank two Caß% 1700, Cß> where (aß and TJ^ are symmetrical spinors changed into each other on inversion (§19). §14 51 The Wave Equation for a Particle with Spin One Equations (14.1), (14.2) can be derived from the variational principle, using the Lagrangian L = it/vi//""* - 2^"*(d^v - d„«/v) -^"{drf* - drf*) + m 2 «/^*. (14.5) The independent generalized coordinates are here represented by i/v, «//*, i^„, i//J„.t To find the energy-momentum tensor, formula (10.11) is not entirely suitable here, since it would lead to an unsymmetrical tensor requiring further symmetrization. Instead, we can use the formula iT^V-g - - w dg^ + dgliV , (14.6) in which L is assumed to be expressed in a form appropriate to any curvilinear coordinates (see Fields, §94). If L contains only the components of the metric tensor gM„ and not their derivatives with respect to the coordinates, the formula becomes simply ^ 2 V^ 3(LV-g) = dg»v ? dL dg^v ^v (since d log g = - g^dg*1"). Since the differentiation in formula (14.6) is not with respect to the quantities <Ko */V" these quantities need not be regarded as independent when applying the formula; we may immediately make use of the relationship (14.1) to rewrite the Lagrangian (14.5) as L = -faMpg^g" + m2i*Î8^ (14.7) Then TV, = - t/vAiK* - </>*A</>Î + m2(i^iK + 0*«^) + g ^ i M ^ ' * - m Vî* A )- (14.8) In particular, the energy density is given by the essentially positive expression To* = i«Mrô + MÛ + m2(<M4 + Ml). (14.9) The conserved current density 4-vector is given by jM = i ( ^ * ^ - ^ > * ) . (14.10) This can be obtained, in accordance with (12.12), by differentiating the Lagrangian t If the variation were made with respect to fa only (assuming fav already expressed in terms of fa by (14.1)), equation (14.3) would have to be imposed as an additional condition unrelated to the variational principle. 52 §14 Bosons (14.5) with respect to the derivative dMi/v In particular j 0 =i(iA 0 k *^-^°Vt) (14.11) and is not an essentially positive quantity. A plane wave normalized to one particle in the volume V = 1 is ^ = V ( k ) Ufi€'iP% u u = » ** "lf (1412) where wM is the unit polarization 4-vector, which, by (14.3), satisfies the condition of four-dimensional transversality, 1*^=0. (14.13) For, on substituting the function (14.12) in (14.9) and (14.11), we obtain Too=-2eV^* = ^ J°= 1. Unlike the photon, a vector particle with non-zero mass has three independent directions of polarization. The corresponding amplitudes are given in (16.21). The density matrix for partially polarized vector particles is defined so that in a pure state it reduces to the product P^ = HMU* (similarly to (8.7) for photons). According to (14.12) and (14.13), it satisfies the conditions P ^ = 0f p£ = - l . (14.14) For unpolarized particles, p^v must have the form ag^ + bp^pv. When the coefficients a and b are found from (14.14), the result is P»v = -kg** -PvPJm2). (14.15) The quantization of the vector particle field is entirely analogous to the scalar case, and there is no need to repeat the arguments. The ^-operators of the vector field are (d «f : = z v^T) > * * eipx+ ^ « P «~*>» where the suffix a labels the three independent polarizations. (14.16) §15 The Wave Equation for Particles with Higher Integral Spins 53 As in the scalar case, Bose quantization is necessary because the expression (14.9) for Too is positive definite and the expression (14.11) for j° is not. There is a close connection between the properties of strictly neutral vector and electromagnetic fields. The neutral vector field is described by an Hermitian (//-operator: k = 2 V(W){êpaU{;) e~ipx + ap+a"(;>*eiPX)' (1417) The Lagrangian of this field is L = } i M > * - ïfr'idjr - cMv) + WM*- (14.18) The electromagnetic field corresponds to m = 0. The 4-vector ty* then becomes the 4-potential A*\ and the 4-tensor iff*11' becomes the field tensor F*\ which is related to the potential by the definition (14.1). Equation (14.2) becomes dv\\ß^v = 0, corresponding to the second pair of Maxwell's equations. This does not imply the condition (14.3), which therefore is no longer obligatory. Since the extra condition has disappeared, there is no need to regard i/^ and ^ as independent "coordinates" in the Lagrangian, and (14.18) becomes L = -\i^\ (14.19) in agreement with the familiar classical expression for the Lagrangian of the electromagnetic field. This Lagrangian, like the tensor i//M„ is invariant under any gauge transformation of the "potentials" i//^. There is an evident connection between this property and the zero mass: the Lagrangian (14.18) does not possess the property, because of the term m2i//Mt£M. § 15. The wave equation for particles with higher integral spins Since the wave equations (14.3), (14.4) follow immediately when the particle mass and spin are given, the practical utilization of the Lagrangian involves not so much the derivation of these equations as the establishment of expressions for the field energy, momentum and charge. To do so we can, as already mentioned, use in place of (14.5) the expression (14.7), and the latter can be further transformed as follows. From (14.1), it can be rewritten as L = - (M/*)(a^n+idrfwr)+m 2 ^* The last term is zero, by (14.3), and the one preceding it is a total derivative. Omitting this, we obtain the Lagrangian V = - ( a ^ W i / O + m2(//*i//^. (15.1) §15 Bosons 54 This has the same form as the Lagrangian (10.9) for a particle with spin zero, the only difference being that the scalar if/ is replaced by the 4-vector i//^ and the sign is changed. The change of sign occurs because i/^ is a space-like vector, so that «/V^*1* < 0> whereas for a scalar particle ipij/* > 0. On constructing the energy-momentum 4-tensor and the current 4-vector from the Lagrangian (15.1), we obtain expressions of the same form as (10.12) and (10.18) for the scalar field: i ; , = - a^ A * • d A - d^x* • a^i//x - L'g^ U = -imh*k-i*MWkl (15.2) (15.3) Thd difference between these and (14.8), (14.10) is again a total derivative. But it has already been stressed that the local values of these quantities have no profound physical significance. Only the volume integrals P^ (10.15) and Q (10.19) are important, and these will be the same for either choice of T^v and jM. This method of description can be immediately generalized to particles with any (integral) spin. The wave function of a particle with spin s is an irreducible 4-tensor of rank s, i.e. a tensor symmetrical in all its indices and vanishing on contraction with respect to any pair of indices: <> / ..„.,,.= *. .„.,..., ifc.^.^O. (15.4) This tensor must satisfy the additional condition of 4-transversality: P ^ . M . = 0, (15.5) and each of its components must satisfy the second-order equation (p2-mV =0. (15.6) In the rest frame, the condition (15.5) means that every component of the 4-tensor whose indices include a zero must vanish. Thus the wave function in the rest frame (i.e. in the non-relativistic limit) is equivalent, as it should be, to an irreducible 3-tensor of rank s, the number of independent components of which is 2s + 1. The Lagrangian, the energy-momentum tensor and the current vector for a field of particles with spin s differ from (15.1M15.3) only in that ^A is replaced by ^ The normalized plane wave is ru = y g ^ ii""" e-*\ «*,..«'"'■■■ = - 1, (15.7) the wave amplitude satisfying the conditions u M pM=0. There are 2s + 1 independent polarization states. (15.8) §16 Helicity States of a Particle 55 The quantization of the field is effected by an obvious generalization from the cases of spin zero and one. The procedure given above is entirely sufficient for the stated purpose: to describe a field of free particles. The situation is different if it is proposed to describe the interaction of the particles with an electromagnetic field. This interaction would have to be included in the Lagrangian in order to yield all the equations without the need to impose additional conditions. In practice, however, this description of the interaction is found to be applicable only for electrons, i.e. particles with spin 2 (see §32). For other spin values, therefore, the problem is only of methodological interest. For any spin s > 1 (integral or half-integral), it proves impossible to formulate a variational principle by means of a single (tensor or spinor) function whose rank corresponds to the given spin. It is necessary to use additional tensor or spinor quantities of lower rank. The Lagrangian is then so chosen that these auxiliary quantities must be zero on account of the free-particle field equations which follow from the variational principle.t § 16. Helicity states of a particle t In the relativistic theory the orbital angular momentum 1 and the spin s of a moving particle are not separately conserved. Only the total angular momentum j = 1 + s is conserved. The component of the spin in any fixed direction (taken as the z-axis) is therefore also not conserved, and cannot be used to enumerate the polarization (spin) states of the moving particle. The component of the spin in the direction of the momentum is conserved, however: since 1 = r x p the product s • n is equal to the conserved product j • n (n = p/|p|). This quantity is called the helicity ; it has already been mentionedin §8 in relation to the photon. Its eigenvalues will be denoted by A (A = - s , . . . , +5), and states of a particle having definite values of A will be called helicity states. Let i//pA be the wave function (plane wave) describing the state of a particle with definite values of p and A, and u(A)(p) its amplitude; to simplify the notation, we shall omit the indices for the components of this function (4-tensor indices for a particle with integral spin). It has been shown in earlier sections that a wave function with more than 2s + 1 components is needed in order to give a relativistic description of particles with non-zero (integral) spin. But the number of independent components remains equal to 2s + 1; the "extra" components are eliminated by imposing additional conditions which cause these components to vanish in the rest frame. In Chapter III this will be shown for half-integral s also. According to the formulae for transformation of the angular momentum (see Fields, §14), the helicity is invariant under those Lorentz transformations which do not alter the direction of p along which the angular momentum component is taken. The number A therefore remains a good quantum number under such t See M. Fierz and W. Pauli, Proceedings of the Royal Society A173,211,1939. The procedure indicated above is carried out in this paper for particles with spin 3/2 and 2. t The discussion in this section relates to particles with any spin (integral or half-integral). 56 §16 Bosons transformations, and the symmetry properties of helicity states can b<e studied by means of a frame of reference in which the momentum |p| < m (in the limit, the rest frame). Then ippk reduces to a non-relativistic wave function with 2s + 1 components. Let its amplitude be denoted by w(A)(n), the argument being the direction n = p/|p| along which the angular momentum is quantized. The amplitude w(A) is an eigenfunction of the operator n • s: (n-s)w (A) (n) = Aw(A)(n). (16.1) In the spinor representation, w(A) is a contravariant symmetrical spinor of rank 2s; according to the correspondence formulae (QM, (57.2)), its components can also be enumerated by the corresponding values of the spin component a along a fixed z-axis.t In the momentum representation, the wave functions of the states considered are essentially the same as the amplitudes u(A)(p): i//pA(k) = u(A)(k)S(2)(v - n) = u(A)(p)S(2)(v - n), (16.2) where the momentum as an independent variable is denoted by k, as contrasted with its eigenvalue p, and v = k/|k|, as against n = p/|p|.t In the non-relativistic limit, <Mv) = H>(A)(v)S(2)(v - n) - w(A)(n)Ô(2)(v - n). (16.3) This expression should be written in the more explicit form t//nA(v,c7)=>v!A)(v)ô{2)(v-n), showing the discrete independent variable o\ The helicity operator s • n commutes with the operators jz and j 2 , since the angular momentum operator is related to an infinitesimal rotation of the coordinates, and the scalar product of two vectors is invariant under any rotation. There exist, therefore, stationary states in which the particle simultaneously has definite values of the angular momentum j , its component jz = m, and the helicity A. Such states will be called spherical helicity states. Let us determine the wave functions of these states in the momentum t These arguments, like the possible values shown for A, apply to particles with non-zero mass. For massless particles there is no rest frame, and the helicity can take only the two values A = ± s. This is because of the fact already mentioned in §8, that the states of such a particle are classified by their behaviour with respect to the axial-symmetry group, which allows only twofold degeneracy of levels (as regards the properties of the wave equation, this means that in the limit as m -»0 the set of equations for a particle with spin s separates into independent equations corresponding to massless particles with spins s, s — l f . . . ) . For example, the photon has A = ± l , and the corresponding w(A) are the threedimensional vectors c(±I) (8.2). t The delta function S(2) is defined so that [ 8i2)(v-n)dov= 1. The delta function which imposes a fixed value of the energy is omitted in (16.2), and similarly in (16.4) below. §16 Helicity States of a Particle 57 representation. This may be done by direct analogy with the formulae derived in QM, §103 for the wave functions of a symmetrical top. They were obtained there on the basis of the formulae for the transformation of wave functions under finite rotations (QM, §58). These in turn were based solely on the symmetry properties with respect to rotation, and are therefore applicable to functions in the momentum representation just as much as to coordinate functions. In addition to the coordinates x, y, z fixed in space (with respect to which the functions i//JmA are written), we shall also use "moving" coordinates £, rj, £, with the £-axis in the direction of v. Without repeating the argument (cf. the derivation of QM, (103.8)), we can write ^JmA(k) = i|ig ) D^(v), where ipfx is the wave function in the moving coordinates, describing the state of a particle with a definite value of the ^-component of the angular momentum, j^ = A; in the momentum representation, of course, this function is the same as the amplitude u(A). The normalized wave function (see below) is <fcmA<k) = ^ ^ D^(v)u (A) (k). (16.4) Here, however, there is a question of the choice of phases, because of the following non-uniqueness: a rotation of the coordinates £, rj, £ relative to x, y, z is defined by three Eulerian angles a, ß, 7, whereas the direction of v, on which the particle wave function can alone depend, is defined by the two spherical angles a = <f> and ß = 6. It is thus necessary to agree on some definite choice of the angle 7. We shall take 7 = 0 , defining Dfmiv) as D{pm(v) = D{lU<t>, 0,0) = e1"* d&(0). (16.5) From QMy (58.21), the functions (16,5) are seen to satisfy the orthonormality conditions: / D ^ ( v ) D ^ m i ( v ) ^ = 2JTÎ S''h *"-»""■ (16 6) ' where dov = sin 6 dd d<f>. The orthogonality of the functions i/*irnÀ with respect to the suffix À is ensured by the factor u(A). Thus the functions t//jmA are orthogonal in all three suffixes, as they should be, and with the coefficient chosen in (16.4) they are normalized by the condition j\4>jmx\2dov=\. (16.7) Here we assume that the amplitudes u(A) are normalized to unity: H ( A V A ) * = 1. Let us now consider the behaviour of the wave functions of helicity states under inversion of the coordinates. The product of the polar vector v and the axial vector j is a pseudoscalar. It is therefore obvious that inversion will change a state 58 §16 Bosons with helicity A into one with helicity - A ; all that is necessary is to determine the phase factors in these transformations. Under inversion, v-> - v. The vector v is defined by the two angles cf> and 0, and the transformation v -» - v is brought about by the changes </> -> <f> + IT, 0 -> TT - 0. This determines the £-axis but leaves indefinite the position of the £ and TJ axes, which depends also on the third Eulerian angle y\ the transformation of 0 and <f> alone does not distinguish, in this sense, between reflection of the coordinates and rotation of the £-axis. Expressed in terms of all three Eulerian angles, inversion is the transformation a = 4> -> <f> + 77, ß = 0 - » 7r - 0, y^TT — y. (16.8) Hence, if D(pm(v) is defined as in (16.5) (i.e. with y = 0), and the transformation v-> - v is regarded as being the result of inversion, then D{i]m(-V) = D(Pm((t> + TT, 7T - 0, TT). (16.9) From formulae QM (58.9), (58.16) and (58.18) we hence find D^m(-v) = e^d{pm(7T'd)eimU^) = (^y-VmVJ>,m(0) = (-irAD°U(^,0,o), or D ^ ( - v ) = (-iy- A D ( iU(v), (16.10) where j - A is an integer. A similar formula for the spinor wik) can be obtained by noticing that its components w ^ are the same, apart from a factor, as the functions w(aA)(v)-D(AsJ(v)*. (16.11) For, by applying the transformation formulae QM (58.7) to the spin eigenfunctions and taking the ^-component of the spin to have a definite value A (i.e. replacing i//im by Sm A on the right-hand side of QM (58.7), we find that D(^(v) are the spin wave functions corresponding to definite values of the z and £ components (a and A) of the spin. The set of these functions with a = - 5 , . . . , + s forms, according to the correspondence formulae (QM (57.6)), a covariant spinor of rank 2s. The components of the contravariant spinor, which according to the formulae QM (57.2) correspond to the components w(aA), are transformed as the complex conjugates of the components of the covariant spinor of the same rank. From (16.10) and (16.11), we have w<*>(-v) = ( - l ) s - V - A ) ( v ) , (16.12) where s - A is an integer. The inversion operation applied to w(A), however, not only changes v into -v but also multiplies w(A) by a common phase factor (the §16 Helicity States of a Particle 59 "internal parity" of the particle), which we shall denote by TJ: jV A ) (v) = T)W (A) (-V) = T,(-1) S ~ A W ( ~ A) (V). (16.13) For the relativistic amplitude u(A)(k), this transformation becomes Pu(A)(k) = r)ßu(A)(-k) = Tj(-irVA)(k), (16.14) where ß is a certain matrix which is a unit matrix with respect to the components of M(A) which remain in the limit |p|->0. It is important to note that this matrix does not depend on the quantum numbers of the state, and in this sense the difference between (16.13) and (16.14) is unimportant.t On applying (16.14) to (16.2), we obtain the law of transformation of the wave functions of the states |nA): P<Mv) = r , ( - i r A ^ n , - A ( v ) . (16.15) For spherical helicity states, using (16.10) and ( 16.12), we obtain the transformation law J%»A(V) = r,(-iy> ; m ,- A (v). (16.16) The states $jm0 are transformed into themselves, according to (16.16), i.e. they have a definite parity. If A ^ 0, however, only superpositions of states with opposite helicities have a definite parity: </>jm|A| = -^2 ^mX ± ^i»».-*)- (16.17) On inversion, these are transformed into themselves: P*BA,(V) = ± r,(-iy->^ A | (v). (16.18) It should be noted that in this section we have arrived at a classification of states of a free particle with a given angular momentum, using only conserved quantities and without invoking the concept of the orbital angular momentum (which was employed, for instance, in §§6 and 7 for classifying photon states). As an example, let us consider the case of spin one. In the rest frame the amplitudes u(A) (4-vectors) become the three-dimensional vectors e(A\ which here take the place of the amplitudes w(A). The action of the operator of spin one on the vector function e is given by the formula (s,-e)k = -ie,-we,; (À) (16.19) t For example, when s = 1 the amplitudes u are the 4-vectors (16.22); ß is then entirely a unit matrix with respect to the 4-vector indices, ß^ = 5^. When s = i as we shall see in Chapter III, u u ) is a bispinor, the phase factor TJ = i, and ß is the Dirac matrix y° (see (21.10)). 60 §16 Bosons see QM, §57, Problem 2. Thus equation (16.1) becomes m x e , A ' = Ae (A) _ v (A) (16.20) The solutions of this equation (in £t)£ coordinates with the £-axis i n t h e direction of n) are the same as the spherical unit vectors (7.14):t e(0)= e(H)= f(0,0, 1), T-^(1,±/,0). (16.21) In a frame of reference in which the particle has momentum p, the hèlicity state amplitudes are the 4-vectors M(o>,i = /|pl _Le(0) \m m I (16.22) If e is a polar vector, then rj = - 1, and the functions (16.17), which are three-dimensional vectors when s = 1, have the following parities: *£]AI:P=(-1)\ ^m)|Ai:P=(-Di+1, ^mo:P = ( - l ) j . On comparing with the definition of the spherical harmonic vectors (7.4), we see that these functions are identical (apart from phase factors) with Yjm\ Y}£\ Y $ respectively. After ascertaining the phase factors (by comparing values for 0 = 0 , say), we obtain the equations (16.23) where j is an integer; e (x) = n x e U ) are spherical unit vectors along axes £ \ TJ\ £ which are obtained from £, rj, £ by a rotation of 90° about the £-axis. The last formula (16.23) is equivalent to the expression QM (58.23) for d^miB). The first or second formula (16.23) leads to a simple expression for the functions t The choice of phase factors is determined by the condition that the spin operator matrix elements calculated with the eigenfunctions (16.21) must be in accordance with the general definitions in QM, §§27 and 107. §16 61 Helicity States of a Particle di\.m. We have « M V ^ ^ U = Y^-e^* = —7— e(±,)* • V Y V[j(i + l ) ] e The scalar product on the right can be written explicitly in the coordinates £, TJ, £, with l±_ J_\ ^ /_3 Vd£'aTj/ 1_ J_\ \a0'sin0 d<j>r With the definitions (7.2) of Yjm and (16.5), the result is "»■■w-(-ir l V a+ (i m;w )! + i)( ± ^ + ^) i,r(co,>) ' ms0 (16.24) CHAPTER III FERMIONS §17. Four-dimensional spinors IN THE non-relativistic theory, a particle with arbitrary spin s is described by a quantity with 2s + 1 components, a symmetrical spinor of rank 2s. These quantities are, mathematically, realizations of the irreducible representations of the spatial rotation group. In the relativistic theory, this group is only a subgroup of the wider group of four-dimensional rotations, the Lorentz group. It is therefore necessary to develop the theory of four-dimensional spinors (4-spinors), as quantities which are realizations of the irreducible representations of the Lorentz group. This theory will be given in §§17-19. In §§17 and 18 we shall consider only the proper Lorentz group, which excludes spatial inversion; the latter will be dealt with in §19. The theory of 4-spinors is analogous in structure of that of three-dimensional spinors (B. L. van der Waerden, 1929; G. E. Uhlenbeck and O. Laporte, 1931). A spinor £ a is a quantity having two components (a = 1,2); as components of the wave functipn of a particle with spin 2, £] and £2 correspond to the respective eigenvalues +1 and -2 of the z-component of the spin. Under any transformation belonging to the (proper) Lorentz group, the two quantities £* and £2 are transformed into linear combinations of themselves: * " = ^ , + <2* ; : } e=ye+s£ . (,7.D The coefficients a, ß, 7, S are definite functions of the angles of rotation of the 4-coordinate system, and must satisfy the condition a8-ßy = 1; (17.2) that is, the determinant of the binary transformation (17.1) is equal to unity, as are the determinants of the coordinate transformations in the Lorentz group. Because of the condition (17.2), the bilinear form ^ ! E 2 - ^ 2 H 1 (where £ a and a H are two spinors) is invariant under the transformation (17.1), and corresponds to a particle with spin zero which "consists" of two particles with spin 2. In order to write such invariant expressions in a natural way, the "covariant" components £ft are used as well as the "contravariant" components £a of the spinor. Their relationship is governed by the "metric spinor" g a ß :t ta=gaße> (17.3) t The spinor indices will be denoted by the letters at the beginning of the Greek alphabet: a, 0, 62 Four-dimensional Spinors §17 63 where *«* = (_? £)• <i7-4> so that «! = «2, &= -*1. (17.5) Then the invariant ^ 1 H 2 -^ 2 E 1 becomes the scalar product £ a E a , and £ a S a = - £ Ea The properties so far stated are formally the same as those of three-dimensional spinors. A difference arises, however, when complex-conjugate spinors are considered. In the non-relativistic theory, the sum <^y* + ^fy2*, (17.6) which determines the probability density for the localization of the particles in space, must be a scalar, and the components i//a* must therefore be transformed as the covariant components of a spinor; the transformation (17.1) must therefore be unitary (a = 8*, ß = - 7*). In the relativistic theory, however, the particle density is not a scalar, but is the time component of a 4-vector. The above-mentioned condition therefore no longer applies, and the transformation coefficients need satisfy no condition other than (17.2). The four complex quantities a, ß, 7, 8 under the condition (17.2) alone are equivalent to 8 - 2 = 6 real parameters, in accordance with the number of angles which define a rotation of the 4-coordinate system (rotations in six coordinate planes). Thus complex-conjugate binary transformations are quite different, and in the relativistic theory there exist two types of spinors. A special notation is customary, in order to distinguish these two types: the indices of spinors which are transformed by the complex conjugate formulae to (17.1) are written with dots over them and are called dotted indices. Thus, by definition, T|B~r*. (17.7) where the sign — denotes "is transformed as". The transformation formulae for a "dotted" spinor are therefore V ' = a * V + j3*Tj2, Tj2' = 7 V + 8*n J . (17.8) The operations of raising and lowering the dotted indices are carried out in the same way as for the undotted indices: TN = T)4, i72 = - V . (17.9) The behaviour of 4-spinors as regards spatial rotation is the same as that of 64 §18 Fermions 3-spinors, for which, as we know, i>* — t//ft. Accoiding to the definition (17.7), the 4-spinor r/a therefore behaves under rotations in the same way as the contravariant 3-spinor <//\ The covariant components r}( and 172 therefore correspond, as the components of the wave function of a particle with spin 2, to the eigenvalues \ and -2 of the spin component. Spinors of higher rank are defined as sets of quantities which are transformed as products of the components of a number of spinors of rank one. The indices of these spinors of higher rank may be partly dotted and partly undotted. For example, there exist three types of spinors of rank two: In this respect, the statement of just the total rank of a spinor does not uniquely define it; we shall therefore, where necessary, indicate the rank as a pair of numbers (k, /), the numbers of undotted and dotted indices respectively. Since the transformations (17.1) and (17.8) are algebraically independent, it is not necessary to specify the sequence of dotted and undotted indices; in this sense the spinors Caß and £ ßa , for example, are the same. In order to be invariant, every spinor equation must have on each side the same numbers of undotted and dotted indices, since otherwise the equation could not remain valid when the frame of reference was changed. Here we must remember that taking the complex conjugate implies interchanging dotted and undotted indices. The relationship r)aß = (£ aß )* between two spinors is therefore invariant. Spinors or their products can be contracted only.with respect to pairs of indices of the same kind (dotted or undotted); summation with respect to two indices of different kinds is not an invariant operation. Hence, from the spinor ra\a2 a kß)ß2 ßl M 7 1Q\ which is symmetrical in all k undotted indices and in all / dotted indices, we can obtain no spinor of lower rank (since contraction with respect to a pair of indices in which the spinor is symmetrical gives zero). Thus we cannot construct from the quantities (17.10) a smaller number of linear combinations of them which in turn are transformed into linear combinations of themselves by every transformation in the group. That is, the symmetrical 4-spinors are realizations of the irreducible representations of the proper Lorentz group. Each irreducible representation is specified by the pair of numbers (k, /). Each spinor index takes two values, and there are therefore k + 1 essentially different sets of numbers aj, a 2 , . . . , ak in (17.10) (containing 0, 1, 2 , . . . , k ones and k, k - 1,. . . , 0 twos) and I + 1 sets of numbers ßu ß 2 , . . . ß/. The symmetrical spinor of rank (k, /) thus has a total of (k + l)(l + 1) independent components, and this is also the dimension of the corresponding irreducible representation. § 18. The relation between spinors and 4-vectors The spinor £a/3, with one dotted and undotted index, has 2 x 2 = 4 independent components, the same as the number of a 4-vector. It is therefore clear that both §18 The Relation Between Spinors and 4- Vectors 65 are realizations of the same irreducible representation of the proper Lorentz group, and that there must consequently be a certain relation between their components. In order to ascertain this relation, let us first consider the corresponding relation in the three-dimensional case, using the fact that 3-spinors and 4-spinors must behave in the same manner with respect to purely spatial rotations. For the three-dimensional spinor iffaß9 the correspondence formulae are as shown in QMy §57; they will here be written as where a*, ay, az are the components of a three-dimensional vector a. For the four-dimensional case, the components ifßaß must be replaced by £a*, and ax, ay, az must be taken to be the contravariant components a\ a2, a3 of a 4-vector. The form of the expression for the fourth component a0 is evident from the fact, noted in §17, that the quantity (17.6) must transform as a0. Hence a ^ ^ ' + E2*, the coefficient of proportionality being determined so that the scalar £aß£aß is the same as the scalar 2aMa^ = 2a2. Thus we obtain the correspondence formulae aMa'W2*), a° = &" + £*). * The inverse formulae are £" = fc = a3 + a°, xi l =-tn x = a -ia\ t22 = tn = a°-a\ 21 ] 2 £ = -£■*= a'-f«a J (18.2) with UC* = 2a2. (18.3) Moreover as is seen from the fact that the spinor CaßCyß, of rank two, is antisymmetric in the indices a, 7, and is therefore proportional to the metric spinor. The correspondence between the spinor Caß and the 4-vector is a particular case of a general rule: any symmetrical spinor of rank (Jc, k) is equivalent to a symmetrical 4-tensor of rank k which is irreducible (i.e. which gives zero on contraction with respect to any pair of indices). The relation between the spinor and the 4-vector may be written in a compact 66 §18 Fermions form by means of the two-rowed Pauli matricest <* = (î o)' a = > (? "<>)• ^ = (o -?)• (185) If the matrix of the quantities Caß (with the indices raised and the first undotted) is symbolized by £, then formulae (18.2) become £ = a-<7 + a°, (18.6) the second term denoting of course the product of a° and a unit matrix. The inverse formulae are a = | t r (£or), a° = h r £ (18.7) Using formulae (18.6), (18.7), we can determine the relation between the laws of transformation of the 4-vector and the spinor, and thus express the law of transformation of the spinor in terms of the parameters of rotations of the 4-coordinates. We write the transformation of the spinor £a in the form r ' = (B£) a , B = (a g), (18.8) where B is a two-rowed matrix formed from the coefficients of the binary transformation. Then the transformation of the dotted spinor is T,^-(B*7]) ß = (T]B+)^, (18.9) and the transformation of the spinor £ a / 3 ^£ a Tj ß , of rank two, may be symbolized ast £' = BÇB*. For the infinitesimal transformation B = 1 + A, where A is a small matrix, we have as far as first-order quantities £ W + (A£ + £A+). (18.10) Let us first consider the Lorentz transformation to a frame of reference moving with an infinitesimal velocity ÔV (without change in direction of the space coordinate axes). Then the 4-vector a* = (a 0 , a) is transformed as follows: a' = a - a°SV, a 0 ' = a 0 - a -ÔV. (18.11) t To simplify the notation, matrix operators acting on spin variables are written without circumflexes. t For the covariant components we have «; = (B-|€)„ = uB-|)rt, V . = (T,B*- , ) i , a so that the product £ a E of two spinors remains invariant. <18-8a) 67 The Relation Between Spinors and 4-Vectors §18 We how make use of formulae (18.7). The transformation of a0 may be represented, firstly, as a0' = a 0 - a - 8\ = a°-|tr(£«r • ÔV); secondly, as a0' = i t r r = a° + itr(À£ + £A+) = a° + !tr£(À + À+). These two expressions must be identically equal (i.e. equal for all values of £)• Hence X + X + = -<r-ÔV. Treating the transformation of a in the same way, we find <TÀ + À + <T= -SV. These equations, as equations for X, have the solution A = A + = -i«r-ÔV. Thus an infinitesimal Lorentz transformation of the spinor £a has the matrix B = l-^cr-nSV, (18.12) where n is a unit vector in the direction of the velocity ÔV. From this we can easily find the transformation for a finite velocity V. To do so, we recall that a Lorentz transformation signifies (geometrically) a rotation of the 4-coordinates in the plane of t and n through an angle </> which is related to the velocity V byt tanh <f> = V. An angle 8<f> = ÔV corresponds to an infinitesimal transformation, and a rotation through a finite angle <f> is carried out by a <t>l8<j>-{old repetition of a rotation through 8<f>. Raising the operator (18.12) to the power <j>l8<f> and taking the limit 8<j> -* 0, we obtain B = «-*♦■•*. (18.13) The mathematical significance of this operator is seen by noticing that, from the properties of the Pauli matrices, all even powers of n • <r are equal to 1, and all odd powers are equal to n • a. Since the expansions of the hyperbolic sine and cosine contain respectively odd and even powers of the argument, we have finally B = cosh i<£ - n • a sinh \<t>> tanh 4> = V. t The metric is pseudo-Euclidean in planes containing the time axis. (18.14) 68 Fermions §19 The matrices B of the Lorentz transformations are Hermitian: B = B* Let us now consider an infinitesimal rotation of the space coordinates. The three-dimensional vector a is transformed as follows: a' = a - 0 9 x a , (18.15) where 68 is the vector of the infinitesimal angle of rotation. The corresponding transformation of a spinor may be found similarly. There is no need to do so, however, since the behaviour of 4-spinors under spatial rotations is the same as that of 3-spinors, and the transformation of the latter is known from the general relationship between the spin operator and the operator of an infinitesimal rotation: B = l+iia-88. (18.16) The change to a rotation through a finite angle 0 is made in the same way as that from (18.12) to (18.14): B = exp (\idn • a ) = cos \d + in • a sin \0, (18.17) where n is a unit vector along the axis of rotation. This matrix is unitary (B+ = B" 1 ), as it should be fol* a spatial rotation. § 19. Inversion of spinors The discussion (in QM) of the three-dimensional theory of spinors did not consider their behaviour under the operation of spatial inversion, since in the nonrelativistic theory this would not have led to any new physical results. Here we shall examine the point, however, in order to make clearer the subsequent analysis of the inversion properties of 4-spinors. The operation of inversion does not alter the sign of the spin vector, or of any axial vector, and the spin component sz is therefore also unchanged in value. Hence it follows that inversion can change each component of the spinor i//a only into a multiple of itself: i// a ^Pi// a , (19.1) where P is a constant factor. On repeating the inversion, we return to the original coordinates. For a spinor, however, a return to the original position can be regarded in two different ways, as a rotation through 0° or 360°. These two definitions are not equivalent with respect to spinors, since ifta changes sign on rotation through 360°. Thus two alternative views of inversion are possible: one where P2=l, P = ±l, (19.2) §19 Inversion of Spinors 69 and one where P2=-l, P=±i. (19.3) Here it is important to note that the concept of inversion must be defined in the same way for all spinors. It is not permissible for different spinors to behave differently under inversion (i.e. in accordance with both (19.2) and (19.3)), since in that case it would not be possible to construct a scalar (or a pseudoscalar) from every pair of spinors: if the spinor ifja were transformed according to (19.2), and <t>a according to (19.3), then the quantity if/a<t>a would be multiplied by ±i under inversion, instead of remaining constant (or simply changing sign). It should be emphasized that (whatever the definition of inversion) the assignment of a particular parity P to a spinor has no absolute significance, since spinors change sign on rotation through 27r, and this can always be carried out simultaneously with inversion. The "relative parity" of two spinors, defined as the parity of the scalar ifja<f>a formed from them, has absolute significance, however; on rotation through 27r, both spinors change sign, and the indeterminacy therefore does not influence the parity of this scalar. Let us now go on to discuss four-dimensional spinors, first noting that inversion changes the sign of only three coordinates x, y, z out of four x, y, z, t; it therefore commutes with spatial rotations but not with transformations which rotate the t-axis. If L is the Lorentz transformation to a frame of reference moving with velocity V, then PL = UP, where Û is the transformation to a frame moving with velocity - V . Hence it follows that the components of the 4-spinor £a cannot be transformed into multiples of themselves under inversion. If the inversion of the spinor £a were given by the transformation (19.1) as before (i.e. if it were represented by a matrix proportional to the unit matrix), it would commute with every Lorentz trans^ formation, and this certainly cannot be true, since the operations L and Û are not the same when applied to £a. Thus inversion must transform the components of the spinor £a into expressions involving other quantities. The latter can only be the components of some other spinor r\a whose transformation properties are not the same as those of £a. Since inversion does not affect the z-component of the spin (as mentioned above), the components Ç1 and £2 can only become TJI and rji on inversion, these corresponding to the same values s2 = \ and sz = - 2 - If inversion is taken to be an operation which gives identity when carried out twice, its effect may be expressed by the formulae r-Tfc, Tfc->r. (19.4) For the covariant components £a and contravariant components TJ", these transformations change sign: &->-T» d , V-+-Ê» (19.4a) since the lowering and raising of the same index lead to opposite signs (cf. (17.5) 70 Fermions §19 and (17.9)).t If, however, inversion is taken in the sense such that P 2 = — 1, its effect is given by r-+")*> Tfc-MT (19.5) or, equivalently, &->-iV\ Và-*-iè<, (19.5a) There is a certain difference between the two definitions of inversion in that with the second definition complex-conjugate spinors are transformed in the same manner: if Ha = TJ *, Hd = £"*, then by (19.5) E« -» - iH°', H° -+ - iE«, i.e. the rule is the same as for £a, 17". According to the definition (19.4), however, we should obtain E a - * H a , H a ->E a , which is opposite in sign to the transformation of the spinors £,, T)a. We shall return in §27 to some possible physical consequences of this difference. In the following, the definition (19.5) will be used. The spinors £a and 7)à are, as we know, transformed in the same way by the rotation subgroup. On taking the combinations E°±tfc (19.6) we obtain quantities which are transformed under inversion according to (19.1) with P = ± i. These combinations, however, do not behave as spinors under all the transformations of the Lorentz group. Thus the inclusion of inversion in the symmetry group makes necessary the simultaneous treatment of a pair of spinors (£ a , TJ«); this is called a bispinor (of rank one). The four components of a bispinor form a realization of one of the irreducible representations of the extended Lorentz group. The scalar product of two bispinors (£ a, TJ«) and (E a , H d ) can be formed in two ways. The quantity f B . + T|«H" (19.7) is unchanged by inversion, i.e. it is a true scalar. The quantity rE.-TfcH* (19.8) is also invariant under rotations of the 4-coordinates, but changes sign under inversion, i.e. it is a pseudoscalar. ^ A spinor of rank two, f "*, may also be defined in two ways. If it is defined by the transformation rule £«'~£"H* + E V f (19.9) t The definition (19.4) is, of course, to some extent arbitrary, since the quantities £ a and Tja are independent. For instance, if tja is replaced by a new spinor tji = el5rjb, (19.4) is replaced by the equivalent definition §19 71 Inversion of Spinors we obtain quantities which are transformed under inversion as follows: (19.10) C^U- The 4-vector a* to which such a spinor is equivalent is transformed, according to (18.1), by (a0, a)-»(a 0 ,-a), i.e. it is a true 4-vector, and the three-dimensional vector a is a polar vector. It is also possible, however, to define £aß thus: £«0~£«H'-EV- (19.11) Thent (19.12) ?*-+-£*. Such a spinor corresponds to a 4-vector such that under inversion (a 0 , a)-»(-a 0 , a), i.e. a 4-pseudovector (the three-dimensional vector a being an axial vector). Symmetrical spinors of rank two, with indices of the same type, are defined by r/3 _ r5ß + pBa9 ^ _ VaHß + ^ H Q (19:i3) On inversion they are transformed into each other: r*-*-r, a 0. (19.14) The pair (£aß9 TJ^) forms a bispinor of rank two. It has 3 + 3 = 6 independent components. The antisymmetric 4-tensor of rank two a*v also has this number of independent components. There must therefore be a certain correspondence between the bispinor and the tensor; both are realizations of equivalent irreducible representations of the extended Lorentz group. Since the spinors Caß and t)aß are transformed independently by the proper Lorentz group, we can construct from the components of the 4-tensor a*1" two groups of quantities which are transformed only into combinations of one another under any rotation of the 4-coordinates. This division is achieved as follows. We define a three-dimensional polar vector p and a three-dimensional axial vector a related to the components of the 4-tensor a*v by ( 0 px py Pz m (p, a), (19.15) -px 0 -a2 ay -py az 0 -ax -pz - a y ax 0 where (p, a) is a concise notation which we shall use in order to specify the t It must be emphasized that the transformation rules (19.10) and (19.12), which differ in the sign on the right, are not equivalent, since components of the same spinor appear on both sides (cf. the last footnote), 72 §19 Fermions components of such a tensor. Then aßV = ( - p , a), and, of the two quantities a2 - p2 = {a^a^, a• p= le^a^a*", the first is a scalar and the second a pseudoscalar; both are invariant under the proper Lorentz group.-The squares of the three-dimensional vectors f± = p ± ia are therefore also invariant. Thus any rotation in 4-space is equivalent, as regards the vectors f*, to a "rotation" in 3-space, through angles which are in general complex; the six angles of rotation in 4-space correspond to three complex "angles of rotatign" of the three-dimensional coordinates. The operation of spatial inversion changes the sign of p but not that of a, and converts the vectors f+ and -f" into each other. The components of these vectors are the required two groups of quantities formed from the components of the tensor a*1". This also makes evident the correspondence between the components of the 4-tensor a*v and the spinors £ aß , 7)aß. Since the Lorentz group contains as a subgroup the spatial rotations, the relations between the components of the spinor and those of the three-dimensional vector must be the same as for three-dimensional spinors: f~x = kvii - TJii), / y = 2Î(T)22 + T|ü), n = -nu. (19.16) PROBLEM Derive the general correspondence between spinors of even rank and 4-tensors. SOLUTION. All spinors for which k + / is even are realizations of single-valued irreducible representations of the extended Lorentz group, and are therefore equivalent to the 4-tensors which are realizations of similar representations.t A spinor of rank (k, k) can be defined so that it is transformed under inversion by r3 yS --*±u...7*.... (i) Such a spinor is equivalent to a symmetrical irreducible 4-tensor of rank k, which is a true tensor or a pseudotensor according to the sign in (1). Spinors of ranks (k, I) and (/, k), forming a bispinor, are transformed under inversion by k i When I = k + 2, the bispinor is equivalent to an irreducible 4-tensor a^^po... of rank k + 2, antisymmetric in the indices [piv] and symmetric in all the other indices. The irreducibihty of this tensor signifies that it gives zero on contraction with respect to any pair of indices and on dualization with respect to any three indices (i.e. eklAVpa[^v\pa... = 0); the latter condition implies that the result is zero on taking the cyclic sum over three indices, \LV and any one other. When / = k + 4, the bispinor is equivalent to an irreducible 4-tensor a^H^iar.. of rank k + 4, having the following properties: it is antisymmetric in the pairs of indices [A/n] and [vp], symmetric in all others, symmetric for interchange of [Xfi] with [i>p], and gives zero on contraction with respect to any pair of indices and on dualization with respect to any three indices. Generally, when I = k -I- 2n, the bispinor is equivalent to an irreducible 4-tensor of rank k + 2n, antisymmetric in n pairs of indices and symmetric in the other k indices. 4-tensors antisymmetric in t Spinors of odd rank are realizations of fwo-valued representations of the group: a spatial rotation through 360° changes the sign of spinors, so that two matrices of opposite sign correspond to each element of the group. §20 Dime's Equation in the Spinor Representation 73 larger numbers (threes, fours, etc.) of indices do not appear in this classification, for the obvious reason that an antisymmetric tensor of rank 3 is equivalent (dual) to a pseudovector, and an antisymmetric tensor of rank 4 reduces to a scalar (is proportional to the unit pseudotensor ex^p)\ antisymmetry in a still greater number of indices is not possible in 4-space. § 20. Dirac's equation in the spinor representation A particle with spin 2 is described, in its rest frame, by a two-component wave function, i.e. a three-dimensional spinor. The "four-dimensional origin" of this may be either an undotted or a dotted 4-spinor. Both these 4-spinors appear in the description of the particle in an arbitrary frame of reference; we shall denote them by r and rj a .t For a free particle, the only operator which can appear in the wave equation is (as shown in §10) the 4-momentum operator pM = îdM. In the spinor notation, this 4-vector corresponds to the operator spinor paßy with P,, = P22 = Pz+Po, Mi p l = -P2i = p* -iPy. P 22 = Pii = Po-Pz, 1 21 P = - P i i = P* + iPyJ (20.1) The wave equation is a linear differential relation between the components of spinors, expressed by the operator paß. The requirement of relativistic invariance leads to the equations A aß ^ . —»£<* p p r)ß - mÇ , (20.2) PßaCa = wi?ß, where m is a dimensional constant. There would be no meaning in using different constants m\ and m2 here, or in changing the sign of m, since the equations could still be reduced to the above form by an appropriate transformation of £ a or r)a. By substituting tjß from the second equation (20.2) in the first, we can eliminate one of the two spinors: From (18.4), pa*pyß = p 2 ô", and thus we obtain ( p 2 - m 2 ) r = 0, (20.3) whence it is evident that m is the mass of the particle. It should be noticed that the need to use the mass in the wave equation implies the simultaneous consideration of two spinors (£ a and r)a): with only one of these, it would not be possible to construct a relativistically invariant equation containing t A three-dimensional spinor of rank one may also "originate" from 4-spinors of higher odd rank which, in the rest frame, become antisymrtietric in one or more pairs of indices. These would, however, lead to higher-order equations (cf. the third footnote to §10). 74 §20 Fermions a dimensional parameter. The wave equation is necessarily invariant under spatial inversion if the transformation of the wave function is defined by **-►«*•. P: *■->"?« (20.4) It is easily seen that the two equations (20.2) are interchanged by this substitution (together with p"ß-*paß, which is evident from (20.1)). Two spinors which are interchanged by inversion form a four-component quantity, a bispinor. The relativistic wave equation given by (20.2) is called Dirac's equation, having been first derived by Dirac in 1928. In order to analyse und apply this equation further, let us consider various ways in which it may be written. Using (18.6), we can rewrite equations (20.2) as (Po + p-oO-n = m& (Po ~ P * or)£ = mi). (20.5) Here the symbols £ and TJ denote two-component quantities, the spinors (the first with upper and the second with lower indices). Here and below, multiplication of the matrices a by any two-component quantity / means multiplication by the usual matrix rule: (<rf)a = <raßfß. (20.7) The vertical column notation for / is in accordance with the multiplication of each row of a by the column /. For subsequent reference, the Pauli matrices may be written once more; -Ci). H?-.!). -G-Ï)- Their fundamental properties are Ojcrjc + flTkO-, = 26jk, 1 . _ o-j<rk = iCikio-j + s,k ; J (20.9) see QM, §55. We shall also give the wave equation satisfied by the complex-conjugate wave function formed from the spinors É*«« 1 *,* 2 *), TÏ* = « , * ? ) . (20.10) Since all the operators p\ contain the factor i,p*= -p M . In taking the complex §21 The Symmetrical Form of Dime's Equation 75 conjugate of both sides of equation (20.5), we must also use the fact that, since the matrices a are Hermitian (cr* = cr), (o/)î = a*flfî=/|crflB = (/*cr)a; the resulting equations are ,•».+»->—«*,i (20ll) ^*(Po-p-a) = -mf}*.J Here it is conventionally implied that the operators p*1 act on the function to the left of them. The writing of £* and -n* as horizontal rows is in accordance with the matrix multiplication in these equations: the row / is multiplied by the columns of the matrix a, (/*<r)a=/S<rßa. (20.12) The inversion transformation for £*, TJ* is defined as the complex conjugate of the transformation (20.4): P: f*-*-h|!, *i!->-£•*. (20.13) §21. The symmetrical form of Dirac's equation The spinor form of Dirac's equation is the most natural one, in the sense that its relativistic invariance is immediately apparent. In applications of the equation, however, other forms of the wave equation may be more convenient, which are obtained by a different choice of the four independent components of the wave function. We shall denote the four-component wave function by the symbol $, with components «ft (i = 1, 2, 3, 4). In the spinor representation, it is a bispinor: M£> (21.1) But the independent components of ty can equally well be taken as any linearly independent combinations of components of the spinors £ and ij.t We shall arbitrarily limit the acceptable linear transformations by the one condition of unitarity; such transformations leave unchanged the bilinear forms constructed from t/r and ifr* (§28). In the general case of an arbitrary choice of the components of </*, Dirac's equation can be put in the form t For brevity, the four-component quantity *ft will be referred to as a bispinor even in non-spinor representations. 76 §21 Fermions where y*1 (/x = 0, 1, 2, 3) are certain four-rowed matrices (Dirac matrices). We shall usually write this equation in a symbolic form, omitting the matrix indices: (yp-m).// = 0, (21.2) where = Po7°-P'7 y= (y\y2,yi)- yp = 7"PM For example, the spinor form of the equation with the components of $ as in (21.1) corresponds to the matricest *•-(?*). - e i ) . as is easily seen by writing the equations (20.5) as \Po-p-o- 0 AT)/ \T?/ and comparing with (21.2). In the general case, the matrices y need satisfy only conditions ensuring that p 2 = m2. To find these conditions, we multiply equation (21.2) on the left by yp: (y*P»){yvPM = m(pM7^)i/f - mV. Since p$v is a symmetrical tensor (all the operators pM commute), this equation may be rewritten and we must therefore have y»yv + yvy» = 2g»\ (21.4) Thus all the pairs of different matrices y*1 anticommute, and their squares are (71)2 = (72)2 = (73)2 = - 1 , (7°)2 = 1. (21.5) Under an arbitrary unitary transformation of the components ty\ ip' = Lty, where t Here and below, we use a compact two-rowed notation for four-rowed matrices. Each symbol in (21.3) represents a two-rowed matrix. §21 77 The Symmetrical Form of Dirac's Equation 17 is a unitary four-rowed matrix, the matrices y are transformed as follows: y'=UyU-x=UyU\ (21.6) so that the equation ( y p - m ) ^ = 0 becomes (y'fi - m)ty' = 0. The commutation relations (21.4) remain unchanged, of course. The matrix 70 (21.3) is Hermitian, and the matrices y are anti-Hermitian. These properties are preserved under any unitary transformation (21.6), and we therefore always havet 7+ = " 7 , 7°+ = 7°. (21.7) The equation for the complex-conjugate function </f* may also be given. Taking the complex conjugate of equation (21.2) and using the properties (21.7), we obtain (-poyo-p"y-m)\ff* = 0. We commute $* by 7 ^ * = ^*y't and then multiply the whole equation on the right by 70; since 77° = - 7°Y, we have in terms of a new bispinor $ = 4,*y>, 4,* = ^ y (21.8) the result $(yp + m) = 0. (21.9) As in (20.11), the operator p is here taken to act on the function to its left. The function «/> is called the Dirac conjugate (or relativistically conjugate) function to «/>. The factor 70 in its definition signifies that (in the spinor representation) it interchanges the spinors £* and 17*; thus, in ^ = (TJ*, £*) the first spinor is undotted (as in $) and the second is dotted. For this reason $ is a more natural "partner" of tjf than if/* is; they appear together, for instance, in various bilinear combinations (see §28). The inversion transformation for the wave function may be written as P: «^-MW, «Ä = -i«Ä7°. (21.10) In the spinor representation of iff, the matrix 70 interchanges the components £ and T), as should happen on inversion. The invariance of Dirac's equation under the transformation (21.10) in the general case is immediately obvious: changing p into - p and ifß into 17V in equation (21.2), we have (po7° + p - 7 - m ) 7 V = 0. Multiplying this equation on the left by 70 and taking into account the fact that 70 and 7 anticommute, we return to the original equation. t These equations may be written jointly in the form 78 §21 Fermions Multiplying the equation (yp-m)ifj = 0 on the left by i£, and the equation i//(yp + m) = 0 on the right by </>, and adding, we obtain <Â7M(PM<Â) + (/v/0yM</> = (P^y^) = 0, where the parentheses indicate the function on which the operator p acts. This equation is in the form of an equation of continuity, dj* = 0, so that = (</,*</,, if,*y°yil,) (21.11) is the particle current density 4-vector. Its time component j°= if/*ip is positivedefinite. Dirac's equation may be put in the form of an expression for the time derivative: idtfr/3* = ]fy, (21.12) where H is the Hamiltonian of the particle.t To obtain this form, we need only multiply equation (21.2) on the left by y°. The resulting expression for the Hamiltonian is H = a - p + ßm, (21.13) where <*=7°Y, ß = 7° (21-14) is the customary notation for the matrices concerned. It may be noted that a«ak + aka, = 2fiikf ßa + aß = 0, ß 2 = 1, (21.15) i.e. all the matrices a, ß anticommute and their squares are unity; they are all Hermitian. In the spinor representation, «■(; -:> H : DIn the limit of small velocities the particle must be described, as in the non-relativistic theory, by a single two-component spinor: on taking the limit p-»0, e-^m in equations (20.5), we find £ = TJ, SO that the two spinors which form the bispinor are equal. This, however, reveals a defect of the spinor form of Dirac's t For a particle with spin zero, the wave equation was not capable of being written in this form: the equation (10.5) for the scalar iff is of the second order in the time, while the first-order equations (10.4) for the five-component quantity (i//, ^M) contain the time derivatives of only some of the components. i §21 The Symmetrical Form of Dirac's Equation 79 equation: in the limit, all four components of \jj are non-zero, although only two of them are really independent. A more convenient representation of the wave function \p would be one in which two of its components were zero in the limit. Accordingly, we replace £ and TJ by linear combinations <f> and \ : (21.17) Then x = 0 for a particle at rest. This will be called the standard representation of tp. On inversion, <f> and x a r e transformed as follows: P: <t>-+i<t>, X-+-ÎX- (21.18) The equations for 4> and \ are obtained by adding and subtracting equations (20.5): F 1 - PoX + P * <*4> = ™X- I (21.19) Hence we see that the standard representation corresponds to the matrices Since the first and second components of £ and TJ are added separately in (21.17), the components ty\ and fa correspond to the spin component eigenvalue +2 in both the standard and the spinor representation, and fa and ^4 to -{. In both representations, therefore, the matrix 2X, where = (0 a (21.21) in a three-dimensional spin operator: when &z acts on a bispinor containing only the components >/M, fa, o r ^2, fa, this bispinor is multiplied by +2 or -{. In an arbitrary representation, (21.21) may be written in the form 2 = - a 7 5 = -\iaxa; (21.22) the definition of 7 s is given in (22.14) below. PROBLEMS PROBLEM 1. Find the formulae giving the transformations of the wave function under an infinitesimal Lorentz transformation and an infinitesimal three-dimensional rotation. 80 Fermions §22 SOLUTION. In the spinor representation of ifr, an infinitesimal Lorentz transformation gives £' = ( l - l a - 5 V ) £ V = (1+^-8V)T,; see (18.8), (18.8a)i (18.10). These formulae may be combined as «KMl-ia-SVW. (1) Similarly, the transformation under an infinitesimal rotation is ♦' = (l + lil-8ew. (2) In this form the results are valid for any representation of ^ if a and X are matrices in that representation. It is easily verified that the matrices a and X are the components of an antisymmetric "matrix 4-tensor", ^ » k r V - r V ) = («.«); the components are arranged as shown in (19.15). Using also the infinitesimal antisymmetric tensor SeM" = (8V,Se), we have <r^8e^ = 2 i ï - S e - 2 a - 8 V , and formulae (1) and (2) above may be combined as ♦'«(l+ia^fic^)^ (3) PROBLEM 2. Write Dirac's equation in a representation such that it contains no imaginary coefficients (E. Majorana, 1937). SOLUTION. In the standard representation, the only imaginary quantities in the equation are the matrices ay and iß. These may be eliminated by a transformation ty' = Ity such that the imaginary matrix ay and the real matrix ß are interchanged. This is achieved by putting then ai = UaxU = - ax, ay = ß, ai = - az, ß' = ay, and Dirac's equation becomes in which all the coefficients are real § 22. Algebra of Dirac matrices In calculations using Dirac's equation, the matrices y occur repeatedly without reference to their specific form in any particular representation. The rules of operation with these matrices are entirely given by the commutation relations y Y + 7 Y = 2g^ Ox, v = 0,1,2,3), which determine all their general properties. (22.1) §22 81 Algebra of Dirac Matrices In this section we shall give various formulae and rules of the algebra of these matrices which are useful in such calculations. The "scalar product" of the matrices y with themselves is g^y^y" = 4. For brevity we use the notation y^ = g^y" by analogy with the covariant components of 4-vectors. Then (22.2) 7*7* = 4. If the matrices yM and y* are separated by one or more factors 7, then they can be brought to adjoining positions by one or more interchanges using the rule (22.1), and the summation over pt is then carried out by means of (22.2). This yields the formulae 7*7 V = -27"» 7M7 A 7V=4g A *, (22.3) = -27P7V> 7*7*7 W 7MYVYPYV = 2(?VyV + 7V7V). The factors y*\ etc., usually appear in combination with various 4-vectors as "scalar products" with the latter,t (22.4) y a = 7**a^. For such products, formulae (22.1) become (ay)(by) + (by)(ay) = 2(ab)A (ay)(ay) = a2, (22.5) J and formulae (22.3) become 7^(07)7" = -2(07), y„{ay){by)y>t = 4(ab), i 7*(û7)(&7)(c7)7' = yMyXbyXcyXdyW -2(cy)(by)(ay), = 2[(dy)(ay)(by)(cy) + (22.6) (cy)(by)(ay)(dy)].\ A frequent operation is taking the trace of the product of a number of matrices y. Let us consider the quantities J-MIMI•••»*■ sitr(7* t , 7 M 2 • • • 7*1"). (22.7) On account of a familiar property of the trace of a product of matrices, this tensor is symmetrical with respect to cyclic permutations of the indices /LL(, j x 2 , . . . , /iR. t In this edition, no special notation is used for such products. Letters with circumflexes or with strokes through them are often found with this meaning in the literature. 82 §22 Fermions Since the matrices 7 have the same form in any frame of reference, the quantities T are also independent of this frame, and they therefore form a tensor which can be expressed entirely in terms of the metric tensor gM„ which h^s this property. From the tensor gMI/ of rank two, however, only tensors of even rank 0an be constructed. Hence it follows immediately that the trace of the product of any odd number of factors 7 is zero. In particular, the trace of each 7 is zero:t tr 7* = 0. (22.8) The trace of a unit four-rowed matrix (which is implied on the right-hand side of the commutation rule (22.1)) is 4. Thus, if we take the trace of both sides of (22.1), we find T14" = g91". (22.9) The trace of the four-matrix product is T^vp = gA*g,p _ g A, g MP (22.10) + gkpgv»m This formula may be derived, for instance, by "pulling" the factor yA in tr 7A7*7,'7P to the right by means of the relation (22.1); after each interchange one of the terms in (22.10) appears: and so on. After all the interchanges there remains on the right -T^vpk = -Tk$lvp9 which we take to the left-hand side. The trace of a product of six 7 can similarly be reduced to the traces of four-factor products, and so on. For instance, yA^vperr = g^f V P^ _ g * vj* P-fxrr ^ „ A p y / x ^ __ Qk<rT ^vfn + gkr'J'fiVpCr ( 2 2 1 1) All the traces TXM are real, and they are non-zero only if each of the matrices 7°, y\... appears in the product an even number of times; both these results are obvious from the above formulae. Hence we easily find that the trace is unchanged when the order of the factors is reversed: yAji..p<7 = yap.../!* ( 2 2 \2) As already mentioned, the factors 7 usually appear as "scalar" products with various 4-vectors. In such cases, formulae (22.9) and (22.10), for example, become ïtr(ay)(by) = ab, } i tr(ay)(by)(cy)(dy) = (ab)(cd) - (ac)(bd) + (ad)(bc). J (22.13) t The trace of a matrix is invariant under the transformations 7 = UyU~l. Thus the result (22.8) is also evident from the expressions (21.3) for the matrices. I §22 83 Algebra of Dirac Matrices The product 7°7 I 7 2 7 3 is of particular importance. There is a special notation for it which is customarily used: (22.14) y*= -iy*y*y2y\ It is easily seen that (75)2=1, 7 V + y V = 0, (22.15) i.e. the matrix y5 anticommutes with all the y*. For the matrices a and ß, the rules are ,<*75 - 7 5 a = 0, ßy5 + ysß = 0; (22.16) the commutability with a follows because a = 7°7 is a product of two matrices y*\ The matrix 7 5 is Hermitian, since and hence 75+ = 7 5 , (22.17) because the sequence 3210 is changed to 0123 by an even number of transpositions. The form of this matrix in two particular representations is: 5 /-I r = ( 0 spinor 0 5 standard 7 = (_ 1 0\ ly)> (22.18) -1\ ftV The trace of the matrix y5 is zero: tr7 5 = 0, (22.19) as can be seen directly from (22.18). The traces of the products 757^7V are also zero. For the products of y5 with four factors 7^ we have 1 tr 7 5 7 V 7 teAw (22.20) N A = ek™ a,bvc<» (22.21) V = Another formula is 7N = iy5(ya)(yb)(yc), which is valid for mutually normal 4-vectors a, b, c: ab = ac = be = 0. §23 Fermions 84 In some cases (for problems involving non-relativistic particles), it may be necessary to calculate the traces of products which involve 7 0 and the threedimensional "vector" 7 separately. The only non-zero traces are those of products containing even numbers of factors y° and 7. All the factors y° become unity, and the traces of products with two and four factors 7 are respectively 4 tr(a • 7Kb • 7) = - a • b, .} J tr(a • 7)(b • 7)(c • Y)(d • 7) = (a • b)(c • d) - (a • c)(b • d) + (a • d)(b • c). §23. (22.22) Plane waves The state of a free particle having definite values of the momentum and energy is described by a plane wave which may be written in the form *'=vh)u>ripx- (23 -° The suffix p indicates the value of the 4-momentum; the wave amplitude up is a suitably normalized bispinor. In proceeding with second quantization we need not only the wave functions (23.1) but also functions with a "negative frequency", which arise in the relativistic theory because of the two-valuedness of the square root ±V(p 2 + m2), as shown in §11. As in §11, we shall always take e to be the positive quantity e = +V(p 2 + m2), so that the "negative frequency" is —e; on changing also the sign of p, we obtain a function which may naturally be called ^/-p: +-p=vh)u-pe'px- (23 2) - The significance of these functions will be explained in §26; here we shall write parallel formulae for i//p and i//_p. The components of the bispinor amplitudes up and u-p satisfy the algebraic equations (yp-m)up =0, } (7P + m)u-p = 0 , (23.3) which are obtained by substituting (23.1), (23.2) in Dirac's equation (this is equivalent to replacing the operator p in that equation by ±p).t The relation p 2 = m2 is then the condition for each such pair of equations to be compatible. We t There are also similar equations obtained from Dirac's equation (21.9) for the complex-conjugate function: üp(yp-m) = 0y ü-p(yp + m) = 0. (23.3a) 85 Plane Waves §23 shall always normalize the bispinor amplitudes by the invariant conditions üpUp = 2m, (23.4) ü-pU-p = - 2 m , where the bar over a letter denotes, as usual, Dirac conjugation: ü = u*y°. Multiplying equations (23.3) on the left by ü±p, we obtain (ü±pyu±p)p = 2m2 = 2p2, whence üpjUp = ü-p-yu-p = 2p. (23.5) It may be noted that the change from the formulae for up to those for u-p is made by changing the sign of m. The current density 4-vector is J = ö±Py*l>±p = 27 û±pyu±p = pie, (23.6) i.e. j*1 = (1, v), where v = p/e is the velocity of the particle. Hence we see that the functions 4ip are "normalized to one particle in the volume V = 1". Equations (23.3) show that the components of the wave amplitude are related, but the actual form of the relations depends, of course, on the specific representation of $. For the standard representation they are found as follows. From equations (21.19) we have, for a plane wave, (e-m)4>-p'<rx=0,) (23.7) (e + m)x - p • o"4> =0. J From these we find the relation between <j> and x in two equivalent forms:' 4>=J^L*, x=J~<t>; (23.8) their equivalence is evident on multiplying thefirstform on the left by p • cr/(e + m) and using the results (p • cr)2 = p2 and e 2 - m 2 = p2, which gives the second form. The common factor in 4> and x *s chosen to satisfy the normalization condition (23,4). Thus we obtain for up, and correspondingly for M-p, the expressions 9 \V(e -m)(n- a)w/ p \ V ( e + m)H> / the second formula is obtained from the first by changing the sign of m and replacing w by (n • a)w\ Here n is a unit vector in the direction of p, and w is an arbitrary two-component quantity subject only to the normalization condition w*w = 1. (23.10) §23 Fermions 86 For ü = u*y (with 7 from (21.20)) we have üP = (V(e + m)w*,-V(e - m)u>*(n • a)), w_p = (V(e - m)w'*(n • a), - V(e + m)w'*)J (23U) and multiplication shows that in fact ü±pu±p = ± 2m. In the rest frame (e = m), we have up = V(2m)(£), «-p = V(2m)(°,), (23.12) i.e. w is the three-dimensional spinor to which the amplitude of each wave reduces in the non-relativistic limit. In the bispinor w_p, the first two components, not the second two, vanish in the rest frame. This property of solutions of Dirac's equation having ''negative frequencies1' is evident, since by putting p = 0 and replacing e by -m in (23.7), we find </>=0.t The amplitude of the plane wave contains one arbitrary two-component quantity. Thus, for a given momentum, there are two different independent states, corresponding to the two possible values of the spin component. But the spin component along an arbitrary z-axis cannot have a definite value. This is evident because the Hamiltonian of a particle with definite p (i.e. the matrix H = a • p + ßm) does not commute with the matrix 2 2 = -iotxOCy In accordance with the general conclusions of §16, however, the helicity A (the component of the spin in the direction of p) is conserved: the Hamiltonian commutes with the matrix n • 2. Helicity states correspond to plane waves in which the three-dimensional spinor w = w(A)(n) is an eigenfunction of the operator n • a: \(n • a)w(A) = Aw(A). (23.13) The explicit form of these spinors is (23.14) w {eKosie r J where 0 and <f> are the polar angle and the azimuth of the direction of n relative to fixed axes xyz.t Another possible choice of the two independent states of a free particle with given p, which is simpler but less clear, corresponds to the two values of the z-component of the spin in the rest frame, which we denote by a. The spinors are wl"b =(o)> w(<7=4 =(i)- <23-15> t In the spinor representation £ = - TJ, instead of £ = TJ as in the rest frame for solutions having positive frequencies" t The solution of equation (23.13) can be multiplied by any phase factor, because of the possibility of an arbitrary rotation about the direction of n. t4 §23 87 Plane Waves As the two linearly independent solutions with "negative frequency" we take plane waves in which the three-dimensional spinors are >v(<7)' = - <r,w(_<r) = 2<riwla); (23.16) the significance of this choice will be shown in §26. We can also find a representation of a plane wave such that in any frame of reference (not only in the rest frame) the wave has only two components corresponding to definite values of the same physical property—the spin component in the rest frame (L. Foldy and S. A. Wouthuysen, 1950). Starting from the amplitude up (23.9) in the standard representation, we seek a unitary transformation to such a representation in the form u'p=Uup, U= ewf\ where W is real; since y* = - 7 , it follows that U+ = I/"1. Expanding in series and noting that (7 • n)2 = - 1 , we put U = cos W + 7 • n sin W; cf. the derivation of (18.14) from (18.13). The condition that the second pair of components in the transformed amplitude u'p should be zero gives tan W = |p|/(m + e), so that ,, = m + g+(7-n)|p| V[2£(e + m)] In the new representation, ^ = V(2e)(£). (23.17) The Hamiltonian of the particle in this representation is H' = U(a • p + j3m)ir* = ße, (23.18) where all the matrices ß,a and 7 belong to the standard representation. This Hamiltonian commutes with the matrix which is, in the new representation, the operator of a conserved quantity, the spin in the rest frame. 88 Fermions §24 § 24. Spherical waves The wave functions of states of a free particle (with spin 2) having definite values j of the angular momentum are spinor spherical waves. To determine their form, let us first state the corresponding formulae of the non-relativistic theory. The non-relativistic wave function is a three-dimensional spinor H» For a state having definite values of the energy e (and therefore of the momentumt p), the orbital angular momentum I, the total angular momentum j and its component m, the wave function is (// = i?p/(r)nj/m(0,(/)). (24.1) The angular functions Üjim are three-dimensional spinors whose components (for the two values j = / ± 2 which are possible for a given J) are Ql+\Xm — (24.2) £ll-{Xm — (see QM, §106, Problem). We shall call the nj(m spherical harmonic spinors. They are normalized by the condition j n ^ m H i ' i ' m ' do - 8jj'8||'ô m m '. (24.3) The radial functions Rpi are the common factor in the two components of the spinor «//, and are given by (24.4) (QM, (33.10)). They are normalized by the condition f r2RplRp,dr = 2TT8(p'-p). Jo t In this section, p denotes |p|. (24.5) Spherical Waves §24 89 Returning now to the relativistic case, let us note first of all that separate laws of conservation of spin and orbital angular momentum do not exist for a moving particle: the operators s and I do not separately commute with the Hamiltonian. But the parity of the state is still conserved (for a free particle). The quantum number / therefore no longer refers to a definite value of the orbital angular momentum, but it defines the parity of the state (see below). Let us consider the required wave function (bispinor) in the standard representation: - ( ! ) ■ Under rotations, <f> and \ behave like three-dimensional spinors. Their angular dependence is therefore given by the same spherical harmonic spinors [lilm. Let 4> a Hjim, where I is a certain one of the two values j + \ and j - [. Under inversion <Mr)-»i<M-r) (see (21.18)), and iljlm(-n) = (-\ynilm(n), so that *(r)-W(-l)'<Mr). The components #(r), under inversion, become - i ^ ( - r ) . In order that the state should have a definite parity (i.e. that all the components should be multiplied by the same factor on inversion), it is therefore necessary that the angular dependence in x should be given by the spherical harmonic spinor CïjVm with the other of the two possible values of /; since ,these two values differ by 1, (—l)r = - ( - 1 ) ' . The radial dependence of cf> and x w*ll be given by the same functions Rpl and Rpi> (with the values of I and /' which give the order of the spherical harmonics in iljim). This is clear because each component of \\t satisfies the second-order equation ( p 2 - m2)i// = 0, which for a given value of |p| becomes (A + p2)i// = 0, and this is formally identical with Schrödinger's non-relativistic equation for a free particle. Thus (f> = AK p I fl i I m f x = BRpVCliVm9 (24.6) and it remains to determine the constant coefficients A and B. To do so, we consider a distant region, where the spherical wave may be regarded as a plane wave. According to the asymptotic formula (QM, (33.12)), Kl Ä ^{e K p f -^ n - e'l{pr-^% (24.7) sa that </> is the difference of two plane waves propagated in the directions ± n (n = r/r). For each plane wave, by (23.8), X = —?—(±n • a)<£. e+m 90 §24 Fermions From the previous results (formulae (24.6)) it is obvious that (n • oOftjim = an jrm , where a is a constant. This constant is easily found by comparing the values of the two sides of the equation when m = 2 and n is along the z-axis. Using (7.2a), we find (n-a)[lHm = ï'-l[ln.m. (24.8) These formulae, on comparison with (24.6), show that e+m Finally, the coefficient A is determined by the normalization of i/r. If this is specified by j Vtnmtprrm- d'x = 2irô0.ô,rôMM.ô(p - p'), (24.9) we have * * = 7 ^ Wfc-mUWW' ' =2i~l (24 l0) ' Thus, for given values of j and m (and of the energy e) there exist two states differing in parity. The parity is uniquely defined by the number /, which takes the values j ± i : on inversion, the bispinor (24.10) is multiplied by i(-l) 1 . The components of this bispinor, however, contain spherical harmonics of both orders I and J', showing that the orbital angular momentum has no definite value. When r->°o, the spherical waves (24.7) may be regarded as plane waves in any small region of space, with momentum p = ± pn. It is therefore clear that the wave functions in the momentum representation differ from (24.10) essentially only in that the radial factors are absent and n denotes the direction of the momentum. In order to make a direct change to the momentum representation, we must carry out a Fourier transformation: <Mp')= [<Kr)e~,y rd3x. (24.11) The integral is calculated by means of the expansion of a plane wave in spherical waves: With an expansion of this kind for e~'pr in (24.11) and using (24.5), we find that the Fourier components of the function <Kr) = J*p,e)fliim07r) §25 91 The Relation Between the Spin and the Statistics are W) = <^-s(p'-p)r'y,„ (£)/uita,(;) Yt(^)do. The integral is equal to the coefficient of the spherical harmonic function in the definition (24.2) of the spherical harmonic spinors, and together with the factor yimip'lp') it yields the same spherical harmonic spinor, but with argument p'lp': ^(p') = ^ ô ( p ' - P ) i - ' n ) l m ( ^ ) . Applying this result to the bispinor wave function (24.10), we obtain the momentum representation , ^ / r V(e + m)i-'n i t w (p7p , )\ x(n> n\ I2?? < / W P ) = «a, m (p'lp'))- m n ^ (24 13) " The states \pjlm) are the same as the states |pjm|A|) (with |A| = 2) discussed in §16: both have definite values of pjm and the parity. The spherical harmonic spinors Qjim are therefore related in a certain way to the functions Di'J, (both with argument pip). When p -*0, the wave functions (24.13) reduce to the three-dimensional spinors n jim , the parity of which is P = T J ( - 1 ) ' (where rj = i is the "internal parity" of the spinor). A comparison with the results of §16 gives the formula < V = il yjy^ ( v v ( ^ \ m ± w*D|«i) (24.14) where I = j1+5, and the w(A) are the three-dimensional spinors (23.14). §25. The relation between the spin and the statistics The second quantization of a field of particles with spin { (a spinor field) is carried out in a similar way to that of a scalar field in §11. Without repeating the arguments, we shall immediately write down expressions for the field operators, which are exactly analogous to (11.2): * = 2 V(kj ((V"^~iPX + ^>P^eipx\ â $ - * V = 2 V(^j ( >p« eipx + (25.1) M-P.-*«-**); the summation is over all values of the momentum p and over or = ± |. The antiparticle annihilation operators bp<7 (like the particle annihilation operators d pa ) appear as the coefficients of functions whose coordinate dependence ( e , p r ) §25 Fermions 92 corresponds to a state having momentum p.t To calculate the Hamiltonian of the spinor field, it is not necessary to determine the energy-momentum tensor (as we did for the scalar field), since in this case there exists a particle Hamiltonian which can be used to derive the wave equation (Dirac's equation) (21.12). The mean energy of the particle in a state with wave function i// is the integral = iffc°%d>x. (25.2) It should be noticed that the "energy density" (the integrand) is here not a positive-definite quantity. Replacing the functions i// and ^ in (25.2) by (//-operators, using the orthogonality of the wave functions with different p or a, and also using the relation ü±p(Ty°u±p(T = le for the wave amplitudes, we obtain the field Hamiltonian in the form H = 2 e(dpadp<7 - bp(Tbpa). (25.3) Hence it is seen that in this case Fermi quantization must be used: {âpa, dpa}+ = 1, {bpaj bpa}+ = 1, (25.4) and all other pairs of operators d, d \ b, b + anticommute (see QM, §65), since then the Hamiltonian (25.3) may be written H = 2 ^(dpadpcx + bpcrkpa - 1), and the energy eigenvalues are (with the usual omission of an infinite ad'ditive constant) E = 2 s ( N p . + N p(7 ), (25.5) p.* and are positive-definite, as they should be. With Bose quantization, we should obtain from (25.3) the eigenvalues 2 e(Npa-Np(T), which are not positive-definite and have no meaning. t The two functions also correspond to the. same value a of the spin component in the rest frame; for the functions i/r-Pl-<, this will be proved in §26 (see (26.10)). §25 The Relation Between the Spin and the Statistics 93 An expression analogous to (25.5) is obtained for the momentum of the system, i.e. the eigenvalues of the operator / i//*p*//d3jc: P = E P ( N P . + NP(7). (25.6) /" = $y"h (25.7) p,cr The 4-current operator is and the "charge" operator of the field is found to be Q = [ $y°4>d3x P,<7 = S (â;aâpa - 6la6pa + 1); (25.8) p , <j its eigenvalues are Q = I(NprT-NpJ. (25.9) P,<7 Thus we again arrive at the concept of particles and antiparticles, and the whole of the discussion of these in §11 is applicable. But particles with spin \ are fermions, whereas those with spin zero are bosons. An examination of the formal origin of this difference shows that it is due to the different nature of the expressions for the "energy density" in the scalar and spinor fields. In the scalar field the expression is positive-definite, and the terms d + d and bb* therefore both have a positive sign in the Hamiltonian (11.3). If the energy eigenvalues are positive, the replacement of bb* by b*b must occur without change of sign, i.e. in accordance with the Bose commutation rule. For the spinor field, however, the "energy density" is not a positive-definite quantity, and hence the term bb* appears with the minus sign in the Hamiltonian (25.3); to obtain positive eigenvalues, the replacement of bb* must be accompanied by a change of sign, i.e. must occur in accordance with the Fermi commutation rule. The form of the energy density is directly related to the transformation properties of the wave function and to the requirements of relativistic invariance. In this sense we may say that the relation between the spin and the statistics obeyed by the particles is likewise a direct consequence of these requirements. The fact that particles with spin 2 are fermions also leads to a general conclusion: all particles with half-integral spin are fermions, and those with integral spin are bosons (including those with spin zero, as demonstrated in §ll).t t The origin of the relation between the spin of a particle and the statistics which it obeys was first elucidated by W. Pauli (1940). 94 Fermions §26 This is evident because a particle with spin s may be regarded as "composed" of 25 particles with spin \. When s is half-integral, 2s is odd; when 5 is integral, 2s is even. A "composite" particle containing an even number of fermions is a boson, and one containing an odd number of fermions is a fermion.t If a system consists of particles of various kinds, then creation and annihilation operators must be defined separately for each kind of particle. The operators pertaining to different bosons, or to bosons and fermions, commute. Operators pertaining to different fermions may be regarded, in the non-relativistic theory, as either commuting or anticommuting (QM, §65). In the relativistic theory, which allows transformations of particles into one another, the creation and annihilation operators of different fermions must be regarded as anticommuting, like those which pertain to different states of the same fermions. PROBLEM Find the Lagrangian of the spinor field. SOLUTION. The Lagrangian corresponding to Dirac's equation is given by the real scalar expression L = ii(^7MaM0 - 3 ^ • yMi//)-miM/. (1) Taking the components of ip and ip as the ''generalized coordinates" q, we easily see that the corresponding Lagrange's equations (10.10) are the same as Dirac's equations for tp and ip. The overall sign of the Lagrangian (like the common factor in it) is arbitrary. Since L involves the derivatives of ip and îp liqearly, the action S = f L dAx can in any case have no minimum or maximum. The condition 8S - 0 here gives only a stationary point of the integral, not an extremum. The Lagrangian of the spinor field is obtained by replacing ^ in (1) by the operator i//. Applying formula (12.12) to this Lagrangian, we find the current operator (25.7). § 26. Charge conjugation and time reversal of spinors The coefficients typ<T = up<Te~ipx which appear with the operators â^ in (25.1) are the wave functions of free particles (electrons, say) having momenta p and polarizations cr: The coefficients i/>-p,-<7 of the operators b^ are to be regarded as the wave functions of positrons having the same p and a. It is found, however, that the electron and positron functions are expressed in different bispinor representations. This is evident from the fact that \p and ip differ in their transformation properties and their components satisfy different sets of equations. To eliminate this defect, it is necessary to carry out a certain unitary transformation of the components i/>_p,-a, t In this argument it is assumed that all particles with the same spin obey the same statistics (whatever the way in which they are "compounded"). The truth of this assumption is seen by analogous arguments. For example, if there existed fermions with spin zero, then a fermion with spin zero and one with spin 1 would yield a particle with spin L which would be a boson, in contradiction with the general result demonstrated for spin 5. §26 95 Charge Conjugation and Time Reversal of Spinors such that the new four-component function satisfies the same equation as i//p<7.t This will be referred to as the wave function of the positron (with momentum p and polarization a). Denoting the matrix of the required unitary transformation by ITC, we may write (26.1) ¥tt=Uc*-P^ The operation C whereby this function is obtained from ^-p-a is called charge conjugation of the wave function (H, A. Kramers, 1937). This concept is, of course, not restricted to its application to plane waves: for any function ^, there exists a "charge-conjugate" function (26.2) ÙW9T)=Uc$(t9T), which has the same transformation properties as if/ and satisfies the same equation. The properties of the matrix Uc follow from this definition. If i/> is a solution of Dirac's equation (yp - m)ty = 0, then ifj satisfies the equation i//(yp + m) = 0, or (yp + m)^ = 0. Multiplying this equation on the left by £/<> Ucypiï + mUcîp = 0, we apply the condition that the function Uc$ satisfies the same equation as i//: (yp - m)Uc4f = 0. A comparison of the two equations gives the following "commutation relation" between Uc and the matrices y^it UcV = - y ' l / c . (26.3) We shall further suppose that the wave functions are stated in the spinor or standard representation; the general case of any representation will be considered only at the end of this section. In these representations, we have (y o...3 ) * = y c..3 f 7 >* = -y.j (26 4) ' Then the condition (26.3) is satisfied by the matrix Uc = T)C7 2 7°. the constant TJC being arbitrary. The condition C2 = 1 shows that |TJC|2= 1, and the matrix Uc is t For particles with spin zero, this problem did not arise, since the scalar functions ^ and iff* satisfy the same equation, and i//*p is identical with i/*p. t From this there follows also the equation UcY =y5Uc. (26.3a) Fermions 96 §26 therefore determined apart from a phase factor. We shall take t]C = 1 ; thus Uc = y2y°=-ccr (26.5) Noting also that $ = i//*7° = yV* = YV*> we may write the effect of the operator C as Cty = y2y°«Ä = y V - (26.6) The explicit form of the transformation (26.6) for the spinor representation is C: r-+-«V*> Tfc->-£*, (26.7a) T»*--ir*. (26.7b) or, equivalently, C: Ê.--WIÎ. The charge-conjugation transformation for the plane waves $±p<r is easily carried out by using the explicit expressions (23.9) for the plane waves and the matrix Uc in the standard representation: VC-(_1 7 ) . (26.8, 'y Since (Tv& p = " ~ — crcr ~ ~ ~ yv > T we have, if w(or)' is defined as in (23.16), UcÜ-p-a = Upay UCU-pt-<r = fipcr. (26.9) Thus C(A-p^ = (/rpa, (26.10) so that the functions t/>-p - a which appear with the operators b?a in the (//-operators (25.1) do in fact correspond to states of a particle having momentum p and polarization or. We see also that the electron and positron states are described by the same functions: C = VS = </w This is to be expected, since the functions ifßp(T embody information only as to the momentum and polarization of the particle. The operation of time reversal may be treated similarly. When the sign of the time is changed, the wave function must change to the complex conjugate. In order §26 97 Charge Conjugation and Time Reversal of Spinors to obtain the "time-reversed" wave function (Ti/0 in the same representation as the original i//, we must also perform some unitary transformation on the components of i//* (or i//). Thus the action of the operator T on ^ can be written, similarly to (26.2), as fi/i(t,r)=l7 T ^(-t,r), (26.11) where UT is a unitary matrix. Dirac's equation satisfied by if/ is ( i y ° ^ + Î 7 - V - m ) ^ a r ) = 0, and the equation for i£ is In the latter equation we change t into -t and multiply on the left by - l / j : {iU^°it " IUT^ * V )* ( ~ f > r ) "" mUr4>(-U R) = 0. We want the function l/r</>(-t, r) to satisfy the same equation as i//(f, r): (iy°yt + iy • v ) l/ T ^(-t, r) - m[/^(-r, r) = 0. Comparing the two equations, we find that the matrix UT must satisfy the conditions UTy° = y°UT, UTy = -yUT. (26.12) In the spinor and standard representations, these conditions are satisfied by the matrixt UT = iy>yly°. (26.13) Thus the effect of the operator T is given by fiKt, r) = iy*yxy°i(-t, r) = iyVi//*(~', r). (26.14) The explicit form of this transformation for the spinor representation is T: É"->-iÉÎ, T»«-MV*. (26.15a) t The choice of the phase factor in (26.13) depends on that in (26.5); see the second footnote to §27. §26 Fermions 98 or Ça-+iÇa*, T: T, a -*-«T,*. (26.15b) In the standard representation, ü * = (o (26 16) -",)• - To find the effect on if/ of all three operations P, T and C, we write successively f ^ a r ) = -iy!yV*(-f,r), PftKt, r) = i V ( f ^) = 7°y ! y V * ( - t . ~r), CPfi//(r, r) = ? 2 (7VY V ) * = 7 W ? V ( ~ ' . " r), or CPfi//(t >r ) = i"7V(-t,-r). (26.17) In the spinor representation, CPT: r->-£a. Î?Ô^ÎÎ|«, (26.18) as is also easily seen directly from the transformation rules (20.4), (26.7) and (26.15).t The expressions given above for the matrices Uc and UT assume the spinor or standard representation of ifr. Let us finally see which properties of these expressions are retained for any representation of i//. If \\ß is subjected to a unitary transformation: V = Ity, Y = UyU~\ & = *'*7°# = W* = t?l/"\ (26.19) then in the new representation we have ( W = U(Cifß) = UUc$ = UUc(4>'U) = UUcVi'. A comparison with the definition of the matrix Uc in the new representation, ( W = U'd\ shows that Ub = UUcÜ. (26.20) The transformation (26.20) is the same as that of the matrices 7 only if U is real. The expression (26.5) also is therefore valid only in representations which are a real transformation of the spinor or standard representation. t The notation CPT implies that the operatorsfcact in the sequence from right to left. The sign of (26.17) and (26.18) depends on this sequence, since T does not commute with C and P (as regards their action on a bispinor). §27 Internal Symmetry of Particles and Antiparticles 99 The matrix (26.5) is unitary, and changes sign when transposed: l/cl/c=l, Uc= -Uc. (26.21) These properties are unaffected by the transformation (26.20), and are therefore retained in any representation. The matrix (26.5) is also Hermitian ( l / c = Uc), but this property is in general not preserved by the transformation (26.20). The above discussion and formulae (26.21) apply likewise to the properties of the matrix l/T. In the second quantization formalism, the transformations C, P. T for ^-operators must be formulated as transformation rules for the particle creation and annihilation operators. These rules can be established (as in §13 for particles with spin zero) from the condition that the transformed ^-operators may be written ftr)=UÀr), ] P iP {Ur) = iy°4,(U-r)A (26.22) T 4, (Ur)=UTk-t,r).\ PROBLEM Find the charge-conjugation operator in the Majorana representation (§21, Problem 2). SOLUTION. The matrix [/ein the Majorana representation is obtained from the matrix Uc = -ay in the standard representation by the transformation (26.20) with U = (ay + ß)/V2, and is Uc= -a y (where ay and ß denote matrices in the standard representation). If primes denote quantities in the Majorana representation, we have Cty' = UH^ß')* and since ß' = ay, Cip' — a y ((/f'*a y ) = aydyi/f'* = r//*, i.e. charge conjugation is the same as complex conjugation. § 27. Internal symmetry of particles and antiparticles The wave function of a particle with spin i, in its rest frame, is a single three-dimensional spinor, which will be denoted by 4>a. The behaviour of this spinor under inversion is related to the concept of the internal parity of the particle. However, as mentioned in §19, although the two possible laws of transformation of three-dimensional spinors (<î>a -* ± i<&a) are not equivalent, there is no absolute significance in assigning a particular parity to a spinor. We therefore cannot speak of the internal parity of any one particle with spin i but we can refer to the relative internal parity of two such particles. From two (three-dimensional) spinors 4>(1) and 4>(2), a scalar <&^W2)a can be formed. If this is a true scalar, the particles described by the spinors are said to have the same internal parity; if it is a pseudoscalar, they are said to have opposite internal parities. We shall show that particles and antiparticles (with spin 2) have opposite parities (V. B. Berestetskiï, 1948). Firstly, if the operation C (26.7) is applied to both sides of the P transformation 100 §27 Fermions (19.5) (in the spinor representation) P: T,ô-MÊa, r->»l«. (27.1) we obtain where the index c marks the components of the bispinor if/c = ( c J charge-conjugate to i// = ( ) . Taking the complex conjugate and interchanging the indices, we find P: T)a-+i'r\ r^ivl (27.2) Thus charge-conjugate bispinors are transformed in the same manner by inversion. Let ip{e) be the wave function of a particle (say an electron) and ift(p) that of the antiparticle (a positron). The latter is a'bispinor which is the charge conjugate of a "negative-frequency" solution of Dirac's equation. In the rest frame, each function becomes a three-dimensional spinor: £<e>a = <«(.<) = <j><e>a £<?><* = r){p) — 4>(p)a. According to (27.1), (27.2), these spinors are transformed as follows by inversion: <D a ^/<D a , (27.3) the same for both <t>{€) and 3>(p). The product <ï>(°(p), however, changes sign, and this proves the result stated above. A strictly neutral particle is one which coincides with its antiparticle (§12). The (//-operator of a field of such particles satisfies the condition if(r,r) = ifc(t,r). For particles with spin 2, this implies the conditions (in the spinor representation)! t= -irT+, Tfc- -ifr. (27.4) Like any relation which expresses a physical property, these conditions are invariant under the transformation CPT.t It is easily verified that they are in fact t In the Majorana representation, strict neutrality implies simply that the operator $' is Hermitian; see §26, Problem. t More precisely, the transformation CPT must here be defined so as to leave relations such as (27.4) unchanged. This is achieved by an appropriate choice of the phase factor in the definition of the matrix l/ T ; see the third footnote to §26. §28 101 Bilinear Forms invariant not only with respect to CPT but also with respect to each of the three transformations separately. In §19, inversion of spinors was defined to be a transformation for which P 2 = - l , arid this definition has been used so far. The result derived above concerning the relative parity of particles and antiparticles is easily seen to be, as it should be, independent of the way in which inversion is defined. If inversion is defined by the condition P 2 = 1, then (27.1) becomes P: r - > t | * . *»<--> f". (27.5) The charge-conjugate function is then transformed according to HCa Ç __v ~+c -> - T ) a , ~+c _^w T)a~> - £ tca , which differs in sign from (27.5). Accordingly, the three-dimensional spinors <ï> will be transformed thus: <!><*■>* -► $ ( < ? ) a , ( p ) a - * - <j> ( p ) a , and the product <&{eWp) will again be a pseudoscalar. The only possible difference in the physical consequences of the two views of inversion is that with the definition (27.5) the condition for a strictly neutral field would not be invariant under this transformation (or the transformation CP), which would alter the relative sign of the sides of the equations (27.4). Actually, no strictly neutral particle with spin \ is known, and we cannot yet say whether this difference between the two definitions of inversion has any real physical meaning.t PROBLEM Find the charge parity of positronium (a hydrogen-like system consisting of an electron and a positron). SOLUTION. The wave function of two fermions must be antisymmetric with respect to simultaneous interchange of coordinates, spins and charge variables of the particles (cf. §13, Problem). The interchange of the coordinates multiplies the function by (-1)', that of the spins by (-1) I+S , where S (=0 or 1) is the total spin of the system, and that of the charge variables by the required parity C. The condition (-iy(-l) ,+s C = - l gives C = ( - l) / + s . Since the internal parities of the electron and the positron are opposite, the spatial parity of the system is P - ( - l ) m . The combined parity CP = ( - l ) s + 1 § 28. Bilinear forms Let us consider the transformation properties of various bilinear forms which can be constructed from components of the functions ip and </**. Such forms are of great importance in quantum mechanics. They include the current density 4-vector (21.11). t The incomplete equivalence of the two definitions of inversion was first noted by G. Racah (1937). 102 §28 Fermions Since \\f and t//* have four components each, a total of 4 x 4 = 16 independent bilinear combinations can be formed from them. The classification of the transformation properties of these is evident from the ways listed in §19 of multiplying any two bispinors (in this case i// and i//*). We can form a scalar (denoted by S), a pseudoscalar (P), a mixed spinor of rank two equivalent to a true 4-vector Vß (four independent quantities), a mixed spinor of rank two equivalent to a 4-pseudovector AM (four quantities), and a bispinor of rank two equivalent to an antisymmetric 4-tensor TßV (six quantities). In a symmetrical form (for any representation of i//), these combinations may be written S = <W, P= ih$^ (28.1) where <r»v = l(y V " 7 V ) = (<*> «X); (28.2) the components in (28.2) are stated as in (19.15).t All the expressions given above are real. The fact that S is a scalar and P a pseudoscalar is evident from their spinor representations: which agree with (19.7) and (19.8). The fact that the V*1 form a vector is then evident from Dirac's equation: multiplying the equation fvyMi/> = mif/ on the left by i/vwe obtain M . 7 ^ = mijhjr. Since the right-hand side is a scalar, so is the left-hand side. The rule whereby the quantities (28.1) are obtained is obvious: they are constructed as if the matrices y* formed a 4-vector, y5 were a pseudoscalar, and the i/f and ^ on either side together formed a scalar, t The non-existence of bilinear forms which are symmetrical 4-tensors is seen from the spinor representation and also from this rule: since the symmetrical combination of matrices is 7^7" + 7"7|i = 2g*1", any such form would reduce to a scalar. The second-quantized bilinear forms are obtained by replacing the (//-functions in (28.1) by i/r-operators. For greater generality, we shall assume that the two (//-operators relate to fields of different particles, denoted by suffixes a and b. Let t_For a unitary transformation of *p (change of representation), we have <//->lty, ift-nfrU~ , and the invariance of the bilinear forms under such transformations is obvious. t The "pseudoscalar" nature of y5 is itrtif in accordance with these rules, since y = 24«A**y y * y y p . y^UyU'\ §28 103 Bilinear Forms us see how such operator forms are transformed under charge conjugation. We havet 4fc = Uc$, $c = U+ci, (28.3) and therefore, using (26.3) and (26.21), = -iaUcUc^b = ~ 4>aU+cy»Uc$b = ky^bWhen the operators are restored to their original order (^ to the left of <£), the Fermi commutation rules (25.4) show that the sign of the product is changed (and moreover terms appear which are independent of the state of the field; we omit these, as in the corresponding treatment in §13). Thus we have $ïfô = $bia, &Y M # = ~ky*h. Proceeding similarly with the other forms, we find the results for charge conjugationt C '. Jab ~* •Jfco» -»ab ~* *ba» »4b ~~* ~~ » oa> /yo A\ The behaviour of these forms under time reversal may be ascertained similarly, remembering (see §13) that this operation brings about a change in the order of the operators, so that, for example, (</Ub)T = il$l t To derive the second of these equations from the first, we write î>c=[UKfa0*))y° = y0uty0î, = -y°U+cy°4> = y°y°*U+cj, using (26.3), (26.21) and the fact that 7 0 is Hermitian. t It should be noticed that, for bilinear forms constructed from (/»-functions (not «fr-operators), the transformations (28.4) would have the opposite signs, since the return to the original order of the factors ji and «// would not involve a change of sign. 104 §28 Fermions Substituting here i//T = UT$, $T = -UT& (28.5) we obtain Treating the other forms in the same way, we obtian Sab -> &fl, T: Pab -> - A « , (A0, Â)ûb -> (- Â° A)*, ( V°, V)ûb -> ( V°, - V ) t o f a ; = (p, â)ûb -> (p, - â)>«, (28.6) where p and a are three-dimensional vectors equivalent to the components of T *v as shown in (19.15). Under spatial inversion we have, in accordance with the tensor properties,! P: £*->&*, 0 Pab -> -Pab9 (A , \)ab ->(-A°, A)*, (V°,V) û b -^(V°,-VU Tj£ = (p, â)ûb - * ( - p , â)«„. (28.7) Finally, the simultaneous application of all three operations leaves all the Saby Pab and f {£ unchanged, and changes the sign of all the V£b and A£b, in agreement with the fact that this transformation is a 4-inversion: since 4-inversion is equivalent to a rotation of the 4-coordinates, it creates no difference between true tensors and pseudotensors of any rank. Let us now consider products of pairs of bilinear forms constructed from four different functions i/ffl, </>*, ij>c, if/d. The result depends on which pairs of functions are multiplied together. It is possible, however, to reduce any such product to products of bilinear forms with specified pairs of factors (W. Pauli and M. Fierz, 1936). We shall derive the relationship on which this reduction is based. If we take the set of four-rowed matrices hy\y\iy*y5,i<rl"'> (28.8) where 1 is the unit matrix, arrange these 16 (=1 + 1 + 4 + 4 + 6) matrices in any definite order and denote them by yA (A = 1 , . . . , 16), and also denote the same matrices with lowered 4-tensor indices (pt, v) by yA, then they will have the following properties: try A = 0 ( y A * l ) , 1 ^A^ « lfr^ A ^ - KA I y yA - i , 4 tr 7 ye - o B. J (28.9) t To avoid any misunderstanding, it should be mentioned that the transformations T and P also involve a change in the arguments of the functions; the right-hand (transformed) sides in (28.6H28.7) are respectively functions of xT = (-f, r), xp = (U -r), when the left-hand sides are functions of x=(f,r). Bilinear Forms §28 105 The last of these shows that the matrices yA are linearly independent. Since their number is equal to the number (4 x 4) of elements of a four-rowed matrix, the matrices yA form a complete set in terms of which an arbitrary four-rowed matrix T may be expressed: r =Sc A 7 A , cA = itry A r f (28.10) or, in explicit form, with matrix suffixes i, k - 1, 2, 3, 4, r,k = 4 ZJ r/m7ml7AiJtA Assuming, in particular, that in the matrix T only the element Fim is non-zero, we obtain the required relation (the "completeness condition"): fi«ôta,=si2 7A*7£i. A (28.11) Multiplying both sides of this equation by ^kümty*, we have ( # V X * V ) = 4 S (ryA*i>b)Wy V ) . A (28-12) This is one equation of the type mentioned above, reducing the product of two scalar bilinear forms to products of forms involving other pairs of factors.t Other equations of the same type may be obtained from (28.12) by the changes using the expansion 7V = Ç CR7*, CR = \ tr 7 V T K (see the Problem). Here we may also give for future reference, the relation for two-rowed matrices which corresponds to (28.11). The complete set of linearly independent two-rowed matrices aA (A = 1, 2, 3,4) is 1, o\, <7y, <rz. (28.13) These have the properties tr<rA = 0 UT<TA<TB ( a A # 1), = 8AB. (28.14) t It should be mentioned, to avoid any misunderstanding, that we are referring here to forms constructed from »//-functions. The sign of the transformation would be opposite for forms constructed from the anticommuting (//-operators. 106 §29 Fermions The completeness condition is = 2<raß • a Ô7 + 2ßa0Ö07 (28.15) (a, ß, y, 8 = 1,2), or «Ja/s • <*ày = ""2cra7 • o"^ + iSa-ySsß. (28.16) PROBLEM Derive formulae analogous to (28.12) for products of pairs of bilinear forms, P, V, A, T. SOLUTION. We use the notation JP = (<Äay V)(£ c y V>, Js = (*VX* V ) , Jv = ( * V * b X*CY**'). JA = (♦"iW^X^friO'V), Jr = Waia^b){Vi*^d), and the same symbols with primes to denote the products with tyb and *ftd interchanged. The method used above gives 4J£= Js+ Jv+ h+ 4J{, = 4 J S - 2 J V 4Jf=6J s -2J T 4JA = 4Js + 2Jv 4JP= JA+ JP, +2JU-4JP, /s- + 6JP, -2JA-4JP, Jv + J T - JA+ JP, thefirstof these equations being the same as (28.12). § 29. The polarization density matrix The dependence on the coordinates of the wave function $ which describes free motion with momentum p (a plane wave) reduces to a common factor e , p r , and the amplitude up acts as a spin wave function. In such a state (a pure state), the particle is completely polarized (see QM9 §59). In the non-relativistic theory this means that the particle spin has a definite direction in space (more precisely, there exists a direction in which the spin component has the definite value +2)- In the relativistic theory, this description of a state in an arbitrary frame of reference is not possible, because, as already mentioned in §23, the spin vector is not conserved. The term "pure state" signifies only that the spin has a definite direction in the rest frame of the particle. In a state of partial polarization, there is no definite amplitude, but only a polarization density matrix p* (i, k = 1, 2, 3,4 being bispinor indices). We define this matrix in such a way that in a pure state it reduces to products: pik = UpMpk. (29.1) The Polarization Density Matrix §29 107 Accordingly the matrix p is normalized by the condition tr p = 2m ; (29.2) cf. (23.4). In a pure state, the mean value of the spin is given by the quantity = ±üpy°Xup. (29.3) The corresponding expression for a state of partial polarization is s = ^tr(p7°X) = ^ t r ( p 7 5 7 ) . The amplitudes up and üp satisfy the algebraic equations (7p -m)up üP(yp - m) = 0. The matrix (29.1) therefore satisfies the equations (yp -m)p = p(yp - m) = 0. (29.4) = 0, (29.5) The density matrix must satisfy similar linear equations in the general case of a state which is mixed (with respect to the spin); cf. the analogous argument in QM, §14. If we consider a free particle in its rest frame, the non-relativistic theory is applicable. In that theory the state of partial polarization is completely defined by three parameters, the components of the mean spin vector s (see QM, §59). It is therefore clear that the same parameters will define the polarization state after any Lorentz transformation, i.e. for a moving particle. Let twice the mean spin vector in the rest frame be denoted by Ç; in a pure state |Ç| = 1, in a mixed state |£| < 1. For a four-dimensional description of the polarization state it is convenient to define a 4-vector aß which in the rest frame is the same as the three-dimensional vector £; since £ is an axial vector, aM is a 4-pseudovector. This 4-vector is orthogonal to the 4-momentum in the rest frame (in which fl" = (0, £), P* = (m, 0)); in any frame, we therefore have a^=0. (29.6) a,a»-~ -Ç 2 . (29.7) In any frame, moreover, The components of the 4-vector aM in a frame in which the particle is moving §29 Fermions 108 with velocity v = p/e are found by a Lorentz transformation from the rest frame, and are Û° = ^ I I , ax = L , ai = ^ | . (29.8) where the suffixes || and 1 denote the components of the vectors £ and a parallel and perpendicular to the direction of p.t These formulae may be expressed in vector form: . - t + JälEL * m(e + m) a« = !LJ> = R^, e m «».f + k ^ . * ml (29.9) ' Let us first consider the unpolarized state (Ç = 0). The density matrix in this case can contain as parameters only the 4-momentum p. The only form for such a matrix which satisfies the equations (29.5) is (29.10) P=kyp + m) (I. E. Tamm, 1930; H. B. G. Casimir, 1933). The constant coefficient is chosen in accordance with the normalization condition (29.2). In the general case of partial polarization (Ç # 0), we seek the density matrix in the form P = 4^- (7P + m)p'(yp + m), (29.11) which necessarily satisfies the equations (29.5). When Ç = 0, the auxiliary matrix p' must become a unit matrix; since (yp + m)2 = 2m(yp + m), (29.11) is then the same as (29.10). The matrix p' must also contain the 4-vector a linearly as a parameter, i.e. must be of the form p'=l-Ay5(ya); (29.12) the second term includes the "scalar" product of the pseudovector a and the "matrix 4-pseudovector" y 5 y. To determine the coefficient A, we write the density t As regards their transformation properties, the components of the mean spin vector s (like those of any angular momentum) are, in relativistic mechanics, the space components of an antisymmetric tensor A S *\ The 4-vector ak is related to this tensor by the equations SAM = — ex*vpa n nK - - — p^^S n o 2m t "»Pi» a e o^vP». It must be emphasized that, in an arbitrary frame of reference, the spatial part a of the 4-vector aA is not the same as the vector 2s: we easily see that 2SII = — ( a i e - a 0 | p | ) = Ch §29 The Polarization Density Matrix 109 matrix in the rest frame: p=}m(l + 70)(l + A y V W + 7°) = im(l+y°)(l + A T V Q and calculate the mean spin by (29.4). Using the rules given in §22, we easily find that the only non-zero term in the trace is 2s = 2 ^ t r ( p 7 5 7 ) = -iAtr[(7-Ü7l Equating this to Ç, we have A = 1. The final expression for p is obtained by substituting (29.12) in (29.11) and interchanging the factors p' and yp + m; since a and p are orthogonal, the product yp anticommutes with ya: (ya)(yp) = lap - (yp)(ya) = - (yp)(ya), and therefore commutes with y5(ya). Thus the density matrix of a partially polarized electron is given by the expression P = l2(yp + m)(\-y5(ya)) (29.13) (L. Michel and A. S. Wightman, 1955). If the matrix p is known, the 4-vector a which describes the state can be found from a'=^tr(p7V), (29.14) and the vector Ç is therefore also known. The formulae for the density matrix of the positron are entirely analogous to those of the electron. ïf the positron (with 4-momentum p) were described by a positron amplitude a(ppos) and by a density matrix p(pos) defined on the basis of this amplitude, then there would be no difference from the case of the electron, and the matrix p(pos) would be given by the same formula (29.13). But, in actual calculations of cross-sections of scattering processes involving positrons, it is necessary (as we shall see below) to deal not with n(ppos) but with the "negative frequency" amplitudes u-p. Accordingly, the polarization density matrix (which we denote by p(_)) must be defined so as to reduce to u-pfi-pM for a pure state. According to (26.1), the positron amplitude n(ppos)= Ucü-P. Conversely, K-p = Lrcw(ppos), û-p = U£u jr* = ujr'l/Jf; Fermions 110 §29 cf. (28.3). If pifc^n-p..-0-p.k, pir s } = nSRos)fiS,s\ then these formulae give p M = Ucßipos)Ut (29.15) Substituting for p(pos) the expression (29.13), we obtain, after some simple rearrangements using (26.3) and (26.21), pM = k 7 P - m ) ( l - 7 5 ( 7 û ) ) (29.16) In particular, for an unpolarized state P{) = kyp-m). (29.17) In referring to positron density matrices below, we shall intend the matrices p ( ) , and the index (-) will be omitted; the matrices p(pos) will not be needed in practice. In various calculations it will often be necessary to average over spin states an expression such as üFu (=üiFikuk) where F is a (four-rowed) matrix and u is the bispinor amplitude of a state having a definite 4-momentum p. This averaging is equivalent to replacing the products uküi by the density matrix pki of a partially polarized state. In particular, complete averaging over two independent spin states is equivalent to changing to an unpolarized state, and, by (29.10), we have -2 S üpF"p = 2 tr (yp + m)F. polar. (29.18) Similarly, for wave functions with negative frequency, \ 2 polar. ö-pFn-p=hr(YP-m)F. (29.19) If summation over spin states replaces averaging, the result is doubled. Let us see how the density matrix (29.13) tends to its non-relativistic limit. To do so, we use the rest frame of the electron. In the standard representation of the wave functions, the amplitudes up in this frame have two components in the limit, and the density matrix must accordingly have two rows. For, in the rest frame, p = im(7°+l)0-7Va and we find from the expressions (21.20) and (22.18) for the matrices y P = ( Jn0nr I ), Pnon-r = m ( l + CT • Ç), ( 29.20) 111 Neutrinos §30 the zeros denoting two-rowed null matrices. If we use the normalization of the density matrix to unity (tr pnon.r = 1), as is customary in the non-relativistic theory, instead of the normalization to 2m, then the above expression must be divided by 2m, giving (l+a-0 in agreement with QM, (59.4), (59.5). Similarly, the non-relativistic limit for the positron density matrix is P = L W ), Pnon-r / Pnon-r = ~ m ( l + CT • Ç). Finally, there is a simpler expression for the density matrix in the ultrarelativistic case. Putting in (29.8) |p| — e (and thereby neglecting small quantities of order (m/e) 2 ), substituting these expressions in (29.13) or (29.16), and taking the direction of p as the x-axis, we can write P=5[e(y0-71)±m][l-75(^(7°-71K|| + Ç1-7l)], where the upper sign refers to the electron and the lower sign to the positron. When the product is expanded, the leading terms cancel, and those of the next order give P = ^ ( 7 ° - 7 , ) [ l - 7 5 ( ± ^ l l + Ci-7i)] or, again writing 6 ( 7 ° - 71) in the form 7p, P = kyp)[l - 75( ± ft + L • 7i)l. (29.21) This is the required expression for the density matrix in the ultra-relativistic case. It should bç noticed that all the components of the polarization vector £ appear in this expression equivalently, as terms of the same order of magnitude. It will be recalled that £| is the component of this vector parallel (if £n>0) or antiparallel (if C\\ < 0) to the particle momentum. In particular, for a helicity state of the particle, £| = 2A = ± 1 ; the density matrix then has an especially simple form, p=\yp{\±2\y\ (29.22) which is, as it should be, the same as the neutrino or antineutrino density matrix, these being particles with zero mass and definite helicity (see §30). §30. Neutrinos We have seen in §20 that the necessity of two spinors (£ and rj) to describe a particle with spin \ is due to the mass of the particle. This necessity disappears if the mass is zero. The wave equation which describes such a particle can be derived from a Fermions 112 §30 single spinor, say the dotted spinor rj : P a % = 0, (30.1) (Po + p-<r)r,=0. (30.2) or, equivalently, It has also been noted in §20 that the wave equation containing the mass m is necessarily symmetrical with respect to inversion (the transformation (20.4)). When the particle is described by a single spinor, this symmetry is lost, but it is not needed, since symmetry with respect to inversion is not a universal property of Nature. The energy and momentum of a particle with m = 0 are related by e = |p|. For a plane wave (% ~ e~ipx), equation (30.2) therefore gives (n-a)rjp = -TJP, (30.3) where n is a unit vector in the direction of p. A similar equation, (n • a)îj-p = - Tj-p, (30.4) applies to a wave with "negative frequency" (rj-p — eipx). The second-quantized ^-operator is p (30.5) Hence it follows, as usual, that T)*P are the wave functions of the antiparticie. The definition (20.1) of the operators p"* shows that pa** = - paß. The complex conjugate spinor ij* therefore satisfies the equations p^t? J = 0, or, equivalently, J W * = O. We write TJ** = £ß, thus expressing the fact that complex conjugation changes the dotted to the undotted spinor. The wave function of the antiparticie then satisfies the equation P\*^=0, (30.6) (Po-p-cr)| = 0. (30.7) (n-a)& = &. (30.8) or Hence, for a plane wave, §30 113 Neutrinos But 2H • er is the operator which projects the spin on the direction of motion. Equations (30.3) and (30.8) therefore signify that states of the particle with a definite momentum are necessarily helicity states, for which the spin component in the direction of motion has a definite value. If the particle spin is opposite to the momentum (helicity -£), then the antiparticle spin is along the momentum (helicity +Ö. The neutrinos which occur in Nature appear to be particles possessing these properties. The particle with helicity —2 is conventionally called the neutrino, and that with helicity +2 the antineutrino.t In connection with the fact that the neutrino states are not degenerate with respect to spin directions, reference should be made to the comment in §8 that a massless particle has only axial symmetry about the direction of the momentum. For a strictly neutral particle (the photon), this symmetry includes both rotations about the axis and reflections in planes passing through the axis. For the neutrino, there is no reflection symmetry, leaving only the group of rotations about the axis, which conserve the angular momentum component along the axis and do not change its sign. The symmetry with respect to reflections exists only if the particle is at the same time replaced by the antiparticle. It should also be noted that the necessarily longitudinal polarization signifies that the spin of the neutrino cannot be distinguished from its orbital momentum, just as with the necessarily transverse photon fields (§6). From one spinor TJ (or £), only four bilinear combinations can be constructed, which together form the 4-vector JM = (T)*T>,T,*aT,). (30.9) It is easily verified that the equations (Po + p-cr)î) =0, T)*(Po-p-<r) = 0 imply the continuity equation dj* = 0, so that j*1 acts as a particle current density 4-vector. It is convenient to normalize neutrino plane waves in a manner similar to that used in §23 for particles possessing mass: ^ = V ( b ) Upe~lPX> v~p=vfe) Up PX *' ' (3(U0) the spinor amplitudes being normalized by the invariant condition u?p(l,cr)H±p=2(e,p). (30.11) The particle density and particle current density are then j° = 1, j = p/e = n. Since a free neutrino with a given momentum is always completely polarized, there t The existence of neutrinos was theoretically predicted by W. Pauli (1931) in order to explain the properties of ß-decay. Equation (30.1) wasfirstdiscussed by H. Weyl (1929). The neutrino theory based on these equations was evolved by L. D. Landau, T. D. Lee and C N. Yang, and A. Salam (1957). 114 Fermions §30 are no states which are mixed (with respect to the spin). It may nevertheless be convienent to define a two-rowed polarization "density matrix" simply as the spinor of rank two Paß = uaw*, (30.12) for which tr p = 2e. An expression for this matrix can be obtained by noting that it must satisfy the equations (s + p • v)p = p(e + p • a) = 0. Hence we have p = e-p-a. (30.13) In the consideration of various interaction processes, neutrinos may appear together with other particles having spin 2 and possessing mass, which are therefore described by four-component wave functions. In such cases it is convenient to retain uniformity of notation by formally defining for the neutrino also a "bispinor" wave function having two components zero: ♦-0 But this form of i}/ is in general not preserved in another (non-spinor) representation. This difficulty can be overcome by noting that in the spinor representation we have identically where £ is an arbitrary "makeweight" spinor which does not appear in the final result; the matrix y5 is given by (22.18). The condition for a true "two-component" neutrino will therefore be maintained when it is described by a four-component ^ in any representation, if <// is taken to be the solution of Dirac's equation with m = 0: (7P)* = 0f (30.14) subject to the additional condition |(1 + y5)^ = *l*> or y V = *, (30.15) This condition may be taken into account by replacing ^ and <j>, wherever they would occur, by the following expressions: §31 The Wave Equation for a Particle with Spin 3/2 115 For example, using these expressions in t//7Mi//, the current density 4-vector may be written in the form j"=;|.Ml-75)7'1(l+7> = ^ 7 * ( 1 + 75)i/>. (30.17) In the same way, the four-rowed neutrino density matrix becomes P=l(\ + 75)(YP)(1 - 75) = i d + Y5)(7P)- (30.18) In the spinor representation it reduces, as it should, to the two-rowed matrix (30.13), p= / 0 0\ L - < r - p 0J- The corresponding formulae for the antineutrino differ from those given above by a change of the sign of y\ The neutrino is an electrically neutral particle, but, with the properties described above, it is not a strictly neutral particle. Here it must be noted that the "neutrino field" described by a two-component spinor is equivalent, as regards the number of possible particle states (but not, of course, as regards its other physical properties) to a strictly neutral field described by a four-component bispinor. Instead of states of particles and antiparticles with definite helicity, we should here have the same number of states of one particle with two possible values of the helicity, and the property of symmetry with respect to inversion would automatically occur. However, the zero mass of the "four-component" neutrino would be, so to speak, "accidental", since it would be unrelated to the.symmetry properties of the wave equation describing the neutrino (which allows also a non-zero mass). The various interactions of such a particle would therefore necessarily imply the existence of a small but not zero rest mass. §31. The wave equation for a particle with spin 3/2 A particle with spin 3/2 is described, in its rest frame, by a three-dimensional symmetrical spinor of rank three (having 2s + 1 = 4 independent components). Correspondingly, in an arbitrary frame of reference, its description may involve the 4-spinors £a/*7, r)aßy and £aßy9 \aßy, each of which is symmetrical in all the indices of one kind (dotted or undotted). The spinors in each pair are interchanged by inversion. In order that the 4-sponors £aßy and r\aßy should become in the rest frame 3-spinors symmetrical in all three indices, they must satisfy the conditions 0*W = O, Paß^y = 0: in the rest frame we have pdß-+Po8ßa = m8l (31.1) 116 Fermions §31 as is seen from (20.1), and the conditions (31.1) therefore imply the equations «2T,'% 7 = O, s £ r P 7 = o, where the primed letters denote the corresponding three-dimensional spinors. Thus these spinors give zero on contraction with respect to the indices a, ß, which means that they are symmetrical in these two indices, and hence in all three. The differential relations between the spinors £ and 17 are The conditions (31.1) ensure that the left-hand sides of (31.2) vanish on contraction with respect to any other pair of indices, and hence that these quantities are symmetrical in ß, y or a, 5. In the rest frame, the three-dimensional spinors £' and TJ' are the same according to (31.2). On eliminating r) or £ from (31.2), we find that each component of the spinors £ and 17 satisfies the second-order equation ( p 2 - m 2 ) r ß 7 = 0. (31.3) The equations (31.1), (31.2) form a complete set of wave equations for a particle with spin 3/2.t No further results would be obtained by using the spinors Ç and x- Their structure is as follows: mtaßy = pa*Jit\ mXaßy — PaSCßy The equations of particles with spin 3/2 can also be put in a different form, using the vectorial properties of spinors (W. Rarita and J. Schwinger, 1941 ; A. S. Davydov and I. E. Tamm, 1942). One four-dimensional vector index /x is assigned to a pair of spinor indices aß. Thus the components £aßy of the spinor of rank three can be put into correspondence with the components of the "mixed" quantities i/f£, which have one vector and one spinor index. Similarly, the spinor r)ßay is correlated with i//£, and the two spinors together correspond to a "vector" bispinor i//M (omitting the bispinor index). The wave equation then becomes a "Dirac equation" for each of the vector components 1/^: (7P-m)</v=0, (31.4) 7^=0. (31.5) with the added condition Using the expressions for the matrices y*1 in the spinor representation and the t See the paper by Fierz and Pauli, cited in §15, concerning the Lagrangian formulation of these equations. §31 The Wave Equation for a Particle with Spin 3/2 117 relations (18.6), (18.7) between the spinor and vector components, we can easily verify that equation (31.4) implies equations (31.2), and that the condition (31.5) is equivalent to the condition for the spinors £a/*7 and r)aßy to be symmetrical in the indices ßy or ßy. Multiplying (31.4) by y* and using (31.5), we find or, with the commutation rules for the matrices yß, 2g^p^-7^p,7^=0. (31.6) The second term in turn is zero by (31.5), and the first term gives P^M=0. (31.7) This condition, which is implied by (31.4), (31.5), is easily seen to be equivalent to the conditions (31.1). Finally, yet another way of expressing the wave equation is to use quantities i/>iW (i, k, / = 1, 2, 3, 4) with three bispinor indices, in which they are symmetrical (V. Bargmann and E. P. Wigner, 1948). The set of these quantities is equivalent to the components of all four spinors £, rj, £, \- The wave equation becomes a set of "Dirac equations" P^im^mki = mil>lki. (31.8) We easily see that these equations yield the necessary number (four) of independent components ifcw, and there is therefore no need to impose any further conditions. In the rest frame, (31.8) becomes yL&mki = fou, according to which all the components with i, Jc, I = 3, 4 are zero (in the standard representation), i.e. the fau reduce to the components of a three-dimensional spinor of rank three. The results given above have an obvious generalization for particles with any half-integral spin 5. In the description by equations of the form (31.4), (31.5), the wave function is a symmetrical 4-tensor of rank |(2s - 1) with one bispinor index. In the description by equations of the form (31.8), the wave function has 2s bispinor indices and is symmetrical in these. CHAPTER IV PARTICLES IN AN EXTERNAL FIELD § 32. Dirac's equation for an electron in an external field T H E wave equations of free particles express essentially only those properties which depend on the general requirements of space-time symmetry. Physical processes involving the particles, however, depend on their interaction properties. The description of the electromagnetic interactions of particles in relativistic quantum theory can be effected by a generalization of the method used in classical non-relativistic quantum theory. This method, however, is applicable to the description of electromagnetic interactions only of particles that are not capable of strong interactions. These include electrons (and positrons), and the very wide domain of electron quantum electrodynamics is therefore accessible to the existing theory. There are also unstable particles, the muons, which are not capable of strong interactions; they are described by the same quantum electrodynamics as regards phenomena occurring in times short in comparison with their lifetime (with respect to weak interactions). In this chapter we shall discuss problems of quantum electrodynamics which fall within the scope of single-particle theory. These are problems in which the number of particles is unchanged, and the interaction can be represented in terms of an external electromagnetic field. Besides the conditions which ensure that the external field may be regarded as given, there are conditions arising from "radiative corrections" which also impose limits on the validity of such a theory. The wave equation of an electron in a given external field can be derived in the same way as in the non-relativistic theory (QM, §111). Let AM = (4>, A) be the 4-potential of the external electromagnetic field (A being the vector potential and the scalar potential). We obtain the desired equation on replacing the 4-momentum operator p in Dirac's equation by p - eA, where e is the charge on the particle t: [y(p-e>A)-m]<A = 0. (32.1) The corresponding Hamiltonian is found by making the same change in (21.13): H = a • (p - e A) + ßm + ed>. (32.2) The invariance of Dirac's equation under gauge transformations of the electromagnetic field potential is shown by the fact that it is unchanged in form if the transformation A-* A + ip\ (where \ *s a n arbitrary function) is accompanied by t The charge together with its sign is meant, so that for the electron e = -|*|. 118 §32 Diràc's Equation for an Electron in an External Field 119 the following transformation of the wave function:! i/f-»i//e**; (32.3) cf. the corresponding transformation for Schrödinger's equation in QM, §111. The current density is expressed in terms of the wave function by the same formula (21.11), j = i/ryi//, as when there is no external field. It is easily seen that, when the calculations used in deriving (21.11) are repeated with the equation (32.1) (and the equation (32.4) below), the external field disappears, and the continuity equation contains the same expression for the current as previously. Let us apply the operation of charge conjugation to equation (32.1). To do so, we write the equation $[y(p + eA) + m] = 0, (32.4) which is obtained as the complex conjugate of (32.1), in the same way as equation (21.9) was derived in Chapter HI, and using the fact that the 4-vector A is real. Putting this equation in the form [y(p + eA) + m]ijj = Oi multiplying it on the left by the matrix Uc and using the relations (26.3), we find [y{p + eA) - m ](Ct/0 = 0. (32.5) Thus the charge-conjugate wave function satisfies an equation which differs from the original equation by a change in the sign of the charge. The operation of charge conjugation, however, corresponds to a change from particles to antiparticles. We see that, if the particles possess an electric charge, the signs of the electron and positron charges are necessarily opposite. Thefirst-orderequations (32.1) can be transformed into second-order equations by applying to them the operator y(p - eA) + m : [YV(PM - eA^){pv - eAv) - m2]^ = 0. The product y''y1' may be written as follows: y V = K Y V + y V ) + 2(7M7' - y V ) t The transformation (32.3) with a function #(f, r) is sometimes called a local gauge transformation, in contrast to the global gauge transformation (12.10) with a constant phase a. 120 Particles in an External Field §32 where crMl is the antisymmetric "matrix 4-tensor" (28.2). On multiplying by o-*1" we can antisymmetrize by the substitution (pM ~ MM)(p(, - eA,)->£{(pM ~ eK)(P» ~ ^A,)}= \e{- A^p, + pvA» ~ P»AV + Avp^) = \ie{dvAli-dlxAv) = -2JeFpV (with FM„ = dßAt. - a^AM the electromagnetic field tensor). The result is the secondorder equation [(p - eA)2 -m2- YieF^W = 0. (32.6) The product FM„cr^ may be written in three-dimensional form in terms of the components or"'' = (a, iX), F ^ = (-E,H). Then [(p - M) 2 - m2 -(- eX • H - iea • E]i^ = 0, (32.7) or, in ordinary units, [(H-f*) 1 -(»V + fA) , -»V + f s . H - l f « . E ] * - a «32.7a, The occurrence in these equations of terms in the fields E and H is due to the spin of the particle, and will be further discussed in §33. The solutions of the second-order equation include, of course, "redundant" solutions which do not satisfy the original first-order equation (32.1), being solutions of that equation with the opposite sign of m. The choice of the appropriate solutions in particular cases is usually obvious and causes no difficulty. The customary procedure is that, if </> is any solution of the second-order equation, then a solution of the correct first-order equation is if, = [y(p - eA) + m]<t>. (32.8) For, on multiplying this equation by y(p - eA) - m, we see that the right-hand side vanishes if <£ satisfies (32.6). It should be emphasized that the introduction of the external field into the relativistic wave equation by replacing p by p - eA is not self-evident. In doing so, we have essentially made use of a further principle: this substitution must be applied to first-order equations. For this reason equation (32.6) contains additional terms which would not appear if the substitution were made directly in the second-order equation. §32 Dirais Equation for an Electron in an External Field 121 The stationary-state solutions of Dirac's equation in an external field may include states of both the continuous spectrum and the discrete spectrum. As in the non-relativistic theory, states of the continuous spectrum correspond to infinite motion, in which the particle can be at infinity; it may there be regarded as a free particle. Since the eigenvalues of the Hamiltonian of a free particle are ±V(p 2 + m2), it is clear that the "continuous spectrum of energy eigenvalues is in the ranges e^m and e^-m. If -m < e <m, the particle cannot be at infinity, and the motion is therefore finite and the state belongs to the discrete spectrum. As in the case of free particles, the wave functions with "positive frequency" (e > 0 ) and with "negative frequency" (e < 0 ) appear in a definite manner in the second quantization procedure. For particles in an external field, there is a natural generalization of this procedure, the plane waves in formulae (25.1) being replaced by appropriately normalized eigenfunctions i//(n+) and ip^ of Dirac's equation, corresponding to positive and negative frequencies (e(„+) and -e ( n -) ): n (32.9) However, as the potential well becomes deeper, the energy levels may cross the boundary e = 0, so that positive levels become negative (or vice versa when the potential has the opposite sign). Nevertheless, for the sake of continuity we must still regard these as electron (not positron) levels. That is, the electron states are to be regarded as all those which approach the positive limit of the continuous spectrum (e = m) when the field is removed with infinite slowness. Although Dirac's equation for an electron in an external field does, as already mentioned, yield solutions for a large class of problems in quantum electrodynamics, we must at the same time emphasize that the applicability of the concept of an external field in the one-particle problem of relativistic theory is nevertheless restricted, because of the spontaneous formation of electron-positron pairs in sufficiently strong fields (see §§35 and 36). We shall not deal in this book with the inclusion of an external field in the wave equations for particles with spin other than i, since the topic has no immediate physical significance: actual particles having such spins are hadrons, and their electromagnetic interactions cannot be described by wave equations. PROBLEM Determine the electron energy levels in a constant magnetic field. SOLUTION. The vector potential is Ax = A: = 0, Av = Hx (the field H being along the z-axis). The components py, pz of the generalized momentum (as well as the energy) are conserved. We use the second-order equation for the auxiliary function <f> (see (32.8)), and assume that <f> is an eigenfunction of the operator X? (with eigenvalue a = ± 1) and of the operators py, pz. The equation for <f> ( d2 1 - jp + (eHx - py)2 - eHa U = (e2 - m2 - p2z)<f>. 122 Particles in an External Field §33 This equation is the same in form as Schrödinger's equation for a linear oscillator. The eigenvalues of e are given by e2 - m2 - p\ = \e\H(2n + 1) - eHa< n = 0, 1 , 2 , . . . ; cf. QM, §112. The wave function i/>, which is to be determined from <f> according to (32.8), is not an eigenfunction of the operator S z , in accordance with the fact that the spin is not conserved for a particle in motion. §33. Expansion in powers of 1/ct We have seen in §21 that, in the non-relativistic limit (v ->0), two components (x) of the bispinor i// = ( J vanish. ^A ' Hence \ ^ 4> when the electron velocity is small. This leads to an approximate equation involving only the two-component quantity <f>y obtained by a formal expansion of the wave function in powers of 1/c. Dirac's equation for an electron in an external field may be written ih ^ = {cot • (p - 1 - A ) + ßmc2 + e<ï> W (33.1) The relativistic energy of the particle includes also its rest energy mc2. This must be excluded in arriving at the non-relativistic approximation, and we therefore replace if/ by a function <//' defined as follows: </, = (/,' e-imc2tlh. Then (ih j-+ mc2)ilß' = (cot- U--A\ +ßmc2 + e<t>]if>'. Substituting i/>' = ( , ), we obtain the equations ^A ' [ih j r *<!>)</>' = ca • ( p - i A ) * \ (iÄ^-6cI) + 2mc 2 ^ , = c a • ( p - ^ A y , . (33.2) (33.3) The primes to <f> and \ will henceforth be omitted; this will cause no misunderstanding, since only the transformed function i// is used in the present section. t In this section, ordinary units are used. §33 Expansion in Powers of 1/c 123 In the first approximation, only the term 2mc2x is retained on the left-hand side of (33.3), which gives A 1 cr-(p-^A)^ 2mc (33.4) (thus x ~ <t>lc). Substitution of this in (33.2) gives 1 {*Tr<*)+=M°'{HA))+For the Pauli matrices we have the relation (a • a)(cr • b) = a • b + icr • a x b, (33.5) where a and b are arbitrary vectors (see (20.9)). In the present case, a = b = p - eA/c, but the vector product a x b is not zero, since p and A do not commute: [(p-^)x(p-i A ) ] ^ i ^ A x V + VxAH = i — curl A • 6. c Thus where H = curl A is the magnetic field, and for <$> we obtain the equation This is PaulVs equation. It differs from the non-relativistic Schrödinger's equation by the last term in the Hamiltonian, which has the form of the potential energy of a magnetic dipole in the extertial field (cf. QM9 §111). Thus, in the first approximation (with respect to 1/c), the electron behaves as a particle having both a charge and a magnetic moment » = -£-**• (33.8) The gyromagnetic ratio elmc is twice its value for a magnetic moment due to orbital motion.t t This remarkable result was first derived by P. A. M. Dirac in 1928. The two-component wave function satisfying equation (33.7) was introduced by W. Pauli (1927), before Dirac's discovery of his equation. Particles in an External Field 124 §33 The density p = i//*i/f = <f>*<f> + \*X- The second term must be omitted in the first approximation, so that p = |</>|2, as would be expected for Schrödinger's equation. The current density is j - = Ci//*oti/f C(<$)*<JX + X*v<t>)- We substitute here, from (33.4), A A 2mc \ 2mc\ c c / /^ and transform the products containing two factors a by means of (33.5) in the form (<r • a)cr = a + i a x a , cr(a • a) = a + ia x a. j= ^(</>V<^-0n^^ zm mc zm (33.9) The result is (33.10) in agreement with the expression in the non-relativistic theory, QM (115.4). Let us now derive! the second approximation, continuing the expansion as far as terms of order 1/c2, and assuming that there is only an electric external field (A = 0). First, we note that, when terms —1/c2 are included, the density is This differs from the Schrödinger expression. In order to find (in the second approximation) the wave equation corresponding to Schrödinger's equation, we must replace <t> by another (two-component) function 4>Sch, for which the timeindependent integral would be of the form / |</>sch|2 d3x, as it should be for Schrödinger's equation. To obtain the required transformation, we write the condition / *£**&* d'x = j {*** + -j^rp (V</>* • cr)(a • V<J>)} d'x t The method used here is due to V. B. Berestetskif and L. D. Landau (1949). §33 Expansion in Powers of 1/c 125 and integrate by parts: [ (V0* • a)(or • V<f>) d'x = - | 4>*(<r • V)(cr • V)</> d3x = - [ </>*A</)d3x (or the same with <f> and </>* interchanged). Thus whence it is evident that *»-( , + ä& 1 )* *-( | -&ife')*»- <33 -"> To simplify the notation we shall assume a stationary state, replacing the operator ihdidt by the energy e (with the rest energy subtracted). In the next approximation after (33.4) we have from (33.3) This is to be substituted in (33.2) and cf> then replaced by <£Sch according to (33.11), omitting all terms of higher order than 1/c2. A simple calculation leads to an equation for </>Sch in the form e<f>sch = H</>sch, where the Hamiltonian is 2 r • pXK*r • P) - r W® + r<*>P)}H = £2m - + e<t> -8m^X-L T^n{(<r c + 4m c The expression in the braces is transformed by means of the formulae (a • p)4>(a • p) = Op2 + (a • p4>)(or • p) = <J)p 2 +ift(a-E)(a-p), p2<D - 4>p2 = - ft2A$ + 2iftE • p, where E = -V4> is the electric field. The final expression for the Hamiltonian is H ^ ^ + e O - ^ ^ - j ^ a - E x p - ^ V i d i v E . r 8m c 2m 8m c 4m c (33.12) The last three terms are the required corrections of order 1/c2. The first of these three terms is due to the relativistic dependence of the kinetic energy on the momentum (the expansion of the difference cV(p 2 + m2c2)- mc2). The second, which may be called the spin-orbit interaction energy, is the energy of the 126 Particles in an External Field §34 interaction of the moving magnetic moment with the electric field.t The last is zero except at points where there are charges creating the external field (for instance, in the Coulomb field of a point charge Ze, A3> = -AirZeb{r)) (C. G. Darwin, 1928). If the electric field is centrally symmetric, then rdcD r dr and the spin-orbit interaction operator can be put in the form eh d4> h2 dU g A „_. r X = -—rr-tf • P~T~ ^—r-T"~r~I • s> (33.13) r v Amlelr dr 2mlelr dr ' where I is the orbital angular momentum operator, s = 5a is the electron spin operator, and U = e<$> is the potential energy of the electron in the field. § 34. Fine structure of levels of the hydrogen atom Let us determine the relativistic corrections to the energy levels of the hydrogen atom—an electron in the Coulomb field of a fixed nucleus.$ The velocity of the electron in the hydrogen atom is vie — a < 1. The required corrections can therefore be calculated by the use of perturbation theory, averaging over the unperturbed state (i.e. over the non-relativistic wave function) the relativistic terms in the approximate Hamiltonian (33.12). For somewhat greater generality we shall take the charge of the nucleus as Ze, assuming, however, that Za<\. The field of the nucleus is E = Zer/r 3 , and its potential satisfies the equation A4> = -47rZeS(r). Substituting this in the last three terms in (33.12) and using the fact that the electron charge is negative, we obtain the perturbation operator V=- X 8m + - 4 ^ I . s + f^ô(r). 2r m 2m (34.1) Since, according to the non-relativistic Schrödinger's equation, (where e 0 = -m Z 2 a 2 / 2 n 2 is the unperturbed level and n the principal quantum t With the magnetic moment (33.8) and the velocity v = p/m, this energy becomes - j i - Ex v/2c. At first sight this result may appear unlikely, because on changing to a frame of reference fixed to the particle there arises a magnetic field H = E x v/c, in which the energy of the magnetic moment should be -jji • H. The occurrence of the factor 2 (the 'Thomas h a l f ; L. H. Thomas, 1926) is due to the general requirements of relativistic invariance together with the particular properties of the electron as a "spinor" particle with the corresponding value of the gyromagnetic ratio (see §41). t The effect of the motion of the nucleus on these corrections is a quantity of a higher order of smallness, which will not be considered here. §34 Fine Structure of Levels of the Hydrogen Atom 127 number), the mean value is ? 4m 2 (s 0 + ^ ) 2 . = This quantity, like the mean value of the second term in (34.1), is calculated by means of the formulae (see QM, §36) r"1 = maZIn2, (34.2) (maZ)2ln\l+\), r* = rr5 = (maZ) 3 /n 3 /(/+|)(/+l),J the last of which is valid if l^ 0; the eigenvalue is l-s = i [ j ( j + l ) - / ( / + D - | ] iff*0, 1•s=0 if / = 0. Finally, the third term is averaged by means of the formulae .3/2 i//(0) = 0, f*0. J (34.3) The result of a simple calculation using the above formulae may be written for all cases (for all j and /) as Ae = Za)4/ m(Za) 2 1 __3\ (34.4) Formula (34.4) gives the required relativistic correction to the energy of the hydrogen levels; that is, it gives the fine-structure energy.! In the non-relativistic theory, there is both degeneracy with respect to spin direction and Coulomb degeneracy with respect to i The fine structure (spin-orbit interaction) removes this degeneracy, but not completely: there remain levels with mutual double degeneracy, having the same n and j but different f = j ± i The levels with the maximum possible value for j for a given n, î Jmax = / J. i Imax * 2 = n —1 " 2y are then not degenerate. Thus the sequence of hydrogen levels, with allowance for t This formula, and the more exact formula (36.10), were first derived by A. Sommerfeld from the old Bohr theory before the development of quantum mechanics. 128 Particles in an External Field §35 the fine structure, is lsi/2; 2S|/2, 2pi/2, 2p3/2i 3si/2, 3pi/ 2 , 3p 3 /2, 3d 3 /2, 3d 5 / 2 ; The level with principal quantum number n is split into n fine-structure components. In non-relativistic mechanics, the "accidental" degeneracy of the energy levels in a Coulomb field is due to the existence of a conservation law peculiar to this field, relating to a quantity A whose operator is r r i 2ma r K see QM, (36.30). This specific conservation law is also responsible for the twofold degeneracy which remains in the relativistic case: the Hamiltonian H = a • p + ßm - e2lr of Dirac's equation commutes with the operator I - - • X + — 0(2 • I + 1)75(H - mß) r ma ^ (M. H. Johnson and B. A. Lippmann, 1950). In the non-relativistic limit, J-»X • A. We shall see later (§123) that this remaining degeneracy is removed by "radiative corrections" (the Lamb shift), which are neglected in Dirac's equation for the single-electron problem. To anticipate, it may be mentioned here that the order of magnitude of these corrections is mZ4a5 log(1/a). The second-order spin-orbit interaction correction would be — m(Za)6, and the ratio of this to the radiative corrections is therefore — Z 2 a/log(l/a). For hydrogen (Z = 1), this ratio is certainly small, and the exact solution of Dirac's equation is therefore of no significance in that case, but it may be significant as regards the electron energy levels in the field of a nucleus with large Z (§36). §35. Motion in a centrally symmetric field Let us now consider the motion of an electron in a centrally symmetric electric field. Since the angular momentum and the parity (relative to the centre of the field, which is taken as the origin) are conserved in a central field, the discussion in §24 regarding spherical waves of free particles is entirely applicable to the angular dependence of the wave functions of such a motion; only the radial functions vary. Accordingly, we shall seek the wave function of the stationary states (in the I §35 Motion in a Centrally Symmetric Field 129 standard representation) in the form where l=j±\, l' = 2j-l, and the power of - 1 is chosen for subsequent convenience. Dime's equation in the standard representation yields the following equations for <f> and x: (.-m-U»-a.fc,| (j52) (e + m - l/)x = <r-p<M where U(r) = e<$(r) is the potential energy of the electron in thefield.The result of substituting the expressions (35.1) is calculated by evaluating the right-hand sides of these equations. Expressing the spherical harmonic spinor njrm in terms of nj/m by jlm (see (24.8)), we can write (<r • p)x = - i(cr • p)(cr • r) ^ nilm. Now transforming the product (cr • p)(<r • r) by means of the formula (33.5), and expanding the vector operators, we have (a • p)x = -i{p • r + ia • p x r}&nj(m = {-div r - (r • V) - a • r x p} & ni/m = -{g' + *g+7<r-î}n„ m , where î = r x p is the orbital angular momentum operator; the prime denotes differentiation with respect to r. The eigenvalues of the product a • î = 2Î • s are 21 • s = j 2 - 1 2 - s2 = j(j + i ) - i ( / + i ) - 2 _ /f j -- Ji if/ L: if I = Jj -— 1—J — § i f / = / + : In order to be able to use the same formulae for both cases (/ = j ± 2), it is 130 §35 Particles in an External Field convenient to write K = - 0 ' + J 2 ) = - ( 1 + 1) for j = l+l for j = 1-1 = +(j + \)=l (35.3) The number K takes all integral values except zero, the positive numbers corresponding to the case } = I -{, and the negative numbers to the case j = I + \. Then 1 • a = - ( 1 + K), and (cr • p)x = - \g' + 1 - K —^-gpjlm. When this expression is substituted in the first equation (35.2), the spherical harmonic spinor £lilm cancels from the two sides. Proceeding similarly with the second equation, we finally obtain the following equations for the radial functions: f' + ^r1f-(e + m-U)g r g' + 1 - K = 0, (35.4) g + ( e - m - U)/=0, or (/r)' + y ( / r ) - ( c + m - t / ) g r = 0, (35.5) (gr)'-j(gr) + {e-m-U)fr = 0. Let us examine the behaviour of / and g at small distances, assuming that the field U(r) increases more rapidly than 1/r as r->0. Then, for small r, equations (35.4) become r+ug = o, g'-uf=o. These have real solutions, of the form / = constant x sin (35.6) g = constant x cos ( f Udr + s \ where 5 is an arbitrary constant. These functions oscillate as r->0, but tend to no limit. It is easy to see that this situation corresponds, in the non-relativistic theory, to the "falF' of a particle to the centre. First of all, we note that the smallness of r here places no restrictions on the §35 131 Motion in a Centrally Symmetric Field choice of solution, since there is no condition at r = 0 for the oscillatory function, and so the choice of 8 remains arbitrary (the correct behaviour of the wave function for large r can be ensured by an appropriate choice of 5, for any value of e). This indeterminacy can be eliminated by regarding the potential with a singularity (at r = 0) as the limit, when r 0 -»0, of a potential cut off at some r0, i.e. equal to U(r) for r>r0 and U(r0) for r < r0. With r0 finite, we of course obtain a definite set of energy levels, but the energy of the ground state tends to -<» as In the non-relativistic theory, this signifies a "fall" to the centre, since a particle at a deep level is localized in a small region near r = 0. In the relativistic theory, such a situation is impossible, since it would imply that the system was unstable with respect to the spontaneous generation of electron-positron pairs. For, whereas an energy exceeding 2m is needed to create such a pair in a vacuum, a smaller energy is sufficient in a field. In the presence of an electron bound state with energy e < m, a pair can be formed by expending an energy e + m < 2m, the result being a free positron and an electron in a bound state. If, on the other hand, the bound state energy is e < - m , such a field can create positrons (with energy - e > m ) spontaneously, without taking energy from an external source. In the field under consideration, as r 0 ->0, there is an infinity of such "anomalous" levels with e < - m . Thus fields whose potential 4>(r) increases more rapidly than 1/r as r-»0 cannot be dealt with by means of Dirac's theory. This applies to potentials of either sign. Although a "fall" can occur, of course, only with attraction, the sign of U = e<$> depends also on that of the charge, so that electron levels behave anomalously in one case and positron levels in the other; in the latter case, the field generates free electrons. Let us next consider the behaviour of the wave functions at large distances. If the field U(r) decreases sufficiently rapidly as r -x», it may be entirely neglected in the equations when determining the asymptotic form of the wave functions at large distances. When e > m, i.e. in the continuous spectrum, we then return to the equation of free motion, so that the asymptotic form of the wave functions (spherical waves) differs from that for a free particle only by the presence of additional "phase shifts", whose values are determined by the form of the field at short distances.t These shifts depend on the values of j and /; that is, on the number K defined above (and also, of course, on the energy e). Denoting the phase shifts by 6K and using the expresssion (24.7) for a free spherical wave, we can therefore immediately write down the required asymptotic formula: . , 2 v 1 / V(e + m)njim sin (pr-\lir + 8K) \ r V(27jV- V(e - m)n j r m sin (pr - \l'ir + 8K))y { f) ^' or, with the definition (35.1), f) g) = V 2 le ± m sin , i, , c . V (pr - \ITT + ÔK), r V e cos v (35.8) ' + Cf. QM, §33. As in the non-relativistic theory, U(r) must decrease more rapidly than 1/r. The case U — 1/r will be discussed separately in §36. §35 Particles in an External Field 132 where p = V ( e 2 - m 2 ) . The common coefficient here corresponds to the normalization of the radial functions by (24.5). The wave functions of the discrete spectrum (e < m) decrease exponentially as r-»oo; / = - V ^ 4 f S = ^ e x p t - r V ( m 2 - e 2 )], (35.9) where A0 is a constant. As in the non-relativistic theory, the phase shifts SK (more precisely, the quantities elih« - 1) determine the scattering amplitudes in a given field, as will be further discussed in §37. We shall not pause to investigate here the analytical properties of these amplitudes (cf. QM, §128), but merely note than e2l&« again has, as a function of energy, poles at the points corresponding to the levels for bound states of the particle. The residue of e216* at such a pole is related in a certain manner to the coefficient in the asymptotic expression for the corresponding wave function of the discrete spectrum. This relation may be found by a generalization of the non-relativistic formula, QM (128.17). The necessary calculations are entirely similar to those in QM, §128. Differentiation of equations (35.5) with respect to the energy gives \ de / r de de V de ) r de de ö ' We multiply these two equations by rg and -rf respectively, and the two equations (35.5) by -rg and rf, and add all four term-by-term. After simplification, we have We now integrate with respect to r: o and take the limit as r-*«. The integral on the right tends to unity, by the normalization condition. On the left-hand side, we use the fact that in the asymptotic region the functions / and g are related by 133 Motion in a Coulomb Field §36 which is derived from (35.5) by neglecting terms in U and in Mr. The result is ^_lwmi.r{(m\.] f + mL \ de ) J de ( ,j. I 0 ) = l. This formula differs only in the coefficient (e + m replacing 2m) from the corresponding non-relativistic formula (for the function x). There is therefore no need to repeat the subsequent calculations; the final formula, valid near the point e = eo (where F 0 is the energy level), is ^ = ( _ . 1 } i J^L J"^>, (35.11) e - eo* m + e0 where A0 is the coefficient in the asymptotic expression (35.9). PROBLEM Find the limiting form of the wave function for small r in a field U ~ r~\ s < 1. S O L U T I O N . For a free particle we have, when r is small, f ~~ r\ g — r', so that / > g if J < I', and f <g if I > /'. We make the assumption (which is confirmed by the result) that this relationship exists also in the field considered here. If / I' is treated similarly. The result is for/</\ f~~r\ g-rr~s; for/>l\ /~r'\ g-rv. § 36. Motion in a Coulomb field We shall begin the study of the properties of the motion in case of a Coulomb field by considering the behaviour of the short distances, taking the particular case of an attractive field: For small r, the terms in e ± m may be omitted in equations the very important wave functions at U = -Za/r.t (35.5), leaving (/r)' + f / r - ^ g r = 0, (gr)'-^gr + ^fr = 0. The two functions fr and gr appear in an equivalent manner in these equations, and we therefore seek them as equal powers of r: fr = ary, gr = bry. Substitution gives a(y + K) - bZa = 0, aZa + b(y - K) = 0, whence y2=K2-(Za)2. (36.1) t In ordinary units, U = - Ze2lr. In relativistic units, e2 is replaced by the dimensionless quantity a. 134 Particles in an External Field §36 Let (Za) 2 <K 2 . Then 7 is real, and the positive value must be taken: the corresponding solution either does not diverge at r = 0 or does so less rapidly than the other. The choice may be justified by considering a potential which is "cut off" (see §35) at a certain small r0 and then taking the limit r 0 ->0; cf. the analogous discussion in QM, §35. Thus Za } f = —■— g = constant x r u \ I \ y+ K 7 - V(K 2 - Z V ) - V[(j + \)2 - Z V].J (36.2) Although the wave function may become infinite at r = 0 (if 7 < 1), the integral of |(//|2 remains finite, of course. If (Za)2 > K2, both values of 7 given by (36.2) are imaginary. The corresponding solutions oscillate as r~] cos(|7J log r) when r ^ O , and this again corresponds to a situation which is inadmissible in the relativistic theory, as already shown above. Since K2^ 1, this means that a purely Coulomb field can be discussed in Dirac's theory only if Za < 1, i.e. Z < 137. Let us now give a qualitative description of the situation which arises when Z > 137. In order to avoid an indeterminacy in the boundary condition at r = 0, we must again consider a potential cut off at a distance r0 (I. Ya. Pomeranchuk and Ya. A. Smorodinskiï, 1945). This has a direct physical significance as well as a formal one. The charge Z > 137 can in practice be concentrated only into a "superheavy" nucleus of finite radius. Let us therefore see how the configuration of levels varies as Z increases for a given r0. In the Coulomb field when not cut off, the energy e{ of the lowest level tends to zero when Za = 1, and the e\(Z) curve terminates, the level e{ becoming imaginary for Za > 1; see (36.10) below. In the cut-off field, for a given r0 ^ 0, the level ex passes through zero only for some Z a > l . The value ei = 0 has no physical distinctiveness, and when r0 5* 0 it also has no formal distinctiveness, the e\{Z) curve not terminating there. When Z increases further, the levels continue to descend, and at a certain "critical" value Z = Zc(r0) the energy ex reaches the bottom ( - m ) of the lowest continuum of levels. As explained in §35, this means that zero energy is required for the creation of a free positron. The critical value Zc is therefore the maximum charge that the "bare" nucleus can have for a given r0. When Z>ZC, the level e\<-m, and the formation of two electron-positron pairs becomes energetically favourable. The positrons go to infinity, carrying kinetic energy 2 ( | e 1 | - m ) , and the two electrons occupy the level e\. This gives an "ion" with an occupied K shell and a charge Zeff = Z - 2 (S. S. Gershteïn and Ya. B. Zel'dovich, 1969). The system is stable for Z > Zc, up to values of Z for which the limit -m reaches the next level.t Lastly, it may be noted that, even for a point charge, the form of the potential at short distances is affected by radiative corrections, but the resulting corrections to Zca are only of the order of a. t For example, if the nuclear charge is uniformly distributed in a sphere with radius ro = 1.2 x 10",2cm, the critical value Zc = 170, and the next level reaches the limit -m when Z = 185 (V. S. Popov, 1970). A detailed account of the quantitative theory is in the review article by Ya. B. ZePdovich and V. S. Popov, Soviet Physics Uspekhi 14, 673, 1972. 135 Motion in a Coulomb Field §36 Let us now turn to the exact solution of the wave equation (C. G. Darwin, 1928; W. Gordon, 1928). (a) Discrete spectrum (e < m). We seek the functions / and g in the form / = V(m + e)e^p 1 '- , (Q 1 + Q2), (36.3) g = -V(m-e)«-V",(Qi-Q2). with the notation p = 2Ar, A=V(m2-e2), 7 = V(K 2 -Z 2 a 2 ). (36.4) This is reasonable, since we already know the behaviour of the functions as p-»0 (36.2) and their exponential decrease (~e"?p) as p-*°°. Since, as p->°°, the functions / and g must have the same asymptotic behaviour, we must expect that Q\ > Qi as p -* 00. Substitution of (36.3) in (35.4) yields the equations p(Qx + Q2Ï + (y + K)(Q{ + Q2) - pQ2 + ZaJ^^ (Q, - Q2) = 0, * fit + e P(QI - Q2)' + (7 - K)(Q, - Q2) + pQ2 - Za J^ziQi y m—e + Q2) = 0, where the prime denotes differentiation with respect to p. The sum and difference of these give PQ'1 + (T-^)Q, + ( K - ^ ) Q 2 = 0, (36.5) PQS+(T + ^ - P ) Q I + ( « + ^ ) Q . - 0 , ] or, eliminating either Qf or Q2, pQï + (2y + 1 - p)Qi - (7 - ^ ) Q , = 0, pQ2' + (27 + l - p ) Q 2 - ( 7 + 1 - ^ ) 0 2 = 0, where we have used the fact that y2-(Zael\)2 these equations which is finite when p = 0 is Q, = AF(y-?f-t = K2-(Zaml\)2. The solution of 27 + l , p ) , (36.6) Q2 = BF(7 + l - ^ p , 2 7 + l , p ) , Particles in an External Field 136 §36 where F(a, ß, 2) is the confluent hypergeometric function. Putting p = 0 in either of the equations (36.5), we obtain the relation between the constants A and B: B = - -—~ 7- A. K - Zam/A (36.7) The two hypergeometric functions in (36.6) must reduce to polynomials, since otherwise they would increase as ep when p->°°, and the wave function itself would increase as e~2P. The function F(a, ß, 2) reduces to a polynomial if a is a negative integer or zero. We write 7 - Z a e / A = -n r . (36.8) If nr = 1 , 2 , . . . , the two hypergeometric functions reduce to polynomials. If nr = 0, only one of them does so. In that case, 7 = Zaelk, and Zam/A = |K|, as is easily verified. If K < 0 , the coefficient B (36.7) is zero, so that Q2 = 0 and the necessary condition is satisfied. If K > 0 , however, then B = - A , and Q2 remains divergent when nr = 0. Thus the following values are possible for the quantum number nr: nr = 0 , 1 , 2 , . . . = 1,2,3,... for K <0,1 for K>0.J (36.9) The definition (36.8) then yields the following expression for the discrete energy levels: H'V^-fz^wJ- (3610) In particular, the energy of the ls\ ground level (|K| = 1, nr = 0) is e1 = m V [ l - ( Z a ) 2 ] . For Za < 1, the leading terms in the expansion of (36.10) are s m * (Za) 2 f 2(|K| + n r ) 2 r (Za)2ri |K| + nrL|K| 3 11 4(|K| + n r )Jr On writing nr + |K| = n ( = 1 , 2 , . . . ) and noting that \K\ = j + 5, we return to formula (34.4), which was previously derived by means of perturbation theory. As already mentioned at the end of §34, the further terms in this expansion have no significance, since they are certainly exceeded by the radiative corrections. Formula (36.10) as it stands, however, is meaningful when Za — 1. The double degeneracy of the levels shown by the approximate formula (34.4) exists also in the exact formula, which involves only |K|, so that levels with the same j and different / again coincide. We have still to determine the common normalization factor A in the wave function. The wave function of the discrete spectrum must, as usual, be normalized §36 137 Motion in a Coulomb Field 2 3 by the condition J |i//| d x = 1; the corresponding condition on the functions / and g is j(f2 + g2)r2dr=\. The value of A is most simply determined from the asymptotic form of the functions as r-»°c. Using the asymptotic formula f(-"»2^'.P)-r(^ T 2V+i) ( " p ^ (see QM, (d. 14)), we find / - (- 1)"-A V(m + g ^ ^ V + l ) e~Kr(Ury+n'~lComparing this with (36.22) derived below, we can find A. Collecting together the formulae, we can write out in full the final expressions for the normalized wave functions: f2 = h 8 ±(2Af2r (m±s)r(27 + nr+l) V F(2y + \)[4m(Zaml\)(ZamlK - K) • nr\\ x {{^Y1 - KW-M,, K } { 2y + 1, 2Ar) + n r F(l - nr, 2y + 1, 2Ar)j, (36.11) where the upper signs refer to / and the lower signs to g. (b) Continuous spectrum (e > m). There is no need to solve the wave equation afresh for the states of the continuous spectrum. The wave functions for this case are obtained from those of the discrete spectrum by the substitutions! V(m - e ) - > - i V ( e - m), A-*--îp, - n r - » y - iZaelp; (36.12) see QM, §128 concerning the choice of sign in the analytical continuation of the square root V(m - e). The normalization of the functions must, however, be done again. Making these substitutions in (36.11), we may write gg = i V ( e - m ) J x [e*F{y - iv, 2y + 1, -2ipr) + e~liF(y + \-iv,2y+ + In the rest of this section, p denotes |p| = V ( e 2 - m~). 1, -2ipr)], 138 Particles in an External Field §36 where A' is another normalization constant, v = Zae/p, e'2* = y 7lv. ; K-ivmle (36.13) the value of £ is real, since y2 + (Zaelp)2 = K2 + (Zamlp)2. According to the formula F(a,ß,z) = e 2 F(ß-a,ß,--z) (see QM, (d. 10)), we have F(y + 1 - ii/, 2y + 1, -2ipr) = e"2lprF(y + iv, 2y + 1, 2ipr) = e"2iprF*(y - iV, 2y + 1, -2ipr). Hence /, g = 2iA'V(s ± m)(2pr)y] im, re{ei(pr+É)F(y - iV, 2y + 1, -2ipr)}. (36.14) The normalization coefficient A' is found by comparing the asymptotic expression for this function with the general formula (35.7) for a normalized spherical wave. The resulting expression for the wave functions of the continuous spectrum (which we shall afterwards verify) ist f e 7 g ' = 2Z3 / \ / « ± £ , *e J r ( y + l + fr)|(2prr V E r(27+D r x im, re{eiipr+i)F(y - iv, 2y + 1, -2ipr)}. (36.15) The asymptotic expression for this function is derived by means of QM (d. 14), where only the first term is now significant, the second decreasing as a higher power of 1/r: V2 le ±m l /, g = — y sin, cos(pr 4- SK + v log 2pr -1 lu), (36.16) where 5K = £ - arg V(y + \ + h)-\iry + \ITT, (36.17) or eMm _K-ivmler(y+'\-iv) y- w I(Y e,nU-y) (36>,g) + 1 + 11/) For future reference, we shall give an expression for the phase in the ultrat The wave functions for a repulsive field are obtained by changing the sign of Za, i.e. that of v. §36 Motion in a Coulomb Field 139 relativistic case (e > m, v = Za): y - IZCL T(y+ 1+ iZà) The expression (36.16) differs from (35.8) only by the logarithmic term in the argument of the trigonometric function. As in Schrödinger's equation, the slowness of the decrease of the Coulomb potential affects the phase of the wave, which becomes a slowly varying function of r. Analytical continuation into the region e < m gives, in place of (36.18), € K - Zam/A T(y + 1 - Zae/A) lV(/-y) y - Zae/A T(y + 1 + Zae/A) * " 2I6K nf% 7 m P ' This expression has poles at the points where y + 1 - Zae/A = 1 - nr, nT = 1 , 2 , . . . (poles of the gamma function in the numerator), and also at the point y - Zae = - n r = 0 (if also K < 0); these points coincide with the discrete energy levels, as they should. Near any of the poles with n r ^ 0, we have e "« T ^ , « ,—r— T(y + 1 - Zae/A). w n r r(2y + 1 + nr) The form of the gamma function near its pole is found by means of the familiar formula r(z)r(l - z) = 7r/sin ÎTZ: r( V + i _ ^ i \ % IL A / r(nr)sin7r(7+l-Zae/Ar sin 7r(y + 1 - Zae/A) « IT cos rrnr • T " ( ~ r - ) * (e - e0) = (-ir(7rZam 2 /A 3 )(e~e 0 ), where e0 is the energy level. Thus we havet *2,7>_/ (Zam/A-K) A3 1 n r !r(27 + 1 + nr) Zam e - e0 l V ^g" iïï7 n*?n At the end of §35 a formula (35.11) was derived which relates the residue of the function em* at its pole to the coefficient in the asymptotic expression for the wave function of the corresponding bound state. For a Coulomb field, however, the formula (35.10) must be slightly modified, because the constant phase shift SK in (35.7) is replaced in (36.16) by the sum bK + v\o%2pr. We must therefore replace em« on the left-hand side of (35.11) by exp(2i6K + 2iV log2pr)^e 2l ^(2iAr) 2(n ^ 7) . + This formula is easily seen to be valid even if nr = 0. 140 Particles in an External Field §37 Using (36.21) and determining from (35.11) the coefficient A0 (which will now be a power function of r), we find the asymptotic form of the normalized wave function of the discrete spectrum: This has already been used to determine the coefficient in (36.11). § 37. Scattering in a centrally symmetric field The asymptotic expression for the wave function which describes the scattering of particles in the field of a fixed centre of force may be writtent i// = Mepe ipz + w;pÇ. (37.1) Here uep is the bispinor amplitude of the incident plane wave. The bispinor u'ep is a function of the direction of scattering n', and for any given value of n' its form (but not, of course, its normalization) is the same as that of the bispinor amplitude of the plane wave propagated in the direction n\ We have seen in §24 that the bispinor amplitude of the plane wave is entirely determined by specifying a two-component quantity, the three-dimensional spinor w, which is the non-relativistic wave function in the rest frame of the particle. The flux density is expressed in terms of the same spinor, and is proportional to w*w, with a proportionality coefficient which depends only on the energy e and is therefore the same for the incident and scattered particles. The scattering cross-section is da = (w'*w7w*w) do or, if as in §24 the incident wave is normalized by the condition w*w = 1, da = w'*w' do. We define the scattering operator / by w' = fw. (37.2) Since the quantities H>, W' have two components, the operator thus defined is exactly analogous to the operator scattering amplitude which appears in the non-relativistic scattering theory taking account of spin (QM, §140). We can therefore apply immediately the formulae derived there which express the operator in terms of the phase shifts of the wave functions in the scattering field. It is only necessary to transform these phase shifts by expressing 8f and 8i from QM, §140, in terms of the phase shift 8K which appears in the relativistic formula (35.7). The phases St and 8i referred to states with orbital angular momentum f and total t In §§37 and 38 p denotes |p|, and e and p will be written separately as suffixes to the amplitude. 141 Scattering in a Centrally Symmetric Field §37 angular momentum j = I + 2 and j = / -*• According to the definition (35.3), K = - / - 1 for j = / + 2 and K = / for j = / - \. We must therefore make the changes 8|+->S_(i+i), Sf-*S/ and remember that the suffix to 8 now represents the value of K. Thus we find f=A+ A== | - 2 [(I + IK« 2ip 1=0 B =Uy 2iS (37.3) Bv<r, — - l ) + /(e 2 i s <-l)]P,(cos0), - e2*>)P\(cos 0), (37.4) (37.5) where v is a unit vector in the direction of n x n'. Since w is the spinor wave function in the rest frame, the polarization properties of the scattering are given in terms of / by the same formulae as in QM, §140. For a Coulomb field, it is possible to express both functions A(0) and B(0) in terms of one function. The calculation is briefly as follows.t In a Coulomb field, the phases 5K are given by (36.18), which we, write in the form Ze22m\ my .Ze P / T(y - iv) f(y + 1 + iv) K r (37.6) ,M|K|-7). _ _ein* w n e n K < 0 u s j n g t h e quantities thus defined we can put the series (37.4), (37.5) in the form eM = ei*K w h e n K > 0 a n d eM A(0) = ^ G ( 0 ) - i ^ ^ F ( 0 ) , (37.7) B(6) =--tanie p • G(0) + ^ - ^2 c o t | 0 • F(0), p where G(6) = ii g l2Q(Pt + ?,_,), F(6) = \i g IQiPi - P,_,). (37.8) In transforming the series B(0), we have used the following recurrence relations between Legendre polynomials: P i + ?,*_, = - c o t i f l - K P i - P , - , ) . P i - P !_, = tan i 0 • l(P, + Pi-,)t R. L. Gluckstern and S. R. Lin, Journal of Mathematical Physics 5, 1594, 1964. (37.9) (37.10) 142 §38 Particles in an External Field According to the identity - [P/(cos 0) - P/_,(cos 0)] = /[P,(cos 0) + P/_,(cos 0)], (1 + cos 0) -j a cos 0 (37.11) the functions F(0) and G(0) are related by G = (1 - cosfl) d£ê=- cotJe f- <3712) Thus A(0) and B(0) are expressed in terms of the single function F(0).t § 38. Scattering in the ultra-reiativistic case We shall now discuss separately the scattering in the ultra-relativistic case (e > m). In the first approximation, we neglect altogether the mass m in the wave equation. It is convenient to use for if/ the spinor representation H<> since the equations for £ and t) are separable when m = 0: -îcr-V£ = ( e - l / ) £ j — i'cr • Vr) = — (e - L/)rj J (38.1) (the "neutrino" form, §30). A helicity state of an electron polarized in the direction of p corresponds to a wave function \fß = (^ j, and for polarization opposite to p we have <> / = ( J. Since the equations for £ and TJ are separable, it is evident that this property is unaffected by scattering. Thus helicity is conserved in the scattering of ultrarelativistic electrons. From considerations of symmetry (longitudinal polarization) ft is obvious that there is no azimuthal asymmetry in the scattering of helical (longitudinally polarized) particles. We can also say that the scattering crosssection of helical electrons is independent of the sign ôf the helicity; this follows because a central field is invariant under inversion, while the sign of the helicity is reversed. In the ultra-relativistic case, formulae (37.3H37.5) may be considerably simplified (D. R. Yennie, D. G. Ravenhall and R. N. Wilson, 1954). t The function F(0) cannot be expressed in a closed form in terms of the elementary functions, but it can be written as a certain double integral; see the paper cited in the last footnote. §38 Scattering in the Ultra-relativistic Case 143 Let the incident electron be polarized, say in the direction of motion n. For a plane wave with a definite value of n • o\ the spinor £ (=(</>+ x)/V2) is proportional to the same three-dimensional spinor w as appeared in the standard representation of the wave. The relation between the spinor amplitudes of the incident and scattered waves in the new representation is therefore given by the same operator /. As a result of the scattering, the polarization is rotated with the momentum to the direction n'. The effect of the operator / on the spin wave function of the electron therefore reduces to a rotation of the spin through the angle 0 (between n and n) about the axis v. This rotation is itself equivalent to a rotation of the coordinates about that axis but in the opposite direction, i.e. through an angle - 0 . Hence it follows that the operator / must be the same (apart from a factor) as the operator which transforms the wave function when the coordinates are changed in the way described, i.e. the operator (18.17) with - 0 instead of 0. A comparison of (37.3) with (18.17) shows that B/A = - i tan 50. (38.2) Thus, in the ultra-relativistic limit, / = A(0)[1 - i tan \ 0 • v • cr]. (38.3) The expression (37.4) for A(0) can also be simplified if a relation between 8K and S-« which exists in the ultra-relativistic limit is used. To derive this relation, we note that, when the terms in m are omitted, the equations (35.4) for the functions / and g become invariant with respect to the changes /-*g, K-»-K, g-»-/, which do not affect the parameters of the particle or field itself. We must therefore have fJgK = -g~Jf-K, and substitution of the asymptotic expressions gives tan(pr - \lir + 8K) = -cot(pr - \VTT + Ô_K), whence e2iSK = e2i6.K ( 3 g 4 ) From this relation, and replacing the summation variable I by / - 1 in the first term of the sum in (37.4), we find A(0) = 2 ^ g K*2'5' ~ l)][Pi(cos 0) + Pi-i(cos • ) ] . (38.5) From (38.2) it follows that re(AB*) = 0. Hence, in the approximation considered, the cross-section is independent of the initial polarization of the particles, 144 Particles in an External Field §39 and an unpolarized beam remains unpolarized after scattering (see QM, (140.8)(140.10)). We may also note that, when 0-*7r, the expression (38.5) for A(0) tends to zero as (TT - 0)2 (since P/(— 1) = (-1)'). The cross-section ^ = | A\2 + \B\2 = | A(0)|2/cos2 \6 (38.6) therefore tends to zero also. These properties do not occur, of course, in higher approximations with respect to the small quantity m/e. In particular, analysis shows that as 0->7T the cross-section tends to a limit proportional to (m/e)2. For a Coulomb field in the ultra-relativistic case, the phases ÔK are independent of the energy, as is seen from (36.19),t Hence, in a purely Coulomb field, the scattering cross-section for e > m has the form da = ^ e (38.7) do, where T is a function of the angle only. § 39. The continuous-spectrum wave functions for scattering in a Coulomb field In later sections (§§95, 96) we shall consider various inelastic processes which occur when ultra-relativistic electrons are scattered in the field of a heavy nucleus (Za ~~ I). To calculate the relevant matrix elements, we need wave functions whose asymptotic form (as r -► <») is the sum of a plane wave and a spherical wave. We shall see that, in the ultra-relativistic case (electron energy e > m), the most significant values of the momentum transfer from electron to nucleus in scattering are q = | p ' - p | ~ m. These values of q correspond to impact parameters p — Ilq — 1/m, the electron being deflected through anglest (39.1) e-qlp-mle. In terms of the coordinates r (distance from the centre) and i~r cos 0, this represents the region ps r sin 0 - 1/m, p(r - z) = pr(l - cos 0) - 1, (39.2) and r — e/m2, so that the distances concerned are large. We write Dirac's equation in the form (e - U - mß + ia • V)^ = 0, U = -Za/r, (39.3) and transform it into a second-order equation by applying the operator e - U + t This is also evident directly from equations (38.1), since for a Coulomb field the energy e may be eliminated from the equations by the substitution r->r le. t In this section, p denotes |p|. §39 The Continuous-spectrum Wave Functions 145 mß - ia • V: (A + p 2 - 2el/)t// = ( - i a • V17 - l72)«/f. (39.4) Since r §> Zale in the region considered, U < e. As a first approximation, the right-hand side of (39..4) may be neglected. The remaining equation, ( A + p 2 + 2eZalr)*l/ = 0, (39.5) is of the same form as the non-relativistic Schrödinger's equation in a Coulomb field: (2kA+£r+v>=°- (39 5a - > differing only in an obvious change in the notation for the parameters (the ''potential energy" containing an extra factor elm). We can therefore write down immediately the solution which has the required asymptotic form (see QM, §136). For example, the wave function which asymptotically comprises a plane wave ( <* e ,p r) and an outgoing spherical wave is C = e"Za€l2pr(l-iZaelp), (39.6) where F is the confluent hypergeometric function and u£p the constant bispinor amplitude of the plane wave, normalized by the condition stated earlier (23.4): WepWep = 2m. (39.7) The wave function (39.6) is normalized in such a way that the plane wave in its asymptotic limit has the usual form, ■!OL V(2e) corresponding to "one particle in unit volume". Since p % e in the ultra-relativistic case, we can write Zaelp « Za in (39.6): *(«V=C^^el>"rF(îZaflfî(pr-p-r)), C = ez"ßr(l-iZa). (39.8) It should be noted that, although we are considering distances so large that pr> 1, the hypergeometric function in (39.8) cannot be replaced by its asymptotic form: the argument of F is not pr but pr(l - c o s 0), which is not assumed large.t t In QM, §135, we were concerned with arbitrarily large r, and this approximation was therefore allowable for all values of 0. 146 Particles in an External Field §39 In applications, the next approximation for i// is also needed, which has a spinor structure different from (39.8) (the latter reducing to the factor uep). To calculate this approximation, we write i// in the form </> = V(27) e ip r (u f p F + 4>). On the right-hand side of (39.4) we now retain the term linear in [/, obtaining for </> the equation (A + 2/p • V - 2eU)<j> = ~iuepa -VU. (39.9) The solution of this may be found by noticing that the function F satisfies the equation (A + 2 i p - V - 2 e l T ) F = 0, as may be seen by substituting (39.6) in (39.5). Applying the operator V to this equation, we obtain (A + 2ip • V - 2eU)VF = 2eFVU. A comparison with (39.9) shows that 4> = - ^ ( a V ) u f p F le The final expressions for i//+) and for a similar function i// () whose asymptotic form contains an ingoing spherical wave are *" = V § 7 j e'? "{l ~ 2 7 a ' V ) F *<pr-p • r»"M>, ^ ) = v ^ e , p r ( l " 2 7 a ' v ) F ( ~ ' Z a ' 1 ' " l ' ( p r + p,r))M£P' (39.10) C = enZal2T(l - iZa) (W. H. Furry, 1934). We shall also write out the corresponding functions («/*-£,-p) with "negative frequency", which are needed when dealing with processes which involve positrons. These can be derived from the functions *\it9 by the substitutions p - * - p , e - > - £ , with p = |p| unchanged; the parameter iZa of the hypergeometric function therefore changes sign, as will be seen from the original expression (39.6), where this parameter occurs in the form iZaelp. Thus we have *-^» Ä v^ e "''' r ( l+ è a ' v ) F(aa,1, " l ^ r " p " ^))u ■••" F, C = g-*Zo/2r(l + iZa). (39.11) §39 The Continuous-spectrum Wave Functions 147 The following comment is necessary regarding the above calculations. Our asymptotic condition is not in itself sufficient to provide a unique choice of the solution of the wave equation; this is clear, since we can always add to *p any outgoing Coulomb spherical wave without violating the condition. By writing the solution of equation (39.5) in the form (39.6), we have tacitly presupposed the choice of a solution finite at r = 0. This requirement was necessary in QM, §§135, 136, where we were considering solutions; valid in all space, of the exact Schrödinger's equation.t In the present case, however, equation (39.5) applies only to large distances, and therefore the choice of solution demands further justification. This is provided by the fact that large impact parameters p = r sin 0 correspond to large orbital angular momenta / and small scattering angles 0: when p ~~ 1/m, we have { — pp — pe — elm > 1, and the angle 0 may be estimated by a quasi-classical procedure: p J dr p e Thus, in the expansion of if/ in terms of spherical waves the main contribution (in this range of r and 0) will come from waves with these large values of I. But a spherical wave with large / will certainly decrease to small values at distances from the origin r < lie which are "classically inaccessible" (because of the centrifugal barrier). Hence, if we "join" the solution of equation (39.5) to that of the exact equation (39.4) at short distances r ~ ru where lie > rx > Zale, then the boundary condition for the solution of equation (39.5) will be that it is small, and this justifies our choice. PROBLEM For an attractive Coulomb field with Za<l, find the correction (of relative order Za) to the non-relativistic wave function of the discrete spectrum. SOLUTION. The electron velocity in a bound state is v ~~ Za, and therefore, for Za <£ 1, the wave function is non-relativistic in the zero-order approximation, i.e. <A = M^non-r, where i/fn0n-r is the Schrödinger function and u a bispinor of the form with w a spinor describing the polarization state of the electron. In the next approximation, we write <A = M^nonr+ i/^(1) and, substituting this in (39.4), obtain for i/r(l) the equation (s*-w+TK-'ë('J)-<«*t In the method of solution given in QM, §135, this condition was satisfied by taking the particular integral in the form (135.1) instead of as a general sum of integrals with different values of ß\ and 02. 148 §40 Particles in an External Field where en is the non-relativistic discrete energy level. Here we have omitted terms of relative order ~~(Za)2; in the non-relativistic case, the important distances are of the order of the Bohr radius, r - 1/mZa. The solution of this equation is ip = - (i/2m)otu • VtKonr, and therefore §40. An electron in the field of an electromagnetic plane Wave Dirac's equation can be solved exactly for an electron moving in the field of an electromagnetic plane wave (D. M. Volkov, 1937). The field of a plane wave with wave 4-vector k (fr = 0) depends on the 4-coordinates only in the combination <f> = kx, so that the 4-potential is A» = A"(<f>), (40.1) and satisfies the Lorentz gauge condition d„AM = k^A»' = 0, the prime denoting differentiation with respect to #. Since the constant term in A is unimportant, we can omit the prime, writing the condition as kA = 0. (40.2) We start from the second-order equation (32.6), in which the field tensor is F^ = kßA'v - KA». (40.3) When expanding the square (id - eA)2 it must be remembered that, from (40.2), d^(A^) = A^i/r. The result is [-d2 - 2ie(Ad) + e2A2 - m2 - ie(yk)(yA')]if* = 0, (40.4) where d 2 = d^M. We seek a solution of this equation in the form * =e " ^ ) , (40.5) where p is a constant 4-vector. This form of the function i// is unaltered by adding to p any constant multiple of the vector k, if the function F(<f>) is appropriately redefined. We can therefore, without loss of generality, impose one further condition on p. Let p2 = m2. (40.6) Then, when the field is removed, the quantum numbers p* become the components §40 An Electron in the Field of an Electromagnetic Plane Wave 149 of the free particle 4-momentum. The significance of the components of the 4-vector p, when the field is present, is more clearly seen in a particular frame of reference chosen so that A0 = 0. Let the vector A in this frame be along the x'-axis and k along the x3-axis; the electric field of the wave is then along x'3 the magnetic field along x2, and the wave itself is propagated along x3. Then (40.5) will be an eigenfunction of the operators p, = I dP P2 = W' p °- p 3 = , läP-äP> with eigenvalues pu Pi, P0-P3; the operators themselves are easily seen to commute with the Hamiltonian of Dirac's equation. Thus, in this frame of reference, p1 and p2 are the components of the generalized momentum along the x1 and x2 axes; p°-p3 is the difference between the total energy and the x3component of the generalized momentum. In substituting (40.5) in (40.4), we note that d»F = k>lF', d ^ F = k2F" = 0, and obtain for F(<f>) the equation 2i{kp)F' + [~2e(pA) + e2A2 - ie(yk)(yA')]F = 0. The integral of this equation is 0 where u/V(2p0) is an arbitrary constant bispinor; the reason for writing it in this form will be shown below. All powers of (yk)(yA) above the first are zero, since (jk)(yA)(yk)(yA) = -(yk)(yk)(yA)(yA) + 2(kA)(yk)(yA) - -k2A2 = 0. We can therefore write so that 4ß becomes * - [ I + «SS(*KH?S5*" (407) 150 Particles in an External Field §40 wheret kx ^-''-ly^M-m*]**0 (40 8) - To determine the conditions to be imposed on the constant bispinor w, we must suppose that the wave is "switched on" with infinite slowness, starting from t = -<*. Then A-»0 when /oc->-oc, and ip must become the solution of the free Dirac's equation. Consequently, u = u(p) must satisfy (yp-m)u=0. (40.9) This condition rejects the "redundant" solutions of the second-order equation. Since u is independent of time, the condition remains valid for finite kx. Thus u(p) is the same as the bispinor amplitude of the free plane wave; we shall take it to be normalized by the same condition (23.4): üu = 2m. The foregoing arguments also show immediately the normalization of the wave functions (40.7). The infinitely slow application of the field does not alter the normalization integral. Hence it follows that the functions (40.7) satisfy the same normalization condition, | i/r* ^ d3x = J </yy V P d3x - (2ir)3S(p' - p), (40.10) as the free plane waves. Let us calculate the current density corresponding to the functions (40.7), first noting that and hence obtaining by direct multiplication r = *V*. = £{p' -M> + k - ( ^ ) - g l ) } . (40..1, If the A*(<f>) are periodic functions, and their time-average value is zero, the mean value of the current density is ^4(p"-2é)7k")- <4012> - We can also find the kinetic momentum density in the state ifjp. The kinetic t This S is the same as the classical action for a particle moving in the field of a wave; cf. Fields, §47, Problem 2. §41 151 Motion of Spin in an External Field momentum operator is the difference p -eA = id - eA. A direct calculation gives 0*0^ - eA^P = ^Py0(p" ~ eA'Wp -p eA +k y(kp) 2(kp)) + K 8(kp)p0 kA h (40.13) The time-average value of this 4-vector, denoted by q*\ is «'- p '-|£ k ' (40 ,4) ' Its square is q2=m\, m^m^l-^Ä 5 ), (40.15) where m* acts as an "effective mass" of the electron in the field. A comparison of (40.14) with (40.12) shows that F = <r/Po. (40.16) The normalization condition (40.10), expressed in terms of the vector q, is f rP tpdh = (27r) 3 ^ô(q'-q); J Po (40.17) this is most simply proved in the particular frame of reference mentioned above. §41. Motion of spin in an external field The quasi-classical approximation in Dirac's equation is reached in the same way as in the non-relativistic theory. In the second-order equation (32.7a) we substitutet where S is a scalar and u a slowly varying bispinor. The usual condition of the quasi-classical case is assumed to be satisfied: the momentum of the particle must vary only slightly over distances of the order of the wavelength ft/|p|. In the zero-order approximation with respect to h, we obtain the usual classical relativistic Hamilton-Jacobi equation for the action S. All the terms which contain the spin (and are proportional to ft) are absent from the equations of motion. The spin would appear only in the next approximation with respect to ft. t Ordinary units will be used at first. 152 Particles in an External Field §41 Thus the influence of the magnetic moment of the electron on its motion is always of the same order of magnitude as the quantum corrections. This is to be expected, since the spin is a purely quantum property and its magnitude is proportional to h. We can therefore reasonably formulate the question of how the electron spin will behave when the electron is executing a given quasi-classical motion in an external field. The answer to this question is contained in the next approximation with respect to h in Dirac's equation. We shall, however, use another method whose significance is more obvious and which does not directly involve Dirac's equation. It has the advantage of allowing a treatment of the motion of any particle, including a particle which has an "anomalous" gyromagnetic ratio not describable by Dirac's equation. The objective is to derive an "equation of motion" for the spin when the particle moves in any (given) manner. Let us first take the non-relativistic case. The non-relativistic Hamiltonian of a particle in an external field is H = H'-/x(T-H, (41.1) where H' includes all terms independent of the spin (see QM, §-111), and n is the magnetic moment of the particle. This form of the Hamiltonian relates to any kind of particle. For electrons, /x = ehllmc (the electron charge being e = - | e | ) , a n ^ f° r nucléons /x also contains the "anomalous" partt li' = M - ehllmc. (41.2) According to the general rules of quantum mechanics, the operator equation of motion for the spin is obtained from the formula l = ^(Hs-sH) = 2^(Ha~aH). (41.3) Substitution of (41.1) gives or l =^sxH. (41.4) We average this operator equation over the state of the quasi-classical wave packet moving in a given path. This is equivalent to replacing the spin operator by t When radiative corrections are taken into account the magnetic moment of the electron also contains a very small "anomalous" part. §41 Motion of Spin in an External Field 153 its mean value s, and the vector H by the function H(t), which represents the change in the magnetic field at the position of the particle (or wave packet) as the latter moves along its path. In the non-relativistic approximation (i.e. in terms of Pauli's equation), s = 2<r is the spin operator of the particle in its rest frame, whose mean value was denoted in §29 by 2Ç. Thus we obtain the equation f =^xH((). (41.5) This form of the equation is, in essence purely classical. It signifies that the magnetic moment vector precesses about the direction of the field with angular velocity -2/xH/fi, remaining constant in magnitude.! Again in the non-relativistic case, the velocity v of the particle varies in accordance with the equation d\ldt = evxH/mc, i.e. the vector v rotates about the direction of H with angular velocity -eHlmc. If /LI' = 0, then ix = ehllmc, and this angular velocity is the same as the angular velocity -2/mH/fi with which the vector Ç rotates; thus the polarization vector is at a constant angle to the direction of motion. We shall see below that this result remains valid in the relativistic case. Let us now proceed to the relativistic generalization of equation (41.5). For a covariant description of the polarization it is necessary to use the 4-vector a defined in §29, and the equation of motion of the spin will determine its derivative with respect to the proper time r.t The form of this equation is given by considerations of relativistic invariance: its right-hand side must be linear and homogeneous in the electromagnetic field tensor FM" and in the 4-vector a^, and apart from these can include only the 4-velocity uM = p*7m. The only form of equation satisfying these conditions is da^ldr = aF»vav + ßu»FvKuuaK, where a and ß are constant coefficients. It a^u* = 0 and the antisymmetry of the tensor other expressions with the required properties As u-*0, this equation must become the uß = (1, 0), T = f, we have (41.6) is easily seen, from the condition F*v (whence FßPu^uv = 0), that no can be constructed. same as (41.5). Putting a*1 = (0,£), dildt = a £ x H . A comparison with (41.5) shows that a = 2/UL. t The classical equation (41.5) can be derived directly from the equation dMldt = j i x H , where M is the angular momentum and |i the magnetic moment of the system; p. x H is the torque acting on the system. Putting M = 2AC, ft = (M/2S)£ = M£> W^ have (41.5). t From here onwards we again take c = 1, ft = 1. §41 Particles in an External Field 154 To determine /3, we use the fact that a*u^ = 0. Differentiating this with respect to T and using the classical equation of motion of a charge in a field, mdu»ldT = eF»vuv, (see Fields, §23), we obtain ^ da* dr M du* dr e _ ^ m e _ m * Hence, on multiplying both sides of equation (41.6) by u^, using the equation ußu* = 1 and cancelling the common factor F^u^a» we have "- 2 ("-5sr)— 2 <*' Thus the final relativistic equation of motion for the spin is da* ^ = 2^iF*vav - 2fjL'u*FvKuvak (41.7) (V. Bargmann, L. Michel and V. L. Telegdi, 1959).t We can change from the 4-vector a to the quantity Ç which directly represents the polarization of the particle in its "instantaneous" rest frame. The relation between a and Ç is given by formulae (29.7)-(29.9). First of all, from (41.7) we necessarily have a^da*jdr = 0, and therefore a^a* = constant. Since a^a* = -Ç 2 , this is equivalent to the obvious result that the polarization Ç of the particle remains unchanged in magnitude during its motion. The equation which shows the change in direction of the polarization is obtained by using three-dimensional notation in (41.7). The space components of this equation are, in explicit form, da 2/xm „ 2/mm 2/x'e , _ 2jx'e , w - j - = —— a x H + —— (a • v)E — - — v(a • E) + —— v(v • a x H) + at e e m m + ^-iv(a-v)(vE). Here we must substitute (29.9), using in the differentiation the equations p = ev, g2 = p 2 +m 2 , and the equations of motion dp/dt = eE + e v x H , deldt = ev • E. (41.8) A lengthy but elementary calculation leads to the result^ t This equation was first derived, in another form, by Ya. I. Frenkel' (1926). t If the gyromagnetic ratio (Lande factor) g is used (as is often done) for charged particles, with PL = g(el2m) • \ (= g(e/2mc) • \h)y this equation becomes f-^-2 +2 7H > < H + à ( «- 2) 7f^<''• H >'^ + ^(^7W> ><(E><v, • »'*> 155 Motion of Spin in an External Field §41 ^J^m^^-m) dt e * Vj e+m ' » 2^ +2^ e+m ' Ex (41.9) The variation of the direction of polarization relative to the direction of motion is of more interest than the variation of its absolute position in space. We write S-DÉj + Çx, (41.10) where n = v/u, and derive the equation for the component £| of the polarization in the direction of motion. A calculation using (41.8), (41.9) leads to the resultt f =2rt-Hx„ + f(^-M')!;1.E. (41.11) The problems at the end of this section include a number of examples of the application of the above formulae. Here it may be noted that, in motion in a purely magnetic field, the polarization of a particle having no anomalous magnetic moment is at a constant angle to the velocity (£j = constant). Thus this result, already mentioned previously for the non-relativistic case, is in fact a general one. The conditions for the above formulae to be applicable can be stated more precisely. The requirement specified initially, that the momentum of the particle should vary sufficiently slowly, is equivalent to a certain condition that the fields E and H should be small; in particular, the Larmor radius in the magnetic field (—pleH) must be large compared with the wavelength of the particle. There is also, however, another condition which must, strictly speaking, be fulfilled: the fields must not vary too rapidly in space, and must vary only slightly within the dimensions of the quasi-classical wave packet. That is, the field must vary only slightly over distances of the order of the particle wavelength (1/p) and of the Compton wavelength (l/m).t In practical problems of motion in macroscopic fields, however, the condition of slow variation is certainly satisfied, and only the condition of smallness remains. In §33 we have derived the first relativistic corrections for the Hamiltonian of an electron moving in an external field. For an electron in an electric field the approximate Hamiltonian is (see (33.12)) H = H ' - ^ a - E x p/m, p = - iV, (41.12) where H includes the terms which do not contain the spin. In our case, since the t This equation can be obtained a little more directly by writing explicitly the time component of equation (41.7). t The latter requirement arises from the condition that the spread of velocities in the wave packet, in its rest frame, must be small compared with c, since otherwise the non-relativistic formulae could not be applied in this frame. If the field varies too rapidly, the equations may contain significant additional terms in the derivatives of the field with respect to the coordinates. 156 Particles in an External Field §41 field varies slowly, we neglect the term in H ' which involves derivatives of E (i.e. the term in div E); the small term in p4 may also be omitted, since it is unrelated to the field effects in question here. Thus H \ in the absence of a magnetic field, reduces to the non-relativistic Hamiltonian H' = p2/2m + eO. Formula (41.12) can also be derived from (41.9) without making direct use of Dirac's equation. This method will generalize it (in the quasi-classical case) to particles with anomalous magnetic moment. The equation of motion of the spin in an electric field, as far as first-order terms in the velocity v, is obtained from (41.9) as | = (M + ^ x ( E x v ) = ^ + 2 ^ x ( E x v ) . If we impose the condition that this equation should be derived quantummechanically by commuting the spin operator with the Hamiltonian (as in (4L3)), then it is easily seen that we must put U = H'-(,L' +^ y • E x p/m. (41.13) This is the required expression. If /LI' = 0, we again obtain (41.12). It should be noted that the "normal" magnetic moment ellm is multiplied by an extra factor \ in comparison with the anomalous moment jx/.t PROBLEMS PROBLEM 1. Determine the change of the direction of polarization of a particle when it moves in a plane perpendicular to a uniform magnetic field (vlH). SOLUTION. The right-hand side of equation (41.9) is reduced to its first term, and the vector Ç therefore precesses about the direction of H (the z-axis) with angular velocity 2fxm +2jx'(e - rn) e H= -(f + 2"')H. The projection of £ on the xy-plane (denoted by CO rotates in that plane with the same angular velocity. The vector v rotates in that plane with angular velocity -eVLIe, as can be seen from the equation of motion p = ev = e\ x H. Hence £i rotates with angular velocity -2jx'H relative to the direction of v. PROBLEM 2. The same as Problem 1, but for motion parallel to the magnetic field. SOLUTION. When v and H are in the same direction, equation (41.9) reduces to dt e so that Ç precesses about the common direction of v and H with angular velocity -2^mH/e. PROBLEM 3. The same as Problem 1, but for motion in a uniform electric field. SOLUTION. Let the field E be along the x-axis, and let the motion be in the xy-plane (with py = constant). According to (41.9), the vector £ precesses about the z-axis with instantaneous angular t This is the "Thomas half" mentioned in the last footnote to §33. Its origin is clearly shown by the derivation given here. 157 Neutron Scattering in an Electric Field §42 velocity \e + m W e We again resolve Ç into components Ci (in the xy-plane) and f:. Then {it = £i cos <£, ^ • E = - f i sin (i> • uy/i\ From (41.11), Ci rotates relative to the direction of v with instantaneous angular velocity §42. Neutron scattering in an electric field In collisions between neutrons and nuclei, the scattering through large angles is determined by the main interaction, the nuclear forces. In small-angle scattering, however, it can be shown that the interaction of the magnetic moment of the neutron with the electric field of the nucleus becomes important (J. Schwinger, 1948). We shall assume that the neutron is non-relativistic, so that the interaction in question is described by the approximate Hamiltonian (41.13). The magnetic moment of an electrically neutral particle is wholly "anomalous" and the operator H ' reduces in this case to the kinetic-energy operator:t H = - ^ A +i^a«ExV. 2m mc (42.1) Since the electromagnetic interaction of the neutron is small, the corresponding scattering amplitude / cm may be calculated by the Born approximation: /-=-2SP/«""-'"('^-BXV)«"-"^ (see QM, §126), or /em = 2 ^ p * • Eq x p, Eq = [ E(r) e , q r d3x, (42.2) where p and p' are the neutron momenta before and after scattering, and hq = p' - p. In this form, the amplitude / cm is an operator with respect to the spin variable. Before continuing the calculation, we should note the following point. Formula (42.1) has been derived in §41 for slowly varying fields (which in practice meant neglecting terms in the Hamiltonian containing coordinate derivatives of the field). As applied to the Coulomb field of the nucleus, this means that the wavelength h/p t In this section, ordinary units are used, and m denotes the mass of the neutron. 158 Particles in an External Field §42 must be small compared with the distances r—1/q which are important in the integral Eq. Hence hq<p, so that the scattering angle 6 -hqlp <\. Thus the required condition is in fact satisfied for small-angle scattering. For a Coulomb field with potential 4> = Ze\r, the Fourier component of the field is 4jrZe E q =-iqcl> q = - i q — j - ; see Fields, (51.5). Substitution in (42.2) gives - . IZeiL , For small scattering angles, hq - p0 and px p'«p 2 0v, where v is a unit vector in the direction of p x p'. Thus - . 2Zeix The nuclear scattering amplitude must be added to this expression. Owing to the rapid decrease of the nuclear forces with increasing distance, this amplitude tends for small angles to a finite (energy-dependent) complex limit, which we denote by a. The total scattering amplitude is therefore f = a + i(blO)a • v, b= 2Ze^lch = 2Za/x/e. (42.3) We see that the electromagnetic scattering is indeed predominant at sufficiently small angles. The expression (42.3) is the same in form as that discussed in QMy §140. We can therefore make direct use of the formulae derived there. The scattering crosssection summed over all possible final polarization states is ^ = M 2 + ^ + 2 b i m a ■ v • Ç, do u (42.4) where Ç is the initial polarization of the neutron beam (called P in QMy §140). If the initial state is unpolarized (Ç = 0), then the polarization after scattering is c= 2bd im a | a | ! e ! + b!V- This is a maximum when 0 = bl\a\y and £max = im al\a\. , - .. <415) CHAPTER V RADIATION §43. The electromagnetic interaction operator T H E interaction of electrons with an electromagnetic field can, as a rule, be treated by means of perturbation theory. This is because the electromagnetic interaction is comparatively weak, as is shown by the smallness of the corresponding dimensionless "coupling constant", viz. the fine-structure constant a = e2lhc = 1/137. The smallness of this number is of fundamental importance in quantum electrodynamics. In classical electrodynamics (see Fields, §28), the electromagnetic interaction is described by the term (43.1) -epA^ in the Lagrangian density of the "field + charge" system (A being the 4-potential of the field and j the particle current density 4-vector). The current density satisfies the equation of continuity, d,j" = 0, (43.2) which expresses the law of conservation of charge. According to Fields, §29, the gauge invariance of the theory is closely related to this law: when Aß is replaced by A» + d^x (4.1), a term -ej^d^x is added to the Lagrangian density (43.1), and this, by (43.2), may be written as the 4-divergence -ed^xj* 1 ); it therefore disappears on integration over dAx in the action S = / L d4x. In quantum electrodynamics, the 4-vectors j and A are replaced by the corresponding second-quantized operators. The current operator is expressed in terms of the i/f-operators by j = i/ryt/r. The generalized "coordinates" q in the Lagrangian J Linter d*X = - t J (/Â) d\ are represented by the values of ip9 i and A at each point in space. Since the Lagrangian density is found to depend only on the "coordinates" q themselves (and not on their derivatives with respect to x), the change to the Hamiltonian density by formula (10.11) amounts simply to a change in the sign of the Lagrangian density.t Thus the electromagnetic interaction operator (the space integral of t Independently of these arguments, it may be noted that, when only the first-order small correction is considered, any small correction in the Lagrangian appears in the Hamiltonian with just a change of sign (see Mechanics, §40). 159 160 Radiation §43 the interaction Hamiltonian density) has the form V = e J (/A) d3x. (43.3) The free electromagnetic field operator is the sum Â = 2 [cnAn(x) + cJA*(x)], (43.4) n which contains the operators of photon creation and annihilation in various states labelled by the suffix n. Each operator has matrix elements only for an increase or decrease of the corresponding occupation number Nn by 1 (the other occupation numbers remaining unchanged). The operator À therefore also has matrix elements only for transitions in which the number of photons changes by 1. That is, only processes of the emission or absorption of a single photon occur in the first approximation of perturbation theory. According to (2.15), the matrix elements are <N„ - \\cn\Nn) = (Nn\c:\Nn - 1) = VN„. (43.5) If there are no photons (of type n) in the initial state of the field, then (l|Cn|0) = 1. The matrix element of the operator (43.3) for photon emission is Vfi(t) = ej(jfiA*)d\ (43.6) where An(x) is the wave function of the emitted photon and jfi the matrix element of the operator / for a transition of the emitter from the initial state i to the final state /.t The 4-vector jfl = (p/„ j/i) is called the transition current. Similarly, we obtain the matrix element for photon absorption: Vfi(t) = ef(jfiAn)d>x. (43.7) This differs from (43.6) only by having A„(x) in place of A£(x). The argument t of Vfi is shown in order to emphasize that the matrix element is time-dependent. By separating the time factors in the wave functions, we can change in the usual way to time-independent matrix elements: Vfi(t)=Vfie-iiErE^)t, (43.8) where E„ Ef are the initial andfinalenergies of the emitting system, and the sign + is for emission and absorption respectively of a photon co. t The notation in (43.6) is slightly inconsistent. The suffixes in V/, refer to states of the whole system "emitter +field",those in jfi to states of the emitter only. ! §44 Emission and Absorption 161 The wave function of a photon with a definite momentum k and a definite polarization is A = V(4ir) " vè) c '" (439) (see (4.3); the time factor is omitted). Substituting in (43.6), we find the matrix element for the emission of such a photon: Vfi = e V(4TT) ^ ^ e*j£(k), (43.10) where j/,(k) is the transition current in the momentum representation, i.e. the Fourier component Jfi(k) = jifi(r)eikrd3x. (43.11) The corresponding formula for photon absorption is Vfi = e V(4TT) ^ ^ e,m-k). (43.12) The equation of conservation of current in the momentum representation is the condition of 4-transversality of the transition currents: kj$ = WOO ~ k • J/iOO '= 0. (43.13) The formulae given in this section do not assume any particular form of the current operator, and are generally valid for electromagnetic processes involving any charged particles. The existing theory allows the form of the current operator to be determined (and hence, in principle, its matrix elements to be calculated) only for electrons. For applications to systems of strongly interacting particles, including nuclei, a semi-phenomenological theory will be used, in which the transition currents appear as empirically determined quantities subject only to the conditions of space-time symmetry and to the equation of continuity. §44. Emission and absorption The transition probability under the action of a perturbation V is given, in the first approximation, by the well-known formulae of perturbation theory (QM, §42). Let the initial and final states of the emitting system belong to the discrete spectrum.t Then the probability (per unit time) of the transition i -*/ with emission t This certainly implies that recoil is neglected, the emitter as a whole remaining at rest. §44 Radiation 162 of a photon is dw = 2TT\ Vfi\2ô(Ei -Ef-co) dv, (44.1) where dv arbitrarily denotes the ensemble of quantities describing the state of the photon and taking a continuous sequence of values; the photon wave function is assumed normalized by the delta function "on the v scale". If a photon having a definite angular momentum is emitted, the only continuous variable is the frequency co. Integration of (44.1) with respect to dv = dco eliminates the delta function, co being replaced by E,- - E/, and the transition probability is w=27r|V /i | 2 . (44.2) If, however, we consider the emission of a photon having a given momentum k, then dv = d3kl(2ir)3 = co2 dco doj{2ir)3. Here it is presupposed that the photon wave function (plane wave) is "normalized to one photon in the volume V = V\ as always in this book; dv is the number of states in the phase volume V d3k. Thus the probability of emission of a photon with a given momentum is dw = 2TT|Vfi\28(Ei -Ef-<o) d3Jc/(27r)\ (44.3) or, after integration over dw, dw =-p-j | V « | V do. (44.4) In this we must substitute the matrix element Vfi from (43.10). In subsequent sections we shall use these formulae to calculate the probability of emission in various specific cases. Here we shall consider certain general relations between radiative processes of various kinds. If in the initial state of the field there is already a non-zero number N„ of the photons in question, the matrix element for the transition is multiplied by <Nn + l|c;|N n > = V(N n + l), (44.5) i.e. the transition probability is multiplied by Nn + 1. The 1 in this factor corresponds to the spontaneous emission which occurs even if Nn = 0. The term Nn represents the stimulated or induced emission: we see that the presence of photons in the initial state of the field stimulates the further emission of photons of the same kind. The matrix element Vif for the transition with the opposite change of state of the system (/-» i) differs from Vfi in that (44.5) is replaced by <N n -l|c n |N n ) = V N n (and the other quantities are replaced by their complex conjugates). This opposite transition is a transition of the system from the level Ef to the level E, with §44 163 Emission and Absorption absorption of a photon. Thus the photon emission and absorption probabilities for a given pair of states i, / are related byt wjwa = (N„ + 1)/N„, (44.6) an expression first derived by A. Einstein (1916). The number of photons can be related to the intensity of the external radiation incident on the system. Let I kc dco do (44.7) be the radiation energy incident on unit area per unit time and having polarization e, frequency in the range dco and wave-vector direction in the solid-angle element do. These ranges correspond to k2 dk do/(27r)3 field oscillators, each having Nke photons of the specified polarization. Hence the same energy (44.7) is given by the product c k2 dkdo xr . h(o3 vr J J $ Nkthoj = Q ii Nke dco do. n \Z7T) OTT C From this we find the required relation: Nke = ^ ^ I k e . (44.8) Let dw$] be the probability of spontaneous emission of a photon with polarization e into the solid angle do, and let the indices (in) and (a) denote the corresponding probabilities for induced emission and for absorption. According to (44.6) and (44.8), these probabilities are related as follows: d<> = dwW = dwJT • ^f- Jke. (44.9) If the incident radiation is isotropic and unpolarized (Ike independent of the directions of k and e), then the integration of (44.9) with respect to do and summation with respect to e gives similar relations between the total probabilities of radiative transitions (between given states i and / of the system): w(a) = w(in) = w(sp) 2 2 TTJ^ j ( 4 4 1()) where / = 2 x 4irlkt is the total spectral intensity of the incident radiation. If the states i and / of the emitting (or absorbing) system are degenerate, the total probability of emission (or absorption) of the photons concerned is found by summation over all mutually degenerate final states and averaging over all possible t In the rest of this section, ordinary units are used. §45 Radiation 164 initial states. Let the degrees of degeneracy (statistical weights) of states i and / be gi and gf. For processes of spontaneous or induced emission, the states i are the initial states, and for absorption the states /. Assuming in each case that all g, or gf initial states are equally probable, we obviously have instead of (44.10) the relations gfwM = giw™ = giw^] j 4 l (44.11) In the literature one frequently meets the Einstein coefficients, defined as Aif = w{sp\ Bif = w(in)c/I, Bfi = w{a)cll (44.12) where I/c is the spatial spectral density of radiation energy. They are related by the equations gfBfi = giBif = giAif7T2c}lhœ\ (44.13) §45. Dipole radiation Let us apply the formulae derived above to the emission of a photon by an electron (in general, a relativistic electron) moving in a given external field. In this case the transition current is the matrix element of the operator / = ijiytjf, in which the (//-operators are assumed expanded in terms of the wave functions of stationary states of the electron in a given field (§32). The matrix element <0« l/|j|l«0/> corresponds to a transition of the electron from state i to state /. This change in the occupation numbers is brought about by the operator âfâ» and the transition current is iti = i/y»to = ( W i , *?«*)> (45.1) where ^ and i/// are the wave functions of the initial andfinalstates of the electron. Let the wave function of the photon be chosen in the three-dimensionally transverse gauge (the polarization 4-vector e = (0, e)). Then the product jfie* = - j/i • e* in (43.10). Substituting Vfi in (44.4), we obtain the following expression for the probability (per unit time) of emission of a photon with polarization e into the solid-angle element do: dw„ - e2(ü>/27T)|e* • }fi(k)\2 do, (45.2) j/,(k) = J - ^ a * •«-*•'d3x. (45.3) where Dipole Radiation §45 165 Summation with respect to the polarization of the photon is effected by averaging over the directions of e (in a plane perpendicular to the given direction n = k/co), and the result is then doubled because of the two independent possible transverse polarizations of the photon.t Thus the result is dwn = e2(col2iT)\n x j/f.(k)|2 do. (45.4) A very important case is that where the photon wavelength À is large compared with the dimensions a of the radiating system. This usually means that the velocity of the particles is small compared with that of light. In the first approximation in a/À (corresponding to dipole radiation; cf. Fields, §67), the factor e lk r varies only slightly in the region where «ft or i/// is appreciably different from zero, and it can be replaced by unity in the transition current (45.3). This implies that the photon momentum is neglected in comparison with the momenta of the particles in the system. In the same approximation, the integral j/,(0) may be replaced by its nonrelativistic value, which is simply the matrix element \fi of the electron velocity with respect to the Schrödinger wave functions. In turn, this element \fi = - iW/,, and etfi = d/„ where d is the dipole moment of the electron (in its orbital motion). Thus we have the following formula for the probability of dipole radiation: dwtn = (w3/27r)|e* . d/e|2 do. (45.5) (Here the direction of n occurs implicitly: the vector e must be perpendicular to n.) Summation with respect to the polarizations gives dwn = (co3/27r)|n x d /t f do. (45.6) Since these formulae are non-relativistic (as regards the electron), they can be immediately generalized to any electron system by taking d/, as the matrix element of the total dipole moment of the system. Integrating (45.6) over all directions, we have the total probability of radiation: w = (4o>3/3)|d/i|2, (45.7) w = (4a)3/3Äc3)|d/l|2. (45.7a) or, in ordinary units, t In the averaging, we use the formula ^ t = l(6,k-n,7ik) (45.4a) or (a • c)(b • e*) = |{a • b - (a • n)(b • n)} = |[axn][bxn], where a and b are constant vectors. (45.4b) 166 §46 Radiation The intensity I is found by multiplying the probability by H(D: I = (4oi4/3c3)|d/i-|2. (45.8) This is directly analogous to the classical formula (see Fields, (67.11)) for the intensity of dipole radiation from a system of periodically moving particles: the intensity of radiation at frequency co5 = sco (where to is the frequency of the particle motion and 5 an integer) is IM=(4<o4J3c*)\ds\29 (45.9) where d5 are the Fourier components of the dipole moment, i.e. the coefficients in the expansion d ( 0 = É dMe'iM. (45.10) The quantum formula (45.8) is got from (45.9) by replacing these Fourier components by the matrix elements of the corresponding transitions. This rule (which is an expression of Bohr's correspondence principle) is a particular case of a general relation between the Fourier components of classical quantities and the quantum matrix elements in the quasi-classical case (see QM, §48). The radiation is quasiclassical for transitions between states having large quantum numbers; the transition energy hay = Ex - Ef is then small in comparison with the energies Ei and Ef of the radiator. This, however, would not lead to any change in the form of (45.8), which is valid for all transitions. This explains the fact (which is something of an accident) that the correspondence principle for the radiation intensity is valid not only in the quasi-classical but in the general quantum case. § 46. Electric multipole radiation Instead of considering the emission of a photon in a given direction (i.e. with a given momentum), let us now consider the emission of a photon with definite values of the angular momentum j and its component m in some chosen direction z. We have seen in §6 that such photons can be of two kinds, electric and magnetic. Let us take first the emission of electric photons, and again assume that the dimensions of the radiating system are small in comparison with the wavelength. The calculations are conveniently carried out by means of the photon wave functions in the momentum representation, i.e. by expressing the 4-vector A^(r) as a Fourier integral. Then the matrix element is Vfi^ejifMAWd'x = ejd3x> jjKr) j0p A*(k) <T' kr ; for simplicity, we omit the suffixes oijm to the photon wave functions. (46.1) §46 Electric Multipole Radiation 167 For an Ej photon we take the wave function from (7.10), with the arbitrary constant C having the value The reason for this choice is to ensure that, in the spatial components of the wave function (A), the terms containing spherical harmonics of order j - 1 cancel (as is seen from formulae (7.16)). Then A will include only spherical harmonics of order j + 1, and therefore the corresponding contribution to Vfi is (as will be clear from the subsequent calculation) of a higher order of smallness (in a/A) than the contribution from the component A0 = <ï>, which includes spherical harmonics of the lower order j. Thus we put A"=(<M), cl> = ^ ^ ^ ^ ô ( | k | - a ) ) y j m ( n ) (n = k/w). Substituting this expression in (46.1) and carrying out the integration over d|k|, we obtain • P/l(r) j doa e-ikrYfm(n). Va = -eyjijl^-jd'x (46.2) To calculate the inner integral, we use the expansion (24.12), written in the form e,kr = 4 7 r i 2 Nm=-I i'g,(kr)Yfm(k/k)Y,m(r/r), (46.3) where g.(fcr)= > /^;J ( + i(kr); (46.4) see QM, (34.3).t Substitution of this expansion in (46.2) gives | e-ikrYfm(n) doB = 47ri-'gi(kr)Yfm(r/r); the remaining terms are zero because of the orthogonality of the spherical harmonics. On account of the condition a/A « 1, only distances such that kr < 1 will be important in the integral with respect to d3x. We can therefore replace the t The normalization of the functions gi is such that their asymptotic form as kr-»» is g,(Jcr)~(l/kr)sin(kr-^r). (46.4a) 168 §46 Radiation functions gj(kr) by the first terms of their expansions in powers of fcr:t a(kr)-(kr)V(2j+l)!!. (46.5) The result is v \\m+{ii -( v/i - (-1) i y / ( 2 j + l ) ( j + 1) —. u)}+~2 (2jTT)!! (Q, ie) '" w)/i ' ( } with the notation ( Q ^ = V ^ T T / PrMr%rn{rlr) d3x (46.7) (Vj.-m = (-l),"mYPjm)- The quantities (46.7) are called the I'-pole electric transition moments of the system, by analogy with the corresponding classical quantities (Fields, §41).* For an electron in an external field, pfi = i//*t//j, and the quantities (46.7) are then calculated as the matrix elements of the classical quantity Uj« J V 2 / + - r Y- In the non-relativistic case (as regards the particle velocities), the transition moment can in principle be calculated similarly for any system of N interacting particles. The transition density is expressed in terms of the wave functions of the system by N f J n=l (46.8) where the integral is taken over the whole of configuration space.§ The photon wave function used here corresponds (in the coordinate representation) to normalization by the delta function on the œ scale, as assumed in formula (44.2). Substituting (46.6), we find the probability of Ej radiation:)] < = ü o i Ä i t f 0 <"**'^WS-'")*!2- <46-9) t The power of kr is equal to the order of the function Yjm by which g, is multiplied. This justifies the neglect of the terms in A which contain higher-order spherical harmonics. t The multipole moments are defined without the factor €, since in this book the currents also are defined without the charge factor. § A situation can occur where the transition probability vanishes according to the approximate selection rules, valid only when the spin-orbit interaction of the electrons is neglected. Then, to obtain a non-zero result, we must use the wave functions with the relativistic correction which takes account of this interaction. || It might appear at first sight that, owing to the isotropy of space, the total probability of photon emission ought not to depend on the value of m. The incorrectness of this conclusion is easily seen if we notice that different final states of the system (for a given initial state) correspond to the emission of photons with different values of m; cf. the rule (46.16) below. I §46 169 Electric Multipole Radiation In particular, for j = 1 we have wiï=^e2\(Q\?-m)fi\2. (46.10) The quantities Q ^ are related to the components of the electric dipole moment vector by eQ(ifo} = id» eQ\% = +^(dx (46.11) ± idy). Summing (46.10) with respect to m, we naturally obtain the earlier formula (45.7) for the total probability of dipole radiation. The angular distribution of multipole radiation is given by formula (7.11). When this is normalized to the total emission probability Wjm, we have dwjm = |Y#(n)|2H> do Wj, JG' + D |VnYim|2 do. (46.12) In particular, for j = 1, Vio = î y ^ c o s ö , Yu±x = TiyJ—smd • e±i4>, where 0 and <> / are the polar angle and azimuth of the direction n relative to the z-axis. On calculating the gradient, we find that the angular distribution of dipole radiation with a definite value of m is given by A 3 •2nJ aw 10 = w,o£— sin' 0 do, OTT A 3 l + cos20, dwl±l = w, ± i^ r do. OTT L ,,, n , (46.13) These expressions could also, of course, be obtained from formula (45.6) by putting firstly (for m = 0) dx = dy = 0, d2 = d, secondly (for m = ± 1) dy = + idx = d/V2, d2=0. If the order of magnitude of the dimensions of the system (atom or nucleus) is a, then that of the electric multipole moments is, in general, Q^ — a1. The probability of multipole radiation is wft-akika)21. (46.14) When the multipole order increases by one, the probability decreases by a factor -(ka)2. The laws of conservation of angular momentum and parity imply certain selection rules which restrict the possible changes in the state of the radiating system. If the initial angular momentum of the system is J„ then, after emission of a photon with angular momentum j, the angular momentum of the system can have Radiation 170 §46 only those values // which are in accordance with the angular momentum addition rule (J, - J/ = j): |/i-J/|*j*Ji+J/. (46.15) For given values of /, and Jf, the same rule (46.15) specifies the possible values of the photon angular momentum j. But, since the probability of emission decreases rapidly with increasing j, the emission occurs principally with the lowest possible multipole order. The components M, and Mf of the angular momenta J, and J/, and m of the photon angular momentum, satisfy the relation (46.16) Mi-Ms = m, which is obvious from the same law of addition of angular momenta. The parities P, and Pf of the initial and final states of the radiating system must be such that P/PPh = Pi, where Pph is the parity of the emitted photon. Since the parities can have only the values ±1, this condition may also be written P,P/ = PPh. (46.17) For an electric photon Pph = (-l) J , and the parity selection rule for electric multipole radiation is therefore P,P/ = (-iy. (46.18) The selection rules for total angular momentum and for parity are entirely rigorous and must be satisfied in emission by any systems. There may also be other rules which are more restrictive and which arise from certain properties of the structure of particular radiating systems. These latter rules must of necessity be approximate to some extent; they will be discussed in later sections of this chapter. The dependence of the emission probability on the quantum numbers m, M-, and Mf is entirely determined by the tensor character of the multipole moments.. The quantities QJm with a given j form a spherical tensor of rank j. The dependence of its matrix elements on these quantum numbers is given by the formula |(n///M/|Q,-m|niJIM1>|2=(^ ^ J^jW/lQ>IMi>l 2 (46.19) (see QM, (107.6)), where n conventionally denotes all the quantum numbers specifying the state of the system, other than J and Af. The reduced matrix elements on the right of (46.19) do not depend on m, M„ Mf. On substituting this formula in (46.9), we obtain the required dependence, which is proportional to Ih \Mf J' m Ji\\ -Mi)' Magnetic Multipole Radiation §47 171 here it is, of course, assumed that the emitter is not in an external field, and that the transition frequency w is thus independent of M, and Mf. Summing the probability over all values of Mf (for a given M;), we have the total probability of emission of a photon of a given frequency from the initial level n„ Jj of the system. It is obvious from the isotropy of space that this quantity must also be independent of the initial value M,. The summation is carried out by means of the formula 2 Kn/J/M/IQ,. - m MM,>| 2 = j j ^ Kn^llQUn,/,)!2 (46.20) (see QM, (107.11)). §47. Magnetic multipole radiation The wave function of a magnetic photon is A*1 = (0, A), where A is given by (7.6). Substitution in (46.1) gives for the transition matrix element Vfl = ~ e ^ | d3x • \fi(r) J don • e~ik"Yff*(n). (47.1) The components of the vector Yjm) can be expressed in terms of the spherical harmonics of order j , as shown in (7.16). Again using the expansion (46.3), we obtain for the inner integral | e-'k-'Yg)*(n) do0 = 47rrigi(kr)Yfc)*(r/r), and, on substituting g, from (46.5),t V/i = " ei~l (2JTÏJ!! / J ' i ( r ) r J " W W r ) d'x. Here we must substitute, in accordance with the definition (7.4), YS l (r;r) = V[j0l+|)]rxVyftn; we then transform the integrand by means of the formula r'j / i TxVYf m = - r x j / i . V(r'Y*,), obtaining V„ = (- , r ,i V (2 ' + ff + 1> g ^ l j e(QS»>„),„ t The current j must not be confused with the angular momentum j. (47.2) 172 Radiation §47 with the notation (Qï% = J^Y V ^ T / rx hi ' WYim) (47.3) d*x. These are called the 2}-pole magnetic transition moments. Because of the analogy between the expressions (47.2) and (46.6) emission probability, we obtain a formula which differs from (46.10) only in electric moments are replaced by magnetic moments. Formula (46.12) angular distribution also remains valid (as has already been mentioned nection with (7.11)). Let us analyse the form of (47.3) when j = 1. In this case, the functions for the that the for the in conare and their gradients are simply the spherical unit vectors e{0\ ef n (7.14). The quantities e(Q\Z})fi are therefore the spherical components of the vector H/, = k J r x j / t d 3 x , (47.4) which is similar in form to the classical magnetic moment (see Fields, §44). The total probability of M l radiation is given in terms of this quantity by the formula (in ordinary units) w=(4co 3 /3ftc 3 )| fJ i /i | 2 . (47.5) We shall show how formula (47.4) is related to the usual non-relativistic quantum expression for the magnetic moment operator. The expression for the transition current is (see QM, §115) J/i = - 2m W V * ' ~ ^Vt//*} + fs c u r WSM. (47 6 -> where /i is the magnetic moment of the particle and s its spin. Hence ^ = ~ 4 ^ / * K r X V)*' d'X + Ûi / * l(r X V ) ^ d'x + Is \ T X curl( ^^<) d > (47.7) In the second term, we write | «Mr x V)<//? d3x = - [ i//?(r x V)^ d3x + J c u r l ( n ^ ) d3x. The last integral can be transformed into one over an infinitely distant surface, and §48 Angular Distribution and Polarization of the Radiation 173 is zero. Thus the first two terms in (47.7) are equal. In the third term, we transform the integral as follows (temporarily writing F = i/>?si//,): ( r x (V x F) d3x = <j> r x (dt x F) - [ (F x V) x r d3x. The surface integral is zero, and in the last term (F x V) x r = - F div r + F = - 2F. Thus, J r x c u r l F d 3 x = 2 J F d3x. The expression for \kjx therefore becomes ^ = / ^ ^ L + ^s)^d3x, (47.8) where L = - ir x V is the particle orbital angular momentum operator. This is, as it should be, the matrix element of the operator A - £ t + £«, (47.9) which contains the operators of the orbital and intrinsic magnetic moments of the particle. The selection rules for magnetic multipole radiation are analogous to those for the electric case: the rules (46.15), (46.16) again apply to the total angular momentum, and the parity rule is P i P / = (-iy + 1 , (47.10) which is obtained by substituting in (46.17) the parity of the Mj photon, Pph = ( - i y + l . §48. Angular distribution and polarization of the radiation The formulae derived in §§46 and 47 relate to the emission of a photon with definite values of the angular momentum j and component thereof m. It was accordingly assumed that the radiating system (a nucleus, say) has not only definite values of the angular momentum J but also definite polarizations, i.e. values of M, both before and after the emission. Let us now consider the more general case of emission by a partially polarized nucleus (whose dimensions are again assumed small in comparison with the wavelength). The emitted photon again has a definite angular momentum j, but may be partially polarized. Let us find the emission probability as a function of the §48 Radiation 174 direction n of the photon. This probability must be expressed in terms of density matrices which describe the polarization states of the nucleus and the photon. For this purpose, we shall first write down the emission probability as a function of the direction n and helicity A of the photon (A = ± 1), for the case where the initial and final nuclei have definite values J„ M,- ; //, Mf. The matrix element for emission of a photon with definite values j, m is proportional to the matrix element of the (electric or magnetic) 2J-pole moment of the nucleus: (JfMf; jmlVlJM,) * (-irVfMAQ^JM). (48.1) The wave function of the emitted photon (in the momentum representation) is proportional to Yjm(n) or YJJÏÎ*(n). The wave function of a photon whose momentum is in the direction n and whose helicity is A is proportional to the polarization vector e(A). The matrix element for emission of a photon n, A is found by multiplying (48.1) by the projection of the wave function of the state |jm) on that of the state |nA): (JfMf, nA| V|JM> « (-l)m<J/M/|Qj)-m|J)M()e(A)* • Yjm. According to (16.23), for photons of either type e(A)* • Yjm(n) a D&(n). (48.2) The matrix element of the multipole moment can be expressed in the usual way in terms of the reduced element. Thus we find the transition probability amplitude in the form <J / M / ;nA|V|J I M i )«(-l) J r M / + '"(_^ Jm ^JQD&OI), (48.3) where Q denotes <J/||Q||Jj). We can now proceed to the general case of mixed polarization states. According to the general rules of quantum mechanics, the transition probability is proportional to the expressiont 2 (JfMf ; nA| V|WMW,; nA'| V'|J,Mj)* x (m) x <M,|p(l)| M JKMjp^M/XA V^A>f t If the initial and final states of the system are described by the superpositions n then the matrix element is m (48.4) §48 175 Angular Distribution and Polarization of the Radiation where pu\ p{f\ p{y) are the density matrices of the initial nucleus, the final nucleus and the emitted photon; the symbol (m) beneath the summation sign indicates that the sum is taken over all the m-type quantities which occur twice (M„ M 5, M/, M/, À,À'). Then (48.3) is to be substituted in (48.4). Let w(n) do denote the probability of emission of a photon into the solid angle do. The total probability of emission, in any direction and with any polarizations of the photon and the final nucleus, is evidently independent of the initial polarization state of the nucleus, is given by formulae already known, and is of no interest here. We shall therefore arbitrarily normalize the probability w(n) to unity. The result ist w(n) = < 2 i + 1 > M + 1 > £ (-ip-M,-MlD(i) Dfö x Ö7T J > xl fZ\ j Ji \ ( J> j J i \x \-Mf - m MJX-M'i - m ' MS/ x<Mi|p(0|Mi)<M^p(/,|M/><A'|pw|A>; it will be seen below that the normalization is correct. This formula can be. transformed by using the series expansion QM (110.2) for the product of the two D functions: D O) n(i)* _ f—l\A+™'n(J> D(^ _>A. _LA)(< = (-ir«?(2L+,)(< Jm, j-jDg, where A = A - A ' , fx = m - m' and L takes integral values 2*2j. Thus we have finally w(n) = (2j + y 27 ' + U S 2 (-l)2}>-M>-M>+m+\2L + 1) x OTT L (m) /i j L \/j VA - A ' - A A m j L W J, j J , \ / J, j J,\ - m ' - / u i A - M , - m M, A - M } - m ' M',/ xDt'diKM.Ip^MlXMIlp^M/KA'lp^lA). (48.5) and its square is K/|V|«)|2= 2 n. n', m, m' V^VJva.aiH^-bÄ. The case of mixed states is obtained by making the changes so that K/|v|i>|2-» S n, n', m, m' v^v^M*. t In transforming the sign factor note that the numbers 2J„ 2J/, 2M„ 2M/ have the same parity; j and m are integers, and A = ± 1. 176 §48 Radiation As previously, 2(m) denotes summation over all m-type quantities which occur twice. Here it must be noted that A and A' differ from the other quantities, since they have only two values, A, A' = ± 1, corresponding to the two polarizations of the photon, and not 2) + 1 values for any given j . Formula (48.5) embodies all the necessary information about the angular distribution and polarization of the emitted photons, and also about the polarization of the secondary nuclei (i.e. those which have emitted a photon). It is assumed that the initial density matrix is given. ANGULAR DISTRIBUTION The angular distribution of the photons is obtained by summation over all polarizations of the photon and the secondary nucleus. The averaging with respect to polarizations is done by substituting the density matrices of the unpolarized states: <A|pw|A'> = J«AAS (M,\pW\M',) = Y^T\ b (48 6) - »M after which the summation amounts to a multiplication by 2 for the photon or by 2Jf + 1 for the nucleus. Thus the summation is effected by simply making the changes <A|pw|A')->SAA., (48.7) (MI\pW\hrf)-+8MfM-P and the angular distribution is w(n) = flJ + lX2Ji + l) y 2 (-ir' + I (2L + l)Dfe>(n) x 87r T (m) (j j L$j j L \( Jf \A -A OAm - m ' -pIK-Mf j -m J , \ / Jf MjK-M, j J-t\ - m' M$ x<M,|p(l)|Mi>. This formula can be considerably simplified by carrying out the summation over m-type quantities. First of all, we note that a \ O H - ^ A ; o)- «M> and therefore =0 for odd L. In the sum over L, therefore, only the terms with even L remain, and it involves §48 177 Angular Distribution and Polarization of the Radiation only even-order spherical harmonics D ^ . This result is obvious a priori, since, by the conservation of parity, the probability must be unchanged by inversion, i.e. by putting n -* - n. Thus we have Ä ( p ) .ßl±i22i^ ? ( 2 L + 1)(J £/ u Ji L)DBWX -m M , / \ - M , - m' M'J \m -m' -IL)\-M, (i) x<Mi|p |M'i). The normalization here is easily verified: with the formula f D\tf(n) dol4ir = ÔL0 8Mo, integration over all directions leaves only the term with L = 0, ix = 0, and the formulae Ii \m j^-l-M/ * °\ = K(-1V" } -m 0) 1 V(2j +1)' - m AfJ =2JT+T' tr >0)=1 ' then show that the integral is equal to unity. The further summation with respect to m, m', Mf in the inner sum in w(n) is effected by means of QM, (108.4). The final expression for the photon angular distribution is *(n) = (-l)'W 2 > + l M 2 J ' + 1)X \5 en ( - i)LV(2L+i) (i -i o){J; Î 3?*B"«H (48.9) where (48.10) w ) ^ i * = (-l) 0l - M . The inner sum in (48.9) is taken over all |/i| ^ L, and the outer sum over all even L such that L*s2j, Lss2J f . (48.11) 178 §48 Radiation (These conditions result from the triangle rule which has to be satisfied by the quantities in the 3j-symbols that appear in (48.9), (48.10).) The number of terms in the sum is therefore usually small. For instance, when J, = 0 or |, only the term with L = 0 remains, and the radiation is isotropic; this term is easily seen to equal 4, as it should by the normalization condition. When J, = 1 or 3/2, or j = 1, the two terms with L = 0 or 2 remain in the sum over L. If the density matrix p0) is diagonal (Mi = Mi), then /x = 0, and the distribution function (48.9) becomes an expansion in Legendre polynomials; according to (16.5) and QM, (58.23), the functions D&* are the functions PL (cos 0). Finally, if i.e. if the initial nucleus is unpolarized, then all the 0*^ are zero except 0*$ = l.t The quantities ^LM are convenient characteristics of the polarization state of the nucleus, and will be called polarization moments. Formula (48.10) defines them in terms of the density matrix pM\i- The inverse formula expressing the density matrix in terms of the polarization moments is easily verified: '"»"£ VllTTrL<-1»"M'(-M' \ M K < 48 ' 12 > Let /L^ be a spherical tensor depending on the polarization state of the nucleus. According to the general rules (see QM, (14.8)), its mean value in a state having the density matrix pMM is h. = 2 PMM</M'|/ L J/M). (48.13) Expressing the matrix elements of the fL$L in terms of the reduced element </||/L||/) by means of the formula = iH-i)}-M(_JM, W\UM) L i)<J|/L|U>, t Using the result that / J \-M' l .1 ° J \ - ( nJ-» 0 MJ-(_l) V(2J + 1)5MM' we have &-«-*(-'*■ Ï i)«--^ + \2(_' M . I L)(-Jv I i) = V(2J + 1) 8Lo S^O, and the conclusion stated then follows from the definition (48.10). §48 179 Angular Distribution and Polarization of the Radiation and using the polarization moments as defined by (48.10), we obtain j . = . <J\MJ) UlL V[(2L+1)(2J + 1)] « ^ (4814) K } PHOTON POLARIZATION When the matrices p and pu\ as well as p (,) , are specified, formula (48.5) determines the probability of a transition in which a photon is emitted, and the nucleus left, in definite polarization states. Such states are essentially characteristic not of the emission process as such, but of the detectors which record the photon and the recoil nucleus and distinguish definite polarizations of these. There is another and more natural formulation of the problem, in which the final state of the "nucleus + photon" system is not specified from the start, and the polarization density matrix of this state is to be determined, with only the direction of the photon emission fixed. The answer to this problem is given by the same formula (48.5). If this is written as iy) w = w(n) S <M,; nA|p|M; ; nA'><Alp(y)|AXAf aPtf)|M/>f (m) (48.15) then the expression (Mf; nA|p|M}; nA') is the required density matrix, since according to the general rules of quantum mechanics the probability w of a transition to a specified state is given by its "projection" on the given piy\ pu\ The factor w(n) is written in (48.15) so that this matrix shall be normalized by the usual condition, 2 (M/;nA|p|M/;nA)=l. If we want the polarization of the photon alone, a summation over M/ = M} is necessary: <nA|p|nA'> = 2 (Mf; nk\p\Mf; nA'). Using a derivation exactly similar to that of (48.9), we obtain x ? (-0VUL + 1)(| _'A, ffij J ! J } 2 ni'DkM. (48.16) where A = A - A', and the summation is over all integral values of L which satisfy the conditions (48.11). In particular, circular polarization is determined by the Stokes parameter 6 = <nl|p|nl)-<n,-l|p|n,-l); §48 Radiation 180 see §8, Problem. Because of the relation (48.8), all terms with even L in this difference are zero, and the resulting formula for £2 differs from (48.9) only in that the summation is over odd instead of even values of L. SECONDARY NUCLEUS POLARIZATION Finally, if we are interested only in the final polarization of the nuclei, we must put p{y)-+ 8. If the integration with respect to directions of the photon is also carried out, the density matrix of the secondary nucleus is (Mf\p\M'f) = f w(n)(M,n\p\M'fn) do = (2Ji + l) S (-I)2*-*'-"'' x * U -I M,)(-i -l AJ;K"ÏM»- The polarization moments calculated by means of this matrix are 9% = (M)J'+i'+L+'V[(2J, + \){2J, + 1)]{'' J J L }^&. (48.17) If the initial nucleus is unpolarized, so is the final nucleus, but there exists a correlation polarization, i.e. a polarization of the nucleus after emission in a specified direction. Putting p(,)->S/(2J, + 1) (and correspondingly w(n) = l/47r) and calculating as in the derivation of (48.9), we obtain for the density matrix describing this polarization (M/;n|p|M;;n) = (2J- + 1)(-1,^«^L2n(2L + 1)('1 _', £)(_'<,, J J ) x x{j J > £}D&>(n). (48.18) The corresponding polarization moments are 9% = iL(-l)l+Ji+J'(2j + 1)V[(2L + 1)(2J, + 1)] x *(i -i o){/1 *H>>- (48i9) Only even-order moments occur (which is also a consequence of the conservation of parity already mentioned). If the secondary nucleus in turn emits a photon, it will generate an anisotropic distribution, being polarized. Since the polarization moments (48.19) depend on the direction n of the photon emitted in the first decay, there is a certain correlation between the directions of successively emitted photons (with an unpolarized §49 Radiation from Atoms: The Electric Type 181 primary nucleus). Other correlation effects (of polarization, etc), in cascade emission can be treated similarly.! PROBLEM Find the relation between the polarization moments 0 ^ , 0 V and the mean values of the angular momentum vector J and the quadrupole moment tensor Q». SOLUTION. The reduced elements of the vector J and the tensor Qik are determined from J1 = (J\\J\\J)2l(2J + \), QZ = (J||Q||J)2/(2J + 1); cf. QM, (107.10), (107.11). The operator 0 * >s expressed in terms of the angular momentum operators as in QM (75.2): Hence we find the mean value rpr 0i 3Q 2 2 '-2J*(2J-l) 2p(J 2 iJ(4J " 3) -°L +1X2J + 3)] 2J(2J-1) } The reduced matrix elements are <J||J||J) = V[J(J + 1)(2J + 1)], From (48.14) we now see that the polarization moments 0>iM are equal to the spherical components of the vector and the moments ^ M are equal to the spherical components of the tensor r 10J(2J-1) VQ~k L3(J + 1)(2J + 3)J Q §49. Radiation from atoms: the electric typet The energies of the outer electrons of an atom (which take part in optical radiative transitions) have, as a rough estimate, the order of magnitude E ~ me4lh29 so that the radiated wavelengths A — hclE — h2l<xme2. The dimension of the atom is a — h2lme2. Thus, in the optical spectra of atoms, we generally have the inequality alk~~a<\. The ratio vie — a, where t; is the velocity of the optical electrons, has a similar order of magnitude. Thus, in the optical spectra of atoms, a condition is satisfied which means that t A detailed account of these problems is given in the paper by A. Z. Dolginov, in: L. A. Sliv, ed., Gamma-luchi (Gamma Rays)> USSR Academy of Sciences, Moscow 1961, pp. 523-681. t In §§49-51 and 53-55, ordinary units are used. 182 Radiation §49 the probability of electric dipole radiation (if this is allowed by the selection rules) considerably exceeds the probabilities of multipole transitions.! For this reason it is electric dipole transitions which are the most important in atomic spectroscopy. As has already been mentioned, such transitions are subject to strict selection rules as regards the total angular momentum J of the atom and the parity P:$ | J ' - J | ^ 1 ^ J + J\ (49.1) PP' = - 1 . (49.2) The inequality |J'- J\^ 1 signifies that the angular momentum J can change only by 0 or ±1; also, the transition 0->0 is forbidden by the inequality J +.J'^ 1. The parities of the initial and final states must be opposite.§ The probability of emission by the transition nJM-*n'J'M' is determined by the corresponding matrix element of the dipole moment of the atom: w(nJM -> n'J'M') = jjjp \(n'J'M'\d-m\nJM)\2y (49.3) a) = w(nJ->n7'). On summing (49.3) over all values of Af ' = M - m (with M given), we obtain the total probability of emission with a given frequency from the atomic level n, J. The summation is carried out by means of (46.20), and the result is w(nJ -> nT) = ^ p 2 J T T \<n'J'\\dWnJ>\2- < 49 ' 4 > The squared modulus of the reduced matrix element is sometimes called the transition line strength; it is symmetrical as between the initial and final states. The observed radiation intensity is found by multiplying w by haj and by the number NnJ of atoms in the source which are at the excitation level concerned. For example, in a gas at temperature T this number is Nnj <* (2J + 1) exp(- EnjlT)\ the factor 2J + 1 is the statistical weight of the level with angular momentum J. Further deductions regarding transition probabilities in atomic spectra can be obtained only for specific kinds of atomic states. We shall not here discuss methods of calculating matrix elements where the degree of approximation has no clear theoretical significance, but simply derive some relations valid for a fairly large class of states (especially in light atoms) of the LS coupling type (see QAf, §72). Such states are described not only by the total angular momentum but also by definite values of the orbital angular momentum L and the spin S, which in this case are conserved. t Typical values of the dipole transition probability in the optical region of the spectra of atoms are of the order of 108sec"' t We shall now denote the quantum numbers of the initial and final states by unprimed and primed letters respectively. The letters n, n' will denote all the quantum numbers which define the state of the system, other than those shown explicitly. § The parity selection rule was first established by O. Laporte (1924). §49 Radiation from Atoms: The Electric Type 183 Since the dipole moment is a purely orbital quantity, its operator commutes with the spin operator, i.e. its matrix is diagonal with respect to the number S. For the number L, the dipole moment is subject to the same selection rules as any orbital vector (see QM, §29). Thus transitions between LS-type states are subject to the following selection rules (in addition to (49.1), (49.2)): S' - S = 0, (49.5) |L' - L| « 1 « L + L'. (49.6) It should again be stressed that these rules are approximate, and no longer apply when the spin-orbit interaction is taken into account. The rule (49.5), which forbids transitions between terms of different multiplicity, is valid not only for electric dipole transitions but for all electric transitions: the electric multipole moments of all orders are orbital tensors, and therefore their matrices are diagonal with respect to spin. For instance, for electric quadrupole transitions, in addition to the general rules |J' - J\ *s 2 ^ J + J\ PP' = 1, (49.7) in the case of LS coupling we have the further rules S' - S = 0, \L' - L\ ^ 2 ^ L + U. (49.8) The emission probability can be written in explicit form as a function of the numbers S, J, J'. This is done immediately by means of the matrix elements of spherical tensors in the addition of angular momenta. According to QM, (109.3), we havet Kn'L'SJl|d||nLS/)|2 = (2J + l)(2J'+l){y £ ^}2|<n'Ll|d||nL>|2. (49.9) Substitution of this in (49.4) gives w(nLS/^nT'SJ') = ^ ( 2 J ' + l ) [ y £ ^}2|<n'Ll|d||nL)|2, (49.10) with co = o)(nLS -► n'L'S).$ A sum rule can be derived for these probabilities. The squares of the 6j-symbols satisfy the summation formula (see QM, (108.7)) ?(2',+»{y i' ?F-ÜTM- (49U) t The "angular momenta of sub-systems 1 and 2" in the formulae in QAf, §109, are here to be taken as the orbital angular momentum and spin of the atom, whose interaction is neglected; the quantities f\lq are represented by the orbital vector dq. t In neglecting the spin-orbit interaction in the calculation of the matrix elements, we also neglect the dependence of the frequencies on J and J\ i.e. the fine structure of the initial and final levels of the atom, 184 §49 Radiation Using this, we obtain from (49.10) Ç w(nLSJ - n'L'SJ') = ^ <4912) ^ I T T \<n'LWinL>\2• This quantity is thus found to be independent of the initial value of J. For radiation from a gas whose temperature is much greater than the finestructure intervals in the atomic term nSL, the states with different / are uniformly occupied, i.e. all values of J are equally probable. The probability that the atom is at a level with some definite value of J is then (2L+1)(2S + 1)' (49.13) i.e. is equal to the ratio of the statistical weight of the level to the total statistical weight of the term nSL. Averaging the expressions (49.10) or their sums (49.12) with respect to these probabilities is equivalent to multiplying by the factor (49.13). This averaging will be denoted by a bar over the letter w. The total probability of emission of all the lines in a spectral multiplet (formed by all possible transitions between the fine-structure components of the two terms nSL and n'SL') is the sum = 'Z'Z w(nLSJ -> n'L'SJ'). w(nLS-+n'L'S) j (49.14) j' Since, of course, 2 ( 2 J + 1) = (2S+1)(2L+1), the result obtained for the total probability agrees with (49.12). Thus the relative probability (which is the same thing as the relative intensity) of a single line is wjnLSJ^n'L'SJ') w(nLS-+n'L'S) = (2J + \)(2J' + 1) [V 2S +1 [J J' L S\2 l)' { } The analysis of the numerical values given by this formula shows that the strongest lines in the multiplet are those for which AJ = AL (called main lines, while the remaining components of the multiplet are called satellites). The intensity of the main lines increases with the initial value of J. Summation of the quantities (49.15) with respect to J' and J gives respectively 2 w(nLSJ-+n'L'SJ') w(nLS -+n'L'S) Y w(nLSJ ^n'L'SJ') w(nLS -> n'L'S) 2J + 1 (2L + 1)(2S + 1)' 2Jf+{ * (2L + 1)(2S + 1)* J (49.16) §49 Radiation from Atoms: The Electric Type 185 Thus the total intensity of all the lines in a spectral multiplet having a common initial or final level is proportional to the statistical weight of that common level. We may also consider the hyperfine structure of atomic spectral lines. The hyperfine splitting of atomic levels is due to the interaction of the electrons with the spin of the nucleus if the latter is non-zero (see QM, §122). The total angular momentum F of the atom (including the nucleus) consists of the total electron angular momentum J and the angular momentum I of the nucleus. Each component of the hyperfine structure of the level n, J has a different value of the quantum number F. The rigorous law of conservation of angular momentum now leads to a rigorous selection rule for the total angular momentum F: for electric dipole radiation, (49.17) |F'-F|ssl*£F + F\ But, in view of the extreme weakness of the interaction of the electrons with the spin of the nucleus, this interaction may be neglected in calculating the matrix elements of the electric (and magnetic) moments of the electron shell of the atom. Thus the previous selection rules regarding the electron angular momentum J and the electron parity remain valid also. In particular, the latter selection rule prohibits electric dipole transitions between hyperfine structure components of the same term: all these levels have the same parity, whereas such transitions can occur only between states of different parity. Since the dipole moment pperator commutes with the nuclear spin, the dependence of the matrix elements on the numbers I and F can be found explicitly, the calculations differing only by an obvious change of notation from those given above for LS coupling. The probability of emission, summed over the final values of the component of the total angular momentum F, is w(nJIF -> n'J'IF') = j£p ^Tl $nTIF'ld\\nJIF)\2 (49.18) o) = o)(nJ -+n'J'), and the square of the reduced matrix element is \(n'J'IF'\\d\\nJIF)\2 = (2F + 1)(2F' + 1){£ *' ' } V^MI|nJ>| 2 . (49.19) PROBLEM The majority of the lines in the spectra of the alkali metals can be described as resulting from transitions of a single outer (optical) electron in the self-consistent field of the rest of the atom, which forms a configuration of closed shells; the state of the atom is governed by LS coupling. Under these conditions, determine the relative intensities of the fine structure components of the spectral lines. SOLUTION. The total angular momenta L and S = 2 of the atom are equal to the orbital angular momentum and spin of the optical electron. The parity of the state is therefore (-$L (the parity of the closed configuration of the rest of the atom being positive). The parity selection rules therefore forbid the dipole transition with L' = L, and so only transitions with L' - L = ± 1 are possible. The transitions §50 Radiation 186 J=L + nL J=L--' J=Ln! L-1 \ J--L FlG. 1. between components of the doublet levels n, L and n', L - 1 give only three lines, because of the selection rule for J (Fig. 1). Their relative intensities (denoted by a, b, c) are most simply determined from the rules (49.16), instead of using (49.15) directly. The ratios of total intensities of lines having each initial (or final) level give two equations b+c 2L 2L + 2' a+b 2L 2L-V whence a:b:c = (L+ 1)(2L - 1) : 1 : (L - 1)(2L + 1). If L = 1, the lower level is unsplit, line c does not appear and alb = 2. § 50. Radiation from atoms: the magnetic type The magnetic moment of an atom is equal, in order of magnitude, to the Bohr magneton: /ut ~~ eh/mc. This differs by a factor a from the order of magnitude of the electric dipole moment, d — ea — h2lme (since vie — a, we have ix — dvlc, as is to be expected). Hence it follows that the probability of magnetic dipole (Ml) radiation from the atom is about a2 times less than that of electric dipole radiation at the same frequency. The magnetic radiation is therefore important in practice only for transitions forbidden by the selection rules for the electric case. The ratio of the probability of electric quadrupole (E2) radiation to that of Ml radiation is, in order of magnitude, £2 Ml (ea2)Wlc7 172 a4mW - ( E ) (50.1) the quadrupole moment ~ea 2 , E ~ tflma1 is the energy of the atom, and AE the change in energy in the transition. We see that, for medium atomic frequencies (i.e. when AE ~ E), the probabilities of E2 and M1 radiation are of the same order of magnitude (assuming, of course, that both are allowed by the selection rules). If, however, AE < E (as for transitions between fine structure components of the same term), then Ml radiation is more probable than E2. The magnetic dipole transitions are subject to the rigorous selection rules (50.2) PP'=1. (50.3) Radiation from Atoms: The Magnetic Type §50 187 For LS coupling, there are additional selection rules, which are even more restrictive than in the electric case. This is because of a particular property of the magnetic moment of the atom, which arises from the fact that all the particles (electrons) in the system are identical: the magnetic moment operator of the atom can be expressed in terms of the total orbital and spin angular momentum operators: |k = /jLo(l + 2S) = - jxo(J + S), (50.4) where /LL0 = \e\hl2mc is the Bohr magneton (see QM, §113). Owing to the conservation of total angular momentum, the operator J has only matrix elements which are diagonal with respect to the energy, and in considering radiative transitions it is therefore sufficient to put p. = - jmoS.t The angular momenta L and S are separately conserved when the spin-orbit interaction is neglected. The spin operator is therefore diagonal in all the quantum numbers n, S, L which belong to the unsplit term. In order for a transition to occur at all, the number J must change. The selection rules are consequently n' = n, S' = S, L' = L, J'-J = ±l, (50.5) i.e. transitions are possible only between fine structure components of a single term. The emission probability can be calculated exactly in this case. By an appropriate change of notation in formula (49.10), we obtain w(nLSJ^nLSJ') = ^é(2J'+\){SJ £ ^}2|<S||S||S)|2. The reduced spin matrix element with respect to the spin eigenfunctions is (S\\S\\S) = V[S(S + 1)(2S + 1)]; (50.6) see QM, (29.13). The 6/-symbol is (S U J-\ S L)2 1J = (L + S + J + 1)(L + S - J + 1 ) ( L - S + J)(S - L + J) S(2S + 1)(2S + 2)(2J-1)2J(2J + 1) (50.7) see QM, §108, Table 10. The result is then w(nLSJ -* nLS, J - 1) = | j ^ - | w(nLS, J - 1 -> nLSJ) =jf^jhï(L+s+J+])(L+s-J+i)x x(J + S- L)(J + L-S). (50.8) t An exception occurs in cases where the electronic angular momentum J of the atom is not conserved: when the hyperfine structure is taken into account, when an external field is present, and so on (see the Problems). 188 §50 Radiation Transitions between hyperfine structure components of one level (whose frequencies are in the radio wave range) cannot occur as electric dipole transitions, since all the components have the same parity. E2 and M l transitions involve no change of parity. But, owing to the relatively very small intervals in the hyperfine structure, E2 radiation has a low probability compared with M l (cf. (50.1)), so that these transitions occur as magnetic dipole transitions. PROBLEMS PROBLEM 1. Find the probability of an Ml transition between hyperfine structure components of a single level. SOLUTION. The transition probability is given by formulae (49.18), (49.19), in which the diagonal reduced matrix element (nJ||ji||nJ) of the magnetic moment will now appear. Its value can be written down immediately by noting that the total (not reduced) matrix element (nJM\fiz\nJM) determines the splitting of the relevant level by the Zeeman effect (see QMy §113), and is - jiogM, where g is the Lande factor. The reduced matrix element is (see QM, (29.7)) (nJ\\fi\\nJ) = ^ V[J(J + 1)(2J + l)l<n/MHfizln/M> = -/iosV[J(J + l)(2J + l)]. The required probability is thus found to bet w(nJIF -> nJI, F - 1) = ^Zr — 4 1 w(n/Jf F - 1 -> nJIF) = 3ftc^2F°+ 1)F ( J * l + F * 1)(J + l ' F * 1)(F *J ~I ) ( F ~J + J) ' This expression differs from (50.8) only by an obvious change of notation and the extra factor g2. PROBLEM 2. Find the probability of an Ml transition between Zeeman components of a single atomic level. SOLUTION. This is a transition M-+M - 1 with the values of n and J unchanged; the transition frequency is (see (51.3) below)ftco= jiogH, where g is the Lande factor. The matrix element of the spherical component ^ut-i of the vector JJL is \(nJ, M - l\^\nJM)\ - ^ ^ j ^ IMWM>| = - MogV&J - M + W + M)) (see QM, (27.12), and Problem 1). The transition probability is »3 * = TJTi \(nJ, M - l|/i-.|nJM)|2 3ftV t An interesting example is the transition between the hyperfine structure components of the ground level (\si) of the hydrogen atom, where both El and £2 transitions are strictly forbidden, the latter by the rule which prohibits a quadrupole transition with J + J'=l. This transition has a frequency (o = 2ir x 1.42 x 109 sec"1 (wavelength A = 21 cm). Putting g = 2, I = i J = l F = 1, F' - 0, we obtain w = 4a> Vo/3ftc3 = 2.85 x 1<T15 sec"'. §51 189 Radiation from Atoms: The Zeeman and Stark Effects §51. Radiation from atoms: the Zeeman and Stark effects In an external magnetic field H (assumed weak), each atomic level with total angular momentum J is split into 2J + 1 levels, E M = E(0) + iaogMH, (51.1) where E(0) is the unperturbed level, /ut0 the Bohr magneton, g the Lande factor, and M the component of J in the direction of the field (see QM, §113). Thus the degeneracy with respect to directions of the angular momentum is entirely removed. Spectral lines resulting from transitions between two split levels are correspondingly split. The number of components of the line is determined by the selection rule for the number M, according to which, for dipole radiation, we must have m = M - M ' = 0,±1. (51.2) In addition to this rule, the transitions with M = M' = 0 are forbidden if also J' = J. This is seen immediately from the general expressions (QMy (29.7)) for the matrix elements of an arbitrary vector. The components resulting from transitions with m = 0 and m = ± 1 are called IT and cr components respectively. Their frequencies are ha>v = hœm + noH(g-g')M, m 1 - g\M ± 1)]. J hua = h(o + ^H[gM (51.3) In the particular case where g = g', we have hcon = ha>{0\ hioa = ha)m + /uiogH, (51.4) whatever the value of M ; thus, in this case, the line is split into a triplet with an undisplaced IT component and two a components lying symmetrically on either side of it (called the "normal" Zeeman effect). The total probability (for all directions) of emission of radiation is proportional to the squared modulus |(n'J'M'|d_ m |nJM)| 2 . Hence, using formula (46.19) with j = 1, we see that the relative probability of emission of each of the Zeeman components of the spectral line is (y \M' X J m -M) V (51.5) K } In the particular case of the "normal" Zeeman effect there are only three components, each arising from transitions with all initial values of M for given m. Since £.(*r m - M ) "3 (516) 190 Radiation §51 (see QM, (106.12)), the emission of all three components is equally probable in this case. The relative intensity of the Zeeman components when observed in a particular direction (relative to the direction of the magnetic field applied to the source) is of greater interest, however. According to (45.5), the probability of emission (and therefore the line strength) in a given direction n is proportional to 2|e*-d/;|2, where the summation is over the two independent polarizations e which are possible for a given n. For observation along the field (along the z-axis), this sum is \(dX)fi\2+\(dy)fi\\ or, in spherical components, |(di)/i|2 + |(rf-i)/«|2. This means that only the two a components (m = ± 1) are observed in the longitudinal direction (along the field). Their intensities are proportional to (A/;, < 51 - 7 > il - M ) - These lines have definite values of the component m of the angular momentum in the direction of propagation, and either right-hand (m = 1) or left-hand (m = - 1) circular polarization (see §8). For observation in a direction perpendicular to the field (along the x-axis, say), the intensity is proportional to the sum \(dz)fi\2 + |Wy)/.f = \(do)fi\2 + M(d,)/i|2 + |(d-,)/,f}. Thus two cr components and a TT component are observed in the transverse direction, with respective intensities proportional to \{MT\ ±\ -M) and (M o -M); (518) the intensities of the a components are half as great as in the case of longitudinal observation. The TT component is linearly polarized along the z-axis; the a components, as observed in this direction, are linearly polarized along the y-axis. The relative intensities of the Zeeman components are seen to be entirely determined by the initial and final values of J and M, and not by any other properties of the levels. The selection rules forbid electric dipole transitions between Zeeman components of the same level, since all these have the same parity. Such transitions occur as magnetic dipole transitions, for the same reason as was mentioned at the end of §50 in respect of transitions between hyperfine structure components of a §51 191 Radiation from Atoms: The Zeeman and Stark Effects level. Because of the selection rule for the number M, the transitions occur only between adjacent components (M' - M = ± l).t The splitting of atomic levels in a weak electric field (the Stark effect), unlike that in a magnetic field, does not cause complete removal of the degeneracy with respect to directions of the angular momentum. All the levels except those with M = 0 remain doubly degenerate, each corresponding to two states with angular momentum components M and - M . The calculating of the relative intensities of the Stark components of a spectral line is exactly similar to that given above for the Zeeman effect.t It must be remembered that the intensity of the TT components includes contributions from the transitions M-*M and -M-^>-M (when M 7*0), and that of the a components includes contributions from the transitions M -» M ± 1 and - M -> - (M ± 1). Hence, for example, in transverse observation the intensities of the TT components are proportional to i(y * VM 0 J V -M/' and those of the a components are proportional to the sums u J' i J \2 u J' i J\2 / J' 2\M±1 + 1 -M) 2 V - M T 1 ±1 MJ \M±1 i +1 J \2 -M/ (when all the numbers in the second row change sign, the 3j-symbols can at most change sign, so that their squares are unaltered). In an external field, even if it is weak, the total angular momentum J is no longer strictly conserved; in a uniform field, only the angular momentum component M is exactly conserved. Thus, in radiative transitions in a weak field, the conservation of angular momentum need not be rigorously maintained, and the atomic spectra may contain lines which are forbidden by the usual selection rules. The calculation of the intensities of these lines is equivalent to the calculation of the corrections in the dipole moment matrix, which in turn requires the determination of corrections to the wave functions of stationary states. In the first approximation of perturbation theory (with respect to the weak external field), the wave function includes "admixtures" of states which are connected to the initial state by non-zero matrix elements of the perturbation (-E • d in the electric field): the admixture of a state fa in a state i/^ is -E-d2l t The frequencies of these transitions are usually in the range corresponding to centimetre wavelengths, and are observed in absorption and induced emission (electron paramagnetic resonance): the absorbing atoms are in a strong constant magnetic field (which causes the Zeeman splitting) and a weak radio-frequency field of the resonance frequency. t This refers to the quadratic Stark effect, which occurs in all atoms except hydrogen (see QM, §76). The field is assumed so weak that the level splitting which it causes is small even in comparison with the fine structure intervals. 192 §52 Radiation Thus the matrix element of the "forbidden" transition contains a term - (E • d2i)d32 E} ~E2 which is not zero if transitions from the "intermediate" state 2 to the initial state 1 and final state 3 are allowed. § 52. Radiation from atoms: the hydrogen atom The hydrogen atom is the only one for which the transition matrix elements can be completely calculated in an analytical form (W. Gordon, 1929), The parity of a state of the hydrogen atom is (-1)1, i.e. is uniquely determined by the orbital angular momentum of the electron (the number J, which defines the parity of the state, retains its significance for the exact relativistic wave functions, i.e. when the spin-orbit interaction is taken into account). The parity selection rule therefore strictly forbids electric dipole transitions without change of I; only transitions with / -* / ± 1 are possible. There is, however, no restriction on the change in the principal quantum number n. The dipole moment of the hydrogen atom is equivalent to the position vector of the electron: d = er. Since the electron wave function in the hydrogen atom is the product of an angular part and the radial function Rni, the reduced matrix elements of the position vector can also be written as the product <n\ I - l||r||nl> = <l - 1||H|I> j Rnj-xrR«r2 dry 0 where </ - l||v||I) are the reduced matrix elements of the unit vector v in the direction of r. These are (I - 1||HID = <IMI - D * = i VI; see QM, (29.14). Thus <n\ I - \\\r\\nl) = -(nl\\r\\n', / - 1)= /V/ j A...,-,«*!-3 dr. o (52.1) The non-relativistic radial functions of the discrete spectrum of the hydrogen atom are given in QM, (36.13):t Rnl = M n (2l + l)\ V(n-I- ) 1)! ( 2 r ) ' <rr"'F(~ n + l + *'2l + 2'2r/n)- (52 2) ' t In the present section, atomic units are used. In ordinary units, the expressions given below for the matrix elements of the coordinate are multiplied by h2/me2 (or by h2lmZe2 for a hydrogen-like ion with atomic number Z). Radiation from Atoms: The Hydrogen Atom §52 193 The integral in (52.1), containing the product of two confluent hypergoemetric functions, is calculated by means of the formulae in QM, §f.t The result is (n,l- \\\r\\nl) - i V/ m _ {y V ( n _ , _ 1 ) ! ( „. _ ,)» (n + . i\n + n'-2l-2 nT -' x | F ( - n + I + 1, - n' + I,21, - 4nn'l(n - n'f) + I - 1, - «' + I, 21, -4nn7(n - n')2)}, ~ i^Tpf^-" (52.3) where the F(a, ß, 7, 2) are hypergeometric functions. Since the parameters a and ß in this case are negative integers or zero, these functions reduce to polynomials.$ For reference, the following are the expressions obtained from (52.3) in some particular cases (the values of / being indicated by the spectroscopic symbols s,p, d,...): 2 2 6 2 _ 2 ' V ( n - l)(n - 2) "' K2*||r||np>P = 5T+7" (n + 2) |<2pl|rl|nd)P = 2l9w ^ ) 2 ( p 2)2n (52.4) 7 " , Formula (52.3) is not applicable to transitions with no change in the principal quantum number n (transitions between the fine structure components of a level). In this case (n = n'), the integration is carried out by expressing the radial functions in terms of generalized Laguerre polynomials: 2 l l(n-l-l)\ ln(2r\ 2l+l(2r\ (52.5) In the integral j i?n.,-,J?„,r3 drocj e''p2,+2L2n'++/(p)L2n'+-,L1(p) dp, we replace one of the polynomials by its expression in terms of a generating function (see QM, §d): T 21+1/ ^ (" + ' ) ! p ■""(i)"""''(e''""t[)- t In the notation used there, we have to calculate the integral J2?+2(~ n + / + 1, - n' + 0- This is done by means of formulae (f.12)—(f. 16). X Numerical tabulations of the matrix elements and transition probabilities for hydrogen are given by H. A. Bethe and E, E. Salpeter, Handbuch der Physik 35, 88-436, Springer, Berlin, 1957. 194 §52 Radiation After n — / - 1 integrations by parts, we obtain an integral of the form JV p p n+ '(^)" ' \pLT,Up))dp, 0 in which we replace the Laguerre polynomial by its explicit form: L: = <-.r n ! "g( m « + k )tf. After the differentiation in the sum, only three terms remain, and the integration is then elementary. The result is simply <n, I - \\\r\\nl) = iVl • \nV(n2- I2). (52.6) The integral 1 Rn i-xRnlr3 dr = \ Xn\i-\(rxni) dr, 0 0 where xm = rR*h is the coefficient in the expansion of the function rxni in terms of the orthogonal functions Xn\i-\ (n' = 1,2,...). The sum of the squared moduli of these coefficients is equal to the integral of the square of the expanded function.! Hence 2 |<nM-l||r||nl>| 2 = I (r2xlidr. n' (52.7) J o Using the known expression for the mean square of r in the state nl (see QMy (36.16)), we obtain the sum rule 2 \(n\ I - l||r||nl>|2 = \ln2[5n2 + 1 - 31(1 + 1)]. (52.8) n' For given n, / and large n\ the matrix element of the transition decreases according to nl-*n'l' |<n'!,||r||n/)|2oc3/n'3, (52.9) as can be seen both from the particular expressions (52.4) and from the general formula (52.3). This result is to be expected: the Coulomb levels of energy E' = - l / 2 n ' 2 have an almost continuous distribution when n' is large, and the t The summation is over states of both the discrete spectrum and the continuous spectrum. §52 Radiation from Atoms: The Hydrogen Atom 195 f probability of a transition to a level in the interval dE is proportional to the density of these levels, which in turn is proportional to n'~\ The Stark effect in hydrogen has the peculiarity that the splitting is proportional to the first power of the electric field (QM, §77). Here the field is assumed to be both weak enough for perturbation theory to be applicable and such that the level splitting is large compared with the fine structure of the levels. Under these conditions, the magnitude of the angular momentum is not conserved, and the levels have to be classified by the parabolic quantum numbers nu n2, m. The last of these, the magnetic quantum number m, again determines the component of the orbital angular momentum along the z-axis (the direction of the field), which is conserved under the conditions stated (neglecting the spin-orbit interaction). It is therefore governed by the ordinary selection rule m'-m=0,±l. (52.10) There is no restriction on the changes in nx and n2. The matrix elements of the dipole moment in parabolic coordinates can also be calculated analytically. The resulting formulae, however, are very lengthy, and will not be given here.t PROBLEMS PROBLEM 1. Find the Stark splitting of hydrogen levels when it is small compared with the fine structure intervals (but large compared with the Lamb shift). SOLUTION. Under the conditions stated, there remains a twofold degeneracy of the unperturbed levels with / = j ± 5, and the Stark splitting is therefore again linear in the field. The splitting A is determined from the secular equation I -A - E(dz)\i\ A A ^ -.„ . . , the suffixes 1 and 2 correspond to states with I = j ± 2 and a given magnetic quantum number m ; the perturbation V = - Edz is diagonal in m and has no elements diagonal in J. The matrix element of the orbital quantity dz is calculated by means of formulae (29.7) and (109.3) in QM: <J. 1 - 1 . m|A|Jlm> = V [ j 0 . + 1X2J ^ 0 ] <j. I - 1H <jj-iiidiijj> = - ( 2 j + i ) { ' : 1 j ] } lsj of the hydrogen atom (G. Breit and E. Teller, 1940). SOLUTION. The process is strictly forbidden by parity for an El transition, and by the rule (46.15) for an £ 2 transition. We have therefore to calculate the probability of an M1 transition, given by (47.5). t These formulae and the corresponding numerical tabulations are to be found in the work by Bethe and Salpeter cited above. 1% §52 Radiation In the present case (I = 0), however, the magnetic moment is a purely spin quantity, and its matrix element is zero if the spin-orbit interaction is neglected, because of the mutual orthogonality of the orbital wave functions with different principal quantum numbers. This means that, to obtain a result other than zero, the PaulFs equation approximation is insufficient, and we must start from the complete Dirac's equation.. In the standard representation of the wave functions, the transition current ist jfi = ilßjaipi = <f>Jaxi + x1<*4>iAccording to (35.1), (24.2) and (24.8), the wave functions of the states with I = 0, j = 1 are * 4irl-ig(rX*-n)w(m)/f \x) where n = r/r, and w(m) is a real three-dimensional unit spinor corresponding to the spin projection value m. Thus jfi = ^JT {//g.H>/cr(a • n)wi - g//,vv/(cr • n)awi}. Substituting this in (47.4) and carrying out the integration over the directions of n, we find fifi = - (el6i)wfKT x awj = - \ewfOwj (from the Pauli matrix commutation rules, a x a = 2ia); here I = J</*.'+/®)r 3 «fr. o (1) The photon emission probability (47.5), summed over the values of m/, is w = (4e V/27)Wi<rW 2 = 4 e V / 2 / 9 . (2) From (35.4) we have, with K = - 1. 5 e + m + alr 2m \ r )Aml in the second term, the exact function / is replaced by the non-relativistic radial function R. With the approximation g = R'/2m, the integral 1 = 2 ^ / {RfRi)'r' dT = " 2^ / RfRif2 dT = °' o o since Rf and R< are orthogonal. In the next approximation, using (3), we find ' Ä 2ÏT / (M»1 dr + 4 ^ / { R , f R i ( E i " €f) ~ 7 <*'*>'>r3 d r o Since, from the orthogonality of the exact functions </* and ^/, when K, = K/, J(/i// + g*/)r 2 dr = 0, t In this Problem, relativistic units are used. (3) (4) §53 Radiation from Diatomic Molecules: Electronic Spectra 197 the first term in (4) may be written, after integration by parts, as 0 0 0 A calculation of the integral, with the functions Rf = 2(ma)V m o \ Rt = (1/V2)(ma)3(l - \mar)eJlmar (see QM, §36) and the energy difference w = ei - Ef = 2ma2(l - 1/22) = gma2, gives J = 23/2a2/9m. Hence the transition probability is (in ordinary units) M = ' -3^W^ = 2:^737ft = 5 •6X,0 SCC ' The corresponding lifetime of the 2s\ state is very long, and in practice it is much more likely that de-excitation will occur by the simultaneous emission of two photons; see the next-to-last footnote to §59. §53. Radiation from diatomic molecules: electronic spectra The specific features of molecular spectra are mainly due to the partition of the energy of the molecule into electronic, vibrational and rotational parts, each of the latter two being small compared with the previous one. The level structure of diatomic molecules has been described in detail in QM, Chapter XL Here we shall consider the resulting pattern of the spectrum and the calculation of line strengths.t Let us take first the general case, in which the electronic state of the molecule (and therefore also, in general, the vibrational and rotational states) changes in the transition. The frequencies of such transitions lie in the visible and ultra-violet regions of the spectrum. They are spoken of collectively as the electronic spectrum of the molecule. We shall always be considering electric dipole transitions; those of other types are of little importance in molecular spectroscopy. As with dipole transitions in any system, the following selection rule applies to the total angular momentum J of the molecule: | j ' - j r | * £ i « j + jr\ (53.1) In the present case, the strict selection rule regarding the parity of the system corresponds to a selection rule regarding the sign of the level. (In the customary terminology of molecular spectroscopy, states having wave functions which do or do not change sign on inversion, i.e. when the coordinates of the electrons and the nuclei change sign, are called negative and positive states respectively.) Thus we have the rigorous rule + ->-, --»+. (53.2) t The discussion below is based on QM, §§78 and 82-88. For brevity, we shall not make constant reference to those sections. 198 Radiation §53 If the molecule consists of identical atoms (with nuclei of the same isotope), the levels can be classified with respect to interchange of the coordinates of the nuclei: symmetric (s) levels, with wave functions which do not change sign under this transformation, and antisymmetric (a) levels, with functions which dd change sign. Since the electron dipole moment operator is unaffected by this transformation, its matrix elements are non-zero only for transitions without change of this symmetry:! s^s, a^>a. (53.3) This rule is not absolutely rigorous, however, since the existence of a given symmetry property of a level depends on a certain definite value of the total spin I of the nuclei in the molecule. Owing to the extreme weakness of the interaction between the nuclear spins and the electrons, the spin I is very nearly conserved, but not exactly. When this interaction is taken into account, I does not have a definite value, the symmetry property (5 or a) is not conserved, and the selection rule (53.3) no longer applies, The electron terms of a molecule consisting of identical atoms are also described by their parity (g or w), i.e. the behaviour of the wave functions when the electron coordinates (measured from the centre of the molecule) change sign while the coordinates of the nuclei remain unchanged. This property is closely related to the nuclear symmetry and the sign of the rotational levels belonging to this term. The levels which belong to an even (g) electron term can be s + or a - , and those belonging to an odd (u) term can be s - or a +. The rules (53.2) and (53.3) therefore give the further rule g-»u, M->g. (53.4) The rule (53.4) remains approximately valid for molecules consisting of different isotopes of the same element. Since the nuclear charges are equal, we can consider the electron term with fixed nuclei, and thus have a system of electrons in an electric field which possesses a centre of symmetry at the midpoint of the line joining the nuclei. The symmetry of the electron wave function with respect to inversion in this point determines the parity of the term, and since the electric dipole moment vector changes sign under this transformation, we arrive at (53.4). The rule as derived in this way'is only approximate, because the nuclei have been regarded as fixed, and it therefore ceases to be valid when the interaction between the electron state and the rotation of the molecule is taken into account. Further selection rules depend on specific assumptions concerning the relative magnitude of the different interactions in the molecule (i.e. the type of coupling), and therefore can only be approximate. The majority of the electron terms in diatomic molecules belong to coupling type a or b. Both these have the property that the coupling of the orbital angular momentum with the axis (the electric interaction between the two atoms in the molecule) is large compared with all other interactions. The quantum numbers A t This rule is clearly valid for transitions of any multipole order. §53 199 Radiation from Diatomic Molecules: Electronic Spectra and S therefore exist, these being respectively the component of the orbital angular momentum of the electrons along the axis of the molecule and the total spin of the electrons. The operator of an orbital quantity, the electron orbital angular momentum, commutes with the spin operator, so that S ' - S = 0 (cases a and b). (53.5) The change in A must satisfy the selection rule A; - A = 0, ±1 (cases a and b), (53.6) and for transitions between states with A = 0 (2 terms) there is a further rule 2 + -»2 + , 2"-*2~ (cases a and b). (53.7) (The states 2* and 2" differ as regards behaviour under reflection in a plane through the axis of the molecule.) The rules (53.6), (53.7) are obtained by considering the molecule in a system of coordinates fixed to the nuclei (see QAf, §87); the rule (53.6) is analogous to the selection rule for the magnetic quantum number in atoms. The coupling types a and b differ as regards the relation between the spin-axis interaction energy and the rotation energy (the intervals between rotational levels). In case a the former is greater, in case b it is much smaller. We shall now examine these cases separately. Case a. Here the quantum number 2 exists, which is the component of the total spin along the axis of the molecule (and therefore so does the number ft = 1 + A, the component of the total angular momentum). If both the initial state and the final state belong to case a, then we have the rule 2 ' - X = 0 (case a), (53.8) which follows from the fact, already mentioned, that the dipole moment commutes with the spin. From (53.6) and (53.8) it follows thatt ft'-ft = 0,±l. (53.9) If ft = ft' = 0, then in addition to the general rule (53.1) the transitions with J' = J are forbidden:t J ' - J = ± l when ft = ft'= 0 (case a). (53.10) t This rule remains valid in case c also (where the coupling of the orbital angular momentum with the axis is small compared with the spin-orbit coupling) although the numbers A and 1 do not separately exist. t This rule is analogous to the prohibition of atomic transitions with J = J' when M = M' = 0 (see 55IX but that rule was of possible interest only in the presence of an external field. Here the rule follows immediately from formula (53.12) below; the 3j-symbol ( n / ' + J + 1 is od<f. ft ft) is zero if J' = J, since the sum §53 Radiation 200 Let us consider transitions between any two specified vibrational levels belonging to two different electron terms (of type a). When the fine structure of the electron term is taken into account, each of these levels splits into several components, the number of which, 2S + 1, must be the same for both, according to the rule (53.5). According to the rule (53.8), each component of one level combines with only one component of the other level, having the same value of X. Let us next take a pair of levels with the same X; the values of Ct and CÏ can differ (like A and A') by 0 or ±1. When rotation is taken into account, each level splits into a series of levels with different values of J and J' in the ranges J ** |ft|, J ' ^ | f i ' | . The dependence of the transition probabilities on these numbers can be derived in a general form (H. Hönl and F. London, 1925). The matrix element of the transition n AÜJMj -* n'A'ü'J'M ; (where n denotes the characteristics of the electron term other than O and A) is |<n'AÏÎ7'MHMnjMj)| = -VK2J + 1XU'+»](_';,, \, £)(_JM, * ^Kn'AKlnA». (53.1.) where dq and dq are respectively the spherical components of the dipole moment vector in the fixed coordinate system xyz and in the "moving" system £TJ£ with the £-axis along the axis of the molecule. This formula is derived by means of QM, (110.6). The matrix elements (n'A'|d q |nA) are independent of the rotational quantum numbers J,J\ and depend only on the characteristics of the electron terms (and in this case are also independentt of the number 2 ; the numbers Ü' = A'+ 2 and H = A + 2 are therefore omitted in the notation for the matrix element. The probability of the transition nAÜJ -»n'A'O'J' is proportional to the square of the matrix element (53.11) after summation over Mj. Using the formula QM (106.12): ftK-M'j q Mj) 2J + V we obtain w(nAn/^n'A'n7,) = (2J'-+l)(jJ^, fl^n j^) B ( n \ n ; À \ À ) , (53.12) where the coefficients B are independent of J and J' (we are, of course, neglecting the relatively very small difference in the frequencies of transitions with different J and J').t t This can be shown in the same way as was done for the scalar / in QMy beginning of §29. In the present case the operator of the vector quantity d commutes with that of the vector S, which is conserved (in the zero-order approximation), and 2 is the component of S along the £-axis in the rotating coordinate system in which the condition of commutability of d and S has to be considered. t Each of the rotational levels J considered splits into two when A-doubling is taken into account; one of the two is positive and the other negative. Thus, instead of one transition J -► J\ we have, using the selection rule (53.2), two transitions: from the positive and negative components of the level / to the negative and positive components, respectively, of the level /'. The probabilities of these transitions are equal. §53 Radiation from Diatomic Molecules: Electronic Spectra 201 If we sum (53.12) with respect to J \ then (because of the orthogonality of the 3j-symbols, QM (106.13)) the result is simply B(n\ n; A', A). Thus the total probability of transitions from a rotational level J of the state Ü to all levels J' of the state fT is independent of J. Case b. Here the quantum number K exists, which is the angular momentum of the molecule without regard to its spin, as well as the total angular momentum J. The selection rules for K are the same as the general selection rules for any orbital vector quantity (such as the electric dipole moment): \K' - K| « 1 « K + K' (case b), (53.13) together with the prohibition of a transition with K = K' when A = A' = 0 (corresponding to (53.10)): K' - K = ± 1 when A = A' = 0. (53.14) Let us consider transitions between rotational components of specified vibrational levels of two electron states belonging to type b. The probabilities of such transitions are given by the same formula (53.12), with K and A instead of J and (Î. When the fine structure is taken into account (for S # 0 ) , each rotational level K splits into 2S + 1 components with J = |K - S | , . . . , K + S, and so a multiplet appears in place of the single line J -»/'. Since in this case we have addition of the angular momenta K and S, which are free (i.e. not coupled to the axis of the molecule), the formulae for the relative transition probabilities for the various lines in the multiplet are the same as the corresponding formulae (49.15) for the fine structure components of atomic spectra, where the corresponding angular momenta (in the case of LS coupling) are L and S. Thus we have examined the selection rules governing the possible spectral lines in all the fundamental cases that can occur in diatomic molecules. The group of lines arising from transitions between rotational components of two given electronic-vibrational levels forms what is called in spectroscopy a band\ ,the lines in a band are very close together, because the rotational intervals are small. The frequencies of these lines are given by the differences hcojj = constant + BJ(J + 1) - B'J'W + D, (53.15) where B and B' are the rotational constants in the two electronic states; in order to avoid unnecessary complications, the electron terms are assumed to be singlets. For J' = J, J ± 1, formula (53.15) is represented graphically (Fig. 2) by three parabolic branches, whose points for integral J give the values of the frequencies. (The arrangement of the branches in Fig. 2 corresponds to the case B' < B. If B' > B, their open ends are towards small values of coy and the branch with J' = J - 1 is the highest.)t The existence of a branch which passes through an apex is seen from the diagram to cause the lines to become increasingly dense towards a certain limiting position (the head of the band). t The series of lines corresponding to transitions with J' = J + 1, J, J - 1 are called the P, Q and R branches respectively. 202 Radiation FIG. §53 2. K % In connection with line strengths, mention should be made of the curious effect of alternating intensities in certain bands of the electronic spectra of molecules consisting of atoms of the same isotope (W. Heisenberg and F. Hund, 1927). The symmetry conditions pertaining to the nuclear spins have the result that, in the electron X terms, the rotational components with even and odd K have opposite symmetry with respect to the nuclei, and therefore different nuclear statistical weights gs and ga (see QMt §86). According to the rule (53.14), only J' = J ±\ is allowed in transitions between the two different X terms, and according to the rule (53.4), one of the X terms must be even and the other odd. The result is that, for a given value of /' - /, transitions with successive values of J take place alternately between pairs of symmetric levels and pairs of antisymmetric levels, as shown in Fig. 3 for the example of the states Xg and X«. The observed line strength is proportional to the number of molecules in the initial state concerned, and therefore to its statistical weight. Thus the intensities of successive lines (J = 0,1,2,...) will be alternately greater and less, being alternately proportional to g, and ga (this behaviour being superimposed on the monotonie variation given by formula (53.12)).t There are no exact selection rules concerning the change in the vibrational quantum number in transitions between two different electron terms. There is, however, a rule (Franck and Condon's principle) whereby the most probable change in the vibrational state may be predicted. It is based on the fact that the t Here we assume that all the states having different values of the total nuclear spin are uniformly occupied. I §54 Radiation from Diatomic Molecules : Vibrational and Rotational Spectra 203 U(r) U'(r)l / E £' FIG. 4. motion of the nuclei is quasi-classical, because of their large mass (cf. the discussion of pre-dissociation in QM, §90).t In the integral which determines the matrix element of the transition between vibrational states E and £ ' of electron terms U(r) and U'(r), the most important range is the neighbourhood of the point r = r0 where U(ro)-U'(r,) = E-E\ (53.16) i.e. the momenta of the relative motion of the nuclei in the two states are the same, p = p'. For a given value of E, the transition probability as a function of the final energy E' increases as each of the differences E - U and E'— U' decreases, and is a maximum when E - l/(r 0 ) = E' - l/'(r 0 ) = 0, (53.17) i.e. when the "transition point" r0 (the root of equation (53.16)) coincides with the classical turning point of the nuclei. (Fig. 4 illustrates graphically this relationship between E and the most probable E'.) This can be intuitively expressed by saying that the transition is most probable near the turning point of the nuclei, where they spend a relatively large amount of time. § 54. Radiation from diatomic molecules: vibrational and rotational spectra The selection rules and formulae for transition probabilities given in §53 remain valid for transitions in which the electronic state of the molecule is unchanged.t Here we shall discuss only some particular features of these transitions. First of all, the selection rule (53.4) prohibits all (dipole) transitions without change of electronic state in molecules consisting of like atoms, since in such a transition the parity of the electron term would remain unaltered. It follows from the discussion in §53 that such transitions can be allowed only when the interaction between the nuclear spins and the electrons is considered or, for molecules of t Strictly speaking, it is also necessary that the vibrational quantum number should be sufficiently large. t Transitions in which the vibrational (and therefore the rotational) state changes form what is called the vibrational spectrum of the molecule; this lies in the near infra-red (wavelengths <20 iim). Transitions in which only the rotational state changes form the rotational spectrum in the far infra-red (wavelengths >20fim). 204 Radiation §54 different isotopes of the same element, because of the effect of the rotation on the electronic state. The calculation of the dipole moment matrix elements is reduced (by the formulae in QM, §87) to a calculation in a coordinate system rotating with the molecule. The wave function of the molecule in these coordinates is the product of the wave function of the electrons for a given distance r between the nuclei and the wave function of the vibrational motion of the nuclei in the effective field U(r) of the electrons and the nuclei. When the influence of the motion of the nuclei on the electronic state is entirely neglected, the initial and final electron wave functions for the transitions in question are the same. The integration over the electron coordinates therefore gives, in the matrix element, simply the mean dipole moment d of the molecule (which is obviously along its axis) as a function of the distance r. Owing to the smallness of the vibrations, the function d(r) can be expanded in powers of the vibrational coordinate q = r - r0. For transitions which involve a change in the vibrational state, the zero-order term in the expansion does not occur in the matrix element, because the wave functions are orthogonal for vibrational motion in the same field l/(q), and this leaves the term which is proportional to q. If the vibrations are regarded as harmonic, it follows from the known properties of the linear oscillator (QM, §23) that the matrix elements are zero except for transitions between adjacent vibrational states; thus, for the vibrational quantum number t; we have the selection rule v'-v = ±\. (54.1) This rule is not valid, however, when the vibrations are not harmonic and the subsequent terms in the expansion of the function d(q) are taken into account. For a purely rotational transition (with no change in the vibrational state also), the matrix element of the dipole moment component along the moving f-axis can simply be equated to the mean dipole moment of the molecule, d = d(0).t The probability of the transition J -* J - 1 is then » | « / . U - l ) . ^ f j g j , (54.2) and from this formula we can calculate not only the relative probabilities (as in (53.12)) but also the absolute probabilities. Formula (54.2) is for case a; in case b, J and H are to be replaced by K and A. The frequencies of the purely rotational transitions are given by the differences of the rotational energies BJ(J + 1): hù>u-x = 2BJ. Successive lines are at equal distances 2B. t It is obvious from symmetry that d = 0 in a molecule consisting of like atoms. (54.3) §55 Radiation from Nuclei 205 §55. Radiation from nuclei For y radiation from nuclei it is usually true that the dimensions of the system (the radius R of the nucleus) are small compared with the wavelength of the photon. But the intervals between the nuclear levels, and therefore the energy of the 7 quantum, are generally small compared with the energy per nucléon in the nucleus. The quantity R/À is thus not directly related to the velocity vie of the nuclçons in the nucleus, and is in general considerably less than vfc. Accordingly, the probability of Ml radiation is usually greater than that of E, / + 1 radiation (cf. the beginning of §50). The general selection rules for the total angular momentum (the "spin") of the nucleus and for the parity are the same as for radiation from any system. The characteristic feature of nuclear radiation is that transitions of high multipole order commonly occur. Unlike atoms, whose radiation is usually of the electric dipole type, nuclei undergo such transitions comparatively rarely at low energies, on account of the selection rules. If a radiative transition of a nucleus can be regarded as a single-particle transition (a change of state of one nucléon while the state of the rest of the nucleus remains unchanged), then there are additional selection rules regarding the angular momentum of that nucléon, but in practice such "single-particle" selection rules are found to be only approximately obeyed. The selection rules for the isotopic spin are peculiar to nuclei. The component T3 of the isotopic spin is determined by the atomic weight and atomic number of the nucleus: T^\{Z-N) = Z-\A. When the value of T3 is specified, the absolute magnitude of the isotopic spin can take any value T ^ |T3|. The selection rule for the number T in radiative transitions arises because the electric and magnetic moment operators of the nucleus, expressed in terms of the isotopic spin operators of the nucléons, are the sums of a scalar and the x3-component of a vector in isotopic space; see QMy §116. Their matrix elements are therefore zero unless T ' - T = 0,±1. (55.1) This rule itself, however, imposes no special restrictions on transitions in light nuclei (the only ones for which the isotopic spin can be said with reasonable accuracy to be conserved); the low levels of these nuclei in fact include none with T > 1. For El transitions, however, there is a further rule which arises because there is no isotopic-scalar part for the electric dipole moment, and the operator of this moment is simply the x3-component of the isotopic vector (see QM, §116). Hence, if T3 = 0, transitions with AT = 0 are also forbidden, and so, in nuclei having equal numbers of neutrons and protons (N = Z,A = 2Z), El transitions are possible only if T'-T = ±l (T3 = 0). (55.2) §55 Radiation 206 The accuracy with which this rule is obeyed depends, of course, on the exactness of conservation of the isotopic spin of the nucleus. The probability of E l transitions in the nucleus is influenced also by the recoil of the rest of the nucleus when a particular nucléon moves. The result is that the protons contribute to the dipole moment with an effective charge e(\ - Z/A) instead of e, and the neutrons with a charge -eZjA instead of zero (see QM, §118). The decreased effective charge of the proton causes some reduction in the probability of £ 1 transitions. The energy levels of non-spherical nuclei have a rotational structure, and therefore such nuclei show a characteristic rotational structure of the 7-ray spectrum. The symmetry of the field in which the nucléons move in a "fixed" nonspherical (axial) nucleus is the same as the symmetry of the field in which electrons move in a "fixed" diatomic molecule consisting of like atoms (the point group C*h). The symmetry properties of the levels of a non-spherical nucleus (and hence the selection rules for the matrix elements) are therefore analogous to those of the diatomic molecule (see QM, §119). In particular, electric dipole transitions within a single rotation band (i.e. without change in the internal state of the nucleus) are forbidden, as in a diatomic molecule of like atoms; cf. §54. Such transitions therefore occur as E2 or M1 transitions. In the former the total angular momentum J of the nucleus can change by 2 or 1, in the latter by 1. According to (46.9), the probability of a quadrupole transition, summed over values of the component M' of the total angular momentum of the nucleus in the final state, is WE2 = j^p g KJ'nMiQ^-m|jnM)|2, where J is the total angular momentum of the nucleus, H its component along the axis of the nucleus, and m = M - M ' . By means of QM, (110.8) this sum can be expressed in terms of the squares of given quantities, the diagonal (with respect to the internal state of the nucleus) quadrupole transition moments Q2A, defined relative to coordinates £TJ£ moving with the nucleus. Here A = Ct - H', so that in the case considered (H' = O) only the component Q2o appears. The quantity eQo = e j p«(2£2 - Ç2 - T,2) dÇ dr) d£ = - 2e(Q2Qh is called simply the quadrupole moment of the nucleus. Hence w E;( n^nj') = ^ Q j ( 2 J ' + 1 , ( _ J n I J n)\ Explicitly, we have WE2(ÜJ -> n , / - 1 ) = ^ w rnr->n r n- ^ Ql n2 ( J _ 1 ) J ( J + 1)(2J + i r (J 2 -n 2 )[(j-i) 2 -n 2 ] w E 2 (nj -* n , J - 2) - 4ÖJ-3 Qo ( j _ 1)/(2 / - 1X2J + I)* (55.3) §56 The Photoelectric Effect: Non-relativistic Case 207 The following remark is necessary concerning these formulae. They include matrix elements calculated with wave functions of the form </>JÜM = constant x *nl>i&(n), where \n is the wave function of the internal,state of the nucleus. These functions correspond to values of the £-component of the angular momentum which are definite in both magnitude and sign. In nuclei, however, states have only a definite parity and a definite magnitude of the angular-momentum component (usually taken as O). Hence, if Cl ^ 0, the initial and final wave functions would have to be taken as combinations of the form The products of the first terms and of the second terms will give the same value as above for the quadrupole moment matrix elements, but the cross-products will lead to non-vanishing integrals if 2 H ^ 2 . t Hence formula (55.3) is not strictly valid if 0 = 2 or 1. In these cases the transition probability contains an additional term which cannot be expressed in terms of the mean value of the quadrupole moment.X In a similar manner to the derivation of (55.3), we obtain for the M l transition probability where fi is the magnetic moment of the nucleus. This formula is not valid if Ü = \. § 56. The photoelectric effect: non-relativistic case In §49-52 we have discussed radiative transitions (with emission or absorption of a photon) between atomic levels of the discrete spectrum. The photoelectric effect differs from such a photon absorption process only in that the final state belongs to the continuous spectrum. The cross-section for the photoelectric effect can be calculated in an exact t For the matrix elements of 2'-pole moments, the integrands will include products of the form The angle integral will not be zero if q' = - 2ft, and the range of values of q' is only from -I to +/; thus we must have 2ft ^ /. t This term in fact gives a significant correction only in the case ft = i when the coupling between the rotation and the internal state of the nucleus is especially large (see QM, §119). 208 §56 Radiation analytical form for the hydrogen atom and for a hydrogen-like ion (with atomic number Z«137). In the initial state, the electron is at a discrete level £, = - J (where I is the ionization potential of the atom) and the photon has a definite momentum k. In the final state, the electron has momentum p (and energy ef = e). Since p takes a continuous series of values, cross-section for the photoelectric effect is da = 2TT| Vfi\28(-I + co - e) d3p/(27r)3 (56.1) (cf. (44.3)); the wave function of the final state of the electron is normalized to "one particle per unit volume". The wave function of the photon is, as before, normalized in the same way; in order to obtain the cross-section da, the probability dw then has to be divided by the photon flux density (which is c/V = c when V = 1), but when relativistic units are used this does not affect the form of (56.1). As in (45.2), we choose the three-dimensionally transverse gauge for the photon. Then Vfi = - eA • j/i = - e V(4ir) ^J^) Mfi ' where M/, = J <K*(<*'e)eik > d 3 x , (56.2) with \\f = ifßi and ifß' = \\ßf the initial and final wave functions of the electron. Putting in (56.1) d3p ->p2 d|p| do = e\p\ de do and integrating to remove the delta function of e, we can write this formula as da = e2^\Mfi |2 do. (56.3) The calculations will be given in two cases, which differ as regards the magnitude of the photon energy: co > I and co <§ m. Since I ~ me4Z2 < m, these two ranges partly overlap (when I <<a<m), and so an examination of the two cases gives an essentially complete description of the photoelectric effect. We shall take first the case a><m. (56.4) The electron velocity is then small in both the initial and the final state, and the problem is therefore entirely non-relativistic as regards the electron. Accordingly, we replace a in (56.2) by the non-relativistic velocity operator v = - iVfm (cf. §45). We can also use the dipole approximation, putting e'k,r**l, i.e. neglecting the §56 The Photoelectric Effect: Non-relativistic Case 209 momentum of the photon in comparison with that of the electron. Then 2lT(t) (56.5) We shall consider the photoelectric effect from the ground level of the hydrogen atom (or of a hydrogen-like ion). Then ^ = (Ze2m)3/2e_Ze2wr. (566) V V in ordinary units, me2 becomes l/a0, where a0 = h2lme2 is the Bohr radius. The wave function if/' must be taken such that its asymptotic form comprises a plane wave (e'p'r) together with an ingoing spherical wave; cf. QM, §136, where this function was denoted by ij/^\ On account of the selection rule for I, a transition from an s state can only be to a p state (in the dipole case). Thus, in the expansiont *ï"} = ^ jÈ i'W + D e-iStRPi(r)P,(n • n,), (56.7) where n = p/p, n, = r/r, it is sufficient to retain the term with 1 = 1. Omitting unimportant phase factors, we therefore have _3_ ^^(n-nOJUr). 2p (56.8) With the functions *p and </*' given by (56.6) and (56.8), we have e • v„ = ^vCmp' 2 / / ( ° ' *l*mi ' e ) ^ ^ ^ ^ V(27r)(Zg2m)5/2/ pm , f J o 2 _^mrj> . d x ° ' r*dr w According to QM (36.18), (36.24), the radial function is (in the units employed here) _ tor ™ , • A ^. ^ „ V(%<ir)Ze2m I \ + v2 ( Rpi = —L—| VKl-e" 2 "") p r e ^ ' pr)> with v = Ze2m/p (= Ze2lhv). t In the rest of this section, p denotes |p|. (56.9) 210 §56 Radiation The integral can be calculated by means of the formula [ e-Xzzy-xF{a, y, kz) dz = Y{y)k*-y{k - kya\ o cf. QMy (f.3). Noticing also that (v + l \ --2vcor« v = we obtain e V/ ' ' 27/27ri/3(n - p) Vpm(\ + g-*"*»-1" 2 m v ) V(\-e-2™y The energy of ionization from the ground level of the hydrogen atom (or a hydrogen-like ion) is I = Z2e4m/2. Hence a>=^+I=^(\ + v2). (56.10) Using this relation, we obtain as the final expression for the cross-section for the photoelectric effect with emission of an electron into the solid-angle element do ( r \ 4 p-4i/ cot~! v ■t) i_e-^(n-e)2do, (56.11) where a = h2lmZe2 = a0/Z (ordinary units are used here and below). The angular distribution of the emitted electrons is governed by the factor (n • e)2. This has maxima in the directions parallel to the direction of polarization of the incident photons, and is zero in directions perpendicular to e, including the direction of incidence. For unpolarized photons, formula (56.11) must be averaged over the directions of e, which is equivalent to substituting (n-e^knoXn)2 with no = k/k; see (45.4b). Integration of formula (56.11) over all angles gives the total cross-section for the photoelectric effect: 29 2 / I \ 4 „-Avcot"1 v (M. Stobbe, 1930). The limiting value of <x as h<a -*■ I (i.e. as v -* <») is a =^ aa* = ^ - f r ° = 0.23a§/Z2, (56.13) §56 The Photoelectric Effect: Non-relativistic Case 211 where e in the denominator is the base of natural logarithms. The cross-section for the photoelectric effect tends to a constant limit near the threshold, as it must for a reaction forming charged particles (see QM, §147). The case in which h(o > I (still with ha> < mc2) corresponds to the Born approximation {v = Ze2lhv < 1). Formula (56.12) becomes ^2JfaaizitT' (56M > where I0 = e4ml2h2 is the ionization energy of the hydrogen atom. The process inverse to the photoelectric effect is the radiative recombination of an electron with an ion at rest. The cross-section artc for this process can be found from that for the photoelectric effect (<rph) by means of the principle of detailed balancing (QM, §144). This principle states that the cross-sections for the processes i -* / and / -» i (with two particles in each of the states i and /) are related by where ph pf are the momenta of the relative motion of the particles, and g„ gf the spin statistical weights of the states i and /. Since g = 2 for the photon (which has two possible directions of polarization), wefindfor the hydrogen atom ground state CTree - (Tph • 2 k V , (56.15) where p = mv is the momentum of the incident electron and k that of the emitted photon. PROBLEMS PROBLEM 1. Derive formula (56.14) by direct use of the Born approximation in the non-relativistic case. SOLUTION. In the Born approximation, $' in (56.5) is simply the plane wave i£' = e , p r , and ^ is again the function (56.6). Then v/i = Vi/= — J *lfp*lf'd3x _ P (Ze m) ,-Zt2mr, The Fourier component is given by (57.6b), and so v/i = 8V7rp-3m3/2(Ze2)5/2n. Substitution in (56.5) and integration over do leads to (56.14) (here, with sufficient accuracy, p2/2m « <*>). PROBLEM 2. Determine the total cross-section for radiative recombination of a fast but nonrelativistic electron (I < mv2 < mc2) with a nucleus having charge Z <3 137. SOLUTION. The cross-section for capture to the K shell (principal quantum number n = 1) is obtained by substituting (56.14) in (56.15): _rec _ 2 7T ^ 3 y i ( h \ §57 Radiation 212 where e = \mv2 is the energy of the incident electron, and hw ~ F. Among the other states of the resulting atom, only s states are important: in the calculation of the matrix element in the Born approximation, the important values are those of the wave function of the bound state when r is small (as will be seen in §57), and when / > 0 these values are small compared with those for / = 0. It is sufficient to take the first two terms in the expansion of <A in powers of r. For states with / = 0 and any n, these terms are —M-;)V(7raV) i.e. they contain n only as a common factor n'V2\ this expression is obtained by expansion of QM, (36.13). The total recombination cross-section is therefore rec _ V* rec rtc V n=l - r(1\~Tec n=|H The value of the zeta function is f(3) = 1.202. §57. The photoelectric effect: relativistic case Let us now consider the case a)>L (57.1) Here we have also e = co - 1 > I, and the influence of the Coulomb field of the nucleus on the wave function i//' of the emitted electron can be taken into account by means of perturbation theory. We shall write 1 ^' = 7V(2?> 77^("'eipr+<A(1)). (57.2) The electron may be relativistic, and therefore the unperturbed function in (57.2) is written as a relativistic plane wave (23.1). Although the electron is non-relativistic in the initial state, its wave function i// must nevertheless, for reasons to be explained below, include the relativistic correction {—Ze2). This function is (cf. §39, Problem) *-{x-&*-v)>7ifc)*~- (57 3) - where i/wr is the non-relativistic bound-state function (56.6), and u is the bispinor amplitude of the electron at rest, normalized by the usual condition üu = 2m. We substitute the functions (57.2), (57.3) in the matrix element (56.2):t M <>=wh)l{a'^ ■e) [(* - £ *v v)"*~] ^K'"l),+ + iÄ"'(-re)e""'<«/Wr}d 1 x. (57.4) t The function (57.3) has been derived for distances r - l / m Z e 2 , at which the relative order of magnitude of the correction term is Ze2. But for the ground state (and for all 5 states) formula (57.3) is valid for any r, since the derivative of the purely exponential function (56.6), and therefore the correction term in (57.3), are always proportional to Ze2. This enables us to use formula (57.3) in the present problem, where (as we shall see below) it is the small values of r that are important. §57 The Photoelectric Effect: Relativistic Case 213 In order to derive the first term of the expansion of this quantity in powers of Ze2, we can replace t/w r in the second term in the braces by the constant (Ze2m)3/2/vV simply. The first term would vanish if treated in this way when p - k ï 0, and it is for this reason that the first relativistic correction, proportional to Ze2, has to be included in «//. When v ~ 1, this correction gives a contribution to the cross-section that is of the same order as the contribution of the next term in the expansion of «Anon-r in powers of Ze1. In the first term in (57.4) we integrate by parts, transferring the action of the operator V from i//n0IW to the exponential factor. The result is (57.5) where the vector suffix denotes the spatial Fourier component. As far as the Ze2 term we havet -zt^mrs SrrZe m (e- z<m V*= Z(p-k) .."• ,p-k (57.6) To calculate the Fourier components i^0, we write down the equation satisfied by the function i|/l): ( A + iy • V - m)^(,) = e ^ A J i i ' e * " = - —7Ve'p", r ' obtained by substituting (57.2) in (32.1). Applying the operator y°e + iy • V + m to both sides, we find (A + p V u = - Ze\y°e + iy • V + m)(y°u')- e i p r . Multiplying this equation by e~ikr and integrating with respect to d3x, with the t Taking the Fourier component of each side of the equation -Ar ( A - À 2 ) — = -4ir6(r), we obtain (Ç),-îfc> and differentiation with respect to A gives 214 §57 Radiation usual integration by parts in the terms containing A and V, gives (p2 - k2)^> = - Ze\y°e - y • k + m)(y°u')(±^ = - Ze2(2ey0 - y • (k - p))(7 V ) ^ ^ 5 In the last line we have used the fact that the amplitude u' satisfies the equation (ey° - p • 7 - m)u' = 0, or (e?0 + p • 7 - m)ybu' = 0. Hence M =« V = 4irZ«2fi' 2 f f l y ) ( k ( k , ~ ^ 7°. (57.7) Substituting (57.6) and (57.7) in the matrix element (57.5), we can write it as .. M/i = A>nm{Ze2mf>2 _, A ( e my /2 (k-p) ittAw ' where A = a(7 • e) + (7 • e)7°(7 • b) + (7 • c)?0(7 • e), a 1 (k^p? g 1 rnFj?' . D k-p 2m(k-p) 2 ' C k-p 2m(k 2 -p 2 )' The cross-section is da = Xk —\J£ a>(. p) m (U'AU)(UAM') do, where Ä = 7°A+y°; see §65. This expression has to be summed over final directions and averaged over initial directions of the electron spin, using the rules given in §65 below and the polarization density matrices of the initial andfinalstates: P = im(7°+1), p' = i ( 7 ° e - 7 - P + m); in the initial state, p = 0 and e = m. The resulting expression is . d(r = 16e2(Ze2m)5lpL r. . t r , ,A /T—TP mw(k-p) (p ApA) do. The calculation of the trace by means of the formulae (22.22) is purely algebraic, §57 215 The Photoelectric Effect: Relativistic Case and the result is tr(p'ApA) = — — [ap - (b - c)(e + m)]2 + 4m(b • e)[(e + m)(c • e) + a(p • e)]; the vector e is assumed real, i.e. the photon is assumed to be linearly polarized.. The formula for the photoelectric effect cross-section will be put in its final form by using the polar angle 6 and the azimuth <t> of the direction of p when the direction of k is the z-axis and the plane of k and e is the xz plane (so that p • e = |p| cos <t> sin 0). When I, the conservation of energy may be written in the form e -m = <o instead of e - m = w - 1 . We then easily see that k 2 - p2 = - 2m(e - m), (k- p)2 = 2e(e - m)(l - v cos 6), where v = p/e is the velocity of the photoelectron. A simple calculation gives finally . 4 2 75 t? 3 (l ~ t?2)3 sin 2 $ dcr = Z a r < [ 1 _ V ( 1 _ y 2 ) ] i ( 1 _ v c o s 0 ) 4 x „f[i-V(i-u 2 )] 2 „ I 2(1 -V) 3 / 2 „r. 0 - w cos 0) + p - [i-V(i-t?2)](i-t?cosfl)l *—Y^p 2.i . ^J cos2 <f> | do, (57.8) where re = e2lm. In the ultra-relativistic case (e > m), the photoelectric effect cross-section has a sharp peak at small angles 6 ~ V(l -1>2), i.e. the electrons are emitted predominantly in the direction of incidence of the photon. Near the maximum l - u c o s 0 « 2 f ( l - t ; 2 ) + 02], and the leading terms in (57.8) give da - 4ZVr 2 ^_y+ey dB d<f>. (57.9) The integration of (57.8) over angles is elementary but lengthy, and leads to the following expression for the total cross-section (F. Sauter, 1931): (57.10) where the "Lorentz factor" y is used for brevity: 1 e _ m+w 2 V ( l ü ) " m * m • y TT7T—7Z = -~--— = .„,,. (57.11) 216 §58 Radiation In the ultra-relativistic case, this formula reduces to the simple expression (j = 27rZVr*/y; (57.12) in the case I <o> <m, the limit of small y - 1 in (57.10) yields the already known result (56.14). §58. Photodisintegration of the deuteron A distinctive property of the deuteron is that its binding energy is small in comparison with the depth of the potential well. This enables reactions involving the deuteron to be described without a detailed knowledge of the behaviour of nuclear forces, using only the binding energy (see QM, §133). Here it is assumed that the wavelengths of the colliding particles are large compared with the range a of the action of nuclear forces. This applies to the disintegration of the deuteron by y quanta having ka < 1. It will also be assumed that pa < 1, where p is the momentum of relative motion of the neutron and proton released; this is a stronger condition than ka < l.t We start from the non-relativistic formula (56.5) for the photoelectric effect cross-section, integrating over all directions: e2p M 4TT U fl where p is the momentum of relative motion of the proton and neutron,^ and m in (56.5) has been replaced by their reduced mass M/2 (where M is the nucléon mass). The matrix element is that of the proton velocity vp, since only the proton interacts with the photon. Expressing vp in terms of the momentum p (vp = iv= p/M), we have ~ w - - 3Mo> ^|p/i|2. (58.D The superscript (e) denotes that this formula corresponds to electric dipole transitions: epIM = e\p = d, so that epf-JM = ia>d/j. The normalized wave function of the initial (ground) state of the deuteron is t The photon energy for which pa * 1 (with a = 1.5 x l(Tncm) is 15 MeV. t In this section, p denotes |p|. i §58 Photodisintegration of the Deuteron 217 where I = 2.23 MeV is the binding energy (see QM, §133).t The wave function of the final state can be taken to be that of free motion, i.e. the plane wave i//' = e,p r. (58.3) The reason is that, in the theory under consideration, the "size of the deuteron" 1/K is assumed large in comparison with the effective interaction radius a. The interaction between the proton and the neutron therefore has to be taken into account only in S states, and can be neglected in states with /^O, whose wave functions are small at small distances. According to the selection rules, electric dipole transitions between two S states (the ground state and an S state of the continuous spectrum) are forbidden, and it is therefore possible in this case to neglect the nucléon interaction in the final state. Integration by parts gives the matrix element V K 4îrp 2TTF+^; see the second footnote to §57. Using also the equation which expresses the conservation of energy, we finally obtain the photodisintegration cross-section (in ordinary units) as a M _ 8TT h2 Vl(fta>-I) 3 / 2 ~T M—(M3— a (58 4) - (H. A. Bethe and R. Peierls, 1935). It has a maximum at hw = 21, and tends to zero as h(o -* I orftco-» ». The electric dipole absorption of the photon, described by formula (58.4), does not, however, give the main contribution to the cross-section near the photoelectric t This function can be made more accurate by including a correction due to thefinitenessof a, the normalization coefficient in (58.2) being replaced by v 2ir(l - a * ) ' see QM, (133.13). A factor l/(l - QK) accordingly appears in the cross-section formulae. This correction is in fact quite large: for the ground state of the deuteron, QK =«0.4. The deuteron ground state is 3Si, with a small "admixture" of 3Di due to the action of the tensor nuclear forces (see QM, § 117). This admixture will be neglected, and therefore so will the tensor forces. 218 Radiation §58 effect threshold (hu) close to I). This is because, in this range, the principal effect must come from transitions to an S state, and these do not occur in electric dipole absorption. Nor do they occur in electric quadrupole absorption, since, although they do not then violate the parity selection rule, they are forbidden by the selection rule for orbital angular momentum (the tensor forces are here neglected, and L and S are therefore separately conserved). To calculate the photodisintegration cross-section near the threshold, we have therefore to consider magnetic dipole absorption, for which the selection rules allow transitions between S states (E. Fermi, 1935). Replacing the electric moment in (58.1) by the magnetic moment, we have a(m) = WMp ||i,i|2. (58.5) The magnetic moment of the orbital motion makes no contribution to n/*, since the orbital angular momentum L has no matrix elements for transitions between S states. The spin magnetic moment |x = 2pipSp +2/j, n s„ = 2(jlp - /Lln)Sp +2/ÜLnS, where S = sp + s„, and pip, nn are the magnetic moments of the proton and the neutron. When the tensor nuclear forces are neglected, the total spin is conserved, and its operator therefore yields no transitions. Hence JI/j = 2(Sp)/,(/Xp -jLl,,). In the same approximation (neglecting the tensor forces), the spin and coordinate. variables are separable. The matrix element, like the wave functions, becomes a product of a spin part and a coordinate part: l&/i = 2(/xp - iXn)(spS'M'\sp\spSM) j i//'*W(r) d'x. But the presence of spin-spin nuclear forces has the result that the wave equation for the coordinate functions t//(r) includes the spin value S as a parameter. If S' = S, then ift'(r) and \\>(r) are eigenfunctions of the same operator, and are therefore orthogonal. Thus a photodisintegration from an initial 3S state can occur only to a XS state of the continuous spectrum. The square ||i/j|2 in (58.5) must, of course, be averaged over components M of the spin S in the initial state. Thus the problem is to calculate the quantity 25^SK^S'M'|s p |5 p SM>| 2 , where sp = sn = 2, S = 1, S' = 0. The general rules for matrix elements in the §58 Photodisintegration of the Deuteron 219 addition of angular momenta give (25 + l) 1 (2S^l) l< ^ 1|Spl|5pS)|2 = { s l ïfl<*>M*p>l2 = l\(sM\sP)f (see QM, (107.11), (109.3)). The reduced matrix element is <sPK||sp> = V[5p(sp + l)(2sp + 1)] = V(3/2). Formula (58.5) then becomes <r(m) = \a>Mp(tip - fx„)2| | «/r'*«/r d3x\\ (58.6) The initial function ty is given by (58.2); the final function is V—^Rt/Lr). 2p This is the first term (/ = 0) in the expansion (56.7)Fof a function whose asymptotic form comprises a plane wave and an ingoing spherical wave; an unimportant phase factor has been omitted. Since the integration is taken over the region outside the range of action of the nuclear forces, the radial function is R,.(r) = 2 s i n <P/ + 8>. The phase 5 is related to the value of the virtual level (Ij = 0.067 MeV) of the proton + neutron system when S = 0: cot 6 = Kt/p, K, = V(Mh); see QM, §133. Then [ «/,"•> d*x = (2TT)3/2 - ^ im f e~Kr+ipr eiS dr V3/2VK. = (27r) i/2 — i m pTT e' K — ip After a simple algebraic reduction, we obtain the following expression for the photodisintegration cross-section (in ordinary units): Radiation 220 §58 When fao-* J, the cross-section tends to zero as V(fta> - / ) , in accordance with the general properties of cross-sections near the reaction threshold (QM, §147). The inverse process to photodisintegration is radiative capture of a proton by a neutron. The capture cross-section (<JC) is obtained from the photoelectric effect cross-section ((7ph) by means of the principle of detailed balancing; cf. the derivation of (56.15). The spin statistical weights of the neutron and the proton are 2 x 2 = 4; those of the deuteron (in a state with S = 1) and the photon are 3 x 2 = 6. Hence ac 3(M 2 „ .. 3(M2 , aph ~ 2TV - 2Me\ha>-I) **■ rcao, ' (58 8) CHAPTER VI SCATTERING OF RADIATION § 59. The scattering tensor T H E scattering of a photon by a system of electrons (which will be referred to below as an atom) consists of the absorption of the initial photon k and the simultaneous emission of another photon k\ The atom may be left either at its initial energy level or at some other discrete energy level. In the former case the photon frequency is unchanged (Rayleigh scattering); in the latter case the frequency changes by a>,-a> = E]-E2y (59.1) where Ex and E2 are the initial and final energies of the atom (Raman scattering)^ If the initial state of the atom is the ground state, then E2 > Ex in Raman scattering, and so co'<co: the frequency is decreased by the scattering (the Stokes case). In scattering by an excited atom, either the Stokes case or the anti-Stokes case (cof > a)) may occur. Since the electromagnetic perturbation operator has no matrix elements for transitions in which two photon occupation numbers simultaneously change, the scattering effect appears only in the second approximation of perturbation theory. It must be regarded as taking place via certain intermediate states, which may be of one of two types: (I) The photon k is absorbed and the atom enters one of its possible states En ; in the subsequent transition to the final state, the photon k' is emitted; (II) The photon k' is emitted and the atom enters the state En ; in the transition to the final state, the photon k is absorbed. In this process, the matrix element is represented by the sum „ \e>\ ~ e „ ei — e„ / (see QM (43.7)), where the initial energy of the atom + photons system is %\ = Ei + to, and the energies of the intermediate states are rn = Eny n = En + a> + o>\ t In this chapter, the suffixes 1 and 2 will denote quantities pertaining respectively to the initial and final states of a scattering system. 221 Scattering of Radiation 222 §59 The V. are the matrix elements for the absorption of the photon k, and the V are those for the emission of the photon k'; the initial state is excluded from the summation over n, this being indicated by the prime to the summation sign. The scattering cross-section is da = 2n\V2i\^f, (59.3) where do' is a solid-angle element for the directions k'. The radiation energy dl' scattered into the solid angle do' per unit time is expressed in terms of the intensity (energy flux density) / of the incident radiation by dL = I(co7o>) da. We shall assume that the wavelengths of the initial and final photons are large compared with the dimensions a of the scattering system. All transitions will therefore be considered in the dipole approximation. If the photon states are described by plane waves, this approximation is equivalent to replacing the factors e l k r by unity. Then the wave functions of the photons are (in the three-dimensionally transverse gauge) *- - v < 4 , r ) V ( b e""> *" = v < 4 , r ) V(b) e ""'- Under the conditions considered, the electromagnetic interaction operator may be written as V=-dÊ, (59.4) where Ê= - À is the field strength operator and d the dipole moment operator of the atom (similarly to the classical expression for the energy of a small system in an electric field; Fields, §42). The matrix elements are Vn] = -iV(27rcu)(e • dnl), V'2n = iV(27Tü>')(e'* • d2„). Substituting these expressions in (59.2), (59.3), we find as the scattering crosssection (written in ordinary units)t da. l g ((*. • OW.. ■ «) + (fa ■ e)(d , • O i l W ftü)„| = E„ — E i , O)' — CO = 0>i2. The summation is over all possible states of the atom, including those of the continuous spectrum (states 1 and 2 cannot appear in the sum, since the diagonal t This formula was first derived by H. A. Kramers and W. Heisenberg (1925), before the development of quantum mechanics. The Scattering §59 Tensor 223 matrix elements d n and d22 are zero). The infinitesimal imaginary increments in the denominators correspond to the usual rule for pole avoidance in perturbation theory (see QM, §43): an infinitesimal negative imaginary part is added to the energies En of the intermediate states over which the summation is carried out. The avoidance rule is important when the poles of (59.5) with respect to the variable En are in the region of the continuous spectrum; for example, if state 1 is the ground state of the atom, this would occur for hù> exceeding the ionization threshold of the atom.t We shall use the notation^ (in ordinary units) (n \ J v F (dihn(dk)n\ , (dk)2n(di)n\ 1 \Cikh\ - j ; 2J\ Tc\+ 1—?—Tï\ b * n L^nl ~* 0) — W (X)n2 + CO — lUJ <<Q6\ P9.t>) where i, k = x, y, z are three-dimensional vector indices. Then formula (59.5) can be written as da = <»(6> + (ül2)3|(cÄ)2ieS*ft|2 do'lc4. (59.7) The notation (59.6) is justifiable in that this sum can in fact be represented as the matrix element of a certain tensor. This is most easily seen by defining a vector quantity b whose operator satisfies the equation ib + cob = d. Its matrix elements are K t>nl= dnt 0) — (On\ , . ö2n = Ain T , CO + CO„2 so that (Cik)2i = (Mi-dib k )2i. (59.8) The matrix elements (clk)2i will be called the radiation scattering tensor. It follows from the above that the selection rules for scattering are the same as the selection rules for the matrix elements of an arbitrary tensor of rank two. We can see immediately that, if the system has a centre of symmetry (so that its states can be classified by parity), transitions are possible only between states of the same parity (including transitions without change of state). This rule is the opposite of the parity selection rule for (electric dipole) emission, and so there is an alternate prohibition: transitions allowed in emission are forbidden in scattering, and vice versa. We can resolve the tensor cik into irreducible parts: cik = c°8ik + csik + cL (59.9) t In a molecule, the threshold for dissociation into atoms here takes the place of the ionization threshold. t Most of the results derived in §§59-61 below are due to G. Placzek (1931-1933). Scattering of Radiation §59 c° = tcu, cfk = l(clk + c k j)-c°5i k , C?k = (59.10) 2(Cik-Cki) are respectively a scalar, a symmetric tensor (with zero trace) and an antisymmetric tensor. Their matrix elements are (c°)2i = i S 7 " Wl v" W2 + Adihniddm, (59.11) (Cfthl = 2 S 7 "nlw"n2+ J(^)2n<dk)nl + (4)2«(4) i .l] ~ (C°)2lO*, (59.12) V (cu n i-cu)(a) rt2 +a)) a + (Oi2 y » ( d j ) 2 n ( d k ) n l - (dk)2n(dj)n _2(Q 2 i. /Wnx (cunl-co)(a)rt2 + a)) T the symbols indicating pole avoidance are omitted, for brevity. Let us consider some properties of the scattering tensor in the limiting cases of low and high photon frequencies.t For Rayleigh scattering (coî2 = 0), the antisymmetric part of the tensor vanishes as (o->0, because of the factor to in front of the sum in (59.13). The scalar and symmetric parts of the scattering tensor, however, tend to finite limits as a>->0. The cross-section is therefore proportional to co4 when cu is small. In the opposite case, when the frequency co is large compared with all the frequencies co„i, a)n2 which are important in (59.6) (but of course the wavelength is still much greater than a), we must arrive at the formulae of the classical theory. The first term in the expansion of the scattering tensor in powers of l/o> is — 2 [(dk)2n(di)n\ - (di)2n(dk)n{] = —{dkdx - dxdk)iu (O (On and is zero, since the operators dt and dk commute. The next term in the expansion is (Cik)2\ = —Î 2 [0>2n{dk)2n(di)n\ ~ (di)2nü)n\(dk)n\] (*> n = 7-ï(<Wi - didk)2\. Using the definition d = 2 er (with the summation over all the electrons in the atom) and the commutation rules for momenta and coordinates, we obtain <C*)II = - ^ * I * ÏÏÏO) (c*) 2 i = 0, (59.14) t The case of resonance (when eu is close to one of the frequencies Ü>„I and a)«2) will be discussed in §63. §59 The Scattering Tensor 225 where Z is the total number of electrons in the system, and m the electron mass. Thus, in the limit of high frequencies, there remains in the scattering tensor only the scalar part, and scattering takes place without change in the state of the system, i.e. the scattering is entirely coherent (see below). The scattering cross-section in this case is da = r2eZ2|e;*-e|2do', (59.15) where re = e2lm. After summing over polarizations of the final photon, we have da = r2,Z2{l-(e-n')2}d<>' = r2eZ2 sin2 0- do', (59.16) which is in fact the same as the classical Thomson's formula (Fields, (80.7)); 0 is the angle between the direction of scattering and the polarization vector of the incident photon. Let us consider the scattering of radiation by an assembly of N identical atoms situated in a region small compared with the wavelength. The corresponding scattering tensor is equal to the sum of the tensors for scattering by each atom. It must, however, be remembered that the wave functions (which are used to calculate the dipole moment matrix elements) for several identical atoms taken together are not simply equal functions. The wave functions are essentially defined only to within an arbitrary phase factor, which is different for each atom. The scattering cross-section has to be averaged over the phase factor of each atom separately. The scattering tensor (cik)2\ of each atom includes a factor el{*x~*2>, where <f>\ and <t>2 are the phases of the wave functions of the initial and final states. For Raman scattering, the states 1 and 2 are different, and this factor is not equal to unity. In the squared modulus \e?ek 2 (c*)2i|2, where the sum is over all N atoms, the products of terms pertaining to different atoms will include phase factors which vanish on independent averaging over the phases of the atoms, and only the squared modulus of each term remains. This means that the total cross-section for scattering by N atoms is found by taking N times the cross-section for scattering by one atom; the scattering is incoherent. If, however, the initial andfinalstates of the atom are the same, then the factors eK*i-*2> = i# xhe amplitude for scattering by the assembly of atoms is N times that for scattering by one atom, and the scattering cross-section consequently differs by a factor N 2 ; the scattering is coherent.! If the atomic energy level is not degenerate, Rayleigh scattering is therefore entirely coherent. But if the energy level is degenerate, there will also be incoherent Rayleigh scattering arising from t The factor Z2 in formulae (59.15) and (59.16) has the same origin: the cross-section for scattering by Z electrons in one atom is Z 2 times that for scattering by one electron. §59 Scattering of Radiation 226 the transitions of the atom between various mutually degenerate states. This is a purely quantum effect; in the classical theory, any scattering without change of frequency is coherent. The coherent scattering tensor is given by the diagonal matrix element (cik){U and will be denoted by aik, omitting for brevity the index which shows the state of the atom. According to (59.6), T7i + aik(cü) = (Cifc)ii = 2J T 77i I (59.17) This expression may also be written as a^H^I-Zfr+i-S \ mco2 { { p M k \ 4- (Pk^(P^1], m r ki-a)-iO conl + co - iOjJ (59.18) using the limiting form (59.14). Here p is the total momentum of the electrons in the atom. The equivalence of the two forms is easily seen by noting that the matrix elements of the momentum and the dipole moment are connected by epjm = i(Dlnd\n and using the same relationships as in the derivation of (59.14). If the sum and difference E\ ± CJ are not equal to any of the energy levels En of the atom (including the continuous spectrum), the terms iO in the denominators may be omitted. Since pfn = pnU the tensor aik is then seen to be Hermitiant: ctik = at (59.19) This means that its scalar and symmetric parts are real, and its antisymmetric part is imaginary. The latter is certainly zero if the atom is in a non-degenerate state; the wave function of such a state is real, and therefore the diagonal matrix elements are also real. The tensor aik is related to the polarizability of the atom in an external electric field. To show this relation, let us calculate the correction to the mean value of the dipole moment of the system when the latter is placed in an external electric field kEe- |wr + E*e , w ). (59.20) This can be done by using a well-known formula of perturbation theory (QM, §40). If the system is subjected to a perturbation t This result depends on the neglect of the natural line width, and therefore of the possible absorption of the incident radiation; see §62. 111 The Scattering Tensor §59 then thefirstvordercorrection to the diagonal matrix elements of a quantity / is fW(f\ 1 K) _. _ V1 IT f\nFn\ T'lUi.i-û>-iO fn\P\n 1 -iut , | Oni + co + iO] J\nF \n . Jn\Fn\ 1 'not 1 l(on] + <D-iO ù>nl-û> + îOJ J* The perturbation V must be regarded as being applied with infinite slowness from t = -oo, so that in the first term <o is to be interpreted as co + iO, and in the second term as w - iO; the imaginary increments in the denominators have been written accordingly. In the present case P = -id-E, and the correction to the diagonal matrix element of the dipole moment is found to be dVi) = kde- |W + d*eia,r), (59,21) where d is a vector whose components are di = aWEk. (59.22) The expression for the tensor aftXio) differs from (59.17) for a,* by a change in the sign of the imaginary part in the denominator of the second term. By definition, atXco) is the polarizability tensor of the atom in a field of frequency a>. For frequencies such that the imaginary parts in the denominators can be omitted and the tensor aik is Hermitian, aik and a\V are identical. In particular, when œ = 0 the formula (59.22) becomes QM, (76.4), and the expression QM, (76.5) for the static polarizability tensor is the same as c*jk(0) from (59.17). Note also that, if state 1 is the ground state,t all o)n\ > 0 and the avoidance rule in the first and second terms in (59.17) is important only when o> > 0 and co < 0 respectively. In that case, aÄ(oi) «a^Clcol). (59.23) The formulae of scattering theory implicitly have a> > 0; the tensor aik is then the same as the polarizability tensor. We shall need not only the cross-section but also the photon scattering amplitude /. As usual in perturbation theory, this is equal, apart from a normalization factor, to minus the matrix element (59.2). Choosing this factor so as to express the cross-section (59.7) in the form da = |/|2 do\ we have as the elastic scattering amplitude /^CUWÏ*«*. (59.24) According to the optical theorem (see (71.10) below), the imaginary part of the t Only this case (which will be assumed henceforward) allows a completely rigorous treatment, because of the finite lifetime of the excited states; see §62 below. Scattering of Radiation 228 §59 forward scattering amplitude (without change in momentum and polarization) determines the total cross-section at for all possible elastic and inelastic processes for a given initial state of the photon: crr = (4TT/O)) im(co 2 a jk ^t) = 47ra)(a* - a£)ef ek\ll (59.25) Thus the total cross-section is determined by the anti-Hermitian part of the scattering tensor. The formula (59.25) has a simple classical significance. The electric field E does work Xe\ • E = E • d on the system of charges per unit time. Expressing the field in the form (59.20), and the dipole moment in the form (59.21), (59.22), and averaging this work with respect to time, we find \a)\E\2e*ek(aik- afc)/2i, with E = eR On the other hand, if E is the incident radiation field, the mean energy flux density in it is |JE|2/87r, and the energy absorbed by the atom is |E|2O-,/8T7. Equating the two expressions, we find (59.25). If the angular momentum J of the ground state of the atom is zero, then by spherical symmetry aik = aöiky and crt = 4TT(O im a. (59.26) For a system having angular momentum, a similar relation holds for quantities averaged over the spatial directions of the angular momentum (see §60). For photon energies above the ionization threshold of the atom, the principal contribution to the total cross-section vt comes from the ionization process (the absorption of a photon in the photoelectric effect). The scattering cross-section is a quantity of higher order in e2; compare, for instance, (56.13) and (59.16). If, however, the photon energy is below the ionization threshold (but not too close to resonance, i.e. to any of the discrete excitation frequencies of the atom), then the cross-section (which in this case reduces to the scattering cross-section), and therefore the imaginary part of the amplitude, are of a higher order of smallness than the real part of the amplitude. Neglecting the former, we again obtain (59.19). The situation is different in the neighbourhood of resonance, where the cross-section increases; this case will be discussed in §62. As well as scattering, the two-photon processes which occur in second-order perturbation theory also include double emission, i.e. the simultaneous emission of two quanta by an atom. The expression for the probability of this process differs from (59.5) only by the changes o> ->-cu, e-*e* (emission of a photon co, instead of absorption) and by the extra factor d3k/(27r)3 = a>2do)do/(27r)3, the number of quantum states of the emitted photon in given ranges of the frequency o> and the directions of k; the frequency of the second photon is §59 The Scattering Tensor 229 determined from a> by the equation to + to' = o>i2. The emission probability per unit time is therefore! dw = \(bik)2Xe'*e% | 2 ( 2 ^ W d ° d ° ' dû>' (5927) where ^ ^ " ^ L ^ i + a i - Î O + ai.t + c o ' - î o J differs from ( c ^ i in (59.6) only by a change in the sign of co. Summing this expression over the polarizations of the photons and integrating over their directions of emission,^ we obtain dw=|^|(M2l|2dio. (59.28) The probability of the emission of two photons <o and o/ is usually very small in comparison with that of the emission of a single photon with frequency <o + a/. An exception occurs in cases where the selection rules forbid the latter process but allow the former, such as transitions between two states with J = 0, where all processes of single-photon emission are strictly forbidden. Another example is the transition from the first excited state (2s ±) of the hydrogen atom to the ground state (Isi), which is forbidden for both El and Ml radiation; see §52, Problem 2.§ If the atom is in the field of an incident flux of photons o>, k, there is not only spontaneous double emission, with the probability (59.27), but also induced double emission, in which the field causes emission of a similar photon and a photon co', k'. The probability of this process differs from that of spontaneous emission by a factor Nke, the number density of incident photons with given k and e. The incident photon flux density is dl = cNkt d3k/(27r)3 = Nke(co2/87rV) d<o do. Expressing Nke in terms of dl and dividing the probability of the process by dl, we obtain the cross-section da = f£\{bik)ne'*et\2do'. (59.29) Similarly, if the atom is in a field of photons a/, k', the incidence of a photon <o, k causes induced Raman scattering, whose cross-section is proportional to the density of photons o/, k'. t In the rest of this section, ordinary units are used. t This operation amounts to complete averaging over the directions of e by eiét = 18,*, followed by multiplication by 2 x 2 x 4rr x ATT. § The lifetime of the 2s\ level for double emission is 0.15 sec. 230 Scattering of Radiation §59 The calculation of the tensors (cik)\2 or (bik)\2 for specific atoms requires that of sums of the form (M ^))2,=?^^i0' (59 30) ' with £ taking the values E\±ho) or Ei±fta>'. To simplify the notation, let us discuss a hydrogen atom. We write the sum (59.30) as the integral (Mß>)2, = J <//f(r) d,G(r, r') d'k«//,(r') d'x dh', (59.31) G(r,r';£) = 2 | ^ f ^ . (59.32) where Let the operator H - E, where H is the Hamiltonian of the atom, act on the function G. Since Hiftn = E„\fß„, we obtain ( H - E ) G = 2«Mr)«/'Ur'). n The sum is the delta function S(r-r'), since the set of functions \\ßn is complete. Thus the function G satisfies the equation (È - E)G(r, r'; E) = ô(r - r'), (59.33) i.e. it is the Green's function of Schrödinger's equation; the avoidance rule in (59.32) decides which solution of this equation is to be taken. Thus the problem of calculating the sum (59.30) reduces tofindingthe Green's function of the atom. An exact solution of equation (59.33) is, however, possible only if we know the exact solutions of the homogeneous Schrödinger's equation, i.e. in practice only for the hydrogen atom.t PROBLEM Calculate the probability of elastic scattering of a (non-relativistic) electron by an almost monochromatic standing light wave (P. L. Kapitza and P. A. M. Dirac, 1933). SOLUTION. The standing wave may be regarded as a combination of photons with momenta k and - k (and equal polarizations). The scattering of the electron may be regarded as the absorption of a photon k and induced emission of a photon - k , so that the electron momentum p is changed by 2hk and rotated (without change of magnitude) through an angle 6 such that |p| sin \Q = ftw/c. The probability of this process can be obtained from the Thomson scattering cross-section (59.15), da = rl\ë* • e| 2 do' = rl do\ t See L. Hostler, Journal of Mathematical Physics 5, 591, 1964. The application of this Green's function to calculate the scattering amplitude for the hydrogen atom is given by Ya. I. Granovskif, Soviet Physics JETP 29, 333, 1969. I §60 Scattering by Freely Oriented Systems 231 by multiplying by the flux density of photons with momentum k and the number of photons with momentum - k . The flux density of photons having frequencies in the range du> fs clLda>/2ftû>, where ILdco is the energy density in the standing wave in the spectrum interval dw ; the factor 2 appears because the energy of the wave is equally divided between the photons moving in opposite directions. The momenta k of all the photons forming the standing wave are parallel to a certain direction n (the "direction" of the standing wave). In other words, the energy density as a function of the frequency and direction of the photons n' is LLn = [/w6(2)(n' - n). Accordingly, the number of - k photons is cf. (44.8). The electron scattering probability per unit time is then found to be 2irV f -_ . w = — r m I U2u d(û. The factor w~* is taken outside the integral, since the non-monochromaticity Aa> is assumed small. The value of the integral is inversely proportional to Aa> (for a given total intensity). § 60. Scattering by freely oriented systems If an atomic energy level is not degenerate, the polarizability and intensity of coherent scattering are determined by the same tensor aik = (cik)n- If the level is degenerate, however, the observed values of these quantities are averaged over all states belonging to the level in question. The polarizability must be defined as the mean value a * = (Cjk)n. The observed scattering intensity is determined by the mean values (Cik)n(c| m )ii. The relation between the polarizability and the scattering is therefore more indirect. Although each of the quantities (cik)n may be complex, their mean values (in the absence of absorption, with a* an Hermitian tensor) are real, since on averaging we can choose arbitrarily the set of independent wave functions (corresponding to a given degenerate level), and we can always ensure that all the functions are real. For free atoms or molecules (not in an external field), the degeneracy of levels is usually due to an angular momentum which is freely oriented in space. Let the initial state in scattering have angular momentum Ju and the final state J2. As usual, the scattering cross-section must be averaged over all values of the component Mi, and summed over the values of M2. After the averaging, the cross-section is independent of M2, and the summation is therefore equivalent to multiplying by 2/2+I. Thus the averaged scattering cross-section is d<7 = û>a>'H^e/*ek4eSdo', (60.1) 232 Scattering of Radiation §60 where <" iklm ^ T X , 1 (c,-jc)2i(cim)fi = (2/ 2 +l)(c l k ) 2 i(C| m )! 1 1 ; (60.2) the bar with index 1 signifies averaging over M\. For Rayleigh scattering, states 1 and 2 belong to the same energy level (a>i2 = 0). If only coherent scattering is considered, then states 1 and 2 must coincide completely, so that Mi = M2. In that case the summation over M2, and hence the factor 2J2 + 1 in (60.2), no longer appear: CCiklm — ( C j * ) l l ( C / m ) î 1 - (60.3) The result of the averaging can be written down without further calculation by using the fact that averaging over Mi is equivalent to averaging over all orientations of the system, after which the mean value can only be expressed in terms of the unit tensor Sik, and the only non-zero mean values are those of products of components of either the scalar, the symmetric or the antisymmetric part of the scattering tensor; it is clear that the unit tensor cannot yield expressions with the symmetry properties of cross-products. Thus ,,(21) _ r;0 o c C iklm — VuOikOim » ,,(21)5 , r(2\)a + C iklm + C iklm » (60.4) where GS, = (2J2+1)I(?CTI. c ^ I = (2J2+l)(cfk)2l(cfm)!, , (60.5) cg/i- = (2J2 + lKc&)2i<cgl.)},|1. The scattering cross-section (and therefore the scattering intensity) for a freely oriented system is therefore a sum of three independent parts, which will be referred to as scalar, symmetric and antisymmetric scattering. Each of the three terms in (60.4) can be expressed in terms of one independent quantity: the scalar scattering is expressed in terms of G2i, and for the symmetric and antisymmetric scattering we have C)Um — loG2l(ôijôkm + 8jm8kl ~ 35|Jkô/m), G^ = (2J 2 +l)(cy 2 i(c? k ) 21 ; Ciklm — hGi\(babkm ~ (60.6) àimSkl), G2fl, = (2/ 2 +i)(cr k ) 2 .(c3c)2.; the combinations of unit tensors are derived from the symmetry properties, and the common factor is then found by contracting with respect to the pairs of indices i, / and Je, m. 1 §60 Scattering by Freely Oriented Systems 233 On substituting (60.4)-(60.6) in (60.1), we obtain for the scattering cross-section da = oKo'3{G2i|e'* • e|2 + foG^l + |e' • e|2 - Je'* • e|2) + *G!i(l - |e' • e|2)} do'. (60.7) This formula shows explicitly the angular dependences and polarization properties of the scattering. The total cross-section for scattering in any direction, summed over the polarization of the final photon and averaged over the polarization and direction of incidence of the initial photon, is easily obtained directly from (60.1) by noting that e*ek = \Sik if the averaging is over both the polarization and the direction of propagation of the photon; summation over these would give a corresponding result larger by a factor 2 x 477. The result is 87T ,3 (21) er = - y a>cü;3c&V = ^<**'\3G°2l + G'2l + Gh). (60.8) It has already been mentioned that the selection rules for scattering are the same as those for the matrix elements of an arbitrary tensor of rank two. Because of the separation of the scattering intensity into three independent parts, it is convenient to state the rules for each part separately. The selection rules for symmetric scattering are the same as those for electric quadrupole radiation, since the latter is likewise determined by an irreducible symmetric tensor (the quadrupole moment tensor). For antisymmetric scattering, the selection rules are the same as those for magnetic dipole radiation, since both are determined by an axial vector (an antisymmetric tensor is equivalent, or dual, to an axial vector).t There is a difference here, however, in that the diagonal matrix elements, which in the case of emission give the mean values of the electric or magnetic moments (and do not correspond to radiative transitions), are important in the case of scattering, since they relate to coherent scattering. For scalar scattering the selection rules are the same as those for the matrix elements of a scalar. This means that only transitions between states of the same symmetry are possible. In particular, the values of the total angular momentum J and its component M must be the same, and the matrix elements diagonal in M are independent of M ; see QM, (29.3). For Rayleigh scattering, therefore, states 1 and 2 must coincide completely (as regards M as well as energy), and so scalar Rayleigh scattering is entirely coherent. Conversely, since in scalar scattering all states always combine with themselves, it follows that in coherent scattering there is always a scalar part. t This refers, of course, to the selection rules based on symmetry, and not due to the specific form of the axial vector in the case of emission; the magnetic moment vector includes a spin part, whereas in scattering we have the matrix elements of orbital (coordinate) quantities. Scattering of Radiation 234 §60 For a system freely oriented in space, the polarizability tensor also must be averaged over the directions of the angular momentum Ju in the same way as the scattering cross-section has been averaged above. The averaging is very simply carried out: we evidently have aiks(c,k)n =(c°)n 8ik. The symmetric and antisymmetric parts of the scattering tensor vanish on averaging, since 5* is the only isotropic tensor of rank two. It has been mentioned that the diagonal matrix elements of a scalar are independent of Mi. The mark of averaging of (c% may therefore be omitted, and this quantity calculated for any Mu so that the polarizability is a* = (c°)u8,k. (60.9) For the same reason, the averaging sign may be omitted in the quantity G?i, which determines the scalar part of the coherent scattering: GÎ^KP^F 1 -^^; (60.10) the factor 2J2+1 is omitted in accordance with (60.3). Thus there is a simple relation between the mean polarizability and the scalar part of the coherent scattering: both are determined by the quantity (C%,H2-T^I<U2. „ 0)n\ — O) (6o.li) PROBLEMS PROBLEM 1. Find the angular distribution and the degree of depolarization in the scattering of linearly polarized radiation. SOLUTION. Let 0 be the angle between the direction of scattering n' and the direction of polarization e of the incident radiation. The scattered radiation has two independent components, polarized one in the plane of n' and e (intensity h) and one perpendicularly to this plane (intensity /:); the degree of depolarization is I2//1. The intensities h and Î2 are given by (60.7) with the appropriate directions of e'. In scalar scattering, the radiation remains completely polarized in the same plane {h = 0), and the angular distribution of intensity is / = I sin2 d. Here and below, the expressions for / = I\ + h are normalized so as to give unity on averaging over directions. In symmetric scattering I = &6 + sin2 0), hi h = 3/(3 + sin2 0). In antisymmetric scattering I == i(l + cos2 6), hlh = 1/cös2 Ö. PROBLEM 2. The same as Problem 1, but for the scattering of natural light. §60 235 Scattering by Freely Oriented Systems SOLUTION. Formula (60.7) can be applied to natural (unpolarized) incident light by the substitution eirt-*2(S.jc-rtinic), which corresponds to averaging over the direction of polarization e with a given direction of incidence n. The scattered light will be partly polarized, and from considerations of symmetry it is evident that its two independent components will be linearly polarized in the scattering plane of n and n' (intensity /|) and perpendicularly to this plane (intensity h). The scattering angle between n and n' will be denoted by For scalar scattering J = h + 1|| = l( 1 + COS2 #), III h = COS2 #. For symmetric scattering / = à(13 + cos2 #), îf/h = (6 +cos 2 *)/7. For antisymmetric scattering I = 1(2 + sin2 *), Itlh = 1 -f sin2 #. PROBLEM 3. For scattering of circularly polarized radiation, determine the reversal factor (the ratio of the intensity of the component circularly polarized in the "reverse" direction to that of the component polarized in the original direction). SOLUTION. For circularly polarized incident radiation, the angular distribution and the degree of depolarization (JR//±) are the same as in the scattering of natural light. Let the vector e of the incident radiation have components (1/V2)(1, i, 0) in coordinates such that the xz-plane is the scattering plane and the z-axis is along n. Then the polarization vectors for the reverse and original circularly polarized components of the scattered radiation are e' = —TZ (cos d, - i, - sin d) and e' = TTT (COS d, i, - sin -d). Calculation of the intensity by means of (60.7) gives the reversal factors P for the three types of scattering: Do , 4u P =tan 2#, where # is the scattering angle. DJ 13 + cos2fl + 10cosfl P =7T1 r~Z—TK â> 13 + cos d - 10 cos d Dfl P l-cos 4 iO - 1 •4i.> 1 - sin 3d PROBLEM 4. Calculate the cross-section for scattering of a low-frequency photon by a hydrogen atom in the ground state. SOLUTION. A low-frequency photon can undergo only elastic scattering. Since the orbital angular momentum J of the hydrogen atom in the.ground state is zero, the selection rules (neglecting the spin-orbit interaction) allow only scalar scattering. The static polarizabÜity of the atom is (in ordinary units) a - (9/2)(ft2/me2)3; see QMy §76, Problem 4. Substitution in (60.8) gives the required cross-section: a, = 54<rr(ü>lc)\h2lme2f. PROBLEM 5. Calculate the cross-section for elastic scattering of y rays by a deuteron (H. A. Bethe and R. E. Peierls, 1935). SOLUTION. The wave functions of the deuteron ground state and of its continuous-spectrum states (the dissociated deuteron) are see (58.2), (58.3). The matrix element of the dipole moment is dpo = -ieppo/Mcopo and has been calculated 236 §60 Scattering of Radiation in §58: _ I K 4 Trie p with the frequencies ü>po = (p2 + K 2 ) / M The polarizability tensor is " 13 J ^ o ^ ? | d o p | ( 2 ^ " 2 M ^ r k The first term is due to the virtual excitation of the internal degrees of freedom of the deuteron, and is written in the form (60.11). The second term is due to the action of the wave field on the translational motion of the deuteron as a whole. Since this motion is quasi-classical, the corresponding part of the scattering tensor is given by (59.14), with m replaced by the deuteron mass 2M. The calculation of a* depends on that of the integral ■j 4 j (zJ+i)Jt(zJ+i)'-7!r 2 = P/K ? = M <*/« 2 = <"/'• ' We have VA dÀ/J»=r s[dk where J ' - 2 _ J < r 2 + Aï)[(z1+l>l-yJr When y < 1, the integrand has poles at the points iA, i V ( l + 7), i V ( l - 7) in the upper half-plane of the complex variable z ; the integral Jo can be calculated from the residues at these poles. The result is J. 1 r(i ++Y ) } / 2 , ( i - Y / 2 -r[-rr- -w~ p i\i w+7)y The total scattering cross-section is expressed in terms of a* by (60.8), and is (in ordinary units) 8TT/ e2 V\ 4 i + T ^ 10 + y)m + O - y)m]\ 37 37 for y = hull < 1. For 7 > 1 the scattering amplitude (above the deuteron dissociation threshold) is found from that for 7 < 1 by analytical continuation; it has an imaginary part, which must be positive (in accordance with the avoidance rule in (59.17)): 8ir / e2 \ 2 | , 4 2 , . .0/2 . . 2 3 for 7 > 1. When 7 > 1 we have or = (87r/3)(e2/Mc2)\ which agrees, as it should, with (non-relativistic) scattering by a free proton. The angular distribution of radiation is do- = <r-4(l+cos 2 0)do/47T, where 0 is the scattering angle. If the scattering amplitude is defined by (59.24), we have im/(0) = 3Mc3 J {0Ty>L According to the optical theorem (59.26), this quantity must equal OXTIMTT, where <jt is the total I §61 Scattering by Molecules 237 cross-section for photodissociation (58.4). The elastic scattering cross-section is of a higher order (~~eA) than the dissociation cross-section (~e 2 ; see (58.4)), and therefore at is equal to the dissociation cross-section. For the same reason, in the approximation considered, the scattering amplitude was found to be real for y < 1 (i.e. below the dissociation threshold). §61. Scattering by molecules The specific properties of molecular scattering are due to the same properties of molecules as form the basis of the theory of molecular spectra, namely the possibility of treating separately the state of the electrons with the nuclei fixed and the motion of the nuclei in a given effective field of the electrons. Let the frequency o> of the incident radiation be less than the energy <oe of the first electron excitation. Then the electron terms will not be excited in the scattering process. The scattering will be either Rayleigh scattering, or Raman scattering due to the excitation of rotational or vibrational levels. Let us further assume that the electron ground term of the molecule is not degenerate (and has no fine structure). That is, we assume that the total spin of the electrons and the component of their total orbital angular momentum along the axis of the molecule (for molecules of the symmetrical-top type) are both zero. For diatomic molecules this means that the electron ground term must be *2. These conditions are known to be satisfied for the ground states of most molecules. t Finally, we shall assume the frequency eu large compared with the intervals in the nuclear (rotational and vibrational) structure of the ground term, and the difference (oe - œ to be in a similar relation to the nuclear structure of the excited term. Thus the frequency of the incident radiation must be sufficiently far from resonances. These conditions make it possible, in calculating the scattering tensor, to ignore at first the motion of the nuclei and to discuss the problem with a given configuration of the nuclei. In such a problem, the scattering tensor is the same as the polarizability tensor, oLik = (Cik)n, and can in principle be calculated from the general formula (59.17), in which the summation is over all excited electron terms. The quantities a* thus obtained will be functions of the coordinates q of the nuclear configuration (the energies and wave functions of the electron terms depend on these coordinates as parameters). Since the state is not degenerate, the tensor a^iq) is real, and therefore symmetrical. The tensor aik(q) is the electronic polarizability of a given nuclear configuration in the molecule. To solve an actual problem of scattering, we have also to take into account the motion of the nuclei in the initial and final states. Let ^,,(q) and ^(q) be the nuclear wave functions of these states, sx and s2 being the sets of vibrational and rotational quantum numbers. The required scattering tensor is the matrix t The results given below are, however, valid (to a certain approximation) also for cases where degeneracy of the electron ground term is due to a non-zero spin, the spin-orbit interaction being small (so that the resulting fine structure may be neglected). In this approximation, states with different spin directions do not combine, and in this sense they behave as if they were not degenerate. The molecule O2, with ground term 3 2, is of this type. Scattering of Radiation 238 §61 element of the tensor aik(q) with respect to these functions: <S2|«ik|si> = J ^f2(q)aik(q)i//Sl(q) dq. (61.1) Because the tensor a*(q) is symmetrical, so is the tensor (61.1) (whether S\ and s2 are the same or not). Thus we conclude that, under the conditions stated, there will be no antisymmetric part in either Rayleigh or Raman scattering. The scattering will include only scalar and symmetric parts. The scalar part <x°(q) of the polarizability is independent of the orientation of the molecule, and depends only on the internal configuration of the atoms within it. Let v denote the set of vibrational quantum numbers of the molecule, and r the set of rotational numbers other than the magnetic number m. Then the matrix elements are {v2r2m2\a*\vxrxmx) = <v2i«°|i;1>ôrir2ôm|m2. (61.2) The diagonality with respect to the numbers r and m is true of any scalar. The particular property of (61.2) is that here the elements do not depend on these numbers at all. Thus the scalar scattering occurs only for purely vibrational transitions and does not depend on the rotational state. The symmetric scattering is determined by the matrix elements of the tensor a?*. Its components in a fixed coordinate system xyz are expressed in terms of the components ä'>k> in a system £TJ£ moving with the molecule by ai = ^ â k - D A , (61.3) where the D;, are the direction cosines of the new axes relative to the old. The quantities 5,V do not depend on the orientation of the molecule, and the DVl do not depend on the internal coordinates. Hence (v2r2m2\a\k\vxrxmx) = ^ {v^ähM{r2m^Dv^k\rxm^. The sum of the squared moduli of these quantities over r2, m2, i, k is easily seen to bet 2 2 \{v2r2m2\a>ik\vxrxmx)\2 = V \(v2\äh]vx)f. t In transforming the sum we use the equation S2 (nmilDoDktlnmàirimilDu'IXg'lnmi) = (rimi 2 DuDkgDirDkAr\mn = (r.imi|5irSOT<|rimi) = 6 f r6 a s which expresses the unitarity of the matrix Dik. (61.4) §61 239 Scattering by Molecules This means that the total intensity of scattering with transitions from a given vibrational-rotational level vx, rx to all rotational levels of the vibrational state v2 is independent of rv For molecules of the symmetrical-top type, we can go further and derive a relation between the scattering intensity and the rotational quantum numbers for every transition v\rx -► v2r2. In this case the numbers r are the angular momentum J and its component k along the axis of the molecule. We replace the Cartesian components of a5ik by the corresponding spherical tensor of rank two, denoting its components by aA (A = 0, ± 1, ±2). According to QM, (110.7), the squared moduli of its matrix elements are \(v2J2k2m2\aK\vxJxkxmx)\2 - (y.+1X2J,+i)(_£ I *)'(_£ I ^)W,| C ,>r, where äx(q) is the spherical polarization tensor relative to axes fixed in the molecule, and A' = k2 —fei-Summing over m2 and A = m 2 -mi (with m fixed), we obtain (cf. QM, (110.8)) 2j<U2J 2 fc 2 m 2 |^ l fyMävM2. (61.5) This quantity determines the intensity of scattering with the vibrational-rotational transition v\J{k\-+ v2J2k2. Since the matrix elements (I>2|«A'|I>I) do not depend on the rotation of the molecule, this also defines the dependence of the intensity on Ju h and on ku k2. The right-hand side of (61.5), it may be noted, involves only one spherical component of the polarizability tensor. Summation of (61.5) over J2 and k2 givest E À 2 J2» *2» m 2 Kü2/2k2m2|aA|üiJikimi)|2 = 2 ItolâxWI2» *' and we return to the sum rule (61.4). A special case of the symmetrical top is the rotator, a linear molecule (or, as a particular instance, a diatomic molecule). The angular momentum component along the axis of such a molecule is zero (in a non-degenerate electronic state with zero electronic orbital angular momentum).t In this case, therefore, we must put k1 = k2 = 0in(61.5). Finally, let us consider the question of the selection rules in vibrational Raman t In the summation over J2 with given ki, A' (and k2 = k\ -I- A'), we have ?<»♦.)(_* v ÏÏ-1 according to QAf, (106.13). The summation over k2 (or, equivalently, over A' - k2-k\) is then effected for given k\. t Here we do not include effects due to the interaction between the vibrations and the rotation of the molecule (see QM, §104). 240 Scattering of Radiation §62 scattering, together with the cognate question of vibrational emission (or absorption) spectra of molecules.t For scattering, the problem is simply to find the conditions under which there are non-zero matrix elements of the tensor aik(q) with respect to the vibrational wave functions iK(q); the scalar a 0 (for scalar scattering) and the irreducible symmetrical tensor afk (for symmetric scattering) have to be considered separately. A corresponding role in emission (or absorption) is played by the matrix elements of the vector d(q), the dipole moment of the molecule averaged over the electronic state with a given position of the nuclei. This has already been stated in §54 for diatomic molecules. The vibrations of a polyatomic molecule are classified according to types of symmetry, the irreducible representations Da of the corresponding point group, where a numbers the representation (see QM, §100). These representations also define the symmetry of wave functions of vibrational states of the molecule (see QM, §101). The symmetry of the wave functions of the first vibrational state (quantum number va = 1) is the same as the symmetry Da of the vibration type; the symmetry of the higher states (va > 1) is given by the representations [Dvaa]y which are symmetric products of va representations Da. Finally, the symmetry of states in which different vibrations a and b are simultaneously excited is given by the direct product [Dvaa] x [D&].t The selection rules for the various quantities (scalar, vector, tensor) with respect to types of symmetry are found as described in QM, §97. The selection rules resulting from the symmetry properties of the molecule are rigorous. There are also approximate rules based on the assumption that the vibrations are harmonic and that the functions <*,*((?) or d(q) can be expanded in powers of the vibrational coordinates q. These are a consequence of the known selection rule for a harmonic oscillator, according to which the matrix elements of the oscillator coordinate q are zero except for transitions in which the change in the vibrational quantum number At; = ± 1 . § 62. Natural width of spectral lines So far, in the study of emission and scattering of radiation, we have regarded all the levels of the system (an atom, say), as being strictly discrete. But in fact excited levels have a certain probability of decay by emission, and therefore a finite lifetime. According to the general principles of quantum mechanics, this has the result that the levels become quasi-discrete, with a certain small but finite width (see QM, §134); they can be written in the form E - 21T, where T(=r/A) is the total probability (per unit time) of all possible processes of "decay" of the state concerned. Let us consider how this situation affects the process of emission (V. Weisskopf and E. Wigner, 1930). It is evident that, because of the finite width of the levels, the emitted radiation will not be strictly monochromatic: its frequencies will be spread t These spectra lie in the infra-red, and are usually observed as absorption spectra. t The symmetry properties of the vibrational wave functions are, of course, independent of the specific form of the vibrational potential energy, and in particular are independent of the assumption made in QAf, §101, that the vibrations are harmonic. §62 Natural Width of Spectral Lines 241 over a range Ao> — T (=T/fi). But, in order to measure the frequency distribution of the photons with this accuracy, the time needed is T > L/Aa> — 1/I\ During this time the level will almost certainly decay by emission. We therefore have to deal with the determination of the total probability of emission of a photon of a given frequency, not with the probability per unit time. We shall calculate this total probability, first of all, for a transition of an atom from some excited level E\ -21T1 to the ground level E2> which has an infinite lifetime and is therefore strictly discrete. Let ^ be the wave function of the atom and the photon field, and H = H(0)+ V the Hamiltonian of the system, where V is the atom-field interaction operator. We shall seek a solution of Schrödinger's equation i ^ = (H ( 0 ) +V)* 01 (62.1) in the form of an expansion in terms of the wave functions of the unperturbed states of the system: * = 2 a¥im? = 2 aAt) e-"'*?. v v (62.2) For the coefficients av(t) we obtain the equations ' % = 2 <"l Vk><V exp{i(g„ - «,)*}. (62.3) or „• Let | v) be a state with energy %v = E2 + w, in which the atom is at the ground level Ei and there is one quantum with a definite frequency w; this state will be symbolized by |o>2>. At the initial instant, the system is in the state |1), the atom being excited to the level Ex, with no photons present. Thus, for t = 0 we must have a,= l, a,=0 for|i/')*|l>. (62.4) The solution of equation (62.3) with this initial condition will give (with the appropriate normalization of the wave functions) the probability that at time t there has been a transition 1 -* 2 of the atom with emission of a photon in the frequency range da>: it is |a«,,2(0|2 d<o. We are interested in the ultimate probability as *-►<»: dw = k2(oo)|2 deo. (62.5) In order to clarify the problem, it may be recalled that, in finding the ordinary emission probability (per unit time) with a transition l-»2 (neglecting the level width), equation (62.3) has to be solved with all the a„(f) on the right-hand side replaced, to a first approximation, by the values (62.4). The solution thus obtained is then examined for large t; cf. QM, §42. We can now describe this procedure more precisely; it relates to times short in comparison with the lifetime of the excited level, and the large values of t concerned are large compared with l/(Ei - E2) but small compared with 1/IY 242 §62 Scattering of Radiation In our present case, where times comparable with 1/T| are considered, the function a\{t) decreases in time according to a,(0 = *" !iv . (62.6) The functions av(t) for states \v') which can result from emission by the atom increase with time, however. If the transition from a given level Ex can occur to various atomic levels (as well as to E 2 ), there will be many increasing functions av(t), each corresponding to a state in which the atom is at a certain level and there is one photon with the appropriate energy. Nevertheless, there still remains on the right of (62.3) only the term with |i/) = |l): since the matrix elements are zero except for transitions in which the number of photons with some one energy changes by 1, they are certainly zero for transitions between states containing one photon each, with different energies. Thus we have for a^t) the equation i ^ = <co2|V|l)e l<E ^- E ' )c a 1 = <a>2| V|l) exp{i(co - û>l2)f - UV}, (62 J) where co\2 = E\ - E2. Integration, with the condition aw2(0) = 0, gives a . ^ M V l l ) 1 " 6 ^ ^ - ^ ! " ^ O) ""* ù)\2 + ill i (62.8) Hence the probability dw (62.5) is dw = |(a>2|V|l)2 , „ * + (0) ~ (On) .1 r * +4I 1 Since the width T\<o)\29 we can put o> = cüi2 in the factor |(co2|V|l)|2. Then the quantity 27r|(a>2|V|l)|2 is the ordinary probability (per unit time) for the emission of a photon with frequency <o\2 and other properties besides the frequency, such as the direction of motion and the polarization, whose existence has so far been ignored in order to simplify the notation. The dependence of the probability on these characteristics is entirely determined by the factor |(o>2|V|l)|2. Thus the allowance for the level width does not affect the polarization properties or the angular distribution of the radiation. The sum r ^ 2 = 2 T r S |<cu2|V|l>|2, (62.9) taken over the polarizations and directions of motion of the photon, is the usual total probability of emission. It is also the part of the width of the level £1 (the partial width) which is due to the transition 1 -► 2, as distinct from the total width 1 §62 243 Natural Width of Spectral Lines T|, which is made up of contributions from all possible modes of "decay" of the quasi-stationary state considered.t By a similar summation of the probability dw, we obtain the following final formula for the frequency distribution of the emitted radiation: <62l0) ""'"'^-t'Kirî- where w, = T^2I^\ is the total relative probability of the transition l-»2. This is a dispersion-type distribution. The shape of the spectral line that is given by formula (62.10) is that which occurs for an isolated atom at rest, and is called the natural shaped Now let the level E2 of the atom be also an excited level with a finite width T2. We shall assume for simplicity that this width is due to transitions of the atom to the ground state EQ with the emission of one photon; the final result (62.12) will not depend on this assumption. The decay of state 1 can then be regarded as an emission of two photons, discussed in §59. The matrix element for this process (not yet taking account of the finite lifetime of state 2) is (62.n) ^^.isçŒ&pgm. fco-" £ 2 » w ~*~ t\) the state 2 in (59.2) becomes state 0, and in the sum over n the only remaining term corresponding to the atom in st^te 2 is the one which is large by resonance when a/ is close to E2 - E0. If we now take account of the finite lifetime of state 2, this simply changes E2 into E2-\iY2 in (62.11), giving <wo|y«ii)=<7,0Wf>Hffl>. CQ— C2+ (0 + 2*1 2 Substituting this value of the matrix element in the equation for a^iiO (which differs from (62.7) only as regards notation), we obtain by a derivation exactly similar to that of (62.8) n ,fooW <a>a>;0lV|ü>2)<co2|Vll) (û) - û>20 + 2'I 2){<t> + 0> — COio+2'Il) The probability of emission of the photons u> and o>' is dw = |aWo(°°)|2dû>dtt)' _ rt->2 r 2 -o 2 day dot' 2 i-n 2TT [(«' - <o 20 ) +im(<»+<»'- Û>,O) +*nr an n\ ^ lAl> t Formulae (62.6) and (62.9) can, of course, also be obtained by solving the equation for a\{t) analogous to (62.7). We may note that transitions to states of the continuous spectrum, causing a finite level width, do not necessarily involve the emission of photons. Highly excited (X-ray) levels can decay with emission of an electron and formation of a positive ion in the ground state (the Auger effect). t As distinct from the broadening caused by the interaction of the atom with other atoms (collision broadening) or by the presence of atoms in the source which move with various velocities (Doppler broadening). §63 Scattering of Radiation 244 This expression has sharp peaks at o>' =» <u2o and at o> « o>I2, as it should. The shape of the spectral line corresponding to the transition 1 —> 2 is obtained by integrating (62.12) with respect to a/; the range of integration can extend from -oo to + ». The integral is most simply calculated by closing the contour of integration with an infinite semicircle in the upper half of the complex o>'-plane, and is given by the sum of the residues of the integrand at the poles (Of = CUIQ— 01 + 2 l T i . (*>' = CÜ20 + 2 ^ 2 , The result is j Fi + T2 d w = w, - ^ 27T da) ,},Lr , r A (0>-CUi 2 ) + 4 ( l i + r 2 ) _x (62.13) tr0% where vvr = Vx^i-dTi^i is the total probability of the double transition 1 ->2->0.t The line shape (62.13) differs from (62.10) only in that Y\ is replaced by Tx + T2: the line width is equal to the sum of the widths of the initial and final states. The line width is not, in general, equal to the probability T\^2 of the transition l-*2 itself, i.e. is not proportional to the line intensity as in the classical theory. Since Ti + T2 > r ^ » the line can have a large width with a relatively small intensity. §63. Resonance fluorescence The allowance for the finite width of the levels in problems of radiation scattering is important when the frequency o> of the incident radiation is close to one of the "intermediate" frequencies (on\ or o>2n; this is called resonance fluoréscence (V. Weisskopf, 1931). Let us consider Rayleigh scattering by a system (an atom, say) in the ground state, so that the initial and final levels are the same and are strictly discrete. Let the frequency of the radiation be close to a certain frequency o>„i, where the level n is an excited level and is therefore quasi-discrete. This problem could be solved by the method shown in §62, but there is no need to do so, since it is exactly analogous to the problem of non-relativistic resonance scattering at a quasi-discrete level (QM, §134). According to the results derived there, the scattering amplitude must contain a pole factor 1 <o-{En-tirH-Exy When |o> - o>ni| > r„, on the other hand, the result must tend to the non-resonance formula (59.5). It is therefore clear that the required scattering cross-section is obtained by simply replacing En by En-\iTn in (59.5); the sum over n can be t In more complex cases, wt is the total probability of all cascades which begin with the transition 1 -► 2 and finish at the level 0. §63 Resonance Fluorescence 245 restricted to the resonance terms: E(d 2 n -e'*)(d„,-e) dcr= 7 vt , l r t ' o>4do'. LtL ( < 0 „ i - u ) ) + 4I n (63.1) The summation is over all states (having different angular-momentum components M„) corresponding to the resonance level E„; the states 1 and 2 belong to the same level (the ground level), but may differ in the values Mi and M2. The cross-section (63.1) has its maximum value when o> = ionl, and this value is, in order of magnitude, <rmM « <t>4d4/r2. Since the probability of the spontaneous transition n -» 1, and hence the width T„, ~û>3d3, this value is a max ~l/o> 2 ~X 2 , (63.2) of the order of the square of the wavelength and independent of the fine structure constant, as compared with typical values ~ r2 outside the resonance region. It must be emphasized that, since the atom is at a strictly discrete level (the ground level) before and after the scattering, the frequencies of the primary and secondary photons are exactly the same. If the incident radiation is monochromatic, the scattered line will therefore be monochromatic also. If the incident radiation has a spectral intensity distribution J(<o) which varies only slightly over the width T„, the intensity of scattered radiation will be proportional to I((o„i)d(t> ,„ STTTrT2 ^ (63.3) (<o-ü>n,) + Jn Thus the shape of the scattered line will be the same as the natural shape for spontaneous emission from the level En. The cross-section (63.1) corresponds to the scattering tensor ^ (di)2„(dk)n\ . i 1: 7 r ^■ (On 0)„|-0>-2lln ((c,k)2i c , f c ) 2 .= ^M" (63.4) In particular, the polarizability tensor is 2 (d,)i„(dk)n\ a,k = (cik)„ = f*— i.r . (63.5) It can be seen immediately that the addition of an imaginary part to the energy levels of the intermediate excited states makes the polarizability tensor no longer Hermitian, even at frequencies below the ionization threshold. It contains an imaginary part which is directly related to the absorption of radiation. After absorbing a photon, the atom will sooner or later return to the ground state, emitting one or more photons. The absorption cross-section, viewed in this way, 246 Scattering of Radiation §63 is just the total cross-section at for all possible scattering processes.! On the other hand, according to the optical theorem (59.25), the cross-section can be expressed in terms of the anti-Hermitian part of the polarizability tensor. Substituting in (59.25) the tensor aik from (63.5), we find the following formula for the cross-section for absorption of a photon with frequency o> close to coni: .„=4^|d,,.eP^ [(M _^ )} + j r , r (63 . 6 ) In the limit as Tn ->0, the last factor tends to the delta function 8(co - co„i), in accordance with the fact that in this case only a photon having one particular frequency can be absorbed. Let radiation with a spectral and angular energy flux density Ikt (cf. (44.7)) be incident on the atom. Then the flux density of number of photons is (IkJco) dco do, and the probability of absorption is dwa = aa(IkJ(A)) dco do. (63.7) If the function ht(o)) varies only slightly over the width Tn, then we have after the integration over frequencies dwa = 4TT2 X |dm • e|2Ike((onl) do. According to (45.5), dwSp = T~^ ld"i # e*| 2 do = f£S|d*i-e| 2 do is the probability of spontaneous emission of a photon having the frequency cu„i; thus we return to formula (44.9). t This discussion, it must be emphasized, refers to absorption by a system in its stable ground state. The problem would have to be stated differently for an excited state, because of the finite duration of the experiment. CHAPTER VII THE SCATTERING MATRIX § 64 The scattering amplitude THE general problem concerning collisions is to find, for a given initial state of the system (an assembly of free particles), the probabilities of various possible final states (other assemblies of free particles). If |i) denotes the initial state, the result of the collision can be represented as the superposition 2 \fXf\S\i), (64.1) / in which the summation is taken over the various possible final states |/). The coefficients in this expansion (f|S|i) (or, more concisely, S/,), form the scattering matrix or S-matrix. The squares \Sfi\2 give the probabilities of transitions to particular states |/). If there were no interaction between the particles, the state of the system would be unchanged, corresponding to a unit S-matrix (absence of scattering). It is convenient to separate this unit matrix in all cases, writing the scattering matrix in the form Sfi = 8fi + i(27t)A8{4)(Pf - Pt)Tfh (64.2) where Tfi is another matrix. In the second term we have written separately the four-dimensional delta function which expresses the law of conservation of the 4-momentum (P, and Pf being the sums of the 4-momenta of all the particles in the initial and final states); the other factors are included for subsequent convenience. In the non-diagonal matrix elements, the first term in (64.2) does not appear, and so, for the transition i->/, the elements of the matrices S and T are related by Sri = i(27r)4S(4)(P/ - PJTfi. (64.3) The matrix elements Tfi which remain after separation of the delta function will be called the scattering amplitudes. When the moduli \Sfi\ are squared, the square of the delta function appears, and is to be interpreted as follows. The delta function comes from the integral Ô(V/ - Pô = J^y j e«*r«» d<x. (64.4) If another such integral is calculated with Pf = Pf (since one delta function is 247 248 The Scattering Matrix §64 already present), and if the integration is taken over some large but finite volume V and a time interval f, the result is Vf/(27r)4.t Thus we can write |S/l|2 = (27r)4fi(4)(P/-P,0|T/i|2Vt. Dividing by t, we obtain the transition probability per unit time: wf^ = (2TT)4Ô(4)(P/ - Pi)\Tfi\2V. (64.5) Each of the free particles, initial and final, is described by its own wave function—a plane wave having some amplitude u (a bispinor for an electron, a 4-vector for a photon, and so on). The structure of the scattering amplitude Tfi is of the form T,i = i i î i i î . . . Q i i i i i 2 . . M (64.6) where on the left we have the amplitudes of wave functions of final particles, and on the right those of initial particles; Q is some matrix relating to the indices of the wave amplitude components of all the particles. The most important cases are those where the initial state comprises only one or two particles. Then we have respectively the decay of one particle or the collision of two particles. Let us first consider the decay of a particle into any number of other particles having momenta p'a in an element II d3p'aof momentum space; the suffix a labels the particles in thefinalstate, so that 2 p'a = P/. The number of states in this element and in the normalization volume^ V is nvd 3 pi/(27r) 3 . a The expression (64.5) must be multiplied by this quantity: dw = (2TT)4Ô(4)(P/ - Pi)\Tfi\2V n Ï0*. (64.7) The wave functions used in calculating the matrix element must be normalized to "one particle in the volume V". For an electron, e.g., the wave function is the plane wave (23,1); for a particle with spin one it is (14.12); for a photon it is (4.3). All these functions include the factor 1/V(2EV), where e is the energy of the particle. Henceforth, however, it will be convenient to omit such factors in the wave functions, and include them in the expression for the probability. Thus the t This can be shown in a différent way byfirstcalculating the integral over each coordinate in (64.4) for a finite range and then making the limits tend to infinity by means of QM, (42.4): .. sin2 ag lim —jnr* = Tro(a). *~ £a $ For greater clarity, in the calculations in thii section, we shall not take V to be unity. §64 249 The Scattering Amplitude electron plane wave will be ilß = ue'ip\ üu=2m, (64.8) and the photon wave A = V(4îr)ee-fa, ee* = - 1 , ek = 0. (64.9) The scattering amplitude calculated with these functions will be denoted by Mfi to distinguish it from Tfi. Evidently Mr Tfi = (2eV...2e;V...) , / 5 ; (64 10) ' the denominator contains one factor V(2eV) for each initial orfinalparticle. In particular, the decay probability is, instead of (64.7), dw = (2TT)4§(4)(P/ - P.OIM/,-!2 j ; f i r? fv?i M ze a VZ7T; zefl (64.11) where e is the energy of the decaying particle; as we should expect, the normalization volume does not appear in this formula.t Formula (64.11) can be given a more definite form by eliminating the delta functions, if the decay produces two particles (with momenta p(, pi and energies ei, si). In the rest frame of the decaying particle pî = - p i = p', e[ + e£ = m. We have d w = ^ , M / l | 2 ^ 4 ^ The first delta function is eliminated by integration over d3pu the differential d3pî is written as d3p' = p'2d|p'|d° = |p'| do g ^2d(si + a0 c | T c2 (6412) The validity of this is easily seen by noting that E\2- ml2= ei 2 - m'22 = p'2. The integration over e\ + E'2 eliminates the second delta function, and the result is ^ = 32^|M / / | 2 |p1do'. (64.13) Let us now consider a collision of two particles (having momenta pi and p2 and t If thefinalparticles include N which are identical, a factor 1/N! must be inserted when integrating over their momenta to obtain the total probability; this factor takes into account the identity of states which differ only by an interchange of the particles. 250 The Scattering Matrix §64 energies e\ and £2), in which they are transformed into any number of particles having momenta p^. Instead of (64.11) we now have The quantity that is of interest in this case is, however, not the probability but the cross-section der. The cross-section invariant under the Lorentz transformations is obtained from dw on dividing by j = I/Ve,e2, (64.14) / = V[(plP2)2-m?m^]; (64.15) where I denotes the 4-scalar see Fields, §12.t In the centre-of-mass system (pi = -p2 = p) I = |p|(ei + e2), (64.16) so that which is the same as the usual definition of the flux density of colliding particles, vx and v2 being their velocities.$ Thus the cross-section is da = (27r)46(4>(P/ - P»)|M„|2 JJ U J^ffc- (64.18) This formula can be put into its final form by eliminating the delta function for the case where in the final state also there are only two particles. Let us consider the process in the centre-of-mass system, and let e = e\ + e2 = e[ + EPI be the total energy; pi = -p2 = p and p5 = -~P2^p; be the initial and final momenta. The delta function is eliminated in the same way as in the derivation of (64.13), and the result tFor future reference, another form of / is I 2 = i[s - (m, + m2)2)[s - (m, - m2)2], (64.15a) where s =(pi + P2)2. t In an arbitrary frame of reference, i = ^V[(v1-V2)2-(v,xv2)2]. This expression is the same as the ordinary flux density whenever vj is parallel to V2; then j = |vj - V2I/V. §64 The Scattering Amplitude 251 is "^M^'l2^' (64 19) - in the particular case of elastic scattering, where the nature of the particles is unchanged in the collision, |p'| = |p|. This formula can be written in yet another form by using the invariant quantity t = (P\ ~ Pi)2 = rni + mi2 - 2(p!p',) = m i + mî2 - 2e,e [ + 2|pi||pi| cos 0, (64.20) where 8 is the angle between pi and pi. In the centre-of-mass system the momenta |pi| = |p| and |pi| = |p'| are determined only by the total energy e, and when e is given we have dt = 2|p||p'| d cos e. (64.21) Hence, in (64.19), do>= - d ^ c o s ö = ^ ^ , 2|p||p I where <f> is the azimuth of pi relative to pi.t Thus d<-j^|M„r£f, (64.22) where I is again the invariant (64.16). The azimuth <£, and therefore the crosssection in the form (64.22), are invariant under those Lorentz transformations which do not change the direction of relative motion of the particles. If the cross-section is independent of the azimuth, formula (64.22) takes the particularly simple form da = ^\Mfi\2^ (64.23) If one of the colliding particles is sufficiently heavy (and its state is unaltered by the collision), it acts only as a fixed source of a constant field in which the other particle is scattered. Since the energy (though not the momentum) of the system is conserved in a constant field, in this treatment of the collision process we can write the S-matrix elements in the form Sfi = i • 2ir8(Ef - E{)Tfi. (64.24) t Since the correct sign of the differential in such cases is obvious, we shall henceforward write simply dt for d(-f), and so on. 252 The Scattering Matrix §65 In the expression |S/,| 2 , the square of the one-dimensional delta function must be interpreted as [ôiEf-Eitf^j-SiEf-Eùt. LIT Now, as in the derivation of (64.11), we change to the amplitude Mfi instead of Tfh and obtain the following expression for the probability of a process in which one particle is scattered in a constant field and produces in the final state a certain number of other particles: Here e (= Et) is again the energy of the initial particle, p'a and e'Q the momenta and energies of the final particles. The scattering cross-section is found by dividing dw by the flux density j = i>/V, where v = |p|/e is the velocity of the particle that undergoes scattering. The normalization volume again disappears, and the result is dCT = 2 . « ( E , - £ )|M,,|'^n^7. (64.25) In the particular case of elastic scattering, there is only one particle in the final state, with the same energy and the same momentum (in absolute value). Writing d V ^ p ' 2 d | p 1 do'= | p y d e ' d o ' and eliminating S ( e ' - e) by integrating with respect to e\ we find the cross-section in the form d(j = T 7 ^ | M / l | 2 d o ' . (64.26) Finally, if the external field is time-dependent, such as the field of a system of particles executing a given motion, the S-matrix also lacks the delta function of energy. Then S/,-= iT/,- and, after the change from Tfi to Mfi by (64.10), the probability of (e.g.) a process in which the field creates a given set of particles is "»•wn&fk- <".27) §65. Reactions involving polarized particles In this section we shall show by means of simple examples how the state of polarization of the particles concerned in the reaction is taken into account when calculating the scattering cross-section. §65 Reactions Involving Polarized Particles 253 Let there be one electron in the initial state and one in the final state. Then the form of the scattering amplitude is Mfi = û'Au («fiJAftiik), (65.1) where u and u' are the bispinor amplitudes of the initial and final electrons, and A is some matrix, which depends on the momenta and polarizations of the other particles (if any) which take part in the reaction. The scattering cross-section is proportional to |M/,|2, and (ü'Aw)* = u'y°*A*u* = u*AVV = üAu\ (65.2) wheret Ä = 7°A V . Thus \Mri\2 = {ü'Au)(üÄu') s n;.fi£AUM,ömÄmi-. (65.3) If the initial electron is in a mixed (partially polarized) state with density matrix p, and if we wish to find the cross-section for a process in which the final electron is in a specified polarization state p \ the products of the bispinor amplitude components must be changed as follows: ui-ü&->?!-*, u{üm -+pim. Then \Mfi\2 = tr(p'ApÄ). (65.4) The density matrices are given by formula (29.13): P=kyp + m)(l-y\ya)) (65.5) and similarly for p'. If the initial electron is unpolarized, then P = i(yp + m). (65.6) Substituting this expression is equivalent to averaging over the polarizations of the electron. If it is desired to determine the cross-section for scattering with any t Since the matrix Ä has to be constructed, we shall note here, for future reference, the following easily verified equations: 7 _ : / ï 7 V . . . / = /...7Y.) y (65 ,2a) 254 §65 The Scattering Matrix polarization of the final electron, we must also put p' = \{yp' + m), and double the result; this operation is equivalent to summation over the polarizations of the electron. Thus we have 22 polar \Mfl\2=]2tr{yp'+m)A(yp + m)Äl (65.7) where the sum is taken over initial and final polarizations, and the factor { converts one summation into an averaging. The density matrix p' in (65.4) is a secondary quantity which essentially represents the properties of the detector as selecting one or the other polarization of the final electron, not the properties of the scattering process as such. There is the question of the polarization state of the electron resulting from the scattering process itself. If p a ) is the density matrix of this state, then the probability of detecting an electron in the state p' is obtained by projecting p a ) on p', i.e. by taking the trace tr (p a ) p'). This will be proportional to the corresponding cross-section, i.e. to |M/j|2. A comparison with (65.4) shows that p^-ApA. (65.8) Since we know that p a ) must have the form (65.5) with some 4-vector a a ) , we need only determine the latter. This could be done by means of formula (29.14), but it is even simpler to proceed as follows. We have seen in §29 that the components of the 4-vector a can be expressed in terms of those of the 3-vector Ç which is (twice) the mean value of the electron spin in its rest frame. The polarization states of the electrons are entirely determined by these vectors, and it is convenient to express the scattering cross-section also in terms of them. The square |M/j|2 will clearly be linear in each of the vectors Ç and £' which relate to the initial and final electrons, and its form as a function of Ç' will be |Mfl|2 = a + ß - £ \ (65.9) where a and ß are themselves linear functions of Ç. The vector Ç' in (65.9) is the particular polarization of the final electron that is selected by the detector. The vector Ç(/), corresponding to the density matrix p{J\ is easily found as follows. According to the above argument, iM/ip-trCp'p^). Since this quantity is relativistically invariant, it may be calculated in any frame of reference. In the rest frame of the final electron we have, by (29.20), p'p(/)^(l + a - n ( l + ^ - 0 Hence |M / ( f~l+£.£», §65 Reactions involving Polarized Particles 255 and from a comparison with (65.9) S^ß/a. (65.10) Thus the calculation of the cross-section as a function of the parameter Ç' also gives the polarization Ç(/). In more complex cases, when there is more than one initial or final electron, the calculations are similar to the foregoing. For instance, if there are two electrons both initially and finally, the form of the scattering amplitude is Mfii =(fiiAii,)(ÖJBii2)+ (fiJCii,)(öiDii2), where uu u2 are the bispinor amplitudes of the initial electrons, and ni, u2 those of the final electrons. The square \Mfi\2 includes terms of the forms \ü[Au]\2\üf2Bu2\2 and {ü\Aux)(üiBu2){ü[Cu\)*(ü\Du2)*. The former reduce to products of two traces like (65.4); the latter reduce to traces having the form tr (p\APxCp'2Bp2D). Positrons are described by amplitudes with "negative frequency" u(-p). For reactions involving positrons, the only difference from the preceding analysis is that the expressions to be used for the density matrices differ from (65.5), (65.6) as regards the sign of m; cf. (29.16), (29.17). Let us now consider the polarization states of photons participating in the reaction. The polarization of each initial photon appears linearly in the scattering amplitude in the form of a 4-vector e, and that of each final photon as e*. In each case the 4-tensor e^e* occurs in the cross-section (i.e. in the square |M/j|2). To obtain an arbitrary partially polarized state, this tensor must be replaced by the fourdimensional density matrix, the 4-tensor p^: e^î-^p,*,,. (65.11) In particular, for an unpolarized photon, according to (8.15), P»v = -ig*»* (65.12) Thus averaging over polarizations of the photon is equivalent to contracting in |M/j|2 with respect to the corresponding two tensor indices /LL, i/.t If summation over the photon polarizations is desired, not averaging, then we t The expression (65.12) as it were reduces the averaging over the two actually possible polarizations of the photon to one over the four independent directions of the four-vector e. 256 The Scattering Matrix §66 must replace e^e% by a quantity twice as large: <Ve*-> -&u>- (65.13) The density matrix of the polarized photon is given by formula (8.17). The choice of the 4-vectors e(1), e(2) which appear in this expression is usually governed by the particular conditions of the problem. In some cases they may be related to certain spatial directions in a given frame of reference; in other cases, it is more convenient to relate them to the 4-vectors which characterize the problem, namely the 4-momenta of the particles. In (8.17) the polarization of the photon is described by the Stokes parameters, which form the "vector" £ = (£i> 6> &)• As with the electron, it is necessary to distinguish the polarization ^ of the final photon as such from the polarization f' that is selected by the detector. If the square of the scattering amplitude is known as a function of the parameter (•': |M fl | 2 =a + ß.E\ then the polarization | ( / ) = ß/a, exactly as in (65.10). §66. Kinematic invariants Let us consider some kinematic relations for scattering processes in which there are only two particles, both in the initial state and in the final state. The relations in question are deduced from the general conservation laws alone, and are therefore valid for all particles and all laws of interaction. The law of conservation of 4-momentum, in a general form that does not specify which are the initial and which the final particles, is qi + q2 + (?3+q4 = 0. (66.1) Here ±qa are the momentum 4-vectors; two of them pertain to the incident particles and two to the scattered particles, the momenta for the latter being -qQ. Thus for two of the qa the time component q°>0, and for two q2<0. The law of charge conservation must be satisfied as well as that of 4-momentum conservation. Here the charge may be interpreted not only as the electric charge but as any other conserved quantity whose sign is opposite for particles and antiparticles. For given types of particles concerned in the process, the squares of the 4-vectors qa are the squares of the particle masses, which are fixed (ql=ml). Three different reactions occur, according to the values taken by the time components q°a and the values of the charges. These reactions may be written (I) l + 2-*3 + 4, (II) l + 3-*2 + 4, (III) 1+4-+2 + 3J i (66.2) 257 Kinematic Invariants Here the numbers refer to the particles, and the bar over a number denotes the corresponding antiparticle. The change from one reaction to another, i.e. the transfer of a particle to the opposite side of the formula, corresponds to a change in sign of the corresponding time component q°a and in the sign of the charge (i.e. a replacement of the particle by its antiparticle). The reactions inverse to (66.2) are also possible, of course. The three processes (66.2) are referred to as three cross-channels of a single general reaction. The following are some examples. If particles 1 and 3 are electrons, and 2 and 4 are photons, then channel I represents the scattering of a photon by an electron; channel III is the same as channel I, since the photon is strictly neutral. Channel II is the conversion of an electron-positron pair into two photons. If all four particles are electrons, then channel I is the scattering of an electron by an electron, and channels II and III the scattering of a positron by an electron. If particles 1 and 3 are electrons, and 2 and 4 are muons, then channel I is the scattering of e by /x, channel III the scattering of e by ß9 and channel II the conversion of a pair el into a pair fiß. In the discussion of scattering processes, the invariant quantities which can be constructed from the 4-momenta are particularly important. The invariant scattering amplitudes are functions of these quantities (§70). Two independent invariants can be constructed from four 4-momenta, since, according to (66.1), only three of the 4-vectors qa are independent. Let these be qu q2, qi. From them, six invariants can be constructed: the three squares q}9 q\> q\ and the three products qxq2y qxq^ q2q^ But the first three are the given squares of the masses, and the second three satisfy one relation which follows from the equationt (q\ + q2+qi)2 = q24 = ml In order to increase the symmetry it is, however, convenient to consider not two but three invariants, which may be taken as s=(q\ + qi)2 = (qi+q4)\ 2 ] 2 t = (qi + 43) = (<Ï2 + <Ï4) , 2 (66.3) 2 H = ( q i + q4) = (q 2 +q3) . These are easily seen to be related by s + t + u = h, (66.4) h = m] + m\+ m] + mi (66.5) where t In the general case of a reaction involving n (^4) particles, the number of functionally independent invariant quantities is 3n - 10. There are altogether An quantities, the components of the n 4-momenta qu, between which there are n functional relations ql=ml and four given by the conservation law lqa - 0. Arbitrary values can be assigned to six quantities, in accordance with the number of parameters which define the general Lorentz transformation (a general four-dimensional rotation). The number of independent invariants is therefore An - n - 4 - 6 = 3n - 10. 258 §67 The Scattering Matrix In the principal channel (I), the invariant s has a simple physical significance. It is the square of the total energy of the colliding particles (1 and 2) in their centre-of-mass system (for pi + p2 = 0, s = (ei + e2)2). In channel II, the invariant t has a similar significance, and in channel III the invariant u. The three channels are therefore often called s, t and u channels. It is easy to express each of the invariants s, t and u in terms of the energies and momenta of the colliding particles in each channel. Let us consider the s channel. In the centre-of-mass system of particles 1 and 2, the time and space components of the 4-vectors qQ are q\ = Pi = (eu Ps), <Ï3 = - P 3 = ( - £ 3 , ~Pi)> Qi = Pi = (e2, "P*)> ] Q* = ~ P * = ( - C 4 , Ps); ] (66.6) the suffix s in p s and p'5 indicates that these momenta refer to the reaction in the 5 channel. Then S = E), £S = E\ + 62 = ^3 + 64; (66.7) 45ps = [s - ( m j + m2)2][5 - ( m r m2)2], 4sp',2 = [s - (m3 + m4)2][s - (m3 - m4)2]; 2f = fi - s +4p 5 • pi — ( m 2 - m 2 )(m 2 - m 4 ), 2u = fi - s - 4p, • pi + - (m ? - m\)(m\- (66.8) (66.9) ml) For elastic scattering (mi = m3, m2 = m4), we have |ps| = |pi|, and hence ei = e3, £2 = £4. Instead of (66.9), the simpler formulae t = "(Ps - pi)2 = -2p 2 (l - cos e s ), M = -2p 2 (l + cos 0S) + (e, - e2)2 (66.10) are then obtained, where 0S is the angle between p s and p's. The invariant -t is here the square of the (three-dimensional) momentum transfer in the collision. Similar formulae for the other channels are found by a straightforward change of notation. For the t channel we must interchange 5 and t, and 2 and 3, in (66.6M66.10); for the u channel, we interchange s and w, and 2 and 4. §67. Physical regions When considering the scattering amplitudes as functions of the independent variables 5, t, u (which are related only by s + t + u = h), we encounter the need to distinguish regions in which their values are physically permissible from those in which they are not. Values which can correspond to a physical process of scattering must satisfy certain conditions which follow from the law of con- 259 Physical Regions §67 servation of 4-momentum and the fact that the square of each of the 4-vectors qa is a given quantity m2,. The product of two 4-momenta (67.1) PaPb ** mamb. Hence (qa + Qb)2 = (pa + Pb?^* (ma + mb)\ if Qa = Pa, qb = Pb (or qa = - pa, qb = - p b ); or (qa + qb)2 = (p<, - Pb)2 < (ma - mb)2, if qa = pa, qj, = -pb. Hence, for a reaction in the s channel, (mi + m2)2 ^ J ? (m3 + m4)2, (rr^-ma)^* « ( m 2 - m 4 ) 2 , 2 (m i - m4) 5= M (67.2) 2 ^ (m2 - m3) , and similarly in the t and u channels. To determine the remaining conditions, we form a 4-vector L which is dual to the product of any three of the 4-vectors qa, say LK = <V„pqïq2qÇ. (67.3) In the rest frame of particle 1, say, we have q\ = (q?, 0). Then L has only the spatial components L; = emiq^q^ql Thus L is a space-like vector, and L 2 «£ 0 in every frame of reference. Expanding L 2 , we obtain the condition qt qiqa qiqj qiq\ q\ q2q3 qsqi qrti q\ 0. (67.4) This can be expressed in terms of the invariants s, t, u in a form which is the same for all channels: stu ** as + bt + CM, (67.5) where ah={m\m\- m]ml){m] + ml-m]- ml), bh = (m\m\- mlml)(m] +m]-ml- ml), ch = {m]ml- mlm])(m] + ml-m\- m]) (T. W. B. Kibble, 1960). (67.6) 260 The Scattering Matrix §67 For a graphical representation of the regions of variation of s, t and w, it is convenient to use triangular coordinates in a plane, called the Mandelstam plane (S. Mandelstam, 1958). The coordinate axes are three straight lines which intersect to form an equilateral triangle. The coordinates s, f, u are measured along directions perpendicular to these three lines; the directions towards the interior of the triangle are reckoned positive, as shown by the arrows in Fig. 5. Thus each point in the plane has corresponding values of s9 t and u which are represented (with the appropriate signs) by the lengths of the perpendiculars to the three axes. The condition s +1 + u = h is satisfied on account of a known theorem of geometry, h being equal to the altitude of the triangle.t Let us consider the important case where the principal channel (s) corresponds to elastic scattering. Then the masses of the particles are equal in pairs: nij = W3 s rtt> 1TI2 = rn* = ix. (67.7) Let m > JA. The condition (67.5) has h = 2(m2 + /i,2), a = c = 0, b = (m 2 - /x2)2, so that sut^(m 2 -V) 2 t. (67.8) The boundary of the region defined by this inequality comprises the straight line t = 0 and the hyperbola su = (m2 - /x2)2, (67.9) whose two branches lie in the sectors u <0, s < 0 and s >0, u >0; the axes s = 0 u=0 s=0 FlG. 5. t For example, if the point P in Fig. 5 is joined to the three vertices A, B, C of the triangle, the latter is divided into three triangles with altitudes s, f and u ; equating the sum of their areas to that of the triangle ABC, we obtain the required relation. The proof is similar when P lies outside the triangle ABC. 1 §67 Physical Regions 261 and u = 0 are the asymptotes of the hyperbola. Instead of (67.8) we can write f>0, su>(m2-^2)2 f<0, su<(m 2 -/x 2 ) 2 . or Moreover, according to the conditions (67.2)-we must apply the inequality s> (m + /x)2 in the s channel and u>(m + /x)2 in the u channel; the remaining inequalities are then necessarily satisfied. We thus find that channels I, II, III (s, t, u) correspond to the shaded regions in Fig. 6, which are called physical regions. If /x = 0 (particles 2 and 4 are photons), the lower branch of the hyperbola touches the axis t = 0, and the physical regions are as shown in Fig. 7. If m = fi, the boundaries of the region (67.8) degenerate to the coordinate axes, and the physical regions are the three sectors shown in Fig. 8. In the general case of four different masses, the equation (67.10) stu = as + bt + cu defines a third-order curve whose branches are the boundaries of the physical regions u=0 u=(m+(.*)2 s=(m+fxp s=(m-^i) u = (m-(z) FIG. u=0 6. s=0 t=o FIG. 7. 262 §67 The Scattering Matrix FIG. 8. (a) (b) FIG. 9. of the three channels, as shown in Fig. 9. Let mi 2* m2 2* m3 s» m4. Then a 2» b 2* c, a > 0 , b > 0 . The curve (67.10) meets the coordinate axes at points on the line as + bt + cu = 0 (see the broken lines in Fig. 9). This line is as shown in Fig. 9a and 9b, depending on the sign of c. If c < 0, the physical region of the u channel includes part of the area of the coordinate triangle. In this case, therefore, the quantities s, t and u may all be positive at the same time. All three branches of the boundary curve have the appropriate coordinate axes as asymptotes; this may be seen by eliminating one of the variables from (67.10) by means of the relation s + t + u = h, and then making one of the other variables tend to infinity. In general, the conditions (67.2) yield nothing in addition to the limits defined by equation (67.10). The straight lines which correspond to the equality signs in (67.2) do not intersect the physical regions shown by the shaded areas in Fig. 9; some of them touch the boundaries of these regions, corresponding to extreme values of the variable s, f or u in the corresponding channel. When the mass of one of the particles exceeds the sum of the masses of the other three (mi > m2 + m3 + m4), a fourth reaction channel is possible, corresponding to the disintegration (IV)l-*2 + 3 + 4. (67.11) §67 263 Physical Regions For this channel, in the rest frame of the disintegrating particle, q, = (m,,0), qi = ( " 6 2 , ~P2>, q4 = (~e4, -P4), ^3 = ( - £ 3 , -P3>, ^2 + €3 + e 4 = m i , P2 + P3 + P4 = 0. The invariants are s = m] + m\-2m\E2, t = m? + m3-2mi£3, (67.12) « = m î + m4-2mi£ 4 . We then have from (67.1) (m3 + m4)2 ^ s ^ ( m i - m2)2, (m 2 +m 4 ) 2 ^f ^ ( m , - m 3 ) 2 , (67.13) (m2 + m 3 ) 2 ^ M ^ ( m i - m4)2. Thus all three invariants are positive, and the physical region of the disintegration channel is within the coordinate triangle. PROBLEMS PROBLEM 1. Find the physical regions for the case of three equal masses: mi = m, mi = m 3 = m4 s M (for example, the reaction K + TT -» IT + 7r). SOLUTION. Equation (67.10) becomes sfw = / L i 2 ( m 2 - p t 2 ) 2 , (1) with s + f + u = 3jLt2 + m 2 Regions I, II and III are bounded by curves of the same shape, with s > 0, t < 0, u < 0 f or region I, and so on. If m > 3fi, equation ( 1 ) also has a branch in the form of a closed curve with s > 0, t > 0, u > 0, which bounds the region of channel IV (Fig. 10). The Scattering Matrix 264 §68 PROBLEM 2. The same as Problem 1, but for the case m i = m, m 2 = \i, m 3 = m4 = 0, m > /x (for example, the reaction /x + y -» e + v). SOLUTION. The condition (67.5) becomes stu s? m2it25, with 5 -I-1 + u = m2 + Lt2. The physical regions are bounded by the axis s = 0 ?nd the two branches of the hyperbola tu = m2/x2 (Fig. 11). PROBLEM 3. The same as Problem 1, but for the case mi = m3 s m, m2 = 0, rru = ti, with m > 2p, (for example, the reaction p + 7 -> p + 7r°). SOLUTION. The boundary equation (67.10) becomes stu = a(s + u) + bt, ah = m V , bh = m4(2m2 - ji 2 ), h = 2m2 + ^L\ Elimination of u gives ,.+ (Lz« + J _ h For a given value of s, this is a quadratic in t. If s > (m + p) 2 (the region of the s channel), there are two negative values of t for each value of s. If s = (m + IL)\ these two roots of the quadratic coincide at t = -m^2l(m +p,). The boundary of the s channel region is then as shown in Fig. 12. The lower branch of the boundary tends asymptotically to the axis u = 0, and the upper branch crosses this axis at the point f = M 4 /0* 2 -m 2 ). The u channel region is symmetrical with the s channel region; the t channel region is situated as shown in Fig. 12. §68. Expansion in partial amplitudes An important step in the analysis of a reaction of the form a + b^c +d (68.1) is the expansion of the scattering amplitude in partial amplitudes, each of which 1 §68 Expansion in Partial Amplitudes FIG. 265 12. corresponds (for a given total energy e) to a definite value of the total angular momentum of the particles J in their centre-of-mass system.t These partial amplitudes are, therefore, elements of the S-matrix in the angular momentum representation: (eJ'M'\S\eJM). Since the angular momentum J and its component M along a specified z-axis are conserved, the S-matrix is diagonal with respect to these numbers (and also with respect to the energy e). Because of the isotropy of space, the diagonal elements are independent of the value of M For given J, M and e, the scattering matrix is still a matrix with respect to the spin quantum numbers; the elements of this matrix will be written in a more concise form: (eJM\'\S\eJM\) ^ <A'|S'(e)|A>, (68.2) where A and A' are the sets of spin quantum numbers. These can most naturally be taken to be the helicities of the particles. The helicity, unlike the spin component along an arbitrary axis in space, is conserved for a free particle, and it commutes with both the momentum and the angular momentum of the particle (§16). The helicities may therefore be used in both the momentum and the angular momentum representation of the scattering matrix. The elements of the S-matrix with respect to the helicity indices will be called the helicity scattering amplitudes, and A, A' will be taken to include the helicities of the initial and final particles respectively: A = (Aa, At,), A' = (Ac, Ad). In the momentum representation, the scattering matrix elements are defined with respect to the states |enA) (where n = p/|p| is the direction of the momentum of relative motion in the centre-of-mass system); in the angular momentum representation, they t Most of the results in §§68 and 69 are due to M. Jacob and G. C. Wick (1959). The Scattering Matrix 266 §68 are defined with respect to the states |eJMA). They can be related by means of the expansions \JMk) = | |nA)<nA|JMA> dom, (68.3) where the integration is over the directions n; the energy e is, for brevity, omitted from the state symbols. Since this transformation is unitary (see QM, § 12), the coefficients of the inverse transformation are <JAfA|nA> = <nA|JMA>*. (68.4) By the general rule of matrix transformation, the same coefficients give the relation between the S-matrix elements in the two representations: <n'A'|S|nA> = 2 (n'A'lJMA'XJMA'lSlJMAXJMAlnA). IM (68.5) The coefficients in the expansion (68.3) are easily found by means of the results of §16. Let the wave functions of all states be expressed in the momentum representation, i.e. as functions of the direction of the momentum (for a given energy); this direction, as an independent variable, will be denoted by v to distinguish it from the direction n as a quantum number of the state. In this representation, the wave function has the form (16.2): 4>nM = uM8{2)(v-n). (68.6) When (68.6) is substituted in the expansion (68.3), the latter reduces to a single term: ^MA=<VA|JMA)U (A) . (68.7) The helicities Aa and Ab of the two particles are defined as the components of their spins in the directions of their respective momenta. If the momenta are pa = p, Pb = -p, then these directions are n for thefirstparticle and - n for the second particle. If now the system is regarded as a single particle with helicity A in the direction n, then A = Aa - \b. Its wave function (in the momentum representation) can be written, according to (16.4), in the form <Wv) = u ( A » v ) V ^ T ' (68 8) ' Comparison of (68.7) and (68.8), with the variable v replaced by n, gives the required coefficients: <nA|/MA) = V ^ h T D ^ ( n ) - (68,9) §68 Expansion in Partial Amplitudes 267 Substituting these coefficients in (68.5), we have <n'À'|S|nÀ> = S ^ T ^ A = Aa — A&, D%(n')D(lC(n)(k'\SJ\k)9 (68.10) A' = Ac — Ad, with the abbreviated notation (68.2). If the direction n is taken as that of the z-axis, then D%(n) = 5AM, and (68.10) becomes <n'À'|S|nÀ> = 2 ^ T ^ DA'AOI'XÀ'IS'IÀ). (68.11) We see that the expansion in partial amplitudes has the functions D{\>\ as coefficients. For a reaction of the form (68.1), it is convenient to define the scattering amplitude / in such a way that the cross-section (in the centre-of-mass system) is da = |<n'A'|/|nA>|2 do'; (68.12) by comparison with (64.19), we can relate this amplitude to the matrix element Mfi. The expansion of the amplitude in partial amplitudes may be written <n'A'|/|nA> = 2 <2J + \)D%(nlD^(n)(kf\fJ\Kh JM (68.13) or, taking the z-axis in the direction n, <n'A'|/|nA> = £ <2J + l)D(A(n')<A'|/J|A). (68.14) This is a generalization of the usual expansion in partial amplitudes for the scattering of spinless particles; see QMy (123.14). Since D&} = PL(COS 0), (68.14) reduces in the case of zero spins to an expansion in Legendre polynomials /W-2(2L+1)/LPL(COSO). L The cross-section (68.12) is valid when all the particles have definite helicities. If they are in mixed polarization states, the cross-section is found by averaging the product < AcAd l/l AaAb><A ^A i|/| A iA t>* over the polarization density matrices of the particles, (Aalp^lA^Afclp^lAiXAap^AcXAilp^Xd); 268 The Scattering Matrix §69 see the first footnote to §48. For example, in a reaction between unpolarized particles a, b to form unpolarized particles c, d, we have x(AcAd|/J|AûAb)<XfAd|/r|AûAb)*D(/)A(n')D^*(n'); (68.15) the 2-axis is along n, and the first summation is over Aa, Ab, Ac, Ad. Using QM, (58.19) for the function DA A* and then the expansion QM, (110.2), we have finally d(r = (2s n< ++it°>« + n 2 (-DA-A(2J + l)(2J'+l)x \){2s + 1) t D ' a u h x (AcAd|/J|AûAb)<AcA(/|/J]A<1Ab>* x X ? ( 2 L + 1 ) (A -A O)(A' -A' O ) P L ( C O S Ö ) ' (6816) where 6 is the angle between n' and the z-axis; the summation with respect to L is over all integers which can occur when J and J' are added vectorially. The expansion of the scattering amplitude in partial amplitudes gives a full expression of all properties of the angular distribution of scattering that are due to the symmetry with respect to spatial rotations. But it does not explicitly reveal the properties that are due to the symmetry with respect to spatial inversion. The P invariance (if possessed by the interaction) leads to certain relations between the various helicity amplitudes (see §69). §69. Symmetry of helicity scattering amplitudes The conditions imposed by the symmetry with respect to the transformations P, T, C (if, of course, the particle interaction process in question in fact possesses such symmetry) lead to certain relations between the helicity scattering amplitudes, and therefore reduce the number of independent amplitudes.t To establish these relations, we shall first determine the symmetry properties of the helicity states of a system of two particles. Let us consider the particles in their centre-of-mass system. One particle has momentum pi = p and helicity At with respect to the direction of p; the other has momentum p 2 = - p and helicity A2 with respect to the direction of - p . If the helicity is defined with respect to the same direction, that of p, its values are k\ and -A 2 , and the particles will thus be described by plane waves with amplitudes MpA,) and Up~À2>- The two-particle system is described by a (multi-component) function u{pK]Kl) formed from the products of the amplitudes w(pA,) and w(p~X2). Let us next regard the system as a single particle with helicity A = Aj - \2 in the direction n = p/|p|; we can then write the wave function (in the momentum t This number does not, of course, depend on the specific representation of the matrix S\ and is the same for any choice of the spin variables. I §69 Symmetry of Helicity Scattering Amplitudes 269 representation, i.e. as a function of n) for a state with definite values of J, M, Ai, A2 (and of the total energy e): </OMA,A2 = up»D%Un) V ^ T > A = A, - A2; (69.1) cf. (68.8). Since A is the component of the total angular momentum in the direction of p, we must have |A|*J. (69.2) According to (16.14), under inversion Ptt (x,A2) (n) = T?i-r?2M(XlX2)(-n) = TÏ,TÏ2(-1),|+,2"A,+X2M(~X,"A2)(II), (69.3) where TJI and 172 are the internal parities of the particles. Using also (16.10), wefindthe transformation law for the functions (69.1): PI/OMA.A, = Tî.T^-ir'^-Wx,,-^ (69.4) If the two particles are identical, the question arises of the symmetry with respect to their interchange. This interchange implies interchanging their momenta and their spins. To show the significance of this operation as applied to the function (69.1), we note that its definition contains an asymmetry, in that the angular momenta of the two particles are projected on the direction of the same vector p, = p, the momentum of the first particle. After the interchange, this vector is replaced by p2 = -p, and the components of the angular momenta ji and j 2 along this vector are - Ai and A2 (instead of Ai and - A2 along p). The result of applying the particle interchange operator Pt2 to the function (69.1) may therefore be written P,2^MX,A 2 = u<-A>-A'>(-n)DttU-n) V H ^ T » where again A = Ai - A2. Then, using (69.3) and (16.10), we find that Pl2^MX,X2 = (-l) 2S- >/MA 2 A„ (69.5) where s\ = s2 = s. For identical particles, the permissible states must be either symmetric (for bosons) or antisymmetric (for fermions) with respect to interchange. Since the former case occurs when the particle spin s is integral and the latter case when it is half-integral, in either case the permissible helicity states of the two-particle system can be written as linear combinations [l+(-l) 2 T 1 2 ]^MA,A 2 , 270 The Scattering Matrix §69 or, according to (69.5), l/OMA^ + t-OViMA^,. (69.6) It is noteworthy that this combination is the same for both bosons and fermions. For a particle-antiparticle system, the result of the interchange is expressed by the same formula (69.5), but, unlike the case of identical particles, states of either symmetry under interchange are here permissible, i.e. both combinations tfr = ^MA1A2±(-D>;MA2A1 (69.7) can occur. These states have certain charge parities C. The operation of charge conjugation may be regarded as the result of a total interchange of all variables (spin and charge) of the two particles, followed by reverse interchange of the spin variables (helicities). The result of thefirstoperation must be the same as that of interchange in a system of two identical particles. Hence it is clear that, with the upper sign in (69.7) (which is the same as the sign in the state (69.6) permissible for identical particles), the system will be charge-even, and with the lower sign charge-odd: G / r = ± </r. Finally, let us consider the operation of time reversal. The wave function of a particle at rest with spin s and component thereof a is transformed according to f ^ = (-Ds">s,-.; see QM, (60.2). The wave function of two particles in their centre-of-mass system may also be regarded (in respect ot its transformation properties) as that of a "particle" at rest, with angular momentum J and component thereof M. The helicities Ai, A? are unchanged: time reversal changes the sign of the momentum and angular momentum vectors, and the products j • p are therefore unaffected. Hence t^MA.A^C-l/^jMA.Ar (69.8) We can now write down immediately the symmetry relations for the helicity amplitudes. If the interaction is P -invariant, then for the reaction a + b-+c + d the amplitudes of the transitions |A«À>)->|ÀcÀd) and P|AflAb)->P|AcAd) must be the same (for given J and e). Hence, using (69.4), we find <AcAd|S'|AaAb> = M * (-l)'e+'<-*-*< - Ac, - A,|SJ| - A„, - Ab). (69.9) §69 Symmetry of Helicity Scattering Amplitudes 271 If states with definite parities, i.e. the combinations where A|, A2 = A„,Ah or Ac, A<*, are chosen instead of those with definite helicities, then the amplitudes of transitions in which parity is not conserved are zero. Time reversal transforms each state in accordance with (69.8), and also interchanges initial and final states. Thus T invariance leads to the relations <AcAd|SJ(e)|AaA>> = <ÀaÀ*|SJ(e)|ÀcÀ,>. (69.10) These two amplitudes, however, pertain to different processes, the direct and reverse reactions. These two processes are essentially equivalent only in the case of elastic scattering, and (69.10) is then a relation between helicity amplitudes for the same reaction. In elastic scattering of two identical particles, the number of different amplitudes is further reduced because of the symmetry with respect to interchange. We have seen that, for a given J, the states which occur are either all symmetric or all antisymmetric in Ai, A2. The conservation of angular momentum therefore implies that of the symmetry with respect to interchange of helicities. A similar situation occurs in the elastic scattering of a particle by its antiparticle, or the conversion of one particle-antiparticle pair into another, i.e. a reaction a + ä -> b + b. For given /, there are both symmetric and antisymmetric states with regard to Ai, A2, but they correspond to different values of the charge parity of the system. Hence it follows that, if the interaction of the particles is C-invariant, so that the charge parity is conserved, transitions between states of different symmetry with regard to Ai, A2 are forbidden.t It must be emphasized, however, that there is a difference from the case of identical particles, in which states of one symmetry are entirely absent for any given J. In the "particle-antiparticle" case, only transitions between states of different symmetry are forbidden; the states themselves exist for every J. Because of the universal CPT invariance, the existence of T invariance implies that of CP invariance. The latter brings about the equality of amplitudes for two reactions, one obtained from the other by replacing all particles by antiparticles (and changing the sign of the helicities): (AcAdlS'lAaAb) = (Ac-AjlS'lAâA*), (69.11) where Aa = - A„ and so on.t The number of independent amplitudes is the same for all the cross-channels of t A similar prohibition can also arise from isotopic invariance of the interaction of non-identical particles. For instance, transitions between states of different symmetry with regard to A i, À2 are forbidden, to the extent that this invariance holds, in the scattering of a neutron by a proton. Î Since these two amplitudes relate to different reactions, interference between which is not possible, the phase factor in (69.11) would have no significance, and can be taken as unity. Only the equality of cross-sections which follows from (69.11) is actually meaningful. 272 §69 The Scattering Matrix one generalized reaction, and therefore this number can be determined from any channel. For example, the elastic scattering a + b -* a + b and the annihilation a + ä-*b + b are described by the same number of independent amplitudes. The restrictions imposed by T invariance in thefirstcase are equivalent to those imposed by C invariance in the second case. Let us also consider a reaction in which one particle disintegrates into two: a-+b + c. In the centre-of-mass system (the rest frame for particle a), we have Pb = -p c . Scalar multiplication of the equation j a = \b + j c by pb gives Aa = A,-A c (69.12) (the helicity Afl of particle a is defined as the component of its spin in the direction of the momentum of one of the secondary particles). This relation can be regarded as a consequence of the additional symmetry present in the process considered, namely the axial symmetry about the directions of pb and pc. If the spin sa of particle a is less than sb + sc, the relation (69.12) reduces the number of possible sets of values of Aa, Ab, Ac and therefore the number of independent helicity amplitudes of the disintegration. The total angular momentum J is then equal to the spin sa of the primary particle, and is consequently fixed. The P invariance in the disintegration is expressed by the relation <AbAc|SJ|Aa> = ^ Va ( _ i r a - ^ e ( _ kbf_ A c | S i| _ Afl>f (69.13) where we have used (69.4) and also the transformation (16.16) for the wave function of a single particle. If the primary particle is strictly neutral, further limitations arise if C parity is conserved. Three cases are to be distinguished here. If the disintegration products are also strictly neutral, we must have Ca = CbCc ; this condition either prohibits the disintegration altogether, or is satisfied and causes no further restriction. If the particles b and c are different, then C invariance implies a relation between the amplitudes of the different processes a-+b + c and a-*b + c. Finally, for the disintegration a -► b + b, there is a restriction because, for a given charge parity C and a given total angular momentum J = sfl, the system may be in states either symmetric or antisymmetric with respect to fhe helicities, depending on the parity of the number J and on the sign of C. CP invariance implies the equality of amplitudes for the disintegrations a^>b + c and ä-+b + c: <A,Ac|SJ|Au) = (Ab-Ac-|S/|Aä), (69.14) where Aâ = -Afl and so on; i.e. it implies equal probabilities of disintegration for the particle and the antiparticle. If the particle can disintegrate in more than one way (through various channels), this equality applies to each channel. This conclusion, it must be emphasized, is based on the existence of CP invariance, which is not a universal property of Nature. Only CPT invariance is universal, and this by itself §69 Symmetry of Helicity Scattering Amplitudes 273 would lead only to the equation (kbkc\SJ\ka) = (kà\SJ\\B\ch in which the right-hand side refers to the process inverse to disintegration. We shall see later (§71) that the condition of CPT* invariance, together with unitarity requirements, does lead to a relation, although a more restricted one, between the disintegration probabilities for the particle and the antiparticle. PROBLEMS PROBLEM 1. Using (69.6), obtain a classification of the possible states of a two-photon system. SOLUTION. In this case Ai, A2 = ± 1. For even J (>0), according to (69.6), three states symmetric in Ai, A2 are allowed: (a) 4fJMll, ( b ) l/OM.-l.-l, (C) lfr/MI.-l + ^JM.-l.l. For odd J (>1), one antisymmetric state is allowed: (d) I//JMI.-1 ~ </>JM,-l,I. States (c) and (d) also have a definite parity (+1): according to (69.4), P(*JMI.-I±*JM.-I.I)= ±(-1)J(*/MI.-I±^M.-I,I); the factor ± ( - \)J = 1, since the upper sign refers to even values of J and the lower sign to odd values. States (a) and (b) themselves have no definite parity, but even and odd states are obtained by taking the combinations (a') ^jMii + «fcjM.-i.-i, (b') ^ M I I - ^ ; M , - I . - I . When / = 0, only Ai = A2 is allowed by the condition |Ai - A2I ^ J, so that state (c) does not occur, leaving one even and one odd state, (a') and (b'). Finally, if J = 1, state (d), which is the only possible state for odd J, is forbidden because it has A = 2 > J. Thus we arrive at the table (9.5) for the permissible states. PROBLEM 2. In the non-relativistic approximation, the total angular momentum / of the system is found by adding the spin S and the orbital angular momentum L. For a system of two particles,findthe relation between the states \JLSM) and IJMA1A2). SOLUTION. According to the rule for constructing wave functions when adding angular momenta, we have *JLSM = l^s^s^criailSMs^LM^MLMslJMh 0) where *fßsa are the eigenfunctions of the spin s with component cr along a fixed z-axis, ^ L M L those of the orbital angular momentum L with component Mc> the expression in the braces corresponds to the addition of s\ and 52 to give S, after which S is added to L to give J; the summation is over all m-type indices. Let all functions be expressed in the momentum representation, as functions of the direction n of the momentum p = pi, and let the functions i//J<r be expressed in terms of the functions i/rlA of the helicity states by means of QM, (58.7): ^2"2 = Ç D-^lOt/V-Ar For the function $LMU we have iltLML = Y L M L ( I I ) 274 §70 The Scattering Matrix using QM (58.25) and the definition (16.5). Substituting these functions in equation (1), and twice using the expansion QM, (110.1), together with the orthogonality of the Clebsch-Gordan coefficients (QM, (106.13)), we obtain the expansion 1/OI.SM = 2J ^MA,A 2 <JMA,A 2 |JLSM> > (2) where «frjMA.A, = 1/TnA^n. A,D%(ll) J ~ ^ , V 477 A = A, - A2, and the coefficients are </MA,A2|JLSM)=(-0L(-l),,",2fV[(2L+l)(2S+l)](^ _£ _*)(£ * ^). (3) Since the transformation (2) is unitary, we have <JLSM|JMA,A:> = (JMA|A2|JLSM>*. § 70. Invariant amplitudes In the helicity amplitudes, a particular frame of reference is used, namely the centre-of-mass system. But, in order to calculate the scattering amplitudes by means of invariant perturbation theory (and also to examine their general analytical properties), it is convenient to write them in an explicitly invariant form. If the particles concerned in the reaction have no spin, the scattering amplitude depends only on the invariant products of the 4-momenta of the particles. For a reaction of the form a + b-*c + d, (70.1) these invariants may be taken as any two of the quantities s, f, u defined in §66. Then the scattering amplitude reduces to a single function Mfi = f(s, t). If the particles have spins, then, besides the kinematic invariants s, f, u, there are also invariants which can be constructed from the wave amplitudes of the particles (bispinors, 4-tensors, etc.). The scattering amplitudes must then have the form Mfi = 2fn(s,t)Fn, n (70.2) where the Fn are invariants which depend linearly on the wave amplitudes of all the particles concerned (and also on their 4-momenta). The coefficients /„(s, 0 are called invariant amplitudes. By choosing the wave amplitudes in such a way as to correspond to particles with definite helicities, we obtain definite values of the invariants Fn = F„(A„ A/). Then the helicity scattering amplitudes are linear homogeneous combinations of the invariant amplitudes /„. Hence we see that the number of independent functions /„(s, t) is equal to the number of independent helicity amplitudes. Since the latter number is easily determined, as shown in §69, this makes easier the construction of the invariants F„, their number being known in advance. Let us consider some examples, assuming in every case that the interaction is T-invariant and P-invariant. The latter property implies that the invariants F„ must be true scalars, not pseudoscalars. §70 Invariant Amplitudes 275 To find the number of invariants (that is, the number of independent helicity amplitudes), we note that the total number of elements of the matrix SJ (i.e. of different sets Ab A2, Ai, À 2) is in this case four: Ai = Ai = 0, A2, À 2 = ±1 When the P invariance is taken into account, the number of independent elements is reduced to two, and this is unchanged by the inclusion of T invariance. The two independent invariants may be taken as F, = M'M, F2 = ü'(yK)u, (70.3) where u = w(p), u' = u(p') are the bispinor amplitudes of the initial and final fermions; K = k + k\ where k and k' are the 4-momenta of the initial and final bosons.t The T invariance of the quantities (70.3) is evident if we note that under time reversal the products ü'u and ü'y^u are transformed according to the same rule (28.6) as the operators i//t£ and i/ry^, whose matrix elements they are: ü'u is invariant, and the 4-vector ü'yu is transformed according to ü'y°u -» ü'y°uy ü'yu -> - ü'yu. The 4-momenta are transformed similarly: (K°, K)->(K°, -K), and the scalar product F2 = K^(ü'y^u) is therefore invariant. ELASTIC SCATTERING OF TWO IDENTICAL PARTICLES WITH SPIN 2 To find the number of independent helicity amplitudes, it is convenient to start from linear combinations of the helicity states: <//,g = (/>++ + i/r__, 4/3g = <//+_ + <//-+, fag = i//++ - if/—. \\ßu = 1//+- - (//_+, where the suffixes ± denote the values ±2 of the helicities of the two particles. The states lg, 2g, 3g are even, and u is odd, with respect to interchange of the particles. The transitions g<->n are forbidden, so that there remain 1 6 - 6 = 10 matrix elements, when the interchange symmetry is taken into account. The functions ifß\g and t//3g, and ij/2g have opposite parities with respect to the inversion P ; the prohibition of transitions between them reduces the number of independent amplitudes to six. Lastly, the T invariance equalizes the amplitudes of the transitions lg-*3g and 3g->lg, leaving only five independent amplitudes. The five independent invariants may be taken as F, = {ü\ux){ü'2u2)> F2 - {ü\y5ux)(ü'2y5u2)> F 3 = (MI7 M U,)(Ü2YMH2), F4 = (ü\y^y5ul)(Ü2y^y5u2), (70.4) F 5 = (Öicr^ü1)(Ö20r^M2), t Atfirstsight, there might appear to be another invariant of the form ü'apJtWu (with the matrices ov„ defined by (28.2)), but this is easily seen to reduce to the invariants (70.3) by means of the conservation law k' = p + k - p' and the equations (yp)u = m«, ö'(yp') = mû' satisfied by the bispinor amplitudes. §70 The Scattering Matrix 276 where ux, u2 are the bispinor amplitudes of the initial particles and w|, u'2 those of the final particles. Interchange of the initial (or of the final) particles gives no new invariants: the invariants obtained can be expressed in terms of the previous ones (§28, Problem). But the expression (70.2), with the F„ given by (70.4), does not explicitly take account of the requirement that interchange of two identical fermions must change the sign of the scattering amplitude. An expression which satisfies this condition may be written M/l- = [(fi;M,)(öiu2)/i(r,u)"(ö5w,)(öiM2)/i(M,0]+ • * •• (70.5) When pi and pi (or p\ and p2) are interchanged, the kinematic invariants s -> s, t -> u, u -* f, so that the condition is necessarily satisfied. ELASTIC SCATTERING OF A PHOTON BY PARTICLES WITH SPIN 0 OR [ The amplitude of this process is conveniently expressed by means of the space-like unit 4-vectors e(l), e{2) which satisfy the conditions e (D2=e(2)2= <W 2 , M ewk = emk = 0, = 0, ) e{{)k' = e{2)kf = 0; \ (70.6) for each of the two photons, these 4-vectors can be the unit 4-vectors by means of which an invariant description of their polarization properties is obtained (§8). Let k and k' be the initial and final 4-momenta of the photon; p and p' those of the scattering particle. The 4-vectors P = P +P -KX =rf , N A = eA^PMq,Kp, J (70.7) where K = k + k\ q = p - p ' = k'-k, are evidently orthogonal to one another and also to the 4-vectors K and q, and therefore to k and k'. Being orthogonal to the time-like 4-vector K (K2 = 2kk' >0), they must themselves be space-like: in a frame of reference for which K=*0, it follows from K P = 0 that P 0 = 0 and hence P 2 < 0 . Normalizing P and N by putting NA e^^rjf^, 2 V(-N y (2U= P ee^^-jf-^. 2 V(-p y (70.8) we obtain a pair of 4-vectors which have all the required properties. It may be noted that e(2) is a true vector and e (l) a pseudovector. §70 277 Invariant Amplitudes The photon scattering amplitude may be written Mfi = F A ^l*e„ (70.9) in terms of the polarization 4-vectors e and e' of the initial and final photons. The photon helicity has only two values, ±1. Hence, for the scattering of a photon by a particle with spin zero, the number of independent helicity amplitudes is the same as for the mutual scattering of particles with spin 0 and \, namely two. The tensor FKlk in (70.9) has to be constructed from the particle 4-momenta only. It can be written F A * = / 1 6 ( I ) A 6 ( , ) ' 4 +/ 2 € ( 2 ) A 6 ( 2 ) M , (70.10) where fx and f2 are invariant amplitudes. It should be noted that no term containing a product eimei2)lA can appear in FA*\ since this product is a pseudotensor and would give a pseudoscalar on substitution in (70.9). Lastly, let us consider the scattering of a photon by a particle with spin i To find the number of independent helicity amplitudes, we note that the total number of elements of the matrix SJ in this case is sixteen; the helicity of each of two initial and two final particles has two values. The condition of P invariance reduces this number to eight, and that of T invariance brings it down to six. Here, we write the tensor FXtx in the form FA, = GteVe™ + e?e <2>) + GMl)e?} + e[2)e^) + -+ G2(e!'>e<2>- e i ^ i V G j f c W - *?*?), (70.11) where G0 and G3 are true scalars, G\ and Gi are pseudoscalars, and all four are bilinear in the bispinor fermion amplitudes ü(p') and u(p), being of the form G„ = ü(p')Q„M(p). (70.12) The general form of the matrices (with respect to the bispinor indices) Q„ is Qo = /. + MyK), Q, = 75(/3 + MyK)),\ Q2 = Y U + U(yK)), Q3 = h + HyK\ \ (70.13) where K = k + k'. The coefficients / 1 , . . . ,/g are invariant amplitudes, in this case eight in number (instead of the correct value of six), because the condition of T invariance has not yet been imposed. Time reversal interchanges the initial and final 4-momenta of the particles, and also changes the sign of their space components: (ko, k)~(k0, - k'), (Po, p)~(pô, - p'). (70.14) The photon polarization 4-vectors are transformed according to (e 0 ,e)~(e 0 *,-e'*) (70.15) 278 The Scattering Matrix §71 (cf. (8.11a)); hence (eo*e0, eTe^ ei*ek)->(eo*eo, - *o*«i, ^ex). By virtue of the last transformation, the condition of invariance of the scattering amplitude (70.9) is equivalent to (Foo, F,o, Fik) -> (Foo, -F 0 „ Fki). On the other hand, the changes (70.14) imply (Ko, K) -* (Ko, - K), (q0, q) -> (-q 0 , q), (P0, P) -> (Po, - P), (No, N) -> (N0, - N), so that (e^2\^2))^(e^\-^2)). (70.16) Hence, from (70.11), we must have Go, if3 -» Go, i f 3, G2-> - G 2 . Under time reversal, ü'y5u -> - w'y5w, ö'y5(yK)w -> ö'y5(Y*Oii, as is evident from the transformation laws for pseudoscalar and pseudovector bilinear forms (28.6). From (70.12), (70.13) it is now evident that, because of the T invariance of the scattering amplitude, / 3 = /6 = 0. (70.17) §71. The unitarity condition The scattering matrix must be unitary: SS* = 1, or in terms of matrix elements, n where the suffix n labels the possible intermediate states.t This is the most general property of the S-matrix, which ensures that the orthonormality of the states is preserved in the reaction; cf. QM, §§125 and 144. In particular, the diagonal elements of equation (71.1) simply express the fact that the sum of the transition t The actual meaning of 5/j in (71.1) depends, of course, on the specific choice of quantum numbers and on the normalization of the wave functions of the system. It must be defined so that 2 / 8,/ = 1. §71 The Unitarity Condition 279 probabilities from a given initial state to all final states is unity: S|Sn,|2=l. n Substituting in (71.1) the matrix elements in the form (64.2), we obtain Tfi - VIf = I(2TT) 4 2 8{4\Pf - Pn)TfnT*n n = i(2ir) 4 2 S(4)(P/ " Pn)T*fTn, n (71.2) The two equivalent forms on the right are obtained by writing the unitarity condition respectively as SS+ = l and S*S = I, with opposite orders of the factors S and S\ It should be noticed that the left-hand side of the equation is linear in the matrix elements of T, but the right-hand side is quadratic. If the interaction contains a small parameter (e.g. the electromagnetic interaction), the left-hand side is therefore of the first order of smallness and the right-hand side is of the second order. The latter may consequently be neglected in a first approximation; then (71.3) Tfi = T*h i.e. the matrix T is Hermitian. In order to make the unitary condition (71.2) more specific, we must understand precisely what is meant by the summation over n. Let us do this for a two-particle collision, assuming that the conservation laws allow only elastic scattering. Then all the intermediate states in (71.2) are likewise "two-particle" states. Summation over these signifies integration over the intermediate momenta p", p2, and summation over the spin quantum numbers (for example, the helicities) of the two particles, which we denote by A": V2d3p'jd3p'iY n J (2TT)6 .£■ Eliminating the delta functions in the same way as in §64, we obtain the "twoparticle" unitarity condition in the form Tfi " ^ = ( 2 ^ 7 ? ^ / T/"Pr"e'e'2'do"> where p is the momentum and e the total energy in the centre-of-mass system. The normalization volume does not appear after changing from the amplitudes T/t to M/, in accordance with (64.10): Mfi ~ Wlf = jÙf ? ^ / Mf^"do"- <71-4) 280 The Scattering Matrix §71 Let the elastic scattering amplitude be defined so that da = |<n'A'|/|nA)|2 do', (71.5) where n and n' are the directions of the initial and final momenta, A and A' the initial and final spin quantum numbers. Comparison with (64.19) shows that <n'A'|/|nA) = ^ - M / j , 07T£ (71.6) and the unitarity condition (72.4) becomes <o'Al/|nA>-(nA|/|nfAf>* = ^ 2 f <nV|/|n'V')<nA|/|n'Vr do\ (71.7) which generalizes the familiar formula of the non-relativistic theory, QM (125.8). The "amplitude of zero-angle elastic scattering" is the diagonal matrix element Tu, in which the final states of the particles are the same as their initial states.t For this amplitude the unitarity condition (71.2) becomes 2 im Tu = (2TT)4 2 |Tifl|2ô(4)(Pi - Pn). n (71.8) The right-hand side of this equation differs only by a factor from the total cross-section for all possible processes of scattering from the given initial state i. For let this cross-section be denoted by crf; then summation of the probability (64.5) over states / and division by the flux density j gives CTf=^^2l^n|28(4)(P-Pn), J n whence (2V/j)imT, = crf. The normalization volume is eliminated by putting T« = MJ(2ei V • 2e2V) (where ex and e2 are the energies of the particles in the centre-of-mass system) and substituting j from (64.17): imM« = 2|p|ea,. (71.9) This formula expresses the optical theorem. If the elastic scattering amplitude (71.6) is used, the theorem takes the customary form im<nA|/|nA) = |p|crr/4iT; (71.10) cf. QM, (142.10). t It must be stressed that the matrix elements of T are concerned, not those of S; that is, the diagonal element is taken after subtracting the unit matrix from S. §71 The Unitarity Condition 281 If the S-matrix is given in the angular-momentum representation (partial amplitudes), it is diagonal with respect to J, and the unitarity condition can therefore be written separately for each value of J. For example, if only elastic scattering is possible, the unitarity condition is 2<A'|S J |À"><À|S J |À'")* = ÔAA'. A" (7L11) Because of the T invariance, the elastic scattering matrix is symmetric (cf. (69.10)), and hence can be reduced to diagonal form. The unitarity condition then requires that the diagonal elements should be of unit modulus, and they are customarily written in the form Sj=exp(2iô jB ), (71.12) where the 8Jn are real constants, depending on the energy (the suffix n labelling the diagonal elements for a given J). In the general case, when the number N of independent amplitudes exceeds the order of the (square) matrix S\ the coefficients of the transformation which diagonalizes S3 depend on J and E (these coefficients then comprise not only the principal values of the matrix but also independent quantities equivalent to the original N quantities). If, however, the number N is equal to the order of the matrix SJ (and therefore to the number of its principal values), the diagonalization coefficients are universal constants, and the diagonalizing states have definite parities (but not, of course, definite helicities). The condition (71.11), expressed in terms of the partial amplitudes (A'|/J|A), is <Al/ J |A)^(A|/ / |Ar = 2i|p|2<Al/i|A''><A|/;|AT, A" (71.13) as is easily seen by substituting the expansion (68.13) in (71.7) and using the orthonormality of the D functions. If there is T invariance, the matrix (A'|/J|A) is symmetric, and (71.13) becomes im<A'|/J|A) = |p|<Al/J/JlA>. (71.14) If the matrix is diagonalized, the diagonal elements are / "= ^(*2"'n~1) = ^ ^ (7U5) Finally, we may mention some consequences which follow from the unitary condition together with the requirement of CPT invariance. The latter shows that T/i = TV, (71.16) where f and / are states which differ from i and / in that all the particles are replaced by antiparticles (and helicities are reversed, and also angular momentum 282 The Scattering Matrix §71 components if spherical waves are used). In particular, for the diagonal elements, It therefore follows from (71.8) and (71.9) that the total cross-section for all possible processes (with a given initial state) is the same for reactions of particles and of antiparticles. In particular, the total disintegration probabilities (i.e. the lifetimes) of the particle and the antiparticle are equal. These results, together with the equality of particle and anti-particle masses (§11), are most important consequences of the CPT invariance of the interactions. A similar statement for each possible disintegration channel separately would require CP invariance also (see the end of §69). PROBLEM From the unitarity condition, find the relation between the phases of the partial amplitudes for photoproduction of pions from nucléons (y + N-*ir + N) and elastic scattering of pions by nucléons (7T + N - > 7 T + N ) , using the fact that TTN scattering depends on strong interactions but photoproduction and yN scattering depend on an electromagnetic interaction. SOLUTION. Let the partial amplitudes be denoted by (irN\S\yN) = S„y, (yN\S\yN) = SYT, (TTN\S\TTN) = Sm the suffix J and the helicity suffixes being omitted. Photoproduction is a first-order process with respect to the charge e, and yN scattering a second-order process; hence SW7 — e, Syy - 1 ~ e2. The amplitude S™ is not small. The conditions (71.1) give, as far as terms in e, S„yS*y + S„„S*n - S„nS*„ = 1 ; (2) on the right-hand side of (2), 1 denotes a unit matrix in the spin variables. Because of T invariance the matrix S™ is symmetric, and S71T = Si,7. Let us take the matrix S™ in diagonal form, i.e. with respect to pion states having definite parities; then it follows from (2) that the diagonal elements have the form e2t*w with various constants 5*. Then (1) gives for each element of the matrix S*y SJS*y= -em\ whence S W7 = ±|S,ry|lV6V Thus the phase of the partial amplitude for photoproduction (in a state having a definite parity) is determined by the phase of elastic TTN scattering. CHAPTER VIII INVARIANT PERTURBATION THEORY §72. The chronological product T H E probabilities of various processes in collisions between particles whose interaction may be regarded as small are calculated by means of perturbation theory. In its ordinary form (in non-relativistic quantum mechanics), however, the formalism of this theory has the defect of not exhibiting explicitly the conditions of relativistic invariance. Although, when this formalism is applied to relativistic problems, the final result will satisfy these conditions, the calculations are considerably complicated by the non-invariant form of the intermediate expressions. The present chapter will deal with the development of a consistent relativistic perturbation theory free from this defect, first established by R. P. Feynman (1948-1949). With a view to a second-quantization description of the system, let 4> denote its wave function in the "space" of occupation numbers for the various states of free particles. The Hamiltonian of the system is H = H 0 + V, where V is the interaction operator. Let <!>„ be the eigenfunctions of the unperturbed Hamiltonian, each corresponding to certain definite values of all the occupation numbers. Any function $ can be expanded as O = 2 C„4>n. Then the exact wave equation id<tydt = (H 0 + V)(D (72.1) becomes a set of equations for the coefficients Cn : i t ^ V ^ e ' ^ ' C . m (72.2) where Vnm are the time-independent matrix elements of the operator V, and En the energy levels of the unperturbed system (cf. QM9 §40). By definition, the operator V does not depend explicitly on the time. The quantities V nm (f)= Vnmei{E»-E»\ (72.3) on the other hand, may be regarded as matrix elements of the time-dependent operator ViO^e^o'Ve-1"«'. (72.4) This is said to be an operator in the interaction representation, as opposed to the 283 284 Invariant Perturbation Theory §72 original time-independent Schrödinger operator V.t Now denoting the wave function in this new representation by the same letter 3>, we can write equations (72.2) symbolically as /<!> = V(r)<D. (72.5) The change in the wave function in this representation is due entirely to the action of the perturbation, i.e. it corresponds to processes which result from the interaction of the particles. If 4>(f ) and <!>(£ + 8t) are the values of <ï> at two successive instants, (72.5) shows that <&(f+ Sf) = [ l - i 5 t • V(t)]$(t) Accordingly the value of <ï> at any instant tf can be expressed in terms of its value at some initial instant t, (<tf) by *a/)=(l|l«""'^,-))*ai), (72.6) where the product Y\ is t n e limit of the product over all the infinitesimal intervals 8ta between U and tf. If V(t) were an ordinary function, this limit would reduce simply to '/ exp(-i j V(t)df), but this result depends on the commutativity of the factors pertaining to different instants, which is assumed in changing from the product in (72.6) to the summation in the exponent. For the operator V(t) there is no such commutativity, and the reduction to an ordinary integral is not possible. We can write (72.6) in the symbolic form </ <D(t,) = Texp{-i | V(t)dt}<t>(til (72J) <i where T is the chronological operator, implying a certain "chronological" sequence t It must be emphasized that the definition (72.4) makes use of the unperturbed Hamiltonian Ho- In this it differs from the Heisenberg representation of operators, where VH(t) = e^Ve "'*; see QM, §13. The Chronological Product §72 285 of time instants in the successive factors of the product (72.6). In particular, putting U -* -3c, tf -> +°°, we have 4>(+oo) = Sd>(-oo), (72.8) S = T e x p j - i | V(t)dt}. (72.9) where The significance of writing the formally exact solution of the wave equation in the form (72.7)-(72.9) is that it easily leads to the series in powers of the perturbation S = %{JJT \ dU f dt2... f dtk • T{V(t,)V(t 2 )... V(tk)}. (72.10) Here, in each term, the fcth power of the integral is written as a fc-fold integral, and the symbol T signifies that in each range of values of the variables ti, t2,..., t* the corresponding operators must be put in chronological order, with the value of t increasing from right to left.t It is evident from the definition (72.8) that, if the system was in a state 4>; (an assembly of free particles) before the collision, the probability amplitude for a transition to a state <î>/ (another assembly of free particles) is the matrix element Sp. Thus these form the S-matrix. The electromagnetic interaction operator has already been given in §43: -ej(jÂ) d'x. (72.11) Substitution of this in (72.9) gives S = Texp[-ie[(/A)d 4 x}. (72.12) It is important to note that the operator (72.12) is relativistically invariant. This is seen from the facts that the integrand is a scalar, the integration over dAx is invariant, and the time-ordering operation is invariant. The last point, however, needs further explanation. The order of two time instants tx and f2, i.e. the sign of t2-tu is independent of the frame of reference chosen if these instants relate to world points X\ and x2 separated by a time-like interval: (x2-X\)2>0. In such a case the invariance of t The derivation of the rules of relativistic perturbation theory by means of the expansion (72.10) is due to F. J. Dyson (1949). 286 Invariant Perturbation Theory §73 time-ordering necessarily follows. But if ( x 2 - X i ) 2 < 0 (a space-like interval), we may have both t2> t\ and t2< t\ in different frames of referenced Now two such points correspond to events between which there can be no causal connection. It is therefore evident that the operators of two physical quantities relating to such points must commute, since the non-commutativity of operators signifies, physically, that the corresponding quantities cannot be measured simultaneously, and this presupposes a physical connection between the two measurements. Thus tpe time-ordering of the product remains invariant in this case also: though a Lorentz transformation may reverse the sequence of time instants, the factors commute and can therefore be restored to their chronological order.t It is easy to see that the definition of the S-matrix given in this section necessarily satisfies the unitarity condition. Writing S as the chronological product in (72.6) and using the fact that V is Hermitian, we find that S* is given by the product of similar factors, exp[îôta • V(ta)] (with the opposite sign of the exponent), in the reverse of the chronological order. Thus all the factors cancel in pairs when S is multiplied by S \ It should be noted that the unitarity of the operator S is ensured in this case because the Hamiltonian is Hermitian. The unitarity condition is actually more general than the assumptions on which the theory given here is based. It must be satisfied even in a quantum-mechanical description which makes no use of the concepts of the Hamiltonian and the wave functions. §73. Feynman diagrams for electron scattering We shall show by means of specific examples how the scattering matrix elements are calculated. These examples will facilitate the subsequent formulation of the general rules of invariant perturbation theory. The current operator / contains the product of two electron (//-operators. Hence processes might occur in the first order of perturbation theory which involve (in the initial and final states) only three particles: two electrons (the operator j) and one photon (the operator A). It is easily seen, however, that such processes cannot occur between free particles, being forbidden by the laws of conservation of energy and momentum. If p\ and p 2 are the 4-momenta of the electrons, and k that of the photon, the conservation of 4-momentum would be represented by k = p 2 - P\ or k = pi + p\. But such equations are impossible, since for a photon k2 = 0, whereas the square ( p 2 ± p 0 2 is certainly not zero: if we calculate this invariant in t Instead of using the terms "time-like" and "space-like", we often refer briefly to regions respectively inside and outside the light cone: all points x separated from a point x' by an interval such that (x - xf > 0 lie within a double cone having its vertex at x'; points for which (x - x')2 < 0 lie outside this cone. X This statement needs refinement to avoid misunderstanding in its application to the product V(t\)V(ti). • . . Since the operator V itself is not gauge-invariant (it varies with Â), the factors V(fi), V(f2),..., though commuting in one gauge of the potential, may be non-commutative in some other gauge. The statements made above must therefore be formulated as asserting the possibility of choosing a gauge for the potential in which V(fi) and V(f2) commute outside the light cone. This reservation clearly has no effect on the invariance of the S-matrix: the scattering amplitudes, which are actual physical quantities, cannot depend on the gauge of the potential, a result which formally follows from the gauge invariance of the action integral (§43). §73 Feynman Diagrams for Electron Scattering 287 the rest frame of one of the electrons, we have (p 2 ±Pi) 2 = 2(m 2 ±p,p 2 ) = 2(m2±£i£2^Pi P2) = 2m(m ± £2), and, since e2 > w, it follows that (p2 + Pi) 2 >0, (p2-p02<0. (73.1) Thus the first non-vanishing (non-diagonal) elements of the S-matrix can appear only in the second order of perturbation theory. All the relevant processes are comprised in the second-order operator obtained by expanding the expression (72.12): S(2) = - ^ I J d4x d V • T(Hx)Aß(x)iv(x')Av(xt)). Since the electron and photon operators commute, the T product can be resolved into two: S (2) = - | j J j d4xd4xf T(r(x)r(xf))T(Â,(x)ÂAxf)). (73.2) As a first example, let us consider elastic scattering of two electrons. In the initial state there are two electrons with 4-momenta px and p 2 , in the final state two electrons with other 4-momenta p 3 and p 4 . It is also assumed that all the electrons are in definite spin states; the spin variable indices will be everywhere omitted, for brevity. Since there are no photons in either state, the required matrix element of the T product of the photon operators is the diagonal element ( 0 | . . . |0), where |0) denotes the photon vacuum state. This value of the T product averaged over the vacuum is (for each pair of indices /x, v) \ definite function of the coordinates of the two points x and x'. Since 4-space is homogeneous, the coordinates can appear only as the difference x - x\ The tensor DßU(x - x') - /<0|TAM(x)Ay(x,)|0) (73.3) is called the photon propagation function or photon propagator It will be calculated in §76. For the T product of the electron operators, we have to calculate the matrix element <34|Tj'(x)r(x')|12>, (73.4) where the symbols |12), |34) denote states in which pairs of electrons have the §73 Invariant Perturbation Theory 288 corresponding momenta. This element also can be represented as a vacuum expectation value, by using the obvious relation <2|F|l> = <0|a2Faf|0>, where F is any operator, df the creation operator for the first electron and d2 the annihilation operator for the second electron. Hence, instead of (73.4), we can calculate the quantity (73.5) (0\a3a4TU»(x)r(x'))aUï\Oh the indices 1,2,... being abbreviations for pu p 2 , . . . . Each of the two current operators is a product, / = i//yi//, and each of the i/f-operators is a sum: i = 2 (M'P + *;<M. $ = 2 (â;iP + bpiÂ-p); p (73.6) p the second term in each expression contains the positron operators, which in the present case "do not act". Hence the product ]^(x)jv{xf) is a sum of terms, each containing the product of two operators dp and two dp. These operators must annihilate electrons 1 and 2, and create electrons 3 and 4. They must therefore be the operators au d2, d^, at, which are said to contract with the "external" operators df, d2, ai, d4 in (73.5) and cancel according to the equations <0|apa;|0>= 1. (73.7) Four terms result, according to the «//-operators from which au d2, d3, dj in (73.5) are taken: i 2 i 1 Z ' 2 ' (73.5) = a 3 a 4 (07^X^7> f )aîar + a,a4(^y^)(^yv^)aUt i i i i > i i « — - + a&AfyWWyWaîaî l 1 « i '—' ' ■ « ■ » + l ' — Z + a&AfyWWyWaUi, I I « . (73.8) where i// = ^(x), ty1 = iK*')> and the brackets join operators which contract, i.e. those from which a pair of operators d, d* is taken for the cancellation according to (73.7). In each term we can bring the conjugate operators together in pairs (â\âU etc.) by successive interchanges of au â2,..., and the mean value of their product is then equal to the product of the mean values (73.7). Since all these operators anticommute (1, 2, 3, 4 being different states),! we find that the matrix element (73.4) is <34|Tj* (x)j*(*0|12> = (^y^iWiyW) + (f^hWiJ^i) - - (fc7 M ^2)(*47>î) ^ (*47 M *iX*37>2). (73.9) t Because of this anticommutativity, the operators f(x) and j(x') may here be considered to commute (in the calculation of the matrix element), and the T product symbol may therefore be omitted. §73 Feynman Diagrams for Electron Scattering 289 The sign of the entire sum is arbitrary, and depends on the order of the "external" electron operators in (73.5). This is in accordance with the fact that the sign of the matrix element for scattering of identical fermions is itself arbitrary. The relative sign of the various terms in (73.9), of course, does not depend on the order of the external operators. The two terms in each line of (73.9) differ only by a simultaneous interchange of the indices fx, v and the arguments x, x'. This interchange clearly does not affect the matrix element (73.3), in which the order of factors is still established by the symbol T. Hence when (73.3) is multiplied by (73.9) and integrated over d4xd4x', the four terms in (73.9) give two pairs of equal results, and the matrix element is therefore Sfl = ie2jJ d4x d4x' D,v(x - x'){($4y^2)My"*l,\) - (^7^.)(^W>i)}; (73.10) the factor 2 has now disappeared. The electron wave functions are the plane waves (64.8). The expression in the braces is therefore {...} = (047^2X037"«,) e-«"2-P4>*-«P.-Pj>*' _ - (Ü47,Au1)(Ü37,,W2) e - ^ - ^ - ^ - p , ) * ' = {(047^2X037^,) e - ^ - p ^ - p . » « / * - (Ü^Um^Ui) e-«0»|-P4)+(l»3-P2)]tf2} e-«P.+P2-P|-P4>Xf where X = 2U+ *'), Ç = x-x'. The integration over d*xd*x' is replaced by one over d4Ç d4X. The integral over d*X gives a delta function, so that pi + p2 = Ps + PAThen, changing from the matrix S to the matrix M (§64), we have finally for the scattering amplitude Mfi = e 2 {(«47""2)D^(P4- P2)(U37"".)-(Ö47 >1 W.)^(P4-P.)(Ö37 ,, "2)}. (73.11) Here we have used the photon propagation function in the momentum representation: Dtlv(k) = JD^v(Oeik'd4l (73.12) Each of the two terms in the amplitude (73.11) can be symbolically represented by means of a Feynman diagram : the first term by e'^Y'u^D^kMüjY'u,) = |k (73.13) 290 Invariant Perturbation Theory §73 Each point of intersection of lines (a vertex of the diagram) has a corresponding factor 7. The "incoming" continuous lines towards a vertex represent the initial electrons, which are associated with the factors u, the bispinor amplitudes of the corresponding electron states. The "outgoing" continuous lines leaving a vertex are the final electrons, and correspond to the factors ü. When the diagram is "read", these factors are written from left to right in the order of movement along the continuous lines against the direction of the arrows. The two vertices are joined by a broken line which represents a virtual (intermediate) photon "emitted" at one vertex and "absorbed" at the other, and corresponds to the factor -iD^ik). The 4-momentum of the virtual photon k is determined by the "conservation of 4-momentum" at the vertex: the total momenta of the incoming and outgoing lines are equal. In this case k = p} - p 3 = p 4 ~P2. As well as the factors mentioned, the whole diagram is also assigned a factor {-ief (the exponent being the number of vertices in the diagram), and then represents a term in iMfi. Similarly, the second term in (73.11) is represented by the diagram P e2(C4Yuu1)D:JV(k,)(Ü3Yvu2) = '^ ^ \ k* Pl (73.14) with k' = pi - p 4 = P3 - P2. It does not matter whether the diagram is read from the end of p 3 or from that of p4. The resulting expressions are equal, because the tensor DM„ is symmetrical. The choice of direction for the virtual photon line is also immaterial: a change in its direction simply reverses the sign of k, which does not matter, since the functions Dßl/(k) are even (see §76). The lines corresponding to the initial and final particles are called the external lines or free ends of the diagram. The diagrams (73.13) and (73.14) differ by the interchange of two electron free ends (p3 and p4). This interchange of two fermions reverses the sign of the diagram, in accordance with the fact that the two terms appear with opposite signs in the amplitude (73.11). We shall everywhere use Feynman diagrams in this momentum representation, but they can also be associated with the terms in the scattering amplitude in the original coordinate representation (the integrals (73.10)). Here the electron amplitudes are replaced by the corresponding coordinate wave functions, and the propagators are in the coordinate representation. Each vertex corresponds to one of the variables of integration (x or x' in (73.10)); the factors assigned to the lines that meet at a vertex are taken as functions of the corresponding variable. Let us now consider the mutual scattering of an electron and a positron; their initial momenta will be denoted by p . and p+ respectively, and their final momenta by p . and p.+. The positron creation and annihilation operators appear in the (//-operators (73.6) together with the electron annihilation and creation operators respectively. Whereas in the previous case the operator tp annihilated the two initial particles and (// created the two final particles, here these operators act oppositely with §73 291 Feynman Diagrams for Electron Scattering regard to electrons and positrons- The conjugate function i/>(-p+) will therefore now describe the initial positron, and i/>(-p+) the final positron, both being functions of the 4-momentum with reversed sign. Taking account of this difference, we obtain the scattering amplitudet Mfi = -e?(fi(pl)y^(pO)D^(p--pL)(fi(-p + )7^(-p;))^ + e 2 (ö(-p + )7^(p-))D^(p- + p+)(ö(p'-)7^(-p'+)). (73.15) The two terms in this expression are represented by the following diagrams: (7.3.16) The rules for constructing the diagrams are altered only as regards the positrons. The incoming and outgoing continuous lines are again associated with factors u and ü respectively. Now, however, the incoming lines correspond tofinalpositrons and the outgoing lines to initial positrons, the momenta of all the positrons being taken with reversed sign. The difference between the two diagrams (73.16) should be noted. In the first diagram, lines of the initial and final electrons meet at one vertex, and those of the two positrons at the other. In the second diagram, initial electron and positron lines meet at one vertex, and final lines at the other. The upper vertex represents annihilation of a pair with emission of a virtual photon; the lower vertex represents the creation of a pair from this photon. This difference affects the properties of the virtual photons in the two diagrams. In the first diagram ("scattering" type), the 4-momentum of the virtual photon is the difference between those of the two electrons (or positrons); hence k 2 <0 (cf. (73.1)). In the second diagram ("annihilation" type), k' = p_ + p+, and hence k'2 >0. Here it should be noted that for a virtual photon we always have k2 # 0, unlike a real photon, for which k2 = 0. If the colliding particles $re not identical and also not a particle and its antiparticle (for instance, an electron and a muon), then the scattering amplitude is represented by a single diagram: p(o); (73.17) t The sign of the whole amplitude is definite in the scattering of non-identical particles, being determined by the fact that in (73.5) the "external" operators must be arranged so that the two electron operators are both at the ends: <0|a'b'... b*a+\0) (or both in the middle); this condition ensures the "same sign" of the initial and final vacuum states. The sign of the amplitude can also be verified from the non-relativistic limit: we shall see later (§81) that, in this limit, the second term in (73.15) tends to zero, and the first term tends to the Born amplitude of Rutherford scattering. 292 Invariant Perturbation Theory §74 There can be no annihilation or exchange type diagram in this case. The same result can be obtained analytically by writing the current operator as the sum of electron and muon currents: and taking, in the product j(*Xx)jlvXx')> the matrix elements of terms which give the required annihilations and creations of particles. Let us now consider first-order processes, which, as mentioned at the beginning of this section, are forbidden by the conservation of 4-momentum. The matrix elements of the operator Sil) = -ie [ j(x)Â(x)d4x (73.18) for such transitions correspond to the creation or annihilation of three real particles (two electrons and one photon) at "the same point x". They occur by the contraction of the operators i//(x) and <//(x) at the same point, and are expressed, for example in the case of photon emission, by integrals of the form Sfi = -iej Ûx)Hx)(yA*(x)) d4x, which vanish because the integrand includes the factor exp[-f(pi - p2 - k)x] with a non-zero exponent. In the language of Feynman diagrams, this means that diagrams with three free ends such as I I* y\ (73.19) are zero. For the same reason, second-order processes involving six particles in the initial state (or in the final state) are impossible. In the matrix element Sfl for such a transition, the integral over dAx d V would separate into a product of two vanishing integrals over d4x and d V of products of three wave functions taken at the same point. In other words, the corresponding diagram would separate into two independent diagrams of the type (73.19). § 74. Feynman diagrams for photon scattering Let us now consider another second-order effect: the scattering of a photon by an electron (the Compton effect). In the initial state let the photon and the electron §74 Feynman Diagrams for Photon Scattering 293 have 4-momenta k\ and p{9 and in the final state k2 and p2 (and also definite polarizations, which will be omitted for brevity). The photon matrix element is <2|TA|1(x)A,(x')|l> = <0|c2TAM(x)A,(x')c||0), (74.1) where k Contraction of the external and internal operators gives (74.1) = CiA^A'vC^ + c2A^Afvct = AfcAl, + A1|iA?1,\ (74.2) where we have used the commutativity of the operators cx and c 2 ; for the same reason, the symbol T can here be omitted. The electron matrix element is <2|Tr(x)f(x')|l> = (0\a2T(W*)WyW)atfi)> (74.3) This involves four ^-operators. Only two are concerned with the annihilation of electron 1 and the creation pf electron %, and will be contracted with the operators df and d2. These may be <//, tp or i£', ^ (but not i£, i£ or ty\ $': the creation and annihilation of two real electrons and one real photon at the same point x or x' would give an expression equal to zero). By carrying out the two possible ways of contraction, we obtain two terms in the matrix element (74.3). These will first be written on the assumption that t > t': (74.3) = a^y^)(<Â V < / ^ r + aAfaWiiy'l'laî. (74.4) tp'ât-^â\â\^\. In the first term the contracted operators are d 2 ^-»d 2 d 2 (£ 2 , Since the operators d2d2 and â\â\ are diagonal and appear at the end of the products, they can be replaced by the vacuum expectation value, i.e. unity. To make a similar transformation in the second term of (74.4), the operator d 2 must first be "pulled" to the left, and â\ to the right. This is done by means of the commutation rules for the operators dp, dpi (74.5) {up, $}+ = <ÄP, {dp, <£}+ = i/>p.J Then (74.4) becomes <0|(fcY^)(*'YTi) - (*y^iX*27> # )|0>, t > t'\ (74.6) 294 §74 Invariant Perturbation Theory only the operator factors are averaged, of course. Similarly, for t < t\ we obtain an expression differing by the interchange of /J, and v and of the primed and unprimed symbols: <0| - (i//y>i)(i£ 2 y^) + (<Â27WO(^Y^i)|0>, t < V. (74.7) The two expressions (74.6) and (74.7) can be written as one by using the chronological product of the (//-operators: Til(x)$k(x()=ipi(x)$k(x^ = -<Mx')ifc(x), t'<t;} V>U J (74.8) where i and k are bispinor indices. Then the first and second terms in (74.6), (74.7) can be combined in the form tÂ27^<0|Ti// • i//|0>y>', + t ^ ? " « ) ^ ' • IÂ|0)T^I, (74.9) where i// • \\i denotes the matrix i//,i//k. It should be noted that, in the natural definition (74.8), the operator products are taken with opposite signs for t < V and t > t'. In this respect it differs from the definition of the T product which has been used for the operators A and j. This difference arises because the fermion operators ip and \\ß anticommute outside the light^cone, unlike the commuting boson operators A and the bilinear operators j = i//yi//.t This procedure ensures the relativistic invariance of the definition (74.8). A formal proof of the commutation rules for the (//-operators will be given in §75.$ We shall define the electron propagation function or electron propagator, a bispinor of rank two, as Gik(x - x ' ) = -i<0|T^(jc)fe(jc')|0>. (74.10) Then the electron matrix element becomes <2|TrU)r(x')|l> = i$iy»GyvVx + Wy'Gy^x. (74.11) On multiplication by the photon matrix element (74.1) and integration over d4x d V , the two terms in (74.11) give the same result, and so we have S/i = - ie2 [ [ d4x d V i2(x)yILG(x - x')7yi//,(x') x x {A2*M(x)Au(x') + Af,(x')AlM(x)}. (74.12) t The ^-operators themselves, it will be recalled, correspond to no measurable physical quantities, and therefore need not commute outside the light cone. t The T product of any number of ^-operators may be defined similarly. It is equal to the product of all the operators arranged in order of increasing time from right to left, the sign being determined by the parity of the interchange needed to obtain this order from the order shown under the T product symbol. Accordingly, this sign changes when any two «//-operators are interchanged; for example, TÄ(x)e(x') = -T(f k U')Ä(x). I §75 295 The Electron Propagator Substituting the plane waves (64.8), (64.9) for the electron and photon wave functions and separating the delta function as in (73.10), we obtain finally the scattering amplitude Mfl = -47re2û2{(ye^G(pi + kd(yei) + (yeùG(p[-k2)(yemuu (74.13) where e\, e2 are the photon polarization 4-vectors and G(p) the electron propagator in the momentum representation. The two terms in this expression are represented by the following Feynman diagrams: k2 j«, v 4 7 r e 2 G 2 ( y e l ) G (f) (^e,) u, = / \ > « fsP, + k, \ (74.14) The broken-line free ends of the diagrams correspond to real photons; the incoming lines (initial photon) are associated with a factor V(47r)e, and the outgoing lines (final photon) with a factor V(47r)e*, where e is the polarization 4-vector. In the first diagram, the initial photon is absorbed together with the initial electron, and the final photon is emitted together with the final electron. In the second diagram, the final photon is emitted together with the annihilation of the initial electron, and the initial photon is absorbed together with the creation of the final electron. The continuous internal line joining the two vertices represents a virtual electron whose 4-momentum is determined by the conservation of the 4-momentum at the vertices. This line is associated with a factor iG(J). Unlike the 4-momentum of a real particle, that of the virtual electron has a square which is not equal to m2. If the invariant f2 is considered, for example, in the rest frame of the electron, we easily find that f = (Pi + M 2 > m2, /' 2 = (p, - k2)2 < m2. (74.15) §75. The electron propagator The propagation functions or propagators defined in §§73 and 74 are of fundamental importance in the formalism of quantum electrodynamics. The photon propagator D^ is a basic characteristic of the interaction of two electrons, as is Invariant Perturbation Theory 296 §75 shown by its position in the electron scattering amplitude, in which it is multiplied by the transition currents of the two particles. The electron propagator plays a similar part in the electron-photon interaction. Let us now calculate the actual values of the propagators, taking first the electron propagator. Let the operator yp - m, where pM = id^ act on the function Gik(x-x') = -KOlTi/zKxJ^U^O), (75.1) i and k being bispinor indices. Since tp(x) satisfies Dirac's equation (yp - m)^f(x) = 0, we find that the result is zero at all points x, except those for which t = t\ The reason is that G ( x - x ' ) tends to different limits as t->f' + 0 and f - » t ' - 0 : according to the definition (74.8) these limits are respectively -i<0|^(r,t)A(rM)|0> and +i<0|^k(r', t)^(r, t)|0>, and, as we shall see, they are not the same on the light cone. This causes an additional delta-function term to appear in the derivative dG/dt: . / , T^Mx>) dG 0^ + o ( t - 0 ( G , ^ o - G , ^ - o ) . (75.2) Since the derivative with respect to t appears in the operator yp - m in the form iy° djdty we therefore have (yp - m)ikGkl(x - x') = 8(t - 07*<0|{iMr, 0 , <Mr\ 0}+|0>. (75.3) The antiçommutator is calculated as follows. On multiplying the operators <£(r, t) and <£(r', t) (see (73.6)) and using the rules for interchange of the fermion operators dp, bp, we find {<Mr, 0 , <Mr\ OK = S tMr)<Mr') + ^i(r)*. P i k (r')], p (75.4) where i//±p(r) are wave functions without the time factor; as in §§73 and 74, the polarization indices are omitted for brevity. The set of all functions </>±p(r), which are eigenfunctions of the electron Hamiltonian, forms a complete set of normalized functions, and according to the general properties of these (cf. QAf, (5.12)) we have 2 iMmW) p + ^pAr)^Ar9)] = M(r-r'). (75.5) The sum on the right-hand side of (75.4) differs from that in (75.5) in that <}/t is replaced by (*lt*y\, and its value is 7? k 6(r-r'). Thus {.Mr, 0 , «K(r\ OK = 5(r - r')y?k. (75.6) From this formula it follows, in particular, that the operators tjf and ip anti- The Electron Propagator §75 297 commute outside the light cone, as stated in §74. When (x -x'f < 0 there is always a frame of reference in which t - t'\ if then rï r', the anticommutator (75.6) is in fact zero. Substituting (75.6) in (75.3) (and omitting the bispinor indices), we have finallyt (yp - m)G(x - x') = S(4)(x - x'). (75.7) Thus the electron propagator satisfies Dirac's equation with a delta function on the right-hand side. Mathematically speaking, therefore, it is the Green's function for Dirac's equation. We shall later be concerned not with the function G(£) itself (£ = x - x'), but with its Fourier components: G(p) = JG(OeipidAÇ (75.8) (the propagator in the momentum representation). Taking the Fourier component of each side of (75.7), we find that G(p) satisfies the algebraic equations (7p-m)G(p)=l, (75.9) G(p) = 2 Ç ^ . (75.10) the solution of which is The four components of the 4-vector p in G(p) are independent variables, not related by p2 = Po~p 2 = m2. Writing the denominator in (75.10) as Po-(p 2 + m2), we see that G(p), as a function of p0 for given p2, has two poles at p0 = ±e, where e = V(p2 + m2). Thus, in the integration with respect to p0 in the integral G(a = (2^/e- i p *G(p)d 4 p = (àfjd3p ' e" "f df>0 ' e_iP0T G(P) (75U) (where r = t — t'), the question of avoiding the poles arises; until this is decided, the expression (75.10) remains essentially indeterminate. To settle this question, we go back to the original definition (75.1), and substitute in it the ^-operators as the sums (73.6), noting that the only non-zero vacuum expectation values are those of the following products of creation and annihilation operators: <0|apap+|0)=l, (0|bp*>p+|0)=l. t The explicit form, including the bispinor indices, is (yp - m )uGtk(x - x') = 5(4,(x - x')6ik. (75.7a) 298 Invariant Perturbation Theory §75 (Since in the vacuum state there are no particles, a particle has to be "created" by the operator âp or bp before it can be "annihilated" by dp or b9.) The result is Ga(jc - x') = -i 2 «Mr, 0«Mr\ O p = - i 2 e"M"0 ^(r)^ k (r') for f - r' >0; P (75.12) Gik(x - x') = » 2 <Ê_M(r\ f')^-p,i(r, 0 P = i £ eh(,-,,) ^-p,,(r)t/;_p.k(r') for t - t' <0. For t > f only the electron terms contribute to G, and for t < t' only the positron terms. If the summation over p is replaced by an integration over d3p» a comparison of (75.12) and (75.11) shows that the integral Jé?-ip°TG(p)dpo (75.13) must have a phase factor e~iET for T > 0 and eUr for T <0. This can be achieved by passing above the pole p0 = e and below p0 = - e in the plane of the complex variable p0: -\v ■^V (75.14) For, when T > 0, the path of integration is closed by an infinite semicircle in the lower half-plane, so that the value of the integral (75.13) is given by the residue at the pole po = +e; when T < 0 , the path is closed in the upper half-plane, and the integral is given by the residue at the pole p0 = - e. The desired result is thus obtained in each case. This rule for avoiding the poles (Feynman's rule) can be differently stated as follows: the integration is everywhere along the real axis, but the mass m of the particle is given an infinitesimal negative imaginary part: m-*m- iO. We then have e-*V[p 2 + (m-iO) 2 ] = V[p2 + m 2 -iO] = e - /O. (75.15) §75 The Electron Propagator 299 The poles p0 = ± e are therefore moved off the real axis: - e + iO —! . ■ (75.16) + c-iO and the integration along this axis is equivalent to integration along the path (75.14).t Using the rule (75.15), we can write the propagator (75.10) in the form G(p)= ^PV".ft. p -m +i0 (75.17) The rule of integration with displaced poles can be proved by means of the relation 1 x + iO (75.18) =P--i7TÔ(x), x which is to be taken in the sense that multiplication by any function /(x) and integration gives I l&L dx = P jtMdx. — oo lV/(0 ) (75.19) —oc (the symbol P denoting the principal value). The Green's function (75.10) is the product of the bispinor factor yp + m and a scalar, G (0) (p)=l/(p 2 -m 2 ). (75.20) The corresponding coordinate function G{0)(£) is evidently a solution of the equation (p2 - m2)G(0)(x - x') = 8(4)(x - x'), (75.21) i.e. it is the Green's function of the equation (p2 - m2)tj/ — 0. In this sense we can say that G(0)(x -x') is the scalar-partide propagator. It is easily seen by calculation, in the same manner as above, that the scalar field propagation function can be expressed in terms of the «//-operators (11.2) by G(0)(x - x') = -i<0|T«KxW+(x')|0>, (75.22) which is analogous to the definition (75.1). The chronological product is defined (as t It is useful to note that the rule for moving the poles corresponds to an infinitesimal damping of G(x - x') with respect to |T| = |I - t'\: if the value of po at the displaced poles is written as -(e - i8) and + (f -i5), with S-»+0, then the time factor in the integral (75.13) becomes exp(-ie|r|- S|T|). 300 Invariant Perturbation Theory §76 for all boson operators) by T$(x)4,+(x') = i(x)4t+(x'), = ti^(x')tMx), r>f';i t <t\ I (75.23) with the same sign for both t > t' and t < f'. § 76. The photon propagator Hitherto we have been concerned (in §§43 and 74) with the explicit form of the electromagnetic field operator A only in finding the matrix elements with respect to a change in the number of real photons. For this purpose it was sufficient to use the representation (§2) of the free field potentials in terms of transverse plane waves. This representation, however, does not give a complete description of every field, as is clear from the fact that the scattering diagrams (73.13), (73.14) must also take account of the Coulomb interaction of the electrons. The latter is described by the scalar potential 3> and certainly cannot be reduced to an exchange between transverse virtual photons (describable by a vector potential such that div A = 0).t Thus we have as yet essentially no complete definition of the operators A, and without this it is impossible to carry out a direct calculation of the photon propagator by means of the formula D^(x -x') = i<0|TA^(jc)A,(x')|0). (76.1) On the other hand, the fact that the potentials are not gauge-invariant deprives of much of their physical meaning the operators which would be needed for a complete quantization of the electromagnetic field. These difficulties, however, are purely formal, not physical, and can be avoided by using certain general properties of the propagator, which are evident from the requirements of relativistic invariance and gauge invariance. The most general 4-tensor of rank two which depends only on the 4-vector £ = x - x' is D»AO = g».D{è2) - d^D{i\e\ (76.2) where D and D (l) are scalar functions of the invariant £2.t This tensor is necessarily symmetrical. t With the condition div A = 0, Maxwell's equations lead to the following equations for A and $: d<t> DA = -4irj + V — , dt A * = -4<7rp. In this gauge, the potential satisfies the static Poisson's equation; cf. (76.13) below with Doo in the same gauge. t These functions are different in the three ranges of values of the argument which are not mutually interchanged by Lorentz transformations: the regions outside the light cone (£' <0), and within its two parts ( £ 2 > 0 ; & > 0 , £o<0). §76 301 The Photon Propagator In the momentum representation, we correspondingly have D^(k) = D(k2)g^ + k,KDil)(k2). (76.3) where D{k2), D(l\k2) are the Fourier components of the functions D(£2), D (0 (£ 2 ). The photon propagation function, in physical quantities (scattering amplitudes), is multiplied by the transition currents of two electrons, i.e. it appears in combinations of the form (j^)2\D^(jv)^; see, for instance, (73.13). But, because of the conservation of current (3 M j M =0), the matrix elements J21 = «/^Y^i satisfy the condition of 4-transversality, Mn2i=0, (76.4) where k = p2~P\; cf. (43.13). It is therefore clear that all physical results are unchanged by the substitution ( 76 - 5 ) D^ -+ D M , + XILK + XvK> where the *M are any functions of k and k0. This arbitrariness in the choice of D^ corresponds to that in the field potential gauge. The arbitrary gauge transformation (76.5) can violate the relativistically invariant form D ^ assumed in (76.3) if the quantities \n do not make up a 4-vector. But, even considering only relativistically invariant forms of the propagator, we see that the choice of the function D{l)(k2) in (76.3) is entirely arbitrary; it does not affect any physical results, and can be made in any convenient manner (L. D. Landau, A. A. Abrikosov and I. M. Khalatnikov, 1954). Thus the determination of the propagation function amounts to that of a single gauge-invariant function D(k2). If we take a given value of k2, and the z-axis in the direction of k, the transformations (76.5) will not affect the components Dxx = Dyy = -D(k2). It is therefore sufficient to calculate the component Dxx, using any gauge for the potentials. We shall use a gauge in which div A = 0 and the operator A is given by the expansion (2.17), (2.18): (c k a e ( a ) e~ikx + cLeM* eikx\ a> = Ikl; (76.6) the index a = 1, 2 labels the polarizations. The only non-zero vacuum expectation values of products of the operators c, c* are (0|ckaCka|0) = 1. Then, by the definition (76.1), we have D Ä (|) = ^ | ^ ^ ( 2 e i a ) e l r ) * ) e - M T , + , k - 1 . (76.7). where i, k are three-dimensional vector indices; the summation over k has been replaced by an integration over d3fc/(27r)\ The absolute value of r = t - V appears in the exponent, because the operator product in (76.1) is chronological. Invariant Perturbation Theory 302 §76 It is evident from (76.7) that the integrand without the factor e ,k * is the component of the three-dimensional Fourier expansion of the function D,k(r, 0« For Dxx - -D, it is (2iri/a>) e-^ 2 K°T = (27ri/a>) <TiwW. a To find Dxx(k2) we now have to represent this function as a Fourier integral over time. The appropriate formula is M €-Mn = - -L f <o 4 J e-V dko. 2ir J ko-kl + »0 —x As explained in §75, this integration is understood to be taken along a contour passing below the pole k0 = |k| = o) and above the pole ko = _ N = _ w ; for T > 0 the value of the integral is determined by the residue at the pole k0 = +a>, and for T < 0 by that at k0 = - w. Thus we have finally D(k2) = 47r/(k2 + /O). (76.8) The term +i0 in the denominator which results from this proof is in accordance with the rule (75.15), JO being subtracted from the (zero) mass of the photon. It is evident from (76.8) that the corresponding coordinate function D(£2) satisfies the equation - d^D(x - x') = 4TTS(4)(X - x\ (76.9) i.e. it is the Green's function of the wave equation. We shall generally take D (0 = 0, i.e. use the propagation function DM„ = g^D(k2) = p ^ g,. (76.10) (the Feynman gauge). There are also other gauges which may be advantageous in certain applications. Putting D(,) = -Dlk2, we obtain the propagator in the form (the Landau gauge), with D^Jk" = 0. This choice is similar to the Lorentz gauge for potentials (Aßk" = 0). The propagator gauge conditions Duk' = 0, D0jk' = 0 are analogous to the threedimensional gauge condition div A = 0 for the potentials. Together with Dxx = -D = -4nlk\ §76 The Photon Propagator 303 D < 76 - 12) these conditions give - = -Ä(5«-f)- In order to obtain this Da, we must apply to the propagator (76.10) the transformation (76.5), putting - *°~ 4TTÙ) _ 2(a>2-k2)k2' Xi Atrkj ~2(o) 2 -k 2 )k 2, The remaining components DM„ are then found to be Doo = -47r/k2, Do,=0. (76.13) This is called the Coulomb gauge (E. E. Salpeter, 1952). Dw is here the Fourier component of the Coulomb potential. Finally, the propagator gauge in which Du = p-TlUi ~ *TT), A>, = Doo = 0, (76.14) is analogous to the potential gauge condition O = 0. This is a convenient form for use in non-relativistic problems (I. E. Dzyaloshinskil and L. P. Pitaevskiï, 1959). All the above expressions relate to the momentum representation of the propagator. In some cases, it is convenient to use the mixed frequency-coordinate representation, i.e. the function D^(w, r) = f D^(û), k) eik" d3kl(2ir)\ In the Feynman gauge (76.10) D^w, r) = g>tvD(o>, r), where rv * = A4 7 rf J e_ikr . d'k D(a>,r) w2 k5+ 0(27r)3 = o 'Vr}<oi-k^i0kdk or, changing k to -k in the second term in the integrand, ™ , i f e'krkdk irr J a>£ - k + tO (76.15) 304 Invariant Perturbation Theory §77 The integration here is carried out by closing the contour of integration with an infinite semicircle in the upper half-plane of the complex variable k, and amounts to taking the residue at the pole k = |o>| + iO. The final result is D(co,r) = - - e , ' M r . (76.16) The following comment may be made regarding this expression. The process described by the diagrams (73.13) and (73.14) may be intuitively regarded as the scattering of electron 2 in the field due to electron 1 (or vice versa). The function (76.16) corresponds to the usual "retarded" potential <* elior (see Fields, (64.1), (64.2)) only when to > 0. The sign of CD, however, depends on the arbitrary choice of the direction of the arrow of k in the diagram. The above-mentioned property of D(oj, r) signifies that in quantum electrodynamics the source of the field is to be regarded as the particle which loses energy, i.e. emits a virtual photon. To conclude this section, let us also consider the problem of the propagator for particles with spin 1 and non-zero mass. There is then no arbitrariness of the gauge, and the choice of the propagator is unambiguous. Substituting the i/f-operators (14.16) in the definition G^ = -i<0|T^(jc)^:(jc')|0>f (76.17) we obtain an expression that differs from (76.7) only in that the sum over polarizations in the integrand is replaced by a Summation over polarizations is equivalent to averaging and multiplication by 3, the number of independent polarizations. Averaging gives the density matrix of unpolarized particles (14.15). Thus we find for the propagator of vector particles The propagators (75.17) and (76.18) have similar structures: the denominator contains the difference p2-m2, and the numerator is (apart from a factor) the density matrix of unpolarized particles with a given spin. §77. General rules of the diagram technique The calculation of the scattering matrix elements that has been given for some simple cases in §§73 and 74 contains all the fundamental features of the general method. There is no particular difficulty in deriving the corresponding general rules for calculating the matrix elements in any order of perturbation theory. As has already been mentioned, the matrix element of the scattering operator Ê for the transition between any initial and final states is equal to the vacuum §77 General Rules of the Diagram Technique 305 expectation value of the operator obtained by multiplying S on the right by the creation operators of all the initial particles and on the left by the annihilation operators of all the final particles, This treatment puts the S-matrix element in the following form in the nth order of perturbation theory: </|S(1i> = ^ < 0 | . . . M t / . - - f l i / . . - c , , x x J d 4 *,... d4xnT{Û-ieyA{)i!>ô... (<M-i>yAJtM}cu . . . aïi... i>r,... |0>; (77.1) the suffixes li, 2i,... label the initial particles (positrons, electrons and photons separately), and the suffixes 1/, 2 / , . . . label the final particles. The suffixes 1,2,... to the operators i£ and Â signify that <£i = 4*(x\) and so on. The operators i£ and A which appear here are linear combinations of the creation and annihilation operators of the corresponding particles in various states. Thus we obtain expressions for the matrix elements which are the vacuum expectation values of the products of the particle creation and annihilation operators and of their linear combinations. The calculation of such expectation values is effected by means of the following results, which constitute Wick's theorem (G. C. Wick, 1950). (1) The vacuum expectation value of the product of any number of boson operators c+ and c is equal to the sum of the products of all possible expectation values of these operators taken in pairs (contraction). In each pair, the factors must be placed in the same order as in the original product. (2) For the fermion operators d+, d, b+, b (of the same or different particles), the rule is the same except that each term appears in the sum with positive or negative sign according to the parity of the number of interchanges of fermion operators needed to bring together all the operators that are averaged in pairs. The expectation value must obviously be zero unless the product contains a factor d+, b+, c+ for each factor d, b, c. Then only pairs of operators (d, d + ),..., pertaining to the same states are to be contracted, and moreover only those pairs in which d*, etc., is to the right of d, etc.: the particle is first created and then annihilated (whereas <0|a+a|0) = 0, etc.). If each pair (d, d+), etc. appears only once in the product, Wick's theorem is obviously true, the expectation value then reducing to a single product of pairwise expectation values. Its validity is also evident when all the annihilation operators in the product are to the right of the creation operators; this is called a normal product. The expectation value is then zero. Wick's theorem is now easily proved by induction for the general case where one pair of operators appears k times in the product, as follows. Let us consider the expectation value (0|.. cc* .. |0>, in which the pair of boson operators appears k times; the argument is entirely similar for fermion operators. If we interchange the factors c and c* in one pair, the commutation rules give <0| ..cc\. |0> = <0|.. c+c.. |0> + <0|.. 1.. |0>. (77.2) 306 Invariant Perturbation Theory §77 The expectation value (0|. . 1 . . |0) contains k - 1 pairs, and Wick's theorem is assumed to be valid for it. If the expectation value <0|.. cc* . . |0) is expanded by Wick's theorem, it differs from (0| . . c V . . |0) by just the term <0|.. 1 . J0><0|ccf|0> = <0|.. l . . | 0 > ; in the expansion of (0| . . c c . . |0), the corresponding term (0|. . 1 . . |0)(0|c 4 c|0) is zero. Hence it follows from (77.2) that, if Wick's theorem is valid for a matrix element (0| . . c c. . |0), it is still valid when c and c + are interchanged. Since the theorem is known to be valid for one particular order of factors (the normal order), it is therefore true in every case. Since Wick's theorem is valid for products of operators d, b , . . . , it is also true for all products which contain the linear combinations <£, i//, A of d, b , . . . , as well as the latter operators themselves. On applying this theorem to the matrix element (77.1), we bring it to the form of a sum of terms, each term being the product of a number of pairwise expectation values. The latter will include contractions of the operators t//, i//, Â with "external" operators—those which create the initial particles or annihilate the final particles. These contractions are expressed in terms of the wave functions of the initial and final particles by the formulae <0|ACp|0> = Ap, <0|cpA|0> = A*, ] <O|<K|O) = 0p, <0|ap^|0> = t^* <o|bpt//|o> = iKP, <o|<fa;|o> = ^ -p» (77.3) where Ap and i//p are the photon and electron wave functions with momentum p (the polarization indices are omitted for brevity, as in §§73 and 74). Contractions of the "internal" operators in the T product will also occur. Since the sequence of factors in each contracted pair is preserved when applying Wick's theorem, the chronological sequence of operators is preserved in these contractions, and they are therefore replaced by the corresponding propagators.! Each term of the sum obtained from the matrix element by applying Wick's theorem is represented by a particular Feynman diagram. In the nth-order diagram there are n vertices, each corresponding to one of the variables of integration (the 4-vectors xu x2,...). Three lines meet at each vertex, two being continuous (electron lines) and one broken (photon line); these correspond to the electron operators t// and ty and the photon operator A as functions of the same variable x. The operator i/> corresponds to the incoming line and if/ to the outgoing line. By way of illustration, we shall give some examples of the correlation between the terms of the matrix element in the third approximation and the diagrams. t The following comment must be made regarding this last statement. In proving Wick's theorem, we have made use of the commutation rules for the operators c and c \ which are meaningful only for real ("transverse") photons. The "external" operators cî> Cf do in fact, of course, correspond to such (initial and final) photons, but the operators A (which appear within the T product) describe, as shown in §76, not only transverse photons. The situation here is similar to that in the calculation of D^„ (§76). Owing to the relativistic and gauge invariance, it is sufficient to prove the theorem for those products (i.e. components of the tensors (0|TA^A,—10» which are determined by the transverse parts of the potentials. The theorem is then valid for all products. General Rules of the Diagram Technique §77 307 Omitting the integral sign and the symbol T, and also the operator symbols, the factors -iey, and the arguments of the operators, we can symbolically write these terms as (a) ($A<J,) (4-A'D (.J,A|) (b) (4-A4-) (vJ,A<[<) ($A<(,) (77.4) (c) (.;A-J/) (}A<;) (.pA4,) (d) (^A-;) (>;A^) ( W ) , 1 ^ 1 . For clarity, the electron and photon contractions are shown by continuous and broken lines as in the diagrams. The direction of the arrows from i/r to \j* for the electron contractions is the same as in the diagrams. For the internal photon contractions the direction is immaterial (the photon propagator is an even function of x -x'). The terms thus obtained include equivalent terms which differ only in that the vertices are renumbered, i.e. that the correlation between the vertices and the variables is changed, or simply that the variables of integration are renamed. The number of such interchanges is n !; this cancels the factor 1/n ! in (77.1), and there is then no need to consider diagrams differing only by interchange of vertices. This has already been noted in §§73 and 74. For example, there are two equivalent diagrams in the second approximation: ( + A + ) (<J/A<|0 = /-A (77.5) In (77.4) and (77.5) only internal contractions which correspond to internal diagram lines are shown (virtual electrons and photons). The operators still free are contracted with external operators, and this establishes a correlation between the free ends of the diagrams and certain initial and final particles. Then 1// (contracting with operators âf or bt) gives the final electron line or the initial positron line, and ip (contracting with at or bf) gives the initial electron line or the final positron line. 308 Invariant Perturbation Theory §77 The free operator A (contracting with ct or cf) can correspond to either an initial or a final photon. Thus we obtain sets of several topologically identical diagrams (i.e. diagrams having the same number of lines arranged in the same way), differing only by interchanges of initial and final particles between incoming.and outgoing free ends. Each such interchange is clearly equivalent to a certain interchange of the external operators d, b , . . . in (77.1). It is therefore evident that, if the initial particles or the final particles include identical fermions, diagrams which differ by an odd number of interchanges of free ends must have opposite signs. An uninterrupted sequence of continuous lines in the diagrams constitutes an electron line along which the arrows maintain a constant direction. Such a line may have two free ends or form a closed loop. For example, the diagram ('W) QÊAij,) = O* has a loop with two vertices. The maintenance of direction along the electron line is the graphical expression of the conservation of charge: the "incoming" charge at each vertex is equal to the "outgoing" charge. The arrangement of the bispinor indices along the continuous electron line corresponds to writing the matrices from left to right in motion contrary to the arrows. The bispinor indices of different electron lines can never become confused. Along an open line, the sequence of indices terminates at the free ends with electron (or positron) wave functions; in a closed loop, the sequence of indices is itself closed, and the loop corresponds to the trace of the product of the matrices found on it. This trace must be taken with negative sign, as is easily seen. A loop with k vertices corresponds to a set of k contractions: (•W) (.}A+) .. • (•; A •}) (or to another which is equivalent, differing only in an interchange of the vertices). In the (k - l)th contraction the operators \p and <// are together in the order ($ to the right of i/0 in which they must appear in the electron propagator. The operators at the ends are brought together by an even number of interchanges with other (//-operators, and are then in the order i//i//. Since <0|T^>|0) = - < 0 | T ^ | 0 > (see the second footnote to §74), the replacement of this contraction by the corresponding propagator means a change in the sign of the whole expression. In general, the change to the momentum representation is made in an exactly similar manner to that in §§73 and 74. As well as the general law of conservation of 4-momentum, "conservation laws" must also be satisfied at each vertex. But all these laws may not suffice to determine uniquely the momenta of all the internal lines in the diagram. In such cases, there remain integrations over dApl{2ir)A for all §77 General Rules of the Diagram Technique 309 the undetermined internal momenta; these integrations extend throughout p-space, including p0 from -<* to +00. In the above discussion it has been assumed that the perturbation is represented by the interaction between those particles which are "actively" concerned in the reaction (i.e. between particles whose state is altered as a result of the process). A similar treatment can be given for the case where there is an external electromagnetic field, i.e. a field generated by "passive" particles, whose state is not altered in the process. Let A{e\x) be the 4-potential of the external field. It appears in the Lagrangian of the interaction together with the photon operator A, as the sum À + A(f) (which is multiplied by the current operator j). Since A{€) does not involve any operators, it cannot contract with other operators. Thus only external lines in Feynman diagrams can correspond to an external field. If A{€) is expressed as a Fourier integral: A(e)(x) = f A{t\q) e~iqx d4ql(2ir)\ (77.6) A{e\q)=( A(e\x)eiqxd4x, the expressions for the matrix elements in the momentum representation will contain the 4-vector q together with the 4-momenta of other external lines corresponding to real particles. Each such external-field line can be correlated with a factor AU)(q), and the line is to be regarded as "incoming" (in accordance with the sign of the exponent in the factor e~tQX which accompanies A{e\q) in the Fourier integral; an "outgoing line" would be correlated with a factor A(f)*(q)). If the 4-momenta of all the external-field lines are not uniquely defined (for given 4-momenta of all the real particles) by the law of conservation of 4-momentum, then there remain integrations over d4q/(27r)4 for all the "free" q and over all the other undetermined 4-momenta of the diagram lines. If the external field is independent of time, then AM(q) = 27rÔ(q°)AU)(q), (77.7) where A(e)(q) is the three-dimensional Fourier component: A(€)(q) - j A(°(r) e~* r d3x. (77.8) In this case the external line is correlated with A(f)(q) and assigned a 4momentum qM ~ (0, q); the energies of the electron lines which (together with the field line) meet at a vertex will be equal by virtue of the conservation law. Integration over d3p/(2ir)3 is necessary for the other "free" three-dimensional momenta p of the internal lines. The amplitude Mfi thus calculated determines, for example, the scattering cross-section by (64.25). We may give a list of final rules for the diagram technique whereby an 310 Invariant Perturbation Theory §77 expression may be obtained for the scattering amplitude (or rather for iMfi) in the momentum representation. (1) The nth approximation of perturbation theory corresponds to diagrams with n vertices, each of which is the meeting point of one incoming and one outgoing electron line (continuous) and one photon line (broken). The amplitude of the scattering process involves all the diagrams having free ends (external lines) equal in number to the initial and final particles. (2) Each incoming continuous external line is associated with the amplitude u(p) of an initial electron or u(-p) of a final positron (where p is the 4-momentum of the particle). Each outgoing continuous line is associated with the amplitude ü(p) of a final electron or ii(-p) of an initial positron. (3) Each vertex is associated with a 4-vector — iey91. (4) Each incoming broken external line is associated with the amplitude y/(4ir)e^ of an initial photon, and each such outgoing line with the amplitude V(47r)e* of a final photon, where e is the polarization 4-vector. The vector index /LL is the same as the index of the matrix y* at the corresponding vertex, so that the scalar product ye or ye* is obtained. (5) Each continuous internal line is associated with a factor iG(p), and each broken internal line with a factor -îD M „(p). The tensor indices /m, v are the same as the indices of the matrices y*\ yv at the vertices joined by the broken line. (6) The arrows have a constant direction along any continuous sequence of electron lines, and the arrangement of the bispinor indices along them corresponds to writing the matrices from left to right in motion contrary to the arrows. A closed electron loop corresponds to the trace of the product of the matrices found on it. (7) At each vertex, the 4-momenta of the lines which meet there satisfy a conservation law, i.e. the sum of the momenta of the incoming lines is equal to the sum of the momenta of the outgoing lines. The momenta of the free ends are given quantities (subject to the general conservation law), with momentum - p assigned to the positron line. Integration over d4pl(27r)4 is carried out for the momenta of internal lines which remain undetermined after application of the conservation laws at every vertex. (8) An incoming free end corresponding to an external field is associated with a factor A{e\q); the 4-vector q is related to the 4-momenta of the other lines by the conservation law at the vertex. If the field is constant, the line is associated with a factor A(e)(q), and integration over d3pl(2ir)3 is carried out for the three-dimensional momenta of internal lines which remain undetermined. (9) An additional factor - 1 is included in iMfl for each closed electron loop in the diagram and for each pair of positron external lines if these are the beginning and end of a single sequence of continuous lines. If the initial particles or the final particles include more than one electron or positron, the diagrams differing by an odd number of interchanges of identical particles (i.e. of the corresponding external lines) must have opposite signs. To clarify the last rule, it may be added that diagrams having the same continuous lines, i.e. diagrams which would be identical after removal of all photon lines, must always have the same sign. When identical fermions are present, the sign of the amplitude as a whole is arbitrary. §78 Crossing Invariance 311 §78. Crossing invariance The representation of the scattering amplitudes Mfi by Feynman integrals reveals the following noteworthy symmetry property of these amplitudes. Any of the incoming external lines in a Feynman diagram may be regarded (without changing the.direction of its arrow) as either an initial particle or a final antiparticle, and any outgoing line as either a final particle or an initial antiparticle. When the change is made from particle to antiparticle, the significance of the 4-momentum p assigned to the line also changes: p = pe for the electron (say), and p = ~pp for the positron. The polarization assigned to the particle is also changed. Since an incoming external line must correspond to an amplitude u and an outgoing one to w*, we have u = ue for the electron and u = u% for the positron; and the change from M t o w * signifies a change in the sign of the spin component (or the helicity) of the particle. For the photon, a strictly neutral particle, this is simply a change from emission to absorption or vice versa: an external photon line with momentum k corresponds either to the absorption of a photon with momentum ka = k, or to the emission of a photon with momentum ke = -k and the opposite helicity. This change in the significance of the external lines is equivalent to a change from one cross-channel of the reaction to others. Hence it follows that the same amplitude, as a function of the momenta of the free ends of the diagrams, describes every channel of the reaction.t Only the meaning of the arguments of the function varies with the channel: the change from particle to antiparticle implies Pi-*-pf, where p, is the 4-momentum of the initial particle (in one channel) and pf the 4-momentum of the final particle (in the other channel). This property of the scattering amplitude is called crossing symmetry or crossing invariance. In terms of the invariant amplitudes defined in §70 as functions of the kinematic invariants, we can say that these functions will be the same for all channels, but for each channel their arguments will take values in the corresponding physical region. Thus the Feynman integrals determine the invariant amplitudes as analytic functions; their values in the various physical regions are the analytical continuation of a function specified in one region. Since the integrands in the Feynman integrals have singularities, so do the invariant amplitudes, and their singularities can be determined from the expressions for the integrals, using the rule of pole avoidance. If the invariant amplitudes are calculated for any one channel from the Feynman integrals, their analytical continuation to the other channels will necessarily take account of these singularities. It should be emphasized that crossing invariance goes beyond the properties of the scattering matrix which follow from the general requirements of space-time symmetry. The latter imply the equality of amplitudes for processes which differ by the interchange of initial and final states and the replacement of all particles by antiparticles (with the momenta p of all particles unchanged and the signs of their angular momentum components reversed). This is the condition of CPT int If a particular channel is forbidden by the conservation of 4-momentum, the transition probability is necessarily zero because of the delta function which appears as a factor in (64.5). 312 Invariant Perturbation Theory §79 variance.t Crossing invariance, however, allows this transformation not only for all the particles at once but also for any one particle. §79. Virtual particles The internal lines in the Feynman diagrams play a role in invariant perturbation theory analogous to that of the intermediate states in the "ordinary" theory, but the nature of these states is different in the two theories. In the ordinary theory the (three-dimensional) momentum is conserved in the intermediate states, but the energy is not, and for this reason they are said to be virtual states. In the invariant theory, the momentum and the energy appear on an equal footing: in the intermediate states, the whole 4-momentum is conserved (this results from the fact that the integration in the S-matrix elements is over both coordinates and time, thus ensuring the invariance of the theory). But the relation between energy and momentum which holds for real particles and is expressed by the equation p2 = m2 is no longer satisfied in the intermediate states, which are therefore spoken of as intermediate virtual particles. The relation between the momentum and energy of a virtual particle may be anything required by the conservation of 4-momentum at the vertices. Let us consider a diagram consisting of two parts I and II, joined by a single line. Ignoring the internal structure of these parts, we can represent the diagram in the schematic form (the lines shown may be either continuous or broken lines). By the general conservation law, the sums of the 4-momenta of the external lines for parts I and II are equal. Because of the conservation at each vertex, they are also equal to the 4-momentum p of the internal line joining parts I and IL Thus this momentum is uniquely defined, and there is therefore no integration with respect to it in the matrix element. The quantity p 2 may be either positive or negative, depending on the reaction channel. There is always a channel in which p 2 >0.t Then the virtual particle is entirely analogous, as regards its formal properties, to a real particle with real mass M = Vp 2 . Its rest frame can be defined, its spin determined, and so on. The tensor structure of the photon propagator (76.11) is the same as that of the density matrix of an unpolarized particle with spin 1 and non-zero mass: P ^ = -3(g,i, -p»pJm2) t The formal description of the change from one of these reactions to the other by reversing the signs of all the 4-momenta in the Feynman diagrams corresponds to the significance of the operation CPT as 4-inversion. t For example, the channel (if it is allowable on energy grounds) in which all the free ends of part I correspond to initial particles and those of part II to final particles. Then p = P, (the sum of the 4-momenta of all the initial particles), and in the centre-of-mass system p = (P?% 0), so that p2 > 0 . 313 Virtual Particles §79 (see (14.15)). For a virtual particle the propagator (a quantity obtained from a quadratic combination of the field operators) plays a role analogous to that of the density matrix for a real particle. Thus a virtual photon, like a real photon, must be assigned spin 1. But, unlike the two independent polarizations of the real photon, all three polarizations are possible for the virtual photon, which is a "particle" with finite mass. The electron propagation function is G oc yp + m, where m is the mass of the real electron, the "mass" of the virtual particle being M = Vp2. Putting 7P 2M (7P * 2M (7P " *' ( ) we see that the first term corresponds to the density matrix of a particle with mass M and spin \, and the second term to that of a similar "antiparticle"; cf. (29.10) and (29.17). Since the particle and the antiparticle have different internal parities (§27), we conclude that the same spin \ must be assigned to the virtual electron, but that no definite parity can be assigned to it. A characteristic feature of the diagram (79.1) is that it can be cut into two unconnected parts by dividing only one internal line.t This line corresponds, in such cases, to a one-particle intermediate state, i.e. a state having only one virtual particle. The scattering amplitude corresponding to such a diagram contains the characteristic factor (which does not undergo integration) 1 p2-m2+*0' arising from the internal line p; m is the electron mass for an electron line and zero for a photon line. Thus the scattering amplitude has poles at the values of p for which the virtual particle would become a physical one (p 2 = m2). This situation is similar to the one in non-relativistic quantum mechanics, where the scattering amplitude has poles for energy values corresponding to bound states of the system of colliding particles (QM, §128). Let us consider the diagram (79.1) for the reaction channel in which all the free ends on the right correspond to initial particles, and all those on the left to final particles; then p 2 > 0 . Then we can say that, in the intermediate state, all the initial particles are converted into one virtual particle. This is possible only if such a conversion would not contradict the necessary conservation laws (not including the conservation of 4-momentum), namely the conservation of angular momentum, charge, charge parity, etc. This is the necessary condition for the occurrence of what are called pole diagrams. If these exist for one reaction channel, they exist also for the remaining channels, because of crossing invariance. t This property occurs for the diagrams of almost all processes in the first non-vanishing approximation. 314 Invariant Perturbation Theory §79 For example, the conservation laws mentioned do not preclude the formation of a virtual electron by e + y -> e. This corresponds to a pole of the Compton effect amplitude (and therefore a pole of the other channel of this reaction, namely two-photon annihilation of an electron-positron pair). The formation of a virtual photon by e~ + e* -*y corresponds to a pole of the amplitude for the scattering of an electron by a positron, and therefore that of an electron by an electron. Two photons can give neither a virtual electron nor a virtual photon: the conversion 7 + 7 -► e is forbidden by the conservation of charge or angular momentum, and 7 + 7 ^ 7 by that of charge parity. Accordingly, the photon-photon scattering amplitude cannot involve pole diagrams. The origin of the pole singularities of the scattering amplitudes, which has been discussed above on the basis of Feynman integrals, is really more general and is not dependent on perturbation theory. We shall show that such singularities arise simply as a consequence of the unitarity condition (71.2). Let us assume that the intermediate states n which appear in (71.2) include a one-particle state. The contribution of this state is (Tp - T$)(one-p) = Î(2TT)4 2 f «(4)(P/ - p)TfnTtn 7 Ä , where p and À are the 4-momentum and helicity of the intermediate particle. The integration over d3p is replaced by one over dAp (in the range p 0 = e >0): d'p-*2e8(p2-M2)d4p, where M is the mass of the intermediate particle. The integration eliminates the delta function 8{4\Pf - p); we then change from the amplitudes Tfi to Mfi by (64.10), obtaining (Mfi - M£)(one-p) = 27Ti8(p2 - M2) 2 MfnM*. A (79.3) Assuming T and P invariance, we have (apart from a phase factor) Mif = M/v, where the states i\ /' differ from i, / only in the sign of the particle helicities (with the same momenta). Taking the sum of equation (79.3) and the corresponding equation for Mrt'- Mff>, we have im M jTe'p) = - ITHP2 - M2)R, (79.4) where M/i = M„ + M/r, R = -2(MfnMfn + MrnM?n). A Hence it follows that M/„ as an analytic function of p2 = P* = P% has a pole at 315 Virtual Particles §79 p - M . According to (75.18) the pole part is Mr» = pi_*, (79.5) + i()- Real transitions to a one-particle state are possible only for one value of P] = P), namely M2. Thus we in fact obtain the scattering amplitude structure corresponding to a diagram of the form (79.1). Finally, let us consider an important property of diagrams containing closed electron loops. This property is easily derived by applying the concept of charge parity to a virtual photon: a virtual photon, like a real photon, must be assigned a definite (negative) charge parity.t If a diagram contains a closed loop (with number of vertices N > 2), the amplitude for the process concerned must include not only that diagram but also another which differs only in the direction of traversal of the loop (if N = 2, there is evidently no distinguishable "direction of traversal"). If these loops are "cut out" along the broken lines which come to them, we obtain two loops, W\ and Uu: i i / i i vil^ \ / wiT ^ \ (79 6) ' which may be regarded as diagrams determining the amplitude for the process of conversion of one set of photons (real or virtual) into another; the number N is the sum of the numbers of initial and final photons. But the conservation of charge parity forbids the conversion of an even number of photons into an odd number. When N is odd, therefore, the sum of the expressions corresponding to the loops (79.6) must be zero. The total contribution to the scattering amplitude from two diagrams containing these loops as constituent parts is consequently zero also, a result known as Furry's theorem (W. H. Furry, 1937). Thus, in constructing the amplitude for a given process, we can ignore diagrams containing loops with an odd number of vertices. This cancellation of diagrams occurs for the following reason. A closed electron loop corresponds to an expression (with given momenta ku k 2 ,..., kN of the photon lines) \d*p tr[(yel)G(p)(ye2)G(p + k.)...], (79.7) where p, p + ki,... are the momenta of the electron lines (which are not completely determined after the conservation laws have been applied at the vertices). Let the operation of charge conjugation be applied to all matrices y*1 and G, replacing them t This follows from the same arguments as were given at the end of §13 for a real photon, concerning the electromagnetic interaction operator acting at each vertex. 316 Invariant Perturbation Theory §79 by Ucy^Uc and U cGUc. The expression (79.7) is then unchanged, since the trace of a product of matrices is unaffected by such a transformation. According to (26.3), UcVl/c = - f , (79.8) and hence Uc]G(p)Uc = = p ^ = <5(-p). (79.9) But the replacement of G(p) by the transposed matrix with the sign of p changed is clearly equivalent to a change in the direction of traversal of the loop, all the arrows being reversed. Thus this transformation changes one loop into the other, and there is a factor (-1) N from the change (79.8) at each vertex. Hence n, = (-l) N n„, (79.10) i.e. the contributions from the two loops are the same when the number of vertices is even, but equal and opposite when this number is odd. CHAPTER IX INTERACTION OF ELECTRONS §80. Scattering of an electron in an external field ELASTIC scattering of an electron in a constant external field is a simple process which occurs even in the first approximation of perturbation theory (the first Born approximation). It corresponds to a diagram with one vertex: (80.1) where p and p ' are the initial and final 4-momenta of the electron, and q = p ' - p. Since the electron energy is conserved in scattering in a constant field (e = e'), we have q = (0, q).t The corresponding scattering amplitude is Mp = -eü(p')[yAw(q)]u(p), (80.2) where A(e)(q) is the component of the spatial Fourier resolution of the external field. The scattering cross-section is, according to (64.26), da^Tjr-ilMjifdo'. 167T (80.3) For an electrostatic field, A(e) = (A ( o\0), and hence M/,= -efi(p')7°M(P)Ai €) (q) = - eu*(p')u{p)AVXq). (80.4) In the non-relativistic case, the bispinor amplitudes u(p) of the plane waves reduce to the non-relativistic (two-component) amplitudes. For scattering without change of polarization, u' = uy and u*u = 2m by the normalization condition chosen. Thus da = %u«>" do\ t When there is an external field, such a diagram is, of course, not forbidden by the law of conservation of 4-momentum, as the diagram (73.19) with a real photon was: q2, unlike the square of the 4-momentum of a real photon, need not be zero, and the component with the necessary q is automatically taken from the Fourier integral which represents the external field. 317 §80 Interaction of Electrons 318 where U(q) = eAtfXq) is the Fourier component of the potential energy of the electron in the field; this expression is the same as the,familiar Born's formula (QM, (126.4)). In the general relativistic case, the cross-section for scattering of unpolarized electrons is obtained by averaging |M/j|2 over initial polarizations and summing over final polarizations, i.e. by taking the quantity \ 2 \Mfi\\ polar. where the summation is over the spin directions of the initial and final electrons; the factor 2 changes one of these summations into an averaging. According to the rules given in §65, we obtain 1 2 y >olar. pol |M/l|2 = 2trp'(7A" ) )p(7A ( ' ) *) = i|A^(q)| 2 tr (m + yp')y\m + yp)y°. To calculate the trace, we note that y°(yp)y° = (yp), where p = (e, - p ) , and therefore îtr (m + yp')y°(m + yp)y° = I tr (m + yp')(m + yp) = m2 + p'p = e 2 + m2 + p • p' = 2e2-iq2. Hence the cross-section is ^_£3A£!$9Ïe,(,_iU, AIT \ (80.5) 4e ) For a field due to a static distribution of charge with density p(r), we have Af >(q) = 4îrp(q)/q2, (80.6) where p(q) is the Fourier transform of the distribution p(r) (the form factor). In particular, for the Coulomb field of a point charge Ze we have p(q) = Ze. The cross-section is then da = d 0 . « W (,_.£) (80 .7) (N. F. Mott, 1929). The quantity q2 = 4p2 sinMfl, where 6 is the scattering angle. The angular dependence of the quantity preceding the parenthesis is therefore that of a §80 Scattering of an Electron in an External Field 319 Rutherford cross-section: . . 4(Ze 2 )V acrRu = do 4 q = do ( Z ^ : ) 4 C "sin" 4 ^e; (80.8) in the non-relativistic limit, e2lp4-+ \lm2v4. Thust da = daRu(\-v2 sin2 \0). (80.9) In the ultra-relativistic case, the angular distribution differs from the non-relativistic case in that there is much less backward scattering: as 0-*7r, do7da Ru -» m 2 /e 2 . In the ultra-relativistic case, formula (80.7) gives for small-angle scattering da (80.10) = ^^do>. Although this formula has been derived in the Born approximation (i.e. on the assumption that Ze2 < 1), it remains valid (for angles 0 ^ m/e) even if Ze1 ~ 1. This can be seen by using the ultra-relativistic wave function if/^ (39.10), which is exact as regards Ze2. This solution, which is valid in the range (39.2), of course remains valid in the asymptotic range r-**>. Here F ~ 1 + constant x e i{pr ~*'r\ a ' V F - 1 - cos d - 02 « 1, e so that the correction term remains small, as it should. The wave function of the form eip r F, which has the same form as the non-relativistic function (with an obvious change of parameters), has the same asymptotic expression, and therefore the cross-section is given by the Rutherford formula. To calculate the scattering cross-section for electrons with any polarization, we could use the density matrix (29.13), following the general procedure. In this case, however, the result can be more readily obtained by expressing the bispinor amplitudes u(p') and w(p) in the form (23.9). Multiplication gives M*(p')w(p) = w'*{e + tn +{e - m)(n' • <r)(n • v)}w, or, using (33.5), w*(p')"(p) = vv'*/vv, (80.11) t The difference between der and dorRU shown by this formula is specific to particles with spin i In the scattering of particles with spin 0, if their motion in the electromagnetic field is described by the wave equation, the result is der = daRu. At first sight it might appear puzzling that the factor expressing this purely quantum effect does not contain h. However, it must be remembered that the condition for the Born approximation to be valid (e~ltiv<£ 1) is contrary to the condition for quasiclassical motion in a Coulomb field, and therefore formula (80.9) cannot be taken to the classical limit. 320 Interaction of Electrons §80 wheret / = A + Bvcr, A = (e + m) + (e - m) cos 0, B = - i(e - m) sin 0, (80.12) v = n x n'/sin 0. The two-component quantity (three-dimensional spinor) w is the non-relativistic spin wave function of the electron. The change to the partially polarized states is therefore made by replacing the products wQw$ (where a, ß are spinor indices) by the non-relativistic two-rowed density matrix paß. Thus we must put \Mfi\2-+e2\A{0e)(q)\2 tr p(A - Bv • u)p'{A + Bv • or), where p H a + a-O, p' = iO + «r-D. and Ç, Ç' are the vectors of the initial polarization and the final polarization selected by the detector. The result of calculating the trace is where dor0 is the scattering cross-section for unpolarized electrons. Expressing the quantity in the braces in (80.13) in the form {1 +£(/) • £'}, w e find the polarization Ç(/) of the final electron itself, as opposed to the detected polarization £ (see §65):$ £/> = (A2-lBl2)C + 2|Bp(vQv + 2A|B|vX^ (gQ> M ) We see that the scattered electrons are polarized only if the incident electrons are polarized. This is a general property of the first Born approximation; cf. QM, §140. In the non-relativistic case (e -» m), (80.14) gives ÇV) = Ç, i.e. the electron retains its polarization on scattering, a natural consequence of the neglect of spin-orbit interaction., In the opposite (ultra-relativistic) case, we have A = e(l + cos 0), B = - i€ sin 0, in accordance with the general formula (38.2). t The definition of / used here differs by a factor from that in §§37 and 38. r Formula (80.14) corresponds to that derived in QM, §140, Problem 1, and is obtained from it by taking A real and B imaginary. §81 Scattering of Electrons and Positrons by an Electron 321 If the incident electron has a definite helicity (Ç = 2An, A = ±|), (80.14) gives after a simple calculation Ç(/) = 2An'. Thus the electron remains helical after scattering, with the same value (À) of the helicity. This property occurs because, as already mentioned in §38, when the mass is neglected Dirac's equation in the spinor representation separates into two independent equations for the functions £ and rj. The result has also a more general significance, since the current J = (£*£ + i7*T), É*«rÊ-î|*«rt|), and therefore the electromagnetic perturbation operator V = ejÀ, do not contain mixed terms in £ and rj, and thus have no matrix elements for transitions between £ states and T) states. Hence it follows that, if an ultra-relativistic electron has a definite helicity (i.e. if either tj or £ is non-zero), this helicity is conserved in interaction processes in an approximation corresponding to completely neglecting the electron mass. §81. Scattering of electrons and positrons by an electron Let us consider the scattering of an electron by an electron, in which two electrons with 4-momenta pi,P2 collide and emerge with 4-momenta pî,P2- The conservation of 4-momentum is expressed by Pl + P2 = Pi + P2- (81.1) We shall use the kinematic invariants of §66, defined by s =(Pi + P2)2 = 2(m2 + p,p2), 1 t = (Pi-p5) 2 = 2(m 2 - P l pi), y w=(Pi-P2r = 2(m 2 -p,p9, j s + t + u=4m 2 . [fi 1.1) The process in question is represented by the two Feynman diagrams (73.13), (73.14), and its amplitude ist M,, = 4 i r e 2 { y ( f i f r * ^ (81.3) t This form of Mfi is in accordance with the general expression (70.5). In the first non-vanishing approximation of perturbation theory, only one of the five invariant amplitudes is non-zero: / 3 (f,u) = 4<rre2lL §81 Interaction of Electrons 322 According to the rules given in §65 for the states of initial and final particles described by polarization density matrices pu p i , . . . , we make the change |M/,| 2 ^ 16TTVlptr(p27>27") tr (piyMp,7»<) + jptr (p\y*p2yv)tr(P27MPI7.)- - ^ tr (p27MP27"pîyiiPi7J ~ JH tr (PI7* 1 P27 , 'P27MPI7,)]- (81-4) For the scattering of unpolarized electrons (without regard to their polarization after scattering), we must put for all the density matrices p = i ( 7 p + m), and multiply the result by 2 x 2 = 4 (averaging over the polarizations of the two initial electrons, and summation over the polarizations of the two final electrons). The scattering cross-section is given by formula (64.23), in which, by (64.15a), 72 = \s(s - 4m2). It may be written da = dt Aire* s(s - 4 m T, ) {fit, u) + git, u) + fiu, t) + g(u, r)}, fit, u) = j^p tr [(7P2 + m)7"(7P2 + m)7"] tr [(7p{ + m)yiliypx + m)yv], (81.5) git, u) = - j ^ tr [iyp'2 + m)7M(7p2 + m)7"(7p{ + m)y^(ypl + m)7„]. In f(t, u) the traces are first calculated (using (22.9), (22.10)), followed by summation over /x and i>;t in g(t, u) the summation over p, and v is taken first, using formulae (22.6). The result is fit, u) = p [(p,P2)2 + iPxPif + 2m2(m2 ~piP[)], git, u) = — ( p , p 2 - 2m2)(p,p2), or, in terms of the invariants (81.2), /(r,u) = p [ i ( s 2 + u 2 ) + 4 m 2 ( f - m 2 ) ] , (81.6) git, u) = giu, t) = ^ ils - m2)i\s - 3m2). t The following formula is given for future reference: 4 tr (yp\ + m)y*(yp2 + m)yy = g*v(m2- p\p2) + ptpi + pïpÇ. §81 Scattering of Electrons and Positrons by an Electron 323 Thus the cross-section is da = T\ ^ 4 ^ {p tiU2 + u2) + Am\t - m2)] 4+ p[i(s 2 + t2) + 4m 2 ( M -m 2 )] + ^ ( ! 5 - m 2 ) ( i s - 3 m 2 ) } , (81.7) where re = e2/m. In the centre-of-mass system, we have = 4e2, t = - 4p2 sin2|0, u = - 4p2 cos2 |e, = - 2p2 d cos 0 = (P2/TT) do, -dt (81.8) where |p| and e are the magnitude of the momentum and energy of the electrons, which are unchanged in the scattering, and 0 is the scattering angle. In the non-relativistic case (e Ä m),t we obtain for the cross-section 2 da = r, um4 dt /1 , 1 5— ( Ti. + - î pz ,2 \2 \r u _ 1\ 7" tu) W V \sinM0 + cos4\B sin2 £0 cos 2 \e) d° a x2 4(l + 3cos 2 0) do (non-relativistic), \mtr/ sin40 (81.9) where v = 2p/m is the relative velocity of the electrons, in accordance with the non-relativistic theory (see QM, §137). In the general case of arbitrary velocities, formula (81.7) with the substitution (81.8) can easily be brought to the form r da - r]mY.+4pV [Lsin - 440^ -snr0 ^ + M ) (l 4 2^0/J 1 do \e 2 r+ .pv V + -sm (81.10) (C. M0ller, 1932). In the ultra-relativistic case (p2= e2), do- = r\ ^V ( 3 t c . ° L g ) do (ultra-relativistic). e 4 sin 0 (81.11) In the laboratory system, where one of the electrons (say electron 2) is at rest before the collision, the cross-section can be expressed in terms of the quantity m m ' t The velocity v is assumed small (v <^ 1) but such that the condition for perturbation theory to be applicable is still satisfied: el\v ( = e2lhv)<€ 1. §81 Interaction of Electrons 324 the energy (in units of m) transferred by the incident electron (electron 1) to electron 2.t The invariants are s=2m(m + £\), f=-2m2A, u = -2m(ei - m - mA). (81.13) Substitution of these expressions in (81.7) gives the following formula for the energy distribution of the secondary electrons (called 8 electrons) formed in the scattering of fast primary electrons: A <> 2 da [ ( y - l ) V 2 7 2 + 2 y - l , %\ ^ =27rre73T(Ai(7_1_A)i-A(7_/_A)+l), (81.14) where 7 = exlm. The quantities m A and m (y - 1 - A) are the kinetic energies of the two electrons after the collision; the identity of the two particles is shown here by the symmetry of the formula with respect to these quantities. If the term "recoil electron" is arbitrarily applied to the electron with the smaller energy, A takes values from 0 to 2(7 - 1). When A is small, formula (81.14) becomes A o -2 y2 <*A 2-rrr] dA A da = liTfe -^ZTl^T = "^f"^ A ^ y - 1. (81.15) This formula, if expressed in terms of the velocity of the incident electron (ui = |pi|/ei), retains the same form in the non-relativistic case. Its form is naturally, therefore, the same as that of the result given by the non-relativistic theory (cf. QM, (148.17)). Let us now consider the scattering of a positron by an electron (H. J. Bhabha, 1936). This is another cross-channel of the same general reaction as the electronelectron scattering. If p_, p+ are the initial momenta of the electron and positron, and P-yPl their final momenta, the change from one case to the other is made by the substitutions Pi->-p+> P2-*P-, Pî-*-p+. P2-+P-- The kinematic invariants (81.2) become s=(p.-pl)\ r = (p,-p;) 2 , n=(p_ + p+)2. (81.16) If ee scattering is the s channel, le scattering is the u channel of the reaction. The square of the scattering amplitude, expressed in terms of s, t and w, remains as before; in the denominator of (81.5), s must be replaced by u. Thus the crosssection for scattering of a positron by an electron is, instead of (81.7), da = r\ ? ™ 2 A (72 [fa 2 + u2) + Am\t - m2)] + + pfl(s 2 + t2) + 4m2(M-m2)] + ^( 1 2 s-m 2 )(is-3m 2 )}. (81.17) t The kinematic relations for elastic collisions in various frames of reference are given in Fields, §13. §81 325 Scattering of Electrons and Positrons by an Electron In the centre-of-mass system, the values of the invariants s, t, u differ from (81.8) by the interchange of s and u: 5 = -4p 2 cosM0, f = -4p 2 sinMö, w=4e2. (81.18) In the non-relativistic limit, formula (81.17) reduces to Rutherford's formula: (-^-!l 2 v 2 J (non-relativistic), -^TT^ (81.19) mir/ sin iQ where v = 2p/m. This comes from the first term in the braces in (81.17), which originates from the "scatterings-type diagram (see §73). The contributions from the "annihilation" diagram (the second term in (81.17)) and from its interference with the scattering diagram (the third term) vanish in the non-relativistic limit.t In the general case of arbitrary velocities, the contributions of all three terms in (81.17) are of the same order of magnitude; the first term predominates only at small angles, because of the factor t~2 « sin~458. Combining like terms, we can write the cross-section for scattering of a positron by an electron (in the centre-of-mass system) in the form . r] m 2 f(e 2 + p2)2 1 f.4| 16 B I p sin 20 der = do — —j { + 8e4-m4 1 , Y~1 ■ 1 In + p e sin £0 i2£^-iä!^ sin »J, + S $ , i n M , ] . e e e J (81.20) The symmetry with respect to 0 and TT - 0 which is typical of scattering involving identical particles does not, of course, occur when a positron is scattered by an electron. In the ultra-relativistic limit, the expression (81.20) differs from the electron-electron cross-section only by the factor cos 4 20: ddee = cos 4 20 dcre€ (ultra-relativistic). (81.21) In the laboratory system, where one of the particles (say the electron) is at rest before the collision, we again define A = £ ± ^ m £i_H£L m = (8122) i.e. the energy transferred by the positron to the electron. As in (81.13), we now have s = - 2m(e+ - m - m A), t = - 2m2A, u = 2m(m + e+). Substitution of these expressions in (81.17) easily gives the following formula for t See (83.4) and (83.20) for the passage to the non-relativistic limit in the scattering and annihilation terms in the scattering amplitude. The latter term contains a factor l/c\ and therefore tends to zero. 326 §81 Interaction of Electrons the energy distribution of the secondary electrons: 2 da fy 2 ^-2^73i{^ 2y 2 + 4 7 + l 1 3 7 2 + 6 7 + 4 V^T~I+ (7 + i)5 27 1 21 (7Tï7A + (7?l7 A r (81.23) where y = e+/m; A varies from 0 to y - 1. When A <è y - 1, (81.23) leads to the same formula (81.15) as for electron scattering. The polarization effects in the scattering of electrons or positrons are calculated by the general rules given in §65. In all but special cases, the resulting formulae are lengthy. Here we shall give only some comments.t In the approximation considered (the first non-vanishing approximation of perturbation theory), the cross-section contains no terms linear in the polarization vectors of the initial or final particles. As in the non-relativistic theory (QM, §140), such terms are forbidden in consequence of the requirement for the scattering matrix to be Hermitian. The scattering cross-section is therefore unchanged if only one of the colliding particles is polarized; and unpolarized particles do not become polarized as a result of scattering. The same conditions prohibit correlation terms in the cross-section which contain the products of the polarizations of three of the particles (initial and final) concerned in the process. The cross-section does, however, contain double and quadruple correlation terms. In the scattering of unlike particles (electron and positron, electron and muon), these terms vanish in the non-relativistic limit, since there is no spin-orbit interaction. In collisions of like particles, however, there are correlation terms even in the non-relativistic case, because of exchange effects. PROBLEMS PROBLEM 1. Determine the scattering cross-section for polarized electrons in the non-relativistic case. SOLUTION. In the non-relativistic case, the bispinor amplitudes in the standard representation have two components, and the density matrices are the two-rowed matrices (29.20). In the scattering amplitude (81.3), the only non-zero terms are those with \i = v = 0, which contain matrices y° that are diagonal (in the standard representation). Instead of (81.4) we have -16irV.4m4.4[i!+iï-.i(l+CI.b)], the summation being over the polarizations of the final electrons. Hence the scattering cross-section is d<r=d<ro 1 ( -i+w^ | -4 where 0 is the scattering angle in the centre-of-mass system and dao the scattering cross-section (81.9) for unpolarized particles. For completely polarized electrons, this formula is the same as the result in t For further details see the paper by W. H. McMaster, Reviews of Modern Physics 33, 8, 1961. §81 Scattering of Electrons and Positrons by an Electron 327 QM, §137, Problem, with |£i| = |^| = 1, Çi'£2 = cosa, where a is the angle between the directions of polarization of the electrons. For the scattering of positrons by electrons, there is no dependence on the polarization in this approximation (da = dao); this is easily seen by noticing that, in the non-relativistic limit, different pairs of components are non-zero in the electron and positron amplitudes up and a p . PROBLEM 2. In the non-relativistic case, determine the polarization of scattered electrons in the scattering of an unpolarized beam by a polarized target. SOLUTION. We can calculate the scattering cross-section for given initial polarization £ and detected final polarization Ci; only the polarization of onefinalelectron is detected. By the same method as in Problem 1, we find A ^A \\ Y> Y 2C0S da = *d<ro[ !-<;,•& 9(1 "COS 0)1 ,+ 3cos*e J. The polarization vector of the scattered electron is therefore rU) Cl = _ f c 2cos 6(1 -cos 0) l + scos'e • PROBLEM 3. In the non-relativistic case, determine the probability of spin reversal of a completely polarized electron scattered by an unpolarized electron. SOLUTION. We similarly find the cross-section for given polarizations £i and £i: fi -ur r 2 cos 0(1 +cos 0)1 der-2dcro[l+<;,•<;, , + 3 c o s 2 e J. A~-±A~ Putting \\ • £i = - 1 , we then find the probability of reversal of the spin direction: da _ (1-cosfl) 2 d<7o 2(1 + 3 cos2 6Y PROBLEM 4. Determine the ratio of the scattering cross-sections for helical electrons with parallel and antiparallel spins, in the ultra-relativistic case. SOLUTION. In (81.4) we must put, according to (29.22), Pi =5(yp.)(l - 2A,y 5 ), p2 = kypi)d pi = 27pi» p2 = 27P2, -2À2Y 5 ), where Ai, À2 = ±2. The traces are calculated by the formulae given in §22; in particular, tr ( y ^ a ) ? 1 ^ ) - / " ) tr (y\yc)yAyd)y.) = i\e>»k»apbk)(e^cad*) = 2(SS5Î - 6prba)apbxcadr = 2(acKM)~2(ad)(bc). The result is Since the momenta of the colliding electrons (in the centre-of-mass system) are opposite, antiparallel spins correspond to like helicities (Ai - A 2), and parallel spins to unlike helicities (Ai = - A2). Substituting s, f, u from (81.8) (with p 2 ~ e2), we find the required ratio: da 1\ld*U= s(l + 6 cos2 0 + cos4 0). This has its least value, J, when 0 = |ir. (1) §81 Interaction of Electrons 328 P R O B L E M 5. The same as Problem 4, but for the scattering of positrons by electrons. S O L U T I O N . In this case we have to calculate, instead of (81.4), |M/i|2 -♦ 1 6 i r V | p tr (p' - y V - 7") tr (p+7^P + 7*) ~ ^ t r (P-y W P + T Ü P W*) + • • • p the remaining terms are obtained by interchanging p+ and p-. The density matrices are P- = kyp-)(l " 2A-73), p . = hyp+)(\ + 2A.7 5 ), pi = l7PH P- = 27P-» where A+, A- = ± 2 (and for the positron, as for the electron, A+ = 2 denotes that the spin is parallel to the momentum). The result of the calculation is ,2 -7-« dt 1 ..2 2 , 2— + \ V # 2 7— + u2 - 4 A + A- tu I (s2-u2 3— + \ V s2-t2 2s2\ u1 tu 1 7—+ . Hence we find for the ratio of cross-sections the same value as formula (1), Problem 4. P R O B L E M 6. Determine the cross-section for scattering of muons by electrons. S O L U T I O N . The process is described by the one diagram (73.17). Instead of (81.5) we have ire4dt AireAdt a;/(MO, [s-(m + /ir][s-(m-/ir] (1) 1 / ( ' . M) = ^ p tr [(7p; + ii)yk(yp„ + M)7*] tr [<7Pi + m)?A(7Pe + m)yv]\ p«, pM and pi, p i are the initial and final 4-momenta of the electron and the muon, and m, /1 their masses. The invariants are « =(Pr + P*)2 = m 2 + /A2 + 2p«p,i; 1 =(P*-Pi)2= 2(m2-pep;) = 2(/x 2 ~p M p;), 2 w =(P*-P^) = n\2 + ii2-2pepl> 2 s + f+ u = 2(m + /12). The result of the calculation is / = p {(P<P*)2 + (PePi)2 + km 2 + ,i 2 )t} = p{i(52-fu2)4(m2 + fi2)(2t-m2-/x2)}. (2) Formulae (1) and (2) give the solution of the problem. In the centre-of-mass system, e4 do da = 8(g, + €jysinMfl I(M|i + ^ * (€C€M + P* C ° S 0 ) ' ~ 2 ( m ' + ^ sif|2 2 ^ where do = 2ir sin 0do; £«, eM are the energies of the electron and the muon; p = s]- m 2 - el~ p2. If p2<*/x2, we return to formula (80.9) for scattering by a fixed centre of Coulomb force. In the ultra-relativistic case (p 2 > jx2), . e4 1 + cos 4 ifl . 4 i „ — do. 8p sin he fl<r = -z—9 — . §81 Scattering of Electrons and Positrons by an Electron 329 In the laboratory system (where the electron is at rest before the collision), „2\2 , , 2 A . (ezV daA L ml A A 2 A dc7 = 27r — -T77ll""VaT— + X-7A 1, 2 2 M Amax 2e* /' \m) v A \ where e» is the energy of the incident muon, and vM = p^/e^ is its velocity; m A = ei - m = EM - £^ is the energy of the recoil electron; and A A 2 P* " M " m 2 + M! + 2me|1 is the maximum value of A. PROBLEM 7. Determine the ratio of cross-sections for the mutual scattering of helical electrons and muons with parallel and with antiparallel spins, in the ultra-relativistic case (e^ > /i, ee>m). SOLUTION.t As in Problem 4, we find daîî/daîi = c o s 4 k where 0 is the scattering angle in the centre-of-mass system. PROBLEM 8. Determine the cross-section for the conversion of an electron pair into a muon pair (V. B. Berestetskiï and I. Ya. Pomeranchuk, 1955). SOLUTION. This is another cross-channel of the reaction to which fie scattering belongs. In this channel, t = (p, + p,) 2 , s = (Pe - p»)\ w = (p, - pM)2, where p„ pt are the 4-momenta of the electron and the positron, and p^, pM those of the muon and the antimuon. The reaction threshold corresponds to an energy 2\x. of the electron pair (in the centre-of-mass system), so that we must have t > 4 j x \ In the laboratory system, where the electron is at rest before the collision and the positron has energy e+, t = 2m(e+ + m) « 2me+, so that we must have e+ > e,, where the threshold energy et=2fi 2 /rn; here and below, all approximations allowed by the inequality /x > m are made. The differential cross-section is (instead of formulae (1) and (2), Problem 6) . 4ire4ds ~ W £t4> . y [{(S2 + M2) + l^t - /]. For given t, the quantity s takes values between the limits determined by the equations su = pi4, s + t + u == 2fi2, i.e. ^ - it - W[t(t - A112)] « s « ti2 - \t + W[t(t - V ) ] . An elementary integration gives '-^T'(-¥)V¥)- '•-«-'- in the laboratory system, t = 2me+. This formula is not valid in the immediate neighbourhood of the threshold: when e + - e , - j x e \ the muons formed cannot be regarded as free particles; when the Coulomb interaction between them is taken into account, the cross-section tends not to zero but to a constant value as e+->et (see QM, §147). The cross-section (1) has its maximum value when £+ = 1.7c Its maximum value is about 20 times less than the cross-section for two-photon annihilation at the same energy. t Another method of solving this problem is given at the end of §144. 330 Interaction of Electrons §82 § 82. Ionization losses of fast particles Let us consider a collision between a fast relativistic particle and an atom, accompanied by excitation or ionization of the atom. Such inelastic collisions for the non-relativistic case have been discussed in QM, §§148-150; here we shall derive the relativistic generalization of the formulae obtained in QM (H. A. Bethe, 1933). The velocity of the particle incident on the atom is assumed large in comparison with those of the atomic electrons; thus we always assume that Za < 1, i.e. that the atomic number is fairly small. This condition ensures the applicability of the Born approximation to the process under consideration. The solution of the problem depends to some extent on whether the fast particle is light (electron or positron) or heavy (meson, proton, a-particle, etc.). The second case is the simpler, and will be taken here. Let p = (e, p) and p' = (e', p') be the initial andfinalmomenta of the fast particle in the laboratory system, where the atom is at rest before the collision; the difference q = p'-p gives the energy and momentum transferred to the atom by the particle. The range of possible momentum transfers can be divided into two parts: (I) q2/m < m, (II) q2/m > I, (82.1) where m is the electron mass and I is a mean energy of the atom, its ionization potential. The two parts overlap with I < q2/m < m ; this allows an exact joining of the results for each part separately. The momentum transfer will be said to be respectively small and large in the two parts of the range. SMALL MOMENTUM TRANSFER In this range, the atomic electrons may be regarded as non-relativistic in both the initial and the final state of the atom. The amplitude of the process is Mp} = e2JM-q)JvpP(q)D»v(q), (82.2) where Jn0 is the transition 4-current of the atom from the initial state (0) to the final state (n), and Jpp is the transition 4-current of the fast particle; these currents here replace ü'yuy which would occur, for example, in the scattering amplitude of two "elementary" particles such as the electron and muon in (73.17); cf. also (139.3). The transition currents are taken in the momentum representation (see (43.11)). The cross-section of the process in the laboratory system is d<rn = 2*6(6 - e ' - <o„0) [M^l2 2\p\?2e'(2<ir)i' (823) where a)n0 = En- E0 is the transition frequency between the states of the atom. The final state may belong to either the discrete or the continuous spectrum, corresponding to excitation and ionization of the atom respectively. In the law of §82 331 Ionization Losses of Fast Particles conservation of energy (represented by the delta function in (82.3)), the recoil energy of the atom is neglected; this is certainly permissible for a small momentum transfer. The photon propagator is here conveniently taken in the gauge (76.14), in which only its space components are non-zero: u) q \ (o I Then only the space components of the transition currents in (82.2) are likewise needed. The atomic transition current J„o(q) is here the Fourier component of the usual non-relativistic expression: J«o(q) = ~ j « H q ' r 0Mty* - *!>*nV*l>o) d'x, (82.5) where if/o and ipn are the atomic wave functions; for simplicity, the sign of summation over the electrons in the atom will be omitted henceforward, i.e. the formulae will be written as if there were only one electron in the atom. Integrating by parts in the first term, we can rewrite this expression as a matrix element: Ji.o(q) = l ( v « " i , w r + « " l v r v U (82.6) where v = -(i/m)V is the electron velocity operator. Since the momentum lost by the scattered particle is relatively small (|q| <? |p|), the transition current for this particle can be replaced simply by the diagonal element Jpp(0) = 2pz, (82.7) corresponding to classical motion in a straight line (cf. (99.5)); a factor z is included to take account of a possible difference between the particle charge ze and the electron charge e. Since q is small, so is the angle of deviation d of the particle. The longitudinal and transverse components of q (relative to p) are -<ïl « (dplde)ü)n0 = (onolvy qx ~ |p|#, (82.8) and hence q • p « - 6a>„o. Substitution of (82.4M82.8) in (82.2) gives M(n)= _±!IZL/n | ^ ( q . v ^ - | q r + e - l q r q - v ) + (p-ve"lqr+€-,qrp-v)|oy In the first term, since q - v / + / q - v = 2i/, 332 §82 Interaction of Electrons w h e r e f = e~i<K r (see QM, §149), the matrix element of this operator is 2i(/)„ 0 = 2ü)„o/rtO- In the second term, e~'q r can be taken as unity, since q is small. Then Mp = _8^{e(e-iqr)/i0_ /p . rnOÙJno} The squared modulus is = 6 4 ^ ( y 2 ) 2 {e2\(e-^rU2 W? + 2(q • rn0)(p • rnQ)e<ono + (p • rn0)Wn0}; (82.9) here, in the second term, we have put e",q r % 1 — iq • r, but this cannot be done in the first term, for a reason explained in the next footnote but two. The energy lost by the fast particle in its inelastic collisions with atomst is given by K = 2J^odan=j^2J^o\M^\2do\ (82.10) where the summation is over all possible final states of the atom, and the integration is over the directions of the scattered particle; this quantity will be called the effective retardation (KIS is known as the energy loss cross-section). The integration in (82.10) can be carried out in two stages, by averaging over the azimuth of the direction of p' relative to p and then integrating over do' ** 2ird d#, where d is the small scattering angle. The first stage makes the change q • r„o-» qjXno = - (o>„0/u)Xno, where xn0 is the matrix element of one of the Cartesian coordinates of the atomic electrons.t The integration over d# can be replaced by integration with respect to q2, since - ql = - ü)no + q ^ - no + —r + P # ~ 5— + P # (82.11) and therefore 2 # d # = d|q2|/p2, M being the mass of the fast particle. The result is K = 4.(»V 2 / {K«-"- -).O|J «? - «A W» (*£ + J,)} gjf. (82.12) The lower limit of the integration with respect to q2 is k 2 U = (M 2 /pVno. (82.13) t These are often called ionization losses, although they are due to excitation as well as ionization of atoms. t It does not matter which coordinate: after the summation over the directions of the angular momentum of the atom in thefinalstate, which is implied below, the matrix element does not depend on the direction of the x-axis. §82 Ionization Losses of Fast Particles 333 As the upper limit, we take a value |q2|t such that (82.14) K\q\lm<m, which thus lies in the overlap of ranges I and II (82.1). The integration and summation in (82.12) are carried out in the same way as was done for the non-relativistic case in QM, §149. The entire range of integration is divided into two parts: (a) from |q2|mi„ to \q\ and (b) from |q2|0 to |q2|i, where |q2|0 is such that JM/|pHV|q 2 |o«ma; (82.15) the quantity ma on the right is of the order of the momenta of the atomic electrons. In part (a) we can put e~,qT = 1 - iq • r, and the contribution of this part to K is l<i2lo 47r(ze2)22 J {pû) n o|x„o| 2 i^--rû>noko| 2 i^p]d|q 2 | l<J2lmin 4-7T(ze2)2 ^ , |2 r. |q2lop2 21 In the second term, the integration can be extended to infinity. The summation is carried out by means of the formula 2û>n0|xn0|2 = Z/2m, n (82.16) where Z is the number of electrons in the atom; see QM, (149.10). The result can be written where I is an average energy of the atom, defined by 2 log J=J! w no|x„ 0 | 2 lOg W„o 2 ^nol^nol n = ^ L S^oW 2 loga> B0 . (82.18) In part (b), (82.11) shows that |q2| * p2#2, i.e. |q2| is independent of the particular final state n of the atom, and the limits of integration are also independent of n. The summation over n can therefore be taken inside the integral in (82.12). In the 334 §82 Interaction of Electrons first term, the summation is carried out by means of the formula 2 |(^ iqr ) n o| 2 w n0 = (Z/2m)q2 (82.19) n (see QM, (149.5)), and the integral ist mvz 1^ |o The integral of the second term in (82.12) over this part of the range makes a negligible contribution to K. Adding the last formula to (82.17), we find as the contribution to K from the whole range of small momentum transfers LARGE MOMENTUM TRANSFER Let us now consider collisions with a momentum transfer which is large compared with the momentum of the atomic electrons (q2>ml). Here we can evidently neglect the binding of the electrons in the atom, regarding them as free. Accordingly, the collision between the fast particle and the atom may be taken as an elastic collision with each of the Z atomic electrons. Because of the high speed of the particle, the atomic electrons may be assumed to be originally at rest. Let m A denote the energy transferred from the fast particle to an atomic electron, and let dcrA be the cross-section for elastic scattering with this energy transfer. The differential effective retardation by the whole atom is then dK = ZmAdaA. (82.21) The maximum energy which can be transferred to an electron at rest by the impact of a particle with mass M > m is A mûmax 2 2 m p2 2 2m p 2 2 " m + M + 2ms ~ M + 2me' where e and p are the energy and momentum of the incident particle; see Fields, (13.13). We shall also suppose that the energy e, though ultra-relativistic (e > M), is nevertheless such that e<M2lm. t The logarithmic divergence of the integral at the upper limit is the reason why e expanded in powers of q in the first term in (82.12). (82.22) ,q r ' cannot be §82 lonization Losses of Fast Particles 335 Then even the maximum energy transfer m Amax - 2mp 2 /M 2 = 2mv2y2 (y = ejM = 1/V(1 - i;2)) (82.23) is small in comparison with the initial kinetic energy of the incident particle (mAmax < e - M). Correspondingly, the momentum transfer q is always small in comparison with the initial momentum p of the particle. This enables us to regard the motion of the particle as being unaltered by the collision, i.e. the particle itself as having infinite mass. Then the scattering, cross-section is found by simply transforming the cross-section (80.7) for electron scattering by a fixed centre to the laboratory system, in which the electron is initially at rest. This is easily done by noting that, in the approximation used, - q2 « q2 = 4p2 sin2 id, do' = Trd|q2|/p2, and the relative velocity is v in both systems. Formula (80.7) becomes vl V 4mV/ k I The energy transfer A is expressed in terms of the same invariant q2: - q2 = 2m2A, and therefore t . 2TT(ze2)2 / , 2 A \dA . ™*„* I 1 " " A—)~ÄT(82.24) m v \ Amax/ A The contribution to the effective retardation from this range of momentum transfers is found by integrating (82.21) from the limit |q2|, defined above-» to !q2|max = 2m2Amax. The result is dcr * = 2ir(zc 2 ) 2 Z/, 2Amaxm2 : T oe m i ^ ~\ ~W\ — r (82 25) - Finally, adding the contributions (82.20) and (82.25), we have for the total ionization losses by the fast heavy particle (in ordinary units) = 4nZ(ze2)2 ^-rg/d-.V)"?)- (82 26) - In the non-relativistic case, this reduces to QM, (150.10): K= 4irZ{ze2)2. 2mv2 T - ^ log —=—, mv I (82.27) t In this expression, of course, no account is taken of the specific effects of strong interactions when the heavy particle is a hadron. Such effects (corresponding to the hadron form factor) are, however, important only when |<j2| « ]/Af2, and such momentum transfers are excluded by the condition (82.22). 336 §83 Interaction of Electrons and in the ultra-relativistic case 47rZ(ze2)2 /, 2mc2 ,\ (82.28) The retardation depends only on the velocity of the fast particle, and not on its mass. The decrease of the retardation with increasing velocity (82.27) changes to a slow (logarithmic) increase in the ultra-relativistic range. PROBLEMS PROBLEM 1. Determine the effective retardation of a relativistic electron. SOLUTION. The contribution of the range of small momentum transfers is again given by (82.20). For large momentum transfers, (82.24) must be replaced by (81.14), which includes exchange, effects. Integrating A • dan, over dA from |q2|i/2m2 to 2(7 - 1) and adding to (82.20), we get 2-nZe' f, mV-l)(y-l)c4 (2 1\. „ , 1 , (y-1) 2 ] ,n where y = (l-t>V)-'. In the non-relativistic case, we get the formula given in QM, §149, Problem, and in the ultra-relativistic case (7 > 1) 2-irZe*/, m V y ' n ,~ PROBLEM 2. The same as Problem 1, but for a positron. SOLUTION. For dai in the range of large momentum transfers, the expression (81.23) must be used, the upper limit for A being y - 1. In the ultra-relativistic case, the result is 2irZe*(t 2m 2 cV 23\ §83. Brett's equation In classical electrodynamics, a system of interacting particles can be described by means of a Lagrangian function depending only on the coordinates and velocities of the particles, and correct as far as terms ~ l / c 2 (Fields, §65). This is because radiation appears only as an effect of order 1/c3. In the quantum theory, this corresponds to the possibility of describing the system by Schrödinger's equation including second-order terms. For an electron moving in an external electromagnetic field such an equation has been derived in §33. We shall now derive a similar equation describing a system of interacting particles. We start from the relativistic expression for the scattering amplitude for two particles. In the non-relativistic approximation, this becomes the usual Born BreiVs Equation §83 337 amplitude, proportional to the Fourier component of the potential of electrostatic interaction of two charges. By calculating the amplitude as far as second-order terms, we can establish the form of the corresponding potential, taking account of terms ~ l / c \ Let usfirstassume that the two particles are different, with masses mi and m2 (say an electron and muon): Then the scattering process is represented by a single diagram, ^ Pi > ^ p i w The corresponding amplitude is Mfi = e\û\y»ul)Dllv(q)(ûWuî), q = pi — pi = Pi-p'ï, (83.1) here it is assumed that the charges have the same sign. If the signs are different, e2 becomes -e2. The subsequent calculations are considerably simplified if the photon propagator DM„ is chosen not in the ordinary gauge but in the Coulomb gauge (76.12), (76.13):t DM=-*£, Do,=0, O» = q ! . j v - i o ( 8 » - ^ ) - (832) Then the scattering amplitude is Mfl = * 2 { ( ö i Y % ) ( ö * A ^ (83.3) If all terms in 1/c are neglected, the second term in the braces vanishes, and the first term gives Mfi = -2m, • 2m2(vvi0)'*vvi0))(vvr*vv^0))Lr(q), (83.4) U(q) = 4Tre2lq\ (83.5) where and w\°\ w{2°\... denote the spinor (two-component) amplitudes of the non-relativistic plane waves, as defined in §23. The function U(q) is the Fourier component of the Coulomb interaction potential energy, U(r) = e2lr. In the next approximation (with respect to 1/c), the "Schrödinger" wave function of the free particle 4>Sch (normalized by the integral / |4>Sch|2 d3x) satisfies + In this section, factors of c will be written in all formulae, and factors of h in the final formulae. 338 §83 Interaction of Electrons the equation IJ(0) _ /_ ( ± H VsehM e - m c 2 ) 42\>i>Sch, s (83.6) Zm 8m c which includes the next term in the expansion of the relativistic expression for the kinetic energy. The (spinor) amplitude of this plane wave will be denoted by w, which tends to w(0) as 1/c ->0. The required scattering amplitude must be expressed in terms of these amplitudes, in order to determine from its form the "Schrödinger" interaction potential of the particles in the approximation considered. In accordance with formula (33.11), the bispinor amplitude u of the free particle can be expressed in terms of the "Schrödinger" amplitude w, with sufficient accuracy, by u w(2 This formula gives m ) (<'-^/"'c;>n \ (or • p/2mc)w / ,83.7) v ' öiy0Ui = u\*ux l 8mjc 4m\c J ü[yu\ = u[*au\ = (l/c)w;*{a(<r • pi) + (a • p[)a}w{ = (l/c)w{*{icr x q + 2pi + q}wi, where q = pi - pi = p2 - pi. The corresponding expressions for (Ü2Y°M2) and (Ö27U2) differ in that the suffix 1 is replaced by 2 and q by - q . We now substitute these expressions in (83.3). Since the product (ü[yu\)(Ü2yu2) already contains the factor 1/c2, the term a>2lc2 in the denominator of Dik may be neglected. The scattering amplitude is then Mfi = -2mi • 2m 2 (wi*W2*l/(p,,p 2 ,q)w, w2), where in \ A „2P 1 1 , (q • Pi)(q * P2) l/(p„P2,q) = 4Tre ^ - ^ - ^ - h * £„£/ P1P2 mim2q2 îarqxp! 4m2c2q2 uri-gxp, 2m\m2c q2 ia 2 q X p 2 [ 4m 2 c q i<r ; -qxp, + ( a r q ) ( g , - q ) _ « r ^ l 4mim?cV 4mim 2 c J 2mjm2c q (83.8) §83 339 Breit's Equation the suffixes 1 and 2 to the Pauli matrices indicate the spinor indices on which they act, <7i acting on w\ and a 2 on vv2. The function l/(pi,p2, q) is the particle interaction operator in the momentum representation. It is then related to the operator Û(pi,p 2 , r) in the coordinate representation by f e"i('i-r'+'* r2) Ü(p,,p 2 ,r)e i(Pl r'+P2 r2) d3Xi d3x2 = + P2 ~ PÎ - pQUipu P2, q). (2TT) 3 5( PI (83.10) If the operator Û is simply a function U(r)(r = n - r 2 ) , then U(p,,p 2 , q) is independent of pi and p2, and formula (83.10) reduces to the usual definition of the Fourier component: j V ' q rU(r)d3x = U(q). Hence it is clear that, to find L7(pi, p2, r), we must calculate the integral |e i < , r Lr( P l ,p 2 ,q)dV(27r) 3 , and then replace pi and p2 by the operators pi = -iV\, p2 = -iV 2 , writing these to the right of all the other factors. The required integrals are found by differentiation of the formula f iar 47T d3q 1 For example, taking the gradient gives f^-^TMV-IV1-1?. qz J r r (2TT) Next, with a and b constant vectors, we have J ( 5 ? 2,Va dr))e VD a q / 7 ( 5 ? ' £ ? the resulting integral, after integration by parts, reduces to (83.12), so that f4ir(a-qXb-q) . i , , A J q4 € = i( (2^r7 * -.mJLLE ' r (83.12) 340 §83 Interaction of Electrons Finally, Mir(«-yb- q ) e „. r d^ J q 1 r (2ir) In expanding the derivatives, it must be remembered that these expressions include the delta function Ô(r). To separate this, we note that, after averaging over the directions of r, - (a • V)(b • V)^ = - Ka • b)A \ = ^f (a • b)S(r). Now expanding the derivatives in the usual manner, we find f 47r(a • q)(b • q) J J ^* iq r q d3 q 1f ^-?(a.b-3 - (a • r)(b • r)l 4TT p ] + Ta-bÔ(r); (83.14) on averaging over the directions of r, the first term vanishes and only the delta-function term remains. Using these formulae, we obtain the following final expression for the particle interaction operator: fT„ Ä , e2 ire'h* ire2h2 I/ 11 __,. 11 \ e' e2 + L A fi/ , , r • (r ■ pi)p21 e2h A 4l^rXP2,a2-2m^cV{rXp,,a2-rXp2,<F,} + 4mim T^-t 2c [( g JrJ ^ ~ 3 ( g ' ' r)r^ 2 ' ^ - ¥3 »» '** g <J 4 + (83-15) The total Hamiltonian of the two-particle system in this approximation is H = H{0) + H f + Û, (83.16) where H(0) is the free-particle Hamiltonian (83.6). TWO ELECTRONS If the two particles are identical (two electrons), then the scattering amplitude includes a second term which is represented by the "exchange" diagram §83 341 BreiVs Equation There is, however, no need to calculate the contribution of this term to the interaction operator. The reason is that the description of a system of identical particles by means of Schrödinger's equation can be achieved with an interaction operator similar to that for non-identical particles, if the solutions of the equation are appropriately symmetrized. In particular, for particle scattering this symmetrization will automatically take account of the contributions to the amplitude which correspond to the two Feynman diagrams. Thus the Hamiltonian of the two-electron system is obtained from formulae (83.15), (83.16) by simply putting m, = m2:t Û = 2 ^ (P' + ^ ~ 8^? (P' + ^ + Ü( P " P 2 ' r) ' e2h + 4 m 2 c i r i {-(^1 + 2<T2) • r x p, + (<r2 + 2cr,) • r x p2} + + 1 {Ûïi^ -3(g| ' T2 'r)-T°>'°>5(r)}- (8317) ' The presence of terms in 5(r) does not, of course, imply that there is a particularly strong interaction. The value of all the correction terms after integration is of the same order, and according to the sense of the expansion used they are all to be regarded as small compared with the first term (the Coulomb interaction). The different groups of terms in the interaction operator (83.17) are of different types. Thefirstthree terms have a purely orbital origin. The next term is linear in the spin operators of the particles, and corresponds to the spin-orbit interaction. The last term, which is quadratic in the spin operators, describes the spin-spin interaction.$ ELECTRON AND POSITRON The electron-positron system needs special consideration. The scattering amplitude in this case consists of two terms: Mp = -« 2 [fl(p:)7 M «(pO]D^(p.-p:)[flep + )7 F ii(-pi)] + + e 2 [ö(-p,)7^(P-)]D^(p- + p,)[ö(p:)7vM(-p;)]; (83.18) the first term corresponds to the scattering diagram and the second to the annihilation diagram. Since the wave function of the "electron + positron" system need t The wave equation with the Hamiltonian (83.17) was first derived by G. Breit (1929); a consistent quantum-mechanical derivation was given by L. D. Landau (1932). t This interaction has been mentioned in CM, §72, in connection with the fine structure of the atomic levels, and the spin-spin interaction between the electrons and the nucleus is considered in QM, §121, in connection with the hyperfine structure of levels. In particular, the formula QM (121.9) corresponds to the delta-function term in the spin-spin interaction operator. 342 §83 Interaction of Electrons not be antisymmetric, the two terms make independent contributions to the interaction operator. The first term (which has the same structure as the amplitude (83.1)) leads, of course, to an operator differing only in sign from (83.17). Let us now consider the transformation of the second term. Here we use the photon propagator in the ordinary gauge: U v » " "F g ^ ~ cu 2 /c 2 -k 2 g ^ In the present case k = p+ + p_, and since the particles are "almost non-relativistic", we have o> V = ( 6+ + e_)2/c2 - 4m 2 c 2 > (p+ + p.) 2 ^ k2. (83.19) For the photon propagator it is therefore sufficient to write D^-(7r/m 2 c 2 )g^. This already contains a factor 1/c2. It is therefore sufficient to take the amplitudes u(p) in the zero-order approximation: ii(p-) = V(2m) (W0-0)), ii(-p + ) = V(2m) (*m), where w ( -\ w(0) are the three-dimensional spinors which appear in (23.12); the index (0) will henceforward be omitted. With these amplitudes we have « ( - P * ) 7 ° K ( P - ) = u*(-p+)u(p.) = 0, fi(""P+)7w(p-) = u*(-p + )au(p_) = 2m(w*crw_). On substitution of these expressions, the "annihilation" term in the scattering amplitude becomes M(ann> = _ £2 _ ^ (2m )2( w*a w-)( W^CTW'). (83.20) It is not yet possible, however, to draw from this any immediate conclusions as to the form of the interaction operator. Firstly, the spinors w in terms of which the amplitudes w(-p+) are expressed are not yet literally positron spinors. The positron amplitudes are got from n(-p+) by charge conjugation, and according to (26.6) the corresponding spinors (which we denote by w+) are related to w by w+ = cryw*, whence w* = cryw+ = -w+ory, w = -o-yWÎ. (83.21) Secondly, the scattering amplitude must be brought to a form in which the §84 343 Positronium electron spinors (w_ and w!) are contracted, and likewise the positron spinors (w+ and w+). This is achieved by means of the formula (w*orw-)(w!*orw') = \(wL*w-)(w*w') - 5(vv:*crvv-)(vv*aYiO, (83.22) which follows from (28.17) Finally, expressing w and w' in terms of w+ and vv+ by (83.21), we easily find (83.23) (w*aw') = -(w+*crw+). Substituting (83.23) in (83.22) and then in (83.20), we obtain the final expression for the annihilation part of the scattering amplitude: M£ nn) = - 4m 2 (wL*wl* y^rp (3 + cr+ • a_)l w_w+), the matrices cr_ and cr+ acting on w_ and w+ respectively. The expression in the square brackets is the interaction operator in the momentum representation. The corresponding coordinate operator is u (r)= ii^p °+a+ • a-) ô(r)' r=r "r+ (83.24) (J. Pirenne, 1947; V. B. Berestetskiï and L. D. Landau, 1949). The total electronpositron interaction operator is -Û + L7(ann), with Û given by (83.17). §84. Positronium The results obtained in §83 can be applied to positronium, a hydrogen-like system consisting of an electron and a positron. In the centre-of-mass system, the electron and positron momentum operators in positronium are p- = -p+ = p, where p = -iftV is the operator of the momentum of relative motion corresponding to the relative position vector r = r- - r+. The total Hamiltonian for positronium ist . n 2 e2 * H = * - - - + Vi+ V 2 + V3, m r F{^+ ^»}. V2 = 6MoAî-S, V3 = 6^ ^{{è'rf'r) + In ordinary units. - IS2} + Air^m ~ 2)fi(r).j (84.1) 344 Interaction of Electrons §84 Here /LIO = ehjlmc is the Bohr magneton, hi = r x p is the orbital angular momentum operator, S = 2(CF+ + CT-) the total spin operator of the system, whose square S2 = 2(3 + <r+ • or_). Vi includes all the purely orbital correction terms, V2 the spinorbit interaction, and V3 the spin-spin and "annihilation" interactions. The "unperturbed" Hamiltonian H = p2/m - e2lr naturally differs from the Hamiltonian of the hydrogen atom only in that the electron mass is replaced by the reduced mass jm. The energy levels of positronium therefore have absolute values which are half those of hydrogen: E = -me*l4h2n\ (84.2) where n is the principal quantum number. The remaining terms in (84.1) cause a splitting of the levels (84.2), i.e. the appearance of a fine structure. The resulting levels are classified primarily by the values of the total angular momentum j. We also see that the particle spin operators appear in the Hamiltonian (84.1) only through the sum S. This means that the Hamiltonian commutes with the squared total spin operator S2, i.e. the value of the total spin continues to be conserved in the approximation considered (the second approximation with respect to 1/c). The energy levels of positronium can therefore be classified by the total spin, which takes values S = 0 and S = 1. The levels with spin 0 are called parapositronium levels, and those with spin 1 orthopositronium levels. It must be emphasized that the conservation of the total spin in positronium is actually exact, and does not depend on any particular approximation with respect to 1/c; it follows from the CP invariance of electromagnetic interactions. Positronium is a strictly neutral system, and its states therefore have definite charge parity and combined parity. The latter is equal to (- l) s + 1 (see §27, Problem); since S can take only two values, 0 and 1, the conservation of combined parity is equivalent to that of total spin. When S = 0 the total angular momentum j is equal to the orbital angular momentum, but when S = 1 and j is given, the number / can take the values j, j ± 1, so that in general each level (n,j) of orthopositronium is split into three. Since the values / = j and / = j ± 1 correspond to opposite parities, the Hamiltonian has no matrix elements between these states. But the perturbation operator (the first term in V3) in general has non-diagonal elements between states with / = j + 1 and / = j - 1; the number I then, of course, no longer has the strict significance of an orbital angular momentum. The Zeeman effect in positronium has some unusual features (V. B. Berestetskil and I. Ya. Pomerachuk, 1949). The orbital magnetic moment of positronium is always zero: since in positronium r+ x p+ = r_ x p_, we have the operator |Xf = /"-o(r+ x p+ - r_ x p_) = 0. §84 Positronium 345 The spin magnetic moment operator is (ILS = /x0(<r+ - a ); (84.3) it is not proportional to the total spin operator S = j(cr+ + o-_), and the operators S2 and \K2 do not commute. The states with definite values of the total spin S and its component Sz are therefore not, in general, eigenstates for the magnetic moment. States with given S kand Sz are described by spin functions xss, having the form X\i = a+a-, *,,-, = ß + ß-, Xio = :^j(a + /3- + u!-ß+), (84.4) Xoo = ^ 7 2 ( a + ß - - a - ß + ) , where a and ß are the spin functions of one particle corresponding to spin projections +2 and - 2 ; the suffixes + and - indicate that the function belongs to the positron and the electron respectively. The first two spin functions, *n and X1.-1, are also eigenfunctions of the operator ixz, corresponding to the eigenvalue zero. The functions xw and #00 are not eigenfunctions of /xz, but the following combinations are eigenfunctions: ^72(Xio + Xoo) = <*+ß-, ^ 2 (Xio ~ Afoo) = oußH (84.5) It is easy to see that the only non-zero matrix elements (S'Sz|ji,z|SSz) calculated from the functions (84.4) are <00|M*|10> = <10|/iz|00> = 2/uio. (84.6) In weak magnetic fields (when iioH <^ A, where A is the difference between the level energies with S = 0 and S = 1) the initial approximation for the calculation of the Zeeman splitting is formed by states with definite values of the total spin. In the first approximation, this splitting is given by the mean value of the perturbation energy operator V„ = -ßzH. But all the diagonal matrix elements of the operator ßz, and therefore VH, as calculated from the functions (84.4), are zero. Thus, in weak fields, there is no linear Zeeman effect in positronium. In the opposite limiting case of strong fields (/x0H > A), we can neglect the spin interaction which brings about definite values of S. The components of the split level will then correspond to states with definite values of ixz = ±2/x0 (described by the functions (84.5)), and the displacement of these components will be ±2/x0H. §84 Interaction of Electrons 346 PROBLEMS PROBLEM 1. Determine the fine structure of the levels of parapositronium (V. B. Berestetskif, 1949).t SOLUTION. The required level splitting energy is given by the mean values of the correction terms in the Hamiltonian (84.1), calculated by means of the wave functions of the unperturbed states with different values of j = I ( = 0 , 1 , . . . , n - 1). When S = 0, the only non-zero contributions come from Vi and the second term in V3. The unperturbed wave functions, which we denote by «/>, satisfy Schrödinger's equation^ P2* = - A 0 = ( E + | W E = -l/4n 2 . Hence P V = P 2 ( E + ^ = ( E + ^) *-*A± + 2(Vjy<r<lß) = (E+J)^ + 4ir«(r)^+^. The mean value is = (E + 1) + 47r|<M0)|2 + J J - ^ d r d o . The integral is equal to —/ |*A(0)|2 do ; since i/f(0) = 0 except when I =0, and the wave functions of S states are spherically symmetric, the integral is -4TT|I/K0)|2 and cancels with the second term. Using the orbital angular momentum operator I = r x p, we can write v v d r T r d r r r r)w \ The other required mean value is therefore J**p-<r-Wp*d J *=-J**JÜfd J * = ^(E + ^-4ir|^(0)| 2 -(+l)r'; if J = 0, the last term does not appear. According to the familiar formulae in the theory of the hydrogen atom (QM (36.14), (36.16)), with the electron mass m replaced by im, we have 2n2' 8irrT "" 73 = 4nJl(! + l)(2J + l) 2n\2l +l)' (, "0)- From these formulae, we find the required energy levels of parapositronium: p ^ = _ l 4? _ 2m*4 I / _ i a H\ "F2ÏP\2l + l 32« I t The fine structure of orthopositronium has been discussed by A. A. Sokolov and V. N. Tsytovich, Zhurnal éksperimentaVnoi i teoreticheskol fiziki 24, 253, 1953. t In the calculation it is convenient to use atomic units. §85 The Interaction of Atoms at Large Distances 347 PROBLEM 2. Determine the difference between the energies of the ground states (n = 1,1 =0) of orthopositronium and parapositronium. SOLUTION. The dependence of the energy on the total spin S when I = 0 arises only from the mean value of the second term in V3; the first term gives zero on averaging over angles in the spherically symmetric S state.t The ground level of orthopositronium (3Si) lies above that of parapositronium ('So) by an amount E( 3 S,)-E( l So) = ^ a 2 ^ - 8 . 2 x l O - 4 e V . §85. The interaction of atoms at large distances Attractive forces act between two neutral atoms at a distance r apart which is large compared with the dimensions of the atoms themselves. The usual quantummechanical calculation of these forces (see QM9 §89) is, however, inapplicable at very large distances, because this calculation considered only the electrostatic interaction, i.e. retardation effects are ignored. Such a treatment is valid only if the distance r is small in comparison with the characteristic wavelengths A0 of the interacting atoms. In this section we shall give a calculation not subject to that limitation. The procedure is much the same as in §83: the amplitude of elastic scattering (i.e. scattering without change of internal state) for two different atoms is calculated in the first non-vanishing approximation. The resulting expression is compared with the amplitude which would result if the interaction between the atoms were described by the potential energy U(r). In the latter case, the first non-vanishing S-matrix element describing the process in question would be the first-approximation element Sfi = - i j ^i*(r,)^5*(r2)U(r)^i(r,)^2(r2) d3*i <*3x2 x x f exp{-i(e, + e 2 ~ eî - e'2)t) dt. (85.1) Here \J/{9 if/2 and i/r{, ^2 are the time-independent parts of the wave functions (plane waves), describing the translational motion of the two atoms with initial and final momenta; ei, e2 and ei, e2 are the kinetic energies of this motion; the coordinates rj and r2 of the atoms as a whole can be regarded as the coordinates of their nuclei, and the distance r = |ri — r2|. The time integral in (85.1) gives, as usual, the delta function which expresses the law of conservation of energy. For convenience in the subsequent comparison, however, it is better to consider formally the limiting case of atoms of infinite mass; for given momenta, this limit corresponds to zero energies e. Or we can say that the times considered are small in comparison with the periods 1/s. t The averaging over angles must precede the integration over r, as is evident from the manner of calculation of the integral (83.14) which leads to the first term in V3. Interaction of Electrons 348 §85 Then (85.1) becomes Sfi = - it jj M*M*U(r)M2 d3x, d3x2, (85.2) where t is the time integration range. The actual calculation of the elastic-scattering amplitude, under these assumptions, can be divided into two stages. We first average the S-operator over the wave functions of the unchanged (ground) states of the two atoms (for given coordinates ri and r2 of their nuclei) and over the photon vacuum: no photons are present at the beginning and end of the process. We then obtain a quantity which is a function of the distance between the nuclei, and which we denote by (S(r)).t In order to find the required transition matrix element, we have then to calculate the integral Sfi = jf ^i*^2*<S(r))^,^2 d3x,d3x2. (85.3) Comparison with (85.2) shows that, if (S(r)) is obtained in the form (S(r)) = -itU(r), then the function U(r) is the required energy of interaction of the atoms. Since we are here concerned with a collision not of elementary particles but of more complicated systems, namely atoms, which may be excited in the intermediate states, the usual formal rules of the diagram technique are not directly applicable, and we shall begin from the expression of the S-operator as the expansion (72.10). In the interaction of atoms, the important field components are those whose frequencies are of the order of atomic frequencies or less. The corresponding wavelengths are large compared with atomic dimensions. The electromagnetic interaction operator can therefore be taken in the form V-=-Ê(r,)-di-Ê(r2)-d 2 f (85.4) where di, d2 are the dipole moment operators of the atoms (i.e. the time-dependent or Heisenberg operators) and Ê(r) is the electric field operator at the positions of the corresponding atoms. The mean values of the dipole moment of the atom in its stationary states are zero (QM, §75). Hence it follows that a non-zero amplitude occurs only in the fourth approximation of perturbation theory, i.e. as the matrix element of the operator S(4) = {^f j dt,... | dU. T{ V(t,) V(t2) V(f3) V(t4)}: (85.5) in lower orders, every term in the product of operators V will contain at least one of the operators di and d2 in the first degree, and on averaging over the state of the corresponding atom the result is zero. t In place of the more lengthy notation for a diagonal matrix element, indicating the states of the atom and of the photon field. §85 349 The Interaction of Atoms at Large Distances Let us now average the operator (85.5) over the photon vacuum. According to Wick's theorem, the expectation value of the product of four field operators E is the sum of products of pairwise expectation values (contractions). The division into pairs can be made in three ways, which may be represented by the diagrams 1 2 0 o o— 3 0 4 1 2 \\ 3 / / >\/ P // \ 1 ? I l I \ ] 4 À 3 2 ? ! I I | A (85.6) 4 where the broken lines represent contractions and the numbers correspond to the arguments tu f2, t3, f4. Moreover, spatial coordinates ri or r2 may correspond to each point, with two points having n and two r2, since otherwise, in the relevant term of the sum, one of the operators di and d2 will appear in the first degree, giving zero on averaging with respect to the state of the atom. It is clear that there must be one n and one r2 at the ends of each line, since otherwise the diagram (i.e. the corresponding term in the matrix element) will reduce to a product of independent functions of n and of r2 instead of being a function of the difference r i - r 2 ; such terms do not pertain to scattering.t In accordance with these conditions, the arguments n and r2 can be assigned to the four points in the diagram in four ways. Using also the commutativity of the operators di and d2 and averaging over the states of each atom, we find that all the 3 x 4 = 12 terms thus obtained are equal, differing only in the naming of the variables of integration. The result is (S(r)) = \fdtx...f dU. <T(E<(r„ f ,)Efc(r2, f2))> x x <T(E,(r2, U)Em{TU r4))>mdii(fi)dim(t4))><T(d2k(t2)d2i(r3))>, (85.7) where i, k, /, m are three-dimensional vector indices. To calculate the quantities DUxx ~ x2) = <T(E,(Xi)Efc(x2))> < 85 ' 8 ) we use the gauge in which the scalar potential 4> = 0. Then E = - dkldt, and we have D*U. - x2) = ^ L <T(Ai(xi)Ak(x2))> where x = xx - x 2 and Dik(x) is the photon propagator in this gauge.t + They give corrections, of no interest here, to the energy eigenvalues of each atom. t The first derivative dDlk(t)ldt has a finite discontinuity at t = 0. The second derivative, i.e. the function D u ( 0 , therefore includes a delta-function term ~~8<4)(X2- xi). This term, however, is zero for all ri * ri and is of no interest here. §85 Interaction of Electrons 350 We shall find it more convenient to use the propagator Dik(co, r) in the mixed <o-r representation, related to Dik(ty r) by Dik(f,r) = | DÄk(cü,r) e~iü>t dcollir, with Dfk(U r) = - i | co2Dik(w, r) <T/ù" dcollir. (85.9) The quantities aik(t. - t2) = «<T(A(ri)dk(f2))> (85.10) can be expressed as a Fourier integral 00 ocik(t) = j e-*" aik((o) d(ol2n. —oü Putting for convenience t2 = 0, fi = t, and using the definition of the T product, we can write a ik ((o)= | e"aik(0* o = i | « M <4k(0)d,(t)> dt + ij e^iMOMO)) dt. (85.11) 0 -oo The mean values (with respect to the ground state of the atom) which appear here can be expressed in terms of the matrix elements of the dipole moment: (dk(0)di(t)) = (di(t)dk(0)) = n 2(dk)on(diUei^\ , n Z(di)on(dkUe-i'0-'ot. For convergence of the integrals in (85.11) it is necessary to take <o in the first integral as to - «0, and in the second as <o + iO. Carrying out the integrations, we obtain / x _ \ ? / (^i)on(dk)no otik{(o) = 2J I , (dk)on(dj)nQ \ TK^ 7 JKh <Q< \~)\ (85.12) §85 The Interaction of Atoms at Large Distances 351 If the ground state is an S state, this tensor is simply a scalar, aik((o) = a8ik, where a((o) = ^2ldon| 2 ( ^+ -1 JK\ (85.13) If, however, the atom has an angular momentum, the same result is obtained on averaging over the directions of this angular momentum, and it will be assumed that this has been done; we are, of course, interested in the interaction of atoms averaged over their mutual orientations. Comparison of (85.12) with (59.17) shows that alfc(co) is the same as the tensor for coherent scattering of a photon of frequency co by an atom. According to (59.23), a(cu) for co > 0 is the polarizability of the atom. Its values for cu < 0 are expressed in terms of those for w > 0 by means of the relation a(-<o) = a(<o), which is obvious from (85.13). Substitution of these expressions in (85.7) gives A* <S(r»-iJ aii... j* dfl» 2diOi 2 w ai4 -r dü -— ——dw -z— x Z7T LIT ITT ZTT x a,(ni)a2(n2)a>iDjk(ü)i, r)(ü\Dlk{h>2, r) x x exp{-icuiU - h) - i<D2(h - t4) - iù{(t, - U) - iü2(t2 - h)}, where r = n - r2, and we have used the fact that Djk(a), r) is an even function of r. The integration over three times gives delta functions (whereby -Hi = (l2 = <o2 = o>i), and that over the fourth time gives a factor t: (S(r)) = -itU(r), where U{r) = \i f to4a,(to)a2(to)[Dik(to,r)]2dto/27r. (85.14) This formula gives the energy of interaction of two atoms at any distance large compared with the atomic dimensions a. We have now to find and insert an explicit expression for Dlk(û>, r). Comparison of the expressions (76.14) and (76.8) shows that kik k Dik(to,k)= - ( s i k - ^ ) D (o),k), Interaction of Electrons 352 §85 where D(<o, k) is given by (76.8). In the co - r representation, the relationship is correspondingly Da(co,r) = -(**+^J^)D(<»>rl <8515) Substitution of D(co, r) from (76.16), and carrying out the differentiations, gives D r)= i+ ^- h( 4"^) r W r CÜ r + /J r Then, substituting this expression in (85.14), we find by a simple calculation, using the fact that a(co) is even, the final expression for the interaction energy of the atoms: l/(r) = - L i \ <o*aMai{ü))e2i"r\\ + — 7 ^ z- 7 ^ 3 + r V 4I d<u. -rrr J L cur (cor) (cor) (cor) J o (85.17) This general result can be simplified in the limiting cases of "small" distances {a<r < A0) and "large" distances (r ^> A0). When r < A0, the important values in the integral are (see below) œ ~ o>0, where cuo^ c/A0 are the atomic frequencies, and therefore cor<\. Then only the last term in the bracket need be retained, and the exponential may be replaced by unity. Writing the integral as one from - » to °o (with a view to the subsequent calculations), we find 3 l/(r) = ^ ' f ai(üi)a2(oi)dcü. (85.18) The interaction law at these distances proves to be 1/r6, as it should. The integral in (85.18) is easily calculated, after substitution of a(ce) from (85.13), by closing the contour of integration with an infinite semicircle in the lower half of the complex o>-plane; the integral is determined from the residues of the integrand at the poles o) = Ü>„O ~ cuo. Assuming (to simplify the result) that the two atoms are identical, we find (in ordinary units) U(r)=-^^n!äM_ (85.19) the same as the familiar London's formula (see QM, §89, Problem). In the limit of large distances (r > A0), the important values in the integral are o> ^ c\r < CJQ; when <o ** Ù>0, the integral is made small by the rapidly oscillating §85 The Interaction of Atoms at Large Distances 353 factor exp2io>r. We can therefore replace the polarizabilities ai(o>) and a2(<o) by their static values c*i(0) and a2(0). The integration is then elementary. (To ensure convergence, r in the exponential is to be replaced by r + iO.) The final result is (in ordinary units) 4ir r (H. B. G. Casimir and D. Polder, 1948).t + The derivation given here is due to I. E. Dzyaloshinskif. CHAPTER X INTERACTION OF ELECTRONS WITH PHOTONS § 86. Scattering of a photon by an electron T H E conservation of 4-momentum in the scattering of a photon by a free electron (the Compton effect ) is expressed by the equation (86.1) p + k = p' + k\ where p and k are the 4-momenta of the electron and the photon before the collision, and p' and k' their 4-momenta after the collision. The kinematic invariants defined in §66 are s = (p +fc)2= (p' + kf = m 2 + 2pk = m2 + 2p'k\ 1 t = (P -p'f M = ( k ' - k)2 = 2 ( m 2 - p p ' ) = - 2 k k \ = (p - k')2 = ( p ' - fc)2 = m 2 - 2pk' = m 2 - 2p'k, s + f + w=2m2. i The process in question is represented by the two Feynman diagrams (74.14), and its amplitude is Mfi = - 4ir« 2 e;*^(ö l Q My ii), (86.3) where Q1" = ^ p y^(7P + 7k + m ) 7 v + j ^ j p yv(yp - yk'+ m )YM- <86-4) Here e, e' are the polarization 4-vectors of the initial and final photons; u, u' the bispinor amplitudes of the initial and final electrons. According to the rules given in §65, for arbitrary polarization states of the particles \Mfl\2 is replaced by iM/ip^lôirVtrip^a'Q^Wp^On (86.5) where p (f) , p{e)' are the density matrices of the initial and final electrons, piy\ pKy)l those of the photons. The photon (tensor) indices are written explicitly, but the electron (bispinor) indices are not. The trace symbol refers to the latter indices, as does the superscript plus in the definiton QßV = y°Qlvy°. 354 i §86 355 Scattering of a Photon by an Electron Let us consider the scattering of an unpolarized photon by an unpolarized electron, without regard to their polarizations after the scattering. The averaging with respect to the polarizations of all particles is given by the density matrices: PS = pfc* = " te*, PU) = 2<7P + m), p(e)' = \{yp' + m); the change to summation over the polarizations of the final particles involves a further multiplication by 2 x 2 = 4. From formula (64.23), in which we must now put I2 = \(s -m 2 ) 2 (see (64.15a)), we fini the cross-section TTP* At From (65.2a), Q^A = QxM- Separating the terms which differ only by the changes k «-> - kf (and accordingly s <->M), we can put the cross-section in the form d<T = dt 77c 4 2 2 ( 5 - m ) t/ (S ' U) + g(5 ' U) + f(U* S) + g( "' 5^' with the notation fU, u) = g ( 5 M) = ' _ m y tr{(7p' + m)7"(YP + yk + m)7"(yp + m)7„(7p + 7k + m)7^}, 4(5-m 2 )(M-m 2 ) t r { ( 7 P ' + m) ^^p + Vk + m)"y"(7P + m)yM x x(7p-7k' + m)7,}; this notation takes account of the fact that the result will depend only on the invariant quantities. The summation over p. and v is effected by means of formulae (22.6); then, omitting terms which contain an odd number of factors 7, we obtain f(s, ") = , m ytr{(7p')(7P + yk)(yp)(yp + yk) + 4m\yp + yk)(yk - yp')+ + m2(7p)(7p') + 4m4}. The trace is calculated by means of formulae (22.13); expressing all quantities in terms of the invariants s and u, we easily obtain fis, u) = {s _ 2 m y {4m4 - (5 - m2)(u - m2) + 2m2(s - m2)}. Similarly, g(5>M) = (s-m2iXu-m2){4m2 + (5 ~m2) + ( M - m ^ 356 Interaction of Electrons with Photons §86 The cross-section is thus , Q + 2 m2dt ( ^ + \{ m2 ^ \ , m2 y , 1 ( ^ +^ ) 1 (86 . 6) where re = e2lm. This formula expresses the cross-section in terms of invariant quantities, and can easily be used to express it in terms of the collision parameters in any specified frame of reference. Let us do this for the laboratory system, in which the electron is at rest before the collision: p = (m, 0). Here s - m 2 = 2mco, u - m 2 = -2mco'. (86.7) Squaring the equation of conservation of 4-momentum in the form p + k- k' = p \ we have pfc-pk'-kk' = 0, whence (in the laboratory system) m(co - co') - ÜHD'O - cos # ) = 0, where fl is the angle of scattering of the photon. This equation gives the relation between the photon energy change and the scattering angle: oj <i) m The invariant t is t = - 2kk' = - 2WÛ)'(1 - cos #). For a given energy <o we find, using (86.8), at = 2<u'2 d cos # = (l/tr)a)'2 do' (do' = 2TT sin # dd). Substitution of these expressions in (86.6) gives the following formula for the scattering cross-section in the laboratory system: da=jr2(«:y(i2.+«I. \<û I \<t) b) sin * A I do> (86.9) (O. Klein and Y. Nishina, 1929; I. E. Tamm, 1930). Since the angle § is unambiguously related to o>' by (86.8), the cross-section can §86 Scattering of a Photon by an Electron 357 be expressed in terms of the energy co' of the scattered photon: * , . „ ; * * q i + -:+(«_ UV _ 2„ ( J. _ I) j, (86.10) with ay' varying in the range w 1 + 2ü>lm -w'^tu. (86.11) When a» < m, we can put a>' ~ a> in (86.9), and the result is, as it should be, the classical non-relativistic Thomson's formula der = ir 2 (l + cos 2 d) do'; (86.12) see Fields, (78.7). To calculate the total cross-section, we return to formula (86.6). The invariants s, t, u there take values satisfying the inequalities s^m2, fssO, us^m\ (86.13) These have already been derived in §67; the corresponding physical region is I in Fig. 7 (§67). They are also easily obtained directly from the expressions for the invariants in the centre-of-mass system. Here p + k = 0, and the energies e of the electron and <o of the photon are related by e = V(co2 + m2). The invariants are s =(e + (x>Y = m2 + 2O)(Û> + e), 1 u = m2 - 2o)(e + a) cos 0), (86.14) 2 t = -2û> (l-cos0), where 6 is the scattering angle (the angle between p and p' or between k and k'). The three inequalities (86.13) then result from the conditions w ^ O and -1*£ cos 6 «s 1. For a given s (i.e. a given energy of the particles), the integration with respect to t can be replaced by one with respect t o u = 2 m 2 - s - t over the range m4ls^u^2m2-s. Using instead of s and u the quantities x=(s-m2)/m\ y = (m 2 -M)/m 2 , we obtain •¥ \ [(H),+H+SG-f)]<* */(* + !) (86.15) 358 Interaction of Electrons with Photons §86 and after the elementary integration The leading terms in the expansion for x < 1 (the non-relativistic case) are (86.17) 1), the expansion of (86.16) gives a = 2-rrrl £ (log x + {). (86.18) In the laboratory system, x = 2o>/m, (86.19) so that formulae (86.16H86.18) give immediately the photon energy dependence of the cross-section for scattering by an electron at rest. Figure 13 shows <r as a function of w/m. In the ultra-relativistic case, the cross-section decreases with increasing energy both in the laboratory system (tr « Û>-' log a>) and in the centre-of-mass system (x"=4û>2/m2, <r « a)"2 log w). But the angular distribution in the ultra-relativistic case has quite different forms in these two frames of reference. In the laboratory system, the differential cross-section has a sharp peak in the forward direction. In a narrow cone d s£ V(m/o>) we have o>' ~ CD and the cross10 — 0-8 06 q 8 5r2 3 • 04 02 0 001 002 005 01 0 2 0 5 FIG. 1 13. 2 5 10 20 50 100 §87 359 Scattering of a Photon by an Electron section dcr/do' — r2., reaching the value r2 as #-*0. Outside this cone, the crosssection decreases, and in the range ü2>mlo> (where w ' « m/(l - c o s #)) we have do' " 2 r ' < ü ( l - cos #)' i.e. the cross-section is reduced by a factor ~w/m. In the centre-of-mass system, on the other hand, the differential cross-section has a peak in the backward direction. For IT - 6 < 1 we have from (86.14) s-m2 m _ 4<o2 m m2-u _ , , <o2, m m 2 The largest term in the cross-section (86.6) is dor « 8UT2. m 2 dt 4(s-m z )(m z -ur whence ^ =H1 + (T_ d °9)V/m!. (86.20) The cross-section dcrldo' ~ r« in a narrow cone ir - 6 *£ m/w; outside this cone it is reduced by a factor of the order of ~o)2/m2. §87. Scattering of a photon by an electron. Polarization effects We shall now go back to the original formulae of §86 and show how the calculations must be made in order to take account of the polarization of the initial and final photons and electrons. The density matrix of the photon can be expressed, according to (8.17), by means of a pair of unit 4-vectors e(]\ el2) which satisfy the conditions (8.16). In the present case, these vectors can be taken to be, for both photons, the 4-vectors defined in §70t e(1) = N/V(-N 2 ), e(2) = P/V(-P 2 ), (87.1) P x = (pA + p' x )- Kk(pK + p'K)IK\ (87.2) where k k t An alternative procedureqxis=to consider from start a specified frame of reference (say the k'k-k =pthe -p'\ laboratory system) and take for each photon as e0), em purely spatial unit vectors e = (0, e) which are orthogonal to the photon momenta and to each other. In that case, however, the calculations will be entirely in three-dimensional form, and the result will not be invariant. §87 Interaction of Electrons with Photons 360 The quantities Q*1* in (86.5) are given by (86.4). They may be regarded as components of a 4-tensor (in the sense that they form a 4-tensor after being contracted with spinors as the quantities ü'Q^u). All the components of a 4-tensor can be obtained by projecting it on four mutually orthogonal 4-vectors, for instance on P, N, q and K defined above. Since the tensors p{JÏ,pl$ contain only components along P and N, we need in fact only the components of Q^ along these 4-vectors. In other words, it is sufficient to find in Q^ the terms of the form Q,„ = Q o M V » + efe?) + Q,(e<M2) + efe?) - iQ2(e(;)et2) - efe?) - + Q3(e!Ml) - e?e?); (87.3) the remaining terms would disappear on substitution in (86.5). The quantities Q0 and Qi are scalars in the same sense that QM„ is a 4-tensor; they therefore contain the matrices 7 only in the "invariant" combinations yK, etc. In the same sense, Q\ and Q2 are pseudoscalars (N is a pseudovector), and hence must contain the matrix By direct projection of the tensor QM„ we find etc. In the calculation it is convenient first to express Q»v in terms of the mutually orthogonal 4-vectors P, N, q, K: There then remain some purely algebraic calculations using the formulae given in §22. It is also possible to make changes in Q*v which do not affect the result after the subsequent construction of the product ü'Q^u. For instance, since ü\yp + yp')u = 2mö'u, 0 V(7<Ï)M = ö'(Y5(YP) + (yp')Y5)w =2mö'75M, we can make in Q*v the changes JP + YP'-»2m, y\yq)-*2my5. (87.4) The detailed calculations are omitted here; the final result ist Qo = - ma+, Qx = \ia+y5(yK), Q2 = - ma+y\ Q3 = ma+ + \a-(yK), (87.5) t The expression (87.3) with the values (87.5) corresponds to the formulae (70.11M70.13) derived in §70 from general considerations. Besides the equations /} = A = 0 which follow from T invariance, another invariant amplitude (Ji) is here zero also. This is a property of the approximation of perturbation theory used here, and would not occur in higher approximations. §87 Scattering of a Photon by an Electron 361 where a± = 1 ^ 1 5± 5. s- m u- m In the subsequent calculations, it is convenient to apply to Q^ the same formal treatment as has been described in §8 for the photon density matrix: the four components of the tensor (87.3) in the directions e(l), e{2) are combined to form a two-rowed matrix Q which is then expanded in terms of Pauli matrices. Similarly to (8.18), we obtain Q = Qo + Q • or, Q = (Q lf Q2, Q3). (87.6) The components of the tensor Q^ = y°Qtvy° in (86.5) are easily seen from (87.3), (87.5) and the rules (65.2a) to be obtained from those of QßV on replacing Q0, Q i , . . . by Oo, Qu . • •, where Oo=Qo, Qi = - Q i , Q2 = - Q 2 , Q3=Q3, (87.7) and simultaneously interchanging the indices /x, vA In matrix form, Q = Qo + Q - * . (87.8) Let us now define more precisely the sense of the 4-vectors e(1), ei2) in relation to the polarization of the photons. For each photon, the independent directions of polarization will be determined by the components of the 3-vectors e(l), e(2) transverse to the photon momentum k.t It is easily seen that, in both the centre-of-mass system and the laboratory system (in which the initial electron is at rest), the vector P is in the plane of k and k', and N perpendicular to that plane. The direction e 0) is therefore that of the polarization perpendicular to the plane of scattering, and e(2) is that of the polarization in the plane of scattering. It must also be noted that the Stokes parameters £i, &> & are defined with respect to the axes xyz, which form a right-handed set with the z-axis in the direction of k. It is easily seen that for the initial photon the vectors N, P x , k form such a set, and for the final photon the vectors N, - P I , k' (where P± and Pi are the components of P perpendicular to k and k' respectively). A change of sign e(2) in the photon density matrix (8.17) is equivalent to a change of sign of £i and £2. The density matrices of the initial and final photons, referred to the unit 4-vectors e(1) and e{2\ are therefore (87.9) t For the matrix Q»v in the original form (86.4) we should have simply Q„„ = Q,K. This property, however, is lost as a result of transformations such as (87.4). t The longitudinal components of e, like the time components of the 4-vectors ef can here be simply ignored; this is permissible, owing to gauge invariance. 362 Interaction of Electrons with Photons §87 The tensor trace P AM V pppW is now calculated as the trace of the matrix product of the matrices (87.6)-(87.9), using (33.5). The result is \Mfi\2 = 8TTV tr{(pw'QapwQ0 + Pi€)'Q • p('}Q) + + (I + r ) • (p(e)'QoP(f)Q + P(e)'Qp(f)Oo) - i ( | -1') • P(0'Q x P(e)Q + + (I • * Vc)'QoP(f)Qo - P(e)'Q • P ( € ) Q) + + P(e)'(C • Q)p{e\i • Q) + p(f >'« • Q)P W (C • Q) - »È x *' • (p(e)'Qop(<)Q - P U ) / QP ( 0 OO)}. (87.10) SCATTERING BY UNPOLARIZED ELECTRONS We shall complete the calculation of the cross-section for the scattering of polarized photons by an unpolarized electron, summed over polarizations of the final electron. To do so, we must put in (87.10) P{e) = \(yp + m), p(e)' = K7P' + m), double the result, and substitute it in place of |M/j|2 in the formula (64.22) for the cross-section: where <f> is the azimuth in the centre-of-mass or laboratory system. Some of the terms in (87.10) are identically zero; the calculation of the other terms gives the final result (with the notation (86.15)) +<-H) + ^KH)K-7) + + ««[(H),+(H)+a)Here da is the scattering cross-section for unpolarized photons given by (86.9); the factor 2 appears because there is no summation over the polarizations of the final photon in (87.11). §87 363 Scattering of a Photon by an Electron In the laboratory system, formula (87.11) becomes .'\2 da = H ( ^ - ) do'{F0 + F3(6 + 6) + F„«,{i + Fafctf + F » ^ } , do' = sin tiddd<}>, (87.12) where o = —; + sin'' -9, to tu Fii = 2 cos d, F3 = sin2 #, (87.13) 2 ) cos d, = \(t) f-^ + — <o 1 F33 = 1 + cos # (U. Fano, 1949). Although (87.12) shows no explicit dependence on the azimuth <f> of the scattering plane, there is an implicit dependence, since the parameters £1, £2, $3 are defined with respect to the axes xyz, which are fixed to the scattering plane. The x-axis is the same for both photons and perpendicular to the scattering plane: x||kxk', and the y-axes are in that plane: y||kx(kxk'), y'Uk'x(kxk'). Taking the sum of cross-sections differing in the sign of (■' (i.e. putting £' = 0 and doubling the result), we obtain the total cross-section (summed over polarizations of the final photon) for scattering of a polarized photon by an unpolarized electron. Denoting this cross-section by dcr(£), we have d(T{%) = \r\{to'ltofFdo\ (87.14) where F = F0 + Ç3Fi = -, + — CO (O -(\-b)sin2d. (87.15) We see that the scattering cross-section for photons polarized perpendicular to the scattering plane ( 6 = 1 ) is greater than that for photons polarized in the scattering plane (& = - 1). The cross-section is independent of circular polarization and of the parameter £1. The scattering cross-section is therefore equal to that for unpolarized photons if there is no linear polarization relative to the x and y axes (£3 = 0) or even if there is polarization relative to directions at 45° to these axes. The cross-section for scattering of unpolarized photons with detection of a polarized photon has similar properties. This cross-section, which we denote by dcr(f'), is obtained from (87.12) by putting £ = 0: _ iJli.jl dvig)-rv = irKto'ltofF' do', F' = Fo + £F 3 . (87.16) 364 Interaction of Electrons with Photons §87 From formula (87.12) it is also possible to deduce the polarization of the secondary photon itself; we shall denote the parameters of this polarization by £tf) to distinguish them from the detected polarization £'. According to the rules given in §65, the quantities £K} are equal to the ratios of the coefficients of the £\ to the term independent of £': î? = (FnlF)Çu & = (F22/F)£2, #> = (F3 + F33fe)/F. (87.17) In particular, for the scattering of an unpolarized photon tf> = #> = <>, tf>- . ,+ sin?* .i v (ojo) + eu /co — sin v (87.18) Here ffi > 0, i.e. the secondary photon is polarized perpendicular to the scattering plane. Circular polarization of the secondary photon occurs only if the primary photon is circularly polarized: ffi # 0 only if £2 ^ 0. Let us consider the case of complete linear polarization of the incident photon (6 = 0, £?+£ 3 = 1), and find the cross-section for scattering with detection of a linearly polarized secondary photon. Expressing the parameters £, and £\ in terms of the components of the photon polarization vectors e and e\ we obtain the following expression for the scattering cross-section: d* = lrl(^)2(^ + - - 2 + 4cos2®)do\ (87.19) where © is the angle between the directions of polarization of the incident and scattered photons.t According to this formula, the cross-section behaves quite differently when the polarizations e and e' are perpendicular and when they are parallel. Distinguishing these two cases by the suffixes JL and ||, we have in the non-relativistic limit (co <^ m, ù/ Ä 0)) daL = 0, dcr\\ = r\ cos2 0 do\ (87.20) in agreement with the classical formulae. In the opposite, ultra-relativistic, case we have a) > m, a>/ass m/(l - c o s #). Here the two ranges of large and small angles (large and small co/co') must be distinguished: m do for tr # 2 £>m/<o; > mlw ; i ddL = dan ^irl — do'^ \r\ —r—° — ior 1 " ù) û>(l — COSv) it) do\i = 0, da\\ = r] cos 0 do' for d < m/o>. (87.21) t Formula (87.19) itself could be more simply derived by writing from the start e = (0, e), e' = (0, e') in the scattering amplitude (86.3) and continuing the calculation of the squared amplitude in threedimensional form (i.e. separating the time and space components of the 4-vectors). On averaging cos 2 0 = (e • e')2 over the directions of e and e' (using (45.4a)), and doubling the cross-section (to sum over e'), we of course return to (86.9). §87 365 Scattering of a Photon by an Electron We see that the scattering cross-section has its classical value at very small angles. The approximate equality of da± and da^ at angles which are not very small signifies that in this range, in the ultra-relativistic case, the scattered radiation is unpolarized; but it must be emphasized that this conclusion applies specifically to a linearly polarized incident photon. From (87.17) it is evident that, for a circularly polarized photon in the ultra-relativistic case, ffi ^ & • cos d. SCATTERING BY POLARIZED ELECTRONS For polarized electrons, the calculation of the traces in formula (87.10) becomes very laborious, though not difficult in principle. Here we shall give only some of the final results of the calculation.! In general, the cross-section depends both on the polarization parameters £ and £' of the initial and final photons, and on the polarizations of the initial and final electrons, described by vectors £ and £'. The dependence on each of these parameters is linear. The cross-section has the form da = idcr«, 6') + W(aiVai)2 do'{t • Çfe + f • « i + g • *'& + g' • Ç& + GM'k + •••}, (87.22) where dcr(£, £') is the cross-section (87.12). All the terms which contain products of two polarization parameters have been written out in (87.22). Terms containing products of three or four parameters have been omitted; they are unimportant as regards correlations between the polarizations of only two particles, and disappear when the polarization parameters of the other two particles are equated to zero. The following are the values of some of the coefficients in the laboratory system: f = —m (1 - cos d)(k cos # + k'), f = - — (1 - cos d)(k + k' cos #), m »-"^a-cosd^Ckcosd + kO-a + cosd) (o "*f,„(k~fc')1, m L — to + Im J g' = -^(l-cosfl)[(k + k'cosfl)-(l + cosfl) m L " + "' o) — (Û + Zm (87.23) Ck-IO.]. J The cross-section (87.22) contains no term of the form G • Ç. This signifies that the polarization of the electron does not affect the total cross-section (summed over (•' and £') for the scattering of unpolarized photons. There is also no term of the form G' • £'. This signifies that, in the scattering of unpolarized photons, the recoil electron is unpolarized. We see also that the terms bilinear in the polarizations of the electron and photon contain only the parameters & and $ which correspond to circular t Further details may be found in the review articles by H. A. Tolhoek, Reviews of Modern Physics 28, 277, 1956; W. H. McMaster, ibid. 33, 8, 1961. 366 Interaction of Electrons with Photons §87 polarization of the photon. The polarization vectors Ç and Ç' of the electrons appear in the form of scalar products f • Ç, etc., which contain only the projections of these vectors on the scattering plane. Hence, for example, the cross-section for scattering of a polarized photon by a polarized electron, d<r{& 0 = dor(l) 4- \r2e(<o'la>)2C2f • £ do\ (87.24) differs from dcr(Ç) only in that the photon is circularly polarized and the electrons have a non-zero projection of the mean spin on the scattering plane. For the same reason, the recoil electron is polarized only if the photon is circularly polarized; the resulting electron polarization vector is then in the scattering plane: f^fcg/F. (87.25) SYMMETRY RELATIONS Finally, we shall show that the qualitative properties of the polarization effects in the scattering of photons by electrons follow from the general requirements of symmetry. The parameter £2 of circular polarization is a pseudoscalar (see §8). Hence, from the requirement of P invariance, terms <*£2 (or <x££) in the scattering cross-section could occur only as the product of £2 with some pseudoscalar formed from the available vectors k and k'.t But a pseudoscalar cannot be formed from two polar vectors. It therefore follows that no such terms can appear in the cross-section. The parameters £i and £3 of linear polarization are related to the components of the two-dimensional (in a plane perpendicular to k) symmetric tensor Saß=Wß + p{&) In the present case, one of the polarization axes is taken to lie along the vector v = k x k\ and the other lies in the plane of k and k' (along k x v for one photon and along k ' x v for the other). Terms «£i could occur in the cross-section only as products Saßva(k' x v)^ (or, equivalently, Saßvak'ß)y etc. But, since v is an axial vector, k a polar vector, and Saß a true tensor, such products are not invariant with respect to inversion. There are therefore also no terms <* £, (or « £j) in the cross-section. Terms a & (or a ^ ) , however, occur as products Saßvavßj etc., and are not forbidden by considerations of symmetry. Terms in the cross-section that are proportional to the electron polarization Ç are not forbidden by parity: such terms could arise from the products £• v of two axial vectors. They must, however, be absent in the first non-vanishing approximation of perturbation theory, here considered, because the scattering matrix t We are considering the process in the laboratory system, where p = 0, p' = k - k\ It is evident that the relevant consequences of the symmetry requirements (the presence or absence of particular terms in the cross-section) will not depend on the choice of the frame of reference. §87 Scattering of a Photon by an Electron 367 is Hermitian in that approximation (§71). Owing to this property, the square of the scattering amplitude (and therefore the cross-section) is unchanged when the initial and final states are interchanged. At the same time the cross-section must be invariant under time reversal, i.e. interchange of the initial and final states together with a change of sign of the momentum and angular momentum vectors of all the particles; the Stokes parameters £i, £2, £3 are then unaltered (see §8). On combining these two requirements, we find that in the approximation considered the crosssection must be unchanged by a change of sign of all the momenta and angular momenta without interchange of the initial and final states, i.e. by the transformation k-^-k, k'-*-k', £-*-£, £'-♦-£' (87.26) with £ and £' unaltered. The transformation (87.26) changes the sign of the product Ç • v, and such terms therefore cannot appear in the cross-section. It must be emphasized, however, that this prohibition is not a consequence of strict requirements of symmetry, and may therefore no longer apply in higher approximations of perturbation theory. Among the terms of the binary correlation between the polarizations of the photons, only those of the form £i£3 and £2£3 are forbidden by parity, and none of those of the photon-electron correlation are forbidden. But all terms of the form £i&> £i£> 6£ are forbidden in the first approximation by the requirement of invariance under the transformation (87.26). For instance, terms of the form £i£ 2 and £i£ could be formed (so far as parity is concerned) as scalars such as eiSQßkfaPß and Saßk'aVßt, • k, but such combinations change sign under the transformation (87.26). The allowed correlation terms of'the form £2£ can be formed as products of the tyPe 6£ • k- The electron polarization vectors appear in them only as projections on the scattering plane. Finally, a number of relations between the coefficients in the allowed terms result from the requirements of crossing symmetry. Reaction channels which differ by an interchange of initial and final photons correspond to the same process— scattering of a photon by an electron. The squared modulus of the amplitude, and therefore the scattering cross-section, must consequently be invariant under a transformation which expresses the change from one of these channels to the other: k **-k\ e ++ e'* with the electron momenta and polarizations unchanged. In three-dimensional form, this transformation is a>«+-a>\ k<-*-k', (87.27) The change in the sign of £2 is evident from the expression £2 = ie x e* • n, in which the vector e x e* changes sign when e and e* are interchanged, while the vector 368 §88 Interaction of Electrons with Photons n = k/o) is unchanged when k<-> - k, u>+* - a>. The transformation (87.27) does not affect the electron momenta and therefore leaves the laboratory system unaltered. Hence the cross-section (87.22) cannot change its form under this transformation, and in fact the formulae (87.12), (87.22), (87.23) comply with this requirement. §88. Two-photon annihilation of an electron pair The annihilation of an electron and a positron (with 4-momenta p_ and p+) to form two photons (k\ and kî) corresponds to two diagrams k, _ — . , p. k2 k p+ . m ■ 1 — - - J — ■ P_ p+ (88.1) These differ from the diagrams for scattering of a photon by an electron as follows: P'-*~P+> fc —> — ki, p-*p~, (88.2) k'-+k2.. The two processes are two cross-channels of the same (generalized) reaction. After the changes (88.2), the kinematic invariants (86.2) become s = (p--k,) 2 , t = (p_ + p+)2 = (k, + k2)2, (88.3) u = (P--* 2 ) 2 . If the photon scattering is the s channel, then the annihilation is the t channel. The quantity |M/,|2 for annihilation (averaged over polarizations of the electrons and summed over those of the photons), when expressed in terms of the invariants s and u, is the same as corresponding quantity for scattering, only the meaning of the invariants being changed.! In the formula (64.23) for the cross-section, the change s *-*■ t is needed in the coefficients of |M/(|2, and I 2 is now, according to (64.15a), equal to Jf(t-4m 2 ). Making the appropriate alterations in formula (86.6), we find the annihilation cross-section m2 ds 2 I m , L\s-m 2 m \ u-m) 1 (s - m 2 , u-m2\] /M A. The physical region of the annihilation channel is region II in Fig. 7 (§67). For given t (given energy in the centre-of-mass system), the range of variation of s is t This takes account of the fact that the photons and the electrons have the same number of independent polarizations (two), and it is therefore immaterial which correspond to the averaging of |M/(| and which to the summation. §88 Two-photon Annihilation of an Electron Pair 369 determined by the equation of the boundary su = m4. Together with the relation s + t + u = 2m2, this gives -l2t-W[t(t-4m2)]^s-m2^-\t + W[t(t-4m2)l (88.5) The integration of (88.4) is elementary; the result must be divided by two to take account of the identity of the two final particles (the photons). Thus we have g = 2 ^ 1 ) ((T2 + T~b log VT - V(r - i) "(T + 1 ) V [ T ( T " 1)3 }> (886) where T = Ulm2 (P. A. M. Dirac, 1930). In the non-relativistic limit (r -*• 1), this gives <r = My/(T-\). (88.7) In the ultra-relativistic case (T -> <»), <r = (Wr2JT)(\og 4T - 1). (88.8) In the laboratory system, in which one particle (say the electron) is at rest before the collision, the invariant T is T = kl + 7), 7 = ejm. (88.9) Formulae (88.6)-(88.8) give as the dependence of the total cross-section on the energy of the incident positron - ^ P ^ l ° 8 [ ? + V<= , -l)]-v^}. (88.10) In particular, in the non-relativistic limitt cr = irr2elv+ (non-relativistic), (88.11) where v+ is the velocity of the positron. In the centre-of-mass system the electron, the positron and the two photons have equal energies, e = <o. The invariants are m 2 - s =2e(£ - | p | cos 0), m2-u =2e(e + |p|cos Ö), t=4e 2 , (88.12) where 6 is the angle between the momentum of the electron and that of one of the t This formula becomes inapplicable, however, when v+^a and the Coulomb interaction of the components of the pair cannot be neglected; cf. the end of §94. 370 Interaction of Electrons with Photons §88 photons. Substituting in (88.4), we find the angular distribution of the annihilation photons: , r]m2 f£ 2 + p2(l + sin2 0) 2p4 sin4 6 ] A «*Q \-\\ da = -rn\—2 —* T-; , t r a — r z v \do(88.13) 4e|p| L e - p ' c o s ' 0 (e - pz cos 2 0)ZJ In the ultra-relativistic case this has symmetrical maxima in the directions 0 = 0 and 0 = IT. Near 0 = 0, we have to * i w l \ U (ultra-relativistic). 2e (0 + m le ) (88.14) The total cross-section is obtained from (88.6): ^ =^ i ^ [ ^ l o g | ^ - 2 ( 2 - , 2 ) ] , (88.15) where v = |p|/e = V ( e 2 - m2)/e is the velocity of the colliding particles. We shall not discuss here the details of the polarization effects in annihilation^ but merely consider certain qualitative features of these effects in the limiting cases where the velocity v of the colliding particles is large or small. The process will be considered in the centre-of-mass system. In the limit v -> 0, only the state with orbital angular momentum of relative motion Z = 0 gives a non-zero contribution to the cross-section. But the S state of the electron + positron system has negative parity (§27, Problem). In odd states of a two-photon system, their polarizations are orthogonal (§9). The same must therefore be true of the annihilation photons in the non-relativistic case. If the electron and positron are polarized, their annihilation is possible (again in the non-relativistic case) only if their spins are antiparallel: since the annihilation occurs in the S state, the total angular momentum of the system is equal to the total spin of the particles, which is 1 when the spins are parallel. The two-photon system, however, has no state with total angular momentum 1 (see §9). In the ultra-relativistic limit (v -» 1), the annihilation of a longitudinally polarized (helical) electron and positron is possible only when their helicities have opposite signs.$ In this limit, helical particles behave as neutrinos (see the end of §80), and the electron and positron undergoing annihilation must be analogous to a neutrino and an antineutrino, whence the result stated follows. The annihilation of an electron and a positron with the same helicity occurs, in the ultra-relativistic case, only when terms containing m are taken into account. The amplitude of this process differs, in order of magnitude, by a factor m/e from that of the annihilation of a pair with parallel spins; the cross-section accordingly differs by a factor (m/e) 2 . t See W. H. McMaster, Reviews of Modern Physics 33, 8, 1961. % Since the directions of the particle momenta are also opposite (in the centre-of-mass system), helicities of opposite sign correspond to parallel spins. §89 Annihilation of Positronium 371 PROBLEM Find the cross-section for the formation of an electron pair in the collision of two photons (G. Breit and J. A. Wheeler, 1934). SOLUTION. This is the process inverse to the two-photon annihilation of an electron pair. The squared amplitudes are the same for the two processes, and their relationship to the cross-section differs only in that here I2 = (kik2)2 = it 2 . Hence A A «CTform = ÖO"ann t~4m* ' . In the centre-of-mass system (t = 4e2 = 4<o2), d(T(orm = V do"ann, where v is the velocity of the components of the pair. In integrating to obtain the total cross-section, the result is not to be divided by 2 (as in the case of annihilation), because the two final particles (electron and positron) are not identical. Hence, in the centre-of-mass system, CTform = 2\) Œann = We{\ - i>2){(3 - v4) log Y^ - 2t>(2 - v2)]. (1) In an arbitrary frame of reference K, in which the two photons k\ and Jc2 are moving in opposite directions, we have (from the invariance of kifo) __ 2 0>lû)2 = û> , where eu is the energy of the photons in the centre-of-mass system. Since this energy is equal to that of the pair components, we have a> = e = m/V(l - v2). To change to the frame K, we must therefore put in (1) t? = V ( l - m2/cüi(ü2). §89. Annihilation of positronium Owing to the conservation of momentum, the annihilation of the electron and positron in positronium must be accompanied by the emission of at least two photons. Such a decay is possible (in the ground state), however, only for parapositronium. In §9 we have shown that the total angular momentum of a two-photon system cannot be 1. Hence orthopositronium in the 3Si state cannot decay into two photons. Moreover, since positronium in the 3S\ state is a chargeodd system (see §27, Problem), Furry's theorem (§79) shows that it cannot decay into any even number of photons. In the ! S 0 state, on the other hand, positronium is charge-even, and the decay of parapositronium into any odd number of photons is therefore forbidden. The main process which determines the lifetime of positronium is therefore two-photon annihilation for parapositronium and three-photon annihilation for orthopositronium (I. Ya. Pomeranchuk, 1948). The decay probability can be related to the cross-section for annihilation of a free pair. The electron and positron momenta in positronium are ~~me2lhy i.e. small compared with mc. Hence, in calculating the probability of annihilation, we can 372 Interaction of Electrons with Photons §89 take the limit of two particles at rest at the origin. Let &2y be the cross-section for two-photon annihilation of a free pair, averaged over the spin directions of both particles. In the non-relativistic limit, according to (88.11),t ä2y = 7T(e2lmc2)2clv, (89.1) where v is the relative velocity of the particles. The annihilation probability w2y is obtained on multiplying â2y by the flux density u|i/f(0)|2. Here i/f(r) is the wave function, normalized to unity, of the positronium ground state: *(r) = ^ ^ e~ri\ a = 2ft2/me2; (89.2) the Bohr radius a for positronium is twice that for the hydrogen atom, because its reduced mass is half as great. This probability, however, corresponds to the initial state averaged over spins, whereas in positronium, of the four possible spin states of a two-particle system, only one (with total spin 0) can undergo two-photon annihilation. Hence the mean decay probability w2y is related to the parapositronium decay probability vv0 by w2y = Jw0, and so vvo = 4 | i K O ) | W 2 , W (89.3) Substituting the values from (89.1), (89.2), we obtain for the lifetime of parapositronium T0 = 2ft/mcV = 1.23 x 1(T10 sec. (89.4) It should be noticed that the level width T0 = hlr0 is small compared with the level energy |Egr| = m e W = m c 2 a 2 / 4 . For this reason positronium may be regarded as a system in a quasi-stationary state. Similarly we find that the decay probability for orthopositronium is related to the spin-averaged cross-section for three-photon annihilation of a free pair by Wi = l * 3 7 H | * ( 0 ) | W 3 7 U (89.5) the statistical weight of a state with spin 1 being i Anticipating, we may mention that t^M^JJ^. 7 5v \mc1/ (89.6) The lifetime of orthopositronium is therefore T «=Ä)^ = 1 - 4 x l 0 " , s e c ' t Formulae (89.1M89.7) are written in ordinary units. (89 7 -> §89 373 Annihilation of Positronium The inequality T i ^ E ^ I is here, of course, satisfied even more markedly than for parapositronium. Let us now calculate the cross-section for three-photon annihilation of a free pair (A. Ore and J. L, Powell, 1949). According to (64.18), the cross-section in the centre-of-mass system is expressed in terms of the squared amplitude by = da iy r —AT ô k + k I • 2Ù>2 • 2CÜ3 (89.8) where, according to (64.16), I = 2m \mv = m v, v being the relative velocity (assumed small) of the positron and the electron; ki, k2, k3 and (ou o>2, O>3 are the wave vectors and frequencies of the photons formed; the delta functions express the laws of conservation of energy and momentum. Because of these laws, the three frequencies <oi, <o2, <Ü3 must be represented by the lengths of the sides of a triangle with perimeter 2m. Thus the magnitudes of the momenta ki, k2, k3 and the angles between them are entirely determined by specifying two frequencies. The three-photon annihilation corresponds to the diagram 2 k, — ^ - - - j — . — p_ — k2 k3 — - - - 1 — — - P + and a further five diagrams obtained from it by interchanging th£ photons ki, k2, ky. The amplitude may be written Mfl = (47r)3/2e(A3)*eï)*et,)*û(- p+)Qx,"u{p-), (89.9) Q ^ = S7ÄG(krP+)7^(p--k,)7f, (89.10) where int. the sum being taken over all interchanges of the photon numbers 1, 2, 3 together with corresponding simultaneous interchanges of the tensor indices A, jx, v. The squared modulus of the amplitude, averaged over the polarizations of the electron and the positron and summed over those of the photons, is 4 2 \Mfl\2 = (4TT)3 trti+Q^p-Q^l polar. (89.11) where P- = 2(yp - + m), p+ = i(yp+-m). The matrices Q*"" differ from the matrices QAM" in that the order of the factors is 374 Interaction of Electrons with Photons §89 reversed in each term of the sum. In the limiting case considered, where the electron and positron velocities are small, their 3-momenta p_ and p+ may be taken as zero, putting p= p+ = (m, 0). Then the electron Green's functions are G(p - k,) "(p--k,)2-m2 = 2 ^ ' etc., and the density matrices reduce to p? = i m ( y ° ± l ) . A large number of terms arise on carrying out the multiplication in (89.11), but the number that need to be calculated can be greatly reduced by making full use of the symmetry with respect to interchanges of photons. For example, it is sufficient to multiply out the six terms in Qk^p (89.10) each with only one term in QkßV. In the six traces then remaining, we can again select certain parts which are transformed into one another by various interchanges of photons. The products of the 4-vectors p, ki, k2, ^3 which occur when the traces are expanded can all be expressed in terms of the frequencies a>i, to2, co3. Since p = ( m , 0), we have pk\ = m<ou The products k\k2,... are determined from the equation of conservation of 4-momentum: 2p = k\ + k2 + k3; for example, writing this equation in the form 2p-Jc 3 = k\ + k2 and squaring, we have k1k2 = 2 m ( m - û > 3 ) , . . . . (89.12) The result of the calculation, which is still fairly lengthy, is i 2 |M,p-(4-)»««. i6f(2i^i) 1 + (a^y + (a^ö) 2 ]. pSïar. L\ k>2^3 / V ü)iu) 3 / V CüiCÜ2 / J Substituting this expression in (89.8), we obtain the differential cross-section for three-photon annihilation: 27 IT m v L\ Û^ 2 U) 3 / V (OIÜ)3 / \ CUICÜ2 / x S(k, + k2 + k3)fi((o, + a>2 + a> 3 - 2m) 1 J x d k l d k2 d k \ (89.13) The delta functions have still to be eliminated. The first is removed by integrating over d3k3, and we then write d 3 ki (Pkj^ ATTQ)} doi\ • 2TTÜ)2 d(cos Ö12) dcu2, where Bn is the angle between k, and k2; it is assumed that the integration has already been performed over the directions of k, and the azimuth of k2 relative to §89 375 Annihilation of Positronium ki. Differentiating the equation Û>3 = V(û>î + (02 + 2(O\0)2 COS #12), we find d COS 0i2 = (0)3/0)1(02) d(tii. The second delta function is removed by integrating over dcjj. The resulting cross-section for annihilation with formation of photons having specified energies is 1 Se6 \(nLz^)2+(nL^i)2+ 6 vm2 t \ (OiO)2 / \ (Oi(03 / däyy = - (HL^I)2} \ O)2(03 / J dùh dû)2; (89 .i4) the factor 1/6 has been included in order to take account of the identity of the photons in the subsequent integration over frequencies (cf. the third footnote to §64). Each of the frequencies (ot, o>2, 0)3 can take values between 0 and m; the latter can be reached by two frequencies when the third is zero. For given o>i, the frequency (y2 varies between m-oj\ and m. Integrating (89.14) over do)2 between these limits, we obtain the spectral distribution of decay photons: dv^ = (8e73i>m3)F((o,) <fo)i, nv x o)i(m -0)1) , 2m -o>i , f2m(m -(oQ 2m(m-o)i) 2 ] 1 m-o)i (2m-o>i)i+"-aTr~+L ^ (2m-o),) 5 J l 0 g m F(û>,)= The function F((oi) increases monotonically from zero when (01 = 0 to unity when toi = m, and is shown graphically in Fig. 14. The total annihilation cross-section is obtained by integrating (89.14) over both F(o),) 02 04 FIG. 06 14. 08 ■10 376 Interaction of Electrons with Photons §90 frequencies: V}y7 = 4e6 3vm 2 ^ J0 m n 0)\0)2 J d^i dù)2 The value of the integral is (7r - 9 ) / 3 , and we thus return to formula (89.6). §90. Synchrotron radiation According to the classical theory (Fields, §74), an ultra-relativistic electron moving in a constant magnetic field H emits a quasi-continuous spectrum with a maximum at the frequency ù)-coo(elm)\ (90.1) where o>0 = i>|e|H/|p| Ä |e|H/e (90.2) is the frequency of revolution of an electron having energy e in a circular orbit (in a plane perpendicular to the field).! We shall assume that the longitudinal velocity of the electron (parallel to H) is zero, as can always be achieved by a suitable choice of the frame of reference. Quantum effects in synchrotron radiation originate in two ways: from the quantization of the motion of the electron, and from the quantum recoil when a photon is emitted. The latter is determined by the ratio hale, and this must be small if the classical theory is applicable. It is therefore convenient to use the parameter where H 0 = m2l\e\h (= m2c3l\e\h) = 4.4 x 1013 G. In the classical case, \ ~ M e « 1. In the opposite limit (^ > 1), the energy of the emitted photon hco — s, and (as we shall see below) the significant region of the spectrum extends to frequencies at which the electron energy after the emission is e'-mHolH. (90.4) If the electron remains ultra-relativistic, the field must satisfy the condition HIH0<èl. (90.5) The quantization of the electron motion itself is expressed by the ratio hcoole; hcoo is the interval between adjacent energy levels for motion in a magnetic field. t In this section we shall put c = 1 but retain factors of h. §90 Synchrotron Radiation 311 Since hioole = (H/Ho)(m/e)2, it follows from (90.5) that ÄCÜ0 < e, i.e. the motion of the electron is quasi-classical for all values of x- T n a t »s> t n e non-commutativity between the operators of dynamical variables of the electron (quantities of order ho)0le) may be neglected, while the non-commutativity of these operators with those of the photon field (quantities of order hale) is not neglected.t The quasi-classical wave functions of stationary states of an electron in an external field can be put in the symbolic form * = y/QÏÏ)U(P) <r(i "°*'<Mr)' (906) where </>(r)~ exp(iS/ft) are the quasi-classical wave functions of a spinless particle (S(r) being its classical action); u(p) is the operator bispinor "(P) = / W(H + m)w obtained from the bispinor plane-wave amplitude u(p) (23.9) on replacing p and e by the operators^ p = P - e A = -iftV-eA, H=V(p 2 + m2), where P is the generalized momentum of the particle in a field with vector potential A(r). The order of the operator factors in ip is immaterial, since their noncommutativity is neglected, and the spin state of the electron is determined by the three-dimensional spinor w. In order to calculate the probability of photon emission in the quasi-classical case, it is more convenient to start not from thefinalformula (44.3) of perturbation theory but from a formula in which the integration with respect to time has not yet been carried out. For the total (over all time) differential probability we have§ 00 d* = £ Kf 0p, op = J Vfl(t) dt (90.7) — 00 t The full solution of the quantum problem of synchrotron radiation was first given by N. P. Klepikov (1954), and the first quantum correction to the classical formula by A. A. Sokolov, N. P. Klepikov and I. M. Ternov (1952). The derivation given here, which explicitly makes use of the fact that the motion is quasi-classical, is due to V. N. Baîer and V. M. Katkov (1967). A similar method had been used earlier by J. Schwinger (1954) to derive the first quantum correction in the radiation intensity. t In this section, unlike Chapter IV, the generalized momentum is denoted by the capital letter P, while p denotes the ordinary (kinetic) momentum. § Putting Vn(t) = V/i exp(ia>/jt), we find a/, = 2TTV/,-6(û>/i), and, since the squared delta function is to be taken as [S(o>)] -> (t/27r)6(ü>), where t is the total observation time (cf. the derivation of (64.5)), we obtain from (90.7) the formula (44.3) for the probability per unit time. 378 Interaction of Electrons with Photons §90 (cf. QM, (41.2)); the summation is over final states of the electron. Using (90.6), we can write the matrix element V/j(0 for emission of a photon o>, k in the operator form V/,U) V{2f^))rfe V(2H)\€ (C a) V(2H) * ' where the operators in the square brackets act to the left; the photon field is taken in the three-dimensionally transverse gauge. The factors cxp(±iHtlh) convert the Schrödinger operators between them into explicitly time-dependent operators of the Heisenberg representation. We can write Vfi(t) in the form V/,(0 = ^ | ^ < / | Q ( 0 | i ) e i where Q(t) denotes the Heisenberg operator Q(t) = J&ÊL (a. e«)6-«k.K„ J I K P L , V(2H) V(2H) (90.8) and the matrix element is taken with respect to the functionsfa,fa. The summation in (90.7) is taken over allfinalwave functionsfa,an d is effected by means of the equation ^4>*f{T')<f,f(r) = 8(T'-T), which expresses the completeness of the set of functionsfa.The result is dw = ^ 0 J A . J A 2 . e"-("-^<(|0+(r2)Q(f,)|i). (90.9) If the integration is over a sufficiently long time interval, fi and t2 can be replaced by new variables T = t 2 -tl, t=kU+h\ and in the integral over t the integrand may be regarded as the probability of emission per unit time. Multiplying by h<o, we obtain the intensity dl = - ^ d\ [ e~™(i\Q\t +i<r)Q(f ~h)\i) dr. 4TT J (90.10) An ultra-relativistic electron radiates into a narrow cone at angles 0 — m/e relative to its velocity v. The emission in a given direction n = k/co therefore occurs over a section of the path in which v turns through an angle ~-m/e. This section is traversed in a time T such that T|V| ** TCO0- m/e < \. This region gives the principal §90 379 Synchrotron Radiation contribution to the integral over r. In the subsequent calculations, we shall therefore expand all quantities in powers of O>0T. It may, however, be necessary to retain more than just the leading term in the expansion, because of cancellations which occur since 1 - n • v -~ 02 ~ (m/s) 2 . If the operator Q*Q is reduced to a product of operators which commute (to the necessary degree of accuracy), the taking of the diagonal matrix element ( ï | . . . |î) is equivalent to replacing these operators by the classical (time-dependent) values of the corresponding quantities. This is achieved in the following way. According to the foregoing discussion, in the expression for Q(t) only the non-commutativity of the electron operators with the photon field operator e x p ( - ik • f(0) need be taken into account. We have p * - * • ' = g- ik -*(p-ftk), H(p)e- ikm ik f * = e- ' H(p-hk). 1 > (90.11) These formulae follow because e~tke is the displacement operator in momentum space. Using (90.11), we can take the operator e'lk'Ht) out on the left in (90.8), and write 0 ( 0 in the form Q{t) = e^'WRit), v R(t)= U {p j ? (tt-e*)-7^, V(2H') V(2H) (90.12) where H ' = H - fto>, p' = p - hk. Then Q2Öi=Ä2eik •''«-*•'*!; (90.13) here and henceforward, the suffixes 1 and 2 denote the values of quantities at the times ti = t — 2T and t2-t + 2r. It remains to calculate the product of the two non-commuting operators elk *2 and e~ik*h. This product itself may be regarded as commuting with the remaining factors. We write £(T) = e - ^ e l k ' * ' e - | k - \ (90.14) this being the combination of operators which appears in (90.10). The operator eiArlh is a time-shift operator, and so £ l k * * 2 = ßifol* £ l k * i e-iÛrl* Substituting this in (90.14) and noting that eik't{ is a displacement operator in momentum space, we find L ( T ) = exp{i[H - M r / f t } e x p H H ^ - hk)rlh}. (90.15) Differentiating (90.15) with respect to T and again using the properties of the 380 §90 Interaction of Electrons with Photons time-shift operator, we havet dLldr = (i/ft) exp{/(H - hco)rlh}[H - hco - H(fa - hk)] x x exp{-iH(p, - hk)rlh} = (Hh)[H -hoy- H(p 2 - ftk)]L(r). (90.16) Having thus made use of the non-commutativity of the operators, we can replace all the operators by the corresponding classical quantities (the Hamiltonian H by the electron energy e). We have identically e(p2~hk) = [(p2-hk)2 + m2]m = [(e-hoj)2 + 2h((oe-k-p2)]m. The difference cos — k • p2 = o>e(l - n • v2) is small, since from the above analysis 1 - v • n — (m/s)2. As far as the first order in this difference, e(p 2 - hk)^ e' + (ele')h(u) - k • v2), where e' = e - too. From (90.16), we now find the differential equation for L(T): indLldr = (ele')h(œ - k • v2)L. (90.17) This equation is to be solved with the obvious initial condition L(0) = 1. Since T o v2dT = r 2 -ri, we have L(T) = exp{i(e/e')(k • r 2 -k • r, - WT)}. (90.18) So far, no use has been made of the specific form of the electron trajectory. Now expressing r2-ri in (90.18) in terms of pi by means of the equation of motion of the electron in the plane perpendicular to the field H (see Fields, §21): eHT\ Pi . eHr , pi x H /, r ; -r, = JLs.n — + ^ ( l - c o s — ) , and expanding in powers of T gives k • ( r j - r , ) - Û>T « «r|(n • v , - 1) + T £HJJL£Ü- T 2 ^}, (90.19) where in the last term we have put n • vi *» 1. t Because of the conservation of energy, the Heisenberg operators H(pi) and Ufa) are the same; we therefore omit the arguments of H in such cases. However, H (pi - hk) is of course not the same as Ä(fr-Äk). Synchrotron Radiation §90 381 We next transform the remaining factors in (90.13). A direct expansion of the product in R(t), using the matrix a from (21.20), leads to' R(t) = w*fe* • (A + iB x <r)Wi, . - , / l 1\ e + e' \e + m e +m/ (90.20) le where p'(0 = p(0~^k; terms of higher order in m/e are omitted. Thus we have finally (90.21) RfJRj = tr |(1 + Ç, • <r)(A2 - ÎB2 x a) • e • |(1 + & • a)(A, + iB, x a) • e*. The factors Kl + Ç' or) are two-rowed polarization density matrices of the initial and final electron. Let us consider the radiation intensity summed over the polarizations of the photon and of the final electron, and averaged over the polarizations of the initial electron. These operations give, after a simple calculation,! U««-^(—>*^),(7)' With sufficient accuracy we can put Vi • V2 = V — JTV + JTV • V 1 = 1 m 1 2 2 f-2<0oT . Substitution of these expressions in (90.21) and thence in (90.10) gives e2 (11 = — -A—5 o) 2 day don x 00 x J (^ + £ ^ < "« T J ) e "p{- ! T 1 ('- n - v + ^»)} d T - —00 O0-22' t This calculation makes use also of the following result. In the summation over e, 2 (vi ' *)(V2 • e*) = vi • V2 - (vi • n)(v2 • n). c On substituting (90.21) in (90.10), we can integrate by parts, noting that (v1.n)exp(-ii7k.r1) = ^ ^ e x p ( - , f 7 k . r 1 ) , and similarly for V2 • n. Consequently, in the remaining integration vi • n and v2 • n can be replaced by unity. §90 Interaction of Electrons with Photons 382 This formula shows the frequency and angular distribution of the radiation intensity. To find the frequency distribution, we integrate over don. If the direction of v is taken as the polar axis, with an angle # between n and v, then n • v = v cos #, don = sin d dd d<f>, and f ficure ) , lire' /ia)Ts\ f / io>Te\) fexp(—) - exp(—-)j. J e x p ( — n • T ) do. = ^ When this is substituted in (90.22), only the first term need be retained, since the second term yields a faster varying exponential (with a factor 1 + v « 2 instead of the small 1 - v = m2/2e2). Hence dl ie2<o f (m1 , e2+e'2 f icrre /, \ , T2 2\1 . According to the integral representation of the Airy function 4> (see QM, §b), the first term reduces to the integral of the Airy function, and the second term to its derivative. The final result is X x = ( W e » 2 ' 3 = (m 2 /e 2 )(ecu/eW 3 (90.24) (A. I. Nikishov and V. I. Ritus, 1967). The frequency distribution has a maximum when x ~ l ; for ^ < 1 we find (90.1), and for x > l (90.4). In the classical limit, ho) ^ e' = e, x = ((olü)Q?(mle)2; the second term in the round brackets is small, and (90.23) becomes the classical formula (Fields, (74.13)). Figure 15 shows diagrams of the frequency distribution for various values of xThe quantity 1 dl 3/cl/2d(ü>/o)c) is plotted against ct>/a>c, where The quantity Jci is the classical value of the total radiation intensity; cf. Fields, (74.2). §90 383 Synchrotron Radiation To calculate the total radiation intensity, (90.23) must be integrated with respect to to from 0 to e. We change to integration with respect to x, noting that -'('"ï+M' ha) and x therefore varies from 0 to °°. With two integrations by parts in the first term in (90.23), we find 1 - ~ ww J ( l + W 0 *(x)x dx- (90 25) - Figure 16 shows a graph of the function I(x)lh\When x<^l, the important region in the integral is x — 1. Expanding the integrand in powers of x a n d integrating by means of the formula j x'V(x) dx = - ^ o 3{Av'mT(\v + l)T(\v +1), we obtain 55V3 / =/ „ ( l - ^ * + 48*'-...). (90.26) When x ^ 1» the important region is that in which *x3'2 ~ 1, i.e., x «^ 1. In the first approximation, we can therefore replace $'(*) by 4>'(0) = - 3l/6r(j)/2Vir, and the §90 Interaction of Electrons with Photons 384 1U 09 08 07 06 I o« 04 03 02 01 05 15 10 3X FIG. 16. integration then leads to the result 1 m%eW 243ft2 V X) m (90.27) Synchrotron emission causes the occurrence of a polarization of electrons moving in the field (A. A. Sokolov and I. M. Ternov, 1963). To discuss this, we have to find the probability of a radiative transition with spin reversal. Putting in (90.21) Ç, = -£/ ■ Ç, |£| = 1, we have RÎK, = B, • B 2 - (e* • B,)(e • B2) - (e* • B, x Ç)(e • B2 x Q - i(C ■ e*)(e • B, x B2). Summation over polarizations of the photon gives, after a simple calculation, 2 i?!R, = (B, • B2)(l - ({ • n)2) + ({ • n)(n • B.XÇ • B2) + c + (£ • n)(n • B2)(£ • B,) - i(£- n(n • Q) • B, x B2. (90.28) We shall assume that \ ^ 1 anc* s e e ^ ° n ly the principal term in the expansion of the probability in powers of ft. Since the expression (90.28) (with B given by (90.20)) already contains ft2, all the remaining quantities e\ including those in the exponent in (90.18), can be replaced by e. §90 385 Synchrotron Radiation With the expansions Bl ,£(.-„ +!T * + „a). B2 = 27l n - v - i T V + v 7> r2 - ri = TV + 24 v, and substituting (90.28) in (90.21) and thence in (90.10), we find the differential transition probability per unit time (dw = dIlho>). The integration over d3k is carried out by means of the formula J d\ „.„ , 4ir (90.29) where in this case Xo = T, x = r2 - r,, X -Xo X -T ^ + l2 j The result of the calculation is W= «^[m) Wo f(l + zVl2)4?"Ï2?+(? + Ï2?) ( ^ v) -?^o^ V><V J' where z = rcooWm and the contour of integration passes below the real axis and is closed in the lower half-plane. After this integration we finally obtain for the total probability of a radiative transition with spin reversal w = 5V3<* h2 /e\5 ,/. lt0, 8V3 ifyfey-K-W-^HO' where fn = ^ • v, C± = i' H/H. This formula is valid for both electrons (e < 0) and positrons (e > 0). The probability (90.30) is independent of the sign of the longitudinal polarization £ll but depends on that of £x. The polarization resulting from the emission is therefore transverse.! For electrons, the probability of a transition from a state with the spin parallel to the field (£x = 1) to a state with the spin antiparallel to the field is greater than that of the opposite transition. The radiative polarization of the electrons is therefore antiparallel to the field, and the degree of polarization in a t This is also evident from the fact that the axial vector of the resultant polarization must be along H, which is the only axial vector occurring in the problem. 386 Interaction of Electrons with Photons §91 stationary state is (when fj = 0) w(£i = - l ) - w ( £ i = 0 wtf± = - l ) + w ( £ ± = l ) = 8V3 15 U ^' Positrons are polarized, to the same degree, parallel to the field. §91. Pair production by a photon in a magnetic field The production of an electron-positron pair by a photon in a magnetic field, and synchrotron radiation, are two cross-channels of the same reaction. The amplitude Mfi of the pair production process is therefore found from the synchrotron radiation amplitude by simply making the changes e , p - > - e+, - p + ; e',p'-» e_,p_; <u, k - * - co,-k. (91.1) Here e-, p_ and e+, p+ are the electron and positron energies and momenta; e, p and E\ p' the initial and final energies and momenta of the electron in synchrotron radiation. In terms of angles and magnitudes, the momenta are transformed according to IPMP+I> IP'MP-I, 0->7T-0+, » # ->e-, *->*-*, (91.2) where d± are the angles between p ± and k, <f> the angle between the kp+ and kp_ planes. For synchrotron radiation, the cross-section is given in terms of the amplitude byt dff = M l ''| ! 8J^ S(e -*'-"> 1 ^ ; < 9U) see (64.25). The delta function is eliminated by integrating with respect to e'. Since, in the present case, p' and k are independent variables, and d3p' = | p V de' do\ d'k = <o2 d<odok, we have simply to substitute 8(e - e' - a)) d*p' d*k -> co2|p'N' dok do1 dio. Then da = |M/f|2 gQ^jpi àok do da>. t In this section we again put h = 1 as well as c = 1. I (91.4) §91 Pair Production by a Photon in a Magnetic Field 387 For pair production by a photon, the cross-section is given in terms of the amplitude by d da = \Mfi\2 r - ^ — S(o) - 6 + - g _) d'P(; >- or, after elimination of the delta function, dv = IM/,12 J^%1O\ZTT) do+ do_ de+. (91.5) it) Comparison with (91.4) shows that, to obtain the pair production cross-section from the synchrotron radiation cross-section, we have to make in (91.4) the changes (91.1), multiply by (p2+/û>2) dejda), (91.6) and replace do' dok by do+ do-. In the ultra-relativistic case a> > m,t this can be done in the formulae derived in §90. Here it is assumed that both particles in the pair are ultra-relativistic; it is easily verified that all the approximations used in §90 then remain valid. In particular, the probability of pair production by an unpolarized photon, summed over the electron and positron spin projections and integrated over the directions of emergence of the electron, is found on making the changes (91.1) in (90.22) (or rather in the expression for dJ/dw), with d*k = o>2 dw don replaced by dW- . 00 e2 d3p+ f ( m2 el + el 2 2\ v x e x p { ^ ( l - n . v + +^wl)}dr. (91.7) where a>o+ = |e|H/e+, and n is a unit vector parallel to the photon momentum, which lies in the plane perpendicular to the magnetic field. The integration is carried out in the same way as in §90, and (since (91.7) depends only on the angle between n and v+) it does not matter whether we integrate over do+ or over don. The result can therefore be obtained directly by analogy with (90.23): 00 dw = v T ^ {/ ^ d* + (f " K Vjc)*'<*>}» (918) t More precisely, we must have eu sin d > m, where d is the angle between k and H; when d = 0, no pairs are formed. In what follows, we shall take d = fa. 388 §91 Interaction of Electrons with Photons where now x = (m3ù>l\e\He+e-)\ } \ K = \e\H(olm3 (= ft2|e|Hû)/mV). J (91.9) The total pair production probability per unit time is found by integrating (91.8) with respect to e+; in view of the obvious symmetry in e+ and e- = <o - e+, it is sufficient to take twice the integral from 0 to \u. Changing variables from e+ to x and integrating by parts in the first term in (91.8), we obtain ?H V^ f J f(x3/2-4/K)1/2 3(X3/2-2/K)<J>'(X)1 73 <Pix) ül ili I P -x \x -4lK)ili\dx (9U0) (A. I. Nikishov and V. I. Ritus, 1967). In the limit of weak fields (K <^ 1), values of x near the lower limit are important in the integral (91.10). Since these values are large, we can use the asymptotic expression for the Airy function, $(*)~2P*expHx 3 / 2 ); see QM, §b. With the variable of integration V = X 3 / 2 - 4 / K , and putting y = 0 wherever possible, we find by calculation W= ^re_8/3K' (9U1) K<L The exponential decrease of the probability as K -* 0 corresponds to the impossibility of pair production in the classical limit. In the opposite limit of strong fields (K > 1), only the second term in (91.10) is important, and it is governed by the range of x for which 'XW~HK<\. In that range, <ï>'(x) may be replaced by $'(0) = - 3l,6r(20/2irl With the value of the integral | y"'(y - i r _ l dy = I V - jOroo/IV), we find W - 2* W^/6) ^ = 0.38|e|'H/mK "', K>l. The function mw(K)/|e|3H has a maximum value of 0.11 at K « 11. (91.12) §92 Electron-Nucleus Brems Strahlung 389 §92. Electron-nucleus bremsstrahlung. The non-relativistic case This section and those following are concerned with the important phenomenon of bremsstrahlung, the radiation emitted in a collision between particles. We shall first consider a non-relativistic collision between an electron and a nucleus, assuming that the nucleus remains at rest; that is, we consider radiation from the scattering of an electron in the Coulomb field of a fixed centre (A. Sommerfeld, 1931). We begin from formula (45.5) for the probability of dipole radiation: dw = (co3/27T)|e* . d/i|2 dok. (92.1) In the present case, the initial and final states of the electron belong to the continuous spectrum, and the photon frequency cü=(l/2m)(p 2 -p' 2 ), (92.2) where p = mv and p' = mv' are the initial and final momenta of the electron. If the initial and final wave functions of the electron are normalized to "one particle per unit volume' 1 ( V = 1) the expression (92.1), on multiplication by d3p7(27r)3 and division by the incident flux density D/V = D, will give the cross-section do\ P ' for emission of a photon k into the solid angle dok with scattering of the electron into the range of states d3pf. Replacing the matrix element of the dipole moment d = er by that of the momentum: we can write the expression for the cross-section in the formt <*ov = {2°fmp |e* ' P/.I2 dok d'p\ (92.3) where P/i = J tWi à?x = - ii J 0fV<fo d3x. For «/>, and ty we must use the exact wave functions in an attractive Coulomb field, whose asymptotic form consists of a plane wave and a spherical wave. The spherical wave must be ingoing in «/// and outgoing in ^ (see QM, §136). These functions are ifc = A, el>rF(iv, 1, i(pr - p T ) ) , iiff = Af eip' r F ( - i V , 1, -i(p'r + p' • r)), t In this section, p and p' denote |p| and |p'| respectively. V = Ze2mlp ; v' = Ze2mlp', I (92.4) 390 Interaction of Electrons with Photons §92 with the normalization factors A, = ew"/2r(l - iv), Af = e*v'l2Y(\ + iv'). (92.5) Since VFOV 1, i(pr - p • r)) = i(pr/r - p)F' r \dp)v' we can write the gradient v>, as ' * - * * - * « * • ' £ (ff).On multiplication by t/rf and integration, the first term vanishes, because «/f, and if/f are orthogonal. The matrix element pfi is therefore p/, = iAiAfpdJldp, (92.6) where J denotes the integral J = j ~-^ F(iV, 1, i{p'r + p' • r))F(iv, 1, i(pr- p • r)) d3x, (92.7) q = p' - p. The symbol ô/ap has been taken outside the integral, with the understanding that, in the differentiation of J, the quantities v, v', q are to be regarded as independent parameters, v and q being expressed in terms of p only after the differentiation. The integral is calculated by replacing the confluent hypergeometric functions by their expressions as contour integrals. Here we shall give only the result:t J = BF0V.ii/, l,z) B = 4TTe-""(-q 2 -2q • p)"''(q2- 2q • pT'O^r^'" (92.8) __ 2 q 2 (pp; + p-p')-2(q-p)(q*p') (q'-2q.p')(q 2 + 2q.p) Here F(iV, iv, 1, z) is the complete hypergeometric function. After differentiating in (92.6), we can put q = p' - p; then (92.9) tThe calculations are given by A. Nordsieck, Physical Review 93, 785, 1954. §92 Electron-Nucleus Bremsstrahlung 391 (z < 0). Also, - q 2 - 2 q . p = q 2 -2q.p' = p 2 -p' 2 >0. The matrix element is thus finally found to be - AA 8TTJ g"™ Pfi AiAf ~ (p-p')\p+p')\p-p') i\ (p+p'\ -i(v+v') X x (1 - 2)'("+*'>-,[«VpqF(z) + (Î - z)F'(z)(p'p- pp')], (92.10) where we have put for brevity F(z) = F(iV, iV, l,z). (92.11) The cross-section is obtained by substituting (92.10) in (92.3), but the general formula is very lengthy and obscure. We shall therefore go on immediately to calculate the spectral distribution of the radiation, i.e. to integrate the cross-section over the directions of the photon and the final electron. The integration over dok and the summation over the polarizations of the photon are equivalent to averaging over all directions e and multiplication by 2 x ATT, i.e. to the substitution e^î dok->(8-Tr/3)ôjk. The cross-section is then = ^\Pfi\2da>dov: (92.12) The value of |p/,|2 is calculated by using (92.9)-(92.11) and the formula |r(l - iV)|2 = 7ri//sinh iTv. The result is , |2 IP/i| 32Tr(Ze2)2m3 P(P + P'Y(P - P')4(l - e-^Xe2"" - 1) X x | Y ^ \F\2- z\Ff + \i(v + v') j 4 ^ (FF'* - F*F')V (92.13) To integrate the cross-section (92.12) over dop=2ir sinddfl, we change from the variable d (the scattering angle) to (P-PT ' PP 392 §92 Interaction of Electrons with Photons In order to integrate with respect to z, we transform the expression in the braces in (92.13) as follows. According to the differential equation of the hypergeometric function (see QM, (e.2)), we have z(l - z)F" + [1 - (1 + iv + iv')z]F' + vv'F = 0, z(l - z)F"* + [1 - (1 - iv - iv')z)F'* + vv'F* = 0. Multiplying these equations by F* and F respectively and adding, we obtain (1 " z)\lzz(FF*+ F '* F) ~ 2 z ' F '' 2 + i ( > itz' ) z ( F '* F - F ' F *) + Y=-Z lFl2] = °- Hence the expression in the braces in (92.13) is seen to be {• • } = -\-^z(F'F* (92.14) + FF'*), and the integration is immediate. Collecting the above formulae, we find as the final expression for the bremsstrahlung emission cross-section in the frequency range do>t A 647T2 ^ 2 _2 d<r„= — m2C2 1 p' Z «r. ^ 7 7 - (1 _ t-u+^u. / d |c.^x,2\dû> fon ... _ D ( " 5 lF(^)l j —> (92-15) where v = Zamclp = Ze2lhv, F(|) = F(iv', iv, 1, a v' = Ze2lhv', p' = V(p 2 - Imhai), £ = - 4pp'/(p " P'f. Let us consider the limiting case where both velocities v and v' are so large that v < 1, i/ < 1 (but, of course, still with t> <^ 1, so that Za < v < 1; this is possible only if Z is small). To calculate the derivative F'(£) in this case, we use the formula ^ F ( a , 0 , y , z ) = ^ F ( a + 1,0 + 1,7 + 1,2), which is easily obtained by simple differentiation of the hypergeometric series. Then F ' Œ - f r - i V F O , 1,2,*) «(w7fllog(l-fl; the last equation is evident from a direct comparison of the corresponding series. t Formulae (92.15M92.25) are given in ordinary units. Electron-Nucleus §92 Bremsstrahlung 393 For the function F(£) itself, we have simply F(É)«F(0,0,1,*)=1. Then, from (92.15), d*„ = iZ are p log ^ ^ 7 — , 2 Ze /ftu«U, (92.16) 2 Ze lhv'<\. The smallness of v and »>' is just the condition for the Born approximation to be valid in the case of Coulomb interaction. Formula (92.16) itself can therefore be more simply obtained directly by means of perturbation theory (see Problem 1). Now let a fast electron (v <^ 1) lose a considerable fraction of its energy by radiation, so that v' < v and v' may not be small. Then -£~4p'/p =4vlv'<*\t F(£)~F(ii/, 0 , 1 , | ) = 1 , F'(0 = - vv'F{\ + iV, 1,2, Ç) « - vv\ and the cross-section is *'-*TZW4) Ze2lhv<\, dco l 1 - exp[- 2rrZe2lhvf] <o Ze2lhv'^l. (92.17) When v'<^ 1, this formula yields the same limiting expression, d(Tw = 3 Z z a r , —5 tr to , as (92.16) does when v'<v. Hence formulae (92.16) and (92.17) jointly cover the whole range of v' (when v <l). When <o-+<oo, where ha>Q = 2mv29 the velocity v'-+Q and i / - » » . In this limiting case, (92.17) gives d „=12|£Z3 «'O mv 2 ' (92.18) Thus dajdio tends to a finite limit as cu -♦ co0. This can be explained in a general manner by arguments similar to those given in QM, §147. The physical reason is that the frequency co0 is the limit only of the continuous bremsstrahlung spectrum. The electron can also go to a bound state with emission of a frequency oi > co0. But highly excited bound states in a Coulomb field have properties almost the same as those of the free states near their limit. Hence the boundary between the con- 394 Interaction of Electrons with Photons §92 tinuous and the discrete spectrum is not essentially a physically distinctive point. Let us now consider the case where both parameters v, v'> 1. The motion of both the initial and the final electrons is then quasi-classical. If the condition hü)<p2l2m is also satisfied, the matrix element too is quasi-classical. Then the formula of quantum mechanics must become the result given by the classical theory (see Fields, §70). We shall, however, suppose that p2/2m —fta>,so that we need an asymptotic expression for the function F(£) when v, v'->oo and £ ~ 1; a more exact condition will be stated below, (92.24). To derive this expression, we start from the integral representation of the hypergeometric function, QMy (e.3), writing it as F(iPv\iv\ho = ^ff t^-W-ty^ii-tçr" dt'9 (92.19) c where p = !//„', 0<p<l, so that £ = -4p/(l-p) 2 . (92.20) The contour of integration is taken as shown in Fig. 17, passing along part of the real axis and avoiding the points t = 0 and t = 1.1" When v, v> 1, the value of the integrand is small on the lower part of this contour, and may be neglected. On passing downwards round the point t = 0, the integrand is multiplied by the small factor exp(-27rpi/), and on passing upwards round t = 1 it is multiplied by exp(27rpi>'). The integral F= 27TÎ je^dtlU / ( 0 = ilog ( 1 ^ f ) p f ( P 1 ^^ ) , (92.21) may be calculated by the saddle-point method. The saddle point t0 is given by the condition /'(to) = 0, whence t0 = 2O - p). At this point, however, the derivative /"(to) is also zero, so that we must write /(t)~/(t 0 ) + liar3, t--o T = f-to, t--i FIG. 17. t For the hypergeometric function F(a, ß, y, £), the contour is to be chosen so that the function V(t) = e'ta-y+l(t-iy-" returns to its initial value on passing round the contour. When y is integral (here, y - 1), the contour chosen satisfies this condition. §92 395 Electron-Nucleus Bremsstrahlung where /(to) = 2TTP + i(l + p) log | ^ , a = ^ f"\U) = ^p^j. The coefficient lit of the exponential in the integrand may be written i/r « i/r0 —r/rg; we cannot here take simply the term l/t0, as this would reduce to zero the derivative d|F(£)|2/d£ in (92.15). Thus we have, after an obvious substitution in the integrals, F ~2*rto(L-)" i e X p { ~ , r p ' / + ' / / ( ' o ) } > < x 00 00 ei dx+ { - / "' laey» —oo J xeii"<**}•• (9222) —oc The two integrals here are, respectively, 00 2 J cos |x3 dx = 3a/sp(2)» 0 2 f xsinix 3 dx = 3 1/6 $. o The derivative F'(£) is calculated similarly; according to (92.19), it is given by an integral which differs from (92.21) only in that the coefficient 1/if of the exponential is replaced by v'l(\ - £0- A simple calculation then leads to the result -A.F(.)P Çl dV = (I-P)2*24V3TT P • Finally, substitution of this in (92.15) gives, with the necessary accuracy, the simple expression , 167T _ im c dot d<r„ = -T57T 2Z2ar] —-j . J P (x) /n- ~~. (92.23) The condition for this to be valid, i.e. for the asymptotic formula (92.22) to be valid, is that the second term in (92.22) should be much less than thefirst:(1 - p)v > 1, or, expressing the parameters of the hypergeometric function in terms of physical quantities, ha> > (hvIZe2) • \mv\ (92.24) Interaction of Electrons with Photons 396 §92 This inequality is compatible with the quasi-classicality condition hu> <\mv2, i.e. \- p<\. When the latter condition is satisfied, the result must be the same as the corresponding formula in the classical theory, since on multiplication byfta>the expression (92.23) becomes the classical formula (Fields, (70.22)) for the "effective retardation" in the high-frequency limit.t In order to go to the classical formulae throughout the range ( l - p ) v - l , v>\, we should have to find the asymptotic form of the hypergeometric function when the saddle point is close to the singularity t = 0; we shall not deal with this here, since the final result is obvious. All the formulae given above refer to an attractive Coulomb field. The crosssection for emission in a repulsive field is obtained from (92.15) by changing the signs of v and v'. Then, in particular, the limiting Born formula (92.16) remains unaltered, but in the limit v < 1, v'-*<x> we have instead of (92.18) , 1287r„3 2 2 / c \ 3 / V(2mc2)>nZa\hd(i> d„„ = — ZWr](-) e x p ( - V ( ^ M ) ^ , ,M,., (92.25) i.e. the differential cross-section tends exponentially to zero as Ù>->IO0. This result also is reasonable: in a repulsive field, there are no bound states, and the frequency o>o is the true boundary of the radiation spectrum. PROBLEMS PROBLEM 1. In the Born approximation, find the bremsstrahlung cross-section for a non-relativistic collision of two particles having different values of the ratio elm. SOLUTION. The dipole moment of two particles with charges eu e2 and masses mi, m2, in their centre-of-mass system, is \m\ mi) where fi - m\mij{m\ + mi), r = n - r2. Hence m2/ \m\ \m\ mi) r The matrix element is dp P = Ù) 2 (d)p P , (t) = ( p 2 - p'2)/2/m, where p = ftv, p' = JAY' are the momenta of relative motion, and it is calculated from the plane wavesj by means of the formula t The agreement of the formula (92.23) for hto der» with the classical formula, when the one condition (92.24) is satisfied, is to some extent accidental. In the classical formula, the difference between v and v' is beyond the accuracy postulated, and there is no reason to identify vo in Fields, (70.22) with v specifically; if vo is identified with v\ there is no longer agreement with (92.23). t The replacement of two particles by a single particle having the reduced mass is, of course, permissible only in the non-relativistic case. §92 Electron-Nucleus Bremsstrahlung 397 The result is . e]el ( e\ e2\2 v' /x2 day . . m dorkp = — r — - — I 7 7 7 (e • q)(c* • q ) — <V dok. After summation over polarizations, the angular distribution of the radiation is given by a factor sin2 0 , where 0 is the angle between the direction of the photon k and the vector q, which lies in the scattering plane (see (45.4a)).. After integration over the directions of the photon we have ■ i6 2 it e\ \mi e2\2 v'da) sin Odd mi) v W v .+ v -2vv cos 0 where 0 is the scattering angle. Finally, integration with respect to 0 gives , 16 2 li*\ da» = T6i€2l \m\ * 2 \ 2 1 1 V + V' dto - 7 log -,—. mi) v v - v to For radiation in the field of a fixed centre of Coulomb force, this formula is equivalent to (92.16). PROBLEM 2. In the Born approximation, find the bremsstrahlung cross-section for a non-relativistic collision of two electrons.t SOLUTION. In this case there is no dipole radiation, and we must therefore consider quadrupole radiation. In the classical theory, the spectral distribution of the total intensity of quadrupole radiation is given by where D* = I eOxiXk - r28ik) is the quadrupole moment tensor of a system of charges.* For two electrons we have, in their centre-of-mass system, Dik = {eOxiXk - r25ik), r = n - r2. In the quantum theory, the Fourier components must be replaced by the matrix elements (cf. the discussion of dipole radiation in §45), and, with appropriate normalization of the wave functions (plane waves) and division by the photon energy o>, we obtain the cross-section for emission of radiation with scattering of the electrons into the range of states d*p': where v = 2p/m is the initial velocity of relative motion; the emitted frequency o> = ( p 2 - p,2)lm. The operator D,* is calculated by threefold commutation of the operator Dik with the Hamiltonian m r and is§ t The collision velocity v satisfies the conditions a<e2lhv<\. The classical case (e2lhv > 1) is discussed in Fields, §71, Problem. t This formula is obtained from Fields (71.5) in the same way as (67.11) in that book is derived from (67.8). § This expression is analogous to the classical form l * 1 fi - 4 * 3 1"* Xi « J. * X k „ oXiXk Dik = ^ 7 16ppk + 6 p p i - 9 - p ~ p • r - p & k p • rl, which would be obtained on differentiating Dik and using the classical equation of motion: 5mr=e2r/r\ §92 Interaction of Electrons with Photons 398 Since the two particles (electrons) are identical, the matrix elements are calculated from the wave functions ^ = V2<eiP'r±e Pr) ^=V2(eiP ' r±e i9 ~ '">' where the signs -I- and - correspond to total spins 0 and 1 of the electrons (interchange of the electrons corresponds to changing the sign of r). The lengthy calculations lead to the following formula for the spectral distribution of the radiation: « A 2I17 3x2 , 12(2-x)4-7(2-x)V-3x4 ._, 1 1 V ( l - x ) J where x = ü>le and e = p2lm is the initial energy of relative motion of the electrons; the cross-section is averaged over values of the total spin of the electrons. The cross-section for energy loss by radiation is \i (oder* = 8.1 are (B. K. Fedyushin, 1952). PROBLEM 3. Determine the energy of the radiation resulting from the emission by a nucleus of a non-relativistic electron in the 5 state. SOLUTION. The wave function of the emitted electron is an outgoing spherical s wave normalized to unit total flux: 1 eipr V(4TTV) r ' see QAf, (33.14). As the wave function of the final state of the electron (after emission of the photon), we choose the plane wave The transition matrix element is P/.=(Pi/)*=(J*?P<M3x) = , P' f ,-ip'-r+lpf <^X V(4irv) J -V 47T r p' v p' - p V 7^ IT y' the integral is calculated by means of (57.6a). The radiation energy is given by (45.8) on multiplying by d3p7(27r)3 and integrating over the directions of p' (which is equivalent to multiplying by 4ir). The spectral distribution of the emitted energy is then dE„ = (2e V3/37Tt;) dco. When u) -♦(), thefinalvelocity v' of the electron tends to t\ and the formula agrees, as it should, with the non-relativistic limit of the classical result; see Fields, §69, Problem. The total emitted energy is (in ordinary units) *-£-(D'2 where e = \mv is the initial energy of the electron. §92 Electron-Nucleus Brems Strahlung 399 PROBLEM 4. Determine the energy of the radiation resulting from the reflection of a non-relativistic electron from an infinitely high potential barrier. SOLUTION. Let the electron be moving perpendicularly to the barrier. Although the photon may be emitted in any direction, in the non-relativistic case the photon momentum is small compared with the electron momentum, and we may therefore suppose that the reflected electron also is moving perpendicularly to the plane of the barrier. Let the barrier be at x = 0, and the electron be moving on the side where x > 0. The wave functions of the stationary states of one-dimensional motion, normalized by S(p/2ir) (p = px), have the form of stationary waves (see Qhf, §21): \\t\-2 sin px, \\ff = 2 sin p'jc. The matrix element of the operator p = px is Pfi = - 4i I sin P*~r sin px dx P -P integrals of this form are to be understood as the limit, as S -♦ + 0, of the values obtained by including a factor e-ÄX in the integrand. The energy radiated in a single reflection of the electron is found from (45.8) by multiplying by dp* = d(olv' and dividing by VI2TT (the flux density of the wave approaching the barrier in the initial function ty): dE» = ~z—r \Pfi\ 3m ' ' Q vv — - -z— e2vv' do). (1) At low frequencies (o)<e = \mv2) we have v'Ä V9 and formula (1) becomes the classical formula (Fields (69.5)), which has to be integrated over angles, using the fact that v = iAt>, where Av is the change in velocity of the electron on reflection; this is as it should be, since the condition for the collision time to be small (Fields, (69.1)) is always satisfied in reflection from a barrier. The quantum formula (1), however, also gives the total emitted energy (in ordinary units): ■ / dEw . 16 v2 ~5— dû) = zr— ae ~i. do) 9ir c PROBLEM 5. Determine the bremsstrahlung energy in the scattering of a slow electron by an atom. SOLUTION. With the condition pa ^ 1 (where a denotes the atomic dimensions), the scattering by the atom is isotropic and does not depend on the electron energy; see QM, §132. The wave functions of the initial and final states of the electron may be written ^ = e i p r + /e ,p 7r, */ = e,pr'r + /e"*'r/r, where / is the constant real scattering amplitude. These expressions pertain to the asymptotic range of distances rt>a, which in the present case is the important range: r—\lp>a. The matrix element calculated from these functions is P/J =(27r//a>)(v-v'); the integrals are calculated as in Problem 3. Substituting this expression in (92.12), we obtain the cross-section for radiation with scattering of the electron in the direction p\ in ordinary units, ***** =3trpc^ Tzfzt (v - v')2 dcrei — , (1) where doei = / 2 do^ is the differential cross-section for elastic scattering. When ha> <Zp2l2m, we can take 400 §93 Interaction of Electrons with Photons p ^ p \ and this formula then becomes, as it should, the non-relativistic expression for the emission of soft photons; see §98.t Integrating (1) over the directions of p \ we obtain d<r. = ^ - ( r 2 + v'V^, 3TTC p (2) & where <rc\ = 4ir/2 is the total elastic scattering cross-section. Lastly, multiplying by hw and integrating with respect to o> from 0 to p2/2m = e, we obtain the "effective retardation" — I htoda» =77— a0eie(u/c)2. J 43 ir (3) §93. Electron-nucleus bremsstrahlung. The relativistic case Let us consider the electron-nucleus bremsstrahlung for the case of relativistic electron velocities.$ We shall assume that the condition for the Born approximation to be valid is satisfied for both the initial (v) and the final (t>') velocity of the electron: Ze2lhv<\, Ze2lhv'<\. The charge on the nucleus must be such that Za<\. As in §92, we shall neglect the recoil of the nucleus, so that the latter acts only as the source of an external field; the justifiability of this treatment is discussed in §97. According to (91.4), the cross-section for bremsstrahlung is given in terms of the amplitude by (à?|M/i|2 iSfd0k do'd<0' (93,1} d(T= In the first non-vanishing approximation, the matrix element Mfi corresponds to two diagrams: k\ / q \ q\ / /k \ / (93.2) The free end q corresponds to the external field, so that q= p'-p + k is the 4-vector of momentum transfer to the nucleus. Since the recoil is neglected, the time component q° = 0. According to the diagrams (93.2), M* = - e2AWq)V(47r)e*ü'(^ f ^ y° + / ^ f c ^ 7M)«. (93.3) t The fact that4'factorization" of the cross-section (the separation of the factor crci) occurs here for any o> is to some extent accidental, arising because the scattering amplitude is independent of the energy. t The majority of the results given below were first derived by H. A. Bethe and W. Heitler (1934) and independently by F. Sauter (1934). §93 Electron-Nucleus Bremsstrahlung 401 The intermediate 4-momenta are / = p - k, /' = p' + k. We shall use the notation / 2 - m 2 = -2kp=-2Kû>, / ' 2 - m 2 = 2kp's2K'o). (93.4) A{>e) is the scalar potential of the external field; for a purely Coulomb field, AMq) = 4irZe/q2. (93.5) Substitution in (93.1) gives for the cross-section da = j £ t e ^„(Ö'Q'MXWQ V ) dok do' do, (93.6) where V 7 7 2WK' 2O)K' 7 7 2o)K 7 ' 7 7 2o)K 7 ' Disregarding polarization effects, we average the cross-section over the directions of spin of the initial electron and sum over the polarizations of the final electron and photon. This is equivalent to the substitution e*ev(û'Q,iu)(ûQvu')-+-12tr Q^yp + m)Q»(yp' + m). The trace is calculated by means of the standard formulae (§22). The calculations are somewhat simplified by using the equation y\yp)y° = yp, where p = (e, -p) if p = (e, p). Moreover, the number of terms to be calculated can be reduced by using the symmetry with respect to the change p +*p', k-^-k, q^>-q; this simply interchanges cyclically the factors in the product of matrices, leaving the trace unaltered. The result is the following expression for the differential cross-section for bremsstrahlung in which a photon of a given frequency is emitted in a given direction and the secondary electron travels in a given direction:! t Here and in the rest of §93, p, p' and q denote the magnitudes of the three-dimensional vectors: P = IPI» P ' = IP'I, <ï = |q|. 402 §93 Interaction of Electrons with Photons A Z2(xr ^ [KK + P'™4 dù) A A ' v m \K ^.n^l.)^(lL m \K K) ml \K + K ! \K K i (93.7) JL)\ K )) where K = e - n • p, K' = e' — n • p \ n = k/co, q = p' + k - p. By means of simple transformations, this expression can be put in a form somewhat more convenient for analysis: da = —z—£ — - — r sin 0 d0 sin 0 dö dé x z u eu pq x {^75 ( 4 e 2 - q2) sin2 0' + j ^ ( 4 e ' 2 - q2) sin2 0 + + ^ ( p 2 s i n 2 0 + p ' 2 s i n 2 0 ^ - ^ ( 2 e 2 + 26,2-q2)sin0sin0'coS(/>], KK KK J (93.8) where K = e - p cos 0, K' = e ' - p ' c o s 0', q2 = p2 + pf2 + <o2-2p<D cos 0 + 2p'û> cos 0 ' - 2 p p ' ( c o s 0 cos 0' + sin 0 sin 0'cos </>); 0, 0' are the angles between k and p, p' respectively; <f> is the angle between the plane of k and p and that of k and p \ The integration of (93.8) over the directions of the photon and the secondary electron is fairly lengthy. It leads to the following formula for the spectral distribution of the radiation :t *72„r2da>p'(4 ^r,p2 + p'2 m2 f, e' ., e IV \ aaw = Z are i -z — Zee —r~7T~ + m 1 ' ~s+ ' ~ïi ;)+ p 2 p ' I p3 p' 3 p p 7 co p 13 , r [See' , a)2 t 2 a . 2 a , 2 ,v , w2co /. ee' + p 2 ,,£e' + p ' 2 \ l l A (93.9) where . ee' + p p ' - m 2 r L = log—T—^7——5, ee — pp — m t The integration over the directions of the secondary electron only can also be completed in an analytical form; see R. L. Gluckstern and M. H. Hull, Jr., Physical Review 90, 1030, 1953. Reference may also be made to the review paper by H. W. Koch and J. W. Motz, Reviews of Modern Physics 31, 920, 1959, in which the bremsstrahlung formulae are represented graphically. Electron-Nucleus Brems Strahlung §93 403 The permissible values of the frequency in these formulae are limited only by the condition imposed on the final velocity of the electron (Ze2lv'< 1): the electron must not lose almost all its energy. As œ ->0, the emission cross-section diverges as (MOD; this illustrates a general rule which will be discussed in §98. In the non-relativistic limit (p <?m), the photon momentum is small compared with the electron momentum, since Hence q2 — (p'-p) 2 - Putting in (93.8) e = e ' = m and neglecting p, p' and o> in comparison with m, we find da = - Z2ar] — ^ \ sin 0 dd sin 0' d0' d<f> x IT a) pq x (p2 sin2 0 + p'2 sin2 0' - 2pp' sin 0 sin 0' cos <f>), or d a Ä ^ n X q f ^ ^ , (93.10) in accordance with the Born-approximation formula derived in §92, Problem 1. Correspondingly, the spectral distribution of the radiation is given by the formula (92.16) already derived.t In the ultra-relativistic case, when both the initial and the final energies of the electron are large (e, e'>m), the angular distribution of photons and secondary electrons is very unusual. For small angles 0, 0', the quantities K, K' which appear in the denominators of formula (93.8) are K~\e(£+e2), K'~ie'(^+0'2), (93.11) and become very small in the range 0 ^ m/e. In this range the magnitude of the vector q is also small (q ~ m). Thus, in the ultra-relativistic case, the photon and the secondary electron move forwards in a narrow cone with aperture angle ~m/e. A quantitative formula for the angular distribution in the ultra-relativistic case is easily obtained from (93.8) by substituting for K, K' from (93.11), replacing p, p' in all other places by e, e\ and neglecting q2 in comparison with e2. With the convenient notation S = E0/m, S' = e'07m, (93.12) t The derivation of this formula by taking the limit in (93.9) is somewhat laborious, however, because of the cancellation of various terms. 404 §93 Interaction of Electrons with Photons we can put (93.8) in the form da = -Z2ar)^K^-8d8 eq IT f Ô2 2 1 1(1 + ô ) ■ 8' d8' d</> x o) 8'2 22 (1 + Ô' ) 82 + 8'2 ô*2 lee' (1 + 82)(\ + 8'2) -(H)<r^W- CM.») Putting q2 = (n x q)2 + (n • q)2 (n = k/to), we can easily find that for small angles ^ = (82+8'2-188'cos m <t>)+m2(^^-l-^)\ \ le le } (93.14) When 5 — ô'— 1, the second term in (93.14) is small compared with the first. The two terms become comparable at even smaller angles, where ô - m/e. Although q here becomes particularly small (q — m2le <?m), the integrated contribution from this region to the cross-section is still small compared with that from the whole region S *s 1 (the ratio of the contributions is easily seen to be m2le2). But q can also reach values q — m2/e when 5 ~ S' ~ 1 if |S-6'|«m/c, (93.15) 4>^mle. The contribution from this region is of the same order as the whole integral cross-section, or may even be the principal term in it (see below). The integration of (93.13) with respect to <j> and S' gives the angular distribution of photons (with given frequency), regardless of the direction of the secondary electron: t j 7 22 22 dio d<r = o8Z «r — x rre + e' e' Ô dô 7 n T g ^ xw 4S2 1, l0ß lee' Ue «' + +2 1662 H U7 7-öTwJ ^"2b 7 -(rro?J|- ,al 1Ä . (9316) Integrating with respect to 8, we find the spectral distribution of the radiation in the ultra-relativistic case: da. = 4 Z V ' , ^ ( A + - - ^ ( l o g ^ : - i V to e \e e 3 / \ ° mco 2/ (93.17) this formula can also, of course, be obtained directly from (93.9). The presence of the logarithm of a large quantity (the ratio se'I mo* ~ e'/m > 1 t The integration over </> from 0 to 2v is taken first. That over 5' is conveniently replaced by integration over the difference |A| = \8' - 8|, dividing the range into parts, from 0 to some Ao and from Ao to °o, where A0 satisfies the inequalities m / e ^ A o ^ l . In each region, appropriate approximations are possible in the integrand. §93 405 Electron-Nucleus Brems Strahlung even if <o ~ e) should be noted. If this quantity is so large that its logarithm is also large, the logarithmic terms become the principal ones in these formulae. The logarithm arises from integration in the range (93.15).t Thus, in the logarithmic approximation, i.e. when the terms not containing a large logarithm are neglected, the secondary electron moves at an angle ~(m/e)2 to the direction of incidence. Finally, we shall give the limiting formula for the region near the hard end of the spectrum, when the ultra-relativistic electron radiates almost all its energy: ay « e > e'. From (93.9) we easily find . .„2 idiù\eax e' + p' mV. 2 e'+ p' e'l /M 10 . Formulae (93.17) and (93.18) together cover the whole range of values of o> for an ultra-relativistic initial electron, and agree for <o ^ e > e'> m. If the secondary electron is non-relativistic (p' <m), then d(Tu> = 2ZWjV[2m(£-(t,)]^. (93.19) POLARIZATION EFFECTS Polarization effects in bremsstrahlung can be studied by the general method described in §65. The choice of the 4-vectors e(1), e(2) is here particularly simple. Since there is only one frame of reference (the rest frame of the nucleus) which is of practical importance, it is sufficient to put e{X) = (0, e(1)), e{2) = (0, e(2)), where e(,), e(2) are unit vectors perpendicular to k, one lying in the plane of £ and p and the other perpendicular to that plane. We shall not give here either the fairly lengthy calculations or their quantitative results, but merely note some qualitative properties of the polarization effects.t These properties can be derived by means of various symmetry relations, as was done for the Compton effect in §87. The theory under consideration corresponds to the first non-vanishing approximation of perturbation theory. In this approximation the cross-section cannot contain a term proportional only to the polarization vector Ç of the initial electron or Ç' of the final electron. The absence of a term <*£ means that the total emission cross-section (summed over the polarizations of the photon and the secondary electron) is independent of the polarization of the incident electron. Of the terms proportional to only the photon polarization parameters £{, ££> &> t This is easily seen by considering the rsmge of integration in which 4> and A = 6 ' - S satisfy the conditions mle <*A, 4> < 1. In this range, q2/m2=* A2 + <£252, and the terms in the braces in (93.13) are proportional to <t>2 or A2 (becoming zero when 4> =0 and A = 0). Integrals of the form f <ft2d<HA JWTsW? or f A2<tydA J(AV5V)2 diverge logarithmically; they are "cut off" at the limits of the above-mentioned range of the variables. t For a fuller discussion of these effects, see W. H. McMaster, Reviews of Modern Physics 33, 8, 1961, and V. N. Baïer, V. M. Katkov, and V. S. Fadin, Radiation from Relativistic Electrons (Izluchenie relyativistskikh ilektronov)^ Atomizdat, Moscow, 1973. 406 Interaction of Electrons with Photons §93 the term °c££ is absent. Thus a photon radiated by an unpolarized electron is not circularly polarized. Here, however, there is a difference from the corresponding result for the Compton effect: in the latter case such terms were forbidden by spatial parity because of the impossibility of constructing a pseudoscalar from the only two available independent vectors, k and k'. For bremsstrahlung, there are three independent momenta p, p' and k, and these suffice to construct the pseudoscalar k • p x p \ A term of the form ^ k • p x p' does not violate spatial parity, and therefore, strictly speaking, need not be zero; but it is not invariant under a change of sign of all the momenta (cf. (87.26)), and is consequently absent in the first Born approximation. The existence of the pseudoscalar k • p x p' also has the result that, as well as the term proportional to £3, a term proportional to ^i is also allowed in the cross-section, unlike the case of the Compton effect. This term arises as a product of the form Saßva(kxv)ßk pxp' (where v = k x r l , which is invariant both under spatial inversion and under a change of sign of all the momenta. Thus the emitted photon has linear polarization of both kinds (both along the axes e(1) and e(2), and in the "diagonal" directions at 45° to these axes). This refers, however, only to conditions where the direction of motion of the secondary electron is also recorded. On integration over all directions of p \ the term *£i in the cross-section vanishes. This is evident from symmetry, since after the integration the two non-coincident "diagonal" directions become equivalent, and there can therefore be no preferential polarization along one of them, such as occurs when Ç\ ï 0. The degree of linear polarization is independent of the state of polarization of the incident electron: the correlation terms of the form £}£ and £3^ in the cross-section are forbidden in the first Born approximation. The term Ç&, however, is allowed, so that the photon radiated by a polarized electron is circularly polarized (Ya. B. Zel'dovich, 1952). SCREENING The formulae derived above are for a purely Coulomb field. If radiation in a collision not with a "bare" nucleus but with an atom is considered, allowance must be made for the screening of the nuclear field by the electrons, which reduces the cross-section. For this purpose we must include the atomic form factor F(q) in the potential A{€\q) of the external field; see QMy §139. According to QM (139.2), this is done by writing Z-F(q) instead of Z. We shall show under what conditions screening is important. A given value of q in the form factor corresponds to distances r—llq in the spatial distribution of the electron charges in the atom. The form factor becomes almost equal to Z (total screening) when q ^ 1/a, where a is the dimension of the atom. In the ultra-relativistic case, as we have seen, an important contribution to the emission cross-section comes from the range of values of q near its minimum §93 Electron-Nucleus Bremsstrahlung 407 possible value for given initial and final energies of the electron. In the ultrarelativistic case, <imin = p-p'~0) = V(e2-m2)-V(e'2-m2)-(e-e') = m2w/2ee'. (93.20) Screening is important if qmin^ 1/a, or ee'lma) > am. (93.21) This condition is always satisfied for sufficiently large energies of the incident electron. If Qmin*^ 1/û ("total screening") we can immediately write down, with logarithmic accuracy, the spectral distribution of the radiation. The argument of the logarithm in (93.17) is just the left-hand side of the inequality ee'lm<o > am. When the inequality is satisfied, the integral over q which leads to this logarithm is cut off at a quantity of the order of the right-hand side of the inequality. According to the Thomas-Fermi model a ~ a0Z~1/3, where a 0 ~ lime2 is the Bohr radius (see QM, §70); then am ~ l/aZ1/3. Thus, when there is total screening, the logarithm in (93.17) should be replaced by log(llaZ m ). ENERGY LOSS The energy rgy lost as radiation by the electron is expressed by the "effective retardation" e-m K r a d = J a>dcrm. (93.22) 0 The calculation of the integral, with daa from (93.17), givest 2 4m 2 ,log e- +p Krad = 7Z2 arriee fl2e ( 3+ ^ ep + 6 _(8e-f6p)m' P . > m 26t zo gl 2± l£±_E4- l+, W F / 2 £ ( £ + £ ) \ l e 3ep m 3ep m 33 ep ep \ m /J (93.23) where the function F(^) is Spence's function (131.19). In the non-relativistic case, (93.23) becomes Krad = 16Z2ar2m/3, (93.24) t Although formula (93.17) is inapplicable near the upper limit, this fact is unimportant, since the integral converges. 408 Interaction of Electrons with Photons §93 since F(£) = £ when (■< 1; see (131.23). This formula can, of course, be obtained by direct integration of the non-relativistic Born formula (92.16). In the ultra-relativistic case, Krad = 4Z2ar]e (log ^ - 1 ) ; (93.25) when Ç> 1, F(£) « | log2 £; see (131.20). The two log2 terms in (93.23) can then be omitted. The ratio Kraa/e is also called the cross-section for energy loss by radiation. It increases logarithmically when e is large. This increase no longer occurs, however, when screening is taken into account. For total screening, Krad/e tends to a constant limit = 4ZW2log(l/aZ"3). For a collision with an atom, it must also be remembered that some radiation originates from the electrons, as well as that from the nucleus. We shall see later (§97) that, in the ultra-relativistic case, the electron-electron emission cross-section differs from the electron-nucleus cross-section only in that the factor Z2 is absent. Hence the presence of Z atomic electrons can be approximately allowed for by replacing Z2 by Z(Z + 1). On passing through a medium containing N atoms per unit volume, a fast electron loses its energy, on average, over a distance irad ~ elNKTad ~ [Z2aNr2 logd/aZ"3)]"1, (93.26) called the radiation length. COHERENCE LENGTH Formula (93.20) may be given a different and more general interpretation. Fpr the expressions derived above to be valid, it is necessary that the external field in which the electron moves should vary only slightly (in the direction of motion) over distances Jcoh ~ l/flmin ~ ee'lm2(o (= ee'/c3m2ü)); (93.27) this is called the coherence lengthA The value (93.27) obtained in the Born approximation is actually quite generally valid for ultra-relativistic particles: it is easily derived in the opposite case of quasi-classical motion also, since from (90.22)$ we see at once that the important times for radiation at small angles to the direction of motion are T ~ e'le(o(\ - v) ~ e'e/m2o), corresponding to a section of the trajectory with length CT ~ /COh. + The discussion here is due to M. L. Ter-Mikaélyan (1953). t The derivation of (90.22) was based only on the smallness of the curvature of the trajectory, and in that sense does not depend on the fact that a magnetic field was specifically under consideration in §90. 409 Electron-Nucleus Brems Strahlung §93 For a given frequency co, the coherence length increases with the electron energy. However, the formulae obtained for bremsstrahlung at an isolated atom can be valid also for passage through a medium only'if there is no secondary photon emission or electron scattering over a distance equal to the coherence length. The condition for no secondary photon emission is Jcoh^'rad, but the condition for no electron scattering is violated^ much sooner: there is repeated scattering of the electron by the atomic nuclei in the medium over a distance ~ /rad. To arrive at a quantitative condition, we return to formula (90.22) before the integration with respect to time in the exponent and write this as f, + T -i^r tj + T \ (l-n-v)dt~-^{(l-i>)r + 12 j 0 2 dt}, (93.28) where 6 is the small angle between v and n which results from scattering by nuclei. For Coulomb scattering, 0 changes by small jumps, and its time variation is therefore a slow "diffusion in angle". The mean square deflection of the electron over t - tj is, in order of magnitude, Ô*~(t-U)llcouu where /Coui is the mean free path for Coulomb collisions, given by 1/lcoul - (NZ2e4le) \Og(XmJXmin), where #max and #min are the maximum and minimum angles of scattering in one collision for which the process may still be regarded as Rutherford scattering (cf. PK, §41).t The value of Xmin is determined by the atomic dimensions a, over which the field of the nucleus is screened: xmin~~ 1/pa. Large scattering angles are limited (for an ultra-relativistic electron) by distances of the order of the nuclear radius R: Xmax - 1/pJR. If we put R - 1.5 x 10",3Zl/3 cm - reZll\ we find /coul s as ~ NZVlog(l/aZ"3) ~ 7 F ,rad- (93 29) - The second term in (93.28), which covers a time T ~ /coh, is now estimated as 6 T ~ (ü>e/e')'coh/'coul~ /coh/afrad. For the bremsstrahlung formulae to be valid that were derived without allowance for multiple scattering, this term must be much less than unity. Hence we find the condition U^afrad, (93.30) which is stronger than /coh <l lrad (L. D. Landau and I. Ya. Pomeranchuk, 1953). t The mean free path is determined by the transport cross-section a, = I (1 - cos x) d<r(x)- For the scattering of ultra-relativistic electrons by a Coulomb centre, the cross-section da(x) is given by (80.10). 410 §94 Interaction of Electrons with Photons §94. Pair production by a photon in the field of a nucleus The formation of an electron-positron pair in a collision between a photon and a nucleus (Z + y -> Z + e~ + e+) and electron-nucleus bremsstrahlung (Z + e~ -+Z + e~ + y) are two cross-channels of the same reaction. Rules have already been formulated in §91 for the transformation of formulae from the latter case to the former. Applying these rules to (93.8), we find the following expression for the differential cross-section for pair production by an unpolarized photon, averaged over the polarizations of the components of the pairit . der = . . . Z2ar] m2p+p-dè+ . z—- — \ 4 sin 0+ d0+ • sin 0_ d0_ a<£ x Z7T (i) q x te (4e- - q2) sin2 0+ + ^ i (4s2 - q2) sin2 0. K— (, K + --^- K+ K— (pi sin2 0+ + p2_ sin2 0 _ ) - ^ ^ ( 2 e 2 + + 2e2 - q2) sin 0+ sin 0_ cos <f>], K + K— J (94.1) K± = e±-p±cos ö±, q2 = (p+ + p- - k)2, e+ + e- = û> (H. A. Bethe and W. Heitler, 1934). A similar transformation derives from (93.9) the energy distribution of the components of the pair: dat+ = Z2ar] ^ û) \ P- de À- \ - 2e+e- ^P^I + p+p- pi +L \ - ^ + L 3p+p_ p+p- e+e.-p+p. + m P+P-J -£T(elel+plpl-mh+e-)- c±~*P± Since the above formulae are based on the Born approximation, they are valid under the conditions Ze2lv± < 1. The symmetry of (94.1) and (94.2) with respect to the electron and the positron is itself a consequence of the Born approximation, and would not occur in higher approximations. In the ultra-relativistic case (e± > m), the electron and the positron are emitted at angles 0± ~ m/e± relative to the direction of the incident photon. The angular t Polarization effects in pair production by a photon are discussed in the papers already quoted in 893 in connection with bremsstrahlung. i §94 Pair Production by a Photon 411 distribution is given by a formula similar to (93.13): , _ 8 _2 da z 2m 4 e+e- J f g2 51 f , co2 Ô2+ + ô2_ - ^ ^ ^ ^ M " â T ^ - ( T T W " 2 e + e _ ( i + 5i+)(i + si) + (94.3) with K = 81 + 81 + 28+ 8-cos $ + m2(^^ + K^)2. (94.4) The energy distribution in this case is da = 4Z2ar2€—y(el (o + e_ + 3e+e-)(log——- - ^ Kultra-relativistic case). \ mco LJ (94.5) Integration of (94.5) over E+ from m to w gives the total cross-section for pair production by a photon having a given energy:! a = ?Z 2 ar 2 (log ~ ™ ) , « » m. (94.6) As with bremsstrahlung, the logarithmic term in the ultra-relativistic crosssection arises from the range of values q — m2le. This now corresponds to angles for which |8+-8-|«m/e, |7r-<^|^m/e, instead of <f>^mle as in (93.15). Thus, in the logarithmic approximation, the directions of the electron and the positron are at small angles to the direction of the photon and are almost coplanar with the direction of the photon but on opposite sides of it. Near the reaction threshold (o>->2m), the Born approximation is invalid. The derivation of a quantitative formula in this case would require an exact calculation of the Coulomb interaction of the three charged particles (the nucleus and the pair) in the final state. The symmetry with respect to the electron (which is attracted to the nucleus) and the positron (which is repelled from the nucleus) is then, of course, lost. If Za<j2^<\9 * (O (94.7) t Since the integral converges at both limits, the inapplicability of formula (94.5) for small values of e± - m is not important. Interaction of Electrons with Photons 412 §94 the Born approximation is still valid. At non-relativistic energies of the pair, co ~ 2m > p±, and therefore q *** cu. In (94.1) we can everywhere put e± = K± = m, (o = 2m, and this formula then reduces to d(T ?! 2 = ^rrf2^r(pl o47T rn sin2 0+ + pi sin2 Ö-) do+ do- de+. (94.8) After integration over angles, m = ^ | ^ (CÜ - 2m)V[(e+ - m)(e- - m)] de,. (94.9) Finally, integration over e+ from m to o> - m gives the total cross-section If the relative velocity (v0) of the components of the pair formed is small, their Coulomb interaction must be taken into account (A. D. Sakharov, 1948). This interaction becomes important when vo is of the order of (or less than) the velocities of the particles in the bound state of the electron and positron (positronium): vo^a. (94.11) Let us consider the process in the centre-of-mass system of the pair. Virtual momenta ~m are important in the diagrams which represent the process in this system; that is, distances — 1/m between the electron and the positron are important. The wave function ip(r) of their relative motion changes appreciably only over distances r — \lmvo~ lima, which are large compared with 1/m. The allowance for the interaction of the particles therefore amounts to the inclusion of a factor i/**(0) in the transition matrix element. The differential cross-section is accordingly multiplied by |^(0)|2, i.e. by 27ralvo 1=- eT » 5 i (94 12) ' see Qhf, (136.11). The relative velocity of the two particles is the velocity of one particle in the rest frame of the other. Comparing the values of the invariant p+rf>t here and in the laboratory system (the rest frame of the nucleus), we find whence v0 may be found. If p+ and p_ are similar in magnitude and direction, vo is §95 413 Exact Theory of Pair Production given by the approximate formula t^ = p 2 0 2 + ( p + - p - ) V (94.13) valid for VQ< 1; here p = i(p+ + p ) , e = \(e+ + £-), and -d is the angle between p+ and p_. The correction to the cross-section according to (94.12) and (94.13) causes an anomaly in the correlation between the momenta of the electron and positron formed: it has a narrow maximum at p+ « p.. §95. Exact theory of pair production in the ultra-relativistic case In §§93 and 94 we have discussed bremsstrahlung and pair production by a photon in the relativistic case, using the Born approximation, for which the condition Za < 1 must always be satisfied. In §§95 and 96 we shall describe a theory of these processes which is not subject to the limitation just mentioned, i.e. is valid even if Za — 1 (H. A. Bethe and L. C. Maximon, 1954). We shall assume that both the particles (the initial and final electrons, or the constituents of the pair) are ultra-relativistic, with energy e > m. We have seen that in the ultra-relativistic case both particles move at small angles (0, 0' or 0+, 0_) to the direction of the photon: 0 ^ m/s. This property is preserved in the exact (with respect to Za) theory, and we shall therefore consider just this range of angles. The momentum transfer to the nucleus in this range is q — m. This means that in the wave functions the important values of the impact parameter are p ~~ \lq~~ 1/m, i.e. "large" distances. At such distances the wave function derived in §39 can be used. The calculations for pair production are as follows. The pair production cross-section is similar in form to the photoelectric effect cross-section (cf. (56.1), (56.2)): der = 2TT <?V(4TT) 1 V^hô) Mn 8(a) - e+ - e.) n i (95.1) where Af,n = liï-X("'*)*ik'f*%.-P.dh. (95.2) Here i/^.p. is the wave function of the electron, and ^(_V+i-p+ the wave function with negative energy -e+ and momentum -p+. The function i/^"^, which pertains to a particle in the final state, must have an asymptotic form which includes (besides the plane wave) an ingoing spherical wave; this is indicated by the superscript (-). According to (39.10), this wave §95 Interaction of Electrons with Photons 414 function ist ^• p - = v ^ T j C'P ~"{l ~ l~TI^)Fi~iv>u ~i{p-r + p * r))w( P- } 'I C - ^ e ^ T d + iV), (95.3) y = Za. The function «/>-+e)+_p+ must have an asymptotic form which includes an outgoing spherical wave (indicated by the superscript (+)), since it denotes the wave function of an "initial state with negative energy". The asymptotic form of the wave function of the positron, obtained from $(-2*-v+, then has an ingoing wave, as is correct for a final particle. According to (39.11), this function is *-•►•-'►= wh) e~i?+"(l + l ~fï7)Fi~iv'u i(p+r + p+ ■ r ) ) M ( -P + ) ' (95.4) C ( + ) = «- w W T(l + iV). The terms ~l/e in (95.3) and (95.4) have to be included because of the matrix structure of M/, (95.2). The matrix element (a)/, is a vector whose direction is close to that of k. The leading term in (a • e)/i is therefore small, and the correction terms are of the same order of magnitude as that term. Substituting (95.3) and (95.4) in (95.2) and neglecting terms —1/e+e-, we find Mfl = 2 v £ ~ T ) "*<P-M(e * «)I + (« * «)(« • I+) + (« • I-Xe • a)}«(-p + ), (95.5) where N = C^C^ = Wsinh vv, I+ = ^ - IV1'« rF*VF+d'x, 2e + J (95.6) (95.7) I = 2 r / ^ , q r(VF_)*F+d3x, q = p+ + p_ - k; F- and F+ are used for brevity to denote the hypergeometric functions which appear in (95.3) and (95.4). The integrals I, I+, I- satisfy one identical relation: from fv(e _ i , ' r F*F + )d 3 x=0, t In this section, p- = |p±|, q = |q|. §95 Exact Theory of Pair Production 415 qI + 2e + I + + 2e-I- = 0. (95.8) we have We average |M/j|2 over polarizations of the incident photon, and sum over directions of the electron and positron spins.t This is done by the tensor substitution eiet -»i(5ik - n^k), n = k/w, and changing the bispinor products according to u±ü±->2p± = (e ± 7°-p± • 7 + m). Putting also a = y°7, we find |M/,| 2 ->(N 2 /2e + e-){tr p _Q . p + Q - tr p_(n • Q)p+(n • Q)}, Q = 7 j - 7<>7(7 . 1 + ) _ 7 o ( 7 . , )7 Q = yl* - 7°(7 * Iî)7 - 7°7(7 ' I-)The final result, obtained after making the appropriate approximations, for the ultra-relativistic case at small angles 0 ± ~ m / e <t\, (95.9) will be given here. We define the auxiliary vectors 1 m 8 ± = — (p r ± )i, " (95.10) m ~ where the suffix 1 denotes the component perpendicular to the direction of k. Then |M„P-iN ! {^|fp + 2 m 6+ + I+ 2e + where we have used the fact that I ~ el±lq ~ eljm terms of higher order in m/e are omitted. The integrals I± may be expressed as ' / = j^-^F(-iv, -IT ^ + I '2 + 2 I 2e. (95.11) (as is seen from (95.8)), and 2e ± dp±* 1, i(p + r+ p+ • T))F(iv, 1, i(p_r+ p_ • r)) dyx. (95.12) t Calculations with allowance for the polarizations of all the particles are given by H. Olsen and L. C. Maximon, Physical Review 114, 887, 1959, and in the book by Bâter et ai cited in §93. 416 §95 Interaction of Electrons with Photons The integral J can be written in terms of the complete hypergeometric function: t /V-2p+-q> F(-iv, iv, l,z), J=ÛZ _ V-2p_.qj (95.13) q2(P+P- ~ P+ • P-) + 2(p+ • q)(p- • q) 2 (<r-2p + -q)((T-2p--q) The differentiation with respect to p± must be carried out with q fixed, only thereafter putting q = p+ + p_ - k. The result, after making the approximations corresponding to the ultra-relativistic case and the conditions (95.9), is ■=7ÄfëjlH*"a*™ + ' S »"«(«I, - m«j}. (95.14) with, for brevity, the notation *± = 1 m I + ÔT (95.15) F(z) = F(-iV,iV,l,2), F(z) being a real function. The integral I is then found immediately from (95.8). Substituting the values of the integrals in (95.11) and thence in (95.1), we find the required cross-section: n d*.±(^ ) IT \sinh 1TV Z2ar] - ? - , S+ dÔ+ • 5- d5- • d<f> de+ x x JF2(z)[-2e+e-(82+£2+ + 81?-) + <o\Sl + 8l)Ç+Ç- + + 2(eî'+ el)ô+ô-Ç+Ç- cos d>] + + '%* ^ ^H-^+M«^+sie)+û>2d+SUDU- - 2(el + e*)S+S-£+f- cos </>]}. (95.16) When v -> 0, 7TV sinh TTv ■*!, F(z)-»1, F'(z)«v 2 ->0. The expression (95.16) then reduces, as it should, to Bethe and Heitler's formula (94.3), which corresponds to the Born approximation. It also reduces to this formula for any v if the angles of emission of the pair satisfy the conditions |Ô+-Ô-|<31, |TT-4>|<1. t The calculations are given in Nordsieck's paper quoted in (92. Exact Theory of Pair Production §95 417 For then q <m, so that the second term in the braces in (95.16) can be omitted because of the extra factor (q/m)4 as compared with the first term, and in the first term we have (since 1 - z ~ q2lm2< l)t F(2)-*F(1)^F(-IV,IV,1,1) 1 T(l - iv)T(\ + iv) sinh ITV 1TV (95.17) so that the similar factor in front of the braces is cancelled. Let us now consider the integration of the cross-section over the directions of emission of the pair. The integration over angles is divided into two regions I and II, in which we have respectively (I) 1 - 2 > 1 - 2 , , (II) l - 2 < l - 2 , , where Z\ is a certain value such that 1 > 1 - Z\ > (m/e)2. Since in region II 1 - z < 1, q2<m2, it follows from the above discussion that in this region do- » daB s dav^, where daB is the cross-section in the Born approximation. The integral over angles is therefore do-e+ = I do- = I dar + I do-y^o i ii = (do-t+)B + j(d<r- dav^\ (95.18) i where {dat+)s is the Born cross-section (94.5) integrated over angles. In region I we have q2lm2 « 8l + Ô2. + 28+8- cos d>. We shall change from the variables Ô+, 8-, d> to £+, £_, z. A direct calculation of the Jacobian for this transformation gives %ml (£+£-) sindV where \-z = (q2lm2)M= £+ + £_ - 2£+£- + 2V[M_(1 - £+)(l - £_)] cos 4>. t This value of the function can be obtained from QM, (e.7), which relates hypergeometric functions with arguments z and 1 - z. §95 Interaction of Electrons with. Photons 418 Expressing sin cf> and cos <f> in terms of the other quantities by means of this equation and substituting in (95.16), we obtain after some simple algebra da = A de + 2d£+ dg- dz ÏTÎ75 [z(l-z)-(l-z)(^ + ^ l ) 2 - z ( ^ - f . ) 1 + ^ ^ [ ( ^ , + e2-)Z + 26,6-(^ + ^ - l ) 2 ] } , = / Try \2Z2ar2e \sinh 7ri^/ 27rcu Finally, we replace £+ and £_ in terms of new "spherical" variables x anc * </^ £+ + £- - 1 = V z sin x cos i//, £ + - £ - = V ( l - z) sin x sin i//, 0 « x ^ 2^, 0 ^ i// ^ 2TT, 2 d£+ d£_ -> V[z(l - z)] sin # cos x dx dip. These ranges of variation of x a n d <A correspond to the range 0 to 1 for £+ and £_, i.e. to the range 0 to °o for S+ and 8- (or, equivalently, 0+ and 0-); the rapid convergence of the integral allows the range of variation of the angles to be extended in this way. After the transformation, the root in the denominator / is elementary, and the becomes V[z(l - z)] cos x\ the integration over x an( * <> result is da = 2A-27rdz(ei + e! + ^ + 6 - ) r | ^ + -^F / 2 (z)lde + . 1 20 118 ^ ^T~ I ^V I 1 14 I 1 12 I ! 1 10 — I 1 08 1 06 .,..__ | 116 f (v) I I 01 02 v FIG. 18. 03 04 Brems Strahlung in the Ultra-relativistic Case §96 419 An extra factor 2 has been included because the integration over z is to be taken from 0 to Zu whereas, when the azimuth <j> varies from 0 to -rr and from v to 2TT, each value of z occurs twice. The integration over z is effected by means of formula (92.14), which, for v' = - v (and F(z) accordingly real), becomes F2 . z „l22 _ \ d , + ^ F ' = -^f-(zFF'). 1-z v v dz The integral of this expression is Z\F(z\)F\z\)jv2. The value of ZiF(zi)«s F(l) is taken from (95.17), and the limit of F'(z\-* 1) is given byt \F'(Z) = F(1-W, l + ii/,2,z) ~-[log(l-z) + 2 / ( , ) ] ^ ^ , where f(p) = \[V(l + iv) + ¥(1 - iv) - 2^(1)] = v2t„,„L1)ïy „=i n(n + v ) (95.19) nz) = r(z)/r(2). Substituting the above expressions in (95.18), we obtain as thefinalformula dat+ = 4Z2ar2(e2 + e2. + \e+e-)\\og ^ ^ - -[ - /(aZ)l % (95.20) The total cross-section for pair production by a photon with energy co is *=?Z2ar2[log^-^-/(«Z)]. (95.21) We see that the only change in these formulae is that a universal function f(aZ) of the atomic number is subtracted from the logarithm. Figure 18 shows a graph of this function. For v < 1, f(v) « \2v2. §96. Exact theory of bremsstrahlung in the ultra-relativistic case The matrix element for the bremsstrahlung process is Mfi = j *ft*(a • e*) e-ik'VtV d3x; (96.1) t The derivation of this formula is in the Appendix to the paper by H. Davies, H. A. Bethe and L. C. Maximon, Physical Review 93, 788, 1954. 420 Interaction of Electrons with Photons §96 the wave functions of the initial electron (e, p) and the final electron (e\ p') include respectively outgoing and ingoing spherical waves in their asymptotic forms. The calculation of this integral is similar to that of the matrix element (95.2), but will not be given here. Instead, we shall describe another way of calculating the bremsstrahlung cross-section, based on the fact that the process is quasi-classical, and not using the explicit form of the wave functions of the electron in the field of the nucleus (and in this sense independent of the precise form of the field potential)(V. N. Baïer and V. M. Katkov, 1968). In the brems Strahlung process, the nucleus transfers to the electron and the photon a momentum q = p' + k - p. As in the pair production problem, we must distinguish two ranges of values of the transfer qx which is transverse relative to p: (I) m^ql><am2le2, (II) q x ~ <ûm2le2<Zm. (96.2) It is evident that in region I the emission cross-section is equal to the Born value: for these values of qx, the recoil momentum of the nucleus is unimportant, as will be shown in §98 (see the derivation of the condition (98.10)). In region I, the cross-section for the process is therefore the product of the exact cross-section for electron scattering in the field of the nucleus at rest and an emission probability which is independent of the form of the field. But since, according to (80.10), the exact cross-section for scattering at small angles in a Coulomb field is equal to the Born value, so is the cross-section for the whole process in region I. Thus only region II need be considered. Small momentum transfers correspond to the passage of the electron at large distances from the nucleus: p ~ 1/qi ~ elm2. But, at these distances, the motion of the electron is certainly quasi-classical, as is easily seen by direct application of the usual quasi-classical condition, QM (46.7), to the ultra-relativistic equation (39.5). Since the motion is quasi-classical, we can use the method already applied in §90 for synchrotron radiation. The expression (90.7) is in this case the probability of emission when the electron passes the nucleus once. Formula (90.18) remains valid for the function L used in §90; the only difference is in the form of the quasi-classical electron path r = r(0, which is used to calculate the difference r 2 -ri. At large impact parameters, the field of the nucleus may be regarded as weak. In the zero-order approximation, the path is a straight line passing at a distance p from the centre. In the next approximation, we have as the equation of motion (cf. Mechanics, §20) dp = dt pdU r dry where p is a vector lying in the xy-plane and perpendicular to the initial momentum of the electron, and r on the right-hand side is to be taken as the zero-order function: r ~ V ( p 2 + u 2 t 2 ) « v V + f2). §96 Bremsstrahlung in the Ultra-relativistic case 421 Hence ,,, f dU dt (96.3) The velocity v(f) = p(fj/e, where the energy e depends on the magnitude but not the direction of p, may be regarded as constant with sufficient accuracy. A further integration then gives r(t) - r, = v,(* - U) - j j [p(t') - p,] dt'. (96.4) We shall take ti = -°°, so that the quantities pi = p(-°°) s p and v = p/e are the initial momentum and velocity of the electron. We can put the probability (90.7) in the form dw = \a(p)\2dikK2Tr)\ (96.5) where a(p) = e Vï / R(t) exp {' 7 M "k * r(0) } d*' ^ ) = V^)a'e*Vfe' (%.6) and e' = e - o>, p'(0 = p ( 0 - k - The classical function p(t) is given by (96.3). If p denotes the initial momentum of the particle, we have for a Coulombfield(U = - p/r, v = Za) p(,)=p -^[Wr?) +1 ]- and e p e In terms of the change of momentum in classical scattering, A = p(oo) - p ( - oo) = - 2pWp2, (%.7) we can rewrite these formulae as r(o = (p+!A)i+A va'+p5). 6 Z£ (96.8) 422 Interaction of Electrons with Photons §96 Now using formula (90.20) for R(t) and the expressions (96.8) for p(t) and r(t), we can calculate the integral with respect to time in (96.6). The integration is carried out by replacing the variable t by è = e --,(<ot-k-r(t)) and using the formula / where Kx is the Macdonald function. There is, however, no need to complete these calculations, since we want the expression a(p) only for small values of A (A <è m), an independent parameter. Then we obtain a(p)=w|DwIAxK1(x), (96.9) where X= p-p-U-n-v), n = k/co, and D is some function of p, e and k (but not of p), whose precise form is unimportant.t Since, in the ultra-relativistic case, the photon is emitted at a small angle 0 to the direction of the electron velocity, we have X^Pjiioil-v+W) or X = p ^ T (1 + S2), Zee 6 = Oelm. (96.10) It has already been mentioned that (96.5) is the probability of photon emission in a single passage of an electron past a nucleus at impact parameter p. The cross-section for the emission of a photon with given frequency and direction is obtained by multiplying by tT1 dpx dpy « dpx dpy = d2p and integrating with respect to the impact parameter: ^ = ^,||a(p)|2d2p. (96.11) t The spinors wt and wf may be taken as constant in the integration, i.e. the change in the electron's polarization in its classical ultra-relativistic motion may be neglected. This can be seen from the equations derived in §41. §96 Brems Strahlung in the Ultra-relativistic Case 423 However, it should not be thought that this formula without the integration over d2p would also give the directional distribution of the final electrons. The deviation of the electron in motion in a classical path, which is uniquely determined by the external field, is certainly not the same as the indeterminate quantum-mechanical deviation (and the limit p'(°°) of the classical function p'(0 is therefore also different from the actual final momentum of the electron). Consequently, in order to obtain the angular distribution of the electrons, we must re-expand their wave function in plane waves. It is seen from (96.11) that a(p) is the amplitude of photon emission in a passage at impact parameter p. The expressions (96.5) and (96.6), however, define this amplitude only to within a phase factor, which is easily seen to be e~lk p: on account of the time-independent term rx(oo) = p in r(t), this constant factor must be present in Vfi(t)y and may be taken outside the time integral. Since it is not an operator, it is not affected by the process of commutation, and the amplitude for the emission process is thus <Tikpa(p), (96.12) where a(p) is given by (96.9). Now let the electron be described, as z -* - a>, by a plane wave with momentum p along the z-axis. This means that the wave function of the electron as z -» - <*>, as a function of x and y, reduces to a constant, which can be taken as unity. Then the wave function of the electron which has passed through the field is, for z -> <»,t 00 «K°°) = S(p) = exp{- i j U{xy y, z) dz}. (96.13) — 00 According to the significance of the transition amplitude (96.12), the wave function of an electron which has passed through the field and emitted a photon is <T,kpa(p)S(p). (96.14) The amplitude for emission in which the electron goes to a state with definite momentum p' is given by the corresponding Fourier component of (96.14), i.e. by a(qi) = J V * peHk^(p)S(p)d2p = |e-^ p a(p)S(p)d 2 p, (96-15) where q± is the transverse component of the momentum transfer to the nucleus; cf. t See QM (131.4); we have in mind the analogy between equation (39.5) (in which we put p 2 ~ e2) and the non-relativistic Schrödinger's equation (39.5a). Bearing in mind the difference in the significance of the coefficients in these equations, it is easily seen that in our case the conditions QM (131.1) for the formula QM (131.4) to be valid are in fact satisfied. The fact that this formula is not valid for arbitrarily large z is unimportant, for the same reasons as in QM §131. 424 §96 Interaction of Electrons with Photons QM (131.7). The cross-section for scattering with a given transfer qi is dCT= 0(*>i ! (Slfr <9616) Let us now calculate S(p). In the case of a Coulomb field considered here, the integral in the exponent diverges, in accordance with the phase divergence in Coulomb scattering. The integral must therefore be first calculated between finite limits: R R = - 2v[\og(R + V(R2 + p2)) - log p] ^ - 2v log 2JR + 2v log p (R > p). The first term, which is a constant, is unimportant, and therefore S(p) = e" 2iHogp -p- 2, \ (96.17) Substituting (96.9) and (96.17) in (96.15) and integrating over the directions of the vector p in the xy-plane, we find 00 a(qx) « v j p-2,'K,0r>JI(q1p)p dp, o (96.18) where Jj is the Bessel function. The factors not involving v = Za have not been written here. We see that the dependence of the amplitude a(qx) (and therefore of the cross-section (96.16)) on v is contained in a separate factor. On the other hand, when v -* 0, the cross-section must tend to its value in the Born approximation. It is therefore immediately clear that the cross-section will differ from the Born value only by a factor which is independent of the electron polarization and hence does not influence the polarization effects. The integral (96.18) can be expressed in terms of the hypergeometric function by means of the formula J x xKx(ax)J{(bx)xdx=— 2xfl3-x {l + ~p) F (^ ' 2 ' '^Tpj- 0 This gives a(qx) « v(\ - i»(iq) 2i T 2 (l - iv)F(ivt 1 - iV, 2, 2), (96.19) §96 Bremsstrahlung in the Ultra-relativistic Case 425 where (1 + S2)2, 2 = 1 - Amîf,î 8 = eBlm ; (96.20) here we have used the fact that, in region II (see (96.2)), the component of the vector q parallel to p is 4 2 -q 2 - q ï ~ ^ 0 + Ô2)2. (96.21) This is easily proved, since in that region the angles between the momenta p, p' and k satisfy the conditions (93.15). The hypergeometric function in (96.19) can be reduced to the function F(z) in (95.15) by means of the formula F(a,b + \,c + \,z) = -^F(a,b,c,z) + ^^F'(a,b,c,z). The final result is then da = dcr B -p^[F 2 (z) + ^ ^ F ' 2 ( z ) ] , (96.22) where dvB is the Born cross-section (93.13) (H. A. Bethe and L. C. Maximon, 1954). When q >m2le, we have z « 1, and the whole coefficient of daB tends to unity; in this sense formula (96.22), which has been derived for region II, is automatically satisfied for all q ** m. When q *£ m2/e and the correction factor in (96.22) is different from unity, the vectors p, p' and k are almost coplanar, and the quantities 5 and 5' are almost equal; this has already been taken into account in (96.22). Thus q2 in the expression (96.20) for z can be written as 2 2 2 -2-, = S2 + 8'2 - 288' cos <t> + ^ ^ 5 (1 + S2)2, (96.23) m 4e e i.e. we can put 8 = 8' in the second term in (93.14), but not in the first term, which does not contain a mall coefficient (~m2/e2). To find the cross-section integrated over angles, there is no need to repeat the integration: we can proceed as follows (H. Olsen, 1955). Various directions of p' (for a given energy e') correspond to degeneracy of thefinalstate of the electron. It is evident that the result of summing over states which belong to one degenerate level is independent of how the complete set of these states is chosen. We can therefore use, in summing over directions of p', the set of functions i//£y instead of the tyty which are needed in calculating the differential cross-section, i.e. we can define the bremsstrahlung matrix element as M?/ = | *$*<« • e*)e-,fc-y.y a°x. 426 §97 Interaction of Electrons with Photons This integral is easily seen to be the same as (M ft)* if the parameters of the wave functions in the latter are changed as follows: p+, p+, 6+ -H> - p, - p, - e ; p-, p_, e- -> p', p', e'; k-> - k and the sign of the integration variables is reversed: r - > - r . Hence it is clear that the brems Strahlung cross-section integrated over angles can be obtained from the integral pair production cross-section (95.20), on multiplication by p+de+ £+ds + (cf. (91.6)) and replacement of e+ by ~e, e- by e'. Thus we have d<r = 4ZW5 Si ( i + i - ?)[log 2H: - ' - / ( o Z ) ] *£. (96.24) We see that the corrections to the Born formulae for the integral brems Strahlung and pair production cross-sections are given by the same function /(aZ). Formula (96.24), which does not depend on any limitations on the value of Za, allows a passage to the classical limit (h ->0, Za -><»). In this limit, we must also put e =*e\ Bearing in mind the asymptotic expression ¥ ( z ) « l o g z as |z|-**> and the value ¥(1) = - C (where C is Euler's constant), we find the effective retardation ( ■ » d ^ l ^ f l o g ^ p - i - c ] ^ . (96.25) This expression, which does not contain ft, is the classical frequency distribution of the bremsstrahlung intensity. §97. Electron-electron bremsstrahlung in the ultra-relativistic case Electron-electron bremsstrahlung is represented by eight Feynman diagrams: four diagrams 1 k 1 \ . — 1 — "■ 11 " * 1 ^ 1 • 1 1I 1 p2 p' n — - 1 — 1 1 1 ]1 . — 1 P. 1 1 1 — P2 0 1 1 1 — 1 (97.1a) 1 — ' — ik ^2 (97.1b) §97 Electron-Electron Bremsstrahlung 427 and four "exchange" diagrams obtained from those shown by interchanging p\ and P2. Here we shall give the results of the calculations for the ultra-relativistic case (G. Altarelli and F. Buccella, 1964; V. N. Baïer, V. S. Fadin and V. A. Khoze, 1966).t In the laboratory system (the rest frame of one of the initial electrons, say the second), the emission cross-section integrated over the directions of the photon can be written as a sum da = da{]) + daa\ where da(1> = 4 a r ^ ^ ^ ( - ^ a) e \e-co m } m2 . 3m + -r 2a> ^=^-|)(log^^^-i); e 3/\ mco 2/ m t m 2 \ , le + — 5 log a) 4co / m . ) 2 2mdiù \IA da{2)(2 = jar] 3 - \[4 IA a) eu t \ + m \ m } 5m2] i -5—7 \ f or œ 2* -2my öa>J <an i\ (97.3) (O .3 _ 2 m /L4(u 2a> ^ . 2CÜ 4co2l 2+ ~ . 2m 2+ (0 (97.2) co m \ —1 r 5- > for to ^ im m m ) /0-7 /l^ (97.4) (e being the initial energy of the first electron). These formulae are accurate as far as terms of relative order m/e. To this accuracy, it is found that the contributions to the cross-section from different diagrams do not interfere, and in this sense da{]) and dcr(2) correspond to emission by each of the two electrons: the fast electron and the recoil electron respectively, diagrams (97.1a) and (97.1b). The "exchange" diagrams give the same contribution to the cross-section as do the "direct" diagrams. Since the electrons are identical, the total contribution from the direct and exchange diagrams has to be halved, and we may therefore consider only the contribution of the direct diagrams and ignore the identity of the particles. For electron-positron collisions the exchange diagrams are replaced by annihilation diagrams, but their relative contribution is of order mis and therefore negligible. Hence the bremsstrahlung cross-sections are the same, to the accuracy indicated, in electron-electron and electron-positron collisions. For eu > m, the ratio da{2) m^ t 37,'~,*1> i.e. the emission from the recoil electron is small compared with that from the fast electron; when this ratio becomes of the order of m/e, formula (97.3) is of course no longer meaningful. When a> <m, on the other hand, the two parts of the t The calculations are given in the book by Barer et ai cited in §93. 428 Interaction of Electrons with Photons §97 cross-section are almost comparable: j (i) is 2 d(o. 2e d(TK) = far; — log , «) mw d*™ = $arl^ 0> a> <3 m. (97.5) log ^ , O) For formulae (97.2M97.5) to be valid, it is necessary that at least one of the electrons should remain ultra-relativistic after emission of radiation, i.e. the photon frequency must be sufficiently far from the hard boundary of the spectrum (the maximum frequency a w that can be emitted). The final energy of the electrons is least, and the photon energy greatest, when both electrons move, after emission, in the direction of the photon at equal speeds. The conservation laws then give e + m = a w + 2e ', |p| = a w + 2|p'|. Hence, eliminating e' and p', we have (e + m - o w ) 2 - (|p| - o w ) 2 = 4m 2 and ,% _ m (e~m) (Qn <L\ (97,6) " - " - m + e-lpr When e > m, a w « e. Thus formulae (97.2)-(97.4) are valid if Wmax - o > ~ e ~ f c » > m . (97.7) The cross-section (97.2) for emission by the fast electron is exactly equal to that for electron-nucleus bremsstrahlung when the nucleus has Z = 1 (formula (93.17)). This agreement is not fortuitous, and can be explained by considering the significance of recoil in the emission process. In deriving (93.17) we neglected the recoil of the fixed particle (the nucleus), replacing it by a constant external field. This was equivalent to neglecting the time component of the momentum transfer 4-vector q = p'-p + k (the recoil energy). We shall show that, in the ultra-relativistic case, this treatment is permissible for electron-electron as well as electron-nucleus bremsstrahlung. We write - q2 = - (E' + a) - e) 2 + (p| + a» - p,)2 + (pi - Pl) 2 , (97.8) where the subscripts indicate the components of the vectors p' and p (the initial and final electron momenta) parallel and perpendicular to the direction of the photon k. In the ultra-relativistic case the angles 0, 0' between k and p, p' respectively are §97 429 Electron-Electron Brems Strahlung small: 6 ^ mie, 6' ^ mle'. Hence |pi|~|p|0~m, (97 9) p--w-é—ir-fl- - and similarly for pi and pjj. Neglecting recoil, we have e' + o> - e = 0; the term pj + w - pj ~ m2/e, and so -q2~(pl-pi)2~m2. (97.10) The energy of (electron-electron) recoil is q0= e' + <o-e~q2l2m ~ m. (97.11) The change in pi due to the change in e' is negligible. The change in q2 with allowance for recoil, which we donote by Aq2, is therefore given by the first two terms in (97.8). Using (97.9), we have A - E ) ( - ^ - ^ + ^ + ^ ) \ e E e e/ 2 ~ m • mle. Comparison with (97.10) shows that Aq 2 «|q 2 |, and the neglect of the recoil is therefore justified.! The fact that the fast particle emits into a narrow cone (with aperture angle —mle) in the direction of its motion enables us to deduce the cross-section in the centre-of-mass system by a simple conversion of the cross-section (97.2) from the laboratory system.$ In the centre-of-mass system the two electrons emit in the same manner, each in the direction of its motion. (It may be noted that this gives an intuitive explanation of the absence of interference between the radiation from the two particles.) The energy E of the ultra-reiativistic electron in this system is related to its energy e in the laboratory system by 2E2 = me ; the respective photon frequencies Cl and a) are related by w/e = H/E. These equations are easily obtained by comparing the values of the invariants (P1P2) and (p\k) in the two systems. The cross-section for emission by each electron in the centre-of-mass system is therefore d(7(1, = d(7(2) A 2dft£-n/ E , E-CÏ 2\/. 4E\E-(l) 1\ ,Q7 n ^ t This conclusion is, of course, valid a fortiori in the case of electron-nucleus bremsstrahlung, for which the recoil energy qo^q2l2M - m2IM, where M is the mass of the nucleus. t In general such a conversion is not possible, because the contribution to the spectrum in a given frequency range d<o comes from photons emitted in quite different directions. 430 Interaction of Electrons with Photons §97 For (97.12) to be valid it is also necessary that the photon frequency should not be close to the boundary of the spectrum. For an ultra-relativistic particle, the above-mentioned transformation gives immediately, when <omax ~ e, nmax « <omaxE/e « E. (97.13) Thus, in the centre-of-mass system, the electrons can emit only half of their total energy 2E. A direct calculation of nmax is easily performed by noting that, after the emission of such a photon, the electrons will move (in that system) at equal speeds in the direction opposite to that of the photon. We have 2E = 2E' + nmax, 2|p'| = nmax, whence nmax = p2/E = E-m 2 /E, (97.14) and in the ultra-relativistic case again (97.13). Thus formula (97.12) is applicable under the condition n m a x -n~E-n>m. (97.15) We shall now give some formulae for emission in the centre-of-mass system in the opposite limiting case, near the boundary of the spectrum, whent nmax-n«m. (97.16) Since in this case the recoil is very important, the results differ from those for scattering by a fixed centre and are also different for electron-electron and electron-positron scattering (V. N. Baier, V. S. Fadin and V. A. Khoze, 1967). In electron-electron scattering, besides the squares of the diagrams (97.1), there is also a contribution to the emission cross-section near the boundary of the spectrum from products (interference terms) of the direct and exchange diagrams, in which a given initial particle emits, for example, the product of the second diagram (97.1a) and the diagram lk P2 _ _ ^ — _ J I , — p l P* - I m P This is because, near the boundary, the final particles have similar momenta and there is no reason for the exchange terms to be small. The final result for the t The result obtained in the Born approximation is, of course, as usual valid only if the relative velocity of thefinalelectrons is large in comparison with a. If not, the interaction of the particles in the final state has to be taken into account. §98 431 Emission of Soft Photons in Collisions cross-section is v n , 2 [E(n max -n)p da = 2ar? Tdil T^. (97,17) Umax M In electron-positron scattering, a logarithmically large contribution to the emission cross-section comes from the squares of annihilation diagrams, in which there is emission by the initial particles: k i - p. _ i lk r— — - P.. " p . '— t -p; , ( -,—.—- -P' P: A L-^_ • — r - • P. (97.18) i m « i • P; The squares of other diagrams are significant when the accuracy is not logarithmic, but the interference terms are small. The final result is da = 2arAm^zM(logm+l\^L, m \ m / ilmax (97.19) Thus the emission in electron-positron scattering is logarithmically large in comparison with that in electron-electron scattering. §98. Emission of soft photons in collisions Let da0 be the cross-section for a given process of scattering of charged particles, which may be accompanied by the emission of a certain number of photons. Together with this process, we shall consider another which differs from it only in that one extra photon is emitted. If the frequency co of this photon is sufficiently small (the necessary conditions will be formulated below), the crosssection da for the second process is related in a simple manner to da0. When co is small, we can neglect the influence of the emission of this quantum on the scattering process. The cross-section da can therefore be represented as a product of two independent factors, the cross-section dao and the probability dw of emission of a single photon in the collision. The emission of a soft photon is a quasi-classical process; the probability is therefore the same as the classically calculated number of quanta emitted in the collision, i.e. the same as the classical intensity (total energy) of emission dl, divided by co (= h<o). Thus da = da0dll(o. (98.1) We shall show how this formula can be derived from the general rules of the diagram technique (J. M. Jauch and F. Rohrlich, 1954). The diagrams for the process involving an additional photon are obtained from those for the original process by adding an external photon line which t4branches 432 §98 Interaction of Electrons with Photons off" from some (external or internal) electron line, i.e. by replacing ■ p . \ by (98.2) ■— p ■ * P-k It is easily seen that the most important diagrams will be those in which this change is made in external electron lines. For, if p is the momentum of the external line (p2 = m2), then for small k we have also (p-k) 2 s =m 2 , i.e. the factor G(p-k) added to the diagram is near its pole. For an initial electron line p the change (98.2) amounts to the following change in the reaction amplitude: u(p)-eV(4ir)G(p - k)(ye*)u(p) «-eV(41r)^^(7c*)u(p). Since (yp)(ye*) = 2pe* - (ye*)(yp) and 7pu(p) = mu(p), we obtain the following rule: u(p)-+- « V ( 4 T T ) ^ M(p). (98.3) Similarly, for a final electron line p \ the replacement of k \ by P' \ P' ^_ P'+k in the diagram implies the change ü(p') - e V(4*)ü(p') { j^ (98.4) in the amplitude. In the rest of the diagram we can everywhere neglect the changes in the momenta of the lines as a result of the emission of the photon k. Here it is assumed that the photon energy o> is always small in comparison with the energies of all the particles participating in the reaction (and in comparison with those of the hard photons, if any, that are emitted). Let the cross-section da0 refer, say, to the scattering of an electron by a fixed nucleus (with possible emission of hard photons). The amplitude of this process, which will be conventionally called "elastic", is M}f>=û(p')Mii(p). §98 Emission of Soft Photons in Collisions 433 Making the successive substitutions (98.3) and (98.4) and adding the results, we obtain the bremsstrahlung amplitude for emission of the same hard photons together with a soft photon k:t M^-Mjr.V^^-ttg). (98.5) Accordingly, the cross-section is ip'e) (pe) d'k da = dvt\ • Aire1 3 (P'k) (Pk) (2ir) 2tu* (98.6) Summation over polarizations of the photon k gives ^-•te-&]'&** (987) In terms of three-dimensional quantities, this formula becomes^ / v'xn da = a 11 ; Vl-V'D v x n \2d<udok A_ •= I -r—j— dtrei, 1 - v n / 4ir<u , ofi fi. (98.8) where n = k/<u, and v and v' are the initial and final velocities of the electron. We see that the coefficient of datX is in fact the same as the classical intensity of emission (cf. Fields (69.4)), divided by eu, as already asserted in formula (98.1). The condition for the above formulae to be applicable is that not only is <u small compared with e but also the momentum transfer q to the nucleus is large compared with the change 5q in this quantity due to the emission of the soft photon. We have Sq = ( p ' - p - k ) - ( p ' - p ) „ = o = 5p'-k, where |5p'| — <od\p'\lde ~ <alv and |k| = <u. In the non-relativistic case (t> < 1), we therefore obtain the condition W|q|v <^ 1. (98.9) For scattering by a Coulomb potential (or by any potential that decreases slowly with increasing distance) |q| ~ 1/p (where p is the impact parameter), and so this condition can also be written as <UT < 1, where T ~ pfv is the characteristic time of the collision. t It should be noted that the difference term in this formula arises naturally from gauge invariance: the reaction amplitude must be unchanged when the polarization 4-vector e is replaced by e + constant x k. t To derive (98.8) it is convenient to return to (98.6), putting p = (e, ev), pk = ea>(l - v • ■ ) , . . . , e = (0, e), and then summing over polarizations by means of (45.4a). 434 Interaction of Electrons with Photons §98 In the ultra-relativistic case, the photons are emitted chiefly in directions near v and v\ as is seen from the denominators in (98.8). If the electron scattering angle 6 is small, the directions of all three vectors p, p \ n are close together. Then |Sq| = | S p # H k | and, since |q| ~~ edy we obtain the condition (98.10) d>-^j. e e Because the formulae (98.5)-(98.8) are quasi-classical, they are valid for emission by any charged particles, not necessarily electrons as assumed in the derivation. In general, when several such particles take part in the reaction, formula (98.5) must be put in the form M„ = M | f e V ( 4 ^ ) E z ( ^ - ^ ) , (98.11) where the summation is over all the particles (with charges Ze); formulae (98.6)(98.8) are changed similarly. In particular, in the non-relativistic case M = M jjo e V < 4 7 r > 2 (O Z(v' - v) • e*. (98.12) mi I (98.13) For two particles, this formula becomes o) q = m(v' - v), \ nt\ m = mxm1\{m\ + m2), where v and v' are the relative velocities of the particles before and after the collision. From this, on integrating |M/j|2 over the directions of emission of the photon and summing over the directions of polarization of the photon, we find the nonrelativistic frequency distribution of the radiation: 3ir \mi m 2 / <*> The above results can be generalized to the case of simultaneous emission of 435 Emission of Soft Photons in Collisions §98 several soft photons. For each photon there is an additional factor in M/;, similar to the coefficient of MJ?0 (98.5). This is easily seen directly for the example of two photons, say. The lines of the two emitted photons have to be added on external electron lines, and in two different orders, so that a diagram with external line p is replaced by two diagrams with the lines k k 2 \ \ k 1 \ . ^ p-k,-k2 \ .—^ p-k, . k 1 \ \ and . ^ p-k,-k2 p 2 \ \ i—^ p-k2 — P respectively. They contain the factors 1 1 2(pk, + pk2)2pk, . 1 1 2(pk, + pk2)2pk2 (the denominators of the electron propagators) respectively, and their sum is _1 1__ 2pki 2pk2' i.e. it is the product of two independent factors relating to the first and the second photon. Then, in the sum of all the diagrams, the terms combine (because of gauge invariance) to give the product of differences (p'el \p% pefWp'ef pkj\p% pe\*\ pk2)' The cross-section for the process separates into factors in accordance with the factorization of the amplitude. The soft photons are therefore emitted independently. The cross-section for emission of n soft photons can be written da = daA dwx... dwn, (98.14) where dwt, dw2,... are the probabilities of individual emission of the photons k(, k2, When this formula is integrated over afiniterange of values of the variables (frequencies and directions), the same for all quanta, a factor 1/n ! must be included in order to take account of the identity of the photons. If the emission cross-section (98.1) is integrated over frequencies in some finite range from a>, to co2, the resulting expression is der ~ a log(ü>2/a)|) dat\i (98.15) cf. (98.8). Here it is assumed that both frequencies are soft, and the possible values of oij are therefore limited by the condition for the method to be applicable. With logarithmic accuracy, however, we can put o>2~ e, where e is the initial energy of the emitting particle. The values of co\ have no lower limit, but on letting o>i-»0 we 436 Interaction of Electrons with Photons §98 see that the cross-section for emission of all possible soft quanta is infinite. Let us investigate the significance of this "infra-red catastrophe" (F. Bloch and A. Nordsieck, 1937). When alog(e/<0!)^U (98.16) we have da ^ dat\. This, however, means that perturbation theory is inapplicable, and da cannot be calculated as a quantity of a higher order of smallness than dac]. Thus in this case the small parameter must be taken as a log(£/a>i), not a. The derivation of formulae (98.5) and (98.6) from perturbation theory is therefore invalid at sufficiently low frequencies. The classical formula for the intensity dl (Fields (69.4)), on the other hand, becomes more nearly correct as co decreases. Hence formula (98.1) remains valid if its meaning is made somewhat more classical. In this formula it has been assumed that one photon is emitted. Then the energy lost by the particle as radiation is equal to to and the ''relative energy loss cross-section" is co dale or daeidlle. (98.17) In reality, for sufficiently small co, the emission probability is not small, and the probability of emission of two or more photons is greater, not less, than the probability for one photon. Under these conditions, the expression (98.17) remains valid but the classical intensity dl determines, instead of the probability of emission of one photon, the mean number of emitted photons dri = d//co, (98.18) or, in a finite range of frequencies, n= f dl/co. (98.19) Since the soft photons are emitted in a statistically independent manner (this being true in every approximation of perturbation theory), Poisson's formula can be applied to the process of multiple emission: the probability w(n) that n photons are emitted is given in terms of the mean number n by w(n) = n n e-*/n!. (98.20) The cross-section for a process of scattering with emission of photons may be written do-= do-er w(n). (98.21) Since 2 w(n) = 1, do-ci is the total cross-section for scattering accompanied by the §98 Emission of Soft Photons in Collisions 437 emission of any soft radiation. This is evident from a classical treatment; according to perturbation theory, however, dat\ is the purely elastic scattering cross-section. But perturbation theory is inapplicable here. Thus we find that dae\ calculated by perturbation theory as the elastic scattering cross-section actually includes the emission of any soft photons. The true value of the purely elastic scattering cross-section is zero: as wi-»0, the mean number n-><», and according to (98.20) the probability of emission of any finite number of photons vanishes.! PROBLEMS* PROBLEM 1. Find the spectral distribution of soft quanta emitted in ultra-relativistic electronnucleus bremsstrahlung. SOLUTION. Integration of (98.8) over dok gives da = aF(|)(do)/o)) daei, (1) where ^^[^î^log^W^-,!))-!], (2) p being the electron momentum and 0 the scattering angle. In the ultra-relativistic case, the most important range of angles is m2<M)le3^e<mle; (3) the lower limit is given by the condition (98.10), and the upper limit is discussed below. Here £ « e 0 / 2 m « l , so that F(|)~(8/3ir)£ 2 , and the electron-nucleus elastic scattering cross-section is (see (80.10)) dae,«4ZV,^4. e 6 (4) The integral a) J 0 diverges logarithmically; it is cut off below at angles 0~~m2u>/e3 and above at { — 1, i.e. at angles 0~~mle. When £-*<*>, F-(4/7r)log£ and the integral converges. Thus we have, with logarithmic accuracy, da^ZV2-^ log A (5) which agrees with the logarithmic part of formula (93.17) (where we must put e « e')- Non-logarithmic accuracy can be achieved only by going beyond the quasi-classical range. t We shall return to a more detailed discussion of this in §130, in connection with radiative corrections. t The following applications of formula (98.7) are due to V. N. Baler and V. M. GalitskiT (1964). 438 Interaction of Electrons with Photons §99 PROBLEM 2. For a collision between two ultra-relativistic electrons, determine (in the centre-ofmass system) the cross-section for simultaneous emission of two soft photons in opposite directions at small angles to the electron momenta. SOLUTION. Photons moving in opposite directions are emitted by différent electrons, each in the direction of its motion. The cross-section for simultaneous emission is , da=datraF(^aF(0—, (x)2 I (0\ (6) £ = (e/m)sin20, where e is the energy of each electron, 8 the scattering angle in the centre-of-mass system; 0 is the same for each electron. No factor 2 is needed in the cross-section, since the photons are certainly emitted in different directions. The cross-section for elastic scattering of the electrons through small angles in the centre-of-mass system, in the ultra-relativistic case, is the same as (4); cf. (81.11). Unlike (1), the cross-section (6) behaves as Odd when 0->O, and the integral therefore converges. On the one hand, this enables us to extend the integration to 0 = 0, without any difficulty that the method might cease to be applicable. On the other hand, the main contribution to the integrated cross-section now comes from the region 0 ~ m/e, not 0 <^ m/e, and so the exact expression (2) has to be used. The result of integrating the cross-section over scattering angles is a(7W|W2= — [5 + 2£(3)]r,a 7T = û)l 0>2 5 . 9 ^ ^ ^ an a>2 the value of the Riemann zeta function being £(3) = 1.202. §99. The method of equivalent photons Let us compare two processes described by the diagrams (n) i , V 1. (99.D where the circles represent the whole of the internal parts of the diagrams. Diagram (a) represents a collision between a photon k (k2 = 0) and a particle having 4momentum q (and mass m; q2 = m2). The system resulting from the collision is a particle or group of particles having total 4-momentum Q. Diagram (b) represents a collision between the same particle q and another particle having 4-momentum p and mass M (p2 = M2). After the collision, the latter particle has 4-momentum p \ and the same system Q is formed. The second process may be regarded as a collision between the particle q and a virtual photon emitted by the particle p and having momentum k = p-p' (k 2 <0). If \k\2 is small, the virtual photon is not greatly different from a real photon. Such a situation is evidently possible in collisions of very fast particles: the electromagnetic field of a charged particle moving with v — 1 is almost transverse, and therefore has properties similar to §99 The Method of Equivalent Photons 439 those of the field of a light wave. Under these conditions, the cross-section for process (b) can be expressed in terms of that for process (a).t We shall thus suppose that the particle M is ultra-relativistic, with energy (in the rest frame of the particle m) e > Af. If the masses of the colliding particles m and M are different, we shall take the case where m < Af. The amplitude of process (a), which involves a real photon, can be written M^ = -eV(47r)(^/^), (99.2) where e^ is the photon polarization 4-vector and JM the transition current corresponding to the vertex (the circle) in the diagram. The amplitude of process (b) is Mfi = Ze2^(jjn, (99.3) where jM is the transition current of the particle m (the lower vertex in the diagram), and Ze is the charge on this particle. The current J is a function of k = Q - q, and is therefore not the same in the two cases, since k2 = 0 in (99.2) and k2 ^ 0 in (99.3). But if, in the second case, |k 2 |«m 2 , (99.4) we can here also take J for k2 = 0. The change in the momentum of the particle M when a virtual photon is emitted, p - p' = k, is small in comparison with its initial momentum |p| * e ; w e can therefore put p = p' in the transition current j. That is, the motion of the particle Af may be regarded as uniform motion in a straight line. Since such a motion is quasi-classical, the corresponding current is independent of the spin of the particle:* j* = 2p*. (99.5) The condition for the current to be transverse (jk = 0) now gives E<O - pxkx = 0, the x-axis being taken in the direction of p. Hence a) = vkx, (99.6) where v = pje is the velocity of the particle Af. Since - k2 = - to2 + k\ + ki ~ (o2(\ - v2) + ki, (99.7) where kx is the component of the vector k transverse to the x-axis, the condition t The method given below is due to C. F. von Weizsäcker and E. J. Williams (1934); the basic idea had been stated earlier by E. Fermi (1924). t When the wave functions are normalized to one particle in unit volume, the current jM =(l,v), where v is the velocity. We have, however, decided (§64) to omit the normalization factor 1/V(2e) in the wave functions. Accordingly, j" must include a further factor 2e, and this gives the expression (99.5). 440 Interaction of Electrons with Photons §99 (99.4) is equivalent to the inequality |kj <è m and to a considerably weaker inequality for eu: <o <è m/V(l - v2). From the condition for the current / to be transverse (Jk = 0) we have, using (99.6), J — II. A. L ' l We therefore obtain for the scalar product \J jj = 2(J0e - JxPx) Ä2£(ki.Ji + ^ M (99 . 8) The product Je in (99.2) can be expanded by taking the polarization 4-vector of the real photon in the three-dimensionally transverse gauge: ek = - e * k = 0, whence ex Ä - ei • kjo). Then /e = - e 1 - ( j 1 - ^ J x ) . (99.9) The expressions (99.8) and (99.9) are proportional if the second terms in the parentheses are negligible. Since the current J pertains to the upper vertex of the diagram (99.1b), it does not depend on the direction of p; hence Jx and J ± must be taken to be quantities of the same order. For the terms in question to be negligible, therefore, we must have |kj<^û> and o> < e 2 |kJ/M 2 ; these conditions are compatible with the previous ones on k± and co. Assuming that in (99.9) the photon is polarized in the plane of x and k (so that e±||k_L) and noting that the conditions stated imply that el *** e2 = 1, we now have M, = M K > ^ £ ^ M . ~ K (t) (99I0) In accordance with the previous discussion, the following conditions are here assumed satisfied: |kjj <* <o <* m-y, o)/72<|ki|<m, (99.11) (99.12). with the notation y = e/M = l/V(l-t> 2 ). From this we can find the relation between the corresponding cross-sections. According to the general formula (64.18) we have (in the rest frame of the §99 The Method of Equivalent Photons 441 particle m) der, = |MK)|2(2TT)4S(4)(P/ - Pu-^dpo, where dpQ represents the statistical weights of the particles Q. Using (99.10) and (99.7), we find d<7 = dovn(k)d 3 p', (99.13) where Here dur is the cross-section for process (a), resulting from a collision between a real photon and a particle at rest, in which a system of particles Q is formed which have momenta in certain ranges; der refers to the process (b) of formation of the same system Q when a fast particle (of mass M ) collides with the same particle at rest, loses momentum p - p' = k, and remains in the range d3p' of values of p'. The factor n(k) in (99.13) may be interpreted as the number density (in k-space) of the photons equivalent to the electromagnetic field of the fast particle. The integration over d*p' is equivalent to one over d3k = du>d2kL. On integrating over d2fcj., we obtain the cross-section for a process in which the total energy E of the system of particles Q lies in a given range dE = do> (E - m = e - e' = a>, where e and e' are the initial andfinalenergies of the particle M). Integration over the directions of k± signifies averaging over the directions of polarization of the incident photon (and multiplying by 2TT). The result is d<7 = n(û>)dcr,dû>, (99.15) where n(w) = J n(k)2irk± dkL = 2ZV f k\dk± ™ J (kl +(û2ly2)2' The integral over dkL diverges when k± is large, but the divergence is only logarithmic. This enables us (within the range of validity of the method) to obtain a result in the logarithmic approximation: it is assumed that not only the argument of the logarithm but the logarithm itself is large. To this accuracy, it is sufficient to take as the upper limit of integration k±mM — m, the upper limit of the inequality (99.12). Integration then gives for the frequency distribution of equivalent photons 442 §99 Interaction of Electrons with Photons (in ordinary units) log^-—. n(a>) dio=-Za IT flù) (99.16) ù) The approximation used here signifies that the numerical coefficient in the argument of the logarithm remains indeterminate. The inclusion of such a coefficient would mean the addition of a relatively small quantity (~1) to the large logarithm and would be superfluous having regard to the accuracy of the method. PROBLEMS PROBLEM 1. From the photon-electron scattering cross-section, find the bremsstrahlung crosssection in a collision between a fast electron and a nucleus. SOLUTION. In the frame of reference K\ in which the electron is at rest before the collision, the process may be regarded as the scattering by the electron of the equivalent photons of the field of the nucleus.t According to (86.10) the cross-section for scattering of a photon by an electron in the frame Kx is A / *\ 2 mdœ[ [coi (üi (m d<7sc(a>i,û>i) = Trre 2— — + — + ( — cüi Lan <*)\ $o\ m\2 (\ ) -2m( — o)\/ \o)\ 1 \] M, (1) cüj/J where w\ and <o\ are the initial and final energies of the photon in this frame. The bremsstrahlung cross-section in the frame K\ is Tbr((0$ == I I dcji • n(u)]) d<7Sc(û>i, <*>!), dabr((o{) (2) where n(o>i) is the function (99.16). Since the cross-section is invariant, the change to a frame K in which the nucleus is at rest involves only a change in the frequency o)J. The frequencies a>i and o>' in the frames K\ and K are related by the Doppler formula OJ' = 70)1(1 - v cos 00, 7 = 1/V(1 - v \ (3) where 6\ is the scattering angle in the frame K\. The same angle relates o>i and o>i according to (86.8): (4) A - — = —(1-cosfli). (o\ d)\ m From (3) and (4) we have toi = ù)\e'le, (5) where e (-my) and e' are the initial and final energies of the electron in the frame K ( e - e ' = '). Substituting (5) in (1), we find . 2 d(T»c = TTTt m du> (e , e , m eu £(ÜI I — + — + ,1 7 \e e E o)i 2m<o \ o>i£ 7 Y / This expression is to be substituted in (2) and the integration over do>i carried out with o>' (i.e. e') fixed, the range being from o)i,min = mo//2e' to û)i,max = lew'lm; these values are given by (3) and (4) with 0i = O and 8] = IT. Because the integral converges rapidly for large o>i, the main contribution to it comes t The scattering of virtual photons by the nucleus (in the rest frame of the nucleus) is excluded by the large mass of the latter: the scattering cross-section tends to zero with increasing mass of the scattering particle. §99 The Method of Equivalent Photons 443 from the range of coi near the lower limit, i.e. we may put coi.max-»00. Calculating the integral with logarithmic accuracyt, we have , 2 do)' er (e e' 2\. EE' . do-br = 4reaZ7 — 7 — I -7 + T log ;. w e \e e 3 / m<o For this result to be valid, besides the condition e > m (ultra-relativistic electron), the condition (99.11) must also be satisfied: the frequencies Q>1 ' CO l, niin important in the integration must be <E. Hence E - s' = io' < EE'lm. Under these conditions the result agrees (to logarithmic accuracy) with (93.17), as it should. PROBLEM 2. The same as Problem 1, but for electron-electron bremsstrahlung. SOLUTION. In this case, the virtual photon can be scattered either by the fast electron or by the recoil electron; the photons equivalent to the field of either electron are scattered by the other. The scattering of virtual photons by the fast electron gives the cross-section dcrtV, which is equal to the cross-section for an electron and a nucleus with Z = 1. The scattering of virtual photons by the recoil electron gives a cross-section diTbr* = I dit) * n(ù)) dcTsc(u), Co'), with do-sc(o>, w') given by (1) with the appropriate change of notation for the frequencies. The range of values of co for given co' is (cf. (4)) Ü/^CÜ^00 foro/>2m, co' ^ 0) ^ (i)'l{m - 2(x)') for co' < im. When co' < Im, integration with respect to co gives . (2> 16 2 da)' I w' o>' \ , e dab/ = Tar* —7- 1 + —2 log —, co \ m m I a) in agreement with (97.4). But when to' > \m we must distinguish the cases co' ~ m and to' ~ e > m. In the former case, . (2) 2 2 m dco'/ d o i r = 30tre co 7T- 4 V m m \. ; + 7-^7 co 4co / E log — , m in agreement with (97.3); in the argument of the logarithm we have, with sufficient accuracy, replaced e/co' by Elm. In the case co' — e, the method of equivalent photons is not valid for calculating derb?. The frequency co of the virtual photons takes values beginning with co', and the condition (99.11) is therefore not satisfied when <D == w' — e. PROBLEM 3. Determine the total pair production cross-section in a photon-nucleus collision from the pair production cross-section in a collision between two photons. SOLUTION. The energy of the photon in the rest frame K of the nucleus is co ^> m. If we change to a frame Ko in which the nucleus moves to meet the photon at a speed t;o such that 1/V(1 - vl) = 5co/m, then in this frame the photon energy is coo = co ^ 1 — l>0 n 1 -,//i 2^ _ 2\ Ä 2C0 V ( l - vo) = m. The required cross-section a is calculated in the frame Ko as the pair production cross-section in collisions between an incident photon co0 and the equivalent photons of the nucleus, whose energy we denote by co': & — I tfyyrt(<*>') d(o\ t This means that, by one integration by parts, the term containing the large logarithm is separated and the remaining terms then neglected. This operation is equivalent to taking the logarithm log(e/coi) outside the integral, with COl — COl, min> 444 §100 Interaction of Electrons with Photons where cryy is the cross-section for pair production by two photons and is given by §§88, Problem, formula (1), with v = V ( l - m2/u>o<*>') = V ( l - m/o>'). Changing to the variable t; instead of a/, we have i a = 2r\aZ I v log[m(l - u 2 )/m]{(3 - vA) l o g y ^ - 2v(2 - v2)] dv. o Because of the convergence at the upper limit, the integral may be taken over the whole range from the reaction threshold <ü'=m (v = 0) to Û>' = <» (V = 1) and with logarithmic accuracy (replacing log[cu(l - v2)lm] by its value f or t; = 0 and taking it outside the integral). The result is <r = $aZ 2 rîlog(a>/m), in agreement with (94.6); this formula is valid when log((j/m)> 1. § 100. Pair production in collisions between particles Electron-positron pair production in a collision between two charged particles is described by diagrams of two types: (100.1) p, -p* p- -P+ (a) (b) The two upper continuous lines in each diagram correspond to the colliding particles, and the lowest line to the pair formed. Let us consider a collision of two heavy particles (nuclei) in the ultra-relativistic case. The change of the motion of the particles themselves in such a collision may be neglected, i.e. they may be regarded as external-field sources.t This corresponds to two diagrams of the first type: |q(,) t |q(2) |q i i P- i P (100.2) -P«. where q(1), q(2) are the "momenta" of the Fourier components of the fields of the two particles. The potential AM = (A0, A) due to a classical particle moving with a uniform t The collision of two light particles (electrons), where the change in the motion cannot be neglected, is a considerably more complicated case; see the book by Bafer et ai cited in §93. 445 Pair Production in Collisions §100 velocity v satisfies the equations DA 0 = - A-rrZebix - vf - r0), DA = - 47rZev8(r - \t - r0). Its Fourier components are Ao(cu, k) = -^-^2 e~ikr° 8(<o - k • v), and similarly for A(tu, k). In four-dimensional form, A»{q) = -^-^-e^WôiUq), where U is the 4-velocity of the particle, and the 4-vector x0 = (0, r0). If nucleus 1 is at rest at the origin (r^ = 0), then p ss r^2) is the impact parameter vector (in a plane perpendicular to the direction of motion of nucleus 2). This expression for A^iq) is to be used in writing analytically the diagrams (100.2). There is, however, no need to use this method for the actual calculations in the present case. The pair production cross-section may be determined by the method of equivalent photons, using the already known photon-nucleus pair production cross-section. The replacement of the field of one particle (the first, say) by a spectrum of equivalent photons implies that in the diagrams (100.2) the lines qil) are regarded as real-photon lines. The two diagrams then become identical with the diagrams corresponding to pair production by a photon at nucleus 2. When e+, e- > m, the cross-section for the latter process is given by (94.5). Multiplying this cross-section by the spectrum (99.16) of equivalent photons of the first nucleus, we obtain (with logarithmic accuracy) for the differential cross-section for pair production in a collision between particles da = -r2€(Z{Z2ay 77 (£++£_) 4 (e% + e 1 + je+s-) log ,c^loge m ( e + + E-) /e » £+ + € - (100.3) where y = 1/V(1 - v2)> 1. Here it is assumed that m<§e+,e-<^my; (10&4) the right-hand inequality is the condition for the method of equivalent photons to be applicable. The range defined by the inequalities (100.4) is the same as the electron and positron energy range which is important in the integration of (100.3). On integration over de+ or de- for a given sum e = £+ + e_ (>m), the important range is the one near the upper limit; omitting terms which do not contain the large §100 Interaction of Electrons with Photons 446 logarithm, we find A 56 2 / 7 x2i e i wy de der = — re(Z7 {Z2ay log — log —*- —. VÎT m e s The integral with respect to e over the range (100.4) diverges as the cube of the logarithm, but only as the square of the logarithm at the boundaries of the range. In the logarithmic approximation (log y > 1), therefore, the range (100.4) is in fact the most important one, and the integral can be taken over the range from m to my. Since y J log £(log y - log O y = 6 log3 y, i the total pair production cross-section is a = 2 ^ r2e(Z,Z2a)2 log3 ^ _ ^ (100.5) (L. D. Landau and E. M. Lifshitz, 1934). Let us now consider the case of non-relativistic velocities of the colliding nuclei. The change in their motion due to their interaction then becomes important, and the main contribution to the pair production cross-section comes from diagrams of the second type in (100.1). There are four such diagrams: two of them are ■ P', — — I ! P, Pi "Pi T — — P2 (100.6) 'k p_ - 1 r p^ and the other two are similar except that the virtual photon Ic (which produces the pair) is emitted by the first nucleus and not by the second.t We shall suppose that the energy of the pair is small compared with the kinetic energy of the relative motion of the nuclei in their centre-of-mass system: e++z-<\Mv\ (100.7) where v is the initial relative velocity and M = M\M2I(MX + M2) is the reduced mass of the nuclei. Then the reciprocal effect of pair production on the motion of the nuclei can be neglected. If the electron-positron line in the diagrams (100.6) is omitted, the remainder will represent the emission by the colliding particles of a t Altogether 36 diagrams correspond to pair production in a collision between two electrons: 2! x 3! = 12 diagrams of type a, differing by interchanges of the two initial and three final electrons, and 2 x 2! x 3! = 24 diagrams of type b, obtained in a similar way from the two diagrams (100.6). Pair Production in Collisions §100 447 low-frequency virtual photon ((o = e+ + e-). Thus we return to the situation discussed in §98 for the emission of a real soft photon, and can use the formula (98.13) derived there for the non-relativistic case (except that the amplitude V(47r)e* of the real photon will be replaced by the virtual photon propagator).! Thus the amplitude of the whole process of pair production becomes Mfi = Mf ^ ( f ^ - | ^ ) q A D A , ( k ) [ - i e ( û - 7 ^ . ) ] , (100.8) where q = (0, q), q = M (v' - v). As usual, the photon propagator in the non-relativistic case is to be taken in the gauge (76.14). From the amplitude (100.8) we find the cross-section for the process: vdnP\fi" 1 ^ da = datl • W # " # V Ô \M\ M2J 2e+ - 2E-{2TT) œ (<o - k (4ir)2|ö-Y • Q"4\ ) (100.9) where o) = 8+ + e-, k = p+ + p-, Q= q CO j k(q • k); dac\ is the cross-section for elastic scattering of one nucleus by the other, in their centre-of-mass system, and is given by Rutherford's formulait dcTei = 4(Z,Z2e2)2M2 do/q4 ~A{ZxZ2e2)ldq\di}l\ v q (100.10) the last equation assumes that the deviation of the nuclei from their original direction of motion (the x-axis) is small. Substituting this expression in (100.9) and summing over polarizations of the pair in the usual manner, we obtain da = (z,z,ey£(!r:|)2* x U(( 7P . + mXT • Q)(7P. - mXï • ^US^ÄJ-^ <1001,) The remaining calculation is made in the approximation in which all the t In the non-relativistic case, the photon momentum is small in comparison with the change in momentum of the radiating particles (|Sp| — a>lv), and can therefore be neglected, in comparison with Sp, even when the photon energy is not neglected. This applies a fortiori here to the virtual photon, for which the four-dimensional square k2 = (p+ + p-f > 0, so that |k| < tu. Under these conditions there is no difference between real and virtual photons, and the use of formula (98.13) is thereby justified. t The diagrams (100.6) are shown on the assumption of the Born approximation for scattering of nuclei. But, since Rutherford's formula is exact (for Coulomb interaction), the validity of the results obtained does not in fact depend on the fulfilment of the condition for the Born approximation to be valid. 448 §100 Interaction of Electrons with Photons logarithms occurring in the integration are assumed large. We shall see that, to this accuracy, pair energies e+, e > m and angles 6 between p+ and p_ such that (100.12) mle<6<\ are the most important. With the appropriate approximations, the calculation of the trace in (100.11) gives tr{...} = 4[(e + e- - p+ • p_)(q2 - ^ ^ ) + (q • k)2 - ^ + 2(p+ • q)(p- • q) + ^ (e+q . p_ + e _ q . p+)j, where we can also put |p+| = e+, |p-| = e_. In the denominator, ,.2 L-2-_„ - n2 , „ 2 ( g + + g-) Integration over the directions of p+ and p_, for a given angle between them, gives da = 3^(Z,Z 2 e 2 ) 2 £ ( ^ - Ä ) V - + el) de, de. x [e2+m\e+ + e-yieUlY q2 The form of the dependence on 0 confirms the hypothesis (100.12), and integration with respect to 0 gives log[e+e-/m(e+ + e-)]. Integration of the last factor in (100.13) is from qy = qz = 0 to V(qJ + q]) — \/R, where Ä is a quantity of the order of the radius of the nuclei (corresponding to the smallest impact parameters; see below). This integration gives [IT log(q2 + q) + qï)]$:£#* ~ 2ir log ^ The total energy of the pair, equal to the change in the energy of the nuclei, is e s ( e + + e-) = iM(v' 2 -v 2 )« Mv(v'x- vx) = vqx, whence qx = e/v. Thus we find **=^(Z'Z* > -7- (JE " w ~r~Iog silog -sr de * d£ - §101 Emission of a Photon by an Electron 449 and, after integration over de+ or de- with a given sum e, *-g<*^¥(&-i$**-s;*»i;T- <10014> The energy e may be correlated with the impact parameter p ~ vie ; the pair energy is of the order of the frequency which corresponds to the collision time. Hence the logarithmic divergence on integration over de in (100.14) implies a similar divergence with respect to impact parameters. This means that large values of p are important (and this, incidentally, justifies the use of the cross-section (100.10) for scattering in the purely Coulomb field of the nucleus). Accordingly, the important range of energy is given by m<e<vIR. Integration of (100.14)gives the total pair production cross-section; the final result is (in ordinary units) (E. M. Lifshitz, 1935).t §101. Emission of a photon by an electron in the field of a strong electromagnetic wave The application of perturbation theory to processes of interaction between an electron and a radiation field requires not only that the interaction constant a should be small but also that the field should be sufficiently weak. If a is the amplitude of the classical 4-potential of an electromagnetic wave field, the characteristic quantity in this respect is the dimensionless invariant ratio £ = <?V(-a2)/m. (101.1) In this section we shall consider emission processes occurring in the interaction of an electron with a field of a strong electromagnetic wave, for which £ can have any value. The method used is based on an exact treatment of this interaction; the interaction of the electron with the newly emitted photons can, as before, be regarded as a small perturbation (A. I. Nikishov and V. I. Ritus, 1964). Let us consider a monochromatic plane wave, say a circularly polarized one. Its 4-potential may be written in the form A = a\ cos <f> + a2 sin <f>, d> = kx, (101.2) where kM = (w, k) is the wave 4-vector (k2 = 0), and the 4-amplitudes a, and a2 are equal in magnitude and orthogonal: a] = al = a2, aia2 = 0. t A numerical error was corrected by L. 6. Okun' (1953). 450 §101 Interaction of Electrons with Photons We shall assume that the Lorentz gauge condition is applied to the potential, so that a\k = a2k = 0. The exact wave function for an electron in the field of an arbitrary plane electromagnetic wave has been derived in §40, and is given by formulae (40.7) and (40.8). We shall, however, change the normalization by making i//p correspond to unit mean spatial number density of particles, in the same way as the wave functions of free particles are normalized J:o "one particle in unit volume". Since the mean density for the function (40.7) is j 0 = WPo, in order to obtain the required normalization this function must be multiplied by V(p0/qo), i.e. the factor 1/V(2p0) in (40.7) must be replaced by 1/V(2q0). For a wave with the 4-potential (101.2), we find </ + (yk)(ya2) sin </>}] ^ ^ x x expj - ie ^ ^ sin <f> + ie ^ ^ cos <f> - iqxV (101.3) where (101 4) ^-"'-''mk'- - According to (40.14), the 4-vector q is the mean 4-momentum of the electron; we shall call it the quasi-momentum. The S-matrix element for a transition of the electron from the state \}/p to the state i//p with emission of a photon having 4-momentum k* = (o/,k') and polarization 4-vector e' is Sfl = -iej Mye'*WP y^Fj d4x. (101.5) The integrand in (101.5) is a linear combination of the quantities 0-i<*\ sin <£+ia2cos 4> sin4> • e - i a ' s i n * + i a 2 c o s *, where (a\p axp'\ (a2p a2p'\ /iri. ,. These quantities, together with the factor exp[i(Jc' + p ' - p ) x ] , give the whole dependence of the integrand on x. We expand them in Fourier series, denoting the §101 Emission of a Photon by an Electron 451 expansion coefficients by Bs, BXsy B2s respectively; for example, p~iax sin<J> + ia 2 cos4> = ^ - i V f a f + a J ) sin(<f>-<*>0) = 2 B,«""*. These coefficients can be expressed in terms of Bessel functions by the formulae Bs=Js(z)eis\ Bls= J[/,+i(z) ei(s+1)*° + /.-.(z) e««"1^], (101.7) B2i = ^ [J,+,(z) e'c*+,*> - Js-{(z) e««"1^], where z = V(aï+a2), cos</>o= ai/z, sin <£o = a2/z. The functions Bs, Bis, B2s are related by a,B„ + a2B2, = sB„ (101.8) which follows from the familiar relation /J_1(2) + / 1+ ,(z) = 2s/,(z)/z between the Bessel functions. The matrix element (101.5) then becomes " = (2„- ■ ll ■ 2 W ? "H*2»)4»^* + « ~ "' - k'>; S (1019) we shall not give here the fairly complicated expressions for the amplitudes M \\\ Thus Sfi is an infinite sum of terms, each corresponding to a conservation law sk + q = q' + k'. (101.10) Since q2 = q'2 = m\\ + £2) = m J (101.11) (cf. (40.15)), and k2 = k'2 = 0, the equation (101.10) can be satisfied only if s ^ 1. The 5th term of the sum describes the emission of a photon k' by the absorption from the wave of s photons with 4-momenta k. The form (101.10) shows that all the kinematic relationships which occur for the Compton effect will apply to the processes considered here if the electron momenta are replaced by the quasi-momenta q and the incident photon momentum by fhe 4-vector sk. In particular, the frequency 452 Interaction of Electrons with Photons §101 of the emitted photon in the frame of reference where the electron is at rest on average (q = 0, q0 = rn *) is <o' = TT7—7-^71 SÏ» (101.12) 1 + (s<i)lm *)(1'~ cos 0) where 0 is the angle between k and k'; cf. (86.8). We may say that the frequencies (o' are harmonics of a>. In the notation previously used (§64), the amplitude of the process of emission of the sth harmonic is M)f, and the expression dw (2,r)4s<<i<s £+ q k,) <i01 i3) - -w? ^ . y t ^ ' « -'- - gives the corresponding differential probability per unit volume and unit time.t The amplitudes M ^ have a structure similar to that of the scattering amplitudes with plane waves, ü(p')... u(p); the operations of summation over polarizations of the particles are therefore carried out in the usual manner. After summation over the polarizations of the final electrons and the photon and averaging over the polarizations of the initial electron, we have em dk'dq'8i4) 4TT q0<ïoû> dW x { - 2J](z) + e(\ + 2( fcp) ( tp>)) (J]+i +J '-' " 2J])}' (10114) In order to integrate this expression, we note that, owing to the axial symmetry of the field of a circularly polarized wave, the differential probability is independent of the azimuthal angle <f> around the direction of k. This fact, together with the presence of the delta function, enables us to integrate over all variables except one, which we take to be the invariant u = (kk')l(kp'). Then, after integration over d3kd<t>d(qi) + ù)'), we find ö KSK + q q K) qia, -+(7^)3- For, in the centre-of-mass system (in which sk + q = q + k' = 0), this integration gives 2ir|q'|d cos $IES, where Es — sco + q0 = <•>' + (jo and 6 is the angle between k and q'; cf. the transformation (64.12). In the same system, moreover, u Es , f - ~—nf qi-|q'|cosea~ lIf , c o s _= Esdu w • w |q'|(l + ii)2- -« v t It should be noted that the normalization of the functions i/fp to unit density corresponds to normalization by the delta function "on the q/2ir scale"; cf. (40.17), where the factor q0/po on the right will now be absent. It is for this reason that the number of final states of the electron must be measured by the element d3q7(27r)3. §101 Emission of a Photon by an Electron 453 The range -1 «* cos 8 «s 1 corresponds to 0^u^us = E]lml-\ = 2s(kp)/m*; in making the transformations it must be remembered that kp = kq. Thus the total probability of emission from unit volume in unit time is 0 (101.15) wheret u=(kk')l(kp'), us=2s(kp)lml /r / M 1 (101.16) When £ <^ 1 (the condition for perturbation theory to be valid), the integrand in (101.15) can be expanded in powers of £ For example, the first term in the expansion of W\ is w.-sja!^ f f2+ "f__4JL(i_JL)]4l4p0 J L 0 1+w ui V Mi/J with Mi *» 2(kp)/m2. This result agrees, as it should, with the Klein-Nishina formula for the scattering of a photon by an electron: putting in (101.17) - a 2 = 47r/co, £2 = 47re2lm2œ9 and dividing by the incident flux density (64.14), we return to (86.16) (the integrated scattering cross-section is independent of the initial polarization of the photon).^ The expression for the probability of emission of the second harmonic (the first term in the expansion of W2 for £ < 1) is t To calculate z, we first note that z2 = (ûiQ)2 + (a2Q)2 = a2Q2, where Q = qKkq)-q'l(kq'). This is easily shown by choosing a frame of reference in which (aOo = (ai)o = 0 and the vectors ai, 82, k are along the axes x\ x2, x\ and noting that Qo = Q3 because kQ = 0. t This value of a2 corresponds to normalization of the 4~potential to "one particle in unit volume". To determine it, o> must be equated to the energy of a classical field with the (real) 4-potential (101.2). §101 Interaction of Electrons with Photons 454 PO J (1 + W ) 2 U 2 \ U 2 /L 0 ^e2m2?\\ 1 Po L2 3II, 4 2 M? M? 1+ M U 2 /J U2\ 1 2(1 +2M,) - ( ^ - Ä " A - - ^ ) l o g ( l + 2ii,)l \2ui 2ui w, ut/ J (101.18) The leading term in Ws for fairly small s is proportional to £2s. Let us now consider the opposite case (Ç> 1). The parameter £ can be made large, for instance, by decreasing the frequency o> with a fixed field strength; evidently £ = eF/mco, where F is the amplitude of the field strength. It is therefore clear that the case £ > 1 essentially refers to processes in a constant and uniform field where E and H are orthogonal and equal in magnitude; this will be called a crossed field. The probability of emission in this field can be found by taking the limit f -> », but it is simpler to assume a constant field in the calculations, taking the 4-potential in the form A^ = aM</>, <> / = kx, ak = 0 (101.19) (so that F^v = k^av - kva^ = constant). The exact wave function of the electron in this field is obtained i au it; u by uy substituting duuoiiiuiiiig (101.19) \ 1 u 1 . 1 7 ; in 111 (40.7), y+xi. 1 )y(40.8): y+\j.vj, </>P = [ l + e (yk)(ya) 1 u(p) e x pf i.e (ap) I S P T *J V(2^) r m) 2 . 2 a2 3 . 1 * ' 6â^ * " T IPJ nft1 2 m (10, 20) - The result given by using this function is exact for emission by an electron with any energy in a crossed field. However, in the ultra-relativistic case this result (when put in the appropriate form; see below) applies to emission by an electron not only in a crossed field but in any constant and uniform electromagnetic field, including a constant magnetic field as discussed in §90. To formulate this assertion we note that the state of a particle in any constant and uniform field is defined by as many quantum numbers as the state of a free particle, and these may always be so chosen as to become, when the field is removed, those of a free particle, i.e. its 4-momentum p* (p2 = m2). Thus the state of a particle in a constant field is described by a constant 4-vector p. The total intensity of emission, being an invariant, depends only on the invariants which can be constructed from the constant 4-tensor FM„ and the constant 4-vector p*\ Since FM„ can appear in the intensity only in combination with the charge e, we obtain three dimensionless invariants: X2 = -^(F^Pn2 f= e\F^)2lm\ = -^a2(kp)\ (101.21) §101 Emission of a Photon by an Electron 455 In a crossed field / = g s 0, whereas in general all three invariants are non-zero. If the electron is ultra-relativistic (po>m), however, and the vector p makes angles 6 >mlpo with the fields E and H, then # 2 > / , g (that is, for an ultra-relativistic particle any constant field appears to be a crossed field for almost all directions p). If also the fields |E|, \H\<m2le (= mV/eft), then [f|, | g | « l . t Under these conditions the intensity calculated for a crossed field and expressed in terms of the invariant \ will apply also to the emission in any constant field. The invariant \ *s given in terms of the fields E and H by X2 = ^ { ( p x H + poE)2-(p-E)2}. For a constant magnetic field, \ ls equal to the quantity (90.3), and the above arguments are therefore another means of deriving the results in §90.t t And p in \ may be regarded, with the same accuracy, as being the ordinary 4-momentum of the particle. t A detailed account of the theory of various processes in strong fields is given in the review papers by A. I. Nikishov and V. I. Ritus in Proceedings (Trudy) of the P. N. Lebedev Physics Institute, Vol. Ill, pp. 5 and 152. CHAPTER XI EXACT PROPAGATORS AND VERTEX PARTS § 102. Field operators in the Heisenberg representation HITHERTO, in considering various specific processes in electrodynamics, we have used only the first non-vanishing approximation of perturbation theory. We shall now go on to discuss the effects which occur in higher approximations. These are called radiative corrections. A better understanding of the structure of the higher approximations can be obtained by first examining some general properties of exact scattering amplitudes (i.e. those which have not been expanded in powers of e2). We have seen in §72 that the successive terms of the series in perturbation theory can be expressed in terms of the field operators in the interaction representation, whose time dependence is determined by the Hamiltonian H 0 of a system of free particles. The exact scattering amplitudes, however, are more conveniently expressed in terms of the field operators in the Heisenberg representation, where the time dependence is determined by the exact Hamiltonian H = H0+ V of a system of interacting particles. The general rule for constructing the Heisenberg operators gives 4f(x) s i(t9 r) = eiririMr)<Tiiîr, (102.1) and similarly for <j>(x) and A(x), i//(r), etc., being time-independent (Schrödinger) operators.t It may be noted immediately that the Heisenberg operators for a given time obey the same commutation rules as the operators in the Schrödinger representation or the interaction representation: for example, {*(*, r)$k(U r% = eiA%(rl$k(rf)U-^ = y% S ( r - r'); (102.2) cf. (75.6). Similarly, the operators (//(t, r) and A(f, r') commute: rtar),A(t,r')}. = 0, but this does not hold good for operators pertaining to different times. The "equation of motion" satisfied by the Heisenberg ^-operator can be derived from the general formula QM, (13.7): . iM*) = Hi(x) - i(x)H. (102.3) t In this chapter, operators with a time argument belong to the Heisenberg representation; those in the interaction representation will be given the suffix int. 456 §102 457 Field Operators in the Heisenberg Representation The Schrödinger and Heisenberg representations are the same as regards the Hamiltonian, which is expressed in the same way in terms of the field operators. Here, to calculate the right-hand side of (102.3), we may omit from the Hamiltonian the part which depends only on the operator A(x) (the Hamiltonian of the free electromagnetic field), since this part commutes with <£(*). According to (21.13) and (43.3), H = ( <£*(t, r)(a • p + ßm)4f(t> r ) d'x + e j $(t, r)(yÂ(U r))i£(t, r) d3x = | $(t9 r){yp + m + e(yÂ(t, r))}${t, r) d3x. (102.4) When the commutator {H, i£(f, r)}_ is calculated from (102.2) and the delta function is eliminated by integration over d3x, we get (yp - eyk - m)i£(t, r) = 0. (102.5) As we should expect, the operator t//(t, r) satisfies an equation which is formally the same as Dirac's equation. The equation for the electromagnetic field operator A(f, r) is obvious from the correlation with the classical case. When that case applies, i.e. when the occupation numbers are large (cf. §5), the operator equation must become the classical Maxwell's equation for the potentials, Fields (30.2), after averaging over the state of the field. It is therefore clear that the equation for the operator is simply the same as Maxwell's equation, so that we have (for an arbitrary gauge) a ^ Â ^ U ) - 3|iaMÂ"(x) = -4irej"(x), (102.6) where j"(x) = t//(x) yu tp(x) is the current operator, satisfying identically the equation of continuity flj"(x) = 0. (102.7) It is important to note that the equations (102.6) are linear in AM and j*\ and the question of the sequence of these operators therefore does not arise. Like the similar equations for wave functions, the operator equations (102,6) and (102.7) are invai.ant under the gauge transformation A M (x)^A M (x)-a^(x), tKx)^«Mx)e'*, (102.8) u ${x)->e- *$(x), where ^(x) is any Hermitian operator which commutes (at a particular time) with t This refers specifically to the Heisenberg (//-operators. In the interaction representation, the gauge transformation of the electromagnetic potentials does not affect the (//-operators. 458 Exact Propagators and Vertex Parts §102 Let us now ascertain the relationship between the operators in the Heisenberg representation and those in the interaction representation. To simplify the discussion, it is convenient to make the formal assumption (which will not affect the final result) that the interaction V(0 is adiabatically "switched on" from t - - » to finite times. Then the Heisenberg and interaction representations are the same for f-»-oo, and the wave functions of the system, <ï> and <î>in,, are the same: <*>int(* = - « ) = <*>• (102.9) But the wave function in the Heisenberg representation is independent of time (since the whole of the time dependence is in the operators); in the interaction representation, the time dependence of the wave function is given by (72.7): «MO = S«, — y M - « ) , (102.10) Ê(h, r,) = T exp{-i | V(t') «ft'}, (102.11) where and the following properties of È are obvious: S(r, r,)S(t,, to) = S(t, to), | Ê~\t, U) = $(*,, t). J (102.12) Comparison of (102.10) and (102.9) gives <*U0 = $(f, "«>)<*> (102.13) as the relation between the wave functions in the two representations. The operator transformation formula is similarly «Kt, r) = $-'(!, -»)**(*, r)Ê(t, -oo) = S(-«,r)«Mr,r)S(r,-°o), (102.14) and likewise for «Ä and À. One further general remark may be added. It has already been mentioned more than once that, in relativistic quantum theory, the physical significance of the field operators is very limited because the zero-point fluctuations are infinite. This is even more true of operators in the Heisenberg representation, which contain also divergences due to the interaction. In this chapter, §§102-109 deal with the formal theory, which ignores the question of eliminating these singularities and which treats all quantities as if they were finite. The results thus obtained have mainly heuristic value: they lead to a fuller understanding of the significance of the expansions given by perturbation theory, and they may also remain valid in some form in a future theory which is free from the present difficulties. §103 The Exact Photon Propagator 459 §103. The exact photon propagator The concepts of exact propagators play a central role in the formalism of the exact theory (i.e. without expansion in powers of e2).t The exact photon propagator (denoted by the script letter 2)) is defined by 2>M„(X - x9) = i<0|TAM(x)A,(x/)|0), (103.1) where AM(x) are Heisenberg operators, in contrast to the definition (76.1): D^(x - x') - i<0|TAi;t(x)Alnt(x')|0), (103.2) in which the operators in the interaction representation were used. The function (103.2) may be called the free (or bare)-photon propagator to distinguish it from the exact propagator (103.1). Since the mean value in (103.1) cannot be exactly calculated, it is impossible to obtain an exact analytical expression for 3)^ although the definition does lead to some general properties of this function, as will be discussed in §111; here we shall consider the calculation of Q)^v by perturbation theory, using the diagram technique. For this purpose, we must express 2)^ in terms of the operators in the interaction representation. in the interaction representation. First, let t > t'. Using the relationship between A(x) and Âint(x) (cf. 102.14)), we can write %Âx - x') = f<0|AM(x)A„(x')|0> = i<0|S(-oo, t)Ai\x)S(U -oo)S(-oo, t') x xAtnt(x')S(t',~oo)|0>. According to (102.12) we can make the substitutions S(r,-oo)^(-oo,o = ^(t,r), S(-oo,o-S(-co,+oo)SKt). Then %Ax - x') - WIST^SK t)AlMnt(x)S(t, OAtnt(x')S(t', -°o)]|0>, (103.3) S^S(+oo,-oo). (103.4) with Since, according to the definition (102.11), S(t2y t\) includes only operators for times t These concepts were introduced by F. J. Dyson (1949), who also developed essentially the whole of the treatment given in this chapter. 460 Exact Propagators and Vertex Parts §103 between tx and t2 arranged in chronological sequence, it is evident that all the operator factors in the brackets in (103.3) are in order of decreasing time from left to right. If the time-ordering symbol T is placed before the bracket, we can rearrange the factors in any manner, since the operator T will automatically put them in the necessary order. Then we write the bracket as [•••] = T[A,;t(x)Alnt(x')S(oc, t)S(U t')S(t\ -co)] = T[Â'T(x)ÂT(xf)Sl Thus 9)*Âx ~ x') = i(0\S-lT[A?\x)A™\x')S]\0). (103.5) It is easily shown by a similar argument that this formula is also valid if t < t'. We shall now prove that the factor S_1 can be taken outside the averaging over the vacuum to form a phase factor. To do so, we recall that the Heisenberg vacuum wave function $ is the same as the value 4>int(-°°) of the wave function of the same state in the interaction representation (see (102.9)). From (72.8), S<Dint(-oo) s S(+oc, -oo)<l>int(-oc) = <Dint(+oo). The vacuum is a strictly stationary state, in which no spontaneous processes of particle generation can occur. In other words, in the course of time the vacuum remains the vacuum; this means that 4)int(+00) can differ from 4>int(-oc) only by a phase factor eia. Hence S4>im(-oo) = eIÄ4>int(-oo) = (0|S|0)<ï>int(-oc), (103.6) or, taking the complex conjugate and using the unitarity of the operator S, ^U-^)S-l = (o\s\oyl^u^l Hence it is clear that (103.5) can be written ^(*-*') =.<0'T<^(',)S'°> (.03.7, Substituting in the numerator and the denominator the expansion (72.10) for S and averaging by means of Wick's theorem (§77), we get an expansion of Q)^ in powers of e2. In the numerator of (103.7), the quantities to be averaged differ from the matrix elements of the type (77.1) only in that the "external" photon creation and annihilation operators are replaced by Â™\x) and ÂT(x'). Since all the factors in the products to be averaged are preceded by the time-ordering symbol, the pairwise contractions of these operators with the "internal" operators A,nt(xj), Xmt(jc2),... will give the photon propagators D^v. Thus the results of the averaging are expressed by sets of diagrams with two free ends, constructed in accordance with the rules in §77, §103 The Exact Photon Propagator 461 except that propagators DMV, not the amplitudes e of real photons, correspond to external (and internal) photon lines. In the zero-order approximation, with S = 1, the numerator of (103.7) is simply D ^ ( x - x ' ) . The next non-zero terms will be of the order of e2. They are represented by a set of diagrams having two free ends and two vertices: C7>- + (103.8) The second of these diagrams consists of two disconnected parts: a broken line (corresponding to -iD^) and a closed loop. The separation of the parts of the diagram signifies that the corresponding analytical expression separates into two independent factors. On adding to the diagrams (103.8) the zero-order approximation diagram (a single broken line) and "taking it outside the brackets", we find that the numerator in (103.7) is, as far as second-order terms, —{.-o}- —o- The expression <0|S|0> in the denominator of (103.7) is the amplitude of the "transition" from the vacuum to the vacuum. Its expansion therefore contains only diagrams without free ends. In the zero-order approximation, (0|S|0) = 1, and as far as second-order terms we have I'-OI When the numerator is divided by the denominator we get, to the same order, the expression Thus the diagram with the detached loop does not occur in the result. This is a general theorem. Having regard to the way in which the diagrams are constructed which correspond to the numerator and denominator in (103.7), we can easily see that the role of the denominator (0|S|0) is simply to ensure that in all orders of perturbation theory the exact propagator 2^,, will be represented only by diagrams which do not contain separated parts. The diagrams with no free ends, forming closed loops, have no physical significance and need not be taken into account, quite apart from the fact that they disappear when the propagator 3) is formed. Such loops represent radiative corrections to the diagonal element of the S-matrix for a vacuum-vacuum transition; but, according to (103.6), the sum of all these loops, together with the unity §103 Exact Propagators and Vertex Parts 462 given by the zero-order approximation, gives only an unimportant phase factor, which cannot affect any physical results. The change from the coordinate representation to the momentum representation is made in the usual way. For example, in the second-order approximation of perturbation theory, the propagator -i'2)MI,(k), which will be shown by a thick broken line, is the sum p+k -*_ ~~ k ,_ + — r < ^ > — , — k k V ^ X P (1019) k in which all the diagrams are calculated by the general rules given in §77 except that factors -iD^ik) are assigned to the external as well as the internal photon lines. In analytical form, we therefore havet 2W(k) - D^(k) + ie2D.k(k) j tr yK G(p .+ k)ypG(p) ^ Dpv(k); (103. 10) the bispinor indices of the matrices y and G are, as usual, omitted. The terms in subsequent approximations are constructed in a similar manner, and are represented by sets of diagrams having two external photon lines and the appropriate number of vertices. For example, the terms in e4 correspond to the following four-vertex diagrams: -o-o- (103.11) The diagram 0 also has four vertices; its upper part is a loop formed Joy a single "self-closed" electron line. Such a loop corresponds to the contraction $(x)y<fr(x), i.e. to the value of the current averaged over the vacuum: (0|j(x)|0). But, by the definition of the vacuum, this quantity must be zero identically, and the identity cannot of course be altered by any further radiative corrections to such a loop.t Thus no diagrams having "self-closed" electron lines need be considered in any approximation. t The factor -1 from the closed electron loop must be taken into account when deriving the signs. t Although a direct calculation from the diagrams would lead to divergent integrals. §103 The Exact Photon Propagator 463 The part of a diagram which lies between two (external or internal) photon lines is called a photon self-energy part. In the general case, it can itself be divided into parts joined in pairs by a single photon line, i.e. it has a structure of the form 0"-0~- .... -rO where the circles denote parts which cannot be further subdivided in the same manner; such parts are said to be compact or proper. For example, the first three of the four fourth-order self-energy parts (103.11) are compact. Let i&pJATT denote the sum of the infinity of compact self-energy parts. The function ^ ( k ) is called the polarization operator. When the diagrams are classified by the number of compact parts which they contain, the exact propagator 2)M„ can be put in the form of a series where i&^JAir corresponds to each shaded circle. The analytical form of this series is op aj> g> a=D+D£D+D£D_D+... = D{1+£[D + D£D + ...]}, (103.12) where the indices are omitted, for brevity. The series in the brackets is again 2i. Hence ^„(k) = DM„(k) + D ^ ( k ) £ ^ 9,,,(k). (103.13) Multiplying this equation on the left by the inverse tensor (D~l)Tfl and on the right by (2>_1)w, and renaming the indices, we get the equivalent form 2'\v = D~\v-j^^v. (103.14) It must be emphasized that writing 3) in the form (103.12) assumes that the diagrams can be broken down into simpler parts calculated by the general rules of the diagram technique, and that the combination of such parts gives the correct expressions for the entire diagrams. The admissibility of this breakdown of the diagrams is an important and by no means trivial feature of the diagram technique, which arises from the fact that the overall numerical factor in the diagram does not depend on the order of the diagram. 464 Exact Propagators and Vertex Parts §103 The same property enables us to use the function 2> (assumed known) to simplify the calculations of the radiative corrections to the amplitudes of various scattering processes: instead of treating afresh each time the diagrams with different corrections to the internal photon lines, we can simply make these lines thick, i.e. assign to them the propagators 9) (instead of D) in the appropriate approximation. If the photon line corresponds to a real and not a virtual photon, i.e. if it is a free end of the whole diagram, the application to it of all the self-energy corrections gives what is called an effective external line. It corresponds to the expression obtained from (103.13) by replacing the factor D by the polarization amplitude of the real photon: M 9pX(k) + a^OO—r^ex. 4TT (103.15) For an external-field line, e» in this expression is to be replaced by Ajf*. The discussion in §76 of the tensor structure and the gauge non-uniqueness of the approximate propagator D^„ applies to the exact function äV„ also. Considering only the relativistically invariant representations of this function, we can write it in the general form %Âk) = 2>(k2) (&, -*jj£) + 2 > ( ' W ) ^ ; (103.16) the first term corresponds to the Landau gauge, and in the second term 2>(0 is a gauge-arbitrary function. The corresponding form of the approximate propagator! is D^fc) = Dik2)^ - ^ ) + D ( 'W) ^ . (103.17) The longitudinal part 2>(0 of the propagator is related to the longitudinal part of the potential 4-vector, whieh has no physical significance. It is therefore not concerned in the interaction and is unaffected by the latter, so that 2>(,)(k2) = D(,)(k2). (103.18) The inverse tensors must, by definition, satisfy the equations Q}-1^" = fij, D~\JDX* = «J. When the original tensors have the form (103.16) or (103.17), the inverse tensors are, from (103.18), (103.19) D ^ = D\g^-'tr) t In this formula D <0 is not the same as in (76.3). + Wkr- §104 465 The Self-energy Function of the Photon From these, it follows that the polarization operator 0V„ is a transverse tensor: ^ = W2)(g^-^), (103.20) where 9 = k2-4-n72), or Air ®(k)~k2[l-W>OT (103.21) Thus the polarization operator, unlike the photon propagator itself, is a gaugeinvariant quantity. § 104. The self-energy function of the photon In order to examine further the analytical properties of the photon propagator, it is useful to define, as well as the polarization operator, another auxiliary function ^„(Jc), called the self-energy function of the photon: iH^JAir is defined as the sum of all self-energy photon parts (not only the compact ones). If this sum is represented in the diagram by a square, we can write the exact propagator as the sum k + k k i.e. '\LV ^ *-VA (104.1) 4ir Hence, expressing U^p as and substituting (103.16) and (103.19) followed by (103.21), we get IV = n(k2) fa, - ^ ) , n = j^ip. (104.2) Thus n ^ like 9^ is a gauge-invariant tensor. The usefulness of U^ arises from the expression for it in the coordinate representation. This is easily found by noting that the equation 4ir n^(k) = D-\xD-lpv{®Kp(k) ~ Dk>{k)}, 466 §104 Exact Propagators and Vertex Parts in which the tensor 2)Kp-DKp coordinate representation is transverse by (103.18), can be written in the n^(x -x') = j - (d^K - g^dnWp - g„pW){2)Ap(x - x') - DKp(x - x')}. In order to carry out the differentiation, we must substitute 25Ap(x - x') - DAp(x - x') = i<0|TAA(x)Ap(x') - TAAnt(x)Afnt(x')|0>. (104.3) In §75 we have seen that the differentiation of a T product generally demands caution, because the product has discontinuities. But the difference that is to be averaged in (104.3) is continuous, and so are its first derivatives, since the commutation rules are the same for the components of the operators AA(x) and AAnt(x) for a given time, and the corresponding discontinuities cancel out (cf. §75). The difference in (104.3) may therefore be differentiated under the symbol T. According to (102.6) and the corresponding equation with zero on the right for the free electromagnetic field operators A£t(x), the result is nM,(x - x') = 47rf62<0|TjV(x)jV(x')|0). (104.4) This shows explicitly the gauge-invariance of nM„, since the current operators are gauge-invariant. From (104.4) we can derive an important integral form of this function. According to (104.2), it is sufficient to consider the scalar function II = \ll*. In the coordinate representation, n(x - x') = ^f 4TT (62<0|TJV(X)J-(X')|0) . 2 2 <0U»(x>l»)<«U"(*')10> for ( > « ' , n 2<0|U*')|n><n|r(x)|0> for t < t', (104.5) where n labels the states of the system electromagnetic field + electron-positron field.t Since the current operator j(x) depends on x*4 = (r, r), its matrix elements also depend on x. The relationship can be found explicitly by taking as the states \n) states which have definite values of the total 4-momentum. The time dependence of the current matrix elements, like that of any Heisenberg operator, is given by <n|r(r,r)|m) = <n|r(r)|m)e - i ( E m-£B)l where E„ and Em are the energies of the states \n) and |m>, and /(r) is the Schrödinger operator. t The current operator conserves charge; hence the states |n) in (104.5) can contain only the same numbers of electrons and positrons. §104 The Self-energy Function of the Photon 467 To determine the coordinate dependence of the matrix elements, we consider the operator /(r) as being the result of transforming the operator j(0) by a parallel translation over the distance r. The operator of this translation is exp(ir • P), where P is the total momentum operator of the system (see QM, (15.15)). Using the general rule for the transformation of the matrix elements (see QM, (12.7)), we therefore have <n|r(r)|m> = <n|e- i r -V(0)e i r >> = <n|j'1(0)|m)ei(P"-,,-)r. Together with the previous formula, this gives finally (n\j»(t, r)|m) = (n\j»(0)\m)e-iiP"-p")x. (104.6) The matrix (n|jM(0)|m) is Hermitian, like the matrix (104.6) of the entire operator /•*(!, r), and according to the equation of continuity (102.7) it satisfies the transversality condition (Pn-Pmr<n|]V(0)|m> = 0. (104.7) Let us now calculate the function Il(x-x'). Substitution of (104.6) in (104.5) gives n(O = ^It<0\U0)\n)(n\n0)\0)e^i for T ^ 0 , (104.8) <0|j,(0)|n><0|r(0)|n>*5(4)(k - P„). (104.9) where x - x' = Ç = (T, £). We use the notation p(k2) = - ^ (2TT)3 2 The sum is taken over all systems of real electron-positron pairs and photons that can be generated by a virtual photon having 4-momentum k = (cu, k) (o> > 0), and for each such system there is summation over the internal variables (the polarizations and momenta of the particles in the centre-of-mass system).t After this summation, the function p can depend only on k, and since it is a scalar it can depend only on k2. In particular, it does not depend on the direction of k. Using these properties of p, we can rewrite (104.8) as œ n<0 = -i $ do, | ( 0 p ( k 2 ) « i M " M T | 0 0 0 t This definition of the states \n) is evidently identical with their definition as states for which the matrix elements <0|j|n) of a charge-odd operator are non-zero. 468 §105 Exact Propagators and Vertex Parts The momentum representation is obtained by substituting e-^ = 2ico ( €-**-& ko-o) J Vr^T^ (104.10) + i0 27T (see §76); the result is U(k2) = j difji2) j d(<o2)8(fi2 + k2 - co2) 0 k i ^ \ i 0 , 0 or, finally,! 0 The coefficient p in this integral form is called the spectral density of the function n(k 2 ), and has the properties p(k2) = 0 for k2<0,| p(k2)>0 k2>0, J for (104.12) since the 4-momentum k of a virtual photon which can generate a system of real particles must necessarily be time-like; k2 is equal to the square of the total energy of the particles in their centre-of-mass system. The transversality condition (104.7) gives PS<0|j M (0)|n>-0. The 4-vector <0|j|n> is orthogonal to the time-like 4-vector P„ and must be space-like: <0|jV(0)|n><0|j-(0)|n>* < 0 ; thus, from the definition (104.9), p > 0 . § 105. The exact electron propagator The exact electron propagator, similarly to that of the photon, is defined by %k(x-xf)^-i(0\T^(x)Mxf)\0) (105.1) t The formal calculations analogous to those given above require caution, on account of the presence of the divergences previously mentioned. These give rise, in particular, to thé occurrence on the right of (104.11) of further divergent terms which do not have an explicitly relativistically invariant form, called Schwinger terms. They will not be written out here, since they in any case disappear on renormalization (§110) and do not affect the subsequent results. §105 The Exact Electron Propagator 469 where i and k are bispinor indices, which differs from the definition (75.1) of the free-particle propagator Gik{x - x') = -i<0|T^(jt)*jr(x')|0> (105.2) in that the ^-operators in the interaction representation are replaced by Heisenberg operators. The same arguments as were used to derive (103.7) lead to *(x-x,--iW^g*m. (105 .3) The expansion of this expression in powers of e2 puts the ^ function in the form of a set of diagrams with two external electron lines and various numbers of vertices. The denominator in (105.3) again has the function of retaining only the diagrams which do not have detached "vacuum loops". For example, as far as the terms in e4, the graphical representation of the propagator <S (denoted by a thick continuous line) ist The thick continuous line corresponds to the function fflip) in the momentum representation, and the sets of continuous and broken lines in the diagrams on the right of the equation correspond to the free-particle propagators iG and -iD respectively. The section between two electron lines is called an electron self-energy part. As with the photon, it is said to be compact if it cannot be further subdivided into two self-energy parts by cutting a single electron line. The sum of all possible compact parts will be denoted by -iMik; the function Mik(p) is called the mass operator. For example, as far as the terms in e4, t It has already been shown in §103 that there is also no need to take account of diagrams which contain "self-closed" lines; these would here appear in the second order: 9 470 Exact Propagators and Vertex Parts §105 By a summation exactly similar to the derivation of (103.13), we find «(p) = G(p) + G(p)M(p)(S(p) (105.6) (omitting the bispinor indices) or, for the inverse matrices, <ê-\p) = G-\p)-M{p) (105.7) = yp-m-M(p). It has already been noted in §102 that the Heisenberg ^-operators (unlike those in the interaction representation) are altered by a gauge transformation of the electromagnetic potentials. The exact electron propagator ^ is therefore also not gauge-invariant. Its gauge transformation behaviour may be derived as follows (L. D. Landau and I. M. Khalatnikov, 1952). The change in ^ under the gauge transformation must evidently be expressed in terms of the same quantity D(,) as is added to the photon propagator by this transformation. This is clear, since in the calculation of <S by the perturbationtheory diagrams each term of the series is expressed in terms of the functions D, and no other electromagnetic quantities are involved. The analysis can therefore be simplified: any special assumptions can be made regarding the properties of the arbitrary operator x in the transfromation (102.8), provided that the result is expressed in terms of D (0 . The transformation (102.8) brings the propagators 2> (103.1) and ^ (105.1) into the following forms: 2V„-i<0|T[AM(x)- dMx)][Av(x')- d'vX(x')]\0h l (105.8) We shall now suppose that the operators x a r e averaged independently of all the remaining operators in the T product. This is a reasonable assumption, since the "field" x takes no part in the interaction, because of the gauge invariance. We also assume that the mean value, over the vacuum, of the operator x is zero: (0|x|0) = 0. Then the terms in x m (105.8) can be separated, and the result is %, -» %v + i{0\TdßX(x) • a^(x')|0>, 3* -♦ "SMTe^^e-^^lO). (105.9) (105.10) The rest of the derivation will be given for the case of an infinitesimal transformation, and we shall emphasize this by writing 8% in place of xThe transformation (105.9) may be writtent (independently of the smallness of t Formula (105.11) can be derived from (105.9) if the function d(l) and its derivative with respect to t are continuous at t = t'\ if they are discontinuous, the right-hand sides of these expressions differ by delta-function terms (cf. the derivation of (75.2)). In the momentum representation, this condition is equivalent to assuming that d('\q) decreases more rapidly than llq1 as \q\-*°°- §105 The Exact Electron Propagator 471 %v -* %v + 5 ^ , , ô9M„ = d^d[dil\x - x'), (105.11) d(0(x - x') = i<0|TÔ*(x)ôx(x')|0>. (105.12) Ô*)as where Hence it is clear that d(,) determines the change caused by the gauge transformation in the longitudinal part 2)(,) of the photon propagator. The assumption that d(,) depends only on x - x ' implies, of course, a certain limitation on the properties of the operator Ox ; in the general case of a completely arbitrary gauge transformation, the propagator may cease to be homogeneous in space and time. In the transformation (105.10), we expand the exponential factors in powers of 8\ as far as the quadratic terms: (0|Te^(at)e"^(x)|0) ~ -ie2<0|S*2(x) + 5X2(x') - 2T8*(x)Sx(x')|0>. Using the definition (105.12), we thus find the following transformation rule for the electron propagator: <S - « + 8% m = ie2(S(x - x')[d(0(0) - d(,)(x - x')]. (105.13) In the momentum representation,! we have ô<S(p) = ie 2 1 d{\q)mP) - <S(p ~ <?)] 0f> (105.14) d°\q) is related to the change in the function 2>(0 by 8Q>l,Kq) = q2dl,)(q). t If the function f(x) = f\{x)ft(x), its Fourier components are f(p) = jf(x)ei'xd*x = / / / fx&Jp *«>-«-« >/■(«■)/:(*) -// ^r8<4,(p "qi " *>'«<«•>«*> In deriving (105.14) from (105.13), we also use the result /(x = 0 , . | / ( , ) ^ , . (105.15) 472 Exact Propagators and Vertex Parts §106 An integral expression analogous to (104.11) could be derived for the electron propagator, using the expressions ^m(x) = ^m(0)e- |(P «- p - ) * (105.16) for the matrix elements of the (//-operator, similarly to the expressions (104.6) for the current matrix elements. Unlike the current, however, the i/z-operators are not gauge-invariant. The coordinate dependence (105.16) is therefore not general, but applies only to some particular gauge. The same is true as regards the integral representation based on (105.16). The deeper physical reason for this situation is that the zero photon mass leads to the infra-red catastrophe (§98). In consequence, the electron emits an infinite number of soft quanta during the interaction, and this means that the "single-particle" propagator (105.1) loses much of its direct significance. § 106. Vertex parts In complicated diagrams it is possible to distinguish both self-energy parts and sections of another type which are not equivalent to them. An important class of such sections is found by considering the function Krk(xi,X2,x3) = <0|TA^(x,)^(x2)Ä(x3)|0> (106.1) which has one 4-vector index and two bispinor indices; since space-time is homogeneous, this function depends only on the differences of the arguments xu xi, x3. When expressed in terms of the operators in the interaction representation, the function K has the form Ktoux^-WMMtwnxtsw. (1062) The momentum representation is obtained by using the formula (27T) 4 Ô ( 4 ) ( P I + k- p2)KMP2, P.; k) = HI KUxi, x2, x3)e-fa-+<Prt-*»>*3 d% d4x2 d%. (106.3) In the diagram technique, the functions Kfc correspond to three-ended (one photon and two electron) sections of the form I §106 473 Vertex Parts where the momenta are related by the conservation law Pi + k = p2. (106.5) The zero-order term in the expansion of this function is zero; the first-order term is K^Od, x2, x3) = e J G(x2 - x)yvG{x - x3) • Dm(x] - x) d4x in the coordinate representation, and KHP2, PI; k) = eG(p2)y„G(p,) • D^(k) (106.6) in the momentum representation (omitting the bispinor indices); the corresponding diagram is I (106.7) X P2 P, In the subsequent approximations, the diagrams are complicated by the addition of new vertices, but not all such diagrams provide essentially new information. For instance, in the third order we have the diagrams 0 ! « 1 (106.8) / \ The first three of these can be cut (across one photon or electron line) into a simple vertex (106.7) and a second-order self-energy part; the fourth diagram cannot be thus treated. This is a general situation. The corrections of the first kind simply replace the factors G and D in (106.6) by the exact propagators <S and 2>. The remaining terms in the expansion give a new quantity to replace the factor y* in (106.6). Denoting this quantity by P \ we thus have by definition K*(p2, P.; k) = {i«(p2)[-ier.(p2, pr, k)]i«(p,)}[-i3^(k)]. (106.9) A section joined to other parts of the diagram by one photon line and two electron lines is called a vertex part if it cannot be divided into parts joined by only one (electron or photon) line. The quantity T'1 is the sum of an infinity of vertex parts, including the simple vertex 7M, and is called a vertex operator or vertex junction. 474 Exact Propagators and Vertex Parts §106 The following are all the vertex-operator diagrams as far as the fifth-order quantities: (106.10) the black dot denoting the exact vertex operator -ieT. The operator T (like the operator 7 of the simple vertex) has two matrix (bispinor) indices and one 4-vector index; it is a function of two electron momenta (Pi, Pi) and one photon momentum (k). The three momenta cannot all relate to real particles simultaneously: the diagram (106.4) in itself (not as part of a larger diagram) would correspond to the absorption of a photon by a free electron, but this process is incompatible with the conservation of the 4-momentum of real particles. Hence at least one of the three free ends of the diagram must pertain to a virtual particle (or to an external field). The vertex parts may also be classified as reducible and irreducible. The irreducible ones are those which do not contain self-energy corrections to internal lines and in which it is not possible to separate parts which constitute (lower-order) corrections to internal vertices. For example, of the diagrams in (106.10), the only irreducible ones are (b) and (d) (apart from the simple vertex (a)). Diagrams (g), (h) and (i) contain self-energy parts; in diagram (c) the upper broken horizontal line may be regarded as a correction to the upper vertex, and in diagrams (e) and (f) the lateral broken lines may be regarded as corrections to the lateral vertices. When the internal lines in irreducible diagrams are replaced by corresponding thick lines, and the vertices by black dots, i.e. when the approximate propagators D and G are replaced by the exact propagators 2 and % and the approximate vertex operators y by the exact ones I\t we evidently obtain the set of all vertex parts. Thus the expansion of the vertex operator may be written ÀvVÀ*À (106.11) This equation is an integral equation for I\ with an infinity of terms on the right. From the above discussion we can easily derive the general principle of construction of the exact expressions for sections having any number of ends. t The resulting diagrams are called skeleton diagrams. §106 475 Vertex Parts They are obtained as vacuum mean values of T products of Heisenberg operators, with one operator <//(x) for each initial electron, one i//(x) for each final electron, and one A(x) for each photon. A further example is given by diagrams of the form (106.12) . ^ with four external electron lines. These are obtained from the function Klk,,m(x,, x2; xj, x4) = <0|T^Ui)^(x2)^(x3)«Âm(x4)|0), (106.13) which, of course, depends only on the differences of the four arguments. Its Fourier components may be written d4x, d4x2 d4x3 d4x4 | KlMm(x„ x2; X3, xt)e^x^-p^-^ = (27T)4ô(4)(p, + p2-Pi- p4)JCik,lm(p3, p 4 ; Pi, p2), (106.14) with Kik,lm(Pl, P4ÎPl,P2) = (27T)4ô(4)(p, " P3)^l(P.)^m(P2) ~ (27T)4Ô(4)(p2 - P 3 )^(Pl)^l(p 2 ) + + ^in(Pi)%r(P4)[-irnr,APi, PA\ P., P2)]^.(Pl)^m(P2). (106.15) In the latter expression, the first two terms exclude from the definition of the function r(p 3 , p 4 ;pi,p 2 ) diagrams which fall into two disconnected parts, each having two free ends: PJ——( J • Pi P< * P} * ( ( J " Pi or P< ' f J - Pi )""* P? In the third term the ^ factors exclude from the definition of T those parts of diagrams which are corrections to external electron lines. From the properties of the T product of the Fermi ^-operators, it follows that the functions T(p3, p4; p,, p2) are antisymmetric: TiMmtes, P<» Pi» P2) = - r k i , i m ( p 4 , P3', Pi» P2) = -r,k,mi(p3, p4; Pa, Pi). (106.16) If the momenta pu Pi, P3» p4 correspond to real particles, the non-separating (i.e. 476 Exact Propagators and Vertex Parts §107 connected) diagrams (106.12) represent the scattering of two electrons. The scattering amplitude is found by assigning the wave amplitudes of the particles (instead of the propagators <ê) to the free ends of the diagram:t iMfi = üi(pi)ük(p4)[-ierikjm(p), p4\ pu p2)]ul(px)um(p2). (106.17) According to (106.16) this amplitude must have the appropriate antisymmetry with respect to interchanges of electrons. § 107. Dyson's equations The exact propagators and the vertex part satisfy certain integral relations, the origin of which is particularly clear if the diagram technique is used. The concepts of reducibility and irreducibility defined in §106 can be applied not only to vertex parts but also to any other diagrams or parts thereof. Let us consider from this aspect the compact self-energy electron diagrams. It is easily seen that only one diagram out of this infinity is irreducible, namely the second-order diagram Any complication of this diagram can be regarded as the application of further corrections to its internal (electron or photon) lines or to one of its vertices. Here it is important to note that, owing to the obvious symmetry of the diagram, any vertex correction need by assigned only to either one or the other vertex.* Since, therefore, only one of the compact self-energy electron parts is irreducible, the ensemble of all such parts (i.e. the mass operator M) is represented by only one skeleton diagram: (107.1) t It will be seen later (§110) that the self-energy parts in the free ends can be ignored in deriving the amplitudes of real processes. X For clarity, it should be emphasized that, although all the required diagrams are found by applying corrections to only one vertex, for any particular diagram the structure of the correction section in general depends on the vertex to which it is assigned, for example where, in identical diagrams, the squares enclose sections which form the vertex part when it is assigned to theright-handand left-hand vertices respectively. §107 Dyson's Equations All In analytical form, this graphical equation becomest ^(p) = G- 1 (p)-^- 1 (p) = -ie2 $ y'<S(p + k)P-(p + k,p;k)- %Ak)0p. (107.2) A similar expression can be derived for the polarization operator 0*. Again only one of the compact self-energy photon parts is irreducible, and 0* is therefore represented by a single skeleton diagram: ~(g>~ = -T<f>V (107.3) The corresponding analytical equation is ^ ^ = D"V,(k)-3"U(k) = ie2 tr j y^(p + k)Fv(p + k,p; k)<S(p) ^ ; (107.4) the bispinor indices are omitted from (107.2) and (107.4). The relations (107.2) and (107.4) are called Dyson's equations; they can also be obtained by direct calculation. For example, to derive (107.2) we consider the quantity (yp - mh<Sik(x - x') = -i(yp - m)li(0|T(///(x)^(x')|0), where p = id is the operator of differentiation with respect to x, which is found from (102.5) in exactly the same way as was done when deriving (75.7) for the free-particle propagator. The result is (yp - m)n<Sik(x - x') = -iey^0\TAAx)^(x)^k(x')\0) + 8ik8{4)(x - x'); the delta-function term on the right is the same as in (75.7), since the commutation properties at t = t' are the same for «//-operators in the Heisenberg and interaction representations. The first term is -ieyvKik(x9x,x')9 and we can thus write (again omitting the bispinor indices) (yp - m)cS(x - x') = -iey^K^x, x, x') + 5(4)(JC - x'). (107.5) To obtain the Fourier components we note that, if the definition (106.3) is t If the exact vertex part is assigned to the left-hand vertex in (107.1), the factors y and T are interchanged in (107.2). The two forms of the equation are, of course, essentially equivalent. 478 §108 Exact Propagators and Vertex Parts 4 4 integrated over d kd p2l(27T)*, the result is j K»(p + k, p ; k) 0f = | K"(0,0, xl)e-{^ d4x3 = ! KM(x, x, x')eip{x~x,] d\x - x'), (107.6) from which it is seen that the integral on the left is the Fourier component of KM(x, x, x'). Thus, by taking the Fourier components of both sides of (107.5), and using the definition (106.9) and the formula yp - m = G~\p), we find G-\PmP) = 1 - ie2 $ y"<S(p + k)T-(p +fc,p ; k)<§ip)D»¥{k) 0p. Finally, multiplying this on the right by ^ ( p ) , we obtain (107.2). § 108. Ward's identity Another relationship between the photon propagator and the vertex part, simpler than Dyson's equation, follows from gauge invariance. To derive, it, we apply the gauge transformation (102.8), assuming that x(x) s 8x(x) is an infinitesimal non-operator function of the 4-coordinates x. Then the change in the electron propagator is fi«(x, x') - ie<S(x - x')[8*(x) - 8x(x')l (108.1) Note that this gauge transformation violates the homogeneity of space-time, and the function d^ depends on the arguments x and xf separately, not only on the difference x - x \ Its Fourier expansion must therefore be made in the variables x and x' separately. Thus, in the momentum representation 5<S is a function of two 4-momenta: ô^(P2, p,) = f f 5^(x, * VP2X"iPlJC dAx d4xl Substituting (108,1) and integrating over d4xd4£ or d4£d4x' (£ = x-x')> we get 8«(P + q, p) = ieôx(q)mp) - «(p + q)]. (108.2) With the same gauge transformation, the operator A„(x) is augmented by the function SA(;)(x) = - ^ r 5 ^ , (108.3) which may be regarded as an infinitesimal external field. In the momentum §108 479 Ward's Identity representation, OA{;\q)=iqßoX(q). (108.4) The quantity S^ can also be calculated as the change in the propagator under the action of this field. As far as quantities of the first order in 8\, this change can evidently be represented by a single skeleton diagram: p+q L p The thick broken line is the effective external-field line, corresponding to the factor (see (103.15)) SAi\q) + ÔA[e\q) ~^- 2»^(q). The 4-vector 8A('\q) is longitudinal (with respect to q) and the tensor ^A" is transverse. The second term is therefore zero, leaving I p+q P where the thin broken line corresponds, in the usual manner, to the field 8A{e) simply. In the analytical form, S<3 = e%p + q)F*(p + q, p ; q)<S(p) • SA(;\ (108.6) Substituting (108.4) and comparing with (108.2), we get «(p + q) - «(p) = -»(P + Q)^(P + q, P ; q)«(p) * <fe or, in terms of the inverse matrices, «-,<P + q) - <ê'\p) = <fep(p + q, p ; q) (108.7) (H. S. Green, 1953). Taking the limit of this equation as q-»0 and equating coefficients when qM is infinitesimal, we get T—S-'fpHPCp.pîO). (108.8) dpM This is Ward's identity (J. C. Ward, 1950). We see that the momentum derivative of Exact Propagators and Vertex Parts 480 §108 ^_1(p) is equal to the vertex operator with zero momentum transfert The derivative of the function <3(p) itself is dp» mP) = i«(p)[-iT*(p, P ; o)]i«(p). (108.9) The higher derivatives could be found similarly by continuing the calculations to higher orders in 8\, but we shall not need these expressions. Let us now consider the derivative d&(k)ldkp, of the polarization operator. Unlike S(p), &(k) is gauge-invariant and is unchanged by the application of the fictitious external field (108.4). Its derivative therefore cannot be calculated in the same way, but a diagram expression can be obtained for this derivative too. To do so, we consider the first diagram in the definition of <3>\ the second-order diagram ÎÉ. An P+ k h—*—O— (108.10) The continuous lines correspond to the factors iG(p) and iG(p + k). Differentiation with respect to k replaces the second factor by idG(p + k)ldk, and according to the identity (108.9) this change is equivalent to adding a further vertex on the electron line: |k'«0 471 Î T (108.11) We see that, in the first non-vanishing order, the required derivative has been expressed in terms of a diagram having three photon ends. It must be stressed immediately that this diagram does not itself give the amplitude for the transformation of one photon into two. The amplitude of this process is the sum of (108.11) and a similar diagram in which the loop is traversed in the other direction, and the sum is zero by Furry's theorem. The diagram (108.11) is not itself zero. In a similar manner, we can differentiate more complicated diagrams by successively adding vertices with k' = 0 on all the electron lines which depend on k. There are, however, diagrams in which the dependence on k occurs in the internal photon lines also, for instance the diagram on the left in the next equation: k-q,-q 2 dk" t In the zero-order approximation, i.e. for the free-particle propagator, this identity is obvious: G~\p) = yp - m, and therefore dG'xldp^. - y*. §109 Electron Propagators in an External Field 481 The derivative of the diagram in the braces is shown here in diagram form by means of a new graphical symbolism, a fictitious three-particle photon vertex, i.e. a point where three broken lines meet, corresponding to the quantity 4 7 r i ^ r = 2ikM = v (108.12) We can now differentiate any diagram by adding, to the lines depending on k, vertices tv or y^ and continuing in accordance with the general rules. Summation of these higher-order corrections gives 1 d& 4ir odk^x '*" = r ( m i 3 ) where ief^ is the sum of the internal parts of all the diagrams with three photon ends thus obtained. We shall also need the second derivative of the polarization operator. Differentiating the equation (108.13) once more in a similar manner, we have 1 d2& „ î|,<r 4ir dk9dk° ™°v ïstipov ' *sujopvt '"""" ^lv0.1*fj where ie19> is the sum of the internal parts of all the diagrams with four photon ends such as S0 \v ie j y 6 / ,0 (108.15) including, of course, those containing the fictitious three-particle vertices (108.12). § 109. Electron propagators in an external field If a system is in a given external field A(<)(x), the exact electron propagator is expressed by the same formula (105.1), but in the Hamiltonian Ê = #<>+ V which converts to the Heisenberg representation of operators we have also the interaction between the electrons and the external field: V = e i ÂJ» d*x + e ( A^J* d*x. ( 109.1) Since the external field makes space and time no longer homogeneous, the propagator <#(*, x') will now depend on the two arguments x and x' separately and not only on the difference x-x'. 482 Exact Propagators and Vertex Parts §109 If we proceed in the usual manner to the interaction representation, the ordinary diagram technique is obtained, with external-field lines as well as virtual photon lines. This technique is, however, unsuitable when the external field cannot be regarded as a small perturbation, in particular when the particles may be in bound states in the field. Now the electron propagator in an external field is in fact required principally for the analysis of the properties of bound states, and in particular for determining the energy levels with allowance for radiative corrections. In order to derive such a propagator, we have to start from a representation of operators where the external field is exactly taken into account even in the "zero-order" approximation with respect to the electron-photon interaction (W. H. Furry, 1951). We shall henceforward assume the external field to be independent of time. The desired representation of the (//-operators is given by the formulae (32.9) for second quantization in an external field: n (109.2) fr'XU r) = 2 {âW;\r)e^)l + bn^n~\r)e-^H}} where i/zlr^r) and e{n] are the wave functions and energy levels of the electron and the positron respectively, which are solutions of the "single-particle" problem, i.e. of Dirac's equation for a particle in a field. It is easily seen that the operators (109.2) are (//-operators in a certain representation (the Furry representation) which is, as it were, intermediate between the Heisenberg and interaction representations. They may be written 4>{^r) = ei**to(r)e-i**itA iJj{e)(Ur) = elH^(r)e-lH^\ (109.3) where Hx = H0+ef A{;Xx)]*(x)d}: The electromagnetic-field operator AM of course commutes with the second term in Hi, and so the Furry representation is the same as the interaction representation for this operator. The electron propagator in the zero-order approximation, in the new representation, is defined as G{^x, xf) = -i(0\T4fnx)mxf)\0). (109.4) The operator \p{e)(t, r) satisfies Dirac's equation in the external field: [yp - eyA{e)(x) - mW'Xu r) = 0, (109.5) §109 483 Electron Propagators in an External Field (e) and the function G correspondingly satisfies the equation [yp - eyA(e)(x) - m]G(e\x, x') = Ô(4)(x - x'); (109.6) cf. the derivation of (107.5). The diagram technique, which expresses the exact propagator ^ as a series in powers of e2, is obtained by changing from the Heisenberg to the Furry representation, in exactly the same way as the earlier change to the interaction representation. The resulting diagrams are of the same form, with the continuous lines now corresponding to factors iG{e) instead of iG. One slight difference in the rules for writing the analytical expressions for the diagrams arises because in the coordinate representation G(e) is not a function of the difference x-x' only. In a constant external field, however, the homogeneity of time is preserved, and so the times t and t' will again appear only as the difference t-t' = Tl G{e) = Gu\r, r, r'). The momentum representation is obtained by a Fourier expansion with respect to each of the arguments of the function: G<«(r, r, r') = / / / «*> - ->G(., p2, p,) £ 0 $$. (109.7) Each line corresponding to the factor iGle)(e, p2, pi) must now be assigned one value of the virtual energy e and two values of the momentum, the initial value pi and the final value p2: iG(e)(e,p2,pl) = p x e^ i . (109.8) This leads to the rule for writing the analytical expressions, in which the integration over de/lv is normal, but those over d^pJilrr)3 and d3p2/(2ir)3 are independent, the conservation of momentum at each vertex being taken into account. For example, c-to = e2 Iff Gie\e, p2, p*)y*Gl'Xe - o>, p" - k, p' - k) x x y'G«<«, p\ Pl)D„(o>, k) ^ 0 4 0f g£. (109.9) It is important to note that in this technique one must also take account of diagrams with "self-closed" electron lines, which in the ordinary technique are rejected as being associated with a "vacuum current". When an external field is 484 §109 Exact Propagators and Vertex Parts present, this current need not be zero, because of the "vacuum polarization" caused by the field. For instance, in the diagram (109.10) 9j e r r e the loop at the top corresponds to the factor Here, however, we must still specify the meaning of the integral over d<o. This is because the integration of the Fourier component of the function G W (T) with respect to w amounts to taking the value of that function at T = 0, and G U ) (T) is discontinuous at T = 0; we must therefore indicate which of its two limiting values is to be taken. To resolve this question, we need only note that the integral (109.11) arises from the contraction of «//-operators in the same current operator: J"sKr)YV(,,(tfr), where «p* is to the left of \\t(t)- According to the definition of the propagator (109.4), this order of factors for t = V is obtained if r' is taken as f +0, i.e. if the limiting value of the function G(t\t - t') as t - 1 ' - > - 0 is taken. In other words, the integral over avilir in (109.11) is to be taken as / e_lWdw for T ^_0 (109.12) 2TT The mass operator in the external field is defined as in §105: -iM is the sum of all the compact self-energy parts. It is now a function of the energy e and the momenta pi and p2 at the ends of the external lines where they respectively enter and leave the part in question: (109.13) - iXcp,,!,) Proceeding exactly as in the derivation of (105.6), we get the equation «(e,P2,Pi)-G<e)(e,p2,p,) = \ \ &'\e, p2, p")^(e, p", p W , p\ p.) 0f jjj£. (109.14) §109 485 Electron Propagators in an External Field This can be put in a more natural form by returning to the coordinate representation in terms of the spatial variables, using the function <S(e, r, r') = jj <S(e, p2, p,)* i ( *-» ' ^ p f f i 2 , (10915) and similarly for the other quantities. Taking the inverse Fourier transform of (109.14), we obtain <0(e,r,r')-G(e)(e,r,r') = jj GM(e, r, r2)M(e, r2, r,)<3(e, r„ r') d3x, d3x2. Next we apply to both sides the operator where e is a number, and p = -iV is the operator of differentiation with respect to the coordinates r. Here it must be noted that, by (109.6), [y°e - y • p - eyAu\x)]G<e)(e, r, r') = ô(r - r'). (109.16) The resulting equation is r, r') - j M(e, r, r,)^(e, r,, r') d3x, = 5(r - r% [y°e - y • p - eyA{t\xW{ey (109.17) The function <S(e,r, r') has the especially valuable property that its poles determine the energy levels of the electron in the external field. We shall prove this first for the approximate function G(<)(e, r, r'). Substituting the operators (109.2) in the definition of the propagator (109.4), we obtain, in exact analogy to formulae (75.12) for the free-particle propagator, i 2 ^(rWftV) exp{-ie<+)(* - t% G\l\t-t\r,r') = n i 2 tf ( I Ä V ) e x p i i e z ~ t')}, n t> t', t < t', (109.18) and the Fourier time component is We see that G(e)(e, r, r'), as an analytic function of e, has poles on the positive real axis which coincide with the electron energy levels, and poles on the negative real axis which coincide with the positron energy levels. The values e(ir)>m form a §109 Exact Propagators and Vertex Parts 486 continuous spectrum,! and the corresponding poles form two cuts in the e-plane, from -oo to - m and from m to oo. The segment \e\ < m contains poles which give the discrete energy levels. For the exact propagator ^(e, r, r') we can obtain a similar expansion by expressing it in terms of the matrix elements of Schrödinger operators; the matrix elements of Heisenberg (//-operators are related to these by <m|iKt, r)|n> = (m\^(r)\n)e'li^E^. (109.20) Here the En are the exact energy levels (i.e. with all radiative corrections) of the system in the external field. The operator i// increases the charge of the system by 1 (i.e. by +|e|), and t// decreases it by one. This means that in the matrix elements <n|i//|0) and (0|<//|n) the states \n) must correspond to a charge of the system of +1, i.e. they can contain, besides a single positron, only a certain number of electronpositron pairs and a certain number of photons; the energies of these states will be denoted by E{„~}. Similarly, in the matrix elements (0|i//|n) and <n|</>|0) the states \n) contain one electron and some pairs and photons (energy E(n+)). Instead of (109.18) we now have h{t-t',r9r>) -i 2 <0|^(r)|n>(n|^(r')|0) exrf-iEiftf - f')}, t > t\ i 2 <0|«Mr')|n)<n|tMr)|0> exrfiEir'O - t% t < t\ n = n (109.21) and hence Let e be close to one of the discrete energy levels E(„+) (or -E(„_)). Then only the corresponding pole term need be retained in the sum (109.22). Substitution in (109.17) shows that the factors which depend on the second argument r' (when r ?* r') do not appear in the equation. The result is a homogeneous integrodifferential equation for the function <0|«/r(r)|n> (or <n|^(r)|0», which we denote for brevity by ^ n (r).t Omitting the subscript n, we have [y°e + »y • V - eyA«>(r)]ik¥k(r) - j M*(e, r, r , m ( r , ) d3x, = 0 (109.23) (J. Schwinger, 1951). The discrete energy levels En now appear as the eigenvalues of this equation. Thus (109.23) becomes the regular basis for determining these levels. t We assume that the external field is zero at infinity. I When radiative corrections are neglected, the ^»(r) are the same (for states with one electron or positron) as the wave functions t/»«+> or i/»î,-) which are solutions of Dirac's equation. §110 Physical Conditions for Renormalization 487 For example, it can be used to determine the correction, in the first order with respect to My to the discrete electron energy level e„ given by solving Dirac's equation: [7°£n + iy • V - eyA(t\r)]^n(r) = 0; (109.24) let the wave function i//„(r) be normalized by the condition J»KiM3x = l. (109.25) The eigenfunction of equation (109.23) may be written ^ n (r) = ^(r) + ^n,)(r), (109.26) where tf/„n is the correction to «//„(r). Substituting (109.26) in (109.23), multiplying on the left by ijt„(r) and integratingt over d3x, we get the required expression En - en « [ f $ni(r)Mik(ent r, r,)«Mr,) d'x d3x,. (109.27) § 110. Physical conditions for renormalization The theory discussed so far in this chapter has been largely formal. We have treated all quantities as if they were finite, and have deliberately passed over any infinities which occur in the theory. In the practical calculation of the functions 3>, ^ and T by perturbation theory, however, divergent integrals arise, which cannot be assigned any definite values without further consideration. These divergences are a manifestation of the logical incompleteness of the existing quantum electrodynamics. It will be seen below, nevertheless, that in this theory it is possible to establish certain rules which allow an unambiguous "subtraction of infinities", and thus to obtain finite values for all quantities which have a direct physical meaning. These rules are based on obvious physical requirements that the photon mass is zero and the electron charge and mass are equal to their observed values. Let us first ascertain the conditions to be imposed on the photon propagator, and consider a scattering process which can occur through one-particle intermediate states having one virtual photon. The amplitude of such a process must have a pole when the square of the total 4-momentum P of the initial particles is equal to the squared mass of the real photon, i.e. when P2 = 0; we have seen in §79 that this requirement follows from the general condition of unitarity. The pole term t In the integration we use the fact that the differential operator in (109.24) is self-con jugate, and thus transfer its action from »/»S," to <£,. 488 Exact Propagators and Vertex Parts §110 in the amplitude arises from a diagram of the form (79.1): . k-P . . (U0.1) and when radiative corrections are taken into account the two parts of the diagram must be joined by a thick broken line (the exact photon propagator). This means that the function 2>(k2) must have a pole at k2 = 0, i.e. must be such that 2>->4irZ/k2 when k 2 - 0 , (110.2) Z being a constant. Hence, for the polarization operator S?(k2), (103.21) gives 0>(O) = 0. (110.3) The coefficient in (110.2) is given by 1_ T l nk2)] z-[ —jr\ k2-0 Further restrictions on the function 0*(k2) can be derived from an analysis of the physical definition of the particle's electric charge: two classical (i.e. infinitely heavy) particles at rest at a large distance apart must interact in accordance with Coulomb's law, U = e2/r. (These are distances much greater than 1/m, where m is the electron mass.) This interaction can also be represented by the diagram ;a) jk (dK (HO.4) Nb) in which the upper and lower lines correspond to classical particles. The photon self-energy corrections are taken into account in the virtual photon line. All other corrections, affecting the heavy-particle lines, would make the diagram equal to zero: the addition of any further internal lines in the diagram (110.4), for example a photon line joining the lines a and c or a and b, would produce lines of virtual heavy particles, with corresponding propagators. But the propagator of a particle has its mass M in the denominator, and tends to zero asM-*». The form of the diagram (110.4) makes it clear (cf. §83) that the factor e22(k2) in it must be (apart from the sign) the Fourier transform of the particle interaction potential. Since the interaction is steady, the virtual-photon frequency a» = 0, and large distances correspond to small wave vectors k. The Fourier transform of the § 110 Physical Conditions for Renormalization 489 Coulomb potential is 47re2/k2. Since 2) depends only on k2 = a> 2 -k 2 , we finally arrive at the condition S-*4<7r/k2 when k 2 ->0, (110.5) i.e. the coefficient in (110.2) must be Z = 1; the sign is obvious, since 2>(k2) tends to the free-photon propagator D(k2). The polarization operator 0>(k2) must therefore satisfy 0>(k2)/k2->O when k2^0. (110.6) This leads not only to the condition (110.3) given previously but also to the result 0>'(O) = O. (110.7) It has been noted in §103 that an effective external real-photon line corresponds to the factor (103.15) or, using (103.16) and (103.20), We now see from (110.5) and (110.6) that the correction term is zero. Thus we have the important result that radiative corrections need not be considered in external photon lines. The natural physical requirements therefore lead to the establishment of definite values (namely zero) for the quantities ^(0) and 0*'(O). The calculation of these quantities from the perturbation-theory diagrams would lead to divergent integrals, and we see that the way to eliminate such infinities is to assign fixed values a priori to the divergent expressions, these values being determined by physical requirements. This procedure is called renormalization of the quantities concerned.t The procedure can also be formulated in a somewhat different manner. For instance, in renormalizing the particle charge one can define a non-physical intrinsic (bare or unrenormalized) charge ec as a parameter which appears in the expression for the original electromagnetic interaction operator in formal perturbation theory. The renormalization condition then becomes e2c3)(k2)->47re2lk2 (when k 2 ->0), e being the actual physical charge. Hence we have the relation e2cZ = e2, which is used to eliminate the non-physical quantity ec from formulae which concern observable effects. By putting immediately Z = l , the renormalization is effected "en route", and there is no need to use fictitious quant'ties even in the intermediate steps. Let us now investigate the conditions for renormalization of the electron propagator. To do so, we now consider a scattering process which can take place t The idea of this approach was first put forward by H. A. Kramers (1947); the systematic application of the renormalization method in quantum electrodynamics is due to Dyson, Tomonaga, Feynman, and Schwinger. 490 Exact Propagators and Vertex §110 Parts through a one-particle intermediate state with one virtual electron. The amplitude of such a process must have a pole when the square of the total 4-momentum P, of the initial particles is equal to the squared mass of the real electron, i.e. when P]-m2. The pole term in the amplitude arises from a diagram of the form >Pi (110.8) and when radiative corrections are taken into account the thick line is the exact electron propagator. This means that the function <S(p) must have a pole at p2 = m2, i.e. its limiting form there must be ^(P)^z»pilP^^0 + g(P) when P2^m\ (110.9) Z\ being a scalar constant and g(p) remaining finite as p2-+m2. The matrix structure of the pole term in (110.9) (proportional to yp + m) is a consequence of the same unitarity condition that causes the existence of the pole. We shall prove this statement and at the same time elucidate the important question of the renormalization conditions for the external electron lines. If <S(p) has the limiting form (110.9), the inverse matrix is ^" 1 (p) Ä y"(7P - m ) - ( Y P -m)g(7p - m ) when p2-+m2. (110.10) The mass operator is M = G ~ , - « ~ , ~ ( l - ^ - V y p - m ) + (-yp-m)g(yp-m) when p2->m2. (110.11) The effective external (say incoming) electron line corresponds (cf. (103.15)) to a factor a U(p) = u(p) + <S(p)M(p)u(p), (110.12) where u(p) is the ordinary amplitude of the electron wave function, which satisfies Dirac's equation ( y p - m ) u = 0 . Because of the requirements of relativistic invariance {% like u, is a bispinor), the limiting value of <ft(p) for p2-+m2 can differ from that of u(p) only by a constant scalar factor: <U(p) = Z'u(p). (110.13) This factor Z ' is related in a definite manner to the factor Zu but the relation cannot be determined simply by substituting (110.10) and (110.11) in (110.12), §110 Physical Conditions for Renormalization 491 because there is an indeterminacy; the result depends on the order in which the limits of the various factors in (110.12) are taken. It is, however, possible to avoid the problem of the correct method of taking the limit, by using instead the unitarity condition for the reaction shown by the diagram (110.8). The unitarity relation generally applies to amplitudes of processes as a whole, not to individual diagrams. But when p2-*m2 the pole diagram (110.8) gives the main contribution to the corresponding amplitude M/„ so that the other diagrams which pertain to the same reaction can be ignored. As has been shown in §79, the unitary conditions require that a one-particle intermediate state should produce in the reaction amplitude an imaginary part with a delta function: iirô(p 2 -m 2 ) Y MfnM*, (110.14) polar. where the subscript n refers to a state having one real electron, and the summation is over the latter's polarizations; to avoid additional complications we assume, as in §79, that both sides of the unitarity relation are symmetrized with respect to the helicities of the initial and final particles, so that Mfi = Mif. The amplitude Mfn corresponds to a process represented by the diagram ft{ and is M,n = (M'fn°U) = Z'(M'fnU), whereM/n is a factor with one free bispinor index.t Similarly, the structure of the amplitude M * is M*=(^M;„*)=Z'(ÛM;„*). Summation over the polarizations of the electron replaces the product (M'fnu)x (UM'*) by M'fn(yp + m)Min, and so the term (110.14) in the amplitude M/, becomes Z'2brS(p2- m2){M'fn(yp + m)M'*}. Using this term in the imaginary part, we can reconstruct the entire pole term in the scattering amplitude; from (79.5), _ Z'2{M[n(yp + m)M':} , , t One point should be clarified here. The electron, a stable particle, cannot really be transformed into an assembly of real particles, but we may formally take as the latter certainfictitiousparticles whose masses are such as to allow this transformation. The resulting relationship is then to be taken as an analytical continuation to real masses. 492 §110 Exact Propagators and Vertex Parts Calculation of the same amplitude directly from the diagram (110.8) gives iM/,-= iM;n • i<3(p) • iJVC. A comparison of the two formulae confirms the limiting expression written above for ^(p) (the first term in (110.9)), and shows that (H0.15) Z' = y/zu We shall now show that, when the limiting form of the electron propagator is known, there is no need to establish any further conditions for the vertex operator. Let us consider the diagram (110.16) X. which represents the scattering of an electron in an external field A{e\k), in the first order with respect to the field, and taking account of all radiative corrections. In the limit k-*0, P2~»Pi = P, the self-energy corrections to the external-field line are zero (since they vanish for any k2 = 0). Then the diagram corresponds to the amplitude M/i = -e^(p)r(p,p;0)^(p)- A(e) (k-+0), (110.17) i.e. the product of the potential A{e) and the electron transition current 'ÛY'U. But when k-»0 the potential A(t\x) reduces to a constant independent of coordinates and time. No physical field corresponds to this potential (a particular case of gauge invariance), which therefore can cause no change in the electron current. Thus, in the limit considered, the transition current ^VU must be simply the free current üyu: <fc(p)P<(p,p;0)<fc(p) = ZMpWuip) (110.18) = ü(p)y»u(p). This is essentially also a definition of the physical charge on the electron. It is easily seen to be necessarily satisfied, whatever the value of Z\\ substituting ^"'(p) from (110.10) in Ward's identity (108.8), we find r*(p, p; 0) = YX y" - 7Mg(P)(7P -m)-(yp- m)g(p)y\ and (110.18) is satisfied, since (yp - m)u = 0, Q(yp - m) = 0. We see that, when the amplitude of the physical process is calculated, the "renormalization constant" Z\ disappears. Moreover, by using the indeterminacy §111 Analytical Properties of Photon Propagators 493 that arises from divergences in the calculation of I\ we can simply require that w(p)r,i(p,p;0)M(p) = ö(p)7' t u(p) when p2 = m\ (110.19) i.e. put Z\ = 1. The convenience of this definition lies in the fact that there is no need to apply corrections to the external electron lines: we have simply °U(p)=u(p). This can also be deduced directly by noticing that for Zx - 1 the mass operator (110.11) is M=(yp-m)g(yp-m) (110.20) and the second term in (110.12) obviously vanishes. Thus there is no need to "renormalize" the external lines of any real particles, either photons or electrons.! §111. Analytical properties of photon propagators It is convenient to begin the study of the analytical properties of the photon propagator with the function Il(k2). The reason is that the direct use of the definition (103.1) for this purpose is difficult because the operators Â'1(x) are gauge-ambiguous and their properties are therefore indeterminate. The integral representation of the function n(k 2) (104.11) was derived from the expression for the photon self-energy function in terms of the matrix elements of the gauge-invariant current operator. Denoting the variable k2 byt t, let us consider the properties of the function 11(0 in the complex f-plane. From the integral representation n «> = / ^ F n ö 0 < 11U) we see that 11(0 is real on the negative real axis, and elsewhere in the plane satisfies the symmetry relation n(t*) = II*(0- (Hl-2) The function U(t) can have singularities only at singular points of p(0- These <are at values of t = k2 which are threshold values for the creation of various groups t In renormalizing the photon propagator, the condition Z = 1 arose as a necessary physical condition, after which the corrections to the external photon lines disappear automatically. Formally, however, the situation is similar for both photon and electron external lines: when Z* 1, the wave amplitude e» of a real photon, with corrections, would be multiplied by VZ. t Not to be confused with the symbol for the time. 494 Exact Propagators and Vertex Parts §111 of real particles by a virtual photon. At these values, new types of intermediate states "come into play" in the sum (104.9). Their contribution is zero below the threshold, but not zero above the threshold, and this causes the singularity at the threshold itself. These threshold values are, of course, real and non-negative.t The singularities of n(0 therefore also lie on the positive real t-axis. If a cut is made along this axis, the function 11(0 is analytic throughout the cut plane. The term +i0 in the denominator of the integrand in (111.1) shows that we must pass below the pole V —t. This means that the value of 11(0 for real t must be taken as its value on the upper edge of the cut. Using the rule (75.18): —TH = P-:?M(x)t x ± iO (111.3) x we find that for real t (111.4) im 11(0 = im U(t + iO) = - irp(0On the lower edge of the cut, im II has the opposite sign; re II is the same on both edges. Thus the discontinuity of 11(0 at the cut is IK* + iO)-n(t - iO) = -2irip(0- (111.5) The integral representation (111.1) itself can in this way be regarded simply as Cauchy's formula for the analytic function 11(0: applying this formula c to the contour (111-7) which passes around the cut, and assuming that 11(0 decreases sufficiently rapidly at infinity, wefindthat the integral along the large circle is zero, and those along the edges of the cut give the following dispersion relation between 11(f) and its t For example, the point k2 = 0 is a threshold for the production of three (or a higher odd number of) real photons; k1 ■ 4m2 is a threshold for electron-positron pair production, and so on. §111 495 Analytical Properties of Photon Propagators imaginary part: 7T J t ~~ f 0 = 1 f imn . dt'. 0 irj t'-t (Hl.8) Substitution of (IH.4) then gives (lll.l).t The analytical properties of the functions 0>(O and 2>(0 are the same as those of 11(0» in terms of which they are expressed by the simple formulae (104.2) and (103.21). For 2(t) we have W)=4jL(1+ni0) (1119) On the positive real f-axis, we must take t as t + iO, as shown above. The imaginary part of 3)(t) can then be calculated from (111.3) and (111.4), bearing in mind that ll(0/r->0 when f-*0, by (110.6). We thus find im 2>(0 = -4TT 2 5(0 + (4ir/f2) im 11(0 = -4ir 2 8(0 - (4ir2lt2)p(t). (111.10) Now, applying to 2>(0 a dispersion relation of the form (111.8), we obtain the integral representation oo 4 0 (K) O = : _ JL_+4 7 r fp(Q t + iO J t^t-t' o dt ' +W n) unii; (lll called the Källen-Lehmann expansion (G. Kallén, 1952; H. Lehmann, 1954). There is a close relationship between the position of the cut for the function 2)(0 (and therefore its imaginary part on the cut) and the unitarity condition for the amplitude of the process a + b^c + d represented by the diagram (H0.4); this reaction is, of course, a purely imagined one, but it does not violate the conservation laws, and the unitarity condition is formally valid for it. In the initial state i of this process there are two "classical" particles a and b, and in the final state there are two other such particles c and d. The unitary condition is (7l.2)t T„ - Tff = i(2«7T)4 2 TfnTUi4)(Pf - Pd; (H1.12) t Dispersion relations were first used in quantum field theory by M. Gell-Mann, M. L. Goldberger and W. E. Thirring (1954). t The amplitudes T/< differ from the M/< only by the factors shown in (64.10). 496 §112 Exact Propagators and Vertex Parts the summation on the right is taken over all the physical "intermediate" states n. In the present base these states are evidently those of the systems of real pairs and photons which can be created by the virtual photon k, i.e. the states which occur in the matrix elements in the definition (104,9) of the function p(k2). The amplitudes Mfi and M * include the factors 2)(k2) and 2>*(k2) respectively, and their difference contains im 2>(k2). We see, therefore, that the relation given by (111.4) between an imaginary part of 2> and the existence of these intermediate states is a consequence of the necessary requirements of unitarity. We shall see later that it is convenient, in practical calculations of Q)(t) (or, equivalently, 0>(O) by perturbation theory, to begin by finding the imaginary part of 0>, which does not involve divergent expressions. But if &(t) is then calculated from a dispersion relation of the type (111.8), the integral diverges and further subtraction operations are necessary in order to satisfy the conditions 0>(O) = 0 and ^'(O) = 0. This subtraction can, however, be effected without the explicit use of divergent integrals. To do so, we need only apply the dispersion relation (111.8) not to &(t) itself but to &(t)lt2. Then we have (,1113) »w-üf.V-Tio*' 0 This integral is convergent, and the function 0>(O thus obtained must necessarily satisfy the required conditions. A relationship such as (111.13) is called a "doublesubtraction" dispersion relation. The significance of the change to 9>(t)lt2 becomes especially clear if (111.13) is written in the form rrjt-t-iO 0 rr J 0 t ir J 0 tu If the first ("non-regularized") integral is denoted by #(f)» the right-hand side is #(0-£(0)-t#'(0). § 112. Regularization of Feynman integrals The physical conditions of renormalization discussed in §110 enable us, in principle, to derive a unique finite value for the amplitude of any electrodynamic process in any approximation of perturbation theory. Let us first of all ascertain the nature of the divergences that occur in the integrals derived directly from Feynman diagrams. This is considerably facilitated by counting the powers of the virtual 4-momenta which appear in the integrands. Let us consider a diagram of order n (i.e. one containing n vertices), with Ne external electron lines and Ny external photon lines; Ne is even, and the electron lines form {Nt continuous sequences, each beginning and ending at a free end. The number of internal electron lines in each such sequence is one less than the number. of vertices in it; the total number of internal electron lines in the diagram is §112 Regularization of Feynman Integrals 497 therefore n - {Ne. One photon line comes to each vertex: at Ny vertices the photon line is external, and at the other n - NY it is internal. Since each internal photon line joins two vertices, the total number of such lines is \(n - Ny). Each internal photon line is associated with a factor D(k), which contains k to the power - 2 . Each internal electron line is associated with a factor G(p), which contains p to the power - 1 (when p2>m2). Thus the total power of the 4-momenta in the denominator of the diagram is In - iNe - Ny. The number of integrations (over d4p or d4k) in the diagram is equal to the number of internal lines minus the number (n - 1) of additional conditions imposed on the virtual momenta (of the n conservation laws at the vertices, one relates the momenta at the free ends of the diagram). Multiplying by 4, we obtain the number of integrations over all the 4-momentum components, 2(n -Ne-Ny + 2). Lastly, the difference r between the number of integrations and the power of the momenta in the denominator of the integrand is r = 4-lNe-Ny, (112.1) and is independent of the order n of the diagram. The condition r < 0 for the diagram as a whole is not in general sufficient for convergence of the integral: the corresponding numbers r' for the internal sections which can be taken from the diagram must also be negative. The existence of sections having r' > 0 would make them divergent although the other integrals in the diagram would converge "with something to spare". The condition r < 0 is, however, sufficient for the convergence of the simplest diagrams, in which n = Ne + Ny and there is only one integration over d4p. If r s* 0, the integral always diverges. The order of divergence is at least r if r is even, and at least r - 1 if r is odd; the decrease by one in the latter case is due to the vanishing of the integrals over all 4-space of products of an odd number of 4-vectors. The order of divergence may be higher if there are internal sections with r'>0. Since Ne and Ny are positive integers, we see from (112.1) that there exist only a few pairs of values of Ne and Ny for which r > 0. The simplest diagrams of each such type may be enumerated, omitting the cases N« = Ny = 0 (vacuum loops) and Ne = 0, Ny = 1 (mean value of the vacuum current), since they have no physical meaning and the corresponding diagrams must be rejected, as already shown in §103. The remaining cases are: 498 Exact Propagators and Vertex Parts §112 In (a) the divergence is quadratic; in all the others (r = 0 and r = 1) it is logarithmic. The diagram (112.2d) is the first correction to the vertex operator. It must satisfy the condition (110.19), which we shall here write as u(p)A'i(p,p;0)u(p) = 0 when p2 = m\ (112.3) A* = r t - y ' \ (112.4) with Let A'i(p2, Pù k) be the Feynman integral as derived directly from the diagram. This integral is logarithmically divergent, and does not itself satisfy the condition (112.3), but we can obtain a quantity which does satisfy this condition by taking the difference AM(P2,P.;k) = Â't(p2,p1;k)-[A'1(p1,p1;0)]PÎ=mJ. (112.5) The leading divergent term in ÂM(p2, pû k) is obtained by taking the virtual photon 4-momentum / in the integrand to be arbitrarily large. It ist - 4 ^ f ^ ( ^ : ( ^ > d4fA and is independent of the 4-momenta of the external lines. In the difference (112.5) the divergence therefore cancels, leaving a finite quantity. This operation of removing the divergence by subtraction is called regularization of the integral. It must be emphasized that the integral ÄM(p2, pù k) can be regularized by one subtraction because here the divergence is only logarithmic, i.e. as weak as it can be. If the integral involved divergences of various orders, a single subtraction with k = 0 might be insufficient to eliminate all the divergent terms. When the first correction in V1 (i.e. the first term in the expansion of A*1) has been determined, the first correction in the electron propagator (the diagram (112.2a)) can be calculated from Ward's identity (108.8), which may also be written in the form 3^(p)/apM=A"(p,p;0), (112.6) with the mass operator M in place of % and A*1 in place of P*. This equation is to be integrated with the boundary condition u(p)M(p)u(p) = 0 for p2 = m2, (112.7) which follows from (110.20). Finally, to calculate the first term in the expansion of the polarization operator, we use the identity (108.14), which after contraction with respect to two pairs of t The complete expression for the integral is given in (117.2). §112 Regularization of Feynman Integrals 499 indices gives ± & 4TT dKdk* -VP *' a relation between the scalar functions 0> = \&* and y = <f%. Both these functions depend only on the scalar variable k2, and so we have 2k2&"(k2) + 9>\k2) = ^f !f(k\ (112.8) the primes denoting differentiation with respect to k2. With the condition 0>'(O) = 0, this equation shows that ^(0) = 0. (112.9) In the first approximation of perturbation theory, 5^(k2) is given by the diagram (112.2e), with the free ends having 4-momenta k, k, 0, 0. The corresponding Feynman integral 9{k2) diverges logarithmically, and can be regularized by a single subtraction, using the condition (112.9): ^(k 2 ) = ^ ( k 2 ) - ^ ( 0 ) . Then 0>(k2) is found by solving equation (112.8) with the boundary conditions 0>(O) = O, 0>'(O) = O. In the next approximation of perturbation theory, the correction to the vertex operator A(M2) is determined by the diagrams (106.10, (cMO)- The irreducible diagrams (d)-(f) are calculated by a similar regularization of the integrals, using a single subtraction as in (112.5), in the same way as in calculating the firstapproximation correction A(M1}. In the reducible diagrams, the internal self-energy and vertex parts of lower order are immediately replaced by the already known (regularized)first-approximationquantities (0>(1), M°\ A^), after which the integrals obtained are again regularized in accordance with (112.5).t The corrections 0*(2) and Mm can then be calculated from (112.6) and (112.8). This systematic procedure will in principle enable us to derive finite values for 0>, M and A^ in any approximation of perturbation theory. It thus becomes possible to calculate the amplitudes of physical scattering processes that are described by diagrams containing 0>, Ji and AM as constituent sections. The physical conditions derived in §111 are therefore sufficient for an unambiguous regularization of all the Feynman diagrams occurring in the theory. This renormalizability is a far from trivial property of quantum electrodynamics.t In a practical calculation of radiative corrections, the above procedure may, however, not be the simplest and most rational. In Chapter XII we shall see, in t In diagrams for still higher approximations, it may be necessary to replace the four-ended sections & also by already regularized values. t A different approach to renormalization theory in quantum electrodynamics is given by N. N. Bogoliubov and D. V. Shirkov, Introduction to the Theory of Quantized Fields, Wiley, New York, 1980. 500 Exact Propagators and Vertex Parts §112 particular, that a convenient treatment may start by calculating the imaginary parts of the corresponding quantities; these are given by integrals which do not involve divergences. The quantity itself is then found by analytical continuation, using the dispersion relations. It thus becomes possible to avoid the lengthy calculations which are needed in a direct regularization by subtractions. CHAPTER XII RADIATIVE CORRECTIONS §113. Calculation of the polarization operator LET us now go on to the actual calculation of the radiative corrections, and begin with that of the polarization operator (J. Schwinger, 1949; R. P. Feynman, 1949). In the first approximation of perturbation theory, this is given by the loop in the diagram (,i3i) - - < ! > - - As already mentioned, the problem becomes easier if we first calculate the imaginary part of the required function. This in turn is most conveniently done by using the unitarity relation. The virtual-photon lines are regarded as corresponding to a fictitious "real" particle, a vector boson with mass M2 = k2 which interacts with an electron in the same way as a photon does. Then (113.1) becomes the diagram of a "real" process, and the unitarity condition can be justifiably applied to it. Thus we regard (113.1) as a diagram giving the amplitude of the transition of the boson into itself (ä diagonal element of the S-matrix) via a decay into an electronpositron pair. The crosses in the diagram show where it is to be cut into two parts so as to show the intermediate state that figures in the application of the unitarity relation. This state contains an electron with 4-momentum p~^ p and a positron with p+ = -(p - k). The unitarity relation with a two-particle intermediate state (71.4), for coincident initial and final states, gives Here the amplitude M«, from (113.1), is iMu = V(4ir)e* • V(4ir)e„ l -j^, (l 13.3) where eM is the boson polarization 4-vector, which according to (14.13) satisfies the equation eX = 0. 501 502 § 113 Radiative Corrections The amplitude Mni corresponds to the diagram for the decay of the boson into an electron-positron pair: -jv p- The corresponding expression is j * = ü(p-)y»u(-p+). Mm = -eV(4TT)e^, (113.4) Substitution of (113.3) and (113.4) in (113.2) gives 2 2e*evim^^ = -f-^ *T7T S fr*f^,<*°. (113.5) Dolar J Here p = p- = - p+ and e = e+ + e_ = 2e+ are the momenta and total energy of the pair in its centre-of-mass system; the integration is over the directions of p, and the summation is over the polarizations of the two particles. Let us now average both sides of (113.5) over the polarizations of the boson. This is done by means of the formula (14.15): e ^ev — — ~z {guv K2 y Since the tensor 0 ^ and the vector j M are transverse (3>^kv = 0, jMkM = 0), we have as the result 2im^=T^-l^X l^TT £ f(jj*)d°, polar J (113.6) where 9 =\&». The summation over the polarizations is effected in the usual manner, the integration over do reduces to a multiplication by 4TT, and so 2 im 0> = e2 ^ tr y»(yp- + m)y»(yp+ - m) = - e 2 * ( p + p - + 2m2). 5e In terms of the variable = 2(m2 + p+p_), (113.7) §113 503 Calculation of the Polarization Operator we have e — t, p — 4* ~~ wi , and hence finally im^(t) = - | a ^ f ^ f 4 m (t + 2m2), f^4m 2 . (113.8) The value t =4m2 is the threshold value for the production of one electronpositron pair by a virtual photon (cf. the second footnote to §111); in the approximation considered (~~e2), the state with a single pair is the only one that can appear as an intermediate state in the unitarity condition (113.2). In this approximation, therefore, the right-hand side of (113.2) is zero for t<4m 2 , and hence im0>(f) = O, t<4m2. (113.9) For the same reason, in this approximation the cut in the complex t-plane for the function &(t) extends only from t = 4m2 on the real axis, and this point must be the lower limit in the dispersion integral (111.13). Thus X _ a .2 ( l(t'-4m2\t' at' + 2m2 no -~5^ r J T^rnöVr-7—)—F*-- (11310) It is convenient for the expression of the result to replace t by a variable £, defined as follows: t/m2 = - ( l - £ ) 2 / f (113.11) This transformation maps the upper half-plane of t on a semicircle of unit radius in the upper half-plane of £, as shown in Fig. 19, where corresponding line segments in the two planes are indicated by similar markings. The semicircle £ = e*, 0 ^ <f> ^ ^ 0 4m= -I FIG. / 0 \ © \ 19. corresponds to the non-physical region 0«£ r7m2=£4. The right-hand and left-hand radii on the real axis correspond to the physical regions t < 0 and t/m 2 >4. The integral in (113.10) is most simply calculated by means of the substitution V 1 504 §114 Radiative Corrections and we first take the case t < 0 (so that the denominator does not vanish in the range of integration and the imaginary term JO may be omitted). The result of the integration, in terms of the variable £, is »<e-£{-7+i(«+{)+(f+r4)î^to4 (11312) The analytical continuation of this formula gives the function &(t) in the range t > 4 m 2 also; this is done by putting £ = |£|e'ff (and the logarithm gives a contribution to the imaginary part: log £ = log |£| + iir).t For the non-physical region, we must put £ = e'*, and then nt) = 2am' 3TT 2 {- ¥ sin 2 ^ - 2 + (1 + 2 sin21 \<t>)4> cot &}, (113.13) t/4m2 = sin2 2$. In the limit of small \t\ (i.e. £-> 1), these formulae become nt) = -£-A, |fN4m2. (113.14) In the opposite case of large |r| (i.e. £-»0), we have nt) = -£z\t\\ogK ( f log -t>4m\ i,r * °-£ ( £- )- t>4 m (113.15) In accordance with the significance of perturbation theory, these formulae are valid if 0747T <D~] = tl4v. The condition for (113.15) to be applicable is thus DIT m (113.16) The radiative corrections which involve alog(|t|/m) are called logarithmic corrections. §114. Radiative corrections to Coulomb's law Let us apply the formulae derived above to the problem of the radiative corrections to Coulomb's law. These corrections may be intuitively described as resulting from the polarization of the vacuum around a point charge. t The analytical continuation thus obtained is, as it should be, a continuation on the upper edge of the cut, since the semicircle in the £-p!ane corresponds to the upper half-plane of t. §114 Radiative Corrections to Coulomb's Law 505 If corrections are neglected, the field of a fixed centre (with charge e\) is given by the Coulomb scalar potential 4> = Aie) = ejr. The components of its threedimensional Fourier expansion are <D(k)^A&e)(k) = 47T€,/k2. When the radiative corrections are included, this field is replaced by the "effective field" si? = AV + 20p f^- A[e) = AP + j - &Q)Aie); (114.1) cf. (103.15). The second term gives the required change in the scalar potential. In the first approximation of perturbation theory for 0>(k2), we must take the expression derived in §113 and replace 2>(k2) by the zero-order approximation: 2)(k2)~D(k2) = -47r/k2. Thus the radiative correction to the field potential is o$(k) = - ^ 0 > ( - k 2 ) . (114.2) To determine the form of this correction in the coordinate representation, we must take the inverse Fourier transform: 8<Hr) = f eik rô<D(k) d3k/(2Tr)\ (114.3) Since 6(k) is a function of t = -k 2 only, integration over angles gives 0 = 4 ^ : im J W - y V y d y ; in the last transformation, we use the fact that the integrand is an even function of y = V - 1 . The contour of integration can now be moved into the upper half-plane of y, and made to coincide with the cut of the function 0>(-y2) (Fig. 20). This cut 1 © 2im FIG. 20. 506 Radiative Corrections §114 extends upwards along the imaginary axis from the point 2im, the physical sheet corresponding to the left side of the cut. Replacing y by a new variable, y = ix, we get ô<D(r) = 2 ^ ; [ im Ô<D(x2) e'xxdx. 2m Finally, on returning to the integration over t = x2, we have [ im ô<ï>(t) e~rV' dt. 8<P(r) = ^ : (114.4) 4m 2 The imaginary part im54>(0 = - ^ p £ i m ^ ( 0 is taken from (113.8), and after an obvious change of variable we have 4.(r) = ^ + S*(r) = ^ { l + ^ J e - ^ ( l + ^ ) ^ ! = - ü d f } (114.5) 1 (E. A. Uehling and R. Serber, 1935). The integral can be evaluated in two limiting cases. Let us first take that of small r (mr <^ 1), and divide the integral of the first term in the parenthesis into two parts: je-^^j)d( I= with £i chosen so that Mmr>(,\> 1. Consequently, we can take r = 0 in the first integral, so that 1 -log 2£,-l. §114 507 Radiative Corrections to Coulomb's Law In Ii, on the other hand, unity may be omitted from £ 2 - 1: h -Ci = - log £, • <T2mrfl + 2mr f <T2mr{ log £ d£. In the exponential and the lower limit, it is permissible to put Ci = 0. Then, with the change of variable 2mr£ = x, we have h = - log 2£, + log — + J e~x log x dx = -log 2£, + log — - C , where C = 0.577 . . . is Euler's constant. In the integral of the second term in (114.5) we can immediately put r = 0: l fV(£ 2 -l) l When the three integrals are added, £1 disappears, leaving *<r> = 7 [ 1 + £( l o g ^- c -s)]- r<<I,m - (1146) When mr > 1, the range £ - 1 ~ 1/mr <^ 1 is important in the integral. The change of variable £ = 1 + £ and appropriate transformations reduce it to e-2mr j e~2m'( \ V(2£) d | = ^ 5 7 5 VTT e"2"'. 0 In this case, therefore,t *<'>-r( l+ 4£?<ïS"> r>1/m ' (1R7) We see that the polarization of the vacuum alters the Coulomb field of a point t The orig? . of the factor e~2mr in S4>(r) is evident from the form of the initial integral (114.4): when r is large, the important values of ( are those near the lower limit. Thus the exponent is determined by the position of the first singularity of the function S4>(t)- 508 Radiative Corrections §115 charge in a region r ~ l/m( = ft/mc), where m is the electron mass. Outside this region, the change in the field decreases exponentially. One further general comment may be made. Hitherto we have implicitly assumed that the radiative corrections arise from the interaction between the photon field and the electron-positron field. Thus, by associating the internal closed loops in the photon self-energy diagrams with the electrons, we have taken into account the interaction of the photon with the "electron vacuum". But the photon also interacts with the fields of other particles; the interaction with the "vacua" of these fields is described by similar self-energy diagrams, in which the internal loops are associated with the appropriate particles. The contributions of these diagrams differ in order of magnitude from those of the electron diagrams by several factors of mjm, where m is the mass of the particle concerned and me the electron mass. The particles whose mass is closest to that of the electron are the muons and the pions. The numerical ratios mjmß and mjm^ are close to a. The radiative corrections from these particles would therefore have to be taken into account together with the electron corrections of higher orders. For muons, the radiative corrections can in principle be calculated by means of the existing theory, but for pions (which are strongly interacting particles) they cannot. This places a fundamental limitation on exact calculations of specific effects in present-day quantum electrodynamics. The use of arbitrarily high-order corrections from the photon-electron interaction alone would be an invalid exaggeration of the attainable accuracy. The radiative corrections to Coulomb's law discussed in this section are, as we have seen, valid even at distances r*£ l/me. We can now add that the formulae obtained cease to be valid at distances r < 1/m^ (or l/m ïï ), where polarization of the vacua of other particles becomes significant. § 115. Calculation of the imaginary part of the polarization operator from the Feynman integral In a direct calculation from the diagram (the loop in (113.1)), the polarization operator in the first approximation of perturbation theory would be given by the integral l ^^~ e2 j tr y»G(p)y"G(p - k)0p. (115.1) This integral, however, taken over all four-dimensional p-space, is quadratically divergent, and in order to obtain a finite result it would be necessary to regularize the integral by the procedure described in §112. Here, we shall not give the complete derivation, but show how one can use the integral (115.1) to calculate the imaginary part of the polarization operator (which has been determined in §113 by means of the unitarity condition); this derivation includes a number of instructive points. The imaginary part of the integral (115.1) does not diverge and therefore does §115 Imaginary Part of the Polarization 509 Operator not need regularization. For the scalar function im 9 = J im 0>£ we have i m <a>-; m J ; i £ £ l f try{yp + m)yß(yp + yk; + m) .4 1 After calculating the trace, we get im 0>(k2) = im I i<t>(p)d4p (p - m + i0)[(p - k)2 - m2 + iOV 2 le1 <t>(p) = ^i(2m2 2 + (115.2) 2 pk-p ). Let k2 > 0. We shall use a frame of reference in which k = (k 0 ,0), and ( p - / c ) 2 = (Po-k 0 ) 2 -p 2 . Using also the notation e = V(p + m2) (this is not the "energy" of the virtual electron p0), we can write (115.2) in the form im 0>(k2) = im f d3p f dp, T-i J J ^ J £ ( P o > P ,\; « .ft1, ( p o - e +iO)[(po-k 0 ) - e 2 + i 0 ] ' (115.3) 9 2 <MPo, p) = Y~j (m 2 r e 2 + p0ko - po). The integrand has poles at four values of p0: (a) po = e - iO, (b) po = k 0 - e + iO, (a') p 0 = - e + iO, (b') p0 = k0+ e - »0. Figure 21 shows the configuration of these poles; we shall take the specific case k0 > 0, but the final answer depends only on ko, and not on the sign of k0. We can © FIG. 21. calculate the discontinuity of the function &(t) at the cut in the complex plane of r = k2 = ko or, equivalents, on the real axis in the k0-plane. The real part of &(t) is continuous at the cut, and the discontinuity is therefore A W ) = 2iim W ) . (115.4) §115 Radiative Corrections 510 We shall first show how the position of the cut may be established from the form of the integral. Let J(p, k0) denote the inner integral (over dp0) in (115.3). So long as the upper and lower poles in Fig. 21 are atfinitedistances apart, the path of integration over p0 can be taken far from the poles, as shown by the broken line. It is therefore evident that in this case the integral I(p, k0) is unaltered by an infinitesimal upward or downward movement of the poles b and bl respectively away from the real axis, i.e. by the change k0-+ k0± îô, Ô -»0. Thus, where k0 tends to its real value from above or from below, the value of J(p, k0) is the same, and therefore makes no contribution to the discontinuity A0\ The situation is different only if two poles (which may be a and b when k0 > 0) are exactly one beneath the other, so that the contour of integration is "trapped" between them and cannot be moved to a great distance. Thus the discontinuity aSP/^0 only if the condition k 0 -£ = e, i.e. k0 = 2e =2V(p 2 +m 2 ), can be satisfied somewhere in the region of integration over d3p« F° r this to be so, we must evidently have k 0 ^2m, i.e. t ^4m 2 .t The integral I(p, k0) can be written in the form <ll5 5) ^•^-Jcpi-.W-W-e'r ' c with the terms I'O omitted from the denominator and the integration contour C correspondingly modified, as shown in Fig. 22. We see that the discontinuity A^(0 is due to the impossibility of bringing the contour away from the pole a when it is trapped between a and b. The contour C is therefore replaced by C\ which passes © C—•—yij ^ O* fZy ■*— a—•—x*s ^ y0J ^TN—^_ FIG. 22. beneath a, and the integral around a small circle C" centred at a is added. Then C" can always be moved away from the poles without any difficulty, and the integration along it therefore contributes only to the regular part of &(t). To determine the required discontinuity, we need only consider the integral along the circle C", which is done by calculating the residue at the pole a. This calculation can be made by substituting in the integrand -^r—i^-2m8(pl--e2)\ Po~ s (115.6) t It can be shown similarly that there is no cut when t = k 2 < 0 . Taking in this case a frame of reference where k = (0, k), we find that the poles of the integrand are at po = ± ( e - i O ) , p o = ± ( V [ ( p - k ) 2 + m 2 ]-iO). The two lower poles are always in the right half-plane of po, and the two upper ones in the left half-plane, so that no pair can be vertically aligned. Imaginary Part of the Polarization §115 Operator 511 the minus sign is used because the circle around the pole is traversed in the negative direction. In the argument of the delta function, only the zero p 0 = + e is to be used (the pole a, not a'); this is automatically done if we agree to integrate over only half of momentum 4-space (p0 > 0). After the substitution (115.6), the discontinuity of the integral I(p, k0) can be calculated immediately: AI = {I(p, k0 + iô) ~ Kp, ko - i8)h^+o = - 2m J d(pl - e2)i<Mpo, P ) x o X L(ko-po) 2 -e 2 +iÔ " ( k o - p o ) 2 - e 2 - i ô J dp °' With the equation 71 ^ T-rrz = P 77 ^ 5+i7r8[(ko-po) 2 -e 2 ] 2 (ko-po) - e ± i ô (ko-po)-e (see (111.3)), we have AI = i(27TÎ)2J ô(p§-£ 2 )ô[(k 0 -po) 2 -e 2 ]</>(Po,p)dpo. 0 The arguments of the delta functions can be rewritten in an invariant form by adding and subtracting p2: pl-e2 = p2-m2, (ko-po)2-e2 = ( k - p ) 2 - m 2 . We then obtain finally A0>(k2) = Î(2TT0 2 I d4p • <f>(p)ô(p2 - m2)ô[(p - k)2 - m2]. Po>o (115.7) Because of the delta functions, the integration is actually taken only over the region of intersection of the hypersurfaces p2 = m2, (p-k)2 = m2. (115.8) Since in this region all the 4-vectors p are time-like, the condition of integration over p o > 0 is invariant (the upper interior region of the cone p 2 = m2). Let us compare (115.7) with the original formula (115.2). We see that the discontinuity of the function &(i) at the cut in the f-plane can be found by applying 512 §115 Radiative Corrections in the original Feynman integral the substitution -3— l-r—rïr-*-27riô(p2-m2) r p -m +i0 (115.9) ' v in the propagators which correspond to the loop lines intersected in the diagram (113.1) (S. Mandelstam, 1958; R. E. Cutkosky, 1960). The conditions (115.8) select the region of momentum space in which the lines of virtual particles in the diagram correspond to real particles (the 4-momenta p and p - k are then said to lie on the mass surface). Here we see clearly the relationship to the unitarity-relation method, where these lines are replaced by lines of intermediate-state real particles. We also observe the mathematical reason for the absence of divergence in the imaginary part of the diagram: the integration is over a finite region of the mass surface, not over the whole of infinite momentum 4-space as in the original Feynman integral. In order to derive from (115.7) the formula obtained in §113, we return to the frame of reference in which k = 0 and integrate over d4p - |p|e de dpo do. The integration amounts to removing the delta functions, with 5(p 2 - m2) dpo = 5(po- e2) dp0->^- Hpo~e) dp0, IE and then Ô[(p - k)2 - m2] de = ô[(po - ko)2 - e2] de = 8(-2ek0+kî)de ->2k~S(e-2ko)<*e. The result is A3>(t) = - \ iV I yji^Y^y^ P) do, where t = k2 = ko; the value of <j> is taken for Po=e=5k 0 , p2 = e 2 - m 2 = Jko-m 2 , i.e. it is 4>(e,p) = ^ ( 2 m 2 + 0 (115.10) §116 Electromagnetic Form Factors of the Electron 513 and is independent of the angle. The integration over do reduces to multiplication by 4ir, and we come back to (H3.8). The only vital point in the foregoing derivation is that the diagram is divided into two parts by cutting no more than two lines. The rule stated therefore remains valid for diagrams comprising any two sections joined by two (electron or photon) lines. The integral calculated by the substitution (H5.9) then gives the contribution to the imaginary part of the diagram that arises (in the unitarity-relation method) from the corresponding two-particle intermediate state. § 116. Electromagnetic form factors of the electron Let us consider the vertex operator P 1 = Ttl(p2, p\\ k) in the case where the two electron lines are external lines and the photon line is internal. The external electron lines correspond to factors MI = M(PI) and Ü2 = M(P2), and T therefore appears in the expression for the diagram as the product if^ü.T^. (116.1) As already noted in §111, this is the electron transition current, including radiative corrections. The conditions of relativistic and gauge invariance enable us to establish the general matrix structure of this current. The electromagnetic interaction operator V = e(fÂ) is a true scalar (not a pseudoscalar), in accordance with the conservation of spatial parity in these interactions. The transition current j/, is therefore a true 4-vector (not a pseudovector), and hence can be expressed only in terms of other true 4-vectors formed from the two available 4-vectors px and p 2 (k = p 2 - p\ is a third) and the bispinors U\ and u2. There are three independent 4-vectors of this kind bilinear in ü2 and u,: M2Y"l,(Ml)pl,(«2Wl)P2, or, equivalently, ü2yuu(ü2Ui)PAÜ2U\)k, (116.2) where P = pi + p2. The condition of gauge invariance requires that the transition current should be transverse to the photon 4-momentum k: j/ik=0. (116.3) This is satisfied by the first two 4-vectors (116.2), respectively because of Dirac's equations (YPi-m)tt, = 0, ö 2 (yP2-m) = 0 (116.4) and because Pk = 0. The current jfi is given by a linear combination of these two 514 Radiative Corrections §116 4-vectors: J}5 = / i ( ö 2 M , ) P ^ +/ 2 (ö 2 -y | £ Mi), where f\ and f2 are invariant functions, called the electromagnetic form factors of the electron. Since the 4-momenta px and p2 relate to the free electron, p} = p2 = m2, and from the three 4-vectors pu p2 and k (which are related by k= p2-p\) we can construct only one independent scalar variable, which we take as k2. Then the form factors are functions of k2. The expression for the current can also be put in other forms with different choices of the two independent terms. Using the equations (116.4) and the commutation rules for the matrices y, we can easily show that {ü2d^ux)kv = - 2m(û2y'4ii,) + (ö2u,)PM, (116.5) where a^ = 2(yfly1' - y'y*1)* The coefficient of this term will later be seen to have an important physical significance, and we therefore write P =y W 2 ) ~ g ( k V a , Lm (116.6) where / and g are two other form factors; the reason for writing the factor 112m separately will be explained below.t For brevity, we shall always use the vertex operator instead of the current, the ü2 and U\ on either side being understood. In order to determine the properties of the form factors, let us consider the diagram (110.16) for the interaction of an electron with an external field. The corresponding scattering amplitude is Mfi = -eJtistïXk)9 (116.7) where d^ is the effective external field (taking account of the polarization of the vacuum). The amplitude (116.7) describes two reaction channels. In the scattering channel, the invariant t is such that t = k2 = ( p 2 - P l ) 2 ^ 0 . Putting p- for p2 and - p + for pu we change to the annihilation channel, which corresponds to pair production with 4-momenta p_ and p+. In this channel, t = (p_ + p+) 2 ^4m 2 . The range 0 < t < 4 m 2 is non-physical. t To avoid misunderstanding, it may be mentioned that in the definition (116.6) k is assumed to be the 4-momentum of the photon line coming to the vertex; for the outgoing line, the sign of the second term would be reversed. §116 Electromagnetic Form Factors of the Electron 515 Let us now consider the unitarity condition (111.12). In the scattering channel (r<0) there are no physical intermediate states in this case: one free electron cannot change its momentum or give rise to any other particles. There are also, of course, no intermediate states in the non-physical region. Hence, when t <4m 2 , the right-hand side of (111.12) is zero, so that the matrix Tfi (or, equivalently, M/() is Hermitian: Mfl = M«?/. The interchange of the initial and final states corresponds to the interchange of p2 and pi, and therefore to a reversal of the sign of k. Putting Mfi in the form (116.7), we therefore have }f,s4{t)(k) = }^s4le)*(-k). Since sd(e)(- k) = si(e)*(k), it follows that the transition-current matrix is also Hermitian: jfi = j1f when t<4m2. (116.8) Using the properties of the matrices y (21.7), we can easily prove that (Ü27>1M.) = ( Ö 1 7 ^ 2 ) * , ^(T^u,) = -(".0-^2)*. Thus jf/ differs from jfi only in that the functions /(f) and g(f) are replaced by their complex conjugates, and it then follows from (116.8) that these functions are real. Thus im/(f) = img(f) = 0 when f<4m 2 . (116.9) In the annihilation channel (f>4m 2 ), the / state is a pair which can be transformed into another pair with different momenta (elastic scattering) or into a more complex system. The right-hand side of the unitarity condition is therefore not zero, the matrix Mfi (and hence jfi) is not Hermitian, and so the form factors are complex. The analytical properties of the functions /(f) and g(f) are exactly similar to those of the function 0>(f) discussed in §111, although it is difficult to prove them so directly. These functions are analytic in the complex t-plane cut along the positive real axis t >4m 2 , with /*(*) =/0*), g*(t) = g(t*). The renormalization condition (110.19) applied to the vertex operator (116.6) leads to the requirement /(0)=1. (116.10) 516 Radiative Corrections §116 In order to include this condition automatically (when calculating f(t) from its imaginary part), we must apply a dispersion relation of the form (111.8) not to the function f(t) itself but to (/ - l)/f. Then we get the dispersion relation "with one subtraction": 4m 2 No values for the form factor g(t) are prescribed by physical conditions. Its dispersion relation will therefore be written "without subtractions": «<«>-£ f 7^Todt'4m 2 <"6-,2> The value of g(0) has an important physical significance, as it specifies the correction to the magnetic moment of the electron. In order to see this, let us consider the scattering of a non-relativistic electron in a constant magnetic field which is almost uniform in space. The term in the scattering amplitude (116.7) which depends on the form factor g(fc2) is SMfl,.=j^g{k*Kü2cr'u'ul)k¥Al:Kk). (116.13) For a purely magnetic field, A(')M = (0, A); since the field is constant in time, the 4-vector k** = (0, k), and since it varies only slowly in space, k is small. With a view to subsequently taking the limit k-*0, we have already replaced the effective ,s#(€) by A w in (116.13). Expanding (116.13) and expressing it in terms of threedimensional quantities, we find SM/i = 2 ^ g( - k2)(ü2Xu,)ik x Ak, where 2 is the matrix (21.21). The product ik x Ak is replaced by the magnetic field Hk, and we can then take the limit k->0. Finally, with the non-relativistic spinor amplitudes w,, w2 given by (23.12): ü2 = V(2m)(M>!0), u, = V(2m)(o'), we have ÔM /I =^-g(0)H k -2m(wîo-w l ). (116.14) §117 Calculation of Electron Form Factors 517 This expression may be compared with the scattering amplitude in a constant electric field having the scalar potential 4>k: Mfi = - e(û27°Ui)4)k — - e4>k • 2m(w$Wi). We see that an electron in a magnetic field can be regarded as having an additional potential energy This means that the electron has an ''anomalous" magnetic moment (in ordinary units) (116.15) I*.'= (ehHmc)g(O) in addition to its "normal" Dirac magnetic moment ehUmc. § 117. Calculation of electron form factors Let us now go on to the actual calculation of the electron form factors (J. Schwinger, 1949). In the zero-order approximation of perturbation theory, the vertex operator P 1 = y*\ i.e. the electron form factors are / = 1, g = 0. The first radiative correction to the form factors is given by the vertex dia~ r am (117.1) with two real external electron lines and one virtual external photon line. We shall first calculate the imaginary parts of the form factors. As has been shown in §116, these differ from zero only in the annihilation channel (Jc 2 >4m 2 ); accordingly, the 4-momenta of the external electron lines in the diagram (117.1) correspond to the production of an electron and a positron, and are denoted by p - and - p + . The analytical expression of the diagram (117.1) is - ieü(p_)Pii( - p*) = ( - ie?ü(p-)y*i j G(p)y»G(p - k)y>D.M) ^ II( - p+) (117.2) 518 §117 Radiative Corrections or, in expanded form, y7(fc 2 )-^-g(fcV^,= f, i '' 2m ô l( \y)d[% J (p - n r ) [ ( p -k) rv -m ] (117.3) with the notation </>^(p) = - g ^^P +m )y(^-Jfc-fm)^ 47T ( P - ~ p ) (117.4) and omitting for brevity the factors ü(p_), w(-p+); it is understood that both sides of the equations below are placed between these factors. The horizontal dotted line divides the diagram (117.1) into two parts, in such a way as to indicate the intermediate state which would figure in a calculation of the imaginary part of the form factor by means of the unitarity condition, namely the state of an electron-positron pair with momenta differing from p., p + . The intersection also shows where the pole factors are to be replaced in the integral (117.2) in order to use the rule (115.9) in the calculation (in (117.3), these factors have been separated in the integrand). The integral in (117.3) has the same form as that in (115.2), and we can therefore immediately write the result of the transformation as in (115.10): 2 7 M i m / ( 0 - ^ c r * % im gU ("7.5) where t = k2, the integration is over the directions of the vector p, and the 4-vectors p~ = p and p+ = k-p in the definition of the function <f>^(p) (117.4) become the 4-momenta of real (not virtual) particles. The expression (117.5) relates to a frame of reference in which k = 0, i.e. the centre-of-mass system çf the pair p_, p+ (and hence of the "intermediate" pair p~, pi). In this frame, therefore, k = (k 0 ,0), P- = (2*0, P-), P+ = (2^0, - p - ) , p = (2JC0, P)> a n c i it is easy to verify that f2 = (P~ P)2 = - 2p2(l - cos 0) = - i(f - 4m2)(l - cos 0), (117.6) where 0 is the angle between p and p- (and p2 = p 2 ). Not substituting (117.4) in (117.5) and eliminating the matrices y \ .. yp in the integrand by means of (22.6), we have y» im / < 0 - 2 ^ **% im g(f) = - SvffirhïP» / ïïo^ë) y " i y p = ' W [ , ( / - W ) ] / 2,(1 -"cos 0) ' - 2 m V + 2(yp+-yf)y»(yp- + yf)l + m)y iyp " ~ yk + m)y' +4 "" P " + 2 ^ ) + (117.7) §117 Calculation of Electron Form Factors 519 with the 4-vectors / = P - P - = (0,I), P = p - - P * = (0,2p-). (117.8) The integration now amounts to the calculation of the three integrals with different numerators J 1 - C O S 6 Lit The integral I is logarithmically divergent when 0 -♦ 0. If it is written as t-4m2 -(l-4ro2) 2 1= | 2 d(f )/f = J d(/ 2 )// 2 , we see that the divergence corresponds to small "masses" of the virtual photon, i.e. it is an "infra-red" divergence, which will be further discussed in §122; here we shall merely note that the divergence is fictitious, in the sense that, when all physical effects are correctly taken into account, the divergences of this kind cancel out. We can therefore cut the integral off at any lower limit, and then, in the subsequent calculation of real physical phenomena, make this limit tend to zero. Here it will be simplest to apply the cut-off in a relativistically invariant manner. To do so, we assign to the virtual photon/ a small butfinitemass A (*^m), i.e. make the change /2-/2-A2 (117.10) in the photon propagator D(/2) in (117.2). Then r /= -<J-4m*) f J o <*(/2) . t - 4 m 2 r2_A2 = log—p—. M n n i n (117.11) The integral Jf\ in which / is a space-like 4-vector, must be expressed in terms of the 4-vector P", which (unlike k", the only other available 4-vector) is space-like for all p+ and p_. Hence J*1 = AP». If this equation is multiplied by P^ and the integral PJ* is calculated in the centre-of-mass system of the pair (with the components of the 4-vectors / and P given by (117.8)), the result is i rrj»,dcos $ 2?) I i - COS0 -'J 1 d cos (F= - 1. 520 Radiative Corrections §117 Thus I" = - ? * . (117.12) We can similarly calculate the integral V = \P2(g^ -^^-) + \P^Pv; (117.13) to determine the coefficients in this expression, we have only to evaluate the integrals 1% and I»VP»PV. The calculation continues as follows: (117.11)—(117.13) are substituted in (117.7) to obtain a series of terms between ü(P-) and u(-p+). In each term we use the commutation rules for the matrices y* to "propel" the factor yp+ to the right and yp- to the left; then we can make the replacements yp-^m, ?p+->- m, since ü(p-)yp- = mü(p-), yp+u(-P+) = - mu(-p+). In the resulting sum - 4(p+p_)Iy'1 + 2mP* - 3P V , we can then replace P* by 2myM + crM%, which is equivalent to it when between the ü and u factors; cf. (116.5). Finally, all quantities are expressed in terms of the invariant t = k2 (2p+p_ = t -2m 2 , P 2 = 4m 2 -f) and the two sides of (117.7) are compared; this yields the following expressions for the imaginary parts of the form factors: ""^'Vud-W)]' im /<*> = 4V[t(r- 4m2)] [ ~ 3' + 8m2 + 2(f - 2m2) log ^ ^ ] . (11714) (117.15) The infra-red divergence occurs only in im/(t). The functions f(t) and g(t) themselves are obtained from their imaginary parts by means of (116.11) and (116.12), in which the integrations are effected by the same substitutions as were used in §113 to calculate 0*(O- The form factors are given in terms of the variable £ (113.11) by +j ^ I ^ M log21 - 2F(|) +2 log £ log (1+ £)]}, where F(£) is Spence's function (131.19). (117.17) §118 521 Anomalous Magnetic Moment of the Electron In the non-physical region (0< r/m 2 <4), we must put £ = ei4>. The expressions for the form factors can then be written IT l\ tan<£/ A 2 sin <> / tan <> / J o J (117.18) (H7.19) SW-TZZ&A- Z7T s m 4m 2 , >4m 2 , (117.21) ï M: Formula (117.21) is valid (as regards re/) with what is called double-logarithmic accuracy, i.e. as far as the squares of the large logarithms.! § 118. Anomalous magnetic moment of the electron As has been shown in §116, the value of g(0) determines the radiative correction to the magnetic moment of the electron. If we seek to calculate only this quantity, there is of course no need to find the whole function g(t). From (117.14) and (116.12), nl(\\- x f im g(*') if - a f dx - a man With this correction, the magnetic moment of the electron is "-&('►£)• <»«*■ a formula first derived by Schwinger (1949). t The expression for thé vertex operator in the case of one virtual and one real external electron line and a real external photon line is given by A. I. Akhiezer and V. B. Berestetskii, Quantum Electrodynamics, Interscience, New York, 1965, §47.5. 522 §118 Radiative Corrections In the next approximation (with terms in a2), the radiative corrections in the form factors are represented by the seven diagrams (106.10, (c)-(i))- Even to find the value of g(0) in this approximation demands very lengthy calculations. Details of these may be found in the original papers; here we shall give only the final valuet for the correction in the second approximation: = -0.328 a2/ir2. (H8.3) The magnetic moment of the electron is therefore "=2^( l + £-°- 3 2 8 £) (ll84 > (C. M. Sommerfield, 1957; A. Petermann, 1957). Particular consideration may be given to the contribution of the vacuum polarization to the correction g(2)(0), namely the diagram (118.5) which contains a photon self-energy part. It differs from the first-approximation diagram (117.1) only by having instead of the photon propagator D(/2) = 47r//2 the product D</>)^W> = ^ , where 0>(/2) is the polarization operator in the first approximation (terms in a), calculated in § 113. Repeating, with this difference, some of the calculations in § 117, we find as the "polarization part" of the correction (2) „ , am2 ( 9>(j2) 1 + 3cos 6 JCOS im gghrXO = V[f(f-4m 2 )] J <f 2 n ' ( „ , 60) ^ -i with /2 = - i ( t - 4 m 2 ) ( l - c o s 0 ) ; (118.7) t The calculation by the unitarity method is given by M. V. Terent'ev, Soviel Physics JETP 16,444, 1963. §118 Anomalous Magnetic Moment of the Electron 523 see (117.6). When this integral and then ggur.(0) = - IT f img£Uo¥ J 4m2 I (118.8) are calculated, the result is g(X(0) = ^ ( ^ - | T T 2 ) = 0.016^, (118.9) which is about 5% of the total quantity (118.3). It has already been noted at the end of § 114 that vacuum polarization for other particles may also make a contribution to the radiative corrections. The contribution of the muon vacuum to the anomalous magnetic moment of the electron is obtained from the same formulae (118.6MH8.8), in which m is again the electron mass me (this applies also to the definition of the variable /*) but the parameter m in the expression for 0>(/2) must be the muon mass mM. The function ^(f 2 )// 2 depends only on the ratio f\m\. In the integral (118.8) the important values of t (and therefore of f2) are those comparable with m2; hence the ratio / 2 / m 2 ~ (mjmll)2<^ 1, and in evaluating the integrals we can use the limiting formula (113.14), according to which 0>(/2) = f 2 o_ f2 \5<rrml' From this we see that the contribution to g2(0) from the muon vacuum polarization contains the extra small factor {mjm^f. The opposite situation, however, occurs for the corrections to the magnetic moment of the muon. Since the particle mass does not appear in (118.3), this value of g2(0) is valid for the muon also, and it takes into account the contribution of the muon vacuum polarization. But the contribution from the vacuum polarization of other particles, namely the electrons, is here considerably greater. It can be calculated from formulae (118.6HU8.8), with mM for m and the electron polarization operator for &(t). Unlike the previous case, the important range of values is and ^(/ 2 ) must be taken as the limiting now given by f2jm2e~{mjmef>\, expression (113.15): j 5TT me The calculation of the integrals gives [g2(0)W —= (f) 2 (i l08 ^-i) = 1.09 a2h2 (H. Suura and E. H. Wichmann, 1957; A. Petermann, 1957). (118.10) 524 § 119 Radiative Corrections Adding (118.10) and (118.3), we find as the magnetic moment of the muon eh - ( l + ^ + OJÔ^À 2m 2TT IT1) Lc \ (118.11) The contribution of the muon vacuum polarization (118.9) is here about 2% of the total value of g(2)(0). The pion vacuum polarization would give a contribution of the same order (because of the similarity of the masses), but this cannot be calculated exactly, and so there would be no point in finding the corrections —a3 in the magnetic moment of the muon. § 119. Calculation of the mass operator The method of direct regularization of the Feynman integrals may be demonstrated by a calculation of the mass operator. In the first non-vanishing approximation, the mass operator is represented by the loop in the diagram p-k (119.1) k corresponding to the integral - iM(p) = ( - ie)2 $ y»G(p - k)y'D^(k) ^ ; on substituting the propagators and combining the factors y*... formulae (22.6), we get yM by means of (the bar over M denotes the non-regularized value of the integral). The fictitious "photon mass" A is used in the photon propagator in orde'r to eliminate the infra-red divergence (as in §117). The integral may be transformed by means of formula (131.4), with a, and a 2 the two factors in the denominator of (119.2). A simple rearrangement of the terms in the denominator of the new integral gives 0 with a2 = m 2 x 2 - (p 2 - m 2 )x(l - x) + A2(l - x). (119.4) §119 525 Calculation of the Mass Operator The change of variable k-^k + px brings the integrand in (119.3) to a form in which its denominator depends only on k2; according to (131.17), (131.18) a constant is then added to the integral: A(p) = i e d k dx 2m - 5Ü7 H ' J 0 <kiy-% X)~ï iir'w }• (1195) (The term in yk in the numerator is here omitted, since it gives zero on integration over the directions of the 4-vector k; cf. (131.8).) The regularization of this integral involves subtractions such as to reduce it to the form (110.20). The latter expression gives zero on multiplication by the wave amplitude u(p), if p is the 4-momentum of a real electron. Without bringing in u(p) explicitly, we can formulate this condition as stating that M(p) vanishes when we substitute yp^m9 p2^m\ (119.6) The form of the integral (119.5) is convenient in that the 4-vector p appears in it only as yp and p2, but not as kp. Subtracting from (119.5) a similar expression after the substitution (119.6), we get i ~ IfMJ W d*[2m ~w(i ~ * 4 < F ^ ~ ( F ^ F ] 0 1 -Jd4k|dx(Jj^(7p-m)-ii7r2(7P-m)}, (119.7) 0 with a £ = m V + A 2 (l-x). To complete the regularization, however, a further subtraction is needed: according to (110.20), the substitution (119.6) should reduce to zero not only M(p) itself but also M{p) without the factor yp -m. A corresponding subtraction removes entirely the second and third terms in the braces in (119.7).t The first integral is transformed by incorporating a further integration (using (131.5)), taking n = 2, a = k 2 - a2, and b = k2-a\. Then (119.7) becomes t x 167TÎ (2TT)4 i 2 i f ,4i f J f A (7P + m)[2m - YP(1-X)]X(1-JC) J J J [k - o o + (p -rrr)x(l-x)zr 0 0 t Thus, in the process of renormalization "en route" (§110) we omit corrections to the renormalization constant Zi. The corresponding integrals are logarithmically divergent. If we use a "cut-off parameter" A 2 >m 2 , p2, and limit the region of integration over d k by the condition k2*sA\ the correction can be explicitly calculated; the result is z,. 1 + 2 n z r =-i[i l o g ^ + ,„ g i; + ?]. „„.T., 526 § 119 Radiative Corrections where we have also used the identity p2- m2 = (yp - m)(yp + m). The integration over d4k is immediate: assuming that p 2 - m 2 < 0 and using (131.14), we have WK 27T J 0 m x + Az(l - x ) + (m -pO*(l - x ) z J 0 We now need to subtract a similar integral with the substitution (119.6), omitting temporarily the factor yp - m; after some simple calculations, we get i 2 2 M(p) = (yp-m) ^jdxfdz x mO-x_») - (W_ + m)(l -Xf[l L * T vA/wj J ~J$$g MIOft* m X + (W ~~ P )(1 ~~ *)Z (the term in A2 is omitted from the common denominator, since this causes no divergence in the present case; elsewhere, A 2 (l-x) is replaced by A2, since the infra-red divergence will correspond to the divergence as x -» 0). The integration in (119.8) (first over dz and then over dx) is fairly lengthy but elementary, and the result is M(p) - ÛH(w - m*[w=7) (' _ T%log ") ~ with p = (m2-p2)lm2 (R. Karplus and N. M. Kroll, 1950). The integral has been calculated on the assumption that p > 0 and p > Kim. In accordance with the rule for passing round poles, in the analytical continuation of (119.9) into the region p < 0 the phase of the logarithm is found by making m-nn- iO; then p^*p- i0, and log p must be taken f or p < 0 as logp = log|p|-iir, p<0. Let us now consider how the mass operator behaves when p2>m2. - p ** p2lm2 > l, and with logarithmic accuracy we have M{p) = (H9.10) Then -[<S-\p)-G-\p)\ ~^(7p)log^. (Il9.ll) §119 Calculation of the Mass Operator 527 As in the case of the photon propagator (cf. formulae (113.15) and (113.16) for the polarization operator), the correction to G 1 is small only when the energy is so small that 47T m In the present case, however, the logarithmic increase is in a certain sense fictitious, and can be eliminated by a suitable choice of gauge, i.e. of the function D (n in the photon propagator (L. D. Landau, A. A. Abrikosov and I. M. Khalatnikov, 1954). To achieve this, we put (in the notation of §103) D (0 = 0, (119.12) whereas formula (119.9) has been derived with the gauge D{i) = D. (119.13) This property of the gauge (119.12) makes it especially suitable for investigating the theory when p2> m2, as we shall do in §132. To prove the result stated, we note that, if only terms in e2 are concerned, the transformation from the gauge (119.13) to (119.12) can be regarded as infinitesimal. Accordingly, we can apply immediately the formula (105.14), with d(0(q) = -D/q 2 =-47r/(q 2 ) 2 , and also replace ^ in the integrand by G with the necessary accuracy. In the integral over d4q, the important range is q > p, for which G{p - q) in the integrand is much less than G(p) and can be neglected. Then SV = = G^p^ip) -ie2G-l{p)fdm{q)d4ql{2<n)4. Finally, with the transformation (131.11) and (131.12), we have «r'(p)~£o-'(p)Jfi^P where A is an upper limit, at which the divergence is removed by renormalization; the renormalization consists in subtracting the same expression with p2s*m2, giving finally This just cancels the difference 9~' - G"1 in (119.11). 528 Radiative Corrections §119 Lastly, let us consider why it is necessary to use a finite "photon mass" A in regularizing the integral (119.2), which is closely related to its behaviour when p2-*m2. Firstly, this integral itself is finite when p2 = m2 and A = 0; to exclude the divergence for large k, which is here unimportant, we assume that the integral is taken over a large but finite region of k-space. The need to use A arises in the subtraction of the re normalization integral, which would otherwise diverge at p2 = m2. Let us therefore ascertain how the non-regularized mass operator would behave as p2-> m2. Since this behaviour depends essentially on the choice of gauge, we shall consider the general case of an arbitrary gauge, whereas the integral (119.2) has been written for the specific gauge (119.13). We again apply the transformation (105.14). Writing dlt)(q)=8D{l)lq2 = ^8a(q2), (119.14) we assume that 8a is the variation of a function a(q2) which changes appreciably only over intervals q2~m2 and is finite when q2 ** m2. In the integrand on the right of (105.14), the two terms in the difference ^(p) - ^(p - q) are almost equal when q is small, and the integral converges. For small q, and so *S(p - q) may be neglected in comparison with <3(p) when q The integral >(p2-m2)lm. ô<3(p) = i e 2 < S ( p ) J d % i ) ^ diverges logarithmically in the range (p 2 -m 2 ) 2 ^ . m 2 We therefore have, with logarithmic accuracy, _ = -_ôa(m2)log?rr^. This can be integrated as follows. When a = e 2 -»0, the exact propagator ^ must be the same as the free-particle propagator G, and therefore 1 / m2 \*<c--oW» §120 Emission of Soft Photons with Non-zero Mass 529 where ao = a{m2) and C is a constant. To determine C, we compare the expression ^ p ) = (yp-m)[l+^(C-a0)logp], (119.16) which is obtained from (119.15) in the first approximation with respect to a, and the corresponding expression given by the integral (119.2) when A = 0:t <r l (p) = ( 7 P - " 0 [ l + f logp]. (119.17) According to the definition (119.14), the function a{q2) is equal to the ratio D{,)ID. The gauge (119.13) to which (119.17) belongs therefore corresponds to a = ao=\. Equating (119.16) and (119.17) for this value of a0, we find C = 3. Thus we have finally as the limiting (infra-red asymptotic) expression for the unrenormalized electron propagator with p2->m2 *O>=Ï^(F^P) (,1918) (A. A. Abrikosov, 1955). The validity of this formula depends only on the inequalities a <^ 1, | \ogp\> 1, whereas the formulae of perturbation theory would require also that a| logp|/27r <? 1. The sign of the difference p 2 - m 2 is also unimportant here, since the imaginary part of (119.18) is in any case beyond the limits of its accuracy. The renormalized propagator must have a simple pole when p2 = m2. We see that (119.18) satisfies this condition only in the gauge where D (0 = 3D (119.19) (so that a0 = 3). Then the regularization of the Feynman integral (in order to prevent its divergence at the upper limits) will not require the use of a finite "photon mass". In other gauges, the zero mass of the photon produces a branch point instead of a simple pole at p 2 = m2, and the finite parameter A is needed in order to remove this "defect". § 120. Emission of soft photons with non-zero mass In calculating the electron form factors in §117, we encountered a divergence of the integrals when the frequencies of the virtual photons are small. This divergence is closely related to the infra-red catastrophe discussed in §98, where it was pointed out that the cross-section for any process involving charged particles (including the t It is not necessary to repeat the calculations in order to derive (119.17). The term in logp in (119.9) is obtained on the assumption that p > A, which allows the limit À ->0 to be taken. The term in log (A/m) arises from the subtraction of the renormalization integral, and is not present in the original integral (119.2). The subtraction is easily seen to have no effect on the logp terms. 530 §120 Radiative Corrections scattering of electrons by an external field, represented by a diagram such as (117.1)) has no significance in itself, but only when the simultaneous emission of any number of soft photons is taken into account. It will be shown in §122 that all the divergences cancel in the total cross-section, which includes the emission of soft quanta. Here, of course, in order to obtain the correct result it is necessary for the initial "cut-off" of the divergent integrals to be taken in the same manner in all the cross-sections in the sum. In §117, this cut-off was applied by means of a fictitious finite mass A of the virtual photon. We must therefore now modify the formulae of §98 in such a way that they describe the emission of soft "photons" with non-zero mass. Formally, such a photon is a "vector" particle with spin 1, whose free field has been discussed in §14. Such particles are described by a 4-vector (//-operator fa = V(4TT) g ^ ^ (cka e{? e- f a + cL «<;>► «*"), a = 1,2, 3; (120.1) the notation and normalization differ from that in (14.16), in order to bring them into line with the photon case. The interaction of the "photons" (120.1) with electrons is to be described by a Lagrangian of the same form as for true photons: (120.2) -eFfa, the potential A^ being replaced by fa. Then the amplitudes for the processes of emission of photons with finite mass will be given by the usual rules of the diagram technique, the only differences being that k2 = A2 (120.3) and that the summation over the polarizations of the emitted photon must be taken over three independent polarizations (two transverse and one longitudinal) instead of two as for the ordinary photon.This is equivalent to averaging with respect to the density matrix of unpolarized particles P^ = - i ( g „ , - ^ ) (120.4) (cf. (14.15)), followed by multiplication by 3. The propagator for "photons with non-zero mass" is n 4TT / kuK\ (cf. (76.18)), but in the case of gauge invariance the amplitudes of real scattering processes do not depend on the longitudinal part of the photon propagator, and this property is not the result of the specific form of its transverse part. The second term in the parentheses can therefore be omitted, leaving an expression of the same §120 Emission of Soft Photons with Non-zero Mass 531 type as for ordinary photons: *V = ki*kî8i™ (120.5) as has been used in §§117 and 119. Let us now consider the emission of soft photons (in the sense explained in §98). The derivation of (98.5) and (98.6) can be applied to the present case, the only difference being that the term k2 = A2 is added in expanding the squares (p ± k)2 in the denominators of the electron propagators. We thus have, instead of (98.6), , . Pe I2 d3k z pk-\ l2\ 4TT1Ù) Pe \p'k + ku 2| where dcrt\ is the cross-section for the same process without emission of a soft quantum, which we shall conventionally call an "elastic process". In the integrations over d3Jc, the important range is |k| — A. Then p'k ~~ pk> A2, so that the terms in A2 in the denominators may be neglected. The summation over polarizations of the photon is carried out by means of (120.4), as already described. When the approximation stated is used, the second term in (120.4) makes no contribution to the cross-section, and there remains! * , -*"" , (Ä)-(fe)TO (i2o 6) - We thus recover formula (98.7), but <o must now be taken as û> = V(k2 + A2). (120.7) Formula (120.6) is completely general: it is applicable to both elastic and inelastic scattering and even when the type of particle changes. The result of the further integration over d3k depends on the 4-vectors p and p', i.e. on the nature of the basic scattering process. Let us take the case of elastic scattering, for which IPIHP'I. * = *'> and determine the total probability of photon emission with a frequency less than some o)max, with the assumption that A<3ow (120.8) and that o w is subject to an upper limit governed by the conditions (98.9) and (98.10) for the theory of soft-photon emission to be valid. We first calculate the t There may be some doubt at first as to the validity of neglecting A2 before averaging, since it occurs in the denominator of the second term in (120.4), but we can easily show directly that this term gives, on averaging, a contribution ~ A4 • 1/A2, which is negligible. 532 §120 Radiative Corrections integral over dik in the non-relativistic limit. For |p| = |p'| <^ m, (_VL \(p'k) P V-,(q;fc) 2 q2, ~mT^r mW' (pk)) where q = p' - p. Integration over the directions of k gives / k2 W A Then, from (120.6), da = dore, '3 /o [1~3(k^Ai)](k^P or, after integration with the assumption that a>max/A > 1, dcr = d<xel | f ^ ( l o g ^ - f ) , q 2 «m 2 . (120.9) In the general relativistic case, the integral is calculated by means of (131.4). The angle integral is then i 1 = J (pk)(p'k) = J dx J [(pk)x + (p'k)(l - x)]2 0 or, expanding the scalar products with p = (e, p), p' = (e, p'), i 7 = J dX J {e<o-k-[px + p'(l-x)]} 2 ' o The inner integral is now easily calculated in spherical coordinates with the polar axis along the vector px + p'(l - x), giving i J(ea>)2-[px + p'(l-x)] 2 k 2 0 1 - f Airdx " J [m2 + q 2 x(l-x)]k 2 + s2A2' o The other two integrals, with (pk)2 and (p'k)2 in the denominators, are derived from §120 533 Emission of Soft Photons with Non-zero Mass this by putting q = 0. Using also the formula pp' = e 2 - p - p ' = m2 + iq2, we get . _ 2e2 f . f k2d|k| f m2 + W m2 } dx 2 2 2 2 2 2 2 2 2 "V) J V(k +A )l[m + q x(l-x)]k +e A "m k +e 2 A 2 jo o d(T (120.10) The integration over d|k| calls for the calculation of integrals having the form w max r fc y\k\ J (ak2 + A2)V(k2 + A2) «max o „ a J V(k2 + A2) «"max a J (ak2 + A2)V(k2 + A2) 0 0 1 iA2fr)max 1 f dz a J (az 2 +l)V(z 2 +l)' a g A o In the second integral we have put Az for |k| and replaced the upper limit a>max/A by infinity; this is permissible, since the integral converges. The integrals over dx which then occur in (120.10) cannot be expressed entirely in terms of elementary functions. The result may be written in the form da = a [ F ( M ) log ^+Fi]d<rtu (120.11) wheret lQ g <* + V<£2 + *»" l]> F( j) = \ [^fjlx) (12012> _, 2e , e +1P1 ^=-7-7 log—-uy7T|p| m i 2mz.2 +. ~q22 f dx — ^ J ^Vâ^) l0g 0 l + V(l-q) Va ' m f t l „ (12013) a = A l m 2 + q 2 x(l-x)]. t The function F(£) has already occurred in §98, Problems. This is not surprising, since (120.11) can be derived with logarithmic accuracy by integrating the cross-section (98.8) for the emission of zero-mass photons over du> from A to o>mM. If £ is replaced by a variable 0 such that £ = sinh 50, then F(0) = (2M(0 coth 0 - 1). (120.12a) 534 §121 Radiative Corrections An asymptotic expression for the cross-section in the ultra-relativistic case can be obtained, assuming not only that e > m but also that |q| > m, i.e. that the scattering angle is not very small. Then the important range of values of x in the integral in (120.13) is that for which a < 1; the appropriate approximations give LTTE } a o Ä f log ( q V ) + log x -Hog (1 - x) J_ dx The integral is to be cut off at a ~ 1, i.e. at x~m 2 /q 2 at the lower limit and 1 - x ~ m2/q2 at the upper limit. Then F, - ±- [2 log ^ log -^ - log2 -^1 2TT I e m m} = ^ r i o g 2 - ^ - 4 1 oeg ^ l o g i > l . 2ir L m m m] This formula is valid as far as the squares of the logarithms (with doublelogarithmic accuracy). To the same accuracy, it is sufficient to put in the first term in (120.11) F(0~(4/7r)log£ (Ç>\). The final result is da = ^ [log - ^ log ^ p - log ^ log ^ + \ log2 - ^ 1 d<r<uM q2>m\ IT 1 * m \ m em 4 ° mj (120.14) § 121. Electron scattering in an external field in the second Born approximation In the first two approximations with respect to the external field, the scattering of an electron is represented by the diagrams |q 1 i f p- 1 1 [ / P' \ I fq,=f-p (121.1) r The first of these corresponds to the amplitude M (,) ~ Ze2 considered in §80. The amplitude of the second approximation is M<2) ~ (Ze2)2. The Second Born Approximation §121 535 It is easily seen that terms of the same order as this arise from the radiative corrections. In the third order of perturbation theory, the radiative corrections to the scattering amplitude are represented by the diagrams (121.2) Here M(3) ~ Ze2 • e\ and M(3) ~ M(2) (if Z ~ 1). According to (64.26), the scattering cross-section is da = |M}!> + Mff+ Mf\2 do'/16ir2. (121.3) In the squared amplitude we can retain not only |M}Pj2 but also the interference terms between M}P and M\f and between M}!* and Mft. Thus the cross-section is given, as far as the terms in e\ by the sum do- = d<riU + do-i2) + d<Tr*i, (121.4) where dcr{l) is the cross-section in the first Born approximation (§80), and the corrections are da(2) = 2 re M}J> Mff* do'/16ir 2 ,| From §80, d<rrad = 2 re M}!> M g* do7167r2. J Miï = \e\(û'y°umq), (12L5) (121.6) where <ï>(q) is a Fourier component of the scalar potential of the constant external field ( = Ao}), and we have used the fact that the electron charge e = -\e\. The two expressions (121.5) can evidently be calculated independently. The first will now be discussed, and the second in §122. The second-approximation amplitude is given by the diagram (121.1) as the intégrait Mf = - e2 j { f l l p V ^ ^ j o A(P)}*(P' - WH ~ P) 0p. The "4-momenta" <?i = / - p and q2 = p'-f (121.7) of the constant external field have no t Here it is necessary to apply the diagram-technique rule concerning a constant external field; see rule 8 in §77. §121 Radiative Corrections 536 time components. Hence / o = e = e\ (121.8) where e and e' are the initial and final electron energies, which in elastic scattering are the same. In the purely Coulomb field of a stationary charge Z\e\, 4>(q) = 47rZH/q 2 . For this potential the integral (121.7) is logarithmically divergent (when f — p and f~p'). This divergence is specific to the Coulomb field, and arises from the slowness with which the field decreases at large distances. Its origin is most easily shown for the non-relativistic case. According to QM (135.8), the coefficient of the spherical wave el|p|r/r in the asymptotic expression for the wave function of an electron in a Coulomb field is /(Ö) exp ( - î -H3p log |p| r j . This coefficient is also the electron scattering amplitude in the field, and we see that its phase includes a term which diverges as r -> ». When the scattering amplitude is expanded in powers of Zay this term causes divergence of all the terms in the expansion from the second term onwards (since the function /(Ö) is proportional to Za). In the relativistic case there is, of course, a similar situation. These arguments also show that the divergent terms must cancel when we calculate the scattering cross-section, in which the phase of the amplitude is unimportant. The simplest procedure for a correct calculation is to consider first the scattering in a screened Coulomb field, putting cD(q) = 47rZ|e|/(q2 + S2) (121.9) with a small screening constant 8 <^ |p|. This eliminates the divergence in the scattering amplitude, and we can then put 8 = 0 in the final formula for the cross-section. Substituting (121.9) in (121.7), we have Mf = - - Z2a2ü(p')[(y°e + m)J, + y • J] ii(p), •JT with the notation J[(p'-f) 2 + « 2 ][<f-p) 2 +8 2 ][p 2 -f 2 +«or J ! ! ! 5 ! - J w - f) + s K(i -'!?+ s ][p - f + iO] " (121.10) kp + p,) h - The Second Born Approximation §121 537 p2 = e2 - m2 = p'2, and the integral J is symmetrical in p and p'; from considerations of vector symmetry, it is immediately obvious that the vector J must be parallel to p + p\ Now, eliminating the matrices 7 by means of the equations y • pw = (y°e - m)u, ü'y p' = ö'(y°e - m ) , we obtain Mf = - - Z2a2ü(p')[y°e(J{ + J2) + m(Jx - J2)} u(p). (121.11) In order to continue the calculations, we change (as in §80) from the bispinor amplitudes u and u' to the three-dimensional spinors w and w' which correspond to them in accordance with (23.9) and (23.11). A direct multiplication gives ü'u = W'*{(E + m) - (e - m) cos 6 + i v • a(e - m) sin 0} w, ü'y°u = w'*{(e + m) + (e - m) cos 0 - i v • <r(e - m) sin 0} w, where v = nxn'/sin0, n = p/|p|, n'= p'/|p'|, cos 0 = n • n\ Then the amplitude (121.11) may be writtent 2 U) a. D< r A (2) M)V = 4TTW*/ *(A + B U)> 1/ • a) A(2) = - i w, Z2a2{[(e + m) + (e - m) cos 6] e(J] + J2) + (121.12) 4- [(e + m) - (e - m) cos 0] m(Ji - J2)}, B(2) = ^ Z 2 a 2 (e - m) sin 0[e(J, + J2) - m(J, - J2)}. The first-approximation scattering amplitude is, in corresponding notation, M}}> = 4TTW'*(A(1) + B (,) va)w, A(1) = ^ [ ( £ + m) + ( £ - m) cos 0], [ (121.13) B(,) = - i ^ ( e - m ) s i n 0 , q where q = p' - p. The scattering cross-section and the polarization effects are expressed in terms of the quantities A = A(1) + A(2) and B = B(1) + B(2) by the formulae derived in QM, t The definition of A and B is as in §37 and in QM, §140, and differs by a factor from that in §80. 538 Radiative Corrections §121 §140. For example, the scattering cross-section for unpolarized electrons is do- = (|A| 2 +|B| 2 )do' « da0) + 2(A m re A(2> - iB0) im B ,2) ) do'. Substitution of (121.12) and (121.13) and straightforward calculation gives daa) = - do' ?ia.\\a 7T p sin 2" [(1 - v2 sin2 i0) re (J, + J2) + (m 2 /e 2 ) re (J, - J2)], (121.14) where v = p/e is the electron velocity and 6 the scattering angle. The electrons are polarized by scattering, and the polarization vector of the final electrons is ,,_2re(AB*) ^ " IAMBI2" , 2(A(I) re B ( 2 ) -iB ( 1 ) im A(2)) |A ( , f+|B ( , ) | 2 " or, substituting (121.12) and (121.13), 4Zamp 4 sin 3 J0cosj0 . „, t1 T. ,n. 1(> Let us now calculate the integrals J\ and J2. This is more easily done by using the parametrization method (131.2). The integral J\ then becomes d 23 /dg,2 dfr dfc 8(12- g, - fe- 2fe) 2 r _ _ 9 (((( Jl ^ J J J J {[(p'-1) +s ]i.+[(p-o +«%+[f -p - m t r 0 0 0 The integration over d£3 eliminates the delta function, and reduction of the denominator then gives r = _ ? f V*2 f 1 d'fdtxdk J J J {Ô2(£, + &) + p2(2£, + 2& - 1) - 21 • (£,p' + &p) + «2 - i0}3' 0 0 Using in place of f a new variable k = f- £ip' - | 2 p , we can reduce the integration over d 3 / to the form f d[k ) (k2-a2-iW so that j = 1 _.. 21T 2( i \-(2 f _ l IT2 4<?' d& d 6 „ J J {p2(£2 + & ~ Ki - 2fc + 1) + 2 * 6 P • p' - ô2(£, +fe)- iO}3'2' §121 539 The Second Born Approximation Instead of £1 and £2 we use the symmetrical combinations x = £i + £2> y = £1 - &• The integration over dy from 0 to x is elementary, and gives i j _ l " Î7T2 f Xdx r___^_x 2[^p J [bx2 - 2x + 1 - (S2/p2)* ~ <0][(1 - x) 2 - (S2/p2)* ~ *0]1/2' o where b = ( p 2 + p-p')/2p 2 To calculate the integral over dx as 5->0, we divide the range of integration into two parts: i î-ô, f -dx= f 0 i •dx+ 0 [ ■ -dx, ls>S,s>S/|p|. 1-6, In the first integral we can put 0 = 0 ; thent 1-5, 1-8, dx = / loe ■ 0 2(r=T)[ bx>-2~xll-io] 0 In the second integral we can put x = 1 everywhere except in the term (1 - x) 2 and 8 = 0 in the first bracket in the denominator. Then* J -«, 'dx- i-bj 0 [x'*-(«V)-iO] w «I = 'yh[j +i S/IPI [x-»-(»W>+ / [(«V>-*'=]'"] 0 t The term iO arises from the rule for avoiding the singularity, which gives the change in the argument of the logarithm between 0 and 1 7- 5i, namely from 0 to - IT as we pass below the branch point. X Here again the singularity avoidance rule gives the sign of the square root as we go from positive to negative values of the radicand. 540 §122 Radiative Corrections On adding the two integrals, ô\ disappears, as it should, leaving Ji = OT,0g(Tsin4 <12116> 2|p| The integral J2 is calculated similarly: j 7T3(l~sin2Ö) T J; = J Î7T2 J . '-4|pl cosM»sinjfl-2|ppcos'J9' . 06S 1,, i9 "' - , _ , _ (12U7) We now have only to substitute these expressions in (121.14) and (121.15), obtaining as the final results * 2Zam\r\ sin'|6 log sin je e (1 - IT sin2 50) cos 50 (W. A. McKinley and H. Feshbach, 1948; R. H. Dalitz, 1950). In the first Born approximation, the electron and positron scattering crosssections are the same (in the same external field); in the second approximation, this symmetry does not occur. In the scattering of a positron (charge + |e|) the amplitude of the first approximation (121.6) has the opposite sign, but the sign of M f is unchanged. The cross-section dcr(2), which is the interference term between M}P and A#J?\ therefore changes sign. The same occurs for the expression (121.19) for the polarization vector. The formulae for electron scattering are all converted to those for positron scattering by the formal change Z-+-Z. § 122. Radiative corrections to electron scattering in an external field Let us now calculate the radiative corrections to electron scattering in an external field (J. Schwinger, 1949). The corresponding part of the scattering amplitude is represented by the two diagrams (121.2). The contribution from the first of these to the amplitude is -(ö>0u)^T^D(-q2)e4>(q), where 0*(-q2) is the polarization operator corresponding to the loop in the diagram. The contribution from the second diagram is -(fi'A°ii)«*(q), where A0 is the correction term in the vertex operator ( P = yß + AM); according to §122 Radiative Corrections to Electron Scattering 541 (116.6), A° = 7°[/( -&-^-^x°*V **( - &• Adding the two contributions, we havet J _ ~/ _ ~2x 2 Qrad(q) = / ( - qJ\2 )_- 1l -_j J_ ^ (Ob( - q _2 )~2x+ ,2^g<-"q )q-y. (122.1) Let us first consider the infra-red divergence in the form factor / ( - q 2 ) and therefore in the scattering amplitude (122.1). It has already been mentioned in §98 that the exact value of the purely elastic scattering amplitude is zero, i.e. it has no meaning. The only physically significant thing is the amplitude of scattering defined as a process in which any number of soft photons can be emitted, each having an energy less than a specified value û>max satisfying the conditions for soft-photon emission theory to be valid. That is, only the sum w max <*>max da = dore) + dcrei I dww + dat\ • ^ 0 w mix I dwW| I dw^ + • • • 0 (122.2) 0 is meaningful, where dac\ is the cross-section for scattering without emission of photons, and dw^ the differential probability for the emission by the electron of a photon with frequency co. Here it is assumed that dat\ itself is calculated as a perturbation-theory series, i.e. as an expansion in powers of a.t Then, on bringing together the terms of each order in a in (122.2), we obtain da as an expansion in powers of a in which each term is finite. In the first Born approximation, dae\~~a2. This term has, of course, an independent significance. If, however, the next correction (~~a3) to dat\ is to be taken into account, we must also include the second term in the sum (122.2): since dw» ~~ a, multiplication by dae\ — a2 likewise gives a quantity ~ a \ We shall show that the infra-red divergence disappears when these two quantities are added. The divergent term in the form factor / (117.17) is§ -JaF(|q|/2m)log(m/A). The corresponding term in the amplitude (122.1) is \<xF log (ml\)(û'y°u) 1 e<ï>(q), 0i t Note that q„ = (0, - q) if q* = (0, q), and therefore a q, = - y°q • y. t The need to take account of radiative corrections in the probability dww is governed by the value of ü)max; the limit <o -+0 corresponds to the classical case where the radiative corrections are zero, and so the latter can always be made small by taking a sufficiently small o>max. § This expression is easily verified by using the relation |q|/m=(l-0/V£ between |q| and the variable £ in terms of which (117.17) is written. 542 §122 Radiative Corrections and in the cross-section (121.5) dainfra = -aF log (m/A)|fi'7°u|2|e*(q)|2 do'/167r2. Comparing this with the Born cross-section da{]) = \ü'y«u\2\e<P(q)\2 do'l\6Tr\ we find that dainfra = - aF log (m/À) da (,) . (122.3) The second term in (122.2), with / dw» from (120.11), gives w max do-ei f d w ^ a F log (2o>max/A)da(,). o (122.4) Finally, adding (122.3) and (122.4), we obtain - dor(,)aF(|q|/2m) log (m/2aw). (122.5) We see that the divergent contribution from soft (|k| ~~ A) virtual photons does in fact cancel with that from the emission of real photons of the same kind. A similar result occurs in any other scattering process. There is also a dependence of the scattering cross-section on a)max, resulting from the fact that a>max appears in the definition of scattering as a process in which any number of soft photons can be emitted. The cross-section for such a process will of course decrease with the upper frequency limit comax for photons whose emission we regard as belonging to the scattering process in question. Let us now determine the complete radiative correction to the scattering cross-section. Proceeding in accordance with the standard rules (see (65.7)), we find as the cross-section averaged over the polarizations of the initial electron and summed over the polarizations of the final electron da = dcr(1) + darad = |e<D(q)|2 tr{(yp'+ m)(y° + Y°Qrad)(yp + m)(y°+ /Q r a d )} do'/327r2. According to (122.1), Qrad = a + b y • q, Qrad = y°QÏ*ôy° = a - b y • q, a=/(-q2)-l-^(-q2), ^=J^S(-^ As far as the terms linear in a and b, the trace in (122.6) is given by i t r { - } = 2(e 2 ~iq 2 )(l + 2a)-2bmq 2 . (122.6) §122 Relative Corrections to Electron Scattering 543 Hence darad = 2 [ / A ( - q 2 ) - l - ^ (122.7) where da{U is the Born cross-section (80.5) for the scattering of unpolarized electrons, and a subscript A is added to the form factor / in order to show explicitly that it is cut off at photon mass A. We now have only to add to (122.7) the cross-section for the emission of soft photons. If we write /A in the form /A( - q2) = 1 - \aF(\q\l2m) log (m/A) + aF2, (122.8) then from (120.11) this addition simply means replacing /A in (122.7) by /aw = 1 - 5aF(|q|/2m) log (m/2aw) + {ccF, + aF2. (122.9) With this change, (122.7) gives the final answer. In the non-relativistic limit we havet The particular form of the external field appears in the radiative correction to the cross-section only through d<r(1); the factor in the braces in (122.7) is universal. In the non-relativistic approximation, "^-"^■Mi^ïtyiy "2«m2' <i22"> which includes contributions from all the terms in (122.7). In the opposite (ultrarelativistic) limit, the main contribution comes only from the term in / Wmax - 1: darad = - da{[) • — log - ^ log - £ - , TT m comax q2 > m\ (122.12) Finally, it may be noted that the radiative corrections considered here do not cause any additional polarization effects that are not present in the first Born approximation (unlike the corrections of the second Born approximation, discussed in §121). The reason is that the particular features of the first Born approximation are ultimately due to the fact that the S-matrix is Hermitian. This property is maintained even when the radiative corrections described above are taken into t This differs from the non-relativistic formula (117.20) by the change log A -» log 2(umax - 6. 544 Radiative Corrections §123 account, since in this approximation there are no real intermediate states in the scattering channel (and so the right-hand side of the unitarity relation is zero).t §123. Radiative shift of atomic levels The radiative corrections cause a shift of the energy levels of bound states of an electron in an external field, called the Lamb shift. The most interesting case of this kind is that of a hydrogen atom (or hydrogen-like ion).$ A consistent method of finding the energy-level corrections is based on the use of the exact electron propagator in an external field (§109). But, if Za«U, (123.1) it is possible to use a simpler procedure in which the external field is regarded as a perturbation. In the first approximation with respect to the external field, the radiative correction in the interaction between an electron and a constant electric field is described by the two diagrams (121.2) already used in connection with the problem of electron scattering in such a field; the change from one problem to the other needs no more than a simple reformulation (see below). However, it is easily seen that this treatment can give only the part of the level shift that is due to the interaction with virtual photons of sufficiently high frequency. Let us consider, for example, the next radiative correction (as regards order with respect to the external field) to the electron scattering amplitude: « i • M , i i i (123.2) p (unlike (121.2b), this diagram contains two external-field vertices). In the range of integration over d4k where k0 is sufficiently large, this correction involves an extra power of Za, and is therefore unimportant. But the addition of a second externalfield vertex to the diagram also brings in a further electron propagator G(/). When t The calculation of the radiative corrections for processes which appear only in the second approximation of perturbation theory is considerably more laborious, and will not be given here. We shall simply list some references: L. M. Brown and R; P. Feynman, Physical Review 85, 231, 1952 (radiative corrections to photon scattering by an electron); I. Harris and L. M. Brown, ibid, 105, 1656, 1957 (r.c. to a two-photon pair-annihilation); M. L. G. Redhead, Proceedings of the Royal Society A220,219, 1953, and R. V. Polovin, Soviet Physics JETP 4, 385, 1957 (r.c. to electron scattering by an electron or a positron); P. I. Fomin, ibid. 8, 491, 1959 (r.c. to bremsstrahlung). t The shift of the hydrogen levels was first calculated by H. A. Bethe (1947) with logarithmic accuracy, using a non-relativistic treatment; this work provided the initial stimulus for the whole subsequent development of quantum electrodynamics. The difference between the 2sm and 2pi/2 levels (in the first non-vanishing approximation of perturbation theory) was exactly calculated by N. M. Kroll and W. E. Lamb (1949); the complete formula for the level shift is due to V. F. Weisskopf and J. B. French (1949). §123 Radiative Shift of Atomic Levels 545 k is small, and the free ends p and p' are non-relativistic, the important values of the virtual-electron momenta / are those close to the pole of the propagator G(/). The small denominator which thus occurs cancels the extra small factor Za. The same evidently applies to the corrections of all orders with respect to the external field. Thus, at low frequencies of the virtual photons, the external field must be taken into account exactly. We can divide the required level shiftt 8ES into two parts: 8ES = 8E{? + 8E™, (123.3) which originate from the interaction with virtual photons having frequencies in the ranges (I) k0> K and (II) k0 < * ; * is chosen so that {Zafm <K<m, (123.4) where Z2a2m is of the same order as the binding energy of the electron in the atom. Then, in region I, it is sufficient to take account of the nuclear field in the first approximation. In region II, the nuclear field must be treated exactly, but on the other hand, since K<€m, we can solve the problem in the non-relativistic approximation—not only as regards the electron itself, but for all the intermediate states*. With the condition (123.4), the ranges of validity of the two methods of calculation overlap, and it is therefore possible to make an exact "joining" of the two parts of the level correction. T H E H I G H - F R E Q U E N C Y PART O F T H E SHIFT Let us first consider region I. Here it is possible to use the correction (122.1) to the scattering amplitude, after removing the contribution of the virtual photons which pertain to region II. These make only a small contribution to the form factor g, which therefore can be left unaltered. The low-frequency virtual photons make a large contribution to /, however, because of the infra-red divergence. Thus / in (122.1) must be taken as a function fK from which the region k0<K has been excluded. This could be done directly by subtracting from / the integral over the region k0 < K, but the required result can be obtained without fresh calculations by using the results of §122. To do so, we note that the exclusion of frequencies k0< K can be regarded as one possible type of infra-red cut-off. The result for the correction to the scattering cross-section must, of course, be independent of the cut-off used, provided that the real soft photon emission probability is cut off in the same way, i.e. the concept of "elastic" scattering includes the emission only of photons with frequencies from K to the specified û>max. If we take a>max = K, there is no need to take explicit account of the photon emission. Hence we see that /„ is obtained from the /„,„,„ determined in §122 by simply replacing <omax by K. In particular, in the t In this section, £, denotes the energy of an electron in an atom, not including its rest energy. The suffix s stands for all the quantum numbers which define the state of the atom. 546 §123 Radiative Corrections non-relativistic case / - - | - - 3 ^ ( t o 8 S + 24)- (123 5) - Let us now transform the correction (122.1) to the scattering amplitude by representing it as the result of a corresponding correction to the effective potential energy of the electron in the field. Comparing the amplitude (122.1) -e(u'*Qradu) with the Born scattering amplitude (121.6) - e(u'*$>u), we see that the correction is given (in the momentum representation) by the function eö(D(q) = 6Qrad(q)0(q). (123.6) In the non-relativistic case, taking 9 and g from (113.14) and (117.20), and substituting fK from (123.5) for /, we get ^■(■ia(''Mi-l) t Ê'''K (1237) The corresponding function S4>(r) in the coordinate representation ist ^«^(^Mi-il^-'Ä^^ «23-8> The level shift 8E(5l) is found by averaging eÔ<ï>(r) over the wave function of the unperturbed state of the electron in the atom, i.e. as the corresponding diagonal matrix element:^ 8E[l) = 3irm ( l0 s£ + iH)< s i A *i s >- -'J^<s|yV*|s>. (123.9) In the first term, the non-relativistic electron function suffices for the averaging. t Note that this correction to the potential is not the same as the one discussed in §114, which included only the effect of the vacuum polarization (diagram (121.2a)) on the Coulomb field as such. The correction (123.8) relates to the interaction of the field with the electron, and includes also the effect of a change in the motion of the electron (diagram (121.2b)). t Strictly speaking, the form factors determined in §117 related to the vertex operator with two external electron lines (p2 = p'2 = m2). For an electron in an atom, the energy Es is a level which is unrelated to p. The distinction may, however, be neglected in region I. Radiative Shift of Atomic Levels §123 547 In the second term, this approximation is insufficient: the zero-order approximation with respect to the non-relativistic functions is zero on account of the absence of diagonal elements in the matrices y. Here, therefore, we must use the approximate relativistic function * -(!) derived in §33, retaining the components \ which are small (in the standard representation). We have and, substituting from (33.4) we get, using the identity (33.5) and integrating by parts, (s\y • V0>|s) = - ^ j - f {4>*(<r • V$)(<r ■ V<j>) + (V<f>* • <r)(a • V<t>)<f>} d'x = Y~\ {<!>*£<& ' <t> - 2i<r • 4>*[V4> x V<f>]} d3x. Since <î> = <ï>(r), _. rd r dr and hence -icr-[V4>xV] = - ^ « r - î , r dr where 1 = - ir x V is the orbital angular momentum operator. Finally, bringing together the expressions obtained and substituting in (123.9), we have SEf = T ^ ( l o g ^ L + ^V$|A4>|s) + r ^ ( s | < r - l - ^ | 5 \ , A-nm \ \ r dr \ / 3irm \ 2K 30/ ' ' (123.10) in which the averaging is over the non-relativistic wave function in both terms. THE LOW-FREQUENCY PART OF THE SHIFT In order to calculate the second part of the level shift, we use a technique based ultimately on the unitarity condition. Since a photon can be emitted, the excited state of the atom is not strictly 548 §123 Radiative Corrections stationary, but only quasi-stationary. A complex energy value can be assigned to such a state, its imaginary part being - 2 w if w is the decay probability of the state, in this case the total photon emission probability (see QM, §134). In the nonrelativistic approximation there is dipole radiation, and from (45.7) im 8ES = - 1 ws = - i 2 K\2(Es ~ E,)\ s' where the summation is over all the lower levels (Es>< E5), or, equivalently, im 8ES = -I$ da • Ç |d„.|2(E, - EM.?6{EM - Es - a>). o (123.11) In order to find the real part of 6E5, we must regard Es as a complex variable and use analytical continuation. This may be done by treating the delta functions as originating from poles. The rule for the avoidance of poles is, as usual, specified by adding a negative imaginary part to the masses of the virtual particles—in this case, to the masses ms> of the electron in the intermediate states of the atom. These are m5 = m + Es, and so we must put Es ->E5 -iO, whence 8(E i -E I .-ai) = - ^ i m p F * / . m; (123.12) cf. (111.3). Thus, substituting (123.12) in (123.11), we find (E • 2' | d' „E,f p- E ^~ Es)\ aim SE, = im 3ir ^ - Jf da> 7* s - w + lO o The required analytical continuation is now obtained by simply omitting the symbol im, but we must take from 8ES only the part due to the contribution from frequencies in region II (o> < K). TO do so, we need only replace the upper limit of integration by K. The result of the integration is *S?D = 3^~ ? K?(Es' - Es)3 l o g £s,_Es + iO; (l23l3) because of the inequality (123.4), the difference Es - E5> is neglected in comparison with K at the upper limit. We shall henceforward be concerned only with the real part of the level, which is obtained by using KI\ES■- Es\ as the argument of the logarithm in (123.13). §123 Radiative Shift of Atomic Levels 549 In the expression (123.13), the term in log K can be transformed by replacing the matrix elements of the dipole moment d = er by those of the momentum p = mv and its derivative p: 2 |dss|2(ßs- - Esy = - A 2 WftE,. - E.) s' "* s' Now replacing p in accordance with the operator equation of motion of the electron, p = - eV3>, we get 2J |d„.| 3 (E,. - E s ) 3 = - ^ 2 {(V<P)SS- • p , , - p ss • (V*),.,} = ^<*|p-V<î>-V<ï>.p|s> = 2^5<s|A<P|5). (123.14) We can therefore write instead of (123.13) 5TTYY\ m + 3 7 S kn'ftfi- - g^)3 Jog 2 | E , " E,.|THE (12315) T O T A L SHIFT Finally, adding the two parts, w e have the following formula for the level shift: 8E > = 37? , + lr " | 2 ( E j '" Ej)3l0g 2|E,-E,j+ e3 1 9 . | A . , , <?3 / I , 1 d4> I \ <s|A s) = 3^'3Ô ^ 4^\T',r^|S/; „ - , ,., ' (123 16) as was to be expected, the auxiliary quantity K does not appear.! All the matrix elements in (123.16) are taken with respect to the non-relativistic wave functions of the electron in the atom. For a hydrogen atom or a hydrogen-like ion, these functions depend only on three quantum numbers: the principal quantum number n, the orbital angular momentum I and its component m, but not on the t The determination of the next-order corrections in the level shift involves very complicated calculations. The most complete tabulation and a systematic derivation of the corrections, together with further references, are given by G. W. Erickson and D. R. Yennie, Annals of Physics 35, 271, 447, 1965. 550 §123 Radiative Corrections total angular momentum j ; the corresponding energy levels depend on n only. We shall use the notation^ L„, = 2mfZe^ Em, \<nTm'\r\nlm)\\En. - Enf • log 2 | E ^ % | - (123.17) The energy levels are proportional to (Ze2)2, and the characteristic dimension of the atom is proportional to Ze2, so that the Lnt defined by (123.17) are independent of Z. They can be calculated numerically. We shall take separately the cases / = 0 and 1^0. When I = 0, the last term in (123.16^ is zero. In the second term, we use the equation eAQ = 4TTZe2Ô(r), which is satisfied by the potential of the Coulomb field of the nucleus. Hence <n/m|A4>|nlm> = 47rZe2|^,m(0)|2 _ r4m3(Ze2)V3 (1=0) ~U (1^0) (cf. (34.3)). In the first term, with the notation (123.17) and again using (123.14), £ Kn'l'm'|r|n00>|2(E„. - E„)3 = ^<n00|A*|n00> = 2m(Ze2)*ln\ This gives the following expression for the shift of the s terms (in ordinary units): The numerical values of some of the Ln0 are: n 1 L„o -2.984 2 -2.812 3 -2.768 4 -2.750 oo -2.721 The unperturbed levels are En = - mc2(Za)2/2n2, and so the relative magnitude of the radiative shift is \8EJEn0\ ~ Z V log (1/Za). (123.19) When 17*0, the second term in (123.16) is zero. The third term can be calculated t The matrix elements of r are diagonal in j and independent of j; the summation over s in (123.16) therefore reduces to summation over n, 1 and m. Because of the isotropy of space, the sum (123.17) is also, of course, independent of m. §124 Radiative Shift of Mesic-atom Levels 551 by means of the formulae in §34, and leads to a dependence of the level shift on the number j also. The result is 3jq+l)-<(l + l ) - n . .0 _4mc2ZVr J Ln ~ 3im L ' 8 1(I + 1)(2I + 1) J' ' ^ ° - 8En,i ,m2m °23-20) Thus the radiative shift removes the last degeneracy which remains after the spin-orbit interaction has been taken into account, namely the degeneracy of levels having the same n and j but different I = j ± i For example, the numerical value of L2\ is +0.030, and formulae (123.18)—( 123.20) give as the difference of the 2sm and 2pi/2 levels of the hydrogen atom Eaxi/2) " E2K1/2) = 0.41mcV, corresponding to a frequency of 1050 MHz. § 124. Radiative shift of mesic-atom levels At the end of §118 the electron vacuum polarization has been shown to play an important role in the (second-approximation) radiative correction to the magnetic moment of the muon. This still more true (and even in the first approximation) as regards the radiative level shift in /x-mesic hydrogen, a hydrogen-like system consisting of a proton and a muon (A. D. Galanin and I. Ya. Pomeranchuk, 1952). In calculating the level shift for an ordinary atom in §123, we took account, in particular, of the electron vacuum polarization effect (the electron loop in the diagram (121.2a)). If the muon vacuum polarization effect is similarly treated in the mesic atom, the entire calculation can be applied to this case, simply replacing the electron mass m = me by the muon mass mM. Since the relative shift (123.19) of the levels does not depend on the electron mass, the same result is obtained for mesic hydrogen. It is easily seen that the electron vacuum polarization has a much stronger effect on the level shift in the mesic atom, because the replacement of the muon loop in the diagram by an electron loop implies the replacement of the muon polarization operator by the electron polarization operator; and the polarization operator &(q2) is inversely proportional to the square of the particle mass for non-relativistic values of q2. Hence the change mentioned must increase the effect by a factor (mjme)2, and it is this contribution which determines the order of magnitude of the level shift: 8EI\E\~~a\mJme)\ or four orders of magnitude greater than in ordinary hydrogen.t The origin of this effect can be more clearly seen by noting that the distortion of the Coulomb t For a similar reason, the contribution of the muon vacuum polarization to the level shift in the ordinary hydrogen atom is, conversely, negligible. 552 Radiative Corrections §125 potential by the electron vacuum polarization extends to distances — \jme (§114). In the ordinary hydrogen atom the electron is at distances from the nucleus that are of the order of llm€a, i.e. outside the main region of field distortion, but in mesic hydrogen the muon is at distances — l/mMa which are in this region. To calculate precisely the level shift in the mesic atom, however, it is not possible to use the approximate non-relativistic expression for the polarization operator, as was done when using (123.7) to find the level shift in the ordinary atom. The reason is that the characteristic momenta of the muon in the mesic hydrogen atom are | p j ~ otmß. For the muon these momenta are non-relativistic, but for the electron they are relativistic. We must therefore use the full relativistic formula (114.5) for the effective potential of the nuclear field as modified by the electron vacuum polarization. The level shift is found by averaging over the wave function of the muon in the atom: ôEnl = -\e\j\il,nl\2ô<ï>(r)d'x = - |e| f Rîi(r)8<ï>(r)r2 dry o (124.1) where JR„/ is the radial part of the (non-relativistic) Coulomb wave function. For a hydrogen-like ion with nuclear charge Z\e\, the functions Rni(r) depend on r only through the dimensionless combination p = Z a m / (the distance in Coulomb units). Using this fact and substituting ô<ï>(r) from (114.5) (with the charge Z\e\ in place of ei), we can bring the integral (124.1) to the form 2 8Enl = -x—Za3m^Q„/(m€/ZamM), (124.2) where 00 » R2nl(p)e-2*«(\ + 2^) V ( *V" ° dt. Qnl(x) = jpdpj 0 1 The first few levels of mesic hydrogen are shown by numerical evaluation to have the following relative shifts: SE10/|E,o| = -6.4xl(r 3 , SE2o/|E2o| = -2.8xl(r 4 , ôE2i/|E2i| = -2.0xl(r 5 , §125. The relativistic equation for bound states The method used in the preceding sections to calculate the radiative shift of the atomic levels is not valid for solving a problem such as that of determining the corrections to the levels of positronium, a system consisting of two particles of §125 The Relativistic Equation for Bound States 553 equal rank, neither of which can be regarded as the source of the external field acting on the other. The systematic procedure for solving this problem is based on the fact that the energy levels of the bound states are poles of the exact amplitude of mutual scattering of these two particles, as a function of their total energy in the centre-of-mass system. In any of its discrete states, positronium may be regarded as an "intermediate particle" having a definite mass, which can be formed as a stage in the electron-positron scattering process, and a pole of the scattering amplitude corresponds to each "one-particle" intermediate state; these poles of course lie in the non-physical region of 4-momenta of the particles undergoing scattering. According to (106.17), the exact scattering amplitude comprises the exact four-ended vertex part Tik^m and the polarization amplitudes u of the particles. The latter are clearly unconnected with the pole singularities, and it is therefore more convenient to ignore them, referring instead to the poles of the vertex part itself, i.e. of the function r,k,im(P-, - P+; P-, - pi), (125.1) where the notation for the 4-momenta of the external lines of the diagram (106.12) corresponds to the scattering of a positron by an electron. It should be stressed that the assertion that poles are present refers to the exact scattering amplitude or the exact vertex part; there is no pole in any separate term of the perturbation-theory series, as can be seen from the fact that the Feynman diagrams in each approximation include only electron (and photon) lines, not lines belonging to the composite particle positronium as a whole. Hence it follows in turn that the calculation of the scattering amplitude near its poles involves summation of an infinite series of diagrams. The diagrams concerned can be determined as follows. In the first non-vanishing approximation of perturbation theory (the first approximation with respect to a), the vertex part (125.1) corresponds to two secondorder diagrams: (125.2) or, in analytical form, Tium = - e27f[7LDM,(p_ -pL) + e2y?myvk{D^(P- + p.). (125.3) In the next approximation (the second with respect to a) there are ten fourth-order diagrams: §125 Radiative Corrections 554 q-pii I fp_q /A, y q-p+-p- V I (125.4) and a further five diagrams obtained from (125.4) by interchanging p_ and -p,+. All these include an extra power of e2 = a in comparison with (125.2), but we shall show that in diagram (a) this extra order of smallness is cancelled by a denominator which is also small when the electron and positron momenta are small. All quantities will be taken in the centre-of-mass system, but, since the 4-momenta of the external lines in the diagrams are not assumed to be physical (i.e. p2 j£ m2), e+ /^ e- in this system, although p+ = - p_. Thus these 4-momenta are p_ = (e-,p), p+ = (£ + ,-p), P: = ( e :,p') p; = ( e ; , - P ' ) , (125.5) The binding energy of the electron and the positron in positronium is ~ ma2. Thus, in the neighbourhood of the scattering-amplitude poles, with which we are concerned, IPI ~ IP'I ~ma<m, | e - - m|~ |e+-m|~p 2 /m ~ ma2,. (125.6) The contribution to the vertex part from the diagram (125.4a) is rj&, = -ie4j (yKG(q)y'iHy''G(q - p_ - p+)yp)km x DKp(q -p'-)D»v{p- ~ q) d4qK2ir)\ (125.7) The important range of values of q" = (qo.q) in the integral (125.7) is that which is close to poles of both functions G simultaneously. In this range, |q| and |<j0— "i| are §125 The Relativistic Equation for Bound States 555 small, and the electron propagators are .0 G(q) y qo - y • q + m « lufi ~*<7°+l) q0-m (q0 + m)(q0- m) - q2 + iO G(q-p--p + )-ky°-D 1 -(q 2 /2m) + i0' 1 q o _ £ - - e + + m + (q / 2 m ) - i O ' (125.8) The poles of these two expressions are on opposite sides of the real axis of the complex variable q0; closing the path of integration along this axis by a contour in the upper half-plane (say), we can calculate the integral over dq0 from the residue at the corresponding pole.t The result is '(4«) ~,j (q - pl) (p. - q)\2md \ 2 - e- - E + + q 2 /m)' and so, using (125.6), we have in order of magnitude F(4a)^a2 (ma) 3 _ 1 (ma) ma ma' The contribution to V from the second-order diagram (125.2a) (the first term in (125.3)) is of the same order, and this proves the statement made above about the order of smallness of the diagram (125.4a). A similar situation occurs in all higher approximations of perturbation theory. Thus the calculation of the relevant vertex part near its poles calls for the summation of an infinite succession of 4t anomalously large" diagrams with intermediate states resembling the internal lines of (125.4a). A typical property of these diagrams is that they can be cut between the ends p_, - p + and pL,-p + into parts joined only by two electron lines.$ The set of all diagrams which do not satisfy this condition will be called a "compact" vertex part and denoted by f ikM; since it does not include the anomalously large diagrams, such quantities can be calculated by ordinary perturbation theory. For example, in the first approximation f is given by the two second-order diagrams (125.2), and in the second approximation by the eight fourth-order diagrams in (125.4), excluding diagrams (a) and (b). If the non-compact vertex parts are classified according to the number of "double bonds", we can represent the total T as an infinite series: (125.9) where the continuous thick internal lines are exact propagators ^ ; this is often t For the diagram (125.4c), which differs from (125.4a) only as regards the relative direction of the electron lines, both poles would be on the same side of the real axis, and so the integral would be zero in the approximation considered. t This definition includes all the anomalously large diagrams and also some "normal" diagrams such as (125.4b). §125 Radiative Corrections 556 called a "ladder" series. To sum the series, we "multiply" it on the left by a further f:t Comparison with the original series (125.9) shows that p -\ q V P- pl v vp~ yp~ P-v (125.10) -p/ q-p;-pi \p+ -pf Vp + -v/ ^-P, This graphical equation corresponds to the integral equation iT lMm (p:, - p+; p_, - pi) = if ik.im(p -, ~ p+; p-, - p ;) + + J f ir,5m(p-, q - pi - P-; q, - P+)^(q)^nr(q - pi ~ P-) x x rlWll(q, - p+; p-, q - pi - pL) d4ql(27r)4. (125.11) The functions f and <S are calculated by perturbation theory, and equation (125.11) then allows, in principle, the determination of V with any desired accuracy. To find the energy levels, we need to know only the positions of the poles of I\ Near the poles, T^>f, and so the first term on the right of (125.11) (the second diagram from the right in (125.10)) may be neglected, the equation then becoming homogeneous in I\ The variables p+,p~ and the indices k and I become parameters, the dependence on which is arbitrary and is not defined by the equation itself. Omitting these parameters and also the primes in the remaining variables pl> p-, we have ir,,m(p_; - p+) = J f ir,jm(p-, q - p+ - p-; q, - p+)%Aq) x x^nr(q-p.-p-)rM(q;q-p+-p.)d4q/(27r)4 (125.12) (E. E. Salpeter and H. A. Bethe, 1951). Equation (125.12), written in the centre-of-mass system (p+ + p_ = 0), has solutions only for certain values of e+ + e-, and these give the positronium energy levels. The function Tim plays only an auxiliary role. Another function is more convenient in practice: XAPU Pi) = %t(Pi)rtAPù Pi)%riPi). (125.13) t That is, we multiply each term in the series by f and two % and integrate appropriately over the 4-momenta of the new internal bonds. §125 557 The Relativistic Equation for Bound States Then equation (125.12) becomes = J f jr . jm (p-,q-p + -p_;q,-p + )Xsr(q, q-p+-p-)d4ql(2ir)\ (125.14) where f appears as the kernel of an integral operator. As already mentioned, f may be calculated by perturbation theory, and the same is of course true of <S~X. We shall show that, in the first approximation of perturbation theory (with respect to a), (125.14) reduces, as we should expect, to the non-relativistic Schrodinger's equation for positronium. In the first non-relativistic approximation, f is determined by the diagram (125.2a) alone; the annihilation-type diagram (I25.2b) is zero in this approximation.! For a similar reason to that in §83, it is convenient to take the photon propagator in the Coulomb gauge (76.12), (76.13), and only Doo need be retained in it. Then f irtJm(P-, q - P+ - P-; q, - P+) = - e2Y® y™Ax>(<J - P-) = -U(q-p-)7?î7l, where l/(q) = -4ire2/q2 is the Fourier component of the potential energy of the Coulomb interaction between the positron and the electron. Equation (125.14) becomes = [G<P-)7° J U(q - p-)x(q, q - p+ - p_) 0^ • y°G( - p.)]^, (125.15) where we have also replaced the exact propagators <ê by the free-electron propagators G. The latter are given by the approximate expression (cf. (125.8)) G{p-) ~ \(\ + 7°)g(p_), G( - p+) « |(1 - y°)g(P+), where the matrix factors have been separated, and g(p) is the scalar function «(">-.-,-A2m)+ «r (,2516) In substituting these expressions in (125.15), we note that all the non-zero matrix t The particle velocities in positronium are such that vie — a. In this sense the expansions in powers of a and of 1/c are interrelated. 558 §125 Radiative Corrections elements Öd + yVx7° • Id - 7°)]- = û(y°+ Dx ' 1(7° - DL« are equal to the elements - Xim- The matrix equation (125.15) is therefore equivalent to one for the scalar function î*(P-, - P+) = - g(P-)g(p + ) J U(q - p-)*(q, <ï ~ P+ " P-) d4ql(2rr)4. (125.17) We now replace p+ and p_ by the variables P = (e, p) = i(p- - p+), P =P- + p+\ these are the 4-momentum of the relative motion of the particles and that of the positronium as a whole. In the centre-of-mass system, P = ( E + 2m,0), where E+2m is the total energy, and E therefore the energy level relative to the rest mass. In terms of these variables, (125.17) becomes iX(p, P) = -g(p +\P)g(-p+{P)j ^-gip+lPM-p+ïP)! U(q - p - ) x ( q -{P 9 P) dV(2ir) 4 U(q' - p)X(q\ P) d4q'l(27r)4. In this equation, P occurs only as a parameter, and x figures on the right-hand side only as the integral *(q)= / x(q,P)dqo. Integrating both sides of the equation over de, we get an equation for $ in a closed form: ^ (p)= oc ~àl g ( p + i P ) g ( " p + i P ) d e ! U ( < , " p H ( q ) â' where g(±P+iP)=±E+,E_jp,/2m)+.o. If the path of integration over de is closed by a contour in the upper half-plane (say) of the complex variable e, we can evaluate the integral from the residue at the corresponding pole, obtaining ( ^ - E ) * ( p ) + J l / ( p - q ) ^ ( q ) ^ , = 0. (125.18) This is Schrödinger's equation for positronium in the momentum representation; see QM, (130.4). §126 The Double Dispersion Relation 559 If only the diagrams (125.2) were used in f, but including in them (and in <$) the next terms in the expansion in powers of 1/c, we should arrive at Breit's equation (§83). The inclusion of the diagrams (125.4) (together with the further terms in the expansion in 1/c) gives the radiative corrections to the positronium levels, but the calculations become very complicated. The following is the difference between the ground levels of ortho- and para-positronium, including the above-mentioned corrections:t ECSÔ - E('So) = a2 ^{l~(j + log 2) f - l t e } ; (125.19) the first term in the braces is the fine splitting (see §84, Problem 2). The second term is the radiative correction to the difference between the levels. The imaginary part of the difference arises from the parapositronium annihilation probability (see (89.4)), i.e. from the fact that the level ^o is complex; for parapositronium, the level width is found to be of the same order as the radiative correction to the real part of the level. § 126. The double dispersion relation After the vertex part with three external lines, the next in order of complexity is a section with four external lines. In quantum electrodynamics, three basic diagrams of this type are possible: n (126.1) The first describes the scattering of a photon by a photon; the others are individual terms in the radiative corrections to the scattering of (b) a photon and (c) an electron, by an electron. This §126 deals with some general properties of such diagrams, but to be simple and specific we shall refer only to (126.1a). The momenta of the lines in such a diagram will be denoted as follows: krVkrk \ \ <J-V ' J / k3 / J \<\~\l «H*** 's \ t R. Karplus and A. Klein, Physical Review 87, 848, 1952. (126.2) Radiative Corrections 560 §126 The 4-momenta ku k2, k}, k4 correspond to real photons, and their squares are therefore zero. If the dependence on the photon polarizations is written separately, the amplitude Mfi which corresponds to the diagram (126.2) can be expressed in terms öf various scalar functions of the photon 4-momenta. These are the invariant amplitudes discussed in §70; they will be derived in §127 for the specific case of photon-photon scattering. Being scalars, they depend only on scalar variables, which may be taken as, for example, any two of the quantities s = (ki + k2)\ t = (k, - fc3)2, M = (ki - k4)\ s + t + u = 0; (126.3) in what follows we shall take s and t as independent variables. Each of the invariant amplitudes, which will be denoted here by the same letter M, can be written as an integral: "I „-. 'Äd^R W - m2][(q - k4)2 - m2][(q - k, - k2)2 - m2)[(q - k2)2 - m2]9 2 2 m -*m - (126.4) Î0, where B is some function of all the 4-momenta; the factors in the denominator arise from the propagators of the four virtual electrons. When s and t are sufficiently small, the amplitudes M are real (more precisely, they can be made real by a Suitable choice of the phase factor), since if s is small the photons cannot generate real particles (an electron-positron pair) in the s channel, and if t is small the same applies to the t channel.! Thus neither channel has real intermediate states which could, according to the unitarity condition, lead to an imaginary part of the amplitude. Now let s increase while t remains at a fixed small value. When s ^4m 2 , the amplitude M has an imaginary part due to the possibility of pair production by two photons in the s channel. We can therefore write for M a dispersion relation in the variable 5: 7T J 4m2 S — S — II) where A ls (s, t) denotes the imaginary part of M (s, 0As in any diagram having the form t The directions of the external lines as shown in the diagram (126.2) correspond to the s channel. In the t channel, lines 1 and 3 are incoming, and so the 4-momenta of the initial photons are k\ and- ky. The physical regions for photon-photon scattering in the variables s, f, u are the shaded sectors in Fig. 8 (§67). For example, the s channel corresponds to the region s >0, t <0 f u <0. §126 561 The Double Dispersion Relation A|S(s, 0 is calculated by the rule (115.9), replacing the pole factors in the integral (126.4) by delta functions: I\A (, f\-n^;\2 f iB8(q2 - m2)8[(q - k, - k2)2 - m2] 2iA lf (M)-<2iri) J [(q .k4)J.mi][(q_k2)J.m4] 4 dp, n u . , (126.6) the integration is taken over the half of q-space in which q°>0. An important further step can be taken by noting that the integral (126.6) has a structure (of pole factors) similar to that of the amplitude for a reaction represented by a diagram having the form q-k 4 x q-k 2 k, The analytical properties of Ai,(s, t) as a function of t are therefore similar to the analytical properties of this amplitude. In particular, the function A|,(M) can acquire an imaginary part (as t increases) only if both factors in the denominator become zero simultaneously. This will not, however, occur as soon as t reaches the value 4m2 which is the threshold for pair production in the t channel. The reason is that the presence of the delta functions in the integrand restricts the region of integration in q-space, which may be incompatible with the value t =4m 2 . The extent of the region of integration depends on s (the arguments of the delta functions contain k\ and k2), and therefore so does the limiting value t = tc(s) beyond which A\s(s, t) becomes complex. In the same way as M(5, t) is expressed in terms of its imaginary part A\5(s, t) by (126.5), the function AXs(s, t) is in turn expressed in terms of A2(s, 0 = im Au(s, 0 by a dispersion relation in the variable t: 30 Au(s,t) = l j j^j^dt'. (126.7) tc(s) If we now substitute (126.7) in (126.5), we get the double dispersion relation or Mandelstam representation for the amplitude Af(s, ()• 00 4m2 00 tt(s) (S. Mandelstam, 1958). The function A2(s, t) is called the double spectral density of M(5, t). It can be obtained from the integral (126.6) by twice applying the substitution rule (115.9). Putting for brevity Ji = q, h = q-k4, h = q-k2, l4 = q-k]-k2, (126.9) 562 §126 Radiative Corrections we have (2i)2A2(s, 0 = (2TT04 j iB8(l]- m2)8(l22- m2)8(l]- m2)8(H- m2) d4q, (126.10) the integration being taken over the region q ° > 0 . It should be noted, however, that formula (126.10) is purely symbolic, since the region s > 0, t > 0 is non-physical, and accordingly lu J 2 ,... are in general complex in this region when q is real; and the delta function is not fully defined for a complex argument. It would be more accurate to refer immediately to the taking of residues at the corresponding poles of the original integral (126.4). In our case this is, however, of no importance. The condition for the four expressions in the denominator in (126.4), or the four arguments of the delta functions, to be zero, entirely determines the components of the 4-vector q. On changing to integration with respect to l], / 2 , . . . (see below) and formally applying the usual rules to (126.10), we obtain (apart from the sign) the expression for A2. To continue the calculations, we use the centre-of-mass system (in the s channel). Then k, = (co, k), k2 = (o>, - k), s = 4<o2, k3 = (<o, k'), k4 = (<o, - k'), * = - (k - k')2 = - 4o>2 sin2 iO, u = - ( k + k ; ) 2 =-4a) 2 cos 2 i0, (126.11) (126.12) where 9 is the angle between k and k' (the scattering angle). The x-axis of spatial Cartesian coordinates is taken along the vector k + k', and the y-axis along k - k ' . t We shall now transform the integral (126.10) by taking J2, ! ! , . . . as new variables of integration in place of the four components of q. Then a(i?)/d<r = 2 i i M > . . . f and the Jacobian of the transformation is therefore -7—75 r — lDU, where D is the determinant formed by the sixteen components of the four 4-vectors l|, I 2 ,.... The integration in (126.10) amounts simply to replacing the functions B and D in the integrand by their valuest when |2=|2 = |2^|2==m2 (126.13) t When f >0, (k-k') 2 <0, i.e. the vector k-k' is imaginary. This difficulty is, however, easily circumvented by expanding all vector expressions with t < 0 and using analytical continuation to t > 0. t This method of integration automatically takes account of only one zero of each argument of the delta functions. § 126 The Double Dispersion Relation 563 2 Prom the conditions l] = l\ = m we have, as in §115, q°=o>, q 2 =w 2 -m 2 . (126.14) The other two conditions give (q - k4)2 - m2 = - 2qk4 = - 2<o2 - 2q ► k' = 0, ( q - k 2 ) 2 - m 2 = -2<o 2 -2q-k = 0, and hence q k = q k ' = -ls, or, in components, q° = a), qx = - s/2(s + i), qy = 0, q2 = ± V ( w 2 - m 2 - q 2 ) ^ + rst-4m 2 (5 4-t)]' /2 "L 4(5 + 0 J " (126.15) Thus the integral (126.10) is (126.16) where the summation is over the two values of q given by (126.15). The determinant D can be written in terms of the antisymmetric unit tensor: D = e^itnm = - w ( q - W(k* - *i)"(*2 ~ WK* where the antisymmetry of e{kvva has been used. Since only ki among the four factors has a time component, we deduce that D = - «q • (k + k') * (k - k'). Expanding this expression with t < 0 and then continuing to t > 0, we find D = - (oqzV(s + tW(-t)^±ii{st[st-4m\s + t)]}m. (126.17) The choice of sign needed here can be made as follows. For simplicity, let 8 = 1. Then A\s(s, t)<0 in the physical region (s >0, t <0), since the two factors in the denominator in (126.6) have the same (negative) sign: (q - k4)2 - m2 = - 2o>2 - 2q • k' < - 2o)(w - |q|) < 0, (q - k2)2 - m2 = - la)2 - 2q • k < - 2a>((o - |q|) < 0 Radiative Corrections 564 §126 (here we use the results (126.14) which follow from the presence of the two delta functions in the numerator, and which show that |q|<o>).t From (126.7) it is then seen that A2(s, t) also must be negative when s > 0 and t > 0 (since, as is evident from (126.16), A2(s, t) does not change sign). This means that the upper sign must be taken in (126.17), giving finally A,= -7T4 SB { s f [ s f - 4 m 2 ( s + 0]} ,/2 " (126.18) Since, from its significance, A2(s, t) must be real, there is a further condition: as well as s and t, the expression in brackets in the denominator must be positive: + 0^0,] st-4m\s s>0, f>0. (126.19) These inequalities define the region (shaded in Fig. 23) over which the integration is to be taken in the double dispersion integral (126.8). The region is bounded by the curve st - 4m2(s + r) = 0, with asymptotes s = 4m 2 and t = 4m2. The dispersion relations in the form (126.5) and (126.8) do not yet take account of the renormalization conditions; if they were applied as they stand, the integrals would be divergent and would need to be regularized. The renormalization condition for the amplitudes M (s, 0 is M(0,0) = 0: (126.20) the photon-photon scattering amplitude must be zero when k\ = k2 = ki = k4 = 0 (and therefore s = t = 0), since k = 0 implies a potential constant in time and space, corresponding to no physical field; this condition will be further discussed in §127. 4m' 4m2 FIG. 23. t This is, of course, not fortuitous: Ai, is negative, in fact, because of the unitarity condition, as is especially clear when t = 0 and Ai, determines the total cross-section. §126 565 The Double Dispersion Relation To include this condition automatically, we must write the dispersion relation "with subtraction" (as in deriving (111.13) from (111.8)). The required relation is obtained in a natural manner by first using an identical transformation: ! . (s'-s)(t'-t) ä (s'-s)(t'-t)s't' s + - + (s'-s)s't' *- +-L (t'-t)s't' s't" Substitution of this in the integrand (126.8) gives w, „ st f f A2(s\t')ds'dt' s f f(s')ds' t f g(t') dt' where /(s) = If>MMO d( , TT J I g(() = lfM£lOds, 77 J S These equations would, however, be meaningful only if all the integrals converged. If not, the functions /(s), g ( 0 and the constant C must be assigned specified values in accordance with the renormalization condition, putting C=0, /(s) = A l 5 (s,0), g(t) = A u (0,t), where A ïf is the imaginary part of M(s, 0 which appears as t increases for a given small s (just as A\s is the imaginary part which appears as s increases for a given small 0- The first of these equations is obvious: C = M(0,0) = 0. The second (and similarly the third) follows on comparing the equation IT J {S - S)S with the single dispersion relation (126.5) written "with subtraction" according to (126.20): Thus the double dispersion relation "with subtraction" is finally If s and t are themselves within the region of integration, the integrals (126.21), 566 §127 Radiative Corrections (126.22) must as usual be taken in the sense s ->s + iO, f -> t + iO. (126.23) § 127. Photon-photon scattering The scattering of light by light (in a vacuum) is a specifically quantumelectrodynamic process; in classical electrodynamics it does not occur, owing to the fact that Maxwell's equations are linear.t In quantum electrodynamics, photon-photon scattering is described as the result of the production of a virtual electron-positron pair by the two initial photons, followed by the annihilation of the pair into the final photons. The amplitude of this process (in the first non-vanishing approximation) is represented by six "square" diagrams with every possible relative position of the four external ends. These include the diagrams l q-M / / i \ /k< k N Iq-kî q-krk:\ \k, (127.1) *>' \k, k 4 / \k, and another three which differ from these only in that the internal electron loop is traversed in the opposite direction. The contribution of these three diagrams is the same as that of the diagrams (127.1), and the total scattering amplitude is therefore Mfi = 2(MW + Mw + Mic% (127.2) where M(a), M(b) and M(c) are the contributions of diagrams (a), (b) and (c). According to (64.19) the scattering cross-section is , do' 1 IM«! (2^p. da = 6 4 7T1W\ (127.3) where do' is the solid-angle element for the direction k' in the centre-of-mass system. The scattering angle in that system is denoted by 6. INVARIANT AMPLITUDES Writing separately the polarization factors of the four photons, we have M/, in the form (127.4) Mfi = e M * 3 M * M w ; t In the limit of low frequencies, this process was first discussed by H. Euler (1936), and in the ultra-relativistic case by A. I. Akhiezer ( 1937). The complete solution is due to R. Karplus and M. Neumann (1951). §127 Photon-Photon Scattering 567 the 4-tensor MAMi/p (called the photon-photon scattering tensor) is a function of the 4-momenta of all the photons. If the arguments of functions are written with the signs which correspond to like directions of the external lines in the diagram, it is evident from the symmetry of the group of diagrams (127.1) that MK^p(kuk2,-k},-k4) is symmetrical with respect to any interchange of the four arguments together with a simultaneous corresponding interchange of the four suffixes. Because of the gauge invariance, the amplitude (127.4) is unchanged when e is replaced by e + constant • k. Thus we must have k\Mk^ = ktMk9HM,= --- = 0. (127.5) It is easily deduced from this that, in particular, the expansion of the scattering tensor in powers of the 4-momenta k\, k2,... must begin with terms containing quaternary products of the components, and certainly M w (0,0,0,0) = 0. (127.6) To determine the actual invariant amplitudes, however, it is desirable to take from the start a particular gauge of the polarization 4-vectors e, in which er = (0,ei), e? = (0,e 2 ),.... (127.7) Mfi = MMmeuelke^ie*Am, (127.8) Then where Miklm is a three-dimensional tensor. We take as the two independent polarizations for each photon the circular polarizations with opposite directions of rotation, i.e. two helical states with helicities A = ± 1. The tensor Mik\m can then be written M iklm = S À,À 2 À 3 À4 MA|A2A,A4ettï)*eiV)*e^>eSis>; (127.9) the sixteen quantities AfÀ|A2À3À4 are functions of s, t and uy and act as invariant amplitudes, but they are not all independent. The quantities MAjA2A3A4 are three-dimensional scalars. Spatial inversion changes the sign of the helicities, while the invariant quantities s, t and u remain unaltered. The condition of P invariance therefore gives the relations MAlA2A3À4(s, r, u) = Af-A|.-5A2.-Aj,-A4(^ t, u). (127.10) Time reversal interchanges the initial and final photons without affecting their Radiative Corrections 568 §127 helicities; s, t and u again remain unaltered. The condition of T invariance therefore gives the equation AfAlx2A3A4(s, t, u) = M A 3 M I A 2 (S, t, u). (127.11) Lastly, one further relation follows from the invariance of the amplitude Mfi under the interchange of the two initial or the two final photons. If both interchanges are made (k\ <-> k2, ki<r* k4), the variables s, t, u are unaltered, and the interchange in the polarization indices leads to M A l A 2 À 3 A 4 (s, r, u) = M A 2 A l A 4 A 3 (s, t, u). (127.12) It is easy to see that, because of the symmetry properties (127.10M127.12), the number of independent invariant amplitudes is only five, which may be chosen, for example, as M++++, M++—, M+-+-, M+—+, M+++- (the suffixes + and - denoting, for brevity, helicity values + 1 and - 1). If one of the amplitudes MXlk2x3\4 is substituted for Mfi in (127.3), the result is the cross-section for scattering with specified polarizations of the initial and final photons. The cross-section summed over the final polarizations and averaged over the initial polarizations is obtained by the substitution \Mfi\2 -> }{2| M++++I2 + 2|M++__|2 + 2|M+-+_|2 + + 2|M,-„| 2 + 8|M, + .-| 2 }. (127.13) The symmetry relations (127.10>-(127.12) connect different invariant amplitudes as functions of the same variables. Further functional relations are obtained from crossing invariance (§78), since the amplitude Mfi describes the same reaction (photon-photon scattering) in every channel, and therefore must be the same for every channel. The s channel (corresponding to the arrow directions as shown in the diagrams (127.1)) is converted to the t channel by interchanging the 4-momenta k2 a n d - k 3 (i.e. by changing the variables s ++1) and interchanging the helicity suffixes À2 <->- A3. Similarly, it is converted to the u channel by interchanging k2 and - k4 (s *+ u) and A2 <-» - A4. This leads to the relations M+-+-(s, r, u) = Af++++(w, t, s), M+-+(s, U u) = M++++(t, s, u), \ (127.14) M + + + + (s, r, u) = M++++(s, M, 0 ; M++— and M+++_ are completely symmetrical in 5, t and w.t It is therefore sufficient t Here we have also used the symmetry with respect to the two final photons. Since the three variables s, f and u are not independent» it would be sufficient to write two arguments (for example, the first two), but we have retained all three, simply in order to clarify the symmetry of the interchanges. Photon-Photon §127 Scattering 569 to calculate only three of the sixteen amplitudes, for instance M++++> M++— and M The relations ( 127.10M127.12) and (127.14) apply to the total amplitudes, i.e. the sums of the contributions of all three diagrams (127.1). But these contributions themselves are related in a manner which is obvious on comparing the diagrams. For example, diagram (b) is obtained from diagram (a) by the substitutions k2 <-» - k4, £:<-*£*, and so their contributions to the invariant amplitudes are obtained from each other by interchanging the variables s <-♦ u and the suffixes A2«->-A4; similarly, the contribution of diagram (c) is obtained from that of diagram (a) by the changes t «-» w, A3 <-> - A4. CALCULATION OF THE AMPLITUDES The integral M\V corresponding to the diagram (127.1a) has the form (126.4), with B(a) = — tr {ye^yq -yk2 + m)(ye2) x x (yq + m)(yet)(yq - yk4 + m){ye%){yq - yk{ - yk2 + m)}. (127.15) The integrals (126.4) are logarithmically divergent. In accordance with the condition (127.6), they are regularized by subtracting the value when k\ = k2 = * • • = O.t The calculation of the regularized integrals is, however, exceedingly laborious. The most straightforward way to calculate the photon-photon scattering amplitudes is based on the use of the double dispersion relation (B. De Tollis, 1964). This method makes the most complete allowance for the symmetry of the diagrams, and almost entirely eliminates the difficulties of the integrations. The function A\as\s, t) (and similarly Aft}) for any given set of helicities Ai, A2, A3, A4 is calculated in accordance with (126.6); owing to the presence of two delta functions in the integrand, the value of B{a) is needed only for l] = q2 = m\ 15 s (q - fc, - k2)2 = m2. (127.16) These equations can be utilized in calculating the trace (127.15). For substitution in t In the summation of the contributions from all the diagrams, the divergent parts of the integrals cancel, as is easily seen by noting the asymptotic form of the integral as q-*»: M^ P « jtT{y,{yq)yAyq)yP(yq)yÄyq)}d4ql{q2)4. After averaging over the directions of q (cf. (131.10)), the trace is easily calculated, giving M{KU « (g^g,P + gA.g^p - 2gApgM„) I d4ql(q2)2. The summation over the diagrams is equivalent to symmetrizing this expression with respect to the suffixes A, ji, v and p, the result of which is zero. However, this is in a certain sense fortuitous, and does not remove the need for regularization, even thpugh the latter amounts only to the subtraction of a finite quantity. 570 §127 Radiative Corrections (126.22) we need the value of AW only for t = 0. This implies that k = k' and k2 = fc4. Then the integral (126.6) becomes ms,0) = .^^nlj[(q_B^lw (12717) cf. the derivation of (115.10). With the angle d between q and k, we have (q - k2)2 - m2 = - 2cu(l - |q| cos d) = - Vs[l - |V(s - 4m2) cos #]. Thé integrals (127.17) can in fact be expressed in terms of elementary functions. The calculation of A^a)(s, t) from its definition (126.18) involves no integration; here the expression for B(a) is to be taken for the values of q given by (126.15), which satisfy not only (127.16) but also the conditions (q - k2)2 = m2, (q - k4f = m2. When the functions Ais, Au and A2 have been calculated, the dispersion relation (126.22) gives the amplitude directly as single and double definite integrals. We shall give thefinalresult for the three invariant amplitudes which, according to the preceding discussion, are sufficient to determine all the other amplitudes:! ji,*#„„ = - ! - ( 2 + ^ ) B ( 0 - ( 2 + ^ ) B ( « ) - -[?^-f][T(0 + T(»)] + f(l-2)/(..r) + .I f i J k ^L ^ sJ - l s- it- iuktu] » , , (,27.18) u \ s/ 5^M+++-=l+4(- + 7 + f)[T(s)+T(D+T(ii)]\s t u/ 8a -4(ÏÏ+ÏÏK'>-4(Ï+SK">-4(Î+^K")> S?**„.-.i-l/(,.o-i/(,..)-^/(«.«). Here, B(s), T(s) and I(s, t) denote the functions B(s) = V ( l - 4 / s ) s i n h _ 4 V - s - l , _, 2 T(5) = (sinh |V-s) , s<0, s<0, i J(s,t) = i J y ( 1 _ y ) f y ( 5 + 0 / s f {log[l-iO-5 y (l-y)] + log[l-iO-ty(l-y)]}, 0 (127.19) t Some further details of the transformations of the integrals, various representations of the transcendental functions B, T and I, and some limiting forms are given by B. De Tollis, Nuovo Cimento [10] 32, 757, 1964; 35, 1182, 1965; V. Costantini, B. De Tollis and G. Pistoni, ibid. [11] 2A, 733, 1971. Photon-Photon Scattering §127 571 and the expressions in the ranges 0 < s < 4 and s>4 are obtained from (127.19) by analytical continuation with the rule s -> s + iO, i.e. through the upper half-plane of these variables. To simplify the notation, s and t denote slm2 and tlm2, in (127.18) and (127.19) only. SCATTERING CROSS-SECTION The limiting case of low frequencies (to < m) corresponds to small values of the variables s, t and u. Thefirstterms in the expansion of the invariant amplitudes in powers of these variables are M + + + + - lleV/45m 4 , M+_+_ ~ lleV/45m 4 , M+__+~ lleV/45m 4 , M + + - « - e4(s2 + t2 + u2)/15m4, (127.20) M + + + -~0. Substituting these values in (127.3), we find the cross-sections for the scattering of polarized photons. The differential scattering cross-section for unpolarized photons, calculated from (127.13), is (in ordinary units) d(J = 4 ^ W a V <(^) 6 ( 3 + C0S2 0) d°' 02721) and the total cross-section ist 973 2 er =10125- a r 2/ftw Y 5 \mc2/ = 0.031a 2 r 2 (^p) , h<o <§ mc2. IT (127.22) In the opposite (ultra-relativistic) case, the total scattering cross-section for unpolarized photons ist <r = 4.7a4(c/<o)2, hot > mc2. (127.23) Finally, the differential cross-section for small-angle scattering in the ultrarelativistic case is der = - ^ 1 log4 ^ do, mc2lho><e<\. 7T 0) V (127.24) This expression is valid with logarithmic accuracy (the next term in the expansion contains a power of the large logarithm lower by one unit). In the limit 0 = 0 t In going from der to o\ a factor i has to be included to take account of the identity of the two final photons. X The origin of this dependence of or on <o will be further discussed at the end of §134. 572 Radiative Corrections §127 10° io- 3 « © i<r 6 l<Tl _____ I02 IO 3 IO1 10° FIG. 24. (forward scattering), (127.24) is invalid, and is replaced by da = ^2 log4 - ^ do, 6 « mc2/ft„. (127.25) This expression is easily derived from the general formulae (127.18), putting t = 0 and noting that, for 5 > 1, the highest power (the square) of the large logarithm is present only in the function T(slm2) ~ i log2 (slm2) - log2 (o>/m). To this accuracy, the only non-zero amplitudes are M++++ = M — . = M+-+- = - 16e4 log2 {(olrn). In particular, therefore, the photon polarization is in this case unchanged on scattering. Figure 24 shows the total scattering cross-section as a function of the frequency, plotted on a double logarithmic scale. The cross-section decreases towards both low and high frequencies, reaching a maximum when hit) — l.Srhc2. The break in the curve at too = mc2 corresponds to the change in the nature of the process when the production of a real electron-positron pair becomes possible. LOW FREQUENCIES For low frequencies (o> <m), the photon-photon scattering amplitude can also be derived by a totally different method, based on the correction terms in the Lagrangian of a weak electromagnetic field (§129). The small correction V' to the interaction Hamiltonian differs only in sign from that to the Lagrangian. From (129.21), V' = - 45x j ^ m w | {(Ê2 - H2)2.+ 7(Ê • H)2} d3x. (127.26) §128 Coherent Scattering of a Photon in the Field of a Nucleus 573 Since this operator is of the fourth order in the field, it has matrix elements for the relevant transition, even in the first approximation. For the calculation, we substitute in (127.26) Ê = -3Â/3f, H = curl A, A = V(4ir) 2 (ckA ekA e'ikx + c k \ e{A e f a ), (127.27) where A numbers the polarization; the S-matrix element is then given by Sfi = -i(f\jv'dt\i) = - i<0|c M3 c M 4 j V at cilkt ct2h\0) (127.28) (cf. §§72 and 77). When A is normalized as in (127.27), the scattering amplitude Mfi is found immediately from Sfi: Sfi = i(27r)4ô(4)(/c3 + k4 - fci - ki)Mn (127.29) (cf. §64). The mean value in (127.28) is calculated by means of Wick's theorem, using (77.3), with contraction of only the "external" operators ckA, ckA with the internal operators A. § 128. Coherent scattering of a photon in the field of a nucleus Other effects which are non-linear, like photon-photon scattering, and are described by square diagrams of the form (127.1), are the disintegration of one photon into two in an external field (and the reverse process of combination of two photons into one), and photon scattering in an external field. The former corresponds to diagrams in which one of the four external photon lines is replaced by an external field line; the latter process corresponds to diagrams with two external lines of real photons and two of virtual photons. This class includes, in particular, coherent (elastic) scattering of a photon in the constant electric field of a stationary nucleus. In general, the calculations lead to very lengthy formulae involving multiple quadratures.t Here, only some estimates will be given. Because of the requirements of gauge invariance, the scattering amplitude as tu->0 must contain products of the components of the 4-momenta of the initial photon (k) and the final photon (Jc'), just as the expansion of the photon-photon scattering amplitude begins with the quaternary products of the components of the 4-momenta of all the photons. Thus the scattering amplitude for a low-frequency photon is proportional to <o2. Since also this amplitude involves the external field t See V. Costantini, B. De Toüis and G. Pistoni, Nuovo Cimento [II] 2A, 733, 1971 ; B. De Tollis, M. Lusignoli and G. Pistoni, ibid. 32A, 227, 1976. 574 §128 Radiative Corrections (the field of the nucleus with charge Ze) in the second order, we conclude that the scattering cross-section is da - Z4a4r2e(a>lm)4 do (co < m). (128.1) The frequency dependence is, of course, in agreement with the general results of §59. The coefficient in (128.1) cannot be calculated from the Lagrangian for a uniform electromagnetic field (as was done for photon-photon scattering). The reason is that, in the process here considered, distances from the nucleus r— 1/m at which its field cannot be regarded as uniform are important. The result of the exact calculation is d(7++ = d ( j - = 1.004 x 10 ~3(Za)4r2(co/m)4 cos 4 \e do, da+- = dcr-+ = 3.81 x \0-\Za)4r2e((olm)A sin4i2d do. (128.2) Here, as in §127, the suffixes + and - denote the helicities + 1 and - 1 of the final and initial photons; 6 is the scattering angle in the rest frame of the nucleus (V. Costantini, B. De Tollis and G. Pistoni, 1971). To estimate the cross-section at high frequencies, we use the optical theorem (§71). The intermediate state which appears on the right-hand side of the unitarity relation is here a state of the electron-positron pair (corresponding to the division of the diagrams at two internal electron lines between external photon lines). The optical theorem therefore relates the amplitude for elastic scattering of a photon through an angle of zero and the total cross-section ap^r for photon pair production in the field of the nucleus. If the amplitude /(o>, 0) for scattering through an angle 6 is so defined that the scattering cross-section is da = |/| 2 do (cf. (71.5)), we have i m /(<*), 0 ) = 0>CTpair/47r. The cross-section <rpair is, of course, zero unless œ > 2m. In the ultra-relativistic case, taking crpair from (94.6), we get f"W - im /(ai, 0) = ^(Za)2re^ [log ~ ™ ] (a>»m). (128.3) The real part of the scattering amplitude is determined by the imaginary part, through the dispersion relation. The latter must here be written "with one subtraction", i.e. for the function fit (where t = o>2), since as a>->0 the amplitude / <* o>2; compare the dispersion relation "with two subtractions" (111.13). Separating the real part of the dispersion integral (for which it is sufficient to take the integral as a principal value), and changing from integration with respect to V = o>'2 to that with respect to <o', we have /'<•) • re /(a,, 0) = ^ 7T P f j f f l t v J O) \0) 1m (O ) UM-4> §129 Radiative Corrections to the Electromagnetic Field Equations 575 When a) > m, the important values in the integral are o)1 — co> my so that we can use the expression (128.3) for /"(<*/); the lower limit of integration may then be replaced by zero. The principal value of the integral can be represented as half the sum of the integrals along paths on the upper and lower edges of the positive real axis in the complex cu'-plane; these paths may then in turn be rotated in the CÜ'-plane to lie along the positive and negative imaginary axes respectively. Then f u /(w) - Q>2n"(iÇ) + TTJ o = — (Za) 97T m2+o>i) — a) m f"(-iQd, di ^ * j J £ +ar and the final result is re /(to, 0) = ^ (Za)2 reœlm. (128.5) Note that the real part of the amplitude, unlike the imaginary part, does not contain a large logarithm. The sum of the squares of (128.3) and (128.5) gives the cross-section for scattering through an angle of zero as dcr.=0 = g g , (Za)' ^ ) 2 { l o g ' ( 4 ^ ) + V } do (128.6) (F. Rohrlich and R. L. Gluckstern, 1952). The result (128.6) derived for scattering exactly forwards is valid also over a certain range of small angles. The condition for its validity can be shown to be 0<^(m/(u)2. This range, however, makes only a small contribution to the total scattering cross-section. The main contribution to the latter comes from angles 6 *£ m/cu, as is easily seen from the general (not only for angle zero) unitarity relation between the amplitudes for photon-photon scattering and photon pair production. In that range, however, the logarithmic term is absent, and the total scattering cross-section is thus a - (Za)4 r](a>lm)262 - (Za)4 r\ (128.7) (H. A. Bethe and F. Rohrlich, 1952). For large w, therefore, the coherent scattering cross-section tends to a constant limit. § 129. Radiative corrections to the electromagnetic field equations In the quantization of the electron-positron field (§25) it has been shown that the expression for the vacuum energy contains an infinite constant, which may be 576 Radiative Corrections §129 writtent £o=-24- (129.1) P,C7 where - e{pJ are the negative frequencies of the solutions of Dirac's equation. This constant itself has no physical meaning, since the vacuum energy is, by definition, zero. When an electromagnetic field is present, however, the energy levels e{pJ will change. The changes are finite and are physically significant. They describe the field dependence of the properties of space, and alter the equations of the electromagnetic field in a vacuum. The changes in the field equations correspond to the change in the field Lagrangian. The density L of the Lagrangian is a relativistic invariant, and therefore can depend only on the invariants E2 - H2 and E • H. The usual expression L0 = (E2-H2)/8TT (129.2) is the first term in the expansion of the general expression in powers of the invariants. Let us derive the Lagrangian for the case where the fields E and H vary so slowly in space and time that they can be regarded as uniform and constant. Then L may be assumed not to involve the derivatives of the fields. The necessary conditions for this will be discussed at the end of the section. However, if the problem stated is to be meaningful, we must also assume the electric field to be sufficiently weak. The reason is that a uniform electric field can generate pairs from the vacuum. The field itself can be treated as a closed system only if the pair production probability is sufficiently small: \E\<èm2l\e\ ( = m2c3l\e\h), (129.3) i.e. the change in the energy of a charge e over a distance hlmc must be small in comparison with mc2. We shall see below (cf. also Problem 2) that the pair production probability is then exponentially small. If there is a magnetic field as well as an electric field, it is in general possible to choose a frame of reference in which E and H are parallel. Then the magnetic field does not influence the motion of the charge in the direction of E. The condition (129.3) is to be satisfied in this frame, which will be the one used in the subsequent calculations. The calculation of the Lagrangian begins with that of the change W in the vacuum energy. This is given by the change in the "zero energy" (129.1) due to the field. From this, however, we must subtract the mean values of the potential energy of the electrons in the "states" of negative energy. The subtraction simply makes the total charge of the vacuum zero by definition. The zero energy in the presence of the field is *o = - 2 4 - = 2 I *£* • « Yf *V d\ P,CT p,<T J Q+ t Here we shall write # in place of E, to avoid confusion with the electric field. (129.4) §129 Radiative Corrections to the Electromagnetic Field Equations 577 where i//p7r are the negative-frequency solutions of Dirac's equation in the field concerned. We shall assume that the integration is over a unit volume, and that the wave functions are normalized to unity in that volume; then %§ is the energy per unit volume. According to the preceding discussion, we have to subtract from g 0 the quantity P,(T J where <>/ = - E • r is the potential of the uniform field. According to the theorem on the differentiation of an operator with respect to a parameter (see QM, (11.16)), = -E-2de ( p ;7dE = E • dg0/dE. Thus the total change in the vacuum energy density is W = («o - E • d%oldE) - («o ~ E • dg0/dE)E==H=o. (129.5) We can relate W to the change L' in the Lagrangian density (L = L 0 + L') by using the general formula W = 2 q dLldq - L, where q represents the "generalized coordinates" of the field (see Fields, §32). For an electromagnetic field, the quantities q are the potentials A and </>. Since E = -À-V<fc H = curl A, (129.6) À is the only "velocity" q which appears in L, and the differentiation with respect to À is equivalent to one with respect to E; hence W = E-dL7dE-L\ (129.7) Comparison of (129.5) and (129.7) gives L' = -[*O-(*O)E.H«O]. (129.8) Thus L' can be calculated by means of the sum (129.1). Let us first take the case where there is only a magnetic field. The "negative" energy levels of the electron (charge e = - \e\) in a constant uniform field Hz = H 578 Radiative Corrections §129 - 4 _ ) = - V[m2 + \e\H(2n - 1 + cr) + pll (129.9) are n = 0, 1,2,... ;a = ± 1 (see §32, Problem). To find the sum, we note that the number of states in the interval dpz is \e\H dpz 2TT 2TT (see QM, §112); the first factor is the number of states with various values of px, which do not affect the energy. Moreover, all the levels except n = 0, a = - 1 are doubly degenerate, the levels n, a = + 1 and n + 1, a = - 1 coinciding. Hence ~^0 = (f^? J {V(m2 + P') + 2nf1 V(m2 + 2|e|Hn+p^)}dpz. (129. 10) The divergence of the integrals in (129.10) is eliminated in the calculation of L' (129.8) by subtracting the value of the sum when H = 0. To carry out this "renormalization", it is convenient to calculate first the convergent expression (PEE- dl% « 0 __l£lïï.f " 1 8iP"UP , ?ZY . 2 ! 1 Ä m + 2|e|HnJ- The sum in the braces can be reduced to that of a geometrical progression, as follows: 0 o = - l f ^ f e-m2T,coth(|e|HTj)dî). OTT (129.11) J 0 To find L\ we must now integrate 4> twice with respect to m2 and then subtract the i §129 Radiative Corrections to the Electromagnetic Field Equations 579 value of the resulting quantity when H = 0. This gives V =- i f ^ - [i]\e\H coth (rj|e|H) - 1} dtj + c, + c2m2, (129.12) 0 where C\ and c2 depend on H but not on m2. From considerations of dimensions and of parity with respect to H, it is evident that L' as a function of H and m must have the form L' = m4f(H2lm4). Hence there can be no terms in U which are odd in m2, and so c2 = 0. The coefficient C\ is given by the condition that the expansion of U in powers of H2 begins with a term in H4: a term in H2 would simply alter the coefficient in the original Lagrangian L 0 = - H2I$TT, and this would essentially signify a changed definition of the field and therefore of the charge. The elimination of the H2 terms thus corresponds to a renormalization of charge. It is easily verified that this is achieved by putting o Finally, making the change of variable m2i) ->T) in (129.12), we have L\H ; E = 0) = ^ 07T J f {- i]b coth bi\ + 1 + \b2r)2} e'11 ^ 0 ÎJ (129.13) where b = \e\Hjm2. Let us now go to the general case where there is not only a magnetic field but also an electric field E parallel to it, satisfying the condition (129.3). To find V in this case it is not necessary to determine afresh the energy levels e{p} of the electron in the field; we need only note that, if the wave function (the solution of the second-order equation (32.7)) is sought as a product ^ = ^E(z)e^«r(y), where *v(y) is the wave function in the magnetic field when E = 0 and pz = 0, then the mass m and the field H appear in the equation for tMz) only in the combination m2 + \e\H(2n + \ + a). If now the factor \e\HI2ir is again taken from the summation over px (the energy 580 Radiative Corrections §129 levels being independent of px), the dimensional argument shows that <D(H,E) = d2L'/(dm2)2 F([m2 + \e\H(2n + 1 + cr)]/|e|H) m2+\e\H(2n + \ + <r) \e\H A y Sp'èoaiix * ir(X\ i ?YFil 8 ? r U ^ À a = \e\E/m2; + 2bnla) ] l + 2bn J' (129.14) each term in the sum is - d2e(p_)/(dm2)2 summed over all the quantum numbers except n. Here F is a function as yet unknown, which will be derived from considerations of relativistic invariance. <ï> must be a function of the scalars H2 - £ 2 and (EH)2 = (E • H)2: <D(H, JB) = f(H2 - E2, (EH)2). Hence <D(0,E) = / ( - E 2 , 0 ) = <D(iE,0). The function <î>(iE, 0) is obtained from (129.11) by putting H ~+ iE; after a change of notation for the variable of integration, this gives 4>(»E,0) = i OTT f e-^cottjdr,. J 0 (129.15) The function F can be found by comparing this expression with the limit 4>(H -> 0, E) given by (129.14). The passage to the limit H -+0 in (129.14) can be effected by replacing the summation over n by integration over dn = dxjlb: •"»-K'J'frOr?x = - j T i I ^>dy. (129.16) Ma Equating the expressions (129.15) and (129.16) and differentiating with respect to 1/a = z, we find F(z)lz = - I e~^z TJ cot T) drj. o The summation in (129.14) then reduces again to the summation of a geometric §129 Radiative Corrections to the Electromagnetic Field Equations 581 progression, and the subsequent calculations are similar to those given previously: we express <f> in terms of m2, E and H, integrate twice with respect to m2, subtract the value for E = H = 0, and determine the constants of integration as in the derivation of (129.13). The final result ist m4 f e ^ L' = g^p J —r { - (W cot i\a){i\b coth r)b) + 1 - \i)\a2 - b2)} drj, o a = \e\Elm2 ( = |e|fiE/m2c3), (129.17) b = |e|tf/m 2 (= |e|ftH/m2c3). The parameters a and b may be written in the invariant form a = - i e -1{(& + icg)m-(&-i(ß)m}, (129.18) b = \&{{UJ + i<ß)m + (ä? ~ mm) > where .> and ^denote the invariants ^ = KB2 - E2), « = E • H, 9 ± Vê = i(H ± ÎE)2. (129.19) When (129.17) is expressed in terms of the invariants ^ and ^, it becomes applicable in any frame of reference (not only in that where E||H). The formula (129.17) is written in a somewhat arbitrary manner. It is valid only if the electric field is small: a <\ (129.3); this condition is not shown explicitly in (129.17), but can be seen from the fact that the integrand in (129.17) has poles at 7) = mrja (n = 1,2,...), and the integral as written above has, strictly speaking, no meaning. Hence (129.17) can essentially be used only to derive the terms of the asymptotic series (see below) in powers of a by a formal expansion of cot a. The integral (129.17) can be given a mathematical meaning by passing round the poles in the complex rj-plane. Then L', and therefore the energy density W\ have an imaginary part. Since the energy is complex, there are quasi-stationary states.t In the present case, the stationarity condition is violated by pair production, and - 2 im W is the probability w of pair production per unit volume and time; since the small increments of W and L differ only in sign, the probability w, expressed in terms of E and H, is simply w = 2 im V. (129.20) This is clearly proportional to e~^a (see (129.22) below). Because im W is exponentially small when a <è 1, an asymptotic series in powers of a, retaining any finite number of terms, is meaningful. t This was first derived by W. Heisenberg and H. Euler (1935). The analysis given above makes use also of the principles of a proof suggested by V. F. Weisskopf (1936). t The direction of passage round the poles must be chosen so that im W < 0 ; this corresponds to the usual rule m2-> m2 - iO (i.e. here a-*a + I'O). §129 Radiative Corrections 582 Let us consider the limiting cases of formula (129.17). In weak fields (a<\ b < 1), the leading terms of the expansion are In particular, when b = 0 the relative correction is L7L0 = aa2/457T. The imaginary part of U for a <è 1 is obtained from the integral (129.17) by taking half the residue at the pole of the cotangent nearest to the origin, i.e. at rja = 7T - iO. From (129.20), this gives the probability of pair production by a weak electric field: or, in ordinary units, w 1 (eEh\2mc2 (mc\> - 4? UP?) T nr) ( exp 7rmV\ ( - i^isr)- M1fl... (129 22) - In a strong magnetic field (a =0, b > 1), we start from (129.13), written (with brj->Tj) as -^r-^iï-^^h,4^2 r »-nib When b > 1, the important range in this integral is 1 <^ r\ < b, in which e"""' ~ 1 and we can neglect the second term in the brackets, terminating the range of integration (with logarithmic accuracy) at TJ « 1 and r\ « b. Then V = (m V/24ir2) log b; (129.23) in a more exact result, log b becomes log b - 2.29. The ratio L'IL0 is here L7Lo-(a/3ir)logb, from which we see that the radiative corrections to the field equations may become of relative order unity only in exponentially strong fields: H~(m2l\e\)e3lTla. (129.24) The corrections calculated above are, nevertheless, meaningful: they remove the linearity of Maxwell's equations, and thus lead to effects which are in principle observable (e.g. scattering of light by light or in an external field). §129 Radiative Corrections to the Electromagnetic Field Equations 583 The relation between the fields E and H and the potentials A and <j> remains, by definition, as before (129.6), and there is therefore also no change in thefirstpair of Maxwell's equations: divH = 0, curlE = - ^ . (129.25) The second pair of equations are obtained by varying the action S=l(L0+L')d4x with respect to A and <f>, and can be written curl (H - 4irM) = | - (E + 4irP), (129.26) at div(E + 47rP) = 0, (129.27) V = dL'ldE, M = dL'ldH. (129.28) with the notation Equations (129.25M 129.27) agree in form with the macroscopic Maxwell's equations for a field in matter.t Hence we see that P and M signify the electric and magnetic polarization vectors of the vacuum. Note that P and M are zero for the field of a plane wave, where both invariants E2 - H2 and E • H are zero. For a plane wave, therefore, the non-linear corrections are zero in a vacuum. Lastly, let us consider the conditions for the above formulae to be valid. If the fields are to be regarded as constant, their relative changes over distances or times of the order of 1/m must be small; this ensures that the corrections to L0 arising "rom the derivatives are small in comparison with L0 itself. For instance, if the field is only time-dependent, this gives the obvious condition <o<m. (129.29) For a weak field, however, there is also a more stringent condition. This occurs because the fourth-order term (129.21) must be much larger than the correction to L0 quadratic in the derivatives; otherwise, the fourth-order term would have no meaning. For example, in an electric field depending only on the time, this leads to the condition o><m\e\Elm\ (129.30) which is more stringent than (129.29). t In making the comparison, it must be remembered that in macroscopic electrodynamics the mean value of the magnetic field is denoted by B, not by H as here. 584 Radiative Corrections §129 The condition (129.30) does not arise, however, in the problem of photonphoton scattering considered in the last part of §128. There, we are concerned from the start with a four-photon process only, described by the fourth-order terms in the Lagrangian, and the relative magnitude of the other terms in L'.is irrelevant. It is therefore sufficient if the condition (129.29) is satisfied. PROBLEMS PROBLEM 1. Determine the correction to the field of a small stationary charge e\ due to the non-linearity of Maxwell's equations. SOLUTION. For H = 0, (129.21) gives P = dL73E = (a2/907T2m4)E2E. (1) In the case of central symmetry, (129.27) gives ( £ + 47rP)r2 = constant = eu (2) the value of the constant being obtained from the condition that as r->» the field is the Coulomb field of the charge e\. An approximate solution of (2) is E = (e,/r2)(l - 2a2eî/45îrm V ) , or $> = (€,/r)(l - 2a2e]l225Trm4r4). <3) The correction in (3) that is non-linear in ^i is to be distinguished from the linear correction in (114.6), due ultimately to the non-uniformity of the Coulomb field. The correction (3) is of a higher order in a, but decreases more slowly with increasing distance and increases more rapidly with è\. PROBLEM 2. Estimate directly the probability of pair production in a weak uniform constant electric field in the quasi-classical approximation, to exponential accuracy (F. Sauter, 1931). SOLUTION. The motion is quasi-classical in a weak field E (which has a slowly varying potential <f> = - E • r = - Ez). Since the reaction amplitude contains the wave function of the final positron as the initial "negative-frequency" function, pair production may be regarded as a transition of an electron from a "negative-frequency" to a "positive-frequency" state. In the former state, with the field present, the quasi-classical momentum is determined by the equation e = - V [ p 2 ( z ) + m2] + |e|Ez, 0) = + V[p 2 (z) + m2] + |e|Ez. (2) and in the latter state by e The change from the first to the second state implies a passage through a potential barrier (the region of imaginary p(z)), which separates the regions where the functions (1) and (2) apply with real p(z) for a given c. The boundaries z\ and Zi of this barrier occur at p(z) = 0, i.e. e = - m + \e\ Ezu e = + m + |e| Ez2. The probability of passage through a quasi-classical barrier is w <xexp^-2 I |p(z)|dzj .ex P (-4^/V(>-S*)4 0 Photon Splitting in a Magnetic Field §130 whence 585 w « e x p ( - irm2l\e\ E), in agreement with (129.22). § 130. Photon splitting in a magnetic field The non-linear corrections in the electromagnetic field equations give rise to a number of specific effects in photon propagation in external fields. In order to put these equations in a more familiar form (cf. the last footnote), we shall denote the electric and magnetic fields in this section by E and B; D and H will denote the quantities D = E + 4TTP, H = B-4ITM, P=dL'ldE, M=dL'ldB. Equations (129.25H 129.27) then become divB = 0, curlE = -5B/ar, div D = 0, curl H = dDldt. (130.1) Let us consider photon propagation in a constant uniform magnetic field Bo. Denoting by a prime the quantities which relate to the weak field of the electromagnetic wave, we have for these the equations k x H ' = -o>D', k x E ' = wB\ k • B' = 0, k • D' = 0, (130.2) with D; = elkEi, B; = filkHi; (130.3) the vacuum permittivity and permeability tensors are functions of the external field B0. Assuming this field to be so weak that \e\B0lm2<è 1, we find from the Lagrangian (129.21) /% 4 (130.4) H<ik = Sik + TF774 Bl(8ik + 2bibk), where b = B0/Bo. The photon frequency is assumed so small that w « m (129.29). However, the structure of the tensors e* and i±ik does not depend on this assumption; it follows from the invariance of quantum electrodynamics under spatial inversion and charge conjugation. The first of these prevents the occurrence in D' of terms having the form constant x B' or constant x B0(B0 • B') (since inversion changes the sign of E 586 Radiative Corrections §130 and D but leaves H and B unchanged); the second prevents the occurrence in e* and fin, of terms antisymmetric and odd in B0, of the form eaciBoi (since charge conjugation changes the sign of all fields). Since the problem under consideration has a distinctive plane, namely the kb-plane, it is reasonable to take the linear polarizations in and normal to this plane as the two independent polarizations of the photon. The subscripts 1 and || wi denote polarizations in which the vector B' is respectively perpendicular to the kb-plane and in that plane. For perpendicular polarization, the vector H' is, like B', at right angles to the kb-plane: B -( i + ^ B ») H ' The vectors E' and D' are in that plane. Then, from equations (130.2), we obtain the photon dispersion relation k = n^, with the "refractive index*' (in ordinary units n^l+^pBlsin'e, (130.5) where 6 is the angle between k and Bo.t In the second case, B' and H' are in the kb-plane, E' and D' perpendicular to it. The refractive index is found to be »1=1 + AI€ ? i Bo sin2 0. (130.6, ■ 45m c Note that nx ^ n\\. The equality occurs when 0 = 0, n± = nj = 1. The most interesting manifestation of the non-linearity of Maxwell's equations with radiative corrections is the splitting of a photon into two in an externr magnetic field (S. L. Adler, J. N. Bahcall, C. G. Calian and M. N. Rosenblut 1970). In a constant uniform field, this process occurs with conservation of energy and momentum.$ In the decay of a photon k into photons k, and k2, we have û>(k) = co(kt) + co(k2), k, + k2 = k. (130.7) For photons in a vacuum, in the absence of external fields, o> = k and the equations (130.7) can be satisfied only for three photons moving in the same direction. In that case, however, the decay is rigorously forbidden by the invariance under charge conjugation: Furry's theorem (§79) shows that the sum of diagrams with three photon free ends is zero. t Expressing B' in terms of H' in the second equation (130.2), we substitute H' from there in the first equation, and then take the projection of the latter on the direction of b. The product k • E' is expressed in terms of b • E' by means of the equation k • D' = 0. t The conservation of momentum is due to the spatial uniformity of the field, but of course occurs only for processes involving uncharged particles. The Lagrangian for charged particles contains not only the fields but also the field potentials, which depend on the coordinates even in a uniform field. §130 Photon Splitting in a Magnetic Field 587 The presence of an external field makes the decay of the photon possible; this decay is represented by diagrams with three photon, ends and one or more external-field lines. The possibility is, however, dependent on the nature of the photon polarization. The dependence may be deduced from the conservation laws (130.7) and the change in the photon dispersion relation in a magnetic field. The dispersion relation may be written <o =fc+ ß(k), (130.8) where ß(k) is an increment that is small (in a weak field). Its presence makes possible, in principle, the fulfilment of equations (130.7) for momenta ki and k2 lying in a certain narrow cone near the direction of k. Since the directions* of all three vectors k, ki, k2 are close together, they can all be regarded as parallel to k in the small terms ß(k), and we can take k\ + k2 = k. The law of conservation of energy then becomes ß(Kk) - ßiOckO - ß2(tck - KkO = ki + |k - ki| - k (where K = k/k); since the dispersion relation depends on the polarization of the photon, the functions ß, ßi, ßi may be different. Since [k - k,l - [(k - k,)2 + 2fck,(l - cos ö))m ~k-kx kk + 2 ( k , k | ) *2. where d is the small angle between k and kt, we have ß(Kk) - ßi(Kk,) - ß2(Kk,) = kM2/2(k - k,) > 0. (130.9) This inequality specifies the properties of the dispersion relation that are necessary for decay. For frequencies co <m, the dispersion relation is given by (130.5) and (130.6), so that ß(k)« - k[n(K)- 1], where the function n(K) depends on the direction of the vector k but not on its magnitude. Then we must have k,n,(K) + (k - kx)n2{k) - M K ) > 0. (130.10) Since n± > % this condition immediately excludes the decays Y±-*7|+Y|> Y±->7»+7±> where y denotes a photon, and 1 and || correspond to the two polarizations defined above.t For the decays 7 i - * 7 ± + 7±f 7|| -* 711+7||> t Numerical calculations show that n± > rig is true not only when o> < m and (130.5) and (130.6) are valid, but for all o> < 2m, the threshold for pair production by the photon 588 Radiative Corrections §130 the left-hand side of (130.10) is zero, since the functions n, n]y n2 are the same. To solve the problem in this case, we have to take into account the dependence of the refractive index on k which appears as a> increases. The required inequality is fc,n(K, k{) + (k- ki)n(K, /c - Âc)- /CH(K, k) > 0 . It can be shown by general arguments that M(K, k) is an increasing function of k, and so this inequality cannot be satisfied, so that the above decays also are impossible: replacing n(k-k\) and n(ki) by n(k) will certainly increase the sum, and the result of these changes is a sum equal to zero. This conclusion applies to any transparent media, and follows from Kramers and Kronig's formula for the refractive index (see ECM, §64). In the present case, the external field is a "transparent medium" for photons of all frequencies œ <2m, up to the pair production threshold, i.e. the photon absorption threshold. Thus the only decay processes allowed are yn->7i + Yi>- (130.11) Y|-*Y| + 7i. (130.12) It has already been noted that the momenta ki and k2 are at small angles d to the initial photon momentum k. If these angles are neglected, i.e. if the momenta of all the photons are assumed to be parallel (the collinear approximation), then the decay (130.12) is impossible, as may be shown in the following way. Similarly to (127.4), we express the decay amplitude as Mfi = Mk9lveAe<t*el*, where e, eu ti are the photon polarization 4-vectors, defined as usual by their 4-potentials A. With the three-dimensional potential gauge, e = (Q, e), we can rewrite this as Mfi = Mjk,e,efke!,. Two independent polarizations are defined by the unit vectorst eullkxb, ejkx(kxb). (130.13) It is easy to see that, in the expansion A,A1,À2 where A, Ai, A2 take the values 1 and || (cf. (127.9)), the vectors ex must occur an t The suffixes || and 1 refer to the polarizations defined above. It must be remembered that the unit vectors e determine the direction of the vector potential A (and therefore of the field E') and are perpendicular to B'. §130 Photon Splitting in a Magnetic Field 589 even number of times (0 or 2) in each term. The amplitude Mfi is invariant under the CP transformation, and since the potentials A (and therefore e) are CPinvariant, so must be the tensor Mikl. Under the CP transformation e|->e|, e i - > - e i ; charge conjugation changes the sign of b, inversion changes that of k while leaving the axial vector b unaltered. Hence, if the vector e± occurs once in any term of the expansion, the corresponding scalar MAA,A2 must be CP -odd. But it is impossible to construct a CP -odd scalar from only two vectors (in the collinear approximation) k = ki = k2 and b, both of which change sign under the CP transformation. This proves the above statement. The decay (130.12) is therefore forbidden in the collinear approximation. A more detailed analysis shows that the ratio of the amplitude of this process to that of the decay (130.11) allowed in the collinear approximation, ^~fl2^a(B0/Bcr)2, (130.14) where Bcr = m2l\e\ ( = m2c3l\e\h = 4.4 x 10,3G); the angles # are estimated from (130.9) as ü2~ n ± - nj. The fact that the only possible decay (in the principal approximation) is 7l|-*7i + 7± implies that 1 polarization is eventually established in an unpolarized photon beam propagating in a magnetic field. Let us now calculate the decay amplitude Mfi = M n j by perturbation theory, assuming that B 0 ^ BCT. The first non-vanishing Feynman diagrams (with respect to a, and with respect to the external field) are of the form (130.15) with all possible permutations of the ends, three ends corresponding to photons and one to the external field. In the collinear approximation, however, the amplitude corresponding to these diagrams is zero. For, as a result of gauge invariance, the external field can appear in the amplitude of the process only as a 4-tensor of its field strengths FM„ and the photon polarization 4-vectors only in the antisymmetric combinations with the wave 4-vectors. The final expression for the amplitude is constructed from the external field tensor FM„, the tensors f^, /iM„ and f2tlv of the three photons and their wave 4-vectors JcM, k^, k2il\ it must be linear in each of the tensors /M„, and for the diagrams (130.15) it must be linear in FH„ also. In the collinear approximation the 4-vectors k{ and k2 reduce to k: k\ = kcui/cu, k2 = k<ù2lo>. Under these conditions, 590 Radiative Corrections §130 any scalar product formed as stated above is identically zero: we can easily show that such a product will contain at least one zero factor k2 or ke. In the collinear approximation, therefore, the first non-zero contribution to the decay amplitude comes from hexagonal diagrams of the form <13(U6> K~~\_)— with three external-field lines.t The amplitude corresponding to such diagrams involves three factors FM„. Scalar products of this kind need not be zero, but all non-zero products contain the photon wave vectors only through the tensors f^: it is easy to see that the addition of further factors k will lead to the presence of zero factors k2 or ke in the products. The components of the tensor / ^ are the same as those of the photon fields E' and B'. This means that, if the decay amplitude corresponding to the diagrams (130.16) is represented as the matrix element of an operator, then that operator, expressed in terms of the photon field operators, is independent of the photon frequencies. Hence it follows in turn that the calculation of the scattering amplitude corresponding to the diagram (130.16), by means of the Lagrangian (129.17), gives the correct answer without restriction to the case co < m. It has been shown at the end of §127 how the interaction Hamiltonian is obtained from the Lagrangian L found in §129. We now have a process involving three photons, and the corresponding interaction operator is found from the terms in the expansion of L which contain products of three photon fields E' and B\ Here we need consider only the term (B' • B0)(E' • Bo)2, (130.17) in which each of the vectors B' and E' appears as a scalar product with B0: the products E'2, B'2 and E' • B' arise, in the four-dimensional notation, from scalars of the form fßJß% which in the collinear approximation are identically zero. The selection of the term containing one factor B' and two factors E' is made because the process under consideration involves one || photon and two 1 photons; for the former, the field B' has a component along B0, and for the latter the field E' has such a component. The Lagrangian L is expressed in terms of the invariants ^ = 2(B 2 -E 2 ) and ^ = E • B. The required term in the expansion comes from a term proportional to 5F£2. A calculation using (129.17) gives for this term 13 6 * 630rrW m 2. Putting B = B0 + B\ E = E\ and taking the term B0 • B' from 9 and Bo • E' from % we get the required expansion term having the form (130.17). Thus the three-photon t The corrections arising from the inclusion of non-collinearity in the diagrams (130.15) would give a contribution to the amplitude that is of the next higher order in a relative to that from (130.16). §130 Photon Splitting in a Magnetic Field 591 interaction operator for the decay y^-* y\± + 721 is V(3) = * \ * {(Bo • ÊiXBo •feKBo• B') d \ (130.18) where B' = iV(4ir)kxc|e l *" r " ttf) c k| , Ê; = -iV(47T)a> 1 e 1 6 i(kl — *l)eilX9 and similarly for Ê$; cf. (127.26), (127.27).t According to the rules given in §64, the decay amplitude Mfi is calculated from the definition Sfi = (/| J V(3) df |î) = - i(2ir)4«(4)(k - k! - k^Af,,, and is M /< = " ' ?i* i - 8 (47r)wü)coi(ü2ßo sin3 0 0 being the angle between k and B0. The decay probability per unit time is (see (64.11)) dw = (27r)45(k - ki - k2)S(oi - a), - a>2)\Mfi\2 „ f fc,d'fc2 „ ,6; ' 2 • 2<o • 2o>i • 2a>2 * (2ÎT) the extra factor | takes account of the decrease in the phase volume due to the identity of the two final photons. The first delta function is eliminated by integration over d3k2. To eliminate the second we note that, if dispersion is neglected, a) - o)i - coi = k - k\ - |k - ki| Irlr. k-k> (l-COS#i), and therefore^ ( I COCUIO)2S(Û> — co\ — (02) d cos #i • 2TTÙ)\ du>\ 0 0 Ü) — 2ix I o>f(a> - o>i)2dcüi = 7TÜ)5/15. 0 t The coefficient in (130.18) is doubled because E( and E2 can be taken from either of the two factors E' in L. % Here it is assumed that, with dispersion taken into account, the argument of the delta function in fact has a zero at some cos #1 < 1. Thus a dispersion of (his type is necessary for decay to be possible, but the decay probability itself does not depend on the amount of the small dispersion. 592 §131 Radiative Corrections We finally have for the total photon decay probability per unit time (in ordinary units) w a 3 / 13 \ 2 mc 2 /fad \ 5 /B 0 sinfl\ 6 T5^\315J h KinP) \ Bcr / = 018a6r^/Msin^y/A)7^) 1 5 . (130.19) n \ Sirmc J \mc J \mcl) As already mentioned, the condition co < m is not necessary for this formula to be valid. Its validity is restricted only by the condition that the terms corresponding to eighth-order diagrams be small. To obtain an estimate, we note that the eighth-order matrix element may contain, for example, a term differing from the sixth-order terms by a dimensionless invariant factor of the form (eF^kJm3)2. The condition for this to be small is a very weak one: o)<m(m2l\e\B0). §131. Calculation of integrals over four-dimensional regions We shall now give some rules and formulae that are useful in the calculation of integrals arising in the theory of radiative corrections. The typical form of integral corresponding to a Feynman diagram is HW * \d2 , (13...) where au <*i, •. • are second-degree polynomials in the 4-vector k, /(k) is a polynomial of some degree n', and the integration is over the whole of fourdimensional fc-space. A convenient method for calculating such integrals, due to Feynman (1949), is based on an initial transformation or parametrization of the integrand by the use of further integrations with respect to auxiliary variables £i, £2>.... : s — ! — = ( , - Di f ,K, ...J \ it..(ai£i f:+ a*.;+• ■+j- - » . a a ...a ' ) &+ • • • + a ^ ) x 2 n o o 2 n n (m.« This transformation replaces the n different quadratics in the denominator by the nth power of a single quadratic polynomial. Eliminating the delta function by integrating over d£n and using new variables defined by §131 593 Calculation of Integrals over Four-dimensional Regions we get as an equivalent form of (131.2) 1 *«-2 X, = ( n - l ) ! \dxx [ dx2... f dx„_ I X J0 J J 1 [a,xn_,+ a 2 (x n - 2 - x„-i) + • • + a B (l-x,)]"' a\a2...an (131.3) When n = 2, this formula becomes ^ = J[a,x + a 2 (l-x)] 2 ' (l3L4) 0 as can be easily verified. For any value of n, the formula can be proved by induction: on carrying out the integration over dxn~\ in (131.3), we get on the right-hand side the difference of two (n - 2)-fold integrals of the same form. If the formula is assumed valid for these, we have _L_r ! « l a\ - ai La2a3.. ► a* aia 3 ... any which is equal to the left-hand side of (131.3). By differentiating (131.3) with respect to au a2, e t c > w e c a n derive similar formulae which can be used for the parametrization of integrals whose denominators contain any of the polynomials in powers above the first. The divergent integrals are regularized by subtracting from them other integrals of similar form. To determine the difference, it may be convenient first to transform the difference of the integrands (each of which is already transformed by means of (131.2)) as follows: i J an LÄ_ f n(a-b)dz U3I bn J [(a-b)z + b]n+l°' o After the application of (131.3), the four-dimensional integration in (131.1) becomes f /(*)d 4 k J [(k-lt-a*r (131 6) ' where I is a 4-vector and a 2 is a scalar, both of these depending on the parameters X|,..., xn-t; the scalar a2 will be assumed positive. If the integral (131.6) converges, we can make the change of variables k -1 -+k (a shift of the origin), which gives (with a different function f(k)) f f(k)d4k J (k*-«V» (1317) §131 Radiative Corrections 594 the denominator now involving only the square k2. In the numerator we need only consider scalar functions / = F(k2): for integrals having numerators of any other form, k»F{k2)d4k 2—r~5vr- = Au, WF(k2)d4k_ / (k ?—zi^r -a ) 2 2 1 g' / (131.8) k2F(k2)fk (k2-a2)" f *!^ ^ ^ ^ ( k O d ' (131.9) k = ^ (g'1"«'" + «"g" + g"Vp) / Kvn,2)d4k y^jpr? (131.10) and so on, as is evident from symmetry on integration over all directions of k. In the original integral (131.1), each of the factors ait a 2 , . . . in the denominator has (as a function of kp) two zeros, which are avoided in thé integration over dk0 according to the general rule (§75). After the transformation to the form (131.7), the In simple poles of the integrand are replaced by two poles of order n, which are avoided according to the same rule (path C in Fig. 25). By moving the contour of integration as shown by the arrows, it can be converted to the imaginary axis in the ko plane (path C"). Thus the variable ko is replaced by ikô with ko real. Then, writing also k' for k, we have k2 = k^-k 2 -*-(k 0 2 + k'2) = -k' 2 , (131.11) where k' is a 4-vector in the Euclidean metric; and d4k-*id4k' = ik'2d(ïk'2)dCï, where dfl is an element of four-dimensional solid angle. The integration over dCi gives 2TT2 (see Fields, §111), and so d4k-*iTT2k'2d(k'2). ci V ■M/- FIG. 25. (131.12) §131 595 Calculation of Integrals over Four-dimensional Regions Putting k'2 = z, we have finally (F(kl)d*k_ ( J (F^Y ~ ~ In particular, f d4k _. /(F(-z)zdz 1} l7r o (l3113) J (z+ <*>)" * (-l)"iV ,, J {k'-a'Y ~ ««-»(n-lXn-2)' (13L14) /[(t-ff-aV (,3U5) The logarithmically divergent part of the integrals (131.7) can be separated as It is easily seen that in an integral of this type we can replace k by k + l: the difference is and is a convergent integral, so that the change k -» k + / is certainly permissible in this integral. On doing so and also changing the sign of k, we get the same quantity with the opposite sign, and it must therefore be zero. A linearly divergent integral would have to have the form k»d4k / [(k-l)2-a2r (131.16) but in fact such an integral is only logarithmically divergent: the integrand tends asymptotically to k*l(k2)2 as k ->o° and gives zero on averaging over directions. The change of origin, however, adds a constant to the integral (131.16). This can be shown for the infinitesimal change k -> k + SJ, by calculating the difference A '-JU-«|L.¥-&rjÇ.}' l < f c (»LIT) As far as the first order in Of, A(1 [\4k»(k8l) 81" } J4| , In the first term, averaging over directions replaces the numerator by k25/'1 (cf. (131.9)), and hencet ^ = a28l» j(kfkal)i = -l2iTT28l*. t The corresponding result for finite I can be obtained by more laborious calculations. (131.18) 596 Radiative Corrections §131 In the final expressions for the radiative corrections there is frequently a transcendental function defined by the integral FU) = J!°&ii±l)dl, 03U9) 0 sometimes called Spence's function. Some of its properties (given here for reference) are F(è) + F(\lÇ) = ÏTT2 + l2log2Ç, F(-E)+F(-l + £)=-67r2+log£log(l-a F(1) = ÎW 2 , F(-l) = -U2. (131.20) (131.21) (131.22) The expansion for small £ is F(£) = £ - k 2 + k 3 -f6É 4 +---. I (131.23) CHAPTER XIII ASYMPTOTIC FORMULAE OF QUANTUM ELECTRODYNAMICS § 132. Asymptotic form of the photon propagator for large momenta THE first-order term (with respect to a) in the expansion of the polarization operator 9>(k2) has been calculated in §113, and it was found that for \k2\ > m2 this term is, with logarithmic accuracy, ^(k 2 ) = ^ - k 2 l o g ^ l . (132.1) It was also mentioned that the derivation of this formula (as a first-approximation correction to the propagator 4TTD~X = k2) assumed the condition £logl£!«U, (132.2) which placed an upper limit on the permissible values of \k2\. We shall now show that the expression (132.1) is in fact valid also with the much weaker condition £ log |Jh(*l. (132.3) The proof is as follows.t We first note that, although in principle the condition (132.3) allows contributions to &(k2) from the terms of all orders (in a) in the perturbation-theory series, in every order n only the terms - a " log"(|k|2/m2) need be considered, which contain the large logarithm in the same power as a ; the terms containing lower powers of the logarithm are certainly small, since a <^ 1. The analysis of the perturbation-theory series for 0> can be reduced to that of the series <3 and P \ using Dyson's equation &(k2) = i ~ tr | y&ip + k)P(p + Je, p ; k)<S(p) 0p\ (132.4) see (107.4). Since the function &(k2) is gauge-invariant, it can be calculated with any gauge for <3 and T. The most convenient here is the Landau gauge, in which the t This formulation and conclusions are due to L. D. Landau, A. A. Abrikosov and I. M. Khalatnikov (1954). 597 598 Asymptotic Formulae of Quantum Electrodynamics § 132 free-photon propagator has the form (76.11): D,Ak) = ^(gllv-^) (132.5) (D (0 = 0 in (103.17)). It is found that in this gauge the perturbation-theory series for <ê and r** do not contain any terms with zero power of the logarithms. In (132.4) it is therefore sufficient to substitute the zero-order approximations % = G, P1 = yM, obtaining 0>(k2) = i • ^ p tr | yMG(p + k)y*G(p) ^ . (132.6) This is the Feynman integral corresponding to the diagram (113.1) of the first approximation (with respect to a), and leads (after appropriate renormalization) to (132.1). In order to prove the foregoing statements, let us first examine the origin of the logarithm in the integral (132.6). It is easily seen that the logarithmic term comes from the range of integration p2>\k2\ when \k2\>m2: (132.7) formally expanding G in powers of Hyp, we, have G(p)~\lyp = yplp2, G(p-k)~l/(yp-yk) 1- — yk — yk — « — + — -yk yp yp yp yp yp yp .- YP . (7P)(7fcXyP) i p7 (p2)2 ! (7P)(7fc)(7P)(7fc)(7P) On substitution in (132.6) the first term, which is independent of k, is removed by regularization (in accordance with the condition 9>lk2-*Q when k2-*0); the second term gives zero on integration over the directions of p; the third integral is logarithmically divergent with respect to p2, and on taking it from p 2 — • |lc2| (the 2 lower limit of the range (132.7)) to some "cut-off parameter" A we get -£***$ (132 8) - In regularizing, we have to subtract from 91k2 its value for k2 = 0. But, since logarithmic accuracy presupposes the condition |k2| > m2, in calculation to this accuracy the regularization is achieved by subtracting the value for |k2| ~ m2, and A2 in the argument of the logarithm is replaced by m2, giving (132.1). §132 Asymptotic Form of the Photon Propagator for Large Momenta 599 Since the desired corrections to ^ and P are logarithmic, their inclusion makes these quantities differ from G and y* by slowly varying logarithmic factors. In the exact integral (132.4), therefore, the important range will be the same one (132.7) as in the approximate integral (132.6). However, we cannot simply put Jc=0 in TM(p + k, p\k): since the integral is quadratically divergent, its regularization involves also the next two terms in the expansion of P*(p + Jc, p; k) in powers of k. Here we shall consider only the corrections to TM(p, p;0), which show sufficiently clearly the importance of the choice of gauge and the difference in the nature of the integrals arising from diagrams of different types. It may also be noted that a similar analysis of ^ is not needed, since the corrections fo T and <$ are related by Ward's identity (108.8). The first-order correction (with respect to a) to T(p, p;0) corresponds to the diagram and hence to the intégrait r* (,) = -ia | yKG(Pl)y»G(pdyvDXv(p - PI)<*4PI/(2TT)4. (132.9) In the ordinary gauge, DUp - Pi) = gx* • 4TT/(P - pt)2, and the important range in the integral is p]>p2, divergent. On calculating the integral f 7À(YPI)7M(7PI)7A P M O ) « -A-„i F 47ral (P?)3 l where it is logarithmically d4px (2TT)4 (13210) and regularizing the logarithm, we get r *.>__« M o g j£. In the Landau gauge, (132.10) is replaced by p « > « -4irai | (7A(7P.)7'1(YP.)7x -P?7"}d 4 p./(P?) 3 (2^) 4 . t To avoid misunderstandings in a comparison with the results of §117, we may note that in §117 the two electron ends of the diagram were assumed to be physical, whereas here we assume that p2>\k2\> m2, and these two lines are therefore certainly non-physical. 600 Asymptotic Formulae of Quantum Electrodynamics §132 Averaging over the directions of px and reducing the matrices y, we find that this integral is zero and the logarithmic term in TM(1) disappears.t In the second-order corrections (with respect to a) we take the diagram The corresponding integral is r*(2) = -a2 j ykG(p2)y>G(pl)y*G(pl)ypG(p2)y<' x - p2)dApxd4p2l{2irf. x Dvp(p2-Pl)DXa(p In the ordinary gauge of the D functions, this integral includes a term in which the logarithm is squared, arising from the range of integration p]>p\>p2\ (132.11) when p2 is neglected in the argument of Dvp(p2- p0, the integration over d4pi becomes the same as in (132.9) and gives logpL and the subsequent integration over d4p2 is likewise logarithmic, giving \og\pllm2). When the Landau gauge is used for the D functions, however, the logarithmic terms disappear from both integrations. A similar situation occurs for all the other diagrams in the skeleton diagram 1 1 A (132.12) Diagrams of other types, with intersecting photon lines, for example those in the skeleton diagram 1 1 A (132.13) t The corrections to G ' in both gauges, as derived from the correction r0* by means of the identity (108.8), are of course in agreement with the results of §119. §133 The Relation Between Unrenormalized and Actual Charges 601 (cf. (106.11)), do not in any gauge contain terms involving the necessary power of the logarithm; there is no range of the variables in which the integral reduces to several successive logarithmic integrations. These arguments, and similar ones for the subsequent terms in the expansion of T in powers of k, confirm that in the Landau gauge there are no corrections to ^ and T involving the necessary powers of the logarithm; thus the expression (132.1) is in fact valid even if the condition (132.3) applies. The function 3)(k2) which corresponds to the polarization operator (132.1) is S(k 2 ) = -7Tl—/ ii M—TTTWi—^(132.14) kM-(a/37r)log(|k 2 |/m 2 ) Because of the condition (132.3), there is no need to expand this expression in powers of a. §133. The relation between unrenormalized and actual charges The applicability of (132.14) is, however, limited on the side of large |k2|, because the denominator decreases. The derivation of that formula was based on neglecting the diagram (132.13), and others with even more intersecting thick photon lines, in comparison with (132.12). But each such added line brings into the diagram a factor e22>, with 2> the exact propagator. The small parameter is then not a = e2 but l~(a/37r)log(|k 2 |/m 2 ) <U. (133.1) When \k2\ increases and this quantity becomes comparable with unity, the small parameter essentially disappears from the theory. The resulting situation can be more clearly understood if the renormalization in the derivation of (132.14) is made not "en route" but through the use of an "intrinsic" electron charge ec, which will here be chosen so as to give the correct value of the observed physical charge e (§110). If the integral is "cut off", as was done previously, at some upper limit A2, the intrinsic charge is a function ec(A2) and the limit A -♦ » must finally be taken. With this treatment, the polarization operator is «kl>-£***$ (the expression (132.8) with ec in place of e), and therefore a(fc:) = F l + (e;/3.)'l0g(AV|)- Now determining the physical charge e from the condition e2c2(k2)^4Tre2lk2 when k2^>~m2, (133 2) - 602 Asymptotic Formulae of Quantum Electrodynamics §133 we have e2= l + (62f/37r)iog(A2/m2)' (1333) l-(e J /37r)log(A 2 /m 2 )- °33'4) or e?= If we formally take the limit A^^c in (133.3), then e2-*0 whatever the form of the function ^(A). This "zeroizing" of the charge means, of course, that no rigorous renormalization is possible. The limit cannot be taken, however, without violating the assumptions made in deriving (133.3). It is seen from (133.4) that, as A increases (for a given value of e2), e\ also increases; and the formulae cease to be valid even when e2c — 1, since their derivation is based on the assumption that g<\ (133.5) as the condition for perturbation theory to be applicable to the "intrinsic" interaction. The failure of the inequality (133.5) to be satisfied as A increases is of fundamental significance. It shows that quantum electrodynamics is logically incomplete as a theory that is based on weak interaction. This essentially implies that the existing theory as a whole is logically incomplete, since its entire formalism is based on the possibility of treating the electromagnetic interaction as a weak perturbation. All the quantities calculated by the theory are found as series in powers of el, which are in fact asymptotic series. In order for them to have a definite meaning when e2c is not small, further arguments would be needed which do not follow from the general principles of the existing theory. It must also be emphasized, however, that in quantum electrodynamics these difficulties cannot be more than theoretical ones. They arise at enormous energies that are of no practical significance.t We may expect that in reality the electromagnetic interactions will very much sooner be "merged" with the weak and strong interactions, so that pure electrodynamics is no longer meaningful.t To conclude this section, we shall show how formulae (133.3) and (133.4) can be derived by simple arguments based on the significance of renormalization and on dimensional reasoning (M. Gell-Mann and F. E. Low, 1954). Let us consider the square of the unrenormalized charge as a function of the cut-off parameter, e?(A2), and define a function d as the ratio of the values of e2c for two different arguments: eliA]) = e2c(A2) d. When A?, fi\>m, the function d is independent of m and, being dimensionless, can depend only on the likewise t For example, the equation (CX/TT) log(e2/m2) = 1 is satisfied when e ~ 1093m. t The opposite situation occurs in theories where the interaction between particles is mediated not by the electromagnetic field but by Yang-Mills fields. The relation between the renormalized and unrenormalized charges in such theories is given by an expression of the type (133.4), byt with the opposite sign in the denominator, so that, for a given value of e2> the unrenormalized charge el decreases with increasing A. This is called asymptotic freedom of the theory. Such a theory is, of course, fundamentally different from the theory with zeroized charge. §134 Asymptotic Form of the Scattering Amplitudes 603 dimensionless quantities e2c(A]) and A2/A2: e2c(Al) = e2c(A}) d[e2M), Ml Ml (133.6) From this functional relation, we can derive a differential equation, by writing it for infinitesimally close values of Ai and Ai Denoting A, by £ and putting A2 = £ + d£, we obtain for ac(£) = e2c(A2) the differential equation dac = <t>(ac)dÇI& (133.7) with <M«c) = ac[dd(ac, x)/ax]x=1; (133.8) we have used the fact that d(ac, 1)= 1, from the definition (133.6). Integration of (133.7) from £ = A? to £ = A\ gives e?(Aj) log(A^/A?)= I dal<t>(a). (133.9) 2 e c(\}) Throughout the range of integration, e\ is small. We can therefore use for <f>(a) the expression corresponding to the first approximation of perturbation theory. The correction to the unrenormalized charge e\ is e2ck2&(k2). Taking the first approximation (132.1) for the polarization operator, we find d(ac, MIA]) = 1 + (ac/37r) log(A22/A?), <M«c) = a?/3ir, and the integration in (133.9) then gives 3^,0*fi = ^j-iélj- (l33l0) As A?-*~m 2 , the unrenormalized charge ec(A}) tends to the actual charge e, and (133.10) then agrees with (133.3) and (I33.4).t § 134. Asymptotic form of the scattering amplitudes at high energies Let us consider the asymptotic form (at high energies) of the amplitudes and cross-sections for two-particle scattering processes (l + 2-»3 + 4). For the basic electrodynamic processes in the first non-vanishing approximation (with respect to a), this problem can be solved by means of the specific formulae derived in the preceding chapters, which are valid at all energies. Here, however, we shall discuss t The renormalization group method based on the functional properties of propagators and vertex parts is systematically developed in the book by Bogoliubov and Shirkov cited at the end of § 112. 604 Asymptotic Formulae of Quantum Electrodynamics §134 the question from a more general standpoint, which enables such asymptotic fornrs to be derived directly. As in §66, we use the invariants s=(P\ + Pi)\ t = ( P i - Pi)2, u = (p, - p4)2, (134.1) with p\ + p2 = P3 + P4; the notation corresponds to reactions in the 5 channel, which will be considered here. In the ultra-relativistic case, when the energies are much greater than the particle masses, the energies of the two particles in the centre-ofmass system are approximately equal. We denote by e the sum of the energies of the colliding particles, and then have, in the centre-of-mass system, Pi = (ie, Pi), Pi = (k, -pi), P3 = (ie9 p3), PA = (ie, -p 3 ), p? = p2 = 4e2, with s = e\ t = - i s ( l - c o s 0),M = - j s ( l + cos0), (134.2) 6 being the angle between pi and p3. Let us first consider the asymptotic form of the reaction cross-section for a fixed value of the scattering angle 0. Then all three variables 5, t and u are proportional, and tend to infinity together. In the ultra-relativistic case, the particle masses cannot appear in the result, and the only quantity having the dimensions of length is 1/e (=hc/e). Hence it follows from dimensional arguments that the differential cross-section for two-particle reactions decreases with increasing energy in the asymptotic form dal do a 1/5 as s,|t|,|u|->«>. (134.3) If the cross-section is related not to the solid-angle element do but to the differential dt, we have, since do oc dtjs> daldt a Us2. (134.4) The cross-section is expressed in terms of the scattering amplitude (in the ultrarelativistic case) as da/do a \Mfi\2ls\ see (64.22), (64.23). The law (134.3) therefore means that in the asymptotic limit the scattering amplitude is independent of s : Mfi = constant. (134.5) It is clear from the manner of the derivation that these results apply not only to the first non-vanishing approximation of perturbation theory, but also to higher approximations (those taking account of radiative corrections), if logarithmic factors (of the form log s/m 2 ). are ignored; the dependence on dimensionless logarithms cannot, of course, be determined from dimensional arguments.! A different situation arises if s increases with t fixed, i.e. with a fixed square of the momentum transfer. This is scattering through small angles, which decrease as t The summation of series containing logarithmic corrections may lead to an exponential dependence on the logarithms, which changes the exponent in the power law. This change is, however, small if a is small. §134 Asymptotic Form of the Scattering Amplitudes 605 the energy increases: 5 ^oo, |t| ~ so 2 = constant, d - (\t\ls)\ (134.6) In such a case, dimensional arguments allow us to establish only that the combined power of 1/s and 1/f in dcrldt is 2, and in the amplitude Mfi zero.t Hence, to find the part of the cross-section that decreases least rapidly with increasing s, we have to separate the factor that has the greatest power of 1/f. Such factors arise only if the Feynman diagram can be divided into two parts between the ends 1,3 and 2,4 by cutting the lines of virtual particles. The total 4-momentum of such lines is px - p 3 , and this leads to the factor that depends on t = (pi - p3)2. Thus the asymptotic form of the diagram in the range (134.6) depends on the nature of the possible cuttings of diagrams in the t channel. Similarly, the asymptotic behaviour in the range s ->oo, |u| ~ S(TT - 6)2 = constant, |TT — 0| ~ (\u\ls)\ (134.7) corresponding to scattering through angles close to IT, is governed by the nature of the possible cuttings of diagrams in the u channel, i.e. between the ends 1,4 and 2, 3. The simplest example is electron-electron scattering, described by the two diagrams (73.13) and (73.14). The first of these allows cutting in the t channel at the virtual photon line, and it determines the asymptotic form of the scattering amplitude in the range (134.6). The virtual photon line corresponds to a D function that is proportional to lit. The asymptotic forms of the amplitude and the differential cross-section are Mfi oc sit, da a dtlt2. (134.8) In the limit (134.7), close to the backward direction, the asymptotic form is determined by the "exchange" diagram (73.14); in that limit, Mfi oc s/u, da <* dulu2. For mutual scattering of different particles (electron and muon), there is no exchange diagram, and so the cross-section for scattering through angles 6 Ä IT decreases in accordance with (134.3) and (134.4).t We shall show that these results for the asymptotic behaviour of electronelectron scattering are unaffected by the inclusion of radiative corrections. To do so, let us consider the corrections of various orders to the diagram (73.13). It has already been shown that the diagrams which are corrections to the internal D function (see (113.11)) or to the vertex parts (see (117.1)) lead only to logarithmic corrections in the amplitude; they do not alter the power law (134.8). We shall show that the same is true of the diagram allowing cutting at two (not one) t These arguments assume a constant |f | > m2. The results thus obtained remain valid, as regards the dependence on s (i.e. on the energy), even when \t\ — m2. t All these statements are, of course, in accordance with the results of §81; see (81.11) and Problem 6. 606 Asymptotic Formulae of Quantum Electrodynamics §134 internal photon lines: P3 + q P, p3-pi-q —* * '—« P P +q P - q 4 < 2 (134.9) P 2 The scattering amplitude corresponding to this diagram differs from that corresponding to (73.13), in that the factor lit is replaced by (y(P\ + q))(y(P2-q)) dA (Pi + q) 2 (P2-q) 2 q 2 (P3-Pi-q) 2 q ' followed by integration over d4q. The important range of integration is the one which gives rise to the lowest power of 1/s. For this, q must always be small in comparison with p{ and p2. Rejecting the terms which are then small (and also the terms p] = p2 = m2), we can rewrite this expression as (ypôiypi) dA (P\q)(P2q)q2(Pi- P\~ q)2 The denominator does not contain s if q0 and qx (with the x-axis along pi = -P2) are oc 1/Vs; qy and qz may be * V|t|; then the range of integration <* Us. The order of magnitude of the numerator is p\Pi<* s. Thus the replacement of one internal photon line in the diagram by two does not affect the dependence of the diagram on s (for a given t)t That is, the contribution of the diagram (134.9) to the scattering amplitude has the same asymptotic behaviour (134.8) as that of the principal diagram. The position is unaffected by adding other parallel internal photon lines in the diagram, and also by including corrections to the internal electron lines. This is a general result: any diagram which can be cut in the t or u channel into two parts across any number of internal photon lines corresponds to an amplitude contribution which has the asymptotic form Mfi <* sit with t constant or slu with u constant (V. G. Gorshkov, V. N. Gribov, L. N. Lipatov and G. V. Frolov, 1967; H. Cheng and T. T. Wu, 1969). As a second example, let us consider Compton scattering described by the two diagrams (74.12). These do not allow cutting in the t channel, but the second diagram can be cut in the u channel at an internal electron line. In the notation of the present section, it is |prp4 P3 *+- - « (134.10) p^ This means that the scattering is largely concentrated near the backward direction, as t Let us mention again that only power-law asymptotic forms are under consideration, and so we need not take account of logarithmic divergences in the integrations. Diagrams of the form (134.9) will be further studied in §137. 1 §134 Asymptotic Form of the Scattering Amplitudes 607 already noted at the end of §86; see (86.20). To find the asymptotic behaviour in this range, we note that the factor G corresponding to the internal line in (134.10) is in order of magnitude l/y(pi-p4) a 1/V|«|. Hence the scattering amplitude Mfi <* a(sl\u\y; this includes a factor a because the diagram (134.10) is of the second order. The differential cross-section is therefore dal du « a2l\u\s. The integral of this expression with respect to \u\ is governed by the range |M|<^S. The total crosssection then decreases with increasing energy as a <* a 2 /s, or more exactly a <* (a 2 /s)log(s/m 2 );cf. (86.20).t For this process, however, radiative corrections alter the asymptotic behaviour. The change is due to sixth-order diagrams of the type ! I P4 _^J ! 1 (134.11) U*- P2 In the t channel, these allow a cut across two internal photon lines, and therefore contribute to the amplitude, with asymptotic form Mfi * a*slt; the factor a3 corresponds to a sixth-order diagram. When 5 is sufficiently large, this part of the amplitude becomes the principal one, and the differential cross-section is then daldt oc a 6 2 lt . The integral of this expression with respect to t is governed by the range of small |t| ~ m2, i.e. by the range of scattering angles 0 - m V s ; note that the scattering is now mainly forward, not backward. The total cross-section then no longer decreases with increasing energy: a*a6lm2 = a4rl (134.12) The decreasing part of the cross-section becomes comparable with this constant part when e = V s « m/a 2 . A similar situation occurs for photon-photon scattering. In the first nonvanishing approximation, this is described by the "square" diagrams (127.1), which can be cut across two internal electron lines. Integration is carried out with respect to the 4-momentum of these lines in the diagram; momenta ~ V s are important, and small values of t or u are not especially significant. The asymptotic form of these diagrams for any constant t or u is given by (134.5): Mfi = constant « a 2 . The total cross-section decreases with increasing energy: a <* a4ls (cf. (127.23)); angles close to zero or TT have no special significance here. In the eighth order, however, there are diagrams which can be cut (in the t or u channel) across two internal t The exact form of the dependence of the cross-section on \u\ or |f| when these are ^ m2 cannot, of course, be ascertained from the arguments given here. It is assumed that the integral with respect to \u\ or \t\ converges at values — m2. This is in fact so for all processes except elastic scattering of charged particles. 608 Asymptotic Formulae of Quantum Electrodynamics §135 photon lines, for example I i (134.13) These diagrams give an asymptotically constant cross-section: a <* a 8 /m 2 when Vs§>m/a2.t The asymptotic constancy of the total cross-section is a characteristic property of scattering processes whose diagrams can be cut (in the t or u channel) across internal photon lines. It occurs even when more than two particles are present in the final state of the reaction. §135. Separation of the double-logarithmic terms in the vertex operator The corrections having the form (aL) n (where L is the large logarithm) can become important only at enormous energies, as already mentioned at the end of §133, and therefore are of purely theoretical significance. But there are also much larger corrections, of the form (aL2)n, in the amplitudes of actual scattering processes. Such terms, containing the square of the logarithm to the same power as a, are called double-logarithmic terms. The characteristic expansion parameter in the double-logarithmic corrections is (a/7r)logV/m 2 ), (135.1) where e denotes the energies occurring in the problem (for example, the total energy of the colliding particles in the centre-of-mass system). The condition for perturbation theory to be valid is that this quantity be small; it ceases to be satisfied at energies e-m exp[iV(îr/a)] - 3 x 104m. (135.2) Let us now try to avoid this limitation and derive formulae valid when (a/7r)logV/m2):£l. (135.3) This clearly requires the summation of an infinite series of corrections in all powers (aL2)n. The double-logarithmic corrections occur in cases of two kinds. One includes scattering through a fixed finite angle; as has been shown in §134, the t The cross-section for coherent scattering of a photon in the field of a nucleus is asymptotically constant even in the first non-vanishing approximation, described by "square" diagrams in which two ends are external-field lines; see (128.7). In reality, however, these diagrams would have to be represented in the form (134.11), the upper continuous line being the nucleus line. The external-field lines then become internal lines in the diagram, and the reason for the asymptotically constant form becomes evident. Separation of the Double-logarithmic Terms §135 609 cross-sections always decrease in the asymptotic high-energy range. In such cases, the double-logarithmic corrections are closely associated with the infra-red divergence. These cases include, in particular, elastic scattering of electrons in an external Coulomb field; thefirstdouble-logarithmic correction to the cross-section has been found in §122. The present section and §136 deal with the complete determination of these corrections, under the condition (135.3). The other class of cases includes reaction cross-sections which decrease with increasing energy for a given square of the momentum transfer, i.e. for scattering angles which asymptotically approach zero or TT; as shown in §134, this occurs for processes whose diagrams cannot be cut across internal photon lines in the t or u channel. Here, the double-logarithmic corrections are not connected with the infra-red divergence. As an example of this, we shall discuss in §137 the problem of backward (u = constant) electron-muon scattering. First of all, with the condition (135.3) the single-logarithmic corrections are ~(a/'n-)log(Ê2/m2)sV(a/7r)^ 1 and can therefore be omitted. Since the doublelogarithmic corrections do not appear in <ê and 2>, the latter functions can now be taken as simply equal to their unperturbed values G and D. The calculation of the vertex operator F involves the summation of the double-logarithmic terms which arise from an infinite sequence of diagrams. This problem will be analysed in §136, butfirsta method will be described for separating the double-logarithmic terms in the various Feynman integrals before actually performing the integrations over all the variables in them (V. V. Sudakov, 1956). Let us consider the first-order correction (with respect to a) to the vertex operator, represented by the diagram (117.1), which is here conveniently taken (with a renaming of the variables) in the form ►o Pi Pi ~V V _ f (135.4) \ , or, analytically, 7y(7Pz-ff + ™)7M(yP. ~ Y/ + m)yvdAf V^\n2 n,a)= - - ^ J VP2 Puq) ' 4 ^ J [ ( p 2 - / ) ^ - m i + iO][(p,-/) 2 -m 2 + «0][/2 + iOr (135.5) We shall assume that \q2\ > Pi plm2, (135.6) and that the ends p,, p2 may be either physical or virtual. From (135.6) it follows that \PxPi\~W>Puplm\ (135.7) 610 Asymptotic Formulae of Quantum Electrodynamics §135 i.e. the 4-vectors p{ and p2 have large components but small squares; this is possible because the four-dimensional metric is pseudo-Euclidean. The doublelogarithmic terms in fact occur when the conditions (135.6) are satisfied. We shall see below that relatively small values of / are important in the integration over d4f. We can therefore neglect / in the numerator of the integrand, and r{1) becomes r*l> Ä _ * y{ypi +m)y»(ypl + m) yju (135.8) where h ~i [(P2-/) 2 -m 2 +iO][( P l -/) 2 -m 2 +iO][/ z +iO]- (1359) The matrix factor in (135.8) can be simplified by using the fact that when T appears in diagrams it is always, in effect, multiplied by the matrices (7p2 + m) and (YPi + m): ( 7 p 2 +m)r(7p, + m). (135.10) For, if the lines p\ and p2 are virtual, these factors come from G(pi) and G{pi)\ if the lines correspond to real electrons, Y is multiplied by M2 and U\, and Dirac's equations show that uz-u2 2m , «,- 2m u,. Interchanging the order of the matrix factors and neglecting at each stage, in accordance with (135.7), the squares p], pi and m2 in comparison with p\ p2, we get ie2 (ypi + nOP^ypi + m) « - - j (Pip2)(7P2 + m)y,l(ypi + m)/,. We can therefore put r(,) in the final form P (1) = (ie2/2ir3) 7*tf„ (135.11) f = q 2 ~-2(p,p 2 ). (135.12) where The integral Jt converges when / is large, and therefore need not be regularized. The main point of the subsequent calculations is the introduction of new and more convenient variables of integration. Let / be resolved into components §135 Separation of the Double-logarithmic terms 611 tangential and normal to the pip2 plane: f = upi + vp2 + f± s / | + /i, /xP. = /xP2 = 0. (135.13) (135.14) As new variables, we take the coefficients u and v and P= -/i. (135.15) It is evident from the conditions (135.7) that the metric in the pip2 plane is pseudo-Euclidean. The time axis can therefore be taken in this plane, so that f± is a space-like 4-vector and p > 0. Let the indices 0 and x temporarily denote the components of 4-vectors in the Pip2 plane; y and z those in the normal plane. To transform the 4-volume element d4/ = d2/xd2/i to the new variables, we write d 2 /i = |IJ <*|fj d<f> = 2 dpd<f> -* TT dp (since the integrand in (135.9) is independent of the angle <f>). Also d 2 /»=|#^U"dt> = |PioP2*-P2oPi*|dudt> ~W\dudv, since pi is small, so that p\x «* plo and (PioP2x ~ P20P1*)2 * (P10P20-P2xPu)2 = (p.p2)2 = (k 2 ) 2 . Thus d4/ = 12|r|dMdüd2/i -+\-n\t\dudvdp. (135.16) The calculations now depend on the relation between p], pi and m2; two cases will be considered. 612 Asymptotic Formulae of Quantum Electrodynamics §135 VIRTUAL ELECTRON LINES Let the momenta p\ and p2 correspond to virtual electrons, with (135.17) \pl\pl\>m\ We shall see that the most important range of integration, which leads to a double-logarithmic expression, is in this case given by the inequalities o<p«*|ftt|,M, \p\lt\<\v\<\,\pllt\<\u\<\. (135.18) Accordingly, in the denominator of the integrand in (135.9) we can neglect m2, p2, p\ and p in comparison with (pi/) and (pif): d4f -J 2(pz/) • 2(p,/)(/ +7Ö)- (135.19) T The quantities prf, prf and f2 are given by f2 = (up\ + vp2)2 - p « - tuv - p, 2(Pi/) = 2p,(up, + vp2) « - tvy 2(Plf)~-tu. Then d£_ du dv 2\t\) p + tuv-iO u v 2U|Jp (135.20) In accordance with the conditions (135.18), the integration over dp is taken from 0 to the smaller of |h;| and \tu\; the result is min(|fi<|,|ft>|) Î7T when tuv < 0, = logmin {RRi} + { 0 when tuv > 0. P+ ^ - i O / (135.21) The logarithmic integration over dv is taken from -1 to -|p?/f| an(* ^ r o m I P M t 0 1 (and similarly over du). When (135.21) is substituted in (135.20), the integral of the first term over dudv is zero, because the integrand is an odd function. The integration of the second term is carried over ranges of u and v having the same sign when t < 0 and opposite signs when t > 0. In either case the ranges v > 0 and v < 0 give the same contribution after integration over du, and the result is r iir2 ~ f du f dv in2. ''-IT2 } T J V = T l o g |pf/»l IPÎ/'I the sign being the same as that of t. log (135.22) §135 Separation of the Double-logarithmic Terms 613 Finally, substituting in (135.11), we get r^\p2,Pûq)= ~£-y» log Kl log KL 2ir PÎ P5I 2 l« IHP?|.|PEI*m (135.23) PHYSICAL EXTERNAL ELECTRON LINES Now let the momenta p\ and p2 correspond to real electrons, so that (135.24) p} = pl = m\ Then the important range of integration is o<p«H,M, (135.25) 0<|»|,|II|«1. Since p\ - m2 = p\- m2 = 0, we can neglect p\ and p 2 in comparison with p\f and P2/, and again bring the integral (135.9) to the form (135.19). To eliminate the infra-red divergence which then occurs, however, we must apply a finite photon mass A. < m in 'he photon propagator (cf. §117): n v J 2(p,/)-2( P 2 /)(/ri2 -At 2^+,i0T (135.26) In this case f2 « - tuv - p, 2pi/~ -fu + 2m2H, 2p2/« -rw + 2m2ü, and hence r _ i,_ ^ f dp du dv 2|f|Jp + fi« + A 2 - i 0 i i - T » » - T i r n „ T O (133.27) where T = 2m2lt < 1. After the integration over dp, similarly to (135.21), we obtain r = -ill 1 f f du 2|f|J J dv U-TVV-TU' the integration being subject to the condition tuv + X2 < 0. The ranges v > 0 and 614 Asymptotic Formulae of Quantum Electrodynamics §136 v < 0 again make equal contributions, and the result of the integration over du is i m2 h-'-fjdv 0 i J (u _ Tl ,)" v _ TM) 81 v 1 iir 2 f. T 8 - U2 = — logrr K7 t J ö dv r—, (Ô - T I T ) ( T - v) v / i a c 00. (135.28) where S = A2t, |ô| <^ |T|, and w e have used the inequality |T| <^ 1. In the integral (135.28), three ranges of v lead t o double-logarithmic (II) V ( 6 / T ) < < V <\r\9 (III) V ( T S ) « v < V ( 8 / T ) . ( W e expressions: (I) \r\<v<U take the specific case V ( S / T ) < ^ | T | ; the result does not depend on this assumption.) With the appropriate approximations in each range, we find ii= £( i o 8 ^ + 4 i o g ^ i o g f)- (,35 29) - Finally, substituting in (135.11), we have r">2,p,;q)= - ^ ( l o g \q2\>P* 2 ^ +41og^log^), (135.30) = pl = m\ in agreement with (117.21). § 136. Double-logarithmic asymptotic form of the vertex operator When the corrections r(1) calculated in §135 become of the order of nity, the vertex operator has to be found by summation of the infinite sequence of rioublelogarithmic terms of all orders in a. This problem can be solved because sucn terms arise only from diagrams of a particular type, and the contributions from diagrams of different orders are related in a simple manner. As we shall see below, the double-logarithmic terms arise from all the diagrams that have the form (136.1) §136 615 Double-logarithmic Asymptotic Form of the Vertex Operator etc., in which each photon line joins the two electron lines, and the photon lines themselves may intersect in any manner. Let the photon momenta / i , / 2 , . . . be numbered in the sequence of, say, the right-hand ends of their lines. Then the various diagrams of a given order will differ in the sequence of the left-hand ends of the photon lines. In each Feynman integral, we neglect terms in the numerator and denominator as in (135.5), and then treat the numerator in the same way as in the derivation of (135.11). Then the sum of all the diagrams having n photon lines, giving the term * an in T, is r*"> = y'datfr^yin, '-£/ 2(pi/i) • 2( P l /, dAU... d% (136.2) r^—^r + p,/ 2 ) / . . 2(p,/, + • • • + p,/„) • 2 ( P 2 / , ) . . . 2( P î /, + • ■ ■ + P2fn)W... /5' (136.3) where the sum is taken over all interchanges (permutations) of the subscripts k in the products p-Jk\ the terms iO and A2 in the denominators are omitted for brevity. It is clear that, if the subscripts k in the products pjk in the sum (136.3) are interchanged in any manner, this is simply equivalent to renaming the momenta and so does not affect the value of I„. We can therefore extend the summation in (136.3) to all interchanges of the factors fk in both p-Jk and pjk, and divide the result by n !. We now make use of the important formula l 1 1 1 ^ tf ai(a1 + a 2 )...(ai + a 2 +- • • + a„) ax a2 a„' (1364) where the sum is taken over interchanges of the subscripts 1, 2 , . . . , n.t When this formula is twice applied to the sum of integrals, we obtain a product of n identical integrals of the form (135.19) (or (135.26)), so that In=-ri/î. (136.5) Substitution in (136.2) and summation of PH) over all n = 0, 1, 2 , . . . gives finally r*(p2,pi; q) = y* txpiiehhll^). (136.6) In particular, substitution of U from (135.22) gives the double-logarithmic asymptotic form of the vertex operator with virtual external electron lines: r*(p2, Pi; q) = Y* exp { - £ log | ^ | log | ^ | } , \q2\>\pl\,\pl\>m<.2 (V. V. Sudakov, 1956). t The formula is obviously true for n = 2, and can easily be proved by induction. (136.7) 616 Asymptotic Formulae of Quantum Electrodynamics §137 Substitution of I{ from (135.29) gives the asymptotic form of the vertex operator for real external electron lines: r-(p 2 ,P,;q) = 7 " e x p [ - ^ ( l o g 2 ^ + 4 1 o g ^ l o g y ) } , (136.8) \q2\>p2i = pl=m2. The factor which distinguishes this F* from its unperturbed value 7M defines also the difference between the amplitude for electron scattering in the external field and its Born value. The scattering cross-section is therefore da = d a f î e x p [ - ^ ( l o g 2 ^ + 4 1 o g ^ l o g ^ ) } . (136.9) To eliminate the infra-red divergence, we still have to multiply this expression by the sum of the probabilities for the emission of various numbers of soft photons with energy not exceeding some small value comax, i.e. by the quantity (see (122.2)) w w max mix 1+ f dwu + jj 0 U IMX ! dwUl j 0 «"max dw^+'^expf 0 f dw»}. (136.10) 0 The integral in the exponential is given by (120.14) (as the expression which there multiplies dcre\), and the final result is the following asymptotic formula for the cross-section for scattering of an electron with energy e at a high momentum transfer: da = d<TBexp\- — log ^-j l o g - M , |q 2 |>m 2 , (136.11) (a/27T)log2(e/m)~l (A. A. Abrikosov, 1956). The first-order term (with respect to a) in the expansion of this expression is, of course, (122.12). Note that, if we put w max ~e, one of the logarithms in (136.11) becomes of the order of unity; that is, the double-logarithmic corrections cancel in the crosssection for simultaneous emission of photons of any energy.t In the approximation used, the exponential factor in (136.11) then becomes unity, and the cross-section has its Born value, in agreement with the general statement at the end of §98. § 137. Double-logarithmic asymptotic form of the electron-muon scattering amplitude As an example of the second kind, let us consider the scattering of an electron by a negative muon, taking only the case of scattering exactly backwards through an t For scattering through a finite angle, the condition formulated in §98 for the photon to be soft requires only that <om« < e, and so the formulae derived here can be used with logarithmic accuracy even when &>m.« — e. §137 617 The Electron-Muon Scattering Amplitude angle 0 = IT (V. G. Gorshkov, V. N. Gribov, L. N. Lipatov and G. V. Frolov, 1967). This is a simple process in two respects. Firstly, exchange diagrams do not appear, because the two particles are not identical. Secondly, in back-scattering there is very little emission of soft photons, and therefore no infra-red divergence: according to (98.8) the soft-photon emission cross-section is L\l-v;-n l-v;-n l-v,n 1-v^-n/ J 4ir <o (137.1) where \t, vM and \'e, v'„. are the particle velocities before and after the collision; in the ultra-relativistic case equality of momenta implies equality of velocities, and to this accuracy, in the centre-of-mass system, for back-scattering, v, = - v^ = - v«■ = v£, so that (137.1) is zero. If the scattering process considered corresponds to the s channel of the reaction, in the t channel it becomes the conversion of an electron-positron pair into a /m+/uT pair. In this channel the condition 6 - IT signifies that the directions of motion of e~ and /x~ (and of e+ and /j,+) coincide. The elimination of the bremsstrahlung in this channel is particularly clear, since the direction of motion of the charge of each sign is unchanged. The cancelling of the leading terms in the emission cross-section has the result that its asymptotic form does not contain double-logarithmic corrections. Correspondingly, there is (with the same double-logarithmic accuracy) no infra-red divergence even on integration over the momenta of the virtual photons in the scattering amplitude. If the process is described by means of the invariants s = (P< + P»)\ t = (pe- p'e)2, u = (pe - p;)2, the values corresponding to back-scattering in the ultra-relativistic case are s = -t>ml, u = 0. (137.2) In the first approximation (with respect to a) of perturbation theory, electronmuon scattering is represented by the diagram }Pe-p. (137.3) The corresponding amplitude is Mji» = ^ (üitL)'yvuM)(uie)'yvu{e)). (137.4) The value of this expression in the limit (137.2) is obtained by replacing the matrix 618 §137 Asymptotic Formulae of Quantum Electrodynamics 4-vector y" by its "projection" y{ on a plane perpendicular to the pep'e plane (or, equivalently, the p^pl plane, since in ultra-relativistic bapk-scattering Pt^pl and p'e ** P»Y- the components parallel to the pep'e plane are the matrices v i ( 7 P < ~ 7 P «* ^77 (yp*+ yp «)' (the first being equal to y° and the second to n, • y, with n, a unit vector along pe), and on using Dirac's equations for the bispinors u(<) and u(M), we have (ü(*)'7,V*))(ü(')'y,,|U(<))~l/s, so that these terms may be omitted. In the next approximation we add the diagram f 1 <-* 1 1 Pc 1 1 • 1 tPc-f 1 (137.5) _l—, ■fv pe+pM-f and a diagram with the photon lines "crossed", which is conveniently taken in a form which differs from (137.5) only as regards the direction of one of the continuous lines: p « " \ ' H*l , ■ P« fiv* I 1 p^-Pe+r (137.6) ■ p; An analysis of the corresponding integrals shows that in both diagrams we have double-logarithmic contributions from the regions of "soft" virtual photons: | ( / pt)1\<rn\ or |(/-pi) 2 | <m\. These contributions arise from the infra-red divergences of the integrals and, according to the foregoing discussion, must certainly cancel here. In the diagram (137.6), however, there is a double-logarithmic contribution also from large momenta: |/ 2 |>mJ, and this contribution must be calculated. The diagram (137.6) corresponds to the integral *,»_ Û*2 f [fi ( 'V(y/ + me)YxU(e)][û^Yx(Y/ + mu)yvu^ 4 where we we have already used the fact that Pt^p'r We again put f = upt + vp'e + f1 (137.8) (cf. (135.13)). The double-logarithmic contribution comes from the range defined by §137 The Electron-Muon Scattering Amplitude 619 the inequalities |su|,|st>|>p>m£, | (137.9) mlls<\u\,\v\<\t] where p = -/*. The 4-vector f± is defined so that fxpe = fxp'e = 0; in the present case of back-scattering, it follows that fl = 0 in the centre-of-mass system, and so p = tl. In the numerator in (137.7) we can neglect mt and mß, as well as all terms containing u o r v ; the factors u and v there would cancel the corresponding poles in the denominator (see below), and so the required squares of logarithms would not occur. Since (p<-f)2sstu~ -su, (pt-f)2« -sv, f^suv-p, we can transform the element of integration d4f in accordance with (135.16) and rewrite (137.7) as M}?,. 1 _ £ I" ^ V W ^ f l t f i ' - Y C ; ^ ' ] , du dv d>/i. lit1) su sv(suv - p + lOy The numerator in the integrand is further transformed by averaging over the directions of fx and replacing y" and yK by 71 and 71 (on the same principle as in (137.4)). A simple calculation gives Mf = M$>Jmt p du dv dp uv (suv - p + iO) r(») (137.10) Finally, using in the numerator the identity p = (p- suv) + suv, we can omit the second term, which would cancel simple poles and therefore give no doublelogarithmic contribution. Thus j(.) = _ ^ (äudvdß, 4TT J Mt? (p - suv - iO) 37 n ) This integral has the same form as (135.20), and the integration over dp can therefore be performed in the same manner, but since now p>ml we have the condition suv > mj instead of suv >0. The result is r(D a fdudv 2TTJ (13712) UV ' the range of integration being defined by the inequalities mils <u,v <\, suv>ml; in the calculation with logarithmic accuracy, the strong inequalities > are replaced 620 §137 Asymptotic Formulae of Quantum Electrodynamics by > simply. A straightforward calculation gives J(,)= < 13713 > Ä1O82^4ir m^ In the higher approximations of perturbation theory, the desired terms -^an log2" s arise from "ladder" diagrams similar to (137.6) but with a larger number of "rungs". The complete double-logarithmic asymptotic form of the scattering amplitude is therefore given by the infinite sum iMm K- ^m-r~^m- -P« IV- ■^— i i —•- + m 1 - L . <m—-PÜ 1 1 -«—JL - ^ . J . 1 i m* i ■ i i i ■ i ' i i i■ +~ (137.14) To determine the general form of the terms in this sum, let us consider the diagram for the third approximation (the third term in the series (137.14)). The corresponding integral may be written j(2) _ ( a \2 f dui dui du2 dv2 [ ~ \2ir) J Mifi(ui + u2)(t), + v2Y J (137.15) with the range of integration w»*s m*. The double-logarithmic term in this integral can be separated by applying to the variables of integration the further conditions V2> V\, (137.16) u2> U\. Then r<2> = I a V f du\dv\du2dv2 \2TT) = \^J J U\U2V\V2 j dÇ\ drj, d& dr)2, where £ = \og(suJml), TJ, = -log vit and the range of integration is defined by the inequalities £i>f)i, &>T?2, o->&«T}2>0, o- = log(s/m*). Similarly, the nth term of the series can be written as M}"* = M}} ) / (n) , where J ( n V) = (g^y | d^.dr,,... dÇndVny (137.17) §137 The Electron-Muon Scattering Amplitude 621 with the range of integration $><n, (i = l , 2 , . . . , n ) , <r>Cn,Vn>0- (137.18) The total scattering amplitude is M„ = M}!> [l + £ J % ) ] . (137.19) To calculate this sum, we use auxiliary functions A("} (£, 17), given by the same integrals (137.17) but with ranges of integration 6>Tfc(i = l,2,...,rt), £>&>(), ti>Tj„>0 (137.20) (i.e. with different limits of integration with respect to £„ and Tjn, instead of uniform limits as in (137.18)). It is evident that Mfi = M$ A(<r, <r), where Ai^J^A^t,), A ( 0 ) =l. (137.21) The definition of the functions A(B) (£, TJ) shows that they satisfy the recurrence relations A(B)(£, V) = ^ I «fEidiiiA"-"«,, T,,), and summation of this equation with respect to n from 1 to « gives an integral equation for the function A(|, TJ): A(fc T?) = 1 + ^ I A({„ r,,) d|,dt)„ t\>Vi> £>£i>0, (137.22) T}>TI,>0. For the subsequent analysis it will be sufficient to consider A(£, 17) in the range $ > T). Then equation (137.22) can be written n i MlT?) = 1 + Yi\ \ A <*"ii>d&d^ < 13723 > 0 1)t Differentiating this with respect to 17, we have d ai t * § -S?/A(ft.'>>*. <13724> 622 §137 Asymptotic Formulae of Quantum Electrodynamics and a further differentiation with respect to £ gives the differential equation ßi-uA=o- (i3725) This has to be solved with the boundary conditions A(£,0)=1, [M/dT,]«=„=0, (137.26) which follow immediately from (137.23) and (137.24). The solution can be found by means of a Laplace transformation with respect to*: A(£ TJ) = Y^ï j e'tQip, T,) dp, c (137.27) where the contour C in the complex p -plane is a closed curve around the point p = 0. Substituting (137.27) in (137.25) and equating the integrand to zero, we have with <f)(p) an arbitrary function. The first boundary condition (137.26) now gives <MP) = ^ + <KP), with \fß(p) an analytic function, having no singularity within C. The second condition (137.26) can be satisfied by putting tfß(p) = -lirpla: then C Combination of these expressions with £ = TJ = a gives c Finally, integrating by parts and using the familiar formula c (where h(z) = -U\(iz) is the Bessel function with imaginary argument), we find the §137 The Electron-Muon Scattering Amplitude 623 scattering amplitude The cross-section for scattering through 6 = ir is correspondingly d<T = d<TW . fr. i,lUJ— 2 a log (5/m^) v V ir l0g-*A ° m£/ (137.29) da(,) = (27raV)df being the cross-section in the Born approximation in the ultra-relativistic case (see §81, Problem 6).t t References to further literature on double-logarithmic asymptotic forms are given in the review article by V. G. Gorshkov, Soviet Physics Uspekhi 16, 322. 1973. C H A P T E R XIV ELECTRODYNAMICS OF HADRONS § Î38. Electromagnetic form factors of hadrons So FAR, we have been discussing in this book the quantum electrodynamics of particles not capable of strong interactions (electrons, positrons and muons). There are also many particles known as hadrons,t which take part in strong interactions. These include, for example, protons and neutrons with spin 2, pions with spin zero, and other particles. Atomic nuclei, which consist of protons and neutrons, are of course hadrons also. Present-day theory does not enable us to derive a complete electrodynamics of hadrons. It is clearly impossible to set up equations which determine the electromagnetic interactions of hadrons without taking account of the considerably more powerful strong interactions. In particular, the latter must be included in order to obtain the explicit form of the hadron current, in order to describe the interactions in quantum electrodynamics. The hadron current will therefore be introduced as a phenomenological quantity whose structure is determined only by the general kinematic requirements independent of any assumption about the dynamics of the interactions^ The electromagnetic interaction operator will again have the form (138.1) e{JA\ where the current is now denoted by the capital letter J to distinguish it from the electron current j . Since the order of magnitude of this interaction is specified by the same elementary charge e, we can again use the methods of perturbation theory.§ Let us first establish the form of the transition current between two states of a hadron in free motion (without any transformation of the hadron itself). This current occurs in the three-ended diagram 1 tq / \ JRz (138.2) P t From the Greek hadros, "large, massive". t Topics of hadron electrodynamics which involve the quark model will not be discussed in this book. § In this chapter, e (>0) denotes the unit charge. 624 §138 Electromagnetic Form Factors of Hadrons 625 which itself may be part of a more complex diagram (for example, that for elastic electron scattering by a hadron). The broken line in the diagram (138.2) represents a virtual photon; it cannot correspond to a real photon, since a free particle cannot absorb (or emit) such a photon; and q2 = (P2"P.) 2 <0. If we take first a hadron with spin zero, let U\ and u2 be the wave amplitudes of the initial and final states of the hadron, in which its 4-momenta are pt and pi\ for a spin-zero particle these amplitudes are scalars (or pseudoscalars).t The hadron transition current Jfi between these two states must be bilinear in u( and u$. We can write it Jft = utTuu (138.3) where the 4-vector T is an unknown vertex operator (the circle in the diagram (138.2)). If we put u, = u2 = 1, then Jfi = V. The conservation of current is a universal property in electrodynamics, due to the gauge invariance of the theory. In the momentum representation, it is expressed by the orthogonality of the transition current and the photon 4-momentum q = P2-P1: qjft = 0. (138.4) Here, this means that V must have the form r = PF(q\ (138.5) where P = p\ + p2 and F(q2) is a scalar function of q2, which is the only invariant independent variable. Since the type of hadron is unchanged by the transition, p\ = pl = M2, where M is the mass of the hadron, and hence Pq = 0. The matrix elements (138.3) with T given by (138.5), and therefore the operator J, are true 4-vectors. The interaction operator (138.1) is consequently a true scalar. Thus the electromagnetic interaction of spin-zero hadrons is necessarily P-invariant. It is also T-invariant. Time reversal interchanges the initial and final 4-momenta, leaving the sum P =pi + p2 unaltered, and changes the sign of the space components of the 4-momenta but not the time components. This is also the way in which the components of the 4-potential A are transformed, so that the product JÂ is unaltered. The invariant function F(q2) is called the electromagnetic form factor of the hadron. The explicit form of this quantity cannot, of course, be established in a phenomenological theory, but it is real (in the region q 2 < 0 at present under consideration), as follows from the same arguments as were used in §116 for the t The plane wave is written in the form tfi = [u/\/(2e)]e~'''x. The normalization to one particle per unit volume corresponds (for spin-zero particles) to the normalization of the scalar by u*u = 1, and we can take simply u = 1 (§!0). In the following we define the transition current with respect to the amplitudes ui, u2, in accordance with the notation used in §64. 626 Electrodynamics of Hadrons §138 electron form factors: when q 2 <0, there are no intermediate states which could appear on the right-hand side of the unitarity relation, and so the matrix Mfi (and therefore Jfi) is Hermitian. If q = 0, the initial and final states are the same, and Jfi becomes a diagonal matrix element. In particular, e{J%J2ei = eF(0) is the charge density, which is equal to the total charge Ze of the particle because of the normalization to one particle per unit volume. For an electrically neutral particle, F(0) = 0, but it must be emphasized that this does not imply a strictly neutral particle. If a particle is strictly neutral and has a definite charge parity, F(q2) = 0 for all q2; since the current operator is charge-odd (§13), its matrix elements between two states of the same hadron are zero.t Let us now go on to hadrons with spin \. In this case the wave amplitudes ut, u2 are bispinors, and the hadron current is JSi = ü2Yux. (138.6) From the bilinear combinations of ü2 and U\ and the 4-vectors p\ and p2, both true 4-vectors and pseudovectors (satisfying the condition (138.4)) can be constructed. Hence the condition of P invariance of the interaction is not necessarily satisfied, and must be imposed separately.t As has been shown in §116, under this condition the vertex operator contains two independent and (if q 2 <0) real form factors. We shall now write it as P-2M(Fe-Fm)^+Fm7'A = (4M2Fe - q2Fm) p + y f (Fe - Fm)a-q„ (138.7) where Fe(q2) and Fm(q2) are invariant form factors (M being the mass of the hadron). It is easily seen from the equation P 2 + q2 = 4M2 and (116.5) that the three expressions in (138.7) are equivalent^ t This does not, of course, mean that such a hadron has no interaction with an electromagnetic field. The product of two current operations, j(x)/(x'), is charge-even, and its matrix elements are non-zero for transitions between states having the same charge parity. Thus a strictly neutral hadron can scatter a photon or simultaneously emit two photons, i.e. can take part in processes of a higher order in a. t We shall not consider possible violations of parity conservation in electromagnetic interactions in consequence of virtual weak interactions. § The convenience of defining the form factors as in (138.7) (R. Sachs, 1962) will be shown below. In the literature, use is also made of form factors F\ and F2 defined similarly to / and g in (116.6): I*-",*»-£«"«These are related to F, and Fm by Ft = Ft+ F2q2l4M\ F„ = Fi + F2. §138 Electromagnetic Form Factors of Hadrons 627 The electromagnetic form factors are among the invariant amplitudes defined in §70. They may be regarded as the amplitudes of a "reaction" which is (in its annihilation channel) the decay of a virtual photon into a hadron and an antihadron. The virtual photon is a "particle" with spin 1. The fact that its decay into two particles with spin 2 must be described by two independent amplitudes is easily seen by calculating the corresponding helicity amplitudes (AbAc|SJ|Afl) (see §69): the P invariance means that the four non-zero elements of the S-matrix must be equal in pairs: (iJls'ID^H-lls1!-!), U-\\Sl\0) = (-{\\Sl\0). The requirement of T invariance (or C invariance in the annihilation channel) imposes no further relations between these elements. This is connected with the fact that the interaction described by the vertex operator (138.7) is necessarily T-invariant also (but this does not apply for particles with higher spins). When q ->0> the terms of zero and first order (in q) in (138.7) are T* = F€(0)y* - 2ÄT [ F m ( 0 ) " F < ( 0 ) ] o r ^ . (138.8) Hence it follows (see §116) that Fe(0) = Z is the electric charge of the particle (in units of e), and Fm(0) - Fe(0) is its anomalous magnetic moment (in units of e/2M).t So far we have used only form factors in momentum space. This is, of course, sufficient to describe the observed phenomena. Purely as an illustration, however, we shall give a somewhat more intuitive interpretation of the form factors, regarding them as the Fourier transforms of certain functions of the coordinates. To do this, it is convenient to take a frame of reference in which P = pi + P2 = 0 (called the Breit frame); this is always possible, since P 2 > 4 M 2 > 0 . In this frame, ei = e 2 = £ , so that P° = 2e, and the components of the 4-vector q are q 0 = = 0, q = 2p 2 = - 2 p i . For a spin-zero hadron, the transition current in the Breit frame has a particularly simple form: fjJ2e = F(-q 2 ), J = 0. From this we see that F(-q 2 ) may be interpreted as the Fourier transform of a static distribution of charges with density ep(r) = e ^ 3 / F(-q 2 ) e* r d'q. (138.9) In this sense, the particle is said to have a spatial electromagnetic structure: when F = constant = Z, we have p(r) = Z8(r), and the dependence of the form factor on q t For example, the proton has F,(0) = 1, F m (0)- F,(0)= 1.79; the neutron has F,(0) = 0, Fm(0) = -1.91 (the magnetic moment being entirely "anomalous"). 628 Electrodynamics of Hadrons §138 is interpreted as the difference between the charge distribution and a point charge. It must be stressed, however, that this interpretation is not to be taken literally. The function p(r) does not relate to any particular frame of reference, since there is a different frame for each value of q. The Breit frame is the same as the rest frame of the particle, and independent of q, only in the non-relativistic limit of small q2 « M 2 , when the change in the particle energy in scattering is negligible. The initial and final states of the particle are the same in this approximation, and so the transition current becomes a diagonal matrix element, with p(r) the actual spatial distribution of charges. For the elementary particles, however, the values of |q| for which the form factors vary considerably are only slightly less than M. In the non-relativistic limit for these particles, we can therefore replace F(-q 2 ) by F(0), i.e. regard the particle as a point. The situation is different for nuclei. The mass M of a nucleus is proportional to the number A of nucléons in it, and the typical value of |q|—l/J?, i.e. is proportional to A'113 (R being the radius of the nucleus). Hence, for sufficiently heavy nuclei, the typical q2 < M 2 , and so the non-relativistic treatment is permissible throughout the significant range. Thus the concept of the electromagnetic structure of the nucleus becomes a quite definite one. For a spin-5 particle, (138.7) gives in the Breit frame J?i = ( F e « F m ) y ( f i 2 i i i ) + Flll(fi270iii) (138.10) = Fe(ü2y°ul), J/i=^Fmiqx(ö22wt), (138.11) where S is the three-dimensional spin operator (matrix) (21.21), and in (138.10) the equation e(ü2y°ul) = M(ü2ul) has been used; this is easily verified by means of Dirac's equations for U\ and u2 with pi = - p 2 . The time component of the transition current (138.10) differs from the expression for a "point particle" (an electron) by a factor F e (-q 2 ). We can therefore say that the form, factor Fe (called the electric form factor) describes the "spatial distribution of charge" in accordance with (138.9). Similarly, the three-dimensional vector (138.11) can be correlated with-the "spatial distribution" of the current density ej(r) = curl jt(r), where ^t) = ^X^Fm{^)e^d\ is the "magnetic moment density". Thus the magnetic form factor Fm may be interpreted as the magnetic moment spatial distribution density, of course with the same reservations as were expressed above with regard to the charge distribution. §139 Electron-Hadron Scattering 629 Fm includes both the "normal" Dirac magnetic moment and the "anomalous" magnetic moment specific to the hadron, the "density" of the latter corresponding to the difference Fm - F€. It is reasonable to suppose that the singular points of the hadron electromagnetic form factors, like those of the electron form factors, occur at real positive values of the argument t = q2 = - q2. From this we can derive certain conclusions as to the asymptotic behaviour of the distribution p(r) (and ji(r)) as r-»». The same transformation of the integral (138.9) as was applied in §114 to derive (114.4) from (414.3) gives the following result for large r: P(r) oc e-v where KI is the abscissa of the first singularity of the form factor F(q2); cf. the footnote to §114. If the nearest singularity is given by the threshold for the production of a pair of hadrons (each with mass M0) by the virtual photon, then KO = 2M0. § 139. Electron-hadron scattering The formulae derived in §138 can be applied to the elastic scattering of an electron by a hadron. Let the initial and final 4-momenta of the hadron be ph and PH, and those of the electron pe and pi; then Pe + Pk=Pe+Pk. (139.1) The process is represented by the diagram -p. (139.2) Ph The emission of a virtual photon by the electron corresponds to the ordinary vertex operator 7, and its absorption by the hadron corresponds to the operator T. Let us take the most interesting case, that of a hadron with spin ] (for example, the scattering of an electron by a proton or neutron). The diagram (139.2) corresponds to the scattering amplitude Mfi = -47re 2 4(ö;7^e)(öir^ h ); H (139.3) in this chapter, the electron charge is -e. The calculation of the cross-section from this amplitude is essentially the same as the calculations in §81; the operator T is conveniently taken in the form of the first expression (138.7). 630 Electrodynamics of Hadrons § 139 For the scattering of unpolarized particles, the result is à = ™2dt [s - (M + m) ][s - (M - m)2]f2(l - r/4M2) 2 x {F 2 [(S - u)2 + (4M2 - Of] ~ ^ Fi[(s - u)2 - (4M2 - t)(4m2 +I)]}, (139.4) where M is the hadron mass and m the electron mass, t = q2 = (pe -p'ef, S = (Pe + Phf, 2 U = (pe ~ p'hf, 2 s + t + u = 2m + 2M . The following are some limiting cases. For the scattering of an electron by a heavy nucleus, an important case is that in * which the momentum transfer |q| from the electron to the nucleus is small compared with the mass of the nucleus, but not small compared with 1/R (where R is the radius of the nucleus), so that the nucleus cannot be regarded as a point. In this case, the centre-of-mass system approximately coincides with the rest frame of the nucleus, the recoil of the nucleus may be neglected, and the electron energy is unchanged. Then ir|dt| = p2do;, -t = q2<M2, s-M1-M1-u- 2Mee, and formula (139.4) becomes da = ^ (4e2 - q2)F2(-q2). (139.5) In this approximation, the cross-section has only the term containing the electric form factor, and (139.5) corresponds to formula (80.5), which applies to the scattering of an electron by a static charge distribution. In the scattering of an electron by a neutron at rest, in the same limiting case ee < M (where M is the mass of the neutron), the form factors can be replaced by their values for q = 0, since, as already mentioned, for a single nucléon the characteristic "radius",of the charge distribution is comparable with 1/M.t Since the neutron is electrically neutral, F«,(0) = 0, and the cross-section becomes 2 . r 4 ( e -jm da = afi 2 2 ) ,+ «1 J , 1 \do e = api2(cosecM# + 1) do'» (139.6) where fx - (e/2M)Fm(0) is the magnetic moment of the neutron and d is the t The empirical value of the r.m.s. "radius" of the nucléon is about 3.51 M * Mim* (where m„ is the pion mass). 631 Electron-Hadron Scattering §139 scattering angle. This formula corresponds to the scattering of an electron by a point magnetic moment at rest. Finally, we shall give the cross-section for the scattering of an ultra-relativistic electron by a nucléon, with |q| > m. As before, q2 denotes the square of the momentum transfer in the centre-of-mass system, and hence the invariant t = -q 2 . In the rest frame of the original nucléon (the laboratory system), we have - t ^2(pePe) = 2e^é(l ~ COS d ) , where e€ and e'e are the initial and final energies of the electron, and d is the scattering angle in this system. In the ultra-relativistic case, e'e is related to # by the same formula as in the scattering of a photon (cf. (86.8)): 1 l I;-^ l = M n (1 - ^ COS,9) - Hence ~ f - l + (2e e /M)sin i ^' (139 7) *M-[i+Mm*ti*? (139 8) ' - where do't = 2tr sin #dd. In formula (139.4), we can everywhere omit the electron mass m; expressing all quantities in terms of t and s - M2 = 2Mee, we have or, using (139.7) and (139.8), A d(r - ■» • a 2 cos2 2# 1 v " d 0 e 4 T 1 s i n 4 U l + (2 ee /M)sin 2 id X i \-tl4M2 2KPFmtan^\ (139 10) - (M. N. Rosenbluth, 1950). Note that the form factors Fe and Fm contribute independently to the crosssection, and there are no interference terms between them. This shows that the form factors have been appropriately chosen. 632 Electrodynamics of Hadrons § 140 PROBLEM Find the cross-section for scattering of an electron by a hadron with' spin zero. SOLUTION. Using (138.5), we have instead of (139.3) M/,= Aire1 ~-^r{ü'AyPH)ut)F(q\ The cross-section is found to be , d(J Tra2dt[(s-u)2 + (4M2-t)t] ' [s-(M + m)2)[s-(M-m)2)t21 „2,^ (0 ' with the same notation as in (139.4). When |f| > m\ . . , a2 cos 2 !« F2(t) der — doe-—? . i i • -———rr-r-—. i \ 4e; sin 2# 1 + (2e€IM) sm^ id with the same notation as in (139.10). § 140. The low-energy theorem for bremsstrahlung In §98 we have investigated the emission of a photon in a collision of particles, in the limit of zero photon frequency, and found that the amplitude of the process is inversely proportional to co and can be simply expressed in terms of the amplitude for the same collision without emission of a soft photon; the latter will again be conventionally referred to below as the amplitude for "elastic" scattering and denoted by M\f\ In the next approximation with respect to o>, Mfi = M{f~]) + Mf, (140.1) where a correction term independent of <o ( « o>°) has been added to the principal term ( « aT1). We shall see that this correction term also, like the principal term, can be expressed in terms of M}?0, and that this is true whatever the detailed electromagnetic structure of the hadron. This result is called the low-energy theorem for bremsstrahlung (F. E. Low, 1958). We have seen in §98 that the main contribution to the amplitude for emission of a soft photon (corresponding to the first term in (140.1)) arises from diagrams in which the photon is emitted by the initial or final particle. These are diagrams of the form (140.2) § 140 The Low-energy Theorem for Bremsstrahlung 633 in contrast to those of the form (140.3) where the photon line comes from the internal parts of the diagram. A characteristic feature of the diagrams (140.2) is that they can be divided into two parts by cutting only one (initial or final) virtual-hadron line. Thus they illustrate an important property: there exists a one-particle intermediate state with one hadron. We have seen in §79 that, because of the unitarity conditions, this property necessarily causes a pole singularity of the amplitude. Let us assume for simplicity that only one (denoted by the subscript 1) of two colliding hadrons has an electric charge and therefore can radiate, and that neither hadron has any spin. The wave amplitudes u of such hadrons are scalars, which will be taken as unity. Then the contribution of the pole part of the diagram (140.2a) to the amplitude is iMf = V(47r)e*(2pï - kn eF {p]_k\i_Mi iT. (140.4) The first factor corresponds to the photon k (eß being its polarization 4-vector). The second factor corresponds to the electromagnetic hadron vertex (the black dot in the diagram), and is written in the form (138.5), with F the hadron form factor. The third factor is the propagator of the virtual hadron pi — k (M being its mass). Finally, the factor iT denotes the whole remaining section. This differs from the amplitude of the,elastic process (140.5) in that the real hadron pi is replaced by the virtual hadron p\-k. The first few terms in the expansion of (140.4) in powers of co include (1) terms inversely proportional to o>, (2) terms independent of o> but depending on the direction of k, (3) terms independent of both to and k. The terms of the third kind only are given also by non-singular diagrams of the type (140.3), which do not have a pole singularity, and by the non-pole parts of the diagrams (140.2). We shall see that all such terms of this sort are jointly and unambiguously given by the terms of the first and second kinds when the condition of gauge invariance is applied, so that no separate calculation of these terms is necessary. The amplitude of the elastic process (140.5) depends only on two invariants: s = ( P i + p2)2 = (pî + pD2, * = (P2-p2)2. (140.6) Electrodynamics of Hadrons 634 §140 The replacement of pi by pi - k not only changes s into (pi - k + p2)2 but also brings in a dependence on a further variable, ( P l - k ) 2 - M 2 = -2p,k, which represents the "non-physicalness" of the momentum p\-k. But the first term in the expansion in powers of this new variable (a small quantity) already eliminates the singularity in the amplitude (140.4), and therefore can yield in this amplitude only terms independent of k, which according to the foregoing discussion are not yet relevant. Thus we reach the important conclusion that X in (140.4) can be replaced by the physical amplitude MftXs, t) with the change s -+ (Pi + Pi - k2) = s - 2k(px + p2). (140.7) The first terms in its expansion are given by T-> M }f(s, t) - 2(kp, + kp2)(dMflds)t. For a similar reason, it is unimportant that the electromagnetic form factor F here relates to a vertex at which only one of the two external hadron lines (pi and Pi — k) is physical. The form factor can therefore be replaced by the one described in §138, for a vertex with two physical external lines; since the photon k is then a real photon, we have F(k2) = F(0) = Z u where eZ{ is the hadron charge. Thus (140.4) gives M }?> = Zxe V(4TT) ^ ^ - Z,e V(4TT) 2(e*p,) z ^ ) KP2V ^ff + • • -, (140.8) where the dots represent terms independent of k (whereas the second term in (140.8) depends on the direction of k). Similarly, we find that the contribution to Mfi from the diagram (140.2b) differs from (140.8) in that pu p2 and k are replaced by pî, pi and -k. The leading term in the expansion is the already familiar expression (cf. (98.5)) Mfr1} = Z,eV(4TT) ( ^ - *£) Mf. (140.9) The terms independent of k can be found from the condition for the amplitude as a whole to be gauge-invariant: it must be unaffected by the change e*-*e* + constant x k, i.e. must have the form M/, = eJJ*\ with k^J" = 0. It is easy to see that, for this to be so, we must add to (140.8) the term independent of k -2Z,eV(47r)(p 2 e*), and similarly for the diagram (140.2b). The final result is My = 2Z1eV(4ir)«*[pï^-pÇ +P r | ^ - p ^ ] ^ . (140.10) § 141 The Low-energy Theorem for Photon-Hadron Scattering 635 The problem can be solved by means of this formula, which may be more compactly written by using the identity and similarly for 5/dpi, and the differential operators AX ! * = Ptr* kv~— (Pifc) dp\ (140.11) — dpt and similarly for d\^. Then Mf = Z,eV(47r) e*{ât + an Mf. (140.12) The cross-section is given by \Mfi\2; to the appropriate accuracy, \Mfi\2 = |M}rn|2 + 2 re (M^Mf*). (140.13) The second term gives the required correction to the emission cross-section. Summation over the polarizations of the photon gives as the value of this correction -4ir(Z,e) 2 ( ^ - j^f (â\ + lUMf\\ (140.14) Thus the correction to the emission cross-section is expressed in terms of the cross-section for the elastic process and its derivative with respect to s. If the charged hadron has spin i the calculations are unchanged in principle; only the specific form of the vertices and propagators is altered. It is found that formula (140.14) rémains valid after averaging over the polarizations of the hadrons and the photon (T. H. Burnett and N. M. Kroll, 1968). §141. The low-energy theorem for photon-hadron scattering In the limit of low frequencies, the cross-section for the scattering of a photon by any charged particle at rest tends to its classical value given by Thomson's formula. This limit corresponds to an amplitude independent of the photon frequency o>, which we denote by M f. It is found, however, that not only the first term in the expansion of the amplitude in powers of a>, Mfi = Mf + M<H). (141.1) but also the next term (M(I) ~ o>), are independent of the specific electromagnetic structure of the hadron for photon scattering, as well as for the bremsstrahlung discussed in §140 (F. E. Low, 1954; M. Gell-Mann and M. L. Goldberger, 1954). 636 § 141 Electrodynamics of Hadrons This process is represented by diagrams of three types: • ,k ,k* kx kN (141.2) of which the first two again have a one-particle intermediate state, and therefore a pole singularity. The analysis and the principle of the calculations are the same as in §140. In practice, we need only determine the contribution from the pole parts of the diagrams (141.2a) and (141.2b), expressing their electromagnetic vertices in terms of the static form factors (the charge Ze and the anomalous magnetic moment /i an ), as in (140.15). Unlike the bremsstrahlung case, however, the corrections to the Comptoneffect cross-section are important only for particles that have spin. This is because, for bremsstrahlung, as well as the spin-dependent corrections, there are corrections arising from the energy dependence of the amplitude of the "elastic" process. In photon scattering, this amplitude is replaced by form factors which, for "physical external lines", are constants independent of the energy, and therefore the correactions arise only from the magnetic moment, which is zero for spinless particles. We shall discuss the scattering of a photon by a spin4 hadron. If Mft denotes the contribution of the pole diagrams to the scattering amplitude, then (cf. (86.3), (86.4)) Mfi = -47r(Ze)2e^ev(üQ^u), (141.3) where Q*" = (y* + S'^yP+^M s=(p (yv - S") + <y" - S") yP 2 + k) = (p' + k')\ u 2 = (p-k') u ^ M ( 7 * + S"), 1 (141.4) J = (p'-k)\ and for brevity we have put Mancr^kA = ZeS\ /x an o^kl = ZeS'». (141.5) By interchanging the operators yp + M and using the equations ö ' ( 7 P ' - M ) = (yp - M)u = 0, we can transform (141.4) to Q-=[<*•+s"> ( 7 l % k + 2 p *+ 7 > ( y" <v + s*>] _ \y'(yk') [ + 2p" 2p'lc' ' [S" y" +£k+ * * L c..y< (yk')-2p'"\ 2pk' J " "*' TP ~2pk'+ M S"\ M S (141 6 '> § 141 The Low-energy Theorem for Photon-Hadron Scattering 637 This form, and the corresponding one with k and k' interchanged, clearly show that (141.3) is gauge-invariant; the relevant condition is K(ü' Q^u) = (ü(Q^u)K = 0, (141.7) in verifying which it must be remembered that (yk)(yk) = 0, kS = k'S' = 0. Since the pole part of the scattering amplitude is thus gauge-invariant by itself, so must be the regular part of the amplitude (which includes the contribution of the diagram (141.2c)). Hence in turn it follows that the expansion of this part in powers of k and k' must begin with quadratic terms; cf. the similar comment relating to the condition (127.5). That is, the regular part of the amplitude includes only terms starting with those proportional to ODCU' — CO2, and makes no contribution to the terms concerned here, which are proportional to a>° and cu1. These are therefore included in (141.3). To calculate the terms in question, we use the laboratory system, in which the initial hadron is at rest. For the photons, we take a three-dimensionally transverse gauge, in which e0=eo = 0. Then pe = 0, p'e f * ~ |p'| ~ o>, and from (141.6) it is obvious"that the leading terms in the expansion of Mfi will be proportional to co°, and that the terms in /xan will contribute only to the cu1 terms. The wave amplitudes of the initial and final hadrons in the laboratory system are, with the necessary accuracy, M=V(2M)(JJ), ö' = V(2M)[w'*,-^(k-k')a], where w and w' are three-dimensional spinors. A straightforward calculation gives the result M }?> = -8ir(Z«)2(e'* -e)(w#*iv), (141.8) M},!) = - 167riM/uianfc)(w'* crw) • (n' x e'*) x (n x e) - 47riZ£fzanco(w'* <JW) • {n(n x e • e'*) + + (n x e)n • e'* - n'(n' x e'* • e) - (nf x e'*)n • e - 2e'* x e}, (141.9) where n = k/cu, n' = k'/cu'. The scattering cross-section is to-skwrfEv*0"- (14110) see (64.19). For scattering by a charged particle, both M/P and Mf? are non-zero. The accuracy used allows us to retain in \Mfi\2 the terms \Mf\2 and re(M f Mfî*). The first of these gives the Thomson scattering. The second becomes zero on averaging over the polarizations of the photons and hadrons. In scattering by a 638 Electrodynamics of Hadrons §142 charged hadron, therefore, the corrections under consideration occur only in the polarization effects. For scattering by an electrically neutral hadron, M f = 0 and the cross-section is determined by IM}!*)2. After averaging over the polarizations of the final particles and summing over those of the initial particles, it is (in ordinary units) da = ^ 5 - (2+ sin2 #) do\ (141.11) where # is the photon scattering angle and the anomalous magnetic moment is equal to the total moment /x. The angle dependence of this cross-section is the same as for antisymmetric scattering (see §60, Problem 2). § 142. Multipole moments of hadrons Let us now consider the transition current corresponding to a diagram of the same kind as (138.2): Ik i 4 «^S (142.1) but with the lines p\ and p 2 pertaining to different particles (masses Mi and M 2 ); the photon line k = p\-p2 will be more conveniently represented here as leaving the vertex. The photon may be either virtual or real, the only necessary condition being k2 < (Mi - M2)2, so that the value k2 = 0 is permissible. Thus the applications of this diagram include, in particular, processes of photon emission in transformations of nuclei as well as other particles (for nuclei, the initial and final particles are the same nucleus in different states). The most interesting case here is that in which the wavelength of the photon is large compared with the characteristic "dimensions" of the particle (i.e. those which appear in its form factors, equal to the "radius" in the case of a nucleus). Then the transition current can be expanded in powers of k.t Note first of all that we must have J/, = 0 w h e n k = 0 , (142.2) since the limit k -+Q corresponds to a potential constant in space and time, but such a potential has no physical significance and cannot give rise to real processes. The same conclusion can be reached by a more formal argument: the currents discussed in §138 were non-zero for k = 0 on account of the terms proportional to the 4-vector P = p i + p2» but when M)^M2 the product P k ^ O , and such terms are therefore forbidden by the condition for the current to be transverse. t The following treatment is due to V. B. Berestetskiî (1948). 639 Multipole Moments of Hadrons §142 This condition for the current Jfi = (p/t> J/,) is, in three-dimensional form, k • J/,-= cop*-, (142.3) and can be satisfied in two ways: Jfi = tov(k, eu), or J/i = k x a ( k , o ) ) , pfi;= k • v(k, a>) (142.4) P/i=0. (142.5) Here v is a polar vector and a an axial vector. The current is said to be of the electric and magnetic type respectively. According to (142.2), v and a are finite or zero when k, to ->0. Let the photon energy to <^ Mi. Then the recoil may be neglected, and the final particle M2 also may be regarded as being at rest (in the rest frame of Mi); a) = Mi - M2 is given quantity. The states of the particles Mi and M2 at rest are specified by three-dimensional spinors wx and w2 of ranks 2s\ and 2s 2 , where s\ and 52 are the spins of the particles. The transition current must be a bilinear combination of w\ and w?. From the products of the components of these spinors, we can form irreducible tensors with ranks / = s\ + s 2 , . . . , \s\ - s2|; for a given /, they are true tensors or pseudotensors according to the internal parities of the particles Mi and M2. Apart from these tensors, we have available only the vector k. In order to obtain the first term in the expansion of the current in powers of k, we must form from these quantities a vector of the lowest possible power of k. This is done by taking the tensor of lowest rank and contracting it / — 1 times with the vector k. This will give the polar vector v or the axial vector a. Let Qim be the spherical components of the tensor formed from the wave amplitudes of the particles. The spherical components of the tensor of rank / - 1 formed from the components of k are |k|/_1 Yi_iffn(n), where n = k/to. From the general rule for the addition of spherical tensors (see QM, (107.3)), the spherical components of the vector v may be written V(4ir) 111 + 1 . ,i-, v / ! - l 1 l \n Y fnï where A takes the values 0 and ± 1 ; the choice of the common factor is explained below. Using formulae (7.16), we can express v in terms of spherical harmonic vectors: V= '" (2/ -V\)V$U(2l + 1)] ? ( - ^ ' " " Q ' - ^ V d + l)Yfä(n) + VlYfä(B)]. (142.6) Substitution in (142.4) gives the El transition current: J/i = '" (21 -\ffî[l(2l p " = + 1)] ? ( ~ 1 ) ' ' m Q ^ ' ^ V ( / + l)Yfä(n) + V l Y ) » ] , '" ( 2 t - ^ ! V ( 2 < - f l ) ? <-1>,~mQt-«Y««<nfc (142.7) <142-8> §142 Electrodynamics of Hadrons 640 |k| and co are distinguished in each formula, with a view to possible applications to real photons and also virtual photons, for which the two quantities are not equal. In (142.7) and (142.8) it is assumed that the spherical tensor Qlm (here denoted by Q\m) is a true tensor. If it is a pseudotensor (denoted by Q\%})9 then (142.6) defines the pseudovector a, and substitution in (142.5) gives the Ml transition current: J " = '" raVï(5ÎTT) M ? <-'>'-Q'ÄY!:»(n), I P/i=0. (142 9) ' The qyantities Q\em} and Q\™] are the hadron electric and magnetic multipole transition moments. Their role in hadron electrodynamics is exactly analogous to that of the corresponding quantities in electron electrodynamics. However, for electron systems these moments can in principle be calculated from the wave functions (as the matrix elements of the corresponding operators), whereas in hadron electrodynamics they occur as phenomenological quantities whose values are determined from experiment. The normalization of these quantities in (142.7)—(142.9) is chosen so as to agree with their definition in §46. This can be verified by regarding the currents (142.7)(142.9) as Fourier components of the transition current in the coordinate representation. For example, expanding the factor e~lk r in the integral PrAk) = fpfl(r)e-ik-rd3x (142.10) by means of (46.3), we get fffi(k) = Am1 2 Ytmin) ( P/i(r)Ytm(r/r)gl(|k|r) d3x. im J Retaining the term with the smallest value of I such that the integral is non-zero, and replacing gi (|k| r) for |k|r < 1 by the first term in its expansion (46.5), we return to (142.9), with Qfö=s yj^fxjrlpfi(r)Ylm(rlr)d3x, (142.11) in agreement with the definition (46.7). It can also be shown that, when applied to the emission of a real photon,-the formulae derived above give results already known. The amplitude of a transition with emission of a photon having momentum k = wn and polarization e = (0, e) is Mfi = - eV(4tr)e* • J,,-. (142.12) If the nucleus has definite values of the angular-momentum component Mt and Mf in the initial and final states, only one term remains in each sum over m in (142.7M142.9), namely that with m^M{-Mf. Since, from (16.23), the products § 142 641 Multipole Moments of Hadrons (X) ) (A) (A) Yfö • e * and Y}^ • e * (where A = ± 1 is the helicity of the photon, and e ±n) are proportional to D[m, we obtain again the formulae given in §48. The differential emission probability ist dw = 27rS[o> - (E, - E/)]|M/i|2d3k/2û>(2ir)3, (142.13) where E, and Ef are the initial and final energies of the nucleus. The total probability is found by summation over the polarizations and integration over d3fc. Substituting (142.7) or (142.9) in (142.12) and thence in (142.13), and performing the operations just mentioned, we again obtain (46.9) or (47.2). Formulae (142.7M 142.9) include all possible cases of the emission of a real photon. For virtual photons, there is another possible case which they do not include (R. H. Fowler, 1930). If the spins and parities of the initial and final states of the nucleus are the same, we can obtain from their wave amplitudes a scalar Q0, and from this a transition current of the form P/i = Qok2, Jfi = Qo<ok. (142.14) Qo is called the monopole (E0) transition moment. The corresponding transition amplitude for the emission of a real photon is zero, since e* • k = 0. The monopole current, however, may give rise to transitions involving the emission of a virtual photon. It is, moreover, the only such source when S\ = s2 = 0 and all the multipole moments are zero. The monopole current (142.14) is analogous to the electric quadrupole current as regards its dependence on a> and k. Accordingly, the moment Q0 also is a quantity of the same order as the quadrupole moment. The same conclusion can be reached by regarding (142,14) as the Fourier components of the current in the coordinate representation. Using in (142.10) the expansion of e~ikT in powers of k • r and assuming that pfi(r) is spherically symmetrical, we get P/i(k) = ^k 2 J P/l (r)r 2 d 3 x. Comparison with (142.14) shows that Qo= ~{j pfi(r)r2d3x. (142.15) The similarity to the quadrupole moment is obvious. PROBLEMS PROBLEM 1. Find the probability of ionization of an atom from the K shell because of the excitation energy <o of the nucleus (called internal conversion of 7-rays), in an Ml nuclear transition, t The factor 2TT8 in this formula, replacing (27r)4S(4) in (64.11), arises because momentum is not conserved when the recoil of the nucleus is neglected, and so only the conservation of energy remains. 642 § 142 Electrodynamics of Hadrons neglecting the binding energy of the electron in the atom and the influence of the nuclear field on the wave functions of the nucleus.t SOLUTION. The process is described by the diagram (1) where p\ and pi pertain to the nucleus at rest in different states, and p = (m, 0) and p' = (m + a>, p') are the 4-momenta of the initial and final electrons. This diagram corresponds to the amplitude -e2-^ü(p')(yJfi)u(pX Mfi = where Jfi is the transition current of the nucleus. After summation over the final polarizations of the electron, and averaging over its initial polarizations, we get 1 2 |M/1|2 = e 4 ^ { q U ^ 0 + 4(J/,p)(J)fip)}, polar. \({ ) using the fact that Jfiq = 0 and therefore iyxp = J/jp'. The conversion probability is calculated from dwconv = 2|iM0)|2 (1^ da) /p-^) \m , where da is the cross-section for the scattering process represented by the diagram (1) with p = (e, p), and \\ti is the wave function of the atomic electron; for a K electron |»M0)|2 = (Zam) I IT. The factor 2 takes account of the two electrons in the K shell of the atom. The cross-section da is da = 2*8(s + to - « ' ) | M / , | 2 2 ( p | . 2 e , ( 2 i r ) j ; cf. the last footnote. For Ml transitions, the current Jfï must be taken from (142.9). The integration of dwCOnv over de' removes the delta function, and the integration over do' replaces jYlJU^2 by unity. The conversion probability is thus expressed in terms of |Qi.^!n|2. But, according to (46.9), the probability wy for the spontaneous emission of a photon in the same nuclear transition can be expressed in terms of the same quantity. The final result is ^ Wy = 2a(Za) 5 ^(l+^Y + ', CO \ to ) this ratio being called the conversion coefficient. PROBLEM 2. The same as Problem l, but for an El nuclear transition. SOLUTION. By the same method, with the transition current given by (142.7) and (142.8), we obtain I m2\/4 2m' "-™^+Th5)(>+W ' t This approximation implies that the nuclear charge is small and the excitation energies o> are sufficiently large (but l/o> is assumed large compared with the dimensions of the nucleus). In practice, the approximation is somewhat unsatisfactory, and a more precise calculation has to take into account the Coulomb field of the nucleus. § 143 Inelastic Electron-Hadron Scattering 643 PROBLEM 3. The same as Problem 1, but for a monopole nuclear transition. SOLUTION. With the transition current given by (142.14), the result is wCOnv= 1 6 a 2 ( Z a ) 3 m V ( l + ^ ) |Qo|2. Since monopole emission of a photon is impossible, |Qo|2 cannot be eliminated. § 143. Inelastic electron-hadron scattering Elastic electron-hadron scattering has been discussed in §139. The problem of inelastic scattering may be formulated similarly. The only difference is that the final hadron state now corresponds to another hadron or several hadrons. The law of conservation of momentum (139.1) remains valid if p'h denotes the 4-momentum of the final hadron or the total 4-momentum of the group of hadrons formed in the scattering process. We thus have now p'h2 ^ pl = M 2 , where M is the mass of the initial hadron. With this difference, the inelastic scattering process is described by the same diagram (139.2). The lower vertex of the diagram is denoted by //, as in §138. However, in contrast to (138.3) or (138.6), we shall not express the transition current in terms of the vertex operator and the amplitudes of the states, in order not to specify in advance the nature of the final hadron state. We can now write the scattering amplitude in a form analogous to (139.3): \Pe P e) a similar amplitude has already been used in §142, Problem 1, where energy transfer to an electron was considered, and the amplitude has a similar structure in the problem of the excitation of nuclei by electrons. We shall assume that the initial electron energy is so high that many hadrons can be formed in the final state, and consider the "inclusive" cross-section, for which only the electron momentum in the final state is fixed, with summation over all the hadron states. This differential cross-section is written as follows, in accordance with the formulae of §64: dor = 4 f ( 2 ^ 2 g ; 2 (27r)A8{4\ph + p'h-pe - p3|Af„-|2. (143.2) The inclusive cross-section can depend only on three kinematic invariants, which may be determined by measurements on electrons only. The three invariants are t = q2s (p€ - p'e)\ s=(pe + ph)\ (143.3) and PH2. The need to include the third invariant arises because, in contrast to the Electrodynamics of Hadrons 644 §143 case of elastic scattering, p£2, the "mass" of the final hadron state, is now unspecified. Instead of it, however, another invariant is more convenient, namely v = qph. (143.4) The relation between v and pk2 follows from Ph = ph + q: p'h2 = M2+t+2v. (143.5) If the initial hadron is stable (for instance, a proton), the rest energy of the final state exceeds M, i.e. p'h2^M2, and from (143.5), since t < 0 , we have (143.6) t\, the equality occurring for elastic scattering. The kinematic invariants can be expressed in terms of the electron energies ee and Be in the initial and final states and the scattering angle 0. We shall assume the electron to be ultra-relativistic (ee > m, e'e> m), and neglect its mass. Then, in the rest frame of the initial hadron (the laboratory system), t = - 4ete't sin21 fl, v = M (ee - e'J, s - M2 = 2MB. (143.7) Substituting (143.1) in (143.2) and summing as usual over the electron polarizations, we obtain the scattering cross-section for unpolarized electrons, which we write as d ° = 7^5 n jYir , HWW", (q ) (2tr) • fSMBcBe (143.8) or 43 9) <*•<&£&*-•"" °- >V = 4ptlLp'ev - 2(ptllqv + p€vq») + q2g»„ (143.10) W»v = Ç (27r) 4 5 (4) ( P ;-p h - q)JWr* (143.11) where The tensor WM" of course depends essentially on the properties of the hadron currents, and in general we can only pose the problem of its phenomenological structure, similarly to the problem of hadron form factors. We first use the fact that the tensor structure of W*1" must be determined only by the 4-vectors relating to the lower vertex of the diagram (139.2), i.e. p* and q. From these (and the metric tensor gMI/) five independent tensors can be constructed. The requirement of invariance under time reversal means that the tensor must be symmetrical, and § 144 Hadron Formation from an Electron-Positron Pair 645 there are four such. Lastly, the condition of current conservation, i.e. W^qv = 0, W " ^ = 0, reduces the number of independent tensors to two. These may be taken as TÏÏ = ^ - g ^ TJ2 = (PH. ~ vqjt)(phv - vqjt), (143.12) and WßV be written as VV^ = 47rMVVlT^ + (47r/M)W 2 Tg. (143.13) Substituting (143.10) and (143.13) in (143.8), we put the cross-section in the form da = (W2 + 2 Wx tan2 \d) de'e datU (143.14) where a 2 cos 2 20 . , . is the cross-section for scattering of an ultra-relativistic electron in a Coulomb field; cf. (80.7). We see that the cross-section is determined by two structure functions which depend on the two invariants t and v. If the physics of hadrons at high energies does not contain characteristic quantities having the dimensions of mass (the hypothesis of scale invariance), we may expect that the structure functions depend, at high energies, on the only dimensionless parameter, tlv. Then the functions Wx, W2 must be functions of one variable: Wx = (Mlv)Fx(tlv), W2 = (Mlv)F2(tlp); (143.15) the ratio M\v is independent of M. § 144. Hadron formation from an electron-positron pair Let us now consider the transformation of an electron-positron pair into hadrons. We denote the 4-momenta of the electron and positron by p_ and p+, and the (total) 4-momentum of the hadrons formed by ph, with p- + p+ = ph. The process is represented by the diagram -p P ; i (144.1) 646 Electrodynamics of Hadrons §144 The lower vertex here corresponds to the transition current from the vacuum to a hadron state |n), which we denote by <n|J|0) as in §104. The diagram (144.1) corresponds to the scattering amplitude M„ = - (4Tralq2)ü(-p+)yMP-)n(n\J*\0). (144.2) We shall consider the total cross-section <7h for annihilation into hadrons, i.e. sum over all final states |n). Then, in accordance with (64.18), <r» = 4 y 2 |Mn|2(27r)46(4)(ph-q), (144.3) where q = p- + p+. The mass of the electron will henceforward be neglected; then q2 = 2(p_p+),I = ^ 2 . As in §143, we write the cross-section in the form <rh = (4TT)2W^WJ2I\ (144.4) where w»v = a{ptqv + p *q* - 2p?p: - iq V ) , (144.5) W^ = a^ (27r)4S(4)(p* - qKO^InXnl/JO) (144.6) n and t = q 2 >0. Note that t is the only kinematic invariant in this problem, with the three-ended diagram (144.1), and q is the only 4-vector on which W^ can depend. Hence, by the requirement of current conservation, the tensor W^v may be written V^ = W 0 ( ^ - g , , ) , (144.7) where ph(t) is the only invariant function, depending on the properties of the hadron current and determining the annihilation cross-section. Substitution of (144.5)-(144.7) in (144.4) gives *h = (47T2a/t2)p,(0. (144.8) Note that the function ph(t)= -lW» is exactly the same as p(t) defined in (104.9) if the currents in the latter equation are taken to be hadron currents. Moreover, p(t) is the spectral density of the self-energy function 11(0: im 11(0 = - MOIn the lowest approximation with respect to a, which is being considered here, the function II is the same as the polarization operator 9. In this approximation, § 144 Hadron Formation from an Electron-Positron Pair 647 therefore, ph(t) is also the spectral density of the hadron contribution to the polarization operator: i m ^ ( t ) = -7rph(0. (144.9) Using the dispersion relation (111.13) and expressing ph in terms of ah by (144.8), we get 0 which expresses the hadron contribution to the polarization of the vacuum in terms of the measured cross-section for annihilation into hadrons. Note that we could, in exactly the same way, solve the problem of electronpositron pair annihilation to form a muon pair (in the first approximation with respect to a, only one such pair is formed). Corresponding to the formula (144.8), the result is orM = (47r2a/t V ( 0 , (144.11) where pß(t) is the spectral density of the muon polarization of the vacuum. It differs from the electron polarization only in that the electron mass m is replaced by the muon mass /x, and, from (113.8), it is p»(t) = <a/3irXl + 2fx2)V(l - 4ft2/f). Substitution in (144.11) brings us back to the result already derived in §81, Problem 8. This page intentionally left blank INDEX Absorption of radiation 161-4 Annihilation diagram 291 operator 11,38-40 of pair 368-76 three-photon 373 Antiparticles 38, 93, 99-101 Anti-Stokes case 221 Asymptotic form of electron-muon scattering amplitude 616-23 of electron propagator 529 of photon propagator 597-601 of scattering amplitudes 603-8 of soft-photon emission cross-section 534 of vertex operator 614-16 Asymptotic freedom 602 n. Atomic levels, radiative shift of 544-52 Atoms interaction at large distances 347-53 radiation from 181-97 scattering by 231-7 Bare charge see Intrinsic charge Bare photon propagator 459 Bethe-Heitler formula 410 Bethe-Salpeter equation 556 Bhabha scattering 324 Bilinear forms 101-6 Bispinor 70 Bosons 14, 33-61 Bound states, relativistic equation for Breit equation 341 frame 627 Bremsstrahlung 389-409, 419-44 Centrally symmetric field electron in 128-40, 389^09 scattering in 140-8 Channels, reaction 257 Charge conjugation 45-6 of spinors 94-9, 103 of field 41 parity 46 Chronological operator 284 Coherence length 408 Compact part of diagram 463, 469 Compton effect 292-5, 354-68 Contraction of operators 288 Correspondence principle 166 552-9 Coulomb field, electron in 133-40, 389-409 scattering in 141-2, 144-8 gauge 303 law, radiative corrections to 504-8 CPT theorem 46 Creation operators 11, 38-40 of pair 371, 386-8, 410-19, 443-9 Cross-channels of reaction 257 Crossing invariance 311 symmetry 311 Deuteron photodisintegration 216-20 Diatomic molecules, radiation from 197-204 Dipole radiation 164-6 Dirac conjugate function 77 equation 74-80, 118-26 matrices 76-84 Dispersion relation 494, 516 double 561 double-subtraction 496 with one subtraction 516 without subtraction 516 Dotted indices 63 Double emission 228 Double-logarithmic accuracy 521 terms 608-14 Dyson's equations 477 Effective external lines 464 Einstein coefficients 164 Electric multipole radiation 166-71, 181-6 Electromagnetic field equations, radiative corrections 575-85 field quantization 5-32 interaction 159-61 Electron interaction 159, 317-455 propagation function (propagator) 294-300 scattering of 286-92, 317^3, 534-44 scattering of photon by 292-5, 354-68 Emission of radiation 161-220, 389-413, 419-44, 449-55 angular distribution 176-9 double 228 induced 162 stimulated 162 649 650 Energy levels, radiative shift of 544-52 Energy loss cross-section 332 Equivalent photons method 438-44 External field 118-58,317-43 lines 290 effective 464 Fermions 62-117 Feynman diagrams 289, 304-16 gauge 302 rules 298 Fine structure in hydrogen 126-8 Form factors charge 628 electric 628 electromagnetic 514, 625 of electron 514-21 magnetic 628 Franck and Condon's principle 202 Free ends 290 Furry representation 482 theorem 315 Gauge Coulomb 303 Feynman 302 invariance 14 Landau 302 three-dimensionally transverse 13 transformation 12, 28, 43, 470 global 119n. local U9n. Hadrons 624 electrodynamics of 624-47 formation from electron-positron pair 645-7 multipole moments of 640 Heisenberg representation 10, 284 n., 456-8 Helicity 25,55 scattering amplitudes 265-78 states 55 spherical 56 Hydrogen atom 126-8, 192-7, 208-11 Infra-red catastrophe 436 divergence 519, 541 Interaction representation 283, 456-8 Intrinsic electron charge 489, 601-3 Invariant amplitudes 274-8 kinematic 256-8 perturbation theory 283-316 Inversion combined 47 Index four-dimensional 39 of spinors 68-74, 104 three-dimensional 17,44-5 Ionization losses of fast particles 330-6 Irreducible vertex part 474 Källen-Lehmann expansion 495 Kinematic invariants 256-8 Klein-Nishina formula 356 "Ladder" diagram 556, 620 Lagrangian operator 36 n., 94 Lamb shift 128, 544 Landau gauge 302 Light cone 286 n. Line width 240-4 Logarithmic approximation 405 corrections 504 Lorentz group 39 extended 44 n. proper 44 n. Low-energy theorem for bremsstrahlung 632 for photon-hadron scattering 635 Magnetic moment anomalous 152, 157 of electron 517, 521-4 of muon 523-4 Magnetic multipole radiation 171-3, 186-8 Majorana scattering 80, 99 Mandelstam plane 260 representation 561 Mass operator 496, 484, 524-9 surface 512 Measurement in the relativistic case 1-4 Mesic-atom levels, radiative shift of 551-2 Metric tensor xiii Molecules radiation from 197-204 scattering by 231-40 Miller scattering 323 Monopole transition moment 641 Multipole moments of hadrons 640 radiation 166-73 Natural width of spectral lines 240-4 Neutrino 113-15 Neutron scattering 157-8 Normal product 10 n., 305 Notation xiii-xv Nuclei, radiation from 205-7 Index Observables in the relativistic case Occupation numbers 9 Optical theorem 280 Oscillator expansion of field 6 Pair annihilation 368-76 production 371, 386-8, 410-9, 443-9 Parametrization method 538, 592 Parity charge 46 internal 44, 99-101 orbital 44 of photons 17-18 of spinors 68-70 of term 198 total 44 Partial polarization 26-9, 106-11 Particles and antiparticles 38, 93, 99-101 in an external field 118-58 polarization 252-6, 326-8, 359-68 in the relativistic case 33 scalar 34 strictly neutral 42 vector 50, 304, 530 Pauli equation 123 matrices 66, 74 Perturbation theory 283-316 Photodisintegration of the deuteron 216-20 Photoelectric effect 207-16 Photons 11-32, 159-220, 354-455; see also Particles angular momentum 17-18 electric (Ej) 18 equivalent 438-44 finite mass 519, 524, 530 magnetic (Mj) 18 parity 17-18 polarization 24-9,173-81,255-6, 359-68,370-1, 405-6 propagation function (propagator) 287, 300-4 scattering 221-82, 292-5, 354-68, 566-75 soft 529-34 splitting 585-92 wave functions 19-24 Physical regions 261 Plane waves 84-7, 148-51 Polarizability tensor 227 Polarization density matrix 26-9, 52, 106, 114 moments 178 operator 463, 501-4, 508-13 partial 26-9, 106-11 of particles 252-6, 326-8, 359-68 of photons 24-9, 173-81, 255-6, 359-68, 370-1, 405-6 vacuum 484 Pole diagrams 313, 490, 636 651 Positron, see Antiparticles; Electron; Pair Positronium 101, 343-7, 371-6, 552-9 Propagation functions (propagators) 287, 294-304 bare photon 459 exact electron 468-72, 529 in external field 481-7 renormalization of 488-93 exact photon 459-65, 597-601 analytical properties of 493-6 renormalization of 487-8 free electron 469 free photon 459 of scalar field 299 of vector particles 304 Proper part of diagram 463 Quantization of electromagnetic field 5-32 Quasi-momentum 450 Radiation, see Photons length 408 Radiative corrections 456, 501-96 shift of atomic levels 544-52 Raman scattering 221 induced 229 Rarita-Schwinger equation 116 Rayleigh scattering 221 Reaction channels 257 Reducible vertex part 474 Regularization of Feynman integral 498, 524 Relativistically conjugate function 77 Renormalizability 499 Renormalization 489-500 group method 603 n. Representation Furry 482 Heisenberg 10, 284 n., 456-8 interaction 283,456-8 Majorana 80, 99, 100 n. Mandelstam 561 Schrödinger 456 standard 79 Resonance fluorescence 244-6 Retardation, effective 332 Scattering amplitude 247, 265 helicity 265-78 coherent 225 diagram 291 of electrons 286-92 in external field 317-43, 534-44 electron-hadron 629-32, 643-5 electron-muon 616-23 induced 229 matrix, see S-matrix of neutrons 157-8 652 Index Transition current 160 line strength 182 moments 168, 172, 640, 641 Transversality condition 5 four-dimensional 13 Two-photon system 29-32, 228-30, 368-76 Scattering—Continued of particles 247-82 in centrally symmetric field 140-8 of photons 221-82, 292-5, 354-68 photon-hadron 635-8 photon-nucleus 573-5 photon-photon 566-73 scalar 232 tensor 223 Schrödinger representation 456 Schwinger terms 468 n. Screening of nucleus 406-7 Self-energy function of photon 465-8 Self-energy part electron 469 photon 463 Sign of level 198 Skeleton diagrams 474 n. S-matrix 3, 247 unitarity 278 Soft photons 529-34 Spectral density of amplitude, double 561 of photon self-energy 468 Spence's function 407, 520, 596 Spherical harmonic spinors 88 harmonic vectors 19-21 helicity states 56 unit vectors 23 waves 87-91 Spin and statistics 91-4 Spin in external field 151-7 Spinors 62-117 Spin-zero particles 34-50 Standard representation 79 Stark effect 191,195 Stokes case 221 parameters 26 Strictly neutral particles 42 Synchrotron radiation 376-86 Ward's identity 479 Wave equation for higher-integral-spin particle 53-5 for photon 12 for spin-0 particle 35 for spin-2 particle 74 for spin-1 particle 50-3 for spin-2 particle 115-17 see also Dirac's equation; Pauli's equation Weizsäcker-Williams method 439 Wick's theorem 305 Thomas half 126 n., 156 n. Time reversal 27, 46-9 of spinors 96-9, 103-4 Zeeman effect 189, 344 "Zeroizing" of charge 602 Uncertainty principle in the relativistic case Unitarity 278 Unrenormalized charge see Intrinsic charge Vacuum polarization 484 state 10 Vector particle 50, 304, 530 Vertex 290 function 473 operator 473 double-logarithmic asymptotic form part 473-81 Virtual particle 312 photon 290 state 312 1-4 614-16

Log In

Vol 4 Landau Lifshitz Quantum Electrodynamics

Related papers

Related papers