An effective theory of initial conditions in inflation

Richard Holman

An effective theory of initial conditions in inflation

Richard Holman

visibility

…

description

21 pages

link

1 file

We examine the renormalization of an effective theory description of a general initial state set in an isotropically expanding space-time, which is done to understand how to include the effects of new physics in the calculation of the cosmic microwave background power spectrum. The divergences that arise in a perturbative treatment of the theory are of two forms: those associated with the properties of a field propagating through the bulk of space-time, which are unaffected by the choice of the initial state, and those that result from summing over the short-distance structure of the initial state. We show that the former have the same renormalization and produce the same subsequent scale dependence as for the standard vacuum state, while the latter correspond to divergences that are localized at precisely the initial time hypersurface on which the state is defined. This class of divergences is therefore renormalized by adding initial-boundary counterterms, which render all of the p...

An effective theory of initial conditions in inflation Hael Collins∗ Department of Physics, University of Massachusetts, Amherst MA 01003 R. Holman† arXiv:hep-th/0507081v1 7 Jul 2005 Department of Physics, Carnegie Mellon University, Pittsburgh PA 15213 (Dated: February 1, 2008) We examine the renormalization of an effective theory description of a general initial state set in an isotropically expanding space-time, which is done to understand how to include the effects of new physics in the calculation of the cosmic microwave background power spectrum. The divergences that arise in a perturbative treatment of the theory are of two forms: those associated with the properties of a field propagating through the bulk of space-time, which are unaffected by the choice of the initial state, and those that result from summing over the short-distance structure of the initial state. We show that the former have the same renormalization and produce the same subsequent scale dependence as for the standard vacuum state, while the latter correspond to divergences that are localized at precisely the initial time hypersurface on which the state is defined. This class of divergences is therefore renormalized by adding initial-boundary counterterms, which render all of the perturbative corrections small and finite. Initial states that approach the standard vacuum at short distances require, at worst, relevant or marginal boundary counterterms. States that differ from the vacuum at distances below that at which any new, potentially trans-Planckian, physics becomes important are renormalized with irrelevant boundary counterterms. PACS numbers: 11.10.Gh,11.10.Hi,11.15.Bt,98.80.Cq I. INTRODUCTION This article is the second in a series that develops a renormalizable effective theory description of the initial state for inflation. The first article [1] described the construction of an effective initial state in Minkowski space while this one generalizes the setting to an isotropically expanding space-time. The main idea is to establish a perturbative description of the signals of new physics as it affects the short-distance structure of the state, showing explicitly how any divergences associated with this short-distance structure are renormalized. The accelerated expansion of the universe during inflation [2] produces an extremely rapid growth in the size of a causally connected region while at the same time the Hubble size remains essentially unchanged. The exact amount of expansion depends upon the details of the inflationary model, but with 60–70 e-folds of inflation, the entire universe seen today could have grown from a single, suitably small, causally connected region. This mechanism can explain the extreme uniformity of the universe observed on large scales or at early times, but even more importantly, inflation also predicts a tiny departure from perfect homogeneity caused by quantum fluctuations which are similarly stretched to vast scales. The spectrum of these fluctuations is exactly the form of the synchronized acoustic oscillations which have been measured precisely by the Wilkinson Microwave Anisotropy Probe (WMAP) [3]. Most models for inflation have no difficulty in producing the required amount of inflation. Typically they produce sub- ∗ Electronic † Electronic address: [email protected] address: [email protected] stantially more and as a consequence the structures on the largest cosmological scales today would have had their origin in quantum fluctuations which occurred at sub-Planckian lengths during inflation. This peculiar feature of inflation, its “trans-Planckian problem” [4], suggests the possibility that extremely short-distance physics, well beyond that currently accessible in accelerator experiments, could be imprinted on the very largest of observable scales. Although the term suggests physics above the Planck scale, here we shall generally refer to any sort of new physics sufficiently above the Hubble scale during inflation as “trans-Planckian.” With this opportunity and with the prospect of significantly better measurements of the cosmic microwave background and the large scale structure, it is increasingly important to have an accurate estimate of the generic trans-Planckian signal. One approach for determining this signal is to choose a specific model for what happens above the Planck scale and then to calculate its corrections to the primordial fluctuation spectrum [5, 6, 7, 8, 9]. Such models have been very useful in providing an estimate of the typical size for the transPlanckian signal. However, the details of a particular model may have relatively little motivation from lower energy phenomenology or might simply not correspond to what actually occurs in nature. A second approach does not attempt to form a complete picture of the physics above the Planck scale, but rather seeks to develop an effective theory description of its signal [1, 10, 11, 12, 13, 14]. Any new physics near the Planck scale first appears as a generic set of effects during inflation which are specified by a small number of parameters. While they assume specific values in any particular model for the new physics, to a low energy observer they simply appear as free parameters to be fixed experimentally. The effective approach is based on a perturbative expansion that uses the smallness 2 of the ratio of the two natural scales—H, the Hubble scale during inflation, and M, the scale associated with the possible new physics. If the signal is suppressed only by H/M, it could well be observable either by future cosmic microwave satellites such as Planck [15] or, still more likely, by future surveys [16] of the large scale structure, which traces the same primordial fluctuations. From the perspective of the effective theory principle, new physics can appear in either the time evolution of the inflaton and its fluctuations or in their “initial” states. The first of these—how the system evolves—is more familiar since the evolution of the quantum fluctuations of the inflaton is determined by its interaction Hamiltonian. The form of the general set of possible corrections that we can add to this Hamiltonian, encoding the effects of the unknown physics, is rather constrained by the space-time symmetries. Given these constraints, the size of the corrections from the unknown physics relative to the leading prediction for an inflationary model is usually suppressed by a factor of (H/M)2 [10]. The other ingredient—the state of the inflaton—is more directly related to the trans-Planckian problem because it is the details of the initial state we have chosen which are being stretched to vast scales. The leading correction from these effects is typically much less suppressed, scaling instead as H/M [5, 6, 7, 8, 9]. The standard view is that the correct state to choose is the vacuum state. In an expanding background, this vacuum corresponds to the maximally symmetric state that matches with the Minkowski space vacuum at very short distances, where the curvature of the space-time should be negligible. This requirement is made so that when a quantum field theory is placed in a curved background, it inherits the same renormalizability it had in Minkowski space, although its behavior can be quite different on large scales. Despite the apparent simplicity of this view, it contains an inherent ambiguity. Even in Minkowski space, there is no reason to trust that the vacuum, defined in terms of the eigenstates of a low energy effective theory, has the correct structure when extrapolated to arbitrarily short distances. The true vacuum, determined by the eigenstates of the complete theory, most likely departs from a perfect agreement with the extrapolated low energy vacuum on sufficiently small scales. An effective theory description of an initial state then provides a method for characterizing these departures such that they only have a small effect on long-distance measurements and moreover do not lead to any uncontrolled divergences which would prevent a perturbative calculation [1]. The effective theory principle [17] provides a powerful prescription for understanding some phenomena over a limited range of length or energy scales, while at the same time parameterizing the possible leading signals of unknown physics that lies just beyond those scales. For a quantum field theory propagating through the bulk of space-time, this idea is implemented by first identifying all of the observed fields and symmetries of a physical system and then constructing all possible operators out of these fields that are invariant under all of the symmetries. A finite, and usually small, subset of these operators describes the theory well at very large distances, or equivalently at low energies. The remaining infinite set of op- erators represents the generic signal of the hidden degrees of freedom whose dynamics are associated with a much smaller characteristic length, 1/M. What makes the theory calculable at low energies, E ≪ M, is that all of the operators in this second set are naturally suppressed by some nth power of E/M, with only a finite number appearing at each order n. For an experiment conducted at an energy E and which measures an observable to an accuracy δ, we need only include the set of operators whose order n satisfies n E ≥ δ. (1.1) M When an experiment finally probes energies of the order E ∼ M, where the effective theory becomes nonpredictive, it uncovers the hidden degrees of freedom and their symmetries and the original effective theory is replaced by a new effective theory including these new fields which is applicable up to some still higher energy. The point of an effective description of an initial state is similarly to divide the aspects of a state into components which are important either at long or at short distances. A state constructed only from the former agrees with the standard vacuum at arbitrarily short distances. Yet even for such states, if the difference between the actual state and the standard vacuum diminishes sufficiently slowly, some new divergences appear in the perturbative corrections. These divergences only occur at the initial time, precisely where the state was defined, and so must be cancelled by adding new counterterms, localized at the initial boundary, which are marginal or relevant according to a boundary action. The short-distance components, which contain the effects of trans-Planckian physics, describe states which diverge from the standard vacuum at distances below 1/M. This behavior is completely consistent as an effective theory and only produces initial-time divergences which are cancelled by adding irrelevant counterterms to the boundary action. As with the standard setting for an effective theory, for a measurement at low energies, which for inflation corresponds to the Hubble scale H, the effects of the trans-Planckian components of the state are suppressed by powers of H/M leading to a completely predictive theory. An effective theory of any form is always inherently applicable only up to a particular energy scale. In an expanding background, this property also sets a limit to the earliest possible time at which it can be applied while still remaining perturbative [12, 13]. While this limit implies that we should choose our “initial” state no earlier than that time at which redshifting would undo the suppression of the ratio H/M, it also implies that we should not use a formalism, such as the S-matrix, which would require integrating over any times earlier than this initial time. What underlies the success of an overall space-time view such as the S-matrix in Minkowski space is that scales do not alter over time. If we define some departure between the standard vacuum and the true vacuum at some very small scale in an asymptotic past, that scale remains unchanged during the period when the parts of the system interact and further on into some asymptotic future. In contrast, with the continuous 3 stretching of scales during inflation, it is not possible to apply a such an overall space-time view since the asymptotic states would need to be defined in a regime where all the features responsible for the shape of the microwave background would have been infinitesimally smaller than the Planck scale. Instead, the correct approach is to use the Hamiltonian to evolve the entire system continuously forward, starting from a state defined at an appropriate initial time [18, 19, 20, 21, 22]. The next section begins by describing how to set the initial state in a Robertston-Walker space-time by using a boundary condition specified along a spacelike surface. To renormalize the theory, it is important to extract the exact behavior at asymptotically short distances, which is accomplished here by applying an adiabatic expansion for the state. For a full timedependent description of the effects of a general initial state, the theory must describe how the information in this initial state propagates forward, so in Sec. III we construct a propagator which is also consistent with the initial boundary condition. Our ultimate interest is to calculate the generic transPlanckian signal in the microwave background [23] which, although a tree-level calculation, implicitly assumes that perturbative corrections are small and finite. It is therefore necessary to establish the renormalizability of the the theory for a general initial condition. This calculation begins in Sec. IV with a statement of an appropriate renormalization condition for this setting and its implications for the renormalization and running of the bulk parameters of the theory are presented in Sec. V. The renormalization and running of the initial condition are examined in Sec. VI, which shows how the renormalizable and nonrenormalizable classes of initial conditions are associated with relevant or irrelevant boundary counterterms respectively. We also show how the standard CallanSymanzik equation applies also to the running of the initial conditions.1 Section VII concludes with a brief outline of how initial state effects must be treated to address correctly the question of back-reaction as well as to calculate the expected trans-Planckian signal in the primordial power spectrum. II. Z √ d x −g 12 gµν ∇µ ϕ∇ν ϕ − 21 ξRϕ2 − 12 m2 ϕ2 4 where g = det(gµν ) and R is the scalar On very large scales or at early times, the universe appears highly ho- 2 An unrelated but quite interesting appearance of a Callan-Symanzik equation in an inflationary setting occurs for a flow within the space of inflationary models [24]. Our convention for the signature of the Riemann curvature tensor is defined by Rλµνρ = ∂ρ Γλµν − ∂ν Γλµρ + Γλρσ Γσµν − Γλνσ Γσµρ and the scalar curvature corresponds to R = gµν Rλµλν . (2.2) although the setting can be readily generalized to less symmetric backgrounds as well. This general Robertson-Walker metric can also be expressed in a conformally flat form, ds2 = gµν dxµ dxν = a2 (η) dη2 − d~x · d~x , (2.3) by defining a conformal time with η(t) = Z t dt ′ 0 a(t ′ ) (2.4) and setting a(η) = a(η(t)). We shall work in this conformal coordinate system since the conformal time has a useful physical interpretation. In units where c = 1, the conformal time corresponds to the distance traveled by a massless particle since the earliest of times. Thus, simultaneous points separated by a spatial distance greater than η were never in causal contact. The conformal time is moreover used in the standard inflationary calculations of the primordial power spectrum. The simplest curved background of this Robertson-Walker form is de Sitter space, a(η) → 1 , Hη ds2 = dη2 − d~x · d~x . H 2 η2 (2.5) The curvature of de Sitter space is everywhere constant, R = 12H 2, so that in this case the mass term and the curvature term in the action are redundant and the curvature term can be absorbed by a suitable rescaling of the mass. de Sitter space is sometimes used as an idealization of the conditions expected to occur during inflation, where the background curvature, while not constant, varies only slowly over time. We shall consider here a completely general isotropically expanding background. In such a background, the rate of expansion is characterized by the Hubble parameter, H(η) = a′ 1 ∂a ≡ , a a ∂η (2.6) in terms of which the scalar curvature is R(η) ≡ (2.1) curvature.2 1 ds2 = dt 2 − a2 (t) d~x · d~x, BOUNDARY CONDITIONS IN AN EXPANDING SPACE-TIME We begin with the action for a free scalar field propagating in classical curved background, S= mogeneous and isotropic so we shall consider backgrounds where the metric only depends on time, 6 a′′ 6 ′ H + H2 = 2 . 2 a a a (2.7) Varying the action with respect to the field yields the KleinGordon equation, ∇2 ϕ + ξRϕ + m2ϕ = 0. (2.8) Since the spatial part of the metric is flat, the spatial eigenmodes are plane waves so that the expansion of the field, as a linear sum of creation and annihilation operators, can be expressed as ϕ(η,~x) = Z i d 3~k h i~k·~x ∗ −i~k·~x † U (η)e a + U (η)e a . (2.9) k ~ k ~ k k (2π)3 4 In Minkowski space, the time-dependent eigenmodes, Uk (η), are also simple exponential functions but in an expanding background these mode functions are instead determined by (2.10) Uk′′ + 2HUk′ + k2 + ξa2R + a2m2 Uk = 0 where k ≡ |~k|. The solutions to this Klein-Gordon equation are completely determined once we have specified two constants of integration; actually, one of these is already fixed by the equal time commutation relation. If the creation and annihilation operators are normalized to satisfy (2.11) a~k , a~†′ = (2π)3 δ3 (~k −~k′ ), k and the standard equal time commutation relation holds between the field ϕ(x) and its conjugate momentum π(x), π(η,~x), ϕ(η,~y) = −iδ3 (~x −~y), π = a2 ϕ′ , (2.12) then the mode functions must satisfy the following Wronskian condition, (2.13) a2 Uk ∂ηUk∗ − Uk∗ ∂ηUk = i. The second constant of integration is then fixed by some assumption about the state that is appropriate for the physical setting being examined. A. Uk (η) = ck In the usual calculation of the primordial spectrum of perturbations, the state chosen during inflation is assumed to be the vacuum. In Minkowski space this statement is completely unambiguous and is little affected by the evolution of the state over time. For example, we could fix the state at some initial time to be the lowest energy eigenstate for the free Hamiltonian. This condition can be expressed more explicitly either by simply stating that the modes Ukflat (t) are the positive energy eigenmodes, (2.16) If we define our state again at some initial time, η = η0 , rescaling coordinates so that a(η0 ) = 1, (2.17) then over short intervals—such that (η − η0 )H(η0 ) ≪ 1— away from this spacelike surface we can also neglect the redshifting of the scales. In this limit, the change in conformal time is essentially the same as the change in cosmic time, 1 ȧ(t0 ) t − t0 (t − t0 ) + · · · ≈ t − t0 , (2.18) 1− η − η0 = a(t0 ) 2 a(t0 ) (2.14) Uk (η ≈ η0 ) = ck eik(η−η0 ) e−ik(η−η0 ) √ + dk √ + ···. 2k 2k (2.19) We then obtain the usual vacuum structure at intervals where the background curvature is not noticeable by setting dk = 0. This prescription defines the Bunch-Davies [25] vacuum state, |Ei.3 We shall write the modes associated with this state as UkE (η). Just as in the case of the Minkowski vacuum above, we can specify the state either by simply stating that its modes are asymptotically proportional to the Minkowski vacuum modes, UkE (η) → up to an arbitrary phase, or equivalently by establishing an initial condition on the modes of the form ∂ flat U (t) ∂t k e−ik(η−η0 ) eik(η−η0 ) √ + dk √ + ···. a(η) 2k a(η) 2k so that the mode functions become the positive and negative energy modes of Minkowski space, Initial states e−ikt Ukflat (t) ∝ √ , 2k at scales much lower than M, the details of the state above M should be unimportant, since in Minkowski space there is no evolution of scales—what we mean by a small momentum compared to M, once specified, remains fixed at all times. The absence of these two principles which held in flat space—the existence of a conserved Hamiltonian and the time-independence of scales—affects how we choose an appropriate state for inflation. At very short intervals and over brief changes in time, the curvature of the background is not apparent and so we can choose our modes so that they match with the Minkowski modes in this regime. More specifically, the leading behavior of the solution to √the Klein-Gordon equation (2.10) at short distances, k ≫ a R, am, is described by e−ik(η−η0 ) √ a(η) 2k as k → ∞, (2.20) or by a differential initial condition on the modes, t=t0 = −ikUkflat (t0 ). (2.15) Here we have neglected the effects of the mass since in inflation it is generally assumed to be small. If the free theory begins to break down at some scale M, so that the free Klein-Gordon equation receives nonnegligible corrections at this scale, the modes used in Eq. (2.14) might not be the correct eigenmodes of the full Klein-Gordon equation for k > M. In terms of the differential form of the initial condition in Eq. (2.15), the simple constant of proportionality −ik could receive more general corrections such as those which scale as k/M to some power. As long as we only evaluate processes ∇nUkE (η0 ) = −iωk [η0 ]UkE (η0 ). (2.21) The differential operator here is the covariant derivative in the direction normal to the initial surface, ∇n ≡ nµ ∇µ . 3 (2.22) Our notation is adopted from that of de Sitter space where this state is also frequently called the Euclidean vacuum, since it is the unique invariant state that is regular when analytically continued to lower half of the Euclidean sphere. 5 For a simple spacelike surface such as η = η0 , the unit normal to the surface is nµ = a(η),~0 . We shall frequently abbreviate ωk [η0 ] by ωk , but it is important to remember that it can depend on the initial time, ωk [η0 ] = i∂ηUkE (η0 ) . a(η0 )UkE (η0 ) (2.23) When considering more general initial states it will be useful to choose a particular phase convention for the Bunch-Davies modes, so we hereafter shall let UkE (η0 ) = UkE∗ (η0 ). (2.24) Although we have chosen the state by considering the asymptotic behavior of the modes at large momenta, k ≫ √ a R, as we continue to yet shorter distances we encounter the same possibility mentioned before for flat space—that the free Klein-Gordon equation could receive substantial corrections for k ∼ M and above. For example, M could correspond to the scale at which some new dynamics becomes strongly interacting with the inflaton or it could represent the scale at which the classical description of gravity breaks down. To include such effects, let us consider a more general initial state determined by the boundary condition, ∇nUk (η0 ) = −iϖk [η0 ]Uk (η0 ). (2.25) From a low energy perspective, where we assume that the Bunch-Davies modes describe the “vacuum” extrapolated to arbitrarily short distances, it is convenient to express the modes for this general state as a transformation of the BunchDavies modes, (2.26) Uk (η) = Nk UkE (η) + eαk UkE∗ (η) , where the initial state structure function eαk is eα k = ωk − ϖk . ω∗k + ϖk (2.27) Both the general modes and the Bunch-Davies modes obey the Wronskian condition given earlier in Eq. (2.13), so the normalization is completely determined by eαk , up to an arbitrary phase, 1 Nk = p . ∗ 1 − eαk +αk (2.28) Note that although we have chosen a particular phase for the Bunch-Davies modes with Eq. (2.24), we can always choose an arbitrary relative phase between the two terms in a general mode by suitably choosing the phase of eαk . The structure function eαk describes how the state differs from the assumed vacuum at different scales. At very large distances, if we are considering an excited state the structure function need not vanish; but the signals of new physics should not be very apparent since the approximation that the theory is that of a nearly free scalar field is good far below M. In this regime, it is natural for the effects of new physics to be suppressed by powers of k/M. Our goal here is to implement an effective theory description of the initial state. From this perspective, the state in Eq. (2.26) is only meant to be appropriate for observables measured at scales well below M, and not that for a complete theory which is applicable to measurements made at any scale. As a consequence, the effective states can contain structures which are the analogues of the nonrenormalizable operators used in an effective field theory Lagrangian. In both cases, the theory remains predictive at long distances since there is a natural small parameter given by the ratio of the energy or momentum of the process being studied to the scale of new physics M. In both cases too, renormalization of the theory can introduce further higher order corrections so that an infinite number of constants is often needed to make a prediction to arbitrary accuracy; but to any finite accuracy, only a small number are needed since the rest are suppressed by high powers of the small ratio of scales. At scales near M, the effective Lagrangian description breaks down but at these energies we should be able to observe the dynamics which produced the nonrenormalizable operators in the low energy effective theory. Similarly, once we probe short distances directly, we should see corrections to the Klein-Gordon equation and the modes Uk (η) given in Eq. (2.26) should be replaced with the correct short-distance eigenmodes. The remaining significant departure from Minkowski space is the constant redshifting of scales inherent to an expanding background, which is responsible for the trans-Planckian problem. The effective theory description of the initial state relies upon the smallness of the measured scale, kexp , compared with the scale of new physics, M, but this ratio is also influenced by the expansion, a(ηnow ) kexp ≪ 1, a(η0 ) M (2.29) and the earliest time for which perturbative calculation works is one which does quite saturate this bound, a(ηearliest ) kexp 0 ∼ . a(ηnow ) M (2.30) Although this time dependence of scales limits the applicability of the effective theory, it should not be seen as anything mysterious or that η0 must be chosen either at this bound or at a time when some nontrivial dynamics is occurring. To study the inflationary prediction for the cosmic microwave background power spectrum, for example, it is sufficient to choose an “initial time” when all of the features of the currently observed power spectrum are just within the horizon during inflation and which still satisfies the condition in Eq. (2.29) for a well behaved perturbation theory. What the effective theory approach accomplishes is not a complete description of the theory to an arbitrarily early time, but rather it provides a completely generic parameterization of the effects of these earlier epochs or of higher scale physics once the state has entered a regime where they can be treated perturbatively. 6 where we have introduced an effective, time-dependent mass defined by B. Adiabatic modes Solving for the analytic form of the eigenmode functions in a completely general Robertson-Walker background is frequently not possible, so instead we must find a consistent method for approximating the modes. A standard approach for approximating the Bunch-Davies modes is provided by the adiabatic modes. The adiabaticity here refers to assuming that the time derivatives are small compared to the scales we are examining. For our purpose of renormalizing the theory, we usually need to know the detailed form for the modes for large momenta, while the exact dependence at longer wavelengths, which is important for detailed finite effects, does not affect the divergences or the accompanying running of the parameters of the theory. We begin by writing the Bunch-Davies mode functions in a form that superficially resembles that of the flat space modes, UkE (η) = e−i Rη ′ ′ η0 dη Ωk (η ) p . a(η) 2Ωk (η) (2.31) The generalized frequency function Ωk (η) is determined by the differential equation, h 1 Ω′′k 3 Ω′2 1i k + Ω2k = k2 + a2 m2 + ξ − a2 R − , (2.32) 6 2 Ωk 4 Ω2k which is derived by substituting the modes in Eq. (2.31) into the Klein-Gordon equation, Eq. (2.10). We shall see by the end of this section that these modes indeed satisfy the BunchDavies condition. In the adiabatic approximation, time derivatives, whether of the scale factor a(η) or of the generalized frequency Ωk (η), are small. This assumption allows Eq. (2.32) to be solved through a series of successively better approximations, (0) (2) Ω2k (η) = [Ωk (η)]2 + [Ωk (η)]2 + · · · , starting at zeroth order with q (0) Ωk (η) = k2 + a2 (η)m2 , (2.33) (2.34) and then proceeding iteratively, using a lower order solution to fix the next order, (0)′ 2 (0)′′ 1 Ωk 3 Ωk 1 (2) 2 + . (2.35) [Ωk ] = ξ − R − 6 2 Ω(0) 4 Ω(0) k k In particular, inserting the zeroth order expression into this equation yields h 1 (H ′ + 2H 2)a2 m2 1i (2) [Ωk ]2 = ξ − R − 6 2 k 2 + a 2 m2 2 4 4 5 H a m + . (2.36) 4 (k2 + a2m2 )2 The form of the generalized frequency becomes a little simpler if we retain only the leading terms in the limit, k ≫ am; to second order in the adiabatic solution we have Ω2k (η) = k2 + a2(η)M 2 (η) + · · · , (2.37) h M 2 (η) ≡ m2 + ξ − 1i R(η). 6 (2.38) Note that in the extreme ultraviolet limit, the leading behavior simplifies to Ωk (η) ≈ k (2.39) reproducing the correct asymptotic behavior of a BunchDavies mode, as in Eq. (2.20). III. PROPAGATION In a pure Bunch-Davies state, the propagator straightforwardly generalizes the Feynman propagator for the Minkowski vacuum, − iGEF (x, x′ ) = Θ(η − η′ ) hE|ϕ(x)ϕ(x′ )|Ei + Θ(η′ − η) hE|ϕ(x′ )ϕ(x)|Ei. (3.1) From the perspective of the low energy theory, this initial state is empty of all information about any higher scale physics, so this Green’s function only propagates the information associated with sources moving through the bulk of space-time and no separate information propagates from the initial surface, δ4 (x − x′ ) ∇2x + m2 GEF (x, x′ ) = p . −g(x) (3.2) For a more general initial state, the Green’s function will need also to include consistently the propagation of this initial state information. One way to express this consistency is to impose the same boundary condition on the propagator as that which determined the mode functions. For the Bunch-Davies propagator, written in its spatial momentum representation, GEF (x, x′ ) = Z d 3~k i~k·(~x−~x′ ) E e Gk (η, η′ ) (2π)3 (3.3) with − iGEk (η, η′ ) = Θ(η − η′ )UkE (η)UkE∗ (η′ ) + Θ(η′ − η)UkE∗ (η)UkE (η′ ), (3.4) we find that in the physical region—those times subsequent to the initial time—this propagator is consistent with a boundary condition for each of its arguments, nµ ∇µ GEk (η, η′ ) nµ ∇′µ GEk (η, η′ ) η=η0 η′ >η0 η′ =η0 η>η0 = iω∗k GEk (η0 , η′ ) = iω∗k GEk (η, η0 ) η′ >η0 η>η0 . (3.5) Note that the right side is the complex conjugate of the coefficient of the boundary condition defined for the modes. 7 The reason is that for η = η0 and η′ > η0 , for example, the non-vanishing Θ-function is that accompanying the UkE∗ (η)UkE (η′ ) factor and not its conjugate, so the timederivative acts upon UkE∗ (η). The nontrivial new element is that the normal derivatives also act on the Θ-functions associated with the time-ordering. In the Bunch-Davies propagator, the opposite ordering of the arguments in the two Θ-functions causes the resulting δfunction terms to cancel between its forward and backward propagating terms. In fact, the boundary condition in Eq. (3.5) is not quite sufficient to determine completely the propagator, but choosing the propagator to be locally time-translationally invariant over infinitesimally short intervals fixes the remaining ambiguity. To understand how the propagator is determined in more detail, we shall make a short excursion into Minkowski space. There we can write a propagator very generally as ′ ′ < ′ G̃k (t,t ′ ) = Θ(t − t ′ ) G̃> k (t,t ) + Θ(t − t) G̃k (t,t ) (3.6) where we shall use tildes to denote the propagator in flat space described by the coordinates (t,~x). The Wightman functions ′ G̃>,< k (t,t ) are partially fixed by three conditions: the propagator should be continuous at t = t ′ , < G̃> k (t,t) = G̃k (t,t), and away from the point-source, it should satisfy the KleinGordon equation, h d2 i i >,< ′ ′ 2 2 (t,t ) = 0 = + k + k G̃ G̃>,< k k (t,t ). dt 2 dt ′2 If we further apply a boundary condition at t0 analogous to the Bunch-Davies condition above, t=t0 , t ′ >t0 ∂t ′ G̃Ek (t,t ′ ) t ′ =t , t>t 0 0 = ikG̃Ek (t0 ,t ′ ) = t ′ >t0 E ikG̃k (t,t0 ) t>t , 0 ∂t ′ G̃k (t,t ) ′ t=t0 , t ′ >t0 t ′ =t0 , t>t0 = iκG̃k (t0 ,t ′ ) = iκG̃k (t,t0 ) t ′ >t0 t>t0 ; (3.13) applying this condition yields instead h i i ′ ′ ′ G̃> + bk e−ik(t−t ) + eα̃k eik(2t0 −t−t ) k (t,t ) = 2k ′ ′ +bk eik(t−t ) + e−α̃k e−ik(2t0 −t−t ) h i i ′ ′ ′ G̃< (t,t ) = + b eik(t−t ) + eα̃k eik(2t0 −t−t ) k k 2k ′ ′ +bk e−ik(t−t ) + e−α̃k e−ik(2t0 −t−t ) (3.14) where we have defined the Minkowski space initial state structure function by eα̃k ≡ k−κ . k+κ (3.15) Sending κ → k sends eα̃k → 0 so we should recover the vacuum propagator in this limit. This requirement fixes the remaining ambiguity in a propagator consistent with a general initial state by setting bk = 0,4 G̃k (t,t ′ ) = Θ(t − t ′ ) (3.9) To avoid any possible confusion with the ωk defined earlier, we have set the mass to zero for this example. Applying all three conditions gives h i i ′ ′ −ik(t−t ′ ) G̃> (t,t ) = + bk eik(t−t ) + a k e k 2k ′ ′ +ck e−ik(t+t ) + dk eik(t+t ) h i i ′ ′ −ik(t−t ′ ) G̃< + + bk eik(t−t ) k (t,t ) = ak e 2k ′ ′ +ck e−ik(t+t ) + dk eik(t+t ) . (3.10) ∂t G̃Ek (t,t ′ ) ∂t G̃k (t,t ′ ) (3.7) it should satisfy the correct discontinuity in its first derivative to produce a correctly weighted point-source, > ′ ′ (3.8) ∂t G̃k (t,t ) − ∂t G̃< k (t,t ) t ′ =t = 1, h d2 The remaining condition we impose is that the propagator should be time-translationally invariant so that it should only depend on t − t ′ and not t + t ′ since the boundary condition in Eq. (3.11) does not itself break time translation invariance. This last condition requires dk = 0 so that the standard Minkowski space propagator is obtained. Now consider a boundary condition in Minkowski space which explicitly breaks the time-translation invariance, + i −ik(t−t ′ ) i ′ e + Θ(t ′ − t) eik(t−t ) 2k 2k i α̃k ik(2t0 −t−t ′ ) e e . 2k (3.16) Sometimes it is convenient to write this propagator in a more suggestive form by defining an image time, tI ≡ 2t0 − t, ′ i −ik(t−t ′ ) i e + Θ(t ′ − t) eik(t−t ) 2k 2k ′ i +Θ(tI − t ′ ) eα̃k e−ik(tI −t ) 2k i ′ +Θ(t ′ − tI ) eα̃k eik(tI −t ) . (3.17) 2k G̃k (t,t ′ ) = Θ(t − t ′ ) Since the theory is only applicable to the region subsequent to the initial surface, t,t ′ > t0 this propagator always agrees with Eq. (3.16) in this region and is moreover still consistent with the boundary condition in Eq. (3.13). In this form, the initial state propagator contains two sources, one for the physics (3.11) we very nearly obtain the correct Feynman propagator, ′ i −ik(t−t ′ ) e + dk eik(t+t ) 2k i ik(t−t ′ ) ′ E,< ′ G̃k (t,t ) = e + dk eik(t+t ) . 2k 4 ′ G̃E,> k (t,t ) = (3.12) Actually, we could have a coefficient bk that only diminishes faster than eα̃k , but we exclude this possibility by demanding that a perturbation theory based on our propagator should be free of uncontrolled divergences, such as those [8, 26, 27, 28, 29] occurring in the α-vacua of de Sitter space [30], which would occur for nonzero values of bk . 8 point-source and one for a fictitious image source which encodes the initial state information. Returning to a general expanding background, if we impose the analogous conditions on the propagator—continuity, an appropriate jump in its first derivative for a point source, consistency with the Klein-Gordon equation—as well as a general initial condition, nµ ∇µ Gαk (η, η′ ) nµ ∇′µ Gαk (η, η′ ) η=η0 η′ >η0 η′ =η0 η>η0 = iϖ∗k Gαk (η0 , η′ ) = iϖ∗k Gαk (η, η0 ) η′ >η0 η>η0 , (3.18) then we obtain a unique propagator structure, − iGαk (η, η′ ) = Θ(η − η′ )UkE (η)UkE∗ (η′ ) + Θ(η′ − η)UkE∗ (η)UkE (η′ ) + eαk UkE (η)UkE (η′ ), ∗ (3.19) which matches with the Bunch-Davies propagator in the limit where the initial state structure function vanishes. The origin of this propagator has an elegant interpretation as the generalization of the time-ordering in the usual vacuum propagator. Recall that in Minkowski space the time-ordering produces the forward propagation of positive frequencies and the backward propagation of negative frequencies. If we start with a different initial state, we must also account for propagation of the initial state information as well. This propagation from the boundary should contain only the forward propagation of one set of modes since the backward propagation would be into the region before η0 . Thus we have only the UkE (η)UkE (η′ ) term in the propagator and not its complex conjugate, which would represent the backwards propagating negative frequency modes associated with the initial state. This interpretation becomes clearer when we rewrite the propagator as − iGαk (η, η′ ) ∝ Θ(η − η′)UkE (η)Uk∗ (η′ ) + Θ(η′ − η)Uk∗ (η)UkE (η′ ). (3.20) Its asymmetric form reflects the fact that signals from the point-source propagate both forward and backwards in time— depending upon the energy of the modes—while the boundary effects only propagate forward, since we never evaluate times earlier than the time at which the conditions are imposed. IV. THE RENORMALIZATION CONDITION In an interacting field theory it is extremely rare that the evolution of a system can be solved exactly, even in flat space, and what is done instead is to describe processes perturbatively. If successive perturbative corrections are sufficiently convergent, then the theory can be predictive as long as the experimental error is larger than the error we make in truncating the series. The energies and the momenta in all parts of the leading term in this series are usually completely finite, being fixed by the actual momenta of the external, physical fields being measured. However, in all the higher corrections appear intermediate processes where a field ranges over all possible momenta, including arbitrarily large ones. Summing over this arbitrarily large momentum behavior, or equivalently the short-distance features of the theory, can produce divergences. But since these divergences are constant, not depending on any measurable quantity, they can be absorbed by a suitable rescaling of the parameters of the theory and in terms of these renormalized quantities, the terms of the perturbation series are finite at each order. In the process, we lose the idea of constant parameters—masses or couplings—and the renormalized parameters depend now on the scale at which they are defined. This scale dependence is not arbitrary, but is is fixed by renormalization conditions which express how or at what scale a particular physical parameter is defined. When a field starts in an initial state which contains some structure at short distances that differs from the naı̈ve idea of a vacuum based on extrapolating the free theory to arbitrary scales, the intermediate processes in the perturbation series also sum over all of this short-distance structure of the initial state which can produce divergences in addition to those associated with the properties of a field propagating through the bulk of space-time. Since they are associated with the structure of the initial state, this class of divergences is localized at the initial time [1] and they are removed by renormalizing the theory at this initial boundary. Once renormalized there, the theory remains finite for all subsequent times. Establishing that these state-dependent divergences can be renormalized is important even if we are only interested initially in evaluating the leading term in the perturbative expansion, where all intermediate momenta are finite. Partially this importance lies in the fact that this leading result only has any meaning if the corrections to it are finite and are small compared to it. But further, even the detailed form of the leading result depends on how the theory propagates forward the information contained in the initial state and whether this propagation is consistent is what is being checked when we renormalize the perturbative corrections. Thus knowing how to renormalize the higher order corrections—for example in the two-point function—tells us also the correct form for the leading, tree-level prediction for the trans-Planckian signature in the cosmic microwave background. For the S-matrix in Minkowski space, a standard set of renormalization conditions is applied, such as the vanishing of the one-particle expectation value or the location and the residue of the pole associated with the physical mass of the particle. During inflation, the physical setting differs quite dramatically from that assumed for the S-matrix so the form of the renormalization conditions must be modified appropriately. A typical inflationary model contains a scalar field, the inflaton, which we divide into a spatially independent classical zero mode φ(η) and a fluctuation ψ(η,~x) about this value, ϕ(η,~x) = φ(η) + ψ(η,~x). (4.1) The zero mode drives the overall inflationary era and its value changes slowly as the field rolls down its potential. The fluctuations, combined with the scalar component of the fluctuations of the gravitational background, produce the pattern of nearly scale-invariant, nearly Gaussian primordial perturba- 9 1 a2 (H ′ + 2H 2 ) m2 + 21 λφ2 − 2 k2 + a2 m2 + 21 λφ2 tions that seed the density perturbations seen both in the temperature fluctuations, observed by WMAP, and the observed large-scale structure, before nonlinear dynamics have set in. Because the zero mode corresponds to the classical expectation value of the field, the vanishing of the expectation value of the fluctuations provides a natural renormalization condition for this setting [31, 32], hαk (η)|ψ(x)|αk (η)i = 0. S= Z √ 1 d 4 x −g 12 gµν ∇µ ϕ∇ν ϕ − 12 ξRϕ2 − 12 m2 ϕ2 − 24 λϕ4 . (4.3) Decomposing the full field into its zero mode and fluctuating components yields S = Sφ + Sψ0 + Sψint (4.4) where Sφ is the classical action for the zero mode obtained by setting ϕ → φ in Eq. (4.3) and Sψ0 and Sψint are the free and the interacting parts of the action for the fluctuations, Sψ0 = Z and Sψint = √ 1 Rψ2 d 4 x −g 21 ∇µ ψ∇µ ψ − 12 − 12 m2 + 12 λφ2 + ξ − 61 R ψ2 Z λ φφ′′ + φ′2 + 4Hφφ′ 4 k2 + a2 m2 + 12 λφ2 2 5 a4 H m2 + 12 λφ2 + 12 λφφ′ + 2 . (4.8) 4 k2 + a2 m2 + 1 λφ2 − 2 (4.2) As we shall see below, because the fluctuation couples to the classical zero mode, this tadpole condition will allow us to define the renormalized mass and the coupling by refering to their values for the zero mode. The method for constructing an effective description of a general initial condition described earlier applies to any interacting field theory so we shall illustrate the boundary renormalization with the relatively simple example provided by a quartic interaction, However at very short distances, where the divergences in loop corrections occur, k2 ≫ a2 m2 , a2 λφ2 , while the second order correction is now h 1i (2) [Ωk (η)]2 = ξ − a2 R 6 (4.9) but with M 2 (η) now given by M 2 (η) ≡ m2 + 21 λφ2 + ξ − 61 R. (4.11) In the interaction picture, the free Hamiltonian determines the evolution of operators while the interaction Hamiltonian, HI , determines the corresponding evolution of the states. When the operator consists of a product of fields, the evolution is already contained in their time-dependence so the evolution of the tadpole from an initial state defined at η = η0 to a later time η f is given in the Schwinger-Keldysh approach5 by hαk (η f )|ψ+ (η f ,~x)|αk (η f )i = hαk |Tα ψ+ (η f ,~x)e hαk |Tα e 4 Notice that in expanding the interaction λϕ4 , we find a term that is quadratic in the fluctuation, λφ2 ψ2 , which acts as a time-dependent correction to the mass. Therefore we have not included this term among the interactions but rather have grouped it with the other quadratic terms to form a time-dependent effective mass term m2 → m2 + 21 λφ2 (η). We would have found the same shift had we treated this effect as an interaction once we summed the entire set of all possible insertions of this term in the free ψ propagator. The contribution of the zero mode to the effective mass similarly shifts the form of the adiabatic modes. For example, the zeroth order adiabatic mode becomes, (0) (4.7) [Ωk (η)]2 = k2 + a2(η) m2 + 12 λφ2 (η) , λ(φφ′′ + φ′2 ) , R the shift in the effective mass simply appears as a corresponding shift in the generalized mass defined earlier. Thus the leading short-distance part of the generalized frequency to second order in the adiabatic expansion is still q (4.10) Ωk (η) = k2 + a2(η)M 2 (η) + · · · (4.5) √ d x −g − ∇2 φ + ξRφ + m2φ + 61 λφ3 ψ 1 (4.6) − 16 λφψ3 − 24 λψ4 . −i R0 R −i η0 dη [HI [ψ+ ]−HI [ψ− ]] |αk i 0 + − η0 dη [HI [ψ ]−HI [ψ ]] (4.12) |αk i where |αk i = |αk (η0 )i. Since both the |αk i state and hαk | state have been time-evolved, we have two parts of the time evolution operator corresponding to each of these components. The “+” fields are associated with the former while the “−” fields are associated with the latter and the relative minus sign between the two appearances of the interaction Hamiltonian is from the Hermitian conjugation of the unitary operator evolving the hαk | state. From the action in Eq. (4.6), the interaction Hamiltonian for a simple quartic theory is HI [ψ± ] = 5 Z d 3~y a4 ∇2 φ + ξRφ + m2φ + 16 λφ3 ψ± 1 + 61 λφψ±3 + 24 λψ±4 . (4.13) A more detailed explanation of the Schwinger-Keldysh approach applied to this setting is given in the Appendix A of [1] which defines the notation used here and explains the correct time-ordering for contractions of any combination of ψ± (x) fields. 10 :::::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::::::::::: ::::::::::::::: x + y ::::::::::::::::::::::: ::::::: :::: :::: :: :: ::: : : : : : : ::: :: ::: :: : :::: : : :::::::: ::::::::::: : : :: :::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::::: x x = y 2 + 2H + ξR + m2 ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: x y ::: + 1 λ ::::::::::::::::::: 6 ::::::::::::: :::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: ::::: ::::: ::::::: :::: :::::: :::::: x y FIG. 2: The shaded blob corresponds to the following two graphs; note that the time derivatives act on the classical φ(η) field. which yields the following leading contribution to the tadpole matrix element, 0 = hαk (η f )|ψ+ (x)|αk (η f )i Z ηf Z < = − dη a4 (η) d 3~y G> 0 (x, y) − G0 (x, y) η0 n λ × ∇2 φ(η) + ξR(η)φ(η) + m2 φ(η) + φ3 (η) 6 o iλ (4.14) − φ(η)G> α (y, y) + · · · 2 where x = (η f ,~x), y = (η,~y) and z = functions, G>,< α (x, y), are defined by G> α (y, z) = i Z (η′ ,~z). The Wightman d 3~k i~k·(~y−~z) e (2π)3 i h ∗ × UkE (η)UkE∗ (η′ ) + eαk UkE (η)UkE (η′ ) G< α (y, z) = i Z d 3~k i~k·(~y−~z) e (2π)3 i h ∗ × UkE∗ (η)UkE (η′ ) + eαk UkE (η)UkE (η′ ) . (4.15) The Wightman functions labeled with a 0 subscript corre∗ spond to those obtained by setting eαk = 0. They appear > here since in taking the difference Gα (x, y) − G< α (x, y), the boundary-dependent terms of each function are the same and cancel each other. Diagrammatically, the leading contribution to the tadpole is shown in Figs. 1–2 where a ψ propagator is represented by a solid line and a dashed line represents the zero mode φ. At this order we shall encounter divergences which require the renormalization of the mass and the coupling of the field, but the leading contribution to the renormalization of the field only appears at two-loop-order through the last diagram shown in Fig. 3. y x y FIG. 1: The leading contributions to the running of the mass m and the coupling λ in a ϕ4 theory. The solid lines represent propagating ψ fields while the dashed lines correspond to the zero mode φ. :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::::: ::::: ::::::::: :::::::::: :: :::: :::: :: ::: ::: : ::::::::::::::::: ::: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::::::::::: + z ::::::::::::::::::::::: :::: ::: :::: : :: ::: :::: ::::::::::::::::::::::: : : : : : : : : : ::: :: : : : : :: :: :: :: ::: : :::: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: z + y x + ::::: :::: :::::::: :::::::::: ::::::::: :::::::::::: :: ::::: :::: :: :::: :: :: ::: ::: ::: :::: : : :::: : : : :::::::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: x y z :::::::::::::::::::::::: :::: ::::: :: :::: :: :: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: ::: : ::: :: : : ::::: : :::::::::::::::::::::::::::: x y z FIG. 3: Further graphs that contribute to the tadpole at second order. The last of these graphs contains the leading nontrivial correction to the wavefunction renormalization. Substituting the expressions for the Green’s functions in terms of the modes in Eq. (4.15) into the tadpole yields hαk (η f )|ψ+ (x)|αk (η f )i = − Z ηf η0 dη a4 (η)G (η f , η) n λ × ∇2 φ(η) + ξRφ(η) + m2 φ(η) + φ3 (η) 6 Z λ d 3~k E + φ(η) U (η)UkE∗ (η) 2 (2π)3 k Z o d 3~k α∗ E λ k U (η)U E (η) + · · · (4.16) e + φ(η) k k 2 (2π)3 where the external ψ leg has been abbreviated by G (η f , η) ≡ i U0E (η f )U0E∗ (η) − U0E∗ (η f )U0E (η) . (4.17) The external leg is independent of the initial state. The first line within the braces in Eq. (4.16) is the equation of motion for the zero mode before we have included the corrections from its interactions with the fluctuations. At leading order, the tadpole contains two terms with loop integrals, which appear in the last two lines of Eq. (4.16). The first of these contains no dependence on the initial state; it produces the need to renormalize the mass and the coupling of the bulk 3 + 1 dimensional field theory, as is shown in the next section. In the second, the initial state structure function appears explicitly so all of the new divergences associated with the fine structure of the initial state arise at this order from this term. Because of its importance in establishing the renormalizability of an effective description of a state bearing some trans-Planckian information, we shall treat it in much more detail separately, after first examining how the standard bulk renormalization proceeds in this inflationary setting. V. BULK RENORMALIZATION Through the coupling of the fluctuations to the zero mode, demanding that the one-point function for the fluctuations should vanish provides a simple and elegant origin for the renormalization and running of the bulk parameters: m, λ, ξ. 11 Recall that the bare theory we have been considering, L = 21 gµν ∇µ ϕ∇ν ϕ − 21 ξRϕ2 − 12 m2 ϕ2 − 241 λϕ4 , (5.1) can be also expressed in terms of the renormalized parameters by L = 2 4 1 µν 1 1 2 2 1 2 g ∇µ ϕR ∇ν ϕR − 2 ξR RϕR − 2 m ϕR − 24 λR ϕR + 21 (Z3 − 1)gµν∇µ ϕR ∇ν ϕR − 21 (Zξ − 1)ξR Rϕ2R 1 (Z1 − 1)λRϕ4R . (5.2) − 12 (Z0 − 1)m2Rϕ2R − 24 The perturbative corrections to a Green’s function calculated from the first line of this renormalized Lagrangian are very frequently divergent; these divergences arise from internal loops which sum over all possible momenta and contain a sufficiently small number of propagators. Since these divergences appear only in the infinite momentum region of the loop integrals, they equivalently correspond to short-distance divergences and therefore can be cancelled by local counterterms of the form shown in the second two lines of Eq. (5.2). Choosing these counterterms correctly, the total value of the perturbative corrections at a given order is completely finite. The bare and renormalized parameters are related to each other through a simple rescaling, Zξ ξR Z3 Z1 λ = 2 λR . Z3 1/2 ϕ = Z3 ϕR m2 = ξ= Z0 2 m Z3 R (5.3) To leading order in the coupling λ, we only need to determine Z0 , Z1 and Zξ since the first correction to Z3 only appears at two loop order, λ2 . To control the divergences that appear in the perturbative corrections to a process, we must apply a regularization scheme such as dimensional regularization, which we use here. The bulk divergences occur at one loop order in the term λ φ(η) 2 Z d 3~k E U (η)UkE∗ (η) (2π)3 k (5.4) appearing in the integrand of the tadpole in Eq. (4.16); here we have excised the external leg. Substituting in the general form for the Bunch-Davies modes, UkE (η) = Rη ′ ′ e−i η0 dη Ωk (η ) p , a(η) 2Ωk (η) (5.5) and applying the adiabatic approximation to second order to extract the leading behavior as k → ∞ as in Eqs. (4.10–4.11), the divergent part of this loop is contained in λ φ(η) 4 a2 (η) Z 1 d 3~k p + · · ·. 3 (2π) k2 + a2(η)M 2 (η) (5.6) By extending the number of spatial dimensions to 3 − 2ε, λµ2ε φ(η) 4 a2 (η) Z 1 d 3−2ε~k p , 3 2 2 (2π) k + a (η)M 2 (η) (5.7) we can perform this loop integral, extracting the pole and the dependence on the renormalization scale µ, λ 4πµ2 1 2 (5.8) − φ(η)M (η) + 1 − γ + ln 2 2 . 32π2 ε a M Adding this result to the other boundary-independent terms that appear in the tadpole integrand in Eq. (4.16) and substituting in the expression for the effective mass M from Eq. (4.11) produces the following bulk effect corrected to first order, hαk (η f )|ψ+ (x)|αk (η f )i Z ηf dη a4 (η) U0E (η f )U0E∗ (η) − U0E∗(η f )U0E (η) = −i η0 λ 1 4πµ2 2 2 × ∇ φ+m φ 1− + 1 − γ + ln 32π2 ε M2 2 λ 3λ 1 4πµ + φ3 1 − + 1 − γ + ln 2 6 32π ε M2 λ 1 1 4πµ2 +Rφ ξ − ξ − + 1 − γ + ln 32π2 6 ε M2 +··· . (5.9) In this expression we have omitted finite contributions which do not affect the renormalization or the running and we have also not written any of the terms that depend explicitly on the choice of the initial state, since these effects are discussed separately in the next section. To fix the scale-independent part of the rescalings from the bare to the renormalized parameters, we apply the standard MS prescription which sets Z0 = 1 + i λ h1 + 1 − γ + ln4π + ···, 32π2 ε (5.10) i 3λ h 1 − γ + ln4π + ··· 32π2 ε (5.11) Z1 = 1 + and Zξ = 1 − i 1 1 λ h1 ξ+ − γ + ln4π + · · · 2 ξ 6 32π ε (5.12) to leading nontrivial order in the λ. In terms of the renormalized parameters then, the tadpole is given by R hαRk (η f )|ψ+ R (x)|αk (η f )i Z ηf = −i dη a4 (η) U0E (η f )U0E∗ (η) − U0E∗(η f )U0E (η) η0 n h λR µ i × ∇2 φ(η) + m2R φ(η) 1 − ln 2 16π M R (η) i λR 3 h µ 3λR + φ (η) 1 − ln 6 16π2 M R (η) h 1 λR µ i +R(η)φ(η) ξR − ξR − ln 6 16π2 M R (η) o +··· (5.13) 12 again up to finite and boundary-dependent contributions. Despite the added complexity inherent in an inflationary environment, the cancellation of the bulk divergences has proceeded exactly as in a Minkowski space S-matrix calculation. The reason for this simplicity is that although both the background and the initial state are nontrivial, neither has any effect on the bulk renormalization. The divergences occur at infinitesimally short distances where the background curvature is not apparent, except as a classical source such as in the renormalization of the coupling of the field to the curvature. Further, once we are sufficiently far from the boundary, how the field propagates through the bulk should be insensitive to the short-distance structure of that boundary. Therefore, although there can be new divergences associated with the initial state, they are completely disjoint from the bulk divergences. The renormalization scale µ is an artifact of our ignorance of the true, bare theory. If we had calculated a matrix element solely in terms of the properties of the bare theory, the result would be completely independent of this scale, µ d hαk (η f )|ψ(x)|αk (η f )i = 0. dµ (5.14) But as the matrix element calculated in either the bare or the renormalized theory is exactly the same, up to a factor of the wave function rescaling, 1/2 hαk (η f )|ψ(x)|αk (η f )i = Z3 hαRk (η f )|ψR (x)|αRk (η f )i (5.15) we have a similar equation for the scale dependence of the matrix element of the renormalized theory, d µ + γ(λR ) hαRk (η f )|ψR (x)|αRk (η f )i = 0, (5.16) dµ where γ(λR ) is the anomalous dimension of the field, 1 d γ(λR ) = µ ln Z3 . 2 dµ dλR , dµ µ dmR , mR dµ dξR , dµ (5.18) in terms of which we obtain the Callan-Symanzik equation, ∂ ∂ ∂ µ + β(λR) + mR γm (λR ) ∂µ ∂λR ∂mR ∂ + γ(λR ) + · · · hαRk (η f )|ψ(x)|αRk (η f )i = 0 +βξ (λR ) ∂ξR (5.19) γm (λR ) = 3λR + · · ·, 16π2 λR γm (λR ) = + · · ·, 32π2 h 1 i λR + · · ·. βξ (λR ) = ξR − 6 16π2 β(λR ) = (5.20) Notice that the coupling to the curvature does not run at this order when the field is conformally coupled, i.e. ξR = 61 . The Callan-Symanzik equation depends upon the running of both the bulk parameters and those describing the boundary since once we have imposed a renormalization condition there exists a single renormalization scale µ in the theory. The fact that there is a unique scale for both, rather than separate scales, is particularly clear from a Wilsonian perspective [33]. In the functional integral description of the theory [1], the generating functional contains information about the bulk theory, in the action, and the initial state, in the source term. To understand how the short-distance physics affects what we mean by a particular parameter, whether for the bulk physics or the initial state, we start with theory defined up to a cutoff, Λ, which is equivalent to truncating the functional integral for modes above this scale. We can then determine how the parameters flow as we alter the cutoff scale from Λ to Λ′ (< Λ) by integrating out the field modes between these scales. The resulting effective theory based on the cutoff Λ′ will have its bulk parameters affected in the usual way. But the form of our effective description of the initial state will also be shifted as well, since how accurately we can describe the features of the initial state also depends on the field modes available in our functional integral. Thus only one quantity, whether Λ′ /Λ for a cutoff theory or µ for a dimensionally regularized theory, is needed to describe the running of both the bulk and boundary effects since the origin of this running is common to both aspects of theory. (5.17) All the renormalized parameters depend upon the renormalization scale so we introduce the usual functions that describe their running, β(λR ) = µ have the standard running of the bulk parameters at leading order, βξ (λR ) = µ up to corrections which do depend on the boundary conditions. From the expression for the renormalized tadpole, we VI. BOUNDARY RENORMALIZATION Thus far, we have written the structure function for the ini∗ tial state, eαk , without specifying how it varies with the momentum. We shall require a more detailed form to show under what conditions a general initial state, when renormalized, produces a theory with finite perturbative corrections. If this renormalization is genuinely associated only with the details of the initial state, then it should be sufficient to apply the renormalization only at the initial time and the theory should remain finite at all subsequent times. This reasoning suggests that if we regard the rescaling of the initial state as the appropriate addition of counterterms, chosen to cancel the divergences at the initial time, then these counterterms should be expressed through a three dimensional boundary action, Sη=η0 = Z √ d 3~x −hL 3d (ϕ). (6.1) 13 h denotes the determinant of the induced metric hµν on the η = η0 boundary. Using again the time-like normal vector nµ that is orthogonal to the boundary, the induced metric is obtained from the full metric by hµν = gµν − nµ nν . (6.2) For the conformally flat metric, hµν dxµ dxν = −a2 (η) d~x · d~x (6.3) √ and −h = a3 (η). From the induced metric we can construct an induced scalar curvature, which we denote by R̂, while the extrinsic curvature tensor is defined by projecting the covariant derivative of the normal back onto the initial surface, Kµν = hµλ ∇λ nν . (6.4) This tensor as well as its trace, K = hµν Kµν , provide additional dimension 1 ingredients out of which we can construct operators for the boundary action. For an expanding background, it is proportional to the Hubble parameter, K(η) = 3H(η) . a(η) (6.5) The operators contained in the counterterm Lagrangian are classified according to whether their mass dimension is greater or less than, or equal to, the dimension of the boundary, but the fields inherit their scaling dimension from the full 3 + 1 dimensional theory. Thus, for example, in our scalar theory with a ϕ ↔ −ϕ symmetry, there is a unique relevant operator of dimension 2, ϕ2 , (6.6) while there are two marginal operators, ϕ∇n ϕ, Kϕ2 . (6.7) Compared with a flat background, the number of higher dimensional operators rapidly proliferates since we can also build counterterms with curvature-dependent objects such as the extrinsic curvature or the induced curvature. For example, the complete set of irrelevant, dimension 4 operators consists of ϕ4 , ϕ∇2n ϕ, (∇n ϕ)2 , ~∇ϕ · ~∇ϕ, K 2 ϕ2 , K µν Kµν ϕ2 , Kϕ∇n ϕ, (∇n K)ϕ2 , R̂ϕ2 . (6.8) For particular, highly symmetric backgrounds, only a subset of this list might be required. Only the first four are needed in flat space and the complete list for de Sitter space is ϕ4 , ϕ∇2n ϕ, (∇n ϕ)2 , ~∇ϕ · ~∇ϕ, K 2 ϕ2 , Kϕ∇n ϕ. (6.9) The idea then is to label specific aspects of the initial state by the type of counterterms associated with them so that a particular piece of the boundary condition is relevant, marginal or irrelevant according to the dimension of the boundary counterterms needed to remove the divergences it produces. Alternatively, a boundary condition can also be characterized as renormalizable or nonrenormalizable according to whether its difference from the vacuum state diminishes or grows at shorter and shorter distances. The nonrenormalizable initial states are not necessarily nonpredictive—they can be understood as an effective description of the state. For example, if we associate a heavy mass scale M with these effects, it generically will require an infinite set of dimension n > 3 counterterms to render the theory finite. However, since these terms are suppressed by factors of (H/M)n−3 , as long as H/M ≪ 1 only a very small subset of the parameters describing a general nonrenormalizable initial state are required in practice. We shall describe a general initial state by an expansion in the generalized frequency, Ωk (t0 ), evaluated at the initial surface, eα k = ∗ ∞ H n (η0 ) ∞ Ωn (η0 ) ∑ dn Ωn (η0 ) + ∑ cn an(ηk 0 )Mn . n=0 k (6.10) n=1 These two series are respectively associated with the infrared and ultraviolet aspects of the initial state. We have chosen an expansion in the frequency, rather than just the momentum k, since this quantity has several useful properties. It is finite, although of a complicated form, in the k → 0 limit and it has a natural transition in its behavior at k ∼ H(η0 ). Above this scale Ωk (η0 ) → k, while below this scale the curvaturedependent effects become dominant compared with its explicit momentum dependence. In the first series, we need a reasonable dynamical scale for separating the long from the short distances on our initial surface so we have chosen the expansion rate, H(η0 ), to set this scale. Since at extremely short distances, where Ωnk (η0 ) ∼ kn , these terms become diminishingly important once k ≫ H(η0 ). More generally, we should use a linear combination of m and H(η0 ) in the numerators of this expansion to obtain the a quantity which does not vanish entirely in the Minkowski space limit, but in slowly rolling models of inflation m ≪ H(η) so we shall neglect the mass in this expansion. Notice that in the extreme ultraviolet limit, k → ∞, all of the terms in this series vanish except for the marginal term, d0 . Thus the first series describes features of the state which are important at long distances and can be viewed as some nonvacuum ensemble since in this regime the idea of the vacuum defined with respect to the free Klein-Gordon equation holds very well. The signals of trans-Planckian physics lies properly in the second series in Eq. (6.10). Unlike the terms in the first series, these grow in importance at extremely short distances. Assuming that the scale of new physics is above the Hubble scale, M ≫ H(η0 )/a(η0 ), these terms are essentially a series ∗ in k/M. Since the structure function eαk accompanies the initial state contribution to the propagator and to the modes, it might seem that these growing terms do not describe a sensible state at extremely short distances. But if we are measuring some process at a scale much lower than M, the contributions from the nth term in this series is naturally suppressed by the nth power of the ratio of the scale we measure to M. Of course, loop corrections sample over all momenta, so the large difference between the state and the extrapolated vacuum will produce divergences in these corrections; but since these divergences occur from the large momentum—short distance— 14 part of the loop integral evaluated at the initial time, they can always be cancelled by adding the appropriate local counterterms on the initial boundary. In [1] we showed that the divergences from the new initial condition only appear at the initial-time surface, η = η0 . More precisely, the divergences only occur in a boundary loop correction when the term is simultaneously evaluated at η = η0 and we sum over arbitrarily large values of the loop momenta. Since the Bunch-Davies state matches with the flat space vacuum states at large values of the momenta, the theory in a curved background inherits exactly the same divergence structure from the short-distance features of the initial state as flat space. The only new feature is that the invariants associated with the curvature allow for a richer family of boundary counterterms, as was seen in Eq. (6.8). The prescription for extracting the short-distance boundary divergences is then as follows. We first define a family of kernel functions, K (p) (η) ≡ Z Rη d 3~k e−2i η0 dη Ωk (η ) , (2π)2 Ω3−p(η)Ω p (η0 ) k k ′ ′ λ 2 Z ηf η0 Z ηf η0 eα k = ∗ ∞ H n (η0 ) ∑ dn Ωn (η0 ) , n=0 Z (6.12) k we can obtain a sense of which terms will require renormalization at the initial boundary by expanding the initial condition on the modes in Eq. (2.25) at η = η0 and looking at the short distance, k → ∞, limit, 1 ∂ Uk (η0 ) = −iϖkUk (η0 ) a(η0 ) ∂η ≈ −ikUk (η0 ) 2d0 2H(η0 )d1 +i Uk (η0 ). k+ 1 + d0 (1 + d0)2 2 H (η0 ) Uk (η0 ). (6.13) +O k At momenta well above the Hubble scale, k ≫ H(η0 ), all the terms aside from d0 and d1 have no effect. The leading contribution to the tadpole that depends on the details of the initial state appears in the last term included in Eq. (4.16), λ 2 Z ηf d 3~k α∗ E e k Uk (η)UkE (η). (2π)3 η0 (6.14) In terms of the adiabatic modes and the leading terms of our ∗ expansion for eαk given in Eq. (6.12), this contribution becomes − dt a4 (η)G (η f , η)φ(η) Z d 3~k α∗ E e k Uk (η)UkE (η) (2π)3 Rη Rη ′ ′ ′ ′ Z Z d 3~k e−2i η0 dη Ωk (η ) d 3~k e−2i η0 dη Ωk (η ) 2 dη a (η)G (η f , η)φ(η) d0 + d1H(η0 ) (2π)3 Ωk (η) (2π)3 Ωk (η)Ωk (η0 ) dt a4 (η)G (η f , η)φ(η) λ = − 4 Renormalizable boundary conditions correspond to those which may differ substantially from the Bunch-Davies vacuum at long distances but which become indistinguishable from this vacuum at very short distances. These states therefore do not generally contain information about transPlanckian physics and resemble some excited ensemble, from the perspective of the low energy, weakly interacting theory. Despite this simplicity at short distances, in a few cases where the difference between the initial and the Bunch-Davies states does not diminish sufficiently fast, these initial states can produce new divergences isolated at the initial time. The renormalizable states thus provide the simplest example of the boundary renormalization which is necessary for a general initial state and which results in a controlled, finite perturbative description of the interacting theory. If we restrict to an initial condition described only by negative powers of the generalized frequency, Ωk (η0 ), Eq. (6.10), (6.11) which arise from inserting the expansion of the initial state structure function in Eq. (6.10) into a loop integral. In this kernel, the terms evaluated at η0 are associated with the series expansion for the initial state and the remaining η-dependent factors are associated with the loop propagators. These kernel functions have been constructed so that they only contain a mild—integrable—logarithmic singularity at η = η0 . The loop integrals do not always appear exactly in the form of one of these kernels, but they can always be expressed in terms of some nth derivative of them, up to explicitly finite terms. By then integrating these kernel-derivatives by parts n times, we obtain a finite term where K (p) (η) appears in the dη-integrand as well as a set of boundary terms, some evaluated at η f and some at η0 . The former are finite while the latter are divergent and can be regularized using dimensional regularization. The resulting set of divergent terms determines the set of boundary counterterms that we should add to the theory to render it finite. The behavior of these kernels, their regularization and their derivatives, is explained more fully in Appendix A. We begin in the next subsection first with an illustration of this procedure for the renormalizable initial state. Although the prescription applies to any type of initial state, it is simpler in the case of the renormalizable state since fewer integrations by parts are necessary. What we shall find is that for a theory with a ϕ4 interaction, only the first two terms (d0 and d1 ) require any renormalization and are associated with the renormalizable set of counterterms, {ϕ2 , ϕ∇n ϕ, Kϕ2 }. − A. Renormalizable boundary conditions 15 2 + d2 H (η0 ) The terms associated with higher moments are associated with loop integrals that are manifestly finite. For the nth term in the moment expansion, the large momentum region of the accompanying loop integral behaves as dn Z Rη d 3~k e−2i η0 dη Ωk (η ) dn = (2π)3 Ωk (η)Ωnk (η0 ) 2π2 +··· ′ ′ Z dk −2ik(η−η0 ) e n−1 k k→∞ (6.16) which is finite for n > 2. The cases n = 0, 1, 2 produce a quadratic pole, a simple pole and a logarithmic singularity at η = η0 , respectively [1]. Because of the remaining conformal time integration in Eq. (6.15), the logarithmic singularity actually only gives a finite contribution to the tadpole so that only the first two terms produce divergences, as was suggested by examining the short-distance structure of the initial condition. Therefore, for the rest of this section we shall set dn = 0 for n ≥ 2. − λ 2 Z ηf Z Rη ′ ′ d 3~k e−2i η0 dη Ωk (η ) + · · · . (2π)3 Ωk (η)Ω2k (η0 ) (6.15) To show that the divergences associated with the initial state are genuinely confined to the initial surface, let us reexpress the loop integrals as derivatives of the kernels defined earlier in Eq. (6.11). Just as in the case of the d2 term, these kernels are only logarithmically divergent in (η − η0 ), so that with an appropriate number of integrations by parts we can isolate the divergent pieces explicitly. Noting that Z Rη ′ Rη ′ d 3~k e−2i η0 dη Ωk (η ) 1 = − K (0) ′′ (η) + UV finite (6.17) 3 (2π) Ωk (η) 4 ′ and Z d 3~k e−2i η0 dη Ωk (η ) 1 = − K (1) ′ (η)+ UV finite (6.18) (2π)3 Ωk (η)Ωk (η0 ) 2i ′ and substituting these expressions into Eq. (6.15) yields, d 3~k α∗ E e k Uk (η)UkE (η) dt a4 (η)G (η f , η)φ(η) (2π)3 η0 λd0 = −a2 (η0 )G (η f , η0 )φ(η0 )K (0) ′ (η0 ) − a2(η f )∂η G (η f , η) η=η φ(η f )K (0) (η f ) f 16 Z η f 2 (0) (0) 2 2 +∂η a (η)G (η f , η)φ(η) η=η K (η0 ) + dη ∂η a (η)G (η f , η)φ(η) K (η) 0 η0 Z ηf 2 (1) λd1 2 (1) dη ∂η a (η)G (η f , η)φ(η) K (η) + · · · H(η0 ) a (η0 )G (η f , η0 )φ(η0 )K (η0 ) + − 8i η0 λd0 2 λd1 = ∂η a (η)G (η f , η)φ(η) η=η K (0) (η0 ) − H(η0 )a2 (η0 )G (η f , η0 )φ(η0 )K (1) (η0 ) + finite 0 16 8i Z (6.19) after integrating by parts and using that G (η f , η f ) = 0. The singular kernels can be regularized by extending the number of spatial dimensions to 3 − 2ε as has been done in Appendix A which allows us to extract the poles as ε → 0 as well as the dependence on the renormalization scale µ, − λ 2 Z ηf d 3~k α∗ E e k Uk (η)UkE (η) (2π)3 η0 4πµ2 1 λd0 2 a (η) G (η , η)φ(η) ∂ − γ + ln = f η η=η0 ε 64π2 a2 (η0 )M 2 (η0 ) 4πµ2 1 λd1 2 + finite. H(η )a (η ) G (η , η )φ(η ) − γ + ln − 0 0 f 0 0 32iπ2 ε a2 (η0 )M 2 (η0 ) dt a4 (η)G (η f , η)φ(η) Z These are the divergences associated with the “bare” initial condition and can be renormalized by adding the following counterterms to the interaction Hamiltonian HIc.t. = Z z0 δ′ (η − η0) ± φψ d ~y a (η) 2 a2 (η) 3 4 (6.20) 16 + z1 δ(η − η0) Kφψ± 2 a(η) (6.21) which vanish except at the initial boundary. We can equivalently regard these terms as the addition of the following threedimensional boundary action to the theory, Z √ d 3~y −h z0 nµ ∇µ φψ± − z1 − 32 z0 Kφψ± . S3d = 12 η0 (6.22) Note that we could have also included the dimension 2 counterterm, mφψ± , but its role is suppressed in an inflationary background where the extrinsic curvature term provides the dominant contribution. The leading contribution to the tadpole from these counterterms is R hαRk (t f )|ψ+ R (x)|αk (t f )i = · · · + 12 z0 ∂η a2 (η)G (η f , η)φ(η) η 0 − 12 z1 a3 (η0 )G (η f , η0 )φ(η0 )K(η0 ). (6.23) The inclusion of the boundary counterterms is the boundary analogue of scaling the bulk parameters which translates between the bare and the renormalized theories. Because of this role, the counterterms will properly contain two components, z0 = zε0 + ẑ0 (µ) z1 = zε1 + ẑ1 (µ), λR d0 + ··· 16π2 λR d1 β̂1 (λR ) = − + ···. 24iπ2 β̂0 (λR ) = − (6.28) Here we have used the Callan-Symanzik equation ∂ ∂ ∂ + mRγm (λR ) µ + β(λR) ∂µ ∂λR ∂mR ∂ ∂ ∂ +βξ (λR ) + γ(λR) + β̂0(λR ) + β̂1(λR ) ∂ξR ∂ẑ0 ∂ẑ1 R + · · · hαRk (η f )|ψ+ (6.29) R (x)|αk (η f )i = 0, which now includes the renormalizable initial condition parameters as well as the bulk parameters, to determine the running of the boundary conditions. The equation is not yet in its complete form since we have not included the nonrenormalizable initial effects, which we discuss next. B. Nonrenormalizable boundary conditions (6.24) representing the infinite scale-independent piece, which is fixed by the renormalization scheme, and a finite scaledependent piece, which is fixed by the overall scale independence of the matrix element. In the MS scheme, the scaleindependent part is fixed to cancel the pole and the usual finite artifacts of dimensional regularization, i h1 λ zε0 = − d − γ + ln4π 0 32π2 ε h1 i λ ε z1 = − d − γ + ln4π . (6.25) 1 48iπ2 ε The resulting contribution to the tadpole from those parts of the renormalized initial condition that depend on the renormalization scale is contained in the terms, hαRk (η f )|ψ+ (x)|αRk (η f )i R λR d0 µ 1 ẑ0 (µ) + ∂η a2 G (η f , η)φ η=η ln = 0 2 16π2 aM R 1 λR d1 µ − ẑ1 (µ) + ln a3 G (η f , η0 )Kφ η=η 0 2 24iπ2 aM R +···. (6.26) Among the many terms not explicitly written in this equation are all the other boundary-independent, µ-dependent contributions from the bulk, evaluated in Eq. (5.13). If we introduce boundary β-functions for ẑ0 and ẑ1 , d ẑ0 dµ d ẑ1 , β̂1 (λR ) = µ dµ then from Eq. (6.26) we have that β̂0 (λR ) = µ (6.27) The signals of trans-Planckian physics reside in the nonrenormalizable part of the initial state. Such an initial state is one which differs increasingly from the the vacuum state at shorter distances or which equivalently requires nonrenormalizable counterterms in the boundary action to render the theory finite. In terms of our expansion, these features of the initial state are those described by a series of positive powers of the generalized frequency, eα k = ∗ ∞ Ωn (η0 ) ∑ cn an(ηk 0 )Mn . (6.30) n=1 M, as usual, represents the scale at which some new dynamics becomes important. A great advantage of an effective description of an initial state is in its applicability to any idea for how the physics above the expansion scale might be modified. Different ideas—minimum lengths, modified uncertainty relations or new dispersion relations—can be distinguished by the values of their coefficients cn in the series of Eq. (6.30). Even more importantly for observations, since at best only the leading nonvanishing term is likely to be observable, the general trans-Planckian signal is determined by the single quantity c1 /M, assuming c1 is the first nonvanishing coefficient. This property allows us to extract a generic prediction for a transPlanckian signal in the cosmic microwave background which can then be contrasted with other ways for generating departures from the simple vacuum prediction, such as through modifications of the inflaton potential. Since our goal here is to show how renormalization proceeds for a nonrenormalizable state and that perturbative corrections remain small when H(η0 ) ≪ M, we shall examine 17 the renormalization of a state for the leading trans-Planckian effect with 1 a2 (η0 )m2 Ωk (η0 ) α∗k 1+ . (6.31) e = c1 a(η0 )M 2 Ω2k (η0 ) Since limk→0 Ωk (η0 ) 6= 0, the nonrenormalizable terms do not vanish at long distances so we have added an extra renormalizable term, scaling as 1/Ωk (η), to subtract some of this long distance behavior so that the condition represents a genuine trans-Planckian effect. The one loop contribution from this boundary condition to the tadpole is of the form hαk (η f )|ψ+ (x)|αk (η f )i Z ηf λ c1 = − dη a2 (η)G (η f , η)φ(η) 4 a(η0 )M η0 Rη ′ ′ Z 1 a2 (η0 )m2 Ωk (η0 )e−2i η0 dη Ωk (η ) d 3~k 1 + × (2π)3 2 Ω2k (η0 ) Ωk (η) +···. (6.32) ×a3 (η0 )G (η f , η0 )φ3 (η0 ) +···. These divergences are cancelled by adding the following counterterms to the interaction Hamiltonian, Z ′′ 4 c.t. 3 a (η) 1 z2 δ (η − η0 ) φψ± HI = d ~y a(η0 ) 2 M a2 (η) 1 z3 R(η)φψ± + δ(η − η0) ξ − M 6 1 z4 3 ± + (6.36) δ(η − η0 )φ ψ , 4M which can also be regarded as adding the following boundary action to the theory, Z √ 1 3 Sη=η0 = − d ~y −h z2 ∇2n (φψ± ) + 53 z2 K∇n (φψ± ) 2M + 23 z2 (∇n K)φψ± + 23 z2 K 2 φψ± 3 ± ± 1 1 +z3 ξ − 6 Rφψ + 2 z4 φ ψ . Introducing the kernel Rη ′ ′ η0 dη Ωk (η ) d 3~k Ωk (η0 )e−2i (2π)3 Ωk (η) (6.37) 1 (−1) ′′′ K (η) + UV finite 8i (6.33) to represent the divergent part of the first term in the loop integral and K (1) ′ (η) as before in Eq. (6.18) for the second, we integrate by parts until all the divergent contributions appear explicitly as term evaluated on the boundary, Z = hαk (η f )|ψ+ (x)|αk (η f )i λ c1 1 = ∂2 a2 (η)G (η f , η)φ(η) η K (−1) (η0 ) 0 32i M a(η0 ) η λ c1 a(η0 )G (η f , η0 )φ(η0 ) + 32i Mh i × K (−1) ′′ (η0 ) − 2a2(η0 )m2 K (1) (η0 ) +···. (6.35) (6.34) Here we have only retained the terms that would diverge if we performed the remaining loop integrations. Using the expressions for dimensionally regularized kernels from Appendix A, we find that hαk (η f )|ψ+ (x)|αk (η f )i λ c1 1 4πµ2 = − γ + ln 128iπ2 M ε a2 (η0 )M 2 (η0 ) 1 ∂2η a2 (η)G (η f , η)φ(η) η=η × 0 a(η0 ) 2 λ c1 1 4πµ + + 1 − γ + ln 64iπ2 M ε a2 (η0 )M 2 (η0 ) 1 φ(η0 )R(η0 ) ×a3(η0 )G (η f , η0 ) ξ − 6 λ 2 c1 1 4πµ2 + + 1 − γ + ln 2 128iπ2 M ε a (η0 )M 2 (η0 ) From either perspective, the contribution from these boundary counterterms to the tadpole calculation is R hαRk (η f )|ψ+ R (x)|αk (η f )i 1 z2 1 ∂2 a2 (η)G (η f , η)φ(η) η=η = − 0 2 M a(η0 ) η i h 1 z3 3 1 − a (η0 )G (η f , η0 ) ξ − φ(η0 )R(η0 ) 2M 6 1 z4 3 3 a (η0 )G (η f , η0 )φ (η0 ) − 4M +···. (6.38) As before, the coefficients of the counterterms can be broken into a scale-independent infinite part and a scale-dependent finite part, z2 = zε2 + ẑ2 (µ), z3 = zε3 + ẑ3 (µ), z4 = zε4 + ẑ4 (µ), (6.39) where the infinite part is completely fixed by a renormalization scheme such as the MS scheme, λc1 1 − γ + ln4π zε2 = 64iπ2 ε λc1 1 zε3 = + 1 − γ + ln4π 32iπ2 ε λ 2 c1 1 ε z4 = + 1 − γ + ln4π , (6.40) 32iπ2 ε while the finite part is determined by the Callan-Symanzik equation, ∂ ∂ ∂ ∂ + mR γm (λR ) + βξ (λR ) 0 = µ + β(λR) ∂µ ∂λR ∂mR ∂ξR ∞ ∂ R hαRk (η f )|ψ+ +γ(λR ) + ∑ β̂n (λR ) R (x)|αk (η f )i, ∂ẑ n n=0 (6.41) 18 which has been finally expressed in its complete form. The sum has been written with an infinite number of boundary βfunctions to represent the possibility of including higher order nonrenormalizable initial conditions. R hαRk (η f )|ψ+ R (x)|αk (η f )i 1 µ λ R c1 1 1 ẑ2 (µ) − ln ∂2η a2 (η)G (η f , η)φ(η) η=η = − 2 0 2M 32iπ a(η0 )M R (η0 ) a(η0 ) λ R c1 1 1 µ 1 ẑ3 (µ) − φ(η0 )R(η0 ) − ln a3 (η0 )G (η f , η0 ) ξ − 2 2M 16iπ 6 a(η0 )M R (η0 ) 1 1 λ 2 c1 µ − ẑ4 (µ) − R 2 ln a3 (η0 )G (η f , η0 )φ3 (η0 ) + · · ·, 4M 16iπ a(η0 )M R (η0 ) which, from the Callan-Symanzik equation, implies that the boundary parameters run as follows, λ R c1 + ··· 32iπ2 λ R c1 β̂3 (λR ) = + ··· 16iπ2 λ R c1 β̂4 (λR ) = + ···. 16iπ2 β̂2 (λR ) = (6.43) So far we have let the parameter c1 be complex, but if the evolution is to remain unitary, it is clear that c1 should be purely imaginary so that its contribution to the boundary counterterm action is real. By similar reasoning, d1 should also be purely imaginary while d0 is real. VII. Adding together the boundary effects in Eq. (6.34) and the counterterms contribution of Eq. (6.38), after applying the MS scheme, yields all of the renormalization scale dependence that is associated with the initial state, CONCLUSIONS Although the effective theory principle has been widely and advantageously applied in field theory, its role has usually been reserved for describing the evolution of a system from one state to another rather than for describing properties of the actual states involved. For a field theory in Minkowski space, such a simplification is usually sufficient since scales— including that which divides long-distances from the regime where new physics could modify the structure of the true vacuum state away from a vacuum based on extrapolating the eigenstates of the low energy theory—are time-independent. Furthermore, the initial and final states are measured in a rather subdued environment, in an asymptotic past or future where the fields no longer interact with each other and where the states become essentially those of the free Minkowski space Hamiltonian. The conditions prevailing during inflation are dramatically different and can imply a more significant role for the shortdistance properties of the states. In particular, the standard vacuum choice is defined to be that state which resembles the Minkowski vacuum at distances where the curvature is not noticeable, k ≫ H. However, when the Hubble scale H is itself an appreciable fraction of the scale M for the trans-Planckian (6.42) physics, we are defining the vacuum precisely in a regime near where the low energy free theory is no longer applicable. An effective description attempts to provide a generic parameterization of this ambiguity between an extrapolated free vacuum and the true vacuum, without making any assumptions about the physics in the trans-Planckian regime. Because of the time-evolution of the background, it is not appropriate to choose states based upon their properties in an asymptotically distant past. Instead, the state is fixed at an “initial” time during the regime when the effective theory is predictive, that is, when the perturbative corrections to a general process are still small. In essence, an effective theory is an application of the principle of decoupling—that the physics of large scales should be relatively independent of the physics at short distances. To make this idea more precise, we describe the initial state through a power series, as in Eq. (6.10), which contains terms which either diminish or grow at short distances. The former are the analogues of the renormalizable part of a bulk effective field theory and they are fixed by the observed long-distance structure of the state. But it is the latter that contain the signals of trans-Planckian physics and they form the initial-state analogue of the nonrenormalizable operators of a bulk effective theory. It is important to remember that in this setting the detailed structure of the state is not meant to be part of a complete theory, but only an effective one applicable up to the scale of the unknown trans-Planckian physics. The main idea of this article has been to establish the renormalizability of this approach, showing explicitly that once the divergences have been all removed, the signals of the transPlanckian physics are suppressed by powers of the small ratio of the Hubble scale during inflation and the scale of the new physics, H/M. These new divergences result from summing over the short-distance structure of the initial state. Thus they appear in a perturbative correction only when we simultaneously sum over arbitrarily high spatial momenta and we evaluate it at the initial time, and as a consequence the counterterms are local operators confined to the same initial surface at which the state was defined. The power counting for these counterterms parallels that of an ordinary bulk theory, 19 with relevant or marginal boundary counterterms removing the short-distance divergences from the renormalizable part of the initial state and with irrelevant boundary counterterms removing the divergences from the trans-Planckian part. One reason for showing the renormalizability of a state with a nontrivial trans-Planckian component is to demonstrate that tree-level calculations, such as that of the primordial power spectrum of fluctuations, are perturbatively stable. But there is a subtle consequence of this result which has a much more direct effect even on a tree-level calculation. The search for a renormalizable description of a state is equivalent to finding the correct propagator for the short-distance information which is contained in this state that does not lead to uncontrolled divergences—and it is this same propagator that also appears in the tree-level estimate of a process. For example, in the Bunch-Davies vacuum, the two-point function and the propagator evaluated for equal times are equivalent, but this equivalence no longer holds for a more general initial state. The extra term, encoding how the information in the initial state propagates forward, influences the precise calculation of both the primordial power spectrum [11, 13] and the gravitational back-reaction [12, 34]. Quantum mechanically, this term represents the interference between the fields being measured and the initial state information. As a result, the precise calculation of the trans-Planckian signal requires following this quantum mechanical interference as it affects the classical spectrum of primordial fluctuations, as will be done in [23]. Acknowledgments This work was supported in part by DOE grant No. DE-FG0391-ER40682 and the National Science Foundation grant No. PHY02-44801. APPENDIX A: KERNELS In Sec. VI we introduced a family of kernel functions, K (p) (η) ≡ Z Rη d 3~k e−2i η0 dη Ωk (η ) , (2π)2 Ω3−p(η)Ω p (η0 ) k k ′ ′ (A1) which occur generically in the loop corrections to a process with a nonrenormalizable initial condition. The parts that depend on an arbitrary time η come from the loop propagator and the part evaluated only on the boundary at η0 resulted from the form of the power series we used to describe the initial state, as in Eq. (6.10). In fact, our choice for this power series was made to obtain a relatively simple form for the loop integrals, such as the kernel above, when evaluated on the initial boundary. The structure of the kernels is also chosen so that they only diverge logarithmically in (η − η0 ) after we perform the momentum integral. Thus if a kernel function appears within an dη-integral whose integrand, apart from the kernel factor, is well behaved, then the result is finite. The point of introducing these functions is that the loop integrals we encounter can be written in terms of an appropriate number of derivatives of K (p) (η) which we proceed to integrate by parts until all the derivatives have been removed from the kernel still occurring with the conformal time integral. The boundary terms which result from this process that are evaluated at the initial surface isolate all the new divergences associated with having a nonstandard initial condition on the state. The remaining momentum integrals can be then regularized, for example by extending the number of spatial dimensions to 3 − 2ε. In this article, we shall not need to consider more than three derivatives of the kernels, (3 − p)Ω′k (η) d 3~k −2i Rηη dη′ Ωk (η′ ) 2i 0 − 4−p K (η) = − 2−p e p p (2π)3 Ωk (η)Ωk (η0 ) Ωk (η)Ωk (η0 ) Z 2i(5 − 2p)Ω′k(η) (3 − p)Ω′′k (3 − p)(4 − p)Ω′k2 d 3~k −2i Rηη dη′ Ωk (η′ ) 4 (p) ′′ 0 K (η) = + 3−p − 4−p + 5−p − 1−p e p p p p (2π)3 Ωk (η)Ωk (η0 ) Ωk (η)Ωk (η0 ) Ωk (η)Ωk (η0 ) Ωk (η)Ωk (η0 ) Z 12(2 − p)Ω′k (η) 2i(8 − 3p)Ω′′k (η) 6i(3 − p)2Ω′k2 (η) d 3~k −2i Rηη dη′ Ωk (η′ ) 8i 0 + + 3−p − 4−p K (p) ′′′ (η) = e p 2−p p p p (2π)3 Ω−p Ωk (η)Ωk (η0 ) Ωk (η)Ωk (η0 ) Ωk (η)Ωk (η0 ) k (η)Ωk (η0 ) 3(3 − p)(4 − p)Ω′k(η)Ω′′k (η) (3 − p)(4 − p)(5 − p)Ω′k3(η) (3 − p)Ω′′′ k (η) + − 4−p . − Ωk (η)Ωkp (η0 ) Ωk5−p(η)Ωkp (η0 ) Ωk6−p (η)Ωkp (η0 ) (A2) (p) ′ Z In the extreme ultraviolet limit, where the leading part of the generalized frequency approaches Ωk (η) → k, it is only the first term in each of these expressions which contains a shortdistance divergence and which is thus important for determining how the initial state should be renormalized. Moreover, these kernel functions are all finite except at η = η0 where there is no longer any oscillatory suppression in the terms that are not already manifestly finite. Because each of the derivatives in Eq. (A2) will be integrated by parts at least once, the actual divergences we encounter are found by setting η = η0 20 in d 3~k 1 3 3 (2π) Ωk (η0 ) Z 1 d 3~k (p) ′ + ··· K (η0 ) = −2i (2π)3 Ω2k (η0 ) Z d 3~k 1 K (p) ′′(η0 ) = −4 + · · ·. (2π)3 Ωk (η0 ) K (p) (η0 ) = encountered and can be regulated by continuing to an arbitrary real number of dimensions, 3 → 3 − 2ε, Z (A3) Note that these divergent parts are independent of the index p. The leading behavior of the adiabatic modes for large momenta is approximated by q (A4) Ωk (η0 ) ≈ k2 + a2(η0 )M 2 (η0 ), as was derived in Eq. (2.37), so that, up to the prefactors, each of the divergent parts of the kernels can be written in the very general form, I(0, α) = Z d 3~k 1 . (2π)3 [k2 + a2 M 2 ]α/2 µ2ε d 3−2ε~k 3−2ε 2 (2π) [k + a2M 2 ]α/2 √ h 2 iε π Γ(ε − 3−α 2 ) 4πµ = [aM ]3−α . (A6) 8π2 Γ( α2 ) a2 M 2 I(ε, α) = (A5) Z Evaluating the infinite and the finite, nonvanishing, terms for the cases α = {3, 1} yields K (p) (η0 ) = 1 1 4πµ2 + ··· − γ + ln 4π2 ε a2 (η0 )M 2 (η0 ) (A7) and K (p) ′′ (η0 ) = a2 (η0 )M 2 (η0 ) 2π2 (A8) 4πµ2 1 + ···. + 1 − γ + ln 2 ε a (η0 )M 2 (η0 ) Except for the fact that we are integrating over three rather than four dimensions, this general integral is of the usual form Note that K (p) ′(η0 ) is finite once its momentum integral has been dimensionally regularized. [1] H. Collins and R. Holman, Phys. Rev. D 71, 085009 (2005) [hep-th/0501158]. [2] A. Linde, Particle Physics and Inflationary Cosmology (Harwood Academic, Chur, Switzerland, 1990) [hep-th/0503203]; E. W. Kolb and M. S. Turner, The Early Universe (AddisonWesley, Reading, MA, 1990); A. Liddle and D. Lyth, Cosmological Inflation and Large-Scale Structure (Cambridge University Press, Cambridge, England, 2000). [3] H. V. Peiris et al., Astrophys. J. Suppl. 148, 213 (2003) [astro-ph/0302225]. [4] J. Martin and R. H. Brandenberger, Phys. Rev. D 63, 123501 (2001) [hep-th/0005209]; R. H. Brandenberger and J. Martin, Mod. Phys. Lett. A 16, 999 (2001) [astro-ph/0005432]. [5] R. Easther, B. R. Greene, W. H. Kinney and G. Shiu, Phys. Rev. D 64, 103502 (2001); R. Easther, B. R. Greene, W. H. Kinney and G. Shiu, Phys. Rev. D 67, 063508 (2003); G. Shiu and I. Wasserman, Phys. Lett. B 536, 1 (2002); R. Easther, B. R. Greene, W. H. Kinney and G. Shiu, Phys. Rev. D 66, 023518 (2002). [6] J. C. Niemeyer, Phys. Rev. D 63, 123502 (2001); A. Kempf, Phys. Rev. D 63, 083514 (2001); J. C. Niemeyer and R. Parentani, Phys. Rev. D 64, 101301 (2001); A. Kempf and J. C. Niemeyer, Phys. Rev. D 64, 103501 (2001); A. A. Starobinsky, Pisma Zh. Eksp. Teor. Fiz. 73, 415 (2001) [JETP Lett. 73, 371 (2001)]; L. Hui and W. H. Kinney, Phys. Rev. D 65, 103507 (2002); S. Shankaranarayanan, Class. Quant. Grav. 20, 75 (2003); S. F. Hassan and M. S. Sloth, Nucl. Phys. B 674, 434 (2003); K. Goldstein and D. A. Lowe, Phys. Rev. D 67, 063502 (2003); V. Bozza, M. Giovannini and G. Veneziano, JCAP 0305, 001 (2003); G. L. Alberghi, R. Casadio and A. Tronconi, Phys. Lett. B 579, 1 (2004); J. Martin and R. Brandenberger, Phys. Rev. D 68, 063513 (2003); U. H. Danielsson, Phys. Rev. D 66, 023511 (2002); U. H. Danielsson, JHEP 0207, 040 (2002); R. H. Brandenberger and J. Martin, Int. J. Mod. Phys. A 17, 3663 (2002). C. P. Burgess, J. M. Cline, F. Lemieux and R. Holman, JHEP 0302, 048 (2003) [hep-th/0210233]; C. P. Burgess, J. M. Cline and R. Holman, JCAP 0310, 004 (2003) [hep-th/0306079]; C. P. Burgess, J. Cline, F. Lemieux and R. Holman, astro-ph/0306236. H. Collins, R. Holman and M. R. Martin, Phys. Rev. D 68, 124012 (2003) [hep-th/0306028]; H. Collins and M. R. Martin, Phys. Rev. D 70, 084021 (2004) [hep-ph/0309265]; H. Collins, hep-th/0312144. R. Easther, W. H. Kinney and H. Peiris, JCAP 0505, 009 (2005) [astro-ph/0412613]. N. Kaloper, M. Kleban, A. E. Lawrence and S. Shenker, Phys. Rev. D 66, 123510 (2002); N. Kaloper, M. Kleban, A. Lawrence, S. Shenker and L. Susskind, JHEP 0211, 037 (2002). K. Schalm, G. Shiu and J. P. van der Schaar, JHEP 0404, 076 (2004) [hep-th/0401164]; B. Greene, K. Schalm, J. P. van der Schaar and G. Shiu, eConf C041213, 0001 (2004) [astro-ph/0503458]. B. R. Greene, K. Schalm, G. Shiu and J. P. van der Schaar, JCAP 0502, 001 (2005) [hep-th/0411217]; K. Schalm, G. Shiu and J. P. van der Schaar, AIP Conf. Proc. 743, 362 (2005) [hep-th/0412288]. R. Easther, W. H. Kinney and H. Peiris, astro-ph/0505426. P. R. Anderson, C. Molina-Paris and E. Mottola, hep-th/0504134. J. A. Tauber on behalf of ESA and the Planck Scientific Collaboration, Adv. Space Res. 34, 491 (2004). Two such examples include the Square Kilometre Array and the [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] 21 [17] [18] [19] [20] [21] [22] [23] [24] [25] Cosmic Inflation Probe: respectively, S. Rawlings, F. B. Abdalla, S. L. Bridle, C. A. Blake, C. M. Baugh, L. J. Greenhill and J. M. van der Hulst, New Astron. Rev. 48, 1013 (2004) [astro-ph/0409479] and G. J. Melnick, G. G. Fazio, V. Tolls, D. T. Jaffe, K. Gebhardt, V. Bromm, E. Komatsu, and R. A. Woodruff, Am. Astron. Soc. Meeting 205 (2004) 10006. H. Georgi, “Weak Interactions And Modern Particle Theory,” (Benjamin/Cummings, Menlo Park, California, 1984); H. Georgi, “Effective field theory,” Ann. Rev. Nucl. Part. Sci. 43, 209 (1993); I. Z. Rothstein, “TASI lectures on effective field theories,” hep-ph/0308266. In de Sitter space, for example, refer to E. Witten, hep-th/0106109. J. S. Schwinger, J. Math. Phys. 2, 407 (1961). L. V. Keldysh, Zh. Eksp. Teor. Fiz. 47, 1515 (1964) [Sov. Phys. JETP 20, 1018 (1965)]. K. T. Mahanthappa, Phys. Rev. 126, 329 (1962); P. M. Bakshi and K. T. Mahanthappa, J. Math. Phys. 41, 12 (1963). S. Weinberg, hep-th/0506236. H. Collins and R. Holman, in progress. F. Larsen and R. McNees, JHEP 0307, 051 (2003) [hep-th/0307026]; F. Larsen and R. McNees, JHEP 0407, 062 (2004) [hep-th/0402050]. T. S. Bunch and P. C. Davies, Proc. Roy. Soc. Lond. A 360, 117 (1978). [26] M. B. Einhorn and F. Larsen, Phys. Rev. D 67, 024001 (2003) [hep-th/0209159]; M. B. Einhorn and F. Larsen, Phys. Rev. D 68, 064002 (2003) [hep-th/0305056]. [27] T. Banks and L. Mannelli, Phys. Rev. D 67, 065009 (2003) [hep-th/0209113]. [28] K. Goldstein and D. A. Lowe, Phys. Rev. D 69, 023507 (2004) [hep-th/0308135]. [29] H. Collins and R. Holman, Phys. Rev. D 70, 084019 (2004) [hep-th/0312143]; H. Collins, Phys. Rev. D 71, 024002 (2005) [hep-th/0410229]; H. Collins, hep-th/0410228. [30] N. A. Chernikov and E. A. Tagirov, Annales Poincare Phys. Theor. A 9, 109 (1968); E. A. Tagirov, Annals Phys. 76, 561 (1973); E. Mottola, Phys. Rev. D 31, 754 (1985); B. Allen, Phys. Rev. D 32, 3136 (1985). [31] D. Boyanovsky, H. J. de Vega, R. Holman, D. S. Lee and A. Singh, Phys. Rev. D 51, 4419 (1995) [hep-ph/9408214]. [32] S. Weinberg, Phys. Rev. D 9, 3357 (1974). [33] K. G. Wilson and J. B. Kogut, Phys. Rept. 12, 75 (1974). [34] M. Porrati, Phys. Lett. B 596, 306 (2004) [hep-th/0402038]. M. Porrati, hep-th/0409210. F. Nitti, M. Porrati and J. W. Rombouts, hep-th/0503247.

Log In

An effective theory of initial conditions in inflation

Related papers

Related papers

Related topics