GEOPHYSICS, VOL. 74, NO. 6 共NOVEMBER-DECEMBER 2009兲; P. WCC127–WCC152, 15 FIGS., 1 TABLE.
10.1190/1.3238367
An overview of full-waveform inversion in exploration geophysics
J. Virieux1 and S. Operto2
ABSTRACT
Full-waveform inversion 共FWI兲 is a challenging data-fitting
procedure based on full-wavefield modeling to extract quantitative information from seismograms. High-resolution imaging at
half the propagated wavelength is expected. Recent advances in
high-performance computing and multifold/multicomponent
wide-aperture and wide-azimuth acquisitions make 3D acoustic
FWI feasible today. Key ingredients of FWI are an efficient forward-modeling engine and a local differential approach, in
which the gradient and the Hessian operators are efficiently estimated. Local optimization does not, however, prevent convergence of the misfit function toward local minima because of the
limited accuracy of the starting model, the lack of low frequencies, the presence of noise, and the approximate modeling of the
INTRODUCTION
Seismic waves bring to the surface information gathered on the
physical properties of the earth. Since the discovery of modern seismology at the end of the 19th century, the main discoveries have arisen from using traveltime information 共Oldham, 1906; Gutenberg,
1914; Lehmann, 1936兲. Then there was a hiatus until the 1980s for
amplitude interpretation, when global seismic networks could provide enough calibrated seismograms to compute accurate synthetic
seismograms using normal-mode summation. Differential seismograms estimated through the Born approximation have been used
as perturbations for matching long-period seismograms, which can
provide high-resolution upper-mantle tomography 共Gilbert and Dziewonski, 1975; Woodhouse and Dziewonski, 1984兲. The sensitivity
or Fréchet derivative matrix, i.e., the partial derivative of seismic
data with respect to the model parameters, is explicitly estimated before proceeding to inversion of the linearized system. The normalmode description allows a limited number of parameters to be in-
wave-physics complexity. Different hierarchical multiscale
strategies are designed to mitigate the nonlinearity and ill-posedness of FWI by incorporating progressively shorter wavelengths
in the parameter space. Synthetic and real-data case studies address reconstructing various parameters, from VP and VS velocities to density, anisotropy, and attenuation. This review attempts
to illuminate the state of the art of FWI. Crucial jumps, however,
remain necessary to make it as popular as migration techniques.
The challenges can be categorized as 共1兲 building accurate starting models with automatic procedures and/or recording low frequencies, 共2兲 defining new minimization criteria to mitigate the
sensitivity of FWI to amplitude errors and increasing the robustness of FWI when multiple parameter classes are estimated, and
共3兲 improving computational efficiency by data-compression
techniques to make 3D elastic FWI feasible.
verted 共a few hundred parameters兲, which makes the optimization
procedure feasible through explicit sensitivity matrix estimation in
spite of the high number of seismograms.
Meanwhile, exploration seismology has taken up the challenge of
high-resolution imaging of the subsurface by designing dense, multifold acquisition systems. Construction of the sensitivity matrix is
too prohibitive because the number of parameters exceed 10,000. Instead, another road has been taken to perform high-resolution imaging. Using the exploding-reflector concept, and after some kinematic corrections, amplitude summation has provided detailed images
of the subsurface for reservoir determination and characterization
共Claerbout, 1971, 1976兲. The sum of the traveltimes from a specific
point of the interface toward the source and the receiver should coincide with the time of large amplitudes in the seismogram. The reflectivity as an amplitude attribute of related seismic traces at the selected point of the reflector provides the migrated image needed for seismic stratigraphic interpretation. Although migration is more a concept for converting seismic data recorded in the time-space domain
Manuscript received by the Editor 7 June 2009; revised manuscript received 8 July 2009; published online 3 December 2009.
Université Joseph Fourier, Laboratoire de Géophysique Interne et Tectonophysique, CNRS, IRD, Grenoble, France. E-mail:
[email protected].
2
Université Nice-Sophia Antipolis, Géoazur, CNRS, IRD, Observatoire de la Côte d’Azur, Villefranche-sur-mer, France. E-mail:
[email protected].
© 2009 Society of Exploration Geophysicists. All rights reserved.
1
WCC127
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC128
Virieux and Operto
into images of physical properties, we often refer to it as the geometric description of the short wavelengths of the subsurface. A velocity
macromodel or background model provides the kinematic information required to focus waves inside the medium.
The limited offsets recorded by seismic reflection surveys and the
limited-frequency bandwidth of seismic sources make seismic imaging poorly sensitive to intermediate wavelengths 共Jannane et al.,
1989兲. This is the motivation behind a two-step workflow: construct
the macromodel using kinematic information, and then the amplitude projection using different types of migrations 共Claerbout and
Doherty, 1972; Gazdag, 1978; Stolt, 1978; Baysal et al., 1983; Yilmaz, 2001; Biondi and Symes, 2004兲. This procedure is efficient for
relatively simple geologic targets in shallow-water environments,
although more limited performances have been achieved for imaging complex structures such as salt domes, subbasalt targets, thrust
belts, and foothills. In complex geologic environments, building an
accurate velocity background model for migration is challenging.
Various approaches for iterative updating of the macromodel reconstruction have been proposed 共Snieder et al., 1989; Docherty et al.,
2003兲, but they remain limited by the poor sensitivity of the reflection seismic data to the large and intermediate wavelengths of the
subsurface.
Simultaneous with the global seismology inversion scheme,
Lailly 共1983兲 and Tarantola 共1984兲 recast the migration imaging
principle of Claerbout 共1971, 1976兲 as a local optimization problem,
the aim of which is least-squares minimization of the misfit between
recorded and modeled data. They show that the gradient of the misfit
function along which the perturbation model is searched can be built
by crosscorrelating the incident wavefield emitted from the source
and the back-propagated residual wavefields. The perturbation model obtained after the first iteration of the local optimization looks like
a migrated image obtained by reverse-time migration. One difference is that the seismic wavefield recorded at the receiver is back
propagated in reverse time migration, whereas the data misfit is back
propagated in the waveform inversion of Lailly 共1983兲 and Tarantola
共1984兲. When added to the initial velocity, the velocity perturbations
lead to an updated velocity model, which is used as a starting model
for the next iteration of minimizing the misfit function. The impressive amount of data included in seismograms 共each sample of a time
series must be considered兲 is involved in gradient estimation. This
estimation is performed by summation over sources, receivers, and
time.
Waveform-fitting imaging was quite computer demanding at that
time, even for 2D geometries 共Gauthier et al., 1986兲. However, it has
been applied successfully in various studies using forward-modeling techniques such as reflectivity techniques in layered media 共Kormendi and Dietrich, 1991兲, finite-difference techniques 共Kolb et al.,
1986; Ikelle et al., 1988; Crase et al., 1990; Pica et al., 1990; Djikpéssé and Tarantola, 1999兲, finite-element methods 共Choi et al.,
2008兲, and extended ray theory 共Cary and Chapman, 1988; Koren et
al., 1991; Sambridge and Drijkoningen, 1992兲. A less computationally intensive approach is achieved by Jin et al. 共1992兲 and Lambaré
et al. 共1992兲, who establish the theoretical connection between raybased generalized Radon reconstruction techniques 共Beylkin, 1985;
Bleistein, 1987; Beylkin and Burridge, 1990兲 and least-squares optimization 共Tarantola, 1987兲. By defining a specific norm in the data
space, which varies from one focusing point to the next, they were
able to recast the asymptotic Radon transform as an iterative leastsquares optimization after diagonalizing the Hessian operator. Applications on 2D synthetic data and real data are provided 共Thierry et
al., 1999b; Operto et al., 2000兲 and 3D extension is possible 共Thierry
et al., 1999a; Operto et al., 2003兲 because of efficient asymptotic forward modeling 共Lucio et al., 1996兲. Because the Green’s functions
are computed in smoothed media with the ray theory, the forward
problem is linearized with the Born approximation, and the optimization is iterated linearly, which means the background model remains the same over the iterations. These imaging methods are generally called migration/inversion or true-amplitude prestack depth
migration 共PSDM兲. The main difference with the waveform-inversion methods we describe is that the smooth background model does
not change over iterations and only the single scattered wavefield is
modeled by linearizing the forward problem.
Alternatively, the full information content in the seismogram can
be considered in the optimization. This leads us to full-waveform inversion 共FWI兲, where full-wave equation modeling is performed at
each iteration of the optimization in the final model of the previous
iteration. All types of waves are involved in the optimization, including diving waves, supercritical reflections, and multiscattered waves
such as multiples. The techniques used for the forward modeling
vary and include volumetric methods such as finite-element methods 共Marfurt, 1984; Min et al., 2003兲, finite-difference methods
共Virieux, 1986兲, finite-volume methods 共Brossier et al., 2008兲, and
pseudospectral methods 共Danecek and Seriani, 2008兲; boundary integral methods such as reflectivity methods 共Kennett, 1983兲; generalized screen methods 共Wu, 2003兲; discrete wavenumber methods
共Bouchon et al., 1989兲; generalized ray methods such as WKBJ and
Maslov seismograms 共Chapman, 1985兲; full-wave theory 共de Hoop,
1960兲; and diffraction theory 共Klem-Musatov and Aizenberg, 1985兲.
FWI has not been recognized as an efficient seismic imaging technique because pioneering applications were restricted to seismic reflection data. For short-offset acquisition, the seismic wavefield is
rather insensitive to intermediate wavelengths; therefore, the optimization cannot adequately reconstruct the true velocity structure
through iterative updates. Only when a sufficiently accurate initial
model is provided can waveform-fitting converge to the velocity
structure through such updates. For sampling the initial model, sophisticated investigations with global and semiglobal techniques
共Koren et al., 1991; Jin and Madariaga, 1993, 1994; Mosegaard and
Tarantola, 1995; Sambridge and Mosegaard, 2002兲 have been performed. The rather poor performance of these investigations that
arises from insensitivity to intermediate wavelengths has led many
researchers to believe that this optimization technique is not particularly efficient.
Only with the benefit of long-offset and transmission data to reconstruct the large and intermediate wavelengths of the structure has
FWI reached its maturity as highlighted by Mora 共1987, 1988兲, Pratt
and Worthington 共1990兲, Pratt et al. 共1996兲, and Pratt 共1999兲. FWI attempts to characterize a broad and continuous wavenumber spectrum at each point of the model, reunifying macromodel building
and migration tasks into a single procedure. Historical crosshole and
wide-angle surface data examples illustrate the capacity of simultaneous reconstruction of the entire spatial spectrum 共e.g., Pratt, 1999;
Ravaut et al., 2004; Brenders and Pratt, 2007a兲. However, robust application of FWI to long-offset data remains challenging because of
increasing nonlinearities introduced by wavefields propagated over
several tens of wavelengths and various incidence angles 共Sirgue,
2006兲.
Here, we consider the main aspects of FWI. First, we review the
forward-modeling problem that underlies FWI. Efficient numerical
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
modeling of the full seismic wavefield is a central issue in FWI, especially for 3D problems.
In the second part, we review the main theoretical aspects of FWI
based on a least-squares local optimization approach. We follow the
compact matrix formalism for its simplicity 共Pratt et al., 1998; Pratt,
1999兲, which leads to a clear interpretation of the gradient and the
Hessian of the objective function. Once the gradient is estimated, we
review different optimization algorithms used to compute the perturbation model. We conclude the methodology section by the source
estimation problem in FWI.
In the third part, we review some key features of FWI. First, we
highlight the relationships between the experimental setup 共source
bandwidth, acquisition geometry兲 and the spatial resolution of FWI.
The resolution analysis provides the necessary guidelines to design
the multiscale FWI algorithms required to mitigate the nonlinearity
of FWI. We discuss the pros and cons of the time and frequency domains for efficient multiscale algorithms. We provide a few words
concerning the parallel implementation of FWI techniques because
these are computer demanding. Then we review some alternatives to
the least-squares criterion and the Born linearization. A key issue of
FWI is the initial model from which the local optimization is started.
We also discuss several tomographic approaches to building a starting model.
In the fourth part, we review the main case studies of FWI subdivided into three categories of case studies: acoustic, multiparameter,
and three dimensional. Finally, we discuss the future challenges
raised by the revival of interest in FWI that has been shown by the
exploration and the earthquake-seismology communities.
THE FORWARD PROBLEM
Let us first introduce the notations for the forward problem, namely, modeling the full seismic wavefield. The reader is referred to
Robertsson et al. 共2007兲 for an up-to-date series of publications on
modern seismic-modeling methods.
We use matrix notations to denote the partial-differential operators of the wave equation 共Marfurt, 1984; Carcione et al., 2002兲. The
most popular direct method to discretize the wave equation in the
time and frequency domains is the finite-difference method
共Virieux, 1986; Levander, 1988; Graves, 1996; Operto et al., 2007兲,
although more sophisticated finite-element or finite-volume approaches can be considered. This is especially true when accurate
boundary conditions through unstructured meshes must be implemented 共e.g., Komatitsch and Vilotte, 1998; Dumbser and Kaser,
2006兲.
In the time domain, we have
M共x兲
d2u共x,t兲
⳱ A共x兲u共x,t兲 Ⳮ s共x,t兲,
dt2
共1兲
where M and A are the mass and the stiffness matrices, respectively
共Marfurt, 1984兲. The source term is denoted by s and the seismic
wavefield by u. In the acoustic approximation, u generally represents pressure, although in the elastic case u generally represents
horizontal and vertical particle velocities. The time is denoted by t
and the spatial coordinates by x. Equation 1 generally is solved with
an explicit time-marching algorithm: The value of the wavefield at a
time step 共n Ⳮ 1兲 at a spatial position is inferred from the value of the
wavefields at previous time steps. Implicit time-marching algorithms are avoided because they require solving a linear system
共Marfurt, 1984兲. If both velocity and stress wavefields are helpful,
WCC129
the system of second-order equations can be recast as a first-order
hyperbolic velocity-stress system by incorporating the necessary
auxiliary variables 共Virieux, 1986兲.
In the frequency domain, the wave equation reduces to a system of
linear equations; the right-hand side is the source and the solution is
the seismic wavefield. This system can be written compactly as
B共x, 兲u共x, 兲 ⳱ s共x, 兲,
共2兲
where B is the impedance matrix 共Marfurt, 1984兲. The sparse complex-valued matrix B has a symmetric pattern, although it is not symmetric because of absorbing boundary conditions 共Hustedt et al.,
2004; Operto et al., 2007兲.
Equation 2 can be solved by a decomposition of B such as lower
and upper 共LU兲 triangular decomposition, leading to direct-solver
techniques. The advantage of the direct-solver approach is that once
the decomposition is performed, equation 2 is efficiently solved for
multiple sources using forward and backward substitutions 共Marfurt, 1984兲. The direct-solver approach is efficient for 2D forward
problems 共Jo et al., 1996; Stekl and Pratt, 1998; Hustedt et al., 2004兲.
However, the time and memory complexities of LU factorization
and its limited scalability on large-scale distributed memory platforms prevent use of the approach for large-scale 3D problems 共i.e.,
problems involving more than 10 million unknowns; Operto et al.,
2007兲.
Iterative solvers provide an alternative approach for solving the
time-harmonic wave equation 共Riyanti et al., 2006, 2007; Plessix,
2007; Erlangga and Herrmann, 2008兲. Iterative solvers currently are
implemented with Krylov subspace methods 共Saad, 2003兲 that are
preconditioned by solving the dampened time-harmonic wave equation. The solution of the dampened wave equation is computed with
one cycle of a multigrid. The main advantage of the iterative approach is the low memory requirement, although the main drawback
results from a difficulty to design an efficient preconditioner because
the impedance matrix is indefinite. To our knowledge, the extension
to elastic wave equations still needs to be investigated. As for the
time-domain approach, the time complexity of the iterative approach increases linearly with the number of sources in contrast to
the direct-solver approach.
An intermediate approach between the direct and iterative methods consists of a hybrid direct-iterative approach based on a domain
decomposition method and the Schur complement system 共Saad,
2003; Sourbier et al., 2008兲. The iterative solver is used to solve the
reduced Schur complement system, the solution of which is the
wavefield at interface nodes between subdomains. The direct solver
is used to factorize local impedance matrices that are assembled on
each subdomain. Briefly, the hybrid approach provides a compromise in terms of memory savings and multisource-simulation efficiency between the direct and the iterative approaches.
The last possible approach to compute monochromatic wavefields is to perform the modeling in the time domain and extract the
frequency-domain solution either by discrete Fourier transform in
the loop over the time steps 共Sirgue et al., 2008兲 or by phase-sensitivity detection once the steady-state regime is reached 共Nihei and Li,
2007兲. One advantage of the approach based on the discrete Fourier
transform is that an arbitrary number of frequencies can be extracted
within the loop over time steps at minimal extra cost. Second, time
windowing can be easily applied, which is not the case when the
modeling is performed in the frequency domain. Time windowing
allows the extraction of specific arrivals for FWI 共early arrivals, reflections, PS converted waves兲, which is often useful to mitigate the
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC130
Virieux and Operto
nonlinearity of the inversion by judicious data preconditioning
共Sears et al., 2008; Brossier et al., 2009a兲.
Among all of these possible approaches, the iterative-solver approach theoretically has the best time complexity 共here, “complexity” denotes how the computational cost of an algorithm grows with
the size of the computational domain兲 if the number of iterations is
independent of the frequency 共Erlangga and Herrmann, 2008兲. In
practice, the number of iterations generally increases linearly with
frequency. In this case, the time complexities of the time-domain and
the iterative-solver approach are equivalent 共Plessix, 2007兲.
The reader is referred to Plessix 共2007, 2009兲 and Virieux et al.
共2009兲 for more detailed complexity analyses of seismic modeling
based on different numerical approaches. A discussion on the pros
and cons of time-domain versus frequency-domain seismic modeling with application to FWI is also provided in Vigh and Starr
共2008b兲 and Warner et al. 共2008兲.
Source implementation is an important issue in FWI. The spatial
reciprocity of Green’s functions can be exploited in FWI to mitigate
the number of forward problems if the number of receivers is significantly smaller than the number of sources 共Aki and Richards, 1980兲.
The reciprocity of Green’s functions also allows matching data emitted by explosions and recorded by directional sensors, with pressure
synthetics computed for directional forces 共Operto et al., 2006兲. Of
note, the spatial reciprocity is satisfied theoretically for the unidirectional sensor and the unidirectional impulse source. However, the
spatial reciprocity of the Green’s functions can also be used for explosive sources by virtue of the superposition principle. Indeed, explosions can be represented by double dipoles or, in other words, by
four unidirectional impulse sources.
A final comment concerns the relationship between the discretization required to solve the forward problem and that required to reconstruct the physical parameters. Often during FWI, these two discretizations are identical, although it is recommended that the fingerprint of the forward problem be kept minimal in FWI.
The properties of the subsurface that we want to quantify are embedded in the coefficients of matrices M, A, or B of equations 1 and
2. The relationship between the seismic wavefield and the parameters is nonlinear and can be written compactly through the operator
G, defined as
u ⳱ G共m兲
共3兲
in the time domain or in the frequency domain.
FWI AS A LEAST-SQUARES LOCAL
OPTIMIZATION
We follow the simplest view of FWI based on the so-called length
method 共Menke, 1984兲. For information on probabilistic maximum
likelihood or generalized inverse formulations, the reader is referred
to Menke 共1984兲, Tarantola 共1987兲, Scales and Smith 共1994兲, and
Sen and Stoffa 共1995兲.
We define the misfit vector ⌬d ⳱ dobs ⳮ dcal共m兲 of dimension N
by the differences at the receiver positions between the recorded
seismic data dobs and the modeled seismic data dcal共m兲 for each
source-receiver pair of the seismic survey. Here, dcal can be related to
the modeled seismic wavefield u by a detection operator R, which
extracts the values of the wavefields computed in the full computational domain at the receiver positions for each source: dcal ⳱ Ru.
The model m represents some physical parameters of the subsurface
discretized over the computational domain.
In the simplest case corresponding to the monoparameter acoustic
approximation, the model parameters are the P-wave velocities defined at each node of the numerical mesh used to discretize the inverse problem. In the extreme case, the model parameters correspond to the 21 elastic moduli that characterize linear triclinic elastic
media, the density, and some memory variables that characterize the
anelastic behavior of the subsurface 共Toksöz and Johnston, 1981兲.
The most common discretization consists of projection of the continuous model of the subsurface on a multidimensional Dirac comb,
although a more complex basis can be considered 共see Appendix A
in Pratt et al. 关1998兴 for a discussion on alternative parameterizations兲. We define a norm C共m兲 of this misfit vector ⌬d, which is referred to as the misfit function or the objective function. We focus below on the least-squares norm, which is easier to manipulate mathematically 共Tarantola, 1987兲. Other norms are discussed later.
The Born approximation and the linearization of the
inverse problem
The least-squares norm is given by
1
C共m兲 ⳱ ⌬d†⌬d,
2
共4兲
where † denotes the adjoint operator 共transpose conjugate兲.
In the time domain, the implicit summation in equation 4 is performed over the number of source-channel pairs and the number of
time samples in the seismograms, where a channel is one component
of a multicomponent sensor. In the frequency domain, the summation over frequencies replaces that over time. In the time domain, the
misfit vector is real valued; in the frequency domain, it is complex
valued.
The minimum of the misfit function C共m兲 is sought in the vicinity
of the starting model m0. The FWI is essentially a local optimization.
In the framework of the Born approximation, we assume that the updated model m of dimension M can be written as the sum of the starting model m0 plus a perturbation model ⌬m:m ⳱ m0 Ⳮ ⌬m. In the
following, we assume that m is real valued.
A second-order Taylor-Lagrange development of the misfit function in the vicinity of m0 gives the expression
M
C共m0兲
⌬m j
j⳱1 m j
C共m0 Ⳮ ⌬m兲 ⳱ C共m0兲 Ⳮ 兺
M
M
1
2C共m0兲
Ⳮ 兺 兺
⌬m j⌬mk Ⳮ O共m3兲.
2 j⳱1 k⳱1 m j mk
共5兲
Taking the derivative with respect to the model parameter ml results
in
M
2C共m0兲
C共m兲 C共m0兲
⳱
Ⳮ兺
⌬m j .
ml
ml
j⳱1 m j ml
共6兲
The minimum of the misfit function in the vicinity of point m0 is
reached when the first derivative of the misfit function vanishes. This
gives the perturbation model vector:
冋
⌬m ⳱ ⳮ
2C共m0兲
m2
册
ⳮ1
C共m0兲
.
m
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
共7兲
Full-waveform inversion
The perturbation model is searched in the opposite direction of the
steepest ascent 共i.e., the gradient兲 of the misfit function at point m0.
The second derivative of the misfit function is the Hessian; it defines
the curvature of the misfit function at m0. Of note, the error term
O共m3兲 in equation 5 is zero when the misfit function is a quadratic
function of m. This is the case for linear forward problems such as u
⫽ G.m. In this case, the expression of the perturbation model of
equation 7 gives the minimum of the misfit function in one iteration.
In FWI, the relationship between the data and the model is nonlinear
and the inversion needs to be iterated several times to converge toward the minimum of the misfit function.
Normal equations: The Newton, Gauss-Newton, and
steepest-descent methods
WCC131
model parameters is zero. Most of the time, this second-order term is
neglected for nonlinear inverse problems. In the following, the remaining term in the Hessian, i.e., Ha ⳱ J†0J0, is referred to as the approximate Hessian. The method which solves equation 11 when only
Ha is estimated is referred to as the Gauss-Newton method.
Alternatively, the inverse of the Hessian in equation 11 can be replaced by a scalar ␣ , the so-called step length, leading to the gradient
or steepest-descent method. The step length can be estimated by a
line-search method, for which a linear approximation of the forward
problem is used 共Gauthier et al., 1986; Tarantola, 1987兲. In the linear
approximation framework, the second-order Taylor-Lagrange development of the misfit function gives
C共m ⳮ ␣ ⵜ C共m0兲兲 ⳱ C共m兲 ⳮ ␣ 具 ⵜ C共m兲兩 ⵜ C共m0兲典
1
Ⳮ ␣ 2Ha共m兲具 ⵜ C共m0兲兩 ⵜ C共m0兲典,
2
Basic equations
The derivative of C共m兲 with respect to the model parameter ml
gives
N
1
C共m兲
⳱ⳮ 兺
2 i⳱1
ml
冋冉 冊
dcali
ml
Ⳮ 共dobsi ⳮ dcali兲
N
⳱ ⳮ兺 R
i⳱1
i
ml
冋冉 冊
dcali *
ml
where we assume a model perturbation of the form ⌬m
⳱ ␣ ⵜ C共m0兲. In equation 12, we replace the second-order derivative of the misfit function by the approximate Hessian in the second
term on the right-hand side. Inserting the expression of the approximate Hessian Ha into the previous expression, zeroing the partial derivative of the misfit function with respect to ␣ , and using m ⳱ m0
gives
共dobsi ⳮ dcali兲*
*
dcal
册
册
共8兲
共dobsi ⳮ dcali兲 ,
where the real part and the conjugate of a complex number are denoted by R and *, respectively. In matrix form, equation 8 translates to
ⵜCm ⳱
C共m兲
⳱ ⳮR
m
冋冉
冊
†
dcal共m兲
共dobs ⳮ dcal共m兲兲
m
⳱ ⳮR关J†⌬d兴,
册
共9兲
where J is the sensitivity or the Fréchet derivative matrix. In equation 9, ⵜCm is a vector of dimension M. Taking m ⳱ m0 in equation 9
provides the descent direction along which the perturbation model is
searched in equation 7.
Differentiation of the gradient expression 8, with respect to the
model parameters gives the following expression in matrix form for
the Hessian 共see Pratt et al. 关1998兴 for details兲:
冋
册
Jt0
2C共m0兲
†
⳱
R关J
J
兴
Ⳮ
R
共⌬d*0 . . . ⌬d*0 兲 . 共10兲
0 0
m2
mt
Inserting the expression of the gradient 共equation 9兲 and the Hessian
共equation 10兲 into equation 7 gives the following for the perturbation
model:
再冋
⌬m ⳱ ⳮ R J†0J0 Ⳮ
Jt0
共⌬d*0 . . . ⌬d*0 兲
mt
册冎
共12兲
ⳮ1
R关J†0⌬d0兴.
共11兲
The method solving the normal equations, e.g., equation 11, generally is referred to as the Newton method, which is locally quadratically
convergent.
For linear problems 共d⫽G.m兲, the second term in the Hessian is
zero because the second-order derivative of the data with respect to
␣⳱
具 ⵜ C共m0兲兩 ⵜ C共m0兲典
.
具共J 共m0兲 ⵜ C共m0兲兩Jt共m0兲 ⵜ C共m0兲兲典
t
共13兲
The term Jt共m0兲 ⵜ C共m0兲 is computed conventionally using a firstorder-accurate finite-difference approximation of the partial derivative of G,
1
G共m0兲
ⵜ C共m0兲 ⳱ 共G共m0 Ⳮ ⵜ C共m0兲兲 ⳮ G共m0兲兲,
m
共14兲
with a small parameter . Estimation of ␣ requires solving an extra
forward problem per shot for the perturbed model m0 Ⳮ ⵜ C共m0兲.
This line-search technique is extended to multiple-parameter classes
by Sambridge et al. 共1991兲 using a subspace approach. In this case,
one forward problem must be solved per parameter class, which can
be computationally expensive. Alternatively, the step length can be
estimated by parabolic interpolation through three points, 共 ␣ ,C共m0
Ⳮ ␣ ⵜ C共m0兲兲兲. The minimum of the parabola provides the desired
␣ . In this case, two extra forward problems per shot must be solved
because we already have a third point corresponding to 共0,C共m0兲兲
共see Figure 1 in Vigh et al. 关2009兴 for an illustration兲.
Pratt et al. 共1998兲 illustrate how quality and rate of convergence of
the inversion depend significantly on the Newton, Gauss-Newton, or
gradient method used. Importantly, they show how the gradient
method can fail to converge toward an acceptable model, however
many iterations, unlike the Newton and Gauss-Newton methods.
They interpret this failure as the result of the difficulty of estimating
a reliable step length. However, gradient methods can be significantly improved by scaling 共i.e., dividing兲 the gradient by the diagonal
terms of Ha or of the pseudo-Hessian 共Shin et al., 2001a兲.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC132
Virieux and Operto
Numerical algorithms: The conjugate-gradient method
Over the last decade, the most popular local optimization algorithm for solving FWI problems was based on the conjugate-gradient method 共Mora, 1987; Tarantola, 1987; Crase et al., 1990兲. Here,
the model is updated at the iteration n in the direction of p共n兲, which is
a linear combination of the gradient at iteration n, ⵜC共n兲, and the direction p共nⳮ1兲:
p共n兲 ⳱ ⵜ Cn Ⳮ  共n兲p共nⳮ1兲 .
共15兲
The scalar  共n兲 is designed to guarantee that p共n兲 and p共nⳮ1兲 are conjugate. Among the different variants of the conjugate-gradient method to derive the expression of  共n兲, the Polak-Ribière formula 共Polak
and Ribière, 1969兲 is generally used for FWI:
 共n兲 ⳱
具 ⵜ C共n兲 ⳮⵜ C共nⳮ1兲兩 ⵜ C共n兲典
.
储 ⵜ C共n兲储2
共16兲
ⳮ1
m
In FWI, the preconditioned gradient W ⵜ C共n兲 is used for p共n兲, where
Wm is a weighting operator that is introduced in the next section
共Mora, 1987兲. Only three vectors of dimension M, i.e., ⵜC共n兲,
ⵜC共nⳮ1兲, and p共nⳮ1兲, are required to implement the conjugate-gradient method.
Numerical algorithms: Quasi-Newton algorithms
Finite approximations of the Hessian and its inverse can be computed using quasi-Newton methods such as the BFGS algorithm
共named after its discoverers Broyden, Fletcher, Goldfarb, and Shanno; see Nocedal 关1980兴 for a review兲. The main idea is to update
the approximation of the Hessian or its inverse H共n兲 at each iteration
of the inversion, taking into account the additional knowledge provided by ⵜC共n兲 at iteration n. In these approaches, the approximation
of the Hessian or its inverse is explicitly formed.
For large-scale problems such as FWI in which the cost of storing
and working with the approximation of the Hessian matrix is prohibitive, a limited-memory variant of the quasi-Newton BFGS method
known as the L-BFGS algorithm allows computing in a recursive
manner H共n兲 ⵜ C共n兲 without explicitly forming H共n兲. Only a few gradients of the previous nonlinear iterations 共typically, 3–20 iterations兲
need to be stored in L-BFGS, which represents a negligible storage
and computational cost compared to the conjugate-gradient algorithm 共see Nocedal, 1980; p. 177–180兲. The L-BFGS algorithm requires an initial guess H共0兲, which can be provided by the inverse of
the diagonal Hessian 共Brossier et al., 2009a兲. For multiparameter
FWI, the L-BFGS algorithm provides a suitable scaling of the gradients computed for each parameter class and hence provides a computationally efficient alternative to the subspace method of Sambridge et al. 共1991兲. A comparison between the conjugate-gradient
method and the L-BFGS method for a realistic onshore application
of multiparameter elastic FWI is shown in Brossier et al. 共2009a兲.
共equation 11兲. Neither the full Hessian nor the sensitivity matrix is
formed explicitly; only the application of the Hessian to a vector
needs to be performed at each iteration of the conjugate-gradient algorithm.
Application of the Hessian to a vector requires performing two
forward problems per shot for the incident and the adjoint wavefields
共Akcelik, 2002兲. Because these two simulations are performed at
each iteration of the conjugate-gradient algorithm, an efficient preconditioner must be used to mitigate the number of iterations of the
conjugate-gradient algorithm. Epanomeritakis et al. 共2008兲 use a
variant of the L-BFGS method for the preconditioner of the conjugate gradient, in which the curvature of the objective function is updated at each iteration of the conjugate gradient using the Hessianvector products collected over the iterations.
Regularization and preconditioning of inversion
As widely stressed, FWI is an ill-posed problem, meaning that an
infinite number of models matches the data. Some regularizations
are conventionally applied to the inversion to make it better posed
共Menke, 1984; Tarantola, 1987; Scales et al., 1990兲. The misfit function can be augmented as follows:
1
1
C共m兲 ⳱ ⌬d†Wd⌬d Ⳮ 共m ⳮ mprior兲†Wm共m ⳮ mprior兲,
2
2
共17兲
where Wd ⳱ StdSd and Wm ⳱ SmtSm. Weighting operators are Wd and
Wm, the inverse of the data and model covariance operators in the
frame of the Bayesian formulation of FWI 共Tarantola, 1987兲. The
operator Sd can be implemented as a diagonal weighting operator
that controls the respective weight of each element of the data-misfit
vector. For example, Operto et al. 共2006兲 use Sd as a power of the
source-receiver offset to strengthen the contribution of large-offset
data for crustal-scale imaging. In geophysical applications where the
smoothest model that fits the data is often sought, the aim of the
least-squares regularization term in the augmented misfit function
共equation 17兲 is to penalize the roughness of the model m, hence defining the so-called Tikhonov regularization 共see Hansen 关1998兴 for
a review on regularization methods兲. The operator Sm is generally a
roughness operator, such as the first- or second-difference matrices
共Press et al., 1986, 1007兲.
For linear problems 共assuming the second term of the Hessian is
neglected兲, the minimization of the weighted misfit function gives
the perturbation model:
⌬m ⳱ ⳮ兵R共J†0WdJ0兲 Ⳮ Wm其ⳮ1R关J†0Wd⌬d0兴, 共18兲
where we use mprior ⳱ m0. Of note, equation 18 is equivalent to
Tarantola 共1987, p. 70兲 and Menke 共1984, p. 55兲:
ⳮ1
ⳮ1 †
†
ⳮ1
⌬m ⳱ ⳮWm
兵R共J0Wm
J0兲 Ⳮ Wⳮ1
d 其 R关J0⌬d0兴.
共19兲
Newton and Gauss-Newton algorithms
The more accurate, although more computationally intensive,
Gauss-Newton and Newton algorithms are described in Akcelik
共2002兲, Askan et al. 共2007兲, Askan and Bielak 共2008兲, and Epanomeritakis et al. 共2008兲, with an application to a 2D synthetic model
of the San Fernando Valley using the SH-wave equation. At each
nonlinear FWI iteration, a matrix-free conjugate-gradient method is
used to solve the reduced Karush-Kuhn-Tucker 共KKT兲 optimal system, which turns out to be similar to the normal equation system
Equation 19 can be more tractable from a computational viewpoint
when N ⬍ M. Because Wm is a roughness operator, Wmⳮ1 is a
smoothing operator. It can be implemented, for example, with a multidimensional adaptive Gaussian smoother 共Ravaut et al., 2004; Operto et al., 2006兲 or with a low-pass filter in the wavenumber domain
共Sirgue, 2003兲.
For the steepest-descent algorithm, the regularized solution for
the perturbation model is given by
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
ⳮ1
⌬m ⳱ ⳮ␣ Wm
R关J†0Wd⌬d0兴,
共20兲
where the scaling performed by the diagonal terms of the approximate Hessian can be embedded in the operator Wmⳮ1 in addition to the
smoothing operator.
A more complete and rigorous mathematical derivation of these
equations is presented in Tarantola 共1987兲.
Alternative regularizations based on minimizing the total variation of the model have been developed mainly by the image-processing and electromagnetic communities. The aim of the total variation
or edge-preserving regularization is to preserve both the blocky and
the smooth characteristics of the model 共Vogel and Oman, 1996; Vogel, 2002兲. Total variation 共TV兲 regularization is conventionally implemented by minimizing the L1-norm of the model-misfit function
RTV ⳱ 共⌬mWm⌬m兲1/2. Alternatively, van den Berg and Abubakar
共2001兲 implement TV regularization as a multiplicative constraint in
the original misfit function. In this framework, the original misfit
function can be seen as the weighting factor of the regularization
term, which is automatically updated by the optimization process
without the need for heuristic tuning. TV regularization is applied to
FWI in Askan and Bielak 共2008兲. The weighted L2-norm regularization applied to frequency-domain FWI is shown in Hu et al. 共2009兲
and Abubakar et al. 共2009兲.
The gradient and Hessian in FWI: Interpretation and
computation
A clear interpretation of the gradient and Hessian is given by Pratt
et al. 共1998兲 using the compact matrix formalism of frequency-domain FWI. A review is given here. Let us consider the forward-problem equation given by equation 2 for one source and one frequency.
In the following, we assume that the model is discretized in a finitedifference sense using a uniform grid of nodes.
Differentiation of equation 2 with respect to the model parameter
ml gives the expression of the partial derivative wavefield u / mᐉ
by solving the following system:
B
u
⳱ f共ᐉ兲,
mᐉ
共21兲
where
f共ᐉ兲 ⳱ ⳮ
B
u.
mᐉ
共22兲
An analogy between the forward-problem equation 2 and equation 21 shows that the partial-derivative wavefield can be computed
by solving one forward problem, the source of which is given by f共ᐉ兲.
This so-called virtual secondary source is formed by the product of
B / mᐉ and the incident wavefield u. The matrix B / mᐉ is built by
differentiating each coefficient of the forward-problem operator B
with respect to mᐉ. Because the discretized differential operators in B
generally have local support, the matrix B / mᐉ is extremely
sparse.
The spatial support of the virtual secondary source is centered on
the position of mᐉ, whereas the temporal support of f共ᐉ兲 is centered
around the arrival time of the incident wavefield at the position of mᐉ.
Therefore, the partial-derivative wavefield with respect to the model
parameter mᐉ can be interpreted as the wavefield emitted by the seismic source s and scattered by a point diffractor located at mᐉ. The radiation pattern of the virtual secondary source is controlled by the
operator B / mᐉ. Analysis of this radiation pattern for different pa-
WCC133
rameter classes allows us to assess to what extent parameters of different natures are uncoupled in the tomographic reconstruction as a
function of the diffraction angle and to what extent they can be reliably reconstructed during FWI. Radiation patterns for the isotropic
acoustic, elastic, and viscoelastic wave equations are shown in Wu
and Aki 共1985兲, Tarantola 共1986兲, Ribodetti and Virieux 共1996兲, and
Forgues and Lambaré 共1997兲.
Because the gradient is formed by the zero-lag correlation between the partial-derivative wavefield and the data residual, these
have the same meaning: They represent perturbation wavefields
scattered by the missing heterogeneities in the starting model m0
共Tarantola, 1984; Pratt et al., 1998兲. This interpretation draws some
connections between FWI and diffraction tomography; the perturbation model can be represented by a series of closely spaced diffractors. By virtue of Huygens’ principle, the image of the model perturbations is built by the superposition of the elementary image of each
diffractor, and the seismic wavefield perturbation is built by superposition of the wavefields scattered by each point diffractor 共McMechan and Fuis, 1987兲.
The approximate Hessian is formed by the zero-lag correlation
between the partial-derivative wavefields, e.g., equation 10. The diagonal terms of the approximate Hessian contain the zero-lag autocorrelation and therefore represent the square of the amplitude of the
partial-derivative wavefield. Scaling the gradient by these diagonal
terms removes from the gradient the geometric amplitude of the partial-derivative wavefields and the residuals. In the framework of surface seismic experiments, the effects of the scaling performed by the
diagonal Hessian provide a good balance between shallow and deep
perturbations. A diagonal Hessian is shown in Ravaut et al. 共2004,
their Figure 12兲. The off-diagonal terms of the Hessian are computed
by correlation between partial-derivative wavefields associated with
different model parameters. For 1D media, the approximate Hessian
is a band-diagonal matrix, and the numerical bandwidth decreases as
the frequency increases. The off-diagonal elements of the approximate Hessian account for the limited-bandwidth effects that result
from the experimental setup. Applying its inverse to the gradient can
be interpreted as a deconvolution of the gradient from these limitedbandwidth effects.
An illustration of the scaling and deconvolution effects performed
by the diagonal Hessian on one hand and the approximate Hessian
on the other hand is provided in Figure 1. A single inclusion in a homogeneous background model 共Figure 1a兲 is reconstructed by one
iteration of FWI using a gradient method preconditioned by the diagonal terms of the approximate Hessian 共Figure 1b兲 and by a GaussNewton method 共Figure 1c兲. The image of the inclusion is sharper
when the Gauss-Newton algorithm is used. The corresponding approximate Hessian and its diagonal elements are shown in Figure 2.
An interpretation of the second term of the Hessian 共equation 10兲 is
given in Pratt et al. 共1998兲. This term accounts for multiscattering
events such as multiples in the reconstruction procedure. Through iterations, we might correct effects caused by this missing term as
long as convergence is achieved.
Although equation 21 gives some clear insight into the physical
sense of the gradient of the misfit function, it is impractical from a
computer-implementation point of view; with the computer explicitly forming the sensitivity matrix with equation 21, it would require
performing as many forward problems as the number of model parameters mᐉ共ᐉ ⳱ 1,M兲 for each source of the survey. To mitigate this
computational burden, the spatial reciprocity of Green’s functions
can be exploited as shown below.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC134
Virieux and Operto
Inserting the expression of the partial derivative of the wavefield
共equation 21兲 in the expression of the gradient of equation 9 gives the
following expression of the gradient:
冋冋 册
册
B t ⳮ1t
B 共P⌬d兲* ,
ⵜCᐉ ⳱ R ut
mᐉ
共23兲
where P denotes an operator that augments the residual data vector
with zeroes in the full computational domain so that the dimension
t
of the augmented vector matches the dimension of the matrix Bⳮ1
共Pratt et al., 1998兲. The column of Bⳮ1 corresponds to the Green’s
functions for unit impulse sources located at each node of the model.
By virtue of the spatial reciprocity of the Green’s functions, Bⳮ1 is
Depth (km)
a)
Distance (km)
0
0
1
2
3
t
symmetric. Therefore, Bⳮ1 can be substituted by Bⳮ1 in equation 23,
which gives
冋冋 册
ⵜCᐉ ⳱ R ut
共24兲
The wavefield rb corresponds to the back-propagated residual wavefield. All of the residuals associated with one seismic source are assembled to form one residual source. The back propagation in time is
indicated by the conjugate operator in the frequency domain. The
number of forward seismic problems for computing the gradient is
reduced to two: one to compute the incident wavefield u and one to
back propagate the corresponding residuals. The underlying imaging principle is reverse-time migration, which relies on the correspondence of the arrival times of the incident wavefield and the
0
2
1
2
3
d)
0
Depth (km)
4.0
VP (km/s)
4.1
4.2
4.3
Row number
Distance (km)
0
0
200
400
600
800
400
600
1
1
800
2
2
b)
Distance (km)
c)
0
1
2
3
e)
0
0
1
1
2
2
0
5
10
Distance (km)
15
20
25
30
0
3
4.0
VP (km/s)
4.1
4.2
4.3
5
10
Depth (km)
3
Depth (km)
0
200
3
b)
Column number
a)
1
册 冋冋 册 册
B t ⳮ1
B t
B 共P⌬d*兲 ⳱ R ut
rb .
mᐉ
mᐉ
15
20
3
3
Figure 1. Reconstruction of an inclusion by frequency-domain FWI.
共a兲 True model and FWI models built 共b兲 by a preconditioned gradient method and 共c兲 by a Gauss-Newton method. Four frequencies 共4,
5, 7, and 10 Hz兲 were inverted. One iteration per frequency was computed. Fourteen shots were deployed along the top and left edges of
the model. Shots along the top edge were recorded by 14 receivers
along the bottom edge; shots along the left edges were recorded by
14 receivers along the right edge. The P-wave velocities in the background and in the inclusion are 4.0 and 4.2 km/s, respectively. Vertical velocity profiles are extracted from the true model 共gray line兲 and
the FWI models 共black line兲 for 共d兲 the gradient and 共e兲 the GaussNewton inversions.
25
30
Figure 2. Hessian operator. 共a兲 Approximate Hessian corresponding
to the 31 ⫻ 31 model of Figure 1 for a frequency of 4 Hz. A close-up
of the area delineated by the yellow square highlights the band-diagonal structure of the Hessian. 共b兲 Corresponding diagonal terms of
the Hessian plotted in the distance-depth domain. The high-amplitude coefficients indicate source and receiver positions. Scaling the
gradient by this map removes the geometric amplitude effects from
the wavefields.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
back-propagated wavefield at the position of heterogeneity 共Claerbout, 1971; Lailly, 1983; Tarantola, 1984兲.
The approach that consists of computing the gradient of the misfit
function without explicitly building the sensitivity matrix is often referred to as the adjoint-wavefield approach by the geophysical community. The underlying mathematical theory is the adjoint-state
method of the optimization theory 共Lions, 1972; Chavent, 1974兲. Interesting links exist between optimization techniques used in FWI
and assimilation methods, widely used in fluid mechanics 共Talagrand and Courtier, 1987兲. A detailed review of the adjoint-state
method with illustrations from several seismic problems is given in
Tromp et al. 共2005兲, Askan 共2006兲, Plessix 共2006兲, and Epanomeritakis et al. 共2008兲. The expression of the gradient of the frequencydomain FWI misfit function 共equation 24兲 is derived from the adjoint-state method and the method of the Lagrange multiplier in Appendix A.
For multiple sources and multiple frequencies, the gradient is
formed by the summation over these sources and frequencies:
N Ns
兺R
i⳱1 s⳱1
ⵜCᐉ ⳱ 兺
冋
t
关Bⳮ1
i s s兴
冋 册
册
Bi t ⳮ1
* 兲兴 .
关Bi 共P⌬di,s
mᐉ
WCC135
mographic problem. Indeed, the superiority of one approach over the
other is highly dependent on the acquisition geometry 共the relative
number of sources and receivers兲 and the number of model parameters.
The formalism in equation 25 has been kept as general as possible
and can relate to the acoustic or the elastic wave equation. In the
acoustic case, the wavefield is the pressure scalar wavefield; in the
elastic case, the wavefield ideally is formed by the components of the
particle velocity and the pressure if the sensors have four components. Equation 25 can be translated in the time domain using Parseval’s relation. The expression of the gradient in equation 25 can be
developed equivalently using a functional analysis 共Tarantola,
1984兲. The partial derivatives of the wavefield with respect to the
model parameters are provided by the kernel of the Born integral that
relates the model perturbations to the wavefield perturbations. Multiplying the transpose of the resulting operator by the conjugate of
the data residuals provides the expression of the gradient. The two
formalisms 共matrix and functional兲 give the same expression, provided the discretization of the partial differential operators are performed consistently in the two approaches. The derivation in the frequency domain of the gradient of the misfit function using the two
formalisms is explicitly illustrated by Gelis et al. 共2007兲.
共25兲
We also need to note that matrices Bⳮ1
i 共i ⳱ 1,N 兲 do not depend on
shots; therefore, any speedup toward resolving systems that involve
these matrices with multiple sources should be considered 共Marfurt,
1984; Jo et al., 1996; Stekl and Pratt, 1998兲.
By comparing the expressions of the gradient in equations 9 and
24, we can conclude that one element of the sensitivity matrix is given by
冋 册
Bt ⳮ1
B ␦ r,
Jk共s,r兲,ᐉ ⳱ ust
mᐉ
Source estimation
Source excitation is generally unknown and must be considered
as an unknown of the problem 共Pratt, 1999兲. The source wavelet can
be estimated by solving a linear inverse problem because the relationship between the seismic wavefield and the source is linear
共equation 2兲. The solution for the source is given by the expression
s⳱
共26兲
where k共s,r兲 denotes a source-receiver couple of the acquisition geometry, with s and r denoting a shot and a receiver position, respectively. An impulse source ␦ r is located at receiver position r. If the
sensitivity matrix must be built, one forward problem for the incident wavefield and one forward problem per receiver position must
be computed. Therefore, the number of simulations to build the sensitivity matrix can be higher than that required by gradient estimation if the number of nonredundant receiver positions significantly
exceeds the number of nonredundant shots, or vice versa. Computing each term of the sensitivity matrix is also required to compute the
diagonal terms of the approximate Hessian Ha 共Shin et al., 2001b兲.
To mitigate the resulting computational burden for coarse OBS surveys, Operto et al. 共2006兲 suggest computing the diagonal terms of
Ha for a decimated shot acquisition. Alternatively, Shin et al.
共2001a兲 propose using an approximation of the diagonal Hessian,
which can be computed at the same cost as the gradient.
Although the matrix-free adjoint approach is widely used in exploration seismology, the earthquake-seismology community tends
to favor the scattering-integral method, which is based on the explicit building of the sensitivity matrix 共Chen et al., 2007兲. The linear
system relating the model perturbation to the data perturbation is
formed and solved with a conjugate-gradient algorithm such as
LSQR 共Paige and Saunders, 1982a兲. A comparative complexity
analysis of the adjoint approach and the scattering-integral approach
is presented in Chen et al. 共2007兲, who conclude that the scatteringintegral approach outperforms the adjoint approach for a regional to-
具gcal共m0兲兩dobs典
,
具gcal共m0兲兩gcal共m0兲典
共27兲
where gcal共m0兲 denotes the Green’s functions computed in the starting model m0. The source function can be estimated directly in the
FWI algorithm once the incident wavefields have been modeled.
The source and the medium are updated alternatively over iterations
of the FWI. Note that it is possible to take advantage of source estimation to design alternative misfit functions based on the differential
semblance optimization 共Pratt and Symes, 2002兲 or to define more
heuristic criteria to stop the iteration of the inversion 共Jaiswal et al.,
2009兲.
Alternatively, new misfit functions have been designed so the inversion becomes independent of the source function 共Lee and Kim,
2003; Zhou and Greenhalgh, 2003兲. The governing idea of the method is to normalize each seismogram of a shot gather by the sum of all
of the seismograms. This removes the dependency of the normalized
data with respect to the source and modifies the misfit function. The
drawback is that this approach requires an explicit estimate of the
sensitivity matrix; the normalized residuals cannot be back propagated because they do not satisfy the wave equation.
SOME KEY FEATURES OF FWI
Resolution power of FWI and relationship to the
experimental setup
The interpretation of the partial-derivative wavefield as the wavefield scattered by the missing heterogeneities provides some connections between FWI and generalized diffraction tomography 共Devaney and Zhang, 1991; Gelius et al., 1991兲. Diffraction tomography
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC136
Virieux and Operto
recasts the imaging as an inverse Fourier transform 共Devaney, 1982;
Wu and Toksoz, 1987; Sirgue and Pratt, 2004; Lecomte et al., 2005兲.
Let us consider a homogeneous background model of velocity c0, an
incident monochromatic plane wave propagating in the direction ŝ
and a scattered monochromatic plane wave in the far-field approximation propagating in the direction r̂ 共Figure 3兲. If amplitude effects
are not taken into account, the incident and scattered Green’s functions can be written compactly as
G0共x,s兲 ⳱ exp共ik0ŝ . x兲,
G0共x,r兲 ⳱ exp共ik0r̂ . x兲,
共28兲
with the relation k0 ⳱ / c0. Inserting the expression of the incident
and scattered plane waves into the gradient of the misfit function of
equation 24 gives the expression 共Sirgue and Pratt, 2004兲
ⵜC共m兲 ⳱ ⳮ 2 兺 兺 兺 R兵exp共ⳮ ik0共ŝ Ⳮ r̂兲 . x兲⌬d其.
s
r
共29兲
Equation 29 has the form of a truncated Fourier series where the
integration variable is the scattering wavenumber vector given by k
⳱ k0共ŝ Ⳮ r̂兲. The coefficients of the series are the data residuals. The
summation is performed over sources, receivers, and frequencies
that control the truncation and sampling of the Fourier series.
We can express the scattering wavenumber vector k0共ŝ Ⳮ r̂兲 as a
function of frequency, diffraction angle, or aperture to highlight the
relationship between the experimental setup and the spatial resolution of the reconstruction 共Figure 4兲:
k⳱
冉冊
2f
cos
n,
c0
2
共30兲
where n is a unit vector in the direction of the slowness vector 共ŝ
Ⳮ r̂兲. Equation 30 was also derived in the framework of the ray ⫹
Born migration/inversion, recast as the inverse of a generalized Radon transform 共Miller et al., 1987兲 or as a least-squares inverse problem 共Lambaré et al., 2003兲.
Several key conclusions can be derived from equation 30. First,
one frequency and one aperture in the data space map one wavenumber in the model space. Therefore, frequency and aperture have redundant control of the wavenumber coverage. This redundancy increases with aperture bandwidth. Pratt and Worthington 共1990兲, Sirgue and Pratt 共2004兲, and Brenders and Pratt 共2007a兲 propose decimating this wavenumber-coverage redundancy in frequency-domain FWI by limiting the inversion to a few discrete frequencies.
This data reduction leads to computationally efficient frequency-do-
a)
Depth (km)
0
0
Distance (km)
1
2
3
4
1
2
b)
0
0
Distance (km)
1
2
3
4
2
0
1
1
s^
c)
^r
0
3
3
4
4
4
S
R
λ = c/f
θ
Distance (km)
1
2
3
4
s^
x
^r
2
3
main FWI and allows managing a compact volume of data, two clear
advantages with respect to time-domain FWI. The guideline for selecting the frequencies to be used in the FWI is that the maximum
wavenumber imaged by a frequency matches the minimum vertical
wavenumber imaged by the next frequency 共Sirgue and Pratt, 2004,
their Figure 3兲. According to this guideline, the frequency interval
increases with the frequency.
Second, the low frequencies of the data and the wide apertures
help resolve the intermediate and large wavelengths of the medium.
At the other end of the spectrum, the maximum wavenumber, constrained by ⫽ 0 and the highest frequency, leads to a maximum resolution of half a wavelength if normal-incidence reflections are recorded. Third, for surface acquisitions, long offsets are helpful for
sampling the small horizontal wavenumbers of dipping structures
such as flanks of salt domes.
A frequency-domain sensitivity kernel for point sources, referred
to as the wavepath by Woodward 共1992兲, is shown in Figure 5. The
interference picture shows zones of equiphase over which the residuals are back projected during FWI. The central zone of elliptical
shape is the first Fresnel zone of width 冑osr, where osr is the sourcereceiver offset. Residuals that match the first arrival with an error
lower than half a period will be back projected constructively over
the first Fresnel zone, updating the large wavelengths of the structure. The outer fringes are isochrones on which residuals associated
with later-arriving reflection phases will be back projected, providing an update of the shorter wavelengths of the medium, just like
PSDM 共Lecomte, 2008兲. The width of the isochrones, which gives
some insight into the vertical resolution in Figure 4, is given by the
modulus of the wavenumber of equation 30.
To illustrate the relationship between FWI resolution and the experimental setup, we show the FWI reconstruction of an inclusion in
a homogeneous background for three acquisition geometries 共Figure
6兲. In the crosshole experiment 共Figure 6a兲, FWI has reconstructed a
low-pass-filtered 共smoothed兲 version of the vertical section of the inclusion and a band-pass-filtered version of the horizontal section of
the inclusion. This anisotropy of the imaging results from the transmission-like reconstruction of the vertical wavenumbers and the reflection-like reconstruction of the horizontal wavenumbers of the inclusion. In the case of the double crosshole experiment 共Figure 6b兲,
the vertical and horizontal wavenumber spectra of the inclusion have
k
Figure 3. Resolution analysis of FWI. 共a兲 Incident monochromatic
plane wave 共real part兲. 共b兲 Scattered monochromatic plane wave
共real part兲. 共c兲 Gradient of FWI describing one wavenumber component 共real part兲 built from the plane waves shown in 共a兲 and 共b兲.
ps
k=fq
q = ps + pr
ps = pr = 1/c
pr
q
Figure 4. Illustration of the main parameters in diffraction tomography and their relationships. Key: , wavelength; , diffraction or aperture angle; c, P-wave velocity; f, frequency; pS, pR, q, slowness
vectors; k, wavenumber vector; x, diffractor point; S and R, source
and receiver positions.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
⌬t
TL
⬍
1
,
N
10
20
30
50
40
60
FWI can be implemented in the time domain or in the frequency
domain. FWI was originally developed in the time domain 共Taran-
a)
Distance (km)
4.1
4.0
Depth (km)
VP (km/s) 0
0
80
90
3
1
2
3
1
2
3
4
5
6
7
8
6
7
8
6
7
8
2
3
b)
Distance (km)
VP (km/s) 0
0
4.2
1
4
5
2
3
4.0
4
c)
Distance (km)
4
5
100
0
1
5
4.0
10
15
20
Depth (km)
Depth (km)
2
1
VP (km/s) 0
0
70
1
4
4.1
Distance (km)
0
Multiscale FWI: Time domain versus frequency domain
共31兲
where TL denotes the duration of the simulation. Condition 31 shows
that the traveltime error must be less than 1% for an offset involving
50 propagated wavelengths, a condition unlikely to be satisfied if
FWI is applied without data preconditioning. Therefore, some studies consider that recording low frequencies 共⬍1 Hz兲 is the best strategy to design well-posed FWI 共Sirgue, 2006兲. Unfortunately, such
a)
low frequencies cannot be recorded during controlled-source experiments. As an alternative to low frequencies, multiscale layer-stripping approaches where longer offsets, shorter apertures, and longer
recording times are progressively introduced in the inversion, have
been designed to mitigate the nonlinearity of the inversion.
Depth (km)
been partly filled because of the combined use of transmission and
reflection wavepaths. Of note, the vertical section shows a lack of
low wavenumbers, whereas the horizontal section exhibits a deficit
of low wavenumbers because the maximum horizontal source-receiver offset is two times higher than the vertical one 共Figure 6b兲.
Therefore, the aperture illuminations of the horizontal and vertical
wavenumbers differ. For the surface acquisition 共Figure 6c兲, the vertical section exhibits a strong deficit of low wavenumbers because of
the lack of large-aperture illumination. Of note, the pick-to-pick amplitude of the perturbation is fully recovered in Figure 6c. The horizontal section of the inclusion is poorly recovered because of the
poor illumination of the horizontal wavenumbers from the surface.
The ability of the wide apertures to resolve the large wavelengths
of the medium has prompted some studies to consider long-offset acquisitions as a promising approach to design well-posed FWI problems 共Pratt et al., 1996; Ravaut et al., 2004兲. For example, equation
30 can suggest that the long wavelengths of the medium can be resolved whatever the source bandwidth, provided that wide-aperture
data are recorded by the acquisition geometry. However, all of the
conclusions derived so far rely on the Born approximation. The Born
approximation requires that the starting model allows matching the
observed traveltimes with an error less than half the period 共Beydoun and Tarantola, 1988兲. If not, the so-called cycle-skipping artifacts will lead to convergence toward a local minimum 共Figure 7兲.
Pratt et al. 共2008兲 translates this condition in terms of relative time
error ⌬t / TL as a function of the number of propagated wavelengths
N, expressed as
WCC137
2
3
3.9
4
b)
0
10
20
30
50
40
60
70
80
90
100
Depth (km)
0
5
10
X
X
s
r
15
20
X
θ
Figure 5. Wavepath. 共a兲 Monochromatic Green’s function for a point
source. 共b兲 Wavepath for a receiver located at a horizontal distance
of 70 km from the source. The frequency is 5 Hz and the velocity in
the homogeneous background is 6 km/s. The dashed red lines delineate the first Fresnel zone and an isochrone surface. The yellow line
is a vertical section across the wavepath. The blue lines represent diffraction paths within the first Fresnel zone and from the isochrone.
Figure 6. Imaging an inclusion by FWI. 共a兲 Crosshole experiment.
Source and receiver lines are in red and blue, respectively. The contour of the inclusion with a diameter of 400 m is delineated by the
blue circle. The true velocity in the inclusion is 4.2 km/s, whereas the
velocity in the background is 4 km/s. Six frequencies 共4, 7, 9, 11, 12,
and 15 Hz兲 were inverted successively, and 20 iterations per frequency were computed. The black and gray curves along the right and
bottom sides of the model are velocity profiles across the center of
the inclusion extracted from the exact model and the reconstructed
model, respectively. 共b兲 Same as 共a兲 for a vertical and horizontal
crosshole experiment 共the shots along the red dashed line are recorded by only the receivers along the vertical blue dashed line兲. 共c兲
Same as 共a兲 for a surface experiment.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC138
Virieux and Operto
tola, 1984; Gauthier et al., 1986; Mora, 1987; Crase et al., 1990兲
whereas the frequency-domain approach was proposed mainly in
the 1990s by Pratt and collaborators 共Pratt, 1990; Pratt and Worthington, 1990; Pratt and Goulty, 1991兲, first with application to
crosshole data and later with application to wide-aperture surface
seismic data 共Pratt et al., 1996兲.
The nonlinearity of FWI has prompted many studies to develop
some hierarchical multiscale strategies to mitigate this nonlinearity.
Apart from computational efficiency, the flexibility offered by the
time domain or the frequency domain to implement efficient multiscale strategies is one of the main criteria that favors one domain
rather than the other. The multiscale strategy successively processes
data subsets of increasing resolution power to incorporate smaller
wavenumbers in the tomographic models. In the time domain,
Bunks et al. 共1995兲 propose successive inversion of subdata sets of
increasing high-frequency content because low frequencies are less
sensitive to cycle-skipping artifacts. The frequency domain provides
a more natural framework for this multiscale approach by performing successive inversions of increasing frequencies. In the frequency
domain, single or multiple frequencies 共i.e., frequency group兲 can be
inverted at a time.
Although a few discrete frequencies theoretically are sufficient to
fill the wavenumber spectrum for wide-aperture acquisitions, simultaneous inversion of multiple frequencies improves the signal-tonoise ratio and the robustness of FWI when complex wave phenomena are observed 共i.e., guide waves, surface waves, dispersive
waves兲. Therefore, a trade-off between computational efficiency and
quality of imaging must be found. When simultaneous multifrequency inversion is performed, the bandwidth of the frequency
group ideally must be as large as possible to mitigate the nonlinearity
of FWI in terms of the nonunicity of the solution, whereas the maximum frequency of the group must be sufficiently low to prevent cycle-skipping artifacts. An illustration of this tuning of FWI is given
in Brossier et al. 共2009a兲 in the framework of elastic seismic imaging
of complex onshore models from the joint inversion of surface
waves and body waves.
T/2
n–1
T/2
n
n+1
n
n–1
n–1
n
n+1
n+1
Time (s)
Figure 7. Schematic of cycle-skipping artifacts in FWI. The solid
black line represents a monochromatic seismogram of period T as a
function of time. The upper dashed line represents the modeled
monochromatic seismograms with a time delay greater than T / 2. In
this case, FWI will update the model such that the n Ⳮ 1th cycle of
the modeled seismograms will match the nth cycle of the observed
seismogram, leading to an erroneous model. In the bottom example,
FWI will update the model such that the modeled and recorded nth
cycle are in-phase because the time delay is less than T / 2.
The regularization effects introduced by hierarchical inversion of
increasing frequencies might not be sufficient to provide reliable
FWI results for realistic frequencies and realistic starting models in
the case of complex structures. This has prompted some studies to
design additional regularization levels in FWI. One of these is to select a subset of arrivals as a function of time. An aim of this time windowing is to remove arrivals that are not predicted by the physics of
the wave equation implemented in FWI 共for example, PS-converted
waves in the frame of acoustic FWI兲. A second aim is to perform a
heuristic selection of aperture angles in the data. Considering a narrow time window centered on the first arrival leads to so-called early-arrival waveform tomography 共Sheng et al., 2006兲. Time windowing the data around the first arrivals is equivalent to selecting the
large-aperture components of the data to image the large and intermediate wavelengths of the medium. Alternatively, time windowing
can be applied to isolate later-arriving reflections or PS-converted
phases to focus on imaging a specific reflector or a specific parameter class, such as the S-wave velocity 共Shipp and Singh, 2002; Sears
et al., 2008; Brossier et al., 2009a兲.
The frequency domain is the most appropriate to select one or a
few frequencies for FWI, but the time domain is the most appropriate
to select one type of arrival for FWI. Indeed, time windowing cannot
be applied in frequency-domain modeling, in which only one or few
frequencies are modeled at a time. A last resort is the use of complexvalued frequencies, which is equivalent to the exponential damping
of a signal p共t兲 in time from an arbitrary traveltime t0 共Sirgue, 2003;
Brenders and Pratt, 2007b兲:
冉 冊
i
P Ⳮ e t0/ ⳱
Ⳮ⬁
冕
p共t兲eⳮ共tⳮt0兲/ ei tdt,
共32兲
ⳮ⬁
where P共 兲 denotes the Fourier transform of p共t兲 and is the damping factor.
A last regularization level can be implemented by layer stripping,
in which the imaging proceeds hierarchically from the shallow part
to the deep part. Layer stripping in FWI can be applied by combined
offset and temporal windowing 共Shipp and Singh, 2002; Wang and
Rao, 2009兲.
These three levels of regularization — frequency dependent, time
dependent, and offset dependent — can be combined in one integrated multiloop FWI workflow. An example is provided in Shin and Ha
共2009兲 and Brossier et al. 共2009a兲, in which the frequency- and timedependent regularizations are implemented into two nested loops
over frequency groups and time-damping factors. In this approach,
the frequencies increase in the outer loop and the damping factors
decrease in the inner loop. In Figure 8, the VP and VS models of the
overthrust model are inferred from the successive inversion of two
groups of five frequencies 共Brossier et al., 2009a兲. The frequencies
of the first group range from 1.7 to 3.5 Hz, whereas those of the second group range from 3.5 to 7.2 Hz. Five damping factors of between 0.67 and 30.0 s were applied hierarchically for data preconditioning during the inversion of each frequency group. Without these
two regularization levels associated with frequency and aperture selections, FWI fails to converge toward acceptable models.
In summary, the implementation of FWI in the frequency domain
allows the easy implementation of multiscale FWI based on the hierarchical inversion of groups of frequencies of arbitrary bandwidth
and sampling intervals. Time-domain modeling provides the most
flexible framework to apply time windowing of arbitrary geometry.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
FWI algorithms must be implemented in parallel to address largescale 3D problems. Depending on the numerical technique for solving the forward problem, different parallel strategies can be considered for FWI. If the forward problem is based on numerical methods
such as time-domain modeling or iterative solvers, which are not demanding in terms of memory, a coarse-grained parallelism that consists of distributing sources over processors is generally used and the
forward problem is performed sequentially on each processor for
each source 共Plessix, 2009兲. If the number of processors significantly exceeds the number of shots, which can be the case if source encoding techniques are used 共Krebs et al., 2009兲, a second level of
parallelism can be viewed by domain decomposition of the physical
computational domain. A comprehensive review of different algorithms to efficiently compute the forward and the adjoint wavefields
in time-domain FWI is presented by Akcelik 共2002兲.
In contract, if the forward problem is based on a method that embeds a memory-expensive preprocessing step, such as LU factorization in the frequency-domain direct-solver approach, parallelism
must be based on a domain decomposition of the computational domain. Each processor computes a subdomain of the wavefields for
all sources. Examples of such algorithms are described in Ben Hadj
Ali et al. 共2008a兲, Brossier et al. 共2009a兲, and Sourbier et al. 共2009a,
2009b兲. A contrast source inversion 共CSI兲 method is described by
Abubakar et al. 共2009兲, which allows a decrease in the number of LU
factorization in frequency-domain FWI at the expense of the number
of iterations.
The choice of the linearization method
The sensitivity matrix is generally computed with the Born approximation, which assumes a linear tangent relationship between
the model and wavefield perturbations 共Woodward, 1992兲. This linear relationship between the perturbations can be inferred from the
assumption that the wavefield computed in the updated model is the
wavefield computed in the starting model plus the perturbation
wavefield.
a)
0
Depth (km)
On the parallel implementation of FWI
L2-norms with different transitions between the norms. Crase et al.
共1990兲 conclude that the most reliable FWI results have been obtained with the Cauchy and the sech criteria.
The L2 and Cauchy criteria are also compared by Amundsen
共1991兲 in the framework of frequency-wavenumber-domain FWI
for stratified media described by velocity, density, and layer thicknesses 共Amundsen and Ursin, 1991兲. They consider random noise
and weather noise and conclude in both cases that the Cauchy criterion leads to the more robust results.
The Huber norm also combines the L1- and the L2-norms; it is
combined with quasi-Newton L-BFGS by Guitton and Symes
共2003兲 and Bube and Nemeth 共2007兲. The Huber norm is also used in
the framework of frequency-domain FWI by Ha et al. 共2009兲 and
shows an overall more robust behavior than the L2-norm.
0
4
Distance (km)
8
12
16
20 VP (km/s)
0
4
8
16
20 VS (km/s)
1
2
3
4
0
Depth (km)
This makes frequency-domain FWI based on time-domain modeling an attractive strategy to design robust FWI algorithms. This is especially true for 3D problems, for which time-domain modeling has
several advantages with respect to frequency-domain modeling 共Sirgue et al., 2008兲.
WCC139
12
1
2
3
4
c)
0
2
2
3
0
2
8
2
3
4
6
8
10
x = 7.5 km
Offset (km)
4 8 12 16
0
Offset (km)
4 8 12 16
4
6
8
12
14
14
0
2
0
10
12
x = 7.5 km
VS (km/s)
2
3
1
4
4
6
10
4
0
Offset (km)
4 8 12 16
Depth (km)
Depth (km)
1
0
Offset (km)
0 4 8 12 16
0
2
Depth (km)
The least-squares norm approach assumes a Gaussian distribution
of the misfit 共Tarantola, 1987兲. Poor results can be obtained when
this assumption is violated, for example, when large-amplitude outliers affect the data. Therefore, careful quality control of the data
must be carried out before least-squares inversion. Crase et al.
共1990兲 investigate several norms such as the least-squares norm L2,
the least-absolute-values norm L1, the Cauchy criterion norm, and
the hyperbolic secant 共sech兲 criterion in FWI 共Figure 9兲. The
L1-norm specifically ignores the amplitude of the residuals during
back propagation of the residuals when gradient building, making
this criterion less sensitive to large errors in the data. The Cauchy and
sech criteria can be considered a combination of the L1- and the
VP (km/s)
3
4
6
Depth (km)
The choice of the minimization criterion
0
Depth (km)
Although the most popular approach of FWI is based on minimizing the least-squares norm of the data misfit on the one hand and on
the Born approximation for estimating partial-derivative wavefields
on the other, several variants of FWI have been proposed over the
last decade. These variants relate to the definition of the minimization criteria, the representation of the data 共amplitude, phase, logarithm of the complex-valued data, envelope兲 in the misfit, or the linearization procedure of the inverse problem.
b)
Depth (km)
Variants of classic least-squares and
Born-approximation FWI
4
6
8
10
12
12
14
14
Figure 8. Multiscale strategy for elastic FWI with application to the
overthrust model. 共a兲 FWI velocity models VP 共top兲 and VS 共bottom兲.
共b兲 Comparison between the logs from the true model 共black兲, the
starting model 共dashed gray兲, and the final FWI model 共solid gray兲.
共c兲 Synthetic seismograms computed in the final FWI models for the
horizontal 共left兲 and vertical 共right兲 components of particle velocity.
The bottom panels are the final residuals between seismograms
computed in the true and in the final FWI models 共image courtesy R.
Brossier兲.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC140
Virieux and Operto
The Rytov approach considers the generalized phase as the wavefield 共Woodward, 1992兲. The Rytov approximation provides a linear
relationship between the complex-phase perturbation and the model
perturbation by assuming that the wavefield computed in the updated model is related to the wavefield computed in the starting model
through u共x, 兲 ⳱ u0共x, 兲exp共⌬ 共x, 兲兲, where ∆ 共x,兲 denotes
the complex-phase perturbation. The sensitivity of the Rytov kernel
is zero on the Fermat raypath because the traveltime is stationary
along this path. A linear relationship between the model perturbations and the logarithm of the amplitude ratio Ln关A共 兲 / A0共 兲兴 is
also provided by the Rytov approximation by taking the real part of
the sensitivity kernel of the Rytov integral instead of the imaginary
part that provides the phase perturbation.
The Born approximation is valid in the case of weak and small
perturbations, but the Rytov approximation is supposed to be valid
for large-aperture angles and a small amount of scattering per wavelength, i.e., smooth perturbations or smooth variation in the phaseperturbation gradient 共Beydoun and Tarantola, 1988兲. Although several analyses of the Rytov approximation have been carried out, it remains unclear to what extent its domain of validity differs significantly from that of the Born approximation. A comparison between
the Born approximation and the Rytov approximation in the framework of elastic frequency-domain FWI is presented in Gelis et al.
共2007兲. The main advantage of the Rytov approximation might be to
provide a natural separation between phase and amplitude 共e.g.,
Woodward, 1992兲. This separation allows the implementation of
phase and amplitude inversions 共Bednar et al., 2007; Pyun et al.,
a)
8
L2 functional
L1 functional
Huber norm functional
Hybrid norm functional
7
Cost function
6
5
4
3
2
1
0
–4
Residual source amplitude
b)
–3
4
–2
–1
0
1
2
3
4
1
2
3
4
L2 residual source
L1 residual source
Huber norm functional
Hybrid norm functional
3
2
1
0
–1
–2
–3
–4
–4
–3
–2
–1
0
Data residual
Figure 9. Different functionals for FWI. 共a兲 Value of four functionals
as a function of real-valued data residual. The L1, L2, Huber, and
mixed L1–L2 functionals are plotted as indicated. 共b兲 Amplitude of
the residual source used to compute the back-propagated adjoint
wavefield for the different functionals shown in 共a兲. Note that the
back-propagated source in the L1-norm is not sensitive to the residual amplitude and therefore is less sensitive to large-amplitude residuals than the L2-norm 共image courtesy R. Brossier兲.
2007兲 from a frequency-domain FWI code using a logarithmic norm
共Shin and Min, 2006; Shin et al., 2007兲, the use of which leads to the
Rytov approximation in the framework of local optimization
problems.
Another approach in the time-frequency domain is developed by
Fichtner et al. 共2008兲 for continental- and global-scale FWI, in
which the misfit of the phase and the misfit of the envelopes are minimized in a least-squares sense. The expected benefit from this approach is to mitigate the nonlinearity of FWI by separating the phase
and amplitude in the inversion and by inverting the envelope instead
of the amplitudes, the former being more linearly related to the data.
Building starting models for FWI
The ultimate goal in seismic imaging is to be able to apply FWI reliably from scratch, i.e., without the need for sophisticated a priori
information. Unfortunately, because multidimensional FWI at
present can only be attacked through local optimization approaches,
building an accurate starting model for FWI remains one of the most
topical issues because very low frequencies 共⬍1 Hz兲 still cannot be
recorded in the framework of controlled-source experiments.
A starting model for FWI can be built by reflection tomography
and migration-based velocity analysis such as those used in oil and
gas exploration. A review of the tomographic workflow is given in
Woodward et al. 共2008兲. Other possible approaches for building accurate starting models, which should tend toward a more automatic
procedure and might be more closely related to FWI, are first-arrival
traveltime tomography 共FATT兲, stereotomography, and Laplace-domain inversion.
FATT performs nonlinear inversions of first-arrival traveltimes to
produce smooth models of the subsurface 共e.g., Nolet, 1987; Hole,
1992; Zelt and Barton, 1998兲. Traveltime residuals are back projected along the raypaths to compute the sensitivity matrix. The tomographic system, augmented with smoothing regularization, generally
is solved with a conjugate-gradient algorithm such as LSQR 共Paige
and Saunders, 1982b兲. Alternatively, the adjoint-state method can be
applied to FATT, which avoids the explicit building of the sensitivity
matrix, just as in FWI 共Taillandier et al., 2009兲. The spatial resolution of FATT is estimated to be the width of the first Fresnel zone
共Williamson, 1991; Figure 5兲.
Examples of applications of FWI to real data using a starting model built by FATT are shown, for example, in Ravaut et al. 共2004兲, Operto et al. 共2006兲, Jaiswal et al. 共2008, 2009兲, and Malinowsky and
Operto 共2008兲 for surface acquisitions; in Dessa and Pascal 共2003兲 in
the framework of ultrasonic experimental data; in Pratt and Goulty
共1991兲 for crosshole data; and in Gao et al. 共2006b兲 for VSP data.
Several blind tests that correspond to surface acquisitions have
been tackled by joint FATT and FWI. Results at the oil-exploration
scale and at the lithospheric scale are presented in Brenders and Pratt
共2007a, 2007b, 2007c兲 and suggest that very low frequencies and
very large offsets are required to obtain reliable FWI results when
the starting model is built by FATT. For example, only the upper part
of the BP benchmark model was imaged successfully by Brenders
and Pratt 共2007c兲 using a starting frequency as small as 0.5 Hz and a
maximum offset of 16 km. Another drawback of FATT is that the
method is not suitable when low-velocity zones exist because these
low-velocity zones create shadow zones.
Reliable picking of first-arrival times is also a difficult issue when
low-velocity zones exist. Fitting first-arrival traveltimes does not
guarantee that computed traveltimes of later-arriving phases, such as
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
rection of the low frequencies by Shin and Cha 共2009兲. An application to real data from the Gulf of Mexico is shown in Shin and Cha
共2009兲. For the real-data application, frequencies between 0 and 2
Hz in combination with nine Laplace damping constants are used for
the Laplace-Fourier-domain inversion, the final model of which is
used as the starting model for standard frequency-domain FWI.
Joint application of Laplace-domain, Laplace-Fourier-domain
and Fourier-domain FWI to the BP benchmark model is illustrated in
Figure 11 共Shin and Cha, 2009兲. The starting model is a simple velocity-gradient model 共Figure 11b兲. A first velocity model of the
large wavelengths is obtained by Laplace-domain inversion 共Figure
11c兲, which is subsequently used as a starting model for inversion in
the Laplace-Fourier-domain inversion, the final model of which is
shown in Figure 11d. During this stage, the starting frequency used
in the inversion of the damped data is as low as 0.01 Hz. The final
model obtained after frequency-domain FWI is shown in Figure 11e.
All of the structures were successfully imaged, beginning with a
very crude starting model.
Distance (km)
Depth (km)
a)
4
5
6
7
8
9
10
11
Distance (km)
c)
0
0
1
1
2
2
3
3
4
4
Depth (km)
b)
4
5
6
7
8
9
10
11
d)
0
0
1
1
2
2
3
3
4
4
4
5
6
7
8
9
10
11
4
5
6
7
8
9
10
11
km/s
1.6 2.0 2.4 2.8 3.2
1
2
3
VP (km/s)
VP (km/s)
VP (km/s)
g)
f)
2.2
1.2
2.2
1.2
2.2
3.2
3.2
3.2
0
0
1
2
3
Depth (km)
1.2
0
Depth (km)
e)
Depth (km)
reflections, will match the true reflection traveltimes with an error
that does not exceed half a period, especially if anisotropy affects the
wavefield. We should stress that FATT can be recast as a phase inversion of the first arrival using a frequency-domain waveform-inversion algorithm within which complex-valued frequencies are implemented 共Min and Shin, 2006; Ellefsen, 2009兲. Compared to FATT
based on the high-frequency approximation, this approach helps account for the finite-frequency effect of the data in the sensitivity kernel of the tomography. Judicious selection of the real and imaginary
parts of the frequency allows extraction of the phase of the first arrival. The principles and some applications of the method are presented
in Min and Shin 共2006兲 and Ellefsen 共2009兲 for near-surface applications. This is strongly related to finite-frequency tomography 共Montelli et al., 2004兲.
Traveltime tomography methods that can manage refraction and
reflection traveltimes should provide more consistent starting models for FWI. Among these methods, stereotomography is probably
one of the most promising approaches because it exploits the slope
of locally coherent events and a reliable semiautomatic picking procedure has been developed 共Lambaré, 2008兲. Applications of stereotomography to synthetic and real data sets are presented in Billette and Lambaré 共1998兲, Alerini et al. 共2002兲, Billette et al. 共2003兲,
Lambaré and Alérini 共2005兲, and Dummong et al. 共2008兲.
To illustrate the sensitivity of FWI to the accuracy of different
starting models, Figure 10 shows FWI reconstructions of the synthetic Valhall model when the starting model is built by FATT and reflection stereotomography 共Prieux et al., 2009兲. In the case of stereotomography, the maximum offset is 9 km and only the reflection
traveltimes are used 共Lambaré and Alérini, 2005兲, whereas the maximum offset is 32 km for FATT 共Prieux et al., 2009兲. Stereotomography successfully reconstructs the large wavelength within the gas
cloud down to a maximum depth of 2.5 km; FATT fails to reconstruct
the large wavelengths of the low-velocity zone associated with the
gas cloud. However, the FWI model inferred from the FATT starting
model shows an accurate reconstruction of the shallow part of the
model. These results suggest that joint inversion of refraction and reflection traveltimes by stereotomography can provide a promising
framework to build starting models for FWI.
A third approach to building a starting model for FWI can be provided by Laplace-domain and Laplace-Fourier-domain inversions
共Shin and Cha, 2008, 2009; Shin and Ha, 2008兲. The Laplace-domain inversion can be viewed as a frequency-domain waveform inversion using complex-valued frequencies 共see equation 32兲, the
real part of which is zero and the imaginary part of which controls the
time damping of the seismic wavefield. In other words, the principle
is the inversion of the DC component of damped seismograms where
the Laplace variable s corresponds to 1 / in equation 32. The DC of
the undamped data is zero, but the DC of the damped data is not and
is exploited in Laplace-domain waveform inversion. The information contained in the data can be similar to the amplitude of the wavefield 共Shin and Cha, 2009兲. Laplace-domain waveform inversion
provides a smooth reconstruction of the velocity model, which can
be used as a starting model for Laplace-Fourier and classical frequency-domain waveform inversions.
The Laplace-Fourier domain is equivalent to performing an inversion of seismograms damped in time. The results shown in Shin and
Cha 共2009兲 suggest that this method can be applied to frequencies
smaller than the minimum frequency of the source bandwidth. The
ability of the method to use frequencies smaller than the frequencies
effectively propagated by the seismic source is called a mirage resur-
WCC141
1
2
3
4
4
4
Figure 10. 共a兲 Close-up of the synthetic Valhall velocity model centered on the gas layer. 共b兲 FWI model built from a starting model obtained by smoothing the true model with a Gaussian filter with horizontal and vertical correlation lengths of 500 m. 共c兲 FWI model from
a starting model built by FATT 共Prieux et al., 2009兲. 共d兲 FWI model
from a starting model built by stereotomography 共Lambaré and
Alérini, 2005兲. 共e兲 Velocity profiles at a distance of 7.5 km extracted
from the true model 共black line兲, from the starting model built by
smoothing the true model 共blue line兲, and from the FWI model of 共b兲
共red line兲. 共f兲 Same as 共e兲 for the starting model built by FATT and 共c兲
the corresponding FWI model. 共g兲 Same as 共e兲 for the starting model
built by stereotomography and 共d兲 the corresponding FWI model.
The frequencies used in the inversion are between 4 and 15 Hz.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC142
Virieux and Operto
CASE STUDIES
Applications of FWI have been applied essentially to synthetic
examples for which high-resolution images have been constructed.
Because FWI is an attractive approach, the number of real-data applications has increased quite rapidly, from monoparameter reconstructions of the VP parameter to multiparameter ones.
Monoparameter acoustic FWI
Most of the recent real-data case studies of FWI at different scales
and for different acquisition geometries have been performed in the
Distance (km)
a)
0
10
20
30
40
50
60
0
4.0
4
3.5
6
3.0
8
2.5
2.0
10
Velocity (km/s)
Depth (km)
4.5
2
1.5
b)
0
10
20
30
40
50
60
0
4.0
4
3.5
6
3.0
8
2.5
2.0
10
Velocity (km/s)
Depth (km)
4.5
2
1.5
c)
0
10
20
30
40
50
60
0
4.0
4
3.5
6
3.0
8
2.5
2.0
10
Velocity (km/s)
Depth (km)
4.5
2
1.5
d)
0
10
20
30
40
50
60
0
4.0
4
3.5
6
3.0
8
2.5
2.0
10
Velocity (km/s)
Depth (km)
4.5
2
1.5
e)
0
10
20
30
40
50
60
0
4.0
4
3.5
6
3.0
8
2.5
10
12
2.0
Velocity (km/s)
Depth (km)
4.5
2
1.5
Figure 11. Laplace-Fourier-domain waveform inversion. 共a兲 BP
benchmark model. 共b兲 Starting model. 共c兲 Velocity model after
Laplace-domain inversion. 共d兲 Velocity model after Laplace-Fourier-domain inversion. 共e兲 Velocity model after frequency-domain
FWI 共image courtesy C. Shin and Y. H. Cha兲.
acoustic isotropic approximation, considering only VP as the model
parameter 共Dessa and Pascal, 2003; Ravaut et al., 2004; Chironi et
al., 2006; Gao et al., 2006a, 2006b; Operto et al., 2006; Bleibinhaus
et al., 2007; Ernst et al., 2007; Jaiswal et al., 2008, 2009; Shin and
Cha, 2009兲. Although the acoustic approximation can be questioned
in the framework of FWI because of unreliable amplitudes, one advantage of acoustic FWI is dealing with less computationally expensive forward modeling than in the elastic case. Moreover, acoustic
FWI is better posed than elastic FWI because only the dominant parameter VP can be involved in the inversion.
Specific waveform-inversion data processing generally is designed to account for the amplitude errors introduced by acoustic
modeling 共Pratt, 1999; Ravaut et al., 2004; Brenders and Pratt,
2007b兲. The amplitude discrepancies in the P-wavefield result from
incorrectly modeling the amplitude-variation-with-offset 共AVO兲 effects and incorrectly modeling the directivity of the source and receiver 共Brenders and Pratt, 2007b兲. Acoustic-wave modeling generally is based on resolving the acoustic-wave equation in pressure;
therefore, the particle-velocity synthetic wavefields might not be
computed 共Hustedt et al., 2004兲. If the receivers are geophones, the
physical measurements collected in the field 共particle velocities兲 are
not the same as those computed by the seismic modeling engine
共pressure兲. A match between the vertical geophone data and the pressure synthetics can, however, be performed by exploiting the reciprocity of the Green’s functions if the sources are explosions 共Operto
et al., 2006兲.
In contrast, if the sources and receivers are directional, the pressure wavefield cannot account for the directivity of the sources and
receivers, and heuristic amplitude corrections must be applied before inversion. Brenders and Pratt 共2007b兲 propose optimizing an
empirical correction law for the decay of the rms amplitudes with
offset. Applying this correction law to the modeled data matches the
main AVO trend of the observed data before FWI. Using this data
preprocessing, Brenders and Pratt 共2007b兲 successfully image the
onshore lithospheric model of the CCSS blind test 共Zelt et al., 2005兲
by acoustic FWI of synthetic elastic data. This strategy is also used
successfully by Jaiswal et al. 共2008, 2009兲 in the framework of
acoustic FWI of real onshore data in the Naga thrust and fold belt in
India.
Successful application of acoustic FWI to synthetic elastic data
computed in the marine Valhall model from an OBC acquisition is
presented by Brossier et al. 共2009b兲. An application of acoustic FWI
to real onshore long-offset data recorded in the southern Apennines
thrust belt is illustrated in Figure 12 共Ravaut et al., 2004兲. The velocity model is validated locally by comparison with a VSP log. Application of PSDM using the final FWI model as a starting model contributes to the validation of the relevance of the velocity structure reconstructed by FWI 共Operto et al., 2004兲. Some guidelines based on
numerical examples of the domain of validity of acoustic FWI applied to elastic data are also provided in Barnes and Charara 共2008兲.
Multiparameter FWI
Because FWI accounts for the full wavefield, the seismic modeling embedded in the FWI algorithm theoretically should honor as far
as possible all of the physics of wave propagation. This is especially
required by FWI of wide-aperture data, in which significant AVO
and azimuthal anisotropic effects should be observed in the data. The
requirement of realistic seismic modeling has prompted some studies to extend monoparameter acoustic FWI to account for parameter
classes other than the P-wave velocity, such as density, attenuation,
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
shear-wave velocity or related parameters, and anisotropy. The fact
that additional parameter classes are taken into account in FWI increases in the ill-posedness of the inverse problem because more degrees of freedom are considered in the parameterization and because
the sensitivity of the inversion can change significantly from one parameter class to the next.
Different parameter classes can be more or less coupled as a function of the aperture angle. This coupling can be assessed by plotting
the radiation pattern of each parameter class using some asymptotic
analyses. Diffraction patterns of the different combination of parameters for the acoustic, elastic, and viscoelastic wave equation are
shown in Wu and Aki 共1985兲, Tarantola 共1986兲, Ribodetti and
Virieux 共1996兲, and Forgues and Lambaré 共1997兲. An alternative is
to plot the sensitivity kernel, i.e., that obtained by summing all of the
rows of the sensitivity matrix for the full acquisition and for different
combinations of parameters and to qualitatively assess which combination provides the best image 共Luo et al., 2009兲. Hierarchical
strategies that successively operate on different parameter classes
should be designed to mitigate the ill-posedness of FWI 共Tarantola,
1986; Kamei and Pratt, 2008; Sears et al., 2008; Brossier et al.,
2009b兲.
Density
Elevation (km)
the same radiation pattern as VP at short apertures but does not scatter
energy at wide apertures because the secondary source fᐉ corresponds to a vertical force for the density. Because VP and have the
same radiation pattern at short apertures, these two parameters are
difficult to reconstruct from short-offset data. For such data, the
P-wave impedance can be considered a reliable parameter for FWI.
If wide-aperture data are available, VP and I P might provide the most
judicious parameterization because they scatter energy for different
aperture bands 共wide and short apertures, respectively; Figure 13b兲.
A successful reconstruction of the density parameter in the case of
the Marmousi case study is presented by Choi et al. 共2008兲. However, the use of an unrealistically low frequency 共0.125 Hz兲 brings into
question the practical implication of these results.
Attenuation
The attenuation reconstruction can be implemented in frequencydomain seismic-wave modeling using complex velocities 共Toksöz
and Johnston, 1981兲. The most commonly applied attenuation/dispersion model is referred to as the Kolsky-Futterman model 共Kolsky, 1956; Futterman, 1962兲. This model has linear frequency dependence of the attenuation coefficient, whereas the deviation from
θ = 0°
a)
Density is difficult to reconstruct 共Forgues and Lambaré, 1997兲.
As an illustration, acoustic radiation patterns are shown in Figure 13
for different combinations of parameters 共IP, 兲, 共IP,VP兲, and 共VP, 兲,
where IP denotes P-wave impedance. The radiation pattern of VP is
isotropic because the operator B / ml reduces to a scalar for VP and
therefore represents an explosion. On the other hand, the density has
a)
WCC143
0
Distance (km)
5
315°
270°
15
10
45°
90°
135°
225°
1
180°
0
I
ρ
P
–1
θ = 0°
b)
–2
315°
45°
Elevation (km)
b)
1
0
90°
270°
–1
–2
Elevation (km)
c)
1
I
–1
P
180°
V
P
θ = 0°
c)
–2
315°
d)
Elevation (km)
135°
225°
0
45°
1
0
–1
270°
90°
–2
2
4
6
VP (km/s)
Figure 12. Two-dimensional seismic imaging of a thrust belt in the
southern Apennines, Italy, from long-offset data by frequency-domain FWI. Thirteen frequencies from 6 to 20 Hz were inverted successively. 共a兲 Starting model for FWI developed by FATT 共Improta
et al., 2002兲. 共b–d兲 FWI model after inversion of 共b兲 the starting
6-Hz, 共c兲 10-Hz, and 共d兲 20-Hz frequencies 共Ravaut et al., 2004兲.
135°
225°
180°
VP
ρ
Figure 13. Radiation pattern of different parameter classes in acoustic FWI. 共a兲 IP- ; 共b兲 IP-VP; 共c兲 VP- .
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC144
Virieux and Operto
constant-phase velocity is accounted for through a term that varies as
the logarithm of the frequency. This model is obtained by imposing
causality on the wave pulse and assuming the absorption coefficient
is strictly proportional to the frequency over a restricted range of frequencies.
Using the Kolsky-Futterman model, the complex velocity c̄ is given by
c̄ ⳱ c
冋冉
1Ⳮ
冊
sgn共 兲
1
兩log共 / r兲兩 Ⳮ i
2Q
Q
册
ⳮ1
,
共33兲
where Q is the attenuation factor and r is a reference frequency.
In frequency-domain FWI, the real and imaginary parts of the velocity can be processed as two independent real-model parameters
without any particular implementation difficulty. However, the reconstruction of attenuation is an ill-posed problem. Ribodetti et al.
共2000兲 show that in the framework of the ray ⫹ Born inversion,
P-wave velocity and Q are uncoupled 共i.e., the Hessian is nonsingular兲 only if a reflector is illuminated from both sides 共with circular
acquisition as in medical imaging; see Figure 5b of Ribodetti et al.,
2000兲. In contrast, the Hessian becomes singular for a surface acquisition. Mulder and Hak 共2009兲 also conclude that VP and Q cannot be
imaged simultaneously from short-aperture data because the Hilbert
transform with respect to depth of the complex-valued perturbation
model for VP and Q produces the same wavefield perturbation in the
framework of the Born approximation as the original perturbation
model. Despite this theoretical limitation, preconditioning of the
Hessian is investigated by Hak and Mulder 共2008兲 to improve the
convergence of the joint inversion for VP and Q.
Assessment and application of viscoacoustic frequency-domain
FWI is presented at various scales by Liao and McMechan 共1995,
1996兲, Song et al. 共1995兲, Hicks and Pratt 共2001兲, Pratt et al. 共2005兲,
Malinowsky et al. 共2007兲, and Smithyman et al. 共2008兲. Kamei and
Pratt 共2008兲 recommend inversion for VP only in a first step and then
joint inversion for VP and Q in a second step because the reliability of
the attenuation reconstruction strongly depends on the accuracy of
the starting VP model. Indeed, accurate VP models are required before reconstructing Q so that the inversion can discriminate the intrinsic attenuation from the extrinsic attenuation.
Elastic parameters
A limited number of applications of elastic FWI have been proposed. Because VP is the dominant parameter in elastic FWI, Tarantola 共1986兲 recommends inversion first for VP and IP, second for VS
and IS, and finally for density. This strategy might be suitable if the
footprint of the S-wave velocity structure on the seismic wavefield is
sufficiently small. This hierarchical strategy over parameter classes
is illustrated by Sears et al. 共2008兲, who assess time-domain FWI of
multicomponent OBC data with synthetic examples. They highlight
how the behavior of FWI becomes ill-posed for S-wave velocity reconstruction when the S-wave velocity contrast at the sea bottom is
small. In this case, the S-wave velocity structure has a minor footprint on the seismic wavefield because the amount of PS conversion
is small at the sea bottom. In this configuration, they recommend inversion first for VP, using only the vertical component; second for VP
and VS from the vertical component; and finally for VS, using both
components. The aim of the second stage is to reconstruct the intermediate wavelengths of the S-wave velocity structure by exploiting
the AVO behavior of the P-waves.
In contrast, Brossier et al. 共2009a兲 conclude that joint inversion
for VP and VS with judicious hierarchical data preconditioning by
time damping is necessary for inversion of land data involving both
body waves and surface waves. The strong sensitivity of the highamplitude surface waves to the near-surface S-wave velocity structure requires inversion for VS during the early stages of the inversion.
This makes the elastic inversion of onshore data highly nonlinear
when the surface waves are preserved in the data.
A recent application of elastic FWI to a gas field in China is presented by Shi et al. 共2007兲. They invert for the Lamé parameters and
unambiguously image Poisson’s ratio anomalies associated with the
presence of gas. They accelerate the convergence of the inversion by
computing an efficient step length using an adaptive controller based
on the theory of model-reference nonlinear control. Several logs
available along the profile confirm the reliability of this gas-layer
reconstruction.
Anisotropy
Reconstruction of anisotropic parameters by FWI is probably one
of the most undeveloped and challenging fields of investigation. Vertically transverse isotropy 共VTI兲 or tilted transversely isotropic
共TTI兲 media are generally considered a realistic representation of
geologic media in oil and gas exploration, although fractured media
require an orthorhombic description 共Tsvankin, 2001兲. The normalmoveout 共NMO兲 P-wave velocity in VTI media depends on only two
parameters: the NMO velocity for a horizontal reflector VNMO共0兲
⳱ V P0冑1 Ⳮ 2␦ and the ⳱ 共 ⳮ ␦ 兲 / 共1 Ⳮ 2␦ 兲 parameter 共Alkhalifah and Tsvankin, 1995兲, which is a combination of Thomsen’s parameters and ␦ 共Thomsen, 1986兲. The dependency of NMO velocity in VTI media on a limited subset of anisotropic parameters suggests that defining the parameter classes to be involved in FWI will
be a key task. Another issue will be to assess to what extent FWI can
be performed in the acoustic approximation knowing that acoustic
media are by definition isotropic 共Grechka et al., 2004兲. The kinematic and dynamic accuracy of an acoustic TTI wave equation for
FWI is discussed in Operto et al. 共2009兲.
A feasibility study of FWI in VTI media for crosswell acquisitions
is presented in Barnes et al. 共2008兲. They invert for five parameter
classes — VP, VS, density, ␦ , and — and show reliable reconstruction of the classes, even with noisy data. Pratt et al. 共2001, 2008兲 apply anisotropic FWI to crosshole real data. The results of Pratt et al.
共2001兲 highlight the difficulty in discriminating layer-induced anisotropy from intrinsic anisotropy in FWI.
Further investigations of anisotropic FWI in the case of surface
seismic data are required. In particular, the benefit of wide apertures
in resolving as many anisotropic parameters as possible needs to be
investigated 共Jones et al., 1999兲.
Three-dimensional FWI
Because of the continuous increase in computational power and
the evolution of acquisition systems toward wide-aperture and wideazimuth acquisition, 3D acoustic FWI is feasible today. In three dimensions, the computational burden of multisource seismic modeling is one of the main issues. The pros and cons of time-domain versus frequency-domain seismic modeling for FWI have been discussed.Another issue is assessing the impact of azimuth coverage on
FWI. Sirgue et al. 共2007兲 show the footprint of the azimuth coverage
in 3D surveys on the velocity model reconstructed by FWI. Their imaging confirms the importance of wide-azimuth surveys for FWI of
coarse acquisitions such as node surveys.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
Sirgue et al. 共2009兲 apply 3D frequency-domain FWI to the hydrophone component of OBC data from the shallow-water Valhall
field. Frequencies between 3.5 and 7 Hz are inverted successively,
using a starting model built by ray-based reflection tomography.
They successfully image a complex network of shallow channels at
150 m depth and a gas cloud between 1000 and 2500 m depth. Although the spacing between the cables is as high as 300 m, a limited
footprint of the acquisition is visible in the reconstructed models.
Comparisons of depth-migrated sections computed from the reflection tomography model and the FWI velocity model show the improvements provided by FWI, both in the shallow structure and at
the reservoir level below the gas cloud. The step-change improvement in the quality of the depth-migrated image results from the
high-resolution nature of the velocity model from FWI and the accounting of the intrabed multiples by the two-way wave-equation
modeling engine. Comparisons between depth slices across the
channels and the gas cloud extracted from the reflection tomography
and the FWI models highlight the significant resolution improvement provided by FWI 共Figure 14兲.
Solving large-scale 3D elastic problems is probably beyond our
current tools because of the computational burden of 3D elastic
modeling for many sources. This has prompted several studies to develop strategies to mitigate the number of forward simulations required during migration or FWI of large data sets. One of these approaches stacks the seismic sources before modeling 共Capdeville et
al., 2005兲. Because the relationship between the seismic wavefield
and the source is linear, stacking sources is equivalent to emitting
each source simultaneously instead of independently. This assem-
Most 3D FWI applications have been limited to low frequencies
共⬍7 Hz兲. At these frequencies, FWI can be seen as a tool for velocity-model-building rather than a self-contained seismic-imaging tool
that continuously proceeds from the velocity-model-building task to
the migration task through the continuous sampling of wavenumbers 共Ben Hadj Ali et al., 2008b兲.
Ben Hadj Ali et al. 共2008a兲 apply a frequency-domain FWI algorithm to a series of synthetic data sets. The forward problem is solved
in the frequency domain with a massively parallel direct solver.
Plessix 共2009兲 presents an application to real ocean-bottom-seismometer 共OBS兲 data. Seismic modeling is performed in the frequency domain with an iterative solver. The inverted frequencies range
between 2 and 5 Hz. The inversion converges to a similar velocity
model down to the top of a salt body using two different starting velocity models, with one a simple velocity-gradient model. An application to a real 3D streamer data set is presented by Warner et al.
共2008兲 for the imaging of a shallow channel. They perform seismic
modeling in the frequency domain using an iterative solver 共Warner
et al., 2007兲.
Three-dimensional time-domain FWI is developed by Vigh and
Starr 共2008a兲, in which the input data in FWI are plane-wave gathers
rather than shot gathers. The main motivation behind the use of
plane-wave shot gathers is to mitigate the computational burden by
decimating the volume of data. The computational cost is reduced by
one order of magnitude for 2D applications and by a factor 3 for 3D
applications when the plane-wave-based approach is used instead of
the shot-based approach.
Dipline (km)
a)
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
Dipline (km)
c)
18
3
3
1700
2200
7
2000
8
9
4
5
6
7
8
9
10
11
12
13
14
15
16
17
d)
18
12
13
14
15
16
17
18
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
3
V (km/s)
P
4
2400
6
2200
7
8
11
9
11
2000
Crossline (km)
1700
Crossline (km)
1800
10
8
11
5
9
1800
3
1900
8
7
10
4
7
6
10
3
V (km/s)
P
6
5
6
b)
5
4
2400
Crossline (km)
1800
V (km/s)
P
5
Crossline (km)
1900
4
3
4
V (km/s)
P
WCC145
5
6
7
8
9
9
1800
10
10
11
11
Figure 14. Imaging of the Valhall field by 3D FWI. Depth slices extracted from the velocity models. 共a,c兲 Built by ray-based reflection tomography. 共b,d兲 Built by 3D FWI. 共c,d兲 The shallow slice at 150 m depth shows a complex network of channels in the FWI model, although the deeper
slice at 1050 m depth shows a much higher resolution of the top of the gas cloud in the FWI model 共image courtesy L. Sirgue and O. I. Barkved,
BP兲.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC146
Virieux and Operto
blage generates artifacts in the imaging that arise from the undesired
correlation between each independent source wavefield with the
back-propagated residual wavefields associated with the other
sources of the stack. Applying some random-phase encoding to each
source before assemblage mitigates these artifacts. The method originally was applied to migration algorithms 共Romero et al., 2000兲.
Promising applications to time- and frequency-domain FWI are presented in Krebs et al. 共2009兲 and Ben Hadj Ali et al. 共2009a, 2009b兲.
Alternatively, Herrmann et al. 共2009兲 propose to recover the sourceseparated wavefields from the simultaneous simulation before FWI.
An illustration of the source-encoding technique is provided in
Figure 15, in which a dip section of the overthrust model is built
three ways: by conventional frequency-domain FWI 共i.e., without
source assemblage; Figure 15b兲, by FWI with source assemblage but
without phase encoding 共Figure 15c兲, and by FWI with source assemblage and phase encoding 共Figure 15d兲. The models were obtained by successive inversions of four groups of two frequencies
ranging between 3.5 and 20 Hz. The number of shots was 200, and
no noise was added to the data. The number of iterations per frequency group to obtain the final FWI models without and with the source
assemblage was 15 and 200, respectively. The time to compute the
model of Figure 15d was seven times less than the time to build the
model of Figure 15b. More details can be found in Ben Hadj Ali et al.
共2009兲. If the phase-encoding technique is seen as sufficiently robust, especially in the presence of noise, it is likely that 3D elastic
FWI can be viewed in the near future using sophisticated modeling
engines.
Distance (km)
0
8
10
12
14
16
18
5000
4000
3000
2000
2
4
6
8
10
12
14
16
18
20
4000
3000
VP (m/s)
5000
2000
0
2
4
6
8
10
12
14
16
18
20
6000
5000
4000
3000
2000
0
0
1
2
3
4
Table 1. Nomenclature, listed as introduced in the text.
Symbol
Description
x
t, f, ⳱ 2 / f
Spatial coordinates 共m兲
Time 共s兲, frequency 共Hz兲, circular frequency
共rad/s兲
Wavenumber vector, wavenumber 共1/m兲,
where c is wavespeed
Wavelength 共m兲
Mass, stiffness, impedance matrices in
the wave equation
Wavefield solution of the wave equation and
seismic source
P-wave and S-wave velocities 共m/s兲, density
共kg/ m3兲
Attenuation factor for P-waves
Anisotropic Thomsen’s parameters
Updated and starting FWI models,
perturbation model
Recorded and modeled data
k, k ⳱ c / f
⳱1/k
M, A, B
V P, V S,
Q
␦,
m, m0, ∆m
6000
0
1
2
3
4
d)
VP (m/s)
6000
0
Most of the FWI methods presented and assessed in the literature
are based on the local least-squares optimization formulation, in
which the misfit between the observed seismograms and the modeled ones are minimized in the time domain or in the frequency domain. Without very low frequencies 共⬍1 Hz兲, it remains very difficult to obtain reliable results from these approaches when considering real data, especially at high frequencies. Clearly, new formulations of FWI are needed to proceed further.
Recent improvements relate to Hessian estimation, which has
been shown to be quite important for better convergence toward the
solution. A systematic strategy since the beginning of FWI investigations has been the progressive introduction of the entire content of
seismograms through multiscale approaches to partially prevent the
convergence toward secondary minima. Transforming the data at the
u共x,t / 兲, s共x,t / 兲
20
VP (m/s)
Depth (km)
6
0
1
2
3
4
c)
Depth (km)
4
0
1
2
3
4
b)
Depth (km)
2
2
4
6
8
10
12
14
16
18
20
6000
5000
4000
3000
VP (m/s)
Depth (km)
a)
DISCUSSION
2000
Figure 15. Application of source encoding in FWI; 200 sources and
receivers were on the surface. 共a兲 Dip section of the overthrust
model. 共b兲 FWI model obtained without source assemblage and
phase encoding; 15 iterations per frequency were computed. 共c兲 FWI
model after assemblage of all the sources in one super shot. No phase
encoding was applied. 共d兲 FWI model obtained with source assemblage and phase encoding; 200 iterations per frequency were computed 共image courtesy of H. Ben Hadj Ali兲.
dobs, dcal共m兲
⌬d ⳱ dobs ⳮ dcal共m兲 Data-misfit vector
Misfit function, gradient of the misfit
C共m兲, ⵜCm
function
J ⫽ 关∂d/∂m兴
Sensitivity or Fréchet derivative matrix
␣
Step length in gradient methods
Iteration number in FWI
n
†
Approximate Hessian
Ha ⳱ J J
Descent direction and Polak and Ribière
p共n兲,  共n兲
coefficient in conjugate
gradients
t
Weighting operators in the data space
W d ⳱ S dS d
Wm ⳱ SmtSm
共ᐉ兲
f
s, r
Weighting operators in the model space
Damping in damped least-squares FWI
Virtual source in FWI for diffractor mᐉ
Source and receiver indices
Diffraction or aperture angle
Damping coefficient for time damping of
seismograms
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
different stages of the local inversion procedure 共particle displacement, particle velocity, logarithms of these quantities, divergence兲
can also provide benefits for global convergence of the optimization
procedure.
Do we need to mitigate the nonlinearity of the inverse problem
through more sophisticated search strategies such as semiglobal approaches or even global sampling of the model space? Because this
search is quite computationally intensive, should we reduce the dimensionality of the parameter models? To perform such global
searches, should we speed up the forward model at the expense of its
precision? If so, can we consider these procedures as intermediate
strategies for approaching the global minimum?
A pragmatic workflow, from low to high spatial-frequency content, should be expected to move hierarchically from imaging the
large medium wavelengths to the short medium wavelengths.Astepby-step procedure for extracting the information, starting from traveltimes and proceeding to true amplitudes, might include many intermediate steps for interpreting new observables 共polarization, apparent velocity, envelope兲.
Another aspect is the quality control of a reconstruction. An objective analysis for uncertainty estimation is important and might
rely on semiglobal investigations once the global minimum has been
found. Various strategies can be considered that would be manageable as statistical procedures 共bootstrapping or jackknifing techniques兲 or as local nondifferential approaches 共simplex, simulation
annealing, genetic algorithms兲.
Finally, solving large-scale 3D elastic problems remains beyond
our present tools, although one must be aware that these aids are
right around the corner. Because massive data acquisition for 3D reconstruction has been achieved, we indeed expect an improvement
in our data-crunching for high-resolution imaging.
An appealing reconstruction is 4D imaging, which is based on
time-lapse evolution of targets inside the earth. Differential data are
available, providing us with new information for tracking the evolution of the subsurface parameters. Thus, fluid tracking and variations
in solid-rock matrices are possible challenges for FWI in the near future.
CONCLUSION
FWI is the last-course procedure for extracting information from
seismograms. We have shown the conceptual efforts that have been
carried out over the last 30 years to provide FWI as a possible tool for
high-resolution imaging. These efforts have been focused on development of large-scale numerical optimization techniques, efficient
resolution of the two-way wave equation, judicious model parameterization for multiparameter reconstruction, multiscale strategies to
mitigate the ill-posedness of FWI, and specific waveform-inversion
data preprocessing.
FWI is mature enough today for prototype application to 3D real
data sets. Although applications to 3D real data have shown promising results at low frequencies 共⬍7 Hz兲, it is still unclear to what extent FWI can be applied efficiently at higher frequencies. To answer
this question, a more quantitative understanding of FWI sensitivity
to the accuracy of the starting model, to the noise, and to the amplitude accuracies is probably required.
If FWI remains limited to low frequencies, it will remain a tool to
build background models for migration. In the opposite case, FWI
will tend toward a self-contained processing workflow that can reunify macromodel building and migration tasks.
WCC147
The present is exciting because realistic applications are becoming possible right now. However, new strategies must be found to
make this technique as attractive as the scientific issues require.
Fields of investigation should address the need to speed up the forward problem by means of providing new hardware 共GPUs兲 and
software 共compressive sensing兲, defining new minimization criteria
in the model and data spaces, and incorporating more sophisticated
wave phenomena 共attenuation, elasticity, anisotropy兲 in modeling
and inversion.
ACKNOWLEDGMENTS
We thank L. Sirgue and O. I. Barkved from BP for providing Figure 14 and for permission to present it. L. Sirgue is acknowledged for
editing part of the manuscript. We thank C. Shin 共Seoul National
University兲 and Y. H. Cha 共Seoul National University兲 for providing
Figure 11. We thank A. Ribodetti 共Géosciences Azur-IRD兲 for discussion on attenuation reconstruction. We are grateful to R. Brossier
共Géosciences Azur-UNSA兲 and H. Ben Hadj Ali 共Géosciences AzurUNSA兲 for providing Figures 8, 9, and 15 and for fruitful discussions on many aspects of FWI discussed in this study, in particular
multiparameter FWI, norms, Rytov approximation, and source encoding. We would like to thank R. E. Plessix 共Shell兲 for his explanations of the adjoint-state method. J. Virieux thanks E. Canet 共LGIT兲
and A. Sieminski 共LGIT兲 for an interesting discussion on the inverse
problem and its relationship with assimilation techniques. We would
like to thank S. Buske 共University of Berlin兲, T. Nemeth 共Chevron兲,
I. Lecomte 共NORSAR兲, and V. Sallares 共CMIMA-CSIC Barcelona兲
for their encouragement during the preparation of the manuscript.
Finally, we thank I. Lecomte and T. Nemeth for the time they dedicated to the review of the manuscript. We thank the sponsors of the
SEISCOPE consortium 共BP, CGGVeritas, ExxonMobil, Shell, and
Total兲 for their support.
APPENDIX A
APPLICATION OF THE ADJOINT-STATE
METHOD TO FWI
In this appendix, we provide some guidelines for the derivation
of the gradient of the misfit function 共equation 24兲 with the adjointstate method and Lagrange multipliers. The reader is referred to Nocedal and Wright 共1999兲 for a review of constrained optimization
and to Plessix 共2006兲 for a review of the adjoint-state method and its
application to FWI.
First, we introduce the Lagrangian function L corresponding to
the misfit function C augmented with equality constraints:
1
L共ū,m,兲 ⳱ 具P共dobs ⳮ Rū兲兩P共dobs ⳮ Rū兲典
2
ⳮ 具兩B共m兲ū ⳮ s典
共A-1兲
The equality constraints correspond to the forward-problem equation, namely, the state equation, which must be satisfied by the seismic wavefield. A realization u of the state equation is the so-called
state variable. In equation A-1, we introduce the variable ū to distinguish any element of the state variable space from a realization of the
state equation 共Plessix, 2006兲.
The vector , the dimension of which is that of the wavefield u, is
the Lagrange multiplier; it corresponds to the adjoint-state variables.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC148
Virieux and Operto
In the framework of the theory of constrained optimization, the firstorder necessary optimality conditions known as the Karush-KuhnTucker 共KKT兲 conditions state that the solution of the optimization
problem is obtained at the stationary points of L.
The first condition, 关 L / 兴ū⳱cste,m⳱cste ⳱ 0, leads to the forward-problem equation: B共m兲u ⫽ s. Resolving the state equation for
m ⳱ m0 provides the incident wavefield for FWI.
The second condition, 关 L / ū兴⳱cste,m⳱cste ⳱ 0, leads to the socalled adjoint-state equation, expressed as
L共ū,m,兲
ū
⳱ P共dobs ⳮ Rū兲 ⳮ B†共m兲 ⳱ 0. 共A-2兲
For the derivation of equation A-2, we use the fact that 具 ,B共m兲ū典
⳱ 具B†共m兲,ū典 and that the source does not depend on ū. Choosing
m ⳱ m0 and ū ⳱ u共m0兲 in equation A-2 leads to
B†共m0兲 ⳱ P共⌬d0兲,
共A-3兲
which can be rewritten equivalently as
* ⳱ Bⳮ1共m0兲P共⌬d*0 兲,
共A-4兲
where we exploit the fact that Bⳮ1 is symmetric by virtue of the spatial reciprocity of Green’s functions. The adjoint-state variables are
computed by solving a forward problem for a composite source
formed by the conjugate of the residuals, which is equivalent to back
propagation of the residuals in the model.
The third condition, 关 L / m兴ū⳱cste,⳱cste ⳱ 0, defines the minimum of L in a comparable way as for the unconstrained minimization of the misfit function C. We have
L共ū,m,兲
m
冓
⳱ 兩
冔
B共m兲
ū .
m
共A-5兲
For any realization of the forward problem u, L共u,m,兲 ⳱ C共m兲.
Therefore, equation A-5 gives the expression of the desired gradient
of C as a function of the state variable and adjoint-state variable
when ū ⳱ u:
冓
冔
C
B共m兲
⳱ 兩
u .
m
m
共A-6兲
Inserting the expression of 共equation A-4兲 into equation A-3 and
choosing m ⳱ m0 gives the expression of the gradient of C at the
point m0 in the opposite direction of which a minimum of C is sought
for in FWI:
C共m0兲
Bt共m0兲 ⳮ1
⳱ ut共m0兲
B 共m0兲P共⌬d*0 兲. 共A-7兲
m
m
Equation A-7 is equivalent to equation 24 in the main text.
REFERENCES
Abubakar, A., W. Hu, T. Habashy, and P. van den Berg, 2009, A finite-difference contrast source inversion algorithm for full-waveform seismic applications: Geophysics, this issue.
Akcelik, V., 2002, Multiscale Newton-Krylov methods for inverse acoustic
wave propagation: Ph.D. thesis, Carnegie Mellon University.
Aki, K., and P. Richards, 1980, Quantitative seismology: Theory and methods: W.H. Freeman & Co.
Alerini, M., S. Le Bégat, G. Lambaré, and R. Baina, 2002, 2D PP- and PS-stereotomography for a multicomponent datset: 72nd Annual International
Meeting, SEG, Expanded Abstracts, 838–841.
Alkhalifah, T., and I. Tsvankin, 1995, Velocity analysis for transversely isotropic media: Geophysics, 60, 1550–1566.
Amundsen, L., 1991, Comparison of the least-squares criterion and the
Cauchy criterion in frequency-wavenumber inversion: Geophysics, 56,
2027–2035.
Amundsen, L., and B. Ursin, 1991, Frequency-wavenumber inversion of
acoustic data: Geophysics, 56, 1027–1039.
Askan, A., 2006, Full waveform inversion for seismic velocity and anelastic
losses in heterogeneous structures: Ph.D. thesis, Carnegie Mellon University.
Askan, A., V. Akcelik, J. Bielak, and O. Ghattas, 2007, Full waveform inversion for seismic velocity and anelastic losses in heterogeneous structures:
Bulletin of the Seismological Society of America, 97, 1990–2008.
Askan, A., and J. Bielak, 2008, Full anelastic waveform tomography including model uncertainty: Bulletin of the Seismological Society of America,
98, 2975–2989.
Barnes, C., and M. Charara, 2008, Full-waveform inversion results when using acoustic approximation instead of elastic medium: 78th Annual International Meeting, SEG, Expanded Abstracts, 1895–1899.
Barnes, C., M. Charara, and T. Tsuchiya, 2008, Feasibility study for an anisotropic full waveform inversion of cross-well data: Geophysical Prospecting, 56, 897–906.
Baysal, E., D. Kosloff, and J. Sherwood, 1983, Reverse time migration: Geophysics, 48, 1514–1524.
Bednar, J. B., C. Shin, and S. Pyun, 2007, Comparison of waveform inversion, Part 3: Amplitude approach: Geophysical Prospecting, 55, 477–485.
Ben Hadj Ali, H., S. Operto, and J. Virieux, 2008a, Velocity model building
by 3D frequency-domain full-waveform inversion of wide-aperture seismic data: Geophysics, 73, no. 5, VE101–VE117.
Ben Hadj Ali, H., S. Operto, J. Virieux, and F. Sourbier, 2008b, 3D frequency-domain full-waveform tomography based on a domain decomposition
forward problem: 78thAnnual International Meeting, SEG, ExpandedAbstracts, 1945–1949.
——–, 2009a, Efficient 3D frequency-domain full waveform inversion with
phase encoding: 71st Conference & Technical Exhibition, EAGE, Extended Abstracts, 5812.
——–, 2009b, Three-dimensional frequency-domain full-waveform inversion with phase encoding: 79th Annual International Meeting, SEG, Expanded Abstracts, 2288–2292.
Beydoun, W. B., and A. Tarantola, 1988, First Born and Rytov approximation: Modeling and inversion conditions in a canonical example: Journal
of the Acoustical Society of America, 83, 1045–1055.
Beylkin, G., 1985, Imaging of discontinuities in the inverse scattering problem by inversion of a causal generalized Radon transform: Journal of
Mathematical Physics, 26, 99–108.
Beylkin, G., and R. Burridge, 1990, Linearized inverse scattering problems
in acoustics and elasticity: Wave Motion, 12, 15–52.
Billette, F., and G. Lambaré, 1998, Velocity macro-model estimation from
seismic reflection data by stereotomography: Geophysical Journal International, 135, 671–680.
Billette, F., S. Le Bégat, P. Podvin, and G. Lambaré, 2003, Practical aspects
and applications of 2D stereotomography: Geophysics, 68, 1008–1021.
Biondi, B., and W. Symes, 2004, Angle-domain common-image gathers for
migration velocity analysis by wavefield-continuation imaging: Geophysics, 69, 1283–1298.
Bleibinhaus, F., J. A. Hole, T. Ryberg, and G. S. Fuis, 2007, Structure of the
California Coast Ranges and San Andreas fault at SAFOD from seismic
waveform inversion and reflection imaging: Journal of Geophysical
Research, doi: 10.1029/2006JB004611.
Bleistein, N., 1987, On the imaging of reflectors in the earth: Geophysics, 52,
931–942.
Bouchon, M., M. Campillo, and S. Gaffet, 1989, A boundary integral equation — Discrete wavenumber representation method to study wave propagation in multilayered media having irregular interfaces: Geophysics, 54,
1134–1140.
Brenders, A. J., and R. G. Pratt, 2007a, Efficient waveform tomography for
lithospheric imaging: Implications for realistic 2D acquisition geometries
and low frequency data: Geophysical Journal International, 168, 152–170.
——–, 2007b, Full waveform tomography for lithospheric imaging: Results
from a blind test in a realistic crustal model: Geophysical Journal International, 168, 133–151.
——–, 2007c, Waveform tomography of marine seismic data: What can limited offset offer?: 75th Annual International Meeting, SEG, Expanded Abstracts, 3024–3029.
Brossier, R., S. Operto, and J. Virieux, 2009a, Seismic imaging of complex
structures by 2D elastic frequency-domain full-waveform inversion: Geophysics, this issue.
——–, 2009b, Two-dimensional seismic imaging of the Valhall model from
synthetic OBC data by frequency-domain elastic full-waveform inversion: 79th Annual International Meeting, SEG, Expanded Abstracts,
2293–2297.
Brossier, R., J. Virieux, and S. Operto, 2008, Parsimonious finite-volume fre-
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
quency-domain method for 2D P-SV wave modelling: Geophysical Journal International, 175, 541–559.
Bube, K. P., and T. Nemeth, 2007, Fast line searches for the robust solution of
linear systems in the hybrid ᐉ1 / ᐉ2 and Huber norms: Geophysics, 72, no. 2,
A13–A17.
Bunks, C., F. M. Salek, S. Zaleski, and G. Chavent, 1995, Multiscale seismic
waveform inversion: Geophysics, 60, 1457–1473.
Capdeville, Y., Y. Gung, and B. Romanowicz, 2005, Towards global earth tomography using the spectral element method: A technique based on source
stacking: Geophysical Journal International, 162, 541–554.
Carcione, J. M., G. C. Herman, and A. P. E. ten Kroode, 2002, Seismic modeling: Geophysics, 67, 1304–1325.
Cary, P., and C. Chapman, 1988, Automatic 1-D waveform inversion of marine seismic refraction data: Geophysical Journal of the Royal Astronomical Society, 93, 527–546.
Chapman, C., 1985, Ray theory and its extension: WKBJ and Maslov seismograms: Journal of Geophysics, 58, 27–43.
Chavent, G., 1974, Identification of parameter distributed systems, in R.
Goodson and M. Polis, eds., Identification of function parameters in partial differential equations: American Society of Mechanical Engineers,
31–48.
Chen, P., T. Jordan, and L. Zhao, 2007, Full three-dimensional tomography:
Acomparison between the scattering-integral and adjoint-wavefield methods: Geophysical Journal International, 170, 175–181.
Chironi, C., J. V. Morgan, and M. R. Warner, 2006, Imaging of intrabasalt and
subbasalt structure with full wavefield seismic tomography: Journal of
Geophysical Research, 11, B05313, doi: 10.1029/2004JB003595.
Choi, Y., D.-J. Min, and C. Shin, 2008, Two-dimensional waveform inversion of multi-component data in acoustic-elastic coupled media: Geophysical Prospecting, 56, 863–881.
Claerbout, J., 1971, Toward a unified theory of reflector mapping: Geophysics, 36, 467–481.
——–, 1976, Fundamentals of geophysical data processing: McGraw-Hill
Book Co., Inc.
Claerbout, J., and S. Doherty, 1972, Downward continuation of moveout corrected seismograms: Geophysics, 37, 741–768.
Crase, E., A. Pica, M. Noble, J. McDonald, and A. Tarantola, 1990, Robust
elastic non-linear waveform inversion: Application to real data: Geophysics, 55, 527–538.
Danecek, P., and G. Seriani, 2008, An efficient parallel Chebyshev pseudospectral method for large-scale 3D seismic forward modelling: 70th Conference & Technical Exhibition, EAGE, Extended Abstracts, P046.
de Hoop, A., 1960, A modification of Cagniard’s method for solving seismic
pulse problems: Applied Scientific Research, 8, 349–356.
Dessa, J. X., and G. Pascal, 2003, Combined traveltime and frequency-domain seismic waveform inversion: A case study on multi-offset ultrasonic
data: Geophysical Journal International, 154, 117–133.
Devaney, A. J., 1982, A filtered backprojection algorithm for diffraction tomography: Ultrasonic Imaging, 4, 336–350.
Devaney, A. J., and D. Zhang, 1991, Geophysical diffraction tomography in
a layered background: Wave Motion, 14, 243–265.
Djikpéssé, H., and A. Tarantola, 1999, Multiparameter ᐉ1-norm waveform
fitting: Interpretation of Gulf of Mexico reflection seismograms: Geophysics, 64, 1023–1035.
Docherty, P., R. Silva, S. Singh, Z.-M. Song, and M. Wood, 2003, Migration
velocity analysis using a genetic algorithm: Geophysical Prospecting, 45,
865–878.
Dumbser, M., and M. Kaser, 2006, An arbitrary high-order discontinuous
Galerkin method for elastic waves on unstructured meshes — II. The
three-dimensional isotropic case: Geophysical Journal International, 167,
319–336.
Dummong, S., K. Meier, D. Gajewski, and C. Hubscher, 2008, Comparison
of prestack stereotomography and NIP wave tomography for velocity
model building: Instances from the Messinian evaporites: Geophysics, 73,
no. 5, VE291–VE302.
Ellefsen, K., 2009, A comparison of phase inversion and traveltime tomography for processing of near-surface refraction traveltimes: Geophysics, this
issue.
Epanomeritakis, I., V. Akçelik, O. Ghattas, and J. Bielak, 2008, A NewtonCG method for large-scale three-dimensional elastic full waveform seismic inversion: Inverse Problems, 24, 1–26.
Erlangga, Y. A., and F. J. Herrmann, 2008, An iterative multilevel method for
computing wavefields in frequency-domain seismic inversion: 78thAnnual International Meeting, SEG, Expanded Abstracts, 1956–1960.
Ernst, J. R., A. G. Green, H. Maurer, and K. Holliger, 2007, Application of a
new 2D time-domain full-waveform inversion scheme to crosshole radar
data: Geophysics, 72, no. 5, J53–J64.
Fichtner, A., B. L. N. Kennett, H. Igel, and H. P. Bunge, 2008, Theoretical
background for continental- and global-scale full-waveform inversion in
the time-frequency domain: Geophysical Journal International, 175,
665–685.
Forgues, E., and G. Lambaré, 1997, Parameterization study for acoustic and
WCC149
elastic ray ⫹ Born inversion: Journal of Seismic Exploration, 6, 253–278.
Futterman, W., 1962, Dispersive body waves: Journal of Geophysics Research, 67, 5279–5291.
Gao, F., A. R. Levander, R. G. Pratt, C. A. Zelt, and G. L. Fradelizio, 2006a,
Waveform tomography at a groundwater contamination site: Surface reflection data: Geophysics, 72, no. 5, G45–G55.
——–, 2006b, Waveform tomography at a groundwater contamination site:
VSP-surface data set: Geophysics, 71, no. 1, H1–H11.
Gauthier, O., J. Virieux, and A. Tarantola, 1986, Two-dimensional nonlinear
inversion of seismic waveforms: Numerical results: Geophysics, 51,
1387–1403.
Gazdag, J., 1978, Wave equation migration with the phase-shift method:
Geophysics, 43, 1342–1351.
Gelis, C., J. Virieux, and G. Grandjean, 2007, 2D elastic waveform inversion
using Born and Rytov approximations in the frequency domain: Geophysical Journal International, 168, 605–633.
Gelius, L. J., I. Johansen, N. Sponheim, and J. J. Stamnes, 1991, A generalized diffraction tomography algorithm: Journal of the Acoustical Society
of America, 89, 523–528.
Gilbert, F., and A. Dziewonski, 1975, An application of normal mode theory
to the retrieval of structural parameters and source mechanisms from seismic spectra: Philosophical Transactions of the Royal Society of London,
278, 187–269.
Graves, R., 1996, Simulating seismic wave propagation in 3D elastic media
using staggered-grid finite differences: Bulletin of the Seismological Society of America, 86, 1091–1106.
Grechka, V., L. Zhang, and J. W. Rector III, 2004, Shear waves in acoustic anisotropic media: Geophysics, 69, 576–582.
Guitton, A., and W. W. Symes, 2003, Robust inversion of seismic data using
the Huber norm: Geophysics, 68, 1310–1319.
Gutenberg, B., 1914, Über erdbenwellen viia. beobachtungen an registrierungen von fernbeben in göttingen und folgerungen über die konstitution
des erdkörpers: Nachrichten von der Könglichen Gesellschaft der Wissenschaften zu Göttinge, Mathematisch-Physikalische Klasse, 125–176.
Ha, T., W. Chung, and C. Shin, 2009, Waveform inversion using a back-propagation algorithm and a Huber function norm: Geophysics, 74, no. 3, R15–
R24.
Hak, B., and W. Mulder, 2008, Preconditioning for linearized inversion of attenuation and velocity perturbations: 78th Annual International Meeting,
SEG, Expanded Abstracts, 2036–2040.
Hansen, C., ed., 1998, Rank-defficient and discrete ill-posed problems —
Numerical aspects of linear inversion: Society for Industrial and Applied
Mathematics.
Herrmann, F. J., Y. Erlangga, and T. T. Y. Lin, 2009, Compressive sensing applied to full-wave form inversion: 71st Conference & Technical Exhibition, EAGE, Extended Abstracts, S016.
Hicks, G. J., and R. G. Pratt, 2001, Reflection waveform inversion using local
descent methods: Estimating attenuation and velocity over a gas-sand deposit: Geophysics, 66, 598–612.
Hole, J. A., 1992, Nonlinear high-resolution three-dimensional seismic travel time tomography: Journal of Geophysical Research, 97, 6553–6562.
Hu, W., A. Abubakar, and T. M. Habashy, 2009, Simultaneous multifrequency inversion of full-waveform seismic data: Geophysics, 74, no. 2, R1–
R14.
Hustedt, B., S. Operto, and J. Virieux, 2004, Mixed-grid and staggered-grid
finite difference methods for frequency domain acoustic wave modelling:
Geophysical Journal International, 157, 1269–1296.
Ikelle, L., J. P. Diet, and A. Tarantola, 1988, Linearized inversion of multioffset seismic reflection data in the -k domain: Depth-dependent reference
medium: Geophysics, 53, 50–64.
Improta, L., A. Zollo, A. Herrero, R. Frattini, J. Virieux, and P. DellÁversana,
2002, Seismic imaging of complex structures by non-linear traveltimes inversion of dense wide-angle data: Application to a thrust belt: Geophysical
Journal International, 151, 264–278.
Jaiswal, P., C. Zelt, A. W. Bally, and R. Dasgupta, 2008, 2-D traveltime and
waveform inversion for improved seismic imaging: Naga thrust and fold
belt, India: Geophysical Journal International, doi: 10.1111/j.1365246X.2007.03691.x.
Jaiswal, P., C. Zelt, R. Dasgupta, and K. Nath, 2009, Seismic imaging of the
Naga thrust using multiscale waveform inversion: Geophysics, this issue.
Jannane, M., W. Beydoun, E. Crase, D. Cao, Z. Koren, E. Landa, M. Mendes,
A. Pica, M. Noble, G. Roeth, S. Singh, R. Snieder, A. Tarantola, and D.
Trézéguet, 1989, Wavelengths of earth structures that can be resolved from
seismic reflection data: Geophysics, 54, 906–910.
Jin, S., and R. Madariaga, 1993, Background velocity inversion by a genetic
algorithm: Geophysical Research Letters, 20, 93–96.
——–, 1994, Non-linear velocity inversion by a two-step Monte Carlo method: Geophysics, 59, 577–590.
Jin, S., R. Madariaga, J. Virieux, and G. Lambaré, 1992, Two-dimensional
asymptotic iterative elastic inversion: Geophysical Journal International,
108, 575–588.
Jo, C. H., C. Shin, and J. H. Suh, 1996, An optimal 9-point, finite-difference,
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC150
Virieux and Operto
frequency-space 2D scalar extrapolator: Geophysics, 61, 529–537.
Jones, K., M. Warner, and J. Brittan, 1999, Anisotropy in multi-offset deepcrustal seismic experiments: Geophysical Journal International, 138, 300–
318.
Kamei, R., and R. G. Pratt, 2008, Waveform tomography strategies for imaging attenuation structure for cross-hole data: 70th Conference & Technical
Exhibition, EAGE, Extended Abstracts, F019.
Kennett, B. L. N., 1983, Seismic wave propagation in stratified media: Cambridge University Press.
Klem-Musatov, K. D., and A. M. Aizenberg, 1985, Seismic modelling by
methods of the theory of edge waves: Journal of Geophysics, 57, 90–105.
Kolb P., F. Collino, and P. Lailly, 1986, Prestack inversion of 1-D medium:
Proceedings of the IEEE, 498–508.
Kolsky, H., 1956, The propagation of stress pulses in viscoelastic solids:
Philosophical Magazine, 1, 693–710.
Komatitsch, D., and J. P. Vilotte, 1998, The spectral element method: An efficient tool to simulate the seismic response of 2D and 3D geological structures: Bulletin of the Seismological Society of America, 88, 368–392.
Koren, Z., K. Mosegaard, E. Landa, P. Thore, and A. Tarantola, 1991, Monte
Carlo estimation and resolution analysis of seismic background velocities:
Journal of Geophysical Research, 96, 20289–20299.
Kormendi, F., and M. Dietrich, 1991, Non-linear waveform inversion of
plane-wave seismograms in stratified elastic media: Geophysics, 56, 664–
674.
Krebs, J., J. Anderson, D. Hinkley, R. Neelamani, A. Baumstein, M. D.
Lacasse, and S. Lee, 2009, Fast full-wavefield seismic inversion using encoded sources: Geophysics, this issue.
Lailly, P., 1983, The seismic inverse problem as a sequence of before stack
migrations: Conference on Inverse Scattering, Theory and Application,
Society for Industrial and Applied Mathematics, Expanded Abstracts,
206–220.
Lambaré, G., 2008, Stereotomography: Geophysics, 73, no. 5, VE25–VE34.
Lambaré, G., and M. Alérini, 2005, Semi-automatic picking PP-PS stereotomography: Application to the synthetic Valhall data set: 75th Annual International Meeting, SEG, Expanded Abstracts, 943–946.
Lambaré, G., S. Operto, P. Podvin, P. Thierry, and M. Noble, 2003, 3-D ray ⫹
Born migration/inversion — Part 1: Theory: Geophysics, 68, 1348–1356.
Lambaré, G., J. Virieux, R. Madariaga, and S. Jin, 1992, Iterative asymptotic
inversion in the acoustic approximation: Geophysics, 57, 1138–1154.
Lecomte, I., 2008, Resolution and illumination analyses in PSDM: A raybased approach: The Leading Edge, 27, 650–663.
Lecomte, I., S.-E. Hamran, and L.-J. Gelius, 2005, Improving Kirchhoff migration with repeated local plane-wave imaging? A SAR-inspired signalprocessing approach in prestack depth imaging: Geophysical Prospecting,
53, 767–785.
Lee, K. H., and H. J. Kim, 2003, Source-independent full-waveform inversion of seismic data: Geophysics, 68, 2010–2015.
Lehmann, I., 1936, P⬘: Publications du Bureau Central Séismologique International, A14, 87–115.
Levander, A. R., 1988, Fourth-order finite-difference P-SV seismograms:
Geophysics, 53, 1425–1436.
Liao, Q., and G. A. McMechan, 1995, 2.5D full-wavefield viscoacoustic inversion: Geophysical Prospecting, 43, 1043–1059.
——–, 1996, Multifrequency viscoacoustic modeling and inversion: Geophysics, 61, 1371–1378.
Lions, J., 1972, Nonhomogeneous boundary value problems and applications: Springer-Verlag, Berlin.
Lucio, P. S., G. Lambaré, and A. Hanyga, 1996, 3D multivalued travel time
and amplitude maps: Pure and Applied Geophysics, 148, 113–136.
Luo, Y., T. Nissen-Meyer, C. Morency, and J. Tromp, 2009, Seismic modeling and imaging based upon spectral-element and adjoint methods: The
Leading Edge, 28, 568–574.
Malinowsky, M., and S. Operto, 2008, Quantitative imaging of the permomesozoic complex and its basement by frequency domain waveform tomography of wide-aperture seismic data from the polish basin: Geophysical Prospecting, 56, 805–825.
Malinowsky, M., A. Ribodetti, and S. Operto, 2007, Multiparameter fullwaveform inversion for velocity and attenuation — Refining the imaging
of a sedimentary basin: 69th Conference & Technical Exhibition, EAGE,
Extended Abstracts, P276.
Marfurt, K., 1984, Accuracy of finite-difference and finite-elements modeling of the scalar and elastic wave equation: Geophysics, 49, 533–549.
McMechan, G. A., and G. S. Fuis, 1987, Ray equation migration of wide-angle reflections from southern Alaska: Journal of Geophysical Research,
92, 407–420.
Menke, W., 1984, Geophysical data analysis: Discrete inverse theory: Academic Press, Inc.
Miller, D., M. Oristaglio, and G. Beylkin, 1987, Anew slant on seismic imaging: Migration and integral geometry: Geophysics, 52, 943–964.
Min, D.-J., and C. Shin, 2006, Refraction tomography using a waveform-inversion back-propagation technique: Geophysics, 71, no. 3, R21–R30.
Min, D.-J., C. Shin, R. G. Pratt, and H. S. Yoo, 2003, Weighted-averaging fi-
nite-element method for 2D elastic wave equations in the frequency domain: Bulletin of the Seismological Society of America, 93, 904–921.
Montelli, R., G. Nolet, F. Dahlen, G. Masters, E. Engdahl, and S. Hung, 2004,
Finite-frequency tomography reveals a variety of plumes in the mantle:
Science, 303, 338–343.
Mora, P. R., 1987, Nonlinear two-dimensional elastic inversion of multi-offset seismic data: Geophysics, 52, 1211–1228.
——–, 1988, Elastic wavefield inversion of reflection and transmission data:
Geophysics, 53, 750–759.
Mosegaard, K., and A. Tarantola, 1995, Monte Carlo sampling of solutions to
inverse problems: Journal of Geophysical Research, 100, 12431–12447.
Mulder, W. A., and B. Hak, 2009, Simultaneous imaging of velocity and attenuation perturbations from seismic data is nearly impossible: 71st Conference & Technical Exhibition, EAGE, Extended Abstracts, S043.
Nihei, K. T., and X. Li, 2007, Frequency response modelling of seismic
waves using finite difference time domain with phase sensitive detection
共TD-PSD兲: Geophysical Journal International, 169, 1069–1078.
Nocedal, J., 1980, Updating quasi-Newton matrices with limited storage:
Mathematics of Computation, 35, 773–782.
Nocedal, J., and S. J. Wright, 1999, Numerical optimization: Springer.
Nolet, G., 1987, Seismic tomography with applications in global seismology
and exploration geophysics: D. Reidel Publ. Co.
Oldham, R., 1906, The constitution of the earth: Quarterly Journal of the
Geological Society of London, 62, 456–475.
Operto, S., G. Lambaré, P. Podvin, and P. Thierry, 2003, 3-D ray-Born migration/inversion, Part 2: Application to the SEG/EAGE overthrust experiment: Geophysics, 68, 1357–1370.
Operto, S., C. Ravaut, L. Improta, J. Virieux, A. Herrero, and P. Dell’
Aversana, 2004, Quantitative imaging of complex structures from multifold wide aperture seismic data: A case study: Geophysical Prospecting,
52, 625–651.
Operto, S., J. Virieux, P. Amestoy, J. Y. L’Excellent, L. Giraud, and H. BenHadj-Ali, 2007, 3D finite-difference frequency-domain modeling of viscoacoustic wave propagation using a massively parallel direct solver: A
feasibility study: Geophysics, 72, no. 5, SM195–SM211.
Operto, S., J. Virieux, J. X. Dessa, and G. Pascal, 2006, Crustal imaging from
multifold ocean bottom seismometers data by frequency-domain fullwaveform tomography: Application to the eastern Nankai trough: Journal
of Geophysical Research, doi: 10.1029/2005JB003835.
Operto, S., J. Virieux, A. Ribodetti, and J. E. Anderson, 2009, Finite-difference frequency-domain modeling of viscoacoustic wave propagation in
two-dimensional tilted transversely isotropic media: Geophysics, 74, no.
5, T75–T95, doi: 10.1190/1.3157243.
Operto, S., S. Xu, and G. Lambaré, 2000, Can we image quantitatively complex models with rays?: Geophysics, 65, 1223–1238.
Paige, C. C., and M. A. Saunders, 1982a, Algorithm 583 LSQR: Sparse linear
equations and least squares problems: Association for Computing Machinery Transactions on Mathematical Software, 8, 195–209.
——–, 1982b, LSQR: An algorithm for sparse linear equations and sparse
least squares: Association for Computing Machinery Transactions on
Mathematical Software, 8, 43–71.
Pica, A., J. Diet, and A. Tarantola, 1990, Nonlinear inversion of seismic reflection data in laterally invariant medium: Geophysics, 55, 284–292.
Plessix, R.-E., 2006, A review of the adjoint-state method for computing the
gradient of a functional with geophysical applications: Geophysical Journal International, 167, 495–503.
——–, 2007, A Helmholtz iterative solver for 3D seismic-imaging problems:
Geophysics, 72, no. 5, SM185–SM194.
——–, 2009, 3D frequency-domain full-waveform inversion with an iterative solver: Geophysics, this issue.
Polak, E., and G. Ribière, 1969, Note sur la convergence de méthodes de directions conjuguées: Revue Française d’Informatique et de Recherche
Opérationnelle, 16, 35–43.
Pratt, R. G., 1990, Inverse theory applied to multi-source cross-hole tomography. Part II: Elastic wave-equation method: Geophysical Prospecting,
38, 311–330.
——–, 1999, Seismic waveform inversion in the frequency domain, Part I:
Theory and verification in a physical scale model: Geophysics, 64,
888–901.
Pratt, R. G., and N. R. Goulty, 1991, Combining wave-equation imaging with
traveltime tomography to form high-resolution images from crosshole
data: Geophysics, 56, 204–224.
Pratt, R. G., F. Hou, K. Bauer, and M. Weber, 2005, Waveform tomography
images of velocity and inelastic attenuation from the Mallik 2002 crosshole seismic surveys, in S. R. Dallimore, and T. S. Collett, eds., Scientific
results from the Mallik 2002 gas hydrate production research well program, Mackenzie Delta, Northwest Territories, Canada: Geological Survey of Canada.
Pratt, R. G., R. E. Plessix, and W. A. Mulder, 2001, Seismic waveform tomography: The effect of layering and anisotropy: 63rd Conference &
Technical Exhibition, EAGE, Extended Abstracts, P092.
Pratt, R. G., C. Shin, and G. J. Hicks, 1998, Gauss-Newton and full Newton
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
Full-waveform inversion
methods in frequency-space seismic waveform inversion: Geophysical
Journal International, 133, 341–362.
Pratt, R. G., L. Sirgue, B. Hornby, and J. Wolfe, 2008, Cross-well waveform
tomography in fine-layered sediments — Meeting the challenges of anisotropy: 70th Conference & Technical Exhibition, EAGE, Extended Abstracts, F020.
Pratt, R. G., Z. M. Song, P. R. Williamson, and M. Warner, 1996, Two-dimensional velocity model from wide-angle seismic data by wavefield inversion: Geophysical Journal International, 124, 323–340.
Pratt, R. G., and W. Symes, 2002, Semblance and differential semblance optimisation for waveform tomography:Afrequency domain implementation:
Sub-basalt imaging conference, Expanded Abstracts, 183–184.
Pratt, R. G., and M. H. Worthington, 1990, Inverse theory applied to multisource cross-hole tomography, Part 1: Acoustic wave-equation method:
Geophysical Prospecting, 38, 287–310.
Press, W. H., B. P. Flannery, S. A. Teukolsky, and W. T. Veltering, 1986, Numerical recipes: The art of scientific computing: Cambridge University
Press.
Prieux, V., S. Operto, R. Brossier, and J. Virieux, 2009, Application of acoustic full waveform inversion to the synthetic Valhall model: 77th Annual International Meeting, SEG, Expanded Abstracts, 2268–2272.
Pyun, S., C. Shin, and J. B. Bednar, 2007, Comparison of waveform inversion, Part 3: Phase approach: Geophysical Prospecting, 55, 465–475.
Ravaut, C., S. Operto, L. Improta, J. Virieux, A. Herrero, and P. Dell’
Aversana, 2004, Multi-scale imaging of complex structures from multifold wide-aperture seismic data by frequency-domain full-wavefield inversions: Application to a thrust belt: Geophysical Journal International,
159, 1032–1056.
Ribodetti, A., S. Operto, J. Virieux, G. Lambaré, H.-P. Valéro, and D. Gibert,
2000, Asymptotic viscoacoustic diffraction tomography of ultrasonic laboratory data: A tool for rock properties analysis: Geophysical Journal International, 140, 324–340.
Ribodetti, A., and J. Virieux, 1996, Asymptotic theory for imaging the attenuation factors Q p and Qs: Conference of Aixe-les-Bains, INRIA — French
National Institute for Research in Computer Science and Control, Expanded Abstracts, 334–353.
Riyanti, C. D., Y. A. Erlangga, R. E. Plessix, W. A. Mulder, C. Vuik, and C.
Oosterlee, 2006, A new iterative solver for the time-harmonic wave equation: Geophysics, 71, no. 5, E57–E63.
Riyanti, C. D., A. Kononov, Y. A. Erlangga, C. Vuik, C. W. Oosterlee, R. E.
Plessix, and W. A. Mulder, 2007, A parallel multigrid-based preconditioner for the 3D heterogeneous high-frequency Helmholtz equation: Journal of Computational Physics, 224, 431–448.
Robertsson, J. O., B. Bednar, J. Blanch, C. Kostov, and D. J. van Manen,
2007, Introduction to the supplement on seismic modeling with applications to acquisition, processing and interpretation: Geophysics, 72, no. 5,
SM1–SM4.
Romero, L. A., D. C. Ghiglia, C. C. Ober, and S. A. Morton, 2000, Phase encoding of shot records in prestack migration: Geophysics, 65, 426–436.
Saad, Y., 2003, Iterative methods for sparse linear systems, 2nd ed.: Society
for Industrial and Applied Mathematics.
Sambridge, M., and G. Drijkoningen, 1992, Genetic algorithms in seismic
waveform inversion: Geophysical Journal International, 109, 323–342.
Sambridge, M., and K. Mosegaard, 2002, Monte Carlo methods in geophysical inverse problems: Reviews of Geophysics, 40, 1–29.
Sambridge, M. S., A. Tarantola, and B. L. N. Kennett, 1991, An alternative
strategy for non linear inversion of seismic waveforms: Geophysical Prospecting, 39, 457–491.
Scales, J. A., P. Docherty, and A. Gersztenkorn, 1990, Regularization of nonlinear inverse problems: Imaging the near-surface weathering layer: Inverse Problems, 6, 115–131.
Scales, J. A., and M. L. Smith, 1994, Introductory geophysical inverse theory: Samizdat Press.
Sears, T. J., S. C. Singh, and P. J. Barton, 2008, Elastic full waveform inversion of multi-component OBC seismic data: Geophysical Prospecting, 56,
843–862.
Sen, M. K., and P. L. Stoffa, 1995, Global optimization methods in geophysical inversion: Elsevier Science Publ. Co., Inc.
Sheng, J., A. Leeds, M. Buddensiek, and G. T. Schuster, 2006, Early arrival
waveform tomography on near-surface refraction data: Geophysics, 71,
no. 4, U47–U57.
Shi, Y., W. Zhao, and H. Cao, 2007, Nonlinear process control of wave-equation inversion and its application in the detection of gas: Geophysics, 72,
no. 1, R9–R18.
Shin, C., and W. Ha, 2008, A comparison between the behavior of objective
functions for waveform inversion in the frequency and Laplace domains:
Geophysics, 73, no. 5, VE119–VE133.
Shin, C., and Y. H. Cha, 2008, Waveform inversion in the Laplace domain:
Geophysical Journal International, 173, 922–931.
——–, 2009, Waveform inversion in the Laplace-Fourier domain: Geophysical Journal International, 177, 1067–1079.
Shin, C., S. Jang, and D. J. Min, 2001a, Improved amplitude preservation for
WCC151
prestack depth migration by inverse scattering theory: Geophysical Prospecting, 49, 592–606.
Shin, C., and D.-J. Min, 2006, Waveform inversion using a logarithmic
wavefield: Geophysics, 71, no. 3, R31–R42.
Shin, C., S. Pyun, and J. B. Bednar, 2007, Comparison of waveform inversion, Part 1: Conventional wavefield vs. logarithmic wavefield: Geophysical Prospecting, 55, 449–464.
Shin, C., K. Yoon, K. J. Marfurt, K. Park, D. Yang, H. Y. Lim, S. Chung, and
S. Shin, 2001b, Efficient calculation of a partial derivative wavefield using
reciprocity for seismic imaging and inversion: Geophysics, 66, 1856–
1863.
Shipp, R. M., and S. C. Singh, 2002, Two-dimensional full wavefield inversion of wide-aperture marine seismic streamer data: Geophysical Journal
International, 151, 325–344.
Sirgue, L., 2003, Inversion de la forme d’onde dans le domaine fréquentiel de
données sismiques grand offset: Ph.D. thesis, Université Paris and
Queen’s University.
——–, 2006, The importance of low frequency and large offset in waveform
inversion: 68th Conference & Technical Exhibition, EAGE, Extended Abstracts, A037.
Sirgue, L., O. I. Barkved, J. P. V. Gestel, O. J. Askim, and J. H. Kommedal,
2009, 2D waveform inversion on Valhall wide-azimuth OBS: 71st Conference & Technical Exhibition, EAGE, Extended Abstracts, U038.
Sirgue, L., J. Etgen, and U. Albertin, 2007, 3D full waveform inversion: Wide
versus narrow azimuth acquisitions: 77th Annual International Meeting,
SEG, Expanded Abstracts, 1760–1764.
——–, 2008, 3D frequency domain waveform inversion using time domain
finite difference methods: 70th Conference & Technical Exhibition,
EAGE, Extended Abstracts, F022.
Sirgue, L., and R. G. Pratt, 2004, Efficient waveform inversion and imaging:
A strategy for selecting temporal frequencies: Geophysics, 69, 231–248.
Smithyman, B. R., R. G. Pratt, J. G. Hayles, and R. J. Wittebolle, 2008, Near
surface void detection using seismic Q-factor waveform tomography:
70th Annual International Meeting, EAGE, Expanded Abstracts, F018.
Snieder, R., M. Xie, A. Pica, and A. Tarantola, 1989, Retrieving both the impedance contrast and background velocity: A global strategy for the seismic reflection problem: Geophysics, 54, 991–1000.
Song, Z., P. Williamson, and G. Pratt, 1995, Frequency-domain acousticwave modeling and inversion of crosshole data, Part 2: Inversion method,
synthetic experiments and real-data results: Geophysics, 60, 786–809.
Sourbier, F., A. Haiddar, L. Giraud, S. Operto, and J. Virieux, 2008, Frequency-domain full-waveform modeling using a hybrid direct-iterative solver
based on a parallel domain decomposition method: A tool for 3D fullwaveform inversion?: 78th Annual International Meeting, SEG, Expanded Abstracts, 2147–2151.
Sourbier, F., S. Operto, J. Virieux, P. Amestoy, and J. Y. L’Excellent, 2009a,
FWT2D: A massively parallel program for frequency-domain full-waveform tomography of wide-aperture seismic data — Part 1: Algorithm:
Computers and Geosciences, 35, 487–495.
——–, 2009b, FWT2D: A massively parallel program for frequency-domain
full-waveform tomography of wide-aperture seismic data — Part 2: Numerical examples and scalability analysis: Computers & Geosciences, 35,
496–514.
Stekl, I., and R. G. Pratt, 1998, Accurate viscoelastic modeling by frequencydomain finite difference using rotated operators: Geophysics, 63,
1779–1794.
Stolt, R. H., 1978, Migration by Fourier transform: Geophysics, 43, 23–48.
Taillandier, C., M. Noble, H. Chauris, and H. Calandra, 2009, First-arrival
travel time tomography based on the adjoint state method: Geophysics,
this issue.
Talagrand, O., and P. Courtier, 1987, Variational assimilation of meteorological observations with the adjoint vorticity equation, I: Theory: Quarterly
Journal of the Royal Meteorological Society, 113, 1311–1328.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49, 1259–1266.
——–, 1986, A strategy for nonlinear inversion of seismic reflection data:
Geophysics, 51, 1893–1903.
——–, 1987, Inverse problem theory: Methods for data fitting and model parameter estimation: Elsevier Science Publ. Co., Inc.
Thierry, P., G. Lambaré, P. Podvin, and M. Noble, 1999a, 3-D preserved amplitude prestack depth migration on a workstation: Geophysics, 64,
222–229.
Thierry, P., S. Operto, and G. Lambaré, 1999b, Fast 2D ray-Born inversion/
migration in complex media: Geophysics, 64, 162–181.
Thomsen, L. A., 1986, Weak elastic anisotropy: Geophysics, 51, 1954–1966.
Toksöz, M. N., and D. H. Johnston, 1981, Seismic wave attenuation: SEG.
Tromp, J., C. Tape, and Q. Liu, 2005, Seismic tomography, adjoint methods,
time reversal, and banana-doughnut kernels: Geophysical Journal International, 160, 195–216.
Tsvankin, I., 2001, Seismic signature and analysis of reflection data in anisotropic media: Elsevier Scientific Publ. Co., Inc.
van den Berg, P. M., and A. Abubakar, 2001, Contrast source inversion meth-
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/
WCC152
Virieux and Operto
od: State of the art: Progress in Electromagnetics Research, 34, 189–218.
Vigh, D., and E. W. Starr, 2008a, 3D plane-wave full-waveform inversion:
Geophysics, 73, no. 5, VE135–VE144.
——–, 2008b, Comparisons for waveform inversion, time domain or frequency domain?: 78th Annual International Meeting, SEG, Expanded Abstracts, 1890–1894.
Vigh, D., E. W. Starr, and J. Kapoor, 2009, Developing earth model with full
waveform inversion: The Leading Edge, 28, 432–435.
Virieux, J., 1986, P-SV wave propagation in heterogeneous media, velocitystress finite difference method: Geophysics, 51, 889–901.
Virieux, J., S. Operto, H. B. H. Ali, R. Brossier, V. Etienne, F. Sourbier, L. Giraud, and A. Haidar, 2009, Seismic wave modeling for seismic imaging:
The Leading Edge, 28, 538–544.
Vogel, C., 2002, Computational methods for inverse problems: Society of Industrial and Applied Mathematics.
Vogel, C. R., and M. E. Oman, 1996, Iterative methods for total variation denoising: SIAM Journal on Scientific Computing, 17, 227–238.
Wang, Y., and Y. Rao, 2009, Reflection seismic waveform tomography: Journal of Geophysical Research, doi: 10.1029/2008JB005916.
Warner, M., I. Stekl, and A. Umpleby, 2007, Full wavefield seismic tomography — Iterative forward modeling in 3D: 69th Conference & Technical
Exhibition, EAGE, Extended Abstracts, C025.
——–, 2008, 3D wavefield tomography: synthetic and field data examples:
78th Annual International Meeting, SEG, Expanded Abstracts,
3330–3334.
Williamson, P., 1991, A guide to the limits of resolution imposed by scatter-
ing in ray tomography: Geophysics, 56, 202–207.
Woodhouse, J., and A. Dziewonski, 1984, Mapping the upper mantle: Three
dimensional modelling of earth structure by inversion of seismic waveforms: Journal of Geophysical Research, 89, 5953–5986.
Woodward, M. J., 1992, Wave-equation tomography: Geophysics, 57,
15–26.
Woodward, M. J., D. Nichols, O. Zdraveva, P. Whitfield, and T. Johns, 2008,
A decade of tomography: Geophysics, 73, no. 5, VE5–VE11.
Wu, R. S., 2003, Wave propagation, scattering and imaging using dual-domain one-way and one-return propagation: Pure and Applied Geophysics,
160, 509–539.
Wu, R. S., and K. Aki, 1985, Scattering characteristics of elastic waves by an
elastic heterogeneity: Geophysics, 50, 582–595.
Wu, R.-S., and M. N. Toksöz, 1987, Diffraction tomography and multisource
holography applied to seismic imaging: Geophysics, 52, 11–25.
Yilmaz, O., 2001, Seismic data analysis: processing, inversion and interpretation of seismic data: SEG.
Zelt, C., and P. J. Barton, 1998, Three-dimensional seismic refraction tomography: A comparison of two methods applied to data from the Faeroe basin: Journal of Geophysical Research, 103, 7187–7210.
Zelt, C. A., R. G. Pratt, A. J. Brenders, S. Hanson-Hedgecock, and J. A. Hole,
2005, Advancements in long-offset seismic imaging: A blind test of traveltime and waveform tomography: Eos, Transactions of the American Geophysical Union, 86, Abstract S52A-04.
Zhou, B., and S. A. Greenhalgh, 2003, Crosshole seismic inversion with normalized full-waveform amplitude data: Geophysics, 68, 1320–1330.
Downloaded 04 Dec 2009 to 193.50.85.151. Redistribution subject to SEG license or copyright; see Terms of Use at http://segdl.org/