AFSABRFinal
AFSABRFinal
AFSABRFinal
net/publication/264718376
Arbitrage-free SABR
CITATIONS READS
46 13,317
4 authors, including:
Diana Woodward
Saint Thomas University
28 PUBLICATIONS 867 CITATIONS
SEE PROFILE
All content following this page was uploaded by Patrick S Hagan on 12 October 2017.
Abstract. Smile risk is often managed using the explicit implied vol formulas developed for the SABR model. These
asymptotic formulas are not exact, and this can lead to arbitrage for low strike options. Here we provide an alternate method
for pricing options under the SABR model: We use asympotic techniques to reduce the SABR model from two dimensions to
one dimension. This leads to an effective one dimensional forward equation for the probability density which has the same
asymptotic order of accuracy as the explicit implied vol formulas. We obtain arbitrage-free option prices by numerically solving
this PDE. The implied volatilities obtained from the numerical solutions closely match the explicit implied volatility curves,
apart from ultra-low strikes. For very low strikes, the implied absolute (normal) vol dips downwards, closely matching market
observations. We also show how negative rates can be accomodated by replacing the factor with ( + ) .
1. The SABR model. European option prices are often quoted by using the normal model. In this
model the forward asset price ̃ () follows the process
and the (forward) price of European calls and puts works out to be
µ ¶ µ ¶
− √ −
(1.1b) = [ − ] √ + √
µ ¶ µ ¶
− √ −
(1.1c) = [ − ] √ + √
Here (·) is the cumulative normal distribution and (·) is the Gaussian density. Both and are
increasing functions of the volatility . Consequently, European option prices can be quoted by stating the
implied normal vol (aka absolute vol or bps per year vol), the unique value of that yields the option’s
dollar price when substituted into these formulas.
Alternatively, in Black’s model the forward asset price is modeled by
Here, too, there is a one-to-one relation between the European option price and the Black (log normal) vol
, so option prices can be quoted in terms of Implied Black vols and implied normal vols are
equivalent, and the mapping between and is well understood, so in this article we focus on the normal
vols .
If the normal model correctly described the behavior of the asset price, then the same implied volatility
would correctly price options with different strikes and times-to-expiry . In practice, matching
market prices requires substantially different implied vols for options with different strikes and expiries,
= ( ). For a given expiry , the implied vol as a function of is the options’s smile, or
skew. Handling the smile and skew judiciously is critical to an options desk, since the risks of options at
all different strikes have to be consolidated before the risks can be hedged efficiently. Although offsetting
risks of options with different expiries is less common, handling the volatility surface (the dependence of
on both and ) is also be important for correctly pricing path dependent options, such as mid-curve
options.
1
The need for handling smile and skew risk effectively led to the development of the SABR models[1-
SABR]. These are models of the form
with
where most commonly ( ) is taken to be . In [1-SABR], singular perturbation techniques were used
to analyze this class of models in the small volatility regime. To carry out the expansion systematically, a
small parameter was introduced,
The model was analyzed in the 1 limit, and then was set to 1 in the final result1 . This analysis was
used to obtain explict formulas for the implied vols of European options under the SABR model. There are
now several variants of these formulas [1-5, SABR variants], all correct through (2 ), but our favorite is
µ ¶ ½ ∙ ¸ ¾
( − ) 2 1 ( ) − () 2 − 32 2 2
(1.5a) () = · · 1 + + 4 + + · · ·
R 0 () − 24
( 0 )
Here = ̃ (0) = ̃(0) are today’s value of the foward price and the volatility, is the time to exercise,
Z Ãp !
0 1 − 2 + 2 − +
(1.5b) = () = log
( 0 ) 1−
Although there are simpler formulas, this one seems to be the most robust.
The classic SABR model is the special case ( ) = . For this case the implied vol formula reduces
to
µ ¶ ½ ∙ ¸ ¾
(1 − ) ( − ) 2 1 − 2 − 32 2 2
(1.6a) () = · · 1 + + 4 + + · · ·
1− − 1− () − 24
where
Ãp !
1− − 1− 1 − 2 + 2 − +
(1.6b) = () = log
1− 1−
1 Although this appears inconsistent, it is equivalent to non-dimensionalizing the problem, expanding in the low volatility
regime, and then re-writing the answers in terms of the original dimensioned variables.
2
and
2 µ 1−
¶
(1 − ) 2 − 1−
(1.6c) = 2 log ( )
( 1− − 1− ) (1 − ) ( − )
Two problems have developed using these explicit implied vol formulas. First, for some low strike, long
dated options, the explict implied vol formulas can lead to arbitrageable prices; this is discussed in the next
section. Second, the SABR model has a barrier wherever ( ) = 0; this is at = 0 in the classical SABR
model. There is an () thick region next to this boundary, a region in which the original asymptotic
analysis does not pertain. In the current, ultra-low rate environment, this region has a substantial influence
on pricing, especially with the advent of zero strike options.
In this article we address both problems. We use singular pertubation methods to show that the reduced
density,
n ¯ o
¯
(1.7) ( ) = Prob ̃ ( ) + ¯ ̃ () = ̃() =
This reduction is not exact, but is accurate through (2 ), the same accuracy as the explicit implied vol
formulas. Our approach is to solve the PDE numerically to obtain the probability density ( ) at the
exercise date ; the call and put prices are then obtained by integrating to find the expected value of the
payoffs.
To simplify notation, let
p 1 2
(1.9) ( ) = 1 + 2( ) + 2 2 2 ( ) 2 Γ( )( −) ( )
Equations 1.10a - 1.10c form a well posed problem for the density ( ). Since the boundaries are
absorbing, the probability density will develop -functions at the boundaries min , max in addition to the
continuous density ( ). Crudely speaking,
⎧
⎨ ( )( − min ) at = min
(1.11) ( ) = ( ) for min max
⎩
( )( − max ) at = max
The rate at which probability accumulates at min and max is determined by the flux reaching the barriers
from the interior:
£ ¤
(1.12a) = lim+ 12 2 2 2 ( )
→min
£ ¤
(1.12b) = − lim+ 12 2 2 2 ( )
→max
We shall see that this ensures that the combined probability in ( ), ( ), and ( ) totals 1. Clealy
the initial condition is
(1.12c) ( ) → 0 ( ) → 0 as →
Of course if the probability ( ) at the upper boundary is significant, max should be increased.
At first glance, the delta function at min looks unusual. However, consider a situation in which the
( ) has been modified to prevent the forward from becoming negative, so that the effective volatility
½
( ) for
(1.13) ̃( ) =
(0 ) for 0
is used in place of ( ). This situation is analyzed in Appendix B. We find that for ¿ 1, there is a thin
boundary layer in 0 which has very high densities ( ). In the limit → 0 the total probability
in 0 is ( ), and yet 2 ( )( ) goes to zero at we approach from above:
Z
(1.14) lim+ 2 ( )( ) → 0 ( ) → ( ) as → 0
→ 0
We believe that this is representative of the general situation: when the forward ̃ ( ) is near enough to
the boundary, other mechanisms come into play; after all, there must be some reason that ̃ ( ) doesn’t cross
the boundary. For example, interest rates in zero, near zero, and even slightly negative rate environments,
can be expected to behave differently than rates in more moderate regimes. If we put in a detailed model for
these boundary mechanisms, and looked on a fine enough scale, then the delta functons should be resolved
into structured boundary layers. Yet the total probability in the layer should match ( ), as this is required
by conservation, and as we move away from the boundary, ( ) should transition into the solution of
1.10a with the absorbing boundary condition 1.10b, as this is required for ̃ ( ) to be a Martigale.
We use a moment-preserving Crank-Nicholson scheme to solve the PDE 1.10a - 1.10c numerically
for , while simultaneously integrating the ODE’s 1.12a - 1.12c. Once we have obtained
4
( ) ( ) and ( ) we can obtain the option prices for all strikes by integration:
Z max
(1.15a) ( ) = ( − ) ( ) + (max − ) ( )
Z
(1.15b) ( ) = ( − min ) ( ) + ( − ) ( )
min
If we wish, we can then find the implied normal vol that matches these prices at each . That is, by solving
the PDE once, we obtain the smile for all at .
We shall find that the implied volatilities obtained from these numerical solutions closely match the
explicit implied volatility formulas, apart from ultra-low strikes. For very low strikes, the implied absolute
(normal) vol dips downwards, closely matching market behavior.
We also show show how negative rates can be accomodated by replacing with ( + ) to move the
barrier below zero.
2. Arbitrage using the explicit formulas for the SABR model. The explicit implied vol formulas
make the SABR model easy to implement, calibrate, and use. These implied volatility formulas are usually
treated as if they are exactly correct, even though they are derived from an expansion which requires that
√ √ √
, and | − | be not too large. The unstated argument is that instead of treating
these formulas as an (accurate) approximation to the SABR model, they should be treated as the exact
solution to some other model which is well approximated by the SABR model.
This is a valid viewpoint as long as the option prices obtained using the explicit formulas 1.5a-1.5c for
() are arbitrage free. There are two key requirements for these prices to be arbitrage free [6, Dupire].
The first is call-put parity, which holds automatically since we are using the same implied vol () for
both calls and puts. The second is that the probability density implied by the call and put prices needs to
be positive. To explore this, note that the call and put values can be written quite generally as
Z ∞ Z
(2.1) ( ) = ( − ) ( ) () = ( − ) ( )
−∞
where ( ) is the probability density at the exercise date (including any delta functions). Clearly,
2 2
(2.2) 2
() = () = ( ) ≥ 0
2
For the explicit implied vol formulas 1.5a-1.5c to represent an arbitrage free model, then, we need
2 2
(2.3) ( () ) = ( () ) ≥ 0 for all
2 2
That is, there cannot be a “butterfly arbitrage.”
It is not terribly uncommon for this requirement to be violated for very low strike options for sufficiently
large . The problem does not appear to be the quality of the call and put prices obtained from the explicit
implied vol formulas, because these usually remain quite accurate. Rather, the problem seems to be that
implied volatility curves are not a stable representation of option prices for low strike options. It is very
easy to find nearly identical, reasonable looking, volatility curves (), for which some of the curves are
arbitrage free and others violate the arbitrage-free constraint of eq. 2.3.
This is illustrated in Figure 2.1. There the smile () obtained from the explicit formula 1.5a-1.5c is
graphed (explicit) along with a very similar smile (arb free) obtained from the arbitrage free procedure. These
two smiles look very similar, and lead to nearly identical option prices, as shown in Figure 2.2. Differentiating
5
Implied normal vol
100%
80%
60%
40% Explicit
Arb Free
20%
0%
0 0.5 1 1.5 2 2.5 3
Strike
Fig. 2.1. The implied normal vol for the SABR model for = 35% = 025 = −10% and = 100% Shown are
() from the explicit formula and from the arb-free approach for = 1yr.
Fig. 2.2. Call and put values from the SABR model for = 35% = 025 = −10% = 100%. Shown are the put
prices (for ) and call prices (for ) obtained from the explicit formulas and the arb-free approach for = 1yr.
these option prices with respect to the strike yields the probability densities shown in Figure 2.3. The
“explicit” smile () leads to negative probabilities for ultra low strikes, and so is not arbitrage free, whilst
the second curve has only positive probabilities, and is arbitrage free.
Using implied Black (log normal) vols () instead of implied normal vols does not help. Figure 2.4
compares the implied Black vols from the explict formulas for () with those from the arb free approach.
There is no obvious way to discern that one curve leads to arbitrage free prices, while the other does not.
6
Probability density
1.5
Explicit
1.0
Arb Free
0.5
0.0
0 0.5 1 1.5 2 2.5
‐0.5
‐1.0
Strike
Fig. 2.3. Probabiity density for the SABR model for = 35%, = 025 = −10% and = 100% Shown are the
densities obtained from the explicit formulas for () and from the arb free approach for = 1yr.
3.1. The effective forward equation. Here we present an alternative pricing approach which is
arbitrage free and retains the (2 ) accuracy of the original SABR analysis. We believe that variations of
this approach have been used by other firms [7, Anderson, 8, the French team].
Consider the probability density that ̃ ( ) = and ̃( ) = at time , given that we start at ̃ () =
7
and ̃() = at time :
(3.1a) n ¯ o
¯
( ; ) = Prob ̃ ( ) + ̃( ) + ¯ ̃ () = ̃() =
This density satisfies the Fokker-Planck equation (the forwards Kolmogorov equation),
£ ¤ £ ¤ £ ¤
(3.1b) = 12 2 2 2 ( ) + 2 2 ( ) + 12 2 2 2 for
In Appendix A we define the reduced (marginal) probability density,
Z ∞
(3.2) ( ) = ( ; )
0
which is the probability density that ̃ ( ) = at time regardless of the value of ̃( ). Although is also
a function of the backwards variables , for clarity we have omitted explicitly showing this dependence.
In appendix A we use singular perturbation techniques to analyze the Fokker-Planck equation 3.1b. This
analysis shows that the marginal density satisfies the PDE
£ ¤
(3.3a) = 12 2 2 2 ( ) for
for all , so
( Z )
max
(3.7) ( ) + ( ) + ( ) = 0
min
£ ¤
(3.9a) = lim+ 12 2 2 2 ( )
→min
£ ¤
(3.9b) = lim− − 12 2 2 2 ( )
→max
Therefore,
Z max
( ) ( )
(3.11) min + ( ) + max = 0
min
Substituting 3.3a for and integrating by parts twice and using equations 3.9a, 3.9b leads to
¯max
(3.12) 2 ( ) ¯min = 0
£ ¤
(3.14c) = lim+ 12 2 2 2 ( )
→min
£ ¤
(3.14d) = lim− − 12 2 2 2 ( )
→max
3.3. Option pricing. In Appendix C we sketch out a Crank-Nicholson scheme [NumRecInC] for solving
3.14a - 3.14e for ( ), ( ) and ( ). This scheme conserves probability and the expected value of
̃ ( ), so equations 3.6 and 3.10 remain exactly true for the numerical solution. Once we have solved these
equations numerically, the call and put prices are obtained by integrating:
The conservation of probablity 3.6 and Martingale proprerty 3.10 show that call-put parity holds exactly for
the numerical solution:
Since the effective forward equation has only one space dimension, solving the PDE is essentially instan-
taneous. Moreover, solving these equations for yields the option prices for all strikes . Thus
the implied normal vols () for all strikes can be obtained by solving the PDE once.
The maximum principle for parabolic equations [Protter & Weinberg] guarantees that ( ) ≥ 0 for
min max , and that ( ) and ( ) are increasing, and hence positive. For fine enough grids,
the numerical solutions will also be non-negative. Since call-put parity is also satisfied, the numerical option
prices are arbitrage free [6, Dupire].
4. Discussion.
4.1. Results. Singular perturbation techniques can be used to analyze the effective forward equation,
£ ¤
(4.1a) = 12 2 2 2 ( ) for
where
¡ ¢ 2
(4.1b) ( ) = 1 + 2 + 2 2 2 Γ( )( −) 2 ( )
10
with
Z
1 0 ( ) − ( )
(4.1c) ( ) ≡ Γ( ) ≡
( 0 ) −
From this analysis we can obtain explicit option prices, and these prices can be used to find explicit formulas
for the implied volatility (). Away from the boundaries min and max , this analysis would lead to
the same explicit implied vol formulas 1.5a-1.5c as before, at least if we continue to work through (2 ).
Thus, away from the boundaries, the “arbitrage free” implied volatility (obtained by numerically solving the
effective forward equation) should match the explicit implied volatility formulas to within (2 ). This is
indeed our experience. Even in relatively extreme cases, such as Figure 2.1, the “arbitrage free” and explicit
implied volatility smiles match closely, except when the strike or the forward gets too close to the
boundary at min .
For values of within () of the lower boundary min , a non-negligible percentage of paths that would
have reached hit the boundary. This creates an () thick boundary layer at min , in which the explicit
implied volatity formulas do not pertain. Instead, as the strike approaches min , the implied normal volatility
() bends towards zero. See Figure 2.1
Figure 4.1 shows the effect of the boundary layer on the normal volatility of at-the-money options. As
today’s forward decreases to within () of min , an increasing percentage of the paths reach the boundary
before the expiry date, which reduces the ATM volatility. Seeing the “knee” in this graph, one might naively
believe that the market switches from a normal regime to a log normal regime when the forward is sufficiently
small. This is an illusion; this graph comes from the SABR model with = 0 and = 0, which is a stochastic
normal model. The reduction is caused solely by the boundary.
40%
30%
20%
10%
0%
0 0.25 0.5 0.75 1 1.25 1.5
Forward
Fig. 4.1. The at-the-money implied normal vol () at = 1yr for the arbitrage free SABR model with = 35%
= 0 = 0and = 100%.
Below are the smiles () obtained for different values of the forward , using the same SABR
parameters in each case.
Market data for at-the-money swaption volatilities exhibit this “knee”: Historical studies comparing
ATM normal vols with forward rates show that when the forward rate is above a critical value, the ATM
11
Smiles at different forwards
1.2
1.0
0.8 0.1
0.6 0.5
0.4 1
0.2 1.5
0.0 2
0 0.5 1 1.5 2 2.5 3 3.5
Strikes
Fig. 4.2. The smiles () at = 1yr for different values of the forward for the SABR model with = 35% = 0
= 0% and = 100%.
normal vols are reasonably constant; for forward rates below the critical value, the ATM vols decrease linearly
with the rate. See figure ??.
Figure caption: The normal vols for 1Y into 1Y at the money swaptions vs the forward swap rate.
Shown is the historic data for 2002 through 2012 for USD, GBP, EUR, and JPY swaptions. Also shown is
the implied volatility obtained from the SABR model for = 65% = 025 = 0%, and = 75%.
Typically this knee is explained as the market switching from normal to log normal behaviour in ultra-
low rate environments. This belief is often reinforced by using the explicit implied vol formulas to calibrate
the SABR model. Calibrating the explicit implied vol formulas to observed smiles can lead to relatively high
values of and/or for low forward rates. Since high values of and push up the high strike vols, this
can create significant mis-pricing for instruments which are sensitive to high strikes, like constant maturity
caps, floors, and swaps. To counter this, some firms have chosen to use
½
for 0
(4.2) ( ) =
0 for 0
in place of . We believe that our approach provides a more natural explanation, since the knee occurs
automatically, without requiring gross changes between the SABR parameters for low and moderate rate
environments.
4.2. Hedging. The coefficients in the effective forward equation 4.1a - 4.1c depend on the current
forward as well as . This means that
· there is no obvious “effective backwards equation” equivalent to the effective forward equation;
· the effective forward equation is not the Fokker-Planck equation (forward Kolmogorov equation) for some
one dimensional Ito process.
I.e., there is not an effective one dimensional local volatility model corresponding to the effective forward
equation. This should have been anticipated as the SABR model was created because of perceived short-
comings of local volatility models[1, SABR].
12
When the SABR model is calibrated to market data, it is very difficult to distinguish between and
; both control the skew. If we fix , say, and calibrate the rest of the parameters, the quality of the fit is
usually pretty much independent of the particular value of chosen. This is illustrated in Figure 4.3. There
we have chosen = 0, = 12 and = 1, and calibrated the SABR model to the same market data for all
three cases, which yields the following set of SABR parameters.
318% 329% 351%
0 05 1
−183% −455% −644%
0777 0867 0985
Although the tail of the smiles are somewhat different, all three sets of parameters seem to fit the actual
market data.
Calibrated smiles
80%
60% 0
40% 0.5
1
20%
data
0%
0 0.5 1 1.5 2 2.5 3
Strikes
1
Fig. 4.3. The SABR model calibrated to the same market data for = 0 = 2
, and = 1. Because can largely
compensate for , all three fits are well within market noise
The conventional delta risk is calculated by shifting the current forward and keeping the current
volatility fixed:
(4.3) → + ∆ →
If the delta risk is calculated in the conventional way, then the delta depends on the particular value of
used. This is shown in Figure 4.4. There we have calculated the conventional delta of a call option as a
function of the strike for the same three sets of SABR parameters. Even though all three sets lead to
essential the same smile (especially for strikes which are not too extreme), we see that the different choices
of have led to different values of delta. This means that if we hedge our positions using the conventional
delta, choosing a poor may lead to a poor hedge, even though it may lead to a superb fit of the market
data.
This issue led to an alternative approach for calculating delta hedges. Since ̃ and ̃ are correlated,
when ̃ changes then ̃ changes as well, at least on average. It is argued that accounting for this shift should
result in a better hedge [Bartlett]:
(4.4) → + ∆ → + ∆
13
Conventional delta
1.00
0.80
0
0.60
0.5
0.40
1
0.20
0.00
0 0.5 1 1.5 2
Strikes
Fig. 4.4. Conventional delta, as a function of the strike for = 0, = 12 and = 1. For each , the other
three parameters and are chosen to match the arb free SABR smile to the market data.
Therefore, changes in ̃( ) can be split into two independent components, one caused by the changes in
̃ ( ), and one due to the idiosyncratic changes in the volatility ̃( ). Accordingly, the delta hedge should
be calculated with respect to the scenario [Bartlett]
(4.7) → + ∆ →+ ∆
( )
Figure 4.5 shows this alternative delta as a function of the strike for the same three sets of SABR
parameters used above. We calculated these deltas by simply bumping the and values input into the
pricing code according to 4.7. We see that this alternative delta is nearly independent of the particular value
of chosen. This has proven true for all cases we have investigated: as long as the SABR model fits market
data decently, the alternative delta is nearly independent of the particular values of or used in the fitting.
Apparently this new delta depends mostly on the actual market smile, and not how the smiles and skews
are represented in the model.
It is generally accepted that hedges based on the alternative delta are superior to the conventional delta
hedge when a desk is only using delta to hedge. When a desk is hedging both delta and vega, which delta
hedge is used is irrelevent provided one doesn’t double count when putting on the vega hedge.
Appendix A. Derivation of the effective forward equation.
14
Alternative delta
1.00
0.80
0
0.60
0.5
0.40
1
0.20
0.00
0 0.5 1 1.5 2
Strikes
Fig. 4.5. Alternative delta, +[( )] for = 0, = 12 , and = 1. For each , the other three parameters
and are chosen to match the arb free SABR smile to the market data.
Clearly the zeroeth moment (0) is the probability density of being at at time ,
(A.2b) ( ) = (0) ( ; )
regardless of the value of ̃( ).
The density satisfies the Fokker-Planck equation
£ ¤ £ ¤ £ ¤
(A.3a) = 12 2 2 ( )2 + 2 ( )2 + 12 2 2 2 for all
with the initial condition
(A.3b) = ( − )( − ) for all → +
Now
Z ∞£ ¤ £ ¤ ¯=+∞
(A.4a) ( )2 = ( )2 ¯=0 = 0
0
Z ∞
£ 2 ¤ £ 2 ¤ ¯=+∞
(A.4b) = ¯=0 = 0
0
15
for all . This just states that there is no probability flux across the boundaries at = 0 and = ∞; i.e.,
that probability is conserved. Integrating the Fokker-Planck equation across all now yields
h i
(0)
(A.5) = 12 2 2 ( )(2) for
I.e., the evolution of the reduced density (0) depends on the second moment (2) .
Under the SABR model these moments satisfy the backward Kolmogorov equation
() () ()
(A.6a) + 12 2 2 2 ( ) + 2 2 ( ) + 12 2 2 2
()
=0 for
We will successively transform this equation order-by-order until all the derivatives are negligibly small.
This effectively reduces the problem from two dimensions ( and ) to one dimension ( only). Instead
of constructing explicit asymptotic solutions to the resulting one dimensional problem, as was done in the
original SABR paper [1, SABR], here we seek to write (2) in terms of (0) . This provides the “constituitive
law” needed to close the “conservation law” A.5, which is then the effective forward equation. Throughout
we work through (2 ), neglecting higher order terms.
Since the backwards equation is autonomous, the moments () only depend on
(A.7) = −
Then
−1 −1
(A.9a) −→ = −→ −
( ) ()
½ ¾
2 1 2 0 ()
(A.9b) 2
−→ 2 2 2 −
() 2 ()
½ ¾
2 1 2 2 1
(A.9c) −→ − + +
() 2
2 2 2 2 2 2 2
(A.9d) 2
−→ 2
− + 2 2+ 2
16
and
1
(A.9e) ( − ) = (( )) = ()
(0)
−1 ()
(A.11) () ( ) = ̂ ( )
(0)
() 1
¡ ¢ () 0 () ()
̂ = 2 1 + 2 + 2 2 2 ̂ − 12 ̂
¡ ¢ ¡ () ¢
(A.12a) () ()
− + 2 2 ( − 2) ̂ − + 2 2 ̂
n o
() ()
+ 12 2 2 2 ̂ + 2( − 1)̂ + ( − 1)( − 2)̂() for 0
with
To leading order in ()the equation and initial condition for ̂() ( ) are
() 1
¡ ¢ () 0 () ()
̂ = 2 1 + 2 + 2 2 2 ̂ − 12 ̂
¡ ¢ ()
(A.15) ()
− + 2 2 ( − 2) ̂ − ̂
()
Then
½ ¾
p 0 ()
(A.16b) ̂() () 1
= ()(0) + 2
½ µ ¶ ¾
p 0 () 00 0 0
() () 2 2 1 1 ()
(A.16c) ̂ = ()(0) + + 2 −4 2
½ ¾
()
p () 0 () 1 0 () 1 0 () 2
1
(A.16d) ̂ = ()(0) + 2 + 2 + 2 + ( )
So
¡ ¢ () ¡ ¢ 0
(A.17a) () = 1
21 + 2 + 2 2 2 − ( − 2) + 2 2 () + 12 2 ()
½ µ ¶¾
1 2 2 1 2 0 2 2 1
00 0 0
3
+ 2 ( − 1) ( − 2) − 2 ( − 1) + 4 −8 2 ()
() 0
− − 12 2 2 ()
with
then () would be independent of until (2 ), as indicated. Thus, the last two terms in eq. A.17a are
both asymptotically smaller then (2 ), and can be neglected. Hence, we can write
¡ ¢ () ¡ ¢ 0
(A.19a) () = 1
2 1 + 2 + 2 2 2 − ( − 2) + 2 2 () + 12 2 ()
½ µ ¶¾
0 00 0 0
1 2 2 1 2 2 2
+ 2 ( − 1) ( − 2) − 2 ( − 1) + 4 1 3
−8 2 ()
with
through (2 )
There are no longer any derivatives with respect to in eq. A.19a. Therefore can be treated as a
parameter instead of as a variable. I.e., the problem has been reduced to one spatial dimension, at least
through (2 ). In [1, SABR] we constructed explicit asymptotic solutions to eqs. A.19a, A.19b. Here we
take a different approach: We note that for = 2,
¡ ¢ (2) 1 2 0
(A.20a) (2) = 1
2 1 + 2 + 2 2 2 + 2 (2)
µ ¶
1 2 0 (2) 2 2 1
00
00
− 2 + 4 − 38 2 (2)
18
whilst the equation for the = 0 can be written as
h¡ ¢ i 0
(A.20b) (0) = 1
2 1 + 2 + 2 2 2 (0)
+ 12 2 (0)
µ ¶
0 00 0 0
1 2 (0) 2 2 3
+ 2 + 4 1
−8 2 (0)
Both satisfy the same initial condition
0 ()
(A.21b) Γ=− {1 + ()}
()
A precise choice of Γ will be made later. Since 2 Γ = (3 ) and 2 Γ = (3 ), eq. A.20b shows that
satisfies
¡ ¢ 0
(A.22a) = 1
2 1 + 2 + 2 2 2 + 12 2
µ ¶
0 00 0 0
1 2 2 2 1 3
− 2 + 4 −8 2
(A.22b) → () as → 0+
This is identical to the PDE and initial condition for (2) , so uniqueness allows us to conclude that and
(2) are the same through (2 ):
2 ¡ ¢
(A.23) (2) ( ) = (0) ( ) Γ 1 + 2 + 2 2 2
We now chase back through the transformations, and noting that 0 ()() is − 0 ( ) we obtain
© ª 2
(A.24a) (2) ( ; ) = 2 (0) ( ; ) 1 + 2 + 2 2 2 + Γ( −)
(A.24b) Γ = 0 ( ) {1 + ()}
Note that ( ) can be zero where, and only where, ( ) = 0 The change in the total probability in any
interval 1 2 is
Z 2
£ ¤ ¯2 £ ¤ ¯1
(B.3) ( ) = 12 2 2 2 ( ) ¯ − 12 2 2 2 ( ) ¯
1
so clearly the probability flux at any point is
£ ¤
(B.4) ( ) = − 12 2 2 2 ( )
It is natural to place the lower boundary at the barrier, where ( ), and hence ( ), is zero. So let us
first consider the cases where (min ) = 0 with
(B.5) ( ) ∼ const · ( − min ) as → min
Barriers have been studied extensively in stochastic processes and PDEs [Feller, others, Hagan − ].
If 0 12 , it is known that min is a regular boundary. Paths can both enter and leave the barrier,
and it is theoretically possible for probability to diffuse through the barrier, reaching the “forbidden region”
min . We do not consider any models in which paths reach the region / min below the boundary.
Any flux of probability from the interior min to the boundary min must accumulate as a delta function
at the boundary:
£ ¤
(B.6b) = lim+ 12 2 2 2 ( )
→min
For regular boundaries we also need to prescribe a boundary condition at min . Typically this boundary
condition would be absorbing,
+
(B.7a) 2 ( )( ) → 0 as → min
20
no flux,
£ 2 ¤ +
(B.7b) ( ) → 0 as → min
or even mixed
£ 2 ¤ +
(B.7c) ( ) − 2 ( )( ) → 0 as → min
To determine the correct boundary condition, note that the expected value of ̃ ( ) has to be constant,
n ¯ o Z ∞
¯
(B.8) ̃ ( )¯ ̃ () = ̃() = = min ( ) + ( ) =
min
Substituting B.2 for and B.6b for ( ), and integrating by parts shows that
½ Z ∞ ¾ Z ∞
£ 2 ¤
(B.10) min ( ) + ( ) = 12 2 2 ( )
min min
= lim+
− 12 2 2 2 ( )( )
→min
Therefore the requirement that ̃ ( ) be a Martingale means that ( ) must satisfy absorbing boundary
conditions at min ,
(B.11) lim
+
2 ( )( ) = 0
→min
If 12 1, then min is an exit boundary. In this case some paths reach the barrier in finite time,
but no paths leave the barrier. Therefore there is a finite amount of probability at the boundary min ,
£ ¤
(B.12b) = lim 12 2 2 2 ( )
+
→min
For exit boundaries, the probability density automatically satisfies the absorbing boundary condition
+
(B.12c) 2 ( )( ) → 0 as → min
as is well known [Feller]. Theoretically no boundary condition is needed at min , since the absorbing boundary
condition occurs automatically. In practice, however, the absorbing boundary condition should be applied
explicitly when solving the effective forward equation numerically, since most numerical finite difference
schemes engender a slight amount of numerical dispersion, meaning that (min ) is effectively slightly
positive. Even if we could develop and employ a dispersion-free finite difference scheme, applying this
boundary condition would be redundant, and not lead to any contradictions.
21
If ≥ 1, the barrier at min is an inaccessible, or natural, boundary. No paths can reach the boundary,
and the probability and flux both go to zero near the boundary,
£ 2 ¤ +
(B.13) 2 ( )( ) → 0 1
2 ( ) → 0 as → min
In theory, no delta function is needed at min , since no paths reach the boundary. In practice, due to
numerical dispersion, small amounts of probability reach the boundary. By incorporating a delta function
at the boundary,
(B.14a) ( ) = ( )( − min ) at = min
one can keep track of this probability,
£ ¤
(B.14b) = lim+ 12 2 2 2 ( )
→min
This ensures that probability is exactly conserved. Even if we had a dispersion-free numerical scheme, it would
just result in ( ) being exactly zero, so the function would be redundant, but not erroneous. Similarly,
we keep the absorbing boundary conditions at min , even though it should be satisfied automatically.
In summary, whenever there is a barrier at min , we use absorbing boundary bounary conditions
+
(B.15a) 2 ( )( ) → 0 as → min
and use a delta function
(B.15b) ( ) = ( )( − min ) at = min
with
£ ¤
(B.15c) = lim+ 12 2 2 2 ( )
→min
in all cases.
There may be situations in which it makes sense to place the boundary min at a point where (min ) 6=
0. For example, one may wish to put the boundary at min = 0, regardless of whether (min ) is zero or
not. In this case we must still use B.15a - B.15c at the boundary: Since we are not allowing any paths to go
below min , we have to allow a delta function at min with the probability at min increasing according to
B.15c, and to preserve the Martingale property of ̃ ( ) we need to use the absorbing boundary condition
B.15a
The upper boundary max should be set high enough so it has no appreciable effect on option prices;
typically setting max to be roughly 4 to 6 standard deviations above the forward suffices. This requires
Z max
1 0 2
(B.16a) (max ) = = sinh (cosh + sinh )
( 0 )
with
√
(B.16b) = 12 · (4 to 6)
|as shown in Appendix D. Although the boundary condition is irrelevent if max is chosen large enough, we
find it cleanest to treat the boundary at max the same as the boundary at min : We allow a delta function
at max
(B.17a) ( ) = ( )( − max ) at = max
22
where
£ ¤
(B.17b) = − lim− 12 2 2 2 ( )
→max
This ensures that probability is conserved exactly, and by examining the size of ( ), we can determine
whether the boundary max needs to be increased. We also use absorbing boundary conditions,
(B.17c) 2 ( )( ) → 0 −
as → max
where the superscript is being used to denote the continuous part of the density. Then ( ) satisfies
the boundary value problem
£ ¤
(B.19a) = 12 2 2 2 ( ) for min max
£ ¤
(B.19e) = lim+ 12 2 2 2 ( )
→min
£ ¤
(B.19f) = − lim− 12 2 2 2 ( )
→max
B.1. Boundary layer analysis. We believe that the delta function ( ) arises because when the
forward ̃ ( ) is near enough to the boundary, other mechanisms come into play. After all, there must
be some reason that ̃ ( ) doesn’t cross the boundary. To show how this could come about, consider the
effective forward equation
h i
(B.20a) = 12 2 2 ̃2 ( ) for 0 ∞
(B.20b) = ( − ) at → 0
23
where ( ) has been modified near the boundary:
½
() for 0
(B.20c) ̃( ) =
( ) for
In the limit → 0, we will recover the absorbing boundary condition and the delta function ( ).
In the region 0 , the new effective forward equation is
µ ¶2
1 £ 2 ¤
(B.21a) = 2 () for 0
(B.21b) =0 at → 0
µ ¶2
£ ¤ £ ¤
(B.22b) () lim− 2 ( ) = lim+ 2 ( )( )
→ →
We note that
Z µ ¶2 Z µ ¶2
( ) £ 2 ¤ £ ¤
(B.24) = ( ) = () = () lim 2
0 0 →−
so
( ) £ ¤
= lim+ 2 ( )( )
→
In the limit → 0, we have a finite probability ( ) in an infinitely thin region; i.e., a delta function:
with
( ) £ ¤
(B.25b) = lim+ 2 ( )( )
→0
Then
h 2 i
(B.27a) ̃ − ̃ = 0 for 0
2
24
where the constant is
(B.27b) = 12 2 2 2 () 0
where
p
−3 ± 1 + 4 2
(B.28b) 12 =
2
The integral
Z
(B.29) () 2
0
is infinite when Re {} ≥ 0. (It suffices to consider the region Re {} ≥ for any constant ; and then
using analytic continuation. See [CKP].) Therefore 2 () = 0 as ̃( ) must be integrable, and thus
1
(B.30) ̃( ) = 1 () () for 0
Since
and
£ ¤ h i
(B.32) lim+ 2 ( )( ) = (())2 1 () ()2+ 1
→
2
= (2 + 1 ) (()) 1 ()
we have
2 £ ¤
(B.33) 1
2 (())2 ( + ) = lim+ 2 ( )( )
2 + 1 →
In the limit that → 0, the right hand side goes to zero, and we obtain the absorbing boundary condition:
This ensures call-put parity, and provided that ( ) ≥ 0 for all , it also ensures that the numerical
solution itself represents an exactly arbitrage free model.
25
To simplify notation, define
¡ ¢ 2
(C.2a) ( ) = 12 2 2 2 ( ) = 12 2 2 1 + 2 + 2 2 2 Γ 2 ( )
where
Z
1 0 ( ) − ( )
(C.2b) ( ) = Γ=
( 0 ) −
We discretize so that
and define
¡ ¢
(C.4ba) ≡ min + − 12 for = 0 1 + 1
to be the midpoints of the intervals from ( − 1) to . In this process we adjust slightly so that occurs
exactly at the midpoint of it’s interval:
¡ ¢
(C.4c) ≡ 0 = min + 0 − 12
so that is the total probability in the grid cell at time step . We usually use around 200 to 500
points for our grid, and divide 0 into 30 to 100 timesteps.
C.2. Finite difference scheme. We integrate the effective forward equation using a Crank-Nicholson
scheme, which averages explicit and implicit centerred difference equations[8, NumRecInC]. Not only is this
scheme unconditionally stable, it is second order accurate in time.
To advance from timestep to + 1 we need to solve
© +1 +1 ª
(C.6) +1
− = +1 +1 − 2+1 +1
+1 +1
+ −1 −1
22
© ª
+1 +1 − 2 + −1
−1 for = 1 2
22
which we re-write as
© +1 +1 ª
(C.7a) +1
− 2
+1 +1 − 2+1 +1
+1 +1
+ −1 −1 =
2
© ª
+ 2 +1 +1 − 2 + −1
−1 for = 1 2
2
26
The absorbing boundary conditions yield
Note that the points = 0 and = + 1 fall outside the domain; these shadow points simply enable us to
obtain the correct boundary condition at the “true” boundaries min = 12 and max = +12 . Note that
this scheme only requires the solution of a tridiagonal system, so the computational work scales linearly with
and the number of timesteps
We also need to solve for the probabilities () and () at the left and right boundaries. Let
be the boundary probabilities at timestep . At each time step, after solving for +1
0 +1
1 +1
+1
+1 ,
+1 +1
we update the values and
© +1 +1 ª
(C.9a) +1
− = 1 1 − 0+1 +1
0 + 1 1 − 0 0
2
© +1 +1 ª
(C.9b) +1
− = − +1 +1 − +1 +1
+ +1 +1 −
2
Note that this is also second order accurte in time.
To set the initial condition, recall that we adjusted so that
¡ ¢
(C.10) = 0 ≡ min + 0 − 12
for some integer 0 when we set up the grid. The initial condition is simply
½
0 0 0 for 6= 0
(C.11) = 0 = 0 = 0
1 for = 0
C.3. Moments. It is easily seen that this scheme conserves probability. The total probabilty is
Z max
X
(C.12) + ( 0 ) 0 + ≡ + +
min =1
Summing equation C.7a over and using equations C.9a, C.9b, yields
X
¡ ¢ © +1 +1 ª
(C.13) +1
− = +1 +1 − +1 +1
− 1+1 +1
1 + 0+1 +1
0
=1
2
© ª
+ +1 +1 − − 1 1 + 0 0
2
¡ ¢ ¡ ¢
= − +1
− − +1 −
X
(C.15) + + = 1 for all
=1
Z max
X
(C.16) min + 0 ( 0 ) 0 + max ≡ min + + max
min =1
We multiply eq. C.7a by and sum over . Since +1 − 2 + −1 = 0 this yields
X ¡ ¢
(C.17) +1
−
=1
© +1 +1
ª
(C.18) = +1 +1 − 0 1+1 +1
1 − +1 +1 +1
+ 1 0+1 +1
0
2
©
ª
+1 +1 − 0 1 1 − +1 + 1 0 0
2
¡ ¢ X ¡ ¢ ¡ ¢
(C.19) min +1
−
+ +1
− + max +1
−
=1
¡ +1 +1 ¢ ¡ +1 +1 ¢
= 1 1 + 0+1 +1
0 − +1 +1 + +1 +1
4 4
¡ ¢
+ (1 1 + 0 0 ) − +1 +1 +
4 4
The absorbing boundary conditions C.7b, C.7c ensure that the right hand side is zero, so the first moment
is conserved for each time step :
X
X
(C.20) min +1
+ +1
+ max +1
= min + + max
=1 =1
Hence,
X
(C.21) min + + max = 0 00 =
=1
Thus the expected value of ̃ ( ) remains exactly for the numerical solution. This is a key reason we
decided to use a uniform mesh; if we used a non-uniform mesh, a moment preserving finite difference scheme
would yield a linear problem with a matrix of at least five non-zero diagonals instead of three.
28
C.4. Option pricing. We integrate the option prices assuming that the probability
is spread
uniformly in each cell . This yields
(C.22a) ( ) = − for min
X £ ¡ ¢ ¤
(C.22b) ( ) = 1
2 (min + − )2
+ min + − 12 −
=+1
+ (max − )
for min + ( − 1) min +
(C.22c) ( ) = 0 for max
for the call prices, and
(C.23a) ( ) = 0 for min
−1
X £ ¡ ¢ ¤
(C.23b) ( ) = ( − min )
+ − min − − 12
=1
[ − min − ( − 1) ]2
+ 12 for min + ( − 1) min +
(C.23c) ( ) = − for max
for the puts.
Appendix D. Dispersion.
We would like to be able to measure , at least crudely, in terms of standard deviations from today’s
forward . The effective forward equation can be written as
£ ¤
(D.1a) = 12 2 2 2 ( ) for
(D.1b) → ( − ) as →
with
¡ ¢ 2
(D.1c) 2 ( ) = 1 + 2 + 2 2 2 Γ( )( −) 2 ( )
To leading order, the solution is a Gaussian density
1 2
(D.2a) ( ) ≈ p − 2( −)
2 ( − )
with
Z
1 0
(D.2b) ( ) =
( 0 )
2
Since we are working only to leading order, we can neglect the exponential factor Γ( )( −)
in ( )
Integrating then yields
Ãp !
1 1 + 2( ) + 2 2 2 ( ) + + ( )
(D.3a) ( ) = log
1+
where
Z
1 0
(D.3b) ( ) =
( 0 )
29
√
For to be roughly standard deviations above on the exercise date we need to be + − .
This occurs at the where
Z
1 0 2
(D.4a) ( ) = = sinh (cosh + sinh )
( 0 )
with
√
(D.4b) = −
2
Similarly,
√ for to be roughly standard deviations below on the exercise date, we need to be
− − . This occurs where
Z
1 0 2
(D.5a) − ( ) = = sinh (cosh − sinh )
( 0 )
with
√
(D.5b) = −
2
for some 0.
For this model, the explicit implied vol formula is
µ ¶
(1 − ) ( − )
(E.2a) () = 1− 1−
· ·
( + ) − ( + ) ()
⎧ ⎡ ⎤ ⎫
⎪ + ⎪
⎨ (2 − ) (1 − )2 2 log2
( + ) − ( + )
2 − 3 2 ⎥ 2 ⎬
2
⎢ 1 + 1
1 + ⎣− 24 h i2 + 4 + ⎦
⎪
⎩ − 24 ⎪
⎭
( + )1− − ( + )1−
where
Ãp !
( + )1− − ( + )1− 1 − 2 + 2 − +
(E.2b) = () = log
1− 1−
where
( + )1− − ( + )1− ( + ) − ( + )
(E.3b) ( ) = Γ=
(1 − ) −
30
E.1. Stochastic normal model. As → 0, the shifted SABR model simplifies to the stochastic
normal model,
(E.4) ( ) = 1
Here there is no barrier, and the placement of min is an independent modeling decision.
REFERENCES
[1] D.T. Breeden and R. H. Litzenberger, Prices of state-contingent claims implicit in option prices, J. Business, 51
(1994), pp. 621-651.
[2] B. Dupire, Pricing with a smile, Risk, Jan. 1994, pp. 18—20.
[3] B. Dupire, Pricing and hedging with smiles, in Mathematics of Derivative Securities, M.A. H. Dempster and S. R. Pliska,
eds., Cambridge University Press, Cambridge, 1997, pp. 103—111
[4] E. DermaN and I. Kani, Riding on a smile, Risk, Feb. 1994, pp. 32—39.
[5] E. Derman and I. Kani, Stochastic implied trees: Arbitrage pricing with stochastic term and strike structure of volatility,
Int J. Theor Appl Finance, 1 (1998), pp. 61—110.
[6] J.M. Harrison and S. Pliska, Martingales and stochastic integrals in the theory of continuous trading, Stoch. Proc.
Appl, 11 (1981), pp. 215-260.
[7] J.M. Harrison and D. Krebs, Martingales and arbitrage in multiperiod securities markets, J. Econ. Theory, 20 (1979),
pp. 381-408.
[8] I. Karatzas, J.P. Lehoczky, S.E. Shreve, and G.L. Xus, Martingale and duality methods for utility maximization in
an incomplete market, SIAM J. Control Optim, 29 (1991), pp. 702—730.
[9] J. Michael Steele, Stochastic Calculus and Financial Applications, Springer, 2001
[10] F. Jamshidean, Libor and swap market models and measures, Fin. Stoch. 1 (1997), pp. 293-330
[11] F. Black, The pricing of commodity contracts, Jour. Pol. Ec., 81 (1976), pp. 167-179.
[12] John C. Hull, Options, Futures, and Other Derivative Securities, Prentice Hall, 1997.
[13] P. Wilmott, Paul Wilmott on Quantitative Finance, John Wiley & Sons, 2000.
[14] Patrick S. Hagan and Diana E. Woodward, Equivalent Black volatilities, App. Math. Finance, 6 (1999), pp. 147—157.
[15] P. S. Hagan, A. Lesniewski and D. E. Woodward, Geometric optics in finance, in preparation.
[16] F. Wan, Mathematical Models and Their Analysis, Harper-Row, 1989.
[17] J. Hull and A. White, The pricing of options on assets with stochastic volatilities, J. of Finance, 42 (1987), pp. 281-300.
[18] S.L. Heston, A closed-form solution for options with stochastic volatility with applications to bond and currency options,
Rev of Fin Studies, 6 (1993), pp. 327-343.
[19] A. Lewis, Option Valuation Under Stochastic Volatility, Financial Press, 2000.
[20] J.P. Fouque, G. Papanicolaou, K.R. Sirclair, Derivatives in Financial Markets with Stochastic Volatility, Cambridge
Univ Press, 2000.
[21] N. A. Berner, Hedging vanna & volga, DKW, private communicatons
[22] J.D. Cole, Perturbation Methods in Applied Mathematics, Ginn-Blaisdell, 1968.
31
[23] J. Kevorkian and J.D. Cole, Perturbation Methods in Applied Mathematics, Springer-Verlag, 1985.
[24] J.F. Clouets, Diffusion Approximation of a Transport Process in Random Media, SIAM J Appl Math, 58 (1998),
pp. 1604—1621.
[25] I. Karatzas and S. Shreve, Brownian Motion and Stochastic Calculus, Springer, 1988.
[26] B. Okdendal, Stochastic Differential Equations, Springer, 1998.
[27] M. Musiela and M. Rutkowski, Martingale Methods in Financial Modelling, Springer, 1998.
[28] G. B. Whitham, Linear and Nonlinear Waves, Wiley, 1974.
[29] J.C. Neu, Thesis, California Institute of Technology, 1978
32