Wavelet Smoothed Empirical Copula Estimators
Pedro. A. Morettin, Clélia M. C. Toloi,
Chang Chiann and José C. S. de Miranda
February 7, 2008
University of São Paulo, São Paulo, Brazil
Abstract
The purpose of this paper is twofold: Fisrt, we review briefly the methods often used for copula estimation in the context of independent, identically distributed random variables and discuss their use for time series
data. Secondly, we propose a new procedure, based on wavelet expansions.
The proposed estimators are based on empirical copulas. Simulations and
applications to real data are also given.
Keywords: Copula, empirical copula, time series, wavelets, wavelet estimators
1
Introduction
Copulas provide a convenient tool for describing the dependence between variables. Copula techniques have been developed basically for the
independent, identically distributed (i.i.d.) case, which would prevent, at
least theoretically, their applications to dependent data (e.g. time series
data) appearing in economics, finance and other areas. The presence of serial correlation and time-varying heteroscedasticity in financial time series,
for example, claims for the development of new methodologies or extensions
of the existing ones for this kind of data, specially in the field of copula
estimation. For i.i.d. samples of bivariate or in general multivariate distributions, parametric and nonparametric methods are well known.
1
Several approaches are used for copula estimation. If the copula is
assumed to belong to some parametric family of copulas, consistent and
asymptotically normal estimators of the parameters can be obtained by the
method of maximum likelihod (ML). See Genest and Rivest (1993) and Shi
and Louis (1995). A two-step procedure called inference function for margins can be used; first the parameters of the marginals are estimated and
then the parameters of the (parametric) copula are estimated, both via ML.
See for example Joe and Xu (1996). These estimators are consistent and
asymptotically normal and also almost as efficient as fully MLE. Another
possibility is to use the so-called empirical copulas, introduced by Deheuvels
(1979, 1981a,b). These are highly discontinuous, and some form of smoothing is necessary to obtain better estimates. An approach of Fermanian et
al. (2002), using kernel estimates based on empirical copulas, is discussed
in section 3. An approach based on wavelet expansions is introduced also in
section3.
Concerning the estimation of copulas for time series, to our knowledge
there are only the works of Fermanian and Scaillet (2003), using nonparametric techniques with kernels and Morettin et al. (2006), using wavelets. A
related paper is Chen and Fan (2004), but this focuses on stationary Markov
processes of order one and assumes a parametric form for the copula function. One purpose of this paper is to propose wavelet estimators based on
the empirical copulas. Another is to indicate that the methods developed for
i.i.d. samples can be used with time series data, under some circumstances.
The plan of the article is as follows. In section 2 we set down the necessary background on copulas and wavelets. In section 3 we describe the
parametric and nonparametric methods used for copula estimation in the
case of i.i.d. data. In section 4 we do the same for time series data. In section 5 we perform some simulations and in section 6 we apply the proposed
techniques for two sets of real data. The article ends with some further
comments in section 7.
2
Background
In this section we present some basic notions on copulas and wavelets.
2.1
Copulas
We restrict our attention to the bivariate case for ease of notation.
The extensions to the n-dimensional case is straightforward.
2
A copula can be viewed as a function C defined on I 2 = [0, 1]2 with values
in I, satisfying, for 0 ≤ x ≤ 1 and x1 ≤ x2 , y1 ≤ y2 , (x1 , y1 ), (x2 , y2 ) ∈ I 2 ,
the conditions
C(x, 1) = C(1, x) = x, C(x, 0) = C(0, x) = 0,
(1)
C(x2 , y2 ) − C(x2 , y1 ) − C(x1 , y2 ) + C(x1 , y1 ) ≥ 0.
(2)
Property (1) means uniformity of the margins, while (2), the n-increasing
property (with n = 2) means that P (x1 ≤ X ≤ x2 , y1 ≤ Y ≤ y2 ) ≥ 0 for
(X, Y ) having distribution function (d.f.) C.
See Nelsen (1999) for a general definition and further details on copulas.
The following important theorem links the definition of copula with a d.f.
and its marginal distributions. A proof can be found in Sklar (1959).
Theorem (i) Let C be a copula and F1 , F2 univariate d.f.’s. Then
F (x, y) = C(F1 (x), F2 (y)),
(x, y) ∈ IR2
(3)
defines a d.f. F with marginals F1 , F2 .
(ii) Conversely, for a two-dimensional d.f. F with marginals F1 , F2 , there
exists a copula C satisfying (3), and this is unique if F1 , F2 are continuous
and then, for any (u, v) ∈ I 2 ,
C(u, v) = F (F1−1 (u), F2−1 (v)),
(4)
where F1−1 , F2−1 denote the generalized left continuous inverses of F1 , F2 .
It follows that copulas are bivariate (in general multivariate) d.f.’s with
uniform univariate marginals. See also Kolev et al. (2005) and Schweizer
(1991) for good reviews of copulas. In what follows we assume that F1 and
F2 are continuous.
We now introduce empirical copulas. Let (Xi , Yi ), i = 1, . . . , n, be a
sample from (X, Y ) and let
1X
I{Xi ≤ x, Yi ≤ y},
n
n
Fn (x, y) =
i=1
−∞ < x, y < +∞
(5)
be the empirical d.f. and let F1n (x), F2n (y) be the corresponding marginal
d.f.’s, namely
F1n (x) = Fn (x, +∞),
F2n (y) = Fn (+∞, y),
3
−∞ < x, y < +∞.
Then the empirical copula function is defined by
−1
−1
Cn (u, v) = Fn (F1n
(u), F2n
(v)),
0 ≤ u, v ≤ 1,
and the empirical copula process is defined by
Zn (u, v) =
√
n(Cn − C)(u, v),
0 ≤ u, v ≤ 1.
(6)
(7)
Deheuvels (1979) proves uniform consistency of the empirical copula,
while Deheuvels (1981a, 1981b) obtained results concerning limits for Zn in
the case of independent marginals. In particular, he proposed a KolmogorovSmirnov-type statistic for testing the independence hypothesis that C(u, v) =
uv and obtained its asymptotic distribution under the null hypothesis. Fermanian et al. (2002) prove that the empirical copula process converges
weakly to a Gaussian process in L∞ [0, 1]2 (the space of a.e. bounded functions on I 2 with sup-norm), under the asumption that C has continuous
partial derivarives. See also Ibragimov (2005) for a similar result in the case
of a stationary β-mixing process.
2.2
Wavelets
We will need two-dimensional wavelets in this paper, but we mention briefly the one-dimensional case. From a mother wavelet ψ and a father
wavelet φ (or scaling function), an orthonormal system for L2 (IR) is generated by setting φj,k (x) = 2j/2 φ(2j x−k), j ≥ j0 and ψj,k (x) = 2j/2 ψ(2j x−k),
j, k ∈ ZZ, for some coarse scale j0 , which we take as zero. Hence, for any
f ∈ L2 (IR) we may write, uniquely,
f (x) =
X
α0,k φ0,k (x) +
XX
βj,k ψj,k (x),
(8)
j≥0 k
k
where the wavelet coefficients are given by
α0,k =
βj,k =
Z
Z
f (x)φ0,k (x)dx,
(9)
f (x)ψj,k (x)dx.
(10)
In our case, a copula will be assumed to belong to L2 (I 2 ), so we need
first to consider periodized wavelets in the interval [0, 1], defined by
φ̃j,k (x) =
X
n
φj,k (x − n),
ψ̃j,k (x) =
X
n
4
ψj,k (x − n),
see Vidakovic (1999) for details. We will supress the upper tilde from now on.
For any function f ∈ L2 (I 2 ) we may have a similar expansion to (8), where
the wavelets are obtained as products of one-dimensional wavelets. One
possibility is to consider a basis with a single scale. Define the bivariate
scaling function as Φ(x, y) = φ(x)φ(y) and the wavelets by Ψh (x, y) =
φ(x)ψ(y), Ψv (x, y) = ψ(x)φ(y) and Ψd (x, y) = ψ(x)ψ(y), where h, v and
d indicate the horizontal, vertical and diagonal directions, respectively. Let
k = (k1 , k2 ). Then a wavelet expansion for f (x, y) is
f (x, y) = c0,0 +
∞ X X
X
dµj,k Ψµj,k (x, y),
(11)
j=0 k µ=h,v,d
with the wavelet coefficients given by
Z
Z
µ
c0,0 = f (x, y)dxdy, dj,k = f (x, y)Ψµj,k (x, y)dxdy.
(12)
Another possibility is to built a basis as the tensor product of two onedimensional bases with different scales for each dimension. This will not be
considered here. This approach is used by Morettin et al. (2006).
3
Estimation for i.i.d. data
As we mentioned in the Introduction, if a copula belongs to a parametric family of copulas, ML methods can be used. These are all well known
and wil not be discussed further here. The software S+FinMetrics, a module
of S-Plus, implement at least one of these procedures. See Zivot and Wang
(2006) for further details.
We now turn to nonparametric estimators. Fermanian et al. (2002)
proposed to use
1X
Kn (x − Xi , y − Yi ),
n
n
F̂n (x, y) =
(13)
i=1
as a smoothed empirical distribution function estimator. Here Kn (x, y) =
−1
K(a−1
n x, an y) and
Z x Z y
K(x, y) =
k(u, v)dudv,
−∞ −∞
R
for some bivariate kernel function k : IR2 → IR, k(x, y)dxdy = 1, and a
sequence of bandwidths an ↓ 0, as n → ∞. It is easily proved that for small
enough bandwidths an , under mild conditions,
5
√
P
n sup |F̂n (x, y) − Fn (x, y)| → 0.
x,y
Similar smoothed estimators F̂1n and F̂2n can be proposed , using univariate kernels. A smoothed (kernel) empirical copula estimator is then
obtained from (6), namely
−1
−1
Ĉn (u, v) = F̂n (F̂1n
(u), F̂2n
(v)), 0 ≤ u, v ≤ 1.
(14)
We now propose a smoothed empirical copula wavelet estimator.
Assume that the copula C(u, v) ∈ L∞ ([0, 1]2 ) and consider its wavelet
expansion
C(u, v) = c0,0 +
∞ X X
X
dµj,k Ψµj,k (u, v),
(15)
j=0 k µ=h,v,d
with the wavelet coefficients given by
Z
Z
µ
c0,0 = C(u, v)dudv, dj,k = C(u, v)Ψµj,k (u, v)dudv.
We take as the empirical wavelet coefficients,
Z
dˆµj,k = Cn (u, v)Ψµj,k (u, v)dudv,
(16)
(17)
and a similar expression for c0,0 .
We have that the corresponding estimator for C(u, v) is then
Ĉ(u, v) = ĉ0,0 +
XX
j,k
δ(dˆµj,k , λ)Ψµj,k (u, v),
(18)
µ
where δ(·, λ) is a threshold. See Donoho et al. (1995) for details on thresholds. In this paper we will take a high quantile as a threshold. The sums
in (18) are computed for j ≤ J, where J = J(n) is the maximum scale
analyzed, chosen (for theoretical purposes) in such a way that J → ∞ as
n → ∞, but J/n → 0. In turn, k1 , k2 vary from zero to 2j − 1. See Morettin
et al. (2006) for details of copula estimators based on wavelet estimators of
densities.
We will not deal here with properties of these smoothed (kernel and
wavelet) estimators. For kernel estimators we cojecture that properties similar to those derived by Fermanian and Scaillet (2003), for time series, hold
6
in this case of i.i.d. data, probably with milder conditions. For wavelet
estimators, research by the present authors is under way.
Actually, Fermanian et al. (2002) proved that the smoothed empirical
copula process
Ẑn (x, y) =
√
n(Ĉn − C)(u, v), 0 ≤ u, v ≤ 1,
converges to a Gaussian process in L∞ ([0, 1]2 ).
4
Estimation for time series data
As remarked in section 1, most of the results available in the literature
of copulas apply to i.i.d. samples (Xi , Yi ), i = 1, . . . , n from a distribution
function F . As properly remarked by Mikosch (2005), “it is contradictory
that in risk management, where one observes a lot of dependence through
time, copulas are applied most frequently”. For stochastic processes, there
are a few works dealing with time series, namely Fermanian and Scaillet
(2003) and Morettin et al. (2006). The former consider strictly stationary
processes satisfying a strong mixing condition and kernel-type estimators,
the latter consider a larger class of stochastic processes satisfying also some
mixing conditions and wavelet-type estimators.
If we decide to apply the theory developed for i.i.d. sequences to time
series data, at least some care should be taken. We recall that in problems
involving MCMC methods, the ultimate goal is to obtain a (i.i.d.) sample from some target distribution, and this is done by sampling from the
stationary distribution of the simulated Markov chain, taking observations
separated by a number τ , which can be obtained from an inspection of the
autocorrelation function (a.c.f.) of the simulated chain: τ corresponds to
the lag at which the a.c.f. dumps to zero. In order to reduce dependence,
similar approach can be followed in the case of copulas, inspecting the a.c.f.
and cross-correlation function (c.c.f.) of the series involved.
Of course the folowing procedure does not produce i.i.d. samples, but
it is hoped that most of dependence will be eliminated from the series of
interest.
′
Let Xt = (X1t , . . . , Xrt ) be an r-dimensional stochastic process and
suppose we have n observations from this process, Xt , t = 1, . . . , n. Assume
that this process is strictly stationary, with distribution function F (x) and
density function f (x), generally unknown. Let ρ̂(τ ), τ ∈ ZZ be the estimated
correlation matrix. We can plot the a.c.f. ρ̂ii (τ ) and the c.c.f. ρ̂ij (τ ) and
look for the lags τij for which there is decay to zero. Do the same with the
7
correlations of the squared series. Sample the series by taking observations
separated by some τ , which is a compromise value obtained by loking at
the τij values.The rational is that, at least for returns of financial assets,
these are in general non-correlated, but they are dependent, so the reason
for looking at the squared returns.
Another possibility is to consider the (usual) norm of the matrices ρ̂(τ ),
for each τ , and plot these norms against τ ; then look at the lag τ that minimizes this norm. Then take samples from the series at each τ observation.
See also de Miranda (2005).
In what follows, we will apply these simple inspection procedures for
simulated series.
Concerning nonparametric techniques, Fermanian and Scaillet (2003)
and Morettin et al. (2006) proposed kernel and wavelet estimators for the
copula, respectively. Estimators of densities, d.f.’s, quantiles and finally
of C are obtained through the relation (4). Properties of the estimators,
simulations and applications are given in the above mentioned papers.
Here we propose to use the wavelet estimator (18) for time series data,
which is a sample of a strictly stationary process. As a matter of fact, it
is only necessary to consider a wider class of processes, for which the d.f.
Ft (x) = F (x), for all t. Some form of mixing condition will be needed also.
5
Simulations
In this section we present simulation examples of the wavelet estimators proposed in sections 3 and 4 and also estimators based on parametric
copulas. We use the same examples of Fermanian and Scaillet (2003).
(1) First, we consider the smoothed empirical copula estimator (18) in the
case of a stationary bivariate autoregressive process of order one:
Xt = A + BXt−1 + ǫt ,
(19)
where Xt = (X1t , X2t ), with independent components and thus C(u1 , u2 ) =
′
′
u1 u2 , ǫt ∼ N (0, Σ), A = (1, 1) , vec(B) = (0.25, 0, 0, 0.75) and vec(Σ) =
′
(0.75, 0, 0, 1.25) .
The number of Monte Carlo replications is 500, while the data length is
n = 210 = 1024.
Table 1 shows the bias, E ĈJ −C, and mean squered error (MSE), E[(Ĉ −
C)2 ], computed for the Daubechies d8 wavelet using J = 5. All values (true
value of the copula, bias and MSE) are expressed as multiples of 10−4 . The
8
results are satisfactory in terms of bias and MSE, comparable with those of
Fermanian and Scaillet. Figures 1 and 2 show the estimated copula and the
contour plots, respectively.
Table 1: Bias and MSE of d8 wavelet estimator: independent case
C(.01,.01)
1.00
0.11394
0.00079
C(.05,.05)
25.00
-0.05280
0.02238
C(.25,.25)
625.00
-2.29666
0.38856
C(.50,.50)
2500.00
0.86449
0.74728
C(.75,.75)
5625.00
2.01140
0.41745
C(.95,.95)
9025.00
0.04206
0.02339
C(.99,.99)
9801.00
4.66451
0.00313
C(u1,u2)
1
0 0.2 0.4 0.6 0.8
x10−4
True
Bias
MSE
1
0.8
1
0.6
u2 0.4
0.8
0.6
0.4
0.2
0.2
u1
Figure 1: d8 wavelet estimated Copula based on empirical copulas:
independent case.
9
1.0
0.9
0.8
0.8
0.7
0.6
0.6
0.4
u2
0.5
0.4
0.2
0.3
0.2
0.0
0.1
0.0
0.2
0.4
0.6
0.8
1.0
u1
Figure 2: Contour plot of d8 wavelet estimated copulas based
on empirical copulas: independent case.
(2) We now turn to the case where the components of Xt are dependent
′
′
processes, with A = (1, 1) , vec(B) = (0.25, 0.2, 0.2, 0.75) and vec(Σ) =
′
(0.75, 0.5, 0.5, 1.25) .
Since X1t and X2t are positively dependent, we have C(u1 , u2 ) > u1 u2 .
Based on 500 Monte Carlo replications with the data length n = 1024,
results are reported in Table 2. These results are much better than those of
Fermanian and Scaillet and comparable with those of Morettin et al. (2006).
Figure 3 and 4 bring the estimated copula and contour plots, respectively.
Again, we have used the d8 wavelet with J = 5.
Table 2: Bias and MSE of d8 wavelet estimator: dependent case
x10−4
True
Bias
MSE
C(.01,.01)
27.08
-2.0990
0.0188
C(.05,.05)
197.95
-3.0690
0.1153
C(.25,.25)
1511.74
-1.6825
0.4421
C(.50,.50)
3747.68
-2.4729
0.6378
C(.75,.75)
6511.74
-2.8429
0.4762
C(.95,.95)
9197.95
-1.3488
0.0977
C(.99,.99)
9827.08
-1.3236
0.0166
(3) We now consider the case (2) above, but will investigate the use of
i.i.d, methods for it. Since the true underlying copula is a normal one, we
consider to fit this to the associated bivariate AR(1) process above. First
we generate a sample of 40,000 observations of the process. Figure 5 (top
pannel) shows the acf of both components of the bivariate series, showing
10
that there are significant correlations, as expected. Then we consider the
norm of the estimated correlation matrix for several lags and see that this
dies out around lag 34. See the botton pannel of Figure 6. We sample
the above process using this sampling interval (34) and obtain a (bivariate)
series of 1,024 observations. The corresponding acf’s are also shown in
the botton pannel of Figure 5. We see now that the series are practically
non-correlated. The twoo upper pannels of Figure 6 also show the ccf’s
of the original series, showing also significant cross-correlations. Figure 7
shows the acf’s of the squared original and sampled series, showing that
the original bivariate process presents some degree of dependence, which
is almost removed in the sampled series. We cannot guarantee that all
the dependence is removed, but as far as the series and squared series are
concerned the correlation is almost totally removed. This is an indication
that the dependence is reduced (possibly a lot). Reducing this correlation is
a necessary condition for independence, but of course we do not claim that
this sampled series is independent. We have used the norm, acf and ccf of
series and squared series as simple necessary tests, in the sense that if these
requirements are not fulfilled then i.i.d. methods may be very missleading.
Figure 8 shows the contour plot of the estimated normal copula, for the
original (40,000 values) and sampled series (1,024 values), superimposed to
the corresponding empirical copulas. We see that they are practically the
same (except that the empirical copula for the longer series is smoother).
The S+FinMetrics package was used to do the estimation, using the IFM
(inference for margins) approach of Joe and Xu (1996). If we use the same
sampled data to estimate the copula using the smoothed wavelet estimator,
we will obtain a contour plot similar to the ones in Figure 4 and Figure 8,
respectively.
Table 3 presents the values of Kendall’s τ for our simulated examples,
showing that the values are very close.
11
C(u1,u2)
1
0 0.2 0.4 0.6 0.8
1
0.8
1
0.6
u2 0.4
0.8
0.6
0.4
0.2
0.2
u1
1.0
Figure 3: d8 wavelet estimated Copulas based on empirical copulas:
dependent case.
0.8
0.9
0.8
0.6
0.7
u2
0.6
0.4
0.5
0.4
0.2
0.3
0.2
0.0
0.1
0.0
0.2
0.4
0.6
0.8
1.0
u1
Figure 4: Contour plot of d8 wavelet estimated copulas based
on empirical copulas: dependent case.
12
Table 3: Kendall’s τ for the simulated series
τ
T = 40, 000
T = 1, 024
normal copula
0.49974
0.50000
empirical copula
0.49906
0.50297
So we can conclude tentatively that if we can fit a parametric copula
to our time series data, it is safe to apply the usual estimation procedures
that were developed for i.i.d. variables, provided we have a large number of
observations and can sample the series appropriately and obtain a reasonable
number of observations to proceed. The estimators are quite close. But in
situations where we are not sure which parametric copula family to fit, or
we have a not too large sample, it is better to use nonparametric estimators
developed for time series data, like the smoothed wavelet estimator of section
4 or the estimators proposed in Fermanian and Scaillet (2003) and Morettin
et al. (2006). We observe that the simulated process in example (2) may be
considered an m− dependent process, with m ≈ 34. In a practical situation
we would fit the parametric copula to the whole series and then to the
sampled series and compare both fits. If they are close enough the use of
i.i.d. procedures is likely to be a god solution.
13
Series : series2.40000
ACF
0.0
0.0
0.2
0.2
0.4
0.4
ACF
0.6
0.6
0.8
0.8
1.0
1.0
Series : series1.40000
0
10
20
Lag
30
40
0
20
Lag
30
40
Series : series2.1024
ACF
0.4
0.0
0.0
0.2
0.2
ACF
0.4
0.6
0.6
0.8
0.8
1.0
1.0
Series : series1.1024
10
0
5
10
15
Lag
20
25
30
0
5
10
15
Lag
20
25
Figure 5: Acf’s of the original and sampled series,
bivariate AR(1) process: dependent case.
14
30
pho.12
0.2
0.4
0.6
1.0
0.8
0.6
pho.11
0.4
0.0
0.2
0.0
0
10
20
30
40
0
10
30
40
30
40
indice.k
0.8
0.6
pho.22
0.0
0.0
0.2
0.4
0.4
0.2
pho.21
0.6
1.0
indice.k
20
0
10
20
30
40
0
10
20
indice.k
1.0
0.5
0.0
norma.pho
1.5
indice.k
0
10
20
30
40
indice.k
Figure 6: Acf’s, ccf’s and norm of the original series,
bivariate AR(1) process: dependent case.
15
ACF
0.0
0.0
0.2
0.2
0.4
0.4
ACF
0.6
0.6
0.8
0.8
1.0
1.0
Series : series1.squared.40000 Series : series2.squared.40000
0
10
20
Lag
30
40
0
20
Lag
30
40
Series : series2.squared.1024
ACF
0.4
0.0
0.0
0.2
0.2
ACF
0.4
0.6
0.6
0.8
0.8
1.0
1.0
Series : series1.squared.1024
10
0
5
10
15
Lag
20
25
30
0
5
10
15
Lag
20
25
30
Figure 7: Acf’s of the squared original and sampled series,
bivariate AR(1) process: dependent case.
16
1.0
0.1
0.1
0.2
0.2
0.3
0.3
0.4
0.4
0.5
0.6
0.7
0.8
0.9
1.0
1.0
0.6
0.6
0.2
0.2
0.3
0.3
0.4
0.4
0.5
0.5
0.6
0.6
0.7
0.7
0.8
0.9
1.0
V
0.8
V
0.8
0.1
0.1
0.4
0.4
0.2
0.2
0.0
0.0
0.0
0.2
0.4
0.6
0.8
1.0
U
0.0
0.2
0.4
0.6
0.8
1.0
U
Figure 8: Normal copula estimators superimposed to the
empirical copula; T = 40, 000 and T = 1, 024, respectively.
6
Empirical Applications
In this section we illustrate the estimation of copulas using the
smoothed wavelet estimator given by (18), considering two pairs of daily
series: In the first example we consider returns of the São Paulo Stock Exchange index (Ibovespa) and prices of stocks of the Brazilian oil company,
Petrobrás, from January 2, 1995 to February 3, 1999 (T = 1, 024 observations). In the second example we consider daily returns of the Ibovespa
and IPC (Mexico) indexes, from September 4, 1995 to June 5, 2000 (also
T = 1, 024 observations).
Figure 9 shows the scatter plot of Ibovespa and Petrobrás, and we see
a rather strong contemporaneous correlation between both series (Pearson
correlation coefficient is 0.83). In figures 10 and 11 we have the plot of
the estimated copula using (18) and the corresponding contour plot, respectively. We see that these contours resemble the ones corresponding to a
comonotonic copula (perpendicular straight lines on the diagonal). We have
used the 0.90 percentile as the λ parameter of the thresholding procedure:
all empirical wavelet coefficients smaller than λ were discarded.
17
0.3
•
0.2
•
•
0.0
ribv
0.1
•
−0.1
•
•
••
•
•
••
•
•
• •
•• • •
•
• •
• •
•
• • •
•
•• • •
••
•
•
•• • •
•
•
•
•
•
•
•
•
•••••
••
••
•••
••
••••••• •
•••
•
•
•••••
•
••••
•
• ••
••
•••
•
•••••••
•
••
•
••••
••
•
•
•••
••
•
••
••
•
•
•••• • •
•
•••••••
••
•
•
•
•
•
•
•
•••
•
•
•
•
••
•
••
•
•
•
•••
•
•
•
•
•
•
•
•
•
•
• ••
•
•••••
•
•••
••
••••
•
••
•
•
•
•
•
••
•
•
•
•
•
•
•
•
•
••
••
•
••
•
••
•
•
•
•
•
•
•
•
•
•
•
•• •••
•
••
•
•••
•
•
•
•
•
••
•
•
•
•
•
••••
•
•
•
•
••
•
•
••
•
•• •••
•
•••
•
•
••
•
•••
•••
•
•
•
•
•
•
••
•
•• •
•
•
••
•
•
•
••
•
•••
•
••
•
•
•
•
•
•
•
•
••
••
•
•
••
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
••••
•
•
••
•
•
•
•••
•••
•
•
•
•
•
•
••
•
•
••••
•
•
••
••
•
•
•
•••
•
•• •
•
••
•
•
•••
•
•
•
•
••
•
•
••
• ••
••
••
•••
•
•
•
•
••
•
•
•
•
•
•
•
•
••••
•
•
•
•
•
••
•
•
•
•
•
•
•
••
•
••
•
•
•
••
•
•
•
•
•
•
••
••
•
•••
•
•
•
•
•
•
•
•
•
•
••
•
•
•
•
•
•
•••
••
•
••
•
•
•
•
•
••••••
•••
•
•
•
•
•
•
••
•
••
•
•
•
•• •••• ••
• •
•
•
•
•
•
•
•
••
•
• •
•
•
•
•
••
•
•
•
••
•
•
••
•
••
•
•
•
•
••
•
•
•
•
•
••
•
•
•
•
••
••
•••••
•••
•
••
•
••
•
•
••
•
•••
•
•
•••••
•
•••
•
•••
•
•
••
•••• •
••••
•
•
•
••
•••
•
•
••
•
•
•
•••• •
•
• •••••
••••
•••
•
•
•
•
•
•
•
•
•
•
•
• •••••••••••••••• ••
• ••• •
•
•
•
••• •
•
•
•• • •
•
•
• •• • •• • •
••
• •• • •
•
•
•
• •
•
•
•
−0.2
−0.1
0.0
0.1
0.2
rpetro
C(u1,u2)
0.8 1
−0.2 0 0.2 0.4 0.6
Figure 9: Returns for Ibovespa and Petrobras.
1
0.8
1
0.6
u2 0.4
0.8
0.6
0.4
0.2
0.2
u1
Figure 10: d8 wavelet estimated copula for Ibovespa and Petrobras
based on empirical copulas.
18
1.0
0.8
0.9
0.8
0.6
0.7
u2
0.6
0.4
0.5
0.4
0.2
0.3
0.2
0.0
0.1
0
0.0
0.2
0.4
0.6
0.8
1.0
u1
Figure 11: Contour plot of d8 wavelet estimated copula for Ibovespa and
Petrobras based on empirical copulas.
Figure 12 shows the scatterplot of the returns of Brazilian (Ibovespa)
and Mexican (IPC) indices. The contemporaneous correlation coefficient
is low, 0.552. The plots for the smoothed empirical estimates are given in
Figures 13 and 14. We see the same kind of behavior as in the simulated
dependent case above. The same threshold as before was used.
19
•
0.10
•
•
•
•
•
•
•
•
•
• • • ••
•
•
• •
• •
•
••• •
• • ••• •••
• • •
•• •
•
•
•
•
•
•
•
•
•
•• •••• •••
••
• ••
•••
••
••
••
••• •••
••••
••
•
••••
• •• •
•••
••
••
••
•••
• ••
••
•
••
•
•
••
••
••••••
••
•••
•
•
••
•
••
••
•
•
••
•
• • • ••
•
•
•••
•••
•
•
• •
•
•••
•
•
•••
••
•
•
•••••
•
•
•
• ••
••
•
••
•••••
•
••••
•
••
••
•
•
••
•
•
••
••
••••
•
••
•
•
•
•
• •• •
•
•
•
•
•
•• ••••
•
•
••
•
•
•
••
•
•
••
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
••
•
•••••••
•
•
•
•
• •
•
•
•
•
•
•
•
•
•
•
•
••
•
•
•
•
•
•
•
•
•
••
•
•
•
•
•
•
•
•••
•
••
•
•
•
••
• • •••
•
•
•
•••
•
•••
•
•
•
•
•
•
•
•
•
•
•
•
•
••••
•
••••
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
••
•
•
••
•
•
•
••
••
•
••
•
•
•
•
•
•
•
•
•
••••••
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
••
•
•
•
•
•
•
•••
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
••
•
•
•
•
•
•
•
•••
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•••
•
••••••
•••
••
••
•
••
•
••
•
•
•
•
••
•
•
••
•
•••••
•
•
•
•
•
•
•••
•
•
•
•
•
••
•
•
•
•
•
•
•
•
••
•
•
• • • •••
•
•
•
••
•
•
•
••
••
•
•
•
•
•
•
•
•
•
•
••
•
••
•
•
•
•
•
••
•
•
•
•
•
••
• ••
•
•
•
•• •
•
•
•
•
•
•
••
•
•
•
•
•
•
•
•
•
•
•
•
•
•
••
•
••
••
•
•
•
•••
•••
• •
•
•
•
••
•
•
•
•
•
•
•
•
•
•
••
•
•••
•
••
••
•
•
•
•
•
•
•
•
•
•
••
•
••
•
•
•
•
•
•
•
• •
••••
•
•
•
•
•
•
•
•
•
••
•
•
•
••••
•
•
•
•
••
••
• ••• ••
•
••
••
•••
•
•
•
•
•
•••
•
•
••
•
• • •••
•
•
•
•
••
•
•
••
•
•
•
••
•
•
•
•
•••
•• ••
•
••
•
••
••
•
•• •••
•
•
•
• ••
••
••
••
•
•
•
•
•
•
•
•••
•••••• ••••
•
•
•
•
•
•
•
•
•
•
• •••••
••
•• ••
• ••••
•
••
••••
•••••
••• •
•
••
• • • • ••• •
• ••
•••• •
••
•
•
•• •• •
••
• •
•
••
•
•
• •
•
•
••
•
ipc
0.0
0.05
• •
•
−0.05
•
−0.15
−0.10
•
•
•
•
−0.1
0.0
0.1
0.2
0.3
ibv
C(u1,u2)
0.8 1
−0.2 0 0.2 0.4 0.6
Figure 12: Returns for Ibovespa and IPC.
1
0.8
1
0.6
u2 0.4
0.8
0.6
0.4
0.2
0.2
u1
Figure 13: d8 wavelet estimated copula for Ibovespa and IPC
based on empirical copulas.
20
1.0
0.8
0.9
0.8
0.6
0.7
u2
0.6
0.4
0.5
0.4
0.2
0.3
0.2
0.0
0.1
0
0.0
0.2
0.4
0.6
0.8
1.0
u1
Figure 14: Contour plot of d8 wavelet estimated copulas for Ibovespa and
IPC based on empirical copulas.
7
Further remarks
In this work we have developed wavelet estimators of copulas based
on empirical copulas which can be used for i.i.d. and time series data. We
further have suggested a simple approach to check if the usual procedures
developed for i.i.d. variables can be used with time series data. Of course
this requires that the series have a large number of observations. Preliminary findings indicate that the estimators based on i.i.d. samples (basically
versions of the maximum likelihod estimators) can be used with time series
data provided some simple checks are done. But further studies, mainly
theoretical, based on convergence rates of estimators for i.i.d. variables and
time series, have to be performed in order to derive more precise conclusions.
Acknowledgements.
The authors acknowledge the support of FAPESP grant 03/10105-2. The
fourth author thanks Our Lord and Saviour Jesus Christ.
21
References
Chen, X. and Fan, Y. (2004). Estimation of copula-based semiparametric
time series models. To appear, Journal of Econometrics.
Deheulves, P. (1979). La fonction de dépendance empirique et ses propriétésUn test non paramétrique d’independence. Académie Royale de BelgiqueBulletin de la Classe des Sciences-5e Série, 65, 274–292.
Deheulves,P. (1981a). A non parametric test for independence. Publications
de l’Institut de Statistique de l’Université de Paris, 26, 29–50.
Deheuvels, P. (1981b). A Kolmogorov-Smirnov type test for independence
and multivariate samples. Rev. Roum. Math. Pures at Appl., 26, 213–226.
de Miranda, J.C.S. (2005). Modeling daily extreme log-returns. Proceedings
of the Second Brazilian Conference on Statistical Modelling in Insurance and
Finance (eds. N. Kolev and P.A. Morettin), 290–293.
Donoho, D.L., Johnstone, I., Kerkyacharian, G. and Picard, D. (1995).
Wavelet shrinkage: Asymptopia? Journal of the Royal Statistical Society,
Series B, 57, 301–369.
Fermanian, J.-D., Radulovic, D. and Wegkamp, M. (2002). Weak convergence of empirical copula process. WP, CREST.
Fermanian, J.-D. and Scaillet, O. (2003). Nonparametric estimation of copulas for time series. Journal of Risk, 5, 25–54.
Genest, C. and Rivest, L.-P. (1993). Statistical inference procedures for
bivariate Archimedian copulas. Journal of the American Statistical Association, 88, 1034–1043.
Ibragimov, R. (2005). Copula-based dependence characterizations and modelling for time series. Discussion paper N. 2094, Harvard Institute of Economic Research.
Joe, H. and Xu, J.J. (1996). The estimation method of inference functions
for margins for multivariate models. TR 166, Dept. Statistics, Univ. British
Columbia.
Kolev, N.W., Mendes, B.V.M. and Anjos, U. (2005). Copulas: A review
and recent developments. TR, University of São Paulo.
22
Mikosch, T. (2005). Copulas: Tales and facts. To appear, Extremes.
Morettin, P.A., Toloi, C.M.C., Chiann, C. and de Miranda, J.C.S. (2006).
Wavelet estimation of copulas for time series. Preprint.
Nelsen, R. (1999). An Introduction to Copulas. Lecture Notes in Statistics.
New York: Springer.
Schweizer, B. (1991). Thirty years of copulas. Advances in Probability Distributions with Given Marginals (G. Dall’Aglio, S. Kotz and G. Salinetti,
eds). Kluwer: Dordrecht.
Shi, J. and Louis, T. (1995). Inferences on the association parameter in
copula models for bivariate survival data. Biometrika, 51, 1384-1399.
Sklar, A.(1959). Fonctions de répartition à n dimensions et leurs marges.
Publ. Inst. Statist. Univ. Paris, 8, 229–231.
Vidakovic, B. (1999). Statistical Modeling by Wavelets. New York: Wiley.
Zivot, E. and Wang,J. (2006). Modeling Financial Time Series With S-Plus.
Second Edition. New York: Springer.
23