DEPARTMENT OF ECONOMICS
EUI Working Papers
ECO 2011/23
DEPARTMENT OF ECONOMICS
THE MULTISCALE CAUSAL DYNAMICS
OF FOREIGN EXCHANGE MARKETS
Stelios Bekiros and Massimiliano Marcellino
EUROPEAN UNIVERSITY INSTITUTE, FLORENCE
DEPARTMENT OF ECONOMICS
The Multiscale Causal Dynamics
of Foreign Exchange Markets
STELIOS BEKIROS
and
MASSIMILIANO MARCELLINO
EUI Working Paper ECO 2011/23
This text may be downloaded for personal research purposes only. Any additional reproduction for
other purposes, whether in hard copy or electronically, requires the consent of the author(s), editor(s).
If cited or quoted, reference should be made to the full name of the author(s), editor(s), the title, the
working paper or other series, the year, and the publisher.
ISSN 1725-6704
© 2011 Stelios Bekiros and Massimiliano Marcellino
Printed in Italy
European University Institute
Badia Fiesolana
I – 50014 San Domenico di Fiesole (FI)
Italy
www.eui.eu
cadmus.eui.eu
THE MULTISCALE CAUSAL DYNAMICS OF FOREIGN EXCHANGE MARKETS *
Stelios Bekiros †
Massimiliano Marcellino ‡
European University Institute, Department of Economics,
Via della Piazzuola 43, I-50133 Florence, Italy
ABSTRACT
This paper relies on wavelet multiresolution analysis to capture the dependence structure of currency
markets and reveal the complex dynamics across different timescales. It investigates the nature and direction
of causal relationships among the most widely traded currencies denoted relative to the United States Dollar
(USD), namely Euro (EUR), Great Britain Pound (GBP) and Japanese Yen (JPY). The timescale analysis
involves the estimation of linear vis-à-vis nonlinear and spectral causality of wavelet components and
aggregate series as well as the detection of short- vs. long-run linkages and cross-scale correlations.
Moreover, this study attempts to probe into the micro-foundations of across-scale heterogeneity in the
causality pattern on the basis of trader behavior with different time horizons. New stylized properties
emerge in the volatility structure and the implications for the flow of information across scales are inferred.
The examined period starts from the introduction of the Euro and covers the dot-com bubble, the financial
crisis of 2007-2010 and the Eurozone debt crisis. Technically, this paper presents an invariant discrete
wavelet transform that deals efficiently with phase shifts, dyadic-length and boundary effects. It also
proposes a new entropy-based methodology for the determination of the optimal decomposition level.
Overall, there is no indication of a global causal behavior that dominates at all timescales. When the
nonlinear effects are accounted for, the evidence of dynamical bidirectional causality implies that the pattern
of leads and lags changes over time. These results may prove useful to quantify the process of integration as
well as influence the greater predictability of currency markets.
Keywords: exchange rates; wavelets; timescale analysis; causality; entropy
JEL classification: C14 ; C32 ; C51 ; F31
* We are grateful to Ramazan Gençay and Helmut Lütkepohl as well as seminar participants at various institutions for helpful
comments and discussions. Stelios Bekiros also thanks the Department of Quantitative Economics at the University of Amsterdam
(UvA), the Department of Economics and the Max Weber Programme at the European University Institute (EUI), for having hosted his
research. Earlier versions of this paper were presented at the UvA, the Netherlands Society for Statistics and Operations Research (VVS)
and at the EUI. This research is supported by the Marie Curie Intra European Fellowship (FP7-PEOPLE-2009-IEF, N° 251877) under the
7th European Community Framework Programme. The usual disclaimers apply.
† Corresponding author. Tel.: +39 055 4685916; Fax: +39 055 4685 902; E-mail address:
[email protected]
‡
Tel.: +39 055 4685956; Fax: +39 055 4685 902; E-mail address:
[email protected]
1
1. INTRODUCTION
Since the pioneering work of Grossman and Morlet (1984) wavelet methodology, a
refinement of Fourier analysis, has been introduced in the literature as an alternative for analysing
nonstationary data with “irregularities”. Their contribution was followed by the development of
multiresolution analysis by Mallat (1989) and the introduction of orthogonal wavelet bases by
Daubechies (1992). Even though the wavelet methodology has widespread application in the
natural sciences, it is a rather unexplored area in economics, with the exception of financial
applications4. The wavelet multiscale decomposition, allowing for simultaneous analysis in the
time and frequency domain, is a valuable means of exploring the complex dynamics of financial
time series, which are inherently characterized by chaotic patterns, fat tails and long-memory,
particularly at high sampling frequencies.
In this study we use the wavelet methodology to investigate the nature and direction of
causality among foreign exchange (FX) markets at different timescales. During the Great
Moderation period5 and in particular during the nineties, currency markets have grown more
similar and FX rate volatility decreased (Laopodis, 1998)6. More recently, the Euro behavior against
the US dollar has seriously altered the prior state of market interrelations (Bénassy-Quéré et al.,
2000; Wang et al., 2007). Given the status of the US dollar and Euro as anchor currencies, it is
interesting to examine the nature of the causal linkages between them, as well as with other
currencies7. The existence of causal linkages would suggest that news originating in a specific
market is not country-specific and idiosyncratic, but efficiently transmitted to other foreign
markets, thus providing support to the “meteor shower” notion introduced by Engle et al. (1990).
4 See for example Greenblatt (1998), Jensen (2000), Davidson et al. (1998) and Fernandez (2005). A more detailed literature overview is
provided in Appendix I.
5 According to Stock and Watson (2003) the Great Moderation period initiated around the mid-1980s and lasted until the beginning of
the 2000s. During that period, the growth variance of the G7 countries was considerably lower, from 50% to 80% in comparison to the
pre- and the post –Great Moderation period.
6
A rich empirical literature exists on the volatility spillover mechanism of the USD across other currency markets. The nature of the
transmission mechanism as well as the degree of price information efficiency was already investigated during the 1980s in the
beginning of the higher integration of FX rates vis-à-vis the USD (Hogan and Sharpe, 1984; Ito and Roley, 1987). Additional empirical
evidence by Koutmos and Booth (1995) and Laopodis (1997) suggests that the size or sign of an innovation in USD, in response to a
variation in the Federal Reserve interest rate, may significantly affect the extent of dependence across markets.
7
The transactions involving USD-Euro amount approximately to 40% of global trading (BIS Triennial Survey, 2007).
2
However, it is necessary to go beyond linearity when examining the exchange rate linkages. Meese
and Rogoff (1983) reported in their seminal work the failure of linear exchange rate models, and
several more recent studies have provided further evidence against linearity. According to Ma and
Kanas (2000) nonlinear structures may account for bubbles with self-fulfilling expectations
(Blanchard and Watson, 1982), target zone models (Krugman, 1991), nonlinear monetary policies
(Flood and Isard, 1989) and noise trading (Black, 1986). However, empirical studies that tested for
these kinds of nonlinearities have rather failed to support them (Lindberg and Soderlind, 1994).
Hence, we need to resort to a more general framework, provided by wavelet analysis.
The aim of our paper is to test for the existence of linear, nonlinear and spectral causal
relationships among the three most heavily traded currencies (“FX majors”) denoted relative to
the United States dollar (USD), namely the Euro (EUR), Great Britain Pound (GBP) and Japanese
Yen (JPY)8. This is implemented via the use of the wavelet methodology, which reveals the
inherent dynamics across different timescales. The nature and direction of causality is investigated
for each component of the time series as resulting from the wavelet analysis, and it is compared
against the causality results obtained with the original “aggregate” series. The “palette” of tests
utilized covers the major types of causality reported in the literature, namely the linear Granger
test (Granger, 1969) the Baek and Brock (1992) parametric causality test for nonlinear dynamic
causality and the frequency-domain test by Breitung and Candelon (2006). The investigated time
period covers diverse regimes and “extreme” events including the rise and fall of the tech-market
bubble and the financial crisis of 2007-2010. It also includes the EU debt crisis in early 2010,
associated with the widening of bond yield spreads and the rise of credit default swaps,
concerning Eurozone countries such as Greece, Ireland, and Portugal. This crisis had a significant
effect primarily on the USD-EUR but also on the USD-GBP rate, partly due to the high UK trade
deficit and debt. An attempt is made to shed light on the impact of these events on FX market
linkages.
8
The prime motivation for choosing these particular exchange rates comes from them being the most liquid and widely traded currency
pairs in the world. On the spot market, according to the Bank of International Settlements (BIS, 2007), the USD was involved in 86.3% of
transactions, followed by the EUR (37.0%), the JPY (17.0%) and the GBP (15.0%). Volume percentages for all individual currencies
should add up to 200%, as each transaction involves two currencies..
3
The paper contributes to the literature in various ways. First, rather than using several
datasets at different temporal frequencies, we rely on the wavelet decomposition of daily data to
analyze the dependence structure of the FX markets at different time scales. The results provide
evidence of complex heterogeneous dynamics across and within different scales and strongly
indicate that the interactions between FX markets have different characteristics at different
horizons.
Second, we attempt to probe into the micro-foundations of the detected heterogeneity
across time scales, on the basis of trader behavior. We focus on the impact of the actions of longand short-term traders, as it is plausible that they may have different time horizons for different
trading decisions. Once the causality structure is identified from low-to-high frequency, this has
implications for the flow of information across timescales. In one sense, this idea is just a
generalization of Friedman’s original concept as mentioned in Ramsey and Lampart (1998b)9. The
propagation characteristics of the heterogeneous-driven behavior are investigated by studying the
statistical properties of the information flow across scales.
Third, we introduce new practical guidelines in wavelet implementation methodology.
Specifically, we propose an invariant transform that enables point-to-point comparison among all
scales, contains no phase shifts, relaxes the strict assumption of a “dyadic-length” time series and
deals effectively with “boundary effects”. In addition, beyond the existing practice that has utilized
either economic rationale or subjective judgement in considering the appropriate “depth” of the
wavelet analysis, we introduce a new entropy-based methodology to determine the optimal level
of decomposition. Overall, we show that our new approach unveils the dependence structure of
FX markets in succinctly capturing the non-Gaussian dynamic features of currency rates by
simultaneously modeling multiple time horizons.
The paper develops as follows: section 2 briefly describes the wavelet methodology and
elaborates on our proposed shift-invariant discrete wavelet transform. It also proposes new
practical guidelines in wavelet implementation. Section 3 describes the data and provides a
9 Friedman’s idea essentially was to propose a new concept of “horizon” to reconcile the short- and long-run empirical results on the
permanent-income hypothesis (Friedman, 1963).
4
preliminary statistical analysis. Section 4 presents the empirical results of the wavelet analysis of
the foreign exchange rates. Section 5 reviews causality testing and studies causality among both
the aggregate and wavelet-disaggregated FX returns and volatility series. Finally, section 6
summarizes and concludes. A set of Appendices provide more technical material.
2. WAVELET MULTISCALE ANALYSIS
The multiresolution features of the wavelet decomposition can be useful in econometric
analysis. Often, in financial and macroeconomic applications, the main focus is on the long-run
equilibrium relationships and their interaction. Through wavelet decomposition, the lowfrequency content of the data that “captures” the relevant long-run interactions can be extracted,
and the high-frequency fluctuations that distort the underlying market dependencies can be
removed.
Among the plethora of useful properties of wavelets, three other major facets should be
highlighted namely, the ability to handle nonstationarities, the localization in time, and the
decomposition in various timescales10. The evaluation of the effects of time scaling on the
relationships among economic variables is also crucial to the present study. In the following
subsection we introduce an invariant discrete wavelet transform. After that, we suggest new
practical guidelines for wavelet implementation, and expand the literature that has utilized
subjective reasoning in estimating the appropriate “depth” of the resolution, by proposing a new
entropy-based methodology for the determination of the optimal decomposition level. An outline
of necessary introductory concepts in wavelet analysis is provided in the technical Appendix I.
2.1 The Shift-Invariant Discrete Wavelet Transform (SIDWT)
The classical, decimated Discrete Wavelet Transform (DWT) involves subsampling of the
output of the high- and low-pass filters
and
, to half their original length11. This leads to a
10
The nonstationarity describes a broader notion than merely the existence of a unit-root, such as time variation, structural breaks,
singularities as well as locally temporal effects.
11
The
=
(
−
)
=
finite length wavelet filter and the
quadrature mirror relationship
( )
= −
+
− −
=
(
−
)
complement low-pass (scaling) filter satisfy the
− (see technical Appendix I for details)
5
serious drawback, namely the transform is not invariant in the real-axis. Specifically, the DWT of a
shifted signal is not the shifted version of the DWT of the signal12. Alternatively, an undecimated
DWT can be implemented without the subsampling technique13. According to Coifman and
Donoho (1995) undecimated versions of the DWT present some advantages compared to DWT.
Primarily, they can handle any sample size
to a multiple of
, while the
-th order DWT restricts the sample size
. In addition, since the coefficients of an undecimated DWT are associated with
zero phase filters, the original time series is properly aligned with the wavelet components.
Moreover, they are invariant to circularly shifting the time series, a property that does not hold for
the DWT. Finally, the undecimated wavelet variance estimator is asymptotically more efficient
than the DWT estimator (Percival, 1995). In this study, we propose a new variation of the
undecimated DWT, namely the Shift-Invariant DWT (SIDWT).
The
length
(
-length vector of the wavelet coefficients
) is obtained as
=
=
SIDWT is defined as follows: The
ɶ = ɶ , where ɶ is a
organized into
+
(
+
)
(
=
=
−
is a
and ɶ is a
={
}=
with dyadic
according to the DWT (see Appendix I). Formally, the
+
×
)
- length vector of SIDWT coefficients ɶ is obtained as
matrix. The SIDWT coefficient vector, as in DWT,
is
vectors
ɶ = ɶ
where ɶ
for a time series
ɶ
ɶ ɶ
(1)
- length vector of wavelet coefficients associated with the scale of length
- length vector of scaling coefficients corresponding to a length scale of
(
. The direct conversion to DWT could be implemented for a dyadic length
=
)
sample, via subsampling and rescaling of the SIDWT. The converted DWT wavelet coefficients are
=
ɶ
( + )−
with
=
− ,
and
the
scaling
12 Shifting a signal simply means delaying its start in the real-axis. Mathematically, delaying a function is represented by
coefficients
(
−
).
13 Some undecimated versions of the DWT are encountered in the statistical literature, such as the “maximal overlap DWT“ (Percival
and Mofjeld, 1997; Allan, 1966) and the “stationary DWT” (Nason and Silverman, 1995).
6
=
ɶ
=
( + )−
ɶ
the SIDWT matrix
ɶ =ɶ
ɶ
with
rescaled
+
comprises
submatrices of
×
dimension expressed as
ɶ . The SIDWT utilizes the rescaled filters from DWT, ɶ =
ɶ
ɶ =
− . In correspondence to the orthonormal matrix of the DWT,
(
wavelet
ɶ = ɶ( ) ɶ( ) ɶ( )
of DWT submatrix
) . The
=
filter
ɶ(
−
submatrix ɶ is constructed by circularly shifting the
×
ɶ
vector
) ɶ(
−
) ɶ
and
by
integer
units
to
the
right,
i.e.,
and it can be interpreted as the circularly shifted version
. The other matrices ɶ
ɶ
are similarly constructed through replacing
ɶ by ɶ .
The SIDWT implementation algorithm starts with the data
, that is no longer limited to
dyadic length as opposed to the classical “pyramid algorithm” introduced by Mallat (1989), and
filters with ɶ and ɶ to obtain the
-length vectors of wavelet and scaling coefficients ɶ and ɶ ,
yet without utilizing the downsampling operation14. In the first step the data is convolved with
each
filter
to
obtain
the
=
ɶ
wavelet
−
∑ɶ
=
ɶ =
−
∑ɶ
=
where
−
=
−
and
scaling
coefficients
− . The second step of the SIDWT algorithm uses the
“new” data, namely the scaling coefficients ɶ from the previous step, and proceeds with the
application of filtering to obtain the second level of wavelet and scaling coefficients i.e.,
ɶ
=
−
∑
=
ɶ ɶ
−
and ɶ
=
−
∑ɶ
=
ɶ
with
−
=
decomposition is ɶ = ɶ ɶ ɶ . The procedure is repeated up to
− . The resulting
=
( )
-length
times in order to
provide the full vector of SIDWT coefficients in Eq. (1). In the Inverse transform the final-level
wavelet and scaling coefficients are convolved with their respective filters and the resulting vectors
14
See Appendix I for a detailed description of Mallat’s algorithm.
7
are added up. Therefore, the vectors ɶ
produce
ɶ
=
−
the
vector
of
+
∑ɶ
−
∑ɶ
ɶ
=
+
and ɶ of the final level are filtered and combined to
scaling
coefficients
−
ɶ
=
=
where
+
ɶ
−
in
−
level
− . The length of ɶ
−
is the
same as ɶ . The algorithm is repeated until the first level of coefficients produces the original
vector of observations
=
−
∑
ɶ ɶ
=
+
+
−
∑ɶ
ɶ
=
+
with
=
− .
The SIDWT, in the same way as the classical DWT, results in the additive decomposition of
the time series. Let ɶ = ɶ ɶ define the wavelet detail for the SIDWT corresponding to changes
at scale
in the time series
for each observation
=
+
∑ɶ
for the level
=
. The multiscale decomposition is defined
as the linear combination of wavelet SIDWT detail coefficients, i.e.,
− , where ɶ
=
is the
-th element of ɶ
for
=
. Similarly,
=
ɶ =
+
∑
ɶ
is the cumulative sum of the variations of the details and is defined as the
-th level
= +
SIDWT wavelet approximation for
wavelet rough ɶ = ∑ ɶ ,
≤ ≤
≤ ≤
+
with ɶ
+
being a vector of zeros. The
-th level
incorporates the remaining lower-scale details. Overall,
=
the vector of observations may be decomposed as
= ɶ +∑ɶ = ɶ + ɶ
(2)
=
It is emphasized that the SIDWT associates the wavelet coefficients with zero-phase filters, thus the
details and approximations correspond directly to the original sample in perfect alignment.
Percival and Mofjeld (1997) proved that undecimated transforms are energy (variance) preserving
transforms. Thus, SIDWT is an efficient transform and the total variance of the time series is given
by
=∑ ɶ
+ ɶ
.
=
8
2.2 New practical guidelines in wavelet implementation
It was shown that SIDWT is time-invariant as opposed to the classical DWT which exhibits
some translation in time even after applying a “signal extension” process15. Furthermore, SIDWT is
not an orthogonal basis, thus it produces an over-determined (redundant) representation of the
series that has advantages in regards to statistical inference. Because the SIDWT entails zero-phase
filtering, the details and approximation at each timescale contain the same number of observations
and line up in time with the original series. This property makes SIDWT a particularly useful tool
in the analysis of time-dependent processes.
The selection of a particular wavelet filter class is not trivial in practice and depends upon
the complexity of the spectral density function and the underlying features of the data in the time
domain.
If the spectral density is dynamic, longer filters should be employed in order to
distinguish the frequency activity between scales. Optimally, in most data sets a balance between
frequency localization and time localization should be pursued. According to Gençay et al. (2001)
and Gençay et al. (2002), a moderate length wavelet filter (e.g., length eight) adequately captures
the stylized features of financial data. Moreover, in case the wavelet filter bears no “similarity”
with the underlying features, then the decomposition will be quite inefficient. Given that the
wavelet basis functions are used to represent the information contained in the time series, they
should “mimic” its underlying features16. Usually, smoothness and (a)symmetry are the most
crucial factors in selecting suitable wavelet basis functions (Gençay et al., 2002; Ramsey and
Lampart, 1998b). The SIDWT coefficients in this study are calculated from the Daubechies family
of compactly supported wavelet filters, which are well localized in time (Daubechies, 1992).
Specifically, the Daubechies filter of length eight, (db8) is selected in order to balance smoothness,
length and symmetry (Jensen and Whitcher, 2000; Gençay et al., 2001). This is a widely used
wavelet and is applicable in a wide variety of data types. It achieves an “ideal compromise”
15
In order to deal with time series of non-dyadic length, a “signal extension” process is usually employed for DWT, which involves
"padding" the time series with values and increase its length to the next power of two. Ogden (1997) reports various methods such as
padding with zeros, using higher-order polynomials, periodic extension, and numerical integration.
16
For instance, if the data appear to be constructed of piecewise linear functions, then the Haar wavelet may be the most appropriate
choice, while if the data is fairly smooth, then a longer filter such as the Daubechies asymmetric wavelet filter may be desired (see
Appendix I for a description of Haar and Daubechies filter families).
9
between competing requirements in that it has reasonably narrow support, is fairly smooth, is
twice differential, nearly symmetric and has a moderate degree of flexibility17.
Furthermore, the application of the DWT to finite-length time series brings up the crucial
issue of “boundary distortions”, which concerns the problematic estimation of the remaining
wavelet coefficients when the end of the series is encountered in the wavelet transform18. To deal
with boundary effects in this study, SIDWT employs a specialized “periodic extension” pattern.
Specifically, if the series length is odd, the series is first extended by adding an extra-sample equal
to the last value on the right. Then a minimal periodic extension is performed on each side
(Pesquet et al., 1996). The extension mode used for the inverse SIDWT is the same to ensure a
perfect reconstruction. In addition, using these boundary coefficients, the SIDWT retains its
numerically stability (Herley, 1995).
Finally, in the literature the depth (level) of the multiscale wavelet decomposition is usually
determined arbitrarily or based on some subjective (economic) rationale with regard to the
examined time scales. Alternatively in this study the optimal level of multiscale decomposition is
pursued with respect to the minimization of the Shannon entropy-related criterion19. It is estimated
on the basis of the sample length, the selected wavelet class and the boundary-distortion method.
The entropy of each level is estimated step-wise and it is compared with the one from the previous
level. If it is decreased then the new decomposition “reveals” interesting, non-redundant
information and the decomposition continues (Coifman and Wickerhauser, 1992). The optimal
level is determined at the minimum value of the entropy-related criterion20. In the following
expressions
is the signal and
represents the details and the
-th level approximation
17
Alternative choices of wavelet classes were also applied in the empirical study, but the results were very robust to such changes and
the current selection appeared to be the most balanced.
18
Although various theoretical methods are available to tackle with this issue, they are rather inefficient from a practical viewpoint
(Cohen et al., 1993). A common technique applied in Fourier analysis involves the entire series to be duplicated around the last obs. This
may be reasonable for some series with strong seasonal effects, but cannot be applied in general (Strang and Nguyen, 1996).
19
Classical entropy-based criteria describe information-relevant properties for an accurate representation of a given signal.
20 The Shannon criterion shows a downward trend until a minimum value-corresponding to a “threshold” scale level-is reached and
then it begins to rise revealing that further signal decomposition “contains” redundant information. The maximum level of
decomposition tried in this study is ten, based on the “translation” of the wavelet scales into economic time horizons, as mentioned in
the empirical section.
10
coefficient of
=
for scales
function such that
=
in an orthonormal basis21. The entropy E must be an additive
and
=∑
. The Shannon entropy for the coefficients in each
level is defined as
=−
= −∑
and thus for the entire signal it is
⋅
⋅
(3)
, with the convention
⋅
= .
3. DATA DESCRIPTION AND PRELIMINARY ANALYSIS
The data comprise three time series of daily closing currency rates denoted relative to
United States dollar (USD), namely Euro (EUR), Great Britain Pound (GBP) and Japanese Yen
(JPY). The exact ratios represent EUR/USD, GBP/USD and USD/JPY22. The foreign exchange
returns are defined as
=
( )−
(
−
) , where
is the closing level on day
volatility series is defined as the absolute value of the returns
=
, while the
as in Jensen and Whitcher
(2000) and Gencay et al. ( 2002) . The data span a time period from January 5, 1999 to May 10, 2010
(2960 observations), namely from the introduction of the Euro until the EU ministers and the ECB
agreed on a program of bond purchases and an unprecedented defence package of 750 billion€,
with the contribution of the IMF, in order to deal with the 2010 Eurozone sovereign-debt crisis.
Moreover, the robustness of the results is examined in sub-periods based on economic rationale as
well as identified by the application of stability tests for structural breakpoints. Specifically, three
breakpoints are initially considered for the identification of the sub-periods, hence setting a
platform for departure for causality tests. The first structural break is March 10, 2000 and
corresponds to the date when the dot-com bubble “burst” (Greenspan, 2007). On that day the
technology NASDAQ Composite index peaked at 5,048.62 (intra-day peak 5,132.52), more than
double its value just a year before. Moreover, the financial crisis of 2007-2010 is examined. It was
21 Based on the wavelet decomposition, the reconstructed signal comprises the
-th level wavelet approximation and the details in all
levels. Consequently, these are used to estimate the Shannon entropy criterion.
22 These are the most liquid and widely traded currency pairs in the world (“FX majors”) with 27% market turnover share for
EUR/USD, 13% for USD/JPY and 12% for GBP/USD (BIS Triennial Survey, 2007).
11
triggered by a liquidity shortfall in the US banking system, which resulted in the collapse of large
financial institutions, the "bail out" of banks by national governments and turbulence in stock
markets around the world (Krugman, 2009). The crisis began to affect the financial sector in
February 22, 2007, when HSBC, the world's largest bank of 2008, wrote down its holdings of
subprime-related mortgage-backed-securities by $10.5 billion, the first major subprime related loss
to be reported. This particular date is used as the second breakpoint23. Finally, the EU sovereign
debt crisis in the end of 2009 is also investigated. It led to a crisis of confidence as well as the
widening of bond yield spreads and rise of credit default swaps for Eurozone countries such as
Greece, Ireland and Portugal, which further intensified the fear of a global contagion24. The crisis
deepened towards the end of 2009 when there was an abrupt increase in the spreads due to the
downgrading of Greece's credit rating by all three major international credit agencies (Fitch,
Moody's and S&P). In this paper December 8, 2009 is set as the third breakpoint, corresponding to
the first Greek rating cut by Fitch.
In addition to the economic rationale, the breakpoint selection is statistically tested via the
application of Chow's test (Chow, 1960) for known (imposed) breaks and the cumulative sum
(CUSUM) test (Brown et al., 1975) for unknown points. These tests are sequentially applied both on
return series and on absolute returns of each currency to investigate also for volatility breaks
(McConnell and Perez-Quiros, 2000; van Dijk et al., 2005). The results from the Chow’s test,
indicate that in the majority of cases for the FX returns one breakpoint is statistically identified at
the 5% level of significance, namely February 22, 2007. The CUSUM test also detects parameter
instability at the 5% significance level mostly around the same breakpoint. In case of all volatility
series, the null of no structural change can be rejected for both Chow’s and CUSUM test at the 1%
level. The structural break of February 22, 2007 is finally selected for the return and volatility series
23
On September 15, 2008, the Lehman Brothers Holdings filed for bankruptcy following drastic losses in its stock and devaluation of its
assets by credit rating agencies. The filing marked the largest bankruptcy in U.S. history.
24
At the beginning of 2010, a failed bond auction in Portugal increased the fear of a Portuguese debt default (Blackstone et al., 2010).
The euro was weakened and a global stock and commodity sell off occurred in February 2010 and the following months. Greece was the
focal point of the crisis. The Greek government searched for a bailout plan in case it failed to raise the money to fill its budget gap
through the credit markets. The provided plan failed to reassure investors, thus leading to an agreement on a defence package of 750
billion€ by the EU and the IMF, in order to prevent the relentless speculative attacks on the euro and eventually restore stability.
12
of all FX rates, thus combining statistical and economic rationale25. Overall, the examined subperiods are the following: P1: January 5, 1999 to February 21, 2007 (2122 obs.), P2: February 22, 2007
to May 10, 2010 (838 obs.). In addition, the entire sample period PTotal: January 5, 1999 to May 10,
2010 (2960 obs.) is comparatively investigated.
The descriptive statistics for the return and volatility series of each currency are presented
in Table 1. The return series are zero mean-reverting with low corresponding variance. The JarqueBera statistic for all FX rates in all periods is statistically significant, thereby implying that the
return distributions are not normal. In general, kurtosis for returns in all periods is larger than
normal, which indicates the presence of fat tails, extreme observations and possibly volatility
clustering. Kurtosis is also significantly higher than normal for the distribution of the absolute
returns for all currencies. As indicated by skewness, EUR return series are symmetric while JPY
and GBP have a longer left tail. The volatility series are not normally distributed with a fat right
tail. Based on the Ljung-Box Q-statistic, the hypothesis that all correlation coefficients of the
returns up to 12 are jointly zero is rejected in the majority of cases. Therefore, it can be inferred that
the return series present some linear dependence. In addition, the statistically significant serial
correlations in the volatility series imply nonlinear dependence due possibly to clustering effects or
conditional heteroscedasticity. The differences between the two periods P1 and P2 are quite evident
in Table 1, where a significant increase in variance can be observed in P2 for all exchange rates as
well as increased fat-tailedness of the return and volatility distributions reflected in the higher
kurtosis. Additionally, P2 witnessed many occasional negative spikes as it can be inferred from the
25 In Chow's breakpoint testing, all possible calendar combinations are examined, i.e. one imposed breakpoint of March 10, 2000,
February 22, 2007 or December 8, 2009 separately, then two points (3 cases) and finally all three points/dates of structural change. The
Chow test in this paper uses the methodology of McConnell and Perez-Quiros (2000) who estimate an AR(1) model with a constant for
each sub-sample separately, to see whether there are significant differences in the estimated equations. Two statistics for the Chow test
are used, namely the log-likelihood ratio ( χ ) and the F-statistic, which are both based on the comparison of the restricted and
unrestricted sum of squared residuals. For EUR, GBP and JPY, the null hypothesis of no structural change for the one break of February
22, 2007 as well as for the specific case of the two breaks of February 22, 2007 and December 8, 2009, is rejected at the 5% (for GBP is
rejected at 10% significance level). The CUSUM test is based on the cumulative sum of the recursive residuals. In case of EUR and GBP
rates it detects parameter instability at 5% level though marginally, around the region of the February 22, 2007 break (e.g., 1950 – 2150
observations). For JPY no structural change is observed. Regarding all volatility series, the Chow test rejects the null of no structural
change at 1% level for the February 22, 2007 breakpoint. Moreover, it does not reject the null hypothesis for the one break of December
8, 2009 for all currencies. In addition, for EUR it rejects the null for all date combinations not including March 10, 2000 and for GBP and
JPY, not including December 8, 2009. Finally, the CUSUM test strongly detects parameter instability for all currencies around the
February 22, 2007 breakpoint. The selected breakpoints have also been verified with the Bai and Perron (2003) and Andrews and Zivot
(1992) tests for unknown dates. These results are available upon request.
13
skewness of GBP and JPY returns, as opposed to a longer right tail for the EUR. Volatility series
also present more spikes in P2.
Table 1 also reports the contemporaneous correlation matrix for each period. Significant
sample cross-correlations are noted for EUR and GBP returns, indicating a high interrelationship
between the two markets. JPY shows a low negative correlation (or uncorrelatedness in P2) with
EUR and GBP returns, while all volatility series are positively correlated for all currencies in both
periods. However, since linear correlations cannot be expected to fully capture the linear/nonlinear
linkages in a reliable way, these results should be interpreted with caution. Consequently, what is
needed is a detailed causality analysis, conducted both at the aggregate foreign exchange rates and
on each of the wavelet components. In the following section we present the results of the wavelet
analysis, while in section 6 we focus on the causality analysis.
4. A WAVELET ANALYSIS OF THE FOREIGN EXCHANGE RATES
As the evaluation of the “scaling” effects on the relationships among FX markets is
fundamental to the present study, in this section we offer a thorough investigation of the FX
market dynamics across and within all scales both for returns and volatility via the wavelet
multiscale decomposition analysis introduced in section 2. As mentioned, this technique allows for
locally temporal effects, sharp cusps, structural breaks, time variation and regime switches. We
then identify the impact of changes in long term dynamics, and analyze the implications for the
flow of information across time scales.
4.1 Minimum entropy wavelet decomposition
The results of the optimal minimum-entropy decomposition for the FX returns and
volatility are presented in Table 3. The entropy in each level is compared to that of the raw time
series and to that estimated at the previous level. In most cases for the FX returns the optimal
decomposition level is the seventh, while for the volatility series the minimum value of the
Shannon entropy criterion is calculated at the fourth scale26.
26 The 8th scale is the optimal for EUR returns in period P1 and the 6th for GBP in P2 and JPY in Ptotal. Regarding the volatility series, the
3rd scale provides the minimum entropy for JPY in P2.
14
Additionally, economic reasons can also be identified in considering the appropriate
“depth” of the wavelet analysis, based also on the “translation” of the wavelet scales into time
horizons. Table 4 “translates” the wavelet scales into appropriate time horizons, thus providing
insight on the relation between SIDWT levels and time scales for the time series. Each scale
corresponds to a frequency interval and it is associated with a range of time horizons that span
from several days to one year. For instance, the detail
8 days (0.8-1.6 weeks), while
is associated with a frequency range of 4-
with approximately one month. Scale level
=
corresponds to a
cycle length between 2.1 to 4.3 quarters, or equally between a semester and a yearly variation.
Thereafter the notation
(and not the ɶ used in Section 2) corresponds to the SIDWT details, to
enhance readability.
To sum up, the FX returns series are decomposed at scale level
= , therefore
“containing” up to yearly frequencies, while the volatility series are analyzed up to the
=
scale,
which is associated with a frequency range of 0.8-1.6 weeks. Also in economic terms it is
reasonable to investigate causality relationships for the returns from daily to yearly frequencies,
whereas up to monthly variations for the volatility. In real world applications, quarterly or yearly
volatility is not interesting for the economic analysis of high-frequency (daily) FX series, nor
“traded” in FX markets, as opposed to daily, weekly and monthly volatility. On the contrary, the
causality analysis of the returns up to yearly variations can be very useful in detecting FX market
linkages with macroeconomic fundamentals and in producing multi-step ahead return or price
forecasts.
4.2 Scale-dependent descriptive analysis
Figures 1-2 present the SIDWT wavelet approximation and details estimated from the
Daubechies (db8) class, for the returns and volatility series on EUR. Figures III.1-4 in the Appendix
III provide the same information for the other currencies. To distill information from the wavelet
components, it is crucial to recall that nonzero wavelet coefficients correspond to activity in a
particular range of frequencies over time. Consequently, when the details are rapidly changing,
15
this implies that its corresponding frequency interval contains important information about the
original process27.
For the EUR return series, for all periods there are no significant differences in high- and
low-scale dynamics. All components display a non-cyclical pattern with fairly low oscillation
amplitude. Essentially, there is no notable “activity” in high scales at all levels, which can be
interpreted as a direct result of the trend-removal procedure, albeit in P2 the return fluctuations are
slightly amplified after the first quarter of 2008, i.e. after entering the financial crisis period. A
similar regime switch is also observed in the details of the PTotal and P1. The increased variability is
mostly evident in detail
of the EUR absolute return series in P2, which is associated with
oscillations of 2-4 days period length, but also in the second, third and fourth scale corresponding
to oscillations with a period of approximately 1 week, 1.6-3.2 weeks and 0.8-1.6 months
respectively. Additionally, in PTotal for the EUR volatility there is an increased variability pattern in
the high frequency harmonic (first detail) associated with 2-4 days, which might be interpreted as
the dominant market dynamics and can be attributed to traders with short-term trading horizons.
The regime switch appearing in the EUR return details immediately after the crisis burst is
also depicted in the P2 details of the EUR volatility28. Interestingly, persistent oscillations are
present in all detail components of the EUR volatility in P1, indicating a near-cyclical pattern in low
scales for the pre-crisis period and probably “neutral” mean-reverting trading behavior. This is
also depicted in the
approximation in the volatility series in P1, but not in P2 (and consequently
not at the end of PTotal), where a break in the long-run trend component signifies the entry in the
high volatility regime of the 2007-2010 financial crisis. The same applies for the
low frequency
component of the EUR return series, yet in a smaller extent.
27
When viewing the SIDWT decomposition, it is evident that the wavelet details display a complicated structure that cannot be
attributable to an oscillation at a single frequency. This is due to the fact that the underlying spectrum of these processes is rapidly and
dynamically changing within the frequency intervals induced by the wavelet transform.
28 The occurrence of the structural changes in the wavelet approximations mentioned throughout Section 9 have also been tested with
the Chow's test (Chow, 1960) for known (imposed) breaks, with the cumulative sum (CUSUM) test (Brown et al., 1975) for unknown
points as well as with the Bai and Perron (2003) and Andrews and Zivot (1992) tests. In addition for the details, the switching regimes
have been verified via a Markov-switching model with two regimes. In each regime an AR(1) specification was used.
16
The qualitative characteristics of the low- and high-frequency components of GBP returns
and volatility are directly comparable and analogous to that of EUR. Finally, the Japanese currency
market has almost no high-scale dynamics and only the wavelet scales (
-
) of the JPY
volatility exhibit some activity in P2. The latter demonstrates that agents with short trading
horizons (daily-monthly) are mostly affected by the crisis break.
4.3 Heterogeneous market dynamics and micro-foundations
The motivation behind the investigation of “vertical” (across scales) heterogeneity in the
variability pattern comes from comparative observation. For example, for JPY returns (Figure III.3)
the first two finest scales mostly affect the dynamics appearing in the raw data, while for EUR
(Figure 1) and GBP (Figure III.1) all scales seem to contribute to the raw series variability. Likewise,
the detail
of the GBP volatility (Figure III.2) in all periods dominates the aggregate raw series
oscillation amplitude, whereas other frequency components embed lower information. It is also
noticeable that a low frequency shock (displayed in the long-run approximation wavelet
component), might lead to a high frequency response by a short time span, as in the case of the
crisis emergence depicted in the GBP and JPY volatility series (Figures III.2 and III.4). This vertical
heterogeneity suggests the presence of trader behavior with different time horizons. At the highest
approximation scale the trading mechanism “comprises” fundamentalists who trade on longer
time horizons. Then, at low scales short-term traders and market makers operate with time
horizons of a few days up to a month. Each trader class may possess a homogeneous behavior, but
it is the combination of these classes in all scales that generates the aggregate time series.
Therefore, the underlying dynamics are heterogeneous due to the interaction of all trader classes at
different time scales. In such a market, a low-frequency shock infiltrates through all scales, while a
high-frequency shock runs out quickly and might have no impact whatsoever in the long-run
dynamics. Probably, a characteristic example of a low-frequency shock which penetrated all scales
and market “behaviors” is that of the Eurozone sovereign debt crisis observed in all wavelet
components of the examined currency series.
17
Another aspect worthy of investigation is the scale-dependent duration of regime switches.
Specifically, a high volatility regime initiated by a market information flow appears to persist
longer at the lower frequency associated also with longer trading horizons, as opposed to a highfrequency horizon. For example, this is demonstrated for the JPY volatility in PTotal and P2. Overall,
the duration of regimes seems to be longer for high-scale (low-frequency) trading horizons,
whereas low-scale behavior results in short and frequent regime switching.
Moreover, the “vertical” causality of the regime structure is one-way, in that a low regime
variability state at low frequencies identically affects the oscillation state at higher frequencies.
Indeed, the results in section 4.2 indicate that if e.g., a low volatility regime is observed at a
monthly frequency, it is more likely that there is also a low volatility pattern at the weekly or daily
scale. On the contrary, high variability at a low frequency does not necessarily entail a high
volatility at higher frequencies (e.g., as in the case of JPY market in P1). This result is in accordance
with the empirical evidence that markets “cool off” after a shock at higher frequencies in a much
shorter period than after a lower-frequency, “structural” change.
4.4 Impact of extreme events and structural breakpoints across time scales
The impact of the stock market dot-com bubble is initially investigated, in particular after
the breakpoint of March 10, 2000 (obs. 309 of PTotal). The estimated sequence of wavelet
approximations and details, as depicted in Figures 1-2 for the EUR and III.1-4 (Appendix III) for
GBP and JPY, indicates that the currency markets were not affected seriously by the tech-bubble
“burst” which partly coincided with the after-Euro era. The volatility regime in all FX rates is
relatively “flat” across the scales.
Another extreme event was the terrorist attack at the World Trade Center on September 11,
2001 (obs. 701), which lead to a sharp drop in stock prices worldwide. The analysis of the impact
on the currency markets in all scales (both on returns and volatility series) reveals that it was a
relatively short one with practically imperceptible consequences, thus no evidence of contagion
across scales was observed.
18
Next, the financial crisis of 2007-2010 occurred, starting from the HSBC write-down in
February 22, 2007 (obs. 2123). A close assessment of the wavelet components in Figures 1-2 and
III.1-4 indicates that the effect of the subprime crisis is not as evident in currency markets as in the
stock markets, except for the case of the JPY volatility. Due to the structural break in the long-run
component (
), the Japanese market is gradually entering into a high volatility regime
(associated with all four scales,
-
).
Finally, the Eurozone sovereign debt crisis after December 8, 2009 and during 2010 is
analyzed. The first Greek credit rating cut by Fitch corresponds to obs. 2851 in PTotal. Interestingly,
it can be inferred that international FX markets are experiencing since then a large ongoing
turmoil. The estimated wavelet components at all scales clearly indicate that the EUR, GBP and
JPY markets entered into a high volatility state since the end of 2008. Moreover, the high volatility
state is not uniform across the scales; at lower scales and especially at the finest scale, the time span
of the regime becomes wider29. This could be safely considered as a warning, precursor signal of an
escalating crisis.
5. TIMESCALE CAUSALITY INVESTIGATION
5.1 Causality testing
We perform causality detection via the Granger test, the modified Baek-Brock test and the
Breitung-Candelon test. The conventional approach of causality testing is based on the Granger
test (Granger, 1969), which assumes a parametric, linear model for the conditional mean. This
specification is simple and appealing as the test is reduced to determining whether the lags of one
examined variable significantly enter into the equation of the other, albeit it requires the linearity
assumption. Baek and Brock (1992) noted that the parametric linear Granger causality test has low
power against certain nonlinear alternatives or higher moments. As a result, nonparametric
29
For example, for return and volatility detail
(approximately 2-4 days) of all currencies in periods PTotal and P2, the estimated
wavelet coefficients display a high volatility regime all through the end of the sample. At scales 2-4 (0.8-1.6 weeks to 0.8-1.6 months) the
high state is observed with smaller amplitude. In other words, for short-term traders the currency turbulence continues within 2010,
whereas for longer horizon traders or investors, the turmoil mostly lasts from 2009 until the first quarter of 2010. In fact, there were
several bursts of different oscillation range and amplitude since the end of 2008 and during the period of the Eurozone crisis.
19
causality tests have been introduced in the literature, directly focusing on predictive power
without imposing a linear functional form. Hiemstra and Jones (1994) proposed a causality-inprobability test for nonlinear dynamic relationships which is applied to the residuals of vector
autoregressions and it is based on the conditional correlation integrals of lead–lag vectors of the
variables. This test relaxes Baek and Brock’s assumption of i.i.d time series and instead allows each
series to display weak (or short-term) temporal dependence. It can detect the nonlinear causal
relationship between variables by testing whether past values influence present and future values.
Finally, a test for causality-in-frequency (spectral causality) is also applied. Geweke (1982) and
Hosoya (1991) originally proposed a causality measure based on the decomposition of the spectral
density, while Yao and Hosoya (2000) developed a Wald-type test procedure based on a
complicated set of nonlinear restrictions on the parameters of vector autoregressions. In a latter
study, Yao and Hosoya (2000) applied a numerical method to estimate the nonlinear function of
the autoregressive parameters and the asymptotic covariance matrix. Recently, Breitung and
Candelon (2006) proposed a simplified test procedure that is based on a set of linear hypotheses on
the parameters of a bivariate vector autoregressive model. It allows testing for short- and long-run
causality at a specified range of frequencies. The test by Breitung and Candelon (2006), as well as
those by Geweke (1982) and Hosoya (1991) upon which the Breitung and Candelon test is based,
provides very good causality results – in terms of size and power properties – only at some prespecified frequency range, which depends on the input data frequency. Applying wavelet analysis
could provide an efficient means of overcoming the constraint of reaching a threshold in the lowest
possible frequency investigated, namely probing further in the “long-run” behavior (i.e., in this
study reaching closer to business cycle fluctuations). The three causality tests are formally
described in Appendix II.
5.2 Empirical analysis
The proposed empirical analysis involves three steps. In the first step, the short- and longrun spectral causal relationships are explored at a pre-specified range of frequencies applying the
Breitung-Candelon test on the aggregate log-differenced time series. Next, the Granger causality
20
test is employed on the original and on the “disaggregated” wavelet components in order to
investigate the linear dynamic linkages at various scales. Lastly, bivariate vector autoregressive
filtering is implemented on the raw and decomposed series and the residuals are examined
pairwise by the nonparametric modified Baek-Brock test. Thus, the nature and direction of
causality explored for each scale component of the return and volatility series is compared against
the causality results of the aggregate series.
Linear and spectral causality are investigated in a VAR representation. VAR modelling is
also applied for the volatility series in accordance with previous results derived by Nikkinen et al.
(2006)30. The results from the SIC criterion, taking into consideration many lag specifications for the
bivariate VAR modelling as in Engel and West (2005), indicate in most of the cases two lags for the
FX return series and their wavelet components in all periods. Similarly, four lags are chosen for the
volatility series and the corresponding components. Finally, for the nonlinear causality test, in
what follows the common lag lengths used are ℓ X = ℓ Y = 1 . The test is applied on the VAR
residuals derived from the pairwise linear causality testing and the distance measure is set to
ε=
, as suggested by Hiemstra and Jones (1994)31.
The FX multiscale causality results from all tests employed in the study are reported in
Tables 5 and 6. The simplifying notation “ ** ” is used to indicate that the corresponding p-value of
a particular causality test is smaller than 1% and “ * ” that the p-value of a test is in the range 1-5%.
This was necessitated in order to overcome the difficulty of presenting large tables with numbers.
30
The results from testing nonstationarity with the Augmented Dickey-Fuller (ADF) and Phillips-Perron (PP) unit root tests show that
processes, whilst the returns and volatility series are stationary. In addition, the trace and maximum
FX log-levels log-levels are
eigenvalue statistics (Johansen, 1988; Johansen and Juselius, 1990) applied on the log-prices series did not identified any cointegrating
vectors and the null of no cointegration was not rejected (Table 2). The lag lengths for testing nonstationarity were selected using the
Schwartz Bayesian Information Criterion (SIC), while for the PP test the bandwidth was automatically selected using Newey and West
(1994) method with Bartlett kernel. ADF and PP tests indicate that the null of a unit root cannot be rejected at 1% for all currency loglevels in all periods, regardless of whether a constant and linear trend or only a constant is included in the deterministic component.
Furthermore, both tests show that the log-returns and volatility series are stationary as the null can be soundly rejected for all currencies
and periods. Due to the nature of volatility, it is assumed that there is no time trend in the series in the long run (Nikkinen et al, 2006).
However, the unit root tests were also performed with a time trend and the results remain unchanged. Moreover, the test results are
generally not sensitive to the number of lags used. Based on these results and in order to identify the correct model specification for the
investigation of linear and spectral causality (i.e., VAR or VECM), the trace and maximum eigenvalue statistics were further applied to
the log-prices series to explore possible effects of cointegration. For all pairs the Johansen tests did not identified any cointegrating
vectors and the null of no cointegration was not rejected (Table 2). Thus, linear and spectral causality are investigated with a VAR
representation.
31 In the estimation
ε=
and ε =
were also considered, with no qualitative difference in the results. In addition, evidence from
the second and third common lag lengths did not significantly modified the nonlinear causality results.
21
Directional causalities in the text are denoted by the functional representation “ → ”. The causality
linkages are also depicted diagrammatically in Figures 3 and 4 where strong unidirectional or
bidirectional causality (“ ** ”) is denoted by a double one-sided or two-sided arrow respectively. In
addition, the corresponding results on the cross-correlation of the return and volatility wavelet
components are displayed in Table 7, while the spectral causal relationships of the BreitungCandelon test are graphically illustrated in Figures III.5 and III.6 (Appendix III). These figures
report the test statistics along with their 5% and 1% critical values (broken lines) for all frequencies
in the x-axis interval π as in Breitung and Candelon (2006)32. This interval, based on the
frequency of the input raw data (i.e., daily), corresponds to a frequency range from 1 day to 16
days, or
−
months. The right value represents the lowest frequency upon which the
Breitung-Candelon test can infer on causality. Hence, the results of the frequency-domain test for
the aggregate series can be comparatively analyzed against those of up to the
scale for the
return and volatility series.
The hypothesis of no causality for the Breitung-Candelon spectral test in case of the
unidirectional EUR→GBP return relationship is rejected at the 5% level in the period P1 for
frequencies in the x-axis
, corresponding to a range of 1-2 days or roughly to scale
wavelet decomposition. In the same period JPY causes GBP for return series in
corresponds to a frequency range of 3-6 days or scale
in the
, which
. The volatility series for P1 reveal a strong
causal linkage of JPY→EUR (1% significance level) in the range of 8-16 days directly associated
with the
wavelet scale. In all other periods (P2 and PTotal) after investigating volatility series,
strong causal relationships are detected (1% level) among all currencies mostly in the frequency
range of
, or 6-16 days (comparable to
). Instead, inferring upon the returns, the results
vary across periods. In P2, a bidirectional linkage GBP↔JPY emerges at the 1% level at the range of
2-8 days and a univariate relationship JPY→EUR
(3-16 days), while in PTotal causalities run from
32 It follows from Breitung and Candelon (2006) that for frequencies in
( π)
the effect of the coefficient
power of the test is minimal and the empirical power is very close to the asymptotic power.
22
(section 6.3) in the size and
GBP to JPY (3-16 days) and EUR (6-8 days) as well as from JPY to EUR (3-8 days) at weaker
significance levels.
Next, the analysis is unfolded on the basis of each wavelet scale. The long-run linkages
exhibit the following characteristics: in case of returns, linear and nonlinear feedback relationships
are observed for the
component in all periods except for the absence of the EUR-JPY causality
in P2. The volatility component
provides strong statistical evidence on linear and nonlinear
bidirectional linkages in all periods, with the exception of P2 when only EUR↔GBP is observed.
The correlation of the long-run component is stronger (positively or negatively higher) for plain
and absolute returns compared to the one of the details, for the majority of the examined currency
pairs. The low-frequency components
-
for the return series present identical
interdependencies. Specifically, the causalities run from EUR to GBP in P1, in both directions for
GBP-JPY in P2, while they “add-up” in PTotal. There is also evidence of unidirectional nonlinear
relationships from EUR to GBP and JPY in P1, and strong feedback nonlinear links in P2 and PTotal
for all pairs. Instead, cross-correlation in these high scales is positively high only for EUR-GBP,
whereas around -0.3 for the other two pairs. Furthermore, the third and fourth wavelet scales (
and
) display the same features in terms of investigated causalities. Firstly regarding returns,
linear unidirectional linkages are observed from EUR to JPY and GBP in P1 and a bidirectional
relationship for the GBP-JPY pair in P2. In PTotal all links observed in P1 and P2 exist. The nonlinear
feedback causalities for all pairs observed in P2 are identical to the ones in PTotal, while in P1 a
sequential pattern emerges from EUR to JPY, to GBP and back to EUR
absolute returns, the
and
Secondly, in terms of
scales reveal absence of linear causality in P1, and only a strong
bidirectional link EUR↔JPY in P2 and PTotal. The nonlinear linkages EUR→GBP and EUR→JPY are
observed in P1, while all currencies present bidirectional causalities in the other two periods. The
correlation at
and
scales is similar to the one observed for raw series, but different compared
to the long-run component. At the second wavelet scale
linear bidirectional relationships exist
in all periods, as well as nonlinear bidirectional causalities for all pairs. In volatility series, no
23
currency Granger causes the other in P1 and only a strong EUR↔JPY link is observed in the other
periods. Again, nonlinear links dominate PTotal and P2 , while EUR causes GBP and JPY in P1.
Finally, at the finest scale
(highest frequency) the same as in
applies for the volatility series
regarding the nature and direction of the interdependencies. In the case of returns, a feedback
relationship emerges for the EUR-JPY and GBP-JPY pairs in P2 and PTotal, whereas nonlinear
linkages appear for all pairs, with the exception of GBP-JPY in P1. In addition, the cross-correlation
of the wavelet components for the two finest scales both for returns and volatility, has
approximately the same value as in
, albeit somewhat lower for the more “noisy” scale
(highest frequencies).
Overall, the evidence provided in this section leads to the conclusion that spillovers and
interactions between FX markets have different characteristics at different timescales33. Especially
when the nonlinear effects are accounted for, the evidence of dynamical bidirectional causality
implies that the pattern of leads and lags changes over time. The market agents filter information
relevant to their positions as new information arrives and, at any time point, one FX market may
lead the other and vice versa. Overall, there is no indication of a “global causality” behavior or a
“prevailing pattern” of interdependencies dominating at all scales.
6. CONCLUSIONS
Multiscale wavelet decomposition could become a valuable means of exploring the
complex dynamics of economic time series, as it allows for temporal and frequency analysis at the
same time. In contrast to simple disaggregation at different time horizons, this study relied on
wavelet multiresolution to analyze the controversial issue of the dependence structure of the FX
markets. The aim of the paper was to test for the existence of causal relationships among the most
liquid and widely traded currencies in the world (“FX majors”), namely the EUR, GBP and JPY.
The nature and direction of causality was investigated for each component of the raw return and
volatility series corresponding to a different sampling frequency and was compared against the
33 The inferred results seem also to corroborate with Genҫay et al (2002) and Ramsey and Lampart (1998a) on multi-scale linkages
between macroeconomic variables.
24
results of the original aggregate series. Causality detection was performed with the use of the
linear Granger test, the modified Baek-Brock test for nonlinear causality and the frequency domain
Breitung-Candelon test. The explored period, starting from the introduction of the Euro, covers
diverse regimes including the rise and fall of the “dot-com” bubble, the financial crisis of 2007-2010
and the Eurozone sovereign debt crisis in early 2010. The timescale causality investigation,
including wavelet cross-scale correlation, proceeded complementarily albeit distinctly both on the
basis of period as well as of wavelet scale. It involved the comparative examination of the
aggregate versus component-based causality, of linear vis-à-vis nonlinear and spectral causality as
well as of short- versus long-run linkages.
Moreover, this study attempted to probe into the micro-foundations of across-scale
heterogeneity in the variability pattern, on the basis of trader behavior with different time horizons
and information flow across time scales. The trading pattern of fundamentalists is reflected at the
highest approximation wavelet scale, while at lower scales short-term traders and market makers
operate. Each trader class may possess a homogeneous behavior, but the aggregate underlying
market dynamics are heterogeneous due to the interaction of all trader classes at different time
scales. In such a market, a low-frequency shock infiltrates through all scales, while a highfrequency shock runs out quickly and might have no impact whatsoever in the long-run dynamics.
The propagation properties of this heterogeneous-driven behavior were investigated, the causality
structure from low-to-high frequency was identified, and the implications for the flow of
information across time scales in the FX markets were inferred. In addition, the scale-dependent
duration of regime switches was highlighted. Specifically, a high volatility regime initiated by a
market information flow appeared to persist longer at the lower frequency associated also with
longer trading horizons, as opposed to a high-frequency horizon. Finally, an asymmetry in
volatility dependence across different time horizons was identified as an important stylized
property. The across-scale causality of the various regime structures is one-way, in that a low
regime variability state at low frequencies identically affected the oscillation state at higher
frequencies. On the contrary, high variability at a low frequency did not necessarily entail a high
25
volatility at higher frequencies. This result is in accordance with the empirical evidence that
markets “cool off” after a shock at higher frequencies in a much shorter period than after a
significant structural change.
In technical terms, the present work introduced new practical guidelines for wavelet
implementation and expanded the relevant literature by presenting an invariant discrete wavelet
transform that contains no phase shifts, relaxes the assumption of a “dyadic-length” time series,
enables multi-scale point-to-point comparison and copes effectively with “boundary effects”.
Beyond the existing practice that has utilized subjective judgement or economic reasoning in
estimating the appropriate “depth” of the wavelet analysis, a new entropy-based methodology
was introduced for the determination of the optimal decomposition level.
Overall, the results strongly indicate that interactions between currency markets have
different characteristics at different timescales and that there is no “global causal behavior” that
prevails at all time horizons. When the nonlinear effects are accounted for, neither FX market leads
or lags the other consistently, namely the pattern of leads and lags changes over time. Given that
causality can vary from one direction to the other, a finding of bidirectional causality over the
sample period may be taken to imply a changing pattern of leads and lags over time. In particular,
market participants filter information relevant to their positions as new information arrives and, at
any time point, one FX market may lead the other and vice versa.
An interesting subject for future research is the nature and source of the nonlinear linkages,
as it was shown that volatility effects might partly induce nonlinear causality. Conditional
volatility or statistically significant higher-order moments may account for a part of the
nonlinearity in daily exchange rates, but only in some cases as it is already known by many
studies, including Scheinkman and LeBaron (1989) and Bekiros and Diks (2008). In general, the
detailed knowledge of the nature of interdependency between the currency markets and the
degree of their integration at different timescales will expand the information set available to
practitioners and policymakers. The results of this study, apart from offering a much better
understanding of the dynamic heterogeneous relationships underlying the major currency
26
markets, may have important implications for market efficiency. In that, they may be useful in
future research to quantify the process of financial integration or may influence the greater
predictability of these markets.
REFERENCES
Allan, D.W., 1966, Statistics of atomic frequency standards. Proceedings of the IEEE, 54 (2), 221-230.
Andersen, T.G. and T. Bollerslev, 1997, Intraday periodicity and volatility persistence in financial
markets. Journal of Empirical Finance 4 (2-3), 115-158.
Baek, E. and W. Brock, 1992, A general test for non-linear Granger causality: bivariate model.
Working paper, Iowa State University and University of Wisconsin, Madison, WI.
Bai, J. and P., Perron, 2003, Computation and analysis of multiple structural change models. Journal
of Applied Econometrics 18, 1-22.
Bank for International Settlements (BIS), 2007, Triennial Central Bank Survey: Foreign exchange and
derivatives market activity in 2007. Basel, Switzerland.
Bessler, D.A. and J. Yang, 2003, The structure of interdependence in international stock markets.
Journal of International Money and Finance 22, 261–287.
Bekiros S.D. and C.G.H. Diks, 2008, The nonlinear dynamic relationship of exchange rates:
Parametric and nonparametric causality testing. Journal of Macroeconomics 30, 1641-1650
Bénassy-Quéré, A. and A. Lahrèche-Révil, 2000, The Euro as a Monetary Anchor in the CEECs. Open
Economies Review 11 (4), 303-321.
Black, F., 1986, Noise. Journal of Finance 41, 529–543.
Blackstone, B., T. Lauricella and N. Shah, 2010, Global Markets Shudder: Doubts About U.S.
Economy and a Debt Crunch in Europe Jolt Hopes for a Recovery. The Wall Street Journal.
Blanchard, O. and M.W. Watson, 1982, Bubbles, rational expectations, and financial markets. in: Wachtel,
P. (Ed.) Crises in the economic and financial structure, Lexington, MA.
Breitung, J. and B. Candelon, 2006, Testing for short- and long-run causality: A frequency-domain
approach. Journal of Econometrics 132, 363-378.
Brown, R.L., J. Durbin and J.M. Evans, 1975, Techniques for Testing the Constancy of Regression
Relationships over Time. Journal of the Royal Statistical Society 37, 149–192.
Chow, G.C., 1960, Tests of Equality between Sets of Coefficients in Two Linear Regressions.
Econometrica 28, 591–605.
Cohen, A., I. Daubechies, B. Jawerth and P. Vial, 1993, Multiresolution analysis, wavelets and fast
wavelet transform on an interval. CRAS Paris 316, 417–421.
Coifman, R.R. and M.V. Wickerhauser, 1992, Entropy-based algorithms for best basis selection. IEEE
Transaction on Information Theory 38 (2), 713–718.
Coifman, R.R. and D. Donoho, 1995, Time-invariant wavelet de-noising, in: A. Antoniadis and G.
Oppenheim, (Eds.), Lecture Notes in Statistics, Vol. 103, Springer-Verlag, NY, pp. 125-150.
Daubechies, I., 1992, Ten Lectures on Wavelets. Regional Conference Series in Applied Mathematics
(SIAM), Society for Industrial and Applied Mathematics Vol. 61. Philadelphia, USA.
Engel, C., and K.D. West, 2005, Exchange Rates and Fundamentals. Journal of Political Economy 113,
485-517
27
Engle, R.F., T. Ito and W-L. Lin, 1990, Meteor showers or heat waves? Heteroskedastic intra-daily
volatility in the foreign exchange market. Econometrica 58(3), 525–542.
Flood, R.P. and P. Isard, 1989, Monetary policy strategies. IMF Staff Papers 36, 612–632.
Friedman, M., 1963, Windfalls, the ‘horizon,’ and related concepts in the permanent-income hypothesis, in:
Measurement in Economics: Studies in Mathematical Economics and Econometrics in Memory of Yehuda
Grunfeld, Carl Christ, et al. (Eds.), Stanford, CA, pp. 3–28.
Genҫay, R., B. Whitcher and F. Selҫuk, 2001, Differentiating intraday seasonalities through wavelet
multi-scaling. Physica A 289(3-4), 543–556.
, 2002, An introduction to wavelets and other filtering methods
in finance and economics. Academic Press, San Diego.
Geweke, J., 1982, Measurement of linear dependence and feedback between multiple time series.
Journal of the American Statistical Association 77, 304–324.
Granger, C.W.J., 1969, Investigating causal relations by econometric models and cross-spectral
methods. Econometrica 37(3), 424-438.
Greenspan, A., 2007, The Age of Turbulence: Adventures in a New World. Penguin Press.
Grossmann, A. and J. Morlet, 1984, Decomposition of hardy functions into square integrable
wavelets of constant shape. SIAM J Math Anal 15(4), 723–736.
Herley, C., 1995, Boundary filters for finite-length signals and time-varying filter banks. IEEE
Transactions on Circuits and Systems-II 42, 102-114.
Hiemstra, C. and J.D. Jones, 1994, Testing for linear and nonlinear Granger causality in the stock
price-volume relation. Journal of Finance 49(5), 1639-1664.
Hogan, W.P. and I.G. Sharpe, 1984, On the relationship between the NY closing spot U.S.$/$A rate
and the reserve bank of Australia’s official rate. Economics Letters 14(1), 73–79.
Hosoya, Y., 1991, The decomposition and measurement of the interdependency between secondorder stationary processes. Probability Theory and Related Fields 88(4), 429–444.
Ito, T. and V.V. Roley, 1987, News from the U.S. and Japan: which moves the yen/dollar exchange
rate? Journal of Monetary Economics 19(2), 225–277.
Jensen, J. and B. Whitcher, 2000, Time-Varying Long-Memory in Volatility: Detection and Estimation
with Wavelets. Technical Report, Department of Economics, University of Missouri.
Johansen, S., 1988, Statistical analysis of cointegration vectors. Journal of Economic Dynamics and
Control 12(2-3), 231–254.
Johansen, S. and K. Juselius, 1990, Maximum likelihood estimation and inference on cointegration
with application to the demand for money. Oxford Bulletin of Economics and Statistics 52, 169–209.
Koutmos, G. and G.G. Booth, 1995, Asymmetric volatility transmission in international stock
markets. Journal of International Money and Finance 14(6), 747–762.
Krugman, P., 1991, Target zones and exchange rate dynamics. The Quarterly Journal of Economics 106,
669–682.
, 2009, The Return of Depression Economics and the Crisis of 2008. Norton.
Laopodis, N., 1997, US dollar asymmetry and exchange rate volatility. Journal of Applied Business
Research 13(2), 1–8.
, 1998, Asymmetric volatility spillovers in deutschemark exchange rates. Journal of
Multinational Financial Management 8, 413–430.
Lindberg, H. and P. Soderlind, 1994, Testing the Basic Target Zone Models on the Swedish Data
1982–1990. European Economic Review 38, 1441–1469.
Ma, Y. and A. Kanas, 2000, Testing for a nonlinear relationship among fundamentals and exchange
rates in the ERM. Journal of International Money and Finance 19(1), 135–152.
28
MacKinnon, J., Haug, A. and L. Michelis, 1999, Numerical Distribution Functions of Likelihood Ratio
Tests for Cointegration. Journal of Applied Econometrics 14, 563-577.
Mallat, S., 1989, A theory for multiresolution signal decomposition: the wavelet representation. IEEE
Transactions on Pattern Analysis and Machine Intelligence 11(7), 674–693.
McConnell, M.M. and G. Perez-Quiros, 2000, Output fluctuations in the United States: What has
changed since the early 1980s? American Economic Review 90, 1464–1476.
Meese, R.A. and K. Rogoff, 1983, Empirical exchange rate models of the seventies: Do they fit out of
sample?. Journal of International Economics 14(1-2), 3–24.
Nason, G. P. and B. W. Silverman, 1995, The Stationary Wavelet Transform and Some Statistical
Applications, in: A. Antoniadis and G. Oppenheim (Eds.), Wavelets and Statistics, New York: Springer, pp.
281-299.
Newey, W.K. and K.D. West, 1994, Automatic Lag Selection in Covariance Matrix Estimation. Review of
Economic Studies 61(4), 631-653
Nikkinen, J., P. Sahlström and S. Vähämaa, 2006, Implied volatility linkages among major European
currencies. Journal of International Financial Markets, Institution & Money 16, 87–103.
Ogden, R.T., 1997, On preconditioning the data for the wavelet transform when the sample size is
not a power of two. Communications in Statistics-Simulation & Computation 26, 467–486.
Percival, D.B., 1995, On estimation of the wavelet variance. Biometrika 82(3), 619-631.
Percival, D.B. and H.O. Mofjeld, 1997, Analysis of subtidal coastal sea level fluctuations using
wavelets. Journal of the American Statistical Association 92, 868-880.
Pesquet, J.-C., H. Krim and H. Carfantan, 1996, Time-invariant orthonormal wavelet representations.
IEEE Transactions on Signal Processing 44, 1964-1970.
Ramsey, J.B. and C. Lampart, 1998b, The decomposition of economic relationships by time scale
using wavelets: Expenditure and income. Studies in Nonlinear Dynamics & Econometrics 3(1), 23-42.
Scheinkman, J., and B. LeBaron, 1989, Nonlinear Dynamics and Stock Returns. The Journal of Business
62 (3), 311-337.
Stock J. and M. Watson, 2003, Has the Business Cycle Changed and Why?, NBER Macroeconomics
Annual 17, 159-218
van Dijk, D., D.R. Osborn and M. Sensier, 2005, Testing for causality in variance in the presence of
breaks. Economics Letters 89(2), 193–199.
Wang, Z., J. Yang and Q. Li, 2007, Interest rate linkages in the Eurocurrency market:
Contemporaneous and out-of-sample Granger causality tests. Journal of International Money and Finance 26(1),
86-103.
Yao, F. and Y. Hosoya, 2000, Inference on one-way effect and evidence in Japanese macroeconomic
data. Journal of Econometrics 98(2), 225–255.
Zivot, E., and D. Andrews, 1992, Further Evidence on the Great Crash, the Oil-Price Shock, and the
Unit-Root Hypothesis. Journal of Business and Economic Statistics 10(3), 251–270.
29
TABLE 1: DESCRIPTIVE STATISTICS
Return Statistics
Statistic
PTotal
0.000
0.007
0.128
4.447
266.18*
12.23
Mean
Std. dev.
Skewness
Kurtosis
JB test
Q(12)
EUR
P1
0.000
0.006
0.074
3.722
48.10*
14.85
P2
0.000
0.007
0.218
5.255
184.28*
10.74
GBP
P1
0.000
0.005
-0.008
3.640
36.28*
9.24
PTotal
0.000
0.006
-0.379
5.844
1068.48*
11.38
P2
0.000
0.008
-0.553
5.547
269.17*
16.52
JPY
P1
0.000
0.006
-0.017
5.576
586.74*
4.64
PTotal
0.000
0.007
-0.105
6.742
1732.19*
33.91*
P2
0.000
0.008
-0.141
6.456
419.92*
37.05*
Return Correlation matrix
EUR
GBP
JPY
EUR
1
0.658
-0.232
PTotal
GBP
JPY
1
-0.126
1
EUR
1
0.681
-0.356
P1
GBP
JPY
1
-0.345
1
EUR
1
0.638
-0.039
P2
GBP
JPY
1
0.139
1
Volatility Statistics
Statistic
PTotal
0.005
0.004
1.588
6.669
2905.27*
365.54*
Mean
Std. dev.
Skewness
Kurtosis
JB test
Q(12)
EUR
P1
0.005
0.004
1.293
4.862
898.17*
53.49*
P2
0.005
0.005
1.883
7.852
1317.05*
378.50*
PTotal
0.004
0.004
2.042
10.191
8433.69*
832.81*
GBP
P1
0.004
0.003
1.250
4.720
814.37*
42.96*
P2
0.006
0.005
1.978
8.316
1532.96*
465.12*
PTotal
0.005
0.005
2.270
12.650
14027.58*
403.88*
JPY
P1
0.005
0.004
1.955
9.995
5678.53*
60.91*
P2
0.006
0.006
2.207
11.409
3149.45*
209.61*
Volatility Correlation matrix
EUR
GBP
JPY
EUR
1
0.485
0.291
PTotal
GBP
JPY
1
0.307
1
EUR
1
0.483
0.226
P1
GBP
JPY
1
0.198
1
Notation: The FX exchange returns are defined as
returns
=
EUR
1
0.495
0.376
=
P2
GBP
JPY
1
0.387
1
( )−
(
−
),
where
is the closing level on day
, while the volatility series as the absolute value of the
as in Jensen and Whitcher (2000) and Gencay et al. ( 2002). (*) denotes significance at 5% confidence level. The periods are P1: 01/05/1999-02/21/2007, P2: 02/22/2007-
05/10/2010 and PTotal: 01/05/1999-05/10/2010.
30
TABLE 2: UNIT ROOT AND COINTEGRATION TESTS
Unit Root tests
Periods
Variables
EUR
GBP
JPY
Pt
rt
ut
Pt
rt
ut
Pt
rt
ut
P1
PTotal
ADF
ADFτ
ADFc
0.76
0.23
0.00*
0.00*
0.00*
0.00*
0.65
0.94
0.00*
0.00*
0.00*
0.00*
0.57
0.49
0.00*
0.00*
0.00*
0.00*
PP
PPc
0.77
0.00*
0.00*
0.65
0.00*
0.00*
0.49
0.00*
0.00*
ADF
ADFc
ADFτ
0.88
0.21
0.00*
0.00*
0.00*
0.00*
0.87
0.43
0.00*
0.00*
0.00*
0.00*
0.21
0.49
0.00*
0.00*
0.00*
0.00*
PPτ
0.23
0.00*
0.00*
0.93
0.00*
0.00*
0.44
0.00*
0.00*
P2
PP
PPc
0.87
0.00*
0.00*
0.88
0.00*
0.00*
0.23
0.00*
0.00*
PPτ
0.17
0.00*
0.00*
0.42
0.00*
0.00*
0.53
0.00*
0.00*
ADF
ADFc
ADFτ
0.49
0.77
0.00*
0.00*
0.00*
0.00*
0.87
0.77
0.00*
0.00*
0.00*
0.00*
0.52
0.18
0.00*
0.00*
0.00*
0.00*
PP
PPc
0.47
0.00*
0.00*
0.87
0.00*
0.00*
0.51
0.00*
0.00*
PPτ
0.76
0.00*
0.00*
0.76
0.00*
0.00*
0.25
0.00*
0.00*
Cointegration tests
Pair
Trace statistic
X
Y
EUR
EUR
GBP
GBP
JPY
JPY
PTotal
Maximum Eigenvalue statistic
P1
P2
PTotal
P1
P2
=
≤
=
≤
=
≤
=
≤
=
≤
=
≤
0.993
0.473
0.641
0.489
0.152
0.243
0.501
0.622
0.825
0.343
0.716
0.800
0.766
0.801
0.645
0.457
0.319
0.362
0.997
0.629
0.729
0.489
0.152
0.243
0.004
0.551
0.769
0.343
0.716
0.800
0.761
0.851
0.665
0.459
0.319
0.362
Notation: Price variables are in logarithms and reported numbers for the augmented Dickey–Fuller (ADF) and Phillips-Perron (PP) test are p-values (both are one-sided tests of the null
hypothesis that the variable has a unit root). The index c indicates that the test allows for a constant, while τ for a constant and a linear trend. The number of lags for the ADF was selected
using the Schwarz information criterion. The lag truncation for the PP test was selected using Newey and West (1994) automatic selection with Bartlett kernel. Reported numbers for the
trace and max. eigenvalue statistics are the MacKinnon-Haug-Michelis (1999) p-values. (*) denotes significance at 1% confidence level. The periods are P1: 01/05/1999-02/21/2007, P2:
02/22/2007-05/10/2010 and PTotal: 01/05/1999-05/10/2010
31
TABLE 3: OPTIMAL MINIMUM-ENTROPY DECOMPOSITION
WL
Level
Raw
1
2
3
4
5
6
7
8
9
10
EUR/USD
PTotal
Returns
P1
P2
1.172
0.780
0.392
1.274
1.164
1.215
1.066
GBP/USD
PTotal
Volatility
P1
PTotal
Returns
P1
P2
1.172
0.780
0.392
P2
0.966
0.538
0.428
0.436
0.551
0.508
0.187
0.998
0.789
1.074
0.458
0.519
0.964
0.422
0.474
0.475
0.181
1.033
0.441
0.184
0.988
1.180
1.109
0.383
1.071
0.859
0.523
0.454
0.387
0.141
0.456
0.448
0.174
1.291
1.044
0.546
0.507
0.414
0.207
0.995
0.942
0.298
1.345
0.920
0.956
1.468
0.719
0.887
2.514
0.708
1.796
1.199
1.256
0.724
8.573
3.789
5.997
2.353
1.433
1.111
10.867
5.428
8.625
USD/JPY
PTotal
Volatility
P1
PTotal
Returns
P1
P2
0.966
0.538
0.428
P2
1.228
0.726
0.502
0.446
0.429
0.328
1.228
0.726
0.502
0.199
1.404
1.131
0.628
0.616
0.528
0.248
0.768
0.503
0.453
0.698
0.530
0.448
0.357
0.205
1.318
1.087
0.564
0.622
0.522
0.264
0.364
0.204
1.278
1.057
0.567
0.575
0.519
0.277
1.062
0.777
0.523
1.000
0.863
0.435
0.392
0.289
0.193
1.091
1.057
0.549
0.683
0.474
0.293
0.424
0.295
0.227
1.283
1.012
0.494
0.717
0.483
0.404
0.942
0.912
0.715
0.456
0.367
0.573
0.494
0.287
0.962
0.954
0.506
1.261
0.673
0.898
0.407
1.377
0.643
1.203
0.985
0.622
0.201
2.238
1.525
1.480
1.623
0.533
1.224
3.009
0.512
2.500
1.200
1.347
0.328
2.110
1.707
1.039
1.244
0.788
0.803
7.482
1.320
6.648
1.116
0.937
0.379
4.308
2.964
4.160
1.629
0.939
1.296
15.405
2.029
9.764
1.536
0.974
0.379
7.491
2.914
6.820
PTotal
Volatility
P1
P2
Notation: Shaded numbers report the corresponding optimal level of decomposition for each time series. It indicates the minimum value of the Shannon entropy criterion for the wavelet
details and -th level approximation.
32
TABLE 4: TRANSLATION/CONVERSION OF WAVELET SCALES INTO TIME HORIZONS
WL Scale
Days
Weeks
Time Horizons
Months
Quarters
Years
2-4
4-8
0.8-1.6
8-16
1.6-3.2
16-32
3.2-6.4
0.8-1.6
32-64
6.4-12.8
1.6-3.2
0.5-1.1
64-128
12.8-25.6
3.2-6.4
1.1-2.1
128-256
25.6-51.2
6.4-12.8
2.1-4.3
0.5-1.1
Notation: Each scale of the SIDWT corresponds to a frequency interval, or conversely an interval of periods, and thus each scale is associated with a range of time horizons. The time
horizons are expressed in base units (daily frequency) as follows: Week=5 trading days, Month=20 trading days, Quarter=60 trading days, Year=240 trading days.
33
TABLE 5: CAUSALITY RESULTS (RETURN SERIES)
EUR ↔ GBP
WL
Component
Spectral
Causality
X→Y
Y→X
EUR ↔ JPY
Linear Granger
Causality
X→Y
Y→X
NonLinear
Causality
X→Y
Spectral
Causality
Y→X
X→Y
GBP ↔ JPY
Linear Granger
Causality
Y→X
X→Y
Y→X
NonLinear
Causality
X→Y
Spectral
Causality
Y→X
X→Y
Linear Granger
Causality
Y→X
X→Y
NonLinear
Causality
Y→X
X→Y
Y→X
PTotal P1 P2 PTotal P 1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P 1 P2 PTotal P1 P2 PTotal P 1 P2 PTotal P1 P2 PTotal P1 P2 P Total P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P 1 P2 PTotal P 1 P2 PTotal P1 P2
Raw
A7
*
*
* **
** * ** **
** ** ** ** ** ** ** ** *
D1
*
D2
** **
D3
*
D4
**
** ** **
*
**
** ** ** **
**
*
* **
*
**
** ** ** ** ** ** ** **
*
*
**
**
** **
*
*
**
** ** ** ** ** **
** **
** *
**
**
**
** **
** **
**
* ** **
** **
** ** * **
**
* ** ** * **
*
*
**
** **
**
** ** * **
**
**
*
**
* ** **
* **
**
**
** ** * **
*
**
* ** **
**
** **
** **
** ** * **
*
**
**
** ** * **
**
** **
**
* **
*
**
** ** * **
D5
*
*
**
* ** **
**
**
* ** **
**
** **
*
**
** **
**
D6
*
*
**
* ** **
**
**
* ** **
**
** **
*
**
** **
**
D7
*
**
**
* ** **
**
**
* ** **
**
** **
*
**
** **
**
*
*
*
* **
**
**
Notation: X→Y: rx does not Granger cause ry. Statistical significance represents 5% (*) and 1% (**). The foreign exchange rates Euro (EUR), Great Britain Pound (GBP) and Japanese Yen
(JPY) are denoted relative to United States dollar (USD). The exact ratios represent EUR/USD, GBP/USD and USD/JPY respectively. The periods are P1: 01/05/1999-02/21/2007, P2:
02/22/2007-05/10/2010 and PTotal: 01/05/1999-05/10/2010. The spectral causality is tested only on the raw series as in Breitung and Candelon (2006). For all pairs the Johansen tests did not
identified any cointegrating vectors and the null of no cointegration was not rejected (Table 2). Thus, linear and spectral causality are investigated with a VAR representation. The results
from the SIC criterion, taking into consideration many lag specifications for the bivariate VAR modelling, as in Engel and West (2005), indicate in most of the cases two lags for the FX
return series and their wavelet components in all periods. Finally, for the nonlinear causality test in what follows, the common lag lengths used are ℓ
applied on the VAR residuals derived from the pairwise linear causality testing and the distance measure is set to ε =
34
= ℓ
=
. The nonlinear test is
, as suggested by Hiemstra and Jones (1994).
TABLE 6: CAUSALITY RESULTS (VOLATILITY SERIES)
EUR ↔ GBP
WL
Component
Spectral
Causality
X→Y
Y→X
Linear Granger
Causality
X→Y
Y→X
EUR ↔ JPY
NonLinear
Causality
X→Y
Y→X
Spectral
Causality
X→Y
Y→X
Linear Granger
Causality
X→Y
Y→X
GBP ↔ JPY
NonLinear
Causality
X→Y
Y→X
Spectral
Causality
X→Y
Y→X
Linear Granger
Causality
X→Y
Y→X
NonLinear
Causality
X→Y
Y→X
PTotal P1 P 2 PTotal P1 P2 PTotal P1 P2 P Total P1 P 2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P 1 P 2 PTotal P1 P2 PTotal P 1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 PTotal P1 P2 P Total P1 P2 PTotal P1 P2
Raw
A4
**
** **
** **
** **
** **
** *
**
** ** ** ** ** ** ** * ** ** ** *
D1
** ** ** ** ** ** **
*
** **
** **
** ** ** ** ** ** ** **
** **
*
*
**
*
** ** ** ** **
**
** ** ** ** ** **
**
**
*
** ** ** ** **
**
D3
** * ** **
**
**
** **
** ** * ** **
**
D4
** * ** **
**
** ** **
** **
**
D2
**
** **
* **
** **
**
** **
** ** ** ** ** ** ** **
**
*
** ** ** **
*
*
*
** **
**
** * ** **
**
** * ** **
**
** **
**
**
*
Notation: X→Y: rx does not Granger cause ry. Statistical significance represents 5% (*) and 1% (**). The foreign exchange rates Euro (EUR), Great Britain Pound (GBP) and Japanese Yen
(JPY) are denoted relative to United States dollar (USD). The exact ratios represent EUR/USD, GBP/USD and USD/JPY respectively. The periods are P1: 01/05/1999-02/21/2007, P2:
02/22/2007-05/10/2010 and PTotal: 01/05/1999-05/10/2010. The spectral causality is tested only on the raw series as in Breitung and Candelon (2006). For all pairs the Johansen tests did not
identified any cointegrating vectors and the null of no cointegration was not rejected (Table 2). Thus, linear and spectral causality are investigated with a VAR representation. The results
from the SIC criterion, taking into consideration many lag specifications for the bivariate VAR modelling, as in Engel and West (2005), indicate in most of the cases four lags for the FX
volatility series and their wavelet components in all periods. Finally, for the nonlinear causality test in what follows, the common lag lengths used are ℓ
applied on the VAR residuals derived from the pairwise linear causality testing and the distance measure is set to ε =
35
= ℓ
=
. The nonlinear test is
, as suggested by Hiemstra and Jones (1994).
**
** **
**
**
** **
TABLE 7: CROSS-SCALE CORRELATION RESULTS (RETURN AND VOLATILITY SERIES)
Returns
WL
Component
PTotal
EUR - GBP
Cross-correlation
P1
P2
PTotal
EUR - JPY
Cross-correlation
P1
P2
PTotal
GBP - JPY
Cross-correlation
P1
P2
Raw
0.658
0.681
0.638
-0.232
-0.356
-0.039
-0.126
-0.345
0.139
A7
0.831
0.831
0.891
-0.289
-0.465
-0.026
-0.085
-0.624
0.372
D1
0.671
0.689
0.652
-0.238
-0.389
-0.001
-0.144
-0.383
0.155
D2
0.654
0.675
0.635
-0.227
-0.362
-0.020
-0.137
-0.354
0.129
D3
0.655
0.676
0.638
-0.230
-0.358
-0.032
-0.125
-0.344
0.140
D4
0.655
0.677
0.638
-0.225
-0.353
-0.026
-0.122
-0.342
0.143
D5
0.655
0.677
0.637
-0.229
-0.353
-0.038
-0.125
-0.341
0.137
D6
0.656
0.679
0.636
-0.231
-0.353
-0.040
-0.128
-0.343
0.134
D7
0.656
0.680
0.634
-0.231
-0.354
-0.039
-0.127
-0.343
0.137
Volatility
WL
Component
Raw
PTotal
EUR - GBP
Cross-correlation
P1
P2
0.485
0.483
0.495
PTotal
EUR - JPY
Cross-correlation
P1
P2
0.291
0.226
0.375
PTotal
GBP - JPY
Cross-correlation
P1
P2
0.307
0.198
0.387
A4
0.768
0.533
0.921
0.585
0.302
0.746
0.569
0.076
0.672
D1
0.382
0.475
0.246
0.172
0.201
0.121
0.207
0.183
0.241
D2
0.408
0.479
0.306
0.210
0.208
0.219
0.229
0.199
0.267
D3
0.418
0.479
0.333
0.218
0.204
0.247
0.237
0.199
0.287
D4
0.424
0.478
0.349
0.235
0.216
0.273
0.248
0.208
0.302
Notation: Reported values indicate the wavelet cross-correlation between all pairs of exchange rate returns and volatility.
36
FIGURE 1: WAVELET MULTI-SCALE ANALYSIS (EUR RETURNS)
Raw Signal
0.05
0
-0.05
500
1000
1500
-3
2
0
-2
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
Approximation 7
x 10
500
1000
1500
Detail 7
0.05
0
-0.05
500
1000
1500
Detail 6
0.05
0
-0.05
500
1000
1500
Detail 5
PTotal
0.05
0
-0.05
500
1000
1500
Detail 4
0.05
0
-0.05
500
1000
1500
Detail 3
0.05
0
-0.05
500
1000
1500
Detail 2
0.05
0
-0.05
500
1000
1500
Detail 1
0.05
0
-0.05
500
1000
1500
Raw Signal
0.05
0
-0.05
1
0
-1
200
x 10
400
600
800
-3
1000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
Approximation 7
200
400
600
800
1000
Detail 7
0.05
0
-0.05
200
400
600
800
1000
Detail 6
0.05
0
-0.05
200
400
600
800
1000
Detail 5
P1
0.05
0
-0.05
200
400
600
800
1000
Detail 4
0.05
0
-0.05
200
400
600
800
1000
Detail 3
0.05
0
-0.05
200
400
600
800
1000
Detail 2
0.05
0
-0.05
200
400
600
800
1000
Detail 1
0.05
0
-0.05
200
400
600
800
1000
Raw Signal
0.05
0
-0.05
2
0
-2
100
x 10
200
300
-3
400
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
Approximation 7
100
200
300
400
Detail 7
0.05
0
-0.05
100
200
300
400
Detail 6
0.05
0
-0.05
100
200
300
400
Detail 5
P2
0.05
0
-0.05
100
200
300
400
Detail 4
0.05
0
-0.05
100
200
300
400
Detail 3
0.05
0
-0.05
100
200
300
400
Detail 2
0.05
0
-0.05
100
200
300
400
Detail 1
0.05
0
-0.05
100
200
300
400
Notation: The results of SIDWT (db8) multiresolution wavelet analysis include the
approximation
. The raw signal is also displayed.
37
−
wavelet details and the 7-th level
FIGURE 2: WAVELET MULTI-SCALE ANALYSIS (EUR VOLATILITY SERIES)
Raw Signal
0.04
0.02
0
500
1000
1500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
Approximation 4
0.02
0.01
0
500
1000
1500
Detail 4
0.05
0
PTotal
-0.05
500
1000
1500
Detail 3
0.05
0
-0.05
500
1000
1500
Detail 2
0.05
0
-0.05
500
1000
1500
Detail 1
0.02
0
-0.02
500
1000
1500
Raw Signal
0.04
0.02
0
200
400
600
800
1000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
Approximation 4
0.01
0.005
0
200
400
600
800
1000
Detail 4
0.02
0
P1
-0.02
200
400
600
800
1000
Detail 3
0.02
0
-0.02
200
400
600
800
1000
Detail 2
0.02
0
-0.02
200
400
600
800
1000
Detail 1
0.02
0
-0.02
200
400
600
800
1000
Raw Signal
0.04
0.02
0
100
200
300
400
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
Approximation 4
0.02
0.01
0
100
200
300
400
Detail 4
0.05
0
P2
-0.05
100
200
300
400
Detail 3
0.05
0
-0.05
100
200
300
400
Detail 2
0.05
0
-0.05
100
200
300
400
Detail 1
0.02
0
-0.02
100
200
300
400
Notation: The results of SIDWT (db8) multiresolution wavelet analysis include the
approximation
. The raw signal is also displayed.
38
−
wavelet details and the 4-th level
FIGURE 3: DIAGRAMMATICAL REPRESENTATION OF DIRECTIONAL CAUSALITIES (RETURNS)
PTotal
P1
P2
EUR
EUR
EUR
Raw
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
A7
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D1
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D2
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D3
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D4
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D5
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D6
GBP
JPY
GBP
JPY
GBP
EUR
EUR
JPY
EUR
D7
GBP
JPY
GBP
39
JPY
GBP
JPY
FIGURE 4: DIAGRAMMATICAL REPRESENTATION OF DIRECTIONAL CAUSALITIES (VOLATILITY)
PTotal
P1
P2
EUR
EUR
EUR
Raw
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
A4
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D1
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D2
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D3
GBP
JPY
GBP
EUR
JPY
GBP
EUR
JPY
EUR
D4
GBP
JPY
GBP
JPY
GBP
JPY
Notation (Figure 3 and Figure 4):
The light grey line represents spectral casual relationships; the dotted and solid lines depict linear and nonlinear causal linkages
respectively.
denote unidirectional and bi-directional causality corresponding to 5%
≤ p-value < 1%
≤ 1%
denote unidirectional and bi-directional causality corresponding to p-value
40
APPENDIX I: WAVELET ANALYSIS
I1. LITERATURE REVIEW
The interest in economic applications of wavelets emerged in the mid-90s, mostly
stimulated by the work of Ramsey and his collaborators. Ramsey and Lampart (1998a, 1998b) used
a wavelet-based scaling method to investigate the relationship and causality between money,
income and expenditure. Further work on long memory processes and fractional integration in
financial data can be found in Greenblatt (1998) and Jensen (2000). Davidson et al. (1998) used
wavelets in introducing a semiparametric approach for analysing commodity prices. Wong et al.
(2003) provided an example of using wavelets in forecasting exchange rates, wherein other
conventional time series models were also used in order to compare against the wavelet-based
methodology. In recent works, Almasri and Shukur (2003) address the causal relation between
spending and revenue at different timescales, while Gençay et al. (2002) look into dependencies
between growth and inflation. Fernandez (2005) deals with the estimation of systematic asset risk.
Finally, it is also worth mentioning a stream of papers utilizing wavelet methodology to address
theoretical econometric issues, such as Pan and Wang (2000), Stengos and Sun (2001), Lee and
Hong (2001), Hong and Kao (2004) and Fan and Gençay (2010).
In economics the notion of timescale is related to time period segmentation and the
examined relationships are described as short-run and long-run, or broadly under the term scaling
laws (Brock, 1999). The scale decomposition often reveals the presence of deterministic regularities
or statistical properties of the conditional moments that are seemingly independent of the scale
details. However, in the wavelet literature the concept of time scaling is quite different from that in
economics. Based on the selected function space, the time series are analysed into “fine” and
“coarse” resolution components, namely into low- and high-frequency parts of a signal
respectively. Although at first sight timescale could directly correspond to frequency there is only
an indirect connection between these two concepts, as indicated by Priestley (1996). Intuitively, in a
naïve interpretation, wide-support wavelets can be associated with low frequencies, while highfrequency analysis can be provided by narrow-support components. However, the link between
1
scale and frequency can be directly interpretable only when the data is stationary. In general, a
multiscale decomposition could be estimated by a bank of filters with varying frequencies and
widths. Yet, selecting the proper filters and their parameters so as not to discard important features
of the underlying series is a difficult, subjective task, lacking a solid methodology. The wavelet
analysis provides a sound mathematical framework for designing filters which eventually provide
an adaptive partition of the time-frequency domain.
Fourier and wavelet methods involve the projection of a signal onto an orthonormal set of
components. Fourier projections are most naturally defined for functions restricted to
the set of square integrable functions in the interval
(
(
π ) i.e.,
π ) . Based on the complex superposition of
individual harmonics, the hypothesis is that over any segment of the time series the exact same
frequencies hold at the same amplitudes, namely the signal is homogeneous over time. On the
(ℝ )
contrary, the basis functions in wavelet analysis are defined in
and are not necessarily
homogeneous over time, meaning that they have narrow compact support so that they rapidly
converge to zero as time approaches infinity. The most widely used classes of wavelets are the
orthogonal ones namely the haar wavelets, daubechies, symlets and coiflets (Percival and Walden,
2000).
I2. TECHNICAL OVERVIEW
I2.1 Preliminaries
A function ψ ( ) that is real-valued and continuous such that
∞
∫−∞ ψ ( )
length
=
=(
defines a wavelet. Considering that
−
∞
∫−∞ ψ ( )
=
and
) is a finite wavelet filter with
, the properties of continuous wavelet functions such as integration to zero and unit
−
energy, in discrete time are equivalently given by
∑
=
=
−
and
∑
=
= . If
=(
−
)
denotes the complement low-pass (scaling) filter of the wavelet (high-pass) filter then according to
Gençay et al. (2002) and Percival and Walden (2000), the scaling filter coefficients are estimated
2
based on the quadrature mirror relationship
−
properties of the scaling filter are
∑
)
and
∑
=
. The
×
for a time series
orthonormal matrix
= . The
={
}=
−
=
and
-length vector of the
with dyadic length
(
=
) is obtained as
defines the Discrete Wavelet Transform (DWT). The
=
is a
− . The
=
vector of wavelet coefficients can be further decomposed into
where
=
− −
−
=
=
wavelet coefficients
+
= (−
+
vectors
(I.1)
-length vector of wavelet coefficients corresponding to the scale of length
is a
-length vector of scaling coefficients associated with scale
. The
matrix comprises the wavelet and scaling filter coefficients on a row-by-row representation.
Hereby, the vector of zero-padded unit scale wavelet filter coefficients is defined in reverse order
=
by
()
=
−
. If
−
−
−
etc., then the
circularly shifted versions of
of
matrices
×
, namely
matrix
. Additionally,
(McCoy and Walden, 1996). The
. From matrix
is defined as the collection
() ()
=
are defined by circularly shifting the vector
wavelet filter coefficients) by factors of
equal to
is circularly shifted by factors of two e.g.,
(
−
)
. In general,
(the vector of zero-padded scale
is a column vector with all elements
×
dimensional matrix
the wavelet filter coefficients for scales
computed via the Inverse Discrete Fourier Transform (IDFT).
3
is
are
I2.2 Implementation of the Discrete Wavelet Transform (DWT)
The implementation algorithm of the DWT was introduced by Mallat (1989). The time
series
is filtered using
and
, then the outputs are subsampled to half their original lengths
and the subsampled filter output from
accounts for the wavelet coefficients. This process is
repeated on the subsampled output from the
algorithm
=
begins
by
convolving
the
filter. Specifically, the first step of the pyramid
data
with
−
∑
=
=
and scaling coefficients
+ −
each
filter
to
obtain
−
∑
,
+ −
=
This also includes a downsampling operation. Consequently, the
the
wavelet
=
− .
-length vector of observations
coefficients. The second step of the algorithm
has been high-and low-pass filtered to obtain
starts by “initializing” the sample now to be the scaling coefficients
and apply the
aforementioned filtering procedure to obtain the second level of wavelet and scaling coefficients.
By saving all wavelet coefficients and the final level of scaling coefficients the decomposition
=
becomes
. This procedure is repeated up to
( )
=
times and provides the
vector of wavelet coefficients in Eq. (I.1). The inversion of the DWT is performed by upsampling
the final wavelet and scaling coefficients, convolving them with their respective filters and adding
and
the resulting vectors. Upsampling the vectors
=
vectors
−
=
−
∑
=
=
and
. Now the vector of scaling coefficients
−
+
of the final DWT level produces the new
+∑
with
+
=
=
−
is given by
and it is twice that of
. This is
repeated until the first level of all coefficients has been upsampled, in order to produce the original
vector of data observations, i.e.,
=
−
∑
=
−
+∑
+
=
+
=
The DWT results in the additive decomposition of the time series. Let
the wavelet detail corresponding to changes in the time series
4
at scale
− .
=
for the level
define
=
.
The coefficients
detail
+
represent the part of the signal attributable to scale
=
=
is equal to the sample mean of the
=
Considering that for each observation,
+
∑
. The final wavelet
observations (Gençay et al., 2002).
=
−
is the linear combination of
=
+
∑
=
wavelet detail coefficients, then
is the cumulative sum of the variations of the
= +
details defined as the
-th level wavelet approximation for
=∑
-th level wavelet rough
zeros. The
≤ ≤
,
≤ ≤
+
with
+
being a vector of
incorporates the remaining lower-
=
scale details. The time series may be decomposed as
+∑
=
=
+
(I.2)
=
Orthonormality of the matrix
the
DWT
≤ ≤
=∑
an
)
=(
=
for
is
efficient,
=
(due
+
implies, as in the case of Discrete Fourier Transform, that
to
=
variance
=
orthonormality
preserving
=
. As
of
and
),
and
an
equal
transform
=
i.e.,
apply
decomposition
is
.
=
I2.3 Wavelet classes: Haar and Daubechies
In case of a time series
=
(
)⋅(
−
−
)
and
=
={
}=
(
)⋅(
, the Haar wavelet and scaling coefficients are
+
−
)
respectively. Although the Haar filter is
easy to visualize and implement, it is inadequate for real-world applications as it provides a poor
approximation to an ideal band-pass filter (Gençay et al., 2002). Instead the Daubechies wavelets
improve the frequency domain characteristics of the Haar and also are compactly supported. In
5
general the wavelet and scaling coefficients of the Daubechies class are
=
−
∑
−
=
=
−
∑
−
=
length
and
and
=
=− −
respectively with
=
have as wavelet coefficients
+
=
.
6
. For example, the Daubechies with
−
,
=− +
,
=
+
APPENDIX II: CAUSALITY TESTS
II1. LINEAR CAUSALITY (PARAMETRIC TEST)
The linear Granger causality test (Granger, 1969) is based on a reduced-form vector
autoregression (VAR) model. If
=
ℓ
is the vector of endogenous variables and ℓ the
number of lags, the VAR( ℓ ) model is given by
ℓ
= ∑Φ
=
where Φ
−
+ε
(II.1)
is the ℓ × ℓ parameter matrix and ε the residual vector, for which
ε
ε ε = ε
. In case of two stationary time series
=
≠
{ }
and
{ }
ε =
and
the bivariate VAR
model is given by
=Φ ℓ
=Ψℓ
where Φ ℓ Χ ℓ Ψ ℓ and
+Χ ℓ
+ ℓ
+ε
+ε
=
(II.2)
ℓ are lag polynomials with roots outside the unit circle and the error
terms are i.i.d. processes with zero mean and constant variance. The test whether
Granger causes
strictly
is simply a test of the joint restriction that all coefficients of the lag polynomial
Χ ℓ are zero, whilst a test of whether
strictly Granger causes
is a test regarding Ψ ℓ . In the
unidirectional case the null hypothesis of no Granger causality is rejected if the exclusion
restriction is rejected, whereas if both Χ ℓ and Ψ ℓ joint tests for significance are different from
zero the series are bi-causally related.
However, in order to explore possible effects of cointegration a vector autoregression
model in error correction form (Vector Error Correction Model-VECM) is estimated using the
methodology developed by Engle and Granger (1987) and expanded by Johansen (1988) and
Johansen and Juselius (1990). The bivariate VECM model has the following form
= −!
= −!
−λ ⋅
−λ ⋅
−
−
−
−
+Φ ℓ
+ Ψ ℓ
+Χ ℓ
+ε
+
+ε
=
7
ℓ
(II.3)
where
−λ the cointegration row-vector and λ the cointegration coefficient. Thus, in case of
{ }
cointegrated time series
{ }
and
linear Granger causality should be investigated on Χ ℓ
and Ψ ℓ via the VECM specification.
II2. NONLINEAR CAUSALITY (NONPARAMETRIC TEST)
Let "
(
Θ−
) denote the conditional probability distribution of
set Θ − , which consists of an
-length lagged vector of
-length lagged vector of
,
testing for a given pair of lags
(
and
#
Denoting the
≡
−
−
+
≡
−
(
−
−
+
−
) and an
) . Hiemstra and Jones (1994) consider
−
the following null hypothesis
Θ − = "
(
"
−
,
given the information
)
≡(
-length lead vector of
−
(II.4)
+ −
),
for
Θ− −
+
∈
, the claim made by
Hiemstra and Jones (1994) is that the null hypothesis given in Eq. (II.4) implies for all ε >
−
$
<ε
−
=
−
{ }
For the time series of realizations
−
and
choosing a value for ε typically in
$
<ε
−
−
−
$
<ε
{ },
−
=
−
−
< ε
< ε
−
(II.5)
, the nonparametric test consists of
after unit variance normalization, and testing Eq. (II.5)
by expressing the conditional probabilities in terms of the corresponding ratios of joint
probabilities
%
(
+
%
(
)
ε ≡
)
ε ≡
%
(
+
%
(
+
−
−
−
ε) ≡
ε) ≡
+
−
−
<ε
−
<ε
+
−
−
Thus, Eq. (II.5) can be formulated as
8
−
−
−
−
−
−
−
−
+
< ε
−
< ε
−
< ε
< ε
(II.6)
%
(
+
ε
(
%
ε
)
)=% (
+
(
%
ε)
ε)
(II.7)
Using correlation-integral estimators and under the assumptions that
{ }
and
{ }
are strictly
stationary, weakly dependent and satisfy the mixing conditions of Denker and Keller (1983),
Hiemstra and Jones (1994) show that
(
%
%
(
with σ
+
ε
(
ε
)
) −% (
%
+
(
ε
)
ε
)
(
∼
σ
(
ε
))
(II.8)
)
ε as given in their appendix. One-sided critical values are used based on this
asymptotic result, rejecting when the observed value of the test statistic in Eq. (II.8) is too large.
II3. SPECTRAL CAUSALITY (FREQUENCY DOMAIN TEST)
A two-dimensional vector of time series observed at
= …
assumed that it has a finite-order VAR representation as
Θ(
)=
−Θ
− ⋯ − Θ!
!
⋅
with
=
−
−
is denoted as
Θ(
) ⋅
=ε
, and that Ε (ε ) =
. It is
where
and Ε (ε ε ′ ) = Σ
with Σ positive definite. Let & be the lower triangular matrix of the Cholesky decomposition
& ′& = Σ− such that Ε (η η ′ ) =
and η = & ε . If the system is assumed to be stationary, its MA
representation is the following
' = Ξ(
with Ξ (
( )=
)ε
−
) = Θ( )
Ψ
π
and Ψ (
(( )
−
Ξ
=
Ξ
+ Ψ
(1982) and Hosoya (1991) is
( )
( )
Ξ
Ξ
( ) ε
( ) ε
) = Ξ ( )& −
= Ψ(
)η
Ψ
=
Ψ
( )
( )
Ψ
Ψ
( ) η
( ) η
. Thus, the spectral density of
(II.9)
can be expressed as
(( ) . The measure of spectral causality suggested by Geweke
−
→
( )=
π
()
9
Ψ
( )
(−
, or equally
→
The null hypothesis that
+ Ψ
( )=
( )
(−
does not cause
( )
(−
Ψ
at frequency
and Hosoya (2000) suggested estimating
()
→
(II.10)
is given by #
(( )
−
by replacing Ψ
→
( )=
. Yao
(( )
in Eq.
−
and Ψ
(
)
measure
is
(II.10) with estimates obtained from a fitted VAR model. Considering that γ = )( Θ … Θ! Σ
represents
( )=
→
of
the
→
vector
( )+
→
of
parameters,
(γ )′ (ɵγ − γ ) +
γ
!
the
estimated
, where
−
γ
causality
(γ ) denotes the vector of derivatives
( ) . Under suitable regularity conditions the asymptotic distribution of the Wald statistic
for the null is given by * =
( )
→
(γ )′ + (γ ) γ (γ ) → χ
(( )
−
asymptotic covariance matrix of γ . However, the expression Ψ
function of the VAR parameters and
γ
, where + (γ ) is the
is a complicated nonlinear
(γ ) is difficult to evaluate (Yao and Hosoya, 2000).
Recently, Breitung and Candelon (2006) proposed a simple approach to test the null
hypothesis.
Ψ(
Considering
−
) = Θ( )
of & − and Θ (
if Θ
from
Eq.
& − and specifically Ψ
)
!
Θ
→
( )
!
( )
=
( )=
∑θ
=
=κ
−
. By denoting κ = θ
+ ⋯ + κ!
−!
+
−
+⋯+
Θ(
)
and
!
−!
+ε
10
=θ
(( ) =
−
is the
does not cause
(( ) =
−
Ψ
if
where ,
= , where θ
Hence, a necessary and sufficient set of conditions for Θ
!
( )=
) . They proved that
( )− ∑ θ
=
that
( ) = −,
the determinant of Θ (
(( ) = ∑ θ
−
(II.10)
,
then
(1,2)-element
at frequency
is the (1,2)-element of Θ .
!
is
∑θ
=
the VAR equation for
and thus the hypothesis
( )=
and
is written as
→
( )=
is
!
′
and
A cointegrating framework is also applicable if
is
equivalent to the following linear restriction Η
-(
) =
replaced by
()
()
( )
( )
(! ) .
(! )
⋯
⋯
)
= , with
=
=.
. To study the local power they considered the simple model
where the gain function of the filter .
and
-(
( ) = α
−
()
+
…
( )
−
+
is a Gegenbauer polynomial
{ } { } are mutually independent white noise processes with Ε (
)= σ
and Ε
( )= σ
.
Breitung and Candelon (2006) proved that when the frequency being tested converges to the true
frequency at a suitable rate under the sequence of local alternatives
statistic for the null Η
parameter λ = σ α
-(
( )
)
=
σ
+
=
+
, the Wald
is asymptotically distributed as non-central χ
( ) . This
with
test is used to detect causality and to
explore the short- and long-run relationships in a particular range of frequencies, which is
determined by the input data frequency.
11
APPENDIX III: ADDITIONAL FIGURES
FIGURE III.1: WAVELET MULTI-SCALE ANALYSIS (GBP RETURNS)
Raw Signal
0.05
0
-0.05
5
0
-5
500
x 10
1000
1500
-3
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
Approximation 7
500
1000
1500
Detail 7
0.05
0
-0.05
500
1000
1500
Detail 6
0.05
0
-0.05
500
1000
1500
Detail 5
PTotal
0.05
0
-0.05
500
1000
1500
Detail 4
0.05
0
-0.05
500
1000
1500
Detail 3
0.05
0
-0.05
500
1000
1500
Detail 2
0.05
0
-0.05
500
1000
1500
Detail 1
0.05
0
-0.05
500
1000
1500
Raw Signal
0.05
0
-0.05
1
0
-1
200
x 10
400
600
800
-3
1000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
Approximation 7
200
400
600
800
1000
Detail 7
0.02
0
-0.02
200
400
600
800
1000
Detail 6
0.02
0
-0.02
200
400
600
800
1000
Detail 5
P1
0.02
0
-0.02
200
400
600
800
1000
Detail 4
0.02
0
-0.02
200
400
600
800
1000
Detail 3
0.05
0
-0.05
200
400
600
800
1000
Detail 2
0.02
0
-0.02
200
400
600
800
1000
Detail 1
0.02
0
-0.02
200
400
600
800
1000
Raw Signal
0.05
0
-0.05
5
0
-5
100
x 10
200
300
-3
400
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
Approximation 7
100
200
300
400
Detail 7
0.05
0
-0.05
100
200
300
400
Detail 6
0.05
0
-0.05
100
200
300
400
Detail 5
P2
0.05
0
-0.05
100
200
300
400
Detail 4
0.05
0
-0.05
100
200
300
400
Detail 3
0.05
0
-0.05
100
200
300
400
Detail 2
0.05
0
-0.05
100
200
300
400
Detail 1
0.05
0
-0.05
100
200
300
400
Notation: The results of SIDWT (db8) multiresolution wavelet analysis include the
approximation
. The raw signal is also displayed.
12
−
wavelet details and the 7-th level
FIGURE III.2: WAVELET MULTI-SCALE ANALYSIS (GBP VOLATILITY SERIES)
Raw Signal
0.04
0.02
0
500
1000
1500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
Approximation 4
0.02
0.01
0
500
1000
1500
Detail 4
0.05
0
PTotal
-0.05
500
1000
1500
Detail 3
0.05
0
-0.05
500
1000
1500
Detail 2
0.05
0
-0.05
500
1000
1500
Detail 1
0.02
0
-0.02
500
1000
1500
Raw Signal
0.04
0.02
0
200
400
600
800
1000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
Approximation 4
0.01
0.005
0
200
400
600
800
1000
Detail 4
0.02
0
P1
-0.02
200
400
600
800
1000
Detail 3
0.02
0
-0.02
200
400
600
800
1000
Detail 2
0.02
0
-0.02
200
400
600
800
1000
Detail 1
0.01
0
-0.01
200
400
600
800
1000
Raw Signal
0.04
0.02
0
100
200
300
400
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
Approximation 4
0.02
0.01
0
100
200
300
400
Detail 4
0.05
0
P2
-0.05
100
200
300
400
Detail 3
0.05
0
-0.05
100
200
300
400
Detail 2
0.05
0
-0.05
100
200
300
400
Detail 1
0.02
0
-0.02
100
200
300
400
Notation: The results of SIDWT (db8) multiresolution wavelet analysis include the
approximation
. The raw signal is also displayed.
13
−
wavelet details and the 4-th level
FIGURE III.3: WAVELET MULTI-SCALE ANALYSIS (JPY RETURNS)
Raw Signal
0.1
0
-0.1
2
0
-2
500
x 10
1000
1500
-3
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
Approximation 7
500
1000
1500
Detail 7
0.1
0
-0.1
500
1000
1500
Detail 6
0.1
0
-0.1
500
1000
1500
Detail 5
PTotal
0.1
0
-0.1
500
1000
1500
Detail 4
0.1
0
-0.1
500
1000
1500
Detail 3
0.1
0
-0.1
500
1000
1500
Detail 2
0.05
0
-0.05
500
1000
1500
Detail 1
0.05
0
-0.05
500
1000
1500
Raw Signal
0.05
0
-0.05
2
0
-2
200
x 10
400
600
800
1000
-3
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
Approximation 7
200
400
600
800
1000
Detail 7
0.05
0
-0.05
200
400
600
800
1000
Detail 6
0.05
0
-0.05
200
400
600
800
1000
Detail 5
P1
0.05
0
-0.05
200
400
600
800
1000
Detail 4
0.05
0
-0.05
200
400
600
800
1000
Detail 3
0.05
0
-0.05
200
400
600
800
1000
Detail 2
0.05
0
-0.05
200
400
600
800
1000
Detail 1
0.05
0
-0.05
200
400
600
800
1000
Raw Signal
0.1
0
-0.1
1
0
-1
100
x 10
200
300
400
-3
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
Approximation 7
100
200
300
400
Detail 7
0.1
0
-0.1
100
200
300
400
Detail 6
0.1
0
-0.1
100
200
300
400
Detail 5
P2
0.1
0
-0.1
100
200
300
400
Detail 4
0.1
0
-0.1
100
200
300
400
Detail 3
0.1
0
-0.1
100
200
300
400
Detail 2
0.05
0
-0.05
100
200
300
400
Detail 1
0.05
0
-0.05
100
200
300
400
Notation: As in Figure III.1
14
FIGURE III.4: WAVELET MULTI-SCALE ANALYSIS (JPY VOLATILITY SERIES)
Raw Signal
0.1
0.05
0
500
1000
1500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
2000
2500
Approximation 4
0.02
0.01
0
500
1000
1500
Detail 4
0.05
0
PTotal
-0.05
500
1000
1500
Detail 3
0.05
0
-0.05
500
1000
1500
Detail 2
0.05
0
-0.05
500
1000
1500
Detail 1
0.05
0
-0.05
500
1000
1500
Raw Signal
0.04
0.02
0
200
400
600
800
1000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
1200
1400
1600
1800
2000
Approximation 4
0.01
0.005
0
200
400
600
800
1000
Detail 4
0.05
0
P1
-0.05
200
400
600
800
1000
Detail 3
0.05
0
-0.05
200
400
600
800
1000
Detail 2
0.05
0
-0.05
200
400
600
800
1000
Detail 1
0.05
0
-0.05
200
400
600
800
1000
Raw Signal
0.1
0.05
0
100
200
300
400
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
500
600
700
800
Approximation 4
0.02
0.01
0
100
200
300
400
Detail 4
0.05
0
P2
-0.05
100
200
300
400
Detail 3
0.05
0
-0.05
100
200
300
400
Detail 2
0.05
0
-0.05
100
200
300
400
Detail 1
0.02
0
-0.02
100
200
300
400
Notation: As in Figure III.2
15
FIGURE III.5: SPECTRAL CAUSALITY (RETURN SERIES)
X→Y
X
Y→X
Y
PTotal
EUR
GBP
EUR
JPY
GBP
JPY
P1
P2
PTotal
P1
Notation: The blue (dotted) line corresponds to 95% confidence level for the Breitung and Candelon (2006) frequency domain causality test, while the red (solid) to 99%.
16
P2
FIGURE III.6: SPECTRAL CAUSALITY (VOLATILITY SERIES)
X→Y
X
Y→X
Y
PTotal
EUR
GBP
EUR
JPY
GBP
JPY
P1
P2
PTotal
Notation: As in Figure III.5
17
P1
P2
REFERENCES
(APPENDIX I AND II)
Almasri, G. and A. Shukur, 2003, An illustration of the causality relation between government
spending and revenue wavelet analysis on Finnish data. Journal of Applied Statistics 30(5), 571–584.
Baek, E. and W. Brock, 1992, A general test for non-linear Granger causality: bivariate model.
Working paper, Iowa State University and University of Wisconsin, Madison, WI.
Breitung, J. and B. Candelon, 2006, Testing for short- and long-run causality: A frequency-domain
approach. Journal of Econometrics 132, 363-378.
Brock, W, 1999, Scaling in economics: a reader's guide. Industrial and Corporate Change 8(3), 409-446.
Davidson, R., W.C. Labys and J-B. Lesourd, 1998, Wavelet analysis of commodity price behaviour.
Computational Economics 11(1–2), 103-128.
Denker, M., and G. Keller, 1983, On U-statistics and von-Mises statistics for weakly dependent
processes. Zeitschrift fur Wahrscheinlichkeitstheorie und Verwandte Gebiete 64, 505-522.
Engle, R.F. and C.W.J. Granger, 1987, Co-integration and error correction: representation, estimation,
and testing. Econometrica 55(2), 251–276.
Fan, Y. and R. Genҫay, 2010, Unit Root Tests with Wavelets. Econometric Theory 26, 1305-1331
Fernandez, V., 2005, The International CAPM and a wavelet-based decomposition of Value at Risk.
Studies of Nonlinear Dynamics & Econometrics 9(4), Article 4.
Genҫay, R., B. Whitcher and F. Selҫuk, 2002, An introduction to wavelets and other filtering methods in
finance and economics. Academic Press, San Diego.
Geweke, J., 1982, Measurement of linear dependence and feedback between multiple time series.
Journal of the American Statistical Association 77, 304–324.
Granger, C.W.J., 1969, Investigating causal relations by econometric models and cross-spectral
methods. Econometrica 37(3), 424-438.
Greenblatt, S.A., 1998, Atomic decomposition of financial data. Computational Economics 12(3), 275293.
Hiemstra, C. and J.D. Jones, 1994, Testing for linear and nonlinear Granger causality in the stock
price-volume relation. Journal of Finance 49(5), 1639-1664.
Hong, Y. and C. Kao, 2004, Wavelet-based testing for serial correlation of unknown form in panel
models. Econometrica 72(5), 1519–1563.
Hosoya, Y., 1991, The decomposition and measurement of the interdependency between secondorder stationary processes. Probability Theory and Related Fields 88(4), 429–444.
Jensen, M.J., 2000, An alternative maximum likelihood estimator of long-memory processes using
compactly supported wavelets. Journal of Economic Dynamics and Control 24(3), 361–387.
Johansen, S., 1988, Statistical analysis of cointegration vectors. Journal of Economic Dynamics and
Control 12(2-3), 231–254.
Johansen, S. and K. Juselius, 1990, Maximum likelihood estimation and inference on cointegration
with application to the demand for money. Oxford Bulletin of Economics and Statistics 52, 169–209.
Lee J. and Y. Hong, 2001, Testing for serial correlation of unknown form using wavelet methods.
Econometric Theory 17(2), 386–423.
Mallat, S., 1989, A theory for multiresolution signal decomposition: the wavelet representation. IEEE
Transactions on Pattern Analysis and Machine Intelligence 11(7), 674–693.
McCoy, E. J. and A.T. Walden, 1996, Wavelet analysis and synthesis of stationary long-memory
processes. Journal of Computational and Graphical Statistics 5, 26-56.
18
Pan, X and Z. Wang, 2000, A wavelet-based nonparametric estimator of the variance function.
Computational Economics 15(1–2), 79–87.
Percival, D.B. and A.T. Walden, 2000, Wavelet methods for time series analysis. Cambridge University
Press, Cambridge, UK.
Priestley, M.B., 1996, Wavelets and time-dependent spectral analysis. Journal of Time Series Analysis
17(1), 85–103.
Ramsey, J.B. and C. Lampart, 1998a, The decomposition of economic relationships by time scale
using wavelets: Money and income. Macroeconomic Dynamics 2, 49–71.
, 1998b, The decomposition of economic relationships by time scale
using wavelets: Expenditure and income. Studies in Nonlinear Dynamics & Econometrics 3(1), 23-42.
Stengos, T. and Y. Sun, 2001, A consistent model specification test for a regression function based on
nonparametric wavelet estimation. Econometric Reviews 20(1), 41–60.
Wong, H., I. Wai-Cheung, X. Zhongjie and X. Lui, 2003, Modelling and forecasting by wavelets, and
their application to exchange rates. Journal of Applied Statistics 30(5), 537–553.
Yao, F. and Y. Hosoya, 2000, Inference on one-way effect and evidence in Japanese macroeconomic
data. Journal of Econometrics 98(2), 225–255.
19