Survival of Overconfidence in Currency Markets
Thomas Oberlechner1
Webster University Vienna
Carol Osler2,*
Brandeis International Business School
1
Thomas Oberlechner, Webster University Vienna, Department of Psychology, Berchtoldgasse
1, A-1220 Vienna, Austria, Email:
[email protected]
2
Carol Osler, Brandeis International Business School, Brandeis University, Mailstop 32, 415
South Street, Waltham, MA 02454, USA, Email:
[email protected]
* Corresponding author
Acknowledgments:
The authors gratefully acknowledge helpful comments from Steve Cecchetti, Blake
LeBaron, Lukas Menkhoff, Dagfinn Rime, Paroma Sanyal, Robert Shiller, David Simon, and
Hanno Ulmer, as well as seminar participants at the Expertise in Context Conference in Berlin,
the University of Hannover, Queens University Belfast, the LIEP group at Harvard University,
Norges Bank, and the Federal Reserve Bank of New York. Also, we acknowledge with thanks
each foreign exchange professional who took the time to fill out the survey. We gratefully
acknowledge financial support for this research from Hottinger & Partner and the Austrian
Science Fund (Schroedinger Scholarship #J2219 and #J1955-SOZ).
Survival of Overconfidence in Currency Markets
This paper tests the influential hypothesis, typically attributed to Friedman (1953), that
irrational traders will be driven out of financial markets by trading losses. The paper’s main
finding is that overconfident currency dealers are not driven out of the market. Dealers with
extensive experience are neither more nor less overconfident than their junior colleagues. We set
the stage for this investigation by providing evidence that currency dealers display two forms of
overconfidence: they underestimate uncertainty and they overestimate their professional success.
This is notable since dealers face strong incentives to accuracy, have access to comprehensive
information, and have extensive experience.
[Key words: Overconfidence, imperfect rationality, behavioral, exchange rates, foreign
exchange, survival of imperfect rationality. JEL classifications: G14, G15, F31.]
I.
Introduction
This paper tests whether imperfect rationality survives in financial markets, an issue that
is central to the debate between neoclassical and behavioral economics. According to the
neoclassical view, most famously articulated by Friedman (1953), irrational traders will buy
assets at high prices and sell them at low prices, thus incurring trading losses that ultimately
drive them out of the market. Since extensive evidence now exists for the presence of imperfect
rationality in financial markets (Hirshleifer (2001); Daniel et al. (2002)), this argument provides
an important justification for maintaining an exclusive focus on rational agents: If irrational
traders are eventually forced to cease trading, then imperfect rationality may have at most a
transient influence on prices. Nonetheless, empirical research on this question is at best rare and
perhaps nonexistent. The evidence presented here suggests that one form of imperfect rationality,
2
overconfidence, does survive in financial markets and may in fact be unaffected by trading
experience.
We focus on the world’s biggest market, the foreign exchange market, and on the agents
that set the prices in that market, the dealers. We first provide evidence that currency dealers are
indeed overconfident, on average, as logically required for this investigation. Consistent with
evidence for other financial-market agents (see, for example, Glaser and Weber (2007)), the
dealers display two forms of overconfidence: they underestimate uncertainty and they
overestimate their own abilities. We test whether overconfidence survives the rigors of the
marketplace by comparing overconfidence across dealers with differing levels of experience. We
find that experienced and inexperienced currency dealers are equally overconfident. This robust
result holds for both dimensions of overconfidence; it holds whether we measure experience by
years of trading, age, or institutional rank; and it holds for different types of dealers.
The literature suggests a number of reasons why overconfidence might survive in
financial markets. Overconfident traders might make more money because they take on more
risk and thus earn higher returns (De Long et al. (1991); Hirshleifer and Luo (2001)). They might
make more money because their cognitive performance – and hence their ability to spot
profitable trading opportunities – is enhanced by illusions of success (Taylor and Brown (1988)).
Overconfident traders might be more successful even without making more money if their
overconfidence inspires superiors and customers to regard them highly (Trivers (1985)). Finally,
overconfidence might not enhance success per se but it might instead be necessary for dealers to
persist in this stressful, high-stakes profession (Oberlechner (2004); Oberlechner and Nimgade
(2005)).
3
Our finding that overconfidence survives among currency traders has potential relevance
for issues in both international economics and microstructure. At the intersection of those two,
for example, we find the puzzle of excessive real and nominal exchange-rate volatility under
floating exchange rates (Flood and Rose (1995)). Theoretical work by Odean (1998) suggests
that overconfident agents are likely to trade more and that this excess trading could foster excess
volatility. Deaves et al. (2003) and Glaser and Weber (2007) confirm empirically that
overconfident agents do trade more than others. The survival of overconfidence among currency
traders could also explain the profitability of trend-following technical strategies in currency
markets (Menkhoff and Taylor (2007)), since overconfidence can be the source of trending and
trend reversals (Daniel et al. (1998)).
The forward bias puzzle – meaning the substantial average risk-adjusted returns one can
expect from holding high-interest currencies – is another foreign-exchange anomaly that may be
related to the survival of overconfidence. Decades of effort to trace this puzzle to time-varying
premiums for volatility risk have had at best mixed success (Engel (1996)). An important
alternative explanation involves imperfect rationality, for which substantial evidence has long
existed (Goodman (1979); MacDonald (2000)). This evidence indicates that professional
exchange-rate forecasts are biased, inefficient, incompatible across horizons, and less accurate
than the naïve forecast of no change. As shown in Burnside et al. (2010), overconfidence could
be responsible for some of the forecasts’ shortcomings as well as the overall inverse relationship
between exchange-rate changes and interest differentials.
When studying the rationality of economic agents it is important to have an objective
basis for choice among the infinitude of potential departures from perfect rationality. Otherwise,
deviations from rationality could be tailor-made to explain each anomaly, leaving the overall
4
hypothesis of imperfect rationality essentially impossible to disprove. Our objective basis for
choice is the large body of evidence for overconfidence from the disciplines of psychology and
economics. The widespread human tendency towards overconfidence was noted as early as the
mid-1960s (Oskamp (1965)) and is by now one of the strongest findings in the psychology of
judgment and cognition. In the world of asset management, overconfidence has been identified
among defined contribution pension plan members (Bhandari and Deaves (2006)), pension-plan
decision makers (Gort et al. (2008)), individual online brokerage investors (Glaser and Weber
(2007), (2009)), and financial planners (van de Venter and Michayluk (2008)). More broadly,
overconfidence has been identified in settings ranging from corporate investment decisions to
medical diagnosis to driving skills (Malmendier and Tate (2005); Hoffrage (2004) and Alicke
and Govorun (2005) provide recent surveys). We are not alone, of course, in relying on evidence
from psychology to provide a scientific basis for the study of imperfect rationality (see, e.g.,
Barber and Odean (2000), (2001); Daniel et al. (1998); Biais et al. (2005)).
We focus on two important dimensions of overconfidence: hubris and miscalibration.
Hubris refers to the tendency to overestimate one’s own success (Alicke and Govorun (2005);
Camerer and Lovallo (1999); Roll (1986)). Miscalibration refers to the tendency to overestimate
the precision of one’s information (Lichtenstein et al. (1977); Soll and Klayman (2004)); its
name refers to the way perceived probabilities are poorly calibrated to true probabilities.
Our sample of North American currency dealers brings several strengths to the analysis
of imperfect rationality in finance. First, dealers are the agents who actually set prices; many
studies in behavioral finance focus on amateurs, whose influence on the market might be limited.
Second, dealers face intense financial incentives for accuracy: dealers’ annual bonuses, which
typically reach $100,000 and often exceed $1 million, are largely determined by their personal
5
trading profits and the penalties for losses can include the loss of a job. Third, dealers are experts
in their field who get daily practice and are provided by their employers with comprehensive
information resources. Finally, the extent of the dealers’ experience varies widely: some are
trainees while others have over twenty-five years of experience and have the title of Treasurer.
The paper is structured as follows. Section II, which follows, summarizes academic
perspectives on the survival of overconfidence in financial markets. Section III outlines the
structure of the foreign exchange market and describes our data. Section IV provides evidence
that dealers tend towards miscalibration and hubris. Section V presents the evidence for our main
finding, specifically that overconfident dealers are not driven out of the market. Section VI
concludes.
II.
Overconfidence and the Survival of Imperfect Rationality
By now, most economists assume that departures from rationality exist in financial
markets. Nonetheless, they often doubt that these departures compromise financial market
performance. The view is that even if “investors are irrational …, they [may be] met in the
market by rational arbitrageurs who eliminate their influences on prices” (Shleifer (2000), p. 2),
an argument often expressed in terms of a “rational marginal agent.” The present study examines
whether overconfidence, as one form of imperfect rationality, survives among the market
participants most likely to serve as marginal agents. Foreign exchange dealers set the price in
almost every currency transaction of meaningful size and they undertake speculative and
arbitrage trades with substantial resources at their disposal. If overconfidence survives within
6
this group then, given limits to arbitrage, there may not be sufficient rational marginal agents for
the market to achieve and maintain efficient prices.1
Skepticism about the survival of imperfect rationality is often based on Friedman’s
(1953) depiction of how trading losses could drive imperfectly rational traders out of business.
Though Friedman’s analysis did not cite a specific form of imperfect rationality, it could
logically be expected to apply to overconfidence. Biais et al. (2005) find a direct empirical
relation between miscalibration and reduced trading performance. The connection could operate
through excess trading which in turn can result from either miscalibration (Odean (1998)) and or
hubris (Glaser and Weber (2007)).
Skepticism about the survival of imperfect rationality is also sometimes based on a set of
related ideas about cognitive performance: “(i) that people, through repetition, will learn their
way out of biases; (ii) that experts in a field, such as traders in an investment bank, will make
fewer errors; and (iii) that with more powerful incentives, the effects [of cognitive biases]
disappear” (Barberis and Thaler (2003)). While it seems logical that experience, expertise, and
incentives will overcome irrational tendencies, the evidence for all three points is mixed at best.
For example, studies show that experience sometimes reduces forecasting accuracy (Staël von
Holstein (1972); Glaser et al. (2005)) and raises overconfidence (Bradley (1981)). With respect
to expertise, a few studies find that it reduces miscalibration (Keren (1987); List (2003); Murphy
and Winkler (1984)), but most studies find that it has no effect (Chen et al. (2007); Lichtenstein
et al. (1977)). Likewise, financial incentives appear to attenuate some biases (e.g., Laury and
Holt (2005)) but they do not eliminate miscalibration (Fischhoff et al. (1977); Sieber (1974)).
1
We note that when there is positive-feedback trading, as there is in currency markets (Osler (2009)), the
presumption that rational arbitrageurs eliminate departures from the efficient price is incorrect (De Long et al.
(1990)).
7
The finance literature suggests a number of reasons why overconfident agents might
actually survive. De Long et al. (1991) argue that overconfident traders hold riskier assets than
others and thus earn higher average returns. Kyle and Wang (1997) show that overconfident
individuals involved in bilateral negotiations can survive because other participants compete less
aggressively. The trading strategies of overconfident traders may help them exploit the
mispricing caused by liquidity and noise traders (Hirshleifer and Luo (2001)). Wang (2001)
shows, in a theoretical setting, that in markets with high fundamental volatility both
underconfident and extremely overconfident agents are driven out while modestly overconfident
traders ultimately dominate.
Overconfidence may also enhance success and promote trader survival through a number
of psychological channels. Overconfidence could support trading success because it enhances
cognitive abilities (Greenwald (1980)) and it could support non-trading skills such as
organizational capability and leadership because it promotes self-esteem (Taylor and Brown
(1988)). Overconfidence may also speed professional advancement by inflating others’ opinions
about oneself (Trivers (1985); Bénabou and Tirole (2002)). Finally, overconfidence fosters
persistence, an essential trait in a profession where setbacks occur daily (Felson (1984)).
“Confidence in his abilities and efficacy can help the individual undertake more ambitious goals
and persist in the face of adversity” (Bénabou and Tirole (2002), p. 872). Currency dealers
concur. According to a trading manager at a top-ten institution interviewed by the authors, “If
you are not that self-confident, you are not going to be a good trader.” Dealers may self select for
overconfidence through a "sleep-well-at-night" effect: those that can handle the stress sleep well
at night while the others switch careers.
8
As this summary indicates, the disciplines of both psychology and economics provide
conflicting theoretical and empirical predictions about the survival of imperfect rationality in
financial markets. Identifying which predictions are correct requires empirical tests.
III.
Market Overview and Data
Trading in the foreign exchange market, the world’s largest by any measure, averages
roughly $4.0 trillion per day (Bank for International Settlements (2010)). Dealing banks stand
ready to provide instant liquidity to hedge funds, mutual funds, and other asset managers;
corporations that buy and sell goods internationally; central banks; and other “end-user”
institutions. In this predominantly wholesale market, the dealers are involved in almost every
significant currency trade. The interdealer market, in which banks trade with each other, by itself
represents almost half of total trading (Bank for International Settlements (2007)), and the rest of
trading almost invariably involves customer-bank transactions.
A typical dealing room includes salespeople, interbank traders, and proprietary traders.
Salespeople manage the bank’s relationship with customers. Interbank dealers use the interbank
market to manage inventory accumulated in customer trades, to speculate, and to do arbitrage.
Proprietary traders – usually interbank dealers with extensive experience – speculate with the
bank’s own funds in foreign exchange and other assets. (Foreign exchange market structure is
described more fully in Osler (2009).)
Our analysis of overconfidence is based on the results of a survey distributed on June 25,
2002 to all foreign exchange dealing banks in North America. North American dealers, in
aggregate, account for almost one fifth of world-wide trading (Bank for International Settlements
(2007)). Of the 1,080 questionnaires sent out, 416 were completed, an overall return rate of 38.5
percent; this compares favorably with return rates of other surveys of foreign exchange
9
professionals, which are sometimes in single digits (Cheung and Chinn (2001)). In aggregate, the
North American dealers in our study have vast trading capacity: the sum of their daily position
limits is at least $30 billion.2 Participating institutions include a number of prominent banks with
familiar names, which we label “top-tier banks.” Appendix A lists the membership criteria for
this group.
Research based on survey data has become fairly standard in the analysis of exchange
rates (e.g., Frankel and Froot (1987); MacDonald (2000); Cheung and Chinn (2001); Menkhoff
(2001)). This acceptance reflects, in part, an acknowledgment that carefully structured surveys
provide no incentive to distort the truth. Further, evidence indicates that our results would be
unchanged even with financial incentives for accuracy (Fischhoff et al. (1977); Sieber (1974)).
The survey first assessed a number of personal and professional attributes, from which we
draw the following portrait of participating dealers (see Table 1). About half of them work in
New York City, and most work in the spot market. The participants primarily trade U.S. dollars
against the euro, the yen, and the Canadian dollar, with smaller concentrations trading U.S.
dollars against British pound and Swiss franc; a few trade other currency pairs. Though a
majority of our participants trade in the interbank market, sizable portions work in sales (32
percent) or serve as proprietary traders (20 percent). Many dealers have multiple functions. Over
one in five of the salespeople are female, while only one in thirteen interbank and proprietary
traders are female.
[Table 1 near here]
Since our study focuses on the effects of experience on overconfidence, it is essential that
the dealers in our sample represent a wide range of experience. The Trainees in our sample have
2
The aggregate limit figure is based on dealer responses when asked to locate their position limit on a list with five
ranges: $0 -$10 million, $11-$25 million, $26-$50 million, $51-$100 million, and over $100 million.
10
spent just over one year on a trading floor, on average, and are in their mid-twenties; the Junior
Traders average four years of trading experience and are typically in their early thirties; the
Senior Traders average twelve years of experience and are typically in their late thirties; the
dealers in the Head Trader/Treasurers category average 17 years of experience and are typically
over forty. Across all dealers, years of trading experience ranges up to thirty.
IV.
Existence of Overconfidence
This section sets the stage for our main result by providing evidence that currency dealers
tend towards miscalibration and hubris. Readers willing to accept this conclusion can move
direction to Section V, which concerns the survival of these two tendencies.
A.
Miscalibration
Existing evidence that people overestimate the precision of their information largely
comes from experiments in which individuals provide 90-percent confidence intervals for
general knowledge questions such as “How many miles is it from Paris to Tokyo?” (Lichtenstein
et al. (1977)). Under perfect rationality, 10 percent of these confidence intervals would exclude
the correct answer, but that share is typically above 50 percent among experimental subjects
(Nofsinger (2007)). Our survey took a similar approach but tailored the questions to measure
financial-market calibration. Each trader was asked to “enter today’s exchange rates of the Euro,
the Japanese Yen, the British Pound, the Swiss Franc, and the Canadian Dollar against the U.S.
Dollar. Then give your personal forecasts of these exchange rates on December 1, 2002 and on
June 1, 2003. For each currency, give your actual forecast and the lower and the upper limit of a
range within which you expect these rates to be with a certainty of 90%.”
11
1.
Confidence intervals: Standard tests
The standard test for miscalibration involves comparing the share of confidence intervals
that exclude the correct answer to ten percent, the true share under the null hypothesis of no
overconfidence. We first examine the confidence intervals of proprietary traders, who regularly
keep positions open for a month or more. Since the horizons of other traders are shorter,
proprietary traders might be better calibrated than salespeople or interbank traders. We focus on
the dealers’ five most actively traded currency pairs (“currencies” for brevity), identifying them
by the market’s standard 3-letter abbreviations: euro-dollar (EUR), dollar-yen (JPY), sterlingdollar (GBP), dollar-Swiss franc (CHF), and dollar-Canadian dollar (CAD).
Consistent with the literature, the share of the proprietary traders’ confidence intervals
excluding the realized exchange rate at the end of the forecast horizon exceeds ten percent for all
ten of the tests associated with our proprietary traders (five currencies, two forecast horizons). As
shown in Table 2, Panel A, the share of December confidence intervals excluding the realized
rate ranges from 25 for GBP to 44 percent for CHF; for the June forecasts the share ranges from
44 to 92 percent.
The possibility that these results reflect overconfidence is also supported by the fact that
the share of June intervals excluding the realized rate is consistently higher than the
corresponding share of December intervals. Indeed, the average share for June intervals, 70.7
percent, is roughly double the share for December intervals, 35.2 percent. This pattern is
predicted by the robust finding that overconfidence rises with task difficulty (e.g., Lichtenstein et
al. (1977); Pulford and Colman (1997); Dittrich et al. (2005)).
To examine whether proprietary traders are indeed better calibrated than other traders we
carry out the same test for all traders. As anticipated, the share of forecast intervals excluding the
12
realized rate is generally higher for all traders than for proprietary traders. Nonetheless, the
average difference between the relevant shares for proprietary traders and all traders is quite
small, at 4.6 percentage points. This is consistent with evidence in the literature that
miscalibration responds only modestly to financial incentives (Fischhoff et al. (1977); Sieber
(1974)). It may also reflect the importance to all dealers of having a “view” about exchange
rates. Some customers base trading decisions on these views and good forecasting increases
one’s chance of becoming a proprietary trader, considered a plum job.
These results, while striking, cannot persuasively demonstrate overconfidence because
the placement of the dealers’ confidence intervals depends on their point forecasts, which have
two notable shortcomings. First, the point forecasts themselves may be inaccurate, since studies
consistently show that professional forecasters could improve their accuracy by adopting the
naïve forecast of no change (MacDonald (2000)). Second, the point forecasts are probably not
generated independently, since traders often discuss their views with others. We isolate the
influence of confidence-interval width by re-aligning each interval relative to the rate prevailing
when each dealer submits the survey, maintaining proportionate distances of confidence-interval
endpoints above and below.
The share of confidence intervals excluding realized rates generally declines after this
realignment, suggesting that our dealers’ point forecasts, like those of most professional
forecasters, are less accurate than the no-change forecast. Nonetheless, the results do not
challenge our previous inference that dealers tend to underestimate uncertainty. As shown in
Table 2, Panel A, the share of intervals excluding the realized rate remains significantly higher
than ten percent for nine of our ten currency-horizon pairs. Further, the excess over ten percent
13
remains economically meaningful; the average share across the five currencies is 26.3 for
December forecasts and 66.0 for June forecasts.
We test the statistical significance of these realigned confidence intervals by noting that,
under the null hypothesis of no overconfidence, each interval represents a Bernoulli trial where
p=0.10 is the probability of excluding the realized rate. If n is the number of intervals submitted,
the share of realigned intervals excluding the realized rate has a binomial distribution with mean
np and standard deviation np(1-p). Since n is reasonably large, this can be approximated by the
corresponding normal distribution. In these tests, all but one of the ten shares is significantly
different from 10 percent at standard significance levels.
[Table 2 near here]
Overconfidence is also suggested by the inaccuracy of dealer forecasts relative to the nochange forecast. The conclusion that they lack accuracy is supported by a formal statistical
analysis showing that the dealers’ average RMSE typically exceeds twice the RMSE of the nochange forecasts and the dealers’ average directional accuracy, at 51 percent, is not statistically
better than a coin toss (for details see Oberlechner and Osler (2009)). Since the more accurate
no-change forecast is always available free on the dealers’ computer screens, it is striking that
none of them submitted point forecasts within a few basis points of it. Bradley’s (1981) research
suggests this choice may be another manifestation of miscalibration. That study examines the
responses of novices and experts to questions for which subjects had only an even chance of
providing the correct answer. Though subjects’ claimed expertise was unrelated to correctness,
by construction, their willingness to admit ignorance was inversely related to claimed expertise.
14
2.
Confidence intervals: Comparison against benchmarks
Since rational forecasts are not necessarily accurate ones, the foregoing results could be
unrelated to miscalibration. Suppose, for example, that dealers rationally anticipated low
volatility during the forecast horizon but realized volatility was high. In this case the confidence
intervals would tend to exclude realized rates, as occurs in our data, despite being rational.
Instead of comparing confidence intervals to realized outcomes, we next compare the dealers’
confidence-interval widths to the widths implied by rational volatility forecasts.
We forecast volatility using GARCH models of exchange-rate returns at different
horizons.3 Dealers completing the survey in July evidently had to forecast over longer time
horizons (four and ten months) than those completing the survey in October (one and seven
months). Surveys were submitted in July, August, September, and October, so we create
GARCH models of historical non-overlapping returns measured over all the relevant forecast
time horizons: 1, 2, 3, and 4 months for December forecasts; 7, 8, 9, and 10 months for June
forecasts. Each sample ends in the associated survey-completion month, so each model’s final
variance estimate is an out-of-sample forecast of volatility for the associated forecast time
interval.
Dealers’ access to real-time market information might enhance their forecasting ability
relative to the crude AR(1) of our GARCH models, so we shrink the GARCH variance forecasts
before converting them to confidence intervals. By how much should they be reduced? Our
earlier evidence suggests that dealers do not have special forecasting power at horizons of a
3
The data, which come from Datastream, begin in 1970 for EUR, GBP, and CHF, and in 1978 for JPY and CAD. In
every case we first tried a GARCH(1,1) model. This converged (in Stata) in 26 of 40 cases. If this failed, we tried a
GARCH(1,2) model (converged in ten cases), a GARCH(2,2) model (converged in three cases), and finally an
ARCH(1) model.
15
month or more, but Evans and Lyons (2005) show that by incorporating order-flow data they can
reduce one-month forecast variances by up to 15.7 percent. We therefore assume conservatively
that dealers forecast returns as well as Evans and Lyons and reduce the GARCH-based variance
forecasts by 15.7 percent. From these adjusted forecast variances we calculate 90-percent
confidence intervals using the fact that the 90-percent confidence interval for a normal
distribution equals 3.29 times the standard deviation.
If our proprietary traders are well-calibrated, then about half of their confidence intervals
should be narrower than the GARCH benchmark. Instead, they are almost all narrower than the
benchmark (Table 2, Panel B). This finding is robust to three methodological modifications:
broadening our focus to include all traders; considering only traders at top-tier institutions; and
basing the benchmark confidence-interval widths on the unconditional rather than the conditional
distribution of returns.4
To evaluate the statistical significance of these results, we take the null hypothesis to be
no miscalibration and view each interval as the outcome of a Bernoulli trial in which too-wide
and too-narrow have equal probability. The number of too-narrow confidence intervals under the
null thus has a binomial distribution with p = 0.5 and n ≡ nfcm is the number of responding traders
for a given forecast date f, currency c, and survey completion month m (e.g., nDEJ = 262 for
December 1 forecasts for the euro submitted in July). Since our samples are reasonably large and
p is not extreme, the distribution is well approximated as normal with mean pnfcm and variance
p(1-p)nfcm. Our results are highly significant, with all marginal significance levels below 0.0001.
These test statistics do not provide a direct measure of the share of traders who are
overconfident, but back-of-the-envelope calculations suggest that the share is fairly high. As a
4
The results using the unconditional distribution, unreported to save space, are available upon request.
16
baseline, suppose that there are no underconfident dealers, so all dealers are either wellcalibrated or overconfident. Suppose likewise that half of well-calibrated dealers choose toonarrow confidence intervals. In this case, the 94.5 percent share of too-narrow intervals among
our 355 EUR forecasts for December would be observed only if 90 percent of dealers were
overconfident. If we adopt the more generous assumption that the share of too-narrow intervals
from well-calibrated dealers had only a 5 percent likelihood of occurring by chance, the implied
share of overconfident dealers declines only to 88 percent.
As a further robustness test we construct benchmark confidence intervals from option
implied volatilities. These are sampled at the end of the associated survey-completion months
(e.g., for July forecasts we used implied volatilities from the last trading day in July) and are
adjusted for maturity. They are not reduced to allow for the possibility that our dealers have an
advantage in forecasting exchange rates because implied volatilities are independent of any trend
in the underlying asset. As shown in Table 2, Panel C, a large and statistically significant
majority of the confidence intervals provided by the dealers are again smaller than the
benchmarks.
These tests so far assume that survey participants decide the width of their confidence
intervals independently from other participants. In our final test, we evaluate statistical
significance under the alternative assumption that interval widths are not generated
independently. To do this we bootstrap the distribution of confidence intervals assuming that the
confidence-interval width for each dealer, CIi, is the sum of a shared component, CIS, and an
~
~
idiosyncratic component, d i : CIi = CI S + d i . Appendix B describes the methodology in detail.
The results, shown in Table 2, Panel D, again indicate that our survey participants are
17
significantly overconfident. The null hypothesis of rationality is rejected at the five percent level
or better for nine of ten tests and at the ten percent level in one remaining test.
B.
Hubris
So far we have provided evidence that currency traders tend to overestimate the precision
of their information. We next provide evidence that dealers also tend to overrate their personal
success. Survey participants were asked: “How successful do you see yourself as a foreign
exchange trader?” The top rank of 7 was assigned to “Much more successful than other foreign
exchange traders;” the bottom rank of 1 was assigned to “Much less successful than other foreign
exchange traders”; “Average” was assigned to the middle rank of 4. All categories of the sevenstep rating scale were individually labeled.
The participants’ immediate superiors (i.e., head traders or chief dealers) were also asked
to rate their subordinates. The superiors were specifically instructed to compare subordinates to
others with similar responsibilities at the same institutional rank, since such specificity is shown
in the organizational literature to increase discriminative ratings and to reduce judgment errors
among supervisors (Locklear et al. (1989)).5 The superiors rated their subordinates along three
dimensions: “Overall Contribution to the Organization,” “Trading Profits,” and “Trading
Potential.” Traders’ Overall Contributions to the Organization were comprehensively defined to
include trading profits and broader factors such as the support of other traders and the
completion of tasks for the whole group (Borman and Motowidlo (1993)). Overall Contribution
thus matches the overall success rating requested of the traders themselves. Trading Profits is
self-explanatory. Trading Potential was defined as the degree to which traders have the personal
making of successful traders in their trading area; head traders were specifically instructed to
5
For example, without position-specific standards, supervisors may show an unwanted tendency to rate workers
better the higher their rank (Brandstätter (1970)).
18
separate trading potential from realized profits. The three ratings are highly correlated, with
bilateral correlation coefficients ranging from 0.58 to 0.72.
Our currency market professionals give themselves an average rank of 5.06, or “better
than average,” with standard error of 0.05 (Table 3). In all, three quarters of traders perceive
themselves as more successful than average (ratings 5, 6, or 7), a share that is roughly fourteen
times the share of dealers perceiving themselves as below average.
[Table 3 near here]
Though these shares seem extreme, there is at least one reason why they could
conceivably be realistic: most of our dealers work at top-tier institutions. If this explains our
result then traders at lower-tier institutions should generally rate themselves below average, but
the average self-rating of traders at lower-tier banks, 4.86, is only slightly below the average selfrating of traders at top-tier banks, 5.20, and remains significantly higher than the benchmark for
no hubris, 4.0.
As noted earlier, these results are consistent with extensive evidence for the “better-thanaverage effect” (Alicke and Govorun (2005)), as the common tendency towards hubris is often
labeled. Among traders, this apparent bias could reflect traders choosing favorable yardsticks
when rating themselves. Rather than rate themselves against their current competitors, for
example, they might rate themselves against every trader with whom they ever worked,
including those that failed and dropped out of the profession. Alternatively, traders might assign
different weights to the various skills that are involved in trading, with higher weights assigned
to the skills at which they excel. Whatever traders were thinking, it is still statistically
implausible for three-quarters of a large sample of individuals to be above average overall.
19
To evaluate the statistical significance of these results, we undertake a bootstrap test of
the null hypothesis of no hubris. We assume the superiors’ ratings represent the true distribution
of ratings within rank and responsibility classes (which is less restrictive than the assumption that
superiors correctly rated individual traders.) Only two of the twelve rank-responsibility
categories – senior interbank traders and senior salespeople – include sufficient individuals to
generate a meaningfully powerful test. Fortunately, these two categories include 70 percent of
the dealers with both superior and subordinate ratings.
For each rank-responsibility category we create 1,000 sets of k ratings, where kI =78 is
the number of senior interbank traders and kS = 94 is the number of senior salespeople. Each
rating is drawn at random (with replacement) from the associated sample of superiors’ ratings.
For each set of simulated ratings we calculate the share that are above average, and the
distribution of these simulated shares is compared with the corresponding observed share from
that rank-responsibility category. Note that the tests are biased towards accepting the null
because the supervisors themselves tended to rate their subordinates better-than-average.
Average supervisors' ratings (standard errors) are 4.6 (0.09), 4.5 (0.09), and 4.9 (0.08) for the
Overall Contribution, Trading Profits, and Trading Potential dimensions, respectively.
[Table 4 near here]
The tests unambiguously reject the null hypothesis of no overconfidence (Table 4). In our
baseline test, which uses the Overall Contribution rating by superiors, none of the 1,000 sets of
simulated ratings for senior interbank traders had a share of above-average ratings as high as the
observed share of 77.5; indeed, the highest simulated share was 66.2. Similarly, none of the sets
of simulated ratings for senior interbank traders had a share of below-average ratings as low as
20
the observed fraction of 5.6. Our conclusions are not changed if the superiors' ratings for Trading
Profitability or Trading Potential replace their ratings for Overall Contribution.
V.
Survival of Overconfidence
So far this paper has presented evidence that currency dealers display two widespread
human tendencies: they underestimate uncertainty and they overestimate their abilities. This is
notable since dealers face strong incentives to be accurate, have access to comprehensive
information, and get frequent practice forecasting and assessing their personal trading
competence.6
These results set the stage for the paper’s central question: Is overconfidence driven out
of the foreign exchange market? Overconfidence could disappear if individual dealers become
well-calibrated in response to frequent feedback or strong incentives for accuracy. Alternatively,
it could disappear if overconfident dealers are fired or leave the profession voluntarily. This
section presents evidence that overconfidence is not driven out of the market.
We organize our analysis around two testable implications of the hypothesis that
imperfect rationality will not survive:
• Absolute rationality: The most experienced dealers are not overconfident at all.
• Relative rationality: Overconfidence is moderated by experience.
For this analysis we need measures of each individual’s overconfidence. Our measure of
individual miscalibration focuses on the extent to which a dealer’s confidence intervals fall short
of its value under rationality, taking the rational value to be the width calculated from our
GARCH analysis. We first calculate the ratio between the width of each confidence interval and
the relevant GARCH width. As before, we distinguish dealers according to the month in which
6
Profitability, for example, is measured daily and every month each dealer and boss sign statements to show they
agree on the figure.
21
they submitted the survey. For dealer i’s confidence interval for currency c at forecast date f, wicf,
G
G is the appropriate GARCH width for that dealer’s surveythis ratio is Rcfi ≡ wcfi / wcfm
, where wcfm
completion month. Since each dealer provided up to ten confidence intervals, we take the
average of the submitted ratios, Ri. (The correlation between the dealers’ average December and
average June ratios is a strong 0.82, so little information is lost by combining intervals.) Dealer
i’s miscalibration, Mi, is then defined as the gap between unity and this average: Mi ≡ 1- Ri. For
well-calibrated individuals Mi is near zero and could be slightly negative; for overconfident
individuals Mi ranges upwards to a maximum of one.
To measure each trader’s hubris we take the difference between the trader’s rating of
his/her own performance, PiOwn, and the superior’s Overall Contribution rating, PiSup: Hi ≡ PiOwn PiSup. 7 For agents who are not overconfident Hi will be zero; like miscalibration, Hi rises with
overconfidence. Note that this measure is not available for lower-tier banks because superiors’
ratings were only solicited from top-tier banks.
To ensure robustness we measure experience three ways: years spent trading, age, and
institutional rank. For years spent trading we create three categories: 0-8 years, 9-14 years, and
15 or more years. For age we create four categories: 30 and under, 31-35, 36-40, and over forty.
For rank we create three categories: Trainees and Junior Traders, Senior Traders, and Head
Dealers/Treasurers.
It might be natural to assume that miscalibration and hubris are closely related. Indeed,
Gervais and Odean (2001) describe how, in theory, hubris could be the source of miscalibration.
Nonetheless, the correlation between our dealers’ miscalibration and their hubris, -0.18, is not
only economically small but significantly negative. A weak relation between these two
7
This measure effectively assumes that superiors’ ratings are more accurate than those of their subordinates
22
dimensions of overconfidence is consistent with results from other relevant studies (Deaves et al.
(2003); Régner et al. (2004); Glaser and Weber (2007)).
A.
Absolute rationality: Are the most experienced dealers overconfident?
If imperfect rationality is driven out of the market, the most experienced traders will
display zero overconfidence. Table 5, which reports the averages of measured miscalibration and
hubris for dealers in different experience categories, provides no support for absolute rationality.
Table 5, Panel A shows that average miscalibration is statistically significant among the
most experienced dealers no matter how we measure experience. Among dealers with 15 or more
years of trading experience, confidence intervals are 58 percent smaller, on average, than they
would be under the null hypothesis that dealers are all well-calibrated (t-statistic 28.1).
Confidence intervals among dealers over 40 years old are on average 59 percent smaller than
implied by the null (t-statistic 29.8); confidence intervals among Head Traders/Treasurers are on
average over 40 percent smaller (t-statistic 2.9).
Hubris is also economically and statistically strong for experienced dealers, as shown in
Table 5, Panel B. For dealers with 15 or more years of experience, hubris averages 0.65 (tstatistic 2.2). This represents a substantial distortion of their performance ratings, as it covers
over a quarter of the gap between the superior’s average performance rating for this group, which
is 4.5, and the highest possible rating, which is seven. Average hubris is 0.46 for dealers over 40
years old (t-statistic 2.1), which implies that this group inflates their performance rating almost
20 percent of the way towards ideal performance, on average. Average hubris is 1.13 for Head
Traders and Treasurers, which implies that these individuals inflate their performance rating
almost half-way to ideal performance.
[Table 5 near here]
23
As discussed in Section IV, the incentives to overcome miscalibration are strongest for
proprietary traders, so this group provides a stronger test of whether experienced traders are
overconfident (Table 5, Panel C). The miscalibration and hubris measures for this group provide
no evidence for dealer rationality, however. Among the most experienced proprietary traders,
both miscalibration and hubris are statistically and economically significant. (Note that, to
conserve statistical power, we create fewer age categories when analyzing proprietary traders;
we do not use “rank” to measure experience since almost all proprietary traders have the rank of
Senior Trader or higher.)
For robustness we also examine miscalibration among the most experienced dealers at
top-tier banks (as noted earlier, the hubris measures apply only to top-tier banks). These dealers
are likely to have greater influence over market prices than other dealers because they have
larger position limits and more customers. A comparison of results for top-tier banks (Table 5,
Panel D) and for all banks (Table 5, Panel A) indicates that there is no difference between the
most experienced dealers at top- and lower-tier banks. Our conclusion that the most experienced
dealers tend to be overconfident is also sustained if we focus exclusively on the currencies that
dealers trade most actively.8
B.
Relative rationality: Is overconfidence moderated by experience?
Overconfidence could be moderated by experience even if it doesn’t disappear with
experience. The moderation hypothesis is not supported by our comparison of overconfidence
levels across dealers with varying experience. When experience is measured in terms of years
spent trading, the miscalibration estimates are almost identical for dealers at all experience levels
at 0.59, 0.58, and 0.58 for the traders with least experience, intermediate experience, and greatest
8
Results of this robustness test are available upon request.
24
experience, respectively (Table 5, Panel A). When experience is measured by age, the
miscalibration estimates vary more but the variation is non-monotonic. Estimated miscalibration
averages 0.59 for the two youngest groups, 0.52 for traders in their late 30s, and once again 0.59
for traders over forty. Moreover, the differences across experience groups are again insignificant.
When experience is measured by rank, the point estimates of miscalibration decline modestly
with rising rank – from 0.65 to 0.57 to 0.42 – but once again the differences are insignificant.
As shown in Table 5, Panel B, the hubris measures provide similarly scant support for the
hypothesis that overconfidence declines with experience; indeed, this evidence suggests weakly
that hubris rises with experience. When experience is measured by years spent trading, average
hubris is 0.50, 0.47, and 0.65 for traders with the least experience, intermediate experience, and
greatest experience, respectively, though the differences across groups are insignificant. When
experience is measured by age or rank, however, the evidence that hubris rises with experience is
a bit stronger. For both these experience measures, average hubris is insignificant for the least
experienced traders and it is significant – and higher – for all other groups. The evidence for the
hypothesis that hubris rises with experience is weakened, however, by the insignificance of
differences in hubris across experience categories. Research has revealed that that
overconfidence rises with experience and expertise in various settings (Staël von Holstein
(1972); Glaser et al. (2005)). Most notably, for this paper, overconfidence rises with expertise
when maximum achievable accuracy is extremely low, as in exchange-rate forecasting (Bradley
(1981)).
The hypothesis that overconfidence is moderated by experience likewise receives no
support if the sample is restricted to proprietary traders (Table 5, Panel C) or to dealers at top-tier
banks (Table 5, Panel D).
25
The results of this section suggest consistently that overconfident traders are not driven
out of currency markets. Despite many years of trading, experienced dealers still tend to
overestimate the precision of their information and to overestimate their own abilities.
VI.
Concluding Observations
This paper tests whether overconfident traders are driven out of the foreign exchange
market. Friedman famously claimed that imperfectly rational traders will be driven out of
financial markets by trading losses (1953), a claim that has been influential because it supports
the assumption of trader rationality. Nonetheless, this claim has rarely, if ever, been tested. We
examine whether overconfidence, a well-documented form of imperfect rationality, survives
among those who set prices in the world’s largest financial market.
We set the stage for this analysis by providing evidence that currency dealers, like most
people, display two types of overconfidence; they tend to overestimate the precision of their
information (miscalibration) and they tend to overestimate their professional success (hubris).
We then analyze how overconfidence varies across foreign exchange dealers with different
amounts of experience. The results indicate that even the most experienced dealers are strongly
overconfident and that they are just as overconfident as their junior colleagues.
These conclusions are remarkably robust. They hold for both types of overconfidence;
they hold regardless of whether we measure experience by years spent trading, age, or
institutional rank; they hold for proprietary traders, who get the most experience forecasting at
our horizons, as well as other traders; and they hold for traders at the most prestigious dealing
banks.
Future research could constructively examine the mechanism(s) through which dealer
overconfidence survives. Among the many hypotheses worth exploring, we highlight three:
26
Overconfidence, by fostering persistence and a tolerance for adversity, might enable dealers to
sustain a career with intense competition and frequent setbacks; overconfidence, by leading
others to have inflated opinions of one’s success, might increase one’s likelihood of promotion
within an organization; overconfidence, by enhancing cognitive facility, might enhance dealers’
ability to identify profitable trading opportunities.
27
References
Alicke, M. D., and O. Govorun. “The Better-Than-Average Effect.” In The Self in Social Judgment,
M. D. Alicke, D. A. Dunning and J. I. Krueger, eds. New York, NY: Psychology Press (2005).
Bank for International Settlements. Triennial Central Bank Survey of Foreign Exchange and
Derivatives Market Activity in 2007, Basle (2007).
Bank for International Settlements. Triennial Central Bank Survey of Foreign Exchange and
Derivatives Market Activity in 2010 -- Preliminary Global Results -- Turnover.
http://www.bis.org/publ/rpfx10.htm (2010).
Barber, B. M., and T. Odean. “Trading is Hazardous to Your Wealth: The Common Stock
Investment Performance of Individual Investors.” Journal of Finance, 55 (2000), 773-806.
Barber, B. M., and T. Odean. “Boys Will be Boys: Gender, Overconfidence, and Common Stock
Investment.” The Quarterly Journal of Economics, 116 (2001), 261-292.
Barberis, N., and R. Thaler. “A Survey of Behavioral Finance.” In Handbook of the Economics of
Finance, G. M. Constantinides, M. Harris, and R. Stulz, eds., Elsevier Science (2003).
Bénabou, R., and J. Tirole. “Self-Confidence and Personal Motivation.” Quarterly Journal of
Economics, 117 (2002), 871 -915.
Bhandari, G., and R. Deaves. “The Demographics of Overconfidence.” Journal of Behavioral
Finance, 7 (2006), 5-11.
Biais, B., D. Hilton, K. Mazurier, and S. Pouget. “Judgmental Overconfidence, Self-Monitoring, and
Trading Performance in an Experimental Asset Market.” Review of Economic Studies, 72 (2005),
287-312.
28
Borman, W. C., and S. J. Motowidlo. “Expanding the Criterion Domain to Include Elements of
Contextual Performance.” In Personnel Selection in Organizations, N. Schmidt and W. C.
Borman, eds. San Francisco, CA: Jossey-Bass (1993).
Bradley, J. V. “Overconfidence in Ignorant Experts.” Bulletin of the Psychonomic Society, 17
(1981), 82-84.
Brandstätter, H. “Die Beurteilung von Mitarbeitern.” In Handbuch für Psychologie, Vol. 9,
Betriebspsychologie, A. Mayer and B. Herwing, eds. Hogrefe, DE: Verlag für Psychologie
(1970).
Burnside, C., B. Han, D. Hirshleifer, and T. Y. Wang. “Investor Overconfidence and the Forward
Discount Puzzle.” Review of Economic Studies (2010), forthcoming.
Camerer, C., and D. A. N. Lovallo. “Overconfidence and Excess Entry: An Experimental
Approach.” American Economic Review, 89 (1999), 306-318.
Chen, G., K. A. Kim, J. R. Nofsinger, and O.M. Rui. “Trading Performance, Disposition Effect,
Overconfidence, Representativeness Bias, and Experience of Emerging Market Investors.”
Journal of Behavioral Decision Making, 20 (2007), 425-451.
Cheung, Y.-W., and M. D. Chinn. “Currency Traders and Exchange-Rate Dynamics: A Survey of
the U.S. Market.” Journal of International Money and Finance, 20 (2001), 439-471.
Daniel, K. D., D. Hirshleifer, and A. Subrahmanyam. “Investor Psychology and Security Market
Under- and Overreactions.” Journal of Finance, 53 (1998), 1839-1885.
Daniel, K. D., D. Hirshleifer, and S. H. Teoh. “Investor Psychology in Capital Markets: Evidence
and Policy Implications.” Journal of Monetary Economics, 49 (2002), 139-209.
29
Deaves, R., E. Lüders, and G. Y. Luo. “An Experimental Test of the Impact of Overconfidence and
Gender on Trading Activity.” Working paper, McMaster University (2003).
De Long, J. B., A. Shleifer, L. H. Summers, and R. J. Waldmann. “Positive Feedback Investment
Strategies and Destabilizing Rational Speculation.” Journal of Finance, 45 (1990), 379-395.
De Long, J. B., A. Shleifer, L. H. Summers, and R. J. Waldmann. “The Survival of Noise Traders in
Financial Markets.” Journal of Business, 64 (1991), 1-19.
Dittrich, D. A. V., W. Güth, and B. Maciejovsky. “Overconfidence in Investment Decisions: An
Experimental Approach.” European Journal of Finance, 11 (2005), 471 -491.
Engel, C. “The Forward Discount Anomaly and the Risk Premium: A Survey of Recent Evidence.”
Journal of Empirical Finance, 3 (1996), 123-192.
Evans, M. D., and R. K. Lyons. “Meese-Rogoff Redux: Micro-Based Exchange-Rate Forecasting.”
American Economic Review, 95 (2005), 405-414.
Felson, R. B. “The Effect of Self-Appraisals of Ability on Academic Performance.” Journal of
Personality and Social Psychology, 47 (1984), 944-952.
Fischhoff, B., P. Slovic, and S. Lichtenstein. “Knowing With Certainty: The Appropriateness of
Extreme Confidence.” Journal of Experimental Psychology: Human Perception and
Performance, 3 (1977), 552-564.
Flood, R. P., and A. K. Rose. “Fixing Exchange Rates: A Virtual Quest for Fundamentals.” Journal
of Monetary Economics, 36 (1995), 3-37.
Frankel, J. A., and K. A. Froot. “Using Survey Data to Test Standard Propositions Regarding
Exchange Rate Expectations.” American Economic Review, 77 (1987), 133-153.
30
Friedman, M. Essays in Positive Economics, Chicago: University of Chicago Press (1953).
Gervais, S., and T. Odean. “Learning to be Overconfident.” Review of Financial Studies, 14 (2001),
1-27.
Glaser, M., T. Langer, and M. Weber. “Overconfidence of Professionals and Laymen: Individual
Differences Within and Between Tasks?” Working paper (2005).
Glaser, M., and M. Weber. “Overconfidence and Trading Volume.” Geneva Risk and Insurance
Review, 32 (2007), 1-36.
Glaser, M., and M. Weber. “Which Past Returns Affect Trading Volume?” Journal of Financial
Markets, 12 (2009), 1-31.
Goodman, S. H. “Foreign Exchange Rate Forecasting Techniques: Implications for Business and
Policy.” Journal of Finance, 34 (1979), 415-427.
Gort, C., M. Wang, and M. Siegrist. “Are Pension Fund Managers Overconfident?” Journal of
Behavioral Finance, 9 (2008), 163-170.
Greenwald, A.G. “The Totalitarian Ego: Fabrication and Revision of Personal History.” American
Psychologist, 35 (1980), 603-618.
Hirshleifer, D. “Investor Psychology and Asset Pricing.” Journal of Finance, 56 (2001), 1533-1597.
Hirshleifer, D., and G. Y. Luo. “On the Survival of Overconfident Traders in a Competitive
Securities Market.” Journal of Financial Markets, 4 (2001), 73-84.
Hoffrage, U. “Overconfidence.” In Cognitive Illusions: A Handbook on Fallacies and Biases in
Thinking, Judgement and Memory, R. Pohl, ed. New York, NY: Psychology Press (2004).
31
Holt, C. A., and S.K. Laury. “Risk Aversion and Incentive Effects: New Data Without Order
Effects.” American Economic Review, 95 (2005), 902-904.
Keren, G. “Facing Uncertainty in the Game of Bridge: A Calibration Study.” Organizational
Behavior and Human Decision Processes, 39 (1987), 98-114.
Kyle, A. S., and F. A. Wang. “Speculation Duopoly with Agreement to Disagree: Can
Overconfidence Survive the Market Test?” Journal of Finance, 52 (1997), 2073-2090.
Lichtenstein, S., B. Fischhoff, and L. D. Phillips. “The Calibration of Probabilities: The State of the
Art.” In Decision Making and Change in Human Affairs, H. Jungermann and G. de Zeeuw, eds.
Amsterdam, NL: D. Reidel (1977).
List, J. A. “Does Market Experience Eliminate Market Anomalies?” Quarterly Journal of
Economics, 118 (2003), 41-71.
Locklear, T. S., B. B. Granger, and J. G. Veres. “Evaluation of a Behaviorally-Based Appraisal
System.” Journal of Managerial Issues, 1 (1989), 66-75.
MacDonald, R. “Expectations Formation and Risk in Three Financial Markets: Surveying What the
Surveys Say.” Journal of Economic Surveys, 14 (2000), 69-100.
Malmendier, U., and G. Tate. “Does Overconfidence Affect Corporate Investment? CEO
Overconfidence Measures Revisited.” European Financial Management, 11 (2005), 649-659.
Menkhoff, L. “Short-Term Horizons in Foreign Exchange? Survey Evidence from Dealers and Fund
Managers.” Kyklos, 54 (2001), 27-48.
Menkhoff, L., and M. Taylor. “The Obstinate Passion of Foreign Exchange Professionals: Technical
Analysis.” Journal of Economic Literature, 45 (2007), 936–972.
32
Murphy, A. H., and R. L. Winkler. “Probability Forecasting in Meteorology.” Journal of the
American Statistical Association, 79 (1984), 489-500.
Nicolson, A., ed. Société Générale Dealers Directory, London: Société Générale (2002).
Nofsinger, J. The Psychology of Investing, Englewood Cliffs, NJ: Prentice Hall (2007).
Oberlechner, T. The Psychology of the Foreign Exchange Marke, Chichester, UK: John Wiley
(2004).
Oberlechner, T., and A. Nimgade. “Work Stress and Performance among Financial Traders.” Stress
and Health, 21 (2005), 285-293.
Oberlechner, T., and C. Osler. “Overconfidence in Currency Markets.”
http://people.brandeis.edu/~cosler/ (2009).
Odean, T. “Volume, Volatility, Price, and Profit When All Traders are Above Average.” Journal of
Finance, 53 (1998), 1887-1934.
Oskamp, S. “Overconfidence in Case-Study Judgments.” Journal of Consulting Psychology, 29
(1965), 261-265.
Osler, C. L. “Foreign Exchange Microstructure: A Survey.” In Encyclopedia of Complexity and
System Science, R. A. Meyers, ed. New York, NY: Springer (2009).
Pulford, B. D., and A. M. Colman. “Overconfidence: Feedback and Item Difficulty Effects.”
Personality and Individual Differences, 23 (1997), 125-133.
Régner, I., D. Hilton, L. Cabantous, and S. Vautier. “Judgmental Overconfidence: One Positive
Illusion or Many?” Unpublished Working paper, University of Toulouse (2004).
Roll, R. “The Hubris Hypothesis of Corporate Takeovers.” Journal of Business, 59 (1986), 197-216.
33
Shleifer, A. Inefficient Markets: An Introduction to Behavioral Finance, Oxford, UK: Oxford
University Press (2000).
Sieber, J. E. “Effects of Decision Importance on Ability to Generate Warranted Subjective
Uncertainty.” Journal of Personality and Social Psychology, 30 (1974), 688-694.
Soll, J. B., and J. Klayman. “Overconfidence in Interval Estimates.” Journal of Experimental
Psychology: Learning, Memory, and Cognition, 30 (2004), 299-314.
Staël von Holstein, C. A. “Probabilistic Forecasting: An Experiment Related to the Stock Market.”
Organizational Behavior and Human Decision Processes, 8 (1972), 139-158.
Taylor, S. E., and J. D. Brown. “Illusion and Well-Being: A Social Psychological Perspective on
Mental Health.” Psychological Bulletin, 103 (1988), 193-210.
Trivers, R. Social Evolution, Menlo Park, CA: Benjamin/Cummings (1985).
van de Venter, G., and D. Michayluk. “An Insight into Overconfidence in the Forecasting Abilities
of Financial Advisors.” Australian Journal of Management, 32 (2008), 545-557.
Wang, A. F. “Overconfidence, Investor Sentiment, and Evolution.” Journal of Financial
Intermediation, 10 (2001), 138-170.
34
Appendix A: "Top" and "lower" tier foreign-exchange dealing banks
Top Tier: Leading North American foreign exchange dealing banks. The 26 banks in this
category were defined as institutions included in at least one of the following lists: (i) membership in the
New York Foreign Exchange Committee in 2001 and/or 2002; (ii) the top 10 institutions of the Best
Provider of Foreign Exchange Services Overall annual ranking published by Global Investor Magazine in
March 2001 and/or March 2002; (iii) the top 10 institutions of the Global Top 50 Foreign Exchange
Market Companies by estimated market share annual ranking published by Euromoney Magazine in May
2001; (iv) the top 10 institutions of the annual Best Bank Overall for Foreign Exchange Dealing ranking
published by Foreign Exchange Week in December 2001; (v) the top 10 institutions in the annual ranking
of banks’ foreign exchange revenues 2001 published by Foreign Exchange Week in December 2001.
North American trading floors of the resulting 26 institutions were contacted and invited to
participate in the study. Twenty-one of these trading floors agreed to participate fully, resulting in an
institutional participation rate of 81 percent of the leading market participants. Of 551 questionnaires sent
to these 21 trading floors, 326 were returned, an individual questionnaire return rate of 60 percent.
Lower Tier: Other foreign exchange dealing banks in North America. These are defined as all
other foreign exchange banking institutions in the U.S. and in Canada listed in Societé Générale's Dealer
Directory (Nicolson (2002)). Foreign exchange traders at these institutions were sent questionnaires by
mail. Of 529 questionnaires sent to banks in this group, 90 were returned, resulting in a questionnaire
return rate of 17 percent for this group.
35
Appendix B: Testing significance under null that confidence-interval widths are not independent
This appendix describes the bootstrap procedure used to identify the distribution of
confidence intervals under the assumption that dealer’s confidence interval widths are not
independent. Specifically, it is assumed that each dealer’s confidence-interval width, CIi, is the
~
~
sum of a shared component, CIS, and an idiosyncratic component, d i : CIi = CI S + d i . The
bootstrap simulations are done separately for each combination of forecast date f and currency c.
Within each f-c combination we work separately with each forecast submission month m, since
any shared component would likely differ across surveys submitted in different months. For
concreteness we illustrate our methodology with the December 1, 2002 confidence intervals for
euro submitted in July of 2002.
S
To determine the shared variance components, VDec
, Euro , July , we first sample (with
replacement) 1,000 values from our GARCH model’s fitted variances for the four-month returns,
V Dec , Euro , July . We assume (as in previous tests) that dealers can use order-flow information to
reduce the forecast variance by the full 15.7 percent estimated by Evans and Lyons (2005). We
then calculate the associated 90-percent confidence interval widths: CISDec,Euro,July =
S
3.29 (1 − 0.157)VDec
, Euro , July .
To find the idiosyncratic component of a simulated confidence interval width,
~i
d Dec
, Euro , July , we first calculate the differences between each dealer’s originally-submitted
proportionate confidence-interval width and the mean proportionate width across all dealers for a
given forecast submission month: diDec,Euro,July = widthDec,Euro,July - meanwidthDec,Euro,July. We then
sample (with replacement) nDec,Euro,July of the original differences for each of the 1,000 elements
36
of V Dec , Euro , July to create the nDec,Euro,July simulated widths for that survey completion month:
~i
~
S
CI iDec,Euro,July = CI Dec
, Euro , July + d Dec , Euro , July .
We then sample a second time (with replacement) from the relevant fitted variances to
create a set of 1,000 “true” variances, VTDec,Euro,July, and associated true rational confidence
T
interval widths: CITDec,Euro,July = 3.29 (1 − 0.157)VDec
, Euro , July .
We repeat this procedure for the December 1 forecasts submitted in August, September,
and October. This provides a set of 1,000 “periods.” For each period there are four true
confidence-interval widths, one for each survey-completion month and four sets of idiosyncratic
confidence-interval widths. For each period we calculate how many of the simulated dealer
widths are smaller than their associated true width. We then express the total as a share of the
number of simulated widths for that forecast date and currency, nDec,Euro = ∑ n Dec , Euro ,m . The
m
distribution of these shares is taken to be the true distribution of shares under the null hypothesis
that individuals’ estimated confidence interval widths are unbiased but correlated with each other
to the extent that they share the same mean on a period-by-period basis. We note that this test
implicitly assumes that estimated and actual confidence interval widths are uncorrelated, which
we accept because it favors the hypothesis of no overconfidence and because anything else
would necessarily involve arbitrary assumptions that would make it difficult to interpret the
results.
37
Table 1: Characteristics of survey respondents
The table provides descriptive information concerning the 416 respondents to a June 2002 survey
of North American currency market professionals. Note: totals may exceed 100% because some
individuals fit multiple categories.
Location
New York City 53%
U.S. non-NYC 33%
Canada
14%
Product
Spot
73%
Forward
33%
Derivatives 23%
Money Market 6%
Function
Interbank Trader 59%
Salesperson
32%
Proprietary Trader 20%
Rank
Treas./Head Trader 12%
Senior Trader
75%
Junior Trader
12%
Trainee
1%
Age
Over 40
36-40
31-35
26-30
Less than 26
Currency Focus
EUR/USD
61%
USD/JPY
42%
USD/CAD
39%
GDP/USD
30%
USD/CHF
20%
USD vs. Other
20%
EUR/JPY
11%
EUR/GBP
11%
EUR/CHF
8%
EUR vs. Other
10%
30%
27%
27%
13%
3%
38
Table 2: Confidence intervals are too narrow
Ninety-percent confidence intervals for exchange rates on December 1, 2002 and June 1, 2003
were submitted by North American currency market professionals in response to a survey
distributed in June of 2002. Panel A shows the share of confidence-intervals widths that exclude
the realized exchange rate. Under the null hypothesis of no miscalibration and independence the
true share is 10%. For “realigned confidence intervals,” end-points are aligned relative to the
exchange rate on the day the survey was completed rather than dealers’ point forecasts.
Panels B and C show the share of confidence-intervals widths that exceed objective
benchmarks. Under the null the share is 50%. In Panel B this benchmark is based on GARCH
variance estimates. In Panel C the benchmark is based on option implied volatilities. Under the
null the shares have a binomial distribution with p = 0.10 (Panel A) or p = 0.5 (Panels B and C).
Panel D bootstraps the statistical significance of fractions from Panel B assuming that interval
widths are not generated independently. * means 5% significance; ** means 1% significance.
EUR
JPY
GBP
A. Share of confidence intervals excluding realized exchange rates
Proprietary traders
December 1 forecast
35.2
35.7
25.4
June 1 forecast
87.9
47.7
43.8
All traders
December 1 forecast
29.3
45.1
33.7
June 1 forecast
94.6
48.5
45.9
All traders: Realigned confidence intervals
December 1 forecast
13.8
38.1**
22.4**
June 1 forecast
98.8**
18.7**
20.3**
B. Percent of intervals narrower than GARCH estimates
Proprietary traders
December 1 forecast
93.7**
90.7**
98.1**
June 1 forecast
92.8**
96.3**
98.1**
All traders
December 1 forecast
94.9**
92.5**
99.1**
June 1 forecast
95.2**
99.7**
98.8**
All traders at top-tier banks
December 1 forecast
97.1**
92.1**
99.2**
June 1 forecast
95.6**
100.0**
99.6**
39
CHF
CAD
35.9
82.0
43.9
92.1
34.0
93.8
54.7
95.8
14.7*
94.3**
42.5**
97.7**
96.9**
99.0**
64.7**
85.3**
98.5**
99.1**
70.2**
93.7**
99.6**
99.6**
71.9**
93.4**
C. Percent of intervals narrower than estimates from option implied volatilities
Proprietary traders
December 1 forecast
94.6**
91.6**
93.3**
94.9**
June 1 forecast
97.3**
92.5**
97.1**
95.9**
All traders
December 1 forecast
97.7**
95.1**
97.6**
96.9**
June 1 forecast
99.2**
95.1**
98.8**
98.2**
All traders at top-tier banks
December 1 forecast
98.2**
95.5**
97.7**
98.4**
June 1 forecast
99.6**
95.1**
99.2**
99.2**
D. GARCH benchmarks with variance forecasts that are not independent
All traders
December 1 forecast
94.9**
92.5**
99.1**
98.5**
June 1 forecast
95.2*
99.7**
98.8**
99.1**
40
97.1**
94.1**
97.9**
97.9**
98.4 **
98.0**
70.2
93.7*
Table 3: Most traders rate themselves professionally above-average
The table shows self-assessments of personal success submitted by North American currency
market professionals as part of a survey distributed in June of 2002. Participants rated their own
professional success on a scale of 1 = far below average to 7 = far above average, with 4 =
average. "Top-tier" traders work for "Top-tier" banks.
All traders
Top-tier traders
Other traders
5.06
(0.05)
5.20
(0.06)
4.86
(0.11)
Share above average (5, 6, 7)
73.6
74.9
68.9
Share below average (1, 2, 3)
5.4
4.5
8.9
Number of participants
401
311
90
Average self-rating
(Standard error)
41
Table 4: Bootstrap tests of hubris
The table shows the marginal significance of the share of North American currency market
professionals rating themselves above average. Their superiors were asked to rate them on the
same scale, with respect to three dimensions of success: Overall Contribution, Trading Profits,
and Trading Potential. Distribution of traders’ share under the null of no hubris is bootstrapped
from superiors’ ratings. Values of 0.000 indicate that the most extreme simulated share was less
extreme than the observed share.
Observed
share
Bootstrapped marginal significance levels
Overall
Contribution
Trading
Profits
Trading
Potential
Senior interbank traders
Above average (5, 6, 7)
77.5
0.000
0.000
0.003
Below average (1, 2, 3)
5.6
0.000
0.000
0.003
Above average (5, 6, 7)
76.7
0.009
0.000
0.014
Below average (1, 2, 3)
5.6
0.001
0.001
0.000
Senior salespersons
42
Table 5: Overconfidence unrelated to experience
The table shows average overconfidence for currency dealers with different levels of experience.
A dealer’s miscalibration is measured as the average percent gap between the width of his/her
confidence intervals and GARCH-based benchmarks. A dealer’s hubris is measured as the gap
between his/her self-rating of performance and the superior’s Overall Contribution rating.
“Hubris share” is average hubris as a percent of the gap between 7, the highest possible rating,
and the superior’s average rating for the group. All measures are zero for perfectly calibrated
individuals and rise with overconfidence.
5A: Miscalibration, all traders
Experience = years trading
Miscalibration
t-statistic
Number obs.
0-8 yrs vs. …
9 – 14 yrs vs. …
Experience = age
Miscalibration
t-statistic
Number obs.
Under 30 yrs vs. …
31 – 35 yrs vs. …
36 – 40 yrs vs. …
0-8
9 - 14
0.59
0.58
22.53
27.80
106
102
t-statistics for differences across groups
0.27
≤ 30
31 - 35
0.60
0.58
16.68
24.72
56
96
t-statistics for differences across groups
0.51
15
0.58
28.06
107
0.09
-0.21
36 - 40
41 +
0.52
8.35
95
0.59
29.83
103
1.15
0.92
0.28
-0.34
-1.10
Trainees,
Senior
Head Traders,
Jr. Traders
Traders
Treasurers
Miscalibration
0.65
0.57
0.42
t-statistic
17.69
41.66
2.85
Number obs.
49
257
38
t-statistics for differences across groups
Trainee and Jr. Trader vs. …
1.93
1.48
Senior Trader vs. …
1.01
Experience = rank
43
Table 5B: Hubris, all traders
Experience = years trading
Hubris
t-statistic
Hubris share
Number obs.
0-8 yrs vs. …
9 – 14 yrs vs. …
0-8
0.50
2.25
9 - 14
0.47
2.81
15
0.65
2.19
0.14
0.22
0.26
57
76
t-statistics for differences across groups
0.09
51
-0.40
-0.51
Experience = age
≤ 30
31 - 35
36 - 40
41 +
Hubris
t-statistic
Hubris share
Number obs.
0.23
0.94
0.64
3.70
0.57
2.37
0.46
2.06
0.10
0.27
0.27
0.19
61
60
-0.98
0.24
-0.69
0.64
0.33
Under 30 yrs vs. …
31 – 35 yrs vs. …
36 – 40 yrs vs. …
Experience = rank
Hubris
t-statistic
Hubris share
Number obs.
47
73
t-statistics for differences across groups
-1.36
Trainees,
Jr. Traders
0.41
1.52
Senior
Traders
0.45
3.72
Head Traders,
Treasurers
1.13
2.06
0.15
0.18
0.45
41
183
t-statistics for differences across groups
Trainee and Jr. Trader vs. …
-0.12
Senior Trader vs. …
44
15
-1.17
-1.21
5C: Miscalibration and hubris for traders by functional responsibility
Function
Overconfidence
t-statistic
Number obs.
Sales vs. …
Interbank vs. …
Experience =
years trading
Overconfidence
t-statistic
Number obs.
0 - 7 yrs vs. …
8 – 15 yrs vs. …
Miscalibration
InterProprieta
Sales
Sales
bank
ry
0.64
0.55
0.50
0.16
34.64
15.53
14.05
0.81
108
174
64
72
t-statistics for differences across groups
2.23
3.35
1.04
Proprietary traders only
Hubris
Interbank
0.57
3.68
127
-1.65
-2.70
-1.35
0-8
0-8
9 - 14
15 +
0.47
0.53
0.52
0.78
6.21
7.82
10.24
2.50
22
11
28
17
t-statistics for differences across group
-0.68
-0.55
0.23
0.83
3.54
9
1.09
2.21
11
-0.13
-0.53
-0.47
9 - 14
15 +
Proprieta
ry
0.92
4.58
39
Experience = age
≤ 35
≤ 35
> 35
Overconfidence
t-statistic
Number obs.
0.50
0.51
0.94
7.86
11.56
3.42
25
38
19
t-statistics for differences across group
-0.13
0.90
3.02
20
35-and-under vs. …
> 35
45
0.09
5D: Miscalibration at top-tier banks
Experience = years trading
Miscalibration
t-statistic
Number obs.
0-8 yrs vs. …
9 – 14 yrs vs. …
Experience = age
Miscalibration
t-statistic
Number obs.
Under 30 yrs vs. …
31 – 35 yrs vs. …
36 – 40 yrs vs. …
0-8
9 - 14
15
0.58
0.58
0.62
20.22
25.56
24.42
95
83
65
t-statistics for differences across groups
-0.16
-1.09
-1.05
≤ 30
31 - 35
36 - 40
41 +
0.60
0.58
0.58
15.16
22.77
19.86
51
83
66
t-statistics for differences across groups
0.46
0.32
-0.15
0.60
25.78
72
Trainees,
Senior
Head Traders,
Jr. Traders
Traders
Treasurers
Miscalibration
0.64
0.57
0.52
t-statistic
15.13
33.20
90.28
Number obs.
42
175
23
t-statistics for differences across groups
Trainee and Jr. Trader vs. …
1.43
1.62
Senior Trader vs. …
0.83
Experience = rank
46
-0.06
-0.71
-0.50