Academia.eduAcademia.edu

Learning To Be Overcondent

2001

We develop a multi-period market model describing both the process by which traders learn about their ability, and how a bias in this learning can create overconfident traders. A trader in our model initially does not know his own ability. He infers this ability from his successes and failures. In assessing his ability the trader takes too much credit for his successes, and this leads him to become overconfident. A trader's expected level of overconfidence increases in the early stages of his career. Then, with more experience, he comes to better recognize his own ability. The patterns in trading volume, expected profits, price volatility, and expected prices resulting from this endogenous overconfidence are analyzed.

Learning To Be Overconfident∗ Simon Gervais University of Pennsylvania and Terrance Odean University of California at Davis First Version: 10 May 1996 This Version: 7 June 1999 The authors would like to thank Roger Edelen, Hayne Leland, Andrew Lo, Ananth Madhavan, David Romer, Matthew Spiegel, Brett Trueman, Greg Willard, and two anonymous referees, as well as seminar participants at l’Ecole des Hautes Etudes Commerciales (Montréal), Rutgers University, the University of Texas at Austin, the 1998 meetings of the American Finance Association, and the UCLA conference on the “Market Efficiency Debate” for their comments and suggestions. All remaining errors are the authors’ responsibility. Address correspondence to Simon Gervais, Finance Department, Wharton School, University of Pennsylvania, Steinberg Hall - Dietrich Hall, Suite 2300, Philadelphia, PA 19104-6367, (215) 898-2370. ∗ Abstract We develop a multi-period market model describing both the process by which traders learn about their ability, and how a bias in this learning can create overconfident traders. A trader in our model initially does not know his own ability. He infers this ability from his successes and failures. In assessing his ability the trader takes too much credit for his successes, and this leads him to become overconfident. A trader’s expected level of overconfidence increases in the early stages of his career. Then, with more experience, he comes to better recognize his own ability. The patterns in trading volume, expected profits, price volatility, and expected prices resulting from this endogenous overconfidence are analyzed. Your herds and flocks may increase, and you may amass much silver and gold — everything you own may increase. But your heart may then grow haughty, and you may... say to yourself, ‘It was my own strength and personal power that brought me all this prosperity.’ Deuteronomy 8:13-14,17 The Living Torah translation by Rabbi Aryeh Kaplan An old Wall Street adage advises “Don’t confuse brains with a bull market.” The need for such wisdom stems from traders’ willingness to attribute too much of their success to their own abilities and not enough to their good fortune. Successful traders are, thus, prone to become overconfident in their abilities. It is a common feature of human existence that we constantly learn about our own abilities by observing the consequences of our actions. For most people there is an attribution bias to this learning: we tend to overestimate the degree to which we are responsible for our own successes (Wolosin, Sherman and Till, 1973; Miller and Ross, 1975; Langer and Roth, 1975). As Hastorf, Schneider and Polifka (1970) write, “we are prone to attribute success to our own dispositions and failure to external forces.” People recall their successes more easily than their failures and “even misremember their own predictions so as to exaggerate in hindsight what they knew in foresight” (Fischhof, 1982). In this paper, we develop a multi-period market model describing both the process by which traders learn about their ability and how a bias in this learning can create overconfident traders. Traders in our model initially do not know their ability. They learn about their ability through experience. Traders who successfully forecast next period dividends improperly update their beliefs; they weight too heavily the possibility that their success was due to superior ability. In so doing they become overconfident. In our model, a trader’s level of overconfidence changes dynamically with his successes and failures. A trader is not overconfident when he begins to trade. Ex ante, his expected overconfidence increases over his first several trading periods and then declines.1 Thus the greatest expected overconfidence in a trader’s lifespan comes early in his career. After this he tends to develop a progressively more realistic assessment of his abilities as he ages. 1 More precisely, this will happen for insiders with learning biases that are not too large, where “not too large” is precisely defined in section 3. 1 One criticism of models of non-rational behavior is that non-rational traders will underperform rational traders and eventually be driven to the margins of markets if not out of them altogether.2 This is, however, not always the case. De Long, Schleifer, Summers and Waldmann (1990) present a model where non-rational traders in an overlapping generations model earn higher expected profits than rational traders by bearing a disproportionate amount of the risk that they themselves create. Rational traders are unwilling to take long-term arbitrage positions to eliminate these higher profits because of the risk that they may die before the arbitrage pays off. In our model, the most overconfident and non-rational traders are not the poorest traders. In fact, for any given level of learning bias and trading experience, it is successful traders, though not necessarily the most successful traders, who are the most overconfident. Overconfidence does not make traders wealthy, but the process of becoming wealthy can make traders overconfident. The model also shows that success affects traders’ conditional future expected profits in two ways. First, success is indicative of higher ability and, therefore, greater expected future profits. Second, success can increase overconfidence and thereby lower expected future profits through suboptimal decision-making. The detrimental effect of the more successful traders’ greater overconfidence on their future expected profits may, on occasion, more than offset their greater probable ability. Most models of financial market microstructure assume that all trader characteristics are common knowledge; in particular, traders’ risk aversion, their wealth, and the distribution of their information are known by all market participants. Exceptions include Blume, Easley and O’Hara (1994), Gervais (1996), and Subramanyam (1996). In these papers, the precision of the traders’ information is random. Each traders’ precision is known to himself but is uncertain to other market participants who must infer it from his actions. Our model builds on these works by extending this uncertainty to the trader himself. He initially does not know the precision of his own information, and must infer it by observing his signals and subsequent outcomes. A large literature demonstrates that people are usually overconfident and that, in particular, they are overconfident about the precision of their knowledge.3 Benos (1998), Odean (1998), Kyle and Wang (1997), and Wang (1997) examine models with statically overconfident traders. In these models greater overconfidence leads to greater expected trading volume and greater price volatility.4 2 Early proponents of this view include Alchian (1950), and Friedman (1953). More recently, Blume and Easley (1982, 1992) have reinforced these ideas analytically. 3 See for example Alpert and Raiffa (1982), and Lichtenstien, Fischhoff and Phillips (1982). Odean (1998) provides an overview of this literature. 4 In one exception, Odean (1998) shows that an overconfident, risk-averse market-maker may reduce market volatility. 2 In our model, a greater learning bias causes greater expected overconfidence, which leads to greater expected trading volume and greater price volatility. Daniel, Hirshleifer and Subrahmanyam (1998) look at trader overconfidence in a dynamic model. Our paper differs from theirs in that we concentrate on the dynamics by which self-serving attribution bias engenders overconfidence in traders, and not on the joint distribution of trader ability and the risky security’s final payoff.5 Our approach has the advantage of being analytically tractable. Our analysis differs from that of Daniel, Hirshleifer and Subrahmanyam in that we are interested in the dynamic effects of biased learning on overconfidence, trading volume, price volatility and trading profits, whereas their main objective is to show that overconfidence implies different price correlation patterns in the short run and the long run. Nevertheless, we show that expected prices exhibit the same humped-shape patterns through time in both models. Overconfidence is determined, in our model, endogenously and changes dynamically over a trader’s life. This enables us to make predictions about when a trader is most likely to be overconfident (when he is inexperienced and successful) and how overconfidence will change during a trader’s life (it will, on average, rise early in a trader’s career and then gradually fall). The model also has implications for changing market conditions. For example, most equity market participants have long positions and benefit from upward price movements. We would therefore expect aggregate overconfidence to be higher after market gains and lower after market losses. Since, as we show, greater overconfidence leads to greater trading volume, this suggests that trading volume will be greater after market gains and lower after market losses. Indeed, Statman and Thorley (1998) find that this is the case. We would expect aggregate overconfidence to be particularly high in a market with many young traders who have experienced only a long bull run. Thus, our model predicts that trading volume and volatility should be higher in the late stages of a bull market than in the late stages of a bear market. Rabin and Schrag (1999) develop a model of confirmatory bias, the tendency to interpret new information as confirming one’s previous beliefs. In as much as people tend to have positive selfimages, confirming bias and self-serving attribution bias are related. Our paper differs considerably from Rabin and Schrag’s, in that we analyze the effect of attribution bias on the overconfidence of traders in financial markets. The rest of this paper is organized as follows. In section 1, we introduce a one-security multiperiod economy with one insider, one liquidity trader and one market maker. In section 2, we develop the conditions under which there is a unique linear equilibrium in our economy. This linear equilibrium is used in section 3 to analyze the effects of the insider’s learning bias on his 5 This is because the risky security’s dividend is publicly revealed at the end of every trading round in our model. 3 overconfidence and profits, as well as on the market’s trading volume, volatility, and price patterns. Section 4 discusses the empirical implications of the model and, finally, section 5 concludes. All the proofs are contained in the appendices. 1 The Economy We study a multi-period economy in which only one risky asset is traded among three market participants: an informed trader, a liquidity trader, and a market maker. At the end of period t, the risky asset pays off a dividend v̂t , unknown to all the market participants at the beginning of the period.6 At the beginning of each period t, the risk-neutral informed trader (also called insider ) observes a signal θ̂t which is correlated with v̂t . In particular, the signal θ̂t is given by θ̂t = δ̂t v̂t + (1 − δ̂t )ε̂t , (1) where ε̂t has the same distribution as v̂t , but is independent from it. The variable δ̂t takes the values 0 or 1. Since ε̂t is independent from v̂t , the insider’s information will only be useful when δ̂t is equal to one. We assume that this will happen with probability â, i.e.   1, prob. â δ̂t | â =  0, prob. 1 − â. (2) This last equation shows that, the higher â is, the more likely that δ̂t will be equal to one. For this reason, we call â the insider’s ability.7 We assume that nobody (including the insider himself) knows precisely the insider’s ability â at the outset (i.e. at time zero). Instead, we assume that, a priori, the insider’s ability is drawn from the following distribution: 6 7   H, prob. φ 0 â =  L, prob. 1 − φ0 , Throughout the whole paper, we use a “hat” over a variable to denote the fact that it is a random variable. Equivalently, we could call â the insider’s information precision. 4 (3) where 0 < L < H < 1, and 0 < φ0 < 1. Of course, since the security dividend v̂t is announced at the end of every period t, the insider will know at the end of every period whether his information for that period was real (δ̂t = 1) or was just pure noise (δ̂t = 0).8 For tractability reasons, we also assume that the market maker observes θ̂t at the end of period t, so that his information at the end of every period is the same as the insider’s.9 This information will be useful to both the insider and the market maker in assessing the insider’s ability â. In making this last assumption, we are essentially saying that the insider’s informational advantage over the market maker is one of market timing. Indeed, the insider’s information set at the beginning of every period is always exactly one period ahead of the market maker’s. Since our goal is not to explain the differences between these two information sets, we reduce the information gap between the insider and the market maker to only (and exactly) one period. Our analysis is then simplified in that both the insider and the market maker perform the same one-period updating at the end of every period, except that the insider’s updating will be biased. Preventing the market maker from observing θ̂t at the end of period t would simply result in a more complex (non Markov) updating process for the market maker, but would not affect the insider’s updating process, in which we are ultimately interested. It would, however, increase the insider’s expected profits since the market maker’s informational disadvantage would then be greater. As mentioned above, our model seeks to describe the behavior of an informed trader with a learning bias. In particular we want to model the phenomenon that traders usually think “too much of their ability” when they have been successful at predicting the market in the past. In statistical terms, this will mean that traders update their ability beliefs too much when they are right. Before we formally include this behavior into our model, let us describe how a rational/unbiased insider would react to the information he gathers from past trading rounds. Let ŝt denote the number of times that the insider’s information was real in the first t periods, that is ŝt = t X δ̂u . (4) u=1 It can be shown, using Bayes’ rule, that, at the end of t periods, a rational insider’s updated beliefs 8 This will be the case since ε̂t = v̂t happens with zero probability with the continuous distributions that we will specify later. 9 As will become clear below, we could have equivalently assumed that every trader’s order and identity (insider vs liquidity trader) are revealed at the end of every period. 5 about his own ability will be given by φt (s) ≡ Pr{â = H | ŝt = s} = H s (1 − H)t−s φ0 . H s (1 − H)t−s φ0 + Ls (1 − L)t−s (1 − φ0 ) (5) We denote this rational insider’s updated expected ability by µt (s) ≡ E[â | ŝt = s] = Hφt (s) + L[1 − φt (s)]. (6) In fact, since we do not assume any kind of irrational behavior on the part of the market maker, and since the market maker’s information set is the same as the insider’s at the end of every period, this will be the market maker’s updated belief at the end of period t. In modeling the self-serving attribution bias (which we simply refer to as the learning bias from now on), we assume that a trader who successfully forecasts a dividend, weights this success too heavily when applying Bayes’ rule to assess his own ability. In choosing our updating rule we seek to accurately model the behavior psychologists have observed, to create a simple, tractable model, and to choose an updating rule that departs gradually from rational updating, and can therefore be used to describe traders with different degrees of bias, including unbiased traders. Psychologists find that when people succeed, they are prone to believe that success was due to their personal abilities rather than to chance or outside factors; when they fail, they tend to attribute their failure to chance and outside factors rather than to their lack of ability. They also find that “self-enhancing attributions for success are more common than self-protective attributions for failure” (Fiske and Taylor, 1991; also see Miller and Ross, 1975). This observed behavior can be modeled by assuming that, when a trader applies Bayes’ rule to update his belief about his ability, he overweights his successes, he underweights his failures, and he overweights successes more than he underweights failures. The model is simpler and the qualitative results unchanged if one simply assumes, as we do, that successes are weighted too heavily and failures are weighted correctly. More precisely, we assume that when evaluating his own ability the insider overweights his successes at predicting the security’s dividend (i.e. every time that δ̂t = 1) by a learning bias factor γ ≥ 1 (γ = 1 representing a rational insider). For example, at the end of the first period, if the insider finds that θ̂1 = v̂1 (i.e. δ̂1 = 1), the insider will adjust his beliefs to φ̄1 (1) ≡ Prb {â = H | ŝ1 = 1} = 6 γHφ0 , γHφ0 + L(1 − φ0 ) (7) where the subscript to “Pr” denotes the fact that the probability is calculated by a biased insider. This updated probability is larger than that of a rational insider, i.e. φ̄1 (1) ≥ φ1 (1). Also, as can be seen from (7), φ̄1 (1) will be higher the larger γ is, and φ̄1 (1) → 1 as γ → ∞; in other words, the learning bias dictates by how much the insider adjusts his beliefs towards being a high ability insider. Moreover, our model departs continuously from rationality in the sense that φ̄1 (1) → φ1 (1) as γ → 1. It is easily shown that, in this case, φ̄t (s) ≡ Prb {â = H | ŝt = s} = (γH)s (1 − H)t−s φ0 , (γH)s (1 − H)t−s φ0 + Ls (1 − L)t−s (1 − φ0 ) (8) and the (biased) insider’s updated expected ability is given by µ̄t (s) ≡ Eb [â | ŝt = s] = H φ̄t (s) + L[1 − φ̄t (s)]. (9) At the beginning of every period t, the risk-neutral insider observes his signal θ̂t ; he then chooses his demand for the risky security in order to maximize his expected period t profits,10 π̂t , conditional on both his signal and his ability beliefs µ̄t−1 (ŝt−1 ) (which only depend on ŝt−1 ). We denote this demand by x̂t = Xt (θ̂t , ŝt−1 ). (10) The other trader in the economy is a trader who trades for liquidity purposes in every period. This liquidity trader ’s demand in period t is given exogenously by the random variable ẑt . Both orders, x̂t and ẑt , are sent to a market maker who fills the orders. As in Kyle (1985), we assume that the market maker is risk-neutral and competitive, and will therefore set prices so as to make zero expected profits. So, if we denote the total order flow coming to the market maker in period t by ω̂t = x̂t + ẑt , (11) 10 As we mention above, both the risky dividend and the insider’s signal are announced at the end of every period, so that the market maker is always exactly one period behind the insider in terms of information at the beginning of the next period. This implies that the insider never finds it optimal to suboptimally choose his demand for one period in order to maximize longer-term profits. 7 the market maker will set the security’s price equal to p̂t = Pt (ω̂t , ŝt−1 ) ≡ E[v̂t | ω̂t , ŝt−1 ] (12) in period t. An equilibrium to our model is defined as a sequence of pairs of functions (Xt , Pt ), t = 1, 2, . . . , such that the insider’s demand in period t, Xt (θ̂t , ŝt−1 ) maximizes his expected profits (according to his own beliefs) for that period given that he faces a price curve Pt , while the market maker is expecting zero profit in that period. As it will become obvious later, the main results of this paper are driven by the insider’s updating dynamics. As such, the liquidity trader and the market maker only play minimal roles in this model. In fact, their role is essentially one of market clearing. Indeed, the presence of the liquidity trader introduces noise that will prevent the “no trade equilibrium” described by Milgrom and Stokey (1982) from occurring. The competitive market maker assumption is simply made out of convenience; the presence of a risk-neutral rational trader would serve a similar purpose, as in Daniel, Hirshleifer and Subrahmanyam (1998). Appendix D presents a brief synopsis of alternative specifications that in turn avoid the presence of the liquidity trader and the market maker, introduce the possibility for the market maker to maximize profits, and increase the number of market participants. As argued in that appendix, these specifications do not affect the nature of the insider’s (or insiders’) updating and, therefore, do not change the main results of the paper. 2 A Linear Equilibrium In this section, we show that, when v̂t , ε̂t , and ẑt are jointly and independently normal, there is a linear equilibrium to our economy. We use that linear equilibrium in section 3 to illustrate the properties of the model. More precisely, for this and the next section, we assume that       Σ 0 0       ε̂t  ∼ N  0  ,  0      ẑt 0 0 Σ   0   , t = 1, 2, . . . , Ω v̂t 0 0 (13) and that each such vector is independent of all the others. Note that it is crucial that Var(v̂t ) = Var(ε̂t ), since we do not want the size of θ̂t to reveal anything about the likelihood that δ̂t = 1 until v̂t is announced. In other words, ability updating is only possible when both θ̂t and v̂t become 8 observable. Let us conjecture that, in equilibrium, the function Xt (θ, s) is linear in θ, and that the function Pt (ω, s) is linear in ω: Xt (θ, s) = βt (s) θ, (14a) Pt (ω, s) = λt (s) ω. (14b) Our objective is to find βt (s) and λt (s) which are consistent with this conjecture. We start with the following result. Lemma 2.1 Assume that a linear equilibrium exists in period t, that is assume that (14a) and (14b) hold. Then, in period t, the insider’s demand for the risky security is given by x̂t = µ̄t−1 (ŝt−1 )θ̂t , 2λt (ŝt−1 ) (15) and the market maker’s price schedule is given by p̂t = µt−1 (ŝt−1 )βt (ŝt−1 )Σ ω̂t . βt2 (ŝt−1 )Σ + Ω (16) Proof : See Appendix A. This lemma establishes that we can indeed write x̂t = βt (ŝt−1 )θ̂t with µ̄t−1 (s) , 2λt (s) (17) µt−1 (s)βt (s)Σ . βt2 (s)Σ + Ω (18) βt (s) = and p̂t = λt (ŝt−1 )ω̂t with λt (s) = However, the result relies on the assumption that a linear equilibrium exists. It turns out that this assumption is not always satisfied given the insider’s learning bias. In fact, the following lemma derives the exact condition under which such an equilibrium will exist in a given period t. 9 Lemma 2.2 In any given period t, there exists a linear equilibrium of the form conjectured in (14a) and (14b) if and only if µ̄t−1 (ŝt−1 ) ≤ 2µt−1 (ŝt−1 ). Proof : See Appendix A. This condition essentially states that the biased insider’s beliefs about his ability cannot exceed those of the rational market maker by too much for an equilibrium to exist. It effectively ensures that the insider will on average be making profits, even though he may not be optimizing correctly due to his biased beliefs.11 Were this condition not satisfied, the market maker would know that the insider will take a position that will on average result in negative profits. Our condition that the competitive market maker quotes a price schedule earning him on average zero profits can then never be satisfied. This results in a market breakdown. The following lemma derives a condition that is both necessary and sufficient to always avoid such outcomes. Lemma 2.3 A necessary and sufficient condition for µ̄t−1 (ŝt−1 ) > 2µt−1 (ŝt−1 ) to be avoided for any history up to any period t and any γ > 1 is that H ≤ 2L. Proof : See Appendix A. Note that, for a given fixed value of γ, H ≤ 2L is too strong a condition to avoid market breakdowns for any history, i.e. the condition is then sufficient but not necessary. However, since market breakdowns are outside the scope of this paper, we assume that H ≤ 2L is always satisfied in the rest of our analysis.12 Such an assumption allows us to vary γ throughout the paper without having to worry about the existence of an equilibrium, but does not affect the qualitative aspects of our results.13 We finish this section with the following characterization of the equilibrium. Lemma 2.4 Assuming that H ≤ 2L, there is always a unique linear equilibrium to the economy described in section 1. In this equilibrium, the insider’s demand and the market maker’s price 11 Note that this condition does not violate the absence of arbitrage results established by Bossaerts (1999) for security prices, and by De Finetti (1937) for subjective probabilities. Indeed, although the insider’s learning bias could introduce profit opportunities for other informed traders (were they present in the economy), they do not lead to riskless arbitrage opportunities. In fact, the beliefs of the market maker in our model correspond to the market beliefs in Bossaerts’ work. Note also that, if the insider is risk-averse, this existence condition is not needed; however, the model then loses tractability. 12 For a formal analysis of market breakdowns, consult Bhattacharya and Spiegel (1991). 13 In fact, although this assumption guarantees the survival of the insider by making his expected profits positive, it does not guarantee the survival of overconfidence for an individual, as we show in section 3.2. 10 schedule are given by (14a) and (14b) with βt (s) = λt (s) = s Ω µ̄t−1 (s) , and Σ 2µt−1 (s) − µ̄t−1 (s) r 1 Σ µ̄t−1 (s) [2µt−1 (s) − µ̄t−1 (s)]. 2 Ω (19a) (19b) Proof : See Appendix A. In what follows, we use this equilibrium to study the effects of the insider’s learning bias on the economy. 3 Properties of the Model In this section, we analyze the effects of the insider’s learning bias on the properties and dynamics of the economy in equilibrium. We introduce a measure of overconfidence analogous to that found in static models and show that the learning bias results in dynamically evolving insider overconfidence. We then look at the effect of this changing overconfidence on trading volume, trader profits, price volatility, as well as the expected price patterns. 3.1 Convergence If this financial market game is played to infinity, we would expect both the insider and the market maker to eventually learn the exact ability â of the insider. This in fact would be true for a rational insider (γ = 1). However, since our insider learns his ability with a personal bias, this result is not immediate; in fact, as we shall see, this result is not true for a highly biased insider. When â = H, we expect the insider to correctly guess the one-period dividend a fraction H of the time. So, as we play the game more and more often (as t tends to ∞), we expect his updated posteriors φ̄t (s) = (γH)s (1 − H)t−s φ0 = (γH)s (1 − H)t−s φ0 + Ls (1 − L)t−s (1 − φ0 ) 11 1 1+  L γH s  1−L 1−H t−s 1−φ0 φ0 to behave like 1 1+  L γH Ht  t−Ht 1−L 1−H 1−φ0 φ0 1 = 1+  L γH H  1−L 1−H 1−H t . 1−φ0 φ0 This last quantity will converge to 1 as desired if " L γH H  1−L 1−H 1−H #t H  1−L 1−H → 0 as t → ∞, or equivalently if  L γH 1−H < 1. The following lemma shows that this is indeed the case. Lemma 3.1 When â = H, the updated posteriors of the insider φ̄t (ŝt ) will converge to 1 almost surely as t → ∞. Proof : See Appendix B. So both the insider and the market maker will eventually learn the insider’s ability precisely when it is high (when â = H). Let us now turn to the case where â = L. In this case, we expect the insider to correctly guess the one-period dividend a fraction L of the time. So, as we play the game more and more often (as t tends to ∞), we expect his updated posteriors φ̄t (s) to behave like 1 1+  L γH Lt  1−L 1−H t−Lt 1−φ0 φ0 1 = 1+  L γH L  1−L 1−H 1−L t . 1−φ0 φ0 As the following lemma shows, this quantity only converges to zero when γ is close enough to 1. This means that the market maker will always learn the insider’s ability when it is low (when â = L),14 but the insider will only do so if his learning bias is not too large. 14 This is due to the fact that we assume that the market maker’s learning is unbiased. 12 Lemma 3.2 When â = L, the updated posteriors of the insider φ̄t (ŝt ) will converge as follows:  ∗    0, if γ < γ a.s. φ̄t (ŝt ) −−→ φ0 , if γ = γ ∗    1, if γ > γ ∗ , where γ ∗ = L H  1−L 1−H (1−L)/L . Proof : See Appendix B. One implication of this lemma is that a low ability insider whose learning bias is sufficiently extreme may never acknowledge his low ability, no matter how much experience he has.15 To illustrate this, we show in Figures 1(a) and 1(b) how we expect an insider to adjust his beliefs about his own ability (φ̄t ) when his actual ability is high, and when it is low respectively. As seen in these figures, the biased insider’s beliefs are always on average larger than those of an unbiased but otherwise identical insider. Since unbiased insiders always eventually learn their ability, it is therefore not surprising to find that high ability insiders also always learn their own ability:16 they naturally tend to update towards that high ability. However, as shown in Figure 1(b), a biased insider may not always give in to his observations: more precisely, if γ > γ ∗ , he will never find out if he is a low ability insider. 3.2 Patterns of Overconfidence As we show in section 3.1, the insider will eventually learn his own ability, provided that his learning is not too biased (i.e. provided that γ is not too large). This means that, when the insider’s ability is low, the insider eventually comes to his senses, and recognizes the fact that he is a low ability insider. However, it is always the case that the insider thinks too highly of himself relative to an otherwise identical unbiased insider. This section introduces a measure for this discrepancy; we call it the insider’s overconfidence. The evolution of the insider’s overconfidence throughout his life is central to our study, as this overconfidence essentially measures by how much our model departs from a purely rational setup in any given period. 15 Bossaerts (1999) shows that prior beliefs that are correctly updated using Bayes’ law converge to the right posterior beliefs, whether the priors are biased or not. In contrast, our result shows that correct priors updated incorrectly may not lead to the correct posteriors. 16 In fact, they will do so faster the more biased they are. 13 1.0 ˆ ] E[ t(sˆt) | a=H 0.9 1.0 0.8 1.5 0.7 25 9 5.0 0.6 5 10 15 20 25 Period t (a) Convergence when â = H. 0.7 E[ t(sˆt) | ˆa=L] 0.6 0.5 1.0 0.4 1.5 0.3 25 9 0.2 5.0 0.1 5 10 15 Period t 20 25 (b) Convergence when â = L. Figure 1: Convergence patterns of the insider’s expected beliefs about his own ability when (a) â = H; (b) â = L. Both figures were obtained with H = 0.9, L = 0.5, φ0 = 0.5, and Σ = Ω = 1. Each line was drawn with a different γ, shown in the legends. Note that, with these parameter values, γ ∗ = 25/9 ≈ 2.78. In our model, an insider is considered very overconfident at the end of a particular period t if his updated expected ability at that time (µ̄t (ŝt )) is large compared to the updated expected ability that a rational insider would have reached with the same past history of successes and failures (µt (ŝt )). To measure the insider’s overconfidence at the end of t periods, we therefore define the random variable κ̂t ≡ Kt (ŝt ) ≡ 14 µ̄t (ŝt ) . µt (ŝt ) (20) Of course, when the insider is rational (γ = 1), the numerator is exactly equal to the denominator of this expression, and κ̂t = 1 for all t = 1, 2, . . . . On the other hand, when the insider’s learning is biased (γ > 1), we have µ̄t (ŝt ) ≥ µt (ŝt ), and κ̂t ≥ 1 for all t. As the next proposition shows, the insider’s overconfidence in period t is greater when the insider’s learning bias is large. In other words, the insider’s overconfidence is directly attributable to his learning bias. Proposition 3.1 The function Kt (s) defined in (20) is increasing in γ. Proof : See Appendix B. Our measure of overconfidence at any point in time is therefore increasing in the insider’s learning bias, but is it also increasing in the number of his past successful predictions? Since the insider’s overconfidence results from his learning bias when he is successful, it is tempting to conclude that the more successful an insider is, the more overconfident he will be. As we next show, this intuition is wrong. First, since the insider updates his beliefs incorrectly only after successful predictions, it is always true that µ̄t (0) = µt (0), and therefore Kt (0) = 1. However, as soon as the insider successfully predicts one risky dividend, his learning bias makes him overconfident,17 and µ̄t (1) > µt (1), so that Kt (1) > 1. So, it is always true that the insider’s first successful prediction makes him overconfident.18 However, it is not always the case that an additional successful prediction always makes the insider more overconfident. To see this, suppose that we are at the end of the second period. The insider will then have been successful 0, 1 or 2 times. We already know that K2 (1) > K2 (0) = 1 for any value of the insider’s learning bias parameter γ. Now, suppose that γ is large. This means that if the insider is successful in the first period, he will immediately (and perhaps falsely) jump to the conclusion that he is a high ability insider, i.e. µ̄1 (1) is close to H. Since this one successful period has already convinced the insider that his ability is high, the second period results will not have much of an effect on his beliefs, whether he is successful or not in that period, i.e. µ̄2 (2) is close to µ̄2 (1). On the other hand, if the insider had been rational (γ = 1), he would have adjusted his expected ability beliefs more gradually. In particular, after a first period success, a rational insider does 17 In our model, traders are not overconfident when they begin to trade. It is through making forecasts and trading that they become overconfident. This leads market participants to be, on average, overconfident. In real markets, selection bias may cause even beginning traders to be overconfident. Indeed, since not everyone trades, it is likely that people who rate their own trading abilities most highly will seek jobs as traders or will trade actively on their own account. Those with actual high ability and those with high overconfidence will rate their own ability highest. Thus, even at the entry level, we would expect to find overconfident traders. 18 Also, as section 3.1 shows, he will remain so for the rest of his life. 15 not automatically conclude that his ability is high. Instead, he adjusts his posterior expected ability beliefs towards H, and uses the second period result to further adjust these beliefs: upward towards H if he is successful, and downward towards L otherwise. As a result, µ2 (2) will be somewhat larger than µ2 (1). Therefore, since µ̄2 (2) ≈ µ̄2 (1) and µ2 (2) > µ2 (1), we have K2 (2) ≡ µ̄2 (1) µ̄2 (2) < ≡ K2 (1), µ2 (2) µ2 (1) and we see that K2 (s) decreases when s goes from 1 to 2. In short, the biased insider adjusts his beliefs non-rationally with every successful prediction, making him overconfident. However, when the insider’s past performance is sufficiently good (large number of successful predictions), it is the case that even an unbiased insider would reach the conclusion that he is a high ability insider. In other words, the significance of the insider’s past performance overweights the significance of his learning bias. The following proposition describes this phenomenon in more details. Proposition 3.2 The function Kt (s) defined in (20) is increasing over s ∈ {0, . . . , s∗t } and de- creasing over s ∈ {s∗t , . . . , t}, for some s∗t ∈ {1, . . . , t}. Proof : See Appendix B. Intuitively, this result says that a trader who has been very successful in only a few rounds of trading or one who has been moderately successful in several rounds of trading will have a greatly inflated opinion of his ability. But a trader who has been very successful over many rounds of trading probably does have high ability. And while he may overestimate his expected ability, he does not do so by as much as do moderately successful traders. In this model, traders are rational in all respects except that they have a common learning bias: they tend to attribute their successes disproportionately to their own ability. This leads successful traders to become overconfident. Other learning biases can also lead to overconfidence. For example, it is well known that, when updating beliefs from sequential information, people tend to weight recent information too heavily and older information too little (Anderson, 1959, 1981; Hogarth and Einhorn, 1992).19 We do not introduce this recency effect into our model because 19 In experimental studies, subjects sometimes also exhibit a “primacy effect,” weighting the earliest observations of a time series heavily (Anderson, 1981). This happens most often in situations where subjects lose interest in the data (Hogarth and Einhorn, 1992). It is unlikely that traders would lose interest in their own successes and failures, and so we would not expect to find a large primacy effect in their updating. 16 doing so would negate the Markov property of the insider’s (and the market maker’s) updating process. It is clear though that, if traders weight recent outcomes more heavily than older ones, recently successful traders will tend to become overconfident. The last two propositions describe how the insider’s overconfidence in a particular period depends on his learning bias and on his previous performance. Let us now turn to how his overconfidence is expected to behave over time. To do this, we calculate the ex ante expected period t overconfidence level of the insider, E [κ̂t ]. Since Pr{ŝt = s} = Pr {ŝt = s | â = H} Pr {â = H} + Pr {ŝt = s | â = L} Pr {â = L}     t t s s t−s = H (1 − H) φ0 + L (1 − L)t−s (1 − φ0 ), s s (21) we have t   X  µ̄t (s) t  s , H (1 − H)t−s φ0 + Ls (1 − L)t−s (1 − φ0 ) E [κ̂t ] = µt (s) s (22) s=0 where µt (s) and µ̄t (s) are as in (6) and (9). Figure 2 shows the patterns in the expected level of overconfidence for different values of γ. When γ is relatively small (γ < γ ∗ ), the insider will on average be overconfident at first but, over time, will converge to a rational behavior. This can be explained as follows. Over a small number of trading periods a trader’s success rate may greatly exceed that predicted by his ability. Very successful traders will overestimate the likelihood that success is due to ability rather than luck. But over many trading periods a trader’s success rate is likely to be close to that predicted by his ability. Only those traders with extreme learning bias (or with very unlikely success patterns) will fail to recognize their true ability. Indeed, as γ increases, the insider tends to put more and more weight on his past successes, and so takes a little more time to rationally find his ability. However, if γ is too large (more precisely, if γ > γ ∗ ), it is possible that the insider puts so much weight on his past successes in the stock market that he never becomes rational. It can be shown that E [κ̂t ] then converges to φ0 + (1 − φ0 ) H L. Thus our model predicts that more inexperienced traders will be more overconfident than ex- perienced traders. Less experienced traders are more likely to have success records which are unrepresentative of their abilities. For some, this will lead to overconfidence. By the law of large numbers, older traders are likely to have success records which are more representative of their abilities; they will, on average, have more realistic self assessments. In a sufficiently large group 17 1.30 1.25 1.25 E[ ˆt ] 1.20 1.75 1.15 2.25 25 9 1.10 5.00 1.05 1.00 0 20 40 Period t 60 80 100 Figure 2: Ex ante expected patterns in the level of overconfidence of the insider over time. The figure was obtained with H = 0.9, L = 0.5, φ0 = 0.5, Σ = Ω = 1. Each line was obtained with a different value of γ shown in the legend. Note that, with these parameter values, γ ∗ = 25/9 ≈ 2.78. of traders,20 however, there will be some successful, older, low-ability traders with whom the odds have not yet caught up. These traders are likely to make large mistakes in the future. As we see in Figure 2, on average, a trader’s overconfidence increases during the first part of his trading experience and decreases during the latter part. It is natural to ask what factors determine the point at which a trader’s overconfidence is likely to peak. All other things being equal, the greater a trader’s learning bias, γ, the longer it is likely to take for his overconfidence level to peak. Figure 3 illustrates this effect. In addition to the degree of learning bias, how quickly a trader’s overconfidence peaks (and how quickly he ultimately learns his true ability) depends on the frequency, speed, and clarity of the feedback he receives. A trader who receives frequent, immediate, and clear feedback will, on average, peak in overconfidence early, and quickly realize his true ability. One who receives infrequent, delayed, and ambiguous feedback will peak in overconfidence later, and more slowly realize his true ability. In general, financial markets are difficult environments for learning, as feedback is often ambiguous and comes well after a decision was made. We would expect those who trade most frequently, such as professional traders, and those who keep careful records rather than relying on memory to learn most quickly. In our model a trader receives immediate feedback each period. The greater the difference between the high (H) and the low (L) ability levels, the clearer 20 Appendix D.3 presents an extension of the model with multiple insiders. The learning and overconfidence dynamics are essentially the same as in this single insider model. 18 40 Period of maximum expected overconfidence 35 30 25 20 15 10 5 1.25 1.50 1.75 2.00 2.25 2.50 2.75 Learning bias (g) Figure 3: Period of maximum expected overconfidence as a function of the insider’s learning bias γ. The figure was obtained with H = 0.9, L = 0.5, and φ0 = 0.5. 100 Period of maximum overconfidence 80 60 40 20 0.15 0.2 0.25 0.3 0.35 0.4 0.45 Dispersion of priors (H-L) Figure 4: Period of maximum expected overconfidence as a function of the dispersion H − L of the insider’s prior ability beliefs. The figure was obtained with φ0 = 0.5, γ = 1.1, and keeping the insider’s ex ante expected ability φ0 H + (1 − φ0 )L constant at 0.7. the feedback. Figure 4 illustrates how long it takes on average for the insider’s overconfidence to peak as a function of H − L. 19 3.3 Effects of the Insider’s Learning Bias Section 3.2 shows that the insider’s learning bias has a dynamic impact on his beliefs about his ability and on the way he interprets future private information. This in turn affects the future trading process. In this and the next section, we describe how this trading process, as measured by trading volume, trader profits, and price volatility, is affected. Since the insider’s learning bias is unaffected by his success rate and vice versa,21 our model allows us to analyze the effects of the learning bias on the insider’s behavior and on the properties of the economy in two different ways. First, given a fixed past history of the insider’s successes and failures, we can vary the size of his learning bias to get an idea of the impact of that bias. This is the focus of the current section. Second, we can fix the insider’s learning bias, and determine the effects of different trading histories on the insider and the economy in general. We will turn to this in section 3.4. Before embarking on the effects of the insider’s learning bias, we calculate the three quantities that will help us measure these effects: trading volume, expected insider profits, and price volatility.22 Let ψ̂t denote the trading volume in period t. Since this trading volume comes from both the insider and the liquidity trader, it is given by ψ̂t ≡ 1 (|x̂t | + |ẑt |) . 2 (23) The following lemma shows how the expected one-period trading volume, expected insider profits and price volatility are calculated, conditional on the insider having been successful s times in the first t periods. Lemma 3.3 Conditional on the insider having been successful s times in the first t periods (i.e. conditional on ŝt = s), the expected volume, the expected insider profits, and the price variance in period t + 1 are given by i h √ i 1 h√ Σ βt+1 (s) + Ω , E ψ̂t+1 ŝt = s = √ 2π 21 (24) The learning bias γ is constant in every period, i.e. it does not change based on the insider’s past successes. Also, the insider’s successes, on which his updating is correctly based, are the direct results of his ability whatever the insider’s learning bias may be, i.e. his success rate (unlike his profits) is not affected by his learning bias. 22 We will study the effects of the learning bias on the patterns of prices in a separate section 3.5. 20 E [π̂t+1 | ŝt = s] = p 1√ ΣΩ µ̄t (s) [2µt (s) − µ̄t (s)], 2 (25) and Var [p̂t+1 | ŝt = s] = Σ µ̄t (s) µt (s), 2 (26) respectively. Proof : See Appendix B. Recall from equation (14a) in section 2 that the insider will multiply his period t + 1 signal, θ̂t+1 , by βt+1 (s) to obtain his demand for the risky asset in that period. In other words, βt+1 (s) represents the insider’s trading intensity in period t + 1 after having been successful s times in the first t periods. As the above lemma shows, greater average insider intensity leads to larger expected volume. Moreover, as shown in section 3.2, a biased insider, who has had at least one success, is always overconfident. In other words, the insider thinks that his signal θ̂t+1 in period t + 1 is more informative than it really is. This leads him to use his information more aggressively than he should and leads to higher expected trading volume in the risky security. As the next proposition demonstrates, the greater the learning bias the greater this trading. Proposition 3.3 Given that ŝt = s, the expected volume in period t+1 is increasing in the insider’s learning bias parameter γ. Proof : See Appendix B. As mentioned above, expected volume in a particular period will be larger the larger the expected insider trading intensity is for that period. Notice that we can rewrite βt+1 (s) given in (19a) as βt+1 (s) = s  −1 Ω 2 −1 , Σ Kt (s) (27) where Kt (s) is as defined in equation (20). This tells us that the trading intensity βt+1 (s) of the insider in period t + 1 given that he has been successful s times in the first t periods is a 21 monotonically increasing function of the insider’s overconfidence Kt (s) after t periods. Since we showed in Proposition 3.1 that the insider’s overconfidence in any period, given any number of past successes, is increasing in γ, it is natural to find that expected volume in a particular period will also be increasing in γ. Let us now look at the effect of the learning bias on the insider’s profits. We know that the biased insider trades too aggressively on his information; in other words, the insider’s learning bias makes him act suboptimally. It is therefore not surprising that the insider’s expected profits in any given period are decreasing in his learning bias parameter γ. Proposition 3.4 Given that ŝt = s, the expected insider profits in period t + 1 are decreasing in the insider’s learning bias parameter γ. Proof : See Appendix B. The more overconfident the insider, the more he trades in response to any given signal. This increases his expected trading relative to that of the liquidity trader. Therefore the signal to noise ratio in total order flow increases and the market-maker is able to make better inferences about the insider’s signal. The market-maker is then able to set prices that vary more in response to θ̂t and are closer to the expected dividend conditional on the insider’s signal (E[v̂t | θ̂t ]) and further from its unconditional expectation (zero). This increases price volatility. Proposition 3.5 Given that ŝt = s, the expected price volatility in period t + 1, as measured by the price’s variance, is increasing in the insider’s learning bias parameter γ. Proof : See Appendix B. 3.4 Effects of the Insider’s Past Performance The effects described in section 3.3 are static in the sense that they do not depend on the insider’s past performance. Given any success history, the next period’s trading volume and price volatility are expected to be larger and the next period’s insider profits are expected to be lower when γ is large. These results are analogous to the results documented by Odean (1998) who shows that trader overconfidence has these effects in a one-period economy. Since, as documented in section 3.2, the insider’s learning bias eventually makes him overconfident, our results are natural extensions of Odean’s static results. 22 However, we also know from section 3.2 that the insider’s overconfidence changes dynamically with his past performance. In this section we look at how, for an insider with a particular learning bias, the economy is affected by that insider’s past performance. The monotonic relationship between βt+1 (s) and Kt (s) described in (27) also helps us characterize the conditional expected volume in a particular period t + 1, given different past histories at the end of t periods. It is not surprising to find that the expected one-period volume given s insider successes in the first t periods has the same shape as Kt (s) as a function of s, which we described in Proposition 3.2. Proposition 3.6 The expected volume in period t + 1, conditional on the insider having been successful s times in the first t periods (i.e. given ŝt = s), is increasing over s ∈ {0, . . . , s◦t } and decreasing over s ∈ {s◦t , . . . , t}, for some s◦t ∈ {1, . . . , t}. Proof : See Appendix B. In section 3.3, we saw that the insider’s profits are reduced by his learning bias. As we shall see next, this learning bias can cause a successful insider’s expected future profits to be smaller than a less successful insider’s. This is because two forces affect an insider’s expected future profits: his overconfidence and his expected ability. To disentangle these two forces, let us describe the insider’s expected profits in period t + 1 after he has been successful s times in the first t periods. We know from section 3.2 that the insider’s overconfidence at the end of t periods is at a minimum of 1 when s = 0. We also know from Proposition 3.2 that overconfidence increases with the number s of past insider successful dividend predictions (up to s∗t ). This means that the insider’s decision in period t + 1 will be more and more distorted as s increases.23 At the same time, as s increases, it becomes increasingly likely that the insider’s ability is high,24 though not as likely as the insider thinks. A biased insider who becomes sufficiently overconfident may act so suboptimally that he more than offsets the potential increase in expected profits coming from his probably higher ability. As the insider’s overconfidence comes back down (s > ŝ∗t ), successes decrease the insider’s overconfidence while increasing his expected ability. Thus both forces lead to additional expected future profits. Figures 5(a) and 5(b) illustrate how the insider’s overconfidence and expected ability counterbalance each other. In Figure 5(a), we look at the insider’s expected profits in period 11, as a 23 This was seen to be true in equation (27), where we show that βt+1 (s) is monotonically increasing in Kt (s). Result C.1 of Appendix C shows that more past successes increase the likelihood φ̄t (s) that the insider’s ability is high. 24 23 function of the number of successes he has had in the first 10 periods; we do this for three different values of γ (1, 2, and 5), and otherwise use the same parameters as in Figures 1 and 2. It is clear from that figure that an unbiased insider always benefits from an additional past success, since his expected ability is higher. However, when the insider’s learning is biased, it is possible that his overconfidence (which we plot in Figure 5(b)) prevents him from benefiting from the boost in expected ability that results from an additional success. In fact, for this example, we can see that an insider with γ = 2 or γ = 5 who has had six successes in the first 10 periods does worse than an insider who has yet to predict one dividend correctly! This simple numerical example can in fact be generalized as follows. Proposition 3.7 Given that ŝt = s, the expected insider profits in period t + 1 are increasing over s ∈ {0, . . . , s′t } and s ∈ {s′′t , . . . , t}, but are decreasing over s ∈ {s′t , . . . , s′′t } for some (s′t , s′′t ) ∈ {1, . . . , t}2 such that s′t ≤ s′′t . Proof : See Appendix B. Since the insider can only be perfectly right or completely wrong in any given period, the correct measure of his past performance at the beginning of period t + 1 in this model is the number of his past successes (ŝt ). In reality, traders can be right or wrong to different extents, and so the measure that is typically used to measure their performance is past profits. It is easily shown that, in our model, expected past profits are monotonically increasing in the number of past successes. Figure 5(c) illustrates this for the above numerical example. Therefore future expected insider profits, as a function of past profits, are first increasing, then decreasing, and then increasing again. Corollary 3.1 Conditional on the insider’s cumulative profits π in the first t periods, the expected insider profits in period t + 1 are increasing over π ∈ (−∞, πt′ ] and π ∈ [πt′′ , ∞), but are decreasing over π ∈ [πt′ , πt′′ ] for some (πt′ , πt′′ ) ∈ R2 such that πt′ ≤ πt′′ . In our model traders trade on their own account. We do not model the agency issues associated with money managers investing for others nor do we model the relationship between an individual money manager and the fund for which he may work. These issues may greatly influence money managers’ decisions. Nevertheless, Proposition 3.7 along with Figure 5 may provide some guidance about the choice of managers. We show that a trader with more past successes may have lower expected future profits than a trader with fewer successes. This is because the more successful trader, though objectively more likely to possess high ability, will not make maximum use of his ability due to his overconfidence. An investor choosing a money manager cannot usually observe 24 E[ ˆ | sˆ = s ] 0.45 0.40 0.35 1 0.30 2 0.25 5 0.20 0 2 4 6 8 10 Number of past successes (s) (a) Expected insider profits. 1.7 1.6 K (s) 1.5 1 1.4 1.3 2 1.2 5 1.1 1.0 0 2 4 6 8 10 Number of past successes (s) (b) Overconfidence. 4 1 2  E[t= ˆt | sˆ = s] 6 2 0 5 -2 0 2 4 6 8 10 Number of past successes (s) (c) Expected past profits. Figure 5: Expected insider profits in period 10 (a), expected overconfidence in that period (b), and expected past insider profits (c) as functions the insider’s successes in the first 10 periods. The figure was obtained with H = 0.9, L = 0.5, φ0 = 0.5, Σ = Ω = 1. Each line was obtained with a different value of γ shown in the legend. 25 that manager’s level of overconfidence. If the investor has personal contact with a manager he may try to assess that manager’s overconfidence through social cues, but when such cues are not available, our model suggests that a manager’s success record will be indicative of his overconfidence. Using a manager’s success record as a measure of his overconfidence creates a dilemma for the investor since the investor uses the same success record to assess the manager’s ability. While a trader would always prefer to have as good a success record as possible it is not clear that, when choosing a money manager, an investor will always prefer the one with the best past record. A very successful trader may be too overconfident and therefore trade too aggressively. The effects of overconfidence on trading are likely to be exacerbated when risk aversion and agency issues are introduced to the picture. An overconfident money manager may take risks with his client’s money which the client would not endorse. Investors can try to protect themselves from choosing the most overconfident managers by avoiding managers who are successful but underexperienced. They should also judge managers on their long term performance, rather than their most recent successes.25 In our model, all insiders, even those with low ability, earn positive expected profits from trading with liquidity traders. In real markets traders who have experienced repeated failures are likely to lose their jobs, their money, or their confidence, and quit trading. The traders who remain will be those with the greatest ability and the greatest overconfidence. This survivorship bias, like the selection bias mentioned in footnote 17, will make the overconfidence level of those active in the marketplace higher than that of the general population. This is in contrast to the results of the natural selection literature,26 which argue that overconfident and irrational traders will be driven out of financial markets over time. This does not happen here, since trading profits are what make insiders overconfident. We finish this section by looking at the volatility of prices conditional on the insider’s past performance. Expected volatility is not affected by the insider’s successes in the same way as are overconfidence and volume. Although expected overconfidence and expected trading volume can both be non-monotonic in the number of past insider successes, expected volatility is always increased by one more insider success. More precisely, large posteriors by the biased insider (µ̄t (s)) and the rational market maker (µt (s)) both contribute to more expected volatility: the former by 25 As discussed in section 3.2, while we can identify the factors that determine on average how much time it will take for a trader to achieve his maximum overconfidence, we are unable to say how long this time is for any class of traders such as money managers. Furthermore, since our model allows for only two ability levels, there is not much room for a trader who has a long and successful track record to overestimate his ability. That is, he thinks he is of ability type H, and he is probably right. In real markets, traders can always believe themselves to have more ability than they have. Thus even those with long histories of success may fall prey to hubris. 26 See, for example, Blume and Easley (1982, 1992), and Luo (1998). 26 his unwarranted aggressiveness, and the latter by his steeper price schedule.27 Proposition 3.8 At the end of period t, the conditional expected volatility in period t + 1 is increasing in the number of past successful predictions by the insider in the first t periods. Proof : See Appendix B. 3.5 Price Patterns Daniel, Hirshleifer and Subrahmanyam (1998) argue that a trader’s overconfidence in his private information can result in positive (negative) price autocorrelation in the short (long) run. In their model, overconfident traders initially overreact to their long-lived private information about a security’s payoff, but eventually realize their mistakes through the gradual public revelation of that payoff. In contrast, the information in our model is short-lived, in the sense that each signal obtained by the insider is advantageous to him for one and only one period. In other words, the insider can profit from each piece of information by trading only once, after which the information is instantaneously made public. As a result, consecutive market-clearing prices are independent in our model: Cov(p̂t , p̂t+1 ) = 0, for all t = 1, 2, . . . . Furthermore, this implies that the returns (i.e. the price changes) are spuriously negatively autocorrelated through a bid-ask bounce, an effect originally documented by Roll (1984): Cov(p̂t+1 − p̂t , p̂t+2 − p̂t+1 ) = −Var(p̂t+1 ). We know from the last two sections what effects the insider’s learning bias and past performance have on the variance of prices; these effects are thus simply reversed for the return autocovariances. We can still reconcile our results with those of Daniel, Hirshleifer and Subrahmanyam (1998), and gain some insight into the effects of the insider’s learning bias on the price process by looking at the expected evolution of prices for a given dividend size. In an economy where the insider’s ability is known ex ante (i.e. H = L = µ), the expected price in any period t given a subsequent 27 It should be noted that the increase in volatility is the result of the overconfident insider revealing, through aggressive trading, more of his signal to the market maker than is optimal. This allows the market maker to set prices closer to the actual forthcoming dividend, v̂t , and further from its unconditional mean. If, as here, the insider’s information is revealed in the next period, this increase in volatility is short-lived and will not even be detected if prices are measured only in the periods following public revelations. 27 announcement of v̂t = v is constant at E(p̂t |v̂t = v) = vµ2 2 . As we next show, this is not true when the insider learns about his own ability through time (when H > L), in which case E(p̂t |v̂t = v) =  t−1   v X t−1  s H (1 − H)t−1−s φ0 + Ls (1 − L)t−1−s (1 − φ0 ) µ̄t−1 (s)µt−1 (s). 2 s s=0 First, when the insider learns his ability without a bias (γ = 1), the expected price in period t for a given subsequent positive dividend28 will be increasing (decreasing) when his ability turns out to be high (low). This is not surprising since the insider learns that he should use his information more (less) aggressively over time. Unconditionally, the expected price starts at vµ2 2 and increases to its long-run value. These effects are illustrated by the dotted lines in Figure 6 for the same set of parameters as in previous figures. When the insider learns with a bias, the expected price for a given dividend is always higher than when he learns without the bias. Moreover, it is still the case that the expected price increases to the correct/unbiased value when the insider’s ability turns out to be high. However, when the insider’s ability is low, convergence to the unbiased long-run expected price only occurs for reasonable levels of self-attribution bias (i.e. γ < γ ∗ ); however, if the insider is so biased as to refuse to acknowledge his low ability despite persistent poor performance (i.e. γ ≥ γ ∗ ), the expected price stays too high. Unconditionally, the expected price for a given dividend will therefore converge to the unbiased value for reasonable levels of bias (i.e., γ < γ ∗ ). All these effects are illustrated in Figure 6. Note that in Figure 6(c), even when expected prices converge to their unbiased long-run value, they have a humped-shape pattern similar to that of expected overconfidence in Figure 2, and similar to that found by Daniel, Hirshleifer and Subrahmanyam (1998). The overconfidence developed over time by the insider causes him to push prices too far in the short run but, in the long run, his learning drives prices back to their correct values. 4 Discussion Our model predicts that overconfident traders will increase their trading volume and thereby lower their expected profits. To the extent that trading is motivated by overconfidence, higher trading will correlate with lower profits. Barber and Odean (1998a) find that this is true for individual investors. 28 All the effects are reversed for a negative dividend. 28 ˆ E[ pˆt | vˆt=1,a=H ] 0.40 Legend g=1.0 0.35 g=1.5 g=25/9 0.30 g=5.0 0.25 10 20 30 40 50 Period t (a) Expected prices when â = H. ˆ ] E[ pˆt | vˆt=1,a=L 0.24 Legend 0.22 0.20 g=1.0 0.18 g=1.5 g=25/9 0.16 g=5.0 0.14 10 20 30 40 50 Period t (b) Expected prices when â = L. 0.30 Legend ˆE[ pˆt | vˆt=1] 0.29 g=1.0 0.28 g=1.5 0.27 g=25/9 0.26 g=5.0 0.25 10 20 30 40 50 Period t (c) Unconditional expected prices. Figure 6: Expected patterns of prices for a given dividend size of $1 per period (a) conditional on â = H; (b) conditional on â = L; (c) unconditionally. All three figures were obtained with H = 0.9, L = 0.5, φ0 = 0.5, and Σ = Ω = 1. Each line was drawn with a different γ, shown in the legends. Note that, with these parameter values, γ ∗ = 25/9 ≈ 2.78. 29 While this evidence supports our model, our model does more than simply posit that investors are overconfident. We also describe a dynamic by which overconfidence may wax and wane, both on an individual level and in the aggregate (though the latter is not modeled formally). In times when aggregate success is greater than usual, overconfidence will be higher. In our model, success is measured by how well a trader forecasts dividends. This formulation allows us to present closed form solutions. In many markets, returns will be a trader’s metric of success. Traders who attribute returns from general market increases to their own acumen will become overconfident and therefore trade more actively. Therefore, we would predict that periods of market increases will tend to be followed by periods of increased aggregate trading. Statman and Thorley (1998) find this is so for monthly horizons.29 Taking a longer view, overconfidence and its principal side effect, increased trading, are likely to rise late in a bull market and to fall late in a bear market. A bull market may also attract more investment capital, in part, because investors grow more confident in their personal investment abilities. This increase in investment capital could cause price pressures that sent market prices even higher.30 In our model, investors are most overconfident early in their careers. With more experience, selfassessment becomes more realistic and overconfidence more subdued. Barber and Odean (1998b) find that, after controlling for gender, marital status, children, and income, younger investors trade more actively than older investors while earning lower returns relative to a buy-and-hold portfolio. These results are consistent with our prediction that overconfidence diminishes with greater experience. A further testable empirical prediction of our model is that, on an individual level, investors who realize abnormally good returns in one period will, on average, trade more actively in the next period and in so doing lower their net returns. While the same prediction could be made for professional money managers, their trading decisions will also be influenced by agency considerations that may be difficult to disentangle from overconfidence. Finally, it is worth noting that if other behavioral biases affect asset prices, then overconfidence is likely to magnify those effects by giving investors the fortitude to act more aggressively on their biased impulses. 29 We were informed of Statman and Thorley’s results after writing this paper. Of course other factors can lead to similar results. Investors might upwardly revise their estimate of the market’s expected return during a bull market and therefore invest more. Or a bull market might benefit from demographically driven changes in aggregate savings. 30 30 5 Conclusion We go through life learning about ourselves as well as the world around us. We assess our own abilities not so much through introspection as by observing our successes and failures. Most of us tend to take too much credit for our own successes. This leads to overconfidence. It is in this way that overconfidence develops in our model. When a trader is successful, he attributes too much of his success to his own ability and revises his beliefs about his ability upward too much. In our model overconfidence is dynamic, changing with successes and failures. Average levels of overconfidence are greatest in those who have been trading for a short time. With more experience, people develop better self assessments. Since it is through success that traders become overconfident, successful traders, though not necessarily the most successful traders, are most overconfident. These traders are also, as a result of success, wealthy. Overconfidence does not make traders wealthy, but the process of becoming wealthy can make them overconfident. Thus overconfident traders can play an important long-term role in financial markets. As shown in our model, an overconfident trader trades too aggressively, and this increases expected trading volume. Volatility is increasing in a trader’s number of past successes (for a given number of periods). Both volume and volatility increase with the degree of a trader’s learning bias. Overconfident traders behave suboptimally, thereby lowering their expected profits. A more successful trader is likely to have more information gathering ability but he may not use his information well. Thus the expected future profits of a more successful trader may actually be lower than those of a less successful trader. Successful traders do tend to be good, but not as good as they think they are. The principal goal of this paper is to demonstrate that a simple and prevalent bias in evaluating one’s own performance is sufficient to create markets in which investors are, on average, overconfident. Unlike models such as De Long et al. (1990), in which biased traders survive by earning greater profits, our model describes a market in which overconfident traders realize, on average, lower profits. Though overconfidence does not lead to greater profits, greater profits do lead to overconfidence. A particular trader’s overconfidence will not flourish indefinitely; time and experience gradually rid him of it. However, in a market in which new traders are born every minute, overconfidence will flourish. 31 Appendix A Proof of Lemma 2.1 Assume that Pt (ω̂t , ŝt−1 ) = λt (ŝt−1 ) ω̂t . This means that the insider’s expected period t profits, when sending a market order of x̂t to the market maker, are given by n o Eb [π̂t | θ̂t , ŝt−1 , x̂t ] = Eb x̂t [v̂t − Pt (ω̂t , ŝt−1 )] θ̂t , ŝt−1 , x̂t o n = Eb x̂t [v̂t − λt (ŝt−1 )ω̂t ] θ̂t , ŝt−1 , x̂t n o = Eb x̂t [v̂t − λt (ŝt−1 )(x̂t + ẑt )] θ̂t , ŝt−1 , x̂t h i = x̂t Eb (v̂t | θ̂t , ŝt−1 ) − λt (ŝt−1 )x̂t , (28) where the last equality follows from the fact that ẑt is independent from both θ̂t and ŝt−1 , and has a mean of zero. Differentiating this last expression with respect to x̂t and setting the result equal to zero yields x̂t = Eb (v̂t | θ̂t , ŝt−1 ) . 2λt (ŝt−1 ) (29) Also, a simple use of iterated expectations and the projection theorem for normal variables31 shows that h i Eb (v̂t | θ̂t , ŝt−1 ) = Eb Eb (v̂t |θ̂t , ŝt−1 , δ̂t ) θ̂t , ŝt−1 h i = Eb δ̂t θ̂t + (1 − δ̂t ) · 0 θ̂t , ŝt−1 h i = Eb δ̂t | ŝt−1 θ̂t = Eb [â | ŝt−1 ] θ̂t = µ̄t−1 (ŝt−1 )θ̂t , 31 The projection theorem for normal variables is as follows: suppose that i i h h i h Σ11 Σ12 x̂ µx . , ∼ N µ Σ12 Σ22 ŷ y Then E[x̂|ŷ] = µx + Σ12 (ŷ Σ22 − µy ). 32 (30) where the third equality results from the fact that θ̂t , without v̂t does not contain any information about δ̂t (or, equivalently, about â).32 This yields (15). Next, Assume that Xt (θ̂t , ŝt−1 ) = βt (ŝt−1 ) θ̂t . As discussed in section 1, the market maker’s price is a function of the information he gathers from the order flow that is sent to him. More precisely, p̂t = E[v̂t | ω̂t , ŝt−1 ] i h = E E(v̂t | ω̂t , ŝt−1 , δ̂t ) ω̂t , ŝt−1 n o   = E δ̂t E v̂t ω̂t = βt (ŝt−1 )v̂t + ẑt , ŝt−1 + (1 − δ̂t ) · 0 ω̂t , ŝt . (31) Use of the projection theorem for normal variables shows that   E v̂t ω̂t = βt (ŝt−1 )v̂t + ẑt , ŝt−1 = βt (ŝt−1 )Σ ω̂t , βt2 (ŝt−1 )Σ + Ω so that we can rewrite (31) as p̂t  = E δ̂t βt (ŝt−1 )Σ ω̂t ω̂t , ŝt−1 2 βt (ŝt−1 )Σ + Ω βt (ŝt−1 )Σ = E[δ̂t | ŝt−1 ] 2 ω̂t βt (ŝt−1 )Σ + Ω βt (ŝt−1 )Σ = E[â | ŝt−1 ] 2 ω̂t βt (ŝt−1 )Σ + Ω µt−1 (ŝt−1 )βt (ŝt−1 )Σ = ω̂t , βt2 (ŝt−1 )Σ + Ω  (32) as in (16). Proof of Lemma 2.2 To see this, recall from (28)-(30) that the insider chooses x̂t so as to maximize his expected 32 Again, this is the case since both v̂t and ε̂t have the same distribution. 33 profits in period t, which can be written as h i Eb [π̂t | θ̂t , ŝt−1 , x̂t ] = x̂t Eb (v̂t | θ̂t , ŝt−1 ) − λt (ŝt−1 )x̂t h i = x̂t µ̄t−1 (ŝt−1 )θ̂t − λt (ŝt−1 )x̂t . (33) Assuming that the slope λt (ŝt−1 ) of the market maker’s linear price schedule is positive,33 we can maximize this expression with respect to x̂t to obtain the insider’s demand: x̂t = µ̄t−1 (ŝt−1 )θ̂t . 2λt (ŝt−1 ) (34) Of course, this demand is not the same as that of a rational but otherwise identical insider, who instead would be maximizing unbiased expected profits: h i E[π̂t | θ̂t , ŝt−1 , x̂t ] = x̂t µt−1 (ŝt−1 )θ̂t − λt (ŝt−1 )x̂t . (35) Since the maket maker is rational in this model, he knows that the (biased) insider’s correct expected profits are given by (35), using the suboptimal demand x̂t as calculated in (34): E[π̂t | θ̂t , ŝt−1 , x̂t ] = = " # µ̄t−1 (ŝt−1 )θ̂t µ̄t−1 (ŝt−1 )θ̂t µt−1 (ŝt−1 )θ̂t − λt (ŝt−1 ) 2λt (ŝt−1 ) 2λt (ŝt−1 ) i µ̄t−1 (ŝt−1 )θ̂t2 h 2µt−1 (ŝt−1 ) − µ̄t−1 (ŝt−1 ) . 4λt (ŝt−1 ) (36) Suppose first that the market maker quotes a price schedule with a positive slope. On average, he then expects to profit from the liquidity trader; in fact, his expected profits against the liquidity trader can be shown to be equal to λt (ŝt−1 )Ω. To perform his market clearing duties competitively, it must therefore be the case that the market maker loses that same amount to the insider on average, i.e. it must be the case that (36) is positive. However, when 2µt−1 (ŝt−1 ) < µ̄t−1 (ŝt−1 ), this is not the case, so that an equilibrium with a positively sloped price schedule is impossible. What happens if the market maker quotes a price schedule with a negative slope? From (33), 33 This is needed in order to satisfy the second-order condition. If the slope of the price schedule is negative, then the insider will want to trade an infinite number of shares. 34 we see that the insider’s problem degenerates, as he would then choose an infinite demand. This would not only make his biased expected profits infinite (and positive), but would also make his unbiased expected profits in (35) infinite (and negative). More than that, any negatively sloped price schedule implies that the market maker will also lose against the liquidity trader; his expected losses are given by −λt (ŝt−1 )Ω. It is therefore impossible for the market maker to perform his duties competitively with any negatively sloped price schedule. Proof of Lemma 2.3 The sufficiency part of the argument is obvious as H ≤ 2L implies that 2µt−1 (ŝt−1 ) ≥ 2L ≥ H ≥ µ̄t−1 (ŝt−1 ). To show necessity, we show that if 2L < H, then 2µt (s) < µ̄t (s) for some integers s and t such that 0 < s ≤ t, and some γ > 1. So, suppose that 2L < H. For any ǫ > 0, it is possible to find integers s and t such that 0 < s ≤ t and µt (s) ≤ L + ǫ. Since µ̄t (s) increases to H as γ increases to infinity, it is also possible to find γ > 1 such that µ̄t (s) ≥ H − ǫ. By choosing ǫ to be strictly smaller than H−2L 3 , we have 2µt (s) ≤ 2(L + ǫ) < H − ǫ ≤ µ̄t (s). This completes the proof. Proof of Lemma 2.4 By using (18) in (17) and rearranging, we obtain 2µt−1 (s)Σβt2 (s) = µ̄t−1 (s)Σβt2 (s) + µ̄t−1 (s)Ω, 35 which is quadratic in βt (s). As long as 2µt−1 (s) ≥ µ̄t−1 (s), we can solve for βt (s) and obtain (19a), as desired.34 Also, as argued above, a necessary and sufficient condition for this inequality to be satisfied for any integers s and t such that 0 ≤ s < t and any γ > 1 is that H ≤ 2L. Finally, using (19a) for βt (s) in (18) yields (19b). 34 The other root is rejected, since it represents a minimum, not a maximum. 36 Appendix B Proof of Lemma 3.1 As discussed in the paragraph preceding the lemma, all we need to show is that  L γH H  1−L 1−H 1−H < 1. By taking log’s of both sides and rearranging, this can be shown to be equivalent to showing that fγ,L (H) ≡ (1 − H) log(1 − L) − (1 − H) log(1 − H) − H log γ − H log H + H log L < 0 (37) ′ (H) < 0 for for all H ∈ (L, 1]. First, note that fγ,L (L) = −L log γ ≤ 0. So, if we can show that fγ,L all H ∈ (L, 1], we will have the desired result. Indeed, ′ (H) = − log(1 − L) + log(1 − H) − log γ − log H + log L fγ,L   H(1 − L) < 0, = − log γ − log L(1 − H) since γ ≥ 1 and H(1−L) L(1−H) > 1. Proof of Lemma 3.2 As t → ∞, since â = L, we expect the insider to correctly guess the one-period dividend a fraction L of the times. So, as we play the game more and more often (t tends to ∞), we expect his updated posteriors φ̄t (s) to behave like 1 1+  L γH Lt  1−L 1−H t−Lt 1−φ0 φ0 1 = 1+  L γH L  1−L 1−H 1−L t . 1−φ0 φ0 So φ̄t (s) will converge to 0, φ0 , or 1 according to whether the expression in square brackets is 37 greater than, equal to, or smaller than 1. By taking log’s, this is equivalent to finding whether gγ,H (L) ≡ (1 − L) log(1 − L) − (1 − L) log(1 − H) − L log γ − L log H + L log L is greater than, equal to, or smaller than 0. Let us first check that g1,H (L) > 0 for all L ∈ [0, H), ′ (L) < 0 for all L ∈ [0, H). The first part is which we do by verifying that g1,H (H) = 0, and g1,H easily verified, and ′ (L) = − log(1 − L) + log(1 − H) − log H + log L g1,H   H(1 − L) < 0, = − log L(1 − H) since H(1 − L) > L(1 − H). Now, observe that ∂ ∂γ gγ,H (L) = − Lγ < 0, so that g1,H (L) > gγ,H (L) for all γ ∈ (1, ∞) and limγ→∞ gγ,H (L) = −∞. Since g1,H (L) > 0, this means that there will always be a value γ ∗ such that  ∗    > 0, if γ < γ gγ,H (L) = 0, if γ = γ ∗    < 0, if γ > γ ∗ . This value γ ∗ solves gγ ∗ ,H (L) = 0, and it is easily shown to be given by L γ = H ∗  1−L 1−H (1−L)/L . This completes the proof. Proof of Proposition 3.1 Since the denominator of Kt (s) in (20) is not a function of γ, ∂ µ̄t (s) ∂γ . We show in Result C.2 of Appendix C that ∂ µ̄t (s) ∂γ ∂Kt (s) ∂γ will have the same sign as > 0. Proof of Proposition 3.2 In our model, the number of successes in the first t periods is obviously an integer in {0, 1, . . . , t}, 38 t(s) 1.0 Kt(s)=K1=1 { t(s), t(s)}ts=0 0.8 Kt(s)=K2 0.6 Kt(s)=K3 0.4 Kt(s)=K4 Kt(s)=K5 0.2 Kt(s)=K6 0.2 0.4 0.6 0.8 1.0 t(s) Figure 7: This figure shows φt (s) as a function of φ̄t (s). For any s ∈ [0, t], we have φ̄t (s) ≥ φt (s), so that all the points {φ̄t (s), φt (s)}ts=0 must lie in the grey area. The thin solid lines represent the “iso-confidence” curves Kt (s) = Ki , i = 1, . . . , 6 for 1 = K1 < K2 < · · · < K6 . The thick solid line represents the parametric curve {φ̄t (s), φt (s)}ts=0 , where φt (s) and φ̄t (s) are given in (5) and (8) respectively. but the function Kt (s) is well defined for any s ∈ [0, t]. We first show that this function is increasing for s ∈ [0, s0 ] and decreasing for s ∈ [s0 , t] for some s0 ∈ [0, t]. To show this, recall that Kt (s) = L + (H − L)φ̄t (s) µ̄t (s) = . µt (s) L + (K − L)φt (s) If we define an “iso-confidence” curve by Kt (s) = Ki for some constant Ki ≥ 1, each of these curves can then be written as a straight line in a φ̄t (s)-φt (s) diagram. More precisely, each iso-confidence curve can be expressed as   (Ki − 1)L 1 φ̄t (s) − . φt (s) = Ki H −L These lines are shown as thin solid lines for 1 = K1 < K2 < · · · < K6 in Figure 7. From Result C.1 in Appendix C, we know that the parametric curve {φ̄t (s), φt (s)}ts=0 starts at (0, 0) and is increasing. This curve is shown as a thick solid line in Figure 7. Since the iso-confidence 39 curves are linear in this φ̄t (s)-φt (s) diagram, we only need to show that φt (s) in first concave and then convex as a function of φ̄t (s). Indeed, it will then be the case that each iso-confidence curve is crossed at most twice or, equivalently, that Kt (s) is increasing and then decreasing as a function of s. To show this, we use Result C.1 in Appendix C to obtain   H 1−L φ (s)[1 − φ (s)] log t t L 1−H ∂φt (s)/∂s ∂φt (s) ,  = = γH 1−L ∂ φ̄t (s) ∂ φ̄t (s)/∂s φ̄t (s)[1 − φ̄t (s)] log L 1−H (38) and ∂ ∂s  ∂φt (s) = (39) ∂ φ̄t (s)   1−L      log H L 1−H φ (s)[1 − φt (s)] γH 1 − L H 1−L  t  [1 − 2φt (s)] log − [1 − 2φ̄t (s)] log . 1−L φ̄t (s)[1 − φ̄t (s)] L 1−H L 1−H log γH L 1−H  Using standard calculus results along with (39) and Result C.1 in Appendix C, we have ∂ 2 φt (s) ∂ φ̄t (s)2  = ∂ ∂s  ∂φt (s) ∂ φ̄t (s)  ∂ φ̄t (s)/∂s φt (s)[1 − φt (s)] φ̄t (s)[1 − φ̄t (s)] = 2 [1 − 2φt (s)] log  H 1−L L 1−H log  − [1 − 2φ̄t (s)] log   H 1−L L 1−H  γH 1−L L 1−H  . This last expression will always have the same sign as  D φt (s), φ̄t (s) ≡ [1 − 2φt (s)] log  H 1−L L 1−H  − [1 − 2φ̄t (s)] log  γH 1 − L L 1−H  . (40) Since γ ≥ 1, (40) is negative for (φ̄t (s), φt (s)) = (0, 0) and, since φ̄t (s) ≥ φt (s), it is positive for  φ̄t (s) ≥ 1/2. Therefore, if we can show that D φt (s), φ̄t (s) is increasing in φ̄t (s) for φ̄t (s) ∈ [0, 1/2], 40 we will have the desired result. Using (38), we have  dD φt (s), φ̄t (s) dφ̄t (s)   ∂D φt (s), φ̄t (s) ∂φt (s) ∂D φt (s), φ̄t (s) = + ∂φt (s) ∂ φ̄t (s) ∂ φ̄t (s)     φ (s)[1 − φ (s)] log H 1−L   t t L 1−H H 1−L γH 1 − L  + 2 log  = −2 log . L 1 − H φ̄ (s)[1 − φ̄ (s)] log γH 1−L L 1−H t t L 1−H This last expression is nonnegative if and only if φ̄t (s)[1 − φ̄t (s)] log 2  γH 1 − L L 1−H  ≥ φt (s)[1 − φt (s)] log 2  H 1−L L 1−H  . Since γ > 1, this is definitely true for 0 ≤ φt (s) ≤ φ̄t (s) ≤ 1/2. This shows that Kt (s) is first increasing and then decreasing as a function of s. To complete the proof, we must deal with the fact that, in our problem, we only care about Kt (s) for s ∈ {0, 1, . . . , t}. However, the result is immediate from the shape of Kt (s). Proof of Lemma 3.3 First, a standard result for normal variables is that, if ŷ ∼ N(0, σ 2 ), then E |ŷ| = r 2σ 2 . π We can therefore write h i E ψ̂t+1 ŝt = s = = = = = i 1 h E |x̂t+1 | + |ẑt+1 | ŝt = s 2 i 1 r 2Ω 1 h E |x̂t+1 | ŝt = s + 2 2 π r h i 1 Ω βt+1 (s) E |θ̂t+1 | ŝt = s + 2 2π r r 1 2Σ Ω βt+1 (s) + 2 π 2π √ i √ 1 h √ βt+1 (s) Σ + Ω , 2π 41 and this last expression is equal to (24). The expression for expected profits is derived as follows: E [π̂t+1 | ŝt = s] = E [x̂t+1 (v̂t+1 − p̂t+1 ) | ŝt = s] n h  io  ŝt = s = E βt+1 (ŝt )θ̂t+1 v̂t+1 − λt+1 (ŝt ) βt+1 (ŝt )θ̂t+1 + ẑt+1 h  n h i  io = E E βt+1 (ŝt )θ̂t+1 v̂t+1 − λt+1 (ŝt ) βt+1 (ŝt )θ̂t+1 + ẑt+1 ŝt = s θ̂t+1 , ŝt = s n h o i = E βt+1 (s)θ̂t+1 µt (s)θ̂t+1 − λt+1 (s)βt+1 (s)θ̂t+1 ŝt = s  h i  2 = βt+1 (s) µt (s) − λt+1 (s)βt+1 (s) E θ̂t+1 ŝt = s h i = βt+1 (s) µt (s) − λt+1 (s)βt+1 (s) Σ. Finally, using the expressions derived for βt (s) and λt (s) in Lemma 2.4, it is easy to show that 1 βt+1 (s) [µt (s) − λt+1 (s)βt+1 (s)] = 2 r Ω µ̄t (s) [2µt (s) − µ̄t (s)], Σ so that E [π̂t+1 | ŝt = s] = p 1√ ΣΩ µ̄t (s) [2µt (s) − µ̄t (s)], 2 as in (25). In this economy, the security price in a period reflects only the dividend paid at the end of that period. Since the dividend’s unconditional mean is zero, the price’s unconditional mean is also zero. Therefore, Var(p̂t+1 | ŝt = s)  = E p̂2t+1 | ŝt = s   2 ŝt = s = E λ2t+1 (ŝt ) ω̂t+1   = E λ2t+1 (ŝt )(x̂t+1 + ẑt+1 )2 ŝt = s   h i2 2 = E λt+1 (ŝt ) βt+1 (ŝt )θ̂t+1 + ẑt+1 ŝt = s h i 2 2 2 = λ2t+1 (s) βt+1 (s)E(θ̂t+1 | ŝt = s) + 2βt+1 (s)E(θ̂t+1 ẑt+1 | ŝt = s) + E(ẑt+1 | ŝt = s)  2  (s)Σ + Ω . = λ2t+1 (s) βt+1 42 Now, using the expressions derived for βt (s) and λt (s) in Lemma 2.4, it is easy to show that  2  Σ (s)Σ + Ω = µ̄t (s)µt (s), λ2t+1 (s) βt+1 2 so that Var (p̂t+1 | ŝt = s) = Σ µ̄t (s) µt (s), 2 as in (26). Proof of Proposition 3.3 Given the expression for the conditional expected volume in (24), it is sufficient to prove that ∂βt (s) > 0. ∂γ Straighforward differentiation of the expression for βt (s) in equation (19a) of Lemma 2.4 results in ∂βt (s) = ∂γ s ∂ µ̄t−1 (s) Ω 2µt−1 (s) − µ̄t−1 (s) µt−1 (s) , 2 Σ µ̄t−1 (s) ∂γ [2µt−1 (s) − µ̄t−1 (s)] which in turn shows that it is sufficient to show that ∂ µ̄t−1 (s) > 0. ∂γ This is shown to be true in Result C.2 of Appendix C. Proof of Proposition 3.4 To show the desired result, we only need to show that µ̄t−1 (s) [2µt−1 (s) − µ̄t−1 (s)] is decreasing in γ. This is straightforward to show since ∂ ∂ {µ̄t−1 (s) [2µt−1 (s) − µ̄t−1 (s)]} = −2 [µ̄t−1 (s) − µt−1 (s)] µ̄t−1 (s), ∂γ ∂γ 43 and ∂ ∂γ µ̄t−1 (s) is shown to be positive in Result C.2 of Appendix C. Proof of Proposition 3.5 The result easily follows from the fact that ∂ ∂γ µ̄t−1 (s) > 0, which is shown to be true in Result C.2 of Appendix C. Proof of Proposition 3.6 As shown in Lemma 3.3, the expected volume in period t + 1 is proportional to the expected trading intensity βt+1 (s) of the insider in that period. Since βt (s) is monotonically increasing in Kt (s) (see equation (27)), the result of Proposition 3.2 immediately implies this result. Proof of Proposition 3.7 This result is shown in essentially the same way that Proposition 3.2 was shown earlier, except that the “iso-profit” curves are now quadratic. Proof of Proposition 3.8 In view of (26), this amounts to showing that the product µ̄t (s)µt (s) is increasing in s. However, since both these quantities are increasing in s (see Result C.2 in Appendix C), the result follows easily. 44 Appendix C This appendix contains a few results that are used in the proofs of some propositions in section 3. Result C.1 The partial derivatives of φ̄t (s) in (8) with respect to γ and s are respectively equal to   s ∂ φ̄t (s) = φ̄t (s) 1 − φ̄t (s) ≥ 0, ∂γ γ (41) and   ∂ φ̄t (s) = φ̄t (s) 1 − φ̄t (s) log ∂s  γH 1 − L L 1−H  ≥ 0. (42) Proof : Staightforward differentiation of φ̄t (s) yields ∂ φ̄t (s) ∂γ = = =    sγ s−1 H s (1 − H)t−s φ0 (γH)s (1 − H)t−s φ0 + Ls (1 − L)t−s (1 − φ0 ) −  s t−s s−1 s t−s (γH) (1 − H) φ0 sγ H (1 − H) φ0  2 ÷ (γH)s (1 − H)t−s φ0 + Ls (1 − L)t−s (1 − φ0 ) s s t−s φ Ls (1 − L)t−s (1 − φ ) 0 0 γ (γH) (1 − H) s t−s s t−s [(γH) (1 − H) φ0 + L (1 − L) (1 − φ0 )]2   s φ̄t (s) 1 − φ̄t (s) , γ which is obviously greater than or equal to zero. Now, since we can write φ̄t (s) = 1+  L 1−H γH 1−L 45 1 s  1−L 1−H t 1−φ0 φ0 , we have ∂ φ̄t (s) ∂s (−1) =  L 1−H γH 1−L  1+  s log L 1−H γH 1−L   = φ̄t (s) 1 − φ̄t (s) log  L 1−H γH 1−L  γH 1 − L L 1−H s  1−L 1−H  t 1−L 1−H 1−φ0 φ0  t 1−φ0 φ0 2 . Since γH > L and 1 − L > 1 − H, this last quantity is obviously greater than or equal to zero. Result C.2 The partial derivatives of µ̄t (s) in (9) with respect to γ and s are respectively equal to   ∂ φ̄t (s) s ∂ µ̄t (s) = (H − L) = (H − L) φ̄t (s) 1 − φ̄t (s) ≥ 0, ∂γ ∂γ γ (43) and   ∂ φ̄t (s) ∂ µ̄t (s) = (H − L) = (H − L)φ̄t (s) 1 − φ̄t (s) log ∂s ∂s  γH 1 − L L 1−H Proof : Since we have   µ̄t (s) = H φ̄t (s) + L 1 − φ̄t (s) = L + (H − L)φ̄t (s) and H > L, this result follows immediately from Result C.1 above. 46  ≥ 0. (44) Appendix D In this appendix, we present a few alternative specifications for the model. These specifications show that the results of section 3 are robust to different trading crowds and market clearing mechanisms. The reason why the results of the model are unaffected by these changes is because the learning by the insider(s) is independent from the trading process. More precisely, an insider’s strategy in each period depends directly on who he trades with and on the market clearing mechanism. However, once the end-of-period dividend is announced, this insider updates his beliefs based on that dividend, not based on that period’s trading. In the interest of space, the formal derivations of these alternative models are not included here; however, the derivations are available from the authors upon request. D.1 Hedgers The model of section 1 assumes that some traders will trade randomly for purely exogenous reasons. Of course, since the market maker in that model makes zero profits on average, these liquidity traders fuel the insider’s profits. In this section, we replace these liquidity traders by rational traders who have a hedging motive for trading. The specification we adopt is similar to that of Glosten (1989). Suppose that, at the beginning of every period t, a trader (whom we will refer to as the hedger ) receives a random endowment ẑt of the risky security. Suppose also that this trader has negative exponential utility U (πt ) = −e−Rπt with respect to his profits π̂th in period t. It can be shown that, if (v̂t , ε̂t , ẑt )′ is distributed as in (13), the demand of this hedger in period t will be proportional to ẑt .35 The rest of the model is solved as before, that is the market maker absorbs the total order flow coming from both the insider and the hedger.36 It can be shown that the equilibrium will be achieved as long as the hedger’s risk aversion coefficient R and the variances Σ and Ω are large enough. Of course, the insider’s updating is exactly the same as in our original model, and all the results of section 3 obtain. 35 Of course, given that this hedger is rational, the proportionality constant is decreasing in the slope of the market maker’s price schedule. 36 In fact, in this model, we can even consider a monopolist market maker who seeks to maximize his expected profits, as in Glosten (1989). This extra layer of generality obtains from the elasticity of the hedger’s demand, as opposed to the liquidity trader’s inelastic demand. 47 D.2 A Rational Expectations Economy The presence of the market maker is also not essential to our model. Instead, the market clearing price in each period could be obtained from a rational expectations equilibrium similar to that of Grossman (1976), and Grossman and Stiglitz (1980). Suppose that the economy consists of two risk-averse traders with negative exponential utility function U (πt ) = −e−Rπt with respect to their period t profits π̂t1 and π̂t2 . Suppose that the first trader, the insider, is endowed with the private information and learning bias described in section 1, but that the second trader has no private information. Of course, this second trader can infer some of the insider’s information through the market clearing price. As in Grossman and Stiglitz (1980), we assume that the risky asset supply in period t is a random variable ẑt . If (v̂t , ε̂t , ẑt )′ is distributed as in (13), it can be shown that the market clearing price in period t for this economy is given by p̂t = at (ŝt−1 )θ̂t − bt (ŝt−1 )ẑt , for some positive functions at (·) and bt (·). Moreover, the demand x̂t for the risky asset by the insider is linear in the signal θ̂t and the price p̂t , and the demand ŷt for the risky asset by the uninformed trader is proportional to p̂t : x̂t = βt (ŝt−1 )θ̂t − αt (ŝt−1 )p̂t ; ŷt = −ηt (ŝt−1 )p̂t , where βt (·), αt (·), and ηt (·) are positive functions. As in Grossman (1976), these traders do not trade when they are both rational, that is as long as the insider remains unsuccessful. However, as discussed in section 3.2, the insider’s first successful prediction makes him overconfident for the rest of his existence. Starting then, the two traders will start trading nonzero quantities of the risky asset. In fact, it can even be shown that they would do so without a noisy supply of the risky asset, as they agree to disagree. Also, since the insider’s updating is unaffected by this alternative market-clearing mechanism, all the results from section 3 continue to hold. 48 D.3 Multiple Insiders A potential criticism of the paper’s main model is the fact that only one trader possesses information about the risky security’s dividend process. For example, our results in sections 3.1 and 3.2 show that a (moderately) biased trader will on average be overconfident early in his career, and well calibrated later on. Can we conclude that an economy’s overconfident traders will consist mainly of young traders? Although it is tempting to answer this question in the affirmative from the results derived so far, our one-insider model only addresses this issue indirectly. In this section, we construct a multi-insider version of the model, which will allow us to tackle this question more directly. Suppose that a new insider is born every period t with an ability ât , and lives for T periods. During period t, these insiders are indexed by their age τ = 0, . . . , T − 1 at the beginning of the period. Suppose also that every period consists of N trading rounds (indexed by n), each of which coincides with a dividend payment by the risky security. As in our main model, we assume that a liquidity trader participates in each such trading round, and that orders are cleared through a competitive market maker. This means that, in every period, T insiders, a liquidity trader and the market maker trade a risky security N times. To simplify the analysis without affecting the results, we assume that at most one of the insiders receives valuable information in each trading round. This trader is denoted by ̂tn : ̂tn | (ât , ât−1 , . . . , ât−T +1 ) =            0, prob. 1, .. . prob. ât T ât−1 T    â +1   T − 1, prob. t−T  T   P −1   T, ât−τ . prob. 1 − T1 τT=0 τ received by each trader is given by The signal θ̂tn  τ θ̂tn = 1{̂tn =τ } v̂tn + 1 − 1{̂tn =τ } ε̂τtn , where v̂tn is the risky security’s dividend at the end of the trading round, and ε̂τtn , τ = 0, 1, . . . , T −1, are pure noises. The rest of the model is similar to our main one-insider model in that, at the beginning of every trading round n in period t, the T insiders and the liquidity trader send their 49 demands x̂τtn , τ = 0, 1, . . . , T − 1, and ẑtn for the risky security, and the market maker absorbs the P −1 τ x̂tn + ẑtn at a competitive price order flow ω̂tn = τT=0 h i T −1 p̂tn = E v̂tn ω̂tn , ŝ0t,n−1 , ŝ1t,n−1 , . . . , ŝt,n−1 , −1 where ŝτt,n−1 denotes the number of past successful predictions for insider τ (with ŝτt,0 = ŝτt−1,N ). T −1 all have independent normal distribution with a mean If we assume that v̂tn , ε̂0tn , ε̂1tn , . . . , ε̂tn of zero and a variance of Σ, and that ẑtn is independently normally distributed with a mean of zero and a variance of Ω, a linear equilibrium exists in each trading round. This linear equilibrium consists in every insider sending a demand proportional to their signal,   T −1 τ τ ŝ0t,n−1 , ŝ1t,n−1 , . . . , ŝt,n−1 θ̂tn , x̂τtn = βtn and the market maker quoting a price schedule proportional to the order flow,   T −1 ω̂tn . p̂tn = λtn ŝ0t,n−1 , ŝ1t,n−1 , . . . , ŝt,n−1 Unfortunately, closed form solutions are not easily obtainable for this multi-insider model. As a   τ s0 , s1 , . . . , sT −1 , τ = 0, . . . , T − result, we generally have to solve for λtn s0 , s1 , . . . , sT −1 and βtn 1, numerically. Such numerical solutions reveal that all the properties about overconfidence, trading volume, insider profits, and price volatility still hold. In addition to these results however, we can now tackle the issue of overconfidence as a function of age. This is shown in Figure 8 for a set of parameters similar to that used in Figure 2. This latter figure shows that an insider’s overconfidence on average goes up and then down during his lifetime. Figure 8 shows that, in a particular period t, it is the case that the most overconfident traders are relatively young, and that older traders are better calibrated. 50 Expected Overconfidence E[ kˆtt] 1.07 1.06 1.05 1.04 1.03 1.02 1.01 2 4 6 8 10 Trader Age t Figure 8: Expected insider overconfidence as a function of the insider’s age. In obtaining this figure, every insider’s life is set to 10 periods (T = 10), and each period consists of 50 trading rounds (N = 50). Also, we use the following parameters for all insiders: H = 0.9, L = 0.5, φ0 = 0.5, and γ = 1.5. Finally, we use variances of Σ = Ω = 1. 51 References Alchian, A. A., 1950, “Uncertainty, Evolution, and Economic Theory,” Journal of Political Economy, 58, 211-221. Alpert, M., and H. Raiffa, 1982, “A Progress Report on the Training of Probability Assessors,” in D. Kahneman, P. Slovic, and A. Tversky (eds.), Judgment Under Uncertainty: Heuristics and Biases, Cambridge University Press, Cambridge and New York, 294-305. Anderson, N., 1959, “Test of a Model for Opinion Change,” Journal of Abnormal and Social Psychology, 59, 371-381. Anderson, N., 1981, Foundations of Information Integration Theory, Academic Press, New York. Barber, B., and T. Odean, 1998a, “The Common Stock Investment Performance of Individual Investors,” working paper, University of California, Davis. Barber, B., and T. Odean, 1998b, “Boys will be Boys: Gender, Overconfidence, and Common Stock Investment,” working paper, University of California, Davis. Benos, A., 1998, “Aggressiveness and Survival of Overconfident Traders,” Journal of Financial Markets, 1, 353-383. Bhattacharya, U., and M. Spiegel, 1991, “Insiders, Outsiders, and Market Breakdowns,” Review of Financial Studies, 4, 255-282. Blume, L., and D. Easley, 1982, “Learning To Be Rational,” Journal of Economic Theory, 26, 340-351. Blume, L., and D. Easley, 1992, “Evolution and Market Behavior,” Journal of Economic Theory, 58, 9-40. Blume, L., D. Easley, and M. O’Hara, 1994, “Market Statistics and Technical Analysis: The Role of Volume,” Journal of Finance, 49, 153-181. Bossaerts, P., 1999, “The Dynamics of Limited-Liability Securities Prices in Fallible Markets,” working paper, California Institute of Technology. Daniel, K., D. Hirshleifer, and A. Subrahmanyam, 1998, “Investor Psychology and Security Market Under- and Overreactions,” Journal of Finance, 53, 1839-1885. 52 De Finetti, B., 1937, “La prévision: ses lois logiques, ses sources subjectives,” Annals of the Institut Henri Poincaré, 7, 1-68. De Long, J., A. Shleifer, L. Summers, and R. Waldmann, 1990, “Noise Trader Risk in Financial Markets,” Journal of Political Economy, 98, 703-738. Fischhoff, B., 1982, “For Those Condemned to Study the Past: Heuristics and Biases in Hindsight.” In D. Kahneman, P. Slovic, and A. Tversky (eds.), Judgment Under Uncertainty: Heuristics and Biases, Cambridge University Press, Cambridge and New York, 335-351. Fiske, S., and S. Taylor, 1991, Social Cognition (2nd ed.), McGraw-Hill, New York. Friedman, M., 1953, Essays in Positive Economics, University of Chicago Press, Chicago. Gervais, S., 1996, “Market Microstructure with Uncertain Information Precision: A Multi-Period Analysis,” working paper, Wharton School, University of Pennsylvania. Glosten, L., 1989, “Insider Trading, Liquidity, and the Role of the Monopolist Specialist,” Journal of Business, 62, 211-235. Grossman, S. J., 1976, “On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information,” Journal of Finance, 31, 573-585. Grossman, S. J., and J. E. Stiglitz, 1980, “On the Impossibility of Informationally Efficient Markets,” American Economic Review, 70, 393-408. Hastorf, A. H., D. J. Schneider, and J. Polefka, 1970, Person Perception, Addison-Wesley, Reading, Massachusetts. Hogarth, R., and H. Einhorn, 1992, “Order Effects in Belief Updating: The Belief-Adjustment Model,” Cognitive Psychology, 24, 1-55. Kyle, A., 1985, “Continuous Auctions and Insider Trading,” Econometrica, 53, 1315-1335. Kyle, A., and F. A. Wang, 1997, “Speculation Duopoly With Agreement to Disagree: Can Overconfidence Survive the Market Test?,” Journal of Finance, 52, 2073-2090. Langer, E., and J. Roth, 1975, “Heads I Win, Tails It’s Chance: The Illusion of Control as a Function of the Sequence of Outcomes in a Purely Chance Task,” Journal of Personality and Social Psychology, 32, 951-955. 53 Lichtenstein, S., B. Fischhoff, and L. Phillips, 1982, “Calibration of Probabilities: The State of the Art to 1980,” in D. Kahneman, P. Slovic, and A. Tversky (eds.), Judgment Under Uncertainty: Heuristics and Biases, Cambridge University Press, Cambridge and New York, 306-334. Luo, G. Y., 1998, “Market Efficiency and Natural Selection in a Commodity Futures Market,” Review of Financial Studies, 11, 647-674. Milgrom, P., and N. Stokey, 1982, “Information, Trade and Common Knowledge,” Journal of Economic Theory, 26, 17-27. Miller, D., and M. Ross, 1975, “Self-serving Biases in Attribution of Causality: Fact or Fiction?,” Psychological Bulletin, 82, 213-225. Odean, T., 1998a, “Volume, Volatility, Price, and Profit When All Traders Are Above Average,” Journal of Finance, 53, 1887-1934. Roll, R., 1984, “A Simple Implicit Measure of the Effective Bid-Ask Spread in an Efficient Market,” Journal of Finance, 39, 1127-1139. Statman, M., and S. Thorley, 1998, “Overconfidence and Trading Volume,” working paper, Santa Clara University. Subramanyam, K. R., 1996, “Uncertain Precision and Price Reactions to Information,” The Accounting Review, 71, 207-220. Wang, A., 1997, “Overconfidence, Delegated Fund Management, and Survival,” working paper, Columbia University. Wolosin, R. J., S. J. Sherman, and A. Till, 1973, “Effects of Cooperation and Competition on Responsibility Attribution After Success and Failure,” Journal of Experimental Social Psychology, 9, 220-235. 54