Academia.eduAcademia.edu

Multifaceted aspects of agency relationships

1984

Agency theory has been used to examine the problem of stewardship of an agent who makes decisions on behalf of a principal who cannot observe the agent's actual effort. Effort i s assumed to be personally costly to expend. Therefore, i f an agent acts in his or her own interests, there may be a "moral hazard" problem, in which the agent exerts less effort than agreed upon. This dissertation examines this agency problem when the agent's effort i s multidimensional, such as when the agent controls several production processes or manages several divisions of a firm. The optimal compensation schemes derived suggest that the widely advocated salary-plus-commission scheme may not be optimal. Furthermore, the information from a l l tasks should generally be combined in a nonlinear fashion rather than used separately i n compensating a manager of several divisions, even i f the monetary outcomes are s t a t i s t i c a l l y independent. In situations where effort i s best...

MULTIFACETED ASPECTS OF AGENCY RELATIONSHIPS by ELLA MAE MATSUMURA A.B., The University of C a l i f o r n i a , Berkeley, 1974 M.Sc, The University of B r i t i s h Columbia, 1976 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY in THE FACULTY OF GRADUATE STUDIES Faculty of Commerce and Business Administration We accept t h i s thesis as conforming to the required standard THE UNIVERSITY OF BRITISH COLUMBIA September 1984 © E l l a Mae Matsumura, 1984 In p r e s e n t i n g this requirements f o r an of British it freely available understood that financial shall for reference and study. I for extensive be her copying of g r a n t e d by shall not the be of make further this thesis head o f representatives. copying or p u b l i c a t i o n the University Library h i s or gain the the s c h o l a r l y p u r p o s e s may by f u l f i l m e n t of I agree that permission department or for in partial advanced degree a t Columbia, agree that for thesis this my It is thesis a l l o w e d w i t h o u t my written permission. Department o f Co mme-ree. ^ The U n i v e r s i t y o f B r i t i s h 1956 Main Mall V a n c o u v e r , Canada V6T 1Y3 E-6 (3/81) &us/'ri£SS Columbia Aden,nisir~cut/or. ii ABSTRACT Agency theory has been used to examine the problem of stewardship of an agent who makes decisions on behalf of a p r i n c i p a l who cannot observe the agent's actual e f f o r t . E f f o r t i s assumed to be personally c o s t l y to expend. Therefore, i f an agent acts i n his or her own i n t e r e s t s , there may be a "moral hazard" problem, i n which the agent exerts less e f f o r t than agreed upon. This d i s s e r t a t i o n examines t h i s agency problem when the agent's e f f o r t i s multidimensional, such as when the agent controls several product i o n processes or manages several d i v i s i o n s of a firm. The optimal compen- sation schemes derived suggest that the widely advocated salary-plus-commission scheme may not be optimal. Furthermore, the information from a l l tasks should generally be combined i n a nonlinear fashion rather than used separ a t e l y i n compensating a manager of several d i v i s i o n s , even i f the monetary outcomes are s t a t i s t i c a l l y independent. In situations where e f f o r t i s best interpreted as time, e f f o r t can be viewed as being additive. The analysis i n this s p e c i a l case shows that the nature of the outcome d i s t r i b u t i o n , including the e f f e c t of e f f o r t on the mean of the d i s t r i b u t i o n , i s c r i t i c a l i n determining whether i t i s optimal for the p r i n c i p a l to induce the agent to d i v e r s i f y e f f o r t across tasks. These new r e s u l t s and the already existing agency theory results are applied to the sales force management problem, i n which the firm wishes to motivate a salesperson to optimally allocate time spent s e l l i n g the firm's various products. The agency model i s also expanded to allow f o r the agent's observation of the f i r s t outcome (which i s influenced by the agent's f i r s t e f f o r t ) before choosing the second e f f o r t l e v e l . The optimal compensation schemes iii both i n the absence of and the presence of a moral hazard problem are derived. The behavior of the second e f f o r t strategy i s also examined. It i s shown that the behavior of the agent's second e f f o r t strategy depends on the interaction between wealth and information e f f e c t s of the f i r s t outcome. Results s i m i l a r to those i n the multidimensional e f f o r t case are obtained for the question of optimality of d i v e r s i f i c a t i o n of e f f o r t when e f f o r t i s additive. iv TABLE OF CONTENTS Page ABSTRACT i i LIST OF TABLES vi ACKNOWLEDGEMENTS CHAPTER 1. vii INTRODUCTION 1 CHAPTER 2. NOTATION AND FORMULATION 5 Chapter 2 Footnotes 8 CHAPTER 3. ALLOCATION OF EFFORT 9 3.1 F i r s t Best 10 3.2 Second Best 11 3.3 Value of Additional Information 15 3.4 Additive Separability of the Sharing Rule 22 3.5 Additive E f f o r t 25 3.6 Application to Sales Force Management 36 3.7 Summary and Discussion 48 Chapter 3 Footnotes CHAPTER 4. ONE-PERIOD SEQUENTIAL CHOICE . 54 55 4.1 F i r s t Best 60 4.2 Second Best 63 4.3 Additive Separability of the Sharing Rule 74 4.4 Additive E f f o r t 76 4.5 Summary and Discussion 84 CHAPTER 5. SUGGESTED FURTHER RESEARCH 88 5.1 Theoretical Agency Extensions 88 5.2 Application to Variance Investigation 89 BIBLIOGRAPHY 91 V APPENDIX 1. ONE-PARAMETER EXPONENTIAL FAMILY OF DISTRIBUTIONS 94 APPENDIX 2. NORMAL DISTRIBUTION CALCULATIONS 102 APPENDIX 3. CHAPTER 3 PROOFS 108 APPENDIX 4. CHAPTER 4 PROOFS 132 vi LIST OF TABLES Page I. II. Examples i n One-product Case 40 One-parameter Exponential Family Q 94 vii ACKNOWLEDGEMENTS I would l i k e to thank Professor Jerry Feltham amd Professor John Butterworth for the many hours they spent discussing ideas and results with me. Their comments often pointed me i n new directions or helped me to look at problems from new perspectives. I am grateful to Professor Tracy Lewis for serving on my d i s s e r t a t i o n committee. I would also l i k e to thank my husband, Kam-Wah Tsui, for h i s technical assistance while I was writing this d i s s e r t a t i o n , and for h i s u n f a i l i n g encouragement the entire time I was i n the Ph.D. program. and moral support during F i n a l l y , I gratefuly acknowl- edge the f i n a n c i a l support of the University of B r i t i s h Columbia and of the Social Sciences and Humanities Research Council of Canada. 1 CHAPTER 1 INTRODUCTION Managerial accounting has t r a d i t i o n a l l y been associated with the valuation of inventories for external reporting and with information provision for internal decision making and control. Broadly speaking, the internal decision making relates to the planning of operations and the control of decentralized organizations. Variance analysis, budgeting, cost-volume- p r o f i t analysis, and the development of performance evaluation measures are t y p i c a l components of the planning and control processes. There are a number of different approaches to gaining a better understanding of the role of the accounting system i n the control of decentralized operations. Since an accounting system i s an information system, any research on the value of information, the demand for information, or the roles or uses of information has potential implications for accounting research. The body of research which examines such information issues has come to be known as information economics. Information economics uses f o r - mal economic models i n order to study the demand for information for decision making and performance evaluation purposes. In p a r t i c u l a r , information economics attempts to find economic explanations for why certain phenomena are observed (e.g., Demski and Feltham, 1978), and to uncover insights about behavior thought to be nonoptimal (e.g., Zimmerman, 1979) or behavior thought to be optimal (e.g., Baiman and Demski, 1980b). Much of the early information economics l i t e r a t u r e focused on essent i a l l y single-person decision situations (e.g., Demski and Feltham, 1976, and Feltham, 1977a), where information serves only a d e c i s i o n - f a c i l i t a t i n g purpose. That i s , the decision maker uses information about the uncertain state of nature to revise his or her b e l i e f s about the decision environment. Thus, the demand for this type of information might be called decision-mak- 2 ing demand. The recent information economics l i t e r a t u r e has incorporated agency theory i n e x p l i c i t l y modeling the multiperson nature of accounting problems (e.g., Baiman and Demski, 1980a, 1980b, Gjesdal, 1981, and Holmstrom, 1977). In multiperson situations, information can play a deci- sion-influencing r o l e . For example, i f a manager's actions a f f e c t actual production costs, and the manager i s evaluated and possibly compensated on the basis of the costs, then the manager's actions w i l l be influenced by the existence of the information system which reports the costs. The demand for this type of information might be c a l l e d performance evaluation demand, or stewardship demand. This d i s s e r t a t i o n uses the agency framework to examine some of the issues i n the development of performance evaluation measures for motivat i o n a l purposes. The basic agency model provides a means of studying s i t u a - tions i n which one individual (the principal) delegates the selection of actions to another individual (the agent). Within the context of the firm, the p r i n c i p a l might be the employer or superior and the agent might be the employee or subordinate. The agency theory l i t e r a t u r e (e.g., Harris and Raviv, 1979, Holmstrom, 1979) uses the expected u t i l i t y model to represent the preferences of the p r i n c i p a l and the agent, and generally assumes that the agent's action ( e f f o r t ) and a random state of nature determine the monetary outcome. The sharing rule (contract or compensation scheme) offered by the p r i n c i p a l to the agent specifies how much i s paid to the agent for each possible value of some performance measure or measures. The performance measure i s often taken to be the monetary outcome, or the monetary outcome and an imperfect signal about the agent's e f f o r t . The compensation can be based only on what i s j o i n t l y observable to the p r i n c i p a l and the agent, and the compensation must be adequate enough to induce the agent to work for the 3 principal. Alternative employment opportunities for the agent are thus e x p l i c i t l y considered. The p r i n c i p a l w i l l generally find i t p r o h i b i t i v e l y costly to continuously monitor the agent to determine what action ( e f f o r t ) the agent chooses. Therefore, i f the agent has d i s u t i l i t y for e f f o r t and acts i n his or her own s e l f - i n t e r e s t , the potential for a moral hazard (incentive) problem exists because of the principal's i n a b i l i t y to observe the agent's actions. If the p r i n c i p a l pays the agent a fixed wage, the agent has no economic incentive to perform the agreed l e v e l of e f f o r t , since a low outcome can be blamed on a bad state of nature rather than on shirking by the agent. At the other extreme, i f the p r i n c i p a l rents c a p i t a l or rents the firm to the agent for a fixed fee so that the agent gets the outcome less a fixed fee, the shirking problem can be avoided e n t i r e l y . The shortcoming of this type of contract i s that i t imposes a nonoptimal amount of r i s k on the agent. That i s , the p r i n c i p a l and the agent could be made better o f f i n an expected u t i l i t y sense by using some other contract. Agency theory provides a framework i n which i t i s possible to find compensation schemes which e f f i c i e n t l y motivate the agent to choose the desired actions. The idea i s to create incentives through an employment contract which imposes some r i s k on the agent i n order to provide incentives for the agent to expend some agreed l e v e l of e f f o r t . The consequences of the exis- tence of nonmonetary returns or costs, such as e f f o r t , can thus be analyzed. This i s important for the analysis of performance evaluation and managerial control systems, where incentive effects play a c r i t i c a l r o l e . The choice of variables on which compensation i s to be based can be formally derived, with implications for the design of information systems. Furthermore, the analysis c l e a r l y demonstrates how the information obtained can be incorporated for motivational purposes. 4 Most of the existing agency theory research (see Baiman (1982) for a comprehensive survey) has a rather narrow d e f i n i t i o n of e f f o r t , i n that e f f o r t i s assumed to be single-dimensional. However, people are often faced with several similar tasks which must be performed within one time period. Examples include a salesperson s e l l i n g several products for a firm, an auditor a l l o c a t i n g time to different tasks i n an audit assignment, a manager controlling several production processes, or a manager overseeing several divisions of a company. The problem of motivating the optimal a l l o c a t i o n of effort within one period i s not only interesting i n i t s own right, but also has possible implications for multiperiod problems, where e f f o r t i s a l l o cated across periods. Multiperiod problems are of interest because the eventual goal i s to be able to analyze and understand when there are current and long-term the issues involved consequences of decisions, as there are i n many accounting settings. Chapter 2 of this dissertation contains the notation used i n the remainder of the paper and a formulation of the agency problem with a l l o c a t i o n of e f f o r t . Chapter 3 describes theoretical results and an application in the a l l o c a t i o n setting, and Chapter 4 describes results i n the one-period sequential choice setting. In this scenario, after each effort l e v e l i s exerted, an associated outcome i s observed by the agent before the next e f f o r t l e v e l i s exerted. sequence of outcomes. The agent i s compensated only at the end of the The one-period sequential choice case i s an interme- diate step between the a l l o c a t i o n of e f f o r t case, i n which both the efforts are exerted before the outcomes are known, and the multiperiod case, i n which the f i r s t outcome i s observed and the f i r s t compensation i s paid before the second e f f o r t i s exerted. Chapter 5 concludes the d i s s e r t a t i o n with an outline of proposed future research. proofs appear i n the appendices. A l l technical calculations and 5 CHAPTER 2 NOTATION AND FORMULATION In order to state the agency problem with a l l o c a t i o n of e f f o r t , the following notation w i l l be used: R = the set of a l l real numbers, R_)_ = the set of a l l nonnegative real numbers, X = the set of possible monetary outcomes, x e X CZ. R i s the monetary outcome, x = (x^,...,x ) i s a disaggregation of the monetary outcome x, i . e . , n k n x= Ex. , w e R i=l i s a k-dimensional vector-valued performance measure, e.g., w = x_with k=n, s ( . ) , a real-valued function, i s a sharing rule over the arguments indicated, with s(.) e [SQ,S],*" a^ = effort expended on task i , i=l,...,d, a_ = (a^ ,... ,a^) e A CZ R^_, f(x,w|a) i s the joint density of x and w conditional on a, and i s understood to be f(x|a) i f w i s a function of x; g(«), h ( . ) , and <(>(.) w i l l also be used to denote probability d i s t r i b u t i o n s ; U(.): R + R i s the agent's u t i l i t y function over money, where U' > 0 and U " <_ 0, V(.): R^ •*• R i s the agent's d i s u t i l i t y function over e f f o r t , where 9V/ 3a > 0 and 3 V/ 3a 2 ± > 0, u = the agent's minimum acceptable u t i l i t y W(.): level, R + R i s the principal's u t i l i t y function over money, where W > 0 and W • <_ 0, argmax {. } = the set of arguments maximizing^the expression i n braces. In order to avoid side-betting issues, i t w i l l be assumed that the p r i n c i p a l and the agent have i d e n t i c a l b e l i e f s about the conditional proba- 6 b i l i t y d i s t r i b u t i o n over the outcome and performance measure, given e f f o r t a_. As i n much of the agency l i t e r a t u r e , the agent's u t i l i t y function i s assumed to be of the form U(s) - V(a). In most of the agency l i t e r a t u r e , n=d=l. The principal's problem i s Maximize // W(x-s(w))f(x,w|a) dw dx (2.1) s( .) ,a_ subject to //[U(s(w))-V(a)]f(x,w|a) dw dx > u a e argmax (2.2) {//[U(s(w))-V(a)]f(x,w|a) dw dx }. (2.3) It w i l l be assumed that (2.3) can be replaced with the conditions -JL //[U(s(w))-V(a)]f(x,w|a) dw dx = 0, i = l , . . . , d . (2.4) Furthermore, s u f f i c i e n t regularity to allow d i f f e r e n t i a t i o n inside the i n t e gral i s assumed. This permits the replacement of (2.4) with //U(s(w))f (x,w|a) dw dx = V (a), i=l,...,d, (2.5) i i with subscripts a^ denoting p a r t i a l d i f f e r e n t i a t i o n with respect to a^. a a a The principal's problem Is solved by means of a generalized Lagrangian technique. A Hamiltonian (Lagrangian) i s formed by attaching a m u l t i p l i e r X to (2.2) and m u l t i p l i e r s to each constraint i n (2.5). It w i l l be assumed that the supports of x and w do not vary as a varies, and that the p a r t i a l derivatives of f with respect to each a^ exist and are nondegenerate. The dimension d i s often taken to be equal to n, and the marginal cumulative d i s t r i b u t i o n functions are assumed to s a t i s f y f i r s t order stochastic dominance. That i s , i f F ^ x ^ a ^ ) i s the marginal cumulative d i s t r i b u t i o n func- tion of X j _ , then SF^x^\a^)/Sa^ < 0, i=l,...,n. In a framework where x.^ = h^(a£,9), where 9 represents state uncertainty, i f x^ i s increasing i n a^ (i.e., aa > 0 1 f o r a l l 9), then 3 F ( x | a ) / 3 a 1 1 i i < 0. F i n a l l y , the shar- ing rule i s assumed to be measurable and bounded. For the most part, i n t e r l o r solutions w i l l be examined. 3 7 Some of the results w i l l make use of two special classes of functions. The f i r s t i s the HARA (hyperbolic absolute r i s k aversion) class of u t i l i t y functions, whose r i s k aversion functions are of the form - U (x)/U'(x) = ,, The C = 1 case corresponds l/(Cx + D). (2.6) to U(x) = ln(x+D), the C = 0 case corresponds to U(x) = - exp[-x/D], and the other cases correspond to power u t i l i t y functions . The other class of interest i s the one-parameter exponential family of distributions. This class includes the exponential, gamma (with the shape parameter fixed), normal (with constant variance), and Poisson d i s t r i b u tions. The following representation d i f f e r s s l i g h t l y from the usual one f o r a one-parameter exponential family (see, e.g., DeGroot, 1970). Definition: A probability density function f(x|a) with respect to the measure r ( . ) w i l l be said to belong to the one-parameter exponential family Q i f i t can be written as f(x|a) = exp[z(a)x - B(z(a))]h(x), (2.7) where r ( . ) i s the Lebesgue measure when the random variable x i s absolutely continuous, and r ( . ) i s some counting measure when x i s d i s c r e t e . The representation i n (2.7) has the advantage that closed-form sions can be obtained for E(x|a) and Var(x|a). B'(z(a)) and Var(x|a) = B " ( z ( a ) ) (Peng, 1975). expres- In p a r t i c u l a r , E(x|a) = Table II i n Appendix 1 d e t a i l s the representations of some familiar d i s t r i b u t i o n s . The remainder of Appendix 1 consists of calculations which are useful i n the proofs of the results i n Chapters 3 and 4. 8 CHAPTER 2 FOOTNOTES If the sharing rule i s unbounded, an optimal solution may not exist (Mirrlees, 1974; Holmstrom, 1977, 1979). Furthermore, the agent's wealth places bounds on the possible sharing rules. Gjesdal (1981) has shown that such a u t i l i t y function for the agent ensures that nonrandomized payment schedules are Pareto optimal. His result refers to ex post (after e f f o r t selection by the agent) randomization only. Fellingham, Kwon, and Newman (1983) have shown that ex ante randomization of payment schedules i s optimal under certain conditions. It w i l l be assumed i n what follows that these conditions are not s a t i s f i e d , and hence the focus i s on pure (nonrandomized) payment schedules. That i s , the focus w i l l be on the f i r s t - o r d e r conditions, which apply to i n t e r i o r solutions. 9 CHAPTER 3 ALLOCATION OF EFFORT As stated e a r l i e r , the agency theory framework e x p l i c i t l y recognizes alternative employment opportunities for the agent, d i s u t i l i t y for e f f o r t , r i s k aversion of the agent, and the p o s s i b i l i t y of the p r i n c i p a l obtaining information about the agent's e f f o r t , a l l for situations i n which the agent has one task to perform. However, i n many situations, job e f f o r t i s multi- dimensional; the agent must allocate e f f o r t to several d i f f e r e n t , but possibly related tasks. mensional In spite of the variety of situations i n which m u l t i d i - job e f f o r t occurs, l i t t l e attention has been devoted i z i n g optimal compensation schemes for these situations. to character- S t i g l i t z (1975) considered multidimensional job e f f o r t under linear incentive schemes, and Weinberg (1975) sought an incentive compatible scheme for the problem of sales force management i n multiproduct firms. Radner and Rothschild (1975) examined the properties of three h e u r i s t i c strategies an agent might employ when faced with the problem of a l l o c a t i n g e f f o r t . More recently, Gjesdal (1982) allowed for multidimensional e f f o r t and focused on the value of information. The focus of this chapter i s the characterization of optimal incentive schemes for the agency problem with a l l o c a t i o n of e f f o r t across several tasks. The issues of separability of the optimal sharing rule across tasks and the value of additional information are examined, and the results suggest that certain compensation schemes that are widely advocated may not be optimal. In p a r t i c u l a r , commission schemes and l i n e a r sharing rules are shown not to be optimal, i n general. The special case of additive e f f o r t i s discussed, and the results are applied to the problem of sales force management . 10 3.1 FIRST BEST Suppose that i n addition to observing the aggregated or disaggregated outcome ( i . e . , w = x or w = x), the p r i n c i p a l can observe the agent's effort. These cases may be called complete contractual information cases, since the p r i n c i p a l can observe the agent's choice of e f f o r t . These " f i r s t best" situations are interesting as benchmarks for comparison with "second best" situations, those i n which there i s less than complete contractual information. The characterizations of the optimal sharing rule for these f i r s t best cases are obtained by solving the problem given by (2.1) and (2.2). As i n the single-dimensional e f f o r t case, i f one individual i s r i s k neutral and the other i s r i s k averse, then the r i s k neutral individual bears a l l the r i s k . pal s(x) Thus, i f the agent i s r i s k neutral (U(s) = s) and the p r i n c i - i s r i s k averse, then Pareto optimal sharing rules are s(x) = x - k and = x - k, where k i s a fixed fee paid to the p r i n c i p a l . Conversely, i f the p r i n c i p a l i s r i s k neutral and the agent i s not, the p r i n c i p a l bears the r i s k , receiving a share of x - c, while the agent receives a constant wage c. In the event that both the p r i n c i p a l and the agent are r i s k neutral, the a^'s are chosen so that the agent's marginal d i s u t i l i t y for e f f o r t equals the marginal increases i n the expected outcome ( i . e . , so that 3E(x|a)/3a^ = 3V/3a^, i = l , . . . , d ) , and the sharing rule can be taken as s(.) = u + V(a*), with the p r i n c i p a l receiving E(x|a*) - u - V(a*). If both the agent and the p r i n c i p a l are r i s k averse, then they each bear part of the r i s k , as indicated i n Proposition 3.1.1 Proposition 3.1.1. below. If both the agent and the p r i n c i p a l are r i s k averse and they have homogeneous b e l i e f s , then s(x) varies only with x i n the f i r s t best case. 1 1 Because the optimal sharing rule depends only on x, s(x) Is the same for a l l jc_ that provide the same t o t a l x. The sharing rule S(JC_) therefore varies with x only for risk-sharing purposes - the makeup of x i s unimportant. Moreover, i t i s e a s i l y seen that s(x) i s increasing i n each x^, regardless of the properties of the conditional d i s t r i b u t i o n function on x. This i s i n contrast to the second best solution. 3.2 SECOND BEST Suppose now that the p r i n c i p a l cannot observe the agent's e f f o r t , and hence must present the agent with a sharing rule which induces the desired choice of e f f o r t . Since the focus i n most of what follows i s on motiva- t i o n a l , rather than risk-sharing issues, i t w i l l be assumed that otherwise stated the p r i n c i p a l i s r i s k neutral and unless the agent i s r i s k averse. As remarked above, i f there were no moral hazard problem, the p r i n c i p a l would then bear a l l the r i s k . Whatever risk i s imposed on the agent i n the second best case i s thus imposed not for risk-sharing purposes, but for motivational purposes. Letting f denote 9f/3a. , the optimal sharing rule, given that only x a I 1 is observed, i s characterized d E ji f t U'(s(x)) = X by (x|a*) a f(x|a*) + ' for almost every x such that s(x) the l e f t hand side of = s i f the opposite (3.2.1) (3.2.1) £ [SQ,S]. For a l l other x, s(x) = ~ N ( a , o ) , and x = s. 1 The 1 if s(x) is true. 2 1 SQ i s greater than the right hand side, and For example, suppose that n = 2 , U(s) = l n s, x^ and x rather 2 2 ~N(a ,a ). 2 2 Then x ~ ^(.a l + a , 2 X 2 are independent, 2 2 + a ) and 1/U'(s) 2 i n t e r i o r portion of the optimal sharing rule i s thus^ (See Table II i n Appendix 1 for the normal density) 12 x - a* 2 2— v^) s(x) = X + ( ^ + °l °2 + ( + u )(a* + a*) h ^ + 2 2 ^ 2 °1 2 + + a a 2 l + J a _ 2 2 ^ X » which can be interpreted as a compensation scheme consisting of a fixed portion plus a commission. If the agent's u t i l i t y function i s U(s) = l - e , - s then the i n t e r i o r portion of the optimal sharing rule i s x - a| - a^ s(x) = l n [ X + (^ + v^) j ^—1* °L + °2 In general, i f the p r i n c i p a l i s r i s k neutral and the agent's u t i l i t y function i s i n the HARA c l a s s , with r i s k aversion function given by -U"(s)/U'(s) = l/(Cs+D), then the i n t e r i o r portion of the optimal sharing rule i s + (x|a) Wri)—>-°^ zl c-t<x ^ £ c if <*° s(x) = (3.2.2) d T. u.f (x|a) i=l D l n ( X+ i f(xla) i f C=0. >• (C=l corresponds to U(s) = ln(s+D), C=0 corresponds to U(s) = - e ~ ^ , and s the other cases correspond to power u t i l i t y D functions.) As i n the single-task setting, the f i r s t best solution i s achievable with a fixed fee going to the p r i n c i p a l when the agent i s r i s k neutral. This can be deduced by interpreting e f f o r t to be a vector rather than a scalar i n the single-task setting proofs (e.g., Shavell, 1979). though less than complete contractual information pal Thus, even Is available, the p r i n c i - and the agent can obtain the same expected u t i l i t i e s as they could i n 13 the complete contractual information case. This i s because i n e f f e c t , the risk-neutral agent rents the firm from the p r i n c i p a l for a fixed fee. When U(.) i s s t r i c t l y concave, equation (3.2.1) implies that a neces2 sary and s u f f i c i e n t condition for s(x) to be nondecreasing i n x i s d f a < ls*> "l-Rt ^ x f(x| «) * >- ° a for a l l x corresponding < ' ' 3 2 3 ) to i n t e r i o r solutions, where the y^'s are the Lagrangian m u l t i p l i e r s associated with the optimal solution (a ,s(x)). When a i s one-dimensional, (3.2.3) reduces to f (x|a*) a a -5* forr^n i >- °> < - - > 3 since y i s positive (Holmstrom, 1979). 2 4 If (3.2.4) i s true for a l l a* e A, then f(x|a) has the monotone l i k e l i h o o d ratio property i n x (Lehmann, 1959, p. 111). Many d i s t r i b u t i o n s , including a l l those i n the one-parameter expo- nential family, have the monotone l i k e l i h o o d r a t i o property; this property i s a stronger ordering on distributions than i s f i r s t - o r d e r stochastic dominance (Lehmann, 1959, pp. 73-74). Lagrangian for multipliers If a_ i s multidimensional and the y^ are a l l nonnegative, then a s u f f i c i e n t condition s(x) to be nondecreasing i n x i s f IT 1 a (xja*) f(x| *) a 1 ^ °' f o r a 1 1 x « i = 1 .---» d When a i s single-dimensional, the f i r s t - o r d e r stochastic dominance property means that as a increases, the d i s t r i b u t i o n f(x|a) s h i f t s to the right. It i s this property that accounts for the monotonicity mal sharing r u l e . of the o p t i - When a_ i s multidimensional, the problem of determining an ordering over the e f f o r t vectors a r i s e s . Condition (3.2.3) states that the d i r e c t i o n a l derivative of log f(x[a) i n the d i r e c t i o n of y = (y^,...,u^) at 14 a* be nonnegatlve i n order Thus, u provides f o r the o p t i m a l s h a r i n g r u l e to be monotonic. a d i r e c t i o n i n which to measure the s h i f t i n g of f(x|a_). Because of the c r i t i c a l r o l e that the m u l t i p l i e r s ing r u l e , i t i s of i n t e r e s t to t r y to determine whether they are positive. A p a r t i a l answer i s provided s i t u a t i o n i n which the v e c t o r x_ = P r o p o s i t i o n 3.2.1. F p l a y i n the ( il i) a x a ^» < i= i n P r o p o s i t i o n 3.2.1 It V(a) of with s t r i c t and can be i n e q u a l i t y f o r some x^ v a l u e s . strictly convex i n a, other a2» concave i n a. then both and and g( •) and concave i n a_. h( •) are e x p o n e n t i a l Then at 2^s, utility In t h i s case, i f the p r i n c i p a l i s r i s k n e u t r a l , are p o s i t i v e . P r o p o s i t i o n 3.5.9 and i n s e c t i o n 3.5 sharing Suppose f(xjj0 i s of the form g i v e n i II g (x i=l provides are p o s i t i v e . remark on the c h a r a c t e r i s t i c s of the o p t i m a l be made at t h i s p o i n t . = distribu- r e s p e c t i v e l y , then the agent's expected c o n d i t i o n s under which both A final and Suppose shown that i f the agent's u t i l i t y f o r wealth i s U(s) is strictly the must be p o s i t i v e . t i o n s with means a^ and is below, f o r i s observed. (x^,X2) f u r t h e r that the agent's expected u t i l i t y i s s t r i c t l y l e a s t one strictly Suppose x^ = (x^,X2), f(xj.a) = g(x^|a^)h(x2|a2), l»2> shar- rule can in Proposition d 3.2.1, i . e . , f ( x | a ) = merely i n d i c e s . The with x replaced by _x. 1 to u t ( s ( x optimal |a ), where the s u p e r s c r i p t s on g( •) are 1 sharing r u l e s i s c h a r a c t e r i z e d as i n (3.2.1), In the s p e c i a l case under c o n s i d e r a t i o n , d ) ) = x + \Z 2 a ( x |a*)/g(x ± ± |a*) belongs to the one-parameter e x p o n e n t i a l ( »)/g*( *) each g* a t h i s reduces i s a constant . I f , f u r t h e r , each g i ( •) family Q described m u l t i p l i e d by by (2.7), then (x^ - H j ^ a ^ ) ) , where M j ^ a ^ is i the mean of x^ g i v e n a^. The norms, so t h a t the o p t i m a l standards ( c f . C h r i s t e n s e n , means M^(a^) can be sharing 1982). thought of as standards or r u l e i s a f u n c t i o n of d e v i a t i o n s from T h i s i s c o n s i s t e n t with managerial 15 accounting's focus on variances (deviations from standards) as an aid i n performance evaluation. 3.3 VALUE OF ADDITIONAL INFORMATION A question which naturally arises at this point i s : Would the p r i n c i - pal be better off knowing each rather than only x? That i s , w i l l the p r i n c i p a l always be s t r i c t l y better off with disaggregated tion? or f i n e r informa- More generally, under what conditions w i l l the p r i n c i p a l or the agent be made s t r i c t l y better off by information i n addition to the aggregate outcome, x? I n t u i t i v e l y , the more (imperfect) information the p r i n c i p a l has about the agent's e f f o r t , the more e f f i c i e n t l y the agent can be motivated effort. to exert Consequently, the p r i n c i p a l ' s expected u t i l i t y should increase i n most situations where additional information i s a v a i l a b l e . A number of people have addressed this problem. Holmstrom (1979), for example, showed that i f the additional information i s of value (that i s , i f i t s optimal use w i l l lead to a Pareto superior pair of expected u t i l i t i e s for the p r i n c i p a l and the agent), then the additional information must be informative i n the sense that i t contains information about the agent's e f f o r t that i s not in the output. 1982) The converse was also shown to be true. contained Gjesdal (1981, examined the relationship between Blackwell (1953) informativeness and the value of information. In order to define Blackwell informativeness, l e t ft be a set of possible performance measures to. output, x. space Y. In this section, u> i s assumed to include the An information system n i s a function from ft to some s i g n a l Let y denote an arbitrary element in Y and l e t A, the set of a l l possible actions, be f i n i t e . mative than another system y : Information system n i s Blackwell more i n f o r -»• Z i f and only i f P (z|a) = 16 / P(z|y)dP (y|a) for each action a i n A and each signal z i n Z. Y It should 11 be noted that although may n i s said to be Blackwell more informative than y , n actually be only equally informative as y i s . Amershi (1982, Appendix 1) has generalized the d e f i n i t i o n for the case where A i s i n f i n i t e . Amershi (1982) re-examined the value of additional Information problem, and corrected and generalized the results of Holmstrom (1979, 1982) Gjesdal (1981, 1982). and Amershi (1982) showed that a risk neutral p r i n c i p a l weakly prefers an information system that i s Blackwell more informative than another. That i s , the principal's and the agent's expected u t i l i t i e s are at least as high with the Blackwell more informative system than with the other. A r i s k averse p r i n c i p a l requires that the Blackwell more informative system also provide a s p e c i f i c form of information about the output Proposition 3.3.1 below). (see These results d i f f e r from the single-person deci- sion maker case, where r i s k attitudes are immaterial. Intuitively, a risk neutral p r i n c i p a l i s concerned only with the incentive properties of contracts, whereas a r i s k averse p r i n c i p a l i s concerned with both the incentive and risk-sharing properties of contracts. accounts for the conditions on the output More s p e c i f i c a l l y , Proposition 3.3.1 The risk-sharing aspect i n Proposition 3.3.1 below. says that information system n i s at least as preferred as information system y i f n i s Blackwell more i n f o r mative than y with respect to the e f f o r t , a, and ( i ) there i s no risk-sharing involved, or ( i i ) y says nothing more about the output x than n does, or ( i i i ) the signal provided by n i s enough to determine the Proposition 3.3.1 (Amershi (1982, Theorem 3.1)). output. Let an information system n : ft -»• Y be more informative i n the Blackwell sense than the system y : ft -*• Z with respect to the family of measures P^ = {p(o>|a) : aeA}. pose also, at least one of the following conditions hold: i s r i s k neutral, Sup- ( i ) The p r i n c i p a l ( i i ) The output variable and the information system y are 17 conditionally independent given n. ( I i i ) The output can be expressed as x = h(n(u))) f o r some measurable function h : Y * R. Then the p r i n c i p a l weakly prefers n over y. In this proposition and i n the other propositions i n this section, to i s a vector of performance measures that includes the output, x. e f f o r t variable, a, i s taken to be single dimensional, Although the the proof holds for finite-dimensional e f f o r t vectors as w e l l . Proposition 3.3.1 i d e n t i f i e s conditions under which information system n i s at least as preferred to information system y. It i s of interest to i d e n t i f y conditions under which n i s s t r i c t l y preferred to y. Amershi's (1982) s t r i c t preference ficient statistics. results r e l y on the concept of suf- Using the notation above, a s t a t i s t i c T : ft + K i s suf- f i c i e n t for the family of measures P^ = {P(to|a) : aeAJ i f and only If there exists a nonnegative function h : ft + R + and functions g( *|a) : K •*• R such that f(to|a) = h( to)g(T( to) |a) for a l l weft and aeA, where f( •) i s a density i f the random variable i s continuous, or a mass function i f the random variable i s d i s c r e t e . viewed as an information system. A s u f f i c i e n t s t a t i s t i c may be A minimal s u f f i c i e n t s t a t i s t i c i s a s u f f i - cient s t a t i s t i c T : ft -*• L that i s a function of every other s u f f i c i e n t statistic. An, agency s u f f i c i e n t s t a t i s t i c (Amershi, 1982) "F i s equal to a suf- f i c i e n t s t a t i s t i c T on ft i f the p r i n c i p a l i s risk neutral, or (X,T) i f the p r i n c i p a l i s risk averse. *P i s called a minimal agency s u f f i c i e n t i f the s u f f i c i e n t s t a t i s t i c T i s minimal. statistic F i n a l l y , a contract (s*,a*) i s called a best agency contract i f there i s no other contract based on any information system on ft that i s s t r i c t l y preferred to i t . The proposition below uses the concept of agency s u f f i c i e n t to characterize s t r i c t preferences statistics f o r information systems. E s s e n t i a l l y , 18 the p r i n c i p a l w i l l s t r i c t l y prefer an agency s u f f i c i e n t s t a t i s t i c n over another system y which does not generate a best contract. This i s because a best contract must be a function of the minimal agency s u f f i c i e n t statistic, which extracts a l l relevant information from cu about a (Amershi (1982, Coro l l a r y 3.3)). Proposition 3.3.2 below provides a s i t u a t i o n i n which the information system y cannot generate a best contract. (Amershi (1982, Proposition 3.4)). Proposition 3.3.2 Suppose a best con- tract exists and at each best agency contract, W'(x-s*(S(w))) U'(s*(f3(u>>> where 3 i s an information system which leads to a best agency contract. The p r i n c i p a l s t r i c t l y prefers an agency s u f f i c i e n t s t a t i s t i c n over a system y If -sg- log f(oj|a ^) i s not a function of Y i f the p r i n c i p a l i s r i s k neutral 1 (or not a function of (x, Y) i f the p r i n c i p a l i s r i s k averse). Here (s*,a*) is the optimal contract based on Proposition 3.3.2 Y' holds for the multidimensional e f f o r t case, with cations of Amershi's (1982) c o r o l l a r i e s to his Proposition 3.4. are immediate. Corollary 3.3.3 Their proofs deals with the s i t u a t i o n i n which an addi- tional signal z would be of positive value given an information system which reports the outcome x and another signal y. As i n Proposition 3.3.2, a con- d i t i o n i s provided which implies that Y(x,y,z) = (x,y) cannot generate a best contract. Since t i c , Corollary 3.3.3 3.3.4 n(x,y,z) = (x,y,z) i s t r i v i a l l y a s u f f i c i e n t follows d i r e c t l y from Proposition 3.3.2. statis- Corollary provides a s i t u a t i o n i n which a s u f f i c i e n t s t a t i s t i c i s s t r i c t l y pre- ferred to a nonsufficient s t a t i s t i c . 19 Corollary 3.3.3 (Gjesdal (1982, Proposition 1)). x,y,z are from some spaces }. Letft= {u> = (x,y,z) : Let n be the information system that reports (x,y,z), and l e t Y be the information system that reports (x,y). Assume that f o r 8 = n and 8 = Y, W'(x-s*(B((D))) U'(s*(g(u))) n = X + i = \ \ TT ^ H a * ) f ± (3.3.1) g holds at the contracts (n, s*, a*p and (Y, s*, a*). Then the signal z has marginal value given (x,y) (that i s , the p r i n c i p a l s t r i c t l y prefers n over Y) i f 3 n Z i=l p 1 ^ i log f(o)|a*) ~ (3.3.2) i s not a function of (x,y). Corollary 3.3.4 (Holmstrom (1982, Theorem 6)). Suppose the p r i n c i p a l i s risk neutral, and suppose that for some system y : ft * Z, the expression i n (3.3.2) i s not a function of y at each a eA. prefers any s u f f i c i e n t s t a t i s t i c Then the p r i n c i p a l s t r i c t l y n over y i f equation (3.3.1) holds at any best agency contract generated by information system 8. As Amershi (1982) remarks, these s t r i c t preference results do not establish that an agency s u f f i c i e n t s t a t i s t i c i s always s t r i c t l y preferred to a nonsufficient s t a t i s t i c . In order for a s u f f i c i e n t s t a t i s t i c to be s t r i c t l y preferred to a nonsufficient s t a t i s t i c , the p r i n c i p a l must use Information which i s provided by the s u f f i c i e n t s t a t i s t i c but not provided by the nonsufficient s t a t i s t i c . In addition, the p r i n c i p a l ' s r i s k attitude i s a factor, as shown i n the proposition below. Part (2) of Proposition 3.3.5 says that s u f f i c i e n c y alone cannot determine s t r i c t preference ing of information systems i f the p r i n c i p a l i s r i s k averse. order- 20 Proposition 3 . 3 . 5 (Amershi ( 1 9 8 2 , Proposition 3 . 5 ) ) . Let n be the minimal s u f f i c i e n t s t a t i s t i c and x be the output. (1) A r i s k neutral p r i n c i p a l s t r i c t l y prefers n over any system y which i s not a s u f f i c i e n t s t a t i s t i c i f and only i f every best agency contract (s*,a*) i s such that s* i s a s u f f i c i e n t (2) statistic. y A r i s k averse p r i n c i p a l s t r i c t l y prefers (x,n) over any system such that (x, y) i s not an agency s u f f i c i e n t s t a t i s t i c i f and only i f every best agency contract (s*,a*) Is such that (x,s*) i s an agency sufficient statistic. Again, although the e f f o r t variable i s single-dimensional, the result holds even i f e f f o r t i s multidimensional. Amershi (1982) next developed the following r e s u l t . 3 minimal s u f f i c i e n t s t a t i s t i c . Suppose n(w) Is a 3 If -sg- log f(co|a*) = log k( n( ui) | a*) i s an i n v e r t i b l e function of n(w), then a r i s k neutral p r i n c i p a l s t r i c t l y prefers n over any system y that i s not a s u f f i c i e n t s t a t i s t i c , and a r i s k averse p r i n c i p a l s t r i c t l y prefers (x,n) over any system (x, y ) which i s not an agency s u f f i c i e n t statistic. Unlike Amershi's (1982) previous results, which were easily extended to the multidimensional e f f o r t case, the i n v e r t i b i l i t y result above does not lend i t s e l f to the multidimensional e f f o r t case. I n t u i t i v e l y , the dimension of a s u f f i c i e n t s t a t i s t i c cannot be less than the dimension of the vector of parameters to be estimated. For example, suppose that x^,...,x ( n 2 ) are n observations from a normal d i s t r i b u t i o n with unknown mean 9 and unknown var2 iance o . - 2 2 Then a s u f f i c i e n t s t a t i s t i c for the vector of parameters (9,0" ) - 2 i s (x,s ), where x i s the sample mean and s i s the sample variance. More- over, i t i s obvious that more than one observation i s needed i n order to 2 make inferences about ( 9, a ). Thus, a s u f f i c i e n t s t a t i s t i c i n the m u l t i d i - mensional e f f o r t case w i l l generally be multidimensional. The impossibility 21 of inverting a one-dimensional value to obtain a multidimensional s t a t i s t i c precludes the use of Amershi's i n v e r t i b i l i t y result i n the a l l o c a t i o n of e f f o r t problem. For example, i n the a l l o c a t i o n of e f f o r t problem, x_= (x^,...,x ) i s n p o t e n t i a l l y observable, with the d i s t r i b u t i o n of x parameterized by a = n ( a ^ , . . . , a j ) . The s t a t i s t i c x = E x . can only be s u f f i c i e n t for (x,x) i f a 1-1 i s not r e a l l y multidimensional, i . e . , i f there i s some known functional relationship among the a^s so that knowledge of one a^ Is s u f f i c i e n t to perf e c t l y i n f e r the others. A special case of this type of relationship occurs when i t i s known that the agent w i l l always choose the a^s to be equal. the a l l o c a t i o n problem, i t i s very u n l i k e l y that a_ i s not r e a l l y In multidimen- sional, and therefore i n general, x i s not s u f f i c i e n t for (x,3c_), i . e . , the minimal s u f f i c i e n t s t a t i s t i c i s multidimensional. Continuing with the focus on the value of additional disaggregated information, the principal's weak preference for the additional information i s e a s i l y established. A multidimensional-effort version of Proposition 3.3.1 shows that the information system reporting x_= (x^,...,x ) i s weakly n preferred to the information system reporting only E x., no matter what i=l n the p r i n c i p a l ' s or the agent's r i s k attitudes are. If the p r i n c i p a l can observe _x, the i n t e r i o r portion of the optimal sharing rule i s characterized by . U'(s(x)) d = A + E u, ^ / i 8 / ls) x a 1 g(x|a) ' To i l l u s t r a t e , suppose again that n=2, (x-^.x^ and U(s) = l n s, but l e t jc_ = - N( a_, E), where E i s the covariance matrix . 2 / \ po^Og 2 22 Then the i n t e r i o r portion of the optimal sharing rule i s (see Appendix 2 for b i v a r i a t e normal calculations) 2 8 tel§) a s( ) = x + £ p - J r . 1 X x r = x + F T i l" l Hi z 1 5 P(x2-a ) a n x -a 2 2 2, ~—r-rri ) ^(^(l-p ) 0^(1-p "2 [ „ 2 1 or(l-p ) 0 O (l-p ) 2 ]L 1 a (l-p ) x 2 U jp" " 1 2,'—7—27^ PV^ Uj^ = X + (x -a )[-^ 1 + p(x>a) 2 pi^ 2 77— - + (x -a )[-^ 2 oj^Cl-p ) 2 CT 2 ]. ^ (1-P ^^(l-p ) This compensation scheme may be interpreted as a commission scheme with d i f ferent commission rates for each task. sion rates w i l l be the same for both tasks. e r a l , even i f x^ and x 2 = ov, and If = y , the commis2 It should be noted that i n gen- are independent, the optimal commission rates need not be equal across tasks. This i s because when x^ and x 2 are independent ( P=0), s(x) = X - a ^ / o ^ - \ + x i , J i / ^ + x^/a . c 2 In this case, the commission rate for task i depends only on the variance of x^ and the m u l t i p l i e r u^. Since the sharing rule depends on each Xj>, the 2 s i g n a l jc, obtained i n addition to x, i s valuable (unless M^/o^ 2 = l^/^)* This can be deduced formally from Proposition 3.3.2. 3.4 ADDITIVE SEPARABILITY OF THE SHARING RULE Once the p o s s i b i l i t y of observing each x^ i s introduced, the question of whether or not to reward the agent for each outcome separately a r i s e s . For example, should a manager of two divisions that are geographically d i s persed be rewarded for the performance of each separately? Analytically, the question i s whether the optimal sharing rule i s a d d i t i v e l y separable i n 23 the x^'s. This question w i l l be addressed for the HARA class of u t i l i t y functions. V(x) = \ + E ]i.g (x|a)/g(x|a). As before, i f the agent's Let l a utility ± function i s i n the HARA class, with -U"(s)/U*(s) = l/(Cs+D), then the i n t e r i o r portion of the optimal sharing rule i s given by i((V(x)) s(x) = C - D), if C *0 < (3.4.1) D ln(V(x)), i f C = 0, for almost every jc_ such that s(x) £ . If the p r i n c i p a l i s r i s k [SQ,S] averse, with u t i l i t y function i n the HARA class and with i d e n t i c a l cautiousness C (see (2.6)), then the i n t e r i o r portion of the optimal sharing rule i s ( V(x)) (Cx+D ) - D C 1 2 if C * 0 C(l + ( V ( x ) ) ) C s(x) = < (3.4.2) D D l n V(x) + D x 1 2 2 D where 1 + D i f C = 0, 2 corresponds to the p r i n c i p a l , and T>2 corresponds to the agent. Equation (3.4.1) implies that i f the p r i n c i p a l Is r i s k neutral and the agent's u t i l i t y function i s i n the HARA class, then a necessary condition for the optimal sharing rule to be additively separable i s that C=l, i . e . , that the agent have a log u t i l i t y function. strong form of independence Given that U(s) = l n s, a of the outcomes, x^,...,x , i s a s u f f i c i e n t conn d i t i o n for the optimal sharing rule to be additively separable. cifically, l e t g^(x^|a£) be the density of outcome x^ given effort a^, and l e t g(x|a_) be the joint density of jc_ given a_. g(x ,...,xja ,...,a ) = 1 More spe- 1 n n^g ( x | a ) i i Then (3.4.3) 24 Is a s u f f i c i e n t condition for additive separability of the optimal sharing n rule, given that U(s) = In s. i i s (x ) = u — 1 In this case, s(x) = \ + Is i=l i (x ), where 1 . The example i n Section 3.3 shows that given U(s) = VUja.) l n s, independence i s a s u f f i c i e n t but not a necessary condition for separa b i l i t y of the sharing r u l e . One might conjecture that there are other common d i s t r i b u t i o n s of dependent random variables which, when U(s) = l n s, y i e l d a separable sharing r u l e . However, no other common joint d i s t r i b u - tions which seem appropriate (see, e.g., Johnson and Kotz, 1972), seem to lead to such a r e s u l t . In general, then, the optimal sharing rule w i l l not be additively separable. It i s interesting to note that (3.4.3) i s not s u f f i c i e n t to y i e l d a separable sharing rule i f U(s) l n s. Furthermore, (3.4.3) i s not s u f f i - cient to y i e l d a separable sharing rule i f both the p r i n c i p a l and the agent are risk averse, with HARA-class u t i l i t y functions and i d e n t i c a l cautiousness C. This i s easily seen from equations (3.4.2). Hence, even i f the p r i n c i p a l and agent have i d e n t i c a l log u t i l i t y functions, a separable sharing rule i s not optimal. These results d i f f e r from those i n the cooperative setting, i n which a weighted sum of the p r i n c i p a l ' s and the agent's expected u t i l i t i e s i s maximized (no Nash constraint i s necessary). In the cooperative case, i f b e l i e f s are i d e n t i c a l and the p r i n c i p a l and agent are s t r i c t l y r i s k averse, then the optimal sharing rule i s linear for a l l weights i f and only i f the individuals have HARA-class u t i l i t i e s with i d e n t i c a l cautiousness (Amershi and Butterworth, 1981). accounts Thus, the moral hazard problem p a r t i a l l y for the generally nonlinear form of the optimal sharing r u l e s . One additively separable compensation scheme which i s commonly used i s the commission scheme. This scheme has the further r e s t r i c t i o n that 25 s ^ ( x ^ ) = c^x^ + b^ , a l i n e a r f u n c t i o n of x^. t i o n f o r a commission scheme ( l i n e a r s h a r i n g As above, a necessary r u l e ) to be optimal p r i n c i p a l be r i s k n e u t r a l and that the agent have a l o g u t i l i t y Given U(s) = I n s, whether or not the optimal sharing on the c o n d i t i o n a l d i s t r i b u t i o n of the outcomes g i v e n 3.5 condi- i s that the function. r u l e i s l i n e a r depends effort. ADDITIVE EFFORT T h i s s e c t i o n examines the s p e c i a l case where e f f o r t when e f f o r t represents intrinsic disutility time spent on d i f f e r e n t t a s k s , and where there f o r any p a r t i c u l a r t a s k . f u n c t i o n f o r e f f o r t expended on d tasks T h i s case n e c e s s i t a t e s i s a d d i t i v e , as In t h i s case, the d i s u t i l i t y can be w r i t t e n as V(a^+..,+a^). only minor changes i n the a n a l y s i s ; p a r t i a l t i v e s of V ( . ) with r e s p e c t i s no t o a± a r e r e p l a c e d by V ' ( . ) . deriva- The assumption t h a t the p r i n c i p a l i s r i s k n e u t r a l and the agent i s r i s k averse w i l l be maintained i n t h i s section. Suppose that there i s one outcome x^ a s s o c i a t e d with each a^, and that d the mean of each x^ i s k m ( a ) , so that E(x) i and m|( •) > 0. i In the f i r s t i best t h a t the agent r e c e i v e a constant 8E(x|a)/9 The m i( i) a cates = sk^jlaj) ai simplest a i» ^ order conditions require wage and that = XV'(.), f o r a l l i . case i s that of constant (3.5.1) marginal p r o d u c t i v i t y , where f u r t h e r , k^ = k f o r a l l i , then (3.5.1) i n d i - o r that (3.5.2) i hence any mix of e f f o r t s s a t i s f y i n g (3.5.2) i s e q u a l l y a c c e p t a b l e to both the p r i n c i p a l and the agent. e f f i c i e n c y of e f f o r t The k^'s may be thought of as measures o f ( S h a v e l l , 1979). boundary s o l u t i o n r e s u l t s . The Z k^m^a.^), where k^^ > 0 case, the f i r s t k/ X = V ( Z a ) , and = I f a l l the k^'s are unequal, then a In p a r t i c u l a r , a l l but one of the a^'s a r e z e r o . problem i s thus e s s e n t i a l l y one of choosing on which task of many to 26 expend e f f o r t . Suppose there are two tasks, with k^ > k . 2 In this s i t u a - tion, the optimal solution i s to devote e f f o r t exclusively to task one. These results are summarized as Lemma 3A.2 i n Appendix 3, where the proofs can also be found. Comparison of two one-dimensional e f f o r t situations with k^ > k shows 2 why the p r i n c i p a l Is better off with a^ > 0 and at. = 0 than with a^ = 0 and ASj > 0. Since k^ > k , there i s a higher return per unit of e f f o r t for task 2 one than from task two. Furthermore, i t i s worthwhile for the p r i n c i p a l to induce more e f f o r t f o r task one than for task two (see Proposition 3A.3 and i t s proof i n Appendix 3). The combined productivity gains ( r e c a l l that E(x^) = k^a^) outweigh the required increased fixed wage compensation to the agent, who would receive the same expected u t i l i t y for either task. The principal's s i t u a t i o n can be depicted graphically as follows: ^i J _+ s* k a l * " a i ^^-f^k a 2 ^ ^ & - T * k 2 2 • a s* 1 1 at, a£ For general m^a^), (3.5.1) implies that k^m^(a^) = k^ml(aj), i,j=l d. The marginal impacts of the a^'s on the expected outcomes are balanced, and hence the solution w i l l generally be i n t e r i o r . If the mean functions are i d e n t i c a l , then the optimal e f f o r t s w i l l be equal. Although the agent's u t i l i t y for wealth i s not important i n determining the principal's choice of the a^'s i n the f i r s t best case, i t i s important 27 in the second best case. Assuming an Interior solution, the f i r s t order conditions i n the second best case require that g U ( s Q Q ) _ 3EU(s(x)) I /o c o\ . . , » - i> J _ l > • • • »Q ^ J . J . J ; j Since the agent's effort i s not observable i n this case, the p r i n c i p a l must induce the agent to exert the optimal amount of e f f o r t at one or more tasks. The p r i n c i p a l may at find i t optimal to devote resources to preventing shirking only one task even i f multiple tasks are a v a i l a b l e . the p r i n c i p a l could, by imposing less r i s k , motivate It i s possible that the agent more e f f i - c i e n t l y i f the agent were induced to devote effort to only one task. Since the risk-averse agent must be compensated for bearing r i s k , the p r i n c i p a l may be better off imposing r i s k related to just one task. The propositions i n the remainder of this section describe situations in which a boundary solution or an i n t e r i o r solution w i l l be optimal, and characterize i n t e r i o r solutions. Before stating the propositions, a simple example w i l l be used to introduce the issues. Suppose there are two independent and i d e n t i c a l tasks, whose outcomes are represented by X^ and X 2 is Suppose further that the agent's action space {(2a*,0),(0,2a*),(a*,a*),(a*,0),(0,a*),(0,0) }, where an e f f o r t l e v e l of 0 represents the minimal e f f o r t the agent w i l l exert. b i l i t i e s of X^ given a are: P r o b a b i l i t i e s given that a=0 a=2a* a=a* $1 .10 11/12 1/2 1/12 - .10 1/12 1/2 11/12 x i E(X la) 1.00 0.50 0.00 VarCXi la) 0.11 0.36 0.11 1 Suppose that the proba- 28 The joint outcomes occur with the following p r o b a b i l i t i e s : P r o b a b i l i t i e s given that a^=2a* Reward (X ,X ) 1 a^=a* a =0 a =0 1 1 a =0 a =2a* a =a* 2 a =0 a =a* a =0 2 2 a^=a* 2 2 2 2 si (1.1,1.1) 11/144 11/144 1/4 1/24 1/24 1/144 s 2 (1.1,-.1) 121/144 1/144 1/4 11/24 1/24 11/144 s 3 (-.1,1.1) 1/144 121/144 1/4 1/24 11/24 11/144 s 4 (-.l.-.l) 11/144 11/144 1/4 11/24 11/24 121/144 E(X +X |a) 1.00 1.00 1.00 0.50 0.50 0.0 Var(X +X |a) 0.22 0.22 0.72 0.47 0.47 0.22 1 1 2 2 Let s_= (s^,s ,S3,S4). Suppose the p r i n c i p a l ' s problem i s : 2 Maximize E(250X! + 250X ) - E(s) 2 subject to EU(_s) - V(ai + a ) >^ u 2 (a^,a ) maximizes |EU(J3) - V(a^+a )} • 2 2 Let IKSJ^) = Ss^ and u = 10, and a* = 1. The optimal solution i s for the p r i n c i p a l to induce the agent to exert 2a* at one task, with the reward for the one task as follows: s = 148.84 and s 0 = 96.04, where s i s paid i f the outcome i s 1.1, and SQ i s paid otherwise. If the p r i n c i p a l desired to induce the agent to exert a* at both of the tasks, the following sharing rule would be optimal: B[ = 207.40, s 2 - s 3 = 144, s£ = 92.16. Looking at the variance as a measure of r i s k , we note that the outcome is r i s k i e r when a_ = (a<£,a*,) than when a_ = (2a*,0). However, this r i s k i s not d i r e c t l y of concern to either the p r i n c i p a l or the agent, because the p r i n c i p a l i s r i s k neutral and the agent i s not concerned about the riskiness of the outcomes per se, but rather about the effects on the compensation 29 received. In the example above, Var(_s) = 213.0 while Var(j_') = 1667.2, and E(j_) = 144.44, which i s less than E(s_') = 146.88. The p r i n c i p a l can thus motivate the agent more e f f i c i e n t l y with a boundary solution rather than with an i n t e r i o r solution. In this case, the principal's expected payments to the agent are lower for the sharing rule which imposes less r i s k (as measured by the variance) on the agent. Although the variances of the outcomes are not d i r e c t l y of concern to either the p r i n c i p a l or the agent, they are i n d i r e c t l y of concern. X 2 and are not only outcomes, but also signals about the agent's e f f o r t s ; as such, they provide information about the e f f o r t s . The r e l a t i v e magnitudes of the variances of the outcomes are potential surrogates for measures of informativeness, since the variances indicate how the signals about the e f f o r t s w i l l vary as the e f f o r t s vary. (information) In the example above, a t o t a l e f f o r t l e v e l of 2a* w i l l provide the same t o t a l expected outcome, regardless of whether a* i s devoted to each of two tasks, or 2a* i s devoted to a single task. However, the variance of the outcome i s smaller when 2a* i s devoted to a single task than when the e f f o r t i s allocated Since the expected to two tasks. outcomes are the same, the risk-neutral p r i n c i p a l desires to allocate e f f o r t i n the way that provides the most information about shirking. That i s , information issues become dominant i n the p r i n c i p a l ' s choice of the a l l o c a t i o n of e f f o r t . A s i t u a t i o n similar to the discrete outcome example above occurs when the X^'s are independent and i d e n t i c a l l y distributed with a normal d i s t r i b u tion with mean ka and variance . If e f f o r t a* i s devoted to each of two independent tasks, then E(X +X |a^=a*,a =a*) = 2ka* and 1 Var(X +X 1ai=a*,a =a*) = 2<?. 1 2 2 2 2 If e f f o r t 2a* i s devoted to just one task, say the f i r s t task, then the expected outcome i s 2ka*, which i s equal to E(X-^+X 1a^=a*,a =a*). 2 2 However, i f the agent i s compensated only on the 30 2 the corresponding variance i s cr , which i s s t r i c t l y less than basis of X^, Var(X^+X |a^=a*,a =a*). 2 In this situation, then, we might conjecture that a 2 boundary solution i s optimal. The two examples above had 2E(X |a*) = E ^ ^ a * ) . Clearly, 1 this can hold for a l l e f f o r t levels only when the means are linear i n e f f o r t . The examples also had Var(X +X |a =a*,a =a*) > V a r ( X | = 2 a * ) . 1 2 1 2 x (3.5.4) a] Thus, one might conjecture that a boundary solution i s optimal i n cases where (3.5.4) holds and there are independent and i d e n t i c a l l y distributed outcomes, with the means proportional to e f f o r t . It should be pointed out, however, that the a d d i t i v i t y of e f f o r t would also be c r i t i c a l for this result. If the X ^ s have Poisson d i s t r i b u t i o n with E(X |a =a) = ka = i i Var(X^|a^=a), then the variances change as the e f f o r t s change. If a* i s exerted at each of two independent tasks, then E(Xj+X |a^=a*,a =a*) = 2ka* = 2 Var(Xj+X |a^=a*,a =a*). 2 2 If 2a* i s exerted at one task, say task one, 2 E(X |a =2a*) = 2ka* = E(X +X |a =a*,a =a*) = Var(X |a =2a*). 1 1 X 2 x 2 x then Therefore, we L might expect that the p r i n c i p a l would be indifferent between a boundary solution and an i n t e r i o r one. F i n a l l y , consider the exponential d i s t r i b u t i o n , where E(X^|a^=a) = ka and Var(X^|a^=a) = k a . If a* i s exerted at each of two independent tasks, then E(X +X |a =a*,a =a*) = 2ka* and Var(X +X | =a*,a =a*) = 2 k a * . 2 1 2 1 2 1 2 a] If 2 2 2a* i s exerted at one task, then E(X |a =2a*) = 2ka* = E(X +X |a =a*,a =a*) 1 but Var(X |a =2a*) = 4 k a * 2 1 2 1 1 1 > Var(X +X |a =a*,a =a*). ] 2 x 2 2 1 2 Thus, i n this situa- tion, we might conjecture that an i n t e r i o r solution, rather than a boundary solution, would be optimal. The propositions below substantiate the i n t u i t i v e arguments above concerning when an i n t e r i o r solution or a boundary solution i s optimal, given 31 that the expected outcomes of independent and i d e n t i c a l tasks a r e p r o p o r t i o n a l to e f f o r t expended. effort, then the s i t u a t i o n s become more Initially, complicated. the normal d i s t r i b u t i o n with constant a f u n c t i o n of e f f o r t w i l l est, I f the expected outcomes are n o n l i n e a r i n be c o n s i d e r e d . variance but w i t h mean This case i s o f p a r t i c u l a r i n t e r - s i n c e i t i s the only d i s t r i b u t i o n i n Q (see (2.7)) whose v a r i a n c e i s independent o f the agent's e f f o r t . The f o l l o w i n g p r o p o s i t i o n s t a t e s condi- t i o n s under which a boundary s o l u t i o n i s o p t i m a l . Proposition x^ and x 2 3.5.1. Suppose the p r i n c i p a l i s r i s k n e u t r a l , U(s) = 2v^T, and a r e c o n d i t i o n a l l y independent and i d e n t i c a l l y d i s t r i b u t e d n o r m a l l y w i t h mean ka and constant variance. Suppose f u r t h e r that V(a) = V ( Z a ^ ) . Then a boundary s o l u t i o n i s o p t i m a l . ^ The p r o p o s i t i o n below c h a r a c t e r i z e s o p t i m a l Proposition averse, 3.5.2. and g O ^ a ) = f ( x j j a ^ ) f ( x | a ) , i . e . , x^ and x V( Ea^). 2 I f a unique i n t e r i o r 2 and u 2 2 are c o n d i t i o n a l l y Suppose f u r t h e r that V(a) = s o l u t i o n i s optimal, a^ = a*, and u* = u^, where described solutions. Suppose the p r i n c i p a l i s r i s k n e u t r a l , the agent i s r i s k independent and i d e n t i c a l l y d i s t r i b u t e d . has unique i n t e r i o r then the o p t i m a l solution a r e the L a g r a n g i a n m u l t i p l i e r s earlier. T h i s r e s u l t i s independent of the u t i l i t y agent or the d i s t r i b u t i o n of x^ g i v e n f u n c t i o n of the r i s k - a v e r s e a^; the c r i t i c a l element i s that the outcomes a r e c o n d i t i o n a l l y independent and i d e n t i c a l l y d i s t r i b u t e d . This r e s u l t does n o t say that a l l agency problems such t h a t the p r i n c i p a l i s r i s k neutral, the agent i s r i s k averse, and the outcomes a r e c o n d i t i o n a l l y i n d e - pendent and i d e n t i c a l l y d i s t r i b u t e d have s o l u t i o n s o f a^ = a | and u£ = u|; t h i s i s evident the o p t i m a l from P r o p o s i t i o n 3.5.1. P r o p o s i t i o n 3.5.2 i n d i c a t e s that i f s o l u t i o n has the agent a l l o c a t i n g nonzero e f f o r t then the e f f o r t s should be equal at each task to each i f the t a s k s present task, indepen- 32 dent and i d e n t i c a l expected returns to the p r i n c i p a l . The following propo- s i t i o n , which applies to the one-parameter exponential family (see (2.7)), describes conditions under which an i n t e r i o r solution i s optimal. These conditions are s u f f i c i e n t but not necessary. Proposition 3.5.3. 8 (51 §L) = Suppose the p r i n c i p a l i s r i s k neutral, U(s) = f ( i ! ^ ) f ( 2 l 2^» x a x a w n e r where M(0) _> 0 and M'(a) > 0. e and f('|a) belongs to Q and has mean M(a), Suppose further that V(a_) = V ^ E a ^ . Let a* be the optimal e f f o r t i n the one-task problem, (i) If M(a) i s concave and z (a*)M'(a*)/[z (a*/2)M'(a*/2)] , < 1/2, (3.5.5) < 1/2, (3.5.6) , then a boundary solution i s not optimal. (ii) If M(a) i s s t r i c t l y concave and z'(a*)M'(a*)/[z (a*/2)M (a*/2)] , , then a boundary solution i s not optimal. In both cases, i f a unique inter- i o r solution i s optimal, then a^ = a*, and y£ = u^. As shown i n below i n Corollary 3.5.4, z'(a)/z'(a/2) i s often independent of a, and hence one need not actually solve for the optimal one-task effort. Corollary 3.5.4. Under the conditions i n Proposition 3.5.3 i f M(a) = ka and z'(a*)/z (a*/2) < 1/2, then an i n t e r i o r solution i s optimal. f In p a r t i c u l a r , ( i ) For the exponential d i s t r i b u t i o n with parameter l/(ka), an i n t e r i o r solution i s optimal (z'(a)/z'(a/2) = 1/4). ( i i ) For the gamma d i s t r i b u t i o n with parameters n/(ka) and n, an i n t e r i o r solution i s optimal (z (a)/z'(a/2) = 1/4). f The following cases do not s a t i s f y (3.5.5) but are included for purposes of comparison: ( i i i ) The Poisson d i s t r i b u t i o n with mean ka has z'(a)/z'(a/2) = 1/2. 33 (iv) The normal d i s t r i b u t i o n with mean ka and constant variance has z'(a)/z'(a/2) = 1. The normal d i s t r i b u t i o n should not, of course, s a t i s f y (3.5.5) i n view of Proposition 3.5.1. In each of the cases in Corollary 3.5.4 l i n e a r l y with the agent's e f f o r t s . the expected outcomes increase In case ( i i i ) , the variances of the out- comes also increase l i n e a r l y with the agent's e f f o r t s . variances of the outcomes are unaffected by the e f f o r t s . In case ( i v ) , the In cases ( i ) and ( i i ) , the variances of the outcomes increase quadratically with the e f f o r t s . A boundary solution i s optimal i n case ( i v ) , where the rate of increase i n the variance i s s t r i c t l y less than the rate of increase i n the mean. i n t e r i o r solution i s optimal i n cases ( i ) and An ( i i ) , where the rates of increase in the variances are s t r i c t l y greater than the rates of increases in the means. The following two propositions characterize optimal i n t e r i o r solutions when the means of the outcomes are l i n e a r i n e f f o r t . Proposition 3.5.5. g( l§.) x = Suppose the p r i n c i p a l i s r i s k neutral, U(s) = 2/s, f( lI i)f( 2I 2^» x a x a w n e r belongs to Q and has mean M(a). 2 e Suppose further that V(a_) = V(Ta ). ± If M(a) = ka and z " ( a ) / z ' (a) i s s t r i c t l y monotonic, then an optimal i n t e r i o r solution i s unique and a l = a 2 a n d Hi = ^* ^ e s t r * c t and has monotonicity i s s a t i s f i e d by the exponen- t i a l and gamma d i s t r i b u t i o n s (given that M(a) = ka), but not by the normal or Poisson d i s t r i b u t i o n s . Proposition 3.5.6. Suppose the p r i n c i p a l i s r i s k neutral, U(s) = 2/s g(x|a) = f(x^|a^)f(x2|a2), where f(.|a) has mean M(a). 2 V(a_) = V ^ E a ^ . If M(a) , and Suppose further that = ka and I'(a)/I (a) i s s t r i c t l y monotonic, where 1(a) = /fg/f' dx, then an optimal i n t e r i o r solution i s unique and has a^ = a^ and y* = p*. 34 1(a) i s called Fisher's information about a contained i n x, and i s a useful concept i n mathematical s t a t i s t i c s (see, e.g., Cox and Hinkley, 1974). The next corollary demonstrates that i n part, the shape of the expected outcome function determines whether the optimal solution w i l l be i n t e r i o r . Corollary 3.5.7. Under the conditions i n Proposition 3.5.3 i f M(a) = a", then an i n t e r i o r solution i s optimal i f f(.|a) i s ( i ) Normal (M(a),cr ) and 0 < ct<_ 1/2 or 2 ( i i ) Exponential (1/M(a)) and 0 < a < 1 or ( i i i ) Poisson (M(a)) and 0 < o< 1. It i s well known that knowing f / f i s equivalent to knowing the l i k e l i a hood of a given the observations. For the exponential family Q, f / f i s given by z'(a)(x-M(a)), where M(a) i s the mean of x conditional on a. a It i s z'(a) and M(a) which play an important role i n determining whether a boundary solution or an i n t e r i o r solution i s optimal. for This might be expected, the x^'s are not only outcomes, but also signals about the efforts that have been expended. x, z'(a) and M(a) Since f / f i s s u f f i c i e n t for the likelihood of a given a together measure, to a certain degree, the informativeness of x about a. It i s interesting to compare the results for the second best case with those for the f i r s t best case. In the f i r s t best case, (3.5.1) indicates that i f M(a) = ka, then whatever the d i s t r i b u t i o n of x given a, the p r i n c i pal w i l l be i n d i f f e r e n t between an i n t e r i o r solution or a boundary one, as long as the t o t a l amount of e f f o r t expended i s the same i n both cases. the second best case, however, Proposition 3.5.1 In says that i f the d i s t r i b u - tion i s normal with mean ka and constant variance, then a boundary solution is optimal. On the other hand, i f the d i s t r i b u t i o n i s exponential or gamma with mean ka, then an i n t e r i o r solution i s optimal (Corollary 3.5.4). 35 If, i n the f i r s t best case, the means are concave i n e f f o r t , then (3.5.1) indicates that an i n t e r i o r solution i s optimal, and the optimal efforts are equal i f the mean functions are i d e n t i c a l (M'(a^) = M'(a^) implies that a* = alSf). Corollary 3.5.7 indicates that for a s p e c i f i c second best case with concave means, a similar result concerning the optimality of an i n t e r i o r solution holds. The ally results up to this point have assumed that independent and i d e n t i c a l l y d i s t r i b u t e d . and x The next two 2 are condition- propositions deal with the case of conditionally independent but nonidentically d i s t r i buted x^'s. Proposition 3.5.8. Suppose the p r i n c i p a l i s r i s k neutral, U(s) = 2/s, and g(x|a) = f ( x | a ) h ( x | a ) , where f ( . | a ) and h(.|a ) belong to Q and 1 1 2 E(x |a=a ) = k a±. i i 2 1 Suppose further that V(a) ± 2 = V( Ea.^) . ( i ) If x^ has an exponential d i s t r i b u t i o n with mean k^a^, implies that a^ > a^ and u£ > then k^ > k > k 2 2 implies 2 u£ > uJ. ( i i i ) If x^ has a normal d i s t r i b u t i o n with mean k^a^ and constant then k u^. ( i i ) If x^ has a gamma d i s t r i b u t i o n with mean k^a^, that a^ > ai£ and then k^ > variance, implies that the optimal solution i s a boundary solution, with a* > 0 and a* = 0. (iv) If x^ has a Poisson d i s t r i b u t i o n with mean k^a^, then k^ > k 2 implies that the optimal solution i s a boundary solution with a^ > 0 and a^ =0. The following proposition states that at least for a s p e c i f i c second best case, the optimal Lagrangian multipliers are p o s i t i v e . Signing the multipliers i s of importance because of their c r i t i c a l role i n the determination of the optimal sharing rule. For example, i f the density of x^ given a^ s a t i s f i e s the monotone l i k e l i h o o d ratio property i n x^ for a l l i , then the p o s i t i v i t y of the u!s guarantees that the optimal sharing rule i s 36 increasing i n each x^. It can be shown that under c e r t a i n conditions i n the second best case, not a l l the yjs can be zero, and hence in the situations above where the optimal m u l t i p l i e r s y* are equal, they must be p o s i t i v e . Proposition 3.5.9. g(l§.) x = Suppose the p r i n c i p a l i s r i s k neutra 1, U(s) = l/s, f ( i | i ) h ( x 1 a ) , where f(.|a^) and h(.|a ) belong to Q. x a 2 further that V(a) = V(Ea^). and 2 2 and Suppose If an i n t e r i o r solution i s optimal, then y£ > 0 y* > 0. Note that i n Proposition 3.5.9, x^ and x 2 need not be i d e n t i c a l l y d i s - tributed, although they are conditionally independent. Furthermore, the result holds for general V(a), as long as 9V/3a^ > 0, i=l,2. 3.6 APPLICATION TO SALES FORCE MANAGEMENT In this section, the previous analysis of multidimensional tions i s applied to the problem of sales force management. effort situa- Steinbrink (1978) depicts the c r i t i c a l role of compensation of a sales force as follows: Any discussion with sales executives would bring forth a consensus that compensation i s the most important element i n a program for the management and motivation of a f i e l d sales force. It can also be the most complex. Consider the job of salespeople i n the f i e l d . They face direct and aggressive competition d a i l y . Rejection by customers and prospects is a constant negative force. Success i n s e l l i n g demands a high degree of s e l f - d i s c i p l i n e , persistence, and enthusiasm. As a result, salespeople need extraordinary encouragement, incentive and motivation i n order to function e f f e c t i v e l y . . . .A properly designed and implemented compensation plan must be geared to the needs of the company and to the products or services the company s e l l s . At the same time, i t must attract good salesmen i n the f i r s t place . . . Management of the sales force has been the focus of a great deal of research, much of i t empirical. Steinbrink (1978), i n a survey of 380 com- panies across 34 industries, found that most companies favored a combination of salary, commission, and bonus schemes. Typical commissions used were 1) Fixed commissions on a l l sales 2) Different rates by product category 37 3) On sales above a determined goal 4) On product gross margin. These commission schemes are a l l examples of linear sharing rules. Farley (1964), Berger (1975), and Weinberg (1975, 1978) problem of " j o i n t l y optimal" compensation schemes. studied the They assumed a given compensation system (a commission scheme based on gross margin) and sought to determine i f that system i s incentive compatible, meaning that the salesperson w i l l be induced to choose levels of e f f o r t which the company desires. In these analyses, the measure of e f f o r t i s taken to be time spent s e l l i n g . The t o t a l time available i s assumed to be fixed and the decisions are how to allocate the t o t a l time across several products. Farley (1964) demonstrated that i f a commission system based on gross margin i s used, the commission rates should be the same for a l l products i n the case where both the firm and the salespeople are income maximizers. Weinberg (1975) extended Farley's result to include the choice of discounts on each product as well as the choice of time spent s e l l i n g each product. Both papers assume that the time spent s e l l i n g one product does not affect the sales of any other product. Furthermore, sales are considered to be a deterministic function of time, although the conclusions are unaffected by uncertainty because of the assumed r i s k n e u t r a l i t y of both the firm and the salespeople. Weinberg (1978) maintained the assumption of r i s k neutrality of the salespeople, and further extended his and Farley's analyses by allowing for interdependence of product sales and relaxing the assumption that salespeople maximize income. Even i n these situations, an equal gross mar- gin commission system i s incentive compatible i f the firm's objective i s to maximize expected gross p r o f i t s . Berger (1975) examined the combined effects of uncertainty and non-neut r a l risk attitudes on the part of the salespeople. He retained Weinberg's 38 and Farley's assumption of constant marginal cost per product, but treated sales of each product as a random variable parameterized by the time spent s e l l i n g that product. Berger demonstrated that i f a commission scheme i s used i n this situation, i t may be undesirable for the firm to set equal com- mission rates for a l l products. The agency model allows for many of the important factors i n the sales force managment problem and provides a more complete analysis of the problem by determining an incentive scheme which motivates the salesperson to make decisions that are Pareto optimal, rather than taking the compensation scheme as given. framework.^ The problem w i l l therefore be examined below i n an agency Interdependence of products, for the salesperson provision of enough net benefits to join and stay with the firm, and also the salesper- son's tradeoff between money and time spent s e l l i n g are incorporated. connection In with t h i s , the t o t a l time spent s e l l i n g i n a given time period w i l l be a choice v a r i a b l e . In order to focus on the motivational rather than r i s k sharing aspects of the problem, i t w i l l be assumed that the salesperson i s risk averse, and the firm i s risk neutral and therefore desires to maximize expected p r o f i t . The agency theory analysis isolates conditions under which some sort of commission scheme i s Pareto optimal, and shows that even when a commission scheme i s Pareto optimal, the commissions are genera l l y unequal. In this analysis, e f f o r t w i l l be interpreted as "time spent s e l l i n g , " and n w i l l represent the number of products available to be sold. It w i l l be assumed that the salesperson has no i n t r i n s i c d i s u t i l i t y for s e l l i n g particular product, so that the d i s u t i l i t y function may V(Ea^), with V increasing and convex. The any be taken to be remaining notation w i l l be as defined previously, with x^ denoting the difference between sales revenue and variable noncompensation costs for product i.^ 39 Suppose that the p r i n c i p a l (firm) and the agent (salesperson) have identical beliefs. hired. This might be the case when the salesperson i s f i r s t Suppose also that the firm and the salesperson are i n a f i r s t best s i t u a t i o n , either because they are acting cooperatively or because the firm can perfectly observe the times spent s e l l i n g each product. Recall that i n the f i r s t best situation, i t makes no difference whether the p r i n c i p a l observes only the t o t a l outcome, x, or the vector of outcomes, x_. Since the firm i s r i s k neutral and the salesperson i s r i s k averse, the optimal sharing rules i s a constant salary c = U * ~ ^ ( l / X) for the salesperson, with the firm receiving the remainder, x-c. The firm requires that the salesperson choose sales e f f o r t so that 3E(x|a)/3a = 9E(x|a)/9a^, i i,j=l,...,n. (3.6.1) The interpretation i n the cooperative setting i s that the salesperson happ i l y supplies e f f o r t levels a^ s a t i s f y i n g (3.6.1) i n return for the salary c, since i n doing so, he or she receives the market u t i l i t y , u. In the per- fect observability setting, the firm pays the salesperson a salary c i f e f f o r t levels a^ s a t i s f y i n g (3.6.1) are exerted, and pays nothing otherwise. Observe that i n the f i r s t best case, the salesperson chooses e f f o r t levels according to their effect on mean outcome. Suppose E(x^) = M^c^a^, where represents the contribution margin (sales revenues minus variable noncompensation costs) per unit of product i , and c^a^ represents the expected quantity of product i that w i l l be sold i f e f f o r t a^ i s exerted. ^2 2» i C , e , » If t n e The analysis i n Section 3.5 then applies. If M^c^ = contributions per unit of time spent s e l l i n g are equal, the t o t a l e f f o r t expended i s the only concern. If M^c^ > M2C2, then the Pareto optimal strategy i s for the agent to devote e f f o r t only to the f i r s t product. Under a more general return structure, the efforts a^ and a2 w i l l 40 be nonzero and unequal. If the mean functions are i d e n t i c a l and nonlinear and monotone i n a^, then the optimal e f f o r t s are such that a^ = a « 2 In practice, a straight salary i s seldom used for salespeople because of imperfect observability and imperfect cooperation (moral hazard). such situations, a second best analysis i s appropriate. In Consider f i r s t the one-product case, i n which the i n t e r i o r portion of the sharing rule i s characterized by I g (*|a) _ U'(s(x)) " a X + g(x|a) » P where x and a are univariate and the subscript a denotes d i f f e r e n t i a t i o n with respect to a. Examples of s p e c i f i c sharing rules are provided i n Table I for two members of the HARA class of u t i l i t y functions and two members of the one-parameter exponential family Q. Sharing Rule Given g(x|a) = U(s) l s n (M(a)) exp[-x/M(a)], _1 s l/b X + > b > 1 M'(a) > 0 (2Tra) exp[-(x-M(a)) /2a ], M'(a)>0 ^ i^l(x-M(a*)) M (a*) 2 (^/(b-l) _1 X _ 2 V*«a*)M'(a*) <T + 2 £ M X </ ^ / ( b - l ) Table I. Examples i n One-Product Case Observe that when U(s) = l n s, the sharing rules shown (and others corresponding to d i f f e r e n t members of Q, the one-parameter exponential family) can be interpreted as a salary plus commission on the outcome x, a scheme commonly found i n practice. If the agent's u t i l i t y function i s a concave power function, then the resulting sharing rule i s a convex power function of a l i n e a r form. The compensation schemes which pay a salary plus bonus commis- sions (e.g., s(x) = m + m^x i f x < XQ, S ( X ) = m + m^x + m (x-xo) i f x > X Q ) 2 can be considered as approximations to these sharing r u l e s . 41 The case where n > 1 i s more complicated i f the agent's u t i l i t y t i o n i s a power function, since cross terms i n the x^'s appear. examine conditions under which i t i s optimal to use a func- In order to salary-plus-commission scheme, the agent's u t i l i t y function w i l l be taken to be U(s) = l n s, since this i s the only s i t u a t i o n i n which a linear scheme can be optimal (see Section 3.4). The examples below employ the normal d i s t r i b u t i o n because of i t s convenient representation for dependent random v a r i a b l e s . For purposes of i l l u s t r a t i o n , i t suffices to take n = 2. Suppose then that U(s) = l n s, n = 2, and that x ~ N(9(a),E(a)), where 9(a) = ( 9^(a), 9 (a)) and £(a) i s the covariance matrix I^2( a ) p(a) ^ ( a ) a (a) 2 \ 2 2 o^a) p(a) o^a) a (a) 2 \ / At this l e v e l of generality, the optimal sharing rule i s quite complicated (see Appendix 2). It i s not separable i n x^ and x , salary plus commission scheme. (independent 2 and therefore i s not a A c o r r e l a t i o n c o e f f i c i e n t which i s constant of a) i s not s u f f i c i e n t for the sharing rule to be a salary plus commission scheme, although p = 0 does lead to a sharing rule which i s additively separable i n x^ and x « 2 S u f f i c i e n t conditions for a salary plus commission scheme to be optimal are that both the c o r r e l a t i o n c o e f f i c i e n t 2 and the variances be constant, with P * 1. by The commissions are determined 2 p, marginal increases i n the means 9^(a) at a*, the variances and the multipliers u^. Three especially interesting results of the example above are: (1) In the case of the normal d i s t r i b u t i o n with log u t i l i t y , independence of the products i s enough to guarantee additive separability ( i n x^ and x) 2 of the optimal sharing rule, but i s not enough to guarantee that the optimal 42 sharing rule w i l l be a linear sharing r u l e . That i s , the optimal sharing rule i s not a (salary plus) commission scheme, l e t alone an equal commission rate scheme. (2) It i s not necessary for the products to be independent i n order for a salary plus commission scheme to be optimal, or for a separable sharing rule to be optimal. (3) The optimal commission rates are generally not equal across products. The agency analysis applied to the sales force management problem i n d i cates that only under very special circumstances Pareto optimal. i s a commission scheme In practice, of course, commission schemes are favored because of their simplicity and ease of application, as well as their recognized incentive e f f e c t s . salespeople who If commission rates are used with r i s k averse face uncertainty i n sales, the rates should most l i k e l y not be equal across products, according to the analysis above. The results i n Section 3.5 on the a l l o c a t i o n of additive e f f o r t with independent outcomes provide some further insights about optimal compensation schemes for salespeople. It should be recalled that most of the results i n Section 3.5 were proved only for U(s) = 2v^\ Thus, the remarks that follow are r e s t r i c t e d by the assumption of that p a r t i c u l a r utility function for wealth for the agent. A p r i n c i p l e commonly taught i n managerial accounting texts i s that under certainty, i n order to maximize p r o f i t s given one scarce factor of production, a firm should manufacture the product which returns the highest contribution margin per unit of the scarce factor (see, e.g., Horngren (1982, p. 373)). setting. This principle does not necessarily hold i n the agency In the f i r s t best case, i f the means are linear i n e f f o r t , then the p r i n c i p l e holds. In addition, Proposition 3.5.8 indicates that in the second best case, i f expected returns are linear In e f f o r t , then a l l the 43 agent's e f f o r t should be put into s e l l i n g the product with the highest expected return per unit of e f f o r t i f the underlying d i s t r i b u t i o n i s normal with constant variance, or Poisson. However, i f the underlying d i s t r i b u t i o n i s exponential, then more e f f o r t should be put into s e l l i n g the product with the higher expected return per unit of e f f o r t , but both e f f o r t s w i l l be positive unless the expected returns per unit of e f f o r t are very d i f f e r e n t . (See the discussion after Proposition 3.5.8.) For the exponential d i s t r i b u t i o n with E(x^|a^) = k^a^, the optimal sharing rule i s given by s ( x ,x ) = [ X + t j ( x - k a*) + 2 k If k^ > k2» then L l l a > ^ ( 1 k " 2 2*^ ^ " k x 2 a 2 2 a and a| > a^. Equation [3] i n the proof of Proposi- tion 3.5.8 shows that y*/a* 2 2 2 2 = u|/a^ . Therefore, M*/(k a* ) < p*/(k a* ). x This implies that when k^ i s greater than k 2 2 (the expected return per unit of e f f o r t i s greater for product one than for product two), the agent's compensation per unit of x^ (the return on product one) i s less than the compensation per unit of x . 2 Continuing with the exponential d i s t r i b u t i o n case, i f kj^ = k , then 2 a^ = a^ and ii£ = u| (Proposition 3.5.5). The agent's compensation per unit of x^ i s equal to the compensation per unit of x , and the sharing rule can 2 be written as 2y* s(x ,x ) = [ X + 1 j ( i x 2 k l * a + x 2 ) " 2 ]• 1 Thus, the information (x^,x ) has no value i n addition to x^ + x . 2 lar result holds for more general situations, also. 2 A simi- Proposition 3.5.2 says that i f the p r i n c i p a l i s r i s k neutral, the agent i s r i s k averse, V(a) = V(Ea^), the x^'s given a^ are independent and i d e n t i c a l l y distributed, and a unique i n t e r i o r solution (a*^ > 0, a*, > 0) i s optimal, then a^ = aij and u* = 44 U*,. Under these conditions, the agent's compensation per unit of x^ i s equal to the compensation per unit of x . 2 It i s important to note that i f 2 x^ given a^ has a normal d i s t r i b u t i o n with mean ka^ and variance o" , and the x^'s given a^ are independent, a boundary solution (e.g., a^ > 0, a^ = 0) i s optimal. In this case, the agent would receive no compensation based on x « 2 Up to this point, the focus has been on a single agent exerting multiple e f f o r t s . A related topic i s that of multiple agents, which i s pertinent here because a firm w i l l generally have more than one salesperson. Feltham (1977b) examined the use of penalty contracts when a l l the agents are ident i c a l , and Holmstrom (1982) showed that the effectiveness of group penalties w i l l be hampered by limited endowments of the agents, especially as the number of agents becomes large. An important question in the multiple agent problem i s whether or not each agent should be rewarded independently of the others' performances. Holmstrom (1982) showed that i f the agents' outcomes are correlated with each other through the common uncertainty they face, then basing agent i ' s share on each agent's outcome helps reduce the uncontrollable randomness i n agent i ' s reward. Holmstrom (1982, p. 335) stated that . . .forcing agents to compete with each other i s valueless i f there i s no common underlying uncertainty. In this setting, the benefits from competition i t s e l f are n i l . What i s of value i s the information that may be gained from peer performance. Competition among agents i s a consequence of attempts to exploit this information. Only aggregate information about peer performance i s used i n the optimal sharing rules i f the aggregate measure captures a l l the relevant information about the common uncertainty. Of course, i f the agents' outcomes are inde- pendent of one another, then the optimal sharing rule for agent i depends only on agent i ' s outcome. One of the t r a d i t i o n a l principles i n performance evaluation within the firm Is the p r i n c i p l e that a person should be held responsible only for 45 those factors (e.g., costs or revenues) over which he or she has control. Basing the sharing rule for agent i only on agent i ' s outcome i s c l e a r l y consistent with the c o n t r o l l a b i l i t y p r i n c i p l e . Basing the sharing rule f o r agent i on the outcomes of other agents when there i s common uncertainty i s , at f i r s t glance, inconsistent with the c o n t r o l l a b i l i t y p r i n c i p l e . However, the reason that the compensation for each agent may depend on the outcomes of other agents i s that the p r i n c i p a l can gain information about the random state, and hence gain information about the e f f o r t s expended by each agent. That i s , the p r i n c i p a l can gain information about each agent's input ( e f f o r t ) , over which the agent has direct control. Thus, there i s no con- f l i c t with the c o n t r o l l a b i l i t y p r i n c i p l e i n this case. The apparent con- f l i c t occurs because the focus of the c o n t r o l l a b i l i t y p r i n c i p l e has been transferred from outputs to inputs ( c f . Baiman (1982, pp. 197-198)). The last modification to the standard agency analysis for the problem of sales force management relates to noneffort decisions. Frequently, the salesperson must not only make several e f f o r t decisions, but also make several " r i s k " decisions which do not require expenditures of e f f o r t . The choices of discounts to offer on each product are examples of such r i s k decisions. Weinberg (1975, p. 938) i d e n t i f i e s the following situations i n which an agent might have control over the price: (1) perishable a g r i c u l t u r a l products; . . . (2) sales involving trade-ins i n which the salesman has control over the evaluation of the trade-in, e.g., automobiles; (3) systems s e l l i n g i n which the salesman has a wide range of latitude i n specifying the combination of services to be provided, e.g., contractors and consultants; (4) some r e t a i l situations i n which the l o c a l store manager has control over price of at least some of the items sold i n his store; (5) l i q u i d a t i o n sales of obsolete product lines or r e t a i l e r d i s tress sales; and (6) highly competitive markets i n which customers are price bargainers . . . . One approach to the problem of incorporating both r i s k and e f f o r t decisions was taken by Itami (1979), who examined optimal linear goal-based incentive schemes under uncertainty. In his model, a r i s k decision i s made 46 by the agent before the state of nature i s observed. The agent then chooses an e f f o r t l e v e l based on the r i s k decision and the observed state of nature, resulting i n a deterministic output which i s a function of the agent's two decisions and the state of nature. For example, the d i v i s i o n a l manager of a large corporation might make investment decisions on projects before the environmental conditions are revealed. The e f f o r t expended and the known state then determine the output. The simplest agency theory approach to the problem of incorporating both r i s k and e f f o r t decisions i s to assume that both of the agent's decisions are made before the state of nature (or any other information) i s observed. This approach w i l l now be b r i e f l y discussed. optimal sharing rule i s derived rather than assumed. The form of the Furthermore, because risk-sharing aspects are important i n this setting, both the p r i n c i p a l and the agent are assumed to be r i s k averse. As Itami points out, there i s a direct and an indirect effect of the agent's e f f o r t on his or her u t i l i t y , while there i s only an indirect effect from the r i s k decisions. Up to this point, i t has been assumed that the agent's u t i l i t y i s separable i n e f f o r t and wealth. This assumption leads to a characterization of the optimal sharing rule that i s independent of the agent's d i s u t i l i t y for e f f o r t , although the indirect effects of e f f o r t expended are captured v i a the terms g /g. The more general u t i l i t y func- Si • J tion U(s(x),a) for the agent leads to a characterization of the optimal sharing rule that captures both the direct and Indirect effects of the agent's e f f o r t . for When there are no r i s k decisions, optimality requires that i n t e r i o r solutions, TW'(x-s(x)) TI/ / w U (s(x),a) j j = s X + y [ Ua..s (s(x),a) gB (xia) - '^ \ _ i _ / U (s(x),a) g(x|a) * v a + s 1 47 where the subscripts on U and g denote d i f f e r e n t i a t i o n with respect to a j , and the subscripts s denote d i f f e r e n t i a t i o n with respect to s. The major implication of nonseparability of the u t i l i t y function i s that the role of e f f o r t i s e x p l i c i t , as i s interaction between e f f o r t and compensation. It i s s t i l l true that i f the agent i s r i s k neutral, then the f i r s t best solution i s achievable by a sharing rule of the form x-k. Because the important distinguishing feature of e f f o r t decisions i s their twofold e f f e c t on the u t i l i t y function, the general form of the u t i l i t y function i s used here. Letting r^ denote the r i s k decision for task i and r = ( r ^ , . . . , r ), the p r i n c i p a l ' s problem is''' Maximize s(x),a,r / W(x-s(x))g(x|a,r) dx subject to / U(s(x),a)g(x|a,r) dx > u -Tr— / U(s(x) ,a)g(x|a,r)dx j / U(s(x) ,a)g(x|a,r)dx :A = 0, j=l,...,n :u. = 0, j=l,...,m. : 3. To the right of the constraints above are their associated m u l t i p l i e r s . The i n t e r i o r portion of the optimal sharing rule i s characterized by W'(x-s(x)) U a s ( 5 8 a ( 5 8 r. ( 5 It should be noted that there i s an Implicit assumpti on that the r.» s do not 1 s a t i s f y f i r s t - o r d e r stochastic dominance, since otherwise the p r i n c i p a l and the agent would agree on the choices of the r^'s and there would be no incentive problem with respect to the r ^ ' s . Suppose next, as Weinberg (1975) did, that the gross margin generated by sales of product I i s given by x i = P i ( 1 - r i ) Q i " i i' K Q w h e r e ^ = nominal s e l l i n g price per unit of product i , r^ = discount (decimal) on product i , 48 = quantity (units) of product i sold, = variable nonselling cost per unit of product i , and M£ = P j ^ l - r ^ ) - = gross margin on product i . Weinberg (1975) sought to determine i f an equal-commission scheme i s incen- tive compatible when there both r i s k and e f f o r t decisions. An agency theory analysis suggests that such a scheme i s not Pareto optimal. Q ± ~N( 9 ( a , r ) , a ( r ) ) . i i (P^l-r^-K pal 1 i i Then x ± ~ N(P (l-r )-K ) 8 ( a 1 i i i 1 Suppose ,r ), ± ) cr^(r ) ) . Previous analysis indicates that i f both the p r i n c i 2 and the agent are r i s k averse with u t i l i t y functions i n the HARA class, then the optimal sharing rule i s i n general not a d d i t i v e l y separable i n and x « 2 V(a), If the p r i n c i p a l i s r i s k neutral and the agent's u t i l i t y i s l n s - then previous remarks concerning the optimality of a commission scheme i n the normal d i s t r i b u t i o n example with no r i s k decisions apply. Demski and Sappington (1983) examined the s i t u a t i o n i n which i t i s desired to motivate an individual to obtain and use information which i s personally costly ( i n a pecuniary or nonpecuniary to obtain. Their analysis may sense) for the i n d i v i d u a l provide insights for the sales force manage- ment problem when the salesperson has the option or the a b i l i t y to observe private information before making r i s k decisions. 3.7 SUMMARY AND DISCUSSION This chapter derived optimal incentive schemes when the agent has sev- eral tasks over which to exert e f f o r t , and the p r i n c i p a l and the agent have homogeneous b e l i e f s about the outcome d i s t r i b u t i o n . In the f i r s t best case, where there i s no moral hazard problem, the major issue i s r i s k sharing, and the results are similar i n nature to the one-dimensional e f f o r t case. If one individual i s r i s k neutral and the other i s r i s k averse, then the r i s k neutral individual bears a l l the r i s k , receiving the uncertain outcome less a constant fee. If both individuals are r i s k averse, then the r i s k sharing 49 aspect i s prominent; even i f the disaggregated information, x = ( x ^ , . . . , x ) , n i s observed, the sharing rule depends only on the sum of the x^'s. In the analysis of the second best case, where there i s a moral hazard problem, the p r i n c i p a l was assumed to be r i s k neutral i n order to focus on motivational issues. As i n the single-task case, the f i r s t best solution i s achievable when the agent i s r i s k neutral. When the agent Is r i s k averse, the optimal sharing rule can be as simple as a salary plus commission, or can be more complicated, depending on the d i s t r i b u t i o n of the outcomes and the agent's u t i l i t y function. In general, i t i s much more d i f f i c u l t to determine when the sharing rule w i l l be increasing i n each outcome, x^, than in the single-dimensional e f f o r t and output case. the sign of each of the Lagrangian m u l t i p l i e r s There are two reasons: must be determined, the question of multivariate stochastic dominance must be addressed. and Each of these problems can be analyzed only i n special cases. The analysis of the value of additional information i s also more complicated than i n the single-dimensional e f f o r t and output case. The applic- a b i l i t y of the results of Amershi (1982) for the multidimensional e f f o r t case was discussed. The use of additional disaggregated information was demonstrated by means of examples. It was shown that i n the case where a salary plus commission scheme i s optimal, the commissions related to each task w i l l generally be unequal. The next question addressed was whether a manager should receive separate rewards for the outcomes from the different tasks. It was shown that a strong form of independence (see (3.4.1)) i s neither necessary nor s u f f i cient for an optimal sharing rule to be additively separable i n the outcomes. If the p r i n c i p a l i s r i s k neutral and the agent's u t i l i t y function i s in the HARA c l a s s , then a necessary condition for additive separability of the optimal sharing i s that the agent have a log u t i l i t y function. If the 50 p r i n c i p a l and the agent are i d e n t i c a l l y r i s k averse, with i d e n t i c a l log utility functions, then the optimal sharing rule i s not additively separ- able. The remainder of this chapter focused on situations i n which effort i s additive, as when e f f o r t represents time devoted to differen t tasks. optimal sharing rules i n the f i r s t best and second best situations were examined under various assumptions about the means and distributions outcomes. The of the In the additive e f f o r t case where there i s no i n t r i n s i c d i s u t i l - i t y for any particular task, i t i s of interest to determine whether the p r i n c i p a l can most e f f i c i e n t l y induce a r i s k averse agent to allocate a l l effort to one task, or to d i v e r s i f y by allocating e f f o r t to each task. section showed that the nature of the outcome d i s t r i b u t i o n factor i n determining whether the optimal solution i s an important w i l l be boundary ( a l l e f f o r t devoted to one task) or i n t e r i o r (effort spread across tasks). c r i t i c a l factor, however, appears to be the relationship expended and the mean of the d i s t r i b u t i o n . mal i n t e r i o r solution This The between e f f o r t Conditions under which an o p t i - i s unique were found, and i t was shown that i f an optimal i n t e r i o r solution i s unique, then the optimal efforts for both tasks are equal, as are the Lagrangian multipliers u^. The additive e f f o r t results were applied to the sales force management problem. As remarked earier, simple commission schemes are rarely Pareto optimal; even when they are optimal, the commissions are generally not equal across products. However, i f the p r i n c i p a l i s r i s k neutral, the agent i s risk averse, V(a) = V(Ea^), the x^'s given a^ are independent c a l l y distributed, and a unique i n t e r i o r solution and i d e n t i - (a'J > 0, a*, > 0) i s o p t i - mal, then the agent's compensation per unit of x^ i s equal to the compensation per unit of x . 2 The multiple salesperson firm was b r i e f l y discussed, 51 as was the addition of r i s k decisions (not involving e f f o r t ) by the s a l e s - person. It has long been recognized that dysfunctional behavior on the part of managers can be induced by their focus on short-term personal goals rather than long-term company goals. Moreover, the company may unwittingly pres- sure managers to make decisions which w i l l increase short-term the expense of long-term goals. One p r o f i t s at of the obvious aspects of a solution i s to extend the performance evaluation period from, for example, one year to several years. A brief comparison of the a l l o c a t i o n of e f f o r t problem and a multiperiod problem follows. Consider the s i t u a t i o n i n which the agent chooses one action a^ i n each of n time periods, resulting i n monetary outcomes x^ which are observed by both the p r i n c i p a l and the agent at the end of period i . The agent's action in any period and the sharing rule for each period can then depend on the outcomes from previous periods. izon w i l l be considered here. For ease of exposition, the two-period horThe p r i n c i p a l ' s u t i l i t y for the two-period horizon is now W(x^-s^(x^),x ~s (x^,x )), 2 2 U(s (x ) , s ( x x ) ,a ,a (x )) . 1 1 2 1 } 1 2 2 ,x ,a Let g ^ 1 ,a (x )) = l 2 2 1 The p r i n c i p a l ' s problem i s then h(x |a^,x^,a (x^))f(x^|a^). 2 and the agent's u t i l i t y i s 2 2 Maximize / J w ^ - s ^ x ^ , x - s ( x , x ) ) g ( x , x , a . a ^ x ^ i' i 2 2 1 2 L 2 l )dXjdx (3.7.1) 2 v subject to /]b(s (x ),a ,s (x ,x ),a (x ))g(x ,x ,a ,a (x ))dx dx 1 1 1 2 1 2 2 1 1 2 1 2 1 and a^ and a2(«) maximize the left-hand side of Let E 2 denote expectation with respect to h ( . ) . rules are characterized by 1 2 > u (3.7.2) (3.7.2). The optimal sharing 52 2 \ E < ll l> f x a i a (E2 + l^C^) < V 2 2 E D B a ) 1 1 a » -ff-jj 2 almost every f o r x, l 8 l and W s S 8 2 , . * h U S a ( 0 h l g( .) a + + . . , Wj( ]_) a a % x h ( 0 2 () > t almost every (x^, x ) , f o r 2 2 special cases are of i n t e r e s t . Two The f i r s t i s the case i n which the p r i n c i p a l ' s and agent's u t i l i t i e s are additive over time, with discount factors 3 and ot, respectively. Suppose the p r i n c i p a l and the agent agree on the contract at the beginning of the two-period horizon and each individual is committed to the contract for the entire time horizon. Then the p r i n c i - pal's expected u t i l i t y i s j W ( x - s ( x ) ) f ( x |a )dx 1 1 1 1 1 + 3/JW( 2 2^ 1 » 2 ^ ^ ( x 1 -s x x . x 1 x t&i , a ( ) ) d x d x , x 2 1 2 1 2 where g(.) = h(x 1 a^ ,x , a ( x ) ) f (x-j^ l a ^ . 2 l 2 1 The agent's expected u t i l i t y i s / b ( s ( x ) ) f ( x |a )dx 1 1 L 1 1 + cx//U(s (x , x ) ) g ( x ,x ,a , a ( x ) ) d x d x 2 1 2 1 2 L 2 1 1 2 - V^) - a/v(a (x ))f(x |a ) dxj. 2 1 1 1 The i n t e r i o r portions of the optimal sharing rules are characterized by woy'i^i)) U'Cs^x,)) f 1 = X + + "l i( il i> x . a n fCxJa,) ( 3 ' * 7 3 ) and s z a W ( x 2 ~ s (Xj_ ,x ) ) g ( x ,x ,a ,a ( x ) ) 2 2 x x 2 x 2 L = A + u, U'(s (x ,x )) 2 1 i 2 g(x ,x ,a ,a (x )) 1 2 1 2 1 g (x ,x ,a ,a ,(x )) 2 + ^ X l > g 1 2 1 2 1 (x ,x ,a ,a (x )) • 1 2 1 2 > 1 ( 3 * ' 7 where the subscripts j on the d i s t r i b u t i o n s indicate p a r t i a l derivatives with respect to a^ 4 ) 53 Equation (3.7.A) i s similar to the characterization for single-period sharing rules i n the multidimensional e f f o r t problem. Thus, the multidimen- sional e f f o r t results described e a r l i e r are useful i n extending to a certain class of f i n i t e - h o r i z o n multiperiod problems. the theory Lambert (1981) has examined the model above under the assumption that e f f o r t i n one period has no effect on the outcome i n any other period, and also examined problems which occur when the p r i n c i p a l i s committed to a two-period contract, but the agent can leave the firm a f t e r the f i r s t period. The second special case of interest i s the case i n which the p r i n c i pal's and the agent's expected u t i l i t i e s depend only on the t o t a l return over the entire time horizon. In this case, the p r i n c i p a l ' s u t i l i t y tion i s W(x^ + %2 ~ and the agent's u t i l i t y i s ( ^ ( x ^ ^ ) , a l' 2^*^* a T n ^ s s(x^,X2)), func- structure i s also appropriate for the problem of sequential a l l o c a t i o n of e f f o r t within one time period, where the time period i s said to end when the agent receives his or her compensation. The sequential a l l o c a t i o n aspect would arise because of the agent's opportunity to observe an outcome affected by the f i r s t e f f o r t choice before making any other e f f o r t choices. This s i t u a t i o n i s the focus of the next chapter. 54 CHAPTER 3 FOOTNOTES 1. Although there are technical problems connected with the use of the normal d i s t r i b u t i o n , i t i s used here for i l l u s t r a t i v e purposes because i t i s the only d i s t r i b u t i o n with a convenient representation for dependent random v a r i a b l e s . Detailed calculations and r e s u l t s for the normal d i s t r i b u t i o n appear i n Appendix 2. 2. A modified version of this result holds when the p r i n c i p a l i s r i s k averse. D i f f e r e n t i a t i n g the f i r s t - o r d e r condition characterizing the sharing rule with respect to x shows that , , , 3 a sign (s'(x)) = sign ( ^ — f ~ W"U\ 2~^ * 3 a Thus, V -K- j— > 0 implies that s'(x) > 0, but the converse does not hold. ~~ f 3. As Holmstrom (1979) points out, i f the production function x i s given by x(a, 6), where 0 represents a random state of nature, then 3x/3a > 0 implies that the d i s t r i b u t i o n of x s a t i s f i e s the f i r s t - o r d e r stochastic dominance property (provided that changes i n a have a n o n t r i v i a l effect on the d i s t r i b u t i o n ) . 4. Extending this and the other propositions which depend on the assumption of a square root u t i l i t y function to a more general class of u t i l i t y functions appears to be n o n t r i v i a l . However, i n the discrete-outcome example presented e a r l i e r , the result i s not confined to only the square root u t i l i t y function. Hence, i t appears l i k e l y that this and the other results stated for the square root u t i l i t y function hold for a more gene r a l class of u t i l i t y functions. 5. Lai (1982) also independently applied agency theory to the problem of sales force management. Much of his analysis i s for a special normal d i s t r i b u t i o n and the class of power u t i l i t y functions. He did not analyze the additive e f f o r t case. 6. Let p be the constant sales price of a product, and c be the constant noncompensation cost per unit of product. Further, l e t q be the random quantity sold as a result of the agent's e f f o r t . One question of i n t e r est i s whether the agent's compensation should be based on, for example, sales (pq) or a "contribution margin" (pq-cq). It i s easy to see that ! f (q|a*) the optimal sharing rule i s characterized by TTTT—/ w = A + u ——,—• . . . U (s( •)) f(q|a*) That i s , the optimal sharing rule does not depend e x p l i c i t l y on p or c or p-c. a 7. The formulation i s presented i n order to i l l u s t r a t e the structure of the problem. Technical problems with the properties of the functions to be maximized are not addressed. 55 CHAPTER 4 ONE-PERIOD SEQUENTIAL CHOICE In this chapter, the model i s expanded to include decisions made at different times. The extension i s to sequential decisions within one period, where a period i s defined to end at the time of payment to the agent. The one-period sequential case i s an intermediate step between the a l l o c a t i o n of e f f o r t case, i n which both the e f f o r t s are exerted before the outcomes are known, and the two-period case, i n which the f i r s t outcome i s observed and the f i r s t compensation Is paid before the second e f f o r t i s exerted. The a l l o c a t i o n and sequential situations can be depicted as follows: Allocation of e f f o r t : Agent exerts Principal chooses s ( l>*2) x a l » A P r i n c i p a l and agent observe xi,x > p r i n c i p a l 2 pays s(x^,x ) to the agent. 2 2 One period sequential choices: Principal chooses s(x ,x ) 1 2 Agent exerts a Agent observes l x Agent exerts a (0 l 2 Agent observes X 2 ; p r i n c i p a l observes x^ and X 2 and pays s ( x i , x ) to the agent 2 Two-period sequential choices: Principal chooses s s l( l) x a n d 2^ l» 2 x x Agent exerts a l P r i n c i p a l and agent observe x^; p r i n c i p a l pays s ( x ) the agent. 1 ) 1 Agent exerts a (0 2 P r i n c i p a l and agent observe X 2 ; p r i n c i p a l pays S 2 ( x ^ , X 2 ) to the agent. In each of the cases above, i f the p r i n c i p a l and the agent observe addit i o n a l valuable post-decision information about the agent's e f f o r t s , then the sharing rules w i l l depend on t h i s additional information. 56 A number of situations might be modeled i n the one-period sequential framework. In the sales force management example, the agent might spend a certain amount of time s e l l i n g products i n one t e r r i t o r y and observe the amount of the resultant sales there before beginning work i n another t e r r i tory. If there i s c o r r e l a t i o n between x^ and x , 2 information from x^ which may then the agent obtains be useful i n the decision about a2« t i o n a l post-decision information that the p r i n c i p a l may agent's e f f o r t s might be comments obtained The addi- obtain about the from personally Interviewing the agent's customers. Another one-period sequential decision setting might involve production decisions by an agent, where a^ i s the number of hours of production some sales information i s obtained. of hours of production The agent would then choose the number for the remainder of the period. the additional post-decision information obtained the number of work hours recorded until In t h i s s i t u a t i o n , by the p r i n c i p a l might be on the agent's time cards. More gener- a l l y , a manager i n a decentralized organization w i l l not be monitored d a i l y , but rather w i l l make many decisions during a given time period and w i l l be evaluated only p e r i o d i c a l l y . The one-period sequential model can be thought of as the special case of the f u l l y general two-period model In which the periods are very short, so that the p r i n c i p a l ' s and the agent's expected u t i l i t i e s depend only on their t o t a l return for the entire horizon. The one-period model can incor- porate some of the elements of the f u l l y general two-period model while providing a somewhat s i m p l i f i e d structure for analysis. For example, i n both models, the f i r s t outcome, which i s f i r s t - s t a g e post-decision information, can be used as pre-decision information for the second e f f o r t choice. The agent's precommitment to stay for the entire time horizon i s not a major 57 problem In the one-period model, since the agent i s not paid u n t i l a l l the required e f f o r t s have been exerted. In the f i r s t part of this chapter, the s i m p l i f i e d structure i n the oneperiod sequential model i s used to explore the impact of c o r r e l a t i o n of outcomes i n f i r s t best and second best s i t u a t i o n s . to the a l l o c a t i o n of e f f o r t r e s u l t s . Some comparisons are made The analysis w i l l focus on aspects which were not addressed i n the pre-decision information l i t e r a t u r e or i n Lambert's (1983) analysis of a f i n i t e - h o r i z o n multiperiod agency problem with independent outcomes. for The second part of t h i s chapter develops results the one-period sequential problem that p a r a l l e l two sets of r e s u l t s i n the a l l o c a t i o n of e f f o r t problem, namely additive s e p a r a b i l i t y of the sharing rule and d i v e r s i f i c a t i o n of e f f o r t across tasks when e f f o r t i s additive. The s i m i l a r i t i e s to and differences from the a l l o c a t i o n r e s u l t s are d i s cussed. Before proceeding to the analysis, a b r i e f review of the existing results on pre-decision information w i l l be given and Lambert's (1983) r e s u l t s w i l l be summarized. Unless otherwise stated, the "sequential e f f o r t problem" w i l l refer to the one-period sequential e f f o r t problem. A limited amount of research has been devoted to one-period agency problems with pre-decision information. Baiman (1982, p. 192) comments as follows on the increased complexity with pre-decision information: The role and value of a pre-decision information system i s more complex than that of a post-decision information system. Expanding a post-decision Information system to report an addit i o n a l piece of information w i l l always r e s u l t i n at least a weak Pareto improvement, since the p r i n c i p a l and agent can always agree to a payment schedule that ignores the additional informat i o n . However, expanding a pre-decision information system to report an additional piece of information may not r e s u l t i n even a weak Pareto improvement. The agent generally cannot commit himself to ignore the additional information, and therefore the optimal employment contract without the additional pre-decision information i s no longer necessarily self-enforcing given the additional information. This i s true whether the additional predecision information i s p r i v a t e l y reported or p u b l i c l y reported. 58 Some of the research concerning pre-decision Information focuses on the following question: Given that the agent has private pre-decision informa- t i o n , what i s the value of public post-decision information systems? Holmstrom (1979) showed that an informativeness c r i t e r i o n (f(x,y,z;a) * g(x,y)h(x,z;a), where z i s the pre-decision signal) i s necessary for the post-decision information system which reports a public s i g n a l , y, i n addition to x, the outcome, to provide a Pareto improvement over the information system which reports only x. Christensen (1982) expanded Holmstrom's (1979) model by allowing the agent to communicate to the p r i n c i p a l a message m about the private pre-decision s i g n a l . The agent i s assumed to select the message that maximizes h i s or her expected u t i l i t y . In Christensen's model, a generalization of Holmstrom's (1979) informativeness c r i t e r i o n i s necessary for the post-decision information system which reports y, i n addition to x and m, to provide a Pareto improvement over the information system which reports only x and m. Here, the public post-decision signal i s a s i g - nal about the agent's e f f o r t and the agent's private pre-decision information signal. Another d i r e c t i o n of the research on pre-decision information has been the value of pre-decision information systems. There are both positive and negative effects of private pre-decision information for the agent. On one hand, the agent has more information before choosing an action, and hence should make "better" decisions. On the other hand, more information may reduce the r i s k the agent faces, and hence reduce the motivation for the agent to exert e f f o r t . Christensen (1981) provided an example which shows that the p r i n c i p a l may be worse o f f when the agent has private pre-decision information (with or without communication of^a message), and also provided an example which shows that the p r i n c i p a l may be better o f f when the agent has private pre-decision information and communicates a message to the prin- 59 cipal. Christensen's examples i l l u s t r a t e the d i f f i c u l t y i n obtaining a gen- e r a l preference ordering r u l e over pre-decision information systems. A t h i r d d i r e c t i o n of research on private pre-decision information has been the value of communication of a message about the private information from the agent to the p r i n c i p a l , given the existence of the private pre-dec i s i o n information system. In the accounting context, the focus i s on the value of communication of private information i n the process of p a r t i c i p a tive budgeting. (1983), who The major result i n t h i s area i s that of Baiman and Evans provided necessary and s u f f i c i e n t conditions for communication to result In a Pareto improvement. Baiman (1982, p. 204) summarizes the result as follows: . . . If the agent's private pre-decision information i s perfect, then communication has no value. Observing the firm's output i n that case allows the p r i n c i p a l to i n f e r a l l he needs to know about the agent's private pre-decision information. However, i f the agent's private pre-decision information i s imperfect, a necessary and s u f f i c i e n t condition for communication to be s t r i c t l y valuable i s for the honest revelation of the agent's private pre-decision information to be s t r i c t l y valuable. That i s , i f any value can be achieved with the information being hone s t l y revealed to a l l , then a s t r i c t l y positive part of that value can be achieved by giving the agent sole d i r e c t access to the information and l e t t i n g him communicate i n a manner that maximizes h i s expected u t i l i t y . Lambert (1983) has examined a special case of the f i n i t e - h o r i z o n multiperiod agency problem. He assumed that both the p r i n c i p a l and the agent have u t i l i t y functions (and that the agent has a d i s u t i l i t y function) which are separable across time. independently He further assumed that the state variables are distributed across time, and that e f f o r t i n one period does not influence the monetary outcome i n any other period. Under these condi- tions, Lambert showed that the agent's compensation i n a given period w i l l depend on the outcomes i n previous periods as well as on the outcome i n the present period. He further showed that the incentive problems associated with the agent's e f f o r t choices i n each period are not eliminated. In the 60 notation of t h i s chapter, the result can be stated as (I) (ii) ^ ( x ^ > 0 for almost every x 1 > 0, and ( f i r s t - s t a g e outcome). The remainder of t h i s chapter analyzes the one-period sequential e f f o r t choice problem. The cooperative, or f i r s t best case i s f i r s t considered, and the behavior of the agent's second-stage acterized. e f f o r t choice strategy i s char- The second best case i s then analyzed. The optimal sharing r u l e i s derived and discussed, as i s the behavior of the agent's second-stage choice strategy, with and without independence of the outcomes. It i s then shown that the optimal sharing rule w i l l not be a d d i t i v e l y separable i n the outcomes, even under the conditions which were s u f f i c i e n t for such a r e s u l t in the e f f o r t a l l o c a t i o n problem. F i n a l l y , the special case of additive e f f o r t i s analyzed, and the question of the d e s i r a b i l i t y of d i v e r s i f i c a t i o n of the agent's e f f o r t s across tasks i s examined. The result i s related to the information content of the outcome about the agent's 4.1. FIRST BEST In the f i r s t best case, the p r i n c i p a l ' s problem i s : Maximize / / W(x-s(x^ ,x ) ) <f>(x^ ,x s( •) , a , a ( •) 2 1 2 ,a2( •) ) dx dx^ 2 2 subject to / / {U(s(x ,x )) - V ( a , a ( •))}•( •)dx dx 1 where effort. 2 t 2 2 1 > u, ,x 1 a^ ,a ( •) ) = f (x^^ | a^)g(x \x^ ,a^ ,a ( •) ) and a ( 0 2 2 2 2 2 indicates that e f f o r t i s i n general not a constant, but rather can the agent's second-stage depend on any information available at the time of choice. Letting X be the m u l t i p l i e r for the agent's expected u t i l i t y constraint and d i f f e r e n t i a t i n g the Hamiltonian with respect to s( •) for every (x^,x ) y i e l d s 2 W'(x-s( ,x )) Xl 2 U'(s( ,x )) X l 2 = X 61 for almost every ( x , x ) . 1 This implies that i f one person i s r i s k neutral 2 and the other i s r i s k averse, then the r i s k neutral person w i l l bear the r i s k (see Appendix 4). That i s , i f the p r i n c i p a l i s r i s k neutral and the agent i s r i s k averse, then the optimal sharing rule Is constant; i f the p r i n c i p a l i s r i s k averse and the agent i s r i s k neutral, then the principal's return i s k, a constant, and the agent receives x^+x ~k. 2 are If both individuals r i s k averse, then the r i s k i s shared; the optimal sharing rule i s a func- tion of (x^+x ). Furthermore, 2 9s/9x^ i s positive f o r i = 1,2. Finally, i f both are r i s k neutral, then the optimal sharing rule i s s = u + V(a^,a (»)). 2 These r e s u l t s are the same as those f o r the a l l o c a t i o n of e f f o r t problem. Thus, i n the f i r s t best case, the sequential nature of the e f f o r t decisions does not a f f e c t the characterization of the optimal sharing r u l e s . In this scenario, there are no signals on which the choice of a^ can be based. Whether or not a i s a function of x^ depends on the r i s k attitudes 2 of the individuals and the j o i n t d i s t r i b u t i o n <|>(x^ ,x |a^ ,a ( • ) ) . 2 2 If at l e a s t one of the individuals i s r i s k neutral and (jKxj^ ,x |a^ ,a ( •) ) = f (x^ |a^)g(x | a ( •) ) , then the optimal a ( •) i s indepen2 2 dent of x^. 2 2 2 In this case, the r i s k neutral person e s s e n t i a l l y owns the out- put of the firm, and thus bears a l l the r i s k associated with the uncertainty of x^. Furthermore, x^ conveys no information about x . 2 If both of the individuals are r i s k averse or i f <)>(•) i s the more gene r a l f ( x j j a ^ ) g ( x | x ^ , a ^ , a ( • ) ) , then the optimal a ( •) w i l l generally depend 2 on x^. ual 2 2 In the f i r s t case, the change from the s i t u a t i o n where one i n d i v i d - i s r i s k neutral occurs because each r i s k averse individual's marginal u t i l i t y depends on the f i r s t outcome, since i t determines where on his or her u t i l i t y curve the Individual i s ; a r i s k neutral individual's marginal u t i l i t y , on the other hand, would be the same no matter what the value of x^ is. This f i r s t e f f e c t of x-i can be termed the "wealth" or " r i s k aversion" 62 effect. for x 2 In the second case, i f x^ and x 2 are dependent, then may change according to the f i r s t outcome, x^. therefore wish to induce the agent to choose a («) 2 decreasing expectations The p r i n c i p a l may as an increasing or function of x^, depending on the r i s k attitudes of the p r i n c i p a l and the agent, the agent's d i s u t i l i t y for e f f o r t , and the nature of the corr e l a t i o n between x^ and x . This second effect of x^ can be termed the 2 "information" e f f e c t . The information effect of x^ i s made more precise i n the proposition below. Proposition 4.1.1. Suppose that i n the f i r s t best case, the p r i n c i p a l i s r i s k neutral, the agent i s r i s k averse, and <{>(•) = f(x^ |a^)g(x2 |x^ ,a^ ,a ( • ) ) . 2 In this case, a ( •) w i l l depend on x^. Let M^( •) denote the mean of x^ 2 given a^, and l e t M2(x^,a^,a2( •)) denote the conditional mean of x respect to g(»). Let the second and third subscripts of j on M t i a l d i f f e r e n t i a t i o n of M 2 2 2 with denote par- with respect to the j-th argument of M ( x , a , a ( •))• Then 2 1 1 2 a * « ( ) = -M /[M233 " A[ 9 V( •) / 9a ] ] • 2 Xl 2 231 For example, suppose M2(x^,a^,a2( •)) = x^^Cx^/a^ and V( •) = (a^+a2) . Then a*,'(x^) = l/(2a^A) > 0. In this case, a^(x^) increases l i n e a r l y i n x^. The e f f e c t of the nature of the correlation between x^ and x 2 i s cap- tured i n the derivatives of M ( 0 » and the effect of the d i s u t i l i t y function 2 2 2 i s captured i n the 9 V/9a 2 term. Note that a*,(x^) does not depend on the agent's u t i l i t y function for wealth. This i s because the r i s k averse agent receives a constant wage i n the f i r s t best case, and hence the agent's u t i l i t y for the wage i s constant. Note further that i f M 2 depends only on a (*)> then a*, i s constant. 2 Proposition 4.1.1 and the discussion preceding e f f o r t choice's dependence on x^, the f i r s t outcome. i t focused on the second The second e f f o r t choice, a„(»), might seem to also depend on the f i r s t e f f o r t choice, aj_. 63 However, the agent chooses the e f f o r t a^ and the e f f o r t strategy a^(. •) simultaneously at the beginning of the time horizon. The second e f f o r t choice i s therefore not viewed as a function of a^, although there i s i m p l i c i t recognition that a^ and a ( ") chosen j o i n t l y and therefore a r e 2 influence one another. However, since the f i r s t outcome i s unknown at the beginning of the time horizon, the second e f f o r t choice can p o t e n t i a l l y depend on the f i r s t outcome. 4.2 SECOND BEST In this section, the general formulation of the one-period sequential model i s f i r s t presented. Subsequently, the two extremes of independent outcomes and perfectly correlated outcomes are examined. In the f i r s t case, knowledge of x^ reveals no information about x , whereas i n the second case, 2 x^ reveals perfect information about x . The behavior of the agent's second 2 e f f o r t strategy i s i l l u s t r a t e d In the two extreme cases, and also for the intermediate case of imperfectly correlated outcomes. As before, i n order to focus on motivational issues, i t w i l l be assumed that the p r i n c i p a l i s r i s k neutral and the agent i s r i s k averse. The prin- cipal's problem i s : Maximize s( •),a ,a ( •) 1 / / (x-s(x^ ,x )) <|>(x^ ,x |a^ ,a ( «))dx dx^ 2 2 2 2 2 subject to / [ / U ( s ( x x ) ) g ( x | x , a , a ( «))dx - V ( a ,a ( •) ) ] f ( x |a )dx 1 } 2 / / u(s(-))[g { /U(s(0)g 1 2 1 f + gf ^]dx d a 2 2 X l 2 1 2 - / (V^f + vf ^)d a x Xl 1 2 a 1 1 >u = 0 (*)dx - V ^( • ) } f ( x | a ) - 0 for almost every a ;L x, x where <«x^,x 1a^,a ( •)) = f(x^ |a^)g(x |x^,a^,a ( •)) and d i f f e r e n t i a t i o n 2 respect to a 2 2 2 i s pointwise for each x^. 2 with The i n t e r i o r portion of the optimal 64 sharing rule i s characterized by (s( )) u t = ^i+a ^* X + where X, then A a l f ° ra l m o s t e v e r y (x ,x ), 1 2 and ^ ( x ^ ) are m u l t i p l i e r s for the three constraints above. Here, 6 a " ^ l ^ a ^ * + x /<(> = f l a / f + g /g and $ l l 2 a a /<\> = g 2 /g. If ai does not influence xo, a /<(. = f / f . l a The characterization of the i n t e r i o r portion of the optimal sharing rule i n the sequential e f f o r t case i s similar to that i n the a l l o c a t i o n of e f f o r t case, except that here y*, and a*, may depend on x^. depends on x^. However, i f the agent i s r i s k neutral and <}>(x^ >x |a^ ,a ( •)) 2 = f ( x | a ) g ( x |a ( •)), then a*,( •) does not depend on x^. 1 1 2 In general, a*,(«) 2 2 If the x^s are conditionally correlated, then a*,( •) w i l l depend on x-^ even i f the agent i s r i s k neutral. These r e s u l t s are d i r e c t consequences of the a c h i e v a b i l i t y of the f i r s t best solution i n the second best case i f the agent i s r i s k neutral (see Shavell (1979)). The proposition below describes aspects of the second stage problem f o r a p a r t i c u l a r u t i l i t y function for the agent, and f o r several commonly used d i s t r i b u t i o n s for the independent outcomes. Proposition 4.2.1. Suppose that i n the second best case, the p r i n c i p a l i s r i s k neutral, and the agent's u t i l i t y function for we a l t h i s U(s) = 2/s. Suppose also that <(>(•) = f (x^ | a^)g(x | a ( •)) , where f( •) and g( •) are i n Q^, 2 2 the class consisting of the exponential, gamma, and Poisson d i s t r i b u t i o n s represented i n Appendix 1. Define a^ and a a^ and the mean of g ( x | a ) i s a . 2 2 2 i f 8V/3a i s p o s i t i v e at a* then 2 (a) y (x..) i s p o s i t i v e , and 0 so that the mean of f(x^|a^) i s Then, assuming that the optimal e f f o r t s are nonzero, (i) 2 65 a sufficient (b) c o n d i t i o n f o r the agent's expected second stage net u t i l i t y to be i n c r e a s i n g i n x^ i s t h a t a*,( •) be a d e c r e a s i n g (ii) if f u n c t i o n of x^; i s p o s i t i v e , then (a) the agent's expected u t i l i t y f o r the second stage p e c u n i a r y r e t u r n , E(u(s(x))|x^}, i s an i n c r e a s i n g f u n c t i o n o f x^, (b) the c o n d i t i o n s V > 0, V 2 jointly sufficient x^. 3V/3a c o n d i t i o n that strictive. conditions 2 2 2 > 0, and 2 in (ii)(b). The to the partial > 0 1 2 2 f u n c t i o n of differentia- one, and i s nonre- forms o f d i s u t i l i t y f u n c t i o n s s a t i s f y f o l l o w i n g , f o r example, s a t i s f y the V ( a ^ , a ) = h(a^) + a , where m m 2 are j-th effort variable. be p o s i t i v e i s a standard A number of g e n e r a l V f o r a*,(*) to be a d e c r e a s i n g Here, s u b s c r i p t s j on V r e p r e s e n t t i o n with respect The > 0, V 2 2 and > 1 and a 2 > the conditions: 0, 2 2 V(a^,a ) = 2 a 1 a where a± 2 V(a ,a ) = h(c a 1 2 1 1 a and 2 1 2 c^ and c 2 > 0, h " ' > 0, and (see Appendix 4 ) . T h i s i s c o n s i s t e n t w i t h the results with a r i s k neutral p r i n c i p a l , o f the outcomes. In g e n e r a l , first a r i s k averse agent, and There i s n e i t h e r an i n c e n t i v e problem nor an to induce the dependence o f a though, a 2 2 the positive. are i s z e r o , so t h a t t h e r e i s no i n c e n t i v e problem, then a depend on x^ effect > 0, 2 + c a ) , where h' > 0, h * constants If > 0 and on 2 does not best independence information x^. w i l l depend on x^. t h a t i n some p a r t i c u l a r s e t t i n g s , the o p t i m a l P r o p o s i t i o n 4.2.1 states second stage e f f o r t will decrease as the f i r s t outcome i n c r e a s e s . R e c a l l t h a t x^ determines a p o i n t on the u t i l i t y curve f o r the agent b e f o r e the second stage e f f o r t i s chosen. Because the agent's m a r g i n a l u t i l i t y f o r w e a l t h i s a d e c r e a s i n g function and 66 the agent's marginal d i s u t i l i t y f o r e f f o r t i s an increasing function, i t i s more costly for the p r i n c i p a l to induce a given l e v e l of a2, the higher x^ is. The r e s u l t that a 2 i s decreasing i n x^ should thus hold for other con- cave u t i l i t y functions for wealth, coupled with convex d i s u t i l i t y functions. Proposition A.2.1 also provides conditions under which the agent's second stage expected u t i l i t y w i l l increase as the outcome increases. Under the given conditions, E[U(s(x))|x^] i s increasing i n x^, and -VCa^,a (x^)) 2 i s increasing i n x^ because a 2 i s decreasing i n x^. Thus, the agent's expected second stage net u t i l i t y i s increasing i n x^. The independence of x^ and X2 i n Proposition A.2.1 means that there i s no information e f f e c t of x j . If x^ and X2 are correlated, then the behavior of a*,( •) would depend additionally on the nature of the c o r r e l a t i o n . In order to examine the information effect of xj_, the extreme case of perfect c o r r e l a t i o n of the outcomes w i l l next be analyzed. When the outcomes are perfectly correlated, then a j o i n t density f o r x^ and X2 does not e x i s t . Since the lack of a j o i n t density precludes using the previous analysis d i r e c t l y , a modified approach must be taken i n order to examine the nature of the sharing rule and the agent's second-stage e f f o r t choice when the out- comes are p e r f e c t l y correlated. Let x^ = x^(0,a^), where 0 i s an uncertain state that influences both the outcomes. It w i l l be assumed that for any fixed a^, x^ can be Inverted to obtain 0 = 0(x^,a^), The p r i n c i p a l ' s and the agent's common b e l i e f s about the outcomes w i l l be expressed as <|>(x^ ,x 1 a^ ,a ( •) ) = f(x^|a^) i f 2 X 2 = x ( » 2^ l^ 9 2 a x a n d 8 = ^ x ^ 3 2 ^ otherwise, <{>(») = 0. In order to describe the sharing r u l e , l e t a^ be the agent's f i r s t stage e f f o r t choice that i s induced by the sharing r u l e , and l e t a^(x^) be the agent's second-stage e f f o r t strategy that i s induced i f x^ i s observed and i t i s assumed that a^ = a*. Because of the perfect c o r r e l a t i o n between 67 x^ and x , 2 form: the sharing rule s(x^,x ) can be viewed as being of the following 2 s(x^,x ) = s(x^) i f x 2 = x (9,a*(x^)) and 9 = 9(x^,a*); otherwise, 2 2 s( •) i s a penalty wage which i s possibly negative. The sharing rule can be viewed as being dichotomous with respect to and varying continuously only with x^. 2 2 and the inferred value of 9. 2 A l t e r n a t i v e l y , the sharing rule can be viewed as being a function of the t o t a l output, x^ + x , condition that the observed x x subject to the i s i n agreement with the observed value of x^ In either view of the sharing rule, lack of agreement between the observed values of x and x^ i s taken as evidence of 2 shirking; accordingly, a penalty i s imposed i n such s i t u a t i o n s . If the pen- a l t y i s s u f f i c i e n t l y severe, the penalty need never be imposed, since the agent w i l l choose to avoid the penalty by choosing a*(x^). Determination of the optimal sharing rule can hence be confined to determination of the o p t i mal function s(x^); furthermore, no f i r s t order condition i s required i n order to induce a*,(x^), as long as a^ i s properly induced. The p r i n c i p a l ' s problem can therefore be written as follows: Maximize s(x ),a ,a (x ) 1 1 2 subject to + x ( 9 ^ , 3 ^ . a ^ x ^ ) - sCx^ )f ( x J a ^ d X j ^ / 2 1 /[U(s( )) - V(a ,a (x ))]f(x |a )dx Xl 1 /[U(s( )) - V ( Xl - /V a a i 2 ,a ( 2 1 X l 1 ))]f 1 (x |a )dx 1 a (a ,a (x ))f(x |a )dx 1 2 1 1 > u 1 1 1 1 1 = 0. In order to determine the f i r s t order conditions, l e t X and u be the Lagrangian m u l t i p l i e r s for the f i r s t and second constraints, respectively, and form the Hamiltonian H i n the usual way. D i f f e r e n t i a t i n g H with respect to s(») for every x^ y i e l d s W'( Xl ^ ^ l ' V + x (8(x ,a ),a (x )) - s ^ ) ) 2 1 1 2 U'(s( )) X l 1 = X + U f( X l | a i ) ' 68 which i s o f t h e usual Jw' ( 0 f ( •)dx - 2V a Finally, form. (Of l a differentiating 3x w (•) H with respect (OdXj a + u/{[U(0 - V ( 0 ] f (.)f(0}dx. l a t o a-^ y i e l d s (0 = 0. 1 H with respect to a f o revery 2 yields x ^ ? •) f( -~ - xv ( . ) f ( •) - u [ v ( O f ( 0 + v ( O f ( O] = 0 . a <«a a 2 If + /W( - ) f 1 " V l (0 l a Differentiating thep r i n c i p a l a 2 i s risk focus on m o t i v a t i o n a l rather neutral, a 2 a^ 1 2 a s i s commonly a s s u m e d i n o r d e r t o than r i s k - s h a r i n g i s s u e s , then t h e f i r s t order c o n d i t i o n s above reduce t o f U ' ^ ) ) ~ / 1PT Ia7 f ( A x l l i/{tU(s(x )) + l - 2V ( a i ,a 2 ( X a i ) l ) d l x / + W - ( <'-> 42 1 ) f a i ( x l l - V(a ,a (x ))]f 1 a a L f'xjap ' v + (x | a 2 1 l ))f 1 d l x 1 1 - V 1 ) (x |a )dx 1 (x |a ) a l a (a g & 1 .a^x^ ) f (xj_ | a^ jd^ = x 0, (4.2.2) and ^ f - ( x ^ v Dividing a. 3x l l a 2 l > a ( a i » (4.2.3) a 2 ( a2 x by f 2 W (a ,a (x ))f(x |a ) " i ) > f 1 a 1 ( 1 2 x l l f(x^|a^) a l ) 1 + V a i a 1 2 ( a l ' a 2 and r e a r r a n g i n g ( x l ) ) f ( x l l a l ) ] = °* ( - - ) 4 2 3 yields (x,|a.) a^ l 1' 1 *2 Substituting = 3a 2 (4.2.1) T7T7-F—vT U'(s(x^)) into V a (4.2.4) yields ( a . , a , ( x ) ) + uV 1 2 1 l 2 a (a 2 1 a (x ) ) . 2 1 (4.2.5) 69 It i s e a s i l y seen that i f (4.2.4) i s to hold for almost every X p a ( •) must i n general vary with x-^. then The "wealth" and "information" e f f e c t s 2 of x-j^ described i n the f i r s t best analysis can be seen i n (4.2.4). The wealth e f f e c t of x^ results from the interaction of the agent's marginal utility for wealth and marginal d i s u t i l i t y for e f f o r t . The information e f f e c t of x^ refers to the information that x^ provides about x « 2 In the perfect c o r r e l a t i o n case, the state 9 i s Inferred from x^ and a^, and x hence a deterministic function of a agent's perspectives. the 9 x / 9 a 2 2 is from both the p r i n c i p a l ' s and the 2 The information e f f e c t of x^ i s therefore captured i n term i n (4.2.5). 2 The behavior of a*, as x^ varies can be determined by d i f f e r e n t i a t i n g (4.2.4) with respect to x^ to obtain P 2 v V 9 3 a- 3x. / l> f _3_ ^ 2 39 ^ 3 a / a Q ; f 2 — 3a 2 (4.2.6) (X + u - ^ ) V 2 2 a 1 2 When x 2 J59 3x. a a a a 2 i s l i n e a r i n a , the 3 x / 3 a 2 - uV l 2 2 2 2 term i n the denominator i s zero. Two special cases of interest are ( i ) x^ = 9 + a^, where 9 Is purely noise, and ( i i ) x^ = 9a^, where 9 reveals information about the production technology. In case ( i ) , the marginal output per unit of e f f o r t i s one, regardless of the value of 9. In case ( i i ) , however, the marginal output per unit of e f f o r t i s 9. 2 2 To i l l u s t r a t e the r e s u l t s , suppose that V(«) = a^a,,. For case (i) , 2 2 2 assume that 9 " N(0,cr). Then x^ " N(a^,a ) and f / f = ( X j - a ^ ) a . There& 2 2 fore, the numerator of (4.2.6) i s u(2a^a )/a and the denominator i s 2 2 l ~ l -(X + u — 2 — ) ( 2 a ^ ) - 4a^u. x a a l~ l The term (X + u — — ) x a 2 * s positive by the a f i r s t order condition (4.2.1), and the e f f o r t levels are assumed to be posi- 70 tive. Therefore, a*/(x^) < 0, provided that y > 0. This can also be seen by solving f o r a (x^) d i r e c t l y from (4.2.4) to obtain 2 X "3. a ( 2 ) = [2a (X + y J^i) + 4ua ]" . 2 X l 1 1 a In t h i s case, the sign of a*,' i s the same as i n the independent outcome s i t uation described i n Proposition 4.2.1, where there was no information about x 2 to be gained from x^. The case ( i ) result here can thus be interpreted as indicating that the wealth e f f e c t of x^ dominates any information e f f e c t that exists through perfect f ci or sr tr e lthat a t i o n0 of For case ( i i ) , assume ~ e xthe p ( l outcomes ) . Then .x^ ~ exp(a^) and f fl 2 /f = (Xj-a^)/a^. Equation (4.2.4) becomes l~ l 9 = (X + y — j — ^ x a a 2 a 2 l 2 a ^ + t j a i a 2 * l Sub s t i t u t i n g 0 = x^/a^ and rearranging results i n l *2< l> " — x -a 2 a ^ [ ( X + y - 4 - ^ ) + 2y] x a X 1 1 ai a l Therefore, r x a^X i a 2 , ( x i> = . t 7T 2 a + y l a i 2—) + 2y - x ( y / a ) 1 a 1 i {a (X + y ^ - ^ ) L a + 2y} 2 l The numerator of a*,'(x^) reduces to (a^X - y + 2y), which i s positive x l" l a (assuming y > 0) because a^(X + y — j — ) > 0 for x^ J> 0, and for x^ = 0 i n «! particular. for Thus, a*/(x^) > 0 i n t h i s case. A similar analysis can be done the normal d i s t r i b u t i o n example used i n case ( i ) , with the result that 71 a*,'(x^) > 0. The sign of a*'(x^) would remain the same i n cases ( i ) and ( i i ) for a wide variety of reasonable d i s u t i l i t y functions. For the normal d i s t r i b u t i o n example, the only difference i n the expres9 ^ 2 96 sion for a i ' ( x ) i s the — (——) — — term. 2 z L 90 X 1 In case ( i ) , i t i s zero, and i n a case ( i i ) , i t i s 1/a-^. Although 9 i s purely noise i n case ( i ) , r i s k i s imposed on the agent for motivational purposes i n order to induce a Pareto optimal choice of a^. The e f f o r t strategy a (x^) i s primarily determined by 2 the wealth e f f e c t of x-^, leading to a decreasing function of x^ just as i n the case when the outcomes were assumed to be independent (see Proposition 4.2.1). In case ( i i ) , where the marginal output per unit of e f f o r t i s 9, the agent receives perfect information about the production technology that was not relevant i n case ( i ) . The information effect of x^ overrides the wealth effect i n the case ( i i ) examples above, so that a*, i s now an increasing function of x^. To i l l u s t r a t e the second best r e s u l t s , suppose that the p r i n c i p a l i s r i s k neutral and the agent's u t i l i t y for wealth i s 2/s. Suppose further that x^|a^ and x | a 2 2 are Independent, f(») i s exponential with mean a^, and g(») i s exponential with mean a ( x ^ ) . 2 Then the i n t e r i o r portion of the 2 optimal sharing rule i s characterized by s(x^,x ) = P (x), where 2 x -a*(x ) " { * ). a* ( x ) 2 P(x) = \ + M ^ - ^ ) a* + P (x )( 2 1 1 i (4.2.7) x P(x) must be s t r i c t l y p o s i t i v e i n order to s a t i s f y the f i r s t order condition 1/U' = P(x). In the proof of Proposition 4.2.1, i t i s shown that P ( ) = (9V(a*)/9a )a* ( )/2 , 2 2 X l 2 Xl (4.2.8) which i s p o s i t i v e under the usual assumption that the agent's d i s u t i l i t y function i s increasing In the second e f f o r t . Furthermore, i t i s e a s i l y seen 72 that u '(x^) < 0 under the assumptions i n Proposition 4.2.1, part ( i i ) . 2 I n t u i t i v e l y , the higher the f i r s t outcome i s , the less concerned the r i s k neutral p r i n c i p a l i s about motivating a high choice of a 2 This i s because the higher the outcome x^ i s , the c o s t l i e r i t becomes to induce a given l e v e l of a2« As remarked e a r l i e r , the p r i n c i p a l induces a strategy which i s decreasing a (xi) 2 In xj_. At the time of the second e f f o r t choice, the f i r s t outcome x^ i s known. x -a*,( ) 2 ^ ' 2 As Lambert (1983) notes, P(x) can be viewed as X(x^) + u ( x ^ ) ( 2 Xl which i s as i t would appear i n a one-stage, one-period agency problem, given that x^ i s f i x e d . Thus, i t i s not t o t a l l y surprising that, as i n the one- stage, one-period problem, 3s/3x u (x^) are s t r i c t l y postive. 2 ably more complicated. 2 i s s t r i c t l y p o s i t i v e , since P(x) and The behavior of s( •) as x^ varies i s consider- Substituting (4.2.8) into (4.2.7) and d i f f e r e n t i a - ting shows that f-= l 2P(x) A, a * (fL!^ + _ a-V .)/2]. 2 ( 2 2 Under the assumptions i n Proposition 4.2.1, part ( i i ) , the f i r s t and t h i r d terms i n the brackets are p o s i t i v e . The condition that x 2 < a*,(x^) i s suf- f i c i e n t f o r the sharing rule to be increasing i n the f i r s t outcome. ever, i t i s c l e a r l y possible that x 2 How- 3s/3x^ i s increasing i n x^ even i f > a*,^). An a l t e r n a t i v e approach to an analysis of the sharing rule i s i n s i g h t - ful. Recall that the f i r s t order conditions require that f U^i)T = R ( X ) - X + h a (x |a*) g 1 fUja*) + "2< l> x a (x |a*( )) 2 X l gOc.la*^)) * 73 Taking the conditional expectation of R(x) with respect to g(x | a^x^) ) 2 f a ( , ) 1 results i n the expression X + model, i f ^ ^ . As i n the one-stage, one-period > 0 annd f( •) s a t i s f i e s the monotone likelihood r a t i o property, then (X + ^ ^ Thus, the agent faces a sharing ) i s increasing i n x^. rule with similar characterization for each stage, looking only one step ahead. That i s , at each stage, the shading rule i s characterized by the condition that 1/U' = X + uJh /h. u u a A In order to i l l u s t r a t e the behavior of a (x^) when x^Ja^ and x | a 2 2 are 2 imperfectly correlated, suppose that g(x |x^,a (x^)) i s exponential with 2 mean M (x^) = x ^ a ( x ^ ) . 2 2 Since the exponential d i s t r i b u t i o n i s a one-parame- 2 ter d i s t r i b u t i o n , we may write g(x |x^,a (x^)) = g(x |M (x^)), 2 t i o n 4.2.1 can be applied. 2 2 2 and Proposi2 2 For concreteness, suppose that V(a^,a ) = a 2 2 Then V 2 ^ a 2 ' 2 = 2a^a > 0, V 2 2 2 = 2a 1 > 0, V = 0, and V 2 2 2 1 2 2 = h& 1 > 0, so that the conditions i n Proposition 4.2.1, part ( i i ) ( b ) are s a t i s f i e d . Substitui s p o s i t i v e , then M^Cx^) i s which i s s t i l l p o s i t i v e . Therefore, i f ting a (x^) = M (x^)/x^ into the expression for V yields V = 2a M (x )/x^, decreasing i n x^. That i s , x^a^x^) i s decreasing i n xj_. If b i s positive, 2 2 2 2 2 2 1 then i t i s e a s i l y seen that a (x^) i s decreasing i n x^, as when b i s zero 2 (the "independent" case). In this s i t u a t i o n , as when there i s perfect cor- r e l a t i o n with the normal d i s t r i b u t i o n i n case ( i ) , the wealth effect of x± i s dominant. Recall that case ( i i ) of the perfect c o r r e l a t i o n analysis assumed that x t = Qa.^, so that x 2 = x a (x )/a . 1 2 1 1 This seems similar to the imperfect c o r r e l a t i o n example i n which M (x^) = x ^ a ( x ^ ) . 2 2 However, the signs of a*,'(x^) are opposite i n these perfect and imperfect correlation cases. This can be interpreted as follows: i n the presence of information related to the production technology, the wealth e f f e c t of x^ i s dominant i f 74 the c o r r e l a t i o n i s imperfect; the information effect of x^ i s dominant only i f the c o r r e l a t i o n i s perfect. If b i s negative, then the behavior of a (x^) i s p o t e n t i a l l y much more 2 complex. The condition that M (x^) i s decreasing i n x^ i s equivalent to the 2 condition that bx^ a ( x ) + x^a^x^) < 0. Since b < 0, the f i r s t term i s negative; a ( x p may It could be, for example, that 1 2 2 1 thus be of any sign. because of the interactions of the wealth and information effects of x^, a (x^) i s increasing for low values of x^ and decreasing for high values of 2 x r This concludes the analysis of the effect of the information x^ on the agent's second e f f o r t strategy. The next two sections examine two aspects which were of interest i n the a l l o c a t i o n problem, namely additive separabili t y of the sharing r u l e , and additive e f f o r t . 4.3. ADDITIVE SEPARABILITY OF THE SHARING RULE In this section, the question of whether or not to reward the agent for each outcome separately i s examined. For example, suppose a salesperson exerts e f f o r t s e l l i n g a product i n one t e r r i t o r y , observes the resultant sales, and then devotes e f f o r t to s e l l i n g the same product or a d i f f e r e n t product i n another t e r r i t o r y . Should the firm compensate the salesperson with a d i f f e r e n t reward function for each outcome, as i f he or she were two separate salespeople? That i s , should the sharing rule be a d d i t i v e l y separ- able i n the outcomes? It was shown i n Section 3.4 that i n the e f f o r t a l l o c a t i o n problem, i f the p r i n c i p a l Is r i s k neutral and the agent i s r i s k averse with a HARA-class u t i l i t y for wealth, then j o i n t l y s u f f i c i e n t conditions for the optimal sharing rule to be a d d i t i v e l y separable i n x^ and x 2 are ( i ) the agent has a log u t i l i t y function for wealth and ( i i ) the outcomes are conditionally dent (see equation (3.3.2)). indepen- In the one-period sequential e f f o r t problem, 75 the optimal sharing rule w i l l not be additively separable i n x^ and x , even 2 under conditions ( i ) and ( i i ) above. This i s e a s i l y seen from the charac- t e r i z a t i o n of the i n t e r i o r portion of the optimal sharing rule: ^•[(V(x)) - D ] if C * 0 D l n 7(x) i f C = 0, C 2 s(x ,x ) = 1 2 2 where the agent's r i s k aversion function i s -U''(s)/U'(s) = l/(Cs+D ) and 2 *a ( 0 f ( x a i = X "I* f ( + X l '"a ll l> a | a i ) + ( 0 S ( x | x , a , a ( •)) a i 2 1 1 g(x |x ,a ,a (.)) 2 and d i f f e r e n t i a t i o n with respect to a 1 2 1 g ^ 2 1 + W 2 2< l x ) •) 1T0~' i s pointwise for every x^. Thus, even i f U(s) = In s ( i . e . , C = 1) and g(x |x^,a^,a (x )) = g(x2|a (x^)), the 2 2 1 2 optimal sharing rule w i l l not be a d d i t i v e l y separable i n x^ and x of the l ( i ) 8 J x 2 a /g term unless ^ ( x ^ ) = k, a constant, and g t i v e l y separable i n x^ and x . 2 fl 2 because /g i s addi- Lambert (1981, p. 90) has shown i n a similar situation that ^ ( x ^ ) > 0 for almost every xj_. Since for almost every x^, U (x^) * 0, and i t i s unlikely that ^ ( x ^ ) = k (which would require that 2 3E(x-s( •))/3a 2 2 2 = k3 E(U(s(')) ~ V(»))/3a f o r almost every xj^), the optimal 2 sharing rule w i l l almost c e r t a i n l y not be additively separable i n x^ and x . 2 A c o r o l l a r y of this result Is that i f the p r i n c i p a l i s r i s k neutral and the agent's u t i l i t y f o r wealth i s i n the HARA c l a s s , then the optimal sharing rule w i l l not be l i n e a r . Thus, the simple commission schemes often used i n practice are not the most e f f i c i e n t way to motivate a r i s k averse agent when sequential e f f o r t decisions are involved. The presence of the additional decision information, x^, for the agent, which i s the only difference between the sequential e f f o r t problem and the 76 e f f o r t a l l o c a t i o n problem, introduces more complexity Into the sharing rule (I) the m u l t i p l i e r 1^(0 in two ways: depends on x^, the d i s t r i b u t i o n of x X j j a i and x | a 2 2 depends on xj_, and ( i i ) because a ( •) 2 given a ( •) depends on x^, even i f 2 are s t a t i s t i c a l l y independent. 2 The combination of these features precludes an additively separable sharing r u l e . depends on x^ even i f x-jja^ a n c * x |a 2 Note that a ( •) 2 are s t a t i s t i c a l l y independent. 2 two Hence, a 's dependence on x^ Is not due to information that x^ provides about the 2 l i k e l i h o o d of x . 2 Rather, the dependence i s due to a wealth effect (x^ influences the agent's position on h i s or her u t i l i t y curve before the second e f f o r t i s chosen) which the p r i n c i p a l can use to e f f i c i e n t l y motivate the agent. Recall that i n the f i r s t best case, there i s no motivational problem, and therefore the optimal a 2 does not depend on x^ i f the agent i s r i s k averse, the p r i n c i p a l i s r i s k neutral, and x^|a^ and x | a 2 2 are s t a t i s - t i c a l l y independent. I f , on the other hand, x-jja^ and x | a 2 2 are dependent, then a 2 depends on x^ f o r the additional reason that x^ provides information about the l i k e lihood of x . This i s true i n both the f i r s t best and second best cases. 2 The m u l t i p l i e r u (x^) further complicates the sharing r u l e . 2 Intui- t i v e l y , i t i s a measure of the cost to the p r i n c i p a l of the motivational problem for a . 2 The result that ^ ( x ^ ) > 0 for a l l x^ means that no matter what the f i r s t period outcome i s , the p r i n c i p a l w i l l not find i t optimal to induce as high an e f f o r t l e v e l , a , as he or she could have i f there were no 2 motivational problem. 4.4 ADDITIVE EFFORT In this section, the additive e f f o r t situation described i n Section 3.5 i s examined when sequential choice i s allowed^. The p r i n c i p a l i s assumed to be r i s k neutral and the agent i s assumed to be r i s k averse. The agent i s further assumed to have no i n t r i n s i c d i s u t i l i t y for any particular task, but 77 rather i s assumed to have d i s u t i l i t y only for the t o t a l e f f o r t expended. The agent's d i s u t i l i t y i s thus represented as V(a^+a2(•))• In f i r s t best situations, i f X j j a j and x | a 2 2 are independent, the p r i n - c i p a l i s r i s k neutral, and the agent Is r i s k averse, then the optimal a ( •) 2 does not depend on x^ i n the sequential e f f o r t case. Therefore, the f i r s t best results f o r the a l l o c a t i o n of e f f o r t problem s t i l l hold for the sequent i a l e f f o r t problem. In p a r t i c u l a r , i f the means are l i n e a r i n e f f o r t , that i s , the means are given by ka^, then only the sum of the e f f o r t s i s of importance to the p r i n c i p a l and the agent. where k^ * k j , the If the means are given by k^a^, i * j , then a l l the e f f o r t should be put into the task with largest return per unit of e f f o r t . For more general unequal mean func- tions, the optimal solution w i l l involve nonzero e f f o r t s devoted to a l l tasks. If the mean functions are i d e n t i c a l nonlinear s t r i c t l y increasing functions, then the optimal e f f o r t s are equal. The second best case i s quite d i f f e r e n t because of the dependence of a ( •) on X]_. Recall that i n the a l l o c a t i o n problem, assuming an Interior 2 solution, the constraints require that 3EU(s(x)) 3EU(s(x)) 3a^ 3a 2 ' because each of the marginal expected u t i l i t i e s must equal the marginal d i s utility the from the t o t a l e f f o r t , V'(a^+a ). 2 In the sequential e f f o r t case, constraints become 3EU(s(x)) •3a 1 3EV(a +a ( •)) 1 3a 2 (4.4.1) x and 3E U(s(x)) 2 ^ = V(a^+a^i •)) f o r almost every x^, (4.4.2) 78 where E U(s(x)) = / U( s( •) )g(x | a (x^) )dx . Equation (4.4.1) requires aver2 2 2 2 aging over a l l possible values of x^ and x , because a^ i s chosen before 2 Equation (4.4.2), on the other hand, requires either outcome i s available. averaging only over a l l possible values of x , because x 2 i s the only 2 remaining uncertainty at the time the second e f f o r t l e v e l i s selected. Corollary 4.4.1 below applies Proposition 4.2.1, which characterizes the behavior of the second stage e f f o r t strategy, to the additive e f f o r t Proposition 4.2.1 case. assumed that e f f o r t s were defined such that they were the means of the outcome d i s t r i b u t i o n s . In t h i s section, e f f o r t s are assumed to be additive; assuming that e f f o r t s are simultaneously of the outcome d i s t r i b u t i o n s i s overly r e s t r i c t i v e . 4.4.1 the means Therefore, Corollary allows for a more general s i t u a t i o n i n which the means of the outcome d i s t r i b u t i o n s are functions of the e f f o r t s . This accounts for conditions on the second stage mean, M ( •), i n order to characterize the behavior of the 2 second stage e f f o r t strategy. It should be noted that the d e f i n i t i o n of e f f o r t in turn influences the description of d i s u t i l i t y captured i n the d i s u t i l i t y function V( • ) . Thus, conditions on both V(•) and M (») are either 2 i m p l i c i t l y or e x p l i c i t l y required i n order to characterize the behavior of the second stage e f f o r t strategy. Corollary 4.4.1. Assume that the conditions i n Proposition 4.2.1 except that E(xi_|aj_) = M^a^) > 0, E ( x | a ) = M ( a ) > 0, and V ( a , a ) = 2 V(a +a ), with Mj/ > 0 and M ' 1 2 2 > 0. 2 2 2 1 Let ei = M ^ a ^ and e 2 2 > 0, V " > 0, V " ' > 0, and e*(x^) to be decreasing 2 1 1 2 1 1 1 2 = M (a ). induced d i s u t i l i t y function i s then V * ( e , e ) = V ( M ~ ( e ) + M V hold, 2 (e )). 2 i n x-^ i s that 3M " 2 - ^'''M^ be nonnegative at *2ct For example, suppose M^(a^) = a^ , M ^ a ^ , where 0 < a < 1, 0< If ' < 0, then a s u f f i c i e n t condition for a (a +a„) The 8 <V2 » and a 8 = a 2 , and V(a^+a ) = > 0 for 1=1,2. 2 Then 79 a± = = ^ e a a n d e*,(x^) i s d e c r e a s i n g V" = 2 > 0, 2 2 2 0 2 " 4 - = M 2 ^ 2^ e since V 2 0 2 & 4.4.1 = 2(a^+a ) > 0 f o r a^+a 2 2 2 ^^^^Y 2^^' = M' ' = g(3-l)a 8 (S-l)(B-2)a the agent's expected The 2 i n x^, V " » = 0, 3 3 (0-l) a a " 0 2 " 2 < 0, and > 0 f o r g <V2 4 second stage net u t i l i t y 3M ' » 2 2 • shows t h a t * ®> 2 - M ' ' »M ' 2 = 2 By P r o p o s i t i o n 4.2.1, i s t h e r e f o r e i n c r e a s i n g i n x^. remainder o f t h i s s e c t i o n makes Pareto comparisons between e f f o r t s t r a t e g i e s i n the s e q u e n t i a l e f f o r t model, i n which i n f o r m a t i o n becomes a v a i l a b l e a t a f i x e d p o i n t d u r i n g the p e r i o d . lows, the agent has e f f o r t c h o i c e s f o r two observed. at In the d i s c u s s i o n t h a t tasks b e f o r e any information i s A f t e r o b s e r v i n g the i n f o r m a t i o n ( i f a nonzero e f f o r t i s exerted a t a s k , then the a s s o c i a t e d outcome i s o b s e r v e d ) , the agent can choose to b e g i n , c o n t i n u e , o r d i s c o n t i n u e e x e r t i n g e f f o r t a t the two the f i r s t fol- s u b s c r i p t on a and on x denote time ( t h e stage) tasks. and Letting letting the second s u b s c r i p t denote the t a s k s , the s e q u e n t i a l c h o i c e s c e n a r i o under d i s c u s s i o n can be d e p i c t e d as f o l l o w s : a-Q and/or Information a^ exerted x^ 2 Point: and/or x ^ a 2 1 ^ * ) and/or a 2 2 2 x ^ and/or 2 ( •) e x e r t e d x observed 2 2 observed In the a l l o c a t i o n o f e f f o r t s i t u a t i o n d i s c u s s e d i n Chapter 3, agent's e f f o r t d e c i s i o n s are made and outcomes a r e observed. e f f o r t s are e x e r t e d The the e f f o r t s are e x e r t e d b e f o r e s i t u a t i o n may be viewed as one s e q u e n t i a l l y , or simultaneously. outcomes a r e observed In e i t h e r case, thought o f as a s p e c i a l case of the Point. Neither x ^ nor x ^ 2 Is observed the The allo- situation d e s c r i b e d above, where the n u l l i n f o r m a t i o n system i s i n e f f e c t a t Information the i n which the o n l y a f t e r both e f f o r t s have been e x e r t e d . c a t i o n of e f f o r t case can be the the u n t i l the end of the 80 period. The e f f o r t s a ^ and a 2 are thus independent of x ^ 2 2 the end of the period, a^ = a ^ + a ^ and a 2 exerted, and x^ = x ^ + x ^ and x 2 2 = x^ + x 2 = a^ 2 + a 2 2 2 and x^ . At 2 w i l l have been w i l l have been observed. 2 2 Analysis similar to that i n the proofs of the propositions i n Section 3.5 establishes the sequential e f f o r t results below. 4.4.2 Part ( i ) of Proposition deals with situations with e f f o r t devoted to only one task at a time, while parts (II) and ( i i i ) deal with situations i n which e f f o r t i s devoted to more than one task at a time. Proposition 4.4.2. Suppose the p r i n c i p a l i s r i s k neutral. Suppose further that the agent's u t i l i t y for wealth i s the square root u t i l i t y function and that the agent's d i s u t i l i t y i s a function of the t o t a l e f f o r t expended. F i n a l l y , suppose that for i , j = 1,2, the x^j's given the corresponding a^j's are independent, and E(x^j|a^j) = ka^-j, where k i s a constant. (i) If i t i s optimal for the p r i n c i p a l to induce (1) a ^ a 2 1 ^ l l ^ ^ ^» and a x 2 2 > 0, a ^ 2 = 0, ( x ^ ) = 0, then i t i s also optimal for the prin- c i p a l to induce (2) a ^ > 0, a ^ or to induce (3) a-j^ = 0, a ^ to induce (4) a ^ = 0, a ^ 2 2 2 = 0, a ^(x^^) = 0, and a ( x ^ ^ ) > 0, 2 > 0» a 2 22 ^ ( x ^ ) ^ 0» 2 a n d a ( x ^ ) = 0, or 2 2 2 > 0, a ^ ( x ^ ) = 0, and a ( x ^ ) > 0. 2 2 2 2 That 2 i s , i f one of the four combinations of e f f o r t s (1) through (4) i s optimal, then the p r i n c i p a l i s i n d i f f e r e n t among the four combinations. This result holds no matter what the r i s k averse agent's u t i l i t y for wealth i s . Moreover, means that are l i n e a r i n e f f o r t are not required. 2 (ii) If xjLj|a£j i s normally distributed with mean ka^j and variance a , then (a) the best e f f o r t strategy with a ^ and a a^ 2 2 > 0, a ^ 2 = 0, a ^(x]_^) > 0, 2 ( x ^ ) > 0 i s Pareto i n f e r i o r to some e f f o r t strategy with > 0, a ^ 2 = 0, a ^ ( x ^ ) > 0, and a 2 2 2 ( x ^ ) = 0, and 81 (b) the best e f f o r t strategy with a-^ > 0, a ^ > » u a 2 1 ^ l l ' 1 2 ^ ^ ^» x x a n d a 2 2 ^ l l » 1 2 ^ > 0 i s Pareto i n f e r i o r to x x some e f f o r t strategy with and (iii) > 0, a-^2 > 0> 2 1 ^ l l » 1 2 ^ a x x E ^» a22(xn.X]^) ^ * u If x ^ j l a ^ j i s exponentially d i s t r i b u t e d with mean ka^-j, then (a) the best e f f o r t strategy with > 0, a ^ = 0, a 2 i ( x n ) > 0, 2 and a 2 2 ( x n ) = 0 i s Pareto i n f e r i o r to some e f f o r t strategy with a (b) ll ^ ^» 12 a ®* 2 1 ^ l l ^ ^ ^» = a x a n < * 22^ ll^ ^ » a the best e f f o r t strategy with a 21^ ll' 12^ x x E ^» a n c x u a n < * > 0, a^2 > 0» * 2 2 ^ l l ' 1 2 ^ > 0 i s Pareto i n f e r i o r to a x x some e f f o r t strategy with a ^ > 0, a ^ > 0, 2 l ( l l » 1 2 ) > 0» a x x 2 and 2 2 ( l l > 1 2 ) ^ * a x x u The r e s u l t s i n Proposition 4.4.2 can be depicted as follows, where s o l i d l i n e s indicate nonzero e f f o r t , and dashed l i n e s indicate zero (no) effort. The f i r s t l i n e i n each pair of l i n e s represents the second l i n e i n each pair represents the f i r s t task, and the second task, (i) (1) (2) J I a u > 0 I I a 2l( ll) x > 0 a 1 2 = 0 I 22< ll> x ~ 0 I l l a I a I > 0 a I a 12 I 2l( ll) = 0 22( ll> > 0 x L I = 0 The p r i n c i p a l i s i n d i f f e r e n t between (1) and (2). a x Alternative (4) Is similar to a l t e r n a t i v e (2), with the tasks renumbered, and a l t e r n a t i v e (3) i s s i m i l a r to a l t e r n a t i v e (1). 82 ( i i ) (a) (A.) (B) I I &11 > a 0 I 1 2l( ll) > x a I a = 0 a 1 2 I 0 I 22^ ll^ ^ x J ll > a 0 I 2l( ll) > x I 0 a 12 0 I = a 0 I 22^ ll^ ^ x 0 The p r i n c i p a l prefers some form of (B) to the best possible form of (A). (ID (b) (C) (D) l _ I a ll > 0 a I 2l( ll> 12) x I x > 0 _l I a 1 2 > 0 a 22^ ll» 12) x x > I a 0 ll I > 0 a I a 12 I 2l( ll> 12) = x x 0 1 > 0 a ! 22^ ll» 12^ x x > 0 The p r i n c i p a l prefers some form of (D) to the best possible form of (C). The results i n ( i i ) say that whether e f f o r t i s exerted at one or two tasks i n i t i a l l y , a l l e f f o r t should be concentrated i n only one task at the second stage. Because of the assumed independence, i t does not matter which task i s chosen. In part ( i i i ) of the proposition, the results In ( i i ) ( a ) and (b) are reversed. That i s , whether e f f o r t i s exerted at one or two tasks i n i t i a l l y , e f f o r t should be s p l i t across two tasks at the second stage. It i s prefer- able to Induce the agent to d i v e r s i f y e f f o r t after receipt of the informat i o n x^ when the outcomes are exponentially distributed as described, and i t i s preferable not to induce the agent to d i v e r s i f y e f f o r t when the outcomes are normally distributed as described. In part ( i ) of the proposition, d i v e r s i f i c a t i o n of e f f o r t Is not i n question. Because the outcomes condi- t i o n a l on the e f f o r t s are independent and i d e n t i c a l l y distributed, the p r i n c i p a l i s i n d i f f e r e n t among the four alternatives (1) through (4). 83 As i n Section 3.5, the results i n parts ( i i ) and ( i i i ) are partly explainable i n terms of the variances of the t o t a l outcomes. For simplic- i t y , consider a comparison between a fixed amount of e f f o r t , a, devoted to only one task, or divided across two tasks. comes of the two tasks, and l e t ka^ and k a where a^ i s the e f f o r t devoted to task i . individual 2 Let x^ and x denote the out- 2 denote their respective means, Since the means of each of the outcomes are linear i n e f f o r t , the t o t a l e f f o r t expended i s the only quantity of relevance for the purpose of comparing the means of the t o t a l outcomes (x^ i f e f f o r t i s devoted only to one task, and x^+x i f 2 e f f o r t i s devoted to two tasks). For the normal d i s t r i b u t i o n i n part ( i i ) of Proposition 4.4.2, Var(x-jJai=a) = o , and Var(xi-hx 1a]+a =a) = 2o~. 2 2 the exponential distributions i 2 For i n part ( i i i ) , Var(xjjai=a) = k^a^, and 2 2 Var(xi+x |ai+a2 a) < k^a . = 2 For the normal d i s t r i b u t i o n , the variance of the t o t a l outcome i s smaller when a l l the e f f o r t i s devoted to one task, while for the exponential d i s t r i b u t i o n , the variance of the t o t a l outcome i s smaller when a l l the e f f o r t i s divided across two tasks. can be related This observation to the Information content of the outcomes considered as s i g - nals about the agent's e f f o r t ( s ) . The quantity 1(a) = / f (x|a)/f(x|a)dx, called Fisher's information about the parameter a contained i n the data (see, for example, Cox and Hinkley, 1974), i s used as a measure of information content about a i n x. For both the normal and exponential d i s t r i b u - tions described above, 1(a) i s the reciprocal of the variance. Thus, for the normal case, there i s "more" information about the agent's e f f o r t when a l l e f f o r t i s devoted to one task than there i s when the e f f o r t i s divided across the tasks. The opposite i s true for the exponential distribution. Proposition 4.4.2 does not state what the optimal e f f o r t strategies are i n each case. The comparisons i n parts ( i i ) and ( i i i ) are between situa- tions with the same information available at the beginning of the second 84 stage. For example, i n ( i i ) ( a ) , which only the comparison i s between two situations i n i s available at the beginning of the second stage. Compari- sons of situations with d i f f e r i n g information available at the beginning of the second stage are more d i f f i c u l t to make. 4.5 SUMMARY AND DISCUSSION This chapter examined the problem of sequential e f f o r t decisions within one period. The sequential aspect arose because the agent observed an out- come affected by the f i r s t e f f o r t choice before making the second e f f o r t choice, which affected a second outcome. The agent was paid only a f t e r both e f f o r t s were exerted and both outcomes were observed. In the f i r s t best case, the characterization of the optimal sharing rule i n the sequential e f f o r t case i s similar i n s p i r i t to that i n the a l l o cation of e f f o r t case. That i s , i f one person i s r i s k neutral and the other i s r i s k averse, then the r i s k neutral person bears the r i s k . If both the p r i n c i p a l and the agent are r i s k averse, then the r i s k i s shared, with the sharing rule a function of the sum of the outcomes. The f i r s t best characterization of the optimal e f f o r t s i s different i n the sequential e f f o r t case than i n the a l l o c a t i o n of e f f o r t case. The sec- ond e f f o r t choice may now depend on the f i r s t outcome and the f i r s t choice. effort If both of the individuals are r i s k averse, then the optimal second stage e f f o r t strategy w i l l depend on x j _ , the f i r s t outcome. The second stage e f f o r t strategy w i l l also depend on x^ i f the joint density of the two outcomes given the actions i s f ( x ^ | a ^ ) g ( x | x ^ . a ^ , a ( • ) ) • 2 2 However, i f at least one of the individuals i s r i s k neutral and the joint density of the two outcomes given the actions i s f ( x ^ | a ^ ) g ( x , a ^ , a ( • ) ) > then the o p t i 2 2 mal second stage e f f o r t strategy w i l l be independent of x^. The second stage e f f o r t strategy may depend on x^ because of a "wealth" ("risk aversion") e f f e c t , or because of an "information" e f f e c t . The wealth 85 effect occurs when both individuals are r i s k averse, because a r i s k averse individual's marginal u t i l i t y varies at different points of the u t i l i t y curve. The f i r s t outcome determines where on the u t i l i t y curve the i n d i v i d - ual i s , so the individual w i l l want the second stage e f f o r t adjusted accord- ing to the value of the f i r s t outcome. the two outcomes are dependent. The information effect occurs when Depending on the nature of the c o r r e l a t i o n between the two outcomes, the p r i n c i p a l may wish to induce the agent to choose the second stage e f f o r t strategy to be an increasing or decreasing function of the f i r s t outcome. Proposition 4.1.1 provides a precise expres- sion for the derivative of the second stage e f f o r t strategy with respect to the f i r s t outcome. The analysis i n the second best case allowed for nonindependence of the outcomes. As usual, the p r i n c i p a l was assumed to be r i s k neutral and the agent was assumed to be r i s k averse. The characterization of the optimal sharing rule i n the sequential e f f o r t case i s similar to the characterizat i o n i n the a l l o c a t i o n of e f f o r t case, except that the multipler u e f f o r t strategy a x l» 2 a D e 2 may depend on x^. Although i n general, a 2 a n d t n e 2 w i l l depend on independent of x^ i f the agent i s r i s k neutral and the j o i n t density of the outcomes i s of the form f ( x ^ | a ^ ) g ( x | a ( • ) ) • 2 Proposition 4.2.1 2 assumed a square root u t i l i t y function for the agent and c o n d i t i o n a l l y independent outcomes given the actions. It was shown that the agent's second stage e f f o r t strategy w i l l be decreasing i n x^. Intui- t i v e l y , t h i s i s because the higher x^ i s , the more c o s t l y i t i s for the p r i n c i p a l to induce any p a r t i c u l a r l e v e l of a . 2 ginal u t i l i t y The agent's decreasing mar- for wealth and increasing marginal d i s u t i l i t y for e f f o r t account for the increasing costliness of inducing a . 2 Since these charac- t e r i s t i c s hold i n general, the results i n Proposition 4.2.1 other u t i l i t y functions. should hold for 86 The c a s e rule which generally result the strictly first perfectly incorporates outcome i f about of the correlated a penalty wage f o r i n a second e f f o r t state is outcome i n in the the perfectly behavior is It ing was rule effort 2 next that case sequential is will information, x^, The f i r s t the additively separable guarantee case. best in results Section in t i o n case because of information tions ing for the under which i n xj_, the dary versus the interior with varying of the case The c o n d i t i o n a l sequential effort the that separable the hold for the sequential the differ first choice. in effort is strategy Section 3.5 obtain degrees of diversification information investigation In the in of the effort effort in as case. the alloca- provides will effort. of be condidecreas- the to The the bounthe effort results information effort. problem i s conditional somewhat related investigation The pre-decision Finally, a measure of decision were a p p l i e d of shar- allocation P a r e t o c o m p a r i s o n s between statistic, agent's 4.4.1 additive. correlated. rule. from those Corollary the an o p t i m a l allocation outcome p l a y s the The sharing rule the case of strategy. the sharing first be override additional separable second stage the in the may imperfectly outcomes to problem. effort for sequential outcome about therefore agent's in order Fisher's strategy results solution results strategies content 3 also role can presence of o u t c o m e , when e f f o r t effort to the effort agent's sequential were r e l a t e d in in to information c o n d i t i o n s which guaranteed second e f f o r t the first the reveals sharing shown decreasing outcomes a r e an a d d i t i v e l y additive state case an a d d i t i v e l y Thus, precludes problem analyzed second best shown t h a t is A The i n f o r m a t i o n e f f e c t the more c o m p l e x when t h e not effort of examined. second e f f o r t correlated changing the behavior a the next s e c o n d s t a g e was that the outcome. wealth e f f e c t , of If then first the strategy random n o i s e . production technology, increasing o u t c o m e s was to the problem, the 87 agent exerts e f f o r t , and both the p r i n c i p a l and the agent observe the outcome x. The p r i n c i p a l then has the option of observing y, an additional signal about the agent's e f f o r t . The agent's compensation i s s(x) or t( iy)» depending on what was j o i n t l y observed. x Cost variance Investiga- tion, a f a m i l i a r problem i n accounting, has been modeled as a conditional investigation problem (see, for example, Baiman and Demski, 1980a,b) i n which x i s a cost and y i s the result of an investigation to t r y to determine the reason for the cost's deviation from a preset standard. The prob- lem i s similar to the sequential e f f o r t problem i n that decisions are based on an i n i t i a l outcome. However, a f t e r the i n i t i a l outcome, the p r i n c i p a l chooses an act i n the conditional investigation problem, and the agent chooses an act i n the sequential e f f o r t problem. The major focus i n the conditional investigation problem has been on the determination of the o p t i mal investigation strategy; such a question i s not at a l l relevant i n the sequential e f f o r t choice problem. Some additional comments about the condi- t i o n a l investigation problem w i l l be made i n the next chapter. As remarked at the end of Chapter 3, the sequential e f f o r t case can be viewed as a s p e c i a l case of the two-period agency problem i n which the p r i n cipal's and the agent's expected u t i l i t i e s depend only on the t o t a l return over the entire time horizon. Thus, the sequential e f f o r t results have potential applications i n such multiperiod s i t u a t i o n s . 88 CHAPTER 5 SUGGESTED FURTHER RESEARCH This chapter concludes the thesis with suggestions for further research. The f i r s t section discusses possible extensions to t h e o r e t i c a l agency r e s u l t s , and the second section discusses possible applications of the agency theory results to a t r a d i t i o n a l accounting topic, cost variance investigation. 5.1 THEORETICAL AGENCY EXTENSIONS A number of generalizations of the results In this thesis are d e s i r - able. For example, i n the a l l o c a t i o n of e f f o r t setting with additive e f f o r t , i t i s desirable to obtain r e s u l t s for a more general class of u t i l i t y functions and for nonindependent d i s t r i b u t i o n s of incomes. A similar remark holds for some of the results i n the sequential e f f o r t setting. The situation with multiple agents was discussed b r i e f l y i n Section 3.6, where the agents were salespeople i n a firm. The important problem of c o l l u s i o n among agents i n order to conceal shirking or the theft of assets has l a r g e l y been unexplored. Beck (1982), however, has recently taken an incentive contracting approach to the problem of c o l l u s i o n for the purpose of concealing the theft of assets. As remarked e a r l i e r , many accounting addressed i n a multiperiod s e t t i n g . and other business issues are best Lambert (1981, 1983) has analyzed special case of the multiperiod agency problem i n which u t i l i t i e s tive over time and the outcomes are independent. analyzed a are addi- Chapter 4 of this thesis a d i f f e r e n t special case of the multiperiod problem. The analysis allows for nonindependent outcomes, and assumes that the agent i s paid only at the end of the time horizon, even though the e f f o r t choices and observations for of the outcomes are sequential. short-term the The analysis i s thus suitable horizons i n which the p r i n c i p a l and the agent are concerned 89 only with t h e i r t o t a l shares at the end of the time horizon. more general multiperiod situations are desirable. Results for These situations are, of course, more d i f f i c u l t to analyze. 5.2 APPLICATION TO VARIANCE INVESTIGATION A great deal of attention has been focused on strategies for investiga- ting the underlying causes of cost variances or deviations from standards. Most of the a n a l y t i c a l research has assumed that investigations reveal the state of a mechanistic production process, and that the investigator can return an "out-of-control" state to an " i n - c o n t r o l " state (Kaplan, 1975). Thus, only the correctional purposes of investigations were examined. Cor- rectional benefits occur, for example, when costs are higher for a malfunctioning machine than for a properly functioning machine. In some s i t u a t i o n s , the primary focus i s on evaluating a manager has control over a mechanistic process. In such s i t u a t i o n s , there may who be motivational as well as correctional benefits to investigating variances. The manager's actions can be influenced by the p o s s i b i l i t y of an investigation i f a reward or penalty i s based on the results of the investigation. The motivational purposes of investigations have recently come to attention in the a n a l y t i c a l l i t e r a t u r e . explored Baiman and Demski (1980a, 1980b) have the motivational aspects of variance analysis procedures i n a period agency model, with a single-dimensional the analyses, effort variable. the agent i s responsible for a production one- In both of process which gener- ates a monetary outcome determined by the agent's e f f o r t and some exogenous randomness. The monetary outcome, owned by the p r i n c i p a l , i s assumed to be j o i n t l y observable, while the agent's e f f o r t i s not. The p r i n c i p a l can, however, conduct a costly investigation i n order to obtain a further imperfect signal which i s independent of the outcome but informative about the agent's e f f o r t . The nature of the investigation strategy was characterized, 90 and the use of the information for motivational purposes was demonstrated. Lambert (1984) extended the analysis by allowing for a nonindependent addit i o n a l signal about the agent's e f f o r t , and showed that the investigation strategy would d i f f e r from that obtained by Baiman and Demski. A number of extensions to the Baiman-Demski analysis are possible. extension i s to allow for multiple e f f o r t decisions by the agent. One Feltham and Matsumura (1979), for example, suggested three d i f f e r e n t e f f o r t decisions the agent might be responsible for: control at the beginning 1) bringing the system back into of the period a f t e r detecting that I t i s out of control; 2) keeping the process i n control during the period given that the process i s i n control at the beginning of the period; 3) influencing or con- t r o l l i n g the operating costs or the outcome during the period. Their analy- s i s did not focus e x p l i c i t l y on the tradeoffs between the e f f o r t s expended by the agent. Instead, the focus was on characterizing the optimal investi- gation strategy and sharing rule for an i n f i n i t e - h o r i z o n Markov process. Another extension to the Baiman-Demski analysis i s the extension to multiple periods. horizon model. One approach would be to extend the analysis to a f i n i t e - Another approach would be to extend the analysis to an i n f i n i t e - h o r i z o n model. It has been argued that i n f i n i t e - h o r i z o n multi- period problems involving two players, the factor that overshadows a l l others i s the players' knowledge that they have arrived at the last play. When the players expect that there w i l l always be another "play" of the game, the appropriate concept i s the repeated game, i n which there are an i n f i n i t e number of plays of the single game (Rubinstein, 1979) . 91 BIBLIOGRAPHY Amershi, A., "Valuation of Performance Information Systems i n Agencies," Faculty of Commerce and Business Administration, University of B r i t i s h Columbia, working paper (1982). and J . Butterworth, "Pareto and Core Syndicates: Risk-Sharing i n Perspective," Faculty of Commerce and Business Administration, University of B r i t i s h Columbia, working paper (1981). Baiman, S., "Agency Research i n Managerial Accounting: of Accounting Literature (Spring 1982), 154-213. A Survey," Journal and J. Demski, "Variance Analysis Procedures as Motivational Devices," Management Science (August 1980), 840-848. and , "Economically Optimal Performance Evaluation and Control Systems," Journal of Accounting Research Supplement (1980), 184-220. and Evans, John H. I l l , "Pre-Decision Information and Participative Management Control Systems," Journal of Accounting Research (Autumn 1983), 371-395. Beck, P., "Internal Control and I r r e g u l a r i t i e s : An Incentive Contracting Model," Graduate School of Management, University of C a l i f o r n i a , Los Angeles, working paper (November 1982). Berger, P., "Optimal Attitude Toward E. Mazze (ed.), ence. American Compensation Plans: The Effect of Uncertainty and Risk on the Salesman E f f o r t A l l o c a t i o n Decision," i n Proceedings of the 1975 Marketing Educator's ConferMarketing Association: Chicago (1975), 517-520. Blackwell, D., "Equivalent Comparison of Experiments," Annals of Mathematic a l S t a t i s t i c s (1953), 265-272. Christensen, John, "Communication i n Agencies," B e l l Journal of Economics (Autumn 1981), 661-674. , "The Determination of Performance Standards and P a r t i c i p a t i o n , " Journal of Accounting Research (Autumn 1982, Part I I ) , 589-603. Cox, D. and D. Hinkley, Theoretical S t a t i s t i c s . (1979) . DeGroot, M., Chapman and H a l l : Optimal S t a t i s t i c a l Decisions. McGraw-Hill: Demski, J . and G. Feltham, Cost Determination: State University Press: Ames (1976). London New York (1970). A Conceptual Approach. Iowa and , "Economic Incentives i n Budgetary Control Systems," The Accounting Review ( A p r i l 1978), 336-359. and D. Sappington, "Delegated Expertise," Graduate School of Business, Stanford University, working paper (1983). 92 Farley, J . , "An Optimal Plan for Salesmen's Compensation," Journal of Marketing Research (May 1964), 39-43. Fellingham, J . , Y. Kwon, and D. Newman, "Ex Ante Randomization i n Agency Models," Graduate School of Business, University of Texas at Austin, working paper #82/83-1-12 (February 1983). Feltham, G., "Cost Aggregation: An Information Economic Analysis," Journal of Accounting Research (Spring 1977), 42-70. , "Optimal Incentive Contracts: Penalties, Costly Information and Mult i p l e Workers," Faculty of Commerce and Business Administration, University of B r i t i s h Columbia, working paper #508 (October 1977). and E. Matsumura, "Cost Variance Investigation: An Agency Theory Perspective," Faculty of Commerce and Business Administration, University of B r i t i s h Columbia, working paper (July 1979). Gjesdal, F., "Stewardship Accounting: Controlling Informational E x t e r n a l i t i e s , " Ph.D. d i s s e r t a t i o n , Stanford University (1978). , "Accounting for Stewardship," Journal of Accounting Research (Spring 1981), 208-231. , "Information and Incentives: The Agency Information Problem," Review of Economic Studies (1982), 373-390. H a r r i s , M. and A. Raviv, "Optimal Incentive Contracts with Imperfect Information," Journal of Economic Theory (1979), 231-259. Holmstrom, B., "On Incentives and Control i n Organizations," Ph.D. tion, Stanford University (1977). disserta- , "Moral Hazard and Observability," B e l l Journal of Economics (Spring 1979), 74-91. , "Moral Hazard i n Teams," B e l l Journal of Economics (Spring 1982), 324-340. Horngren, C , Cost Accounting: A Managerial Emphasis ( F i f t h e d i t i o n ) . Prentice H a l l : Englewood C l i f f s (1982). Itami, H., "Analysis of the Optimal Linear Goal-Based Incentive System," Department of Commerce, Hitotsubashi University, working paper (March 1979). Johnson, N. and S. Kotz, Continuous Multivariate D i s t r i b u t i o n s . Wiley: New York (1972). Kaplan, R., "The Significance and Investigation of Cost Variances: Survey and Extensions," Journal of Accounting Research (Autumn 1975), 311-337. L a i , R., "A Theory of Salesforce Compensation Plans," Ph.D. GSIA, Carnegie-Mellon University (1982). dissertation, 93 Lambert, R., "Managerial Incentives In Multiperiod Agency Relationships," Ph.D. d i s s e r t a t i o n , Stanford University (1981). , "Long-Term Contracts and Moral Hazard," B e l l Journal of Economics (Autumn 1983), 441-452. , "Variance Investigation i n Agency Settings," Kellogg Graduate School of Management, Northwestern University, working paper (1984). Lehmann, E., Testing S t a t i s t i c a l Hypotheses. New York (1959). John Wiley & Sons, Inc.: Mirrlees, J . , "Notes on Welfare Economics, Information and Uncertainty," i n M. Balch, D. McFadden, and S. Wu (eds.), Essays on Economic Behavior Under Uncertainty. North-Holland (1974). Peng, J . , "Simultaneous Estimation of the Parameters of Independent Poisson D i s t r i b u t i o n , " Ph.D. d i s s e r t a t i o n , Technical Report #78, Department of S t a t i s t i c s , Stanford University (1975). Radner, R. and M. Rothschild, "On the Allocation of E f f o r t , " Journal of Economic Theory (1975), 358-376. Rubinstein, A., "Offenses That May Have Been Committed by Accident - an Optimal Policy of Retribution," i n S. Brams, A. Schotter, and G. Schrodiauer (eds.), Applied Game Theory. Physica-Verlag: Wurzburg (1979). Shavell, S., "Risk Sharing and Incentives i n the P r i n c i p a l and Agent Relationship," B e l l Journal of Economics (Spring 1979), 55-73. Steinbrink, J . , "How to Pay Your Sales Force," Harvard Business Review (July-August 1978), 111-12 2. S t i g l i t z , J . , "Incentives, Risk, and Information: Notes Toward a Theory of Hierarchy," Bell Journal of Economics (Autumn 1975), 552-579. Weinberg, C., "An Optimal Commission Plan for Salesmen's Control Over Price," Management Science ( A p r i l 1975), 937-943. , " J o i n t l y Optimal Sales Commissions for Nonincome Maximizing Salesforces ," Manajement_S£ience_ (August 1978), 1252-1258. Zimmerman, J . , "The Costs and Benefits of Cost Allocations," The Review (July 1979), 504-521. Accounting Appendix 1 Table I I One-parameter Exponential Family Q f(x|a) = exp[z(a)x - B(z(a))]h(x) Exponential f ( x 1 I v c t ' . -x1 . MUT^W a ) Normal 1 "^= e X-(x-M(a)) Pl_ 1. 2 E(x|a) M(a) M(a) Var(x|a) M ( ) o 2 a Gamma X x " e" n n 1 Poisson Binomial , ._(M(a)) expI-MCa)]^^- Xx x M r(n) / M(a) (= n/X) ,n M(a) ( x )<-^) M(a) M(a) 2 X w d" M(a)"" -^0 M(a) M ( a ) [1 - Hll] n z ( a ) ~ MTaJ X n " HTaT In M(a) l n M(a) - In (n-M(a)) a B(z) B*(z) -In (-z) , ( , n) n In z v 1 -z 2 oz n -z 1 -2 n 7 Note: 2 —a z 2 2 ° 7 , exp (z) x v n e nz - n ln[ i z ] z 1 +e r 1 _i_ , exp (z) v r . e * .. p<z) ne z z 1 +e ne Z ^ 7 The exponential d i s t r i b u t i o n i s a special case of the gamma d i s t r i b u t i o n but i s l i s t e d separately because of i t s wide use. 95 The following calculations for the one-parameter exponential family, Q, given i n Table I I , w i l l be useful i n the proofs of the results i n Chapters 3 and 4. _a f d ln f da = ^ - [z(a)x - B(z(a))] da z'(a)x-B'(z(a))z (a) , z'(a)[x-B'(z(a))] (Al.l) z'(a)(x-E(x|a)). f (x|a) / ( | dx = / z'(a)(x-E(x|a))f (x|a)dx f x a ) a z ' ( a ) f / xf(x|a)dx - z ' ( a ) E ( x | a ) / f (xla)dx da ' ' a 1 J z'(a)^- B'(z(a)), aa since E(x|a) =B'(z(a)) and / f (x|a)dx = 0 = (z'(a)) B"(z(a)). 2 f^(x|a) (A1.2) f ) f(x|a)dx f (x|a) da Note that (z'(a*)) 2 / (x-E(x|a*)) f(x|a)dx 2 / (x-E(x|a*)) f(x|a)dx 2 = / (x-E(x|a) + E(x|a) - E(x|a*)) f(x|a)dx 2 = / (x-E(x|a)) f(x|a)dx + (E(x|a) - E ( x | a * ) ) 2 2 - 2(E(x|a) - E(x|a*)) / (x-E(x|a))f(x|a)dx = Var(x|a) + (E(x|a) - E ( x | a * ) ) . 2 96 Therefore, f!(x|a) dx = ( z ' ( a * ) ) H ^ 2 (B"(z(a))) f (x|a) a* (A1.3) = (z'(a*)) B"'(z(a*)). 3 f (x|a) / f 3 f(x|a) aa z'(a*) - i y / (x-E(x|a*))f(x|a)dx da (xla)dx = z'(a*) - \ da (E(x|a) - E(x|a*)) = z'(a*) B'(z(a)) da = z'(a*) (B"(z(a))z»(a)) = z ' ( a * ) [ B ' " ( z ( a * ) ) ( z ( a * ) ) + B"(z(a*))z»'(a*)]. , 2 Second Best, Additive E f f o r t Example (Section 3.5) Suppose the p r i n c i p a l i s r i s k neutral, <Kx ,x |a ,a ) = f ( x | a ) f ( x | a ) , 1 2 1 2 1 1 2 V ( a , a ) = V ( a + a ) , and 1 2 1 lAs) = 2JT~ Then 2 . s * ( x x ) = R (a*), where l f 2 2 R(a*) - X + ^ ± f a ( f x il l> a i x ^ 2 (A1.4) Let K(s,a_) = / 2 R(a*)4>(x|a) dx EU(s*,a) = K(s*,a) - V ( a and the p r i n c i p a l ' s expected / (x G(s*,a) = The + x L Hamiltonian, with : . + The 2 utility is <Kx|a) dx. 2 s*(x), i s 2 H = G(s*,a) + The 1) A(EU(s*,a) - u) + 3EU(s*,a) Z u k=l — k f i r s t order c o n d i t i o n s are 3G + X»0 + That i s , which imply G a, + % + u,K u Z . , i=l G and »l< ^ + ( - 0, i 3a. 9a . i J " V") K a a K a i a ' "> V 2 + + j=l,2. ^ ( ^ - V") ^ 2 ^ " "> i. a, a, + 11 u»K Hi. a, a- = G n~2 = 0 , j-1,2. =° V • a_ "2 + U.K T. a, a„ *1~2 + P~K a„a 2°2 at a* = ( a i * , a * ) 2 I.e., / 2R(a*) f ( x | a ) f ( x | a ) d x ^ - / 2R(a*) f ( x | a ) f ( x | a ) d x ^ - V (a _ 1 and = 0 that where a l l the f u n c t i o n s are e v a l u a t e d 2) utility a ) - R (a*)) 2 agent's expected ] > 1 1 2 2 2 2 + a ) = 0 + a ) = 0 2 , ] 2 98 Thus, 3a, J [ X+ E i=l 2 a f ] f ( x | a ) f ( x | a ) dx dx 2 ] f ( x | a ) f ( x | a ) dx dx 2 1 f ( 0 1 2 2 1 '> ( 1 1 2 2 L , which implies that f 3a, ' U l ^ I ' V — 3 — since a d x i = 3 -3a7 1 : y f(x„ a„) dx„ 2 2 2 v i • ! V L - T ^(^1^) l Therefore, 1^ / - j — 2 2 f 3 r d x i = for i * j . 0 a d x l = (A1.6) dx « "2 / 2 Let J denote the quantity i n (A1.6) G can be written as a l -JL [ECxJa^ E(x |a )] + 2 * 2 / [ X + 2X^ f f a + 2X 2 "2 — ! 2 ^ + 2 } [ ^ 2 l a 2 .-r-] — ] •f(x | a ) f ( x |a )dx dx ][ 1 2 2 1 2 + 2 I * l 99 9E(x |a ) 1 1 f(x |a )dx 1 . 2 9 /• a j l 1 1 } f(x |a )dx 1 f f a, 1 1 a. fCxJa^ f(x |a)dxdx ] 2 2 1 2 The last term In the sum i s 0 when evaluated at a* because x^ and x 2 are conditionally independent and f ^ l a , * ) ^ f(x |a *) 2 2 ' f< x l 2* a 2 ) d x 2 " °' Thus, at a*, SECxJap - 2XJ - u dx^. 2 f(x |a )dx + 1 1 1 * "i 2 9a, l-r f a a 1 ( x 1 l l | a ) d x l 1 1 / f(x |a )dx ]} 2 2 2 100 and K a f(x |a )dx = K l 2 a a 2 2 2 ] = 0 2 l a Therefore, (A1.5) can be written as 3E(x |a ) 1 2 , 1 - 2 XJ l a dx + 2 2 i a l f a ( i a i x l! l a ) d l x a* 9E(x |a ) 2 2 - 2XJ - 9a U r a a f 2 ( x 2 2 2 a f 2l 2 a ) d x 2 dx„ (A1.7) 2 For the exponential family, using the results i n Table II and equations (A1.2) through (A1.4), (A1.7) can be written as B"( (a *))z'(a *) - ^[(z'Ca^)) 2 1 B ' " ^ ^ * ) ) - 2(z'( *)) 3 1 B "(z(a *)) 3 , a i 1 - 2z'(a *)z"(a *)B"(z(a *))] 1 1 1 = B"(z(a *)z'(a *) - ^ [ ( z ' ( a * ) ) 2 2 2 3 B"'(z(a *)) - 2 ( z ' ( * ) ) 2 - 2z'(a *) z " ( a * ) B " ( z ( a * ) ) ] , 2 or 2 2 a ? 3 B " ' (z(a *) ) 2 101 B"( (a *))z'(a *) + ^ [ ( z ' ( Z 1 1 a i *)) B'"(z( *)) 3 a i + 2z'(a *)z"(a *)B"(z(a *))] 1 1 1 = B"(z(a *))z'(a *) + ^ [ ( z ' ( a * ) ) 2 2 B"'(z(a *)) 3 2 2 + 2z'(a *)z"(a *) B"(z(a *))] 2 2 (A1.8) 2 Equation (A1.6) can be written as (see equation (A1.2)) W (z'(a *)) L 1 2 B " ( z ( * ) ) = P2(z'(a *)) a i 2 2 B"(z(a *)) 2 . (A1.9) 102 Appendix 2 Normal Distribution Calculations This appendix contains calculations for the bivariate normal distribution, the only d i s t r i b u t i o n with a convenient representation for dependent random variables. I 2 X o-^a) p(a) 0 ^ ( 3 ) 0 - 2 ( 3 ) Suppose x ~ N( 6 ( 3 ) , £(§)), with E = p(s) 0^(3)0-2(3) \ where s_= (.a l t 6(a) = ( e ^ a ) , 9 ( a ) ) . and a ) 2 T 2 1 f (x ,x, |a. , 3 , ) = 1 x B = l" l 2 9 X l" l 9 — — X 2" 2 9 x 2" 2 2 a l °1 a 2 =— ], where (A2.1) 9 i ) - 2p(-L_^)(^—±) + (-^-^ Z 2 Then B exp[ 2 o" (§) • °2 Let D denote the argument i n the exponentiation i n (A2.1). The following quantity plays an important role i n the determination of the optimal sharing rule. 9f / f aa. 1 9 9 - 5 — [- log 2TT - log 9a log f = 9a 1 a - log 2 o_ 6 1 x 6 s 1 - \ l o g ( l - p ) + D] 2 (1) (1) (1) \ °2 - — + — + D(1) , °L °2 1-p p p V 2 ; e 103 where the superscript ( i ) denotes d i f f e r e n t i a t i o n with respect to a^. -1 x CD [ i l l ^ B ^ B ^ (1-PV ] and 1-p* r i -^V^rV^ (i)x r e i e 1 o^ x 2 1 _ e 2 2 -9{ a -(x -9.)a5 x,-9 1) 1) 1 - 2 P [ — i — 9 1 n-^-i 2 l) 1 at x.-9 -g-^^-Cx^-g )a\ ~ 2p[4-^1[ ] 2 a. 1 -t£ > -(x -e ) a j x_-9 1 0 + n—r n— 1 ) 7 1 • jL 2 af 2 Case ( a ) : 9(a) =. ( ^ ( a ) , f cf, ov, 1 2 9 (a)) T 2 . , 2 1-p , , 2 2 (1-p ) „ ., 2. 2(l-p ) Since f i s symmetric, simply replace the ( l ) ' s with (2)'s to get f (2) (2) _!?. . _ A - . ^ f °L / 2 _ + ^ _pP\ 1-P 2 P P 0 ( N 2 ) B (1-P ) 2 2 _ 1 B (2) 2(l-p ) 2 The optimal sharing rule i s not separable i n x^ and x 2 and i s not a com- mission scheme. Case (b): p = constant * 0. The optimal sharing rule w i l l c l e a r l y not be a commission scheme. Of course, i f p H 0, the optimal sharing rule w i l l be separable i n x^ and x 2 but w i l l not be a commission (linear) scheme. 104 Case ( c ) : p = constant and a 8 - r • r 2 = constant. (x -9 ) 2 a r t 1-p p ( " -^r (^> x -e ) ( 2 e^c^-e^) + • 2 a a 2 12 + 1-p ) 2 1 2 + J . 2 °2 The optimal sharing rule w i l l be a commission scheme with the c o e f f i c i e n t of X l e q u a l t (l) 0 fl r 1 . 1 2 [ 1-p n f l / (l) 1 * M,( l v Q w 2 —") «— a. 7 2o o^ 1 2 (2) (2) A P ^ i 2 + 2U „ ( 2 o V Oj a,— o—_ ) 1 2 J . The c o e f f i c i e n t of x i s 2 P . 2 1 1-p 2 a 9< 9 ^ a. a ' 1 2 2 K 2 2) a 2 a. a 2 V p0< > 2 0 1 2 Lemma 2A.1 characterizes some properties of the optimal second best solution f o r p a r t i c u l a r u t i l i t y functions when the d i s t r i b u t i o n of the outcomes i s bivariate normal. The calculations i n the proof w i l l be useful i n the proofs of propositions i n Section 3 . 5 . k Lemma 2 A . 1 . Suppose (x^, X 2 ) ~ ^ ] l l a N C A ^ » 2 CT where E = ( 1 p o l °2 _ ) . Suppose further that the p r i n c i p a l i s risk neui o 1 l J0^ t r a l , U(s) = lis, and V(a_) = V(a^ + a.^)' assuming the i n t e r i o r characp a a T n e n 105 t e r i z a t i o n of the optimal sharing rule, s(x ,x_) = (X + Ey.f /f ) , i s 2 1 v a l i d for almost every (x^ ,^2), the following results hold: (1) a j * > 0 , &2* > 0 , k^ = k • and (2) k^*k or a * = 0 . V S — In t h i s case, V =[ W Pk (x -k a ) 1 " 2 2 2 = 2 2 ~ 1 2 2 2 = (X+ 2 £ yf /f y ^ Let ) ^ pk (^(I-P) y2k . 2 5— and x = o (l-p ) Z 2 ^1^1 2 2" a (l-p ) — • a^l-p ) EW = E(x^ + x - s(x!,x )) C 2 = 2 l / d - P) . oTcZ 0 s(xi ,x ) > ' ]/(1_p PkjC^-k^) 2 2 [ 2 i 1 k (x -k a ) 2 2 dTdZ 0^ I " 2 0. 2 Proof of Lemma 2 A . 1 . a 2 implies that the optimal solution i s a boundary solution, 2 i . e . , a^* = and = o" imply that y^ = y « 2 2 2 l 3 l + k a 2 - E( X + = k l 3 l + k a 2 1 - w E( X + 2 6 2 = k 2 "^h principal's expected return i s C L a k l 3 l " o, k l a 1 l * + c g n C A 1 + 2 k 2 2 3 " ) k 0. 2 2* a 2 106 Letting k = ± V r*V i * x C and y ± = £ i~ i i k a , g the p r i n c i p a l ' s expected i return can be written as 2 l + A ) ^ - E( i a i + k a 2 - (X + A 2 - < a, E k i i a + k C x 2 " l 2 t 2> 2 C 2 2 " ( x+ a - C k + k a 2 i a i 2 i A C + A i i + A ) -i-i L = k. SEW 3a, 2 a, > 1 Cov(y ,y ) - C 1 2 - C 2 2 - 2C C p 1 (A2.2) 2 , 1-1,2, and pk p pk V [ a (l-p') " i " 1 , i,j-l,2, i * j. j ± If x^ and x >< C 2 2 Ak 3EW 3a, 2 1 L (yi) 2 2 - 2(A + A V a r Var(y ) - 2 0 ^ k ± c 2 C )^ L o " i ) 2 - (A + Aj_ + A ) 2 3EW k 2 a g < E l l k X k are independent, then p = 0 and = k - 2Ak p /a 2 ± 1 2 i Letting a = a^ + a , the agent's expected u t i l i t y i s 2 EU = 2 E( A + £ p.f / f j 2 k a - k.a * 1- k a 2 2 2 i, . 2 1-p 2 a 2 k v v l h 2 a 2 P2k (- 2 ~ vuk p p.k. P - k a * 1 " P 3EU 3a I ) - V(a) a 2 2 ^ j V ? 2 a l 2 0 p k p -) ] " V(a) 1 1 °i °o 1 2 •) - V'(a), i,j=l,2, i * j. 107 2k C = — i V'(a), 1=1,2. (A2.3) a The f i r s t order conditions require that (A2.3) i s zero for 1=1,2. There- fore, l l k C a If k^ = k \ that C = o^, then (A2.4) implies that implies that Lemma 2A.1. 2 2 °2 and 2 k = = C , which i n turn 2 (assuming p * + 1). This establishes result (1) of Note that i f p = 0, then setting (A2.3) equal to zero shows > 0 and > 0, since V (a) i s assumed to be p o s i t i v e . The Hamiltonian i s H = EW + X(EU-u) + E \i 3EU/3a J-l jw. •—- = k - 2[ X + i fe ± j = j . J 2 k.C. k.C, E (a.-a *) ] j j c 1 CTj - V"(a) E u j-l ± i-1,2. (A2.5) J 3H Setting — = 0 for 1=1,2, and l e t t i n g P denote the quantity i n (A2.4) 1 6 3 yields k t - 2P[ X + P(a +a x = k 2 It i s impossible 2 - a *-a *) ] 1 - 2P[ X+ P(a +a 1 2 2 - *-a *) ] . a]L (A2.6) 2 to s a t i s f y equation (A2.6) unless k^ = k , which estab2 l i s h e s result (2) i n Lemma 2A.1. Q.E.D. 108 Appendix 3 Chapter 3 Proofs Proof of Proposition 3.1.1. The p r i n c i p a l ' s problem i s Maximize EW(x-s(x)) = / W(x-s(x)f(x|a)dx subject to EU(s(x)) - V(a) = u. The f i r s t order condition for s*(x) requires that - W'(x-s*(x))f(x|a) + X U'(s*(x))f(x|a) = 0, or W'(x-s*(x)) = XU'(s*(x)) . This implies that x - s*(x) = W' (X U'(s*(x))) -1 = T(s*(x)), with T'(s*) > 0 since X > 0. Therefore, x = T(s*(x)) + s*(x) =Y(s*(x)), with Y'(s*) = T'(s*) + 1 > 0 . Thus, s*(x) = Y ( x ) . - 1 Q.E.D. Lemma 3A.1 below w i l l be used i n proving Proposition 3.2.1. Lemma 3A.1: Suppose f(x|a) = expected u t i l i t y n II f (x. |a.) and that the risk-averse agent's i=l i s pseudoconcave i n a. Suppose further that F a (x.|a.) < 0, with s t r i c t i 1 3EW x^-values. Then for i = l , . ..,n, i f Proof of Lemma 3A.1: (1) Inequality for some 1 > 0 . < 0, -g^— The f i r s t order conditions are (x|a*)dx + £ u.* { jb(s|x)f _ ( «)dx - V i i j i j j=l i=l,•..,n, and /W(x-s(x))f a 2 a 3 a 3 } = 0, 109 tW'(r(x)) n/ i w fa. (xia*) -'- n U'(x-r(x)) j j f(x|a*) f ( x . a.*) ] 3 « a = A* + Z u.* — r f (x.|a.*) j-l J 1 3 J J because of the independence assumption. Here, subscripts a^ and aj on f ( •) and V( •) denote p a r t i a l d i f f e r e n t i a t i o n with respect to a^ or a j , tively; A* and n *, j=l,...,n, are the optimal values of the multipliers i n the second best problem, and r(_x) = x-s(x). Suppose some < 0. Without loss of generality, l e t j = l . Consider the following a u x i l i a r y problem: Max S J W(x-s (x))f(x|a*)dx + A* [ / U(s (x))f(x|a*)dx - V(a*)] x x A n + Z \x* [ j U(s,(x))f (x|a*)dx - V (a*) ] , J=2 3 3 where a*, A* and i ^ * , . . . , ^ * terized by (1) and ( 2 ) . For x E X = { x with X j such that f ^ x ^ a j * ) > 0 } , 1 + r-yr- U'(x-r(x)) correspond to the optimal solution charac- Let r^(x) = x - s^(x). „,/ / W'(r(x)) rrrp- respec- f (x.|a.*) a. 3 3 j n = A* + Z U.* < A* ( x . | a, . *^) j=l . . 3 f-3. J J J J W'(r (x)) A U'(x-r (x)) * A f (x.|a.*) a. 3 3 j n + 1 Z \1 * j=2 . _ j T 3- f_v (x.|a.*) J J J 110 W(r(x)) Note that 777-; . . U'(x-r(x)) N r ^(5) * s a n i s decreasing i n r(x) for every fixed x. — — Further, increasing function of x^, since 3r 3r W " ( r ( x ) ) _ J u ' ( x - r ( x ) ) + W < •) U " ( • ) ( ! - ^ ) x x 2 1 U' L ° = Z ^X W'U'' implies that ^ - = „, .y, , , , > 0. r + W D Now W'(r(x)) r r r 7777 W(r(x)) 7~~\T decreasing i n r W'(r (x)) x < U'(x-r(x)) 7~sT 7777 a n 7777 d U'(x-r^x)) U'(x-r(x)) implies that r(x) > r^(x), for a l l x e X^ . Correspondingly, + r(x) < r (x) on X j _ = { x with Xj^ such that f ^ C x ^ J a ^ ) x Therefore, /W(r(x))f (x|a*)dx - / w ( r , ( x ) ) f a. ^ }. (x|a*)dx A cl ^ = t ( <*>> - W(r (x)) ] f ^(x|a*)dx + L [ W(r(x)) - W(r (x)) ] f ( »)dx > 0. l l+ w < 0 r a x X A a (x|a*)dx > 0. It remains to show that / W(r,(x))f A 3. ^ ~~* The left-hand side of ~" the expression can be written as j [ / 1 ... / W ( r ( x ) ) f ( x | a * ) . . . f ( x | a * ) d x . . . d x 2 n 2 x x x n 2 2 n n 2 n ] f^^ 1 |a *)dx 1 ]L = L T(x )f ^(x, |a *)dx > 0, as i n the one dimensional case, because of X, l a , 1 1 1 n 1 1 Ill stochastic dominance and the fact that T'(x,) = f W 1 J 3r . « f ...f dx ...dx > 0 . 3x^ 2 n n 0 Q.E.D. Proof of Proposition 3.2.1: Let A = / W(x-s(x))f(x|a)dx and B = / U(s(x),a)f(x|a)dx. Subscripts i and j on A and B w i l l denote p a r t i a l d i f f e r e n t i a t i o n with 3H respect to a^ or a j , respectively. for n=2 are (1) A (2) A + : + 2 The f i r s t order conditions = 0 + UpB^ = 0 and P B L + 2 1 = 0, where the functions are evaluated at the optimal a* and with the optimal s*(x). In matrix notation, A + B y = 0, l " l 0 l l 12 where A = ( ), B = ( ), y = ( ), and 0 ( ). A B B y2 0 A B B 2 2 1 2 2 If B i s s t r i c t l y concave i n a, then | B | * 0 and B -1 1 B 22 B 12 A I.e., A (3) ^ = (4) U2 2 12 B A l 22 B iBl V21 " Vll rgi and l - 1 exists. Therefore, 112 If B i s s t r i c t l y concave i n a_, then | B | > 0 and B ±i Now assume < 0 and < 0. < 0, i=l,2. Then by Lemma 3A.1, A^ > 0 and A 2 > 0. From (3) and (4), we have A 2 12 ~ 1 22 < 0 A 1 21 < B B A B " 2 11 A B These imply that B a n d °* 1 2 < 0 (note: (2) cannot be s a t i s f i e d . B 1 2 = B ). 2 1 But i f B Therefore, not both 1 2 < 0, then (1) and and u can be nonpositive. 2 Q.E.D. Lemma 3A.2 below deals with the problem of a l l o c a t i n g e f f o r t to two tasks considered Lemma 3A.2. simultaneously. ( F i r s t Best, Additive E f f o r t ) Suppose E(x^) = k j ^ * i=l,-..,n. (1) If k^ = k, for a l l i , then k = X V ( Ea^) implies that any nonnegative vector a_such that Ea^ s a t i s f i e s (2) If some k^ [1] below i s Pareto a boundary solution r e s u l t s . are zero except one. optimal. That i s , a l l the a^'s In the n=2 case with k^ > k , a^* > 0 and 2 a * = 0. 2 Proof of Lemma 3A.2. The principal's problem i s Maximize / (x-s(x)) g(x|a) dx s(x), a subject to / [ U(s(x)) - V(a) ] g(x|a) dx > u. H = / (x-s(x)) g(x|a) dx + X { / [ U(s(x)) - V(a) ] g(x|a) dx - u } . 1 31 -g- = -g + XU'g = 0 implies that U'(s(x)) = ~y which implies that s(x) = U ' - 1 ( i ) = C. 113 -g-= /(x-s(x)) g (x|a) dx + X { / [ U(s(x)) g ( •) ] dx - V ( E a . ) } = 0 . 3E(x|a) - 0 + X(0-V'(Ea )) = 0 implies that This establishes result (1) of Lemma k = XV ( E a ) for a l l i . [1] 3A.2. To establish result (2), r e c a l l that a^* and a * are nonnegative by 2 assumption. Let s*(x) = C*, where s*(x) i s the optimal sharing rule corre- sponding to the optimal choices a^* and and a a 2 *« ^ > 0, be a feasible e f f o r t pair given 1 2 ( i'» e t a a 2'^» w n e r e i' * 0 a C*. The agent's expected u t i l i t y for any feasible ( a i , a ) i s 2 C* - V(a + a ) = u . 2 1 Since ( a ' , a ' ) i s f e a s i b l e , C* - V(a ' + a ') 1 2 x ( a " , a " ) = ( a ^ + a ', 0). 1 2 2 2 = u. Consider the pai r This pair i s also f e a s i b l e , since a^ ' + a " 2 = a^' + a ', and the p r i n c i p a l i s s t r i c t l y better o f f with ( a i " , a ' ' ) 2 2 since his expected return i s k ^ " + k a " - C* = ^ ( a ^ + a ') 2 Therefore, 2 2 - C* > k ^ ' + k a ' - C* i f k 2 2 L > k. 2 a ' > 0 Is not optimal, and hence the optimal e f f o r t pair i s 2 such that a^* > 0 and a * = 0 . 2 Q.E.D. Proposition 3A.3 Proposition 3A.3. below compares the solutions to two one-task problems ( F i r s t Best, Additive E f f o r t ) Suppose E(x^) = K^a^, i l > 2 , and that k^ > k . = 2 Consider the two sepa- rate problems where e f f o r t i s devoted only to task i . Then (1) a^* > a * i f V i s increasing and convex, (2) a^* > a * implies that s^* > s * ( i . e . , the agent i s paid more 2 2 2 for exerting a^* at task 1 than for exerting a * at task 2), and 2 114 (3) the p r i n c i p a l i s better off with a^* > 0 and a.^* = 0 than with = 0 and a^* a^* Proof of Proposition 0. > 3A.3. The principal's problem i f e f f o r t i s devoted only to task I i s Problem i : Maximize / (x - s^(x)) g^(x|a^) dx s ^ x ) ^ subject to J u ( ( x ) ) g ( x | a ) dx - V(a ) >u i S l 1 9H. . = 0 implies that s (x) = U'~ 1 . (_ X dS^ 3H ± „ = 0 )= C and k ^ i . 1 '< i> = A V'(a ) - -r- - — r — = - . A-i (c. X" X X i F e a s i b i l i t y requires that U ( U'" which implies that U'" 1 [1] X A^ V implies that . (i i 1 a ( 1 - ) ) - V(a ) i ± ) = U'" [ u + V(a.) 1 A = u , ] . 1 Equation [2] implies that A Result (1). i k k^ > k 2 i implies that a^* > a * 2 i f V i s increasing and convex. Proof. Suppose a^* < * « a 2 U (G _1 Then + V(a *)) < U~ (G 1 L (since U ^ - + V(a *)) and V are 2 increasing) [2] [3] -1 , ' (^ V U' _ implies that ( l * J— l a x -1 '< 2* > ) <- U' ^ i — ) by [3] and [4], , _ ^( 2 ) V a ) f k k which implies that V»( *) V(a *) — r > r 2 l ai K x (since U' i s decreasing), or k 2 '( 2* k~~ * V (a *) k _ 2 V a ) > * ( i s n c e v ' increasing and a^* < a * ) , so that i s 2 ^2 ^ ^1 * Therefore, > k 2 implies that a^* > a * . 2 Result (2). a^* > a * implies that s^* > s * . Proof. a^ > a * implies that u + V ( a ^ ) > u + V ( a * ) , which 2 2 2 2 implies that U ( u + V(a *)) > U ( u + V ( a * ) ) - 1 _ 1 L 2 (since U ^ i s Increasing), so that - U'" ( V 1 ) > ' u sj* > s * by 2 Remark : a^* > a * Result (3). 2 - 1 ik- [1] ) b y I !* 3 Therefore, . also implies that > ^ I f ki > k , the p r i n c i p a l i s better o f f with 2 a j * > 0 and a * = 0 than with a ^ = 0 and a * > 0 . 2 2 U6 Proof. It i s necessary to show that 1 2 / (x - S j * ) g ( x ^ * ) dx > / (x - s * ) g (x|a *) dx, 2 that i s , k ^ * - > k a * - C 2 2 2 . 2 [5] Note that ( a * , s * ) i s feasible for Problem 1: 2 2 / U(s *) g ( x | a * ) dx - V(a *) = U(C ) - V(a *) = u . 1 2 2 2 2 2 Therefore, k j ] * ~ j * > l 2 * ~~ 2 * because °^ f e a s i b i l i t y of ( a * , s * ) 3 s k a s 2 2 for Problem 1 and optimality of ( a ^ * , s^*) for Problem 1. Furthermore, k^a * - s * > k 2 * ~ 2 * s a 2 2 b e c a u s e l > 2 ^ ^» k k 2 a n d n e n c e t l holds. 5 Q.E.D. Proof of Proposition 3.5.1: if x ~N(ka ± and an i n t e r i o r solution (a^* > 0, a * > 0) i s optimal, l t 2 then i t must be that a* = a^* + a 2 In Lemma 2A.1 of Appendix 2, i t was shown that = Uj,- Let u = , a* = (a^*, a * ) , and 2 * ' This i n t e r i o r solution s a t i s f i e s the Nash conditions f (xjlaj) a ) f(x |a )f(x |a )dx x - V'( + a ) = 0, a i 1 2 2 i=l,2. 9 The condition for i=l i s f 2 v {"857 1 f (xja^) a kil*i*> a , 2 f + 3 -£7 I or 2 p - i - / (kx ( x a L * 2' 2* a 1 f ( 2 2 3 l l K ) d x l ) L ' f(x la *) - k x 2 K^W^V** * ) •f(x |a )dx 1 1 1 1" - V ' ^ + a )= 0 2 + 2> " °» a 117 2 pk i.e., - V'(a*) = 0, which would also result from the i=2 condition. Hence, there i s r e a l l y only one Nash condition. The principal's expected utility is f / ( + x X l 2 - ( X+ y Z = ka* - X - 2u k 2 2 (x.la.*) 3 i ^ ) ) f(x |a *)f(x |a *)dx dx 1 1 2 2 (see equation (A2.2) i n Appendix 2 1 2 2). The agent's expected u t i l i t y i f e f f o r t a* i s exerted i s f./x^a.*) 2 / ( X+ y f / | x j j 1 a *) > f(x |a *)f(x |a *)dx dx 1 1 2 2 1 2 - V(a*) = u , j which implies that 2X - V(a*) = u . suppose that a =0, the minimum e f f o r t , and that x Now 2 , f (x|a) Consider s(x) = [ X + u 2 i s ignored for a compensation purposes. fOcTa) and a* are the same as i n the i n t e r i o r solution above. is ) where X, u, The Nash condition now a f (x|a) ) f(x|a)dx - V*(a) = 0 , la" or 2 U - L - / (kx - k a * ) f ( x | a ) d x - V'(a) = 0 , that i s , 2vk - V'(a) = 0 , which i s s a t i s f i e d at a=a* The p r i n c i p a l ' s expected u t i l i t y If a=a* i s f (x|a) a / ( x - ( X+ = ka* - y 2 X 2 f(x|a) - ) ) f(x|a*)dx 2 2 y k 2 2 > ka* - X - 2y k , which i s the p r i n c i p a l ' s expected u t i l i t y with an i n t e r i o r solution. Since the agent's expected u t i l i t y i s 118 unaffected, the p r i n c i p a l i s s t r i c t l y better o f f , and the Nash condition holds, a boundary solution i s optimal. Q.E.D. Proof of Proposition 3.5.2; H = E(x + x L The Hamiltonian for the two-task problem i s - s*(x)) + X [ EU(s*(x)) - 2 + Ji,_ - j ^ - [ EU(s*(x)) - V(a x + a ) ] + ^ x + a ) ] [ EU(s*(x)) - V(a + a ) - u ] 2 2 2 The f i r s t order conditions are 3H "5T7 3H = o. [i] = 0 pointwise, [2] [ EU(s*(x)) - V ( 3 l + a ) ] - 0, 2 [3] EU(s*(x)) - V ( a * + a *) = u and L 2 As before, [2] implies that U'(s*(x)) X + Z j = 1 3H 3a, = /^ V'jIV 2 + x 2 U. -=4 1 r- J ^jl^j) - s*(x))f ( x | a ) f ( x |a )dx L + ^ -1^ [ EU(s*(x)) - V( .) ] 3 a l 1 2 2 + 0 119 = 0 . = -gi / (x + x t + u, 2 - s*(x))f(x |a )f(x |a )dx 1 1 [ EU(s*(x)) - V( •) ] 2 2 + 0 = 0 . 9a2 These imply that 9H 9a, 3H 9a, [5] Similarly, i t i s necessary that / U(s*(x))f(x |a )f(x |a )dx 1 = 1 2 = V'(a * 2 1 + a *) 2 / U(s*(x))f(x |a )f(x |a )dx 1 1 2 2 [6] It i s clear that (a^* = a2* and u^* = v^*) constitute a solution to conditions [5] and [6]. Therefore, i f a unique i n t e r i o r solution i s optimal, then i t has a^* = a2* and u^* = v^*• X are determined from conditions The p a r t i c u l a r values of a^*, u^*, and [1], [3], and [4]. Q.E.D. 120 Proof of Proposition 3.5.3: Consider f i r s t the s i t u a t i o n where a the agent's compensation i s based only on xj_. 2 = 0 and Dropping the subscript for convenience, the optimal sharing rule Is f (x|a) a t ( x ) = [ X 0 + ] *b -fUTaT 2 I * = [ XQ + u z'(a*)(x - E(x|a*)) ] , 2 0 where UQ > 0 (Holmstrom, 1979). Recall that E(x|a) = B ' ( z ( a ) ) . The princi- pal's expected return i s / (x - t(x))f(x|a*)dx = B*(z(a*)) - X, - u j ( z ' ( a * ) ) / (x-E(x| a*)) f (x | a*)dx 2 2 2 = B'(z(a*)) - X, - v g ( z ' ( a * ) ) B " ( z ( a * ) ) 2 since 2 Var(x|a*) = B " ( z ( a * ) ) . The agent's expected u t i l i t y i s f (x|a*) 2 ' ( \) % + f(x|a*) ^ ( l *) f x a d x " ( *) V a u . which implies that 2 XQ - V(a*) = u . The Nash condition i s 2 ' ( \) H> f ( x | a * ) + f f O T 2 V 0 2 f ( |a*)dx x ) - V(a*> = 0, a (x|a*) / f(xla*) d X - '<"> V = °• , [1] 121 Now consider the two-task s i t u a t i o n , where fCx-^ja) = f(x2|a) i f x^ = x « 2 Let 2 s(x ,x ) = ( ^ + u Z 1 V ^ i ' V 2 ) fCx^) i = 1 . 2 _y a* where a' = (a^', a ') and a^' = a , -. 2 2 The Nash conditions are now 2 / 2 ( + f ^ y - V'(a L a < iK'> X fCxJap + a ) = 0 1 ( + ^ a i 1 1 2 2 1 2 y i ! i ' > x " Jl - V*( ai and 2 2 2 >f U |a )f(x |a )dx dx a fUJa^) )f(x |a )f (x |a )dx dx 1 1 a 2 2 2 1 2 + a ) = 0 . 2 When evaluated at a', the Nash conditions reduce to £ a, ( x ll l a f , ) " J fCxJa,') 2 d 'l • '< *> ' T a 2 ^ 2 ^ 2 ^ a » / f(x |a ') 2 2 d x 2 The Nash conditions w i l l thus hold at a' i f ^"l'V* P / fCx^a^) fj(x|a*) d x l = Ho / f(x|a*) d x that i s , i f C = yCz'Ca^)) 2 B (z(a ')) = ^(z'Ca*)) , , 1 (see equation (Al.9) i n Appendix 1). 2 B"(z(a*)) [2] 122 [2] Equation i s true i f (z'(a*)) 2 a* (z'Cy-)) B"(z(a*)) _ z'(a*)M'(a*) a* B"(z(|-)) z (— ; M}M' f — } f 4 r ,. 4 ' The p r i n c i p a l ' s expected return i s / (x + x L - s(x ,x ))f(x |a ')f(x |a ')dx dx 2 1 2 = 2B'(z(^L)) " 1 2 2 1 2 \ j - 2 u ( z ' ( ^ ) ) B"(z(|^)) . 2 Since M(a) i s concave, M(a*) < M(|-) 1 2 2 < 2M(|^-) . because M( •) i s concave. If M(0) (Proof: V2M(0) + !/2M(a*) > 0, thenV2M(a*) < M(|-)). That i s , B'(z(a*)) < 2B'(z(-2^-)) . Suppose that Z '' ^ " ' ^ <V z ' ( f - ) M'(|-) The difference between [4] and 2 [4] [5] [6] . [1] is 2B'(z(4r-)) " B'(z(a*)) - 2pC + i^C > 0 because [5] holds and because [3] H, - 2 - H , 11 0 2 , , U z If M(a) y ) K , and y ) [6] imply that i > o. '(|_)M'<f-> i s s t r i c t l y concave, then the Inequality i n [5] inequality, and hence the s t r i c t inequality i n [6] becomes a s t r i c t can be relaxed to be a nonstrict Inequality (<). F i n a l l y , the agent's expected u t i l i t y i n the two-task situation described above i s s t i l l 2XQ - V(a*) = u. Since the p r i n c i p a l i s better o f f with an i n t e r i o r solution which s a t i s f i e s the Nash conditions, a boundary solution i s not optimal. 123 Proof of Corollary 3.5.4: z'(a*)/z'(^*-) <V2 (1) If M(a) = ka, then (3.5.5) reduces to • For the exponential d i s t r i b u t i o n with mean ka, z(a) = (see Table II i n Appendix 1), and hence z'(a)/z'(f) = - i y / _ 4 T ka ka (ii) For the gamma d i s t r i b u t i o n with mean ka, z(a) = (see Table II i n Appendix 1), and hence z (a)/z'( -) = J y ka , (iii) = V4<V2 • / _ ^ . = l/4< ka a V2 • For the normal d i s t r i b u t i o n with mean ka and unit variance, z(a) = ka (see Table II i n Appendix 1). z'(a)/z (|-) = 1 >V2 . , (iv) Therefore, For the Poisson d i s t r i b u t i o n with mean ka, z(a) = l n ka = l n k + In a z'(a)/z'(f) = (see Table II i n Appendix 1). i/|=V . 2 Q.E.D. Proof of Proposition 3.5.5: In this case, B'(z(a)) = ka, B " ( z ( a ) ) z ' ( a ) = k, and B ' ( z ( a ) ) ( z ( a ) ) ( z ' ( a ) ) 1 , , 2 + B' • ( z ( a ) ) z " (a) = 0 . Equation (A1.8) reduces to k + |£ z ' ( a * ) z " ( a * ) B " ( z ( a * ) ) = k + 1 1 z' ( a * ) z " (a *)B' ' ( z ( a * ) ) , 1 2 2 2 which implies that ^ z"( a i * ) = ^ z"(a *) 2 . [1] Equation (A1.9) reduces to U ' ( * ) = li2z'(a *) . lZ ai 2 [2] 124 [1] and [2] together imply that z"(a*) z"(a*) [3] (z'( a i *)) 2 (z'(a *)) 2 2 ' 2 Let v(a) = z ' ' ( a ) / ( z ' ( a ) ) . that a^* = a *. I f v(a) i s s t r i c t l y monotone, then T h i s i n t u r n i m p l i e s , from [1] or 2 [3] Implies [2], that Q.E.D. Examples 1) -1 z(a) = ^ Exponential: ka : l 2 i, a 4 k 1 -2 , z'(a) = — j , z''(a) = — ^ » ka ka a n a * 3 v 2) e (a) = = -2ka is strictly Gamma: z ( a ) = 7 - ° - , so v (a) i s a constant lea g 2 decreasing and = 0. R If pliers Pj and Poisson: 9 and u. * = u m u l t i p l e of v ( a ) . e z'(a) = k, z''(a) = 0, s o l u t i o n i s r e q u i r e d , [2] i n d i c a t e s that the m u l t i p 2 would have to be z ( a ) = l n ka, z'(a) = equal. , z''(a) = - and _ J_ 2 = — H-1 and case, a boundary s o l u t i o n i s o p t i m a l . a v *. • z ( a ) = ka, R e c a l l that i n t h i s an i n t e r i o r i n a, so a,* = a * p^* = u^* Normal w i t h u n i t v a r i a n c e : v (a) 4) / v (a) T h e r e f o r e , a^* = a * 3) . . T h e r e f o r e , the s o l u t i o n i s not unique and ~2 a cannot say whether p^ = P2 . we 125 Proof of Proposition 3.5.6: Equation (A1.6) can be written as ^1(8^) f (x|a) f( | ) = UjKaj*). 2f f 2 d Note that a I* (a) = S x [1] d = / ( x a f ~ 3 a ) > dx and hence equation (A1.7) can be written as I'( *) = I'(a *) . a i Equations [1] and [2] together [2] 2 imply that I'(a *) _ I'(a *) 1 2 ~2 r( Therefore, 2 3 l *) ' I (a *) 2 i f T.'(a)/I (a) i s s t r i c t l y monotonic, then a j * = a * , which Z 2 implies that u^* = l ^ * (equation [1]). Q.E.D. Proof of Corollary 3.5.7. For cases ( i ) - ( i i i ) , an i n t e r i o r solution i s optimal i f (3.5.6) holds. (i) z(a) = M(a), and hence (3.5.6) requires that ( ^ f f or 2 2 2 <V , 2 which i s s a t i s f i e d i f 0 < a <V • 2 (ii) z(a) = ^ ) a r , and hence (3.5.6) requires that M'(a) . M' (—) 2> .2 K , fM'(a) MC—) 2 ,2 ; 126 2 or • (—|— ) 2 o r ,2cr2-2a <ty> , which i s s a t i s f i e d when 0 < a< 1, since 2ct 1 . 1 4^2* (iii) z(a) = l n M(a) , and hence z'(a) = ^ ' M(a) ( a ) = a Equation (3.5.6) requires that a orl ^ 3 2a i.e.,V • 2 - <V , <V , 0 2 ,a..cc-l 1 2 2 which i s s a t i s f i e d i f 0 < a < 1 . O.E.D. Proof of Proposition 3.5.8. In this case, B*(z(a)) = ka, B " (z)z'(a)=k, and B ' " ( z ) z ( a ) + B " ( z ) z " ( a ) = 0. , y Equation (A1.9) says that 2 k z ( a l l l' l* ) = u 2 2 2 k z , ( a 2* ) » [ 1 ] and equation (A1.8) says that k (i) + l ^Vl" z(a) = - i_ Equations and l * 2 a k i + l* ) = k 2 , z'(a) = ka 4. 2 2 " + k z ( a 2* ) * [ 2 ] , and z " ( a ) = ka [1] and [2] become - 1 a ( a P i 2 * 2 "R " [3] e 2 . _- 2 = k 2 + ^ . _ 2 - ,2 127 which together imply that k. - 2R a * = k - 2R a * , 1 e l 2 e 2 2 2 0 or 2R (a * - a *) = k, - k, > 0 e l 2 1 2 2 since k^ > k . Therefore, 2 a^* > a * , and hence, u-^* > u * 2 2 (from [3]). (ii) z(a) - - £ - , z'(a) = -Ar , and z " ( a ) = ka ' * " ' . 2 ' " ,3 ka ka 2 n v v o / An analysis similar to that i n ( i ) establishes the result. (iii) z(a) = ka, z'(a) = k, and z''(a) = 0. k l Equation [2] becomes = 2 » k which contradicts the assumption that kj > k . 2 optimal solution i s a boundary solution. solution has a^' = 0 and a ' > 0' ( a ± * , a 2 the Suppose the optimal It w i l l be shown that there i s a 2 Pareto superior solution Therefore, with a^* > 0 and a * = 0. * ) , 2 optimal sharing rule i f only task two has nonzero e f f o r t i s f (x |a «) 2 ^2 f(x |a «) ^ 2 a < 2> - ( 5 X X "2 + 1 2 2 Z 2 The Nash condition i s (see equation (A2.3) i n Appendix 2) 2k U2 - V ( a ' ) - 0 . 2 2 The agent's expected u t i l i t y i s 2A - V(a ') = u , 2 and the p r i n c i p a l ' s expected return i s (see equation (A2.2) i n Appendix 2) k 2 2 2' ~ * ~ a 2w 2 2 2 2 ' k The 128 Now consider the pair ( a ^ * , a * ) , where a^* = a2' and a * 2 = u 2 i and consider the sharing rule f l> t ( x where = —jk • ( = h X + (x.la,*) f^xja^) The agent's expected u t i l i t y (with e f f o r t l exerted only at task one) i s s t i l l u, and the Nash condition i s 2 2 s a t i s f i e d , since 2 ^ = 2k and a ' = a ^ . 2 Furthermore, the 2 p r i n c i p a l i s s t r i c t l y better o f f because 2 k k^* - X - l{ k 2 = 2 a ' - X - 2U* k 2 k l > k a ' - X - 2u k 2 2 (iv) 2 = Equations [1] and [2] £• u *k 1 2 become a . ~ —« • 3. 1 2 2 z(a) = ln ka, z'(a) = —,and z''(a) U *k ( J - ) 2 2 l a 2 = ^ 2 a P and 2 k l + "L* ~ ' — 2 a l 2 k = k 2 V + k '—71 l * a 2 2 which together imply that 2 k l K R since k i > ko. P R - -E. = k k 2 K x < ^ " 2 R - _E k * 2 17 > • k i " 2 k > 0 Therefore, -r— ~ 1 7 - > 0, which implies that *2 l k k l ^ 2 (contradiction). Hence, a boundary solution i s optimal. k 129 Suppose the optimal solution has a^' = 0 and a ' > 0. It w i l l 2 be shown that there i s a Pareto superior solution ( a i * , a * ) , 2 with a j * > 0 and a * = 0. The optimal sharing rule i f only task 2 two has nonzero e f f o r t i s f s(x 2> = ( X "2 + (x |a ') 2 f(x la ') 2 » 2 ^ 2 X ' = The Nash condition, evaluated at 8 2 ' » i s E x =0 "2 1 2 f -! u x ( x 2l*2 f ) f(x la ') l 2 a " ; V ' ( a 2 , : > ° » = 2 that i s , (see Appendix 1 ) , ^(z'ta^)) "2 2 "-^2 2 B"(z(a ')) - V ( a ' ) = 0 , 2 k 2 2' ~ a 2 V , ( a 2 , : > - » 0 a "2 2 k which implies that ; a V'(a ') = 0 . 2 The agent's expected u t i l i t y , evaluated at a2', i s 2X - V(a ') = u , 2 and the principal's expected return i s . k Now , 2 2 a .2 " X "2 2 k ~ * consider the pair ( a ^ * , a * ) , where a j * = a ' and a * = 0, 2 and consider the following sharing rule, 2 2 130 where = ^— The agent's expected u t i l i t y (with • effort exerted only at task one) i s s t i l l u, and the Nash condition i s s a t i s f i e d , since k^ principal = k 2 ^2 a n d l* a = 2' * a Furthermore, the i s s t r i c t l y better o f f , because \ i 2 2 \ 1 2 V 2 _ k,a * - A - -!-4 = k,a ' - X - k *1=1 1 2 lV 2 2 "2 2 —-j-. 2 > k a ' ~ X 2 2 Q.E.D, Proof of Proposition 3.5.9. 2 ^ / 2 ( x + j = The Nash conditions require that 4> ( l a ) x a ) <«x|a)dx \ ^-wnr- V ' ( * + a *) = 0, a i 2 j=l,2. For j=l, the condition i s 8 a 2l 2*) g(x |a *) 2 2 \ I f'xja^) d x l + Zv 2 I a ( x 2 2 * f (x !a *)g(x |a *)dx a i 1 1 2 2 - V ' ( * + a *) = 0, a i 2 which reduces to a l l* $ ffrja^) f 2 \ 2 ( x | a > d x l= V'( * a i + a *) . 2 Since f( •) belongs to Q, the condition can be written as (see Appendix 1) 2u (z'(a *)) i 1 2 B"(z( *)) a i = V ' ( a * + a *) . x 2 Since V > 0 and B " ( z ^ * ) ) = V a r ^ ^ a ] * ) > 0, analysis for j =2 shows that j ^ * > 0. j^*> 0 • A similar 132 Appendix 4 F i r s t Best The p r i n c i p a l ' s problem i s to Maximize / / W(x-s(x^ .x^) ) <J>(x^ ,x 1 a^,a ( •) ) dx dx^ s( •) , a , a ( •) 2 1 2 2 2 subject to / / [U(s(x ,x ))-V(a ,a («))]'()(x ,x |a ,a (0) 1 where ^(x^ ,x |a^^ ,a ( •)) 2 2 = 2 1 2 1 2 dx^x^^ > u, 2 1 f (x^ | a ) g ( x \-x^ ,a ( •)) and a ( •) indicates that 1 2 2 2 the agent's second-stage e f f o r t i s i n general not a constant, but rather can depend on any information available at the time of choice. described, a may depend on x^. 2 / / W(x-s(-))<t'(-)dx dx 2 In the scenario The Hamiltonian Is + X / / [U(s(-))-V(-)]<l>(Odx dx . 1 2 1 D i f f e r e n t i a t i n g the Hamiltonian pointwise with respect to s y i e l d s -M'lf + X U'<|> = 0 f o r almost every ( x ^ . x ^ , which implies that W'(x-s(x ,x )) r-r— = X for almost every (x. , x „ ) . ,. , 1 u 1) Q S ( , X ^ , X 2 2 , ) } 1 (A4.1) i. Risk averse p r i n c i p a l , r i s k neutral agent ( i . e . , U' = 1). Equation (A4.1) implies that W ( x - s ( x i , x ) ) = X f o r almost every 2 ( x i , x ) , which implies that x - s ( x i , x ) i s constant for almost every ( x i , x ) , 2 2 2 which i n turn implies that s ( x , x ) = x-c, where c i s a constant. 1 be shown below that a 2 2 i s independent of x^ i f x^ and x 2 It w i l l are c o n d i t i o n a l l y independent, i n which case c = E(x|a*) - V(a*) - u. 2) Risk neutral p r i n c i p a l , r i s k averse agent (W'(x-s(«)) = 1). Equation (A4.1) implies that U ' ( s ( x i , x ) ) = constant for almost every 2 ( x i , x ) , which implies that s( •) i s a constant for almost every ( x j _ , x ) ' « 2 2 x^ and x 2 are c o n d i t i o n a l l y independent, then a s( •) = U ( u + V(a*)). - 1 2 i s independent of x^ and If 133 3) Both individuals r i s k averse. Equation (A4.1) implies that x - s ( x , x ) = W' ( AU' ( s ( x , x ) ) ) = -1 1 G(s(x)), where G' > 0. H' > 0. 4) 2 x 2 Therefore, x = G(s(x)) + s(x) = H(s(x)), where Thus, s(x) = H ( x ) , where H'" _ 1 > 0. 1 Both individuals r i s k neutral. ^ In this case, the agent's expected u t i l i t y constraint implies that s ( , x ) = u + V(a*). X l 2 The choice of the agent's e f f o r t decisions w i l l f i r s t be examined i n the simplest case, where the p r i n c i p a l i s r i s k neutral, the agent i s r i s k averse, and the outcomes are conditionally independent. ( K x - p X ^ a ^ . a ^ •)) That Is, f (x^ | a^)g(x | a ( •) ) > where we allow for the p o s s i b i l i t y = 2 2 that the second e f f o r t decision depends on the f i r s t outcome. Since the optimal sharing rule Is s ( x i , x ) = s (constant), the function to be maxi2 mized i s / / (x-s) ^(x! ,x laj^ ,a ( •))dx dx 2 2 2 1 + A[ / / (U(s) - V ( a a ( 0 ) } « x , x | a , a ( « ) ) d x d x 1 > or, 2 1 2 1 2 2 1 - il] , ignoring constants, / x f(x |a )dx 1 1 1 + / / x f(x!|a )g(x |a (•))dx dx 1 2 1 2 2 2 1 (A4.2) - A / / V(a ,a (.))f(x |a )g(x |a (.))dx dx . 1 2 1 1 2 2 2 1 (A4.2) can be rewritten as / [xj_ + { / [x - X V(a ,a (0)]g(x |a («))dx }]f(x |a )dx . 1 2 2 2 2 2 1 1 1 (A4.3) For each fixed xj_, maximizing the expression inside the braces with respect to a 2 w i l l maximize (A4.3) with respect to a . 2 Since the expression depends on x^ only through a , a ( •) i s the same for almost every x^. 2 2 That i s , a 2 134 does not depend on x^. A s i m i l a r a n a l y s i s can be done f o r the case where the p r i n c i p a l i s r i s k averse and the agent i s risk neutral. Finally, If both i n d i v i d u a l s a r e r i s k a v e r s e , then the f u n c t i o n t o be maximized i s / [ / W(x-s(x))g(x |a (-))dx ]f(x |a )dx 2 2 2 1 1 + X [ / { / U ( s ( x ) ) g ( x | a ( »))dx 2 In t h i s case, a Maximizing 2 1 - V(a 2 1 > a (0)}f(x |a )dx ]. 2 1 1 1 ( 0 w i l l g e n e r a l l y depend on x^. 2 (A4.3) w i t h r e s p e c t t o a^ r e s u l t s i n the c o n d i t i o n t h a t aE(x |a' )/aa 1 Maximizing 1 = X . (A4.3) w i t h r e s p e c t to a 2 (which i s independent of x^) r e s u l t s i n the c o n d i t i o n t h a t 3E(x |a )/aa 2 2 2 = X 3V(-)/3a . 2 Proof of P r o p o s i t i o n 4.1.1. Under the g i v e n assumptions, a ( •) w i l l 2 depend on xj . denote the mean o f x^ g i v e n a^, and l e t M^x^ t i o n a l mean of x 2 w i t h r e s p e c t to g( • ) . J J ( x j + x )<j>( • ) d x d x 2 = = 2 1 - X / / V( , a ( •)) denote the c o n d i 2 The f u n c t i o n to be maximized i s a i , a ( •))•( 2 / X j f U j l a ^ d x j + / M^ •)f(x |a )dx 1 1 1 Odxjdxj - X / V ( - ) f (Xj |aj )dxj M j C a j ) + E M ( •) - X EjVC • ) , 1 2 where E^ r e p r e s e n t s e x p e c t a t i o n with r e s p e c t to f ( • ) . d i t i o n s with respect to e f f o r t 3M (a )/3a 1 Let M ^ a j ) 1 1 + B E ^ C O ^ The f i r s t order con- a r e then = X ffi^C and 3M (»)/3a 2 2 = X 3V(0/aa 2 (A4.5) 135 for almost every and for a^ = ai£. The sign of a^'Cx^) can be determined by taking the derivative of (A4.5) with respect to x^. Let the second and t h i r d subscripts of j on M 2 denote p a r t i a l d i f f e r e n t i a t i o n of M respect to the j - t h argument of M2(xpa^,a2( •)) • 2 with Taking the derivative of (A4.5) with respect to x^ results i n M 233 * ' a 2 + M 231 = [3 V(0/3a ]a « X 2 2 2 or a*'( ) = -M Xl /[M 33 - X[3 V(0/3a2]] 2 231 2 Q.E.D. Second Best Let <(<x ,x |a ,a ) = f ( x ^ | a ) g ( x | x , a , a ( •)) . 1 2 1 1 2 2 x x 2 The agent's expected u t i l i t y i s / [ / U ( s ( x , x ) ) g ( x | x , a , a ( *))dx 1 2 2 1 1 2 ,a < •) ) ] f (x | ) d x . 3 2 2 1 1 1 The Hamiltonian i s H = / / ( x - s C x ^ x ^ H C Odx + X { / [ / U(s(x ,x ))g(x |x ,a ,a («))dx 1 2 2 1 - V(a ,a (-))]f(x |a )dx 1 2 1 + 1^1 / / U ( s ( . ) ) [ g (a) f + gf a { / U(s(0)g + / y (x ) 2 1 1 1 2 2 - u} 1 J d x ^ - / (V <Odx 2 - V f + Vf fli )dx } 1 (•)}f(x |a )dx . 1 1 1 D i f f e r e n t i a t i n g H pointwise with respect to s( •) y i e l d s - <J> + X U'<f> + UjU*^ + p (x )U'g 2 1 a f = 6 for almost every ( x , x ) . 1 2 That i s , 1 u'C(x l t •a , . x )) = X y + 2 where the subscript a each fixed x^. 2 i — 1 . + W , x ^a 2 T' represents d i f f e r e n t i a t i o n with respect to a for 2 136 A t a - a * , | - - // (b) h *rr I / / < < - » f c u + ' + (c) W (-)dx (x-sCxj.Xg))* f s "al^ ^ ' [ * U ( S ( + 8 f ai a i a ) ) 8 i d x ( 0 d x 2 d x i - / <\ + V f > il d x a i 2 " V ( - ) ] f ( x | a ) d x } = 0. a 2 2 f 1 1 f + V 2 f 1 1 At a = a*, and for every fixed x^, -g- = / (x-s(x ,x ))<j) (.)dx 1 2 a2 2 f + g f ]dx + M, { / U(s( - ) ) [ g 1 a a 2 l • a a L ^(x^f + a a 2 /U(8(0)g (Odx - V 2 - (V l 2 a a (OjfCxJa^ Clearly, the strategy a*,(») depends on x^ i n general. = )} 0. However, i f the agent i s r i s k neutral, then the f i r s t best solution can be obtained (see Shavell (1979)). Proof of Proposition 4.2.1: (1981, pp. 104-105).) (Generalization of the derivation by Lambert Since f( •) and g( •) are i n Q, they can be written as f(x |a) = exp[z (a)x 1 - B ^ z ^ a ) ) ] h ( x ) and g(x |a) = e x p [ z ( a ) x 2 - B (z (a))]h (x ). 1 1 2 2 1 2 2 1 2 2 Recall that E(x |a) - B ^ z ^ a ) ) , Var(x |a) = B^'(z ( a ) ) , and ± 1 f / f = zj^(a)(x - M ( a ) ) , where Mj_ = E ( x | a ) . 1 C i 1 1 = y z ^ ( a | ) ( x - M^(a*)) , where u (X + i c l i + c ) . 2 2 1 2 Let denotes (^(x^). Then s(x) = Some h e l p f u l quantities w i l l f i r s t be calculated. (1) / / (x + x ) f ( x | a ) g ( x | a ( x ) ) d x d x 1 2 1 1 2 = / XjfCxj l a ^ d X j 2 1 1 2 + / [ / x g ( x | a ( x ) ) d x ] f ( x |a )dx 2 2 2 1 2 1 1 1 = MjCaj) + / M ( a ( x ) ) f ( x | a ) d x . 2 (2) 2 1 1 1 1 / / sCxj,x )f(xj|a )g(x |a (x ))dx dx 2 1 2 2 1 1 2 = / / (D + F + G ) f ( x | a ) g ( x | a ( •))dx dx , 1 1 2 2 1 where D = ( X + C ^ ) , F = 2 ( X + C ^ C ^ and G = C . 2 E(D) = / / [ X 2 2 + X W J Z J U J K X J - M J U * ) ) 2 2 1 = X 2 + 2 X Xl 1 + M ^) 2 since E(x-a*) X + 2 1 2 2 1 2 - Mj(a*)) U J Z J U J X M J U J ) + ^zJ (a*)tVar(Xl lap = 2 u^zJ (a*)(x -M (a*)) ]f( |a >g(x |a <•))dx dx + E(D) 2 - a t ^ a p M j U j ) + M (a*)] , 2 2 2 2 = Var x + (Ex) - 2a*Ex + a* . uJz^CajOVarCxJa*) 2 E(F) = 2 X / / u (x )z^(a*)[x -M (a*(«))]f(x | )g(x |a ( 2 1 2 2 1 ai 2 •))dx dx 2 1 2 + 2 Uj, / / z ' ( a J ) ( x - M ( a { ) ) u ( x ) z ^ ( a § ( 0 ) ( x ^ M ( a § ( « ) ) ) f ( 0 g ( 0 d x d x 1 1 2 1 2 = 2 X / u (x )z^(a*)[M (a (x )) 2 1 2 2 2 1 2 (a*(x ))]f(x |a )dXj 1 x x y + 2u z^(a*) / z ^ ( a * ( ' ) ) P ( x ) ( x - M ( a * ) ) ( M ( a ( 0 ) " M (a$( 0 ) ) f ( OdXj, 1 2 E(F)| 1 1 1 2 2 2 = 0. Is* E(G) = / / u ( x ) z ( a * ( - ) ) ( x - M ( a * ( - ) ) ) f ( ' ) g ( * ) d x d x 2 2 1 2 2 2 2 1 = / z ( a * ( - ) ) u ( x ) [ V a r ( x | a ( - ) ) +M (a (-)) 2 2 2 2 1 2 2 2 2 2M (a*(0)M (a (0) - + M ( a * ( •)) 1 f ( O d X j . 2 2 2 2 2 /' z ^'( a * ( • ") ) ^ ( x ) V a r ( x | a * ( • ) ) f ( x | a * ) d x = E(G) 2 , Z 1 2 1 1 Therefore, - (2) - ' + u^zj 2 2 2 (a*)Var( |a*) X l * + (3) / z^ (a*( •))^(x )Var(x |a*( •))f(x | * ) d 2 1 2 1 / / 2/s(x) f ( x | a ) g ( x | a ( » ) ) d x d x 1 1 2 2 1 1 • f ( ')g( • ) d x d x 1 = 2X+ 1 - 2 1 - / V ( a j , a ( 0 ) f (xj |aj )dxj 2 2 / V(a a (»))f(x |a )dx 1 > 2y z{(a*)(M (a ) 1 2 1 - (3) - / V( 2X- (4) a i 2 2 2 ,a ( •))f(x |a )dx . / V( 2 a i 1 1 1 Z. ,a ( •))f(x |a )dx = 2 1 1 1 = 2 y z | ( a * ) M j ( a ) + 2 / ^ ( x j ) z ' ( a * ( •)) [ M ( a ( • ) ) 1 - M (a|( 0 ) ] f 2 (4) 1 la* (5) 1 - ML, (a*( •) ) ] f ( O d X j , 1 1 Mj(a*)) 1 + 2 / M (x )z (a*(0)[M (a (.)) 2 X l ( x j ) z ^ ( a * ( • ) ) ( x ^ (a*( •)) ) ] = 2 / / [X+y zj(a*)(x ^ (a*)) + 1 a dxj - / V a = 2w *(a*)M^(a*) lZ f(x |a )dx a 1 Jv 1 x - / V( - ) f & &2 ^p- = |a )dx . 1 1 (x |a*)dx 1 1 1 2H (x )z'(a*(0)M^(a (0)f(x |a ) 2 2 1 1 1 ( • ) f ( x . | a ) = 0, which i m p l i e s t h a t 1 V W Clearly, - / V(-)f ^ ( x j 1 fCxJa-pdXj & 2 1 Fix x . - V 2 1 U^Xj) > 0 if V " & (a*) 2 '(a*(x ))M^(a*(x ))* Z ( •) > 0. 1 1 This e s t a b l i s h e s r e s u l t (i)(a), 139 After i s r e a l i z e d , the agent's expected u t i l i t y g i v e n x j and a^(x^) is 2 / Tsjx) g ( x | a * ( ) ) d x 2 X l - V(a*,a*(x )) 2 1 UjzJCajXxj-MjCa*)) + j^Cxj ) z ' ( a * ( X j ))(x ^M (a*(x ))] = 2 J [\+ 2 • g(x | Odx 2 2 1 - V(a*,a*( )) 2 X l = 2 [ A + UjzJCaJXxj-MjCa*))] - V(a* . a * ^ ) ) . D i f f e r e n t i a t i n g w i t h r e s p e c t t o x^ y i e l d s 2 The Vl< l> agent's s(x)) V a a V - ' ( 2 expected ) a 2 utility * f o r the second stage p e c u n i a r y r e t u r n ( i . e . , i s an i n c r e a s i n g f u n c t i o n o f x^ (assuming ( •) > 0, a s u f f i c i e n t a , ( X l ) p^ > 0 ) . c o n d i t i o n f o r the agent's Assuming t h a t expected second stage n e t 2 utility xj. t o be i n c r e a s i n g i n x^ i s t h a t a^(x^) be a d e c r e a s i n g f u n c t i o n o f This e s t a b l i s h e s r e s u l t s Now f i x x j and l e t a f-=M'(a )f denote a ( x ^ ) , and f denote 2 2 1 2 2u zJ(a*)z^(a*)p (x )(x -M (a*))M^(a )f 1 - z 2 2 2 (a*)p^( - X l 1 1 1 2 ) [ B " ( z ( a ) ) z ^ ( a ) + 2M (a )M (a ) 2 2 2 2 2 2 2 2 2 2 P [2u (x )z (a*)M'(a )f + u (x )[2p (x )z^(a*)M^'(a )f - V 1 2 2 2M (a*)M (a )]f + 3H f(xjja^). 2 -2Xu (x )z'(a*)M'(a )f 2 - ( i ) ( b ) and ( i i ) ( a ) . 2 1 1 2 2 2 - V a i 1 ^ 2 f- V^COf^l (Of]. - 2y (x )[M (a*)Xz (a*) + - H J U * ) 2 1 2 2 u l a a l a a V f 2 1 a i a 2 / f l 140 + (Note y ^ ( x ) [ 2 z ( a * ) M ' ( a * ) - z ( a * ) B ' " ( z ( a * ) ) ] = 0. , 1 3 2 that f a 2 /f 2 - z J ( a * ) ( x - M ( a * ) ) = 0.) 1 1 l the e x p r e s s i o n f o r v^ix^) from (5) above and l e t t i n g Substituting subscripts j on V r e p r e s e n t p a r t i a l d i f f e r e n t i a t i o n w i t h r e s p e c t t o the j - t h e f f o r t variable yields V v 2 22 V V z! (x, -M. ) - y , V 112 ^ V i ^ l 1 0 0 M 2 " W 2 - 2 ^ " V V ' „ 2 + 2z^M 2 4M Z 2 Z : B : * ' f ,2 = 0. D i f f e r e n t i a t i n g w i t h r e s p e c t to x^, fi a' M 2 a XV a 2 2 2 2 2 "l 122 2 [ V a a V + V r 22 2 a 2 ( z ^ 2 z (z-M 2 (z*Mp M + z M ') 2 2 2 a' 2V V M'' + V p[*' ' 2 2 22 2 2 2 + 4 ( . ) z'M' 2 2 V zJ] 2 ) 2 Z 2 ) M 2 2 V 1 Z R e c a l l that M ( a ) = a ^ 1 2 2 2 ( x a' 2V V z'B ' f 2 , 2 22 2 2 4 2 1 2 V + z^2M M ') 2 2 2 V V 2 222 22 2 l r l V M *(z^'M 2 V z M 1 2 2 + 2 z''B' ' 2 2 2 ,2 M, 1 z H z + so t h a t i 2 2 V z'B'' ' ' 2 2 2 ,2 M, V z'B'''2M'' 2 2 2 2 -] = 0. ,3 M 2 B v = 1 and M|' = 0 . Thus, the e x p r e s s i o n above reduces t o £ ., a 2 < X . l> = " "l 2 i V Z ., / D ' W h e r e /, D = (X + U l — ) a U „ V 2 2 + . V 2 22 — + V V 2 222 .. 2 22 2 —j- 141 V V 2 + 1 122 V 2 2 z B "' 2 V 2 2 2 (z 'B " + z 2 2 2 2 B "') 2 + Recall that i t i s assumed that V 2 > 0, V 2 2 > 0, V i s e a s i l y checked that for the exponential, i n 0, z' > 0, z " < 0, B , , , 2 2 2 > 0. It 1 2 2 gamma, and Poisson d i s t r i b u t i o n s » > 0, and z'^'** + z B ' * " > 0. These facts, , 2 plus the f i r s t order condition requiring that X + y. f antee that the denominator of a (x^) i s p o s i t i v e . i s the same as the sign of y^. /f be p o s i t i v e , guar- The sign of the numerator 2 function of x^. > 0, and V Hence, i f y^ > 0, then a*,( •) i s a decreasing This establishes r e s u l t (ii)(b). Q.E.D. Proof of Corollary 4.4.1: V* 2 2 > 0, and V * 22 > 0. It i s necessary to show that V* > 0, V* > 0, 2 The derivatives of M 1 w i l l f i r s t be calculated. Dropping subscripts for convenience, 1) M (M(a)) = a implies that M '(M(a))M'(a) = 1. -1 _1 Therefore, M '(M(a)) = 1/M'(a) > 0. _1 2) M "(M(a))M'(a) = - M " ( a ) / ( M ' ( a ) ) . _1 2 M "(M(a)) = - M"(a)/(M'(a)) _1 3) -1'" M" 'M' M (M(a))M'(a) = - [- 3 1 ™. * Therefore, M \\ 3M (M(a)) = Therefore, > 0. 3 2 - M''3M' M ' ^ ]. M' 1 -M = M , , 2 , , , M' ,:> Let subscripts j on V* denote p a r t i a l d i f f e r e n t i a t i o n with respect to ej. Then V* = V'M^' = V'/M'(a ) > 0, 2 142 V* 22 V"(M -1' 2 -1'' ) + V'M^ 1 V V V* 2 2 2 -1* = i ( a 2 -1' 2 -1'' [V'"(M y +V"M ] i l 2 M ^ l } [ v 2 ,,, (M (a )) 2 V"'(M ) 2 V"M '(a ) —] (M (a )) 2 2 2 2 2 + V"2M M 2 > 0, and 2 L 2 +V"M 2 (M'(a )) 2 2 3 2 (M (a )) 2 4 2 - M^'*Mp > 0 a t a*, then V * M 2 2 M 2 3 V " M " ( a ) ^ V'(3M'' Thus, I f ( 3 M 1 ' 2 (M^(a ))- 2 1 V* 222 V'M '(a ) 1 j (M^(a ))' Z 2 2 + V'M 2 2 - M "M ) 2 2 ,5 > 0, as r e q u i r e d . Q.E.D.