A Solution Concept for Majority Rule in Dynamic
Settings∗
B. Douglas Bernheim
Department of Economics
Stanford University
and
National Bureau of Economic Research
[email protected]
Sita Nataraj Slavov
Department of Economics
Occidental College
[email protected]
May 18, 2007
Abstract
We define and explore the notion of a Dynamic Condorcet Winner (DCW),
which extends the notion of a Condorcet winner to dynamic settings. We show
that, for every DCW, every member of a large class of dynamic majoritarian
games has an equivalent equilibrium, and that other equilibria are not similarly
portable across this class of games. Existence of DCWs is guaranteed when
members of the community are sufficiently patient. We characterize sustainable and unsustainable outcomes, study the effects of changes in the discount
factor, investigate efficiency properties, and explore the potential for achieving
renegotiation-proof outcomes. We apply this solution concept to a standard
one-dimensional choice problem wherein agents have single-peaked preferences,
as well as to one involving the division of a fixed aggregate payoff.
∗
We would like to thank Avinash Dixit, Roger Myerson, Antonio Rangel, members of the Public
Economics Group at Stanford University, participants at various seminar presentations, and several
anonymous referees for helpful comments. This paper was prepared in part while Bernheim was
a Fellow at the Center for Advanced Study in the Behavioral Sciences (CASBS), where he was
supported in part by funds from the William and Flora Hewlett Foundation (Grant #2000-5633),
and while Slavov was a graduate student at Stanford University, where she was supported by an
NSF Graduate Research Fellowship. Bernheim gratefully acknowledges financial support from the
NSF (grants SES-0137129 and SES-0452300).
0
1
1
Introduction
A central objective of Political Economy is to understand collective choice with
majoritarian institutions. For such institutions, political economists often attempt
to infer plausible outcomes directly from the majority preference relation. The
best known and most widely used solution concept of this type is the notion of a
Condorcet winner, defined as a policy that majority-defeats all other policies.1
When Condorcet winners exist, they emerge as equilibrium outcomes in a large
and important class of game forms, including models of electoral competition (Downs
[1957]), representative democracy (Besley and Coate [1997]), and agenda-setting
(Ferejohn, Fiorina, McKelvey, [1987], and Bernheim, Rangel, and Rayo [2006]).
Therefore, even when one is not entirely certain of the best way to model the details
of a complex majoritarian institution, it is still often reasonable to assume that
the institution can implement Condorcet winners.2
Unfortunately, the existence
of Condorcet winners is frequently problematic. The concept suffers from a curse
of dimensionality: with a sufficiently rich policy space, it is virtually certain that a
Condorcet winner does not exist.3
In addition, the notion of a Condorcet winner is not an appropriate solution
concept for dynamic collective choice problems.
This is an important limitation
because dynamic issues are central to numerous public policies issues, such as the
design of social security, the use of public debt, the configuration of an optimal
tax system, the management of monetary policy, and the provision of incentives for
investment and R&D. In recent years, the performance of majoritarian institutions
in these dynamic settings has attracted growing interest and attention.4
The purpose of the current paper is to propose and develop a solution concept
1
Other examples include the Condorcet set (Miller [1977]), defined as the smallest set such that
no element of the set is majority defeated by an element outside the set, and the uncovered set
(Miller [1980], McKelvey [1986]), which consists of policies x such that the set of policies majority
defeated by x is not strictly contained in the set of policies majority defeated by any other policy
y.
2
McKelvey [1986] makes a similar observation concerning the uncovered set. There is, of course,
no guarantee that Condorcet winners are the most plausible equilibrium outcome for any particular
majoritarian game.
3
See, e.g., Plott [1967], Rubinstein [1979], Schofield [1983], Cox [1984], Le Breton [1987], Banks
[1995], and Saari [1997].
4
Pertinent studies include Baron [1996], Barron and Herron [1999], Krusell and Rios-Rull [1999,
1996], Epple and Riordan [1987], Krusell, Quadrini and Rios-Rull [1997], Glomm and Ravikumar
[1992], Perotti [1993], Persson and Tabellini [1994], Alesina and Rodrik [1994], Cooley and Soares
[1999], Grossman and Helpman [1998], Rangel [2003], and Banks and Duggan [2000], among others.
2
that extends the notion of a Condorcet winner to dynamic settings. We summarize
collective choices through a policy program, which specifies the current policy choice
as a function of previous policies, and thereby allows for the possibility that collective
choices are history-dependent. A policy program is a Dynamic Condorcet Winner
(DCW) if, in all periods and for all histories, the prescribed choice for the current
period majority defeats every alternative, in light of the fact that different choices
lead to different prescriptions for future periods. In the current paper, we confine
attention to models involving infinite repetitions of a single-period problem, with
infinitely-lived agents.5 A description of these enrvironments and a formal definition
of the solution concept appear in Section 2, along with a simple motivating example.
Why study DCWs? As mentioned above, the static Condorcet concept is often
justified on the grounds that it identifies outcomes that are equilibria for many
majoritarian game forms. As we explain in Section 3, DCWs are justified on precisely
the same grounds for dynamic versions of the same game forms.
Specifically, for
any DCW, every dynamic majoritarian game within this class has an equivalent
equilibrium.
In contrast, other equilibria do not have the properties that ensure
their portability across institutions.
Thus, dynamic equilibria corresponding to
DCWs are robust with respect to institutional details, while other equilibria are not.
This observation provides an argument for studying DCWs rather than dynamic
games whenever the political institution of interest is difficult to model (perhaps
due to complexity or procedural ambiguities), but one is nevertheless reasonably
confident that it could implement Condorcet winners in a static setting.
As we demonstrate, one can usually restrict attention to DCWs that are completely stationary both on and off the equilibrium path (see Section 4.1).
This
greatly simplifies the task of characterizing DCWs, particularly compared with subgame perfect equilibria for repeated games.6
When a Condorcet winner exists for the single-period problem, the policy program that assigns this outcome irrespective of history is a DCW. When agents are
sufficiently patient, the community can also achieve any outcome that is part of a
Condorcet cycle.
Consequently, in contrast to Condorcet winners, DCWs always
exist when agents are patient. This point, which we develop in Section 4.2., may at
first seem surprising: policy programs are extremely high-dimensional objects, and
5
The concept is easily generalized and extended to other settings such as those with overlapping
generations of agents (e.g., Slavov [2006a]).
6
As shown by Abreu [1986], optimal punishments for repeated games are often non-stationary.
3
Condorcet winners tend not to exist in high dimensional spaces (recall the references
in footnote 4). The key point with respect to existence is that history dependence
can break the curse of dimensionality.
In many familiar environments, every outcome is part of a Condorcet cycle (McKelvey [1976, 1979], Schofield [1978]). In combination with the aforementioned result,
this observation provides us with a “folk theorem.” We discuss the appropriate interpretation of this result in Section 4.2, and argue that the DCW concept remains
useful despite the potential indeterminacy.7
In particular, we identify conditions
under which certain outcomes are unsustainable even when group members are extremely patient (Section 4.3), exhibit assumptions that give rise to those conditions
in the context of the median voter model, and demonstrate generally that the set of
DCWs shrinks as the group members become more impatient (Sections 4.3 and 4.4).
The latter result allows us to associate each DCW with a degree of sustainability,
and to identify the most sustainable DCWs.
The efficiency properties of DCWs are also notable (Section 4.5). The efficient
frontier of the DCW payoff set always lies on the efficient frontier of the feasible
payoff set. Moreover, we show that it is always possible to sustain these efficient
outcomes through DCWs that never depart from the efficient frontier of the feasible
payoff set for any history. The strategic equilibria corresponding to these universally efficient DCWs satisfy a very demanding standard of renegotiation-proofness.
Universal efficiency therefore emerges as a natural refinement criterion — one that
can potentially resolve the indeterminacy associated with our folk theorem.8 For
example, in the case of the repeated median voter problem, we demonstrate with
considerable generality that the repeated static solution is the unique universally
efficient outcome, even with extremely patient group members.
In Section 5, we apply our concept to a problem involving pure distribution (division a fixed payoff). In addition to characterizing the DCW set, we draw implications
for distributional policy.
No alternative is more sustainable than the egalitarian
outcome, and movements toward egalitarianism always (weakly) enhance sustainability. Moreover, penalizing one party requires a measure of equality between the
7
Notably, a folk theorem does not emerge in collective choice problems with overlapping generations, even if agents apply no discounting within their own lifespans. As Slavov [2006a] demonstrates, DCWs yield useful predictions in such contexts.
8
In contrast, for repeated games, it is difficult to assure the existence of equilibria satisfying
reasonable notions of renegotiation-proofness (Bernheim and Ray [1989]; Farrell and Maskin [1989]).
4
other two.
Consequently, political feasibility places upper limits on wealth, but
does not place lower limits on poverty.
This paper is perhaps most closely related to work by Krusell and Rı́os-Rull
[1999], who proposed another dynamic generalization of Condorcet winners, known
as a “recursive political equilibrium.”9 Their concept allows continuation paths to
depend on a structural state variable, but not on any other aspect of history.10 In
the absence of a structural state variable (that is, for the set of environments considered in this paper), the concept effectively treats each period as a separate problem,
selecting a static Condorcet winner when one exists. Thus, for the environments
considered in this paper, recursive political equilibria fail to exist when there is no
Condorcet winner for the static problem.
2
The Problem and the Solution Concept
2.1
The Environment
A finite set of infinitely-lived agents, {1, 2, ..., N } with N ≥ 3, constitutes a commu-
nity. Time unfolds in a sequence of discrete periods, indexed t = 0, 1, 2, .... In each
period t, the community must select some policy x from the set X. Single period
payoffs are given by the mapping η : X → RN , where ηi (x) denotes the payoff to
agent i from policy x.
Let U denote the image of X under η. Without loss of generality, we can think of
the community as selecting u from the set U ; henceforth, we therefore refer to each
u as a policy, and usually (but not always) suppress references to X. We typically
assume that U is compact, convex, or both. These assumptions are satisfied, for
example, if X corresponds to lotteries over some finite set of deterministic policies.
Let M denote the smallest integer not less than
and when N is odd, M =
RN ,
9
such that
w0 Rw00
N +1
2 .
N
2;
when N is even, M =
N
2,
We define a binary relation R on elements of
iff #{i ∈ 1, ..., N | wi0 ≥ wi00 } ≥ M (where # denotes the
Our paper is also related, though somewhat less closely, to work by Roberts [2006]. Roberts
assumes that, each period, the status quo is compared to a randomly chosen alternative based on
majority rule; the winner becomes the status quo in the following period. With limited history
dependence (voters’ choices are conditioned on the current status quo), a number of outcomes can
be sustained. There are also rough parallels to our concept in the literature on time consistency
in dynamic cooperative games (see e.g. Petrosjan and Zenkevich [1996], or Filar and Petrosjan
[2000]).
10
Thus, the relation between our concept and theirs is analogous to the relation between subgame
perfect equilibria and Markov perfect equilibria.
5
cardinality of a set).
Viewing w0 and w00 as vectors of scalar payoffs, this is the
conventional majority preference relation.
relation P defined as follows:
w0 P w00
iff
Corresponding to R, there is a strict
w0 Rw00
and ∼ [w00 Rw0 ].
A policy sequence (u0 , u1 , ...) is feasible iff ut ∈ U for all t.
each individual discounts future payoffs at the rate δ.
We assume that
For any feasible policy
sequences γ and ρ, we say that γ is weakly majority preferred to ρ (alternatively,
£P∞ t t ¤ £P∞ t t ¤
(keep in mind that the superscript on δ is an
γRδ ρ) iff
t=0 δ ρ
t=0 δ γ R
exponent, while the superscripts on γ and ρ are indexes). For the case of δ = 1,
we use the overtaking criterion: γR1 ρ iff there exists some finite T such that, for
¤
¤ £Pτ
£Pτ
δ
δ
t
t
all τ ≥ T , we have
t=0 ρ . If γR ρ and ∼ [ρR γ], we say that γ is
t=0 γ R
strictly majority preferred to ρ (alternatively, γP δ ρ). In most instances, we omit
the modifier “weakly,” stating “strictly” where intended.
2.2
Dynamic Condorcet Winners
In dynamic models, it is natural to think that today’s outcomes may affect outcomes
in the future. This is the case even when there are no structural links across periods
(by way of analogy, consider equilibria based on history dependent strategies in
repeated games). We use ht = (u0 , u1 , ..., ut−1 ) to denote the history of policies
adopted prior to period t (a t-history).
Let Ht denote the set of all feasible t-
(u0 , u1 , ..., ut−1 )
such that us ∈ U for all s ∈ {0, 1, ..., t − 1}).
S
A dynamic policy program is a mapping µ : ∞
For every period t
t=0 Ht → U .
histories (that is,
and every history of policies adopted prior to period t, µ identifies the next policy
to be implemented.
By considering policy programs rather than simply infinite
sequences of policies, we allow for the possibility that the community’s decisions
may be history-dependent.
For any policy program µ, we define the function C µ by
C µ (h) = (µ(h), µ(h, µ(h)), ...) for all h ∈
∞
[
Ht
t=0
For any t-history ht , C µ (ht ) describes the continuation policy sequence generated
by applying the policy program µ.
We are now prepared to define our solution concept for the dynamic collective
choice problem.
Definition: A Dynamic Condorcet Winner (DCW) is a policy program µ such that
C µ (ht )Rδ (u, C µ (ht , u)) for all t = 0, 1, ..., ht ∈ Ht , and u ∈ U .
6
In words, µ is a DCW if and only if, for all feasible t-histories, the prescribed
policy for the current period, µ(ht ), is a Condorcet winner, in the sense that it
majority-defeats all other policies u ∈ U . The important feature of this definition
is that, when comparing the prescribed policy for the current period to any other
possible policy, each agent considers the fact that the current outcome affects the
continuation path.
In subsequent sections, we illustrate the implications of our general results by
returning repeatedly to a familiar application, commonly known as the “median
voter” model, wherein the policy choice is one-dimensional (without loss of generality, X = [0, 1]), each community member’s payoff function, ηi (x), is continuous and
strictly concave (hence “single-peaked”), and N is odd. One can think of x ∈ X
as denoting the level or characteristic of some public good.
We use yi to denote
the optimal level of x from the perspective of agent i. Without loss of generality,
we label the agents so that yi is weakly increasing in i. Agent M is the “median
voter.” If the community makes this choice only once, yM is the unique Condorcet
winner.
2.3
Example
The following example illustrates the concept of a DCW and previews some of our
results.
A community consisting of three agents, indexed i = 1, 2, 3, repeatedly
confronts the problem of selecting some policy x from the set {A, B, C, D, E}. In
any given period, the payoffs to the agents from each of the policies are as follows:
A : (2, 3, 4)
B : (3, 4, 2)
C : (4, 2, 3)
D : (1, 1, 5)
E : (0, 0, 0)
Here the i-th element in each vector is the payoff to individual i. Focusing on any
single period in isolation, we see that A is majority preferred to C, C is majority
preferred to B, and B is majority preferred to A.
Moreover, A, B, and C are
each majority preferred to D and E, and D is majority preferred to E. There is
no Condorcet winner; i.e., there is no policy that is majority preferred to all other
7
policies in pairwise comparison. A, B, and C form a Condorcet cycle, and D and
E lie “below” the cycle.
Members of this community should not, however, focus on any single period in
isolation. Because social outcomes are in principle history-dependent, the majority
preference relation depends on individuals’ expectations concerning continuation
paths.
Consider the following policy program: µ (h0 ) = A, µ (ht , µ (ht )) = µ (ht ) and
for any x 6= µ (ht ) ,
C if µ (ht ) = A
B if µ (ht ) = C .
µ (ht , x) =
A if µ (ht ) = B
Notice that this policy program yields A in every period (that is, C µ (h0 ) = (A, A, ...)).
As long as the individuals are sufficiently patient (δ ≥ 21 ), µ is a DCW. First
consider any history ht for which the policy program prescribes A in the next period
(that is, µ(ht ) = A). For any such history, individuals will expect the following: (1)
the community will select A in every subsequent period (that is, C µ (ht ) = (A, A, ...));
(2) if the community selects some other policy x 6= A in any period, then it will select
C in every subsequent period (that is, C µ (A, .., A, x) = (C, C, ...)). Comparing the
two paths (A, A, ...) and (x, C, C, ...), it is apparent that a majority (individuals 2
and 3) prefers the first to the second as long as individuals are sufficiently patient
(δ ≥ 21 ).11
Now consider any history ht for which the policy program prescribes C in the
next period (that is, µ(ht ) = C). For any such history, individuals will expect the
following: (1) the community will select C in this and all subsequent periods (that
is, C µ (ht ) = (C, C, ...)); (2) if the community selects some other policy x 6= C in the
current period, then the community will select B in every subsequent period (that
is, C µ (ht , x) = (B, B, ...)). Again, if individuals are sufficiently patient (δ ≥ 21 ), a
majority (individuals 1 and 2) prefers (C, C, ...) to (x, B, B, ...).
Finally, consider any history ht for which the policy program prescribes B in the
next period (that is, µ(ht ) = B). For any such history, individuals will expect the
following: (1) the community will select B in this and all subsequent periods, and
(2) if the community selects some other policy x 6= B in the current period, then it
11
In particular, if all individuals discount future payoffs at the rate δ, then for individual 2, the
3
2δ
first path yields a payoff of 1−δ
, while the second yields x2 + 1−δ
(where x2 is the second element in
x). Individual 2 prefers (A, A, ...) to (x, C, C, ...) for any x provided δ ≥ 21 . A similar calculation
shows that individual 3 also prefers (A, A, ...) to (x, C, C, ...) for any x provided that δ ≥ 21 .
8
will select A in every subsequent period (that is, C µ (ht , x) = (A, A, ...)). Again, if
individuals are sufficiently patient (δ ≥ 12 ), a majority (individuals 1 and 3) prefers
(B, B, ...) to (x, A, A, ...).
Putting all of these pieces together, we see that µ is indeed a DCW. The choice
of A in every period is majority preferred to all alternatives when a deviation from
this path results in a permanent shift to C; the choice of C in every period is
majority preferred to all alternatives when a deviation from this path results in a
permanent shift to B; and the choice of B in every period is majority preferred
to all alternatives when a deviation from this path results in a permanent shift to
A. Moreover, if we were to change the initial choice, µ(h0 ), to either B or C, the
policy program would remain a DCW, and it would yield either B or C in every
period. Ironically, the majority preference cycle that undermines the existence of
a Condorcet winner in the static collective choice problem is actually instrumental
in constructing a political equilibrium in the dynamic setting. This observation is
generalized subsequently in Theorem 3.
Although it is possible for the community to select A, B, or C, the community
cannot select either D nor E in every period.12
For the purposes of the current
illustration, we confine attention to stationary paths. However, our formal analysis
also allows for non-stationary outcomes (see in particular Theorem 1). First consider a path along which the community selects E in every every period. Note that
any other continuation path (x, y, y, ...) is majority preferred to (E, E, ...). Consequently, regardless of how the continuation path is chosen, a majority would favor
the choice x 6= E to E in the current period.
Now consider a path along which
the community selects D in every period. Note that any other continuation path
(x, y, y, ...) with x ∈ {A, B, C} is majority preferred to (D, D, ...) unless y = E.
But we have already established that (E, E, ...) is not sustainable. Hence (D, D, ...)
cannot emerge either. This observation is generalized subsequently in Theorem 4.
3
Why Study DCWs?
Dynamic Condorcet Winners, like their static counterparts, are non-game-theoretic
solutions. They do not include any notion of individual strategies or deviations.
12
In contrast, if individuals are sufficiently patient, paths along which the community selects
either D or E in every period are subgame-perfect equilibrium outcomes of a strategic model with
two-party Downsian competition in each period (even when all members always vote as if they are
pivotal).
9
There is also no natural analog of “Nash reversion” (constructing punishments based
on repetitions of a static Nash equilibrium). Attempting to sustain an equilibrium
through “Condorcet reversion” (constructing punishments based on repetitions of a
static Condorcet winner) is pointless, as a majority would always prefer the punishment path to the equilibrium continuation path.
Nevertheless, the main value of the DCW concept lies in its relationship to
game theoretic solutions.
The static Condorcet concept is often justified on the
grounds that it identifies outcomes that are equilibria for many majoritarian game
forms. As we explain in this section, DCWs are justified on precisely the same
grounds for dynamic versions of the same game forms. Specifically, for any DCW,
every member of this large class of dynamic political institutions has an equivalent
equilibrium.
Naturally, like many static institutions that implement Condorcet
winners, a dynamic institution within the class of interest may have other equilibria
that do not correspond to DCWs. However, none of those other equilibria have the
properties that ensure portability across the entire class institutions. Thus, dynamic
equilibria corresponding to DCWs are robust with respect to institutional details,
while other equilibria are not. These observations provide an argument for studying
DCWs rather than dynamic games whenever the political institution of interest is
difficult to model (perhaps due to complexity or procedural ambiguities), but one
is nevertheless reasonably confident that it could implement Condorcet winners in
a static setting.
3.1
Downsian Competition
For concreteness, we begin by illustrating the relationship between the DCW concept
and repeated Downsian competition with two political parties. However, none of
our conclusions are specific to the Downsian model; they apply equally to any static
institution for which the equilibrium set always contains all Condorcet winners (if
any exist).
In the static Downsian model, there are N citizens and two political parties, L
and R. The game consists of two stages. In the first stage, each party proposes
a policy xL , xR ∈ X. Since strategic considerations potentially inject uncertainty
either endogenously (through the selection of mixed strategies) or exogenously (e.g.,
through random tie-breaking), we take X from the outset to consist of lotteries
over some set of deterministic policies. In the second stage, individuals vote for a
10
party. Thus, a strategy for citizen i is a mapping from pairs of policy choices into
votes. Citizens care only about the policy outcome, and candidates care only about
electoral victory (from which they receive a fix payoff).
In equilibrium, we require as-if-pivotal voting behavior, which guarantees that
citizens cast their votes for their preferred outcomes, and subgame perfection, which
guarantees that they do so for any pair of announced policy platforms. As is well
known, if there is a Condorcet winner x ∈ X, then there is an equilibrium in which
both parties propose x.
If there is no Condorcet winner, then no pure strategy
equilibrium exists; however, there may be mixed strategy equilibria.
Now suppose the Downsian game is repeated infinitely. We assume that parties
as well as citizens are infinitely-lived, and that all players discount future payoffs
at the rate δ.
At any point during the game, players can condition their choices
upon the entire history of play. This history includes, for each period, the policy
outcome, both parties’ proposals, and the identity of the winning party.
Let ht
denote the citizen-payoff history (u1 , ..., ut−1 ) as before, and let qt denote all other
historical information.
Consider any DCW, µ.
the repeated game.
We argue that there is an equivalent equilibrium for
¡
¢0
Define D = 1, δ, δ 2 , ... and π (ht ) = C µ (ht ) D. That is,
π µ (ht ) is the vector of discounted payoffs from the continuation path C µ (ht ). For
each history ht , let xµ (ht ) be a policy for which η (xµ (ht )) = µ (ht ). Now construct
strategies for the game as follows:
1. If the history is (ht , qt ), both parties propose xµ (ht ).
2. If the history is (ht , qt ), and if the parties have proposed xL and xR in the
current period, individual i votes for the party, k, that proposed the policy
yielding the largest value of ηi (xk )+δπiµ (ht , η (xk )). If those values are equal,
individual i votes as follows: (i) if neither party has proposed xµ (ht ), or if both
have proposed xµ (ht ), individual i votes for each party with probability 21 ; (ii)
if one and only one party has proposed xµ (ht ), individual i votes for that party.
Let’s verify that these strategies constitute a subgame perfect equilibrium with
as-if pivotal voting.
For the usual reasons, it is sufficient to check deviations
that are limited to a single period.
t.
Suppose the history is (ht , qt ) as of period
Party k expects party j to propose xµ (ht ). Because µ is a DCW, we have
11
C µ (ht ) Rδ (u, C µ (ht , u)) for all u ∈ U . Consequently, for every x 6= xµ (ht ), there
are at least M individuals for whom
ηi (xµ (ht )) + δπiµ (ht , η (xµ (ht ))) ≥ ηi (x) + δπiµ (ht , η (x))
According to the voting strategies, all of those individuals will vote for a party
proposing xµ (ht ) over a party proposing x. Therefore, a deviating party will lose
with certainty in the current period (instead of winning with probability 12 ). Regardless of whether it deviates in the current period, the continuation equilibrium
implies that the party will win with probability
a deviation makes the party worse off.
1
2
in every subsequent period. Thus,
Now consider the voters.
Given the
strategies, if a party wins with a platform x, voter i’s discounted payoff will be
ηi (x) + δπiµ (ht , η (x)).
Thus, voters are behaving optimally, on the assumption
that they are always pivotal.
3.2
Portability
Economists and political theorists study a variety of static collective choice games for
which the equilibrium set always contains all Condorcet winners (if any exist). Aside
from the Downsian game, examples include models of representative democracy
(Besley and Coate [1999]), and of sequential, comprehensive agendas (Ferejohn,
Fiorina, McKelvey, [1987], and Bernheim, Rangel, and Rayo [2006]). Consider the
infinitely repeated version of any member of this class of games. Reasoning as in the
previous section, one can show that, for any DCW, there is an equivalent equilibrium
for the infinitely repeated game.
Therefore, DCWs correspond to equilibria that
are portable across an important class of dynamic collective choice games.
However, for many equilibria of an infinitely repeated collective choice games
belonging to the class described in the previous paragraph, equivalent DCWs do not
exist.
As an illustration, consider a Downsian model with three voters (N = 3).
Assume that the utility possibility set is U = {(u, u, u) | 0 ≤ u ≤ 1}. Note that
the utility possibility set is Condorcet ranked; (1, 1, 1) is the unanimously preferred
outcome.
Yet if δ ≥
1
2,
one can construct subgame-perfect equilibria with as-if-
pivotal voting wherein, on the equilibrium path, both parties always propose the
worst possible outcome, (0, 0, 0).13
13
In fact, Duggan and Fey [2006] show that for δ >
under repeated Downsian competition.
1
,
2
any sequence of policies is sustainable
12
We construct such an equilibrium using the following strategies: if there have
been no deviations, both parties propose (0, 0, 0). If R proposes any other policy
and loses, both parties propose (1, 1, 1) in every subsequent period, regardless of
whether there have been subsequent deviations.
In case of any other deviation,
both parties continue to propose (0, 0, 0) in every period. Individuals vote for the
party proposing the platform they prefer (accounting for the continuation outcome);
if they are indifferent, they vote for L.
Let’s verify that this is a subgame perfect equilibrium with as-if pivotal voting. Since L wins with certainty in every period on the equilibrium path of every
subgame, L has no incentive to deviate. By construction, voters behave optimally
on the assumption that they are always pivotal. Thus, we only need to check R’s
incentives to deviate. We distinguish between two classes of t-histories: those for
which the parties’ strategies prescribe (0, 0, 0) (“type 0 histories”) and those for
which their strategies prescribe (1, 1, 1) (“type 1 histories”). Once again, it is sufficient to check deviations that are limited to a single period. Consider any type 0
history. If R deviates to (u, u, u) with u > 0, each individual receives a discounted
payoff of u if R wins (since both parties revert to (0, 0, 0) in subsequent periods)
and
δ
1−δ
if R loses (since both parties will subsequently propose (1, 1, 1)). Thus, as
long as δ > 12 , no one would vote for R’s alternative proposal u. Since R loses in
every period whether or not it deviates in the current period, it has no incentive to
deviate.
Next consider any type 1 history. If R deviates to (u, u, u) with u < 1,
each individual receives a discounted payoff of u +
δ
1+δ
if R wins and 1 +
δ
1−δ
if
R loses (since both parties will subsequently propose (1, 1, 1) in every subsequent
period regardless of choices in the current period). Thus, no one would vote for R’s
proposal. Since R loses in every period whether or not it deviates in the current
period, it again has no incentive to deviate.
In contrast, no DCW can produce infinite repetitions of the worst outcome,
(0, 0, 0).
Why not?
A collective deviation to any other policy would make a
majority better off regardless of the continuation outcome. In fact, the only outcome
sustainable as a DCW involves infinite repetitions of the best outcome, (1, 1, 1) (this
result is a consequence of Theorem 4, below).
At the outset of this section, we claimed that equilibria with equivalent DCWs
are portable across a large class of important institutions. This portability results
from two properties that other equilibria do not share. First, the continuation pol-
13
icy path for an equilibrium with an equivalent DCW depends only on past policies,
and not on any other aspect of the game’s history, including details of the process
by which those outcomes were selected. A critical feature of the last equilibrium
described above (in which both parties propose the worst possible policy for all voters) is that the continuation policy path depends on aspects of the history other
than past policies (specifically, the platform of the losing party).14 If, in an equilibrium for one institution, the continuation policy path depends on some choice (e.g.,
the platform of a political party) that has no counterpart in a second institution
(e.g., one without political parties), then — unlike DCWs — that equilibrium is not
portable to the second institution.
Second, if an equilibrium has an equivalent DCW, then, accounting for equilibrium continuation payoffs, there is always (for every period and every feasible
history) an unambiguous Condorcet winner in X.
This property is also critical
for portability across institutions: by definition, any institution within the class of
interest is capable of selecting that same Condorcet winner for the same history,
given the same contingent continuation policy paths. For many institutions, there
are also equilibria in which, for some histories, there is no Condorcet winner in X
(accounting for continuation payoffs). However, these equilibria are not generally
portable across institutions.
To see why, consider an institution, an equilibrium,
and a history for which there is no Condorcet winner in X (accounting for continuation payoffs).15 For that history, the institution produces some selection from X.
If, with the same conditional continuation policy paths, another institution would
produce a different selection from X,16 then — unlike DCWs — that equilibrium is
not portable to the second institution.
Thus, DCWs are, in effect, the robust solutions to a large and important class of
dynamic majoritarian games. Consequently, one can also think about the DCW set
as embodying an equilibrium refinement for such games. In some specific instances,
14
The same statement holds for any equilibrium producing an outcome other than (1, 1, 1). Indeed, in demonstrating that it is possible to sustain any sequence of policies under repeated Downsian competition when δ > 21 , Duggan and Fey [2006] rely on strategies that condition choices on
aspects of the game’s history other than policy outcomes.
15
To avoid conflating this property with the previous one, assume that the equilibrium conditions
the continuation policy path only on past policies.
16
For example, models of legislatures with real-time agenda setting and evolving defaults select
Condorcet winners when they exist (provided a sufficient number of legislators have opportunities
to make proposals), but when no Condorcet winner exists the outcome depends on the order in
which legislators are allowed to make proposals (see Bernheim, Rangel, and Rayo [2006]).
14
this refinement rules out equilibria which seem implausible (e.g., for the repeated
Downsian game, the one in which both parties propose the worst policy for all
voters).
We do not mean to suggest, however, that an equilibrium is necessarily
unreasonable when it corresponds to no DCW. On the contrary, it is entirely possible
that a particular institution might give rise to other plausible equilibria. If one is
certain that one has the correct model of the pertinent institution, it would be
inappropriate to discard such an equilibrium in favor of DCWs.
However, if the
institution of interest is difficult to model (perhaps due to complexity or procedural
ambiguities), but one is nevertheless reasonably confident that it could implement
Condorcet winners (when they exist) in a static setting, then DCWs are attractive
because they correspond to valid equilibria regardless of institutional details.
4
General Properties
4.1
Stationary vs. Non-stationary DCWs
Potentially, a policy program is an extremely complicated object. This complexity
could in principle reduce the value of the DCW concept by making it difficult to
establish general properties, or to apply the concept to particular problems. Fortunately, DCWs turn out to be much simpler to analyze than subgame perfect
equilibria. In particular, it is possible in many cases to restrict attention to DCWs
that satisfy a strong stationarity property:
Definition: A policy program µ is stationary if, for every ht , µ(ht ) = µ(ht , µ(ht )).
To understand this definition, imagine that the community finds itself in period
t with some history of policies ht . The policy program µ prescribes some current
choice u = µ(ht ).
If the community adopts u, the policy program prescribes
µ(ht , u) = µ(ht , µ(ht )) in the next period.
If the policy program is stationary,
then µ(ht , µ(ht )) = µ(ht ) = u. Thus, starting from any ht , the continuation path,
(u, u, u, ...), is stationary.
Henceforth, for δ ∈ [0, 1), let
Vδ∗ = {w | ∃ a DCW for which discounted payoffs are w/(1 − δ)}
and
Wδ∗ = {w | ∃ a stationary DCW for which discounted payoffs are w/(1 − δ)}
15
In other words, Vδ∗ represents the set of normalized discounted payoffs sustainable
through DCWs, and Wδ∗ represents the set of normalized discounted payoffs sustainable through stationary DCWs. The normalization involves multiplying discounted
payoffs by 1 − δ to keep everything on the same scale as single-period payoffs. For
δ = 1, we define Vδ∗ using limiting averages, and Wδ∗ using (constant) per-period
payoffs.17
Stationary DCWs are much simpler than non-stationary DCWs, but stationarity
would appear to be an extremely strong requirement (particularly inasmuch as the
restriction is imposed for all histories, both on and off the “equilibrium path”). Our
first result nevertheless establishes that, as long as U is compact and convex, one can
restrict attention to stationary DCWs without loss of generality — nothing is gained
by considering complex non-stationary DCWs. This observation vastly simplifies
the task of finding DCWs, particularly compared with finding the subgame perfect
equilibria of a dynamic voting institution.
Theorem 1: If δ ∈ [0, 1) and U is compact and convex, then the set of normalized
payoffs associated with DCWs is the same as the set of normalized payoffs
associated with stationary DCWs. That is, Vδ∗ = Wδ∗ .
We regard Theorem 1 as somewhat surprising.
By way of analogy, consider
subgame perfect equilibria for stationary repeated games.
It is well-known that
optimal subgame perfect equilibria are not, in general, stationary (see, for example,
Abreu’s [1986] discussion of “stick and carrot” punishments).
This analogy is
pertinent given some of the formal similarities between these concepts (for example,
the proofs of many of our results use an analog of the self-generation mapping,
which was developed by Abreu, Pearce, and Stachetti [1991] to analyze subgame
perfect equilibria), as well as the relationship between DCWs and strategic equilibria
described in Section 3.
Why then does the theorem hold?
Suppose the normalized payoff vector for
some non-stationary DCW is w, reflecting a current-period payoff vector of u0 6= w
followed by a normalized continuation payoff vector of w0 ∈ U (with w = (1 −
δ)u0 + δw0 ).
For each u ∈ U , we know there is some w00 (u) ∈ U such that wi ≥
(1 − δ)ui + δwi00 (u) for at least M agents.
17
Since U is convex, we also know that
By “limiting
we mean that, for an infinite sequence of payoff vectors u1 , u2 , ..., we
Paverages,”
T
t
1
use limT →∞ T t=1 u . Obviously, if ut = ut+1 = u for all t, then the limiting average is the
per-period payoff vector u, so our treatments of Vδ∗ and Wδ∗ are mutually consistent.
16
w ∈ U.
Consequently, it is possible to construct a DCW that selects w in every
period “on the equilibrium path,” where we follow u 6= w with the continuation
payoff w00 (u), and sustain those continuation payoffs exactly as before. While this
new DCW may not prescribe the same policy in every period “off the equilibrium
path” (e.g. w00 (u) may reflect policy choices that differ from one period to the next),
precisely the same reasoning implies that one can, for any history, substitute a DCW
with the same discounted payoffs and a constant policy. Intuitively, we construct
a stationary DCW by making this substitution sequentially for all histories. This
argument breaks down in strategic settings because, unlike here, the utility vectors
achievable through deviations depend on the starting point (that is, switching from
choices generating u0 to choices generating w alters the subset of alternatives in U
against which the current choice is tested).
Though the convexity requirement in Theorem 1 may appear somewhat demanding, one should bear in mind that it is always satisfied when the community is
permitted to randomize over policies. In our view, there is in general no good reason
to rule out randomizations; if we seek to understand why communities appear to
select deterministic policies in practice, this should be a result and not a restriction.
In any case, the following result shows that it is still useful to examine stationary
DCWs with randomized policies even when randomization is not permitted.
Theorem 2: Suppose X is convex, and ηi is concave for each i. Then the set of
normalized payoffs associated with stationary DCWs allowing for randomization contains the set of normalized payoffs associated with DCWs (including
non-stationary ones) not allowing for randomizations.
The intuition for this result is as follows. Permitting randomizations potentially
enlarges the set of sustainable outcomes by expanding both the set of feasible outcomes and the set of available punishments.
It also potentially shrinks the set
of sustainable outcomes by expanding the set of alternatives against which a policy
program must be tested. However, under the stated conditions, no shrinkage occurs
because each new alternative in this latter set (a randomization over X) is weakly
less tempting that some non-random alternative (the expected value of X).
To illustrate the implications of these results, consider the median voter model.
If the community is permitted to select randomized policies (that is, any Borel
probability measure on X), then U is compact and convex. This means Theorem 1
17
applies, and we can confine attention to stationary DCWs without loss of generality.
If the community is not permitted to select random policies, Theorem 1 does not
apply (though X is convex, strict concavity of ηi implies that the set U = η(X) is
not convex). However, Theorem 2 tells us that we can still say what isn’t in the
set of DCW outcomes by studying stationary DCWs with randomized policies. As
we’ll see in Section 4.3, this observation proves useful.
4.2
Some sustainable outcomes
If a Condorcet winner exists in the static problem, then there is always a DCW that
repeats this solution. Formally, suppose U contains a Condorcet winner wC , and
consider the policy program µ(ht ) = wC for all t, ht . Then, for all ht and u ∈ U,
C µ (ht ) = (wC , wC , ...)R(u, wC , wC , ...) = C µ (u, C P (u, ht )).
In a static setting, a Condorcet winner may fail to exist due to cycles in the
majority preference relation. However, our motivating example (Section 2.3) suggests that it is possible to use a Condorcet cycle as the basis for constructing an
equilibrium in a dynamic setting.
In that example, the policies A, B, and C are
sustainable because the community can use each successive element in the cycle as
a contingent consequence for collective deviations from the previous element.
Our next result formalizes both of these points:
Theorem 3: (i) Suppose that wC is a Condorcet winner in U .
δ ∈ [0, 1], there exists a DCW prescribing
wC
Then, for all
for every history.
(ii) Assume that U is bounded above (that is, there exists some vector u with
u ≤ u for all u ∈ U ). Suppose U contains a finite Condorcet cycle, W =
{w1 , w2 , ..., wK } (i.e., w1 P w2 P...P wK P w1 ). Then there exists δ ∗ < 1 such
that, for δ ≥ δ ∗ , each element of the Condorcet cycle is the normalized payoff
18
vector for some stationary DCW (i.e., W ⊆ Wδ∗ ).18
For δ = 1, one can prove the following result (details available upon request). Suppose there
exists some infinite sequence of payoff vectors, W = {w1 , w2 , ...} ∈ U such that wk P wk+1 for all
k ≥ 1. Then each element of W is the per-period payoff vector for some stationary DCW (i.e.,
W ⊆ W1∗ ). This result has the following corollary: if U is strictly convex, and if each agent has
strictly convex preferences, then with δ = 1 it is possible to sustain an outcome u ∈ U as long as
there is some other u0 ∈ U with uP u0 . In other words, any payoff vector not on the lower frontier
of U is sustainable. As an example, consider the policy space U = {(u, u, ..., u)|u ∈ [0, 1]}. With
δ = 1, it is possible to sustain any outcome u in the open interval (0, 1]. The outcome u = 0 is not
sustainable, however.
18
Part (i) of Theorem 3 implies that, for the median voter model, we know there’s
a DCW that replicates the solution of the static choice problem, selecting yM in
every period, regardless of history. The key question is whether this is the only
DCW outcome. We return to this question at various points below.
Part (ii) of Theorem 3 shows that, with sufficient patience (but some discounting), the community can sustain any outcome that is part of a Condorcet cycle. As
an application, consider the median voter model where randomized policies are permitted. It is relatively easy to construct examples in which preferences are cyclic
on the convex hull of η(X).
In light of Theorem 3, this means it is possible to
sustain outcomes other than the median voter solution as DCWs when agents are
sufficiently patient.
Though cycles in η(X) technically involve lotteries, one can
also construct non-stationary DCWs that do not involve lotteries, but that mimic
lottery payoffs by selecting each policy with an appropriate frequency.
As a corollary of Theorem 3, we have the following general existence result:
there is some δ ∗ < 1 such that a DCW exists for δ > δ ∗ .
The proof of the
corollary is simple. If the majority preference relation is acyclic on a compact set
U , then U contains a maximal element, which is necessarily a Condorcet winner.
Existence of a DCW is then assured by Theorem 3(i). Alternatively, if the majority
preference relation cycles within U , Theorem 3(ii) guarantees existence of DCW for
sufficiently large δ.
Existence is somewhat surprising in light of the fact that
Condorcet winners often fail to exist in static settings, particularly with policy sets
involving high dimensionality.
After all, the dimensionality of an intertemporal
policy in an infinite horizon setting is necessarily infinite. One might therefore
think existence would be more difficult to guarantee. The problem is resolved by
allowing for history dependence, which selectively reduces the range of potential
outcomes.
Theorem 3 also provides us with a “folk theorem” for a large class of environments. Various papers (McKelvey [1976, 1979], Schofield [1978]) have shown that,
under relatively weak conditions, all outcomes in the utility possibility set reside
within a large Condorcet cycle. If every outcome is contained in a Condorcet cycle,
then all outcomes are sustainable for sufficiently high δ.
In light of this folk theorem, one might conclude that our solution concept has
disposed of one problem (non-existence of Condorcet winners) only to replace it
with another (vast multiplicity of DCWs). In response, we note the following.
19
First, economists are often concerned with environments in which the folk theorem does not apply, including the following. (1) In many instances, it is natural
to assume that agents are impatient, in which case the set of DCW outcomes is
more narrowly circumscribed (see the example in Section 5). When the appropriate rate of discounting for a particular application is unknown, one can associate
each DCW with a “degree of sustainability,” based on the lowest discount factor
for which it survives.
This allows one to determine the features of an outcome
that contribute to its sustainability, and to provide sharper characterizations of the
most sustainable outcomes (see, again, the example in Section 5, where we argue
that more egalitarian distributions of a fixed prize are weakly more sustainable than
all other outcomes). Trivially, when a Condorcet winner exists in the static problem, this is also the most sustainable dynamic outcome. (2) In certain classes of
infinitely-repeated collective choice problems, the DCW set yields sharp predictions
even when the discount factor is high (see Theorem 4 below, as well as its application
to the median voter model). (3) The DCW concept is also applicable to settings
with overlapping generations, where a folk theorem does not emerge even if agents
apply no discounting within their own lifespans. As Slavov [2006a] demonstrates,
DCWs yield useful predictions in this context; indeed, the approach yields a robust
explanation for the political “clout” of the elderly.
Second, the strategic approach suffers from the same multiplicity problem. There
are a number of well-known folk theorems for repeated games. In political games,
multiplicity can be severe even with moderate rates of discounting. For example,
with repeated Downsian electoral competition, if voters’ discount factors are greater
than 1/2, then every sequence of policies is sustainable (see Duggan and Fey [2006]).
As discussed in Section 3.2, the set of DCW outcomes is a subset of equilibrium outcomes for an important class of dynamic collective choice games, so multiplicity is
actually less severe for DCWs. Moreover, just as game theorists attempt to resolve
multiplicity by considering equilibrium refinements, one can similarly prune the set
of DCWs (see Section 4.5 for analysis along these lines).
Notably, folk theorems are often interpreted as helpful, not problematic, in
strategic settings.
In particular, they tell us that, in certain circumstances, suf-
ficiently patient agents can achieve the same outcomes with self-enforcing agreements as with enforceable contracts. Accordingly, when those agents negotiate over
outcomes, they can ignore the self-enforceability requirement. A similar message
20
emerges here. If one views the DCW concept as a political feasibility constraint,
one can interpret our folk theorem as validating (in some circumstances) the widespread practice among economists of characterizing optimal policy without regard
to political considerations.19
4.3
Some unsustainable outcomes
The motivating example in Section 2.3 also suggests that, in certain instances, some
outcomes are not sustainable. We saw in particular that, while it was possible to
sustain repetitions of A, B, or C, it was not possible to sustain repetitions of the
two policies “below” the A-B-C cycle (D and E). This observation suggests that
DCW outcomes “unravel from the bottom”: E is not sustainable; since E is the
only consequence that would deter collective deviations from D, D is not sustainable
either. Our next theorem generalizes this point.
Theorem 4: Assume that U is compact and convex.
Suppose there is some set
S ⊂ U such that
(i) R is transitive on clos (S), and
(ii) for any w0 ∈ S and w00 ∈ U \S, we have w00 P w0 .
Then, for δ ∈ [0, 1), there does not exist a DCW sustaining any outcome in S (that
is, S ∩ Vδ∗ = ∅).
The proof of Theorem 4 is more subtle than one might expect. Based on our
motivating example, one might be inclined to argue that, if w is sustainable, then
the future consequence of a collective deviation from w must be another policy w0
with wP w0 ; likewise, if w0 is sustainable, then the future consequence of a collective
deviation from w0 must be yet another policy w00 with w0 P w00 , and so forth. Thus,
if w ∈ S, one eventually runs out of potential consequences. This simple intuition
is, however, incomplete, in that the consequence that sustains w could in principle
be another policy w0 with w0 P w.20 Theorem 4 imposes additional structure that
rules out this possibility in the pertinent instances. To see why, suppose we wish
19
As we demonstrate in the Appendix, if U is compact, the set of sustainable DCW payoffs is
also compact. Thus, as long as the social welfare function is continuous, a constrained optimum
exists because the set of politically feasible outcomes is compact.
20
For example, suppose w = (10, 10, 10), u = (20, 11, 0), w0 = (20, 7, 12), and δ = 12 . Then
plainly w0 P w, but wP ((1 − δ)u + δw0 ) = (20, 9, 6).
21
to sustain some w ∈ S. Let’s try to construct a DCW that punishes a deviation to
some u ∈ U \S with a punishment w0 that majority defeats w. Let v = (1−δ)u+δw0
(this is the normalized payoff vector when u is chosen). Obviously, w must majority
defeat v, which means v ∈ S by condition (ii). We can’t have v ∈ S when w0 ∈ U \S
(because v always majority defeats either u or w0 , both of which are in U \S). But
if w0 ∈ S, then v majority defeats w0 (since u majority defeats w0 ), which means v
majority defeats w by transitivity.
Theorem 4 has an immediate corollary: if R is transitive on U , if U is compact and convex, and if U contains a static Condorcet winner, wC , that is strictly
majority preferred to all other alternatives, then wC is the only normalized payoff
vector associated with a DCW.21 To illustrate, consider again the policy space
U = {(u, u, . . . , u)|u ∈ [0, 1]}. With no discounting, we know that any u > 0 is
sustainable in a DCW (footnote 10). However, with any degree of discounting,
Theorem 4 implies that the only possible outcome is u = 1.
For an application of greater economic interest, we return to the median voter
model, but specialize to the case where ηi (x) = −β(yi − x)2 (which implies no
second-order heterogeneity). Suppose that randomized policies are allowed. Banks
and Duggan [2006] show that R coincides exactly with the preference relation of
the median voter (agent M ), RM , over U . Accordingly, Theorem 4 tells us that,
for this special case, the static solution, η(yM ), is the only normalized payoff vector
associated with a DCW. The same conclusion holds when randomization is not
permitted: from Theorems 3 and 4, we know that η(yM ) is the only normalized
payoff vector associated with a stationary DCW when randomization is permitted;
from Theorem 2 we know that no other normalized payoffs are sustainable when
randomization is disallowed; and from Theorem 3 we know that η(yM ) remains
sustainable when randomization is disallowed.
In both of the preceding examples, the unique DCW outcome coincides with the
static Condorcet winner. In fact, whenever the DCW outcome is unique, it must
be a static Condorcet winner.22 Thus, existence of a static Condorcet winner is a
necessary condition for uniqueness of the DCW outcome.
To see why, let S = U \{wC }, and apply the theorem.
Uniqueness implies that the continuation utility vector is independent of the current outcome.
But then, for any DCW µ, the choice prescribed for the current period must majority-defeat all
other alternatives, considering only current payoffs.
21
22
22
Theorem 3 and its corollary show that the existence of a Condorcet cycle helps
to assure the existence DCWs if individuals are sufficiently patient.
Conversely,
when the one-shot payoff space does not contain a Condorcet winner, non-existence
is guaranteed when agents are sufficiently impatient.
Theorem 5: Suppose that U is compact. If U does not contain a Condorcet winner,
then there is some δ 0 > 0 such that DCWs do not exist for δ < δ 0 .
This result is intuitive.
With no Condorcet winner, for every element of the
policy set there is another element that strictly defeats it.
With δ near zero,
members of the group place almost all weight on the first period, so the outcome is
governed by the one-shot comparison.
4.4
Discounting
The general existence result that arises as a corollary of Theorem 3, and the nonexistence result in Theorem 5, underscore the importance of the discount factor for
collective choice. It is natural to wonder how the set of sustainable outcomes changes
with this parameter. As our next result shows, when U is convex, the set of sustainable normalized discounted payoffs expands monotonically as agents become more
patient.
Theorem 6: Assume that U is compact and convex. Consider two discount factors,
δ and δ 0 , with 1 > δ 0 > δ. Then the set of normalized payoffs sustainable
through DCWs is weakly larger with δ 0 than with δ (that is, Vδ∗ ⊆ Vδ∗0 ).
Remark: Since U is convex, the same statement holds for stationary DCWs by
Theorem 1.
When U is compact, convex, and contains no Condorcet winner, Theorems 3 (the
general existence corollary), 5, and 6 together imply that there is a single threshold
(δ ∗ = δ 0 ∈ (0, 1)) above which DCWs exist, and below which they do not. There
is no reason to think that the threshold is particularly high or low; indeed, for the
pure distributional problem considered in Section 5, it is 21 .
How should one interpret non-existence? Recall that our objective in formulating the DCW concept is to identify outcomes that are arguably robust across a range
of majoritarian institutions. The absence of robust outcomes implies that collective
23
choices must hinge on the specific features of those institutions. While our approach
does not completely resolve the problem of non-existence for Condorcet winners, it
considerably expands the range of environments for which the non-game-theoretic
identification of institutionally robust outcomes is a viable alternative to the game
theoretic, and necessarily institution-specific, mode of analysis.
4.5
Efficiency and renegotiation-proofness
In infinitely-repeated games, it is often difficult to sustain the most efficient outcomes
in the utility possibility set unless the discount factor is sufficiently high. By analogy
with subgame perfection, one might expect to have some difficulty sustaining efficient
outcomes as DCWs as well. However, it turns out that the set of DCWs has nice
efficiency properties. In particular, outcomes that are efficient within the set of
DCWs are also efficient within the utility possibility set.
For any compact set W ⊆ U , let F (W ) denote the weakly efficient frontier,
defined as follows:
F (W ) = {w ∈ W | for any other w0 ∈ W , wi ≥ wi0 for some i}
Similarly, let F s (W ) denote the strictly efficient frontier, defined as follows:
F s (W ) = {w ∈ W | for any other w0 ∈ W , wi > wi0 for some i}
Theorem 7: Assume U is compact and convex. The weakly efficient frontier of the
set of normalized payoffs associated with DCWs is weakly efficient within U ,
and the strictly efficient frontier of the set of normalized payoffs associated with
DCWs is strictly efficient within U (that is, F (Vδ∗ ) ⊆ F (U ), and F s (Vδ∗ ) ⊆
F s (U )).
Remark: As is clear from the proof of the theorem, the stated properties hold for
stationary DCWs regardless of whether U is convex.
Theorem 7 also motivates consideration of refinements. In the context of Nash
equilibria, one common refinement is to focus on the efficient frontier of the equilibrium set (the Pareto refinement). Theorem 7 tells us that, in the context of DCWs,
the Pareto refinement always places the community on the efficient frontier of U.
This is a natural refinement inasmuch as no member of the community would oppose
efforts to coordinate on a Pareto improvement.
24
There is, however, a well-known problem with the Pareto refinement: subgame
perfect equilibria that generate efficient payoffs on the equilibrium path often entail
highly inefficient outcomes (used as punishments) off the equilibrium path. These
inefficient outcomes are frequently viewed as vulnerable to renegotiation. The literature has explored various notions of renegotiation-proof equilibria for dynamic
games, beginning with Bernheim and Ray [1989] and Farrell and Maskin [1989].
The definition of a renegotiation-proof equilibrium is relatively uncontroversial for
finite horizon games. Formulating a concept for the infinite horizon case is more
controversial.
One particularly strong notion of renegotiation-proofness requires
outcomes to remain on the efficient frontier of the feasible payoff set, both on and
off the equilibrium path. With such a strong requirement, existence is not generally
guaranteed for infinitely repeated games. For equilibria that correspond to DCWs,
however, it turns out that this requirement is quite easy to satisfy.
To develop this line of analysis, we need some additional definitions. When U
is compact, each agent’s payoff is bounded below. Let ui denote the lowest feasible
payoff for agent i.
Definition: The set U is characterized by free disposal if, for all u ∈ U and u0 6= u
such that u 5 u0 ≤ u, we have u0 ∈ U . The set U is characterized by strict
comprehensiveness if, in addition, there exists u00 > u0 such that u00 ∈ U .
Both definitions are standard. Free disposal implies that it is always possible to
throw away payoffs, subject to the lower bound. Strict comprehensiveness implies
that reductions in one agent’s payoff always permit increases in others’ payoffs.
Notice that strict comprehensiveness implies free disposal.
Definition: A DCW µ satisfies universal weak efficiency (UWE) if, for all ht , the
normalized continuation payoff lies on the weakly efficient frontier of Co(U ).23
A DCW µ satisfies universal strict efficiency (USE) if, for all ht , the normalized continuation payoff vector lies on the strictly efficient frontier of Co(U ).
Let Eδ denote the set of normalized payoff vectors sustainable in DCWs satisfying universal weak efficiency. Likewise, let Eδs denote the set of normalized payoff
vectors sustainable in DCWs satisfying universal strict efficiency.
Note that any
strategic equilibrium corresponding to a universally efficient DCW (weak or strict)
23
For any set A, Co(A) denotes the convex hull of A.
25
is renegotiation-proof in the very strong sense that the community always remains
on the efficient frontier, irrespective of history.
define
Wδ∗ (A)
To state our next result, we also
as the set of normalized payoffs corresponding to stationary DCWs
when the feasible payoff set is A. Clearly, Wδ∗ (U ) and Wδ∗ (as defined in Section
4) are identical; we have simply suppressed the dependence on U until this point to
simplify notation.
Theorem 8: Assume U is compact and convex.
(i) If U is characterized by free disposal, then the following sets of normalized payoff vectors are equivalent: those associated with DCWs satisfying universal
weak efficiency, those associated with weakly efficient DCWs, and those associated with stationary DCWs when choices are restricted to the weakly efficient
frontier of U (in other words, Eδ = F (Vδ∗ ) = Wδ∗ (F (U ))).
(ii) If U is characterized by strict comprehensiveness, then the following sets of
normalized payoff vectors are equivalent: those associated with DCWs satisfying universal strict efficiency, those associated with strictly efficient DCWs,
and those associated with stationary DCWs when choices are restricted to the
strictly efficient frontier of U (in other words, Eδs = F s (Vδ∗ ) = Wδ∗ (F s (U ))).
Theorem 8 has two implications. First, the set of outcomes found in universally
efficient DCWs is exactly the same as the efficient frontier of the DCW set.
In
other words, any efficient DCW outcome can be sustained as a universally efficient
DCW. The intuition for this result is simple. To discourage a collective deviation, a
consequence need only make a majority of individuals worse off. If a consequence is
inefficient, one can replace it with another consequence that gives the same payoffs
to members of the decisive majority, and higher payoffs to individuals who are
not members of this majority.
By the argument used to prove Theorem 7, this
alternative consequence is also sustainable as a DCW. Therefore, one can continue
to increase the payoff to individuals who are not members of the decisive majority
until the punishment lies on the efficient frontier.
Second, Theorem 8 implies that the set of universally efficient DCWs is exactly
equal to the set of DCWs obtained by first throwing away all inefficient feasible
outcomes, and then finding the stationary DCW set.
To illustrate this result,
imagine the community faces the problem of dividing a dollar among its members,
26
and that it is free to throw away any portion of the dollar. Any outcome in which
the entire dollar is divided among agents is on the efficient frontier, and any outcome
in which some of the dollar is thrown away is inefficient. Theorem 8 tells us that
to find the universally efficient DCW set for this problem, we can ignore all the
inefficient outcomes (in which a portion of the dollar is destroyed) and simply solve
for DCWs under the assumption that the policy space is just the unit simplex. Note
that this result holds even when the efficient frontier of the utility possibility set is
not convex (provided the underlying set U is convex).
As an application, we return again to the median voter model.
So far, we’ve
shown that the static Condorcet winner, η(yM ), is always sustainable through a
DCW, and that it’s the only sustainable outcome when preferences exhibit no
second-order heterogeneity. However, we also know that, more generally, there are
other sustainable outcomes. Our next result shows that, with modest assumptions
concerning preferences, universal weak efficiency rules out these other outcomes,
leaving only the repeated static solution.
Theorem 9: Consider the median voter model.
Assume preferences satisfy the
weak single-crossing property (specifically, if ηi (y 0 ) ≥ ηi (y 00 ) for y0 < y00 , then
ηj (y0 ) ≥ ηj (y 00 ) for j < i). Assume also that δ ∈ (0, 1). The static Condorcet
winner is the only normalized payoff vector sustainable in a DCW satisfying
universal weak efficiency (that is, Eδ = {η(yM )}).
5
Application: Pure Distributive Politics
As an application, we now consider the classical problem of dividing a fixed payoff (henceforth a “dollar”) among a number of agents (e.g., Baron and Ferejohn
[1987,1989], Epple and Riordan [1987]). The problem is important because it involves pure distributive politics. It is subtle because there is no Condorcet winner;
for any division of the dollar, there exists another division that makes a majority
strictly better off.
Since every possible choice belongs to a Condorcet cycle, this is an instance where
our folk theorem (Theorem 3) applies.
Our objective here is to characterize the
DCW set as a function of the discount factor. We offer a complete characterization
for the case of N = 3 and a partial characterization for the case of N > 3, and we
identify implications for distributional policy.
27
Formally, we assume that each individual’s utility is equal to the share of the
dollar received. Thus, the set of feasible payoffs is the unit simplex:
(
)
N
X
U = ∆N ≡ u ∈ RN
ui = 1 .
+ |
i=1
Note that we do not allow the community to throw away any fraction of the dollar.
However, as discussed in the previous section, this is equivalent to allowing for free
disposal provided that we restrict attention to universally efficient DCWs. Define
(
)
X
N
N
ui ≥ 1 − δ
∆δ = u ∈ ∆ | ∀A ⊂ {1, ..., N} with |A| = M ,
i∈A
Theorem 10: (i) The set of normalized payoffs associated with DCWs is contained
∗
N
in ∆N
δ (that is, Vδ ⊆ ∆δ ).
(ii) If δ < 12 , no DCWs exist.
(iii) If δ ≥
(that
N−1
N+1 , the set of
is, Vδ∗ = ∆N
δ ).
normalized payoffs associated with DCWs is exactly ∆N
δ
Parts (i) and (iii) of Theorem 10 imply that if N = 3, all policies giving at least
(1 − δ) to every majority are in Wδ∗ provided δ ≥
1
2
(if N = 3,
N −1
N +1
=
1
2 ).
We
illustrate these constraints in Figure 1 for various values of δ. The innermost set
shows the divisions that are sustainable when δ = .5. This is contained in the set
that is sustainable for δ = .8, which in turn is contained in the set that is sustainable
for δ = 1. As discussed in Section 4.2, one can associate each DCW with a “degree
of sustainability,” based on the lowest discount factor for which it survives (with
survival at lower discount factors implying greater sustainability). Here, the most
sustainable outcomes are those in the innermost set.
Note that, for δ ∈ [ 12 , 1), some outcomes are sustainable, and some are not. This
illustrates as point made earlier: our solution concept is more discerning than, say,
repeated Downsian competition, which can produce any outcome as long as δ >
1
2
(see Duggan and Fey [2006], who also refine the equilibrium set).
If we interpret each individual as representing a social class (e.g. poor, middle
class, and wealthy), several interesting and significant conclusions follow from an
inspection of Figure 1. (1) No alternative is more sustainable than the egalitarian
outcome (u∗ ).
(2) Starting from any other alternative, movements toward egali-
tarianism always (weakly) improve sustainability. We illustrate this in the figure
28
Figure 1: Division of a Fixed Payoff
by drawing a straight line from an arbitrary point (u0 ) to the egalitarian outcome
(u∗ ). As we move along the line toward u∗ , we cross into payoff sets sustained at
progressively lower discount factors. These first two conclusions suggest that political considerations favor equality. There are, or course, unequal outcomes that are
as sustainable as perfect egalitarianism (e.g., point u00 ).
However: (3) penalizing
one party requires a measure of equality between the other two. Note, for example,
that as one moves from u∗ to u00 , decreasing the payoff to agent 3, the scope for
inequality between agents 1 and 2 (as indicated by the double-headed arrow) within
the set of most sustainable outcomes declines monotonically. This leads to our next
conclusion: (4) it is easier to sustain outcomes with extremely poor minorities (who
receive essentially nothing) than ones with extremely wealthy minorities. That is,
political feasibility places a limit on wealth, but not on poverty. To see this point,
note that, for any δ ≥ 12 , no individual receives more than δ, but there are DCWs
29
in which one individual is arbitrarily poor. Within the most sustainable set, the
Gini coefficient lies in the interval [0, 12 ]; the highest degree of inequality is attained
for outcomes in which one party receives nothing, and the other two split the prize
equally.
Finding DCW outcomes when N > 3 and
1
2
≤δ ≤
N−1
N+1
is more complicated.
By theorem 10(i), we know that Wδ∗ ⊆ ∆N
δ for δ in this range. However, by way of
example, one can show that not all policies in ∆N
δ are sustainable.
6
Concluding remarks
The value of any solution concept lies in the insights it supplies when applied to
concrete economic problems. Here we have explored two standard applications: the
“median voter” problem, and division of a fixed surplus. For the first application,
we’ve identified both a preference restriction and an equilibrium refinement that justify focusing on the static solution even when the setting is dynamic. For the second
application, we’ve shown that political considerations (weakly) favor equality, and
that political feasibility places upper limits on wealth, but does not place lower limits on poverty. For more elaborate applications to other significant economic issues,
see Bernheim and Slavov [2004] (who study the selection of incentive compatible tax
systems), Slavov [2006a] (who studies the political power of the elderly), and Slavov
[2006b] (who examines private and public provision of public goods).
Although we have defined DCWs for a class of simple repeated collective choice
problems with infinitely-lived agents, our definition extends in a straightforward
manner to other dynamic settings, including those with overlapping generations (see
Slavov [2006a]) and/or state variables. Similarly, the concept is easily generalized
to encompass environments with publicly observed, payoff-irrelevant random events.
This generalization permits one to convexify the continuation payoff set Wδ∗ through
public randomization, thereby facilitating numerical computation of DCW sets (see
Slavov [2006b]).
Other extensions are also possible.
In defining our solution concept, we have
used the majority preference relation. One could define similar concepts for other
relations (e.g. a supermajority requirement).
A careful reading of our proofs
reveals that some results rely only on completeness and continuity of the majority
preference relation, while others also require the Pareto property.
As one of our
referees noted, one could alternatively reinterpret our framework as describing the
30
choices of a single dynamically inconsistent decision-maker whose preferences are
also intransitive.
31
Appendix
To prove several of our theorems, we use an analog of the self-generation map,
which is a tool used to simplify the analysis of subgame-perfect equilibrium (see
Abreu, Pearce, and Stacchetti [1990]). We define the self-generation map for DCWs
as follows: for any set W ⊆ Co(U ) (where Co(U ) denotes the convex hull of U ),
Ψδ (W ) = {w | there exists u00 ∈ U and w00 ∈ W such that w = (1 − δ)u00 + δw00
and, ∀u ∈ U , there exists w0 ∈ W with wR((1 − δ)u + δw0 )}
In words, the self-generation map identifies the set of normalized payoffs that are
achievable when normalized continuation payoffs must lie in the set W . A normalized payoff vector w can be achieved provided that (i) w can be decomposed into
some feasible current outcome u00 and some w00 in the feasible normalized continuation set W , and (ii) for any other current outcome u, there is another feasible
normalized continuation payoff vector w0 such that a majority prefers u00 followed
by w00 to u followed by w0 .
In a stationary environment, the set of feasible nor-
malized continuation payoffs must be the same as the set of achievable normalized
payoffs. Consequently, the set of normalized equilibrium payoffs is a fixed point of
the self-generation map.
For a stationary DCW, the prescribed current payoff and the normalized continuation payoff are always the same.
Accordingly, we define the stationary self-
generation map as follows: for any set W ⊆ U ,
©
ª
Φδ (W ) = w ∈ W | ∀u ∈ U , there exists w0 ∈ W with wR((1 − δ)u + δw0 )
We demonstrate below in Lemma 5 that Vδ∗ and Wδ∗ correspond to the largest
fixed points of the self-generation maps Ψδ and Φδ , respectively.
To obtain the
largest fixed point of Ψδ , one natural strategy is to start with the set of feasible
normalized payoff vectors, Ωδ , and iteratively apply Ψδ , at each step eliminating
infeasible continuation payoffs until convergence is achieved. Formally,
)
(
∞
X
t t
1 2
t
δu
Ωδ ≡ w | ∃(u , u , ...) with u ∈ U ∀t such that w = (1 − δ)
t=0
We note that Ωδ = U when U is convex.
Define Ψ1δ = Ψδ (Ωδ ), and recursively
Ψtδ = Ψδ (Ψt−1
δ ). Let
∞
t
Ψ∞
δ = ∩t=0 Ψδ
32
Likewise, to obtain the largest fixed point of Φδ , one might start with the payoff set
U and iteratively apply Φδ until convergence is achieved. Define Φ1δ = Φδ (U ), and
recursively Φtδ = Φδ (Φt−1
δ ). Let
∞
t
Φ∞
δ = ∩t=0 Φδ
Our analysis begins with a series of lemmas.
Lemma 1: Consider a sequence u1 , u2 , ... ∈ RN converging to u∗ .
second sequence
w1 , w2 , ...
∈
RN
Consider a
with at least one limit point, and un Rwn for
each n. Then, for any limit point w∗ of the second sequence, we have u∗ Rw∗ .
Proof: The lemma reflects a well-known property (continuity of the majority preference relation), but is also easy to establish directly.
we can select some subsequence indexed by nt with
uni t
≥ wint } ≡ I ∗
{i |
we have u∗i ≥ wi∗ for
wnt
→
w∗
Since N is finite,
such that the set
is the same for all t. We know that #I ∗ ≥ M . Taking limits,
i ∈ I ∗. ¥
Lemma 2: (a) If A ⊆ B ⊆ Ωδ , then Ψδ (A) ⊆ Ψδ (B). (b) If A ⊆ B ⊆ U , then
Φδ (A) ⊆ Φδ (B). (c) If A ⊆ Ωδ and A ⊆ Ψδ (A), then A ⊆ Ψ∞
δ . (d) If
t+1
⊆ Ψtδ ⊆ U for all t ≥ 1. (f)
A ⊆ U and A ⊆ Φδ (A), then A ⊆ Φ∞
δ . (e) Ψδ
Φt+1
⊆ Φtδ ⊆ U for all t ≥ 1.
δ
Proof: (a) Consider any w ∈ Ψδ (A). We know there exists u00 ∈ U and w00 ∈ A
with w = (1 − δ)u00 + δw00 ; obviously, w00 ∈ B.
exists
w0
∈ A such that wR ((1
− δ)u + δw0 ).
this holds for all u ∈ U , we have w ∈ Ψδ (B).
Moreover, for each u ∈ U , there
Since A ⊆ B, we have w0 ∈ B. Since
(b) Consider any w ∈ Φδ (A). For each u ∈ U , there exists w0 ∈ A such that
wR ((1 − δ)u + δw0 ). Since A ⊆ B, we have w0 ∈ B. Since this holds for all u ∈ U,
we have w ∈ Φδ (B).
(c) By part (a) (taking B = Ωδ ), A ⊆ Ψδ (A) ⊆ Ψδ (Ωδ ) = Ψ1δ . Now assume that
t−1
t−1
t
A ⊆ Ψt−1
δ . Then, by part (a) (taking B = Ψδ ), A ⊆ Ψδ (A) ⊆ Ψδ (Ψδ ) = Ψδ .
Thus, by induction, A ⊆ Ψtδ for all t, which implies A ⊆ Ψ∞
δ .
(d) Same as (c), substituting U for Ωδ and Φ for Ψ.
(e) By definition, Ψ1δ = Ψδ (Ωδ ) ⊆ Ωδ . Applying part (a) (taking A = Ψ1δ and
B = Ωδ ), we have Ψ2δ ⊆ Ψ1δ .
Now apply induction: assuming Ψtδ ⊆ Ψt−1
and
δ
t+1
applying part (a) (taking A = Ψtδ and B = Ψt−1
⊆ Ψtδ .
δ ), we have Ψδ
33
(f) Follows immediately from the fact that, by construction, Φδ (W ) ⊆ W for
any set W . ¥
Lemma 3: Assume U is compact.
(a) Consider W ⊆ Ωδ ; if W is compact, so
is Ψδ (W ). (b) Consider W ⊆ U ; if W is compact, so is Φδ (W ). (c) Ψtδ is
compact for all t, including t = ∞. (d) Φtδ is compact for all t, including
t = ∞.
Proof: (a) Consider any convergent sequence w1 , w2 , ... ∈ Ψδ (W ) with limit
point w∞ .
For each wt , there exists u00t ∈ U and wt00 ∈ W such that wt = (1 −
δ)u00t + δwt00 . Since U is compact, it contains all limit points of u00t ; likewise, since
W is compact, it contains all limit points of wt00 . Consequently, there exists u00∞ ∈ U
00 ∈ W such that w = (1 − δ)u00 + δw00 .
and w∞
∞
∞
∞
For each wt , there exists a function wt0 : U → W such that, ∀u ∈ U , we
have wt R ((1 − δ)u + δwt0 (u)).
0
quence wt (u).
w∞ R ((1 − δ)u
contains w∞ ).
0 (u) be a limit point of the seFor each u, let w∞
0 (u) ∈ W .
Since W is compact, w∞
0 (u))
+ δw∞
(Lemma 1).
Moreover, ∀u ∈ U , we have
This establishes that Ψδ (W ) is closed (it
It is easy to verify that, since U is compact, Ωδ is compact. For W ⊆ Ωδ , we
have Ψδ (W ) ⊆ Ωδ by construction. Since Ψδ (W ) is a closed subset of a compact
set, it is compact.
(b) We establish that Φδ (W ) is closed using the same argument as in the second
paragraph of the proof of part (a). Since Φδ (W ) ⊆ W by construction, and since
W is compact by assumption, Φδ (W ) is compact.
(c) By induction, part (a) (along with the compactness of Ωδ ) implies that Ψtδ
is compact for all t. Part (e) of Lemma 2 implies that the sequence Ψ1δ , Ψ2δ ,... is
nested. Since Ψ∞
δ is the infinite intersection of nested compact sets, it is compact.
(d) Same as (c), substituting Φ for Ψ, U for Ωδ , part (b) for part (a), and part
(f) of Lemma 2 for part (e). ¥
∞
∞
∞
Lemma 4: If U is compact, then (a) Ψδ (Ψ∞
δ ) = Ψδ and (b) Φδ (Φδ ) = Φδ .
∞
Proof: (a) First we argue that Ψδ (Ψ∞
δ ) ⊆ Ψδ . From part (a) of Lemma 2 and
t+1
t
∞
for all t, from which the
the fact that Ψ∞
δ ⊆ Ψδ for all t, we have Ψδ (Ψδ ) ⊆ Ψδ
conclusion follows directly.
34
∞
∞
Next we argue that Ψδ (Ψ∞
δ ) ⊇ Ψδ . Assume that Ψδ is non-empty; otherwise
∞
the statement is trivial. Take any w ∈ Ψ∞
δ . We show that w ∈ Ψδ (Ψδ ), in three
steps.
Step 1: We claim that, for any sequence w1 , w2 , ... converging to some limit w∞
t
with wt+1 ∈ Ψtδ for each t, we have w∞ ∈ Ψ∞
δ . Since each Ψδ contains wt+1 , and
since Ψτδ ⊆ Ψtδ for τ > t (Lemma 2 part (e)), we have wτ ∈ Ψtδ for all τ ≥ t + 1; since
00 . Since this holds for all t, w00 ∈ Ψ∞
Ψtδ is also compact, it therefore contains w∞
∞
δ
00 ∈ Ψ∞ with w = (1 −
Step 2: We claim that there exists u00∞ ∈ U and w∞
δ
00 . For all t ≥ 1, we know w ∈ Ψt , so for each t there exists u00 ∈ U
δ)u00∞ + δw∞
t
δ
with w = (1 − δ)u00t + δwt00 . Consider any subsequence for which u00t
and wt00 ∈ Ψt−1
δ
00 (existence is assured since Ω is compact).
and wt00 converge to limits u00∞ and w∞
δ
00 ∈ Ψ∞ .
Clearly, u00∞ ∈ U and (from step 1) w∞
δ
0 (u) ∈ Ψ∞ such that
Step 3: We claim that, for each u ∈ U , there exists w∞
δ
0 (u)). For each t ≥ 1, we know w ∈ Ψt , so for each t there exists
wR ((1 − δ)u + δw∞
δ
a function wt0 : U → Ψt−1
such that for all u ∈ U , we have wR ((1 − δ)u + δwt0 (u)).
δ
0
0 (u) be a limit point of the subsequence w (u). By step 1, w0 (u) ∈
For each u, let w∞
t
∞
0
Ψ∞
δ . By Lemma 1, we have wR ((1 − δ)ui + δw∞ (u)).
Together, steps 2 and 3 imply w ∈ Ψδ (Ψ∞
δ ), as required.
∞
(b) To prove Φδ (Φ∞
δ ) ⊆ Φδ , we argue exactly as in part (a), substituting Φ for
Ψ, and Lemma 2, part (b) for Lemma 2, part (a).
∞
To prove Φδ (Φ∞
δ ) ⊇ Φδ , we
argue as in steps 1 and 3 of part (a), substituting Φ for Ψ, and Lemma 2, part (f)
for Lemma 2, part (e). ¥
Henceforth, let D = (1, δ, δ 2 , ...)0 . For any feasible policy sequence γ = (u0 , u1 , ...),
P
t t
the normalized payoff vector is (1 − δ)γD = (1 − δ) ∞
t=0 δ u .
Lemma 5: (a) If W ⊆ Ψδ (W ) for some W ⊆ Ωδ , then W ⊆ Vδ∗ . (b) If W ⊆ Φδ (W )
for some W ⊆ U , then W ⊆ Wδ∗ .
Proof: (a) Consider any w ∈ W . We will construct functions µ : ∪∞
τ =0 Hτ → U
b 0 ) = w, and such that for all τ ≥ 0 and hτ ∈ Hτ
and w
b : ∪∞
τ =0 Hτ → W such that w(h
we have (i) w(h
b τ ) = (1−δ)µ(hτ )+δ w(h
b τ , µ(hτ )), and (ii) w(h
b τ )R ((1 − δ)u + δ w(h
b τ , u))
for all u ∈ U .
Since w ∈ Ψδ (W ), we know that ∃ u00 ∈ U and w00 ∈ W such that w = (1−δ)u00 +
δw00 and, ∀u ∈ U , there exists w0 (u) ∈ W with wR ((1 − δ)u + δw0 (u)). Define µ on
the degenerate domain H0 as follows: µ(h0 ) = u00 ; extend w
b to the domain H0 ∪ H1
35
as follows: w(u
b 00 ) = w00 , and w(u)
b
= w0 (u) for u ∈ U \{u00 }. Clearly, this satisfies
properties (i) and (ii) for τ = 0
Now we complete the definitions of the functions µ and w
b through induction
on t.
Fix t > 0, and assume we’ve defined the functions µ : ∪t−1
τ =0 Hτ → U and
w
b : ∪tτ =0 Hτ → W satisfying properties (i) and (ii) for τ = 0, ...t − 1 and all hτ ∈ Hτ .
b t ) ∈ W , we know that ∃ u00 (ht ) ∈ U and w00 (ht ) ∈
Consider any ht ∈ Ht . Since w(h
W such that w(h
b t ) = (1−δ)u00 (ht )+δw00 (ht ) and, ∀u ∈ U , there exists w0 (ht , u) ∈ W
with w(h
b t )R ((1 − δ)u + δw0 (ht , u)). Extend µ to the domain ∪tτ =0 Hτ by defining
µ(ht ) = u00 (ht ), and extend w
b to the domain ∪t+1
b t , u) as follows:
τ =0 Hτ by defining w(h
b t , u) = w0 (ht , u) for u ∈ U \{u00 (ht )}. This completes
w(h
b t , u00 (ht )) = w00 (ht ), and w(h
the induction step. Note that, by construction, the functions satisfy properties (i)
and (ii) for τ = 0, ...t and hτ ∈ Hτ .
b t ) (that is, the policy program µ generates
By property (i), (1−δ)C µ (ht )D = w(h
the normalized payoff vector w(h
b t ) when initialized at ht ).
that (1 −
δ)C µ (h0 )D
Note in particular
= w, which means the policy program delivers the desired
normalized payoff vector. To complete the argument, we need only show that µ is a
DCW. Consider any ht . Since (1 − δ)C µ (ht , u)D = w(h
b t , u) for all u ∈ U , property
(ii) immediately implies C µ (ht )Rδ (u, C µ (ht , u)) for all u ∈ U . Thus, µ is a DCW.
(b) Consider any w ∈ W .
We will construct a function µ : ∪∞
τ =0 Hτ → U
such that µ(h0 ) = w, and such that for all τ ≥ 0, hτ ∈ Hτ , we have (i) µ(ht ) =
µ(ht , µ(ht )), and (ii) for all u ∈ U , we have µ(hτ )R ((1 − δ)u + δµ(hτ , u)).
Since w ∈ Φδ (W ), we know that, ∀u ∈ U , there exists w0 (u) ∈ W with
wR ((1 − δ)u + δw0 (u)). Extend µ to the domain H0 ∪ H1 as follows: µ(w) = w,
and µ(u) = w0 (u) for u ∈ U \{w}. Clearly, this satisfies properties (i) and (ii) for
τ = 0.
Now we complete the definition of the function µ through induction on t. Fix t >
0, and assume we’ve defined the function µ : ∪t−1
τ =0 Hτ → U satisfying properties (i)
and (ii) for τ = 0, ...t − 1 and all hτ ∈ Hτ . Consider any ht ∈ Ht . Since µ(ht ) ∈ W ,
we know that, ∀u ∈ U , there exists w0 (ht , u) ∈ W with µ(ht )R ((1 − δ)u + δw0 (ht , u)).
Extend µ to the domain ∪t+1
τ =0 Hτ by defining µ(ht , u) as follows: µ(ht , µ(ht )) = µ(ht ),
and µ(ht , u) = w0 (ht , u) for u ∈ U \{µ(ht )}. This completes the induction step. Note
that, by construction, the function satisfies properties (i) and (ii) for τ = 0, ...t and
hτ ∈ Hτ .
By property (i), µ is stationary, which means that, for the history ht , it delivers
36
the normalized payoff (1 − δ)C µ (ht )D = µ(ht ). In particular, (1 − δ)C µ (h0 )D = w,
as required.
To complete the argument, we need only show that µ is a DCW.
Consider any ht .
Since (1 − δ)C µ (ht , u)D = µ(ht , u) for all u ∈ U , property (ii)
immediately implies C µ (ht )Rδ (u, C µ (ht , u)) for all u ∈ U . Thus, µ is a DCW. ¥
Henceforth, for any policy program µ, define
Πµ = {u | u = (1 − δ)C µ (ht )D for some feasible history ht }
In words, Πµ is the collection of normalized payoffs associated with all histories for
the policy program µ.
Lemma 6: Assume U is compact. (a) Ψ∞
δ is identical to the set of normalized payoffs associated with DCWs (Vδ∗ ). (b) Φ∞
δ is identical to the set of normalized
payoffs associated with stationary DCWs (Wδ∗ ).
∗
Proof: (a) First we argue that Ψ∞
δ ⊆ Vδ . From Lemma 4, part (a), we know
∞
that Ψ∞
δ = Ψδ (Ψδ ). The desired conclusion then follows from part (a) of Lemma
5.
Next we argue that Vδ∗ ⊆ Ψ∞
δ . Consider any DCW, µ.
We claim that Πµ ⊆
Ψδ (Πµ ). From Lemma 2 part (c), it then follows that Πµ ⊆ Ψ∞
δ . Since this is true
for every DCW µ, the desired conclusion is immediate.
To establish the claim, consider any w ∈ Πµ . Let ht be some t-history for which
(1 − δ)C µ (ht )D = w. Clearly, w = (1 − δ)µ(ht ) + δ(1 − δ)C µ (ht , µ(ht ))D, where
µ(ht ) ∈ U and (1 − δ)C µ (ht , µ(ht ))D ∈ Πµ . Since µ is a DCW, we know that,
for all u ∈ U , we have [(1 − δ)C µ (ht )D] R [(1 − δ)u + δ(1 − δ)C µ (ht , u)D], where
(1 − δ)C µ (ht , u)D ∈ Πµ . But this implies w ∈ Ψδ (Πµ ), as required.
∗
(b) First we argue that Φ∞
δ ⊆ Wδ . From Lemma 4, part (b), we know that
∞
Φ∞
δ = Φδ (Φδ ). The desired conclusion then follows from part (b) of Lemma 5.
Next we argue that Wδ∗ ⊆ Φ∞
δ . Consider any stationary DCW, µ.
We claim
that Πµ ⊆ Φδ (Πµ ). From Lemma 2 part (d), it then follows that Πµ ⊆ Φ∞
δ . Since
this is true for every stationary DCW µ, the desired conclusion is immediate.
To establish the claim, consider any w ∈ Πµ . Let ht be some t-history for which
µ(ht ) = w.
Since µ is a stationary DCW, we know that, for all u ∈ U , we have
µ(ht )R ((1 − δ)u + δµ(ht , u)), where µ(ht , u) ∈ Πµ . But this implies w ∈ Φδ (Πµ ), as
required. ¥
37
Lemma 7: Assume U is compact. (a) The set of normalized payoffs associated with
DCWs (Vδ∗ ) is compact and identical to Ψ∞
δ . (b) The set of normalized payoffs
associated with stationary DCWs (Wδ∗ ) is compact and identical to Φ∞
δ .
Proof: The result follows directly from Lemma 3 and Lemma 6.¥
Lemma 7 is useful because it provides both a tool for proving other results, and
an algorithm for finding the set of normalized payoffs associated with DCWs in
parameterized applications.
Proof of Theorem 1
We claim that, when U is compact and convex, Vδ∗ ⊆ Φ∞
δ . Since we know that
∗
∗
Φ∞
δ = Wδ ⊆ Vδ (Lemma 6, part (b)), the theorem follows directly.
Consider any DCW, µ. We establish first that Πµ lies in the domain of Φδ : with
U convex, we have U = Ωδ = Co(U ), so Πµ ⊆ U follows from Πµ ⊆ Ωδ . The claim
then follows from Lemma 5, part (b), as long as Πµ ⊆ Φδ (Πµ ).
Consider any w ∈ Πµ . Let ht be some t-history for which (1 − δ)C µ (ht )D = w.
Since µ is a DCW, we know that, for all u ∈ U , we have wR ((1 − δ)u + δ(1 − δ)C µ (ht , u)D),
where (1 − δ)C µ (ht , u)D ∈ Πµ . But this implies w ∈ Φδ (Πµ ), as required. ¥
Proof of Theorem 2
Let U be the feasible utility set, and let HtU ≡ U t−1 be the set of feasible t-
histories, when randomizations are not allowed.
the feasible utility set, and
Co(U )
Ht
With randomizations, Co(U ) is
= [Co(U )]t−1 is the set of feasible t-histories.
U
Consider any policy program µ : ∪∞
k=0 Hk → U that constitutes a DCW when
Co(U)
randomizations are not allowed. We construct a policy program µ
b : ∪∞
k=0 Hk
→
U ⊆ Co(U ) as follows. For any u ∈ Co(U ), there exists a Borel probability measure
λ(u) on X such that Eλ(u) [η(x)] = u (where Eλ is the expectations operator given
£
¤
λ). Let ξ(u) = η Eλ(u) (x) . Since X is convex, Eλ(u) (x) ∈ X, so ξ(u) ∈ U . Since
Co(U)
ηi is concave, ξi (u) ≥ ui . Now consider any t and ht = (u1 , u2 , ..., ut−1 ) ∈ Ht
b t ) ≡ (ξ(u1 ), ξ(u2 ), ..., ξ(ut−1 )) ∈ H U , we can define µ
b t )).
Since ξ(h
b(ht ) = µ(ξ(h
.
t
To complete the proof, we check that µ
b is a DCW when randomizations are
Co(U)
, and u ∈ Co(U ). By conallowed. Consider any t, ht = (u1 , u2 , ..., ut−1 ) ∈ Ht
b t )). Since µ is a DCW, C µ (ξ(h
b t ))Rδ (ξ(u), C µ (ξ(h
b t ), ξ(u))).
struction, C µb (ht ) = C µ (ξ(h
b t ), ξ(u)) = C µb (ht , u) by construction,
But since ξi (u) ≥ ui for all i, and since C µ (ξ(h
we have C µb (ht )Rδ (u, C µb (ht , u)). But this implies that µ
b is a DCW. ¥
38
Proof of Theorem 3
Part (i): Proof appears in the text.
Part (ii): Define χ(k) ≡ k + 1 for k = 1, ..., K − 1, and χ(K) = 1. Choose any
wk ∈ W . Since wk P wχ(k) , there plainly exists δk∗ ∈ [0, 1) such that, for δ ∈ (δk∗ , 1],
¡
¡
¢
¢
we have wk P (1 − δ)u + δwχ(k) , which in turn implies wk P (1 − δ)u + δwχ(k) for
all u ∈ U. Since wχ(k) ∈ W , this means wk ∈ Φδ (W ). Taking δ ∗ = maxk=1,...,K δk∗ ,
we plainly have W ⊆ Φδ (W ) for all δ ∈ (δ ∗ , 1].
The result then follows as an
application of Lemma 5, part (b).24
Proof of Theorem 4
Since U is convex, Vδ∗ = Wδ∗ (Theorem 1). Assume, contrary to the theorem,
that S ∩ Wδ∗ is non-empty.
In light of (i) and the continuity of the majority
preference relation, we can represent R on clos(S) by a continuous “utility” function
ψ(u) (Mas-Colell, Whinston, and Green [1995], Proposition 3.C.1, p. 47). Define
Θ ≡ clos(S) ∩ Wδ∗ . By Lemma 3, part (d), Θ is compact. Thus, ψ(u) reaches a
minimum on Θ. Choose any w0 ∈ arg minu∈Θ ψ(u). Note that w0 ∈ S (if not, then
since S ∩ Wδ∗ is non-empty by hypothesis, there would exist some w ∈ S ∩ Wδ∗ such
that ψ(w) ≥ min ψ(u) = ψ(w0 ), which implies wRw0 ; since w0 ∈
/ S by assumption,
this violates (ii)).
Now consider some u ∈ U \S. Since w0 ∈ Wδ∗ = Φδ (Wδ∗ ), there exists w0 ∈ Wδ∗
such that w0 R(1 − δ)u + δw0 ≡ v. By convexity, v ∈ U , and by (ii), v ∈ S. There
are two cases to consider.
Case 1: w0 ∈ U \S. Since v is a convex combination of w0 and u, we must either
have vRu (which occurs if w0 Ru) or vRw0 (which occurs if uRw0 ). But since v ∈ S,
while u, w0 ∈ U\S, this contradicts (ii).
Case 2: w0 ∈ S. Since u ∈ U \S, (ii) implies uP w0 , which in turn implies vP w0 .
Combining this with w0 Rv yields w0 P w0 (recall that w0 , v, w0 ∈ S, on which R is
transitive). But since w0 ∈ Θ, this contradicts the definition of w0 . ¥
Proof of Theorem 5
For any w, u ∈ U , define α(w, u) as follows: if N is odd, α(w, u) is the M -th
largest value of ui − wi (among i = 1, ..., N ); if M is even, it is the M + 1-th largest
24
It is also possible to prove this result through a direct constructive argument. The proof given
here is considerably simpler, however, given that we’ve already proven Lemma 5, part (b) (using a
constructive argument that covers a wider range of cases).
39
value.
Define α∗ (w) = maxu∈U α(w, u) and select
Notice that α is continuous.
a function u
b(w) ∈ arg maxu∈U α(w, u).
existence of a maximum is assured, so
α∗
Since U is compact and α is continuous,
is well-defined. Moreover, the maximum
theorem tells us that α∗ (w) is continuous. Consider w∗ ∈ arg minw∈U α∗ (w). Since
U is compact and α∗ is continuous, existence of a minimum is assured. Since w∗ is
not a Condorcet winner by assumption, α∗ (w∗ ) > 0.
Select a vector u such that u 5 u for u ∈ U , and a vector u such that u = u for
all u ∈ U (existence is assured for u and u since U is compact).25 Notice that there
exists some δ ∗ > 0 such that, when δ < δ ∗ , every element of the vector
is strictly less than α∗ (w∗ ).
We claim that no DCW exists for δ < δ ∗ .
δ
1−δ
(u − u)
Consider any policy program µ.
Discounted payoffs as of period 0 are no greater than µ(h0 ) +
δ
1−δ u.
If the out-
come in period 0 is u
b (µ(h0 )) rather than µ(h0 ), discounted payoffs are at least
δ
δ
u
b (µ(h0 )) + 1−δ
u. But with every element of 1−δ
(u − u) strictly less than α∗ (w∗ ),
´ ³
´
³
δ
δ
we have u
b (µ(h0 )) + 1−δ
u . This means that u
b (µ(h0 )) and its
u P µ(h0 ) + 1−δ
prescribed continuation is strictly majority preferred to µ(h0 ) and its prescribed
continuation, so µ is not a DCW. ¥
Proof of Theorem 6
Consider any W ⊆ U .
Φδ0 (W ).
We claim that Φδ (W ) ⊆ Φδ0 (W ).
Select any w ∈
/
Then there exists u ∈ U such that, ∀w0 ∈ W , ((1 − δ 0 )u + δ 0 w0 ) P w.
Define u0 = λu + (1 − λ)w, where
µ
¶
¶µ
1 − δ0
δ
λ≡
∈ [0, 1)
δ0
1−δ
Since U is convex, u0 ∈ U . Through some simple algebra, one can verify that, for
any w0 ∈ W ,
µ ¶
¤
δ £
(1 − δ 0 )u + δ 0 w0 − w
(1 − δ)u + δw − w =
0
δ
0
0
But this implies that (1 − δ 0 )ui + δ 0 wi0 > wi iff (1 − δ)u0i + δwi0 > wi . Consequently,
/ Φδ (W ), from
∀w0 ∈ W , we have ((1 − δ)u0 + δw0 ) P w. This is equivalent to w ∈
which the claim follows directly.
25
Throughout, v = w denotes vi ≥ wi for all i, v ≥ w denotes vi ≥ wi for all i with strict
inequality for some i, and v > w denotes vi > wi for all i.
40
Next we argue that Φtδ ⊆ Φtδ0 for all t.
of t = 1 (taking W = U ).
Lemma 2
Φδ (Φt−1
δ0 )
part (b), Φtδ
⊆ Φδ0 (Φt−1
δ0 )
conclusion. Applying
The preceding claim covers the case
Now suppose that the property holds for t − 1.
By
t−1
= Φδ (Φt−1
δ ) ⊆ Φδ (Φδ 0 ). Moreover, by the preceding claim,
= Φtδ0 . Combining these two statements yields the desired
induction establishes Φtδ ⊆ Φtδ0 for all t, from which Vδ∗ ⊆ Vδ∗0
follows directly as an application of Lemma 6, part (b), and Theorem 1. ¥
Proof of Theorem 7
Consider any set W ⊂ U and any w, v ∈ W with v ≥ w.
We claim that
w ∈ Φδ (W ) implies v ∈ Φδ (W ). This follows immediately from the observation that,
for all u ∈ U and w0 ∈ W , we have wR ((1 − δ)u + δw0 ) implies vR(1 − δ)u + δw0 ).
Let f (w) ≡ {v ∈ F (U ) | v ≥ w}. Consider any w ∈ Wδ∗ . We know that w ∈ Φtδ
for all t (Lemma 6, part (b)). Through induction, we will prove that f (w) ⊆ Φtδ
for all t. Consider first t = 1. Since w ∈ Φ1δ = Φδ (U ), the preceding claim implies
v ∈ Φδ (U ) = Φ1δ for all v ∈ f (w) ⊆ U , as desired. Now assume the statement is
true for t − 1. Since w ∈ Φtδ = Φδ (Φt−1
δ ), the preceding claim likewise implies that
t−1
t
v ∈ Φδ (Φt−1
δ ) = Φδ for all v ∈ f (w) ⊆ Φδ , as desired. Thus, applying induction
along with Lemma 6, part (b), we have f (w) ⊆ Wδ∗ .
/ F (U ). Since Wδ∗ ⊆ U , there exists some
Now consider any w ∈ Wδ∗ with w ∈
/ F (Wδ∗ ). Similarly,
v ∈ f (w) such that v > w. Since f (w) ⊆ Wδ∗ , this means w ∈
consider any w ∈ Wδ∗ with w ∈
/ F s (U ). Since Wδ∗ ⊆ U , there exists some v ∈ f (w)
/ F s (Wδ∗ ). The theorem then
such that v ≥ w. Since f (w) ⊆ Wδ∗ , this means w ∈
follows from the fact that Vδ∗ = Wδ∗ when U is compact and convex. ¥
Proof of Theorem 8
We provide a separate argument for each part of the theorem.
Throughout,
recall that, since U is convex, U = Co(U ). For the purpose of this proof, we need
to make certain functional arguments, previously suppressed, explicit:
Φδ (•, A)
refers to the self-generation map when the feasible payoff space is A (so far, we
have written Φδ (•, U ) as Φδ (•)); Φtδ (A) is obtained through t successive applications
Φδ (•, A), starting with A (so far, we have written Φtδ (U ) as Φtδ ).
Part (i): The proof of part (i) proceeds through a series of three steps.
Step 1: Eδ ⊆ F (Vδ∗ ). Suppose w ∈ Eδ . Then, by definition, w ∈ F (U ), and
w ∈ Vδ∗ . Since Vδ∗ ⊆ U , we have w ∈ F (Vδ∗ ).
41
Step 2: F (Vδ∗ ) ⊆ Wδ∗ (F (U )).
t
By Lemma 6, part (b), Wδ∗ (F (U )) = ∩∞
t=1 Φδ (F (U )). To prove the desired result,
we first establish the following property.
Lemma 8: For any W ⊆ U , we have F [Φδ (W, U )] ⊆ Φδ (F (W ), F (U )).
Proof: Consider any w ∈ F [Φδ (W, U )].
proof of Theorem 7, we know that w ∈ F (W ).
that, ∀u ∈ F (U ),
∃w0
From the argument given in the
Since F (U ) ⊆ U , we also know
∈ W such that wR ((1 − δ)u + δw0 ). Take any such u and
the associated w0 . Select any j such that at least M agents other than j weakly
prefer w to (1 − δ)u + δw0 .
w00 = arg maxw∈H
w
ej .
e
Define H = {w
e ∈ W | w
ei = wi0 for i 6= j}, and let
We claim that w00 ∈ F (W ).
Suppose not.
Then ∃v ∈ W such that v > w00 .
Define v0 as follows: vj0 = vj , and vi0 = wi0 for i 6= j. By free disposal, v 0 ∈ W , and
by construction v 0 ∈ H. But since vj0 > wj00 , this contradicts the definition of w00 .
Since wR ((1 − δ)u + δw0 ), and since only the j-th elements of w0 and w00 differ,
we have wR ((1 − δ)u + δw00 ). But this implies w ∈ Φδ (F (W ), F (U )), as required.
¥
Now we continue with the proof of step 2. By Lemma 8, we know that
¡
¢
F Φ1δ (U) = F (Φδ (U, U )) ⊆ Φδ (F (U ), F (U )) = Φ1δ (F (U ))
¢
¡
t−1
Now assume that F Φt−1
δ (U ) ⊆ Φδ (F (U )). Then
¡
¢
¡
¢
F Φtδ (U ) = F Φδ (Φt−1
δ (U ), U )
¡
¢
⊆ Φδ F (Φt−1
δ (U )), F (U)
¡
¢
⊆ Φδ Φt−1
δ (F (U )), F (U)
= Φtδ (F (U ))
¡ t
¢
t
∗
∞
Thus, by induction, ∩∞
t=1 F Φδ (U ) ⊆ ∩t=1 Φδ (F (U )) = Wδ (F (U )) (where the
equality follows from Lemma 6, part (b)).
¢
¡ t
Next we claim that F (Wδ∗ (U )) ⊆ ∩∞
t=1 F Φδ (U ) .
Combining this with the
conclusion in the previous line gives us F (Wδ∗ (U )) ⊆ Wδ∗ (F (U )); in light of Theorem
1, this implies F (Vδ∗ ) ⊆ Wδ∗ (F (U)), which is what we set out to prove.
To establish the claim, suppose that w ∈ F (Wδ∗ (U )). Then w ∈ Φtδ (U ) ⊆ U for
¡
¢
all t, and w ∈ F (U ) (by Theorem 7). Consequently, w ∈ F Φtδ (U ) for all t, which
¡ t
¢
implies w ∈ ∩∞
t=1 F Φδ (U ) .
42
For any w ∈ Wδ∗ (F (U )), there exists a stationary
Step 3: Wδ∗ (F (U )) ⊆ Eδ .
DCW µ such that µ(h0 ) = w and µ(ht ) ∈ F (U ) for all ht , given the utility possibility
set F (U ). We define another stationary policy program µ0 for the utility possibility set U as follows.
Consider any function z(u) such that, for all u ∈ U \F (U ),
z(u) ≥ u, and z(u) ∈ F (U ); for all u ∈ F (U ), z(u) = u. For any t-history
¡
¢
ht = (u1 , ..., ut−1 ), let Z(ht ) = z(u1 ), ..., z(ut−1 ) , and let µ0 (ht ) = µ (Z(ht )).
Plainly, C µ (h0 ) = (w, w, ...), and the associated normalized payoff is w.
More-
µ0
over, for any ht and u ∈ U , we know by construction that C (ht ) = C µ (Z(ht )),
0
that C µ (ht , u) = C µ (Z(ht ), z(u)), that z(u) weakly Pareto dominates u, and that
C µ (Z(ht ))Rδ (z(u), C µ (Z(ht ), z(u))) (since µ is a DCW). Putting these facts to0
0
gether gives us C µ (ht )Rδ (u, C µ (ht , u)), which means µ0 is a DCW when the feasible
payoff set is U . By construction, it is also UWE.
Plainly, steps 1-3 together imply that Eδ = F (Vδ∗ ) = Wδ∗ (F (U )).
Part (ii): First we claim that, with strict comprehensiveness, F s (U ) = F (U ).
Clearly, w ∈ F s (U ) implies w ∈ F (U ).
w ∈
/ F s (U ).
So assume that w ∈ F (U ).
Then there exists some w0 ∈ U such that w0 ≥ w.
there must also exist some
w00
∈ U with
and thereby establishes the claim.
w00
Suppose
But then
> w, which contradicts w ∈ F (U )
It follows immediately that Eδ = Eδs , and
Wδ∗ (F (U )) = Wδ∗ (F s (U )).
To complete the proof, we need to show that F (Wδ∗ (U )) = F s (Wδ∗ (U )) (which
implies F (Vδ∗ ) = F s (Vδ∗ ) by Theorem 1).
Obviously, F s (Wδ∗ (U )) ⊆ F (Wδ∗ (U )).
From Theorem 7, we know that F (Wδ∗ (U)) ⊆ F (U ), and we have just shown that
F (U ) = F s (U ) with strict comprehensiveness, so F (Wδ∗ (U )) ⊆ F s (U ). Thus, for
any w ∈ F (Wδ∗ (U )) and w0 ∈ U with w0 6= w, we have wi > wi0 for some i̇. Since
Wδ∗ (U ) ⊆ U , this implies that, for any w ∈ F (Wδ∗ (U )) and w0 ∈ Wδ∗ (U ) with
w0 6= w, we have wi > wi0 for some i̇. But then w ∈ F s (Wδ∗ (U )), as desired. ¥
Proof of Theorem 9
Let Λ denote the set of Borel probability measures on X.
For each i, define
θi : Λ → R as follows: θi (λ) = Eλ [ηi (x)] (where Eλ denotes the expectation with
respect to the probability measure λ). The utility possibility set is then given by
U = θ(Λ). Standard arguments imply that U is compact and convex.
We claim first that F (U ) ⊆ η(X).
no x ∈ X with θ(λ) = η(x).
Consider any λ ∈ Λ such that there is
Then there exists some open set Q ⊂ X with
43
λ(Q) ∈ (0, 1). Since ηi is strictly concave, θi (λ) = Eλ [ηi (x)] < ηi (Eλ [x]). But this
/ F (U ), as claimed.
implies θi (λ) ∈
Since U is compact, so is F (U ).
Lemma 6, part (b) therefore implies that
Wδ∗ (F (U )) is compact. Consequently, we can define w0 = arg minw∈Wδ∗ (F (U )) wM .
Since F (U ) ⊆ η(X), we know that w0 = η(y 0 ) for some y 0 ∈ [0, 1].
Now assume that y 0 6= yM .
Without loss of generality, assume that y 0 >
By Lemma 4, part (b), we know there exists w0 ∈ Wδ∗ (F (U )) such that
¡
¢
w0 R δwC + (1 − δ)w0 . Since Wδ∗ (F (U )) ⊆ F (U ) ⊆ η(X), we know that w0 = η(y 0 )
yM .
for some y0 ∈ [0, 1], and we also know that ηM (y0 ) ≥ ηM (y0 ) (by definition of y 0 ),
which in turn implies y0 ≤ y0 . But then, by the single crossing property, we have
wi0 = ηi (y 0 ) ≥ ηi (y 0 ) = wi0 for i ∈ {1, ..., M }. Since all i ∈ {1, ..., M } also strictly
¡
¢
prefer wC ≡ η(yM ) to w0 , this contradicts w0 R δwC + (1 − δ)w0 .
It follows that y 0 = yM , which implies Wδ∗ (F (U )) = {η(yM )}. Since U is
compact and convex, Theorem 8 implies that Eδ = Wδ∗ (F (U )) = {η(yM )}. ¥
Proof of Theorem 10
Since U is convex and compact, Vδ∗ = Wδ∗ (Theorem 1), which allows us to focus
attention on stationary DCWs.
Part (i): Suppose not. Without loss of generality, consider w ∈ Wδ∗ with
wi
1−δ
wi < 1 − δ.
PM
i=1
Consider any u ∈ U and some small ε > 0 with ui =
+ ε for
PM
i = 1, ..., M (choosing ε sufficiently small guarantees i=1 ui < 1). Note that,
for i = 1, ..., M , we have wi < (1 − δ)ui + δwi0 for all w0 ∈ U (since wi ≥ 0).
Consequently, w ∈
/ Φδ (U ) ⊇ Wδ∗ .
Part (ii): First consider the case where δ <
∀i), for any coalition A with |A| = M , we have
M−1
N .
P
With equal division (ui =
ui =
1
N
M
N.
Consequently, for
P
all u ∈ U , there exists some coalition A with |A| = M , such that i∈A ui ≤ M
N.
i∈A
< 1 − δ, so u ∈
/ ∆N
/ Wδ∗ .
δ . Therefore, by part (i), u ∈
¢
£
1
∗
∗
Now consider the case where δ ∈ M−1
N , 2 . Suppose Wδ 6= ∅. Since Wδ is
But δ <
M−1
N
implies
M
N
compact, we can define u0 ≡ mini minu∈Wδ∗ (U ) ui . We claim that all w ∈ Wδ∗ satisfy
P
the following property: ∀A ⊂ {1, ..., N } with |A| = M , i∈A wi ≥ 1 − δ + M δu0 .
P
Consider any w such that, for some A ⊂ {1, ..., N } with |A| = M ,
i∈A wi <
−δu0
+ ε for
Take any u ∈ U and some small ε > 0 with ui = wi1−δ
PM
i = 1, ..., M (choosing ε sufficiently small guarantees i=1 ui < 1). Note that,
1 − δ + M δu0 .
for i = 1, ..., M , we have wi < (1 − δ)ui + δwi0 for all w0 ∈ U (since wi0 ≥ u0 ).
44
Consequently, w ∈
/ Φδ (U ) ⊇ Wδ∗ (U ), which establishes the claim.
Now consider w ∈ Wδ∗ such that wi = u0 for some i. Without loss of generality,
assume w1 = w0 . By the preceding claim, for all A ⊂ {2, ..., N } with |A| = M − 1,
P
i∈A wi ≥ 1 − δ + M δu0 − u0 . Consider A1 , A2 ⊂ {2, ..., N } with |Ak | = M − 1
for k = 1, 2, and A1 ∪ A2 = {2, ..., N }. Then
N
X
wi = w1 +
i=1
X
wi +
X
wi
i∈A2
i∈A1
≥ u0 + 2 (1 − δ + M δu0 − u0 )
= 2(1 − δ) + (2M δ − 1) u0
Since δ < 12 , we have 2(1 − δ) > 1. Since δ ≥ M−1
N , we have 2M δ − 1 > 0. Since
PN
u0 ≥ 0, we have i=1 wi > 1, which implies that w is infeasible — a contradiction.
Part (iii): We claim that, for δ ≥
N−1
N+1 ,
N
N
Φδ (∆N
δ ) = ∆δ . Consider any w ∈ ∆δ
and any u ∈ U . Define zi = wi − (1 − δ)ui . Without loss of generality, assume that
the zi are weakly decreasing in i. Note that
N
X
zi =
i=1
N
X
i=1
wi − (1 − δ)
N
X
ui = δ
(1)
i=1
Since the zi are arranged in descending order, 1 implies
M
X
i=1
zi ≥
M
δ
N
(2)
In addition, we claim that zi ≥ 0 for i = 1, ..., M . Suppose not. Then zi < 0 for
i = M, ..., N . But
N
X
i=M
zi =
N
X
i=M
(wi − (1 − δ)ui ) ≥ 0.
PN
This is because w ∈ ∆N
i=M wi ≥ 1 − δ, while u ∈ U
δ and |{M, ..., N }| = M imply
PN
implies i=M ui ≤ 1. Thus we have a contradiction.
Now construct w0 ∈ ∆N
δ as follows:
(
z
P Mi
(1 − δ) for i = 1, ..., M
0
i=1 zi
wi =
δ
M−1 for i = M + 1, ..., N
P
0
N
0
0
Note also that
Note that N
i=1 wi = 1 and that wi ≥ 0 for all i, so w ∈ ∆ .
P
PM 0
0
i∈A wi ≥
i=1 wi = 1 − δ, and for all other A ⊂ {1, ..., N } with |A| = M we have
45
wM+1 =
δ
M−1
≥
N−1
(N+1)(M−1)
that, for i = 1, ..., M ,
= 1−
N−1
N+1
≥ 1 − δ. Thus, w0 ∈ ∆N
δ . Finally, note
(1 − δ) ui + δwi0 = wi − zi + δwi0
!
Ã
δ(1 − δ)
= wi + zi PM
−1
i=1 zi
¶
µ
N
(1 − δ) − 1
≤ wi + zi
M
≤ wi
where the first inequality follows from 2, and the second inequality follows from
δ≤
N−1
N+1 .
Thus, w ∈ Φδ (∆N
δ ).
N
N
∗
By lemma 5, part (b), Φδ (∆N
δ ) = ∆δ implies ∆δ ⊆ Wδ . Combining this with
N
∗
Wδ∗ ⊆ ∆N
δ (part (i)), we have ∆δ = Wδ . ¥
46
References
[1] Abreu, Dilip (1986), “Extremal Equilibria of Oligopolistic Supergames,” Journal of Economic Theory, 39(1), 191-225.
[2] Abreu, Dilip, David Pearce, and Ennio Stacchetti (1990) “Toward a Theory of
Discounted Repeated Games with Imperfect Monitoring,” Econometrica, 58(5),
1041-1063.
[3] Alesina, Alberto and Dani Rodrik (1994) “Distributive Politics and Economic
Growth,” Quarterly Journal of Economics, 109(2), 465-90.
[4] Banks, Jeffrey S. (1995) “Singularity Theory and Core Existence in the Spatial
Model,” Journal of Mathematical Economics, 24, 523-536.
[5] Banks, Jeffrey S. and John Duggan (2006) “A Social Choice Lemma on Voting
Over Lotteries with Applications to a Class of Dynamic Games,” Social Choice
and Welfare, 26(2), 285—304.
[6] Baron, David P. (1996) “A Dynamic Theory of Collective Goods Programs,”
American Political Science Review, 90(2), 316-330.
[7] Baron, David P. and John A. Ferejohn (1989) “Bargaining in Legislatures,”
American Political Science Review, 83(4), 1181-1206.
[8] Baron, David P. and John A. Ferejohn (1987) “Bargaining and Agenda Formation in Legislatures,” American Economic Review, 77(2), 303-309.
[9] Barron, David P. and Michael Herron (1999) “A Dynamic Model of Multidimensional Collective Choice,” Stanford Graduate School of Business Research
Paper Series, No. 1559.
[10] Bernheim, B. Douglas, Antonio Rangel, and Luis Rayo (2006), “The Power of
the Last Word in Legislative Policy Making,” Econometrica, 74(5), 1161-1190.
[11] Bernheim, B. Douglas and Debraj Ray (1989) “Collective Dynamic Consistency
in Repeated Games,” Games and Economic Behavior, 1(4), 295-326.
[12] Bernheim, B. Douglas and Sita Slavov (2004) “Income Distribution with Majoritarian Politics,” mimeo.
47
[13] Besley, Timothy and Stephen Coate (1997) “An Economic Model of Representative Democracy,” Quarterly Journal of Economics, 112(1), 85-114.
[14] Black, Duncan, The Theory of Committees and Elections, London: Cambridge
University Press, 1958.
[15] Cox, G. (1984) “Non-Collegial Simple Games and the Nowhere Denseness of
the Set of Preference Profiles having a Core,” Social Choice and Welfare, 1,
159-164.
[16] Downs, Anthony, An Economic Theory of Democracy, New York: Harper and
Row, 1957.
[17] Duggan, John, and M. Fey (2006) “Repeated Downsian Electoral Competition,”
International Journal of Game Theory (forthcoming).
[18] Epple, Dennis and Michael H. Riordan (1987), “Cooperation and Punishment
under Repeated Majority Voting,” Public Choice, 55(1-2), 41-73.
[19] Farrell, Joseph and Eric Maskin (1989) “Renegotiation in Repeated Games,”
Games and Economic Behavior, 1(4), 327-60.
[20] Ferejohn, John, Morris Fiorina, and Richard D. McKelvey (1987), “Sophisticated Voting and Agenda Independence in the Distributive Politics Setting,”
American Journal of Political Science 31(1), 169-193.
[21] Filar, Jerzy A., and Leon A. Petrosjan (2000), “Dynamic Cooperative Games,”
International Game Theory Review, 2(1), 47-65.
[22] Glomm, Gerhard and B. Ravikumar (1992) “Public versus Private Investment
in Human Capital Endogenous Growth and Income Inequality,” Journal of
Political Economy, 100(4), 813-34.
[23] Grossman, Gene M. and Elhanan Helpman (1998) “Intergenerational Redistribution with Short-Lived Governments,” Economic Journal, 108(450), 1299-1329.
[24] Kramer, Gerald H. (1973) “On A Class of Equilibrium Conditions for Majority
Rule,” Econometrica, 41(2), 285-297.
48
[25] Krusell, Per and Jose Victor Rı́os-Rull (1999), “On the Size of U.S. Government:
Political Economy in the Neoclassical Growth Model,” American Economic
Review, 89(5), 1156-81.
[26] Krusell, Per and Jose-Victor Rı́os-Rull (1996), “Vested Interests in a Positive
Theory of Stagnation and Growth,” Review of Economic Studies, 63(2), 301331.
[27] Krusell, Per, Vincenzo Quadrini and Jose-Victor Rı́os-Rull (1997), “PoliticoEconomic Equilibrium and Economic Growth,” Journal of Economic Dynamics
and Control, 21(1), 243-272.
[28] Le Breton, M. (1987) “On the Core of Voting Games,” Social Choice and Welfare, 4, 295-305.
[29] Mas-Colell, Andreu, Michael D. Whinston, and Jerry R. Green, Microeconomic
Theory, Oxford: Oxford University Press, 1995.
[30] McKelvey, Richard D. (1976) “Intransitivities in Multidimensional Voting Models and some Implications for Agenda Control,” Review of Economic Theory
12(3), 472-482.
[31] McKelvey, Richard D. (1979) “General Conditions for Global Intransitivities in
Formal Voting Models,” Econometrica, 47, 1086-1112.
[32] McKelvey, Richard D. (1986) “Covering, Dominance, and Institution-Free
Properties of Social Choice,” American Journal of Political Science, 30(2), 283314.
[33] Miller, Nicholas R. (1980) “A New Solution Set for Tournaments and Majority Voting: Further Graph-Theoretical Approaches to the Theory of Voting,”
American Journal of Political Science, 24(1), 68-95.
[34] Miller, Nicholas R. (1977) “Graph Theoretical Approaches to the Theory of
Voting,” American Journal of Political Science, 21(4), 769-803.
[35] Perotti, Roberto (1993) “Political Equilibrium, Income Distribution, and
Growth,” Review of Economic Studies, 60(4), 755-76.
49
[36] Persson, Torsten and Guido Tabellini (1994) “Is Inequality Harmful for
Growth?” American Economic Review, 84(3), 600-621.
[37] Petrosjan, L.A., and N. A. Zenkevich (1996). Game Theory. Singapore: World
Scientific.
[38] Plott, C. (1967) “A Notion of Equilibrium and its Possibility under Majority
Rule,” American Economic Review, 57, 787-806.
[39] Rangel, Antonio (2003) “Forward and Backward Intergenerational Goods: Why
is Social Security Good for the Environment?” American Economic Review,
93(3): 813-34.
[40] Roberts, Kevin (2006) “Condorcet cycles? A model of intertemporal voting,”
Social Choice and Welfare (published online).
[41] Roemer, John (1999) “The Democratic Political Economy of Progressive Income Taxation,” Econometrica, 67(1), 1-19.
[42] Rubinstein, Ariel (1979) “A Note on the Nowhere Denseness of Societies having
an Equilibrium under Majority Rule,” Econometrica, 47, 511-514.
[43] Saari, D. (1997) “The Generic Existence of a Cor for q-Rules,” Economic Theory, 9, 219-260.
[44] Schofield, Norman (1983) “Generic Instability of Majority Rule,” Review of
Economic Studies, 50, 695-705.
[45] Schofield, Norman (1987) “Instability of Simple Dynamic Games,” Review of
Economic Studies, 45(3), 575-594.
[46] Shepsle, Kenneth A. and Barry R. Weingast (1984) “Uncovered Sets and Sophisticated Voting Outcomes with Implications for Agenda Institutions,” American Journal of Political Science, 28(1), 49-74.
[47] Slavov, Sita Nataraj (2006) “Age Bias in Fiscal Policy: Why Does the Political
Process Favor the Elderly?” Topics in Theoretical Economics, 6(1), Article 11.
[48] Slavov, Sita Nataraj (2006) “Public versus Private Provision of Public Goods,”
mimeo, Occidental College.