E SSAYS ON E MPIRICAL D YNAMIC G AMES AND I MPERFECT
I NFORMATION
by
Arvind Magesan
A thesis submitted in conformity with the requirements
for the degree of Doctor of Philosophy
Department of Economics
University of Toronto
c Copyright by Arvind Nathan Magesan (2011)
Essays on Empirical Dynamic Games and Imperfect Information
Arvind Magesan
Doctor of Philosophy
Department of Economics
University of Toronto
2011
Abstract
This thesis collects three papers that study applied problems in economics dealing with dynamic strategic behavior and imperfect information. In the first chapter I study the relationship
between participation in United Nations Human Rights Treaties (HRT), foreign aid receipts and
domestic human rights institutions. I provide empirical evidence that countries with relatively
high HRT participation rates receive more foreign aid. Further, countries with high quality institutions are more likely to participate in HRTs, but that high levels of HRT participation leads to a
decline in the quality of domestic human rights institutions. Based on these findings, I propose
and estimate a dynamic game of HRT ratification. The estimates show that economic factors play
an important role in HRT ratification and that the ratification costs countries incur vary significantly across treaties and country regime types. I use the estimated model to evaluate the effects
of counterfactual policies on HRT ratification decisions, human rights behavior, and the distribution of foreign aid.
The second chapter considers environmental regulation under imperfect information and political constraints. We compare the value of two types of information to a regulator: the cost of
pollution and the profitability of firms in the economy. We find that in environments where small
increases in the losses to regulated firms greatly affect the regulator’s ability to implement the
ii
policy, it is most valuable to learn the types of firms, while it is most valuable to learn the cost of
pollution when small increases in losses are relatively ineffectual.
The third chapter deals with the identification and estimation of dynamic games when players
maximize expected payoffs given beliefs about other players’ actions, but their beliefs may not be
in equilibrium. First, we derive conditions for point-identification of structural parameters and
players’ beliefs, and propose a simple two-step estimation method and sequential generalization
of the method that improves its asymptotic and finite sample properties. We also present a procedure for testing the null hypothesis of equilibrium beliefs. Finally, we illustrate our model and
methods with an application of a dynamic game of store location by retail chains.
iii
Acknowledgement
I thank the three members of my supervisory committee, Victor Aguirregabiria, Matthew
Turner and Carlos Serrano, without whom these pages would be completely empty. I thank Victor
for his persistent support and optimism at every step of the way, and most importantly for teaching me everything I know about conducting and presenting empirical research in economics. I
am forever in debt. I thank Matthew for his encouragement and for teaching me how to properly
write a theory paper in economics. Finally, I thank Carlos for his many insightful comments and
criticisms which helped to shape the first chapter. I also thank the rest of my thesis examination
committee, Jordi Mondria, Junichi Suzuki and especially Matt Shum, for useful comments. I thank
those who taught and supported me during my time at the University of Toronto, especially Li,
Hao and Martin Osborne, from whom I have learned a very great deal. I thank Ben Amsel, Branko
Boskovic, David Byrne, Sacha Kapoor and Ali Emre Konukoǧlu for their help, support and friendship and I also gratefully acknowledge financial support from the University of Toronto and The
Royal Bank Graduate Fellowship in Public and Economic Policy.
In addition to those mentioned above, each chapter has benefitted from the comments of several individuals. “Human Rights Treaty Ratification of Aid Receiving Countries" has benefitted
from the comments of Gustavo Bobonis, Patrick Francois, Ken Jackson, Eik Swee, Wang, Hui, and
participants of the 2010 EARIE Conference in Istanbul, the 2009 Meeting of the Canadian Economics Association as well as seminar audiences at the Universities of Calgary, Houston, New
South Wales, Sydney, Toronto and Western Ontario. “The Value of Information In Regulation" has
benefitted from the comments of Patrick Francois, Sumon Majumdar as well as participants of the
2008 Canadian Public Economics Group Meetings and the 2009 Camp Resources Meetings. “Identification and Estimation of Dynamic Games when Players’ Beliefs Are Not in Equilibrium" has
benefitted from the comments of Dan Bernhardt, Sridhar Moorthy, Katsumi Shimotsu, and seminar participants at Carlos III-Madrid, New York University, UI Urbana-Champaign, the Canadian
Econometric Study Group in Ottawa, and the Econometric Society World Congress in Shanghai.
iv
I thank my parents, Murugesapillai Magesan and Sugi Magesan for a lifetime of support and
encouragement, and for always pushing me to expect more from myself. Lastly I thank Deniz
Çorbacı. Deniz, hem bütün kötü günlerin hem de bütün güzel günlerin her saniyesi benim yanimdaydin, sensiz bunu yapamazdim. Verdiǧin sonsuz destek ve yardim için sana nasil teşekkur
edeceǧimi bilemiyorum.
v
Contents
List of Tables
ix
List of Figures
x
1
Human Rights Treaty Ratification of Aid Receiving Countries
1
1.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1.2
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5
1.3
Aid, Treaty Ratification and Human Rights . . . . . . . . . . . . . . . . . . . . . . . .
8
1.4
Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
2
1.4.1
Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
1.4.2
Formal Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
1.5
Estimation of the Dynamic Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
1.6
Counterfactual Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
1.7
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
1.8
Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
1.8.1
Treaty-Specific Cost Heterogeneity and Observable Treaty Characteristics . . 58
1.8.2
Representation of Choice Probabilities . . . . . . . . . . . . . . . . . . . . . . . 63
The Value of Information in Regulation
66
2.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
2.2
Literature Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
2.3
Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
2.4
Planner’s Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
2.5
Optimal taxes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
vi
3
2.6
The Value of Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
2.7
Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
2.8
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
2.9
Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
Identification and Estimation of Dynamic Games When Players’ Beliefs are not in Equilibrium
100
3.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
3.2
Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
3.2.1
Basic Framework and Assumptions . . . . . . . . . . . . . . . . . . . . . . . . 105
3.3
Identification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
3.4
Estimation and Inference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
3.5
3.6
3.4.1
Estimation with nonparametric payoff function . . . . . . . . . . . . . . . . . 118
3.4.2
Estimation with parametric payoff function . . . . . . . . . . . . . . . . . . . . 120
3.4.3
Test of Equilibrium Beliefs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
Empirical Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
3.5.1
Data and descriptive evidence . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
3.5.2
Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
3.5.3
Estimation of the structural model . . . . . . . . . . . . . . . . . . . . . . . . . 135
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
References
141
vii
List of Tables
1.1
Summary Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
9
1.2
Sample Quartiles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.3
Foreign Aid Receipts: Parameter Estimates . . . . . . . . . . . . . . . . . . . . . . . . 17
1.4
Human Rights-Democracy Episodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.5
Number of Different Regimes Visited in Country History . . . . . . . . . . . . . . . . 21
1.6
Human Rights: Parameter Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
1.7
Treaty Capital: Parameter Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
1.8
Treaty Decision: Parameter Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
1.9
Structural Parameter Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
1.10 Structural Parameter Estimates : Treaty Specific Costs . . . . . . . . . . . . . . . . . . 48
1.11 Structural Parameter Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
1.12 Human Rights Treaties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
2.1
Distribution of probability over states . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
2.2
Information partition Ωθ induced by information about firms . . . . . . . . . . . . . 75
2.3
Information partition Ωc induced by cost information . . . . . . . . . . . . . . . . . . 77
2.4
Optimal taxes in each possible state . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
2.5
Optimal taxes when the planner knows firm types only . . . . . . . . . . . . . . . . . 80
2.6
Optimal taxes when social cost is known . . . . . . . . . . . . . . . . . . . . . . . . . 80
2.7
Optimal taxes in each possible state . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
2.8
Optimal taxes when planner knows who has political power . . . . . . . . . . . . . . 98
2.9
Optimal taxes when planner knows social cost of pollution . . . . . . . . . . . . . . . 98
2.10 Optimal taxes when planner knows firm types . . . . . . . . . . . . . . . . . . . . . . 99
viii
2.11 Optimal taxes in each possible state . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
2.12 Optimal taxes when profit is known . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
2.13 Optimal taxes when the planner knows firm cost types only . . . . . . . . . . . . . . 99
3.1 Descriptive Statistics on Local Markets (Year 1991) . . . . . . . . . . . . . . . . . . . . . 125
3.2 Evolution of the Number of Stores . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
3.3 Transition Probability Matrix for Market Structure . . . . . . . . . . . . . . . . . . . . . 127
3.4 Reduced Form Probits for the Decision to Open a Store . . . . . . . . . . . . . . . . . . 127
3.5 Estimates from Myopic Game of Entry for McDonalds and Burger King Under the
Assumption of Equilibrium Beliefs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
3.6 Estimates from Dynamic Game of Entry for McDonalds and Burger King Under the
Assumption of Equilibrium Beliefs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
3.7 Estimated Bias in BK Beliefs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
ix
List of Figures
1.1
Argentina and Chile Treaty Capital
. . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.2
Kenya and Chad Treaty Capital . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.3
Uganda and Chad Treaty Capital . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.4
Ratification Cost as a Function of Human Rights Institutions . . . . . . . . . . . . . . 45
1.5
Frequency Distribution of Country Specific Treaty Costs . . . . . . . . . . . . . . . . 51
1.6
Number of Ratified Treaties Under Factual and Counterfactual Equilibria . . . . . . 54
1.7
K Under Factual and Counterfactual Equilibria
1.8
Number of Ratified Treaties Under Counterfactual Equilibria with No Heterogeneity 55
1.9
Argentina and Somalia Treaty Capital . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
. . . . . . . . . . . . . . . . . . . . . 54
1.10 Argentina and Somalia ln(Aid) Receipts . . . . . . . . . . . . . . . . . . . . . . . . . . 58
1.11 Argentina Treaty Capital and ln(Aid) Receipts . . . . . . . . . . . . . . . . . . . . . . 59
1.12 Somalia Treaty Capital and ln(Aid) Receipts . . . . . . . . . . . . . . . . . . . . . . . . 59
1.13 Reservations/Objections per Ratifier and Treaty Costs . . . . . . . . . . . . . . . . . . 60
1.14 Treaty Costs and Words per Article . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
x
Chapter 1
Human Rights Treaty Ratification of Aid Receiving Countries
1.1
Introduction
With the horrors of two catastrophic world wars fresh in its collective memory, the United Nations
General Assembly adopted the Universal Declaration of Human Rights (UDHR) on December 10,
1948. In the years since, the UDHR has formed the basis of international human rights norms
and their formal legal embodiment in the United Nations Human Rights Treaties (HRT). The
ratification and in many cases subsequent integration of HRT’s into domestic law by countries
from all over the political and cultural spectrum is considered one of the great achievements of
the international community in the post Second World War era.
At the same time, it is natural to ask why countries ratify HRT’s at all. First, formal enforcement mechanisms generally do not exist. Countries party to a treaty can in practice violate its
terms without formally specified punishment. This stands in contrast to treaties in trade and the
environment, where monitoring is possible and non-compliant parties can be punished. Second,
the benefits to ratifying a human rights treaty are not immediately clear. Treaties in trade and the
environment solve a collective action problem by clearly stipulating and enforcing the rules of the
game, and the mutual benefit associated with treaty participation is clear and tangible. When a
country ratifies an HRT, on the other hand, it agrees not to take actions that affect its own citizens. It is not obvious what one country has to gain by agreeing to limit domestic behaviour while
receiving unenforceable promises from other parties to do the same.1
1 To the best of our knowledge this is the first study in the economics literature to address the question of why countries commit to human rights institutions. However a small but growing literature in political science and international
law, triggered by Hathaway (2003) and Hathaway (2007), have studied the question of why autocratic countries often
participate in HRTs more frequently than democratic countries. Von Stein (2005) studies the extent to which HRTs actually act as a constraint on treaty participant behaviour, and considers the possibility that HRTs simply select countries
that are ex-ante compliant as participants and thus do not constrain behaviour at all.
1
Despite the seemingly inconsequential nature of HRT’s, we observe ratification throughout the2
history of the UN, and perhaps more importantly, substantial and persistent variation in the timing
and frequency of ratification across countries and treaties. Exploiting a panel of 83 foreign aid receiving countries over the years 1973-2001, we investigate who ratifies when and why. Specifically,
we examine the role of economic, political and strategic incentives in the decision to ratify HRT’s
at the UN. Acknowledging that global human rights standards are not constant and evolve over
time, our measure of a country’s treaty participation is always relative to the treaty participation
of other countries at that point in time.
We approach the problem in two stages. In the first stage we utilize dynamic panel data methods to study the relationship between aid, human rights institutions, and human rights treaty
participation. In the first set of results we find that, conditional on domestic human rights institutions, high levels of relative treaty participation has a significant and positive effect on foreign
aid receipts. This result is robust to the inclusion of many political and economic control variables, and to the presence of unobserved country and time heterogeneity. By contrast, the quality
of domestic human rights institutions themselves do not have a significant effect on aid receipts
once we control for other political and economic variables. These findings are a contribution to
the literature on the determinants of foreign aid, at the forefront of which is Alesina and Dollar
(2000) and Alesina and Weder (2002), as well as the literature on the determinants of HRT participation (i.e., Hathaway (2007)). We rationalize this set of results by arguing that aid donors use
HRT
participation as a criteria to distribute foreign aid among recipients, and actually prefer this
“nominal" measure instead of a “real" measure of human rights behavior.
2
Further, we find that countries with high quality human rights institutions tend to participate
more frequently in HRT’s relative to other countries, but that the converse is not true; countries
2 There are several possible reasons aid donors prefer to use HRT participation as the criteria for making aid allocation decisions. For example, the actual recipient of aid, usually the current government of the receiving country, is
often constrained by other domestic political forces (i.e., the military) in affecting immediate change in actual human
rights institutions. Ratifying a treaty is a task that is not only simpler, but often under the government’s direct control.
Additionally, there is often disagreement among outside actors about the quality of a country’s human rights institutions, and further, about how to interpret any given measure and compare it across countries. For this reason, when
a donor needs to justify to its domestic polity its decision of who to allocate aid to, HRT participation may be a more
favorable statistic to point to.
3
with a higher propensity to ratify HRT’s actually experience a subsequent decline in their human
rights institutions. The first finding is intuitive. The costs of complying with the terms of a treaty
will be lower for a country that is already compliant at the time of ratification than for a noncompliant country. Compliant countries have a higher net benefit to ratifying, and ratify more
frequently. The second result, that HRT participation causes a decline in domestic human rights
institutions, is somewhat surprising. Previous studies in political science (i.e., Hathaway (2007))
find a similar result, and argue that countries under scrutiny by the international community for
their practices may use HRT participation to take the “spotlight" off their real behavior. Once the
spotlight is off, so too is the pressure for real improvements to domestic human rights institutions,
and HRT participation may be followed by a decline in real human rights institutions. Finally, we
show that high rates of relative participation in previous years has a strong negative effect on the
country’s current ratification decision. Or, viewed in another light, the probability a country ratifies a treaty increases as countries in the rest of the world increase their relative treaty participation.
This finding is consistent with the hypothesis that countries are strategic and forward-looking in
their decision to ratify an HRT. Countries are strategic in the sense that other countries’ treaty
participation decisions affect their own decision, and countries are forward looking in the sense
that their own past decisions influence current decisions.
These empirical results, while providing new insight regarding the determinants of foreign
aid and the relationship between foreign aid, international human rights treaties, and domestic
human rights institutions, raise three interesting questions that we study here. First, motivated
by the empirical relationship between relative treaty participation and aid receipts, we investigate how much of the patterns of behavior we observe are explained by the economic benefits
to ratification. In other words, is increased foreign aid receipts the motivation behind ratification? Second, finding that treaty participation is rewarded by aid raises the question, why don’t
all countries ratify all available treaties immediately? Treaty ratification must be costly in some
dimension. We explore the sources of the costs of treaty ratification, as understanding where these
costs come from and the role they play in the participation decisions of countries is of interest
4
from a policy perspective. Third, it is clear also that there is significant heterogeneity in participation rates across countries, suggesting that there is heterogeneity in the net benefit to participation
across countries. One other question of interest is, what specifically is driving this heterogeneity
in net benefit?
Absent a theoretical model of treaty ratification, these issues are difficult to investigate. As
such, to further understand how economic and strategic factors drive country behavior, in the
second part of the paper we develop and estimate a structural dynamic game of treaty ratification.
The model we consider is analogous to a model of quality competition in an oligopolistic industry
(eg., Pakes and McGuire (1994)). In our model countries compete for foreign aid by ratifying
costly HRTs in the same way firms compete for consumer demand by investing in costly product
quality improvement. Donor countries make resource allocation decisions based on the relative
treaty participation of recipient countries. Recipient countries “invest" in the quality they offer
donor countries by ratifying treaties at a cost. These investment costs are allowed to vary by
country, treaty and by the type of domestic institutions in place. We estimate the investment costs
as well as the structural parameters of the ratification benefit function. Our first finding is that the
economic benefit to ratification that we found evidence for at least partly explains the ratification
decision. Second, there are significant permanent differences in ratification costs across countries,
and moreover countries with good human rights institutions and democratic institutions in place
find it less costly to participate, all else equal. Finally, we find significant variation in costs across
treaties. We relate this cost variation to variation in institutional characteristics of the treaties,
in particular the verifiability of the treaty terms, and discuss the implications of this finding for
institutional design.
We then use the estimated model to consider several counterfactual experiments. In the first
set of experiments we quantify the relative importance of foreign aid receipts and human rights as
motives in the treaty ratification decision. We find that the average rate of participation drops by
33% when we decrease the aid motive by 10%, confirming that the economic returns to ratification
we find evidence of in the first part of the paper are a significant part of the reason countries ratify
5
in the first place. In the second experiment we investigate the importance of regime heterogeneity in explaining the observed heterogeneity in country ratification behaviour. Existing studies
on HRT ratification emphasize variation across regimes in the “cost of commitment" (Hathaway
(2007)) to treaty terms as the primary source of heterogeneity in ratification decisions. As we allow for heterogeneity in both the economic benefits and cost of treaty ratification, estimating a
structural model lets us compare the relative importance of these two sources of heterogeneity in
explaining ratification behavior. While both sources of heterogeneity are important, we find that
heterogeneity in the benefits to ratification explain a more significant proportion of the variation
in behavior.
As well as contributing to the literatures on the determinants of foreign aid and human rights
treaty participation, this paper contributes to the nascent literature on the estimation of dynamic
structural models in political economy. Several recent key developments in the estimation of dynamic games, in particular Aguirregabiria and Mira (2007), Bajari, Benkard and Levin (2007) and
Pakes, Ostrovsky and Berry (2007), have allowed applied researchers to address questions that,
for mainly computational reasons, were previously unanswerable. But with the exception of the
pioneering work of Merlo (1997) and Diermeier et al (2003) who develop and structurally estimate
a dynamic bargaining model of government formation, and, more relevant to the present study,
Wagner (2008), who estimates a dynamic game of environmental treaty ratification, the tools of
structural dynamic game estimation, and especially the most recently developed tools, have yet
to be applied to questions in political economy. We aim to contribute by bridging the gap between
the most recent methodological developments in empirical industrial organization and questions
of interest in the field of political economy.
1.2
Data
As our study is targeted at the developing world, we restrict ourselves to the countries of Latin
America, Africa, Asia and the Middle East. We consider every country that existed on these con-
6
tinents at any point during the period 1973-2000, was a member of the United Nations, and has a
population of at least 500,000.3 We exclude countries that are major oil exporters throughout the
time horizon of the sample, as these countries were largely non-aid recipients during this time.
Altogether this sample contains 83 countries and 2043 country-year observations.
Treaty ratification dates come from the United Nations Treaty Collection.4 We restrict consideration to treaties that opened after the founding of the United Nations in 1945 (See table 1.12 in the
data appendix for the full list of treaties).
5
From these data we construct all variables related to
ratification. There are dozens of treaties in the collection, many pertaining to human rights related
issues.6 While chapter IV of the treaty collection is titled “Human Rights," many other treaties in
other chapters have an important (often predominantly) human rights dimension, and we thus
do not restrict our attention only to treaties from this chapter. The set of treaties we select is broad
enough that basic human rights (torture, political killings etc.), property rights, civil rights (religious and political freedoms), and emancipatory rights (worker rights, discrimination) are each
considered explicitly by at least one of the treaties in the data set. We consider seven of the eight
“core" human rights treaties,7 as well as eight other treaties considered human rights treaties by
the Encyclopedia of Human Rights (1996).
The data on country level democracy over time comes from the Polity IV data set (Marshall and
Jaggers, 2004). Each of the democracy and autocracy indexes in the Polity data set, which range
from 0-10 (0 being the lowest level of democracy (autocracy) and 10 being the highest level of
democracy (autocracy)) are composites of other political variables. First, democracy is conceived
3 Much of the data we use, most importantly the Polity IV data set, does not provide values for countries with a
population below 500,000. The human rights data starts in 1973.
4 Available at http://treaties.un.org/Pages/Home.aspx?lang=en
5 While treaties dealing with substantive human rights issues did exist in the pre-world war II era, the concept of
international human rights law is widely considered to have been a product of the founding of the UN .
6 There are other international governmental organizations (IGO’s) that have as part of their raison d’etre a well
established treaty regime that is valid under international law. A prominent example is the International Labour Organization (ILO). While we could have included treaties from these organizations in our sample, doing so would have
raised additional problems. For example, there are countries that are members of the United Nations and not the ILO.
Thus there are countries that are “players" in the ratification game associated with UN treaties, but not ILO treaties.
Further, treaty rules vary across organizations. In sum, the nature of the treaty ratification game differs across IGO’s,
and we wish to minimize these differences.
7 We are unable to consider the last of these treaties, the Convention on the Rights of Persons with Disabilities (in
force 3 May 2008), because it opened too late with respect to the scope of our data set.
7
as the composite of three things: the degree to which citizens can freely express preferences over
political leaders and policies, the constraints on the exercise of power by the executive, and the
guarantee of civil liberties to citizens. Autocracy on the other hand is determined by how sharply
political participation and competition is restricted, and how freely the executive, once selected,
exercises power. We follow the literature and use the difference between these two scores (the
Polity Composite Index) as our measure of a country’s level of democracy. Our measure of political stability comes from the “durable" variable in the Polity IV data set. This variable simply
measures the number of years since a major political regime change in the country. Any missing
data from the polity data set is imputed using the suggestions of the authors. The measure of trade
openness we use is the Sachs and Warner (1996) trade openness indicator (updated by Wacziarg
and Welch (2003)). This is a binary variable which designates a country as “closed" if one of five
policy criteria are met: the country has an average tariff rate greater than 40%, non-tariff barriers
cover more than 40% of imports, there is a state monopoly over major exports, the country has a
socialist economic system, or there is a black market exchange rate premium greater than 20%.
Country GDP data is measured in thousands of international Geary-Khamis dollars, and is
taken from Maddison (2003). Population data is also taken from Maddison (2003). The data on
foreign aid comes from the OECD Overseas Development Assistance (net ODA). We consider
both Total Net ODA data itself, as well as (as a robustness check) the Net Aid Transfers (NAT)
data set constructed by David Roodman (2005). While this data is constructed from the same
underlying OECD Overseas Development Assistance (net ODA) data, the NAT data corrects for
two sources of concern with the original net ODA data among practitioners. First, Net ODA
does not account for interest payments paid on past loans by recipients to donors, it is only net
of principle payments received on past ODA loans. NAT is net of both principle and interest
payments. Second, NAT does not account for cancelation of old non-ODA loans, while the ODA
does. The data is in billions of constant 2005 US dollars. While DAC members (aid donors) change
over time, the countries that are members at some point during the time horizon of our sample are
Australia, Austria, Belgium, Canada, Denmark, Finland, France, Germany, Greece, Ireland, Italy,
8
Japan, Luxembourg, Netherlands, New Zealand, Norway, Portugal, Spain, Sweden, Switzerland,
United Kingdom, United States, and the Commission of the European Communities.
Lastly, the human rights data comes from Freedom House’s Comparative Survey of Freedom.
We specifically consider the “Civil Liberties" variable, which rates countries on a scale from 17 according to a checklist (see above) of civil freedoms and rights. The measure is essentially a
grade given to each country each year based on the number of positive answers to a checklist of
fourteen questions. These include (Gastil, 1990): Free media, Open Public Discussion, Freedom
of Assembly and Demonstration, Freedom of Political Organization, Nondiscriminatory Rule of
Law in Politically Relevant Cases, Freedom from Political Terror and Imprisonment, Free Trade
Unions and Peasant Organizations, Free Businesses/cooperatives, Free Professional/Private Organizations, Free Religious Institutions, Personal Social Rights, Socioeconomic Rights, Freedom
From Gross Socioeconomic Inequality, Freedom From Gross Government Indifference or Corruption. Crucially, there is significant overlap between the realms of human rights behavior measured
by the civil liberties variable we use, and the subjects covered by the human rights treaties that
comprise the treaty capital variable (see the appendix for a list of the treaties used in this paper).
Summary statistics for the key variables are provided in table 1.1. We include a variable measuring the rate of treaty participation in the table. This variable measures, for each country at each
point in time, the ratio of the number of treaties it is a member of to the number of treaties open
for ratification. This variable will be key in the analysis below and we discuss it in much more
detail in the next section.
1.3
Aid, Treaty Ratification and Human Rights
In this section we use dynamic panel data techniques to examine the relationship between foreign
aid receipts, human rights treaty ratification and domestic human rights institutions. First, we
establish that aid receiving countries that ratify HRTs more frequently than other aid recipients
experience an increase in foreign aid receipts, but that the quality of domestic human rights in-
9
Table 1.1: Summary Statistics
Variable
Aid per capita
Obs
2043
Mean
51.83335
Std. Dev.
52.73943
Min
-19.45861
Max
459.4547
Measurement Units
2006 US dollars
Human Rights
2043
3.399902
1.5162
1
7
Ratification Rate
2043
0
2.561963
-8.246575
6.5
Discrete Index
Ranging from 1 to 7
Continuous Index
per capita
2043
2.406353
2.109192
.218
12.34
Political Stability
2043
13.8605
14.36674
0
81
Democracy
2043
-1.181596
6.695256
-10
10
Openness
2043
.3220754
.4673861
0
1
ln(Population)
2043
16.25582
1.42699
13.18815
20.95647
GDP
Thousands of “Geary-Khamis"
International Dollars
Number of Years
Discrete Index
Ranging from -10 to 10
Binary Indicator
Natural Log of Number of People
stitutions themselves are inconsequential for aid receipts. We then discuss these findings in the
context of the foreign aid literature. Second, we find that countries with relatively good human
rights institutions in place participate in HRT’s more frequently but that the converse is not true.
Countries with high levels of predetermined treaty participation experience a decline in human
rights institutions. Finally we provide some evidence that countries are strategic and forward
looking in their decisions to participate in HRT’s, and in tying all of these results together, motivate the structural model of treaty ratification that we develop in the following section.
It is useful to conceptualize human rights treaty ratification as a lumpy investment decision.
At any point in time there are several treaties open for ratification by a country, as new treaties
appear throughout the history of our sample (see data section below). Countries rarely ratify
many treaties at once, and instead decisions are spaced over time. As with any other investment
decision, there is a “capital stock" associated with ratification, and we imagine that as countries
ratify treaties they accumulate capital. We now describe the measure of treaty capital that we use
in our analysis.
10
Indexing countries by i, the year by t, and treaties (in chronological order) by r, the “stock" of
treaties ratified by country i before year t is given by:
Tit =
∑ xirt
(1.1)
r
where xirt is a binary variable taking the value 1 if country i has ratified treaty r at any year before
t and 0 otherwise. While Tit clearly measures a country’s treaty participation, it does not suit our
purposes for one important reason. We would like to understand how treaty participation effects
foreign aid receipts. A “high level" of participation is a moving target, however. New treaties
appear regularly, and more importantly, global human rights standards change over time, and
what may have been perceived as a high level of treaty participation in 1970 is not likely to be
regarded as such in 1995. As such, we define:
Kit = Tit −
∑ j6=i Tjt
Ct−1 − 1
(1.2)
where Ct is the number of aid receiving countries in the world at time t.
Kit represents the cumulative number of human rights treaties ratified by country i relative to
the average cumulative number of ratified treaties by other aid receiving countries. As a country
remains idle and other countries ratify, the country’s stock depreciates. The measure Kit is particularly effective in that it allows for depreciation of the capital stock, while also making transparent
in the context of our model how recipient countries compete with one another for economic attention from the more developed world.
It is useful to illustrate some ways in which the path of treaty capital can differ across countries,
and the consequent issues for estimation. Figure 1.1 displays the capital paths over the duration
of the sample for Argentina and Chile, figure 1.2 displays the paths for Kenya and Chad, and
figure 1.3 displays the paths for Uganda and Chad. We purposefully select countries that are
within close geographical proximity to one another to “control" for regional differences. In figure
11
1.1 it is clear that each country has a period of heavy investment (Argentina in the mid 1980’s,
Chile in the Early 1970’s), and each has long periods characterized by stagnation and decline.
But most interesting is that each country’s heavy investment comes at a time when the other’s
treaty capital is well into a period of decline. The striking feature in figure 1.2 is that, while
both country’s investment dynamics are similar, the treaty capital stock of Kenya is persistently
higher than the stock of Chad up until the last few years of our sample. This suggests there is
important permanent country level heterogeneity in ratification patterns as well. Finally in figure
1.3 we see that the investment dynamics of Uganda and Chad track one another fairly closely for
many years, but most importantly, in 1995 both countries experience a significant upward spike in
investment. This suggests the possibility of region and time specific heterogeneity in behavior. As
we discuss below, these pictures are very useful both in terms of understanding what is required
for identification of the effects of interest, as well as for interpreting results.
Regression Analysis
In this section we establish and analyze the empirical relationships between foreign aid, human rights practices and HRT participation. Our two equations of primary interest are given by:
ait = γaa ait−1 + γha hit−1 + γka Kit + β a zit−1 + uita
(1.3)
hit = γah ait−1 + γhh hit−1 + γkh Kit + βh zit−1 + uith
(1.4)
where: ait is foreign aid receipts per capita8 for country i at time t, hit is the quality of human
8
While in principle one could consider total foreign aid receipts instead of foreign aid receipts per capita as the explained variable in equation 1.3, we prefer per-capita aid receipts for two reasons. First, we are interested in discussing
our results in the context of the existing literature on the determinants of foreign aid, in particular the landmark work
of Alesina and Dollar (2000). As the literature has almost exclusively considered per capita foreign aid (or the log of
per capita foreign aid), to facilitate comparison we do so as well. Second, as there are several country-year observations where foreign aid receipts are zero or negative (aid receipts are measured as negative if the amount of principal
a recipient repaid on earlier loans exceeded the amount of aid it received during the year, or if there was a recovery on
grants or other unused funds previously reported as disbursed) the distribution of total aid receipts is much less “wellbehaved" (from an econometric point of view) than the distribution of per-capita foreign aid receipts. Total aid receipts
are typically either zero or a very large number, where as per-capita foreign aid has a more smooth distribution. In
principle we could use a log to compress the total foreign aid receipts variable, but will involve some form of artificial
transformation of the data. For example, we would have to add some constant positive number (greater than 1, since
aid can be negative) to each observation before a log transformation to avoid having to take the log of a number less
than or equal to zero.
12
Figure 1.1: Argentina and Chile Treaty Capital
Figure 1.2: Kenya and Chad Treaty Capital
Figure 1.3: Uganda and Chad Treaty Capital
13
rights institutions in country i at time t, and Kit is the treaty capital of country i at time t. zit is a
j
vector of covariates and uit is the unobserved error term. We assume the following structure for
j
uit ,
j ∈ { a, h}:
j
j
j
j
j
uit = ωi + δt + νr(i),t + ũit
j
(1.5)
j
j
where ωi and δt are country and time fixed effects respectively, and νr(i),t is a region-time fixed
j
effect (the function r (i ) simply selects the region of country i). We thus explicitly allow uit to be
correlated across time and across countries but maintain the usual assumption
j
E ũit | ait−1 , hit−1 , Kit , zit−1 = 0.
(1.6)
We consider the hypothesis that treaty capital Kit Granger causes foreign aid receipts ait and
human rights practices hit . Formally, HRT Granger causes aid receipts and human rights practices
if:
E( ait |Kit , Jt−1 ) 6= E( ait | Jt−1 )
(1.7)
E(hit |Kit , Jt−1 ) 6= E(hit | Jt−1 )
(1.8)
where Jt−1 contains information on past aid, human rights institutions and all other explanatory
variables except Kit (Wooldridge, 2008). In words, these equations say that past treaty behavior
of a country is still informative for current foreign aid and human rights institutions even after
conditioning on past aid, human rights institutions and the other explanatory variables. While we
readily admit that establishing Granger causality is a less ambitious task than establishing strict
causality, we favor identifying the effect of past HRT participation for two reasons. First, identifying a strict causal relationship between variables with complicated contemporaneous feedback
effects in the absence of a valid instrument is not a realistic objective. Second, the timing of political events, particularly with respect to foreign aid giving, is possibly better captured through a
14
model with lagged explanatory variables rather than contemporaneous ones. The OECD aid data
that we consider here measures actual disbursements of aid. Seeing as how most OECD countries
make donation decisions in the fiscal year prior to the year of actual disbursement, aid donation
decisions themselves are lagged by one year (Kuziemko and Werker, 2006). Then if we are trying
to determine how treaty capital investment influences donor aid decisions, it is more correct to
consider current aid disbursement as a function of the previous year’s treaty capital stock.
For completeness, in the regression analysis we also consider the equation:
Kit+1 = γak ait−1 + γhk hit−1 + γkk Kit + βk zit−1 + uitk
(1.9)
While understanding the dynamic relationship between aid receipts, human rights behavior
and HRT participation is a stated goal of this study, and thus considering equation 1.9 together
with equations 1.3 and 1.4 is logical from the point of view of completeness, we should be cautious in interpreting the estimates of equation 1.9. Recall that Kit measures the HRT participation
of country i against all other countries in the world at year t, not simply the decision of country i
at year t. Clearly, Kit will largely be determined by variables other than political and economic
variables of country i. This is part of the motivation for the structural model of treaty ratification
decisions that we develop and estimate later, which fully incorporates the strategic and forward
looking behavior of countries. However, considering equation 1.9 now does help guide the structural model because it allows us to get a better idea of the possible mechanism underlying the
data generating process. Further, to try and address the shortcomings associated with estimating
equation 1.9, we also consider a linear probability model of country i’s ratification decision itself
below.
To aid in the interpretation of the results below, we include in table 1.2 the quartiles of the three
endogenous variables, aid per capita, quality of human rights institutions and HRT participation,
as well as the country-year identity corresponding to each quartile.
Foreign Aid Receipts
15
Table 1.2: Sample Quartiles
Minimum
Aid Receipts/Capita
-19.46 (Malaysia - 1996)
Human Rights Institutions
1.0 (Angola - 1978)
Participation
-8.25 (South Africa - 1994)
25%
14.47 (Indonesia - 1981)
2.0 (Albania - 1995)
-1.69 (Thailand - 1974)
50%
37.34 (The Gambia - 1996)
3.0 (Guinea- 1992)
0.20 (Zimbabwe - 1997)
75%
68.58 (Rwanda - 1981)
5.0 (El Salvador -1993)
1.79 (Venezuela - 1988)
459.50 (Mauritania - 1976)
7.0 (Uruguay - 2001)
6.50 (Philippines - 1996)
Maximum
HRT
We interpret equation (3) as an aid supply function that describes the decision process of aid donor
countries. Our primary interest here is in uncovering how the supply of aid responds to human
rights treaty participation and human rights behavior in recipient countries, holding fixed other
variables that may effect HRT participation, human rights institutions and aid receipts. We are
not the first to try to disentangle the effects of different economic and political variables on aid
receipts. The studies by Alesina and Dollar (2000) and Alesina and Weder (2002) are particularly
relevant to ours, and we use their findings to guide our analysis. Alesina and Dollar (2000) find
that aid flows are as much dictated by strategic and political interest as they are by the economic
conditions and performance in recipient countries. While we do not directly control for strategic
importance and political importance of the recipient to the donor(s), the inclusion of country specific fixed effects is useful in this regard. If strategic importance is relatively time invariant, at least
over the time horizon of our sample, country specific fixed effects allow us to control for the average strategic value of the recipient across the donors (OECD members). Alesina and Dollar also
find that aid donors reward democratization. As many episodes of democratization are accompanied by increased participation in HRTs as well as improvements in human rights institutions,
democracy is an important control variable in our analysis. Alesina and Weder (2002) examine
the relationship between foreign aid and domestic corruption. They find no evidence that less
corrupt governments receive more aid (per capita) than more corrupt governments. Similar to
16
Alesina and Weder (2002) and in contrast to Alesina and Dollar (2000), we focus specifically on a
relatively narrow set of institutions, and try to discern whether, after controlling for other determinants of aid flows, countries with one type of institutions are more likely to receive aid than
countries without them. Table 1.3 displays the parameter estimates of equation (3).
Column 2
.6737234*
(.0362061)
-.3595292
(.5269335)
1.235768*
(.3781004)
YES
NO
NO
.8272
2042
Column 3
.6354525*
(.0375123)
.9460418
(.5122686)
1.490251*
(.3797569)
YES
YES
NO
.8364
2042
Column 4
.6295216*
(.0377596)
1.241887*
(.5183479)
1.438282*
(.3830961)
YES
YES
YES
.8392
2042
.6116126*
(.1664197)
YES
YES
YES
.8402
2042
Column 5
.6207469*
(.0382736)
-.3084337
(.6649017)
1.380379*
(.3811949)
Column 6
.615328*
(.0380997)
-.1835161
(.6735464)
1.311195*
(.3787861)
-2.145677*
(1.092854)
.0182174
(.0755765)
.6137618*
(.169889)
-4.218408*
(1.806804)
.2193509
(12.22033 )
YES
YES
YES
.8412
2042
Notes: Standard errors estimated using Newey-West (allows for autocorrelation and heteroskedasticity)
Country FE
Time FE
Region*Time FE
R2
N
popit−1
openit−1
democit−1
Stabilityit−1
gdpit−1
Kit
hit−1
ait−1
Column 1
.8938801*
(.0163584)
-.2783102
(.3047436)
-.0533879
(.1508015)
NO
NO
NO
.8041
2042
Table 1.3: Foreign Aid Receipts: Parameter Estimates
Column 7
.8187365*
(.0215647)
.303654
(.4906844)
.1241684
(.1527717)
-1.294062*
(.2988315)
.0365486
(.0323007)
.0858431
( .1120373)
-2.520494*
(.9867822)
-3.913687*
(.496932)
NO
NO
NO
.8127
2042
17
18
We include estimates of the regression model with and without fixed effects for illustrative
purposes. The importance of accounting for permanent and unobserved country level heterogeneity in cross country studies is made clear by Acemoglu et al’s (2008) study of the effect of
income on democracy. Permanent and unobserved country level factors simultaneously cause
many country level outcomes of interest, and since pooled OLS only yields consistent estimates of
the parameters of interest if there is no correlation between time invariant country characteristics
and the explanatory variables, not accounting for permanent unobserved heterogeneity can result
in significantly biased estimates. As an example in our context, Argentina is one of the few countries to have ratified every treaty in our sample, and throughout the history of our sample has a
larger stock of treaty capital than Somalia (see figure 1.9 in appendix). However, Somalia annually
receives more aid per capita than nearly every country in our sample, and certainly much more
than Argentina at any point in time ( figure 1.10 in the appendix). By comparing across the two
countries we may conclude that past human rights treaty ratification results in a decrease in foreign
aid. But few would dispute that there is something intrinsically different and time-invariant (at
least over the duration of our sample) between these countries that potentially explain both aid
receipts and human rights treaty participation. One should look at the relationship between aid
and human rights within a country over time to get a clear picture of how aid responds to treaty
participation and human rights behavior (figures 1.11 and 1.12 in the appendix). Controlling for
country specific heterogeneity in the regression model allows us to do precisely this.
For the purposes of this paper, the key result from this section is that, once we control for
country-specific unobserved heterogeneity HRT participation has an economically and statistically
significant positive effect on per capita foreign aid receipts. Looking at the first two columns of
table 1.3, the estimates confirm the message in figures 1.9 - 1.12. In column 1 we regress current
aid receipts on lagged aid, human rights institutions and treaty capital, and in the second column
we add only country level fixed effects to the regression in column 1. In column 1, the only
statistically significant variable is lagged foreign aid receipts. If there is any relationship at all,
treaty capital negatively effects aid receipts. Once we allow for the presence of permanent country
19
level heterogeneity however, we see that the estimated effect of treaty capital becomes positive
and statistically significant. To quantify the economic significance of the effect, the estimate in
column 2 suggests that if country i were to ratify one more treaty, holding the behavior of all
other countries in the world constant, in the short-run country i would receive 1.24 dollars more
of aid per capita. As the model is dynamic, it makes sense to also consider the long run effect
of ratifying an additional treaty. By recursive substitution of equation 1.3, it is straightforward
to see that entering year t having ratified one additional treaty in the previous year implies an
additional total stream of γka ∑sT=1 (γaa )s dollars of aid T years in the future. Thus, far enough into
the future, one additional treaty implies an additional
γka
1−γaa
∼ 3.80 dollars of aid per-capita. By
similar calculations, a one-standard deviation increase in treaty participation implies a short-run
increase of about 3.17 dollars and a long-run increase of 9.62 dollars of aid per-capita. At first
glance the effect of HRT participation on aid seems small given the mean (51.8) and standard
deviation (52.7) of aid per capita, but we note that the most important determinant of differences
in aid receipts across countries is actually permanent unobserved country specific heterogeneity,
and not any of the covariates we consider in the analysis. This is actually in line with Alesina and
Dollar (2000) who find that the colonial heritage and (permanent) strategic political importance of
a country are the most important determinants of aid receipts. Among time-varying covariates,
HRT participation is a relatively important determinant of aid receipts, as we discuss in more detail
below.
The difference in the estimates between columns 1 and 2 suggests that unobserved permanent
country heterogeneity is negatively correlated with treaty participation and positively correlated
with foreign aid receipts. One potential hypothesis is that there are some countries, such as Somalia, with permanent, poor social, political and economic institutions. These countries tend to be
economically poor and rely on foreign aid and moreover, their poor institutions are also inimical
to participation in international human rights initiatives. On the other hand, relatively wealthy
countries like Argentina do not rely on foreign aid, and possess more sound domestic institutions
which are conducive to participation in international institutions. While this argument has intu-
20
itive appeal, we provide some evidence below that the “omitted variable" of column 1 is not likely
to be domestic institutions.
The results in columns 1 and 2 suggest that the quality of real human rights institutions has
no statistically significant effect on aid receipts. The measure of human rights institutions we use
here is the “civil freedoms" variable from the Freedom House Comparative Survey of Freedom
(see Gastil(1990) for details).9 In columns 3 and 4 we add time fixed effects and region-time effects
respectively. While the reasons for including time fixed effects here may be obvious, the choice
to add region-time interactions is motivated by what we observed in figure 1.3 above: regional
political/economic initiatives which simultaneously cause aid and participation in international
treaties are potentially important. As we move to these columns we see that lagged human rights
becomes economically and statistically significant. Before interpreting this as a positive result, we
should be sure we are not simply obtaining an upwards bias in our estimate of the effect of human rights institutions on foreign aid receipts. The quality of domestic human rights institutions is
potentially correlated with other factors that explain foreign aid receipts, such as political institutions. It has been established in the foreign aid literature (Alesina and Dollar, 2000) that countries
with more democratic political institutions are rewarded with more aid. It is also well known
that democratization episodes are often accompanied by improvements in human rights institutions (Hafner-Burton et al. (2008)). Moreover, Alesina and Dollar present some evidence that,
controlling for democracy, human rights institutions have no effect on aid receipts. These findings
motivate column 5 of table 1.3. When we control for domestic political institutions (democracy
score) in column 5, the effect of human rights institutions on HRT participation becomes statistically insignificant, and is negative if anything, while the estimate of the effect of democracy is
economically and statistically significant, perfectly consistent with the results in the existing aid
literature. From an identification perspective one may be concerned about a potentially high level
9 The
Comparative Survey of Freedom contains two variables, a “political freedoms" variable, generally interpreted
as a measure of how democratic a country is, and the civil freedoms variable, which has been generally interpreted in
the political science literature as a measure of human rights (i.e., Hathaway, 2007). The political freedoms variable has
been used extensively in the economics literature, but the civil freedoms variable has not, perhaps owing to the fact that
there are relatively few studies on human rights in economics. One important exception is Alesina and Dollar (2000),
who use both variables.
21
Table 1.4: Human Rights-Democracy Episodes
Democracy
low
high
Total
Human
low
65
28
93
Rights
high
44
51
95
Total
109
79
Table 1.5: Number of Different Regimes Visited in Country History
Number of Regimes
1
2
3
4
Total
Number of Countries
20
29
26
8
83
of correlation between democracy and human rights institutions. While we discuss these two
measures in more detail below, for now as a check, we dichotomize each into “high" and “low" for
each country-time observation 10 and for each of the four possible democracy - human rights combinations, count the number of countries that have an episode at that combination at any point in
the history of our sample. The statistics are presented in table 1.4.
As there are 83 countries in our sample, it is clear that several countries necessarily visited
multiple democracy-human rights pairs in the duration of our sample. To further illustrate this
point, in table 1.5 we provide, for each number of possible regime pairs a country can visit in its
history, a count of the number of countries that visited that many pairs.
Table 1.4 suggests there are many episodes of low democracy-high human rights and high
democracy-low human rights. Moreover, table 1.5 suggests the presence of the most important
type of variation in institutions from an identification perspective, within country, over-time variation. While they are bound to be highly correlated, our democracy and human rights variables
10 For example democracy ranges from −10 to +10, so any country-time observation with greater than or equal to
zero democracy score is labeled “high." We do the analogous relabeling for the human rights variable.
22
are not measuring the exactly the same institutional features.
In column 6 of table 1.3 we include several variables that have been considered in the foreign
aid literature. Not surprisingly, countries with higher GDP per capita receive less aid. Political stability has no significant effect on foreign aid receipts. “Openness" of a country, a general measure
of economic policy created by Sachs and Werner (1995) is associated with less foreign aid. This is
possibly explained by the fact that countries closed to economic activity are more reliant on foreign aid than are more open countries. This finding is also generally consistent with the findings
of Alesina and Dollar (2000). In the final column of table 1.3, we consider the same model as in
column 6, but without allowing for controlling for unobserved heterogeneity of any form. We see
that the coefficient on treaty capital is economically insignificant and far from statistically significant. This provides some evidence against the hypothesis that the “omitted variable" in column 1
is simply permanent differences in the levels of political institutions and wealth across countries.
Even when we control for these factors, unless we allow for country specific heterogeneity, we get
an estimate of the effect of treaty capital on aid that is not statistically different from zero.
The question then is, why do aid receipts respond to treaty participation? On possible explanation is that treaty participation is a costly signal of a country’s preferences and intentions. As
an example:
In Estonia, 28 treaties...were ratified in one session, a month after independence, without even having been translated into Estonian. Estonia “wanted to send a strong signal
that it would respect human rights and was not a part of the Soviet Union anymore."
(Heyns and Viloen, 2002)
That a government of a country would agree to abide by the terms of 28 complicated documents
potentially not even knowing the language they were written in lends credence to the possibility that countries with a dark history signal to the Western world that they are now “open for
business." The link between treaty participation and aid receipts may be even more direct. For example, the EU’s Generalised System of Preferences Plus (GSP Plus) offers large tariff reductions as
23
well as other benefits to developing countries in exchange for the ratification of key international
treaties, 8 of which are UN human rights treaties that we consider here.11 While this particular
example is not one of foreign aid giving per se, it does indicate that HRT participation is used as a
criteria when the developed world decides who to give a “carrot" to.
(b) Human Rights Behavior
To the best of our knowledge this is the first study to consider the economic and political determinants of human rights behavior in the economics literature. In table 1.6 we present estimates of
equation 1.4.
11 See:www.ec.europa.eu/trade/wider-agenda/development/generalised-system-of-preferences/
http://trade.ec.europa.eu/doclib/docs/2005/june/tradoc1 23861.pd f
and
Column 2
.0001513
(.0003201)
.8058864 *
(.0160688 )
-.0157269
(.009243)
YES
NO
NO
.8802
2042
Column 3
.0005233
(.0003461 )
.7900121*
(.0168019 )
-.0167405
(.0090792)
YES
YES
NO
.8867
2042
Column 4
.0006184
(.0003521 )
.7852115*
(.0171473)
-.015276
( .0093057)
YES
YES
YES
.8911
2042
YES
YES
YES
.8925
2042
.021051*
(.0048281)
Column 5
.0003164
(.0003482)
.7318513*
(.0207996)
-.0172689
(.0092487)
Column 6
.000252
(.0003504)
.7282466*
(.0209071)
-.0191326*
(.0092234)
-.0173823
(.016706)
-.0032785
(.0019546)
.0188709*
(.0052418)
.1065236*
( .0458571 )
-.4198933
(.2395068 )
YES
YES
YES
.8931
2042
Column 7
.0002227
(.0002521)
.8624241*
(.0149096)
.0073752
(.0048028)
.019666*
(.0067289)
-.0004692
(.0007621)
.01263*
(.0032013)
.0607946*
(.0275029)
-.0070408
(.0105671)
NO
NO
NO
.8732
2042
Notes: Standard errors estimated using Newey-West (allows for autocorrelation and heteroskedasticity). The null hypothesis of unit root in
hrit is rejected in a standard Dickey Fuller test (the test statistic is almost -10).
Country FE
Time FE
Region*Time FE
R2
N
popit−1
openit−1
democit−1
Stabilityit−1
gdpit−1
Kit
hit−1
ait−1
Column 1
.0000956
(.0002048)
.9248508*
(.0082207)
.0060268
(.004748)
NO
NO
NO
.8703
2042
Table 1.6: Human Rights: Parameter Estimates
24
25
We consider the same set of specifications as in the case of foreign aid. In table 1.6 we see that
in columns 1-4 only lagged human rights institutions is statistically significant. Treaty capital has
an economically (but not statistically) significant negative effect. However, once we control for
democracy in column 5, treaty capital becomes statistically significant at the 10% level, and negative. Not surprisingly the democracy score has a positive estimated effect and is very statistically
significant. Once we add other covariates in column 6 we also see that openness has a positive
and statistically significant estimated effect on human rights behavior, but none of the estimates
from column 5 are affected very much.
The most surprising result here is that HRT participation has a negative effect on human rights
behavior. This would suggest that the institutions designed with the intention of preventing governments from violating the fundamental rights of their own citizenry actually have the opposite
effect. HRT participation makes governments more likely to commit human rights abuses. We are
not the first to find results suggestive of a negative relationship between HRT participation and
human rights behavior. Hathaway (2007) also finds a relative decline in human rights practices
among countries with a higher rate of treaty participation. In rationalizing this result she argues
that HRT participation often relieves “pressure for real change in (human rights) performance in
countries that ratify the treaty." In other words, aid recipients may take the international spotlight
off their real practices by ratifying unenforceable human rights treaties. While this “smokescreen"
argument does have some intuitive appeal, we have reason to be cautious. Goodman and Jinks
(2003) point out an alternative interpretation of Hathaway’s result, in that “greater compliance
with one obligation... can show up as lower compliance with another." Countries may ratify and
comply with the terms of a treaty concerned with one human right, and “substitute" into abuse
of another human right. Thus HRT ratification may be associated with future declines in human
rights behavior. In the case of our study this is not as much of a concern. Both our ratification variable Kit and our human rights institution variable are summary measures of treaty participation
and quality of domestic human rights institutions, and so what we are capturing in equation 1.4
is the effect of an increase in relative treaty participation on human rights institutions in general.
26
The other concern Goodman and Jinks have with Hathaway’s study which is relevant in our case
is the potential “measurement error" in the human rights behavior variable. They argue that the
measurement error is not exogenous, and is instead highly correlated with treaty ratification in
the sense that ratification is associated with improvements in the reporting of abuses. However
the Freedom House “civil liberties" variable is not a violation count variable, and we should be
confident that our study is not subject to this criticism (see Gastil (1990) for details).
The other result of interest in this regression is the positive and significant effect of trade
openness on human rights behavior. This result is not terribly surprising considering that many
trade deals between developed and less-developed countries have human rights related strings
attached, including the EU’s GSP plus program discussed above. Countries that are open to trade
are then more likely to have better human rights practices, because the return to good human
rights practices in the form of trade benefits are higher for these countries.
Treaty Capital
We present estimates of equation 1.9 in table 1.7.
Column 1
-.0004006
( .0002662 )
.0283617*
( .0110861)
.9545335*
(.008096)
NO
NO
NO
.9230
2042
Column 2
.0006869
( .0004728 )
.0455078*
( .018455 )
.819478 *
( .0188985)
YES
NO
NO
.9323
2042
Column 3
.0000689
(.0005001)
.0493893 *
( .0187422)
.8230187*
(.0187433)
YES
YES
NO
.9343
2042
Column 4
.0001304
(.0005024)
.0448022 *
( .0193904)
.8173725*
( .0189641)
YES
YES
YES
.9354
2042
YES
YES
YES
.9354
2042
-9.41e-06
(.0056218)
Column 5
.0001305
(.0005122)
.0448261
(.0247881)
.8173734*
(.0189968)
Notes: Standard errors estimated using Newey-West (allows for autocorrelation and heteroskedasticity)
Country FE
Time FE
Region*Time FE
R2
N
popit−1
openit−1
democit−1
Stabilityit−1
gdpit−1
Kit
hit−1
ait−1
Table 1.7: Treaty Capital: Parameter Estimates
Column 6
.0001309
(.0005052 )
.0422902
( .0247236)
.8116114*
(.0196132 )
-.0174141
( .0212753 )
-.0006018
( .0022304 )
.0003837
(.0058654)
.0288315
(.0570071)
.6424783*
(.3290319 )
YES
YES
YES
.9356
2042
Column 7
-.0004831
(.000326)
.0600241*
(.0169391)
.9547278*
(.0078307)
.004651
( .0090556)
-.0013737
(.000896)
-.0090468*
(.0034549)
-.0471121
(.0390001)
.0003138
(.0129301)
NO
NO
NO
.9251
2042
27
28
As we discussed above, these estimates should be interpreted with caution. A significant
amount of the variation in country i’s Treaty capital Kit is due to aggregate behavior of countries in the rest of the world. We shouldn’t expect to be able to explain this variation with political
and economic variables of country i only. Not surprisingly, the only variable other than lagged
treaty participation that is significant or close to significant throughout the columns of table 5 is
lagged human rights. The effect of human rights institutions on treaty capital is positive.
To get a better understanding of how the key variables of interest affect treaty participation
decisions, we also estimate the following equation:
dirt = γad ait−1 + γhd hit−1 + γkd Kit−1 + βd zit−1 + ωid + δtd + κrd + εdirt
(1.10)
where dirt is country i’s ratification decision (binary) in treaty r at time t, and κrd is a treaty specific
fixed effect. In table 6 we present the estimates of the full specification (including all controls and
fixed effects) of equation 1.10.
The two key results here are the economic and statistical significance of treaty capital and
human rights. Countries with a large stock of treaty capital are less likely to ratify a treaty than a
country with a low stock. Dynamic strategic effects are important, as countries are more compelled
to ratify a given treaty if the number of treaties they have already ratified is low relative to the
number of treaties that other countries in the world have ratified. Countries with good human
rights institutions are more likely to ratify than countries with poor institutions.
Summary of Main Results
The results of the previous section, taken together, are suggestive of the following short run relationships between foreign aid, human rights and human rights treaty participation.
1. Aid donors use relative HRT participation to decide who to donate foreign aid to. In contrast,
29
Table 1.8: Treaty Decision: Parameter Estimates
ait−1
hit−1
Kit
gdpit−1
Stabilityit−1
democit−1
openit−1
popit−1
R2
N
0.000034
(0.000059)
0.007456*
(0.002546)
-0.016046*
(0.001364)
-0.002122
(0.002770)
-0.000021
(0.000261)
-0.000055
(0.000637)
0.005442
(0.006720)
0.089781*
(0.034281)
.094
14112
the quality of a country’s domestic human rights institutions are inconsequential for aid receipts. While rewarding a nominal measure of human rights and ignoring the real measure
seems like strange behavior on the part of donor countries, this behavior is possibly rationalized by the fact that the actual aid recipients themselves often do not have direct control
over actual human rights institutions in the short run, but do have control over the decision
to participate in an HRT. Moreover, from the point of view of donors, HRT participation is an
easier object to quantify than human rights institutions.
2. Countries with good human rights institutions are more likely to participate in HRTs. Surprisingly, countries with a high level of predetermined HRT participation experience a decline in the quality of domestic human rights institutions. Countries with good institutions
are by definition ex-ante compliant, and so ratification is less costly for these countries. Further, in light of point (1), if donors used real human rights institutions instead of the nominal
HRT
participation as the criteria for distributing aid, countries with good domestic human
rights institutions would have no reason to ratify HRTs. It is then intuitive that these coun-
30
tries are more willing to participate. While it is surprising that countries that participate
frequently experience a decline in domestic human rights institutions, it is possible that HRT
participation is used as something of a “smokescreen." Countries under scrutiny for their
practices participate in order to take the spotlight off their real behavior.
3. Countries with a low level of HRT participation are more likely to ratify a given treaty than
countries with a high level of HRT participation. This suggests that dynamic strategic effects
are important. The past ratifications of other countries increase the likelihood of ratification
for a country.
These results raise further questions. We have established that predetermined HRT participation has an economically and statistically significant effect on aid receipts, but is increased foreign
aid receipts the motivation behind ratification? And if so, why don’t all countries ratify all available treaties immediately? There must be some cost to ratification. A study of the decision to
ratify HRTs is not complete without understanding where these costs come from and the role they
play in the participation decision. It is clear also that there is significant heterogeneity in participation rates across countries, suggesting that there is heterogeneity in the net benefit to participation
across countries. One other question of interest is, what specifically is driving this heterogeneity in
net benefit? To address these questions we develop and estimate a structural model that allows for
the dynamic strategic interaction we have found evidence for. We then use the estimated model
to consider counterfactual experiments.
1.4
1.4.1
Model
Discussion
The structural model we propose here has as an analogue in the empirical IO literature to a
dynamic game of oligopoly competition where firms invest in quality (eg.,Pakes and McGuire
(1994)). Before formally laying out the model it is worthwhile to make explicit the relation between a dynamic game of quality competition and the problem we study here. In a model of qual-
31
ity competition, each firm in an industry composed of several firms produces a product which is
indexed by quality. Consumers derive utility only from the quality (net of price) of the good they
choose to purchase, and so demand for a given product in the industry is determined fully by its
quality relative to the quality on offer from the other firms. Firms also have the option to increase
the demand for their products by making a costly investment in quality. Firm profits thus depend
on quality through both the revenue and cost channel: a higher quality means more demand and
the ability to command a higher price, but comes at an economic cost. Firms who do not invest in
quality may see their demand decrease if other firms in the market continue to invest.
In the model we consider here, poorer countries in the world compete with each other to increase aid receipts from OECD countries. We also allow for the possibility that countries have an
intrinsic preference for human rights. Donor countries have a finite amount of aid to distribute
among recipient countries, and rely on K to make allocation decisions. Recipient countries have
the option to increase the aid they receive by making a costly ratification of a human rights treaty.
This is a costly investment because ratifying a treaty commits a country to the terms of the treaty,
at least in principle. Different treaties are allowed to have different ratification costs for a country,
and different countries can have different costs of ratification of a given treaty. For example, countries with different human rights institutions or different political institutions may have different
costs of participation. As in the quality investment model country payoffs depends on quality
through both the benefit and cost channel: a larger treaty capital stock means more aid and economic growth, but comes at a cost. Further, countries that do not invest in quality may see the
economic resources they receive decline if other countries continue to increase their participation
in human rights treaties.
1.4.2
Formal Model
At year t, the international community is configured by Ct recipient countries and Nt treaties.
Countries and treaties are given exogenously in our model. Let xirt ∈ {0, 1} indicate country i’s
status in treaty r at year t. If xirt = 1, we say country i has ratified treaty r at some time τ < t.
32
We can represent country i’s membership status in the set of Rt treaties at time t by the vector
xit = { xirt : r = 1, 2, ..., Nt }, and we can represent the ratification status of the entire international
community as the vector xt = {xit : i = 1, 2, ..., Ct }. Ratification is irreversible: once a country has
ratified a treaty it may not exit (erase its name) from the treaty, an assumption clearly validated by
the data. We represent the ratification decisions at period t as dit ≡ {dirt : r = 1, 2, ..., Nt }.
Country payoffs in year t are the difference between per-period economic payoff Ri and a
ratification “investment" cost ECi :
Πi (xt , zt , dt , ε it ) = Ri (xt , zt ) − ECi (xt , zt , dit , ε it )
(1.11)
zt is a vector of exogenous political and economic variables and ε it is a vector of private information shocks of country i. We specify the benefit and ratification cost functions in turn.
Economic Payoffs
We make the following assumption on the economic payoff function Ri (xt , zt ):
• Assumption (E1) The function Ri (xt , zt ) depends on the vector xt , zt only through its effect
on foreign aid receipts and human rights. Specifically, let ait = ai (xt , zt ) and hit = hi (xt , zt )
represent country i’s foreign aid received and human rights score respectively at year t.
Then:
Ri ( x t , z t ) = α a ai ( x t , z t ) + α h hi ( x t , z t )
ai (xt , zt ) and hi (xt , zt ) are functions that make explicit the fact that these variables depend on
treaty status xt as well as other political and economic variables zt . α a and αh are parameters to
be estimated, representing the relative weights of aid and human rights in country payoffs. What
remains then is to specify the functions ait = ai (xt , zt ) and hit = hi (xt , zt ). While it is natural here
to use the results of the first stage estimation, we considered several versions of the equations aid
and human rights equations 1.3 and 1.4, and we must settle on a specification. The trade-off here
33
is that a richer specification provides a more accurate depiction of the true relationship among
the variables, but comes at the conceptual cost of clarity and computational cost of a larger state
space. We would like a specification rich enough so as not to sacrifice reality but concise enough
so that we can narrowly focus on answering our primary questions of interest. Note that in the
empirical results above, once we condition on all three forms of unobserved heterogeneity as well
as democracy, adding other covariates does not significantly alter the results. With this in mind,
the specification we consider is given by:
ait = γaa ait−1 + γha hit−1 + γka Kit + β a demit−1 + ũita
(1.12)
hit = γah ait−1 + γhh hit−1 + γkh Kit + βh demit−1 + ũith
(1.13)
where:
j
j
j
j
j
uit = ωi + δt + νr(i),t + ũit
(1.14)
We allow for all three forms of unobserved heterogeneity in the transition of the endogenous variables: country-specific across time, time-specific across country, and time-specific within region
across country. This is the specification that we considered in column 5 of tables 1.3 and 1.6 above.
Implicit in this specification of Ri (xt , zt ) is the following assumption:
• Assumption (E2) The vector of ratification statuses xt of the international community enter
the payoff of country i in the dynamic game only through Kit .
Treaties are interconnected in the sense that we allow ratification decisions in one treaty to affect
the payoff to ratifying any other treaty. Country i0 s status, and the status of all other countries in
all other treaties, influences country i’s decision in any given treaty. This is a departure from the
traditional “isolated markets" assumption typically made in the literature in empirical industrial
34
organization. As we discuss below in the estimation of the dynamic model, this specification
of the payoff function plays an important practical role in alleviating the computational burden
associated with the solution and estimation of the dynamic game.
Investment Costs
The ratification cost function ECi (xt , zt , ε it ) is directly analogous to the concept of irreversible investment cost in the IO literature. It should be interpreted as the sum of a one time cost paid upon
ratification of (investment in) the treaty plus the discounted present value of a sequence of fixed
costs of remaining in the treaty. We specify the following investment cost function for ratifying
treaty r in country i at time t:
ECirt = γi + ξ r + γde demit + γh hrit + γdh demit hrit + ε irt
where γi is the country specific component of the entry cost, ξ r is a treaty specific cost, and demit
and hrit are the levels of democracy and human rights institutions in country i at time t. γde
and γhr represent the affect on entry cost to having democratic institutions and to having good
human rights institutions respectively. ε irt is a random shock with distribution G, which is private
information of country i. We discuss the private information shock in more detail below.
It is important to note here that we can not estimate a fixed cost of being a party to a treaty. To
separately identify fixed costs from our investment costs we would need to observe both entry and
exit into treaties. Since ratification decisions are irreversible, we do not observe exit. The choice
to include democracy and human rights in the cost function is not arbitrary. Hathaway (2007)
finds that for a given level of democracy, countries with worse human rights practices have lower
probability of ratifying HRT’s, while for a given level of human rights, more democratic countries
have a lower rate of ratification. Hathaway theoretically links state decisions to ratify a human
rights treaty to the domestic enforceability of the treaty by arguing that ratification is only costly
for those countries that a) are not compliant with the treaty’s terms ex-ante of ratification and b)
35
are to the terms of the treaty by some domestic enforcement mechanism post-ratification. While
the cost of abiding by the terms of a treaty is in principle the same for any ratifier, only countries
that need to change their behavior post ratification actually pay the cost. Our specification of ratification cost allows us to test Hathaway’s hypothesis in the context of our model. We also allow
for treaty and country specific heterogeneity in ratification cost. There is substantial and persistent variation in the rate of ratification across countries and treaties. As an example nearly every
country in the world has ratified the Convention on the Rights of the Child, but a comparatively
small fraction have ratified the Convention on the Protection of the Rights of all Migrant Workers,
even though the latter opened 5 years prior. The country specific differences are often stark as
well. Armenia and Kazakhstan, two countries with similar recent political histories follow very
different HRT ratification paths post independence in 1991. Of the treaties in our data set that both
countries have ratified, Armenia ratified all but one well before Kazakhstan. These suggest that
ignoring the potentially permanent differences across countries and treaties in the estimation of
the model may lead to spurious results. Further, by allowing costs to vary across treaty type for a
given regime we are also able to study the possible relationships between observable institutional
features of the treaties and the cost of ratification.
Implicit in our definition of the payoff function Πi is a time to build assumption:
• Assumption (E3): Ratification costs are paid at the time period of ratification, but the ratification decision is not effective until the following time period.
As we discussed above, countries do not see the benefits of a ratification made in period t until
period t + 1. This is especially justified in the case of aid, as we are considering aid disbursement
data. This means that the aid received in period t + 1 was decided by donors in period t. A treaty
ratification in period t does not affect aid receipts in period t.
Country Strategies and Equilibrium
We assume that countries are forward looking and maximize intertemporal payoffs, and take
36
into account the direct effect of their actions on their own future payoffs as well as the indirect
effect through the expected reaction of other countries. Further, we assume that strategies depend only on payoff relevant variables. That is, we restrict players to use Markov strategies.
Given the above discussion, country i’s payoffs at time t depends on the vector of state variables
ait , hrit , Kit , demit , ε it . The endogenous state variables are ait , hrit , Kit , while the exogenous state
variables are demit , ε it . The model constitutes a dynamic game because the evolution of the variable Kit depends on the actions of all the players.
Keeping in mind that the payoff-relevant variables for player i at time t are ait , hrit , Kit , demit , ε it ,
for notational simplicity, we continue to represent the full state of the game at time t for player i
by the vector {xt , zt , ε it }. Let σi (xt , zt , ε it ) be a strategy function for country i. Given this strategy
function we can define a conditional choice probability (CCP) function as
Pi dit |xt , zt ≡
Z
I σi (xt , zt , ε it ) = dit dG (ε)
In words, this is the probability with which country i takes action dit at time period t given the
ratification status xt and state zt .
We define country i’s value function given equilibrium entry probabilities Pi d−i |xt , zt as
V Pi (xt , zt ,ε it ), and we say that a strategy function σ is a Markov Perfect Equilibrium if for any
possible state (xt , zt ,ε it ):
n
o
σi (xt , zt ,ε it ) = arg max Πi (xt , zt , ε it ) + δE V Pi (xt+1 , zt+1 , ε it+1 )|xt , zt
dit
We discuss the dynamic model and further assumptions in more detail in the estimation section below.
37
1.5
Estimation of the Dynamic Model
The dynamic model we have described above leads to three interrelated dimensionality problems
that render estimation of the model in its current form impossible:
1. (P1) In the model section above we have formally defined the Markov states of our game to
be xt , zt , and thus player strategies depend on the full vector (xt , zt ) of treaty statuses and
exogenous variables. Ignoring zt , the dimension of xt alone is 2 Nt ∗Ct , which, given the data
we consider, can be as large as 21245 . Solving the value functions associated with a game
defined on this space is computationally infeasible.
2. (P2) Player i’s action space at time t, Dit , is the set of treaties yet to be ratified by player i.
This can be as large as 215 .
3. (P3) Player payoffs depend on the behaviour of the other players in the game. Thus the
expectation of future payoffs depend on expected behaviour of all (83) players, and the transition of the state variables has a very high dimension.
We describe in detail how we deal with each of these dimensionality problems so as to facilitate
estimation of the model. As described above, a key benefit of the modeling approach we have
taken is that the economic payoffs of players in the game depends on the vector (xt ) only through
the functions ai (xt , zt ) and hi (xt , zt ). This effectively solves problem (P1) above. Since potentially
many values of xt , zt yield the same value of ait , hit , 12 the space over which player strategies are
defined is considerably reduced.
To deal with problem (P2) we adopt the method proposed by Aguirregabiria and Ho (2009).
We assume first that at each period t, each country appoints a committee to each treaty it has not
ratified yet. This committee observes some private information about the treaty that neither any
other country, nor any other committee in its own country observes. Based on this information,
the state of the game, and beliefs about the strategies of other committees within the country
12 This
of course depends on the coarseness of the grid ait and hit are discretized on, which we discuss later.
38
and committees in other countries, the committee makes a recommendation to the government to
either ratify or not. The government then takes the decision that was recommended. We imagine
that the government finds it too costly to research the implications of ratification of all the treaties
open to it and delegates this task to the committee. Committees within a country can not share
all their private information with each other, and thus do not fully co-ordinate. In this sense we
are moving away from the “state as monolithic decision maker" assumption that has been used
in similar problems (i.e., Wagner (2008)). What does this assumption buy us? It allows us to
treat each country as a “different player" in each treaty, while still relaxing the “isolated markets"
assumption. The country payoff to ratifying each treaty is still affected by the decisions made in
other treaties through the K variable.
13
More formally, we make the following assumptions:
• (D1) Committee r in country i at time t makes recommendation dirt ∈ {0, 1} to maximize the expected discounted value of the stream of country-treaty (committee) payoffs:
s
Et ∑∞
s=1 β Πir,t+s , where:
Πirt ≡ xirt Ri (1, x−irt , zt ) + (1 − xirt ) Ri (0, x−irt , zt ) − dirt (1 − xirt ) ECirt ( xt , zt , ε irt )
• (D2) The shocks {ε irt } are private information of committee r in country i at time t. These
shocks are unknown to the other committees in country i and unknown to all other countries.
Assumption D1 explicitly says that committees that enter period t having ratified the treaty for
which they are responsible (xirt = 1) obtain Ri (1, x−irt , zt ), and committees that have yet to ratify
earn Ri (0, x−irt , zt ). Further, committees that have yet to ratify and recommend ratification in year
t, i.e., (dirt = 1, xirt = 0), pay the cost ECirt ( xt , zt , ε irt ). Assumption D2 says that there is statistical
independence across treaties within a country. Note that committees within a country have the
13 This assumption is motivated partly by the literature on “formal" and “real" authority in organizations, spawned
by Aghion and Tirole (1997). In this literature “formal" authority is defined as the right to decide, while “real" authority
is defined as effective control over a decision. The difference between the two types of authority is generated by private
information. Aghion and Tirole derive conditions under which a principal (here the leader of a country) may allocate
authority over a decision to an agent (here the treaty committees) who possesses private information. Importantly, they
show that when a principal can “trust" an agent (i.e., aligned incentives), authority will be delegated. Here our agents’
payoffs are closely aligned with those of the country.
39
same general objective, as the variables that enter the committee economic payoff are country level
aid and human rights. While committees in the same country are playing “against" one another in
the treaty ratification game, the decisions of committees within the same country and the decisions
of committees in other countries enter the payoffs of a committee differently. Technically speaking,
the portfolio of treaties chosen by the country (the set selected by the committees) will be the
optimal portfolio for the country up to a deviation in one treaty holding decisions in all other
treaties fixed.
14
While he have solved the computational issues associated with P1 and P2, we are still left with
P3, the problems associated with each player conditioning his strategy on the state variables of all
players in the game.
Note now that for any treaty committee r in country i, current payoff is fully determined by
it’s own status xirt , three endogenous variables, treaty capital Kit and the payoff variables ait , hit
and the exogenous variable demit Then define the reduced vector of variables wirt :
wirt ≡ { xirt , Kit , ait , hit , demit }.
(1.15)
To alleviate the computational burden associated with P3, we make the following further assumption:
Assumption (D3): The strategy function of treaty office (i,r) is given by σir (wirt , ε irt ) that maps from
W × R into {0, 1}
As we did above for the full state space, we can now define the vector of conditional choice
probabilities (CCPs) associated with the strategy functions σ as:
Pir (wirt ) ≡
14 The
Z
I {σir (wirt , ε irt ) = 1}dGε (ε irt ).
(1.16)
inability of committees within a country to perfectly co-ordinate and select the best overall portfolio of treaties
for the country to hold at a given time can be alternatively viewed as a bounded rationality assumption. Information
about treaties is costly to aggregate and process for countries, and so each decision is made in some isolation.
40
With these assumptions in hand we now write the value function of committee r in country i at
time t:
VirP (1, wirt ) = Ri (1, wirt ) + δE VirP (1, wirt+1 )|1, wirt
VirP (0, wirt ) = Ri (0, wirt ) + max − ECir (wirt , ε irt ) + δE VirP (1, wirt+1 )|1, wirt , δE VirP (0, wirt+1 )|0, wirt
While the payoffs (and thus strategies) of each player still depend on the treaty statuses of all
other players through K, an issue we deal with in the following subsection, the assumption D3
has considerably reduced the dimension of the vector of variables that each player conditions his
behavior on.
Markov Perfect Equilibrium under Assumptions D1 - D3
We now define a Markov Perfect equilibrium of the treaty ratification game given assumptions
D1 − D3. Note that by the time to build assumption, we can write the flow economic payoff to
choosing action d for treaty office r in country i at time t as Πirt (d) = α a air + αh hir − d ∗ ECirt .
Formally, let œ ≡ {σir (wirt , ε irt ) : i = 1, 2, ..., C; r = 1, 2, ..., R; } be a set of strategy functions, one
for each treaty office in each country, such that σir maps from the space W × R into {0, 1}. Then σ
is an MPE if for every treaty office in every country (i, r ) that has yet to ratify treaty r (i.e., xirt = 0)
and every possible state (wirt , ε irt ):
{σir (wirt , ε irt ) = 1} ⇐⇒
ε irt ≤ − ECirt + δE VirP (1, wirt+1 )|1, wirt − δE VirP (0, wirt+1 )|0, wirt ,
and {σir (xirt , ε irt ) = 1} for every treaty office in every country (i, r ) that has ratified treaty r (xirt =
1). In words, each treaty office that has yet to ratify ratifies the treaty if and only if doing so
maximizes the value of the country given the state and the decisions of all other treaty offices in
all countries. By the assumption of irreversibility treaty offices that have already ratified in a past
period do not make a decision in the current period.
41
Then, a Markov Perfect Equilibrium of our dynamic game can be expressed as a vector P =
{ Pir (w)} of conditional choice probabilities (CCPs) such that for every (i, r, wirt ) such that xirt = 0:
Pir (wirt ) = Gε − ECirt + δE VirP (1, wirt+1 )|1, wirt − δE VirP (0, wirt+1 )|0, wirt
(1.17)
and Pir (wirt ) = 1 otherwise. More explicitly:
Pir (wirt ) = Gε − ECirt + δ ∑ VirP (w0 ) f irw,P (w0 |1, wirt ) − f irw,P (w0 |0, wirt )
(1.18)
w0
where f irw,P (wirt+1 |dirt , wirt ) is the transition probability of the vector of payoff relevant state variables w given equilibrium probabilities P.
Finally, before moving to the estimation of the structural parameters, we need to address further the computational burden caused by the number of players in the game. Here the issue is estimation of the transition probabilities f irw,P (wirt+1 |dirt , wirt ) of the state vector wit . In more standard
applications the transition of the exogenous payoff relevant state variables is estimated separately
from the conditional choice probabilities using simple maximum likelihood methods, while the
transition of the endogenous payoff relevant state variables which depend on the choices made
by the players in the game is estimated jointly with the parameters of the model. In the current
application, the large number of players in the game renders the standard method computationally infeasible. The transition probability of the payoff variables ait = ai (xt , zt ) and hit = hi (xt , zt )
depends on the actions of all players in the game. To minimize the computational burden, we
estimate the transition of wit separately from the choice probabilities P, in a manner analogous to
Hendel and Nevo (2006). Specifically, we consider a VAR process for the vector wi , allowing for
country-specific transitions. Then using the estimated parameters of the VAR model, the transition
probability vectors f irw,P (wirt+1 |dirt , wirt ) are calculated following Tauchen (1996).
42
Now we have the elements we need to estimate the full dynamic model. Given the model we
have described above, the vector of structural parameters θ of interest is given by:
θ = {α a , αh , {γi }iN=1 , {ξ r }rR=−11 , γde , γh , γdh }
that is, the weights on foreign aid and human rights in the countries per-period payoff, the country specific entry cost parameter, the treaty specific entry cost parameters, and institutional cost
parameters.
We assume that the country-treaty private information ε irt is normally distributed with variance σε2 . Following Aguirregabiria and Mira (2007), we express the entry thresholds and conditional choice probabilities in a form that is more convenient for the purposes of estimation (see the
appendix for a detailed derivation):
P
P θ
+ ẽirt
Pir (wirt ) = Φ z̃irt
σε
Estimator
For notational simplicity, let us redefine θ =
θ
σε .
For some arbitrary value of the parameters
and entry probabilities θ, P, define the log likelihood function:
T
L θ, P =
Nt Ct
∑ ∑ ∑ dirt ln Φ
P
P
P
P
z̃irt
θ + ẽirt
+ (1 − dirt ) ln 1 − Φ z̃irt
θ + ẽirt
(1.19)
t =1 r =1 i =1
We estimate the model using the Nested Pseudo-Likelihood Estimator (NPL). As a way of providing a simple description of the estimator and explaining our reasons for choosing it, consider
first the two-step Pseudo-Maximum Likelihood (PML) Estimator. Specifically, let θ0 , P0 represent the true parameter vector and CCP vector in the population. The two-step PML Estimator of
the above likelihood function is a pair θ̂, P̂ such that P̂ is a consistent non-parametric estimator
of P0 , and θ̂ maximizes the Pseudo-Likelihood function L θ, P̂ . In many applications, in partic-
43
ular those involving permanent unobserved heterogeneity, the implementation of this estimator
is problematic, because obtaining an unbiased estimator of the choice probabilities P̂ is not feasible, and thus the PML estimator is not an attractive option. Given the very likely importance
of country and treaty specific permanent unobserved heterogeneity (the rough analogue of firm
and market level permanent unobserved heterogeneity in more standard IO applications), we face
such a problem in our model. The NPL estimator has a clear advantage in such cases, as a consistent estimator of choice probabilities is not required. One can obtain the NPL estimator in the
following way. Given any (consistent or not) initial estimate of the true choice probabilities P0 , say
P1 , one may obtain the vector θ 1 that maximizes the pseudo-likelihood L θ, P1 . This estimate allows us to obtain an updated estimate of the choice probabilities P2 using the mapping described
above:
1
1
P 1
P
P2 = Φ z̃irt
θ + ẽirt
With these estimates of the choice probabilities in hand, we now find the parameters θ that
maximize L θ, P2 , and again obtain a new estimate of the choice probabilities using these parameter estimates. We continue iterating in this fashion until the sequence of probability estimates
converges to the limit P∗ . The vector θ ∗ that maximizes L θ, P∗ is the NPL estimator.1516
Estimation Results
For expositional purposes, we divide the presentation and discussion of the structural parameter
estimates into three subsections, key payoff parameters, treaty specific costs, and country specific
15 See
Aguirregabiria and Mira (2002,2007) for details.
alternative method for estimating the model we have presented here is that of Benkhard, Bajari and Levin
(BBL) (2007). There are two potential reasons a practitioner may prefer to use BBL over NPL. First, when using BBL
one never has to invert a large matrix to solve value functions, whereas in using NPL we must solve the value functions once for each NPL iteration. Second, BBL allows for continuous variables in the state space, while NPL does
not. Since many applications are naturally modeled with continuous state variables, this is an important consideration. However, we prefer the NPL estimator for several reasons. First, implementation of BBL requires a consistent
estimate of the conditional choice probabilities, which as we argued above, is typically not available for applications
with permanent unobserved heterogeneity. Second, NPL generally delivers more efficient estimates of the structural
parameters. Finally, while it is true that not having to incur the computational cost of solving value functions is an
important virtue, we will be using the estimated parameters to perform counterfactuals. In order to perform counterfactuals, regardless of the estimation procedure, one must solve the value functions. Given that we must endure this
cost at the counterfactual stage in any case, the benefit of avoiding it at the estimation stage is not as large.
16 One
Table 1.9: Structural Parameter Estimates
44
γ̄i
5.097
Mean Country Specific Ratification Cost
γde
0.0610
( 0.1611)
0.0509
(0.0394)
-0.1032
(0.0380)
0.0945
( 0.0167 )
1.4390
( 0.6355)
0.1268
14077
Contribution to Cost from Democracy Status
γh
γdh
αa
αh
Pseudo-R2
N
Contribution to Cost from Human Rights Status
Democracy-Human Rights Interaction
Weight on Aid in Payoff Function
Weight on Human Rights in Payoff Function
Notes: Standard errors should be interpreted with caution. We are not accounting for the variability of the
first stage estimates in our calculation of the standard errors.
costs.
(a) Payoff Parameters
The estimates of the weights α a , αh in the revenue function and the γ vector of cost parameters are
presented in table 1.9
We find that countries have a positive preference both for aid and for human rights institutions. That countries have a positive preference for foreign aid in our game confirms that foreign
aid benefits are not only associated with HRT participation, but they motivate a country’s decision
to participate in an HRT. To quantify the relative importance of foreign aid and human rights in a
country’s flow payoff, we can use the estimated parameters to calculate how much foreign aid a
country at the 25th percentile of human rights institutions would be willing to forego in order to
jump to the 75th percentile, and vice versa. We find that a country would be willing to pay $45.71
per capita in foreign aid to improve human rights institutions form the 25th to the 75th percentile
(an increase of 3 on the human rights institutions score), and that a country would be willing to
accept a drop of 3.55 on the human rights institutions score to jump from the 25th to 75th percentile in foreign aid (an increase of $54 per capita of aid). While these numbers are informative
in that they help interpret the estimated weights in the payoff function, we should note that these
calculations do not take into account the fact that changing domestic human rights institutions
45
Figure 1.4: Ratification Cost as a Function of Human Rights Institutions
changes the cost of treaty participation and thus the cost of obtaining aid in the future, and moreover that this effect is heterogeneous across countries depending on political institutions. Further,
the calculations do not take into account that changing human rights institutions, by changing a
country’s payoff to ratification directly through cost, also effects the decisions of other countries
which enter into country payoffs through foreign aid. Generally speaking, the calculations ignore
the dynamic and strategic effects. Below we use counterfactual analysis to more fully quantify
the relative importance of foreign aid and human rights institutions in explaining the ratification
decision.
Our regime cost estimates allow us to directly consider the hypothesis of Hathaway (2007)
that democracies may ratify less frequently if domestic human rights institutions are such that the
country is ex ante non-compliant with the treaty terms. We find that in the context of our model,
this is not the case. Our results suggest that, for two otherwise identical countries, one democratic
and one autocratic, regardless of the human rights record of the countries, the democratic country
always pays a lower cost of ratification than the autocratic country and this difference increases
as the quality of domestic human rights institutions improve. To illustrate this result in figure 1.4
we plot the estimated contribution to ratification cost from being a democracy and from being an
autocracy as a function of human rights behavior, holding all else constant.
46
This result stands in contrast to the findings of Hathaway (2007), who argues that democracies
with poor human rights records find it more costly to ratify than autocracies with poor human
rights records, because democracies actually realize the cost of ratification through their domestic
institutions while autocracies do not. We in fact find that, for a democracy, the cost of ratification
decreases in the quality of human rights institutions, but that the opposite is true for autocracies.
Ratification for an autocracy with good human rights practices is costly relative to an autocracy
with good human rights practices.
We acknowledge that the source of the difference between our findings and Hathaway’s may
lie simply in the fact that we are looking at different sets of countries (we consider only recipients of OECD aid, while Hathaway considers both aid donors and recipients), or in the fact that
she considers specific treaties and human rights individually while we are studying more general
measures of human rights behaviour and treaty participation. However, we must also consider
the possibility that the difference in our results is being driven by differences in our methodological approaches. First, in our analysis we are controlling for a variable that Hathaway “omits" from
hers: the benefits of ratification. Second, we are controlling for country specific heterogeneity both
in costs and benefits to ratification.
17
Ignoring permanent unobserved heterogeneity in ratifica-
tion decisions can result in biased estimates of ratification costs if the unobserved heterogeneity is
also correlated with the democracy and human rights measures in the cost functions.
But supposing the cost estimates presented here are correct, we face the burden of rationalizing
what we see in figure 1.4. The costs of HRT ratification in Hathaway’s theory are largely domestic
costs which depend on the ability of actors within the country to hold the government to the terms
of the treaty. It is plausible that the cost of ratification actually has a significant non-domestic dimension in the sense that the international community monitors and either implicitly or explicitly
punishes non-compliance. As we discussed above, we envision the cost of ratification itself being
the sum of two costs, the cost of complying (given that the ratifier complies) and the cost of not
17 We
are sympathetic to the fact that Hathaway(2007) employs hazard analysis, with which it is difficult to allow for
permanent unobserved heterogeneity.
47
complying. To take an extreme case, suppose that democratic countries comply and autocratic
countries do not. As long as the costs from non-compliance are severe enough, autocracies will
always have a larger cost to participation. While the source of the cost of complying is relatively
clear, this is not the case for the cost of not complying. One possibility is reputation (Goodman
and Jinks, 2004). Countries who ratify a treaty and later flout its terms risk their international reputation and thus their opportunities in other realms of global interaction. However we still need
to explain the widening gap in costs as domestic human rights institutions improve. That the cost
of ratification for a democracy decreases as the quality of domestic human rights institutions improves is intuitive. Democracies are likely to comply with the terms of the treaty, and compliance
is cheaper if the democracy is ex-ante compliant. Then why does the cost of ratification for an autocracy increase in the quality of its human rights institutions? We have argued that an autocracy’s
ratification cost is likely to be comprised primarily by the cost of not complying, which would
be imposed by the international community. Suppose further that non-compliance is not perfectly
observed by the international community. The international community relies on media and NGO
presence, for example, for information about violations of treaty commitment. The human rights
institutions variable measures, among other things, freedom of media, freedom to demonstrate
and open public discussion. A country with high quality human rights institutions, according to
our measure, is thus more likely to have any violations of HRT commitments become known to
the international community, and is thus more likely to realize the cost of non-compliance. Thus,
an autocracy may actually have a higher cost of ratification if it has high quality human rights
institutions. Of course without more detailed data on human rights institutions we can not verify
this hypothesis, but it does present one way to rationalize our findings.
(b) Treaty Specific Costs
The estimates of the treaty specific costs are in table 1.10.
The most striking pattern in the treaty specific cost estimates is the substantial variance in the
costs across treaties. For example, the cost of ratifying the Convention on the Rights of the Child
48
Table 1.10: Structural Parameter Estimates : Treaty Specific Costs
ξ1
ξ2
ξ3
ξ4
ξ5
ξ6
ξ7
ξ8
ξ9
ξ 10
ξ 11
ξ 12
ξ 13
ξ 14
ξ 15
N
-0.0213
( 0.1322)
-0.1811
( 0.1638 )
0.3811
( 0.1607 )
-1.5183
( 0.4390)
0.8208
(0.2079 )
1.2245
( 0.2416 )
0.0385
( 0.1373 )
-0.5286
(0.1787)
0.4105
(0.1469 )
0.5587
(0.1605)
0.7794
(0.1809)
0.3923
(0.1630 )
0.1282
(0.1648 )
-0.0039
(0.1362)
14077
Economic, Social and Cultural Rights
Civil and Political Rights
Elimination of All Forms of Racial Discrimination
Prevention and Punishment of the Crime of Genocide
Rights of Child
Protection of the Rights of All Migrant Workers and Members of their Families
Non-applicability of statutory limitations to war crimes and crimes against Humanity
Suppression and Punishment of the Crime of Apartheid
Elimination of All Forms of Discrimination against Women
Against the Taking of Hostages
Prohibitions or Restrictions on the Use of Certain Conventional Weapons
Protection of Performers, Producers of Phonograms and Broadcasting Organisations
Political Rights of Women
Status of Refugees
Torture and Other Cruel, Inhuman or Degrading Treatment or Punishment
Notes: Standard errors should be interpreted with caution. We are not accounting for the variability
of the first stage estimates in our calculation of the standard errors.
49
is nearly half the cost of ratifying the treaty on The Non-Applicability of Statutory Limitations
to War Crimes and Crimes Against Humanity. To understand this result, let us consider closely
two treaties, the Convention on the Elimination of all Forms of Discrimination against Women
(CEDAW) (1979) , and the Convention on the Political Rights of Women (CPRW) (1952). We choose
these treaties to illustrate our point because they address a similar issue, but have very different
estimated costs of ratification: CPRW is much more costly to ratify than CEDAW is.
This is somewhat puzzling. Ex-ante, if one treaty was more costly than another, we would
naturally expect CEDAW to be more costly. Discrimination against women in the political sphere
is merely one form of discrimination against women. If a country is willing to ratify a treaty
concerned with all forms of discrimination against women (CEDAW), it should be willing to ratify
a treaty concerned with just one type of discrimination against women (CPRW). Consider articles
1-2 of CPRW:
Article 1 : Women shall be entitled to vote in all elections on equal terms with men, without any discrimination.
Article 2 : Women shall be eligible for election to all publicly elected bodies, established by national law, on
equal terms with men, without any discrimination.
By contrast, consider now articles 2 and 3 of CEDAW (article 1 simply defines “discrimination
against women"):
Article 2 : States Parties condemn discrimination against women in all its forms, agree to pursue by all appropriate means and without delay a policy of eliminating discrimination against women, and to this
end undertake:
– To embody the principle of the equality of men and women in their national constitutions or other appropriate legislation...
– To adopt appropriate legislative and other measures, including sanctions where appropriate, prohibiting all discrimination against women...
50
Article 3 : States Parties shall take in all fields, in particular in the political, social, economic and cultural fields,
all appropriate measures, including legislation, to ensure the full development and advancement of
women... on a basis of equality with men.
In particular, note the ease with which articles 1-3 of CPRW can be verified. Election monitoring by reputable organizations deployed by the OECD and the EU occur regularly in countries all
over the world.18 Further, the monitoring is generally done over a long period prior to the election itself. Any failure to comply with any of articles 1-2 of CPRW would not go unnoticed, and
at the very minimum would be brought to the attention of the international community. On the
other hand, what is demanded by CEDAW is to a great extent open to interpretation. Verification
of compliance ex-post of ratification would require consensus in the international community on
whether a violation has occurred. In a world where strategic alliances are important but fluid, the
set of countries who agree that another’s actions violate the terms of a treaty can be small and uncertain. One hypothesis then is that the more open to interpretation, or more difficult a violation
is to verify, the cheaper the ratification cost should be. We discuss this hypothesis and its possible
implications in more detail in the appendix.
(c) Country Specific Costs
There are significant permanent differences across countries in the cost of ratification. To illustrate this variation, in table 1.11 we display the country-specific cost estimates for ten of the
eighty-three countries in our sample, and the variance and coefficient of variation of the costs. In
figure 1.5 we plot a histogram of country specific costs.
Nearly all the country-specific costs are statistically significant. The range over which country
specific costs vary is larger and more economically significant that the range over which treaty
specific costs vary. Mauritania has the smallest estimated cost, and Malaysia has the largest. We
purposely include in the table cost estimates for countries that were discussed above. In particular,
figures 1.1, 1.2, and 1.3 comparing the HRT participation of Argentina and Chile, Kenya and Chad
18 See
for example the Handbook for European Union Election Observation (2008)
51
Table 1.11: Structural Parameter Estimates
Mauritania
El Salvador
Chile
Argentina
Uganda
Kenya
Somalia
Chad
Turkey
Malaysia
Variance
Coefficient of Variation
0.4173
(1.227)
4.260
(0.6075)
4.2903
(0.6023)
4.3504
(0.6041)
4.7963
(0.7281)
5.383
(0.8005)
5.5389
(0.745)
6.1012
(1.003)
7.3958
(1.0141)
11.9950
(1.729)
3.0414
0.3422
Notes: Standard errors should be interpreted with caution. We are not accounting for the variability
of the first stage estimates in our calculation of the standard errors.
Figure 1.5: Frequency Distribution of Country Specific Treaty Costs
52
and Uganda and Chad respectively as well as figure 1.9 in the appendix comparing Argentina and
Somalia all are consistent with these cost estimates.
1.6
Counterfactual Experiments
In this section we use the estimated model to consider counterfactual policy experiments. The
two goals of counterfactual analysis here are first to evaluate the relative importance of potential
competing theories of HRT participation, and second to evaluate equilibrium effects of alternative
policies. Performing counterfactual analysis in dynamic games is often complicated by the potential multiplicity of equilibria. How behaviour responds to a change in the structural parameters
of interest directly depends on the equilibrium played under the counterfactual parameters. If the
model has multiple equilibria, we can not know which equilibrium is played in the counterfactual scenario, and thus we can not make counterfactual predictions.19 Many applications in the
dynamic game literature simply get around this problem by assuming a unique equilibrium, or
assuming that the counterfactual equilibrium is the same as the equilibrium in the data. These
assumptions are strong and in many cases not realistic. Here we adopt the approach of Aguirregabiria (2009). Let (θ̂, Pˆ0 ) represent the estimated parameters and equilibrium choice probabilities
(i.e., P0 = Φ θ̂, P0 ). Then under the assumption that the equilibrium selection mechanism is a
smooth function of the structural parameters, by taking a Taylor expansion around the estimated
structural parameters θ̂ one can obtain an approximation to the counterfactual equilibrium associated with the counterfactual parameters θ ∗ , say P∗app .
20
As we discussed above, it is difficult to assess the relative importance of purported key variables such as foreign aid receipts in the treaty ratification decision simply by looking at the estimated parameters. One of the key benefits of estimating a structural model which allows for
19 If we had a method to solve for every equilibrium in the model we could at least say what all the potential counterfactual scenarios would be. Note however that if the number of equilibria is very large model has little ability to predict
behaviour under counterfactual scenarios.
20 The approximation error can be quite large if the counterfactual is large (i.e., if θ ∗ is very different from θ̂). Supposing the error is small enough so that P∗app lies in the dominion of attraction of the counterfactual equilibrium P∗ , by
Pk
Pk
starting from P∗app and iterating in the mapping Pk+1 = Φ z̃irt
θ + ẽirt
we will reach the counterfactual equilibrium P∗ .
53
several competing theories is that we can “shut down" one theory while allowing for the other(s)
and observe how much the original patterns in the behavioral responses we were interested in
changed. This is precisely what we do in the first set of counterfactual experiments. To get a sense
of the importance of foreign aid receipts in the treaty participation decision, we consider a counterfactual world where the weight on foreign aid receipts in the payoff function is 10% lower than
in the factual world. In the terms discussed above, in this counterfactual world a country would
be willing to forego $50.79 in foreign aid (as opposed to $45.71) for a jump from a 2.0 score on
human rights institutions to 5.0. In figure 1.6 we present a plot of the number of treaties ratified
for each country in the factual and counterfactual equilibrium scenarios. To obtain the number of
treaties ratified we create a dynamic sequence of decisions using the equilibrium choice probabilities in both the factual and counterfactual scenarios, starting from the same state in each case. The
average number of treaties ratified across countries drops from 6.78 under the factual equilibrium
to 4.5 in the counterfactual scenario, a drop of about 33%. The standard deviation of the number
of treaties ratified increases from 2.56 in the factual scenario to 2.89 in the counterfactual scenario.
We also plot treaty capital K in the two scenarios in figure 1.7. While we can not learn anything
by comparing the means across the two scenarios (the mean across countries of K is always 0 by
definition), it is worthwhile to note the difference in standard deviations across the two equilibria.
In the factual case the standard deviation is 2.52 while in the counterfactual case it is 3.42.
Though aid receipts are an important factor in explaining HRT ratification behavior, the same
is not true for human rights. We consider a counterfactual analogous to the one we considered
above, and reduce the weight on human rights in the payoff function by 10%. In this counterfactual world a country is now willing to drop almost 4 points in human rights institutional quality
for an increase from the 25th to 75th percentile in aid receipts. In this case, the mean number of
treaties ratified in the factual case is 6.78, and in the counterfactual case 6.80. In the context of our
model, aid is by far the most significant factor in the treaty ratification decision, at least on the
benefit side.
In the second set of counterfactuals, we try to determine whether country-specific benefit het-
54
Figure 1.6: Number of Ratified Treaties Under Factual and Counterfactual Equilibria
Figure 1.7: K Under Factual and Counterfactual Equilibria
55
Figure 1.8: Number of Ratified Treaties Under Counterfactual Equilibria with No Heterogeneity
erogeneity or cost heterogeneity is more important in explaining the heterogeneity we observe in
ratification decisions. Recall that in the structural model we allow for both permanent differences
in the flow of foreign aid into a country (equation 1.13), and permanent differences in the ratification costs across countries. The experiment we consider here then is to shut down each source of
heterogeneity in turn, and compare ratification behavior in each counterfactual scenario. By comparing the variances in treaty participation across these two counterfactual cases we get an idea of
whether heterogeneity on the cost side or benefit side explains more of the participation decision;
if the variance across countries is larger in the case where there is no cost side heterogeneity we
can conclude that more of the heterogeneity in behaviour is explained by benefit side heterogeneity. Similarly, if the variance is larger in the case where there is no benefit side heterogeneity, we
would conclude the cost side differences are more important. In figure 1.8 we plot the number of
treaties ratified in each case. The key result is that the standard deviation of the number of treaties
ratified in the counterfactual case of no benefit heterogeneity (2.89) is smaller than the standard
deviation of the number of treaties ratified in the case of no cost heterogeneity (3.74). This tells
us that benefit heterogeneity is more significant in explaining heterogeneity in behavior. This is
result is particularly important because benefit heterogeneity has been overlooked as a factor in
explaining heterogeneity in behavior in past studies of HRT participation.
56
1.7
Conclusion
We model the decision of aid receiving countries to participate in human rights treaties at the
United Nations. Using dynamic panel data techniques, we first establish that there are significant
economic returns to treaty ratification. Countries that participate in HRT’s receive more foreign
aid than those that do not. We further establish that countries with good human rights practices
are more likely to participate than countries with bad practices, but that the converse is not true.
Countries with relatively large levels of predetermined treaty participation experience a decline in
human rights practices. Motivated by these findings, we develop and estimate a dynamic game of
treaty ratification. The dynamic model we consider is analogous to a dynamic game of oligopoly
competition where firms invest in quality (eg.,Pakes and McGuire (1994)). Here, aid receiving
countries compete to attract economic resources from the developed world by ratifying costly human rights treaties. We estimate the costs and benefits of ratification, allowing for heterogeneity
across treaties and regimes. We find that the attendant economic returns to ratification induce
countries to ratify. This is a contribution of the paper, as the literature on treaty ratification has
generally focused on the cost of ratification as opposed to the possible benefits in rationalizing observed behaviour. We also find that ratification costs vary significantly across regimes and across
treaties. Specifically, autocracies with poor human rights behavior have a larger cost of ratification
than democracies with poor human rights behavior, but this difference shrinks as human rights
behavior improves.
We are the first to examine how costs may vary across treaties. We find significant variance in
the cost of ratification, and discuss how this variance may be explained by observable institutional
details of the treaties, in particular the verifiability of treaty terms. In future work we hope to
address this issue in more detail, as this result can have interesting policy implications for the
design of international treaties from a welfare perspective.
We then use the estimated model to consider several counterfactual experiments. We evaluate
the relative importance of aid and domestic human rights as factors for explaining ratification
57
behavior, and we find that aid is by far the more significant factor. Decreasing the aid motive
by 10% decreases ratification by 33% while a 10% decrease in the human rights motive hardly
changes ratification behavior at all. We also evaluate the relative importance of benefit side and
cost side heterogeneity in explaining heterogeneity in behavior and find that benefit heterogeneity
is more important.
58
Figure 1.9: Argentina and Somalia Treaty Capital
Figure 1.10: Argentina and Somalia ln(Aid) Receipts
1.8
1.8.1
Appendix
Treaty-Specific Cost Heterogeneity and Observable Treaty Characteristics
There are several candidate characteristics that vary across treaties that measure how “open to
interpretation" a treaty is. For example each treaty contains a section on reservations ratifiers
may have with respect to the contents of the treaty, and objections existing parties may have to
the reservations. Objections and reservations are easier to make (and perhaps more necessary)
when the treaty is less interpretable. Interpreting the treaty specific cost as the choice of the treaty
designer (x-axis) and the rate of reservation/objection as the outcome (y-axis), in figure 1.13 we
59
Figure 1.11: Argentina Treaty Capital and ln(Aid) Receipts
Figure 1.12: Somalia Treaty Capital and ln(Aid) Receipts
60
Figure 1.13: Reservations/Objections per Ratifier and Treaty Costs
Figure 1.14: Treaty Costs and Words per Article
plot the rate of reservation/objection per treaty participant against the estimated treaty specific
entry costs.
There appears to be a negative relationship here; the larger the treaty specific cost, the fewer
reservations and objections made per treaty ratifier. This then begs the question, what exogenous
characteristic of HRT’s determines the costliness of ratification? In light of our example above,
in figure 1.14 we plot the estimated treaty specific entry costs against the number of words per
article for each treaty. Consistent with the example in the text, treaties with more words per article
are cheaper to ratify. This raises potentially interesting optimal institutional design questions.
An optimal treaty design balances verifiability and enforceability of the treaty’s terms with the
61
number of participants to the treaty. While some of these questions could be addressed in the
context of our structural model, we leave this for future work.
Treaty
International Covenant on Economic, Social and Cultural Rights
International Covenant on Civil and Political Rights
International Convention on the Elimination of All Forms of Racial Discrimination
Convention on the Prevention and Punishment of the Crime of Genocide
Convention on the Rights of child
International Convention on the Protection of the Rights of All Migrant Workers and Members of their Families
Convention on the non-applicability of statutory limitations to war crimes and crimes against humanity
International Convention on the Suppression and Punishment of the Crime of Apartheid
Convention on the Elimination of All Forms of Discrimination against Women
International Convention Against the Taking of Hostages
Convention on Prohibitions or Restrictions on the Use of Certain Conventional Weapons which may be deemed to be
Excessively Injurious or to have Indiscriminate Effects
International Convention for the Protection of Performers, Producers of Phonograms and Broadcasting Organisations
Convention on the Political Rights of Women
Convention relating to the Status of Refugees
Convention against Torture and Other Cruel, Inhuman or Degrading Treatment or Punishment
Table 1.12: Human Rights Treaties
10/4/1981
26/10/1961
20/12/1952
28/7/1951
10/12/1984
Date Opened
19/12/1966
19/12/1966
7/3/1966
9/12/1948
20/11/1989
18/12/1990
26/11/1968
30/11/1973
1/3/1980
18/12/1979
62
63
1.8.2
Representation of Choice Probabilities
P θ + ẽP
We show how we arrived at the expression Pir (wirt ) = Φ z̃irt
irt explicitly. For notational
σε
simplicity, let us redefine θ =
θ 21
σε .
Recalling that wirt contains all payoff relevant information for
treaty office r in country i at time t, for the sake of clarity where necessary we will write separately
the ratification status and all other payoff relevant variables in wirt as ( xirt , wirt ), with it being
P (1, w ), vP (0, w ) represent the choice-specific
understood that xirt is included in wirt . Let virt
irt
irt
irt
values to ratification and non-ratification of treaty r in state wirt . The entry threshold for treaty
office r in country i is given by the difference in the choice specific values:
P
P
∆ir = virt
(1, wirt ) − virt
(0, wirt )
Treaty r is ratified in country i when the private information shock ε irt is no larger than this difference, i.e., when
P
P
virt
(1, wirt ) − virt
(0, wirt ) ≥ ε
Then we have the ex-ante probability of ratification in any ( xirt , wirt ) :
P
P
Pir (0, wirt ) = Φ (virt
(1, wirt ) − virt
(0, wirt ))/σε
Pir (1, wirt ) = 1
(1.20)
(1.21)
P (1, w ) − vP (0, w ) = z̃P θ + ẽP . Define the integrated
What we need to show now is that virt
irt
irt
irt
irt
irt
value function of treaty office r in country i as:
P
P
VirP (w) = Eε max virt
(1, wirt ) − ε irt , virt
(0, wirt )
21 Note
however that given the structure of the model we are able to identify σε separately.
64
The model we have described above implies that:
P
virt
(1, wirt ) = Rirt (wirt ; θ ) − (1 − xirt ) ECirt (wirt ; θ ) + δFirw,P (1, wirt )0 Vir
P
virt
(0, wirt ) = Rirt (wirt ; θ ) + δFirw,P (0, wirt )0 Vir
where Vir is the vector of values (as many value functions as there are values of wirt ) and Firw,P (0, wirt )
and Firw,P (1, wirt ) are vectors with transition probabilities. We are able to express the discounted
expected continuation values as δFirw,P ( xirt , wirt )0 Vir by our assumption that the private information shock ε irt is independent from the state variables wirt .
Note that we can express the choice specific values in this way even though we assume that
entry is irreversible, as long as we have Pir (wirt ) = 1 for any vector wirt such that xirt = 1. Then,
by the definition of the integrated value function, we have that:
P
P
P
VirP (wirt ) = 1 − Pir (wirt ) virt
(0, wirt ) + Pir (wirt )virt
(1, wirt ) + eirt
(1.22)
P
P
P
eirt
= Pir (wirt ) E ε irt |virt
(1, wirt ) + ε irt > virt
(0, wirt ) + (1 − Pir (wirt ))0
(1.23)
where
P = σ φ Φ−1 ( P ( w )) (see AguirIn the case of normally distributed error, we have that eirt
ε
ir
irt
regabiria and Mira (2007) for details). Now, by substituting the expressions for the choice-specific
values into the above expression for the integrated value function, we get:
VirP (wirt ) = Rirt (wirt ; θ ) + Pir (wirt ) − (1 − xirt )Cirt (wirt ; θ ) + δFirw,P (1, wirt )0 VirP
+
P
1 − Pir (wirt ) δFirw,P (0, wirt )0 VirP + eirt
65
By stacking the value functions by state w, we can express this more compactly as:
VirP = Rir (θ ) + Pir × [−(1 − xir )Cir (θ ) + δFirw,P (1)VirP
1 − Pir × δFirw,P (0)VirP + eirP
+
where × denotes the element-wise product. Further, define:
Firw,P ≡ (1 − Pir ) × Firw,P (0) + Pir × Firw,P (1)
A ≡
I − δFirw,P
−1
Then we have:
VirP = A ∗ Rir (θ ) − Pir × [(1 − xir )Cir (θ )] + A ∗ eirP
Now recalling the definition of the entry threshold and the expressions for the choice-specific
values, we write the vector of entry thresholds as:
∆irP = virP (1) − virP (0)
= −(1 − x)Cir (θ ) + δ Firw,P (1) − Firw,P (0) Vir
P
= −(1 − x)Cir (θ ) + δFw,P
A
∗
R
(
θ
)
−
P
×
[(
1
−
x
)
C
(
θ
)]
+ δFw,P
ir
ir
ir
D
D A ∗ eir
w,P
w,P
P
= δFw,P
D A ∗ Pir − 1 × (1 − x ) Cir ( θ ) + δF D A ∗ Rir ( θ ) + δF D A ∗ eir
w,P
w,P
where Fw,P
D ≡ Fi,r (1) − Fi,r (0).
Note first that the parameters of the model θ enter into the thresholds in the first two terms
of the expression only. Further, we have assumed that the functions Rirt (wirt , θ ) and ECirt (wirt , θ )
are linear in the parameters θ. We can therefore express the vector of entry thresholds as
∆irP = Z̃Pir θ + ẽPir
Chapter 2
The Value of Information in Regulation
2.1
Introduction
We consider the choice of environmental policy by an imperfectly informed regulator whose
choice of regulation is constrained by the ability of regulated firms to block regulation. We compare the value to the regulator of two distinct types of information, information about the social
cost of pollution and information about the profitability of firms.
Determining when the regulator would prefer learning the social cost of pollution to learning
about firm profitability is important for three reasons. First, cost-benefit analysis has long held
the status as the fundamental yardstick in policymaking, though perhaps by default. As Richard
Posner argues, “My own justification for using cost-benefit analysis... [is based on] ...what I claim
to be the inability of judges to get better results using any alternative approach" (Posner, 2000). The
Clinton administration institutionalized the use of cost-benefit analysis in public decision making
with Executive Order 12866, mandating that agencies “propose or adopt a regulation only upon a
reasoned determination that the benefits of the intended regulation justify its costs" (W.J. Clinton,
1993). Consequently, public policy decisions worth billions of dollars are made on the basis of
cost-benefit analysis each year. Given this, it is important to assess the extent to which public
decisions can be improved by better information. Further, in environments with several sources
of uncertainty, determining what type of information to learn is of practical use.
Second, cost-benefit analyses are notoriously inaccurate. One particularly relevant example is
the uncertainty surrounding the cost of meeting the Kyoto Protocol targets. In a study comparing the cost estimates from eight different sources, the Energy Information Administration1 finds
1 Information
available at the EIA Kyoto website: http://www.eia.doe.gov/oiaf/kyoto/cost.html
66
67
significant variance in estimated costs along many dimensions. For example, estimates of annual
GDP loss in the United States from meeting the targets range from 91 to 311 billion dollars, while
estimated carbon prices range from 147 to 360 1996 dollars per metric ton. Thus, at least seven out
of the eight projections are inaccurate to some degree. This wide range for the estimates indicates
that there is considerable “noise" present in cost-benefit analysis. By characterizing the value of
information we address the question of how much the noise matters, and which type of noise is
most worth reducing.
Finally, executive order 12866 actually requires cost benefit analyses only for regulatory actions
“likely to result in a rule that [has] an annual effect on the economy of 100 million or more," (W.J.
Clinton, 1993). That is, US policy makers have a binding research budget, and this budget is
currently being directed to large regulatory decisions. Our goal is to provide a basis for refining
the way this research budget is allocated. In particular, we determine conditions where equally
costly research capacity is best directed at learning the social cost of pollution and, alternatively,
where this research capacity is best directed at learning the cost structure of regulated firms.
To fix ideas, we consider the specific case of environmental tax policy. Environmental policy is necessarily chosen under uncertainty about both the costs (lost profit to firms) and benefits
(reduced pollution) of any particular regulation. Moreover, environmental regulators are often
constrained by industry influence on the legislative process, making the probability of implementation a central consideration in the choice of tax. Thus, environmental regulation seems a useful
example of our more general problem.
We find that the value of learning costs relative to learning benefits hinges on how politically
powerful firms are, where we say firms are “politically powerful" if a marginal increase in tax has
a large effects on the regulator’s ability to implement the policy. Our primary finding is that when
firms are not politically powerful, information about how costly pollution is to consumer welfare
(benefits of regulation) is more valuable than information about the types of firms in the economy
(costs of regulation). On the other hand, when firms are politically powerful, it pays more to learn
precisely what types of firms are in the economy.
68
The intuition behind this result is best seen by considering an extreme case. If firms possess
no lobbying ability and take any policy as given, all a regulator needs to know to set a first-best
tax is how costly pollution is to consumers. With powerful firms, by contrast, the regulator faces a
trade-off: choosing a tax high enough so that if it passes the legislative process only efficient firms
remain in production, or choosing a regulation that is more likely to pass but that leaves some
inefficient firms in the market. Information about firm types helps the regulator make this trade
off optimally.
2.2
Literature Review
The political economy of regulation has its roots in the work of Olson (1964), Stigler (1971) and
Becker (1985). Broadly speaking, the contribution of this literature is to offer a convincing theory
of “who gets regulated and why". While Olson (1964) frames lobbying by regulated parties as a
collective choice problem and examines how the regulated group’s power to influence regulation
depends on collective action, it is Stigler (1971) who first developed a theory of the demand for
regulation. He argues that the state, which has the unique ability to “prohibit or compel, to take or
give money," is largely at the service of well organized groups. Becker (1985) formalizes the ideas
developed in the earlier papers into a model of strategic competition among pressure (interest)
groups for political influence.
This canonical model of regulation largely neglects the role of information and uncertainty.
Lewis (1996), the foundation of the present work, remedies this by introducing regulatory uncertainty into the Becker model. Lewis’ model incorporates the political constraints of Olson and
Becker in the sense that the probability of implementing a policy decreases in the aggregate losses
incurred by firms. Lewis also assumes the regulator is unable to observe the type (profitability) of
any particular firm in the economy, and shows that in this context a welfare maximizing planner
chooses an inefficient policy. Lewis’ planner would ideally shut down all socially inefficient firms,
but a tax which accomplishes this objective is costly both for firms remaining in the industry and
69
for firms which are forced to shut down. A lower tax, which places a smaller burden on efficient
firms, is more likely to be implemented. On the other hand, with a lower tax, some inefficient
firms remain in the industry.
The optimal (second best) policy is one that does not regulate almost efficient firms but induces
very inefficient firms to exit. The optimal policy is inefficient in the sense that, even if the policy
survives the legislative process, firms whose profit is smaller than the externality cost they create
go unregulated. To determine what type of information is most valuable and when, our model
departs from Lewis’s in that we assume the planner is not only uninformed about firm types,
but also about the social cost of pollution. In Lewis’s model the planner knows the exact cost
of pollution. Our purposes are also different. The punchline of Lewis’s paper is that, although
the first best is achieved by setting tax equal to cost, the regulator chooses a lower tax when he
doesn’t know firm types and must be concerned with lobbying. Lewis is able to characterize the
second best tax in this instance. We start from Lewis’s model knowing that a second-best tax will
be chosen, and show how the second best tax and welfare varies with information.
There is a literature, e.g., Laffont and Tirole (1993), which seeks to characterize an optimal regulatory mechanism under imperfect information. While this literature is broadly related to the
current paper, it seeks to answer a fundamentally different question. Instead of characterizing an
optimal regulatory strategy under imperfect information, the current exercise takes as exogenous
a stylized description of prevailing regulatory institutions and examines the value of information
in the context of these institutions. Informally, the existing literature seeks to inform a more ambitious policy reform than we do. The existing literature asks how we can best constitute our
regulatory agencies. We ask, more simply, how should existing regulatory agencies allocate their
research efforts. That is, we take as given the simpler observed mechanism and ask, given this
mechanism, which type of information is worth more to a welfare maximizing regulator.
Value of Information
The statistical measure of the value of information is the expected value of an informed decision
less the expected value of the uninformed decision. To illustrate this
idea,2
70
consider a risk-neutral
gambler who is offered the following gamble: A fair coin will be tossed once. If the gambler
predicts the side of the coin that faces up correctly, he wins a dollar, while if the prediction is
incorrect, he gets nothing. The gambler can expect to earn 50 cents from the gamble, and so
will pay at most fifty cents to play. Now suppose the gambler knows that before making his
“prediction", the true outcome of the coin toss will be revealed to him. In this case the gambler
expects to earn a dollar from the gamble. Whatever the outcome of the coin toss, he will be able
to predict it correctly. In this simple example, the value of information is fifty cents: the expected
value of a fully informed decision minus the expected value of an uninformed decision.
In the context of our problem, the “gambler" is a welfare maximizing regulator with the option
to learn about either the profitability of the firms present in the economy, or the cost they impose
on society. We perform calculations analogous to the one above in order to determine when each
type of information is more valuable.
2.3
Model
We consider the choice of tax by a regulator who is initially uncertain about firms’ profits and
about the social cost of pollution. The regulator chooses a tax to induce the exit of firms whose
profit does not cover the social cost of production. The implementation of the tax is less likely as
the losses to firms increase. After deriving the optimal taxes under different information structures
we determine what information is most valuable to the regulator in different circumstances.
To be more precise, we consider an economy where heterogeneous firms make a binary decision to either produce or not. Firms that produce earn some profit, and firms that do not earn zero
profit. We assume that the firms face perfectly elastic demand.3 Through productive activity, firms
2 See Arrow and Fisher (1973) for an early application of the value of information in economics. A more rigorous
exposition of the value of information, particularly in the context of irreversibility, is provided in Freixas and Laffont
(1984)
3 While the assumption of perfectly elastic demand is not crucial, it does simplify the value of information calculations considerably. In the absence of this assumption, when the regulator sets a tax he must take into account that the
number of firms in the economy will typically decrease when the tax is higher, and the market price will depend on the
tax.
71
impose a social cost (pollution) on consumers. We assume that, whatever this cost is, it is the same
for all firms. The regulator’s proposed policy may be blocked by firms, and the probability of the
policy being blocked increases in the losses the tax imposes on firms. We obtain optimal taxes and
(expected) social welfare for each possible information set, or ‘type’ of planner, and calculate the
statistical values of different types of information.
Formally, the set of firm types in the population is Θ = {θ L , θ H }. The population frequency of
low types θ L is α ∈ [0, 1]. The θi ’s describe firm profit conditional on entry, with a firm of type θi
earning profit θi gross of any tax. For expositional clarity we suppose that at most two firms may
operate in the economy. Let i ∈ {1, 2} index the realized set of firms. The sample of realized firms,
(θ1 , θ2 ), is binomially distributed, with parameters (2, α), with α ∈ [0, 1]. That is,
Prob (θ1 , θ2 ) = (θ L , θ L ) = α2 ,
Prob (θ1 , θ2 ) = (θ L , θ H ) = Prob (θ1 , θ2 ) = (θ H , θ L ) = α(1 − α),
Prob (θ1 , θ2 ) = (θ H , θ H ) = (1 − α)2 .
The social cost of pollution, c, takes one of two values, {c L , c H }, where the population frequency of low costs is q ∈ [0, 1]. Thus there are six possible states of the world, with representative element s ∈ S. The distribution of probability µ(s) over the set of states is given in table 2.3.
A planner who learns c will be said to “learn the social cost of pollution", while a planner who
learns (θ1 , θ2 ) will be said to “learn firm types". This model is among the simplest in which we
can sensibly talk about more than one type of information.
Under any proposed policy, firms either pay the tax and remain in production, or exit and
pay nothing. The losses firms incur are the sum of profit forgone by firms that exit (those with
profit less than the size of the tax), and the taxes paid by firms who remain in production. The
firm’s stay/exit decision is completely determined by whether profit net of the tax is positive or
negative. We assume that indifferent firms exit.
Alternatively, we can imagine that the regulator faces a single firm which produces 0, 1, or 2
72
Table 2.1: Distribution of probability over states
State s
{θ L , θ L , c L }
{θ L , θ H , c L }
{θ H , θ H , c L }
{θ L , θ L , c H }
{θ L , θ H , c H }
{θ H , θ H , c H }
Prob(s) ≡ µ(s)
α2 q
2α(1 − α)q
(1 − α )2 q
α2 (1 − q )
2α(1 − α)(1 − q)
(1 − α )2 (1 − q )
units of output. θi denotes the firm’s profit from producing unit i. The firm’s production decision
hinges on whether the profit from a marginal unit is sufficiently high to cover the tax. This is
precisely the logic behind the Pigouvian tax. With this alternative interpretation noted, we adhere
to our original interpretation for the remainder of the paper to preserve the parallel with Lewis
(1996), the motivation of the present work.
To capture the idea that lobbying makes implementation uncertain we follow Lewis (1996),
and suppose that if a tax τ imposes aggregate losses L on firms, the probability of the policy being
implemented is P(L), where P0 ( L) < 0 and P00 ( L) < 0.4 In words, as aggregate losses increase
the chances of the policy passing decrease at an increasing rate. From the planner’s perspective,
proposed policy is not implemented with certainty, so that choosing a tax is a gamble. The political power of firms depends on the shape of the P(·) function. If P(·) is steep, small changes in
the losses the policy imposes on firms greatly decrease the probability of the policy being implemented. In other words, marginal changes in L are important, and in such a case we say that firms
are politically powerful.5
We also assume:
Ec > θ H > c L ≥ θ L ,
4 We
are assuming here that while firms have the ability to organize and (probabilistically) affect regulation outcomes, consumers do not. One could imagine a more general model where consumer and industry interest groups
compete over the regulation outcome, as in Grossman and Helpman (2001).
5 One could imagine allowing firms to make a lobbying decision where there is some effort cost to lobbying intensity.
In this case strategic interactions both between firms and between firms and the regulator become important. In some
instances firms may find it profitable to free-ride on the lobbying effort of other firms, which the regulator must consider
when choosing a tax. Further, the regulator must also take into account the fact that lobbying intensity will depend on
the tax he selects. While these issues are potentially interesting in their own right, these extensions render our model
intractable.
73
where Ec denotes the expected cost of pollution, qc L + (1 − q)c H . θ H > c L means that when the
planner draws the sample of firm types and the social cost of pollution, there is strictly positive
probability that at least one firm is efficient. c L ≥ θ L means that, for whatever social cost the
planner has drawn, there is strictly positive probability that the planner has drawn an inefficient
firm. We discuss the role and restrictiveness of each of the three inequalities in turn. First consider
the assumption Ec > θ H . This assumption does not effect the main results and is made in order
to keep the scope for regulation large. Notice that without this assumption, a planner informed
about firm types who realizes {θ H , θ H } would not want to set a tax at all. Avoiding such cases
keeps the analysis more tractable (see section 5 below). The assumption θ H > c L is made to
keep the regulator’s problem interesting. If instead θ H < c L there are no efficient firms. Finally
consider the assumption c L ≥ θ L . If instead c L < θ L , if the planner chooses to learn social cost
first and realizes c = c L , he does not wish to set any tax; any firm draw is efficient. As in the
case of Ec > θ H , this assumption is made to keep the analysis more tractable. We argue that this
assumption is not restrictive, however. To see this, suppose that instead of what was presented
above, our planner draws the firms from a continuum Θ = [θ, θ ] and social cost from a continuum
C = [c, c]. In this alternative framework, as long as Θ ∩ C 6= ∅ and c ≥ θ, this assumption
holds. This alternative setup would also ensure that there is an efficient firm in the population,
i.e., θ H > c L .
2.4
Planner’s Problem
The planner’s objective is to choose a tax to induce the exit of firms whose profit is no larger than
the externality they create. The planner chooses the tax taking into consideration the information
he has, and that policies are not implemented with certainty. The planner’s welfare criterion does
not directly include the loss the tax imposes on firms since the tax is simply a transfer.
74
Let the function 1`i indicate that the profit of firm i is greater than the tax τ. That is,
1 `i =
1 if θi > τ
0 otherwise.
Then for any given realization {θ1 , θ2 , c} ∈ S, the payoff to a planner who chooses tax τ is given
by:
G (τ, s) = P( L(τ, s)) 1θ1 (θ1 − c) + 1θ2 (θ2 − c) + (1 − P( L(τ, s))) ∑(θi − c)
i
where:
L(τ, s) =
∑
1 θi τ + (1 − 1 θi ) θ i
i
At any state s, a fully informed planner or government chooses a tax that solves:
max P( L(τ, s)) 1θ1 (θ1 − c) + 1θ2 (θ2 − c) + (1 − P( L(τ, s))) ∑(θi − c)
τ ∈<
(2.1)
i
Let τF∗ (s) denote the optimal full information tax in state s, and G (τF∗ (s), s) be the planner’s
payoff. We can write the expected full information payoff as:
EGF =
∑ G(τF∗ (s), s)µ(s)
s∈S
In contrast, an uninformed planner chooses a tax independent of the true state of the world.
In particular, he chooses a tax to solve:
max ∑ P( L(τ, s)) 1θ1 (s) (θ1 (s) − c(s)) + 1θ2 (s) (θ2 (s) − c(s))
τ ∈< s∈S
+(1 − P( L(τ, s)))
∑(θi (s) − c(s))
i
µ(s)
(2.2)
75
Table 2.2: Information partition Ωθ induced by information about firms
Event ωθ
{{θ L , θ L , c L }, {θ L , θ L , c H }}
{{θ L , θ H , c L }, {θ L , θ H , c H }}
{{θ H , θ H , c L }, {θ H , θ H , c H }}
Probability µ(ωθ )
α2
2α(1 − α)
(1 − α )2
Let τU∗ denote the optimal uninformed tax, and G (τU∗ , s) be the planner’s payoff to choosing this
tax when the true state is s. Note that here the planner may choose only one tax regardless of what
the true state is, as information about the state will not be revealed to him. We can thus write the
expected uninformed payoff as:
EGU =
∑ G(τU∗ , s)µ(s)
s∈S
To calculate the value of learning the social cost of pollution or firm type, we must state the
partially informed planner’s problem in each case.
To describe how new data changes the planner’s beliefs, it is convenient to use the concept of
information “partitions." Formally, a partition of the sample space S is a collection of subsets, or
events, A1 , ..., An such that ∪in=1 Ai = S and ∩in=1 Ai = ∅. A partition groups realizations {θ1 , θ2 , c}
together, and allows a planner to distinguish across, but not within groups of realizations.
Letting Ωθ represent the partition induced by information on firm types, ωθ an event in the
partition, and µ(ωθ ) the probability measure defined on the partition, the elements of the partition
and their population frequencies are given in table 2.4. Information about firm types partitions the
set of states into three disjoint sets, which the planner can distinguish from one another. Each of
these sets contains two elements, between which the planner is unable to distinguish. In words,
when the planner learns firm types he learns whether he faces two low type firms, two high types,
or one of each type. However, he does not learn whether social cost of pollution is high or low.
A planner who learns the types of firms but is uncertain of the social cost has expected payoff,
max
∑
τ ∈< s∈ω
θ
P( L) 1`1 (θ1 − c(s)) + 1`2 (θ2 − c(s)) + (1 − P( L)) ∑(θi − c(s)) µ(s|s ∈ ωθ ). (2.3)
i
76
Here the planner does not know the realization {θ1 , θ2 , c} exactly, but he knows that the realization
falls in the set ωθ . As before, for any state s and tax τ the payoff G (τ, s) is obtained. The planner
thus chooses τ to maximize the expected value of G (τ, s), given that s ∈ ωθ . Let τΩ∗ θ (ωθ ) be the
solution to this problem, the optimal tax under partial information about firm types. This tax
balances the benefit of shutting down socially inefficient firms against the cost of reducing the
probability of implementation.
We can write the expected partial information payoff from learning firm types as,
EGΩθ =
∑ ∑
ωθ ∈ Ω θ s ∈ ωθ
G (τΩ∗ θ (ωθ ), s)µ(s|s ∈ ωθ )µ(ωθ ).
In words, a planner anticipating information about firm types forms an expectation over the information he could receive (the elements ωθ of the partition Ωθ ), knowing that once he obtains this
information he will choose the tax that maximizes expected social welfare conditional on his information. Note that this partially informed problem is intermediate between the ones the planner
faces when fully informed and uninformed. In (3) the planner can choose one of three taxes, one
for each element of the partition. This contrasts with the fully informed and uninformed planner’s
problem, where the planner chooses six taxes and one tax respectively.
A planner who chooses to be informed about the social cost of pollution learns whether the cost
of pollution is high or low, but not the types of firms that are present. Formally, information about
social cost of pollution c partitions the set of states of the world into two disjoint sets between
which the planner can distinguish. Each set contains three elements, between which the planner
cannot distinguish. Letting Ωc represent this partition and ωc an event in Ωc , table 2.4 gives the
elements of the partition and their population frequencies.
We can now write the partial information payoffs when the planner learns the social cost of
pollution but not firm types,
max
∑
τ ∈< s∈ω
c
P( L(τ, s)) 1`1 (s) (θ1 (s) − c) + 1`2 (s) (θ2 (s) − c) + (1 − P( L(τ, s))) ∑(θi (s) − c) µ(s|s ∈ ωc ).
i
77
Table 2.3: Information partition Ωc induced by cost information
Event ωc
{{θ L , θ L , c L }, {θ L , θ H , c L }, {θ H , θ H , c L }}
{{θ L , θ L , c H }, {θ L , θ H , c H }, {θ H , θ H , c H }}
Probability µ(ωc )
q
1−q
Letting τc∗ (ωc ) denote the optimal tax under partial information about social cost given the
information ωc , the expected partial information payoff is:
EGΩc =
∑ ∑
ωc ∈ Ω c s ∈ ωc
G (τc∗ (ωc ), s)µ(s|s ∈ ωc )µ(ωc ).
The model we have presented is extremely simple. In particular, we have greatly simplified
our task by assuming that all firms impose the same pollution cost on society. In an extension we
consider the possibility that firms have heterogeneous costs of pollution (see section 7). Finally,
we note that we have also assumed the cost of obtaining information to be constant across the two
types of information. One could easily allow for the costs to be different across information types,
and introduce a parameter measuring this difference into the problem. The substantive results
presented below will not change; all comparisons between values of information will be made
relative to this parameter.
2.5
Optimal taxes
To calculate the value of the three types of information, we need to obtain the tax that maximizes
social welfare for a planner with each possible type of information: a fully informed planner,
either type of partially informed planner, and an uninformed planner. We can then characterize
and compare the values of the different types of information.
We first observe that a welfare-maximizing planner may restrict himself to choosing a tax from
{θ L , θ H } without loss. That is, a planner with any information can ignore taxes outside the set
{θ L , θ H } when choosing the optimal tax. To see this, note first that a planner is indifferent between
78
Table 2.4: Optimal taxes in each possible state
State s
{θ L , θ L , c L }
{θ L , θ H , c L }
{θ H , θ H , c L }
{θ L , θ L , c H }
(
{θ L , θ H , c H }
{θ H , θ H , c H }
θL
Optimal Tax τF∗ (s)
{θ L , θ H }
θL
θL
{θ L , θ H }
(θ H −c H )+(θ L −c H )
if
≤ P(Pθ(2θ+Lθ) )
(θ −c )
θ H if
L
H
(θ H −c H )+(θ L −c H )
(θ L −c H )
H
>
L
P(2θ L )
P(θ H +θ L )
θH
taxes τ = θ H and τ ∈ (θ H , ∞). The former leaves a high type firm with zero profit, and we have
assumed that firms exit when they earn zero profit, while the latter set of taxes clearly induces
shut-down. Both types of firm exit for any τ ∈ [θ H , ∞), and aggregate losses are constant over
[θ H , ∞). Notice as well that, as we have assumed c L ≥ θ L , a tax of τ = θ L is strictly preferred by the
planner to any tax τ < θ L . Under the latter, no firms exit when the policy is passed, while under
the former low-types exit, and a planner would always like to induce low firms to exit. Together
these imply that, without any loss in generality, we need only consider taxes τ ∗ ∈ [θ L , θ H ].
It only remains to show that the optimal tax may not lie in the open interval (θ L , θ H ). Suppose
to the contrary that τ ∗ ∈ (θ L , θ H ). By decreasing the tax by ε ∈ (0, τ ∗ − θ L ), the planner reduces
the loss directly incurred by firms while leaving firm exit decisions unchanged. But if this is true,
then the planner has increased the probability of implementing the tax policy without changing
expected welfare, contradicting the optimality of τ ∗ . Therefore the only taxes we need consider
are in the set {θ L , θ H }.
We can now calculate the optimal tax in each of the six possible states in S. Letting τF∗ (s) denote
the optimal full information tax in state s, the optimal taxes are given in table 2.5.
Given the realization {θ L , θ L , c L }, the planner is indifferent between any taxes that induce low
types to exit the market. Any tax at least as large as θ L induces low type firms to exit, and losses
incurred by the low type firms do not increase once tax exceeds θ L . As we can always restrict
ourselves to the set {θ L , θ H } when looking for the optimal tax, this completes the argument. The
79
same argument applies to the realization {θ L , θ L , c H }.
Now consider {θ L , θ H , c L }. In this state the planner optimally shuts down the low-type and
leaves the high type unregulated. A tax of θ L uniquely solves this problem. In the case of
{θ H , θ H , c L } the planner would prefer to leave both firms unregulated. Under a tax of θ L neither
firm exits.6 For the realization {θ H , θ H , c H } the planner optimally shuts down both firms, which
any tax at least as large as θ H does.
Finally, consider {θ L , θ H , c H }. In this case the planner faces a trade-off. On the one hand, both
of the firms are inefficient, and the planner would ideally shut down both of them, which requires
a high tax, τ = θ H . However, while a high tax may shut down both firms, such a tax imposes
larger losses on firms and is less likely to be enacted than a low tax that shuts down only the most
inefficient firm θ L . The optimal tax can be θ L or θ H depending on the importance of lobbying and
the relative efficiencies of the firms.
To see this more clearly, note that when the P(·) function decreases quickly, as is the case when
lobbying is important, the ratio
P(2θ L )
P(θ H +θ L )
is very large. The planner would choose a low tax in this
case, as it is preferable to shut down the more inefficient low type with higher probability than
risk not shutting down either firm by setting a high tax and regulating both firms. Note as well
that when θ H ∼ θ L we have that
(θ H −c H )+(θ L −c H )
(θ L −c H )
>1∼
P(2θ L )
P(θ H +θ L )
and the planner regulates both
firms by setting a high tax. The firms are almost equally efficient and the planner does not benefit
by treating them differently. On the other hand, when θ L < θ H ∼ c H so that the firms are different
and the high type is almost efficient we get that
(θ H −c H )+(θ L −c H )
(θ L −c H )
∼1<
P(2θ L )
,
P(θ H +θ L )
and the planner
prefers a low tax, and thus treats the two firms differently.
We now turn attention to the calculation of optimal taxes when the planner learns firm types
but not the social cost of pollution. Optimal taxes for each element of the relevant partition are
given in table 2.5.
When {{θ L , θ L , c L }, {θ L , θ L , c H }} is realized, whatever the social cost of pollution, both firms
6 Note
enacted.
that for the social welfare criterion we use, in this case the planner does not care whether or not regulation is
80
Table 2.5: Optimal taxes when the planner knows firm types only
Optimal Tax τθ∗ (ωθ )
{θ L , θ H }
(θ H − Ec)+(θ L − Ec)
if
≤ P(Pθ(2θ+Lθ) )
(θ − Ec)
Event ωθ
{{θ L , θ L , c L }, {θ L , θ L , c H }}
(
{{θ L , θ H , c L }, {θ L , θ H , c H }}
θL
L
θ H if
(θ H − Ec)+(θ L − Ec)
(θ L − Ec)
H
>
L
P(2θ L )
P(θ H +θ L )
{{θ H , θ H , c L }, {θ H , θ H , c H }}
θH
Table 2.6: Optimal taxes when social cost is known
Event ωc
{{θ L , θ L , c L }, {θ L , θ H , c L }, {θ H , θ H , c L }}
(
{{θ L , θ L , c H }, {θ L , θ H , c H }, {θ H , θ H , c H }}
θL
Optimal tax τc∗ (ωc )
θL
P(2θ L )(θ L −c H )− P(θ H +θ L )[(θ H −c H )+(θ L −c H )]
1− α
> α
if
P(2θ )[θ −c ]
θ H if
H
H
H
P(2θ L )(θ L −c H )− P(θ H +θ L )[(θ H −c H )+(θ L −c H )]
P(2θ H )[θ H −c H ]
≤
1− α
α
are inefficient. Any tax at least as large as θ L induces the firms to exit, and all such taxes impose
equal losses on the firms. Thus the optimal tax is θ L or θ H .
When {{θ L , θ H , c L }, {θ L , θ H , c H }}, is realized the planner faces a trade-off similar to that when
he is fully informed and the state of the world is given by {θ L , θ H , c H }}. We have assumed that
Ec > θ H so both firms are inefficient in expectation. From the point of view of expected efficiency
a high tax is strictly optimal. However, high taxes are less likely to be implemented than low taxes,
and thus the preferred tax hinges on the relative inefficiencies of the two types of firms. The more
inefficient the low type is relative to the high type, or equivalently, the faster the probability of
implementation decreases in losses, the greater the planner’s inclination to choose a low tax.
Last, when the realization is {{θ H , θ H , c L }, {θ H , θ H , c H }}, both firms are inefficient in expectation. Given the planner’s information it is optimal to regulate both firms and thus a high tax is
chosen.
We now calculate optimal taxes under the assumption that the planner knows the social cost
but not firm types. The optimal taxes under the partition induced by this information are given in
table 2.5.
When costs are low, i.e., {{θ L , θ L , c L }, {θ L , θ H , c L }, {θ H , θ H , c L }} is realized, only the low type is
81
inefficient. A low tax is uniquely optimal since a high tax shuts down efficient firms.
When costs are high, {{θ L , θ L , c H }, {θ L , θ H , c H }, {θ H , θ H , c H }} is realized. If the planner knew
that the true state was an element of {{θ L , θ L , c H }, {θ H , θ H , c H }} the optimal policy would clearly
be a high tax. That the planner cannot distinguish between these states and {θ L , θ H , c H } complicates matters, because the fact that {θ L , θ H , c H } is possible forces the planner to weigh the net
benefit of a low tax against the net benefit of a high tax. The planner must choose between
a higher probability of implementation at the cost of efficiency, and more efficiency at the cost
of lower probability of implementation. If, conditional on the true state being an element of
{{θ L , θ L , c H }, {θ H , θ H , c H }}, the likelihood of drawing the realization {θ L , θ H , c H } is sufficiently
high, and the probability of implementation decreases quickly enough as losses increase, the planner will prefer a low tax to a high one.
Finally, we derive the uninformed planner’s optimal tax. This tax is given by:
τU∗
=
θL
if
P(2θ L )(θ L − Ec)− P(θ H +θ L )[(θ H − Ec)+(θ L − Ec)]
P(2θ H )(θ H − Ec)
>
1− α
α
θ
H otherwise.
Recalling that an uninformed planner must trade-off probability of implementation with efficiency, this condition says that when marginal changes in political resistance are important, an uninformed planner chooses a low tax, and when marginal changes are not so important he chooses
a high tax. Of course the relative frequencies of firm types and social cost play a role as well. For
example if the probability of drawing a high type firm is sufficiently large relative to the probability of drawing a low type firm (i.e., α → 0) it is easy to see from the above expression that a high
tax is optimal.
2.6
The Value of Information
In this section we use the optimal taxes derived above to determine which information is more
valuable to the planner, and when.
82
By definition, the value of learning firm types is given by,
VΩθ = EGΩθ − EGU ,
while the value of learning social cost of pollution is,
VΩc = EGΩc − EGU .
With this notation in place, we can state our first proposition.
Proposition 1
1. The value of learning the types of firms, VΩθ , is zero if firms’ political power is sufficiently small.
Formally:
If
P(2θ L )(θ L − Ec) ≥ P(θ H + θ L )[(θ L − Ec) + (θ H − Ec)]
then
VΩθ = 0.
2. The value of learning social cost of pollution, VΩc , is zero if the firms’ political power is sufficiently
large. Formally:
If
P(2θ L )(θ L − c H ) ≤ P(θ H + θ L )[(θ L − c H ) + (θ H − c H )] +
1−α
P(2θ H )(θ H − c H )
α
then
VΩc = 0.
Beginning with proposition 1.1, consider an uninformed planner who optimally taxes τU∗ . Suppose that once the planner learns the firm types but not social cost of pollution, the tax τU∗ remains
optimal for any possible pair of firm types. That is τU∗ = τθ∗ (ωθ ) for all ωθ ∈ Ωθ . Then, in an exante sense the planner receives the same expected payoff whether he is informed about firm types
or has no information at all. For parameter values such that this is true, information about firm
types cannot have value. Proposition 1 provides conditions under which the planner’s optimal
tax with and without information are the same. When this is the case, information has no value.
83
To understand intuitively when the value of information about firm types can be zero, let us
first determine when it is strictly positive. Suppose first that the planner only regulates the low
type firm when uninformed: τU∗ = θ L . Looking at table 2.5, if ωθ = ({θ H , θ H , c L }, {θ H , θ H , c H })
is realized, that is, the planner learns that the sample contains two high types, the optimal tax
decision is τ ∗ (ωθ ) = θ H . Thus, with probability (1 − α)2 information about firm types changes the
optimal tax decision when the optimal uninformed tax is low, and in this case learning the sample
must be of value.
Now suppose an uninformed planner optimally chooses: τU∗ = θ H . Then, again looking at
table 2.5, if either
ωθ = ({θ L , θ L , c L }, {θ L , θ L , c H }),
or
ωθ = ({θ H , θ H , c L }, {θ H , θ H , c H })
is realized, then τU∗ = θ H remains optimal. In the first case the planner is indifferent over all taxes
that induce low types to exit, and in the second case, since Ec > θ H the planner prefers a high
tax. However, if the realization is ({θ L , θ H , c L }, {θ L , θ H , c H }), the optimal tax may be high or low,
depending on how quickly P(·) decreases. If P(·) decreases quickly, then going from a low tax to
a high tax can drastically reduce the probability of implementing the policy, while if P(·) does not
decrease quickly the optimal tax under this realization is high. If the latter is true, then for any
possible realization of firm types the planner prefers a high tax. If he also optimally chooses a high
tax when he has no information, then information about firm types cannot have value, because it
does not change the optimal tax. The above argument is exactly summarized by the inequality in
proposition 1.1, which simply says that a planner who realizes {{θ H , θ L }, c} prefers a high tax.
Example: Consider the parameterization,
P( L) = 1 −
1 2
L ,
B
where B ∈ [(2θ H
)2 , ∞) so that
84
P( L) is a proper probability distribution. This choice for the function
P( L) provides a convenient way to consider comparative statics with respect to political power.
Note that
P0 ( L) = −
2L
,
B
P00 ( L) = −
2
B
As we noted above, the political power of firms as we have defined it here, depends on the shape
of the P(·) function. If marginal changes in L are important we say that firms are politically
powerful. Changing the parameter B allows us to change how important marginal changes in L
are, holding all else fixed. The way we have set up this example, small values of B correspond to
high political power, while larger values of B correspond to low political power. Now, under this
parameterization, proposition 1.1 says that the value of learning firm types is zero if:
1−
1
1
(2θ L )2 (θ L − Ec) ≥ 1 − (θ L + θ H )2 (θ L − Ec) + (θ H − Ec)
B
B
which, after rearranging yields:
1 − B1 (2θ L )2
1−
1
B (θ L
+ θ H )2
≤ 1+
θ H − Ec
.
θ L − Ec
Note that the right hand side of this expression is greater than 1 by our assumption Ec > θ H .
Then, as B gets large,
1 − B1 (2θ L )2
1 − B1 (θ L + θ H )2
→ 1 < 1+
θ H − Ec
θ L − Ec
In words, as the slope of P( L) gets very flat so that marginal changes in L leave the probability
of implementation unaffected, the inequality holds with certainty and the value of learning firm
types is zero. On the other hand, it is straightforward to find parameter values for which the
inequality does not hold. If θ L ≈ 0 and θ H ≈ Ec, the right hand side becomes 1 while one can find
appropriate values for B and Ec so that the left hand side is larger than 1.
85
To understand proposition 1.2, note that information about social cost can only have zero value
if the planner chooses the same tax for any value of social cost he draws. Further, recall that
information about social cost has value if τU∗ = θ H . This is true because when social cost is low
the optimal tax is τ ∗ = θ L for any possible sample. Thus information about social cost changes
the planner’s tax choice with positive probability (probability q). So for information about social
cost to have no value, the planner must optimally regulate only the low type for any realization
of social cost. In other words, the planner must tax low even in the event c = c H . For this to be
true, lobbying must be very important. Indeed, the inequality in proposition 1.2 says that when
the planner does not know firm types but knows that social cost is high, he prefers a low tax.
Example cont’d
Continuing with the parameterization given above, proposition 1.2 says that the
value of learning the social cost of pollution is zero if:
1−
1 − α
1
1
1
(2θ L )2 (θ L − c H ) ≤ 1 − (θ L + θ H )2 (θ L − c H ) + (θ H − ch ) +
1 − (2θ H )2 (θ H − c H )
B
B
α
B
First, letting B get large we have:
1
(2θ L )2 (θ L − c H ) → θ L − c H
B
1
1 − ( θ L + θ H )2 ( θ L − c H ) + ( θ H − c h ) → ( θ L − c H ) + ( θ H − c H )
B
1 − α
1
1−α
1 − (2θ H )2 (θ H − c H ) →
(θ H − c H )
α
B
α
1−
and the inequality thus approaches:
0 ≤ θH − cH
and as we have assumed θ H < c H , the inequality does not hold. This is indeed consistent with
proposition 1.2, as it shows that when firm political power becomes arbitrarily small the value of
learning social cost of pollution cannot be zero. Now, letting B approach its lower bound, (2θ H )2
86
we have:
1
(2θ L )2
(θ L − c H )
(2θ L )2 (θ L − c H ) → 1 −
B
(2θ H )2
1
( θ L + θ H )2
(
θ
−
c
)
+
(
θ
−
c
)
1 − ( θ L + θ H )2 ( θ L − c H ) + ( θ H − c h ) →
1−
L
H
H
H
B
(2θ H )2
1−α
1
1 − (2θ H )2 (θ H − c H ) → 0
α
B
1−
In this case, the inequality reduces to
1−
(2θ L )2
( θ L + θ H )2
(
θ
−
c
)
≤
1
−
(θ L − c H ) + (θ H − c H )
L
H
2
2
(2θ H )
(2θ H )
This inequality holds for certain parameter values. One example of where it does hold is when
θ L ≈ 0 and θ H ≈ c H .
Proposition 2
1. Learning the social cost of pollution yields the expected full information payoff if firms’ political power
is sufficiently small. Formally:
If
P(2θ L )(θ L − c H ) ≥ P(θ H + θ L )[(θ L − c H ) + (θ H − c H )]
then
VΩc = VF .
2. Learning only the types of firms never yields the expected full information payoff. That is,
VΩθ < VF .
For information about social cost of pollution to yield the full information payoff, learning firm
types once social cost is known must not change the planner’s tax choice in any state of the world.
Similarly, for information about firm types to yield the full information payoff, learning social cost
once firm types are known must not change the planner’s tax choice in any state of the world.
87
Let us first establish that learning firm types can never yield the full information payoff. To see
this, note that if the planner learns that both firms are high types, i.e., {(θ H , θ H , c L ), (θ H , θ H , c H )}
is realized, the planner would strictly prefer to leave the firms unregulated if the true social cost
of pollution is c L , but would optimally regulate them if true social cost of pollution is c H . Thus we
have found a possible realization of firm types where information on social costs would strictly
change the planner’s tax choice. θ does not yield the full information payoff. The planner knows
that there is positive probability of drawing a sample of firm types such that it will still pay to
learn social cost.
Learning social cost of pollution can yield the full information payoff, however. To see this,
note first that the optimal tax does not vary across the states {θ L , θ L , c L }, {θ L , θ H , c L }, {θ H , θ H , c L };
once the planner observes that social cost of pollution is low, regardless of what the types of firms
are, a low tax is optimal. Now suppose that c = c H . The three possible states of the world are
then:
{ θ L , θ L , c H }, { θ L , θ H , c H }, { θ H , θ H , c H }
In the first and last case a high tax is optimal. However, for a mixed sample the level of the tax
depends on how quickly P(·) decreases, or how likely the policy is to pass. If lobbying is not
important, the planner would choose to regulate both firms, and set a high tax. If this is true, then
there is a unique optimal tax across all possible firm pairs for each possible level of social cost, and
learning c yields the full information payoff.
Example cont’d After some re-arranging, under the parameterization above proposition 2.1 says
that learning social cost yields the full information payoff when:
1 − B1 (2θ L )2
1 − B1 (θ L + θ H )2
≤ 1+
θH − cH
.
θL − cH
Clearly, as B becomes arbitrarily large the inequality holds. In words, as the function P( L) becomes
arbitrarily flat and marginal changes in L do not affect the probability of implementation, learning
88
the social cost of pollution yields the full information payoff.
Corollary: Marginal value of information
1. Learning firm types after social cost is known is of no value if firm political power is sufficiently small.
2. Information about social cost always has value when firm types are known.
The “marginal" values of information, (i.e., what the planner would pay to learn social cost of
pollution after information about firm types has been obtained, and vice versa) also depends on
how important lobbying is. When firms possess significant political power, both marginal values
are strictly positive. This is the same as saying that when lobbying is very important, learning only
one type of information is not enough for a first-best solution. When lobbying is not important,
it is always worthwhile to learn c after learning firm types, but it may be of no value to learn the
sample after learning c.
Whether or not the marginal value to either type of information is positive follows immediately
from our discussion about whether partial information can yield the expected full information
payoff. We noted that once the types of firms have been observed the planner is always willing
to pay for information about the social cost of pollution. Thus the marginal value of information
is always positive in this case. We also noted that if lobbying is not important, information about
social cost of pollution yields the expected full information payoff, and thus the marginal value of
information is zero. However if lobbying is important, partial information about social cost can’t
yield the expected full information payoff, and the marginal value of information is positive.
2.7
Extensions
In this section we discuss potential extensions to the model we have presented above.
7.1 Firm Political Power
89
We now consider the possibility that firms differ in their political power. Specifically, we assume
that with some probability γ the losses of firm i do not enter into the probability of implementation
function P( L). Formally we define two indicator functions, 1 Ri , i ∈ {1, 2} such that Pr (1 Ri = 0) =
γ and now write the loss function associated with tax τ in state s as
L(τ, s) =
∑
1 Ri Li (τ, s)
(2.4)
i =1,2
where, as before,
Li (τ, s) = 1θi τ + (1 − 1θi τ )θi .
(2.5)
In this framework there are three possible types of information a planner can acquire, information
about profit types {θ1 , θ2 } , information about social cost of pollution c, and information about
firm political power {1 R1 , 1 R2 }.
Tables 2.7-2.10 in the appendix describe the optimal taxes for a fully informed planner, a planner informed about political power, a planner informed about social costs, and a planner informed
about firm types respectively.
We find that learning which firms have political power never yields the full information payoff,
and has no value at all if (holding all else constant) either α is very small, θ H ∼ θ L or P(·) is
not steep (lobbying is not important). In such cases an uninformed planner chooses a high tax,
and a planner who learns political power, regardless of what he learns, also chooses a high tax.
Further, for γ very small (so that both firms are likely to have power), the propositions in the paper
continue to hold. This is straightforward: As γ → 0 we approach the model in the paper. On the
other hand as γ approaches 1 and neither firm has power, information about social cost yields the
full information payoff. In this case when cost is low the planner only wants to regulate low type
firms, and when the cost is high he wants to regulate both.
90
Notice that a planner who does not directly observe firm profit types or the social cost of production does learn something about the economic environment simply by observing the expenditures/effort undertaken by firms in opposition to a proposed tax policy. In a further extension
to the analysis we have presented one could examine the value that accrues to simply observing
firm expenditures as a response to a tax. Considering such an extension would require further
assumptions and modifications to the model we have presented here. Firstly we do not model
explicitly how firm expenditures in opposition to a policy affect the probability of a policy’s implementation; we simply allow the probability of implementation to be an unspecified function
of the firm types. One would need to explicitly model this relationship to consider the value of
observing expenditures. Second, the timing implicit in the model we have presented is such that
one would need to consider a dynamic version of the model to answer the question at all. Specifically, a planner chooses a tax and then observes expenditures made by firms. In the context of our
model, it is too late to learn from firm expenditures once they occur. One would need to consider
the problem in the context of a dynamic game.
7.2 Heterogeneity in Pollution Costs
Now suppose that the firms earn identical profit θ ∈ {θ L , θ H }, but differ in the costs they
impose on society ci ∈ {c L , c H }, i = 1, 2. We maintain all other parametric assumptions made
in the paper. Tables 2.11-2.13 in the appendix describe the optimal taxes for each possible type
of planner, a fully informed planner, a planner who knows profit θ but not the costs of pollution
{c1 , c2 }, a planner who knows the costs of pollution {c1 , c2 } but not profit θ, and a completely
uninformed planner.
The key result in this extension is that the value of learning the common profit level θ is zero,
and learning the cost types of the firms {c1 , c2 } yields the full information payoff. While firms
are inefficient in expectation by assumption, what is important is that firms differ in terms of how
inefficient they are. This is true both in the model we present in the paper and in the extension
here. In the model presented in the paper, the difference in efficiency across firms is driven by
91
differences in profit type θ. The cost of pollution is common across the firms. By contrast, in the
model we present in the extension profit is common across the two firms and the difference in
efficiency depends on the cost of pollution. Notice that the probability of implementation always
depends only on firm profit types. Thus, in the model in the paper, the planner faces the following
trade-off: set a lower tax and shut down only the most inefficient firm with higher probability (a
lower tax is more likely to be implemented) while leaving the more efficient firm in the market,
or set a higher tax and try to shut down both firms, but with less probability. The probability of
implementation decreases as the planner includes more efficient firms in his regulation. In the
model in the extension however, the nature of the trade-off is quite different. Here firms have the
same profit, and their response to the tax depends only on their profits. Thus it is not possible to
regulate the less efficient firm and not regulate the more efficient firm; the planner cannot regulate
firms according to their efficiency as he can in the original model.
Since both firms are inefficient in expectation, the uninformed planner regulates both by setting
a high tax. Notice that a planner who learns the firm profit type, regardless of whether he learns
that the type is low or high, can do no better than set a high tax. If he learns the common profit
type is low, both firms are inefficient with certainty and he wants to shut down both firms. A tax
of either θ L or θ H does this. If he learns that the profit type is θ H , the firms are still inefficient in
expectation and he shuts down both firms. Thus there is no value to learning the firm profit type;
learning this information does not change the planner’s tax choice. It immediately follows that
learning cost types must yield the full information payoff
2.8
Conclusion
We model an economy where a regulator chooses optimal environmental regulation in the presence of political constraints and hidden information. We use the model to compare the value of
information about the social cost of pollution with the value of information about the types of
firms present in the economy. We find that in environments where firms are politically powerful,
92
it is most valuable to learn the types of firms. On the other hand, when firms are not politically
powerful, it is most valuable to learn the social cost of pollution. While we present our results
using a deliberately simple model, this intuition is general. Indeed, the main results appear to
generalize to an environment with many firm types and levels of social costs.
The relative values of different types of information hinge on the political power of firms.
When the probability of implementing a policy is not sensitive to the losses that firms incur as
a consequence of the policy, choosing the optimal regulation amounts to setting the tax to equal
the externality (pollution) cost of production. This is the first best tax, and so information about
social cost of pollution is much more valuable than is information about firm types. On the other
hand, when a high tax is much less likely to be implemented than a low tax, a welfare-maximizing
planner would treat a sample of high profit, inefficient firms differently from a mixed sample
of high and low profit inefficient firms. In the latter case there is more surplus at stake from
not passing the policy, and information about firm types is more valuable. To say this another
way, information about firm types allows the regulator to choose regulation that affects only firms
that are less able to influence the political process. In an environment where marginal changes
in political resistance are important, this means that information about firm type can be very
valuable. If marginal changes in political resistance are not important, information that leads
to the first best, i.e., information about social cost, is more valuable.
We acknowledge that the assumption of a binary production decision is a strong one. Relaxing
the assumption of binary production choice and allowing firms to choose quantity in the context
of the model we present here comes at great cost in terms of tractability and clarity, and generates
small benefit in intuition over and above the propositions we have discussed here. Finding the
correct way to extend the model in this direction without losing the simplicity inherent to the
binary setup remains a subject for future research.
Given the results we have obtained here, the conclusions of Olson (1982), Becker (1985) and
Stigler (1971) suggest that when a regulator is facing a concentrated industry he would do well
to learn firm types as opposed to the true social cost of pollution, as concentrated industries are
93
conducive to firm mobilization. On the other hand, if the regulator finds himself in an industry
with many small firms that cannot organize effectively, learning the social cost of pollution is
preferable, because the benefit from getting the tax closer to the actual cost of pollution outweighs
the cost from potentially not implementing the tax.
94
2.9
Appendix
Proof of Proposition 1
Proposition 1 asserts that under some conditions on the parameters the value to either type of
information is zero. To prove this result, we need only show that when these conditions hold, the
uninformed planner’s optimal tax choice is the same as the tax choice of a planner that has the
information in question. We prove each of proposition 1.1 and 1.2 in turn.
First we show that:
P(2θ L )(θ L − Ec) ≥ P(θ H + θ L )[(θ L − Ec) + (θ H − Ec)] ⇒ VΩθ = 0.
Note first that if the parameters satisfy:
P(2θ L )(θ L − Ec) ≥ P(θ H + θ L )[(θ L − Ec) + (θ H − Ec)],
then looking at Table 2.5, for the realization {{θ L , θ H , c L }, {θ L , θ H , c H }} the planner prefers a
high tax. This then implies that a high tax is optimal for any of the three samples that are possible
when the planner gains information about firm types. Further,
P(2θ L )(θ L − Ec) ≥ P(θ H + θ L )[(θ L − Ec) + (θ H − Ec)]
⇒ P(2θ L )(θ L − Ec) ≥ P(θ H + θ L )[(θ H − Ec) + (θ L − Ec)] +
1−α
P(2θ H )(θ H − Ec),
α
which implies that the planner prefers a high tax with no information. Altogether, the planner prefers a high tax with no information, and chooses a high tax for any of the three possible
realizations when he gains information about firm types. There cannot be a value to learning firm
types.
95
Now we show that:
P(2θ L )(θ L − c H ) ≤ P(θ H + θ L )[(θ L − c H ) + (θ H − c H )] +
1−α
P(2θ H )(θ H − c H ) ⇒ VΩc = 0
α
First, from table 6, if
P(2θ L )(θ L − c H ) ≤ P(θ H + θ L )[(θ L − c H ) + (θ H − c H )] +
1−α
P(2θ H )(θ H − c H )
α
then the planner chooses a low tax even when the social cost is high. Then, for either realization
of social cost, the optimal tax is low.
Further,
1−α
P(2θ H )(θ H − c H )
α
1−α
⇒ p(2θ L )(θ L − Ec) ≤ P(θ H + θ L )[(θ H − Ec) + (θ L − Ec)] +
P(2θ H )(θ H − Ec)
α
P(2θ L )(θ L − c H ) ≤ P(θ H + θ L )[(θ L − c H ) + (θ H − c H )] +
and thus an uninformed planner optimally taxes low. Altogether, a planner taxes low for
either realization when information about social cost is gained, but taxes low when he has no
information anyways. The value of learning social cost can’t be positive.
Proof of Proposition 2
Proposition 2.1 establishes conditions under which learning only the social cost of pollution
leaves the planner as well off as if he would be with full information. To prove this we need to
show that, for each possible level of social cost, once information about social cost is obtained,
learning the types of firms would not change the planner’s choice of tax.
Now suppose
P(2θ L )(θ L − c H ) ≥ P(θ H + θ L )[(θ L − c H ) + (θ H − c H )]
96
Note that (see table 4) if the parameter values satisfy this condition a fully informed planner
regulates both types for the realization {θ L , θ H , c H }. This implies that a high tax is always optimal
when c = c H , regardless of firm types. Further, we know that if c = c L a low tax is always optimal.
Thus once the planner knows the social cost, learning firm types does not change his choice of tax
regardless of the types, and thus does not change his ex-post payoff. Information about social cost
yields the full information payoff.
We now prove proposition 2.1, that information about firm types can never yield the full information payoff:
Vθ 6= VF
To prove this, we need find only one possible realization of firm types such that learning social
cost in addition to information about firm types changes the planner’s tax choice. If we can find
such a case, information about firm types can’t yield full information payoff, because the planner’s
ex-post payoff can change with more information. So consider the realization
{{θ H , θ H , c L }, {θ H , θ H , c H }}
In this case, without further information the planner chooses a high tax, as both firms are
inefficient in expectation. However, if the planner were to learn that the true social cost is c L , a
low tax would be strictly preferable, as both firms are efficient. Thus information about firm types
alone can’t provide full information payoff.
97
Table 2.7: Optimal taxes in each possible state
State s
{θ L , θ L , c L , 0, 0}
{θ L , θ L , c L , 1, 0}
{θ L , θ L , c L , 0, 1}
{θ L , θ L , c L , 1, 1}
{θ L , θ H , c L , 0, 0}
{θ L , θ H , c L , 1, 0}
{θ L , θ H , c L , 0, 1}
{θ L , θ H , c L , 1, 1}
{θ H , θ L , c L , 0, 0}
{θ H , θ L , c L , 1, 0}
{θ H , θ L , c L , 0, 1}
{θ H , θ L , c L , 1, 1}
{θ H , θ H , c L , 0, 0}
{θ H , θ H , c L , 1, 0}
{θ H , θ H , c L , 0, 1}
{θ H , θ H , c L , 1, 1}
{θ L , θ L , c H , 0, 0}
{θ L , θ L , c H , 1, 0}
{θ L , θ L , c H , 0, 1}
{θ L , θ L , c H , 1, 1}
Optimal Tax τF∗ (s)
{θ L , θ H }
{θ L , θ H }
{θ L , θ H }
{θ L , θ H }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L }
{θ L , θ H }
{θ L , θ H }
{θ L , θ H }
{θ L , θ H }
98
Table 2.7 (cont’d)
Optimal Tax τF∗ (s)
{θ H }
{θ H }
State s
{θ L , θ H , c H , 0, 0}
{θ L , θ H , c H , 1, 0}
(
{θ L , θ H , c H , 0, 1}
(
{θ L , θ H , c H , 1, 1}
(θ −c H )+(θ L −c H )
(θ L −c H )
θ L if H
θ H o.w.
(θ −c H )+(θ L −c H )
(θ L −c H )
θ L if H
θ H o.w.
≤
≤
P(2θ L )
P(θ H +θ L )
{θ H , θ L , c H , 0, 0}
(
{θ H , θ L , c H , 1, 0}
(θ H −c H )+(θ L −c H )
(θ L −c H )
θ L if
θ H o.w.
{θ H , θ L , c H , 0, 1}
(
{θ H , θ L , c H , 1, 1}
(θ H −c H )+(θ L −c H )
(θ L −c H )
θ L if
θ H o.w.
≤
P(θ L )
P(θ H )
≤
{θ H }
P(θ L )
P(θ H )
{θ H }
P(2θ L )
P(θ H +θ L )
{θ H , θ H , c H , 0, 0}
{θ H , θ H , c H , 1, 0}
{θ H , θ H , c H , 0, 1}
{θ H , θ H , c H , 1, 1}
{θ H }
{θ H }
{θ H }
{θ H }
Table 2.8: Optimal taxes when planner knows who has political power
State s
Neither firm has power
(
θ L if P(Lθ )+αP(θ )H <
L
H
( θ H o.w.
P(θ )+ 1−α P(θ )
θ L if P(Lθ )+αP(θ )H <
L
H
θ H o.w.
Firm 1 has power
Firm 2 has power
(
Both firms have power
P(θ )+ 1−α P(θ )
Optimal Tax τR∗ (s)
θH
(θ H − Ec)+(θ L − Ec)
(θ H − Ec)+(θ H − Ec)
(θ H − Ec)+(θ L − Ec)
(θ H − Ec)+(θ H − Ec)
P(2θ L )(θ L − Ec)− P(θ H +θ L )[(θ H − Ec)+(θ L − Ec)]
P(2θ H )(θ H − Ec)
θ L if
θ H o.w.
>
1− α
α
Table 2.9: Optimal taxes when planner knows social cost of pollution
State s
c = cL
c = cH
θL
θ
H
Optimal Tax τω∗ c (s)
θL
2
(1−γ) 2α γ(1− P(θ L ))+α(1−α)[γ( P(θ H )− P(θ L ))+(1−γ)( P(θ H +θ L )− P(θ L ))]
−c H
if γ2 (1−α)+γ(1−γ)(1−α) P(θ )+(1−γ)2 (α(1−α) P(θ +θ )+(1−α)2 P(θ )) > θθHL −
cH
H
L
H
H
o.w.
99
Table 2.10: Optimal taxes when planner knows firm types
State s
{ θ1 , θ2 } = { θ L , θ L }
{ θ1 , θ2 } = { θ L , θ H }
θL
θ
H
if
Optimal Tax τω∗ c (s)
{θ L , θ H }
γ(1−γ)[ P(θ L )− P(θ H )]−(1−γ)2 [ P(θ L +θ H )− P(2θ L )]
1−(1−γ) γ(1− P(θ L )+1− P(θ H ))+(1−γ)(1− P(θ L +θ H ))
o.w.
{ θ1 , θ2 } = { θ H , θ L }
{ θ1 , θ2 } = { θ H , θ H }
−
θH
Table 2.11: Optimal taxes in each possible state
State s
{θ L , c L , c L }
{θ L , c L , c H }
{θ L , c H , c H }
{θ H , c L , c L }
{θ H , c L , c H }
{θ H , c H , c H }
θL
θH
Optimal Tax τF∗ (s)
{θ L , θ H }
{θ L , θ H }
{θ L , θ H }
θL
if [(θ H − c H ) + (θ H − c L )] > 0
o.w.
θH
Table 2.12: Optimal taxes when profit is known
Event ωθ
Optimal tax τθ∗ (ωθ )
{{θ L , c L , c L }, {θ L , c L , c H }, {θ L , c H , c H }}
{θ L , θ H }
θ L if q(θ H − c L ) + (1 − q)(θ H − c H ) > 0
{{θ H , c L , c L }, {θ H , c L , c H }, {θ H , c H , c H }}
θ H o.w.
Table 2.13: Optimal taxes when the planner knows firm cost types only
Event ωc
{{θ L , c L , c L }, {θ H , c L , c L }}
{{θ L , c L , c H }, {θ H , c L , c H }}
{{θ L , c H , c H }, {θ H , c H , c H }}
θL
θH
Optimal Tax τc∗ (ωc )
{θ L }
if θ H − c L + θ H − c H > 0
o.w.
θH
Optimal tax when the planner is uninformed
τU∗
=
θ L if q(θ H − c L ) + (1 − q)(θ H − c H ) > 0
θ H o.w.
Chapter 3
Identification and Estimation of Dynamic Games When Players’ Beliefs
are not in Equilibrium
3.1
Introduction
The principle of revealed preference (Samuelson, 1938) is a cornerstone in the structural empirical
analysis of decision models, either static or dynamic, single-agent decision problems or games.
Under the principle of revealed preference, agents maximize expected payoffs and their actions
reveal information on the structure of payoff functions. This simple but powerful concept has
allowed econometricians to use data on agents’ decisions to identify important structural parameters for which there is very limited information from other sources. Agents’ degree of risk aversion, intertemporal rates of substitution, market entry costs, adjustment costs and switching costs,
consumer willingness to pay, preference for a political party, or the cost of a merger, are just some
examples of the type of structural parameters that have been estimated under the principle of revealed preference. In the context of empirical games, a player’s expected payoff depends on his
beliefs about the behavior of other players. Most empirical applications of games have combined
the principle of revealed preference with the assumption that players’ beliefs are in equilibrium.
There are multiple reasons why the assumption of equilibrium beliefs is useful in the estimation of
games. Equilibrium restrictions have identification power even in models with multiple equilib-
100
ria (Tamer, 2003, and Aradillas-Lopez and Tamer,
2008).1
101
Imposing these restrictions contributes
to improve asymptotic and finite sample properties of estimators (Kasahara and Shimotsu, 2008).
Furthermore, structural models where agents’ beliefs are endogenously determined in equilibrium are attractive for the evaluation of counterfactual policy experiments because they take into
account how agents’ beliefs may change in the counterfactual scenario.
Despite the important and attractive implications of the assumption of equilibrium beliefs,
there are empirical applications of games where the assumption is not realistic and it is of interest
to relax it. To motivate the contributions of this paper, we start by presenting three examples. First,
competition in actual oligopoly industries is often characterized by strategic uncertainty (Besanko
et al., 2010). Firm managers are very secretive about their own strategies and face significant
uncertainty about the strategies of their competitors. In fact, it is often the case that firms have
incentives to misrepresent their own strategies (e.g., entry deterrence). The second example deals
with the empirical evaluation of the effect of a policy change in a strategic environment. Consider
a game of competition in an oligopoly industry where firms choose whether or not to adopt a new
technology. In this game, firms’ adoption decisions are strategic complements and the game has
multiple stable equilibria, e.g., an equilibrium with low probabilities of adoption, and an equilibrium with high probabilities of adoption. We have panel data of firms in this industry over
several periods of time. Suppose that an important policy change occurred in the middle of our
sample period, e.g., the introduction of a new government subsidy that tries to encourage firms’
adoption of the new technology. It seems reasonable to think that it will take time for firms to
learn about the new strategies of competitors after the policy change, and for some time firms’
strategies will be out of equilibrium. Imposing the restriction of equilibrium strategies may bias
our estimates of the effects of the new policy. Finally, there is significant empirical evidence in
1 In games with multiple equilibria, the assumption of equilibrium beliefs is key to have point identification of struc-
tural parameters and beliefs, and for the implementation of relatively simple methods of estimation. That is the case
in games of incomplete information under the assumption that the observations in the data come from the same equilibrium (Aguirregabiria and Mira, 2007). Under this assumption, it is possible to estimate players’ beliefs consistently
using a nonparametric estimator of the distribution of players’ actions. This nonparametric estimator of beliefs can
be used to construct players’ expected payoffs and to obtain an estimator of structural parameters that optimizes a
sample criterion function based on players’ best responses to the estimated beliefs from the data. This simple two-step
approach for identification and estimation cannot be applied when players beliefs are not in equilibrium.
102
experimental economics showing large heterogeneity in agents’ elicited beliefs (Camerer, 2003).
In many laboratory experiments, agents’ heterogeneity in beliefs is one of the most important factors to explain heterogeneity in behavior. This evidence contrasts dramatically with the standard
approach in empirical applications of games using non-experimental data. Most of these applications impose the assumption of equilibrium beliefs, and this implies that beliefs play no role in
explaining heterogeneous behavior.2
In this paper we study identification, estimation, and inference in dynamic discrete games of
incomplete information when we relax the assumption of equilibrium beliefs and the researcher
does not have data on elicited beliefs.3 First, we present new results on the point-identification
of structural parameters and beliefs. If players are rational, in the sense that each player maximizes expected payoff given some arbitrary beliefs, an exclusion restriction and a large-support
condition on one of the explanatory variables are sufficient for point-identification of structural
parameters and players’ beliefs. While the type of exclusion restriction that we need is quite plausible in dynamic games of oligopoly competition, the large support condition is not satisfied in
most applications. However, we show that this condition can be replaced with a "normalization"
restriction on beliefs. Second, we propose a simple two-step estimation method of structural parameters and beliefs, and an extension of this method that provides a sequence of estimators with
asymptotic variances and finite sample bias that decline monotonically. We also present a procedure for testing the null hypothesis of equilibrium beliefs. Finally, we illustrate our model and
methods with an empirical application of a dynamic game of store location by retail chains.
In the class of econometric models that we consider, players’ beliefs are probability distributions over the set of other players’ actions. These distributions are nonparametrically specified
and they are treated as incidental parameters that, together with the structural parameters of the
game, determine the stochastic process followed by players’ actions. When players beliefs are not
in equilibrium they are different from the actual distribution of players’ actions in the population.
2 An
exception is the recent paper by Goldfarb and Xiao (2009) that studies entry decisions in the US local telephone
industry and finds significant heterogeneity in firms’ beliefs about other firms’ strategic behavior.
3 Data on elicited beliefs of firm managers is very rare and typically of low quality.
103
Therefore, without other restrictions, beliefs cannot be identified and estimated by simply using a
nonparametric estimator of the distribution of players’ actions. We show that an exclusion restriction in the one-period payoff function and a large-support condition on one of the state variables
are sufficient conditions for point-identification of structural parameters and players’ beliefs. The
exclusion restriction consists in the assumption that for each player there is an observable state
variable that does not enter in the player’s payoff function but that enters in the payoff function
of the other players. The large-support condition establishes that the ’special’ state variable of the
exclusion restriction has unbounded support on the real line.
To illustrate our model and methods, we consider an empirical application where players are
firms. Most empirical studies on bounded rationality have concentrated on individual behavior,
and there is very little empirical work on bounded rationality of firms. However, there are good
reasons to consider that, in some contexts, firms’ do not have ’correct’ (equilibrium) beliefs about
their competitors’ behavior. For instance, firms tend to be very secretive about their business
strategies and this fact can make it difficult for them to construct correct beliefs about the behavior
of competitors. The hypothesis of social learning, that seems plausible in the context of individual
decision making, is much less convincing in strategic settings involving firms.
We consider a dynamic game of store location between McDonalds (MD) and Burger King
(BK). Using panel data of 422 local markets (districts) in United Kingdom during the period 19911995, we present empirical evidence that documents a puzzling regularity in the entry decisions
of MD and BK. Reduced form estimates show that the probability that a firm opens a new store in
a local market depends significantly and negatively on the own (predetermined) number of stores
in that market, but it does not respond (or even it responds positively) to the number of stores of
the competing firm. This firm behavior cannot be explained by a standard static model of store
location by firms that sell substitute products. After presenting robust empirical evidence showing
that unobserved market heterogeneity alone cannot alone explain this puzzle, we propose and
estimate a structural dynamic game that provides three possible explanations: spillover effects;
forward looking behavior; and biased beliefs about the behavior of the competitor
104
This paper is related to several topics on the econometrics of empirical games. It builds
and extends the literature on estimation of dynamic games of incomplete information, with recent methodological contributions by Aguirregabiria and Mira (2007), Bajari, Benkard and Levin
(2007), Pakes, Ostrovsky and Berry (2007), Pesendorfer and Schmidt-Dengler (2008), and Kasahara
and Shimotsu (2009), and empirical applications by Ryan (2008), Collard-Wexler (2008), Sweeting
(2007), Dunne et al (2009), Xu (2008), or Aguirregabiria and Mira (2009), among others. All the
papers in this literature have assumed that the data come from a Markov Perfect Equilibrium. We
relax that assumption. Our paper is related to that of Aradillas-Lopez and Tamer (2008) who study
the identification power of the assumption of equilibrium beliefs in static games. There are important differences between our model and econometric approach and those in Aradillas-Lopez
and Tamer. First, we study identification and estimation of dynamic games. There are different
aspects in which the assumption of equilibrium beliefs and identification issues are substantially
different in dynamic games than in static ones (Magnac and Thesmar, 2002, Aguirregabiria, 2010).
In particular, the characterization and derivation of bounds on choice probabilities is significantly
more complicated in dynamic than in static games. Some important results in Aradillas-Lopez
and Tamer either do not extend to dynamic games or they are impractical. That is the main reason
why we incorporate additional restrictions that provide point identification of parameters and
beliefs. Second, Aradillas-Lopez and Tamer concentrate on identification, while we propose and
implement new estimators and tests.
The rest of the paper includes the following sections. Section 2 presents the model and basic
assumptions. In section 3, we derive our characterization of the set of players’ rational strategies
in our class of dynamic games. Section 4 presents our identification results. Section 5 describes
estimation methods and testing procedures. The empirical application is described in section 6.
We summarize and conclude in section 7.
105
3.2
3.2.1
Model
Basic Framework and Assumptions
This section presents a dynamic game of incomplete information where two players make binary
choices over T periods.4 The time horizon T can be either finite or infinite. We use the indexes
i ∈ {1, 2} and j ∈ {1, 2} to represent a player and his opponent, respectively. Time is discrete and
indexed by t ∈ {1, 2, ..., T }. Every period t, players choose simultaneously and non-cooperatively
between alternatives 0 and 1. Let Yit ∈ {0, 1} represent the choice of player i at period t. Each
player makes this decision to maximize his expected and discounted payoff, Et (∑sT=0 βsi Πi,t+s ),
where β i ∈ (0, 1) is player i’s discount factor, and Πit is his payoff at period t. The one-period
payoff function has the following structure:5
Πit =
πit (Yjt , Xt ) − ε it if Yit = 1
0
(3.1)
if Yit = 0
Yjt represents the current action of the other player; Xt is a vector of state variables which are common knowledge for both players; variable ε it is private information of firm i at period t; and πit (.)
is a real valued function. We distinguish several components in the vector of common knowledge
state variables: Xt = (Wt , Xit , X jt ). Wt ∈ W is a vector of state variables that evolve exogenously
according to a Markov process with transition probability function f W (Wt+1 |Wt ). The variable
Xit is an endogenous state variable for player i with discrete support {0, 1, 2, ..., K } and it evolves
4 The results in the paper can be generalized to models with more than two players or choice alternatives. A key result in our paper is the characterization of rational beliefs in section 3. That characterization is used in the identification
results and in the estimation methods in sections 4 and 5. It is possible to extend that representation to multinomial
choice models and games with more than two players.
5 For the nonparametric identification of the payoff function we need to impose some "normalization" assumption.
We normalize to zero the one-period payoff of choice alternative 0. In contrast with static decision problems, this
normalization is not innocuous in dynamic decision models and it has implications for some counterfactual experiments using the model (see Aguirregabiria, 2010). However, the results in this paper do not depend on the specific
choice of normalization of the payoff function, and they apply both to models with nonparametric and with parametric
specification of the payoff function.
over time according to a transition probability function f i ( Xit+1 |Yit , Xit
).6
106
We will refer to Xit as
the stock or the "capacity" of player i. The private information shocks ε 1t and ε 2t are independent
of Wt , independent of each other, and independently and identically distributed over time. Their
distribution functions, Λ1 and Λ2 , are absolutely continuous and strictly increasing with respect
to the Lebesgue measure on R.
EXAMPLE: Dynamic game of market entry and exit. Consider two firms competing in a market. Each
firm sells a differentiated product. Every period, firms decide whether to be active in the market
or not. Then, incumbent firms compete in prices. Let Yit ∈ {0, 1} represent the decision of firm
i to be active in the market at period t. Let Xit be a state variable that represents the incumbent
status of firm i at previous period, i.e., Xit = Yit−1 . And let Wt be a vector of exogenous market
characteristics, e.g., market size. The profit function of firm i is:
Πit =
πi Yjt , Xit , Wt − ε it if Yit = 1
0
(3.2)
if Yit = 0
In this example, we have that πi (0, 0, Wt ) (alternatively πi (0, 1, Wt )) is the profit of firm i when
it is a monopolist and a new entrant (an incumbent last period). Similarly, πi (1, 0, Wt ) (alternatively πi (1, 1, Wt )) is the profit of firm i when it is a duopolist and a new entrant (an incumbent
last period). The variable ε it is a private information shock in the fixed operating cost, and it
is i.i.d. normally distributed. Without further restrictions, this model considers a nonparametric specification of the profit function πi . We study identification and estimation in this context.
6 For instance,
this transition probability could be based on the transition rule:
Xit+1
=
max 0 , min K , Xit + κ Yit − δi,t+1 . K and κ are parameters that belong to the natural numbers. κ represents the increment in the stock associated with the investment decision Yit = 1, and K represents the maximum
possible level of the stock. δi,t+1 ∈ {0, 1} is a random variable that captures the exogenous depreciation in the stock,
and it is i.i.d. with a multinomial distribution with discrete and finite support. This specification includes as particular
case a model where the evolution of the state variable Xit is deterministic. i.e., the probability distribution of the
depreciation variable δi,t+1 has all the probability mass at a single point.
107
Alternatively, the model could be parametric. For instance, the profit function could be:
πi Yjt , Xit , Wt
= Wt (1 − Yjt ) θiM + Yjt θiD − θiFC − (1 − Xit ) θiEC
(3.3)
where Wt represents market size, and θiM , θiD , θiFC , and θiEC are structural parameters. θiM and θiD
represent the per capita variable profit of firm i when the firm is a monopolist and when it is a
duopolist, respectively. θiEC and θiFC are parameters that represent the fixed operating cost and the
entry cost, respectively. We also study the estimation of parametric models.
Most previous literature on estimation of dynamic discrete games assumes that the data comes
from a Markov Perfect Equilibrium (MPE). This equilibrium concept incorporates three main assumptions.
ASSUMPTION 1 (Payoff relevant state variables): Players’ strategy functions depend only on payoff relevant state variables: Xt and ε it .
ASSUMPTION 2 (Rational beliefs on own future behavior): Players are forward looking, maximize expected intertemporal payoffs, and have rational expectations on their own behavior in the future.
ASSUMPTION ’EQUIL’: (Rational or equilibrium beliefs on other players’ actions): Strategy functions
are common knowledge, and players’ have rational expectations on the current and future behavior of other
players. That is, players beliefs about other players’ behavior are consistent with the actual behavior of other
players.
Suppose that we impose only Assumption 1. The payoff-relevant information set of player i
is {Xt , ε it }. The space of Xt is X ≡ W × {0, 1, ..., K } × {0, 1, ..., K }. At period t, players observe Xt
and choose their respective actions. Let σit (Xt , ε it ) be a strategy function for player i at period t.7
This is a function from the support of (Xt , ε it ) into the binary set {0, 1}, i.e., σit : X × R → {0, 1}.
Given any strategy function σit , we can define a choice probability function Pit (Xt ) that represents
7 Strategy
functions may change over time. This is the case for the equilibrium strategies in a model with finite time
horizon T. More generally, when we do not impose the assumption of Markov Perfect equilibrium, we allow strategies
to vary over time.
108
the probability of Yit = 1 conditional on Xt given that player i follows strategy σit . That is,
Pit (Xt ) ≡
1 {σit (Xt , ε it ) = 1} dΛi (ε it )
(3.4)
where 1{.} is the binary indicator function. It is convenient to represent players’ behavior and beliefs using these Conditional Choice Probability (CCP) functions. Note that, given that the variables
in Xt have a discrete support we can represent the CCP function Pit (.) using a finite dimension
vector Pit ≡ { Pi (Xt ) : Xt ∈ X }. Throughout the paper we use either the function Pit (.) or the
vector Pit to represent the actual behavior of player i at period t.
Without imposing Assumption ’Equil’ (’Equilibrium Beliefs’), a player’s beliefs about the behavior of other players do not necessarily represent the actual behavior of the other player. Therefore, we need functions other than σjt (.) and Pjt (.) to represent players i’ beliefs about the strat
egy of player j. Let b jt Xt , ε jt be player i’s belief about the strategy function of player j at
period t. And let Bjt (Xt ) be the choice probability associated with b jt Xt , ε jt , i.e., Bjt (Xt ) ≡
1{b jt Xt , ε jt = 1} dΛ j ε jt . We can represent function Bjt (.) using a finite-dimensional vector
B jt ≡ { Bjt (Xt ) : Xt ∈ X }, and B̃ j represents {B jt : t = 1, 2, ..., T }.
Now, suppose that we impose Assumptions 1 and 2, but not Assumption ’Equil’. We say that a
strategy function σit (.) (and the associated CCP function Pit ) is rational if for every possible value of
(Xt , ε it ) ∈ X × R the action σit (Xt , ε it ) maximizes player i’s expected and discounted value given
his beliefs on the opponent’s strategy. Given beliefs B̃ j , player i’s best response is the optimal
solution of a single-agent dynamic programming (DP) problem. A DP problem can be described
in terms of a discount factor, a payoff function, and a transition probability function for the state
variables. In this case, the discount factor is β i , the expected one-period payoff function is Yit
(πitB (Xt ) − ε it ), where:
πitB (Xt ) = (1 − Bjt (Xt )) πit (0, Xt ) + Bjt (Xt ) πit (1, Xt )
(3.5)
109
And the transition probability of the state variables is:
f itB (Xt+1 |Yit , Xt ) =
f i ( Xit+1 |Yit , Xt ) (1 − Bjt (Xt )) f j ( X jt+1 |0, Xt ) + Bjt (Xt ) f j ( X jt+1 |1, Xt )
(3.6)
Let VitB (Xt , ε it ) be the value function for player i’s dynamic programming problem given beliefs
B̃ j . By Bellman’s principle, the sequence of value functions {VitB : t = 1, 2, ..., T } can be obtained
recursively using backwards induction in the following Bellman equation:
VitB (Xt ) =
max Yit (πitB (Xt ) − ε it ) + β i
VitB+1 (Xt+1 , ε it+1 ) dΛi (ε it+1 ) d f itB (Xt+1 |Yit , Xt ) (3.7)
Yit ∈{0,1}
The best response function of player i at period t given beliefs B̃ j is the optimal decision rule
of this DP problem. Define the threshold value function:
vitB (Xt ) ≡ πitB (Xt ) + β i
∑
f itB (Xt+1 |1, Xt ) − f itB (Xt+1 |0, Xt ) V̄itB+1 (Xt+1 )
(3.8)
X t +1
where V̄itB (Xt ) is the integrated value function
R
VitB (Xt , ε it )dΛi (ε it ). We denote vitB a threshold value
function because it represents the threshold value of ε it that makes player i indifferent between the
choice of alternatives 0 and 1. The optimal response function is:
{Yit = 1} iff
ε it ≤ vitB (Xt )
(3.9)
The best response probability function is Λi (vitB (Xt )). Therefore, under Assumptions 1 and 2 the
actual behavior of player i, represented by the CCP function Pit (.), satisfies the following condition:
Pit (Xt ) = Λi vitB (Xt )
(3.10)
This equation summarizes all the restrictions that Assumptions 1 and 2 impose on players’ choice
probabilities. The right hand side of equation (3.10) is the best response function of a rational
110
player. We use
Λi (vitB )
to represent the vector-value function
{Λi vitB (Xt )
: X t ∈ X }.
The concept of Markov Perfect Equilibrium (MPE) is completed with Assumption ’Equil’
(’Equilibrium Beliefs’). Under this assumption, players’ beliefs are in equilibrium, i.e., Bjt (Xt ) =
Pjt (Xt ) for any period t and state Xt . A MPE can be described as a sequence of CCP pairs,
{Pit , P jt : t = 1, 2, ..., T } such that for every player i and any time period t, we have that
P
Pit = Λi vitj
(3.11)
For the rest of the paper, we maintain Assumptions 1 and 2 but we relax Assumption ’Equil’
on ’Equilibrium Beliefs’. Our approach is agnostic about the formation of players’ beliefs. Assumption 2 establishes that every player is rational, in the sense that his strategy maximizes his
expected and discounted payoff given his beliefs on other players’ behavior. For some of our
results, we need a stronger concept of rationality that is defined as level 2 rationalizability: every
player is rational and knows that his opponents are rational as well, such that players’ beliefs are
consistent with other players’ having rational strategies.
3.3
Identification
Suppose that the researcher has a random sample with realizations of the game over multiple
locations and time periods. Using the terminology in empirical applications of games in Industrial
Organization, we employ the term local market to refer to a location. We use the letter m to index
local markets. The researcher observes a random sample of M local markets with information on
{Yimt , Ximt , Wmt } for every player i ∈ {1, 2} and every period t ∈ {1, 2, ..., T }. For the moment, we
consider that the dynamic game has a finite horizon T. The number of local markets, M, is large
and for the identification results in this section we consider that M is infinite. The unobservable
variables {ε imt } are assumed to be independently and identically distributed across markets and
over time.8
8 This
framework can be extended to incorporate unobservable state variables for the econometrician which are
common knowledge to players and have a distribution with finite support (see Kasahara and Shimotsu, 2008b).
111
We want to use this sample to estimate the models structural parameters (functions) {πi , f i ,
f W , Λi , β i : i = 1, 2}. We assume that the distribution of the unobservables, Λi , is known to the
researcher up to a scale parameter. We consider identification of the payoff function πi up to scale,
but for notational convenience we omit the scale parameter. Following the standard approach
in dynamic decision models, we assume that the discount factors, β i , is known to the researcher.
Note that the transition probability functions f i and f W are nonparametrically identified by using
0
0
M
sample frequency estimators, i.e., fˆi ( Xi |Yi , Xi ) = [∑m
=1 1{ Ximt+1 = Xi , Yimt+1 = Yi , Ximt = Xi }]
0
0
M
M
ˆ
/ [∑m
=1 1{Yimt+1 = Yi , Ximt = Xi }], and f W ( W | W ) = [ ∑m=1 1{ Wmt+1 = W , Wmt = W }] /
M
[∑m
=1 1{ Wmt = W }]. Therefore, we concentrate on the identification of the payoff functions πi
and π j and consider that { f i , f W , Λi , β i : i = 1, 2} are known.
0 ( X ) be the true (population) conditional probability function Pr(Y
Let Pimt
imt = 1| Xmt = X ) that
represents the actual behavior of player i in market m at period t. Let B0jmt (X) be the probability
function with player i’s ’true’ beliefs in market m at period t. And let π 0 ≡ {πi0 , π 0j } be the
true payoff functions in the population. Assumption 3 summarizes our conditions on the Data
Generating Process.
0 is the best response of player i given his beliefs B0 and the
ASSUMPTION 3: (A) For every player i, Pimt
jm
payoff functions π 0 , i.e., P0imt = Λi (B0im ; π 0 ). (B) Players’ beliefs may vary over time (in an unrestricted
way) and over markets with the observable characteristics X, but a player has the same beliefs in markets
with the same observable characteristics X at the same time period, i.e., for every market m with Xmt = X,
0 ( X ) = B0 ( X ).
Bimt
it
Assumption 3(A) simply establishes that players are rational in the sense that their actual behavior is the best response given their beliefs. Assumption 3(B) establishes that a player should
have the same beliefs in two markets with the same observable characteristics and at the same
period of time. Note that beliefs can vary across markets according to the observable variables in
Xmt .9 This assumption plays a similar role for identification as the assumption of "no multiple equi9 Our sampling design with large M and small T is the standard case in applications of empirical games in Industrial
Organization. Alternatively, if the sampling design is such that the number of periods T is large and the number of
markets M is small, then we can allow beliefs to vary over markets but being constant over time.
112
libria in the data" in dynamic games under the condition that beliefs are in equilibrium.In dynamic
games where beliefs are in equilibrium, the no multiple equilibria in the data assumption effectively
allows the researcher to identify player beliefs. This is because conditional choice probabilities
are non-parametrically identified under this assumption (Aguirregabiria and Mira (2007)), and if
beliefs are in equilibrium, a player’s belief about his opponents probability of taking any given
action is that probability itself. When beliefs are not in equilibrium we of course can not identify
beliefs in this way, but assumption 3B does allow us to identify CCPS from the data because it
implies that the probability of taking any given action across two markets with the same observable characteristics is the same. This in turn, as we will show, is important for the identification of
beliefs themselves.10
Assumption 3 implies that choice probabilities describing players’ actual behavior (CCPs)
should be the same in two markets with the same observable characteristics, i.e., for any market
0 ( X ) = P0 ( X ). It also implies that, given a large sample of local
m with Xmt = X, we have that , Pimt
it
markets, we can identify these probabilities nonparametrically from the data. For any player i,
any period t, and any value of X ∈X , we have that Pit0 (X) = E(Yimt |Xmt = X), and this conditional
expectation can be estimated consistently using data on Yimt and Xmt . For instance, given that Xmt
M
is a vector of discrete random variables the frequency or ’cell’ estimator ∑m
=1 Yimt 1{ Xmt = X } /
M
0
∑m
=1 1{ Xmt = X } is a consistent estimator of Pit ( X ).
For notational simplicity, we omit the market subindex m for the rest of this section. The model
0
restrictions are summarized in the best response conditions Pit0 (Xt ) = Λi vitB (Xt , π 0 ) . Given
these conditions, we want to identify payoffs π 0 and beliefs B0j . It is simple to verify that, without
further restriction, the order condition for identification is not satisfied. For a given time period
and player, the number of restrictions is |X |, the number of parameters in the payoff function is
2|X |, and the number of parameters in beliefs is |X |, i.e., |X | < 3|X |.
10 See
Assumptions 5(A) and 5’(A) in Aguirregabiria and Mira (2007) and the discussion of these assumptions in
pages 14-15 and 25-26 of that paper. Other recent papers on the estimation of dynamic games, such as Bajari, Benkard,
and Levin (2007), Pakes, Ostrovsky, and Berry (2007), and Pesendorfer ans Schmidt-Dengler (2008) consider similar
assumptions. Aguirregabiria and Mira (2009) relax that assumption to allow for multiple equilibria in the data.
113
The following is our key identification assumption.
ASSUMPTION 4 (Exclusion Restriction): The one-period payoff function of player i depends on the actions
of both players, Yit and Yjt , the common state variables Wt , and the own stock variable, Xit , but it does not
depend on the stock variable of the other player, X jt .
πit (Yjt , Xt ) = πit (Yjt , Wt , Xit )
This type of exclusion restriction appears naturally in some dynamic games. For instance, this
assumption holds in the dynamic game of market entry and exit of our Example in section 2 above.
A firm’s profits depend on whether the competitor is currently in the market or not but, given
that, it does not depend on whether the competitor was in the market or not at previous period.
Similarly, in a dynamic game of Cournot competition with capacity constraints (e.g., Besanko and
Doraszelski, 2004) a firm’s profit depend on the current capacity of competing firms but not on
the capacity of these firms at previous periods. A similar structure appears in the dynamic game
of retail store competition that we present and estimate in section 5.
Though exclusion restriction in Assumption 4 reduces the number of parameters in the payoff
function, it is not enough for the identification of the model. The order condition of identification
is not satisfied. The number of restrictions is still the same, |X | = |W |(K + 1)2 , the number of
parameters in the payoff function is now 2|W |(K + 1), and the number of parameters in beliefs is
still |X | = |W |(K + 1)2 .
ASSUMPTION 5 (Monotonic Beliefs and Large support condition): (A) The beliefs function B0jt (Xt ) is
strictly monotonically decreasing (or increasing) in X jt : i.e., ∂B0jt (Xt )/∂X jt < 0. (B) Conditional on
(Wmt , Ximt ) the variable X jmt has variability over the whole real line.
ASSUMPTION 6 (Normalization of beliefs): There are two values in the support of X j , say X jlow and
high
Xj
, such that for every value of ( Xi , W ) we have that beliefs are in equilibrium, i.e., B0jt ( Xi , X jlow , W ) =
high
Pjt0 ( Xi , X jlow , W ), and B0jt ( Xi , X j
high
, W ) = Pjt0 ( Xi , X j
, W ).
114
We use Assumptions 5 and 6 as two alternative conditions that, together with the exclusion
restriction, can provide nonparametric identification of payoff functions and beliefs. These two
assumptions are related. Assumption 6 may be interpreted as a generalization of Assumption
5. The intuition behind it is that when the opponent’s stock variable is ’bad’ enough (or ’good’
enough) strategic uncertainty disappears such that the player knows perfectly the strategy of his
opponent. Strategic uncertainty may disappear because the opponent’s choice becomes certain
(i.e., the choice probability gets arbitrarily close to zero or one), as under Assumption 5. But the
conditions for the absence of strategic uncertainty may be more general, as under Assumption 6.
The following Proposition summarizes our main identification result.
PROPOSITION: Under Assumptions 1-4 and 5 (and under Assumptions 1-4 and 6) the payoff functions
{πit0 for any i, t} and the beliefs functions { Bit0 for any i, t} are nonparametrically identified.
The proof of this Proposition follows a backwards induction approach. First, we show identification of beliefs and payoff functions at period T. Then, given identification at period T, we apply
backwards induction to prove identification at any period t < T. Through this proof, we assume
that the time horizon T is finite and that the researcher has panel data that covers all the periods
until T. At the end of this section we discuss identification when we relax this conditions.
Consider the identification at the last period T. For notational simplicity we omit the time
subindex. Under Assumptions 1 to 3, the CCP functions Pi0 and Pj0 are identified everywhere
in the support of X. At period T, the threshold function viB (X, π ) is equal to πiB (X) = (1 −
Bj (X)) πi (0, X) + Bj (X) πi (1, X). Therefore, the best response condition is: Pi0 (X) = Λi ((1 −
B0j (X))πi0 (0, X) + B0j (X)πi0 (1, X)). Define the function q0i (X) ≡ Λi−1 ( Pi0 (X)). Given that the distribution function Λi is invertible and it is known (up to scale) to the researcher, the function q0i (.) is
identified everywhere in the support of X. We have that:
q0i (X) = πi0 (0, X) + πi0 (1, X) − πi0 (0, X) B0j (X)
(3.12)
Let X ja , X jb , X jc ,and X jd be four arbitrary values in the support of X j such that X ja 6= X jb and X jc 6= X jd .
Under Assumption 4 (Exclusion Restriction), we have that
πi0 (0, X a )
=
πi0 (0, Xb )
=
115
=
πi0 (0, Xc )
πi0 (0, Xd ), and
{πi0 (0, X a ) + πi0 (1, X a ) − πi0 (0, X a ) Bj (X a )} − {πi0 (0, Xb ) + πi0 (1, Xb ) − πi0 (0, Xb ) Bj (Xb )}
{πi0 (0, Xc ) + πi0 (1, Xc ) − πi0 (0, Xc ) Bj (Xc )} − {πi0 (0, Xd ) + πi0 (1, Xd ) − πi0 (0, Xd ) Bj (Xd )}
=
Bj (X a ) − Bj (Xb )
Bj (Xc ) − Bj (Xd )
Therefore, we have that:
Bj (X a ) − Bj (Xb )
q0i (X a ) − q0i (Xb )
=
Bj (Xc ) − Bj (Xd )
q0i (Xc ) − q0i (Xd )
(3.13)
This expression shows that, under assumptions 1 to 4, there is a function of beliefs that is
identified, without having to impose the assumption of equilibrium beliefs. Note that this result
relies on the exclusion restriction but it does not require the monotonicity of beliefs (assumption
5a), the large support condition (assumption 5b), or the restriction on beliefs in assumption 6. This
result also provides a nonparametric test for the null hypothesis of equilibrium beliefs. Define the
function:
(
δi (X a , Xb , Xc , Xd ) ≡
q0i (X a ) − q0i (Xb )
q0i (Xc ) − q0i (Xd )
)
(
−
Pj0 (X a ) − Pj0 (Xb )
Pj0 (Xc ) − Pj0 (Xd )
)
(3.14)
It is clear that we can nonparametrically identify δi . If beliefs are in equilibrium, δi is equal to zero
for any value of (X a , Xb , Xc , Xd ). Let δ̂i be a root-M consistent and asymptotically nonparametric
estimator of δi . The hypothesis of equilibrium beliefs implies δi = 0, and we can test equilibrium
beliefs using a simple LM test of the null hypothesis of δi = 0. We describe this test in more detail
in section 4.
Let’s consider the identification of the payoff function πi0 from equation (3.12). First, we
present the proof using Assumption 5 (monotonic beliefs and large support condition). By the monotonicity of the beliefs function Bj , as X j → +∞ we have that Bj (X) → 0. Therefore, for arbitrarily large values of X j we have that πi0 (0, Xi , W) = q0i (X) and variables ( Xi , W) have variability over all their support. Therefore, πi0 (0, .) is identified. Similarly, as X j → −∞ we have
that Bj (X) → 1, and πi0 (1, Xi , W) = q0i (X) such that πi0 (1, .) is identified over the whole sup-
πi0
port of ( Xi , W).Given that
116
is identified, then we can obtain the beliefs function as Bj (X) =
[q0i (X) − πi0 (0, X)]/[πi0 (1, X) − πi0 (0, X)] for any value of X with πi0 (1, X) − πi0 (0, X) 6= 0.
We consider now identification of the model when we replace Assumption 5 by Assumption
6. First, note that the order condition of identification is satisfied: the number of restrictions is
|X | = |W |(K + 1)2 , the number of parameters in the payoff function is now 2|W |(K + 1), and
the number of parameters in beliefs is |W |(K + 1)2 − 2|W |(K + 1). The number of unknown
parameters is equal to the number of restrictions. Solving the restrictions imposed by Assumption
6 into equation (3.12), we have that for any value of ( Xi , W):
q0i ( Xi , X jlow , W)
=
πi0 (0, Xi , W) + πi0 (1, Xi , W) − πi0 (0, Xi , W) Pj0 ( Xi , X jlow , W)
(3.15)
high
q0i ( Xi , X j
high
, W) = πi0 (0, Xi , W) + πi0 (1, Xi , W) − πi0 (0, Xi , W) Pj0 ( Xi , X j , W)
high
Combining these two equations, we obtain [πi0 (1, Xi , W) − πi0 (0, Xi , W)] = [q0i ( Xi , X j
high
q0i ( Xi , X jlow , W)] / [ Pj0 ( Xi , X j
, W) −
, W) − Pj0 ( Xi , X jlow , W)]. And solving this expression into the equa-
tions in (3.15), we identify the payoff function as:
πi0 (0, X)
=
q0i (Xlow ) −
Pj0 (Xlow )
q0i (Xhigh )−q0i (Xlow )
Pj0 (Xhigh )− Pj0 (Xlow )
(3.16)
h
πi0 (1, X) = q0i (Xlow ) + 1 − Pj0 (Xlow )
high
where Xlow ≡ ( Xi , X jlow , W) and Xhigh ≡ ( Xi , X j
i
q0i (Xhigh )−q0i (Xlow )
Pj0 (Xhigh )− Pj0 (Xlow )
, W). Again, given the identification of the
payoff function, we can obtain beliefs as Bj (X) = [q0i (X) − πi0 (0, X)] / [πi0 (1, X) − πi0 (0, X)].
The equations in (3.16) deserves further comments. First, these equations contain as a particular case the model with the large support condition in Assumption 5. That is, the conditions of
Assumption 5 can be seen as a particular case of the conditions in Assumption 6. In particular, suphigh
pose that X j
high
is so large that Pj0 ( Xi , X j
, W) = 0, and X jlow is so small that Pj0 ( Xi , X jlow , W) = 1.
high
Then, the equations in (3.16) become πi0 (0, X) = q0i ( Xi , X jlow , W) and πi0 (1, X) = q0i ( Xi , X j
, W ),
117
which are also the expressions that identify the payoff function under the large support conditions
in assumption 6a. Second, the equations in (3.16) relate also to the model with equilibrium beliefs.
When we impose the assumption of equilibrium beliefs, the equations in (3.16) are satisfied not
high
only for the pair of values ( X jlow , X j
) but for every pair of values of X j , say X ja and X jb , such that
Pj0 ( Xi , X ja , W) − Pj0 ( Xi , X jb , W) 6= 0. Therefore, the payoff function is over-identified, and we can
test these over identifying restrictions, as we have mentioned above.
Given beliefs and payoffs at period T, we we use backwards induction to prove the identification of the payoff function and beliefs at any period t < T. Suppose that the functions
πit0 +s and B0jt+s are known for every s ≥ 1. Given this information, we want to identify func0
tions πit0 and B0jt . Player i’s best response implies that Pit0 (Xt ) = Λi (vitB (Xt )), where vitB (Xt ) =
(1 − Bjt (Xt )) πi (0, Xt ) + Bjt (Xt ) πi (1, Xt )+ β i ∑ [ f itB (Xt+1 |1, Xt )− f itB (Xt+1 |0, Xt )] V̄itB+1 (Xt+1 ), and
X t +1
R B
B
V̄it+1 is the integrated values function Vit+1 (Xt+1 , ε it+1 )dΛi (ε it+1 ). Given { Pit0+s , πit0 +s , B0jt+s : s =
1, 2, ..., T − t}, the integrated value function V̄itB+1 is known. Define the function:
q0it (X) ≡ Λi−1 ( Pit0 (X)) − β i
∑ [ fitB (Xt+1 |1, Xt ) − fitB (Xt+1 |0, Xt )] V̄itB+1 (Xt+1 )
0
0
0
(3.17)
X t +1
This function is identified everywhere in the support of X. Furthermore, by the best response
condition and the definition of q0it , we have that:
q0it (X) = πit0 (0, X) + πit0 (1, X) − πit0 (0, X) B0jt (X)
(3.18)
This equation has the same structure as the one at the last period T. Therefore, we can use exactly
the same arguments to show the identification of functions πit0 and B0jt .
So far, we have assumed that the time horizon T is finite and that the researcher has panel data
that covers all the periods until T. What if the time horizon of the decision problem is infinite? For
the sake of saving notation, we continue using T to represent the number of periods in the sample
(but not the time horizon that now is infinite). To obtain identification in this case, we make the
118
following assumption: from period T and forever in the future, agents have rational beliefs on the
behavior of the opponent, i.e., Bit0 (X) = Pit0 (X) for any player i, any state X, and any period t ≥ T.
Aguirregabiria (2005) and Tan (2008) have studied identification in this case.
3.4
Estimation and Inference
The proof of identification in the previous section implicitly suggests a method for the estimation
of the just-identified nonparametric model. We start with a detailed description of that estimation
method. In most empirical applications, the specification of the payoff function involves parametric restrictions. Therefore, we extend the estimation method to deal with parametric models in a
very simple way. Finally, for parametric models, we describe how the two-step method can be
extended recursively to generate a sequence of estimators with better statistical properties.
3.4.1
Estimation with nonparametric payoff function
Step 1: Nonparametric estimation of CCPs, P̂it0 , for every player, time period, and state X, and (if
needed) of the transition probabilities f i and f W .
0 ( X )), the estimated
Step 2: Estimation of preferences and beliefs. At the last period T, q̂0iT (X) = Λi−1 ( P̂iT
payoff function is:
0 (0, X ) = q̂0 ( Xlow ) − P̂0 ( X high )
π̂iT
iT
jT
q̂0iT (Xhigh )−q̂0iT (Xlow )
0 ( X high )− P̂0 ( Xlow )
P̂jT
jT
(3.19)
h
i
B̂0jT (X) =
0 (0,X )
q̂0iT (X)−π̂iT
0
0 (0,X )
π̂iT (1,X)−π̂iT
0 (1, X ) = q̂0 ( Xlow ) + 1 − P̂0 ( X high )
π̂iT
iT
jT
q̂0iT (Xhigh )−q̂0iT (Xlow )
0 ( X high )− P̂0 ( Xlow )
P̂jT
jT
and the estimated beliefs function is:
(3.20)
We also construct an estimate of the integrated value function at period T, that we will use later
119
for the estimation of payoffs and beliefs at periods earlier than T. For instance, if the distribution
Λi is logistic:
n 0
o
B0
B
V̂iT
(X) = ln 1 + exp v̂iT
(X)
(3.21)
0
B ( X ) ≡ (1 − B̂0 ( X )) π̂ 0 (0, X ) + B̂0 ( X ) π̂ 0 (1, X ).
where v̂iT
t
jT
iT
jT
iT
At any period t < T, we construct function:
∑ [ fitB (Xt+1 |1, X) − fitB (Xt+1 |0, X)] V̂itB+1 (Xt+1 )
0
q̂0it (X) = Λi−1 ( P̂it0 (X)) − β i
0
0
(3.22)
X t +1
Given q̂0it , we can estimate payoffs and beliefs using the same type of expression as for period T:
π̂it0 (0, X) = q̂0it (Xlow ) − P̂jt0 (Xhigh )
q̂0it (Xhigh )−q̂0it (Xlow )
P̂jt0 (Xhigh )− P̂jt0 (Xlow )
(3.23)
h
i
B̂0jT (X) =
0 (0,X )
q̂0iT (X)−π̂iT
0
0 (0,X )
π̂iT (1,X)−π̂iT
π̂it0 (1, X) = q̂0it (Xlow ) + 1 − P̂jt0 (Xhigh )
q̂0it (Xhigh )−q̂0it (Xlow )
P̂jt0 (Xhigh )− P̂jt0 (Xlow )
and
(3.24)
And the integrated value function at period t is:
0
V̂itB (X) = β i
∑
0
0
f itB (Xt+1 |0, X)] V̂itB+1 (Xt+1 ) − ln 1 − P̂it0 (X)
(3.25)
X t +1
It is straightforward to show that this estimator is consistent and asymptotically normal. The
derivation of the asymptotic variance is cumbersome. In our empirical application we use the
bootstrap method to obtain standard errors and confidence intervals for the estimates.
120
3.4.2
Estimation with parametric payoff function
In most applications, we consider a parametric specification of the payoff function. A very common class of parametric specifications is the linear in parameters model:
πit (Yjt , Xt ) = zit (Yjt , Xt ) θ
(3.26)
where zit (Yjt , Xt ) is a 1 × K vector of known functions, and θ is a K × 1 vector of unknown structural parameters. Note that, given the subindexes i and t in the vector of functions zit (.), this
specification is very general and it includes models with player-specific and time-specific parameters. Given this specification, the model implies the following relationship:
0
q0it (X) = zitB (X) θ 0
(3.27)
0
where zitB (X) ≡ (1 − B0jt (X)) zit (0, X) + B0jt (X) zit (1, X), and θ 0 is the true vector of parameters in
the population. Expression (3.27) is a key equation in the estimation of θ 0 .
To estimate θ 0 we consider a simple three steps method. The first two-steps are the same as for
the nonparametric model.
B =
Step 3: Given the estimates from step 2, we construct the variables: q̂imt = q̂0it (Xmt ) and ẑimt
(1 − B̂0jt (Xmt )) zit (0, Xmt ) + B̂0jt (Xmt ) zit (1, Xmt ), for every sample observation (i, m, t). Then, we
B . The estimator is root-M consistent and
estimate θ 0 by running an OLS regression of q̂imt on ẑimt
asymptotically normal. Again, the derivation of the asymptotic variance of this estimator is cumbersome, and we use the bootstrap method to obtain standard errors and confidence intervals for
the estimates.
It is possible to apply step 3 recursively to obtain a sequence of estimators of θ 0 and of belief
functions with better asymptotic and finite sample properties. To describe this recursive method, it
is important to take into account the model under Assumption 6 implies the following expression
121
for beliefs at any value of
X:11
h
B0j (X) = Pj0 (Xlow ) + Pj0 (Xhigh ) − Pj0 (Xlow )
i
"
q0i (X) − q0i (Xlow )
q0i (Xhigh ) − q0i (Xlow )
#
(3.28)
Let’s denote steps 1 to 3 as the first stage of the recursive procedure. Let θ̂ (1) be the estimator of
`0 that results from this first stage. Given θ̂ (1) and using backwards induction we can obtain new
estimates of choice probabilities, q’s, beliefs, and value functions.
(1)
(1)
(1)
At period T, the functions q̂iT (.), P̂iT (.), and B̂jT (.) for both players are the result of per(1)
forming R iterations in the following iterative procedure. We initialize B̂jT (.) = B̂0jT (.).At each
iteration, we perform tasks (a) to (d):
h
i
(1)
(1)
(1)
(a) q̂iT (X) = (1 − B̂jT (X)) ziT (0, X) + B̂jT (X) ziT (1, X) θ̂ (1)
(1)
(1)
(b) P̂iT (X) = Λi q̂iT (X)
(c) Updating beliefs:
(1)
B̂jT (X)
=
(1)
P̂jT (Xlow ) +
h
(1)
P̂jT (Xhigh ) −
(1)
P̂jT (Xlow )
i
(1)
"
(1)
q̂iT (X) − q̂iT (Xlow )
(1)
#
(1)
q̂iT (Xhigh ) − q̂iT (Xlow )
(d) Repeat (a)-(c) using the new beliefs functions.
(1)
Given P̂iT (.), we update the integrated value function:
(1)
(1)
V̂iT (X) = − ln 1 − P̂iT (X)
(1)
(1)
(3.29)
(1)
At period t < T, the functions q̂it (.), P̂it (.), and B̂jt (.) for both players are the result of
(1)
performing R iterations in the following iterative procedure. We initialize B̂jt (.) = B̂0jt (.). At each
iteration, we perform tasks (a) to (d):
11 As
B0 (X a )− B0 (Xb )
q0i (X a )−q0i (Xb )
= B0j (Xc )− B0j (Xd ) .
q0i (Xc )−q0i (Xd )
j
j
Xhigh , and Xd = Xlow , we have that
shown in equation (3.13) above, for any vectors X a , Xb , Xc , and Xd , we have that
If we particularize this expression at the vectors X a = X, Xb = Xlow , Xc =
q0i (X)−q0i (Xlow )
q0i (Xhigh )−q0i (Xlow )
B0j (X)− B0j (Xlow )
0
Bj (Xhigh )− B0j (Xlow )
. By Assumption 6, B0j (Xlow ) = Pj0 (Xlow ) and B0j (Xhigh ) = Pj0 (Xhigh ). Thus, solving
h
ih 0
i
q (X)−q0 (Xlow )
for B0j (X) we obtain B0j (X) = Pj0 (Xlow ) + Pj0 (Xhigh ) − Pj0 (Xlow ) q0 (iXhigh )−i q0 (Xlow ) .
=
i
i
122
h
(a)
(1)
q̂it (X)
(b)
(1)
P̂iT (X)
= (1 −
(1)
B̂jt (X)) zit (0, X) +
(1)
B̂jt (X) zit (1, X)
i
θ̂ (1)
!
(1)
q̂it (X) +
= Λi
B0
B0
β i ∑ [ f it (Xt+1 |1, X) − f it (Xt+1 |0, X)]
X t +1
(1)
V̂it+1 (Xt+1 )
(c) Updating beliefs:
(1)
B̂jt (X)
=
(1)
P̂jt (Xlow ) +
h
(1)
P̂jt (Xhigh ) −
(1)
P̂jt (Xlow )
i
"
(1)
(1)
q̂it (X) − q̂it (Xlow )
(1)
#
(1)
q̂it (Xhigh ) − q̂it (Xlow )
(d) Repeat (a)-(c) using the new beliefs functions.
(1)
Given P̂it (.), we update the integrated value function:
0
(1)
(1)
(1)
V̂it (X) = β i ∑ f itB (Xt+1 |0, X)] V̂it+1 (Xt+1 ) − ln 1 − P̂it (X)
X t +1
(1)
(1)
(1)
Finally, given the new estimated functions q̂it (.) and B̂jt (.), we construct the variables: q̂imt =
(1)
(1)
(1)
(1)
q̂it (Xmt ) and ẑimt = (1 − B̂jt (Xmt )) zit (0, Xmt ) + B̂jt (Xmt ) zit (1, Xmt ), for every sample observa(1)
(1)
tion (i, m, t). Then, we estimate θ 0 by running an OLS regression of q̂imt on ẑimt . Let θ̂ (2) be the
estimator of θ 0 that results from this first stage. This same procedure can be repeated to obtain a
sequence of estimators {θ̂ (K) , B̂(K) : K ≥ 1} with better statistical properties.
3.4.3
Test of Equilibrium Beliefs
In this subsection, we present a simple test of the null hypothesis of equilibrium beliefs. In principle, we could considered a standard Lagrange Multiplier (LM) or Score test based on the constrained maximum likelihood estimation (MLE) of structural parameters and beliefs. Define the
log-likelihood function:
M
l (θ, P) ≡
T
2
∑ ∑ ∑ Yit
m =1 t =1 i =1
log Λi (vitB (Xmt , θ )) + (1 − Yit ) log(1 − Λi (vitB (Xmt , θ )))
(3.30)
123
The constrained MLE is defined as a vector (θ̂ MLE , P̂ MLE ) such that:
(θ̂ MLE , P̂ MLE ) = arg max l (θ, P)
(θ,P)
(3.31)
subject to: P =
Λ(vP (θ ))
We want to test the null hypothesis P = Λ(vP (θ )), that consists of 2|X | constrains on (θ, P). The
standard LM statistic for testing this null hypothesis is:
LM( MLE)
∂l (θ̂ MLE , P̂ MLE )
=
∂(θ, P)0
"
∂2 l (θ̂ MLE , P̂ MLE )
∂(θ, P)∂(θ, P)0
# −1
∂l (θ̂ MLE , P̂ MLE )
∂(θ, P)
(3.32)
Under the null hypothesis, this statistic is asymptotically distributed as a chi-square with 2|X |
degrees of freedom.
The test that we propose is the following. Let X ja , X jb , X jc ,and X jd be four arbitrary values in the
support of X j such that X ja 6= X jb and X jc 6= X jd . And define the function δi0 :
(
δi0 (X a , Xb , Xc , Xd )
≡
q0i (X a ) − q0i (Xb )
q0i (Xc ) − q0i (Xd )
)
(
−
Pj0 (X a ) − Pj0 (Xb )
)
(3.33)
Pj0 (Xc ) − Pj0 (Xd )
Given our model under Assumptions 1-4, we have that if player i has rational beliefs then δi0 (X a ,
Xb , Xc , Xd ) = 0 for every value of (X a , Xb , Xc , Xd ). Therefore, testing δi0 (X a , Xb , Xc , Xd ) = 0 implies
testing the null hypothesis of rational beliefs. More precisely, we test the following null hypothesis. Let H be the number of all possible combinations of four different values of X j . We index
(h)
these values by h. Let Xmt the quadruplet h when the values of Xi and W are the ones in ob(h)
(h,a)
(h,b)
(h,c)
(h,d)
(h,a)
servation (m, t): i.e., Xmt = (Xmt , Xmt , Xmt , Xmt ) = ([ Ximt , X j
(h,c)
[ Ximt , X j
(h,d)
, Wmt ], [ Ximt , X j
(h,b)
, Wmt ], [ Ximt , X j
, Wmt ],
, Wmt ]). Under the hypothesis of equilibrium beliefs, we have that
(h)
E(δi0 (Xmt )) = 0 for every h. This is exactly the null hypothesis that we test:
(h)
H0 : E δi0 Xmt
= 0 for every quadruple h
(3.34)
124
Let be the estimator of that we obtain when we replace
and
by nonparametric esti
(h)
(h)
mates of these CCP functions. Define the statistic dˆi that is simply the sample mean of δ̂i0 Xmt ,
(h)
M
T
0 X(h) . Then, define the statistic:
i.e., dˆi = ( MT )−1 ∑m
δ̂
∑
=1 t =1 i
mt
δ̂i0
δi0
Pi0
Ŝ =
∑
H
h =1
(h)
dˆi
(h)
se(dˆ )
Pj0
!2
(3.35)
i
(h)
(h)
where se(dˆi ) is the standard error of dˆi , that we can obtains using nonparametric bootstrap.
Under the null hypothesis, Ŝ is asymptotically distributed as a Chi-square with H degrees of
freedom.
3.5
Empirical Application
We illustrate our model and method with an application of a dynamic game of store location.
There has been significant interest in recent years in the estimation of game theoretic models of
entry in local markets and store location. Most studies have considered static games: see Mazzeo
(2002), Seim (2006), Jia (2008), Nishida (2008), and Zhu and Singh (2009), among others. Recent
applications estimate dynamic games of store location: Holmes (2010), Beresteanu and Ellickson
(2005), Aguirregabiria, Mira, and Roman (2006), Walrath (2008), and Suzuki (2010).
We study store location of McDonalds (MD) and Burger King (BK) using data for United Kingdom during the period 1991-1995. We divide the UK in local markets (districts) and consider these
companies’ decisions of how many stores, if any, to operate in each local market. The profits of
a store in a market depend on local demand and cost conditions and on the degree of competition from other firms’ stores and from stores of the same chain. There are sunk costs associated
with opening a new store, and therefore this decision has implications on future profits. Firms are
forward-looking and maximize the value of expected and discounted profits. Each firm has uncertainty about future demand and cost conditions in local markets. Firms also have uncertainty
about the current and future behavior of the competitor. In this context, the standard assump-
125
tion is that firms have rational expectations about other firms’ strategies, and that these strategies
constitute a Markov Perfect Equilibrium. Here we relax this assumption. The main question that
we want to analyze in this empirical application is whether the beliefs of each of these companies
about the store location strategy of the competitor are consistent with the actual behavior of the
competitor.
3.5.1
Data and descriptive evidence
The dataset was collected by Otto Toivanen and Michael Waterson, who have used it in their
paper Toivanen and Waterson (2005).12 Our working sample is a five year panel that tracks 422
local authority districts (local markets), including the information on the stock and flow of MD and
BK stores into each district. It also contains socio-economic variables at the district level such
as population, density, age distribution, average rent, income per capita, local retail taxes, and
distance to the UK headquarters of each of the firms. The local authority district is the smallest
unit of local government in the UK, and generally consists of a city or a town sometimes with a
surrounding rural area. There are almost 500 local authority districts in Great Britain. Our working
sample of 422 does not included the districts that belong to Greater London.13 The median district
in our sample has an area of 300 square kilometers and a population of 95,000 people.14 Table 1
presents descriptive statistics for socio-economic and geographic characteristics of our sample of
local authority districts.
Table 2 presents descriptive statistics on the evolution of the number of stores for the two firms.
Toivanen and Waterson present a detailed discussion of why the retail chain fast food hamburger
industry in the UK during this period can be considered as a duopoly of BK and MD. In 1990,
MD had more than three times the number of stores of BK, and it was active in more than twice
12 We want to thank Otto Toivanen and Michael Waterson for generously sharing their data with us. See also Yang
(2010) for other empirical application with these data.
13 The reason to exclude from our sample the districts in Greater London is that they do not satisfy the standard
criteria of isolated geographic markets.
14 As a definition of geographic market for the fast food retail industry, the district is perhaps a bit wide. However,
an advantage of using district as definition of local market is that most of the markets in our sample are geographically
isolated. Most districts contain a single urban area. And, in contrast to North America where many fast food restaurants
are in transit locations, in UK these restaurants are mainly located in the centers of urban areas.
126
Table 3.1
Descriptive Statistics on Local Markets (Year 1991)
422 local authority districts (excluding Greater London districts)
Variable
Median
Std. Dev.
Pctile 5%
Pctile 95%
Area (thousand square km)
0.30
0.73
0.03
1.67
Population (thousands)
94.85
93.04
37.10
280.50
Children: Age 5-14 (%)
12.43
1.00
10.74
14.07
Young: 15-29 (%)
21.24
2.46
17.80
25.17
Pensioners: 65-74 (%)
9.01
1.50
6.89
11.82
GDP per capita (thousand £)
92.00
12.14
74.40
112.70
Claimants of UB / Population ratio (%)
2.75
1.27
1.24
5.11
Avg. Weekly Rent per dwelling (£)
25.31
10.61
19.11
35.07
Council tax (thousand £)
0.24
0.05
0.11
0.31
Number of BK stores
0.00
0.62
0.00
1.00
Number of MD stores
1.00
1.16
0.00
3.00
127
the number of local markets than BK. Conditional on being active in a local market, MD had also
significantly more stores per market than BK. These differences between MD and BK have not
declined significantly over the period 1991-1995. While BK has entered in more new local markets
than MD (69 new markets for BK and 48 new markets for MD), MD has open more stores (143
new stores for BK and 166 new stores for MD).
Table 3 presents the annual transition probability of market structure in local markets as described by the number of stores of the two firms. According to this transition matrix, opening a
new store is an irreversible decision, i.e., no store closings are observed during this sample period.
In Britain during our sample period, the fast food hamburger industry was still young and expanding, as shown by the large proportion of observations/local markets without stores (41.6%).
Although there is significant persistence in every state, the less persistent market structures are
those where BK is the leader. For instance, if the state is "BK=1 & MD=0", there is a 20% probability that the next year MD opens at least one store in the market. Similarly, when the state is "BK=2
& MD=1", the chances that MD opens one more store the next year are 31%.
128
Table 3.2
Evolution of the Number of Stores
422 local authority districts (excluding Greater London districts)
1990
#Markets with Stores
Change in #Markets with Stores
# of Stores
Change in # of Stores
Mean #Stores per Market
(Conditional on#Stores > 0)
#Markets with Stores
98
104
118
131
150
-
17
6
14
13
19
79
115
128
153
181
222
-
36
13
25
28
41
1.11
1.17
1.23
1.30
1.38
1.48
1990
1991
McDonalds
1992 1993 1994
1995
206
213
220
237
248
254
7
7
17
11
6
316
344
382
421
447
35
28
38
39
26
1.49
1.56
1.61
1.70
1.76
281
Change in # of Stores
Mean #Stores per Market
(Conditional on#Stores > 0)
1995
71
Change in #Markets with Stores
# of Stores
Burger King
1991 1992 1993 1994
1.36
129
Table 3.3
Transition Probability Matrix for Market Structure
Annual Transitions. Market structure: BK=x & MD=y, where x and y are number of stores
%
Market Structure at t+1
2
BK≥
MD=0
MD=1
MD≥
-
-
0.1
-
7.4
1.0
-
-
1.4
-
-
15.8
-
-
1.4
-
76.0
18.0
2.0
4.0
-
-
-
-
-
87.1
8.1
-
3.3
1.4
-
-
-
-
-
86.5
-
-
13.5
2 &MD=0
-
-
-
-
-
-
84.6
15.4
-
2 &MD=1
-
-
-
-
-
-
-
69.0
31.0
2 &MD≥ 2
-
-
-
-
-
-
-
-
100.0
41.6
23.3
6.6
2.2
10.9
8.8
0.6
1.4
4.5
BK=0
BK=0
MD=0
MD=1
BK=0 &MD=0
95.1
3.6
BK=0 &MD=1
-
BK=1
BK=1
MD=0
MD=1
0.2
1.0
-
87.2
4.2
-
2
-
-
82.7
BK=1 &MD=0
-
-
BK=1 &MD=1
-
2
BK≥
BK≥
BK=0 &MD≥
BK=1 &MD≥
BK≥
Frequency
BK=0
MD≥
2
BK≥
BK=1
MD≥
2
2
BK≥
2
2
130
Table 3.4
Reduced Form Probits for the Decision to Open a Store
Explanatory Variable
Estimated Marginal Effects1 (∆P( x ) when dummy goes from 0 to 1)
Burger King
McDonalds
No FE County FE District FE No FE County FE
District FE
Own number
of stores at t-1
Dummy: Own #stores = 1
Dummy: Own #stores = 2
Dummy: Own #stores ≥ 3
-0.021∗∗
(0.005)
-0.023∗∗
(0.004)
-0.019∗∗
(0.005)
-0.036∗∗
(0.007)
-0.030∗∗
(0.005)
-0.027∗∗
(0.005)
-0.885∗∗
(0.063)
-0.210∗
(0.085)
-0.056
(0.036)
-0.035∗∗
(0.010)
-0.047∗∗
(0.006)
-0.043∗∗
(0.006)
-0.045∗∗
(0.012)
-0.060∗
(0.008)
-0.053∗∗
(0.008)
-0.550∗∗
(0.056)
-0.757∗∗
(0.041)
-0.816∗∗
(0.038)
0.032∗∗
(0.011)
0.045∗
(0.023)
0.089∗
(0.048)
0.037∗
(0.014)
0.052∗
(0.029)
0.101∗
(0.059)
-0.025
(0.055)
-0.017
(0.031)
0.011
(0.084)
0.020
(0.013)
0.041
(0.029)
-0.041∗∗
(0.007)
0.032∗
(0.018)
0.076
(0.046)
-0.050∗∗
(0.009)
0.052∗∗
(0.073)
-0.007∗∗
(0.093)
-0.104∗∗
(0.020)
0.024
0.027
0.014
0.045
0.054
0.085
YES
YES
NO
NO
2110
422
-371.89
0.229
YES
YES
YES
NO
1715
343
-340.26
0.252
YES
YES
NO
YES
535
107
-110.54
0.624
YES
YES
NO
NO
2110
422
-467.46
0.159
YES
YES
YES
NO
1855
371
-449.02
0.161
YES
YES
NO
YES
640
128
-198.50
0.441
Competitor’s number
of stores at t-1
Dummy: Comp.’s #stores = 1
Dummy: Comp.’s #stores = 2
Dummy: Comp.’s #stores ≥ 3
Pred. Prob. Y=1 at mean X
Time dummies
Control variables2
County Fixed Effects
District Fixed Effects
Number of Observations3
Number of Local Districts3
log likelihood
Pseudo R-square
Note 1: Estimated Marginal Effects are evaluated at the mean value of the rest of the explanatory variables.
Note 2: Every estimation includes as control variables log of population, log of GDP per capita, log of population density,
proportion population 5-14, proportion population 15-29, average rent, and proportion of claimants of unemployment benefits.
Note 3: Fixed effects estimations do not include districts for which the dependent variable does not have enough time variation.
131
Table 4 presents estimates of reduced form Probit models for the decision to open a new store.
We obtain separate estimates for MD and BK. Our main interest is in the estimation of the effect of previous year number of stores (own stores and competitor’s stores) on the probability
of opening a new store. We include as control variables population, GDP per capita, population
density, proportion of population 5-14, proportion population 15-29, average rent, and proportion
of claimants of unemployment benefits. To control for unobserved local market heterogeneity
we also present two fixed effects estimations, one with county fixed effects and other with local
district fixed effects. We report only estimates of the marginal effects associated to the dummy
variables that represent previous year number of stores. The main empirical result from Table 4 is
that, regardless the set of control variables that we use, the own number of stores has a strong negative effect on the probability of opening a new store but the effect of the competitor’s number of
stores is either negligible or even positive. This result is very robust to different specification of the
reduced form model is along the lines of the empirical results from the reduced forms in Toivanen
and Waterson’s paper. Controlling for unobserved heterogeneity using fixed effects reveals that
the estimation without fixed effects suffers of a very important upward bias in the marginal effect
of the number of own stores. However, the estimated marginal effect of the number of competitor’s stores barely changes. The estimates show also a certain asymmetry between the two firms:
the absence of response to the competitor’s number of stores is more clear for BK than for MD. In
particular, when BK has three stores in the market there is significant reduction on MD’s probability of opening a new store. That negative effect does not appear in the reduced form probit for
BK. This empirical evidence cannot be explained by a standard static model of store location by
firms that sell substitute products. Here we explore three, non-mutually exclusive, explanations:
(a) spillover effects; (b) forward looking behavior (dynamic game); and (c) biased beliefs about the
behavior of the competitor.
(a) Spillover effects. The competitor’s number of stores may have a positive spillover effect on the
profit of a firm. There are different possible sources of this spillover effect, e.g., informational,
132
product expansion through advertising, costs reductions etc. Since we do not have price and
quantity data at the level of local markets, we do not try to identify the source of the spillover
effect. We allow for this effect in our specification of demand such that the natural interpretation
in the context of our model is a product expansion coming from the advertising effect of retail
stores. However, this should be interpreted as a shortcut or ’reduced form’ specification of different possible spillover effects.
(b) Forward looking behavior. Opening a store is a partly irreversible decision that involves significant sunk costs. Therefore, it is reasonable to assume that firms are forward looking when they
make this decision and that they play a dynamic game. Forward looking behavior might explain
the apparent lack of competitive effects when we look at these decisions from the point of view of
a static model of entry. Suppose that firms anticipate, with some uncertainty, the total number of
hamburger stores that a local market can sustain in the long-run given the size and the socioeconomic characteristics of the market. For simplicity, suppose that this number of "available slots"
does not depend on the ownership of the stores because the products sold by the two firms are
very close substitutes. In this context, firms play a ’racing’ game to fill as many ’slots’ as possible
with their own stores. Diseconomies of scale and scope may generate a negative effect of the own
number of stores on the decision of opening new stores. However, in this model, during most of
the period of expansion the number of slots of the competitor has zero effect on the decision of
opening a new store. Only when the market is filled or close to filled do the competitor’s stores
have a significant effect on entry decisions.
(c) Biased beliefs. As mentioned in the Introduction, competition in actual oligopoly industries is
often characterized by strategic uncertainty. Firm face significant uncertainty about the strategies
of their competitors, and they try to deceive competitors about their own strategies. In the context
of our application, it may be the case that MD’sand/or BK’s beliefs overestimate the negative
effect of the competitor’s stores on the competitor’s entry decisions. For instance, if MD has one
store in a local market, BK may believe that the probability that MD opens a second store is close
133
to zero. This over-optimistic belief about the competitor’s behavior may generate an apparent lack
of response of BK’s entry decisions to the number of MD’s stores.
3.5.2
Model
Consider two retail chains competing in a local market. Each firm sells a differentiated product.
The demand for product i is qit = (St /2)(1 + (bit − b jt ) − ( pit − p jt )) where: qit is the quantity
sold by firm i; St is market size that is exogenous; bit is the ’quality’ of product i at period t;15
and pit is the ’price’ of product i at period t. Production costs are linear in the quantity produced,
i.e., Cit = ci qit , where ci is firm i’s constant marginal cost. The variable profit of firm i is VPit =
( pit − ci )qit . Given market size and qualities at period t, firms compete in prices ala Bertrand to
maximize current variable profit. It is simple to show that in the Bertrand equilibrium prices are
pit = 1 + c j + bit − b jt , and variable profits are:16
VPit =
St
2
1−
1
3
ci − c j +
1
3
bit − b jt
2
(3.36)
A firm’s quality increases with the number of stores that the firm has in the market. There are
at least two ways in which the number of stores in the market affects the willingness to pay of
the average consumer. First, an increase in the number of stores implies a reduction in consumer
transportation cost to visit a store of the chain. Second, stores are like ’advertisements’ in the
sense that they increase the awareness of local consumers about the retail chain. We assume the
following specification:
(0)
bit = bi
(1)
+ bi
Yit
(3.37)
where Yit ∈ {0, 1, ..., K } represents the firm’s choice of number of stores in the local market at
(0)
period t. bi
≥ 0 is a parameter that represents the exogenous quality of firm i in every local
(1)
market. And bi
15 This
≥ 0 is a parameter that measures the effect of the number of stores in the local
’quality’ is the just the willingness to pay for the product of the average consumer in the market.
that this is the total variable profit of the firm in the local market, not the profit of a single store.
16 Note
market on the ’quality’ of the firm in that
134
Firm i is active in the market at period t if Yit is
market.17
strictly positive. Given the specification of firm quality and the previous formula for equilibrium
variable profits, we have the following expression for variable profits in terms of the state variables
and the decision variables of the dynamic game:
VPit = 1{Yit > 0} St
(3.38)
VP + θ VP Y + θ VP Y 2 + θ VP Y + θ VP Y 2 + θ VP Y Y
θ0i
it jt
jt
jt
it
it
5i
4i
3i
2i
1i
VP : k = 0, 1, ..., 5} are structural parameters that are known
1{.} is the indicator function. {θki
(0)
(0)
(1)
(1)
functions of the ’deep’ parameters bi , b j , bi , b j , ci , and c j .18 It is simple to verify that given
VP : k = 0, 1, ..., 5; i = 1, 2} we can (over) identify the ’deep’
a value of the vector of parameters {θki
structural parameters di , d j , and ([bi − ci ] − [b j − c j ]).
In order to distinguish decision and state variables, we use the variable Xit to represent the
number of stores at period t − 1, i.e., Xit ≡ Yit−1 . Every period, the two firms know the current
’stocks’ of stores in the market at previous period and choose simultaneously the new number of
stores. The firm’s total profit function is:
Πit = VPit − θiEC 1{Yit > 0 and Xit = 0}
(3.39)
− 1{Yit > 0} θ0iFC − θ1iFC Yit − θ2iFC Yit 2
− 1{Yit > Xit } ε it
FC , θ FC and θ FC are parameters in the cost function. θ EC is an entry cost that is paid
where θiEC , θ0i
1i
2i
0i
FC is a lump-sum cost associated
the first time that the firm opens a store in the local market. θ0i
FC Y + θ FC Y 2 takes into
with having any positive number of stores in the market. The function θ1i
it
2i it
17 It
is important to note that in the static version of the game, the number of stores of the two firms are strategic
substitutes, i.e., ∂2 VPit /∂Yit ∂Yjt = −di d j St /9 < 0.
18 In
(1)
(0)
VP ≡ (1 + ( b
particular, θ0i
i
(1)
(0)
(1)
(0)
VP ≡ (2/3) b (1 + ( b
− b j )/3 − (ci − c j )/3)2 , θ1i
i
i
(0)
VP ≡ −(2/3) b (1 + ( b
[bi /3]2 , θ3i
j
i
(0)
(1)
(0)
VP ≡
− b j )/3 − (ci − c j )/3), θ2i
(1)
(1)
VP ≡ [ b /3]2 , and θ VP ≡ −2[ b /3][ b /3]2 .
− b j )/3 − (ci − c j )/3), θ4i
5i
j
i
j
135
account that operating costs increase with the number of stores in a linear or quadratic form. The
variable ε it is a private information shock in the cost of opening a new store, and it is i.i.d. normally
distributed. In this example, the vector of structural parameters for firm i is
VP
VP
VP
VP
VP
VP
FC FC
FC 0
θi ≡ (θ0i
, θ1i
, θ2i
, θ3i
, θ4i
, θ5i
, θiEC , θ0i
, θ1i , θ2i
)
(3.40)
And observable part of the payoff function, πit , is:
πit ≡ 1{Yit > 0}
St , St Yit , St Yit 2 ,
(3.41)
St Yjt , St Yjt 2 , St Yit Yjt ,
−1{ Xit = 0}, − 1, − Yit , − Yit 2
θi
Note that our model implies the exclusion restriction that, given Yjt , the profit of firm i does
not depend on X jt = Yjt−1 . That is, a firm’s current profit depends on his own and his opponents
current number of stores in the market, but given these variables it does not depend on the number of stores of the competitors at period t − 1. Of course, a firm’s beleifs about the probability
distribution of Yjt depends on X jt .
3.5.3
Estimation of the structural model
As described in section 5.1 above, we do not observe store closings in our sample. Furthermore,
for almost all the observations with store openings the number of new stores is one. Therefore, we
consider that Yit ∈ { Xit , Xit + 1} or equivalently, Yit − Xit ∈ {0, 1}. The stock variable Ximt that
represents the number of installed stores of firm i in market m at the beginning of the year. The
maximum value of Ximt in the sample is 13, and we consider that the set of possible values of Ximt
is {0, 1, ..., 15}. Therefore, the state space X is {0, 1, ..., 15} × {0, 1, ..., 15} that has 256 grid points.
Yimt − Ximt is the binary indicator of the event "firm i opens a new store in market m at year t". We
136
consider that market characteristics are constant over time. The measure of market size Sm is total
population in the district. For some specifications, we allow the cost of investment to depend on
market characteristics such as average rent, retail taxes, population density, or average income.
Therefore, each market has its own vector of players’ CCPs. The dimension of the vectors Pi in
this model is equal to 108, 032, i.e., 422 markets times 256 states X.
Tables 5 and 6 present estimates of the structural model under the assumption that firms are
myopic, β = 0, and under the assumption that firms are forward looking, β = 0.95, respectively. We report two different sets of point estimates: estimates using a simple two-step method
Pseudo Maximum Likelihood method where the estimator of (equilibrium) players’ beliefs in the
first step is a nonparametric frequency estimator; and estimates using the Nested Pseudo Likelihood (NPL) method proposed in Aguirregabiria and Mira (2007). The NPL method imposes the
equilibrium restrictions in the sample (i.e., the estimated beliefs should be equal to the estimated
best response probabilities), while the two-step method only satisfies the equilibrium restrictions
asymptotically. The NPL estimator has smaller asymptotic variance and finite sample bias than
the two-step method. There are very substantial differences between two models, particularly in
the estimates of the parameters that capture cannibalization and competition effects. While these
effects have the ’wrong’ sign in the myopic model, the signs are the expected ones in the forward
looking model. All the parameter estimates in the forward looking model have the expected signs
and have reasonable magnitudes. Therefore, it seems that forward looking behavior explains part
of the puzzle in the reduced form estimates.
137
Table 3.5
Myopic Game of Entry for McDonalds and Burger King
Under the Assumption that Players’ Beliefs are in Equilibrium
Data: 422 markets, 2 firms, 5 years = 4,220 observations
β = 0.00 (not estimated)
Two Step Estimates
NPL Estimates
Burger King McDonalds Burger King McDonalds
Variable Profits:
θ0VP
4.904 (1.070)∗
7.909 (2.289)∗
4.864 (1.081)∗
7.898 (2.287)∗
θ1VP cannibalization
2.005 (0.869)∗
3.510 (0.659)∗
2.035 (0.831)∗
3.466 (0.647)∗
θ2VP competition
0.014 (0.046)
0.032 (0.051)
0.016 (0.044)
0.037 (0.053)
θ0FC fixed
0.378 (0.212)∗
0.806 (0.248)∗
0.374 (0.212)∗
0.808 (0.247)∗
θ1FC linear
3.099 (0.436)∗
2.662 (0.405)∗
3.103 (0.436)∗
2.659 (0.405)∗
θ2FC quadratic
-0.054 (0.064)
0.085 (0.041)
-0.052 (0.063)
0.087 (0.041)
Fixed Costs:
Pseudo R-square
Log-Likelihood
Distance||PK −PK− ||
# NPL iterations
0.154
0.154
-895.5
-895.4
0.00
1
5
138
Table 3.6
Dynamic Game of Entry for McDonalds and Burger King
Under the Assumption that Players’ Beliefs are in Equilibrium
Data: 422 markets, 2 firms, 5 years = 4,220 observations
β = 0.95 (not estimated)
Two Step Estimates
NPL Estimates
Burger King
McDonalds
Burger King
McDonalds
Variable Profits:
θ0VP
0.5849 (0.1077)∗
0.8303 (0.2968)∗
1.098 (0.2169)∗
0.9737 (0.3091)∗
θ1VP cannibalization
-0.2096 (0.0552)∗
-0.0024 (0.0392)
-0.0765 (0.0725)
0.2874 (0.0986)∗
θ2VP competition
-0.0110 (0.0029)∗
0.0008 (0.0027)
-0.0129 (0.0065)∗
-0.0074 (0.0073)
Fixed Costs:
θ0FC fixed
0.0784 (0.0213)∗
0.0822 (0.0332)∗
0.0788 (0.0307)∗
0.0773 (0.0261)∗
θ1FC linear
0.0790 (0.0420)∗
0.1076 (0.0400)∗
0.1509 (0.0282)∗
0.1302 (0.0185)∗
θ2FC quadratic
-0.0078 (0.0059)
-0.0034 (0.0023)
-0.0054 (0.0026)∗
0.0001 (0.016)
Pseudo R-square
Log-Likelihood
Distance||PK −PK− ||
# NPL iterations
0.323
0.146
-655.7
-893.4
4831.26
0.00
1
31
Table 7 presents results of our test of equilibrium beliefs. We implement separate tests for MD
and BK. We impose the restrictions that beliefs for X jt = 0 and X jt = 3 are unbiased.
3.6
Conclusion
This paper considers a class of dynamic games of incomplete information where players’ beliefs
about the other players’ actions may not be in equilibrium. We present new results on identification, estimation, and inference of structural parameters and beliefs in this class of games when
the researcher does not have data on elicited beliefs. Specifically, we derive sufficient conditions
139
Table 3.7
Estimated Bias in BK Beliefs
Difference Between BMD and PMD
Stores of BK
0
1
Stores of MD
1
-0.17 (0.04)
-0.10 (0.06)
2
-0.08 (0.07)
-0.06 (0.10)
Estimated Bias in MD Beliefs
Difference Between BBK and PBK
Stores of MD
0
1
Stores of BK
1
-0.03 (0.05)
0.02 (0.04)
2
0.03 (0.10)
0.04 (0.12)
140
under which payoffs and beliefs are point identified. These conditions then lead naturally to a
two-step estimator of payoffs and beliefs, which we show can be extended to provide a sequence
of estimators with asymptotic variances and finite sample biases that decline monotonically. We
also present a procedure for testing the null hypothesis that beliefs are in equilibrium.
We illustrate our model and methods with an empirical application of a dynamic game of store
location by McDonalds and Burger King. Our main interest in this application is to explain a puzzling empirical regularity, that the probability a firm opens a new store in a local market depends
negatively on the number of stores it currently has open in the location, and does not depend
(or may even positively depend) on the number of stores its competitor currently has open in the
location. In the context of our model we explore three alternative (but not mutually exclusive) explanations for these: cross-firm spillovers, forward looking behavior, and out of equilibrium (i.e.,
biased) beliefs. We find empirical evidence for the hypotheses of forward looking behavior and
biased beliefs.
References
1. Acemoglu, Daron, Simon Johnson, James A. Robinson and Pierre Yared (2008) “Income and
Democracy" American Economic Review Vol. 98, Issue 3: 808:842.
2. Aguirregabiria, Victor (2009) “A Method for Implementing Counterfactual Experiments in
Models with Multiple Equilibria." Working Paper.
3. Aguirregabiria, Victor (2010) "Another Look at the Identification of Dynamic Discrete Decision Processes: An Application to Retirement Behavior,” Journal of Business and Economic
Statistics, 28(2), 201–218.
4. Aguirregabiria, Victor and C-Y. Ho (2009): "A Dynamic Oligopoly Game of the US Airline
Industry: Estimation and Policy Experiments," Manuscript. University of Toronto.
5. Aguirregabiria, V., P. Mira, and H. Roman (2006) “Entry, Survival and Growth in Oligopoly
Retail Markets: Explaining Cross-Industry Heterogeneity,” manuscript. University of Toronto.
6. Aguirregabiria, Victor and Pedro Mira (2002) “Swapping the Nested Fixed Point Algorithm."
Econmetrica, 70, 1519-1543.
7. Aguirregabiria, Victor and Pedro Mira (2007) “Sequential Estimation of Dynamic Discrete
Games." Econmetrica, 75, 5: 1-53.
8. Aguirregabiria, Victor and Pedro Mira (2009) "Structural Estimation and Counterfactual Experiments in Games when the Data Come from Multiple Equilibria," manuscript. Department of Economics. University of Toronto.
9. Alesina, Alberto and David Dollar (2000) “Who Gives Foreign Aid to Whom and Why?"
Journal of Economic Growth 5: 33-63.
141
142
10. Alesina, Alberto and Beatrice Weder (2002) “Do Corrupt Governments Receive Less Foreign
Aid." American Economic Review, 92, 4: 1126:1137.
11. Arrow, Kenneth and Anthony C. Fisher. “Environmental Preservation, Uncertainty and Irreversibility" The Quarterly Journal of Economics 88 (1974): 312-319.
12. Aradillas-Lopez, A. and E. Tamer (2008) "The Identification Power of Equilibrium in Games,"
Journal of Business and Economic Statistics, 26 (3), 261-283.
13. Kenneth J. Arrow and Robert M. Solow and P. R. Portney and E. E. Leamer and Roy Radner
and H. Schuman. Report of the NOAA Panel on Contingent Valuation, Federal Register 10
(1993) 58: 4601-4614.
14. Bajari, Pat, C. Lanioer Benkard and Jonathan Levin (2007) “Estimating Dynamic Models of
Imperfect Competition." Econometrica 75, 5: 1331Ű1370.
15. Becker, Gary S. “Public Policies, Pressure Groups and Dead Weight Costs" Journal of Public
Economics 28 (1985): 329-47.
16. Beresteanu, A. and P. Ellickson (2005) "The Dynamics of Retail Oligopolies," manuscript.
Duke University.
17. Besanko, D., and U. Doraszelski (2004) "Capacity Dynamics and Endogenous Asymmetries
in Firm Size," RAND Journal of Economics, 35, 23-49.
18. Besanko, D., U. Doraszelski, L. Lu, and M. Satterthwaite (2010) "On the role of demand
and strategic uncertainty in capacity investment and disinvestment dynamics," International
Journal of Industrial Organization, 28(4), 383-389.
19. William J. Clinton. Presidential Documents, Executive Order 12866 of September 30, 1993.
Regulatory Planning and Review, United States Federal Register 58, (October 4th, 1993):55135527
143
20. Camerer, C. (2003) "Behavioral Game Theory: Experiments in Strategic Interaction," Princeton University Press: Princeton NJ.
21. Collard-Wexler, A. (2008): "Demand Fluctuations in the Ready-Mix Concrete Industry," Manuscript.
New York University.
22. Diermeir, Daniel, Hulya Erslan and Antonio Merlo (2003) “A Structural Model of Government Formation," Econometrica, 71: 27-70.
23. Dunne, T., S. Klimek, M. Roberts, and Y. Xu (2009) "Entry, Exit and the Determinants of
Market Structure," Manuscript. Department of Economics. Pennsylvania State University.
24. Freixas, Xavier and J.J. Laffont, “The Irreversibility Effect" in Bayesian Models in Economic
Theory. North Holland. Amsterdam (1984).
25. Gastil, Raymond (1990) “The Comparative Survey of Freedom: Experiences and Suggestions." Studies in Comparative International Development Vol. 25, No. 1 : 25:50.
26. Goodman, Ryan and Derek Jinks (2003) “Measuring the Effects of Human Rights Treaties."
European Journal of International Law Vol 14, No 1.
27. Goldfarb, A., and M. Xiao (2009): "Who thinks about the competition? Managerial ability
and strategic entry in US local telephone markets," manuscript. Rotman School of Management. University of Toronto.
28. Grossman, Gene M. and Elhanan Helpman Special Interest Politics MIT Press. Cambridge
(2002).
29. Hafner-Burton, Emilie Marie, Mansfield, Edward and Pevehouse, Jon C. (2008) “Democratization and Human Rights Regimes." Working Paper, Available at SSRN: http://ssrn.com/abstract=1123771
30. Hathaway, Oona (2003) “The Cost of Commitment." Yale Law School, Public Law Working
Paper No. 47; Yale Law and Economics Research Paper No. 273.
144
31. Hathaway, Oona (2007) “Why Do Countries Commit to Human Rights Treaties?" Journal of
Conflict Resolution 51, 588-621.
32. Heyns, C H and Frans Viljoen (2002) The Impact of the United Nations Human Rights Treaties on
the Domestic Level, The Hague, New York.
33. Holmes, T. (2010) "The Diffusion of Wal-Mart and Economies of Density," Econometrica, forthcoming.
34. Jia, P. (2008): "What Happens When Wal-Mart Comes to Town: An empirical analysis of the
Discount Industry," Econometrica, 76, 1263-1316.
35. Laffont, J.J. and Jean Tirole A Theory of Incentives in Procurement and Regulation MIT Press.
Cambridge (1993).
36. Lewis, Tracy (1996) “Protecting the Environment When Costs and Benefits are Privately
Known" Rand Journal of Economics 27: 819-847.
37. Maddison, Angus (2003) The World Economy: Historical Statistics, Development Centre of the
Organization for Economic Cooperation and Development, Paris, France.
38. Magnac, T. and Thesmar, D. (2002) "Identifying Dynamic Discrete Decision Processes," Econometrica, 70, 801-816.
39. Marshall, Monty G. and Keith Jaggers (2004) “Political Regime Characteristics and Transitions, 1800-2002," Polity IV Project. University of Maryland.
40. Mazzeo, M. (2002): "Product choice and oligopoly market structure," Rand Journal of Economics, 33(2), 221–242.
41. Merlo, Antonio (1997) “Bargaining over Governments in a Stochastic Environment," Journal
of Political Economy 105: 101-131.
145
42. Moravcsik, Andrew (2000) “The Origins of Human Rights Regimes: Democratic Delegation
in Post War Europe," International Organization 54,2: 217-252.
43. Nishida, M. (2008): "Estimating a model of strategic store network choice," Working Paper,
University of Chicago.
44. Olson, Mancur Jr. The Logic of Collective Action, New Haven: Yale University Press, 1982.
45. Pakes, Ariel and Paul McGuire (1994) “Computing Markov-Perfect Nash Equilibria: Numerical Implications of a Dynamic Differentiated Product Model" RAND Journal of Economics
25,4: 555-589.
46. Pakes, Ariel, Michael Ostrovsky and Steve Berry (2007) “ Simple estimators for the parameters of discrete dynamic games (with entry/exit examples)" The RAND Journal of Economics
38,2:373Ű399.
47. Pesendorfer, M. and Schmidt-Dengler (2008) "Asymptotic Least Squares Estimators for Dynamic Games," The Review of Economic Studies, 75, 901-928.
48. Posner, Richard A. “Cost-Benefit Analysis: Legal, Economic, and Philosophical Perspectives" The Journal of Legal Studies 29 (2000): 1153-1177
49. Roodman, David (2006) “An Index of Donor Performance" The Center for Global Development, Working Paper 67.
50. Samuelson, P. (1938) "A Note on the Pure Theory of Consumers’ Behaviour," Economica, 5,
61-71.
51. Seim, K. (2006) "An Empirical Model of Firm Entry with Endogenous Product-Type Choices,"
The RAND Journal of Economics, 37(3), 619–640.
52. Stigler, George J. “The Theory of Economic Regulation " The Bell Journal of Economic Management and Science 2 (1971): 3-21.
146
53. Suzuki, J. (2010) "Land Use Regulation as a Barrier to Entry: Evidence from the Texas Lodging Industry," manuscript. Department of Economics. University of Toronto.
54. Tamer, E. (2003) "Incomplete Simultaneous Discrete Response Model with Multiple Equilibria," Review of Economic Studies, 70, 147-165.
55. Toivanen, O., and M. Waterson (2005): "Market Structure and Entry: Where’s the Beef?,"
RAND Journal of Economics, 36(3), 680-699.
56. Von Stein, Jana (2005) “Do Treaties Constrain or Screen? Selection Bias and Treaty Compliance." The American Political Science Review, 99, 4: 611-622.
57. Wacziarg, Romain and Karen Horn Welch (2003) “Trade Liberalization and Growth: New
Evidence." World Bank Economic Review 22, 2: 187-231.
58. Wagner, Ulrich (2008) “Estimating Strategic Complementarities in a Dynamic Game of Timing: The Case of the Montreal Protocol." Working Paper.
59. “Handbook for European Union Election Observation," European Commission (2008). Available at: http : //ec.europa.eu/externalr elations/humanr ights/eue lection a sso bserv/docs/handbook e n.pd f
60. Walrath, M. (2008) "Religion as an industry: estimating a strategic entry model for churches,"
manuscript. Department of Economics. University of St. Thomas.
61. Xu, D. (2008) "A Structural Empirical Model of R&D, Firm Heterogeneity, and Industry Evolution," Manuscript. New York University.
62. Zhu, T. and V. Singh (2009) "Spatial competition with endogenous location choices: An application to discount retailing," Quantitative Marketing and Economics, 7, 1–35.
View publication stats