Academia.eduAcademia.edu

No evidence on directional vs. proximity voting

1999, Political analysis

The directional and proximity models offer dramatically different theories for how voters make decisions and fundamentally divergent views of the supposed microfoundations on which vast bodies of literature in theoretical rational choice and empirical political behavior have been built. We demonstrate here that the empirical tests in the large and growing body of literature on this subject amount to theoretical debates about which statistical assumption is right. The key statistical assumptions have not been empirically ...

No Evidence on Directional vs. Proximity Voting The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters. Citation Lewis, Jeffrey, and Gary King. 1999. No evidence on directional vs. proximity voting. Political Analysis 8(1): 21-33. Published Version http://pan.oxfordjournals.org/ Accessed August 9, 2015 12:01:22 AM EDT Citable Link http://nrs.harvard.edu/urn-3:HUL.InstRepos:4125046 Terms of Use This article was downloaded from Harvard University's DASH repository, and is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-ofuse#LAA (Article begins on next page) Political Analysis, 8:1 No Evidence on Directional vs. Proximity Voting Jeffrey B. Lewis Princeton University Gary King Harvard University The directional and proximity models offer dramatically different theories for how voters make decisions and fundamentally divergent views of the supposed microfoundations on which vast bodies of literature in theoretical rational choice and empirical political behavior have been built. We demonstrate here that the empirical tests in the large and growing body of literature on this subject amount to theoretical debates about which statistical assumption is right. The key statistical assumptions have not been empirically tested and, indeed, turn out to be effectively untestable with existing methods and data. Unfortunately, these assumptions are also crucial since changing them leads to different conclusions about voter decision processes. 1 Introduction Since Rabinowitz and Macdonald (1989) first proposed their “directional” spatial voting model, over 25 scholarly articles have been devoted to the subject. While some of these papers have sought to articulate the implications of the theory (Merrill 1993) or to generalize it by, for example, explicitly incorporating uncertainty into the model (Macdonald and Rabinowitz 1993b), most debate its empirical merits. Does it work in the United States, France (Pierce 1997), Norway (Macdonald et al. 1991; Westholm 1997), Sweden (Gilljam 1997), Germany (Kramer and Rattinger 1997), or The Netherlands (Macdonald and Rabinowitz 1996a)? Does it explain roll call voting decisions in the U.S. Congress (Platt et al. 1992)? Does it describe all voters or only unsophisticated voters (Macdonald et al. 1995b)? Beyond its theoretical merits, interest in the directional model has been sustained by the prominence of the alternative against which it has generally been tested. What contributors to the directional voting literature refer to as the “proximity model” has so dominated the research on spatial representations of voter preferences that it is generally referred to as “the spatial theory of voting” (see Hinich and Enelow 1984). The familiar spatial model has a long theoretical and empirical tradition. Well-known results including the Median Voter Authors’ note: Our thanks goes to Jim Alt, Mo Fiorina, Torben Iversen, Stuart Elaine Macdonald, George Rabinowitz, and Anders Westholm for helpful comments, and the National Science Foundation (SBR-9729884), the Centers for Disease Control and Prevention (Division of Diabetes Translation), the World Health Organization, and the National Institute of Aging for research support. Copyright 1999 by the Society for Political Methodology 21 22 Jeffrey B. Lewis and Gary King Theorem of Black (1958) and Hotelling (1929), the McKelvey (1976) Chaos Theorem, Shepsle and Weingast’s (1981) Structure-Induced Equilibrium, and Poole and Rosenthal’s (1985) Nominate Scores are predicated on the idea that politics can be represented as a Euclidean space (the proximity model) and not an inner product space (the directional model).1 The contributors to the literature on directional versus proximity voting are fighting over two central political science issues: our understanding of a basic feature of the political world—how voters make decisions—and a prominent aspect of our data collection strategies—how randomly chosen respondents answer imprecisely worded survey questions. On one hand, we have a group of scholars who have championed directional theory for a decade, and on the other we have a large body of theoretical and empirical rational choice research that rests on the veracity of the proximity model. Despite substantial effort and much thoughtful analysis, the scholarly community does not seem much closer to a resolution. We attempt to explain why this debate persists. Our purpose is not to declare a winner, or to play Ted Koppel’s role on Nightline (“Did you hear what he said about your mother?”), but rather to clarify the complicated issues involved in testing these theories and to lay bare the assumptions one would need to conclude that one side, rather than the other, is correct. Our main argument is that the existing data contain insufficient information with which to distinguish the two theories. The result is that a large and supposedly empirical literature, designed to answer the empirical question of how voters make decisions, too often amounts to data-free debates about which untested assumption is right. More specifically, we demonstrate that (1) in the most general formulation, support for each model is marginal at best; (2) each set of authors is able to produce results that seem to favor their position only by making (and justifying in a variety of interesting theoretical ways) different untested methodological assumptions; and (3) the data are not sufficiently rich to allow an appropriate test that can distinguish between the opposing assumptions employed by each. Despite much hard work, it is still not possible to decide who is right using the kinds of data and methodologies that have been employed in the literature. For simplicity, our references to the literature draw on the two most recent articles in this debate when possible. That is, we take Westholm (1997) as our exemplar of the defenders of the proximity model and Macdonald, Rabinowitz, and Listhaug (1998) (hereafter MRL) as our exemplar of the defenders of directional theory. Using identical data, the authors find apparently equally strong but opposing support for their favored model. 2 Definitions One way to interpret the common starting point of both theories of voting is by the common assumption that candidates for office can be arrayed on an underlying dimension, such as liberal to conservative.2 Figure 1 gives an example with some well-known senators from the 104th congress.3 1 References to many notable empirical tests of the spatial voting model in the context of electoral competition are given by Hinich and Enelow (1984, 1989). Countless papers have assumed the spatial model in testing other hypotheses. 2 In many applications, candidates are ordered in multidimensional space. We use one dimension to clarify the underlying concepts. For expository purposes, here and elsewhere, we use the simplest possible examples and forms of the theories at issue. Both bodies of literature elaborate in many useful ways, but the elaborations do not change the essential points. 3 We took the exact ideological positions from Poole and Rosenthal (1997) but the same ideas (and, in the case of well-known senators, nearly the same positions) apply regardless of method. The latter is important since Poole and Rosenthal’s methods happen to be based on proximity theory. Directional vs. Proximity Voting 23 Fig. 1 Senators on an ideological dimension: the dimension ranges from the most liberal at the left to the most conservative at the right. A few senators and “YOU” are located as filled circles on the dimension. The small vertical bar at the middle represents the neutral point required under directional theory. Figure 1 is a familiar representation of an ideological issue space, ranging from Edward Kennedy at the most liberal end to Jesse Helms at the most conservative. For illustrative purposes, we have also added “YOU” to this dimension, with preferences slightly to the left of center. According to the proximity model, voters prefer candidates closest to them on this dimensional scale. So, according to the model, You would prefer Olympia Snowe as your representative to any of the other senators. Also, the farther away from your position, the less you like the representative. Hence, after Snowe, You prefer Nancy Kassebaum, then Sam Nunn, Edward Kennedy, and, finally, Jesse Helms. In contrast, under the directional model, voters prefer candidates “on their side,” and the more on their side the better. A “side” might be a politically meaningful grouping such as a political party or a like-minded set of people who agree with some basic ideological premises. In the figure, “sides” are determined by your position relative to the neutral point (marked by the vertical line in Fig. 1). Thus, according to the model, You prefer those to the left of zero more than those to the right, because You are on the left. Moreover, You have the strongest intensity of preferences for the extreme candidates on your side because they are most clearly members of your “team.” Thus You and everyone else located on the “Liberal” side have the same preference ordering: Kennedy, Nunn, Snowe, Kassabaum, Helms. However “sides” are defined, the position of the neutral point is critical in directional theory (and irrelevant in proximity theory). MRL exclude from this scheme truly extremist candidates who fall outside of what they call the “region of acceptability” that they use to define mainstream politics. The elected U.S. senators who make up our current example all probably fall within the acceptable region. Thus, under directional theory, voters would penalize more extreme candidates (for example, perhaps, the Right-to-Life Party candidates in New York State or the occasional Communist Party candidate elsewhere). In some formulations, directional theory describes something closer to the respondent’s answer to a survey question than a traditional ideological scale. In that situation, the idea is that people decide only what side they are on and how intense they feel, in which case the underlying ideological dimension in Fig. 1 has a different interpretation than it has in a traditional spatial modeling framework. Indeed, whether respondents use the intermediate values of issues scales to convey their preferences for intermediate policy positions or to reflect the strength of their commitments to one or the other of the positions that anchor the scale is central to this debate. Whether the empirical tests presented below and elsewhere in the literature tell us more about how respondents conceive of Likert scales or how they conceive of politics more generally is of course unclear. The debate in the literature is focused on whether it is the proximity or directional model that best explains voter behavior. However, implicit in this debate is the simultaneously determined question of what dimension (or dimensions) people use to decide. This makes the question of proximity vs direction conditional on what dimension is being used by voters 24 Jeffrey B. Lewis and Gary King or analyzed by researchers, since it could be the case that people follow the proximity model on some dimensions and the directional model on others.4 3 A Formal Statement of the Models In this section we formally define the directional and proximity models of voter decision making. We also introduce a general model that includes both as special cases. This model helps us clarify the theories in this section and the methods and assumptions necessary to produce valid empirical tests, in the next. Let vi be the numerical position of voter i (for i = 1, . . . , n) on some issue or ideological dimension like that in Fig. 1 (where the underline points out the mnemonic labeling convention we adopt). Also, let cij be the ideological position of candidate (or party) j on the same dimension as presented to voter i. For this conceptual section, we treat vi and cij as theoretical quantities that could be measured without error. Even without considering measurement error or psychological concepts such as perception, ci j may in fact vary over i if the candidates present themselves differently to different groups, which is after all the point of many targeted campaign appeals. The neutral point is defined as zero. While they differ as to how voter utility is defined, both sides agree that utility theory governs actual decision making. So a voter would choose to cast a ballot for candidate 1 over candidate 2 if the voter’s utility for 1 is higher than for 2. What distinguishes the various models of voter decision making is the voter’s utility function. Under the directional model, the utility of voter i for candidate j is Uijd = α d + β d (vi cij ) (1) where α d and β d (>0) are unknown constants. This equation indicates that when a voter and a candidate are on the same side of the neutral point, the voter’s utility under the directional model is lowest when the voter or the candidate is near the neutral point (i.e., zero) and highest when both are on the extreme of the same side of the neutral point. The second term is negative if the voter and candidate are on opposite sides and is increasingly negative as the voter or the candidate become more extreme. [We sidestep concerns about the region of acceptability in the analyses below by also running a version of our specifications that drop extremist parties; so for simplicity, Eq. (1) excludes penalties for candidates that fall outside this region, as would otherwise be required by directional theory.] Under the proximity model, the voter’s utility is most commonly written as p Uij = α p − β p (vi − cij )2 = α p − βvi2 − β p cij2 + β p (2vi cij ) (2) where β p > 0. Eq. (2) defines a voter’s maximum expected utility as occurring when the voter and candidate are at the same position. The less proximate the two positions are, the lower the voter’s expected utility.5 4 Almost all formal theory work assumes the proximity model. Two interesting papers by Merrill (1993, 1995) explore the theoretical consequences for this literature of assuming instead that the directional model applies. 5 Sometimes the absolute (or “city block”) rather than the squared (“Euclidean”) distance between v and c is i ij used as a distance measure, but these rarely differ empirically by very much. We choose the squared Euclidean distance because it is more commonly chosen and allows us to nest the directional and proximity models within our “general” model. A similar (though slightly better) fit is achieved using Euclidean distances rather than squared Euclidean distances. Note, also, that α and β have different meanings in Eqs. (1) and (2). Directional vs. Proximity Voting 25 Our general encompassing model comes from letting the three β’s in the second line of Eq. (2) differ: g Uij = α − βv vi2 − βc cij2 + β2 (2vi cij ) (3) The special cases of this general model produce the two behavioral models at issue. The g general model equals the directional model (i.e., Uij = Uijd ) when βv = βc = 0 and β2 > 0. p g The general model specializes to the proximity model (Uij = Uij ) when βv = βc = β2 > 0. Unlike the directional and proximity models, Eq. (3) is not meant as a theory of voter behavior, only as a device to understand and subsequently evaluate the two behavioral models as special cases.6 In the simplest possible case, we have only two parties and one issue. If we had a survey with n interviews, our data set constructed for the analysis would have 2n observations (n for each party).7 The effect of additional issues on utility is assumed here and elsewhere in this literature to be additive. Each additional issue enters the model in the same way as the first and receives its own value of vi , cij , and (2vi cij ). 4 Alternative Assumptions for Empirical Testing We now describe three alternative sets of assumptions under which it is possible to estimate the parameters of the model and hence distinguish between the proximity and directional models. Most applications in this literature measure a voter’s utility for a candidate or party with a “feeling thermometer,” a survey question resulting in a scale from 0 (feeling “cold” about a candidate) to 50 (the neutral point) to 100 (a “warm” feeling). The voter’s position vi and the candidate’s position as presented to each voter cij are measured with 7- or 10-point Likert issue scale questions put to voters. Since the parameters of interest are a linear function of squares and products of variables, they can be estimated by least squares regression. 4.1 Optimistic Assumptions Suppose we were willing to believe that all the methodological problems one might raise were sufficiently minor such that, if ignored, they would not change our conclusions about which theory is correct, or, if corrected, the estimates would be no better. This is a useful starting point in most analyses, especially for understanding the relationships in the data and how imposing alternative assumptions affects these relationships. In the present case, an optimistic approach to estimation means we could estimate α, βv , βc , and β2 in Eq. (3) by regressing the feeling thermometer on a constant term, vi2 , cij2 , and (2vi cij ). To provide a feel for the kind of information in the data, we implemented this with the 1989 Norwegian Election Study,8 the same data set used by Westholm and MRL 6 Rabinowitz and Macdonald’s (1989) so-called “mixed” model is also a special case of our general model, when βv = βc and β2 > 0. Its authors sometimes think of the mixed model as useful for understanding and statistical testing, as we do our encompassing model, but they sometimes also treat it as a model of voter behavior in its own right. 7 With more than one issue, our general model becomes U g = α − v′ β v − c′ β c + 2v′ β c , where c and v ij i v i ij c ij i 2 ij are now vectors and the β’s are now diagonal matrices. 8 Data distributed by the Norwegian Social Science Data Services (NSD). Bernt Aardal and Henry Valen were the Principal Investigators and the Norwegian Gallup Institute collected the data. 26 Jeffrey B. Lewis and Gary King (N = 8833).9 (We also replicated the same analyses in U.S. data, which we do not present here; the results are highly consistent with the analyses and conclusions below.10 ) We did the estimation for all seven political parties and six issues (a general left–right scale, the environment, agriculture, immigration, health policy, and alcohol restriction). Continuing the notation developed above, denoting the feeling thermometer scores of voter i evaluating candidate j as Tij and indexing each of the issues by k, we estimate the following equation by least squares: Tij = α − 6 X k=1 2 βvk vik − 6 X βck ci2jk + k=1 6 X β2k (2vik ci jk ) + ǫij (4) k=1 With six issues, we have six βv ’s, βc ’s and β2 ’s to estimate—one for each corresponding vi2 , cij2 , and (2vi cij ). The results, which appear in Table 1, indicate that βv is usually fairly close to zero, but βc and β2 are both large and positive.11 Thus, before correcting any methodological problems, the results do not provide unambiguous support for either the directional model (which requires βv = βc = 0 and β2 > 0) or the proximity model (which requires βv = βc = β2 > 0). If the results in Table 1 are so mixed, how are Westholm and MRL able to draw such strong and opposing conclusions? The answer depends critically on their differing methodological assumptions and resulting statistical corrections, a subject to which we now turn. 4.2 Assumptions that Favor Directional Theory MRL worry that candidate issue placements, as measured, may be endogenous to the feeling thermometer and thus may be systematically biased. To correct for this problem, MRL12 replace each respondent’s perception of a candidate Pn issue position cij with the average issue cij /n. If the true value of cij is indeed position in the sample for each party, c j = i=1 9 MRL (1998) argue that Norway’s Socialist Left and the Progressive parties are outside of the “region of acceptability” and thus voters will not use directional logic when evaluating them. Instead, these parties suffer some “penalty” for their extreme views. Rather than trying to model the “penalty” directly, we refit all of the models in this paper after excluding these parties. The substantive results given in the text are not altered when these parties are omitted, though the number of issues that can be considered is reduced by two due to an identification problem described below. The number of observations is n = 2198 respondents × 7 parties − missing party-respondent pairs = 8833. 10 To be specific, we replicated the analysis with U.S. data using the 1948–1996 NES Cumulative Data File (ICPSR Study No. 8475). In this replication, we used the feeling thermometer scores for all major party presidential candidates from 1972 to 1996 and placements on the liberal–conservative and government guaranteed employment 7-point scales. We choose these two scales because they were included on all NES surveys in the 1972 to 1996 period. As with the Norwegian data, the estimated βc ’s are similar in magnitude to the β2 ’s, while the βv ’s are small. Similar results were obtained using the other issue scales included in the data set (equality for women, relations with Russia, minority aid, social services spending, and defense spending) for the subsets of years in which they were asked. Tables and program files for this replication are available in the replication data set that accompanies this article. 11 The average standard errors given in Table 1 are generated by the usual OLS estimator. One might be concerned that the regression errors are not independent across observations involving the same respondent. If this sort of dependence is present, the OLS standard error estimates are biased and inconsistent. We also employed a Huber sandwich-type estimator that is consistent in the presence of within-respondent dependence. The standard errors estimated in this way were in all cases within ±0.01 of the reported standard errors. 12 MRL argue that, without this correction, projection effects bias the results toward the Euclidean model. [Projection is the tendency for voters who are uncertain of the parties’ issue positions to locate parties they prefer close to their own position (see Brody and Page 1972).] Westholm (1997) argues that it is somewhat inconsistent to assert that voters would locate candidates they prefer close to themselves, but when assuming the directional 27 Directional vs. Proximity Voting Table 1 Estimates of the general model: Roughly speaking, β̂v ≈ 0, β̂c > 0, and β̂2 > 0a Issue Left–right Agriculture Environment Immigration Alcohol Health (Average SE) β̂v −0.15 −0.03 0.04 −0.10 −0.08 0.04 (0.03) β̂2 β̂c 0.32 0.28 0.13 0.29 0.28 −0.23 (0.03) 0.64 0.22 0.23 0.16 0.24 0.23 (0.02) a For this table, and all that follow, N = 8833. constant over i, this procedure averages away much of the measurement error and may go some way toward solving the methodological problem. However, if candidates present different ideological positions to different groups of voters (or different voters receive different messages), so that cij is not constant, the procedure will introduce measurement error. Whether or not cij is in fact constant over i, replacing it with c j has important side effects for estimation. Table 2 repeats the regression in Table 1 with these mean party placements. The results for this table are qualitatively the same, except that estimates of βc are now much less precisely Table 2 Estimates of the general model with mean party placements: Roughly speaking, β̂v ≈ 0, β̂c > 0, and β̂2 > 0a Issue Left–right Agriculture Environment Immigration Alcohol Health (Average SE) β̂v β̂c β̂2 0.02 −0.03 0.01 0.02 −0.09 0.11 (0.03) −0.46 −0.17 −0.26 2.78 −0.31 −1.47 (0.11) 0.90 0.25 0.18 0.35 0.29 0.25 (0.02) a Note that compared to the estimates given in Table 1, βc is much less precisely estimated. model, voters would not in general prefer candidates that are in close proximity to themselves. In any event, Merrill and Grofman (1997) demonstrate empirically that the projection effects are modest. A second reason MRL (1998) give for using mean party location is that each party should be able to take only a single position. Thus, they argue that variation in a party’s location across respondents must be attributable to measurement error. Others, such as Enelow and Hinich (1994), argue that this interrespondent variation in party placements is the result mainly of idiosyncratic misperceptions that do bear on voter’s utilities. Aldrich and McKelvey (1977) suggest that the individual variation in party placements is attributable largely to interrespondent incomparability in the issue scales (that is, a 3 on one respondent’s scale might correspond to a different policy position than a 3 on another respondent’s scale). Aldrich and McKelvey develop a clever technique that rescales respondents’ placements onto a common metric, an approach that is preferable to the mean placement method and may be a reasonable approach to take in future work. However, since the Aldrich and McKelvey method yields unique party locations, it would also fall prey to the identification problems described in the text. 28 Jeffrey B. Lewis and Gary King Table 3 The MRL approach, mean party placements and fixed effects: Under the general model, candidate fixed effects make βc inestimablea General Issue Left–right Agriculture Environment Immigration Alcohol Health (Average SE) β̂v β̂c β̂2 Directional, β̂2 | (β̂v = β̂c = 0) Proximity, β̂2 | (β̂v = β̂c = β̂2 ) 0.02 −0.03 0.01 0.02 −0.09 0.11 (0.03) ? ? ? ? ? ? 0.90 0.25 0.18 0.35 0.29 0.25 (0.02) 0.90 0.26 0.18 0.35 0.29 0.25 (0.02) 0.73 0.14 0.08 0.18 0.16 0.25 (0.02) 19.90 0.36 20.93 0.29 19.87 0.36 σ̂ R2 a The directional model restrictions fit the data slightly better than the proximity model. estimated, more variable across issues, and overall less clearly distinguishable from zero. Since cij was replaced by the statistically less variable and hence less powerful c j , this should come as no surprise. Moreover, since party location variables are fixed for each party, as the number of issues increases, the estimates of βc are increasingly correlated with any party specific effect on the feeling thermometer that might be omitted from the model (as discussed below), making the estimation of βc far more difficult. In fact, the number of parties must exceed the number of issues or βc would be inestimable. Nevertheless, with this methodological adjustment, the results now provide somewhat greater support for the directional theory of voting—the restriction that βc = βv = 0 is more consistent with the data. MRL also worry that there might be omitted party-specific attributes that affect feeling thermometer scores, a standard statistical problem.13 To correct for these possible omitted party attributes, MRL let α vary over the parties and so estimate α j for all j (a so-called party “fixed-effects” model). Unfortunately, these effects are now perfectly collinear with the squared party positions c j and so βc is no longer estimable.14 This is indicated by the question marks in the third column in Table 3, where we estimated the general model with mean party placements and fixed effects. The last two columns in Table 3 specialize our general model under the competing assumptions of the directional and proximity models. The fit statistics indicate that the directional model fits slightly (although “significantly”) better than the proximity model. In the directional model, the restriction that βv = 0 has little effect since the unrestricted 13 For example, parties involved in scandal may be given relatively low evaluations by all voters, ceteris paribus. can show more intuitively why the party locations are linearly dependent on the set of party fixed effects. That is, one way to formulate the fixed effects model is to transform the data by mean deviating all the variables in the data by their within-party means. First, write each observation as 14 We Tij − T̄ j = 6 X k=1 6 6 ¢ X ¡ ¢ X ¡ 2 2 β2k 2(vik c jk − vik c jk ) + e − ē j βck c2jk − c2̄jk + + − v̄ik βvk vik k=1 k=1 where the bars over the variables indicate their within-party means. However, since c2jk does not vary across voters, and c2jk − c2̄jk = 0 for all parties j and issues k, the value of βck will have no effect on the fit of the model. Directional vs. Proximity Voting 29 estimate in the second column is always approximately zero. Since βc is inestimable, the directional model restriction that βc = 0 and the proximity model restriction that βc = βv have no effect whatsoever. Thus, under this set of methodological assumptions, the proximity model restriction that βv = β2 does have an effect, as it slightly reduces the fit of the model. On this basis, then, MRL conclude that the directional model outperforms the proximity model. 4.3 Assumptions that Favor Proximity Theory Unlike MRL, Westholm and most other defenders of proximity theory are not concerned with problems that might arise from the use of respondents’ idiosyncratic party placements. However, Westholm finds other faults with what we have called the standard “optimistic assumptions.” Westholm argues that feeling thermometers should not be directly compared across respondents.15 The incomparability of utility measures, either observed as in this case or latent as in many choice models, is a serious obstacle to the study of political or consumer choice. Standard utility theory explicitly assumes that any two individuals’ utility functions need not be comparable. This incomparability is often unproblematic (though sometimes inconvenient) for developing theoretical results, but for the empirical testing of utility models it is often devastating. Very rarely are we able to observe the same individual in a large number of choice settings. Rather we generally observe only interpersonal, rather than intrapersonal, variation in choice setting. That is, we observe different individuals in different circumstances choosing over the same alternatives. Therefore, we are forced to adopt the dubious “representative consumer” assumption (see Hausman and Wise 1978, pp. 406–407; Kirman 1992). In other words, we treat a set of observations on choices (or, in this case, feeling thermometer ratings) made by different individuals as if they were the choices of a single “average” or “representative” individual. If we observed only one utility value per respondent, we would have little choice but to treat the feeling thermometer scores as directly comparable across respondents. However, because in the present case multiple feeling thermometer ratings are recorded for each respondent, the feeling thermometer scores need not be treated as directly comparable. Westholm uses this panel-like structure of the data to relax the representative consumer assumption. By allowing α to vary by respondent, Westholm’s method effectively treats differences in, and not levels of, feeling thermometers as comparable across respondents.16 As is often the case, the relaxation of an assumption comes at a cost.17 Specifically, by assuming that feeling thermometer scores are strictly comparable across repeated measurements within and across individuals for levels and differences, MRL are able to test implications of the proximity model such as “A voter should be indifferent between being 15 In the simplest case, this would mean that the α’s and β’s vary across respondents. More generally, any monotonic transformation of utility (in this case, as measured by a feeling thermometer) represents the same preferences. Brady (1989) considers the problem of interpersonal comparability of feeling thermometer scores in the context of the traditional spatial model. Brady’s model treats the feeling thermometers as ordinal measures. Transforming the feeling thermometer scores to ranks and applying econometric techniques appropriate for ranked data is a potentially promising line of future research in the directional voting literature. However, such methods would, like Westholm’s method, leave βv unidentified. 16 Variation in the α’s could be introduced by the inclusion of random rather than fixed effects. However, this would require an ex ante assumption to be made about the correlation between the random effects and the independent variables. The usual assumption is that the effects are independent of the regressors. Using a Hausman-type test, we can reject (χ 2 (12) = 115.7) the notion of independent random effects in these data (conditional on either the directional or proximity model). 17 It should be noted that Westholm’s method does in fact require the comparison of differences in feeling thermometer scores across respondents. His analysis is not based solely on intrarespondent utility. 30 Jeffrey B. Lewis and Gary King Table 4 The Westholm approach, voter fixed effects: Under the general model, voter fixed effects make βv inestimablea General Issue β̂v Left–right Agriculture Environment Immigration Alcohol Health (Average SE) ? ? ? ? ? ? β̂c 0.37 0.32 0.17 0.24 0.35 −0.21 (0.04) 18.15 0.50 σ̂ R2 β̂2 Directional, β̂2 | (β̂v = β̂c = 0) Proximity, β̂2 | (β̂v = β̂c = β̂2 ) 0.61 0.22 0.23 0.18 0.25 0.25 (0.02) 0.60 0.29 0.31 0.14 0.26 0.27 (0.02) 0.53 0.22 0.20 0.18 0.26 0.21 (0.02) 19.08 0.43 18.57 0.47 a The proximity model restrictions fit the data slightly better than than the directional model. a strong conservative with Barry Goldwater as president and being a strong liberal with Lyndon Johnson as president.” This prediction is somewhat forced since people normally cannot “choose” to be liberals or conservatives in the same way that they can choose to support Johnson or Goldwater, but nothing in choice theory prevents individuals from having preferences over outcomes they cannot achieve. Just as one might prefer to have been born healthy, wealthy, and wise, one can also prefer to be a liberal in Sweden or a conservative in Utah. Since voter issue positions are fixed across candidates in his model, and he rules out direct comparison of two voters’ evaluations of the same candidate, Westholm’s relaxation of the strong incomparability assumption removes the restriction placed on βv .18 Since the estimates of βv in our general model are near-zero, and the proximity model previously required βv = βc = β2 , removing the restriction on βv greatly increases the empirical fit of the proximity model while making it harder to distinguish it from the directional model. To rule out direct comparisons of the utility levels across voters while still allowing utility differences to be comparable, Westholm includes voter “fixed effects.” This involves estimating a separate αi for each voter (which is equivalent to including a dummy variable for each voter). Since vi2 is fixed over i, αi and βv (the coefficient on vi2 ) are not separately identified. (For example, the estimate of βv could be doubled without changing voters’ predicted feeling thermometer scores for any candidate simply by subtracting βv vi2 /2 from each of the previously estimated αi ). Table 4 provides estimates under Westholm’s assumptions. In our general model, βv is inestimable and hence represented by question marks. Respondent fixed effects have little 18 Because the inclusion of voter fixed effects is equivalent to mean-deviating the data by within-voter means, and because voter positions vik are fixed across parties, we encounter the same identification problem for βv that we described above for βc with included party fixed effects. Using the mean-deviated forms and letting the bars over the variables represent within-voter means, we would have Tij − T̄ j = 6 X k=1 6 6 ¡ ¢ X ¢ X ¡ 2 2 + βck c2jk − c2jk + β2k 2(vik c jk − vik c jk ) + e − ē j − vik βvk vik k=1 k=1 Because vik is fixed for each voter i, vik − v̄ik = 0 for all voters i and βv is not identified. Directional vs. Proximity Voting 31 effect on estimates of βc (compare column 3 in Tables 4 and 1). Since it is not identified, the different restrictions on βv under the directional and proximity models have no effect on estimation. However, since unrestricted estimates from the general model of βc and β2 are both large, the restriction under the proximity model (βc = β2 ) outperforms the restrictions under the directional model (βc = 0), as indicated by the fit statistics in the last two columns. 5 Concluding Suggestions Do voters decide based on proximity or direction? Like all inferences, any decision requires making some untestable assumptions. In this case, the alternative possible methods used to relax the most questionable of these assumptions are far from innocuous. Which assumptions we choose to relax depends on which seem more important. Westholm and MRL make very strong, and diametrically opposed, cases for relaxing different assumptions. They are both right to a degree, in that each of the methodological problems raised could seriously bias the results if left untreated. However, the data available are insufficient to relax all the assumptions at issue in a single model. It is unfortunate that the scholarly community is left with theoretical analyses for deciding these empirical questions, but until survey researchers or experimentalists produce better measurement devices, or political methodologists generate better methodological approaches, the impass will remain. We now offer some suggestions for future researchers looking for ways to put this debate on empirically solid microfoundations. 1. It may be possible to design a survey experiment where vi and cij are manipulated exogenously without error so that Eq. (3) is estimable without assumptions. This would seem to be an important component of any satisfactory solution, since if we cannot design an experiment—even in principle—to answer the critical questions at hand, there is little hope of making inferences from observational data. 2. Scholars could work to demonstrate precisely how feeling thermometer scales are predicted by demographic variables. This may undermine Westholm’s claim that they are interpersonally incomparable. 3. Work by Grofman et al. on “projection bias” might be extended, which might be seen as undermining MRL’s assumptions. 4. It would be very helpful to extend studies that use intensive interviews or “debriefings” with survey respondents to study precisely how people think about, explain, and understand our survey questions (e.g., Kaiser 1984). This kind of work is typically done when pretesting survey instruments, but feeling thermometers and the various issue questions were never designed to test directional and proximity theories. 5. Much of the debate surrounding directional versus proximity voting comes down to disagreements about the appropriateness of various additions to the basic parametric model or to the parameterization itself. For example, some have wondered if, in the operationalized proximity model, the feeling thermometer should be proportional to a Euclidean distance or “city block” metric, rather than to a squared Euclidean distance as we (and most others) have assumed. Thus, another fruitful approach may be to test these theories semi- or nonparametrically (e.g., Pierce 1993). One such approach we have explored is to model a person’s feeling thermometer answers on a set of questions as ordinal rankings (e.g., Bloom and Cavanagh 1986). This type of model entails some interesting statistical issues that we hope to pursue in future research. 32 Jeffrey B. Lewis and Gary King The body of literature attempting to distinguish proximity vs directional voting has been growing fast. Yet in our view, this scholarship has been confined to a relatively narrow methodological tradition, and scholars have analyzed only a few types of data sets. Unfortunately, there exists essentially no evidence within this tradition and these data to distinguish between the two models. Perhaps as a consequence, other scholars have probably paid insufficient attention to this controversy. However, the issue is clearly critical, as it potentially threatens to undermine a large fraction of the work on formal rational choice and empirical voting behavior. That there exists no evidence for one of the most elementary assumptions underlying work in these fields is worthy of much future attention. We hope that our research leads future researchers to pursue these important questions more productively. References Aarts, Kees, Stuart Elaine Macdonald, and George Rabinowitz. 1996a. “Issue Competition in the Netherlands.” Presented at the European Consortium for Political Research Joint Sessions of Workshops, Oslo, Norway. Aarts, Kees, Stuart Elaine Macdonald, and George Rabinowitz. 1996b. “Issue Competition and Party Support in the Netherlands.” Presented at the Annual Meetings of the Political Science Association, San Francisco, CA. Aldrich, John H., and Richard D. McKelvey. 1977. “A Method of Scaling with Application to the 1968 and 1972 Presidential Elections.” American Political Science Review 71(1):111–130. Black, Duncan. 1958. Theory of Committees and Elections. Cambridge: Cambridge Univerisity Press. Bloom, David E., and Christopher L. Cavanagh. 1986. “An Analysis of the Selection of Arbitrators.” The American Economic Review 76(3):408–422. Brady, Henry. 1989. “Factor and Ideal Pont Analysis for Interpersonally Incomparable Data.” Psychometrika 54(2):181–202. Brody, Richard A., and Benjamin J. Page. 1972. “Comment: The Assessment of Policy Voting.” American Political Science Review 66(2):450–458. Dow, Jay. 1998. “Directional and Proximity Models of Voter Choice in Recent U.S. Presidential Elections.” Public Choice 96:259–270. Downs, Anthony. 1957. An Economic Theory of Democracy. New York: Harper & Row. Enelow, James M. 1989. ”The Location of American Presidential Candidates: An Empirical Test of a New Spatial Model of Elections.” Mathematical Computational Modeling 12:461–470. Enelow, James M., and Melvin J. Hinich. 1994. “A Test of the Predictive Dimensions Model in Spatial Voting Theory.” Public Choice 78:155–169. Gilljam, Mikeal. 1997. “The Directional Theory Under the Magnifying Glass: A Reappraisal.” Journal of Theoretical Politics 9:5–12. Hausman, Jerry A., and David A. Wise. 1978. “A Conditional Probit Model for Qualitative Choice: Discrete Decisions Recognizing Interdependence and Heterogeneous Preferences.” Econometrica 46(2):403–426. Hinich, Melvin J., and James M. Enelow. 1984. “The Spatial Theory of Voting: An Introduction.” Cambridge: Cambridge Univerisity Press, 1984. Hinich, Melvin J., and James M. Enelow. 1989. “The Location of American Presidential Candidates: An Empirical Test of a New Spatial Model of Elections.” Mathmatical Computer Modelling 12(4):461–470. Hotelling, Harold. “Stability in Competition.” Economic Journal 39(March):41–57. Iversen, Torben. 1994a. “The Logics of Electoral Politics: Spatial, Directional, and Mobilization Effects.” Comparative Political Studies. 27:196–210. Iversen, Torben. 1994b. “Political Leadership and Representation in Western European Democracies: A Test of the Three Models of Voting.” American Journal of Political Science 38:46–74. Kaiser, Diane Sue. 1984. A Critique of Voting Behavior Research as a Form of Explanation. Ph.D. dissertation. Madison: University of Wisconsin. Kirman, Alan P. 1992. “Whom or What Does the Representative Individual Represent.” Journal of Economic Perspectives 6(2):117–136. Kramer, Jorgen, and Hans Rattinger. 1997. “The Proximity and the Directional Theories of Issue Voting: Comparative Results for the U.S. and Germany.” European Journal of Political Science 32:1–29. Listhaug, Ola, Stuart Elaine Macdonald, and George Rabinowitz. 1990. “A Comparative Spatial Analysis of European Party Systems.” Scandinavian Political Studies 13:327–354. Directional vs. Proximity Voting 33 Listhaug, Ola, Stuart Elaine Macdonald, and George Rabinowitz. 1991. “The Role of Issues in Elections: Voting Decisions in Norway and the United States.” Presented at the Annual Meetings of the American Political Science Association, Washington, DC. Listhaug, Ola, Stuart Elaine Macdonald, and George Rabinowitz. 1994. “Ideology and Political Support in Comparative Perspective.” European Journal of Political Science 25:111–149. Macdonald, Stuart Elaine, and George Rabinowitz. 1993a. “Direction and Uncertainty in a Model of Issue Voting.” Journal of Theoretical Politics 5:61–87. Macdonald, Stuart Elaine, and George Rabinowitz. 1993b. “Ideology and Candidate Evaluation.” Public Choice 76:59–78. Macdonald, Stuart Elaine, Ola Listhaug, and George Rabinowitz. 1991. “Issues and Party Support in Multiparty Systems.” American Journal of Political Science 85:1107–1131. Macdonald, Stuart Elaine, George Rabinowitz, and Ola Listhaug. 1995a. “Issue Competition and Multiparty Politics: Insights from the 1993 Norwegian National Election.” Presented at the Annual Meetings of the American Political Science Association, Chicago, IL. Macdonald, Stuart Elaine, George Rabinowitz, and Ola Listhaug. 1995b. “Political Sophistication and Models of Issue Voting.” British Journal of Political Science 25:453–483. Macdonald, Stuart Elaine, George Rabinowitz, and Ola Listhaug. 1998. “On Attempting to Rehabilitate the Proximity Model: Sometimes the Patient Just Can’t be Helped.” Journal of Politics 60:653–690. McKelvey, Richard D. 1976. “Intransitivities in Multidimensional Voting Models and Some Implications for Agenda Control.” Journal of Economic Theory 12:472–482. Merrill, Samuel. 1992. “An Empirical Test of the Proximity and Directional Models of Spatial Competition: Voting in Norway and Sweden.” Presented at the First Meeting of the of Society for Social Choice and Welfare, Caen, France. Merrill, Samuel. 1993. “Voting Behavior Under the Directional Spatial Model of Electoral Competition.” Public Choice 77:739–756. Merrill, Samuel. 1995. “Discriminating Between the Direction and Proximity Spatial Model of Electoral Competition.” Electoral Studies 14:273–287. Merrill, Samuel, and Bernard Grofman. 1997. “Directional and Proximity Models of Voter Utility and Choice: A New Synthesis and an Illustrative Test of Competing Models.” Journal of Theoretical Politics. 9:25–48. Merrill, Samuel, Bernard Grofman, and Scott Feld. 1996. “Nash Equilibrium Strategies in Directional Models of Two-Candidate Spatial Competition.” Presented at the Annual Meeting of the Public Choice Society, Houston TX. Pierce, Roy. 1993. “Directional Versus Proximity: A Second Opinion.” Presented at the Annual Meetings of the American Political Science Association, Washington, DC. Pierce, Roy. 1995. “Directional Versus Proximity Models of Voter-Candidate Issue Linkages in France and the United States,” Typescript. Ann Arbor: Center for Political Studies, University of Michigan. Pierce, Roy. 1997. “Directional Versus Proximity Models: Verisimilitude as the Criterion.” Journal of Theoretical Politics. 9(January):61–74. Platt, Glenn, Keith T. Poole, and Howard Rosenthal. 1992. “Directional and Euclidean Theories of Voting Behavior: A Legislative Comparison.” Legislative Studies Quarterly 17:561–572. Poole, Keith T., and Howard Rosenthal. 1985. “A Spatial Model for Legislative Roll-Call Analysis.” American Journal of Political Science 29(2):373–399. Rabinowitz, George, and Stuart Elaine Macdonald. 1989. “A Directional Theory of Voting.” American Political Science Review 83:93–121. Rabinowitz, George, Stuart Elaine Macdonald, and Ola Listhaug. 1991. “New Player in an Old Game: Party Strategy in Multiparty Systems.” Comparative Political Studies 4:147–185. Rabinowitz, George, Stuart Elaine Macdonald, and Ola Listhaug. 1993. “Competing Theories of Issue Voting: Is Discounting the Explanation?” Presented at the Annual Meetings of the Political Science Association, Chicago, IL. Shaffer, William. 1994. “A Congruence Model of Issue Voting.” Presented at the Annual Meetings of the American Political Science Association, Chicago, IL. Shepsle, Kenneth, and Barry Weingast. 1981. “Structure-Induced Equilibrium and Legislative Choice.” Public Choice 37:503–519. Westholm, Anders. 1997. “Distance Versus Direction: The Illusory Defeat of the Proximity Theory of Electoral Choice.” American Political Science Review 91(4):865–885.