Academia.eduAcademia.edu

On the status of Savage’s Postulates

This essay will focus on assessing the axioms, or rather postulates as they are called, of Savage's (1954) representation theorem for the Subjective Expected Utility theory. What his representation theorem achieves is to prove the existence of an expected utility representation of the preferences of a rational agent. Getting clear about the status of 1 In the course of this essay we will use the notions of axiom, postulate and principle interchangeably. 2 In more mathematical terms, a representation theorem for a specific class of mathematical 0

On the status of Savage’s Postulates Author: Matthias Würtenberger Abstract This essay will focus on assessing the axioms, or rather postulates as they are called, of Savage’s (1954) representation theorem for the Subjective Expected Utility theory. What his representation theorem achieves is to prove the existence of an expected utility representation of the preferences of a rational agent. Getting clear about the status of Savage’s axioms is of high relevance when one considers the overall project of the theory of Subjective Expected Utility. It is essentially a normative theory, spelling out how choices in the face of uncertainty are ought to be made. Choices among actions should maximise expected utility. Which action to choose from in an uncertain environment or action space, is determined not solely by the consequence of an action but also by the likelihood of the state of the world in which the consequence might be realised. Building upon the works of von Neumann & Morgenstern [1944] who showed that probabilities can be used to construct a utility measure on consequences such that preferences amongst gambles cohere with their expected utilities, Savage’s main achievement is determining conditions which are jointly sufficient to establish the existence of a unique probability representation of a rational agent’s beliefs. These conditions are Savage’s postulates and they represent certain constraints imposed upon preferences among acts. In case all these constraints are met by an agent’s preferences, he is said to be rational. The structure of our essay is as follows: first of all Savage’s strategy in setting up his representation theorem will be discussed. Here the postulates figuring in his representation theorem will be critically assessed. In assessing them we will be discussing which of his axioms represent genuine requirements of rationality and which are mere ‘structure’ axioms motivated by mathematical or theoretical concerns [Suppes (2002)]. The aim is to give an overview of the different reconstructions and discussions, most notably the accounts of Broome [1991], Fishburn [1970], and Joyce [1999]. Next, once their status is approximately settled, will be a critical discussion of all the candidates for rationality axioms. Here the aim is to contrast the arguments for and against each particular rationality-axiom. This is not to repeat the discussion about whether these can be taken to be requirements of rationality or not, this issue is taken for granted at this stage, but rather whether each rationality-axiom under discussion should be taken as a requirement of a theory of practical rationality. CONTENTS Contents 1 Savage’s strategy 0 2 The Axioms - their roles and status 4 3 Rationality Axioms? 3.1 Transitivity . . . . . . . . . . 3.2 Completeness . . . . . . . . . 3.3 Allais’ Paradox & Separability 3.4 State-Independence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 12 15 19 21 4 Conclusion 23 References 24 1 Savage’s strategy The main point of decision theory is to make clear the nature of meansends constraints. More specifically, it specifies the connection between an individual’s preferences about ends and her preferences about means that rationality requires. The aim of Savage, or any decision theory, is to construct an intelligible axiomatisation of a theory of preferences. If successful, such an axiomatisation would achieve to represent the preferences of rational agents. In doing so one is confronted with, what Jeffrey [1965:144] calls, the existence problem. Namely, formulating qualitative conditions on preferences which jointly imply the existence of a quantitative probability- and utility-function representing these preferences. These two kinds of functions merge together, yielding an expected utility function. Savage’s [1954] proof of the existence of such an expected utility representation of preferences involves two major stages. The first one consists in postulating a set of axioms1 that are jointly suffcient to establish the existence of a unique probability representation of the agent’s beliefs. Secondly, it is shown that such probability representations can be used to construct utility measures on outcomes such that preferences amongst lotteries cohere with their expected utilities. Savage’s main achievement in establishing the proof lies in the derivation of a qualitative probability relation over events which is equivalent to the first stage. The construction of an expected utility representation where the probabilities of states are intertwined with the utilities of the consequences has originally been carried out by von Neumann and Morgenstern [1944]. The advancement of Savage’s theory therefore lies in the disentanglement of states’ probabilities from the utilities of their outcomes. Savage constructs a representation theorem for rational action. The strategy here is to go from accepted, local principles of rationality to a global one.2 1 2 In the course of this essay we will use the notions of axiom, postulate and principle interchangeably. In more mathematical terms, a representation theorem for a specific class of mathematical 0 1 Savage’s strategy By local principles are meant certain rational patterns of preference, which (arguably) are Savage’s seven (or eight) postulates. His theorem shows that these postulates are individually necessary and jointly sufficient for an expected utility representation of an individual’s beliefs and desires. The global principle is exactly this, the maximisation of expected utility (hereafter MEU). Therefore this representation theorem is primarily of interest for those who already accept MEU in the first place, commitment to it stands prior to the theory. We will therefore not enter the principal debate about the pros and cons of the doctrine. Nevertheless, there is, vitally, a lot of room for interpreting the MEU. This is because the principle of MEU on its own, as a mere global principle, does not reveal much as to what actions, what preferences among them, are rational. The quintessential question of practical rationality ‘How should I act?’ cannot be satisfactorily answered by that global principle alone. In fact attempting to answer the previous question by saying ‘go and maximise your expected utility’ is close to giving an utterly meaningless answer, since it is not at all clear how one should act in specific cases. And this is where the local principles come into the picture. Not only do these give substance or meaning to the doctrine of MEU - what it demands at the level of individual preferences -, but also give some argument for MEU’s plausibility. The principle of MEU rises and falls with the strength and plausibility of its local principles. It is therefore of upmost importance that the local principles lend their plausibility to independent reasons. Assessing their individual justification will be the main theme of this essay. Before getting into the interplay of Savage’s postulates, we briefly note some peculiar facts about the ontology of his framework. There is a distinction between acts, states of the world, consequences and events.3 Intuitively one might consider the second and third to effectively be the same - an outcome is just a particular way the world turns out. But in the Savage framework they are, strictly speaking, entities of a separate kind, in so far states of the world do not correspond to outcomes or consequences.4 The conceptual clarifications and distinction made in the remainder of this section, will be relevant throughout the upcoming discussion. As we will see Savage was not always precise and definite about what acts, consequences, and states actually stand for. Sure, they definitely relate to our pre-theoretical understanding of them as used in every day language, but besides that they have not much in common. • Acts: or actions are defined as functions from the set of all possible worlds 3 4 structures, establishes “that every structure in that class is isomorphic to some structure in a distinguished proper subclass. Typically, the subclass of structures is in some sense more ‘concrete’ than the class as a whole.” [Makinson 2007:4] A widely used synonym for consequences, which we will here use interchangeably is outcomes. On the contrary Jeffrey’s [1965] account of decision theory is representation-independent. Meaning his ontology consists only of propositions. For instance, actions are simply propositions that can be made true. 1 1 Savage’s strategy to the set of consequences. Acts are assumed to be the relata of an agent’s preferences. When choosing an act one “decides ‘now’ once and for all; there is nothing for [one] to wait for, because [one’s] decision provides for all contingencies” [Savage 1954:17]. Accordingly, one has to interpret acts as choices between complete courses of future actions. Courses of actions determine everything one will eventually care about. This definition might strike one as being very unintuitive and far-removed from the every-day understanding and usage of the concept ‘act’. The definition gives rise to a particular problem. According to Jeffrey [1965:224] the Savage-acts, or possible acts as he called them, lead to the situation in which one “never knows just what act one is performing; for to know that, one would have to know how the act would turn out in every possible world.” It is easy to see that such acts are virtually impossible to conceive. If taken literally, one could never form a preference on any act, since acts would have to be evaluated with respect to every possible world. One would have to consider every eventuality which might become relevant, be it ever so small. This fact led Jeffrey to conclude that Savage-acts cannot be objects of choice [1965:224]. Joyce [1999:61] on other hand, does not so readily dismiss the account and gives a more benevolent reading of Savage-acts. He tries to secure plausibility by interpreting them “as objects of instrumental desire whose realization may or may not lie within the agent’s control.” Acts are instrumental because they are means to the end, the consequences. In addition, there is another problem coming in through the existence of so called constant acts [Savage 1954:25]. As we will see in due course they are made possible by a controversial postulate, namely P0. The merit of this construction is that through it, preferences not only range over acts but also events. These are special cases of acts with the striking feature of yielding the same consequence in each state of the world - thereby extending preferences to outcomes. It is very unclear whether we ever come to face such acts in real life. One plausible candidate for a constant act, would be choosing to commit suicide via a bullet proof method of say, taking in cyanide poison. In such a limiting case an individual could be sure that his act ‘oral intake of Xmg cyanide poison’ will always lead to the desired consequence, namely his death. Apart from such a case there seem to be none which we could bring about.5 This is one reason why we should be careful to interpret aspects of Savage’s framework too literally without good qualification. • States: are descriptions “of the world leaving no relevant aspect undescribed” [Savage 1954:9]. Crucially they have to contain some object about which one is uncertain about, i.e. what the temperature will be in Munich 5 Moreover, if constant acts were all we could choose from, the point of decision theory would be lost. This is because we could choose outcomes directly, the act guarantees a particular outcome. Maximising ones expected utility would not be possible anymore since actions do not come with risk anymore. Instead one should act so as to maximise non-expected utility by choosing the act which yields the best consequences [Joyce 1999:67]. 2 1 Savage’s strategy tomorrow at noon.6 In addition, they are descriptions of situations in the world over which an individual does not have direct control. The crucial part in this characterisation is what exercising ‘no direct control’ precisely means. In a first approximation this could mean that the probabilities of the states obtaining are independent of the relevant individual - more specifically, independent of her desires and choices. A typical example of a ‘state’, in the context of wanting to make an omelette, could be the circumstance of whether a given egg is good or rotten [Savage 1954:14]. The fact of the matter whether a given egg is rotten or not, should neither depend on ones desire that it is good, nor on ones action to break it into the bowl to the other good eggs. • Outcomes: are the consequences of actions, they describe “anything that might happen to the person” performing the act [Savage 1954:13]. A description of an outcome should encompass all the information relevant to the decision maker.7 In other words (descriptions of) outcomes should be specific to everything an individual cares about with respect to the given choice situation consequences are the set of events we care about. • Events: are sets of states [Savage 1954:10], where each event forms a subset of the set of states. How these sets are partitioned, i.e. what events are relevant depends on the interest of the decision maker. Returning to the omelette example above, one might be interested in the event that one egg out of a dozen is rotten - meaning the event would comprise 12 states where in each state a different egg is rotten. Consequently, an event forms a vector of possible outcomes [Broome 1991:90], i.e. all the outcomes which contain one rotten egg out of the twelve under consideration. In such a case, where the same consequence prevails in every state of the world, acts are functions from the set of events to consequences. As we have observed, Savage’s framework comprises elements which - often in their limiting cases but also in principal - do not have any obvious correlate in ordinary language. We arguably never face constant acts or are able to choose among completely specified courses of actions. We face these restrictions in principal, it is important to realise that this fact is not a result of ones bounded rationality. In the following one should be aware of these special devices - at times it will be advisable to abstract from these peculiarities and refrain to interpret certain aspects literally. We hold that, apart from technical considerations, the main challenge for 6 7 This seemingly proves Arrow’s [1971:45] characterisation of Savage-states as an “encryption of the world so complete that, if true and known, the consequences of every action would be known” wrong. States have to contain some uncertainty. An outcome O is said to be underspecified if there exists “a possible circumstance C such that the agent would prefer having O in the presence of C to having O in C’s absence. Whenever there is such a circumstance O must be replaced by two more specific outcomes O1 = (O&C) and O2 = (O&¬C)” [Joyce 1999:52]. 3 2 The Axioms - their roles and status the plausibility of Savage’s theory consists in giving good reasons for holding that each one of his postulates presents a genuine requirement of rationality. For those that do not stand the test of rationality, the question of how essential they are for the Savage theory to hold, remains. The next section will go through each of the postulates, look at their respective roles in the representation theorem with the aim of finding out which ones can be considered genuine principles of rationality. Once an approximate (interim) division among them has been made, the subsequent discussion will turn to a more in-depth discussion of the (more controversial) rationality postulates. 2 The Axioms - their roles and status The exact presentation and formulation of Savage’s axioms varies considerably in the contemporary literature discussing them. Originally Savage presented seven postulates–P1 up to P7– to which one additional postulate has been added; it was found to be implicit in Savage’s theory. Here we denote it by P0. We will stick to Savage’s ordering of his postulates with the exception of adding P0. Additionally, we will make use of a slightly more up to date notation, which has the advantage of being more concise and handy for the task in hand. This section will go through all postulates and discuss their function. Savage’s original Postulates:8 For all states s, s′ ∈ S and subsets A, B, C; consequences f, g, h ∈ F ; acts which are functions from states to consequences - f, g, h from S to F . The relation is not preferred to between acts . P1 Ordering Principle: The relation  is a simple ordering. P2 Separability/Sure-Thing Principle: For every f, g and B, f  g given B or g  f given B. P3 State Independence: If f (s) = g, f ′ (s) = g ′ for every s ∈ B, and B is not null; then f  f ′ given B, if and only if g  g ′ . P4 Probability Principle: For every A, B, A  B or B  A. P5 Non-Triviality: It is false that for every f, f ′ , f  f ′ —— (alternatively) There is at least one pair of consequences f, f ′ s.t. f ≻ f ′ . P6 Non-Atomicity: Suppose it is false that g  h, then for every f there is a (finite) partition of S s.t, if g ′ agrees with g and h′ agrees with h except on an arbitrary element of the partition, g ′ and h′ being equal to f there, then it will be false that g ′  h or g  h′ . P7 Averaging: If f  g(s) given B for every s ∈ B, then f  g given B. 8 Savage [1954:End papers]. 4 2 The Axioms - their roles and status Suppes [2002:250f.] uses a distinction to categorise axioms of a theory. Which axioms are a requirement of rationality and which ones (solely) play a structural role in the proof of Savage’s representation theorem? Structural axioms make existential demands on the environment of a theory, while not being implied by the existence of an expected utility representation of preference. According to Suppes only P5 and P6 are structural ones, Joyce [1999] adds P0 and part of P19 to the list of structural postulates, these are furthermore not essential, that is necessary, for the existence of a numerical expected utility representation. The former because it makes claims about the richness of the action space. The latter, the completeness requirement, because it makes claims about the judgemental state of the agent. One way to argue for the rationality requirement of completeness is to take Suppes’ condition, of it being implied by the expected utility representation, into account. In view of this condition, completeness seems to be a requirement of rationality. But we would not want to stop here and just accept the validity of Suppes’ argument insofar he takes the theory of expected utility for granted. In fact it is not clear whether he can. As we argued above, since the individual postulates give substantive meaning to MEU, each individual postulate should have independent reasons speaking in its favour. In order to really get behind the question of whether completeness is a requirement of rationality we need some other argument. We will come back to this point below (section 3.2). Aside from a rough, preliminary characterisation in this chapter, we will try to avoid entering the discussion about the alleged rationality axioms. At this point we will rather classify some of the axioms as potential rationality candidates whereas we discard the more obvious uncontroversial cases. Here we will point to some of their general implications. In criticising some of the postulates we will draw on arguments regarding their general plausibility. P0 - The Rectangular Field Assumption Broome [1991:115f.] found the the Rectangular Field Assumption (hereafter RFA) to be implicitly contained in Savage’s formulation of P1. According to the RFA each state of the world is arbitrarily assigned one outcome out of the set of all possible outcomes. Any arbitrary prospect constructed that way figures in an individual’s preference ordering! In other words the RFA defines a preference relation over all functions ( = acts) from states of the world to consequences. The reason Broome coined the term in question is because it defines a product set, and “a product set occupies a rectangle or a series of rectangles in a vector space” [1991:80]. The set of functions coming out of this procedure is a set of very rich acts. Consequently Joyce [1999:83] calls the RFA Act Richness. What it comes down to is that any possible combination of states of the world and consequences is contained in the set of acts. The intuitively 9 Namely the completeness condition. Joyce’s ordering of Savage’s postulates is structured differently to the way we present them here. In the terminology of Joyce P0 is called Act-Richness denoted by SAV0 , the completeness part of P1 is denoted by SAV3 , P5 Nontriviality by SAV1 , and P6 Event Richness is SAV7 [1999:83–95]. 5 2 The Axioms - their roles and status ‘artificial’ ontological distinction between states and consequences gives rise to such very implausible acts.10 An example would be ‘making the world hot (the act) when it is cold (the state of the world)’. Or alternatively, ‘arriving dry at work (act) while being soaked with rainwater (state of the world)’. Could this assumption be a candidate requirement of rationality? On a literal reading it seems that it couldn’t. A principle which gives rise to nonsensical prospects cannot be a plausible requirement of rationality. Moreover, one might object that the the examples presented are blatant contradictions. Making the world hot when it is cold is certainly a physical impossibility, nevertheless in Savage’s framework this presents no logical contradiction. The reason for this is that the set of states is ontologically distinct from the set of consequences. More specifically, our example comprises two distinct worlds - one being hot the other one being cold. In ordinary language there exists no such distinction among domains - there is just one domain or world. Acts like ‘making the world hot when it is cold’ present indeed a logical, conceptual contradiction in ordinary language, though in Savage’s formal language it does not. Nevertheless, only the logician would say that P0 cannot yield logical contradictions. Clearly then the motivation for this assumption is theoretical, and cannot be a genuine requirement of rationality. In fact P0 makes the proof of Savage’s representation theorem easier. This is because the proof for separability (P2) depends on this axiom, via the construction of constant acts. Broome [1991:115] raises attention to another problem implied by the RFA. Consider an outcome-description of loosing in a lottery on behalf of a risk-avers individual. An intuitive description of the outcome ‘End up loosing the lottery’ is: ‘Receive no prize and feel disappointed for not having won.’ Broome’s argument for why such a description is in conflict with Savage’s theory is the following: “The rectangular field assumption says your preference ordering includes all arbitrary prospects. Amongst them is the prospect that leads to this particular outcome for sure. This prospect determines, whatever lottery ticket you draw, that you get no money and also feel disappointment. But this feeling of disappointment is supposed to be one you get as a result of bad luck in the draw. It is hard to see how you could feel it if every ticket in the lottery would lead to the same boring result. So this prospect seems causally impossible, and that may make if doubtful that it will have a place in your preferences.” [Broome 1991:116] What Broome’s argument comes down to is that the (random) attribution of consequences involving risk sentiments to states of the world is not permitted by the RFA, because such risk properties intimately depend on the precise combination of states and consequences involved in an act. Steele [2007:145] argues that Broome11 over-interprets the RFA, thus unfairly targets Savage’s 10 11 The RFA also gives rise to constant acts, as mentioned above. Next to Weirich [1986] 6 2 The Axioms - their roles and status theory with respect to risk- and regret-sentiments. Although the RFA “strongly suggests that the description of outcomes should preclude risk sentiments”, she holds that the principle ought not to be interpreted literally. She gives three reasons to support her claim. First, because the RFA is an idealisation which constitutes an ideal agent’s preference space in such a way that it can be represented by “a continuous utility function (unique up to positive linear transformation) and a corresponding unique probabilistic belief function for an agent.” Consequently one could not assume the existence of such an individual “with such extraordinary discerning powers [. . . ] It is impossible for us ordinary mortals to entertain a complete preference ordering over the infinitely rich option space that Savage’s theory requires.” Although her argument does not directly target the completeness assumption of P1, this attack nevertheless feeds from it. We will postpone a critical discussion of completeness until later on. Does the RFA rise and fall with the completeness part of P1? Is the completeness assumption central to the RFA? Not necessarily so, since one of its implausible features comes in through the random, arbitrary assignment of outcomes to states of nature. To us, that seems to be the main factor which makes P0 such a strange thing. Nevertheless it might be argued that completeness is needed, because if it fails what determines which arbitrary combinations of outcomes to states go into one’s incomplete preference relation? Second, she points the finger at the above mentioned physical (and conceptual) impossibility of some Savage-acts. “The actual world constrains the set of actions that any agent, ideal or otherwise, is able to carry out. Just because we can conceive of an abstract map from states to some combination of outcomes doesn’t mean that the act in question is, will be, or ever was, a viable possibility in the actual world.” Third, she adds that many state-consequence combinations will also be “outright contradictory”. This point was already discussed above. For her it is not clear why Broome invokes causal impossibility as an insurmountable obstacle for incorporating risk sentiments into outcomes. Given the ideal nature of the RFA, she concludes that “any attempt to draw from it concrete conclusions about the contents of act outcomes is questionable.” [Steele 2007:145f.]. We will keep her advice in mind. P1 - Ordering This postulate defines the basic properties of the preference relation of a rational decision maker should display [Savage 1954:19]. Savage’s characterisation of the preference relation can be described to involve two parts. Defining a partial order, which is a binary relation  over the set of acts A, yields transitivity, i.e. such that for three acts a, b, c if a ≻ b and b ≻ c then it should be the case that a ≻ c.12 To give an example if I prefer an apple to a banana, and a pineapple 12 A partial order also generates reflexivity - a  a - and antisymmetry - if a  b and b  a then a = b. 7 2 The Axioms - their roles and status to an apple I should therefore prefer a pineapple to a banana. As we will see further below, intransitive preferences expose one to be taken advantage of, hence intransitivity is seen as a requirement of rationality. This constitutes the first part of P1. The second part consists in adding completeness to the properties of the preference relation which generates a complete ordering. It effectively implies that a decision maker has to have a definite preference over every act a in the set of acts A. Joyce [1999:84] argues that this constraint is not strictly required for the existence of an expected utility representation. Furthermore, he argues that it would even be unreasonable to place this requirement on rational agents. While Joyce may see the completeness assumption as not even controversial, we take note of his arguments but will discuss both it and transitivity more extensively in the next chapter (section 3.1). Our motivation for doing so is twofold, first and trivially the lengthy discussion is better placed in a separate section, second because completeness seems to be assumed to be a requirement of rationality by many authors.13 This fact renders completeness, contrary to Joyce, controversial and therefore demands careful examination. P2 - Separability, Independence or the Sure-Thing principle The preference relation is further built up such that it is separable across events. In other words, one’s desirability14 of the consequence of one act in one state of the world should be independent of the desirability of its consequences in any other state of world. It entails that “on no account, should preferences among consequences be modified by the discovery of which event obtains” [Savage 1954:32]. In other words, rational preference among acts should not be sensitive towards situations where these yield identical outcomes. Joyce [1999:86] puts it “that a rational agent should make judgments about the relative desirabilities of acts by treating their common outcomes as “dummy variables” whose effects “cancel out” in her deliberations.” In discussing Allais’ Paradox below (section 3.3) we will see a particular application of how this cancelling out is supposed to work. As P2 is thought to form the core of Savage’s theory, it is fiercely defended as a requirement of rationality. But more about that later on. The upshot of the separability axiom is that, jointly with P0, P1 and a theorem about conjoint additive structures, it implies that there exists an additive utility representation of preferences over acts that is unique up to positive affine transformation. Put differently, the value of each act is the sum of the state-dependent utilities of its consequences. 13 14 Such as Binmore [2009] and any subscriber to revealed preference theory. Meaning the utility one assigns to an outcome. 8 2 The Axioms - their roles and status P3 - State-Independence This postulate ensures the ordinal comparability of state-dependent utilities [Savage 1954:26f.]. In order to make cardinal comparability possible, P3 would have to be stronger. The source of this lies at the fact that utility is interpreted ordinal. Fishburn [1970:193] describes state independence as a companion to P2, which states if an act f yields consequence x, and g yields y given an event E where the probability P r(E) 6= 0, then act f is preferred to g given E if and only if act f ′ is preferred to g ′ when f ′ yields x and g ′ yields y. P3 achieves to establish a correspondence between preferences on consequences (through constant acts) and conditional preferences on events whose obtaining is regarded as possible, ie. non-null. Joyce [1999:87] sees P3 as some form of noncontextuality constraint on an individual’s beliefs and desirabilities. It is constraining in so far ones judgements about the probabilities of states or desirabilities of outcomes are not permitted be dependent on what act happens to be true. This makes it possible that a preference relation is effectively extended to consequences in terms of the relation among acts. P1 up to P3 determine such a fundamental preference relation between acts. Although most authors in the literature see P3 as an obvious requirement of rationality, its status was prominently challenged by Aumann [1971]. We will look at his critique in the next chapter (section 3.4). P4 - Probability Principle P4 introduces a new relation, namely a probability relation with states of the world as its relata [Savage 1954:33]. The binary relation, which we denote by ☎, should be read as ‘more probable than’. Since it is defined on the set of states (or events), it serves to make a probability representation of an agent’s attitude to states possible, e.g. how probable an agent judges the event E ‘It is raining in Munich at 12 am tomorrow’ in contrast to ¬E ‘It is not raining in Munich at 12 am tomorrow’. In order to fully achieve this, Savage constructs certain circumstances which are meant to provide a test for when one event is judged to be more (or less, or equally) probable than another. It is important to note that the notion of probability being used here is a purely subjective one. Savage is not concerned about determining the objective probability of it raining in Munich tomorrow at 12 am, but individuals’ probability. The prescriptive part of P4 is that individuals should prefer the alternatives that are more likely to yield the more desirable outcomes. In other words, given two alternatives with equal desirabilities any rational individual should prefer the more probable one. Once this method of determining ones credence in a state’s obtaining is established, P4 ensures that any two events can be compared to each other with respect to their relative chance of obtaining. One property of the probability relation, which is implicitly contained in this procedure is the completeness 9 2 The Axioms - their roles and status condition. Only with it, all events are comparable.15 The probability measure also needs to be transitive and quasi-additive. The latter property is one of ratios and absolute scales. The latter is needed to establish the link to (mathematical) probability. All these properties, which define a weak order, are sufficient in order to apply a representation theorem for probability [Fishburn 1970:193]. P4 is generally assumed to be a requirement of rationality, and quite uncontroversially so. If satisfied it guarantees coherence among ones beliefs. Since it acts as a constraint on the relation between beliefs and preferences, and not so much on rational preference, we will not discuss it in further detail here. P5 - Non-Triviality P5 demands that agents are not indifferent between at least one pair of (constant) acts [Savage 1954:17]. This postulate is commonly accepted to be purely structural since it, next to P6, is solely needed to ensure that the probability measure ☎ can be represented numerically. It is also required for it to be unique. If P5 were not satisfied then the probability relation would be reflexive which is needed for mathematical reasons [Fishburn 1970:cf.193]. Further evidence that P5 is a structural axiom comes from the fact it can easily be conceived to fail without there being some irrational act implied by doing so. Joyce [1999:84] argues that there are plenty of acts between which one can justifiably be indifferent. Imagine that a selection of outcomes are equally desirable to an agent, i.e. whether having a beer or a wine for breakfast. In this case there is just no decision to make, because both drinks, being alcoholic, seem equally detestable in the morning. P4 and P5 taken together, make it possible that an agent’s belief in the chance of a state’s obtaining can be extrapolated by offering him prizes in a specific way. Together they lead to the introduction of a notion of qualitative personal probability. It is qualitative in so far it signifies what it means for an individual to consider one event more probable than another in terms of the preference relation among acts already introduced. P6 - Non-Atomicity P6 allows the set of states to be partitioned as finely as one wants [Savage 1954:38f.]. Accordingly, there do not exists atomistic, fundamental descriptions of states. It is always permitted to add further details to descriptions of states. Only now can the notion ‘(no) more probable than’ be connected quantitatively with mathematical probability. This postulate has one far-reaching implication; it asserts that there exist no consequences which are so (un)desirable such that these could overpower the 15 In principle, similar objections as the ones raised above apply to completeness in the context of probability assignments to states and events. However we find that it does not have such problematic implications as its (utility-) counterpart. 10 2 The Axioms - their roles and status Acts\States Believe in God Not believe in God God exists Eternal life (∞) A bad situation (z) God does not exist Finite & deluded life (x) Assumed status quo (y) Table 1: Pascal’s Wager (im)possibility of any given event A. Consider the example of Pascal’s Wager, standing proxy for the type of scenarios non-atomicity is directed against.16 Consider an individual, Mr. Jones, who faces the decision whether to start believing in God or remain an atheist. His state-consequence matrix expressing the decision problem is given below, the desirabilities of each outcome are given in brackets, where x and y are of finite values and z is either finite or ∞. In addition, say Mr. Jones takes the probability of God’s existence to be one in a million (Pr(∃God) = 0,000001), it follows that the chance of there being no God is Pr(¬∃God) = 1 - Pr(∃God) = 0,999999, respectively. How should he decide? In this case where non-atomicity fails, he is advised to believe in God irrespective of how small the probability of his existence might be - the probability of one in trillion would yield the same recommendation. This is because in case God indeed exists, this consequence is infinitely more desirable to the individual than anything else which could possibly happen. Additionally, if z is interpreted as −∞ as Pascal in his original argument did, then not believing in God while God exists has infinite undesirability which seems to give further support to Pascal’s recommendation. The reason why such a categorical recommendation takes place, is because infinite (un)desirabilities simply wash away the probabilities. Probabilities are thereby made useless in deliberating what to do. The introduction of non-atomicity ensures that exactly this will not happen - no consequence or prospect is such that it could be infinitely (un)desirable. This is called the Archimedean condition which was first introduced by von Neumann & Morgenstern [1944] in their representation theorem for utility [Fishburn 1970:cf.194]. When the probability relation ☎ is defined upon the basis of the preference relation , P1 up to P6 imply the existence of probability measure on the set of states S that satisfies A ☎ B if and only if P r(A) ≻ P r(B) for all A, B ∈ S where A and B are events. That equivalence constitutes the essence of the representation theorem. The postulates are jointly sufficient to obtain an expected utility representation of preferences, but just for those acts in F that assign a finite number of consequences to all the states in some event A. To cover the infinite case the next postulate is still needed. 16 The example is taken from Jeffrey [1974:12f.]. 11 3 Rationality Axioms? P7 - Averaging P7 ensures that Savage’s representation theorem holds for all acts, including the ones that assign infinite number of consequence to all states [Savage 1954:77]. It is therefore an extension of the expected utility representation to infinite sets of consequences. Averaging also ensures that any desirability u on the set of consequences X is bounded [Fishburn 1970:194]. This postulate is a generalisation of the Sure-Thing Principle to cover infinite wagers. Fishburn [1970:193] argues that this generalised dominance condition, is not required for the derivation of a unique probability measure on the set of all subsets of S, the set of states. This completes our overview of Savage’s postulates. Candidates which are quite certain to be structure axioms are P0 RFA, P5 Non-triviality, P6 Non-atomicity. There is hardly any disagreement about this in the contemporary literature. This leaves us to discuss the remaining ones in more detail - these are P1a.) Transitivity, P1b.) Completeness, P2 Separability, P3 State-Independence. We will not discuss P4 Stochastic dominance and P7 Averaging. The latter because it is just a form of P2. P4 is excluded of the discussion because it does not directly constrain rational preferences. 3 Rationality Axioms? Each of the axioms under consideration in this section has some arguments speaking for being a constraint on rational choice. Not surprisingly for each there also exists arguments pertaining to the opposite. The status of the following arguments is a matter of hot dispute. Each of the following subsections will be concerned with the discussion of one. 3.1 Transitivity Transitivity is a property of a binary relation - the preference relation. Being so defined the relation is a partial order which is a desirable property from a mathematical or theoretical standpoint, since it makes things easier in proving the representation theorem. Besides, what might be the motivation arguing from rationality grounds for transitivity? Any argument attempting to do so has to give a plausible account of why intransitive preferences are undesirable from a rational point of view. It must show that there is some kind of defect in preferences exhibiting intransitivity. There exist two major arguments in the literature contending to establish that transitivity of preferences is a genuine requirement of rationality. These are the Dutch-Book argument and the Money-Pump argument. Although they are similar in structure both address the problem in slightly different ways. Here we will only be concerned with the latter, primarily because Dutch-Book arguments are mainly concerned with the (ir)rationality of probability assignments, a topic not at the centre of our debate. 12 3 Rationality Axioms? The money-pump argument, first established by Davidson et al. [1955], tries to show that an individual exhibiting an intransitive preference relation is liable to give up some of his wealth for no reward. It has the form of a reductio ad absurdum argument. It holds for any intransitive set of binary strict preferences, i.e. x ≻ y, y ≻ z, and z ≻ x. Suppose that an individual Mr. Smith is offered a choice among three different jobs, a, b and c: “He can be a full professor with a salary of $5,000 (alternative a), an associate professor at $5,500 (alternative b) or an assistant professor at $6,000 (alternative c). Mr. [Smith] reasons as follows: [a ≻ b] since the advantage in kudos outweighs the small difference in salary; [b ≻ c] for the same reason; [c ≻ a] since the difference in salary is now enough to outweigh a matter of rank.” [Davidson et al. 1955:145] Is this an irrational set of preferences? The reasons in favour of each pair appear to be plausible. Nevertheless, Davidson et al. argue that “the reasons can never be good enough to justify acceptance of such a set of preferences.” According to them, an intuitive principle of rational choice commands that an alternative should only then be chosen, if nothing else is preferred to it. This principle cannot possibly prescribe a rational choice in the context of Mr. Smith’s decision problem, because his preferences imply that no matter which job he chooses there will always be another job which he prefers to it! This generates the money-pump. To fully see this, imagine that Mr. Smith randomly settles with the assistant professorship, option c. Since he strictly prefers b to c, offering him to switch to b for a payment of a small amount of money, say 1$17 , will indeed result in him accepting the offer, pay 1$ and obtain the associate professorship. Now, since he strictly prefers a to b, a similar offer to switch to a for the price of 1$ will again result in him accepting the offer and pay 1$ to obtain the full professorship. Strictly preferring c to a, again a similar offer can be put forth, which he will accept. At this point Mr. Smith finds himself at his assumed status quo but with 3$ poorer. It is easy to see that this argument can be carried out indefinitely for any number of n-rounds. The conclusion is that Mr. Smith got money pumped which is a undesirable thing to expose oneself to. This behaviour is said to be irrational because one is worse-off than before without having gained anything. Therefore one should have transitive preferences. Let us consider another violation of transitive preferences taken from Broome [1999:70ff.]. Suppose Mr. Wilson is about to decide what to do for his vacation. He is faced with three alternatives, (R) to make a trip to Rome, (M ) to go mountaineering, and (H) to stay a home. Maurice prefers making the trip to Rome to go mountaineering (R ≻ M ) because he is afraid of heights, prefers 17 The exact amount does not matter; because of Mr. Smith’s preference of b over c there must some token of exchange for whose trade he will be willing to switch - may it only be such a small amount as 0,01$. 13 3 Rationality Axioms? staying at home to the trip to Rome (H ≻ R) because old roman ruins bore him, and prefers to go mountaineering to a stay at home (M ≻ H) because he judges avoiding a hiking tour coward-like when nothing else is there to do. Mr. Wilson exhibits intransitive preferences among the pairs of alternatives R, M , and H, thus is liable to getting money pumped over the alternatives. Is there a way to justify or rationalise this instance to intransitivity? Similar to Mr. Jones, Mr. Wilson’s preferences can be said to be menu dependent, meaning actual preferences depend on what alternatives are on offer. By itself this does not do much to avoid the charge of irrationality, but what about the reasons at play? Once again the reasons supplied for each individual preference seem plausible. Whereas Davidson et al. argued that no reason could ever be good enough to justify a violation of transitivity, one reason in this example might stand the test. The following argument has two parts. First, Mr. Wilson could argue that he did not choose among three but four alternatives. Instead of preferring a stay at home over a trip to Rome, Mr. Wilson would choose a stay at home when a trip to Rome was the only other option available - formally Hr instead of just H, yielding Hr ≻ R. On a similar move he re-describes staying at home when mountaineering was the only other option available - denoted by Hm , yielding M ≻ Hm .This is what Broome calls fine individuation of outcomes or alternatives, intransitivity and therefore threat of irrationality vanishes. Staying at home when Rome was the only alternative is thereby different from staying at home when both Rome and mountaineering are available. The general problem of the practice, is that fine individuation renders the requirement of transitivity vacuous, since by it any violation whatsoever could be rationalised!18 This brings us to the second part of the argument. As we stand, some principle according to which the right to refine outcomes is permitted in particular cases would be very desirable. This principle should also counteract Davidson’s et al. argument. Brome [1991:103] suggest two principles: the principle of individuation by justifiers according to which outcomes should be distinguished as different if and only if they differ in a way that makes it rational to have a preference between them. Where a justifier is the difference between two outcomes that makes it rational to have a preference among them. The second principle put forth, is one of rational requirement of indifference, which does not restrict individuation of outcomes, but demands indifference instead. Whereas Broome prefers the latter, we are more inclined towards the former. In the context of Mr. Wilson, the avoidance of cowardice can figure as a justifier. Irrespective of what one might think the truth about cowardice is, if 18 Broome [1999:71] argues that transitivity still has some restrictive power, captured in his distinction between practical and nonpratical preferences. Practical preferences only hold among two prospects if and only if the prospects are such that an individual could have a choice between them. Non-practical preferences are such that no actual choice among them is possible, and these are constrained by transitivity - i.e. because Hr ≻ R and R ≻ M it follows Hr ≻ M meaning Mr. Wilson ought to prefer a stay at home over Rome when mountaineering was the only alternative. This is obviously not a pair of alternatives Mr. Wilson could in any way choose from. 14 3 Rationality Axioms? Mr. Wilson feels he can preserve his dignity through going mountaineering, then this constitutes a sufficient reasons to justify his preference.19 Cowardice, or avoidance thereof, constitutes a relevant difference, or powerful enough reason contrary to Davidson et al., so as to justify fine-individuation on behalf of Mr. Wilson. On a more superficial, non-intentional description of his preferences, transitivity is violated but, as we argue, permissibly so.20 3.2 Completeness As was observed above, the completeness condition on the preference relation is a property of an ordering. An individuals preference relation is said to be complete if and only if for every pair of alternatives, either she prefers one over the other or is indifferent between them. The first feeds from philosophy of science considerations. Broome [1991:92] argues that the completeness axiom serves mainly a representational purpose. Joyce [1999:84] states that it is not even required for an expected utility representation. One way to achieve representability requires the relation to be an ordering, i.e. to be transitive, reflexive, and complete. But representability can also be achieved by using a partial order, with the drawback that it makes the proof of any representation theorem much harder from a mathematical point of view. This fact doesn’t just apply to Savage’s case, but for instance also to the theory of von Neuman & Morgenstern [1944] which also assumes completeness. On top of that Bradley [2012:14f.] observes that representation theorems without completeness do not achieve as much. This pragmatic consideration reveals that on grounds of mathematical expediency the completeness axiom has a lot going for it. How much leverage as a requirement of rationality does completeness have? The theory in support of this axiom goes under the name of revealed preference theory (hereafter RTP). RTP identifies preference with (observed) choice, this philosophical assumption is the driving force behind the argument in favour of completeness. Mandler [2001:15] identifies the choice definition of preferences as the orthodox understanding of preferences in economics, so does Binmore [2009] who at the same time is an advocate of RTP. According to the theory what is meant by having a preference for alternative x over alternative y, is just to choose x over y when both alternatives are available. Any description adhering to intentional, psychological factors is explicitly avoided [Binmore 2009:8f.]. The case for completeness is as follows - given any two alternatives, one constructs a forced choice. This scenario will always yield some positive result from an observational standpoint, because even not making a choice amongst two alternatives is making a choice. When observing that an individual has made no choice in a given decision situation, all that the theory can attribute to the decision maker is indifference. Or alternatively, if the situation is set up such that an individual starts off with an alternative x - the status quo, making 19 20 On the assumption that this issue can be avoided in R ≻ M . For a discussion of the money pump in sequential choice situations with ‘sophisticated’ choice, see McClennen [1990:89ff.] and Anand [2009:166ff.] 15 3 Rationality Axioms? no choice will mean preferring x. All it can attribute is either preferring one alternative over another or indifference - tertium non datur. By definition all individuals inevitably make some choice even though they might not do or want to do so, hence preferences are automatically complete. Irrespective of concerns whether this strikes one as a satisfactory approach to explain human behaviour - the theory is positivistic and behaviourist in spirit - issues we cannot discuss at any length in the present paper, the question remains whether this argument makes a convincing case for completeness to be a requirement of rationality? Suppose one had incomplete preferences among two alternatives - what could this mean and how could it be justified? Does it seem irrational not to have formed a preference over some alternatives? A first argument would hold no, one might not have have formed a preference over some alternatives due to time constraints, or mere lack of interest. Consider the situation of choosing one can of beans out of a bunch in a supermarket. Is it really a requirement of rationality to have made up ones mind about which can to choose? Maybe one of the cans contains two more beans than another, but should we even bother? In order to get closer to an answer, consider the case of Superia, a computationally unbounded being [Morton 1991:39f.]. Due to her extraordinary abilities Superia knows for each possible world - which is a completely detailed specification of how the things could be - how much she wants it - assigns precise utility to it -, and how likely she thinks that the real world will be like that at some point in the future - assigns a probability to it. She is also able to state the exact position of each possible world in her preference ranking. In order to determine her desirability of an alternative, she simply goes through all the possible worlds in which this alternative obtains, to form a sum of them where each is weighted by its individual chance of obtaining. All that is required in order to carry out this task on behalf of Superia, is speed and memory. Morton [1991:41] argues that, while this scenario might be possible for an unbounded being like Superia, the trouble for us ordinary mortals is that we do not and cannot have attitudes to whole possible worlds. We simply lack the time and, more crucially, the computational resources to imagine these. We are computationally bounded, where the meaning of boundedness as used by this example does not include rational boundedness! Assuming the latter whilst rejecting completeness on grounds of an argument which uses this, would prove self-defeating for a normative assessment of the axiom. How would a decision theory look like which takes into account the computational limitations we face? Morton [1991:43] observes an interesting link between complexity and incommensurability of alternatives. “[A] model that accommodates our limited capacity to handle information is likely to generate incomparable preferences for complex situations even given comparable preferences for simple ones”.21 Whereas this argument sees the locus of incom21 Morton’s strategy to overcome the incommensurability problem, which we will not consider here, is by assuming a fundamental, preference underlying, basic value relation. It contains ones fundamental preferences over general aspects or qualities. 16 3 Rationality Axioms? mensurability at the level of individuals, a second argument discerns it in the world. The following critique of the completeness axiom describes it aptly: “However, it is inevitable that people, if put in a position of having to, can make choices between incommensurable alternatives. Given that, the choice-value principle translates their choices into valuations; that is its function. [...] When two alternatives are incommensurable, they are not made commensurable by the mere fact that people can choose between them.” [Broome 1978:332] The argument is that there exist certain alternatives, goods in the world which just are incomparable such that no weighing is possible. Interestingly the incomparability is argued not be a feature deriving from individuals’ assessment. In order to get a tighter grip on this phenomenon Temkin [2012:176ff.] defines the relation of rough comparability which holds among two outcomes/alternatives, if one alternative is neither better than, worse than or equal to another. More formally - there exists alternatives x and y such that x  y, y  x and x ≁ y. Accepting the possibility and justifiability of the relation implies rejecting completeness.22 Is it possible that reasons in favour of one alternative are incommensurable with the reasons in favour of another? Such that no balancing is possible at all? Broome [1978:332] considers a positive answer to this question. If ones assumes that any thing can be compared to any other thing than it is like trying to “measure a good of one sort, say the removal of pain, against another, say money, as a scale. It turns out to be like measuring the brilliance of a painting with a light-meter.” Would we be satisfied with a uniform approach to measurement? Surely there is a limit to the scope of this method! After all it is not plausible to assess pain in terms of money. Nor is it plausible for many other ‘goods’ or alternatives such as freedom of speech, equality or even careers - imagine the choice between either becoming a professional musician or to take up a career in the political sector. 22 To make a failure of completeness even more plausible, we point to an argument from Mandler [2001:21] who argues that the incompleteness problem is not intrinsically tied to normativity. Incomplete preferences are most likely when multi-dimensional aspects of decision problems are considered. One option is to reduce complexity through focusing on one aspect of a decision. While such a reduction enables a ranking and hence re-establishes completeness, it needs a further argument to support the focus on specific, say, hedonistic, criteria. This is because the reduction implies that the specific focus taken is most important or relevant in deciding what to do. This move, if generalised, is questionable in the case of hedonism. Having to rank any act (or consequence) in terms of pain and pleasure does not proves to be intelligible. On the other hand, if this reduction is not carried out there is another alternative. As early proponents as von Neumann and Morgenstern [1944:29] recognised the possibility of incomplete utility functions. They stipulated that the absence of the completeness axiom would yield “a many-dimensional concept of utility”. Failure of completeness for decision under risk, a special case of Savage where the probabilities of states of the world are typically not given exogeneously, are studied under the name of non-archimedean utility representations. These are representations of preference orderings by means of utility functions whose range is a lexicographically ordered vector space [Herzberg 2009:8]. 17 3 Rationality Axioms? A third argument is provided by Mandler [2001:16f.] who gives a sociological explanation about the existence of the completeness axiom. In the course of decision theory the problems with which theorists have been concerned became more and more complex. Whereas completeness might be have been justified vis a vis the early comparably straight forward decision situations, that is not so anymore. He argues that currently, the cases under discussion involve too many aspects. Mandler [2001:20] mentions procedural issues and symbolic importance. Does it, or should it for instance make a difference whether a certain outcome is determined by a democratic decision or a paternalistic one? Should it matter in cases where the outcome is the same? Considerations like these certainly go beyond the scope of this essay but highlight the complexity of the matter surrounding the completeness condition. Fourth, incompleteness of preferences might be justified due to informational issues. It is easy to conceive situations where one expects to receive relevant information in the near future and consequently postpones judgement. One might want to argue that in such cases it would be wrong to reach a judgement. Temkin [2012:240] expresses this concern in his Reflection Principle: “If, on reflection, I know that at some point in the future I’ll have more knowledge than I currently have, and I now know that given that future knowledge it will be reasonable to assess two prospects in a certain way, then it is now reasonable for me to assess the two prospects in that way.” Thus postponing ones judgement is permissible if one knows, or to weaken the condition, is justified to believe that one will receive relevant information.23 We want to conclude this section and also the previous one in considering an interesting point raised by Mandler [2001:15]. He argues that transitivity and completeness are mutually exclusive properties among any account of rational choice has to choose from. The main reason for this fact is conceptual. Arguments for transitivity, such as the money pump, assume a welfare definition of preferences. Examples like it show that preferences if interpreted as rational welfare judgments should adhere to transitivity. As we have seen, on the same interpretation of preferences there are very strong arguments for them to be incomplete in principal. But what about the case where preferences are understood as choice? Is transitivity still a requirement of rationality? Consider again an intransitive set of preferences among the triple a, b and c as presented above. Starting with c one agrees to switch to b and then to a. Mandler interprets the (slightly weaker) relation c  a as merely saying that ‘an agent will accept z when x is available’. Under the choice definition of preference there are no resources within the theory to explain why choice-intransitivity is irrational. In order to 23 For another, fifth argument see Bradley [2012:14f.], who discusses the requirement of coherent extendibility. According to it it should be possible to extend ones current preferences to complete and consistent ones when deliberating about new alternatives. Coherent extendibility guarantees consistency when reaching new judgements. 18 3 Rationality Axioms? do so, some welfare-significance has to be attributed to the preference-relation  (or alternatively ≻) otherwise this conclusion cannot be reached.24 All that can be otherwise surmised is that an agent exhibiting intransitive preferences can end up with an option that is never chosen directly over the original status quo. Although we agree with Mandler on his observation, we hold that nevertheless even the choice-interpretation of preference has to admit some psychological import. Our remark not so much targets Mandler’s argument per se, but is rather part of a bigger argument against the very thin, behaviourist equation of preferences to choice. Although we cannot at length discuss the theory of revealed preference here, we want to highlight the fact one should not forget that we are dealing with agents whose choice-behaviour wants explaining. RPT offers a kind of explanation which completely ignores the role and existence of intentional, psychological states of individuals. Requiring some minimal reference to individuals’ psychology in explaining their behaviour, does not result in an automatic collapse of the theory into a welfarist one. 3.3 Allais’ Paradox & Separability The separability principle is said to constitute the ‘cornerstone’ of Savage’s theory [Joyce 1999:85]. For many theorists the theory of expected utility rises and falls with this principle [Broome 1991:115]. On a normative reading, separability, or as it is also called, the Sure-Thing Principle (hereafter STP) implies restrictions on the objects to which it refers. It requires ones desirability of the consequence of an act in one state of the world to be independent of ones desirability of the act’s consequences in any other state of world.25 Technically speaking, the STP is an instance of weak dominance. Weak dominance demands that if all the consequences of an act a are weakly preferred to those of another act b and a has one consequence which is strictly preferred to one consequence of b, then a ought to be preferred overall. The validity of this reasoning is derived from a pre-theoretical commitment to consequentialism, i.e. the only thing which matters, or one is justified to consider, when assessing the goodness of an action are its consequences. All concerns about the relevance of procedural aspects of decision making are omitted from this view. This is where Allais [1953] critique comes into the picture. In the shape of Allais’ Paradox, it calls the STP’s status as a requirement of rationality into question. Interestingly, the argument is not purely conceptual, but is supported by considerations about the empirical deviation from the prescribed, rational solution. The paradox applies to decision situations with the following structure (see Table 2): An individual placed in this decision situation is confronted with two choices in two distinct lotteries. The first (I) is a choice between lotteries A and B. The second (II) is a choice between lotteries C and D. A substantial proportion of 24 25 In our discussion of the initial money-pump argument, this was achieved by stating that Mr. Jones is worse off having lost 3D. Furthermore, the STP is valid only if the states of the world are probabilistically independent of the acts to which they are compared with. 19 3 Rationality Axioms? I II A B C D 0,01 1.000.000 D 0D 1.000.000 D 0D 0,1 1.000.000 D 5.000.000 D 1.000.000 D 5.000.000 D 0,89 1.000.000 D 1.000.000 D 0D 0D Table 2: Allais’ Paradox. people, some decision theorists among them, have been found to make choices inconsistent with the STP26 - combinations of choices which are inconsistent are either A and D or B and C where the former pair is most often observed in empirical findings and also the option some decision theorists try to justify. Here is a demonstration why this is so. We denote the three different outcomes figuring in the example as follows: ux = u(5.000.000D), uy = u(1.000.000D), and uz = u(0D) to . If the expected utility principle holds, the preference A ≻ B implies uy ≻ 0, 01uz + 0, 1ux + 0, 89uy . Introducing the probabilities of each outcomes we get the following 0, 11uy ≻ 0, 01uz + 0, 10ux . By adding 0, 89uz to both sides, we obtain 0, 11uy + 0, 89uz ≻ 0, 10ux + 0, 9uz , which says C ≻ D.27 The only two pairs of choices consistent with STP would either be a choice of A and C or B and D. Allais explains the deviance from the STP by arguing that individuals do not evaluate what happens in one state of the world independently from what happens in others. Put differently, contrary to what the STP requires of rational decision makers, it can happen that outcomes in different states of the world are not evaluated separately. Is there something special to the choice situation above, which might explain and possibly justify deviance from the STP? With reference to the pair A and B, Allais argues that avoidance of feeling disappointment or regret displays sufficient grounds to render this behaviour rational. Whereas A yields a certain amount, B would yield considerably higher amount but is way more risky. The trade-off between certainty and riskiness is not present in (II). Weirich [1986:436f.] further argues that the riskiness of the alternatives are relevant properties not taken into account when having framed the decision problem. Broome [1991:96 & 110] describes the peculiar structure among the outcomes and lotteries in question, as complementarities between ‘interacting’ states. Their Interaction is due to the specific way the individual lotteries are connected with prizes - a feature the STP cannot take into account because it just considers individual choices. But what can these observations tell us? Should we make our choices by taking into account what might happen in other states, thereby dismissing the STP as a rationality axiom? Is avoidance of regret rationally defensible? This line of argument brings up a general problem though. Would we be likewise justified to feel regret about a fair coin landing heads, if we betted on 26 27 See Maher [1993:64] for a list of empirical studies. The structure of this argument is taken from Gintis [2009:16f.]. 20 3 Rationality Axioms? tails? Samuelson [1952:672] gives a negative answer to the question. He states “either heads or tails must come up: if one comes up the other cannot.” The value one assigns to what happens in one state of the world (the coin landing heads) should not depend on what it would be like if the other state occurs (the coin landing tails). If that state really occurs no other state does. What would have happened in other states should make no difference to the value.28 In the centre of this argument lies the question: How can something that never happens possibly affect the value of something that does happen? Point taken, but isn’t there something special to Allais’ situation? Similar to the case of transitivity, Broome [1991:98] tries to solve the problem by fine-individuation. The reasons trying to justify a violation of the STP, all refer to a feeling, namely regret or disappointment. He argues that this is an indication that the decision problem might have been underspecified. A refinement of the decision problem incorporating reference to feelings achieves to get rid of the paradox. Again, similar to the argumentation above there comes in a slippery slope argument, which points towards the fact that any violation could be defended by fine-individuation. The force of the argument is that if any violation could be explained away by re-describing the decision problem this could make decision theory vacuous; after all decision theory aims at constraining ones choices, specifically so separability. STP is thought to be a constraint ones preference! Could the avoidance of regret be a powerful enough reason to act as a justifier for re-describing outcomes so as to involve feelings about risk sentiments such as Weirich proposed? This could reconcile the STP with attitudes towards regret, thus Allais’ choices came out to be rational. 3.4 State-Independence The status of the remaining axiom P3 is probably the most ambiguous and overlooked one of all seven (eight) postulates. Does it make a claim about rationality or is it a constraint on the interpretation of consequences and states? Consider an act a with the constant consequence x and another act b with the constant consequence of y.29 Suppose one prefers a to b, thus x over y, given some event E.30 Should we show the same preference given any other event E ′ ? State-independence requires one to do so, but it is questionable whether a convincing case can be made for it. In a first approximation one might want to specify that any answer would heavily depend on the specification of background conditions, e.g. the features relevant to an acts evaluation. In order to pull this argument from the abstract to the concrete consider (our reconstruction of) Aumann’s [1971:77f.] objection: 28 29 30 Others in favour of the STP, have reconstructed Allais’ decision problem in a sequential decision tree and then show that in this context the large majority of people adheres to the principle [Maher 1993:70ff.]. Unfortunately this line of defence is not very useful in the context of our discussion since we are interested in normative arguments. This means that acts a and b are constant acts. Event E has to be non-null, i.e. Pr(E) ⊁ 0. 21 3 Rationality Axioms? Suppose, Mr.X loves his wife very much and judges that without her his life would loose much of its purpose. His live would be “less ‘worth living’ ”. Mr.X’s wife falls ill and has to undergo a dangerous operation in order to keep on living and not die. Her chances of surviving the operation are 0.5. Imagine Mr.X be offered a choice between betting $100 on his wife’s survival or on the outcome heads in the toss of a fair coin. Aumann argues that, even though the two bets seem identical with regard to their outcomes and probabilities - thus the rational thing to do would be to be indifferent among them - Mr.X would still be justified to strongly prefer betting on his wife’s survival to anything else. The reason being that the gain of a $100 in the event that the wife dies is “somehow worthless”. If he bets on heads (or tails) he might win the $100 in a situation in which he will not be able to appreciate them. Aumann argues that in this situation there simply is no consequence whose desirability is state-independent. Is Mr.X’s supposed preference irrational? Savage [1971:80] replies to this objection by slightly twisting the example. He demands of Mr. X to imagine the counterfactual situation in which the continuance of his family life would not depend on the outcome of the operation. Only then, once he is detached from his personal feelings towards the situation, is he in the position to “appraise his own probabilities”. Savage further states that a consequence is in the last analysis an experience, where experiences screen out the features of the world causing them and hence have static, independent desirabilities [79]. Doesn’t this move, instead of solving one problem, generate another one? If, at the end of the day consequences are nothing but subjective experiences, what is the difference between such a consequence and one’s evaluation of it? Despite considerations which concern pragmatic difficulties for Mr. X to successfully detach himself from the situation he finds his and his wife in, Savage seems to miss the mark here. The point of Aumann’s example is not about the problem to elicit Mr. X’s beliefs about the chance of his wife’s survival31 , but rather by the very convincing case that desirabilities can be intimately dependent on the state in which it obtains, and rationally so! Why should Mr. X care at all about $100 in the event of his wife dying? To further rest our case consider an example, closer to everyday live experiences. Take the decision between having a cold beer or a hot chocolate. State-independence requires that ones preferences - in this case we focus on just the desirability - between these acts (‘having a cold beer’; ‘having a hot chocolate’) for any given situation to be independent of a particular state obtaining. In the context of this example the relevant states could be ‘A warm summer evening on the beach’ and ‘A cold winter day’. Say one prefers the hot-chocolate on cold winter day; is one required on grounds of rationality to prefer the chocolate over the cold beer on a warm summer evening? It seems not, why couldn’t one be justified to always prefer the hot chocolate on a winter’s day over the cold beer and vice versa? One response could be 31 Ensured by the fact that all states of the world are assumed to have the same probability of 0.5. 22 4 Conclusion that having a cold chocolate is not a properly specified consequence since the description is to coarse, leaving out relevant aspects for its evaluation. Having a cold chocolate when it is cold outside might be more faithful to the real issue. It follows that a precondition for State-Independence to hold, is that consequences have to be described in a sufficiently fine grained manner. If the description of a particular consequence turns out to be too coarse grained, then aspects which might be relevant to the agent’s assessment of the state’s worthiness - which is relative to the consequence considered - are left out. To conclude, the fact that decision makers’ preferences depend on the underlying state of the world is a feature which cannot always argued to be irrational! As a result P3’s scope should be changed so as to only be used in specific situations where occurrence of state-dependent preferences are justifiably ruled out. 4 Conclusion This completes our discussion of the rationality axioms. We conclude by rejecting completeness and accepting the other three principles, although with restrictions. As we have seen Savage’s theory only focuses on factual uncertainty, that is uncertainty about which states of the world obtain. Another kind of uncertainty which is not addressed by Savage, is option uncertainty. It concerns the specific way to describe decision problems. Issues that arise with this kind of uncertainty were touched upon by Broome and his proposal fine-individuation of outcomes. This method, we argue, is absolutely crucial. We have seen how powerful it can be. The problem we faced there, consisted in the absence of clear criteria which define when it is permissible to re-describe outcomes and when not. In discussing the rationality axioms, the general trade-off consisted between either rationalising certain deviant behaviour such as intransitivity of preferences but at the same time making decision-theory less prescriptive and therefore more vacuous. The threat of making decision theory a vacuous enterprise prevails. This makes it even more important to come to terms with option uncertainty in a way that comes up with criteria which determine the permissibility of fine individuation. This could be a relevant area of further research. A third and related kind of uncertainty addresses concerns about the defensibility of preferences address ethical uncertainty, i.e. uncertainty about ones relative desirability assignments to outcomes. The problem of incommensurability of alternatives, discussed in the section about completeness, lets us points the finger at another issue, namely a peculiar futility of non-substantive theories of rationality of which Savage is an instance. Theories like Savage’s which are only concerned about internal consistency cannot serve to guide agents about what preferences are rationally defensible. Individuals who desire some guideline in deliberating which normative criteria are appropriate and how these should be applied get no answer from these. In other words non-substantive theories cannot evaluate the sources of preferences. An answer to the main question of practical rationality, ‘How should I act?’ cannot completely be 23 REFERENCES answered by a formal decision theory alone. References Allais, Maurice (1953): Le comportement de l’homme rationnel devant le risque: Critique des postulats et axiomes de l’école Américaine. In: Econometrica: Journal of the Econometric Society,, 503–546. Anand, Paul (2009): Rationality and Intransitive Preference. In: The Handbook of Rational and Social Choice. Oxford: Oxford University Press, 156–172. Aumann, Robert J. (1987): Letter from Robert Aumann to Leonard Savage, 8 January 1971. In: Dréze, Jacques, editor: Essays on economic decisions under uncertainty. Cambridge: Cambridge University Press, 76–78. Binmore, Ken (2009): Rational Decisions. Princeton/Oxford: Princeton University Press. Bradley, Richard (2013): Decision Theory: A Formal Philosophical Introduction. No address in, LSE. Broome, John (1978): Choice and value in economics. In: Oxford Economic Papers, 303 , 313–333. Broome, John (1991): Weighing goods: Equality, uncertainty and time. Oxford: Basil Blackwell. Broome, John (1999): Can a Humean be moderate? In: Ethics out of Economics. Cambridge: Cambridge University Press, 68–88. Davidson, Donald, McKinsey, John & Suppes, Patrick (1955): Outlines of a Formal Theory of Value. In: Philosophy of Science,, 140—160. Fishburn, Peter C (1970): Utility theory for decision making. New York: Wiley. Gintis, Herbert (2009): The bounds of reason: game theory and the unification of the behavioral sciences. Princeton: Princeton University Press. Herzberg, Frederik (2009): Elementary non-Archimedean utility theory. In: Mathematical Social Sciences 58. Jeffrey, Richard C (1965/1983): The Logic of Decision. 2nd edition. Chicago: University of Chicago Press. Joyce, James M (1999): The Foundations of Causal Decision Theory. Cambridge: Cambridge University Press. 24 REFERENCES Maher, Patrick (1993): Betting on Theories. Cambridge: Cambridge University Press. Makinson, David C. (2007): Completeness theorems, representation theorems: what’s the difference? In: J. Josefsson D. Egonsson, T. RønnowRasmussen, B. Petersson, editor: Hommage à Wlodek. Philosophical Papers Dedicated to Wlodek Rabinowicz. Lunds Universitet. Mandler, Michael (2001): A Difficult Choice in Preference Theory: Rationality implies Completeness or Transitivity but not Both. In: Millgram, E., editor: Varieties of Practical Reasoning. MIT Press. McClennen, Edward F. (1990): Rationality and Dynamic Choice. Cambridge: Cambridge University Press. Morton, Adam (1991): Disasters and Dilemmas. Oxford: Basil Blackwell. Samuelson, Paul A. (1952): Probability, Utility and the Independence Axiom. In: Econometrica 20, Nr. 4. Savage, Leonard (1954/1972): The Foundations of Statistics. 2nd edition. New York: Dover Publications. Savage, Leonard (1987): Letter from Leonard Savage to Robert Aumann, 27 January 1971. In: Dréze, Jacques, editor: Essays on economic decisions under uncertainty. Cambridge: Cambridge University Press, 78–81. Steele, Katie (2007): Precautionary Decision-Making. Ph. D thesis, Department of Philosophy, University of Sydney. Suppes, Patrick (2002): Representation and Invariance of Scientific Structures. Stanford: CSLI Publications. Temkin, Larry S (2012): Rethinking the Good: Moral Ideals and the Nature of Practical Reasoning. Oxford: Oxford University Press. Von Neumann, John & Morgenstern, Oskar (1944/1953): The Theory of Games and economic Behavior. Princeton: Princeton University Press. Weirich, Paul (1986): Expected utility and risk. In: The British journal for the philosophy of science, 37, Nr. 4, 419–442. 25