Academia.eduAcademia.edu

Propositions and Reasoning in Frege and Russell

1998, Pacific Philosophical Quarterly

Criticism of the 'Universalist Conception' of logic.

© Gary Kemp; penultimate draft; final draft in the Pacific Philosophical Quarterly 1998 79/3: 218-35. PROPOSITIONS AND REASONING IN RUSSELL AND FREGE By Gary Kemp For Frege, and for Russell during the crucial years immediately following his disavowal of idealism, the essential subject-matter of logical theory was not language but the propositions expressed by language (I shall use ‘propositions’ for either Fregean thoughts or Russellian propositions). The point has both a metaphysical and an epistemological side. The proposition is the real bearer of truth and falsity, and its existence and identity are independent not only of language but of the mind. Still, it admits of being apprehended, of being ‘grasped’ by the mind. Thus Frege pronounced that knowledge or cognition is ‘embodied in judgement’, which is nothing but the acceptance of a proposition as true.1 Accordingly, to draw an inference - to reason - is to move from judgement to judgement; it is to accept a proposition as true ‘on the basis of’ other propositions accepted as true.2 It follows that however psychologically dependent any actual creatures may be upon language for their cognitive prowess, thought or reasoning is not essentially something linguistically embodied; its essential condition is just the apprehension of, and the taking of attitudes towards, propositions.3 I shall call this thesis that of the Primacy of the Proposition. Frege was ever more explicitly commited to it. Russell held it during the aforementioned period, except that unlike Frege he was inclined to think of the constituents of propositions as being potential items of apprehension independently of the mind’s engagement with them as parts of whole propositions (and even characterized this so-called acquaintance as constituting a form of knowledge - in this case of ‘things’ rather than truths). Thus Russell, despite having hit upon the technique of defining certain sorts of expressions by defining their sentential contexts, did not accept anything like Frege’s context principle, understood as a bit of epistemology.4 But for my purposes that difference is immaterial. So is Russell’s having rejected Frege’s sense-reference distinction, and so is his having ceased, around 1910, to believe in propositions. For he continued to believe in their parts, and allocated to them what was substantially the same role in judgement: whereas he had thought of judgement as a binary relation between the judger and an independently existing and unified proposition, he now thought of it as a many-termed relation between the judger and what on the old scheme were the components of the proposition, which achieve propositional unity, if ever, only if they should peradventure be combined in some actual judgement (unless the judgement is true, in which case they are combined in the corresponding fact). On either view, the constituents stand before the mind in the act of judging, and on neither view does language enter into the conception of what it is to judge or think. My primary aim in what follows is to show that the thesis of the Primacy of the Proposition is not compatible with the formal treatments of inference expounded in Frege’s or 1 © Gary Kemp; penultimate draft; final draft in the Pacific Philosophical Quarterly 1998 79/3: 218-35. Russell’s logical works. A further purpose is to cast doubt upon it more generally. I I begin with Frege. The thesis of the Primacy of the Proposition figured ever more prominently in his life-long crusade for logical rigour in mathematics. As noted, Frege conceives an inference as that sort of judgement which is made on the basis of other propositions already accepted as true. Consider, for example, modus ponens: if I have already accepted the propositions that p and that if p then q, then, without further ado, I may judge that q (on pain of a well-known vicious regress, we cannot suppose that such an inference also requires the premise that q does follow from those premises). Now a perfectly presented, ‘systematic science’ is a true and complete account of its subject which, by explaining their truth, explicitly displays the real epistemic grounds for all its assertions. It thus consists confining ourselves to the a priori sciences - of statements expressing a set of true propositions which collectively embody a certain structure: Every proposition is either an axiom - a proposition neither requiring nor admitting of proof - or a theorem derived from axioms by a gap-free chain of purely logical inferences (since definitions are mere abbreviations which add no content, they can be ignored).5 Frege’s commitment to this ideal can be made vivid by briefly considering his central criticism of Hilbert’s Grundlagen der Geometrie (some terminological anachronism will have to be risked here for the sake of concision).6 Hilbert - or more accurately his apologist Korselt - was espousing an incipient formalism or deductivism, whereby mathematical theories are investigated strictly for their logical properties - their customary interpretations, for this purpose, set aside. Some of this was expressed in terms of what for Frege were such alarming ideas as that axioms ‘implicitly define’ the concepts occurring in them, but what we would now describe as a set of axioms determining a set of models. Now a number of related issues, both historical and conceptual, will have to be kept at bay here; but the crucial point is that Frege cannot accept the idea that an axiom or theorem is something which can be given an interpretation - as happens, for example, when Hilbert declares the consistency of Euclidean Geometry by specifying a model with a domain of real numbers. In Frege’s view, since Hilbert’s deductions employ schematic (uninterpreted) predicate letters, the formulae in which they appear do not express propositions, hence do not have truth-values, and hence do not themselves represent genuine advances in knowledge. Since a theorem is simply a true proposition, a formula which admits of more than one interpretation expresses no proposition in particular, and hence no theorem. Thus, if a theory is a system of theorems concerning a certain subject-matter, it makes no sense to speak of its being susceptible to different interpretations. Instead, the idea that an axiomatized theory should be considered purely ‘formally’ - 2 © Gary Kemp; penultimate draft; final draft in the Pacific Philosophical Quarterly 1998 79/3: 218-35. that is, independently of the concepts expressed by its predicates - should not be explained in terms of signs and their interpretations, but in terms of second-order quantification. In particular, each theorem must be thought of as a second-order, universally quantified conditional whose antecedent is the conjunction of what Hilbert called the ‘axioms’ of the theory, and whose consequent is what Hilbert called the ‘theorem’ (since this will not itself express a proposition, Frege calls it a ‘pseudo-sentence’)7. For example, where Hilbert had derived ‘(x1, ... xn)i’ from a set of axioms {A0, ... An} - with ‘i’ appearing in the axioms and, strictly speaking, serving as a mere schematic predicate - the thing ought to appear as ‘(1 ... n)(x1 ... xn)(A0 & ... & An  i’ - with ‘i’ now a bound variable. The results are not facts of geometry at all. They are simply truths of logic, to the effect that any properties and relations possessing the second-order properties and relations specified in the antecedents of those conditionals must also possess those specified in their consequents. The application to Euclidean Geometry then depends upon the premise that such relations as  is between  and  and  lies upon  do possess the requisite second-order properties and stand in the requisite second-order relations (this argument is independent of Frege’s view that that premise is supplied by our intuitive acceptance of the genuine Euclidean axioms). What is striking is that Frege should have regarded this issue as critical - that, in response to Hilbert, he should have seized upon this. The point, of course, was not that Hilbert’s Grundlagen was of no mathematical importance.8 Frege’s concern, as ever, was that even the best mathematicians were labouring under various false conceptions of the nature of mathematical knowledge, and hence of mathematics itself. In Frege’s view, Hilbert and many others could not adequately address foundational questions because they had yet to recognize the character of truly systematic science. That can only be a philosophical question, and in Frege’s view the answer is strongly constrained by a proper account of judgement: since knowledge is generated by judgement (as Frege conceives it), and a theory is something which actually embodies our knowledge of something, there is simply no such thing as a theory which is not fully interpreted, no such thing as reinterpreting a theory.9 In particular, then, there is no such thing as a kind of reasoning which is merely structural or formal - if that means that it does not consist in advancing from judgement to judgement, from complete proposition to complete proposition.10 What might seem to be mere pettifogging literalism is in Frege’s eyes philosophically inescapable. Let us ask, then, how well Frege manages to obey his principles in his own mathematical work - the relevant work being Begriffsschrift and Grundgesetze. The commitment to the Primacy of the Proposition, in Frege’s work (as well as in Russell’s) is closely allied to a rather heroic conception of the science of logic itself, which Jean van Hejinoort has aptly called the ‘Universalist’ conception. Everyone will agree that the mission of logic is in some sense to formulate universally applicable standards of reasoning. But for Frege, this mission is not to be carried out by devising a truth-preserving calculus, a calculus 3 © Gary Kemp; penultimate draft; final draft in the Pacific Philosophical Quarterly 1998 79/3: 218-35. ratiocinatur. In the years immediately subsequent to the publication of Begriffsschrift, Frege repeatedly explained that unlike Boole’s or Schröder’s, his logic is represented in a fully interpreted language, not as a mere formalism adaptable for the purposes of the moment. Thus in the footnote to §17 of Grundgesetze, Frege says that in a concept-script, ‘we always proceed directly from one asserted proposition to another’; likewise the heading of §32 is that ‘Every Sentence of Begriffsschrift Expresses a Thought’.11 In claiming this, he is not saying that every formula, once its non-logical symbols are assigned interpretations, expresses a thought; he is saying that every logical formula expresses a thought in its own right. Accordingly, he characterizes the theorems of logic, not as formulae or schemata true on all interpretations, but simply as the ‘most general laws’: whereas the special sciences represent the facts concerning some particular domain of reality - some particular range of objects and concepts - logic represents the most general and abstract facts, ones which hold for all objects and concepts whatsoever (Russell advances a similar view in The Principles of Mathematics12 ). Logic thus serves as the prescriptive ‘laws of thought’, not as a universally applicable method of ensuring validity, but as a set of universally constraining facts. It follows that there is no external perspective from which the deduction of logical laws from others can be thought of as being justified. Every judgement needed in the establishment of a theorem of logic must itself be expressed as a line in the proof, not as part of a separate argument to the effect that some particular mode of inference is correct.13 So Frege presumably believed he had obeyed the strictures he derives from his theory of judgement. And beginning with those in Begriffsschrift, his explanations of generality suggest that he had taken some pains to ensure that he could. Of course he recognizes the necessity, in carrying out inferences involving generality, of dropping quantifiers in order to expose the truth-functional complexity required for the application of inference rules such as modus ponens. But here, as well as repeatedly elsewhere, he explains his ‘Roman letters’ which correspond syntactically to the free variables of more recent formulations of logic - as actually ‘conferring generality’ upon the content of a proposition. Indeed in Begriffsschrift he describes ‘Fx’ merely as an abbreviation for ‘xFx’ (of course I am pretending that Frege employed a notation like those now standard).14 Elsewhere, ‘xFx’ is explained, not as the truth of ‘Fx’ on all assignments of ‘x’, but as the subsumption of the concept denoted by ‘F’ under the second-order concept denoted by the quantifier; it follows that ‘Fx’ itself involves a tacit quantifier. By the theory of Sense and Reference, then, the general proposition is composed of senses corresponding to those entities.15 In Grundgesetze (§8), the matter is turned around: Roman letters are introduced first as conferring generality; it is then pointed out that expressing generality merely by means of Roman letters will not enable one adequately to demarcate the scope of a generalization. In particular, it will not enable one to distinguish the ‘generality of a negation’ from the ‘negation of a generality’: if ‘Fx’ is a way of writing that everything is F, then still we need some way of distinguishing the negation of 4 © Gary Kemp; penultimate draft; final draft in the Pacific Philosophical Quarterly 1998 79/3: 218-35. that generalization from the assertion that everything is not-F. The notation of quantifiers and bound variables is then introduced for the purpose, and everything suggests that the point is strictly one of symbolism. Roman and Gothic letters (free and bound variables) are said equally to ‘indicate’ rather than ‘denote’, by which Frege means that by flagging the relevant argument-places into which object or function-denoting names may be inserted, they indicate precisely what generalization is being expressed (§17). When Frege acknowledges the necessity of dropping quantifiers in order to carry out the Barbara inference, he says merely that this is the necessity of ‘writing’ the premises in a way that admits the application of modus ponens, again implying that the change is only notational. If one glances over the theorems of Grundgesetze, one might quickly gain the impression that this conception has been realized. The theorems do contain Roman letters, but if one thinks of each theorem as being preceded by implicit quantifiers whose scope extends over the whole proposition, one for each distinct Roman letter (including ones which indicate functions), then the theorem will indeed express the desired general proposition. Even the letters appearing in truth-functional theorems are Roman letters, but that is as it should be: for example, if ‘p  (q  p)’ is to express the envisaged general fact, then we must think of the letters occurring in it as universally quantified variables ranging over all objects; it only makes this more vivid if we express the principle as ‘(p,q)[p  (q  p)]’. Since for Frege sentences are names of truth-values and truth-functions are literally functions on truth-values, any instantiation to sentences expresses a truth.16 The trouble is that procedures of inference are obstructed by this way of regarding theorems. Suppose, for example, we wished to prove (the proposition expressed by) ‘xy[Gy  (Fx  Gy)]’, following the stricter regimen of Begriffsschrift. (Since this is really a generalization about concepts, strictly speaking we should add quantifiers binding the predicate-letters as well, but there is no need to add that complication here.) We substitute from the relevant truth-functional axiom to formulae involving Roman letters, then append the appropriate universal quantifiers, yielding the theorem: (1) (2) (3) (p,q)[p  (q  p)] Gy  (Fx  Gy) xy[Gy  (Fx  Gy)] Since, according to Frege’s pronouncements, (2) and (3) express the same proposition, it all hangs on the inference from (1), a generalization, to (2), another. Clearly the inference cannot be one of Universal Instantiation. For if it were, then, in deriving (2), we must substitute names of truth-values, that is sentences, for ‘p’ and ‘q’, in which case the result would be something truth-functionally complex, not something synonymous with (3). (By ‘truth-functionally complex’ I mean a proposition which is the immediate result of combining 5 © Gary Kemp; penultimate draft; final draft in the Pacific Philosophical Quarterly 1998 79/3: 218-35. one or more propositions with a truth-functional connective.) That is, if (2) is truthfunctionally complex then ‘Gy’ there either expresses some particular ascription of a particular concept to a particular object, or is itself a universal generalization. Since what we are after is certainly not a statement about any particular object or concept, it cannot be the former. But if it’s the latter, then - even ignoring the question of the meaning of the predicate-letter - we cannot maintain that the scope of ‘y’ in (2) is the whole proposition, which we must, if (2) is to express the desired generalization. For if Roman letters express generality, then ‘Gy’ and ‘Gz’ express the same proposition; hence for (2) we could just as well have written ‘Gy  (Fx  Gz)’, which is no help towards (3) (the same reasoning repeated for the predicate variables makes things all the worse). Of course, the normal route nowadays would go from a schematically understood ‘p  (q  p)’ to (2) by substitution, to (3) by Universal Generalization. That is not available to Frege, but it might be suggested that the inference from (1) to (2), where (2) is reckoned synonymous with (3), is after all sanctioned by Frege’s principle of substitution, whereby any Roman letter may be replaced by any expression of the same grammatical category, even if the substitutends themselves contain Roman letters. In fact that is exactly what Frege does. And, ignoring the fact that Frege also regards lines such as (2) as truth-functionally complex, Frege’s procedure is valid, in the sense described above: every line so derived, understood as bound by appropriate quantifiers, is true. But this technical fact does not address the philosophical issue: it does not explain in what manner the inference may be regarded essentially as an operation upon propositions. Substitution as conceived nowadays is a syntactical operation upon logical schemata, not a generalization about objects, concepts, or propositions; it is justified semantically by showing that the validity of the original schema ensures the truth of the universal closure of every substitution instance. Now the substitution procedure in Frege’s logic - explicitly formulated only in Grundgesetze - would be correct from that point of view, but it ought to be puzzling from his. For according to that point of view, the validity of substitution (together with universal generalization) represents a fact about particular languages, not a logical principle in its own right. Frege calls it a ‘simple inference’; but, if conceived as a step from one proposition to another, it is by no means generally simple, and - in contrast with his careful discussions elsewhere of modus ponens and universal instantiation - Frege offers no explanation of it that elucidates its character as a step from one proposition to another.17 At most, there is one variety of substitution that can readily be explained as an operation on propositions. Suppose for example that in a firstorder generalization ‘(p)’ we wish to substitute the expression ‘Fx  Gx’ for ‘p’ (where the former expresses a generalization). This cannot be explained as the instantiation of a generalization, but it might be explained as a reduction of its scope: for it is clear that if something holds for all objects - hence for all truth-values - then it must hold for all those which are values of functions of a given internal structure (though for Frege, this talk of the 6 internal structure of a function would be problematic). But this does nothing to address the problem of cross-referring variables exemplified, at its most elementary, by the inference above from (1) to (2); the simple variety of substitution-instance just described may add syntactical structure within an argument-place, but not add structure which extends across argument-places. Put more intuitively, (1) indicates a truth-functional relationship between two propositions; (3) does not. The grammatical category of ‘Fx’, as it occurs here, is not that of ‘p’ (irreducibly polyadic examples will make this more vivid). The problem is general: given the way that his epistemological imperatives force Frege to think of inference, and of truth-functional theorems - as generalizations about the meanings of complete sentences - it is not at all clear what it is to derive an instance of a truth-functional theorem whose linguistic expression will comprise formulae with variables referring across that truth-functional complexity, bound by (possibly tacit) quantifiers whose scope is the whole proposition.18 The syntactical procedure of substitution-and-universalgeneralization tells us how to obtain sentences which express the required generalizations, but we have as yet no explanation of how this might be made consistent with the Primacy of the Proposition, and certainly none of how it might connect with or be justified by what, on the Universalist conception of logic, could rightly be called logical truth (remember that proofs, on the Universalist conception, cannot be thought of as being backed by a global argument for the soundness of the calculus). I shall return to this, including the question of in what way the thesis of the Primacy of the Proposition demands the Universalist conception. Now the only sign that Frege recognized this difficulty is that, after having said all that I have related concerning the role of Roman letters, he effectively takes it all back towards the end of §17 of Grundgesetze. He now says that combinations of signs containing Roman letters are not names, but expressions which become names when the Roman letters are replaced by names. Hence sentences containing Roman letters - including many theorems of Grundgesetze - are not names of truth-values, and do not express complete propositions.19 Perhaps Frege saw that this was inconsistent with his epistemological strictures, but unlike his frank worries over various related issues such as his worries about Hilbert - and unlike his careful explanations of other, simpler sorts of inference - he never puzzled over it in his writings. II Russell, however, puzzled over it openly and at length. His divagations range widely over the years 1903 to around 1924, and they are highly ramified at each point. Indeed, the Russell I shall consider is not one who existed at any one time. I shall think of Russell as having combined the binary theory of judgement assumed in The Principles of Mathematics (1903), with the account of generality in terms of propositional functions propounded in Principia Mathematica (1910). We shall thus avoid the tangles surrounding the Principles 7 theory of denoting concepts, and the incipient multiple relation theory of judgement of Principia (expounded in more detail, and with various adjustments, in succeeding works).20 I think this will be harmless: nothing I say depends upon features of the binary theory not shared by the multiple-relation theory, and those familiar with the theory of denoting concepts will know that those logical homunculi cannot provide any genuine relief from the sorts of difficulties I shall discuss. I shall, however, stress the epistemological ground common to Russell’s theories of judgement: that any constituent of a judgement must be something with which we are acquainted: that in order to judge that p, the constituents of p must be real mind-independent entities which in some sense stand before the mind.21 Russell retained this view throughout his ever-diminishing estimates of what we may assume ourselves to be acquainted with. Notoriously, Russell never explained how the logical theory of ‘On Denoting’ or Principia was to combine with the principle of acquaintance in the explanation of general judgements, but something like the following seems inevitable. The crucial notion is that of the propositional function. A propositional function x_ (such as _x is wise) is a function whose values are propositions: a for example is the value of x_ for the argument a. To assert that everything is , according to Russell, is to assert the truth of all values of x_. What, then, are the constituents of the proposition that everything is ? It cannot simply be composed of all values of x_, for that would require acquaintance with every object, and the proposition would be of implausibly enormous, if not infinite, complexity. Instead, Russell thinks of a universal generalization as ‘involving’ the propositional function itself. Whereas the proposition a does not itself contain the propositional function x_, the universal generalization does contain it (equivalently: is about it); it says that all its values are true. The conclusion seems inevitable then that for Russell, much as for Frege, the universal quantifier signifies a property of propositional functions, one which forms a constituent of general propositions. Unlike Frege, however, Russell not only sees clearly that quantifiers must be dropped for purposes of logical deduction, he sees from the start that the significance of doing so is not merely symbolic, and he struggles to come to terms with it philosophically. The issue is stated most directly in ‘Mathematical Logic as based on the Theory of Types’ (1908). Of a universal generalization, as just explained, Russell says that it is a ‘determinate proposition’ (in Principia, that it is ‘in no sense ambiguous’). By this Russell means that it does not contain a variable; its linguistic expression contains the letter ‘x’, but this is merely an ‘apparent’ variable, needed, as for Frege, to indicate precisely what is being said to hold in all cases. In carrying out deductions, however, we must operate with what Russell calls ‘real variables’. In particular, where we drop the quantifier and write ‘x’ as a line in a deduction, what we are doing is asserting a propositional function. In this case what we assert does contain a variable; to assert a propositional function is not to assert all its values, but to assert 8 ‘any value’ of it.22 Now in spite of that way of putting the point, and in spite of the slightly misleading first paragraph of Russell’s initial discussion of propositional functions in Principia, Russell does not mean that in such a case the propositional function is itself the object of judgement. Instead, to assert x is to assert something which is just like a, except that it contains a real variable where the latter contains the object a. x is thus not the propositional function but an undetermined value of it, and to assert it amounts to an ‘ambiguous assertion’. Thus Russell writes: When we assert something containing a real variable x, we cannot strictly be said to be asserting a proposition, for we only obtain a definite proposition by assigning a value to the variable. This could be taken in either of two ways. It could mean that there is a certain something containing a real variable x, and which we assert, but that this thing is not to be called a ‘proposition’. Or it could mean that there is no thing in particular that is the object of assertion (or judgement); rather we make an assertion of an arbitrary member of a certain class of propositions, namely the values of the asserted propositional function, in such a way that the truth of what we assert does not depend upon which member of the class happens to be the arbitrary one. On the first alternative there seems to be no good reason not to call the thing asserted a proposition; even if for some reason truth or falsehood can attach directly only to the values of the function, still it has at least derivative truth-conditions, and it is a thing which can be asserted or judged. On the second alternative there is only mystery, which I suspect it would be unremunerative to enter into at any length. Suffice it to say that where judgement is conceived as a relation between mind and a nonlinguistic object which is apprehended by the mind, and assertion as the linguistic expression of judgement, it makes no sense to say that assertions may take place when there is no particular object - especially not when it is quite clear that the conditions of success (the truth-conditions) of such an assertion are exactly specified, as in the present case they certainly are (it should be conceded, however, that the notion of ambiguous assertion is relatively unmysterious so long as sentences are understood to be the objects of assertion). To be driven towards the first alternative, however, is to be driven back to a problem which had bedevilled Russell unmercifully in the Principles. To assert, as a universal generalization, xRx (that everything bears R to itself) is clearly different from asserting, as a universal generalization, that xRy (that everything bears R to everything). Both propositions can be obtained by extracting the relation R from a proposition aRb, and replacing a and b with variables. But if we are discussing propositions rather than sentences, and these 9 propositions contain variables, then the variables x and y must be nonlinguistic objects, and must furthermore be distinct. But how could they be? After all, ‘xRy’ and ‘yRx’ (understood as generalizations) express the same proposition, as do ‘xRx’ and ‘yRy’. If variables enjoy an extralinguistic existence, then each can be signified by any letter, ‘x’ or ‘y’. As Russell had put it in the Principles, the (unrestricted) variable x (or y, or ...) must be ‘any term’, any object whatsoever; equivalently, something which ‘denotes’ but in an ‘impartial distributive manner’.23 It is hardly comprehensible how there could be two such entities - distinct, yet each such as to be any term whatsoever. Moreover, such entities would have to be not only distinct in themselves but distinguishable in thought, such that their appearance before the mind would enable one to distinguish xRx from xRy (without thereby distinguishing xRx from yRy).24 These near absurdities may have contributed to Russell’s having introduced, in the second edition of Principia, a method of quantificational inference which dispenses with real variables. All sentences containing unbound variables, Russell now says, are to be understood as governed by tacit universal quantifiers, one for each variable. Quantificational inference is now to be carried out, not by means of instantiation and generalization, but by means of a new inference rule: from (x)x and (x)(x  x) we may now infer (x)x directly. He then sketches a method for proving general propositions using the new rule. Whether Russell was motivated as I suggest is clouded by his later uncertainty over the status of logic. For example by 1937 - ten years after the second edition of Principia - Russell was saying that logic is ‘much more linguistic then I believed it to be at the time when I wrote the Principles’, and was willing to consider that the logical constants are nothing but symbols, not things expressed or denoted by symbols. He may simply have called it progress to reduce the number of primitive ideas in logic, which, in the first edition, did explicitly include ambiguous assertion. In any case the emendation announced in the second edition is clearly insufficient to make good on the idea of deduction as the movement from whole proposition to whole proposition. For substitution survives in the new system unchanged, and indeed now bears more of the burden; as it emerged earlier in connection with Frege, the real stumbling-block is substitution, not quantificational inference generally. Technically speaking, the discussion of substitution in Principia establishes almost nothing. Philosophically it is not without suggestive nuance, but from any point of view it is clearly inadequate to rest, as Russell does, with saying that to invoke it in deducing fresh theorems is nothing more than to ‘notice that they are instances’ of previous theorems, for the requisite notion of ‘instance’ is left unexplained. Without a precise statement of it there is no fact of the matter as to whether or not even the elementary portion of the system is consistent. For his part, Frege formulated a substitution rule explicitly and, from the point of view of logic as a formalism subject to interpretations, correctly in Grundgesetze. But neither Russell nor Frege had made any effort to come to terms philosophically with its centrality in logic 10 (though it is conceivable that Russell’s strange remark to the effect that substitution is somehow unstateable was philosophically motivated, not mere technical indolence). Again, the only evident path open to a Russell or a Frege, trying not to conceive of logic as being about language, would be to devise a principle of substitution as a generalization about structural relationships amongst propositions. No doubt the reasoning behind such a substitution principle can be plausibly sketched.25 What we need can be explained in terms of the logic of Principia as follows (the simplifying restriction here to oneplace propositional functions is inessential). We want to derive instances of truth-functional formulae with variables referring across their truth-functional structure. Thus suppose that F is some truth-function, a proposition F(p1...pn) is a truth functional theorem, and we want to derive (1 ...n)(x1 ... xn)[F(1x1 ....nxn)]. We can argue: For any choice of functions 1x_ ...nx_, every instance F[1a1 ... nan] of the truth-functional theorem is true, since the a’s are propositions in each case. But then every value of the propositional function F[1x_1 ... nx_n] is true, so by Russell’s definition of generality we may add the quantifiers to bind the ‘x’s, yielding the proposition (x1 ... xn)[F(1x1 ....nxn). But that was any choice of functions 1x_ ...nx_; hence by the same argument we may add the quantifiers to bind the ‘’s, which gives us the desired substitution-instance. That much is straightforward, and seems to generalize about propositions and propositional functions - rather than the symbols which express them - in the requisite way without mentioning propositions containing variables. But it is to no avail. In the first place it gives every appearance of using real variables, such as ‘a proposition F(p1...pn)’, thus assuming the unwanted thing in the attempt to eliminate it. Second, its premises will presuppose what look too much like matters of fact. That’s all right, as well as inevitable, for the philosophical, meta-theoretic enterprise known as logical theory, but not for logic itself. To speak of ‘a proposition F(p1...pn)’ , or ‘any instance F(a...)’ is like speaking of ‘any set {a, ...}’ - insupportable until we have a reassuringly explicit theory of the supposed entities, from which such loose expressions may be said to borrow their significance. The advent of such a theory would only make it clear that we were building logic atop a substantive theory of abstract entities. The needed generalization would have to presuppose a theory of the structure and constitution of propositions, something largely analogous to set theory except intensional rather than extensional. But, whereas Frege, and although less clearly Russell, could distinguish in a principled way between the laws of deduction (the truths of logic) and the theory of the composition of propositions (logical theory), the latter will now be essential to the content of the former. In expounding such a theory, one’s hand may be steadier in imposing some of the restrictions necessary for consistency, but even the most obvious restrictions are not thereby any less substantive; in principle, all the terrors of set theory and more will now be endemic to elementary logic itself, hence implied by whatever is needed for elementary reasoning (even in Russell the principles of type theory are not themselves 11 propositions of logic).26 That would be a lot even for Russell to swallow, and if you believe as Frege did that the basic principles of logic cannot coherently be thought of as being subject to doubt then it is out of the question (Russell too held this view in the Principles, but had definitely hedged it by the time of Principia). Of course, it might be pointed out that even where logic comprises the employment of formal systems and semantical generalizations about them, the validity of substitution must be established. Normally this is achieved indirectly by means of a soundness proof, which indeed one might regard as part of the argument for anything derived within the formal system. But the informal reasoning in such a proof is itself elementary - availing itself, at most, of finitary mathematical induction - and makes explicit reference to nothing more worrisome than linguistic expressions (whose structure is something rigorously stipulated, not out there to be discovered). It is at least arguable that the codification of inference does not thereby rest upon something more doubtful than what is being codified. Such is far from being the case if the codification rests upon a theory of propositions. III Formal first-order logics exist which employ no free variables, such that any interpretation of their predicates ensures that every provable formula is a true sentence. Quine’s system in Mathematical Logic is one such system; the example Tarski employs for a truth-definition of a first-order language is another, and indeed the formalism of the second edition of Principia provides materials for a third. But what must be appreciated is that the possibility of such systems provides no comfort to the thesis of the primacy of the proposition. To make this vivid, consider that decision procedures exist for the validity of a wide range of polyadic quantificational schemata. Choosing some such procedure (or set thereof), and taking the widest manageable class of such schemata as one’s object, one could declare one’s logic to consist of nothing but a specification of its formulae and its semantics, along with a single inference rule: every formula which is identified as valid by the decision procedure is a theorem. No one - and certainly no believer in the primacy of the proposition would say that a proof thereby delivered actually displays the justification for a given assertion. Clearly the justification of any such theorem would be contained in the argument for the reliability of the decision procedure. Yet the case is substantially the same, if less vividly, for systems like those just mentioned due to Quine and Tarski. Confidence in the logic is supplied not as it were from within, as Frege and Russell envisaged, but from without, by metalogical argument. The thesis of the primacy of the proposition requires that all reasoning consist in operating with complete propositions: perhaps the operations themselves cannot at all events be identified with propositions, but it would surely be satisfactory if they could be reduced, say, to nothing more momentous than modus ponens.27 Needless to say, for Quine or Tarski, avoiding open sentences as theorems does not bring a 12 requirement in its train that metalogical or metalinguistic argumentation should avoid such natural language analogues of free variables as ‘any predicate’ or ‘any sentential function’. But for Russell or Frege there must, as it were, be a level at which the buck stops: there must be a way of giving the full justification for any assertion in terms of complete propositions, all on the same level. If formalizing is like ascending a ladder, then we have to be able to draw it up behind us, if not kick it away. I have argued that the only evident way of achieving this by supplementing logic with an explicitly propositional principle of substitution - would be implausibly problematic: it carries too much weight to be passed off as an inference rule on a level with modus ponens, and it insinuates too much into logic to appear as an axiom.28 But that is to say that anyone who accepts the thesis of the Primacy of the Proposition, and thinks of the syntax of Begriffsschrift or Principia as embodying essentially the correct account of logical structure, must face the same difficulty. Some will see the positive wisdom of the Tractatus view of logic in this. The rest of us might settle with the idea that Wittgenstein the younger was right at least as far as saying that reasoning is in some sense calculating or operating with language; for in that case it would not seem ruled out that reasoning should be at least partly schematic.29 Perhaps, consistently with that, the language could be mental; what is more fundamentally missing from the propositional view of Frege or Russell might be more generally expressed as a failure to give proper place to the idea that thought is representational. University of Glasgow 13 Notes 1.PHW p. 144 (NS p. 155); for Frege’s characterizations of judgement, see PHW p. 2, p. 139, p. 185, p. 198 (NS p. 2, p. 151, p. 201, p. 214); CP pp. 164-165, p. 356 (KS pp. 150-1, pp.346-7). 2.PHW p. 3, p. 175, p. 204 (NS p. 3, p. 190, p. 220); GG §14, PMC p. 22 III/3 (WB p. 35 ix/4). 3.See PHW pp. 269-70 (NS pp. 288-9), ‘On the Scientific Justification of a Conceptual Notation’, CN pp. 83-4 (the original of this piece is reprinted in Frege 1964b); cf. PHW p. 259 (NS p. 279). 4.In particular, Frege held that in order to think a thought containing a given part, there need not be a kind of mental apprehension of that part (or of the referent of that part) which can be isolated from the thinking of whole thoughts containing it. See Foundations §§58-60. 5.See the beginning of ‘Logic in Mathematics’, PHW pp. 204-205 (NS pp. 220-1). 6.CP pp. 306-333 (KS pp. 293-317), PHW pp. 247-250 (NS pp. 266-70). 7.In the English translations Frege’s ‘Satz’ is ‘proposition’ (hence ‘uneigentlicher Satz’ is ‘pseudo-proposition’). For obvious reasons, I have avoided this. 8.At CP p. 318 (KS p. 304) Frege admits that Hilbert’s schematic derivations may at least ‘serve as the occasion’ for constructing proper proofs. Hilbert might have been happy to accept that his axiomatization was a work in pure logic; what Frege does not appreciate is that Hilbert, or at least the Hilbert of the 1920's, would nevertheless continue to speak of interpretations where Frege speaks of second-order logic. 9.CP p. 316 (KS p. 302). 10.Of course one can use words such as ‘theory’ as one likes, and the modern idea of a formal theory as precisely a syntactical apparatus subject to varying interpretations is not inconsistent with Frege’s epistemological strictures, so long as one distinguishes clearly between assertions of the theory (under a given interpretation), and metalinguistic assertions about the theory. Clearly Hilbert and Korselt at this time (1903-1906) were well on the way towards that conception, but it had not yet been realized fully (Hilbert’s proof theory was not to emerge completely before the 1920's). Frege himself was on the one hand making fun of those who would advance a ‘theory’ which doesn’t actually mean anything, and saying on the other that if symbols are the objects of a theory then the truths of that theory ought to be discoverable by means of a chemical analysis of (the tokens of) those symbols. 14 11.Other characterizations of inference, especially in his unpublished writings, reinforce this point. See especially PHW pp. 260-1 (NS 280-1). 12.Ch. 1-2, esp. §17, in which Russell observes that basic ‘principles of deduction’ cannot be subject to doubt, on pain of being unable to justify any argument. On this theme see Van Heijenoort, Ricketts, Hylton (pp. 198-204), Kemp (1995). Partly in order to maintain that the Axiom of Reducibility is a logical axiom, Russell eventually dropped the idea that axioms of logic need have any special epistemic distinction. 13.Of course, inference rules cannot themselves be cited as lines of proofs. Frege recognizes this, but he sees also that the choice of inference rules is partly open, hence that there is no one principle that must ever remain ineffable. 14.B §1. 15.PHW pp. 254-5 (NS 274-5). 16.Those letters can be instantiated to any name of an object, but Frege’s definitions ensure that p will be inferrable only when p is in fact the true, as opposed not only to the false but also as opposed to other objects. 17.A further issue is that substitution transmits only validity, and not truth generally. This fact, indeed, is difficult for Frege’s philosophy of logic to assimilate, because a substitutioninference requires that the premise be recognized not merely as true but as logically true. I believe that this will be problematic for Frege, because of the special relationship he envisages between the notions of truth and judgement. But the issue is too complex to enter into here. 18.In fact the same sort of problem arises within truth-functional logic itself: understood as generalizations about truth-values, truth-functional theorems are no longer truth-functionally complex. But these could plausibly be made amenable to inference by adopting a generalized version of modus ponens. 19.Frege does sometimes characterize the use of Roman letters as representing the idea of reasoning about an arbitrary object - e.g., ‘Suppose n is any perfect square; then ...n...’. The nature of the case perhaps makes this inevitable, but this idea finds no reflection in Frege’s philosophical explanations of generality or inference. I have thus suppressed this, since Russell takes up the point explicitly and at length. 20.Principia pp. 43-44, Problems pp. 125-130, Theory of Knowledge pp. 105-118. Also see Candlish. 21.More exactly: if, after adopting the multiple-relation theory, Russell could no longer hold precisely that ‘[every] proposition which we can understand must be composed wholly of 15 constituents with which we are acquainted’ (Problems p. 58), he still maintained, in effect, that ‘judges that’_s can be true of A only if A is acquainted with the meanings of the parts of s (where s is fully analysed, in which case s is significant only if all its parts do stand for something). Cf. Problems pp. 125-130. It may also be noted that, in Russell’s Logical Atomism, sense-data are in the requisite way independent of the mind. See Problems p. 41, and ‘The Relation of Sense-Data to Physics’, reprinted in Mysticism and Logic, pp. 108-131. 22.This account leans upon the account of Russell on generality in Hylton pp. 296-297, though departing from it slightly for the sake of simplicity. 23.Principles 56-60, 86, 88. 24.For greater illumination of this theme see Hylton, pp. 212-218 and pp. 255-6. In a wonderful letter to Jourdain, Frege suggests that this sort of idea is so egregiously incoherent that the expression ‘variable’ ought to be banned outright, since people will continue to talk as if they could be anything but mere letters (PMC VIII/13, pp. 81-84 (WB XXI/13, pp. 12933)). See also PHW pp. 234-8 (NS pp. 252-7); at p. 238 (NS p. 257) Frege makes the point about the indistinguishability of x and y, if variables were nonlinguistic entities. 25.This spells out in slightly more detail the brief discussion in §4 of Section II of the Introduction to the Second Edition of Principia (which is indeed Russell’s most explicit discussion of substitution; it deals only with the instantiation of truth-functional theorems). 26.It should be repeated that, having assumed the Axioms of Reducibility and Choice, Russell denied that logical axioms must be self-evident or peculiarly obvious (see note 12). But, to cut a long story short, it is not as if Russell could expect everyone to agree that those are principles of logic, in advance of showing that, along with principles which are so agreed upon, they suffice to derive mathematics. Russell never strayed so far from plausibility as to suppose that all reasoning - not just that needed for mathematics - assumes among its premises a substantive theory which explicitly quantifies over sets or propositions. It should also be acknowledged that Russell’s truth-functional theorems already generalize about propositions (for the truth-functions are propositional functions which take propositions as arguments); for that purpose, however, no account is needed of their structure. 27.Though erroneously, Frege supposed that Begriffsschrift employs only modus ponens as an inference rule. The situation is clouded because of Frege’s inconsistency in his explanations of Roman Letters (free variables), but if we think of the strictly first-order part of the system as a formalism in the modern sense then it employs universal generalization and substitution (Universal Instantiation is an axiom). 28.One might also worry that if propositional substitution must be an explicit axiom rather than an inference rule, then, where S is the relation of one proposition’s being a substitution instance of another, the principle would have to be of the form ‘if S(p,q) then if p is true then 16 so is q’ - in which case a truth-predicate appears in logic itself, hence consistency only at the price of yet more complexity. 29. For a recent account of inference which copes with this, see Lance. Jim Edwards is also working on a paper which suggests that, if we weaken the constraint on inference from preservation of truth to preservation of Tarski-truth, then although natural deduction systems cannot satisfy the Universalist Conception of logic, Axiomatic systems such as Frege’s might. 17