Academia.eduAcademia.edu

Cause and correlation in language change

Abstract

1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 4

CAUSE AND CORRELATION IN LANGUAGE CHANGE Malte Rosemeyer (KU Leuven) Freek Van de Velde (KU Leuven) 51st Meeting of the Societas Linguistica Europaea Tallinn, 30 August 2018 ! 1 CAUSATION IN HISTORICAL LINGUISTICS 1. INTRO 2. CLEFTING IN BP • What counts as a cause in historical change? • Consider morphosyntactic change: changes in usage contexts of a construction A might be caused by restructuring of constructional networks, for instance the rise or loss of a competing construction B • Empirical problem: correlation does not imply causation • We will present a statistical method that helps to distinguish correlation and cause in language change 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 2 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 3 INVERSE CORRELATION BETWEEN INFLECTION AND WORD ORDER 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! “(...) one of the most banal and familiar (but slippery) precepts in all of linguistics, to wit, that there is some kind of functional tradeoff between morphology and syntax. It is often said, but very difficult to prove, that word order tends to be rigid where morphology is absent, but free where morphology is relatively luxuriant.” Haiman (1985: 66-67) “What we see here is an interaction between two ways of signalling the noun-modifier relation, i.e. NP constituency – by adjacency (...), or by morphological binding, (case and/or gender/number agreement). Both principles are universal, but they can play against each other, so that when one is violated, the other becomes obligatory.” Givon (2001a: 282) “There is a correlation between rigidity of word order and absence of a morphological means of distinguishing A from O, even if it is not true that rigid word order is necessary if A and O are to be distinguished by order.” Mallinson & Blake (1981) “Turning to the relationship between word order and morphological marking, the analyzed data reveal that neither the presence of agreement nor of case marking is a sufficient condition for flexible order, nor does rigid order entail the absence of either form of morphological marking. There is nonetheless a relationship between the two phenomena, namely flexible order tends to be accompanied by the presence of overt agreement and/or case marking and lack of agreement and/or case marking tends to be accompanied by rigid or restricted order.” Siewierska (1998: 525-526) 4 WHAT CAUSED WHAT? 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! • Different suggestions in the literature: – Loss of case system caused the emergence of fixed word order (SVO): Foulet (1930: §50), von Wartburg (1965: 129), Rheinfelder (1967: 45), Harris (1978: 49), Faarlund (2001: 1718), Bentz & Christiansen (2013: 55–56) – Rise of fixed word order caused the loss of the case system Bourciez (1967: §559) – No direct causation Lerch (1934: 267–271), Schösler (1973: 251–254, 2001a: 277–278), Detges (2009) 5 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CLEFT-WH INTERROGATIVES IN BRAZILIAN PORTUGUESE (BP) • Historical increase in the usage frequency of BP cleft-wh interrogatives Lopes Rossi (1993; 1996; 1998), Duarte (1992), Fontes (2012a; 2012b), Kato & Ribeiro (2005; 2009), De Paula (2015; 2016), Rosemeyer (to appear) CAUSALITY 5. RESULTS 6. CONCLUSION! (1) a. b. Onde where Onde where é be.PRS.3.SG você you que você that you foi? go.PST.PFV.3SG foi? go.PST.PFV.3SG ExSitu Cleft 6 DECLARATIVE CLEFTS IN BP 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER • The rise in the usage frequency of cleft-wh interrogatives in turn coincided with a rise of the usage frequency and syntactic productivity of cleft sentences of the ‘that’ type Kato & Ribeiro (2009) CAUSALITY 5. RESULTS 6. CONCLUSION! a Maria (2) É be.PRS.PFV.3SG DET.F.SG Mary ‘It is Mary who arrived’ que chegou. that arrive.PST.PFV.3SG 7 WORD ORDER CHANGES IN BP 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! • Increase in ‘that’-clefts is caused by word order changes, in particular the loss of V2 Kato & Ribeiro (2009: 123) • Possible word orders in Class. Port.: Galves & Paixão Sousa (2017) (3) Christo Senhor nosso, disse a seus Discipulos Christ Master POSS.1 say.PRS.PFV.3SG to POSS.3PL disciples ‘Christ our Lord, said to his Disciples...’ (4) Mártires os chama-vam os companheiros Martirs them call-PST.IPFV.3PL DET.M.PL fellows ‘Their fellows called them martirs’ o despedaça-vam, ou lhe pun-ham o fogo (5) ou either it smash-PST.IPFV.3PL or to-it PUT.PST.IPFV.3PL DET.M.SG fire ‘(they) either smashed it or put fire to it’ 8 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! data from Galves & Paixão Sousa (2017: 157) (n=34,293) 9 POSSIBLE HYPOTHESES 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER LOSS OF V2 1 INCREASE IN 'THAT'CLEFTS 1 2 CAUSALITY 5. RESULTS 6. CONCLUSION! INCREASE IN CLEFT-whINTERROGATIVES H1: Loss of V2 -> more 'that'-clefts -> more cleft-wh interrogatives H2: Loss of V2 -> more clefting in general H3: Loss of V2 -> more cleft-wh interrogatives 10 POSSIBLE HYPOTHESES (INVERTED) 1. INTRO 2. CLEFTING IN BP 3. DATA LOSS OF V2 1 INCREASE IN 'THAT'CLEFTS 1 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! INCREASE IN CLEFT-whINTERROGATIVES H4: More 'that' clefts -> loss of V2 H5: More clefting in general -> loss of V2 H6: More cleft-wh interrogatives -> loss of V2 11 MOTIVATIONS FOR H1-3 (NON-INVERTED) 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! • Word order was used to mark information structure Galves & Paixão (2017), Galves & Gibrael (fc.) – null subject means an “unmarked” information structure – the preverbal position is reserved for “prominent elements of various kinds” --> topicalization – postverbal subjects are interpreted as “familiar” or “continuing” topics (6) Em várias partes das fronteiras fizeram os castelhanos in several parts of.the borders make.PST.PFV.3PL DET.F.PL Castilians fumo smoke ‘The Castilians made fire on several places along the border’ 12 IMPLICATIONS OF THE CHANGE: ‘THAT’-CLEFTS 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! • Loss of word order as a marker of information structure meant that speakers had to mark information structure some other way • While Romance cleft sentences are usually considered markers of contrastive focus or presentational markers, it has recently been suggested that the main function of cleft sentences is to ensure topic continuity Hedberg/Fadden (2007), Dufter (2009), Roggia (2009), De Cesare et al. (2016) • 'That'-clefts may thus have arisen as a compensation strategy for the loss of postverbal subject use in declaratives 13 IMPLICATIONS OF THE CHANGE: WH-QUESTIONS 1. INTRO 3. DATA • The function of subject postposition in wh-interrogatives in Classical Portuguese seems similar to the function in declaratives (“familiar” or “continuing” topics) 4. GRANGER (7) 2. CLEFTING IN BP CAUSALITY 5. RESULTS 6. CONCLUSION! tens? Xanto. Esopo, que escrito e esse, que ai Esopo what document be.PRS.3SG this, that there have.PRS2SG? ‘Esopo, what is this document that you are holding there?’ Esopo. É a carta da menina be.PRS.3SG the letter of.the girl ‘It’s the letter from the girl’ Xanto. Como vai ela com o ler? How go.PRS.3SG she with the read ‘How is her reading going?’ (lit. How does she go with the reading?) (Esopaida ou vida de Esopo, Antonio da Silva, 1734, t980) 14 IMPLICATIONS OF THE CHANGE: WH-QUESTIONS 1. INTRO 3. DATA • Early cleft-wh interrogatives are typically used in contexts in which the question proposition is highly accessible, leading to a discourse-continuing interpretation 4. GRANGER (8) 2. CLEFTING IN BP CAUSALITY 5. RESULTS 6. CONCLUSION! ao Sr. Francisco que aceit-o. Ana. Então diz-ei So say-PST.PFV.1SG to.the Mr. Francisco that accept.PRS.1SG ‘So I told Mr. Francisco that I accept’ Carlos. Que é que aceit-ais? What be.PRS.3SG that accept.PRS.2SG ‘What is it that you accept?’ Ana. Bast-a só que êle o saiba. suffice-PRS.2SG only that he it know.PRS.SBJ.3SG ‘Only he needs to know that’ (Comédia sem título, Martins Pena, 1848, t5978) 15 IMPLICATIONS OF THE CHANGE: WH-QUESTIONS 1. INTRO 2. CLEFTING IN BP 3. DATA • Similarly to 'that'-clefts, cleft-wh interrogatives may have arisen as a compensation strategy for the loss of postverbal subject use in wh-interrogatives 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 16 CORPUS 1. INTRO 2. CLEFTING IN BP 3. DATA • Self-compiled corpus of over 3.2 million words from European and Brazilian Portuguese theater texts between 1736 and 2016 (Rosemeyer to appear) 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 17 DATA COLLECTION 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! • Wh-interrogatives: n=21372 cases, n=1883 cleft-wh • SV word order: all instances of a personal pronoun followed by a verb (identified via regex) --> n=26501 • Declarative 'that'-clefts: all instances of present-tense 'that'clefts (é X que P) --> n=808 18 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 19 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 20 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 21 TIME SERIES ANALYSIS 1. INTRO 2. CLEFTING IN BP 3. DATA • Historical linguistics often makes use of TIME as an independent variable, in correlation studies or in multiple regression • While this gives good results, it flouts the assumption of independence of datapoints • Often high degree of ‘autocorrelation’. This leads to an underestimation of the standard error • We need techniques that take care of the autocorrelation due to consecutiveness of time measurements. • One of these techniques is Time Series Analysis (see Baayen et al. 2018 for an application to experimental data, and Van de Velde & Petré, forthc. for an application to historical linguistics). 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 22 TIME SERIES ANALYSIS 1. INTRO Two time series can be correlated 2. CLEFTING IN BP • • Correlation does not imply causation 3. DATA • How can we decide? 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! ð Granger Causality • Technique from econometrics • Using the prior values of one time series to predict values of another time series 23 TIME SERIES ANALYSIS – GRANGER CAUSALITY 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! 24 TIME SERIES ANALYSIS – GRANGER CAUSALITY 1. INTRO 2. CLEFTING IN BP 3. DATA CAUSALITY 5. RESULTS 6. CONCLUSION! Granger-causes 4. GRANGER Granger causality test Model 1: QCY.ts ~ Lags(QCY.ts, 1:1) + Lags(SVY.ts, 1:1) Model 2: QCY.ts ~ Lags(QCY.ts, 1:1) Res.Df Df F Pr(>F) 1 97 2 98 -1 15.145 0.0001826 *** Granger causality test Model 1: SVY.ts ~ Lags(SVY.ts, 1:1) + Lags(QCY.ts, 1:1) Model 2: SVY.ts ~ Lags(SVY.ts, 1:1) Pr(>F) Res.Df Df F 1 97 2 98 -1 1.5947 0.2097 25 TIME SERIES ANALYSIS – GRANGER CAUSALITY 1. INTRO 2. CLEFTING IN BP Does not Granger-cause 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! Granger causality test Model 1: DCY.ts ~ Lags(DCY.ts, 1:1) + Lags(SVY.ts, 1:1) Model 2: DCY.ts ~ Lags(DCY.ts, 1:1) Res.Df Df F Pr(>F) 1 97 2 98 -1 0.9697 0.3272 Granger causality test Model 1: SVY.ts ~ Lags(SVY.ts, 1:1) + Lags(DCY.ts, 1:1) Model 2: SVY.ts ~ Lags(SVY.ts, 1:1) Res.Df Df F Pr(>F) 1 97 2 98 -1 0.0489 0.8255 26 TIME SERIES ANALYSIS – GRANGER CAUSALITY 1. INTRO 2. CLEFTING IN BP 3. DATA 4. GRANGER CAUSALITY 5. RESULTS 6. CONCLUSION! Granger causality test Model 1: DCY.ts ~ Lags(DCY.ts, 1:1) + Lags(SVY.ts, 1:1) Model 2: DCY.ts ~ Lags(DCY.ts, 1:1) Res.Df Df F Pr(>F) 1 97 2 98 -1 0.1631 0.6872 Granger causality test Model 1: SVY.ts ~ Lags(SVY.ts, 1:1) + Lags(DCY.ts, 1:1) Model 2: SVY.ts ~ Lags(SVY.ts, 1:1) Res.Df Df F Pr(>F) 1 97 2 98 -1 0.0006 0.9806 27 CONCLUSION 1. INTRO 2. CLEFTING IN BP Summary: • • SV-order Granger-causes interrogative clefts, but not the other way around (H3) 3. DATA • SV-order does not Granger-cause declarative clefts 4. GRANGER • Declarative clefts do not Granger-cause interrogative clefts (but: this may be due to the low frequency of declarative clefts in present tense) CAUSALITY 5. RESULTS 6. CONCLUSION! Is this causation? Some caveats: • • Granger-causality does not imply strict causality. There may be deeper causal factors 28 CONCLUSION 1. INTRO 2. CLEFTING IN BP • We hypothesize the following scenario: • Clefts are a compensating strategy for loss of VS word order. But how much functional variation is there in these clefts? • Questions make more eager use of this strategy because it has stronger implications for discourse structure 5. RESULTS • Over time, the cleft-wh constituent became a co-marker of interrogative mood 6. CONCLUSION! • This ties in with other cases of compensation-by-already-existing-strategies, see Van de Velde (2014) on ‘Degeneracy’. 3. DATA 4. GRANGER CAUSALITY 29