Academia.eduAcademia.edu

Health at Birth, Parental Investments, and Academic Outcomes

2018, Journal of Labor Economics

This paper explores the relationship between health at birth and academic outcomes using administrative panel data from Chile. Twins fixed effects models estimate a persistent effect of birth weight on academic achievement while OLS and siblings fixed effects models find this relationship to decline over time. We make sense of these findings in the context of a model of human capital accumulation where parental investments respond to initial endowments. Using detailed data on parental investments, we find that investments are compensatory with regard to initial health, but that within twins, parents do not invest differentially. These findings suggest that initial health shocks significantly affect academic outcomes and that parental investments are a potential channel via which the negative impacts of health shocks can be mitigated over the long run.

Health at Birth, Parental Investments and Academic Outcomes Prashant Bharadwaj, Juan Eberhard & Christopher Neilson† December 2013 Abstract This paper explores the relationship between health at birth and academic outcomes using administrative panel data from Chile. Twins fixed effects models estimate a persistent effect of birth weight on academic achievement while OLS and siblings fixed effects models find this relationship to decline over time. We make sense of these findings in the context of a model of human capital accumulation where parental investments respond to initial endowments. Using detailed data on parental investments, we find that investments are compensatory with regard to initial health, but that within twins, parents do not invest differentially. These findings suggest that initial health shocks significantly affect academic outcomes and that parental investments are a potential channel via which the negative impacts of health shocks can be mitigated over the long run. JEL Codes: I10, I20, J20 † University of California, San Diego; University of Southern California and Universidad Adolfo Ibañez; Yale University. The authors wish to thank Danuta Rajs from the Departamento de Estadisticas e Informacion de Salud del Ministerio de Salud (MINSAL) and Francisco Lagos of the Ministry of Education (MINEDUC) of the government of Chile for facilitating joint work between government agencies that produced the data used in this study. Finally, the authors also wish to thank Joseph Altonji, Michael Boozer, Ryan Cooper, Julie Cullen, Adrian de la Garza, Adam Kapor, Bhash Mazumder, Karthik Muralidharan, Seth Zimmerman, and seminar participants at Yale, ITAM, Oxford, Berkeley and the Chicago Federal Reserve for their comments and suggestions. The authors have no relevant or financial interests related to this project to disclose. Previous versions of this paper were circulated beginning in July 2010 under the title: Do initial endowments matter only initially? Birth Weight, Parental Investments and Academic Achievement in School 1 Introduction Recent empirical work has shown evidence that initial health endowments are important determinants of later life labor market and cognitive outcomes (Almond and Currie 2011b). However there is much less evidence on the relationship between initial health endowments and school outcomes, the evolution of this relationship during early childhood, and how investments in human capital adjust in response to these endowments. We contribute to this literature by examining the relationship between health at birth (as measured by birth weight), subsequent parental investments, and academic outcomes from childhood to early adolescence using administrative data covering the entire student population of Chile. This empirical evidence is important as it sheds light on the mechanisms through which initial health affects later life labor market outcomes (Black, Devereux, and Salvanes 2007). We use administrative data from Chile to link birth records of children born between 1992 and 2002 to their academic records between 2002 and 2012. This panel data set follows cohorts of students from first grade through high school and college entrance exams. In addition to this unique linkage of records, the data allows for the estimation of models with rich heterogeneity as well as models with siblings and twins estimators which have been used in the literature to account for unobserved characteristics affecting both birth weight and the outcome of interest. We supplement this large dataset of birth records and school achievement with data on parental investments recorded at the individual child level, from both parent and child reports. We use this data to examine whether parental investments systematically vary by birth weight, and in particular, whether parents differentially invest within twin pairs. We find that birth weight significantly affects academic outcomes throughout the 2 schooling years. Our estimates that include twins fixed effects (the standard in this literature for estimating causal impacts) suggest that in first grade, a 10% increase in birth weight increases outcomes in math and language scores by 0.04-0.06 standard deviations. We find this result to be stable from first grade through to middle and high school, and even for college entrance exams. This implies a persistent effect of birth weight among twins that is seemingly not undone (or exacerbated) by the behavioral responses of parents and teachers. The effect of being born low birth weight (less than 2500 grams) or very low birth weight (less than 1500 grams) is greater, a decrease of around 0.1-0.2 standard deviations, suggesting non-linearities in the birth weight-academic outcomes relationship. To put the magnitude of our results in perspective, consider that recent examples of large-scale interventions in education in developing countries show increases in test scores between 0.17 SD to 0.47 SD (Duflo and Hanna 2005, Muralidharan and Sundararaman 2009, Banerjee, Cole, Duflo, and Linden 2007). These results contrast with siblings fixed effects and OLS estimators which show a steady decline in the effect of birth weight on test scores. However, the decline is less among siblings who are closer together in age than among siblings who are further apart. Using detailed data on parental investments, we find that education-related investments are negatively correlated with birth weight; i.e. parents invest more via time spent reading, time spent helping out with home work etc, in children with lower birth weight. We find that within twins however, parental investments are not systematically correlated with birth weight, which is the assumption typically made when using twins fixed effects estimators. We present a model of human capital accumulation and parental investments to rationalize the empirical results described above. This model suggests that over time, depending on parental preferences (whether parents compensate or reinforce initial conditions), 3 test score differences within sibling or twin pairs will converge or diverge over time. To this fairly standard model of academic achievement, we add a dimension of public goods in parental investments within the household to explain the differences we observe when using twins and sibling fixed effects. The main intuitive insight of the model is that if there are public goods within the household with regards to parental investments, then test score differences will converge or diverge less over time, compared to a case with no public goods in investments. We argue that in the case of twins the role of public goods in investments could be large (if a parent reads to one twin, it is difficult to actively prohibit the other twin from listening in) implying that even if parents wish to invest differentially, they are unable to do so. Hence, the model would predict that over time, twins fixed effects estimates diverge or converge less than OLS and in this way the twins estimates bring us closer to the causal effect of birth weight over time. We emphasize that the time component is critical to our model and results, as twins fixed effects and OLS differences at any given point in time (in cross sectional data) can be explained by things such as measurement error. This paper bridges a gap in the literature investigating the lasting role of initial endowments, in particular initial health endowments. By examining repeated educational performance outcomes for children between the ages of 6-18, we are able to provide a more complete picture of how initial health affects human capital accumulation, which in turn is a potential mechanism for explaining adult labor market outcomes. Papers by Black, Devereux, and Salvanes (2007), Torche and Echevarrı́a (2011) and Oreopoulos, Stabile, and Walld (2008) look at long term cognitive outcomes in their analysis of the impact of birth weight using twins and sibling estimators. However these papers do not have repeated observations on cognitive achievement to study how the health endowment effect evolves over time. This paper also adds to the literature on parental investments and initial endowments 4 (Aizer and Cunha 2010, Rosenzweig and Zhang 2009, Ashenfelter and Rouse 1998, Adhvaryu and Nyshadham 2012). Like Loughran, Datar, and Kilburn (2004) and others, we use birth weight as a summary measure of initial endowments. We find that parental investments are negatively correlated with birth weight, which viewed through the lens of our model would explain the difference between the sibling fixed effects, OLS, and twins fixed effects estimates. Additionally, this paper partially addresses an important assumption used in many twins based studies. Most twins papers that examine the role of birth weight on long term outcomes, have to assume that parental investments are not related to individual birth weight. We find that while parents in general invest more in lower birth weight children, they do not differentiate based on birth weight within twins. A recent related paper by Figlio, Guryan, Karbownik, and Roth (2013) finds similar persistent effects of birth weight on test scores using data on twins from Florida in elementary through middle school years. We view these two papers as jointly providing a more complete picture of the role of early childhood endowments in determining school outcomes. Their paper focuses on understanding whether the birth weight effect varies by socio-economic background and by school quality. Using twins fixed effects their findings suggest that the effect of initial differences in birthweight is not undone in the long run. While our twins estimates would suggest a similar conclusion, we are able to push on the role of parental investments. Our theoretical model and direct data on parental investments in conjunction with a close comparison of OLS, siblings fixed effects and twins fixed effects estimates suggest that parental investments might have the ability to reduce initial health inequalities among the general population. Arguably, our results can be explained by other unobserved characteristics that drive the differences between OLS, twins and sibling fixed effects models. We therefore like to think of our results on parental investments as a starting point for thinking about the dynamics of early childhood health and 5 its interaction with investments and intra household resource allocation. This approach is important as it highlights that some of the inequalities at birth can potentially be undone through the efforts made by parents and possibly public policies aimed at investing in the health and human capital of children. Providing a framework and empirical evidence for understanding the differences between OLS and twin/sibling fixed effects estimates is a key contribution of this paper. 2 Medical Background 2.1 Birth Weight and Cognitive Development Medical research suggests a few pathways by which birth weight and the incidence of low birth weight affects cognitive development. Hack, Klein, and Taylor (1995) suggest an association between brain damage and low birth weight, leading to poorer performance by low birth weight children on tests. The extent of brain damage and lesions associated with low birth weight can be as severe as resulting in extreme forms of cerebral palsy. Another pathway that is highlighted in Lewis and Bendersky (1989) is that of intraventricular hemorrhage (IVH, or bleeding into the brain’s ventricular system). However IVH is often thought to be due to shorter gestational periods, and therefore less likely to be the mechanism in the case of twins (Annibale and Hill 2008). Using detailed MRI data from very low birth weight and normal birth weight babies, Abernethy, Palaniappan, and Cooke (2002) suggest that learning disabilities might be related to the growth of certain key brain structures like the caudate nuclei (pertaining to learning and memory) and the hippocampus. Hence, it appears from our reading of a sampling of the medical literature that low birth weight is correlated with developmental problems of the brain, which might lead to lower 6 to cognitive ability later in life. Figure 1 shows the distribution of birth weight for the population and for twins. Figure 1: Distribution of Birth Weight Percent of Total Births 1992-2002 0.07 0.06 0.05 0.04 0.03 0.02 0.01 0 0 1000 2000 3000 5000 4000 Birth Weight (grams) 6000 7000 This histogram shows all live births in Chile between 1992 and 2002 and also only twin births. The two vertical lines indicate the 1st and 99th percentile of the distribution respectively. 2.2 Why do twins differ in birth weight? Empirical estimation strategies that use twins fixed effects identify the relationship between birth weight and outcomes using the variation of birth weight between twins. This makes it important to understand why these differences arise. In this section we capitalize on the excellent reviews of the medical literature regarding why differences in birth weight arise within twin pairs provided in Almond, Chay, and Lee (2005) and Black, Devereux, and Salvanes (2007), and summarize their arguments. Figure 2 shows the density of birth 7 weight differentials within twin pairs in our sample of twins. The average birth weight differential is around 175-200 grams. The main reason why birth weight differentials arise within twins is due to IUGR (intrauterine growth retardation).1 The leading reason for differential fetal growth is nutritional intake and in the case where two placentae are present, nutritional differences can arise due to position in the womb. Among monozygotic twins (which most often share a placenta), the placement of the umbilical chord affects nutritional intake. For details and references on the subject, we refer the reader to footnote 13 in Almond, Chay, and Lee (2005). Figure 2 shows the distribution of birth weight differences within twins for our sample. Figure 2: Histogram of Birth Weight Differentials among Twins 1600 Number of twin pairs 1400 1200 1000 800 600 400 200 0 0 200 400 600 800 1000 1200 1400 1600 1800 2000 ∆ grams Note : This histogram shows the distribution of birth weight differentials among twins born in Chile between 1992 and 2002. 1 The other common reason for low birth weight is gestational age, however, gestational age is identical for twins, hence, the birth weight differentials must arise from fetal growth factors. 8 3 Data The data used in this paper is largely similar to the data used for the Chile specific analysis in Bharadwaj, Loken and Neilson (2013). While what follows is a brief summary, we refer the interested reader to the Online Appendix in Bharadwaj, Loken and Neilson (2013) for details on merge rates and attrition across the various data sets used. 3.1 Birth Data The data on birth weight and background information on parents come from a dataset provided by the Health Ministry of the government of Chile. This dataset includes information on all children born between 1992-2002. It provides data on the sex, birth weight, length, weeks of gestation as well as demographic information on parents such as the age, education and occupational status. In addition, the dataset provides a variable describing the type of birth (single or multiple). Twins and siblings are identified by using a motherspecific ID made available for our purposes. Unfortunately. the data does not provide information on zygosity of the twins. 3.2 Education Data The data on school achievement comes from the SIMCE and RECH database that consists of administrative data on the grades and test scores of every student in the country between 2002 and 2008. This database was provided by the Ministry of Education of Chile (MINEDUC). 9 3.2.1 RECH The RECH is the Registro de Estudiantes de Chile (the student registry). This database consists of the grades by subject of each student in a given year and is a census of the entire student population. This database provides the information on the educational results of twins broken up by subjects and allows the construction of the ranking and level measures of academic success at the school/class/grade level. For our purposes, we standardize the grades at the classroom level for each student. While these are classroom grades, we note that performance in the classroom as captured by these grades is highly correlated with performance on national exams such as the SIMCE and PSU. 3.2.2 SIMCE SIMCE tests and surveys began to be used in Chile in 1988 as a way of providing information to parents on the quality of schools. This is important in the Chilean context as the education system is compromised of a large private and voucher school system. The tests are administered to all children in a given grade. Between 1988 to 2005, the test alternated between 4th, 8th and 10th grades. Since 2006, the test is administered to 4th grade every year and alternates between 8th and 10th grade every other year. The total number of children varies between 250,000 and 280,000 across approximately 8000 schools. The response rate to the test is generally over 95%. The SIMCE test covers three main subjects: Mathematics, Science and Language Arts. The education data sets were subsequently matched to the birth data using individual level identifiers. Since we observe grades for all students who take the test in a given year, we standardize the SIMCE scores at the national level. 10 3.2.3 PSU The PSU or Prueba de Selecion Universitaria test is the college entrance exam and is the main criteria used in determining admission to the higher education system in Chile. The data included in this study covers both Mathematics and Language. The test is voluntary but required for most forms of financial aid and for the current years includes the majority of graduating seniors. The test is standardized each year. For more information on the PSU and college admissions in Chile see Hastings, Neilson, and Zimmerman (2013). 3.3 Parental Investments Data The SIMCE test is also accompanied by two surveys, one to parents and one to teachers. The survey to parents include questions about household income and other demographics. The parent survey has a response rate above 80% and is a large endeavor that requires visiting even the most remote schools in the northern and southern regions of the country and substantial efforts are made to evaluate all schools, both private and public. The parent survey covers questions about the demographics of the household as well as the parents’ opinion of the school and the teacher. In some years the survey covered specific questions regarding parental investments. These years are 2002 and 2007. In 2009, the latest year available, SIMCE surveyed not only the parents but also the students. This allowed students to give their opinions regarding their perceptions of school in many dimensions. One component of the survey asked about the help they received from their parents and how they perceived their parents’ role in their education. We use this data in conjunction with the data on parental investments. 11 4 Economic Framework The economic framework described in what follows shares some of the ideas presented in prior work such as Heckman (2007), Almond and Currie (2011a), Conti, Heckman, Yi, and Zhang (2010) and Todd and Wolpin (2007). We begin by specifying a general production function for cognitive achievement that takes into account the history of parental investments; we then specify the intra-household allocation problem and the effects of parental investments on the evolution of test score differences across children when investments have a public good component. We finally derive some testable empirical implications of the model. An important caveat to the model is that we suppress forces other than parental investments in charting out the evolution of school achievement. For example, inputs by teachers and the history of teacher inputs could be just as important as parental inputs. However, not only do we lack such data on other sources of investments in children, our model quickly loses tractability if we were to include say the behaviors of teachers with respect to individual children within the classroom. Hence, while we think our model provides an interesting way to interpret the results, we wish to emphasize that this interpretation is not unique; i.e. other ways of rationalizing the data using different sources of investments might be possible. Given the long history of understanding intra household resource allocation and the importance of parental investments in the development of children, we consider our framework a relevant starting point. 4.1 Model of Human Capital Accumulation We begin by defining production functions for test scores, investments and endowments: 12 Tijg = T (Xijg , θijg ) (1) Xijg = f (θijg , θi ′ jg ) (2) θijg = f (θij( g−1) , Xij( g−1) ) (3) Where Tijg is the achievement in school by student i born to mother j at grade g. Xijg is the vector of all inputs applied in that grade, and θijg is the child’s cognitive endowment. In this framework, the cognitive endowment θijg includes health endowments. Child i has a twin denoted by i ′ . This framework allows us to study the effect of an initial shock eij0 (for example being born low birth weight) on future school achievement. We assume that this initial shock only has a direct effect on the initial cognitive endowment θij0 . Therefore, the effect of a shock in period 0 on test scores at grade g is dTijg ∂T ∂Xijg ∂θijg ∂θij0 ∂T ∂θijg ∂θij0 = · · · + · · deij0 ∂Xijg ∂θijg ∂θij0 ∂eij0 ∂θijg ∂θij0 ∂eij0 ∂Xi ′ jg ∂θijg ∂θij0 dTi ′ jg ∂T · = · · deij0 ∂Xi ′ jg ∂θijg ∂θij0 ∂eij0 (4) (5) We interpret the first term of equation (4) as a resource reallocation effect (this shows that an initial shock to one sibling affects the investments in the other sibling via the intrahousehold reallocation of resources), and the second term as a cognitive-biological effect. We take the cognitive-biological effect as negative (being born with deficiencies leads to poorer test scores). However, the sign of the resource reallocation effect is less clear with13 out a deeper understanding of parental preferences. The reallocation effect will depend on whether parents want to equalize achievement across siblings (“compensating behavior”), or whether parents want to invest more on the child with higher returns (“reinforcing behavior”).2 If parents compensate (lower endowment child gets greater investments), then the effect of the negative biological shock in equation (4) is muted by the positive sign on the reallocation component. Twins effects in this context are used under the assumption that within twins differences “net out” the resource allocation component. In this case, it would amount to differencing equation (4) and (5); what we are left with if we assume lack of resource allocation behavior in the twins case is the just the biological effect of the shock. Herein lies the logic of interpreting differences in OLS and twins fixed effects in our setting: if we believe that twins fixed effects net out the resource allocation effect, then the difference between OLS and twins estimates is the component that is due to resource allocation. With some structure, we can discern important aspects about parental investment behavior by examining whether OLS is larger or smaller than twins estimates. 4.2 Model of Parental Investments To gain a deeper insight into what factors affect the sign of the resource allocation term, we derive optimal parental inputs from a model where parents at each time t, maximize household utility that depends on the test scores of the two children in the house (T1jg and T2jg′ ). Child 1 is in grade g, and child 2 is in grade g′ . Parents use the technology described in equations (8-3) for each child. We define the total investment constraint that parents face 2A good discussion about parents’ strategic reactions and a survey of the empirical evidence supporting both hypothesis can be found in Almond and Mazumder (2013). 14 as TE .3 Formally, max X1jg ,X2jg′ s.t. U (T1jg , T2jg′ ) (6) equations (8) through (3) X1jg + X2jg′ ≤ TE We follow Behrman, Pollak, and Taubman (1982) and use a CES functional form to describe the household’s utility function:  ρ U (T1jg , T2jg′ ) = (T1jg ) + (T2jg′ ) ρ  ρ1 (7) ρ in this case governs what Behrman, Pollak, and Taubman (1982) call “inequality aversion”. This implies that depending on ρ, parents either behave in ways that allocate more investments to the child with the higher returns, or they are “inequality averse” and invest in the child with lower returns in a bid to lower test score gaps. To see this more clearly, we also assume that test scores for each child i are produced according to the following technology: γ 1− γ Tijg = θijg Xijg (8) This implies that households maximize the following utility function 3 We provide a simple model of parental time allocation across educational and non-educational inputs in the Appendix. We can consider TE as the optimal time allocation for educational activities as emerging from this utility maximization problem. 15 max X1jg ,X2jg′  γρ 1− γ θ1jg (X1jg )ρ γρ 1− γ ρ + θ2jg′ (X2jg ′ )  ρ1 (9) Equation (9) shows that each child’s cognitive endowments act as loading factors in the CES utility function. Large positive ρ would suggest that the parents should invest more in the child with better endowments to raise their utility. However, parents may have aversion for inequality, captured by a small, or negative ρ. When ρ → −∞ households invest in order to equalize test scores across siblings. Hence, ρ is the fundamental parameter governing whether parents invest more in the child with lower endowments or whether they invest more in the child with better endowments. For any ρ, the optimal allocations are X1jg = TE  1+  θ2jg′ θ1jg 1+  θ2jg′ θ1jg X2jg′ = TE  γρ ( 1− γ ) ρ −1 γρ ( 1− γ ) ρ −1 16 θ2jg′ θ1jg ! γρ ( 1− γ ) ρ −1 (10) (11) Figure 3: Optimal Investment Time X 0.22 Sibling 1 Sibling 2 Optimal X 0.21 0.2 0.19 0.18 0.17 0.16 -0.8 -0.6 -0.4 -0.2 0 ρ 0.2 0.4 0.6 0.8 Note : This figure displays optimal allocations, for different values of ρ at a specific point in time t. In this case we assume that sibling 1 has a higher cognitive endowment at time t. Figure 3 displays optimal allocations, for different values of ρ at a specific point in time t. In this case we assume that sibling 1 has a higher cognitive endowment at time t. It is clear that when ρ ≥ 0, parents invest in a way that would be considered “reinforcing”; they act in compensating ways when ρ < 0. We extend this framework to multiple time periods to study how test score gaps within siblings evolve over time. According to the original description presented in equations (8) to (3), cognitive endowment evolves endogenously. We adopt a rather general structure for this evolution (investments and endowments can be imperfect substitutes or compliments) : θijg = β θ θij( g−1) + β X Xij( g−1) + β Xθ θij( g−1) Xij( g−1) (12) β θ captures depreciation of the cognitive endowment over time and is hence between 0 and 1; educational inputs increase the cognitive endowment through a cognitive accu17 mulation production function implying that β X is positive;4 and finally β Xθ captures the complementarity of the investment. If β Xθ = 0 cognitive endowment and parental investment are perfect substitutes. Our intuitive theoretical results are preserved without any specific assumption on the sign of β Xθ , as long as β Xθ is small.5 It is important to notice that parental investment plays a dual role in determining test scores. First, as described in equation (8), parental investments directly impact test scores. We discussed that parents can use this investment to either compensate or amplify the differences across siblings. Second, parental investment may also affect the evolution of cognitive endowment, as described in equation (12). It is difficult to test the indirect effect of parental investment on cognitive abilities, therefore, we assume a very flexible theoretical approach, which allows us to obtain our main result under a broad range of assumptions regarding this indirect effect. In particular, parental investment can be complements or substitutes with cognitive endowments when we study the evolution of cognitive abilities. 4.3 Public Good Dimension of Parental Investment Up to this point, we have described the household problem assuming that parents can completely differentiate the educational input dedicated to each child. However, parental investment may have a public good dimension, or spillover effects across siblings. For instance, parents may read books to both children, or they may simultaneously help them with their homework. The fundamental assumption for our model with public 4 We also explore a more general function , in which Xij( g−1) enters with an exponent, similar to a standard Ben-Porath human capital accumulation function. The main results are preserved. 5 Our empirical application, however, has to make several simplifications to this rather general structure. We discuss these in the following section. 18 goods is that when siblings are close in age, we expect the degree of spillover to be greater. Therefore, under certain conditions, it can be difficult for parents to invest differentially across children. Twins are an extreme example of this issue, in the sense that they are of the same age and, if they attend the same school and classroom (85% of twins in our sample are observed in the exact same classroom for example), their homework and other educational needs are probably very similar. For these reasons, we conjecture that it might be difficult for parents to differentially invest across siblings when they are very close together in age. To formalize public goods in parental investments, we use a loading function δ(1, 2) (taking values between 0 and 1) which is larger when sibling age difference is smaller. For example, δ(1, 2) = C(Age difference in siblings) where C is some constant between 0 and 1 would be a candidate loading function.6 This loading function captures the degree of public good dimension of parental investment. If δ(1, 2) is zero, parental investments have no public good dimension, and we return to the problem described in equation (6). The bigger δ(1, 2) is, the more important is the public good dimension in the provision of parental investment. Thus, the effective parental investment in child 1, for example, X̂ is described by X̂1jg = X1jg + δ(1, 2)X2jg′ (13) where X1jg is the optimal parental investment, coming from the problem without public goods in parental investments (6). Note that as far as parents are concerned, a public good dimension in X increases the effective time endowment available for educational 6 C=0.8 in our simulations. 19 activities. T̂E = X̂1jg + X̂2jg = (1 + δ(1, 2))(X1jg + X2jg ) = (1 + δ(1, 2))TE (14) We can derive time endowments from a “first stage” where parents decide between educational and non-educational inputs in the Appendix. Under certain conditions specified in the Appendix, we show that the total time allocation for educational inputs reduces as the public good dimension in educational investment increases. In the case of twins the total time allocation component does not matter for our overall results as the public good dimension simply results in equal investments across both twins. We assume that parents are aware of the public good dimension and solve the following within sibling allocation problem:7 max X1jg ,X2jg′ s.t.  γρ 1− γ γρ 1− γ θ1jg (X̂1jg )ρ + θ2jg′ (X̂2jg′ )ρ X̂1jg = X1jg + δ(1, 2)X2jg′  1ρ (15) X̂2jg′ = X2jg′ + δ(1, 2)X1jg X̂1jg + X̂2jg′ ≤ T̂E Defining TE∗∗ = T̂E , 1+ δ(1,2) the new optimal allocations are 7 Parents solving for the effective parental investment or just the parental investment, but knowing the nature of the public good feature leads to the same solution. 20 X1jg TE∗∗ =  γρ i  h θ2jg′ (1−γ ) ρ−1 (1 − δ(1, 2)) 1 + θ " θ2jg′ θ1jg ! γρ ( 1− γ ) ρ −1 − δ(1, 2) # (16) 1jg X2jg′ = TE∗∗ h (1 − δ(1, 2)) 1 +  θ2jg′ θ1jg  γρ ( 1− γ ) ρ −1 i " 1 − δ(1, 2) θ2jg′ θ1jg ! γρ ( 1− γ ) ρ −1 # (17) For the specific case of twins, where δ(1, 2) = 1, effective parental investment is equal across twins. This is because allocations are not defined for δ(1, 2) = 1 (i.e. the case with no age gap between siblings, which is the twins case), as in that case the problem has infinite solutions for X1jg and X2jg′ . However, parents know that any feasible solution in this case implies equal effective parental investment among twins. Hence, for simplicity we assume that the solution for twins X1jg = X2jg′ . In this case, parents may try to differentiate across twins, but the public good dimension of their investment counters any strategic behavior (either mitigating or reinforcing). Consequently parents simply invest the same amount for each twin. Test scores are a function of initial conditions and the history of educational inputs; hence, in the twins case, initial conditions differ, but educational inputs are the same for both children at any time. The history dependent feature of test scores implies that the relative importance of initial endowment diminishes over time. Therefore, the model implies a slight decrease in the test score gap.8 Siblings offer additional insight about the underlying strategic behavior of parents. In this case, we are able to deduce the evolution of the test score gap between siblings, because δ(1, 2) < 1. Moreover, using variation across families in age differences, we can assess whether the public good dimension decreases with 8 The rate of convergency depends mainly on γ. 21 increasing age difference. We can now graph the evolution of test scores using the structure on optimal parental investments, test score production and endowment evolution for different parameters values of ρ with and without the public good aspect.9 Figure 4 displays the evolution of the gap in test scores (Test score child 1 - Test score child 2), over time, for both different values of ρ. Figure 4: Evolution of Test Scores with Public Good in Parental Investment 0.7 t1 t3 t5 t7 0.6 t1 t3 t5 t7 0.65 TS1 − TS2 TS1 − TS2 0.65 0.55 0.5 0.45 0.4 0.6 0.55 0.5 0.45 0.4 0.35 -1 -0.5 0 ρ 0.5 0.35 1 -0.8-0.6-0.4-0.2 0 0.2 0.4 0.6 0.8 ρ Note : This figure displays how differences in achievement change over time under different assumptions about parental preferences ρ. The right panel assumes parental investment is a public good among twins and on the left panel there is no public good aspect to parental investments. It can be seen in this simulation that differences are muted and change less over time in the presence of public goods. In the left panel we can see that for high values of ρ, the solid black line representing period 1 is below the gray line representing period 3, which in turns is below the dashed blue line representing period 5, and so on. This sequence means that the gap is increasing over time. The original gap is positive because child 1 has higher initial cognitive endowment that child 2. This is the graphical representation of the effect of a reinforcing parental behavior on the dynamics of the gap in test scores. We observe the exactly op9 Other parameters and details of how we create these graphs are presented in the Appendix. 22 posite evolution when ρ < 0. In this case, the gap diminishes over time, as a reaction of the parents’ compensating efforts. Note from this graph that the switch from divergence over time to convergence over time in test score gaps does not occur precisely around ρ=0. This means that just observing whether test scores diverge or converge over time is not enough to discern whether parents want to compensate or reinforce. However, combined with knowledge of the correlation between investments and endowments, we can make an informed guess of whether parents compensate or reinforce initial endowments. The right panel of Figure 4 shows the test score evolution in the presence of some public goods in parental investments (δ = 0.7),10 Y axis scales are purposely kept the same as in Figure 4 to show how the evolution in differences is muted with a higher δ. Hence, the public good dimension diminishes the effectiveness of parental investments in either compensating or reinforcing initial differences. The implications of our model for the evolution of test scores over time can be summarized by a fairly intuitive proposition and two corollaries: Proposition 1 If compensating (reinforcing) parents can fully differentiate the educational inputs allocated to each child, the test score gap between siblings will decrease (increase) over time. If there is only partial parental investment differentiation, the test score gap may decrease (increase), but this decrease (increase) will be less than in the case of full differentiation. Proof 1 Please see Appendix A. Corollary 1 The public good dimension of parental investment implies partial differentiation across children. Thus, the compensating (reinforcing) behavior will take longer to reduce (increase) the test 10 In our calibrations, δ = 0.7 corresponds to an age difference of 1.5 years. 23 score gap, than in the absence of public good dimension. Proof 2 Please see Appendix A. Corollary 2 For twins, in the presence of public goods in parental investments, the test score gap will be quite stable over time.11 Proof 3 In this case the actual (effective) parental investment is equal across twins. Over time, the only change in test score gap comes from the evolution of the cognitive endowment. In particular, when β θ < 1, the depreciation of the initial endowment will imply a convergence of test scores over time. 5 Estimation In what we have shown above, cognitive endowment and parental investments are a function of past inputs and endowments. Therefore, an alternative specification for equation (8) is Tijg = T (Xijg , Xij( g−1) , . . . , Xij0 , θij0 ) (18) Hence, school achievement on grade g depends on the initial cognitive endowment and the whole history of educational inputs. In other words, test score is a cumulative function of past inputs, and endowment at birth θij0 . From here on, we call θij0 birth weight, which is our measured initial endowment. Such a production function is very similar to the one 11 If β θ = 1, and β Xθ = 0 the test score gap will be exactly constant. 24 used by Todd and Wolpin (2007). A linear, estimable version of (18) is: Tijg = λ g θij0 + β 1 Xijg + β 2 Xijg−1 + . . . + β t Xij1 + ǫijg (19) Where T is the test outcome measured with error ǫ, and X is a vector of educational inputs up to grade g. In this exercise, we are interested in estimating λ g and estimating (19) would require detailed parental input histories. We do not have data on the entire sequence of parental inputs (the data we have from the SIMCE surveys measures investments in 4th grade only), hence, the X’s will form part of the error term in the estimating equation. As the model in the above section suggests, parental investments are likely correlated with birth weight and hence, in OLS, λ g will be measured with bias, where the direction of bias is governed by the covariance between inputs and endowments, and as per the model in section 4, by the degree of parental inequality aversion. We estimate OLS for the entire sample in figures, but also focus on the sample that shares a common support with twins between 700-3000 grams (3000 grams represents the 90th percentile of the twin birth weight distribution). Since twins are significantly smaller than the rest of the population, valid comparisons for our purposes are only derived by focussing on singletons on the same birth weight support as twins. 25 5.1 Twins Fixed Effects Before we write down the twins fixed effects estimator, it is useful to rewrite equation 19, with a new error term that captures all the unobservables: Tijg = λ g θij0 + β 1 Xijg + β 2 Xijg−1 + . . . + β t Xij1 + ǫijg {z } | (20) u ijg A twins estimator is particularly useful in estimating λ g from equation (20). As a twins fixed effects estimator essentially differences equation (20) within twins, it would difference out observable and unobservable time invariant family level components (while we have not modeled these variables like parental education explicitly, we believe that they would play a role in the bias that exists in OLS) since these are shared within twin pairs. Calling the other twin i ′ , a twins estimate of equation (20) results in: Tijg − Ti ′ jg = λ g ( BWij − BWi ′ j ) (21) + β1 (Xijg − Xi ′ jg ) + . . . + β t (Xij1 − Xi ′ j1 ) + ǫijg − ǫi ′ jg {z } | (22) u ijg − u i′ jg The model in the previous section would suggest that rather than assuming that parental investments are the same within twins, one way to think of why they might effectively be the same even when parents wish to invest differentially based on birth weight is due to public goods in the parental investment component. Under the conditions of our model in the previous section, if there are perfect spillovers within twins, then the effective parental investment is the same within twins and equation (22) will result in consistent estimation of λ g . In what follows, we estimate equation (22) for first through eighth grade for math and language classroom grades, fourth, eighth and tenth grade SIMCE test scores in math 26 and language and for the college entrance exam, also for math and language. We wish to note an important caveat at this point. Twins fixed effects are useful in estimating λ g only if there are no heterogenous returns to birth weight by parental investment. Empirically, this implies that we cannot have interaction terms between investments and birth weight in equation (19). While the model we presented in section 2 was quite general, the specific empirical application uses stricter functional form assumptions on the production of test scores and the evolution of the endowment. This is however essential to keep the empirical component tractable and meaningful, but we are aware that this is indeed a (perhaps drastic) simplification of reality. 5.2 Siblings Fixed Effects A siblings fixed effects estimator is similar in spirit to the twins fixed effects estimator, with the difference that we expect a “greater” bias if we believe the lesser degree of public goods in parental investment within siblings as per the model in section 2 and proposition 1. For siblings (i and i ′ ) who are observed in grade g, we estimate a siblings fixed estimator of the form: Tijg − Ti ′ jg = λ g ( BWij − BWi ′ j ) (23) + β1 (Xijg − Xi ′ jg ) + . . . + β t (Xij1 − Xi ′ j1 ) + ǫijt − ǫi ′ jg {z } | (24) u ijt − u i′ jg We estimate equation (24) for siblings varying in age difference from 1-5 years. Data limitations do not allow us to estimate this equation for very large age differences. 27 6 Results 6.1 Nonparametric and OLS Results Figure 5 shows the relationship between academic achievement in math and language and birth weight in first and eighth grade. The relationship between birth weight and both math and language achievement is remarkably linear and upward sloping up until approximately 3300 grs (which is approximately the average birth weight) , with higher birth weight babies doing better in both measures. 28 Figure 5: Standardized grades and Birth Weight Grade 1 - Math Grade 8 - Math 0.15 0.15 0.1 0.1 0.05 0.05 0 0 -0.05 -0.05 -0.1 -0.1 2000 2500 30003500 4000 4500 Grade 1 - Language 2000 2500 3000 3500 4000 4500 Grade 8 - Language 0.15 0.15 0.1 0.1 0.05 0.05 0 0 -0.05 -0.05 -0.1 -0.1 2000 2500 3000 3500 4000 2000 2500 3000 3500 4000 Note : This graph shows the relationship between birth weight and achievement in math (top panel) and language arts (bottom panel) for students born from 1992-2002 in Chile. The grades have been standardized at the classroom level. The black solid line represents a local second order polynomial regression. The dots represent a moving average with a centered window width of 30 grams. Further exploration of this relationship via regressions confirms that this correlation is robust to the addition of various controls. The regressions estimated in Table 1 shows the OLS coefficient for the birth weight effect at each grade on standardized math grades for 29 various samples of the data using a specification similar to that in equation (19), with the exception that we do not have controls for the history of parental inputs. Moreover, since twins are quite different from the rest of the population, we wanted to focus our attention to siblings and singletons with the same birth weight support which is between 0-3000 grams. As is evident from Figure (5), most of the effects of birth weight on the outcome of interest is observed within this support. Row 1 shows λ g estimated for the sample that shares the same birth weight support as the twins sample. In all OLS specifications, we control for gestational age, mother’s education, mother’s age at birth and sex of the child. The second row shows the same specifications but restricting the sample to just the twins sample. Across all rows, the results appear fairly similar and the main pattern among the coefficients is the decline in the birth weight effect in later grades. In first grade, the effect of birth weight appears to be around 0.35-0.4 SD and by eighth grade the birth weight effect declines to 0.2 SD. Examining test scores in fourth, eighth and tenth grades we find similar results. The OLS regression coefficient declines over time in each case. 6.1.1 Heterogeneity We also examine whether the OLS relationship between birth weight and standardized math grades varies by observable characteristics of the mother. The following graphs show that students with mothers with college education preform better than those of mothers with lower education levels but that the positive relationship between birth weight and academic achievement is similar in both groups in 1st grade. It can also be seen that over time, this relationship diminishes in strength for both groups with lower birth weight children raising their relative performance. The results from this section show that the simple 30 correlation between initial health endowment and academic outcomes is quite significant but that this relationship seems to weaken over time. 31 Figure 6: Standardized Math and Language grades and Birth Weight by Mothers Education College Mother - Math Less Than High School Mother - Math 0.3 0.3 0.2 0.2 0.1 0.1 0 0 -0.1 -0.1 -0.2 -0.2 -0.3 -0.3 2000 2500 3000 3500 4000 2000 2500 3000 3500 4000 College Mother - Lang Less Than High School Mother - Lang 0.3 0.3 0.2 0.2 0.1 0.1 0 0 -0.1 -0.1 -0.2 -0.2 -0.3 -0.3 2000 2500 3000 3500 4000 2000 2500 3000 3500 4000 Note : This graph shows the relationship between birth weight and achievement in math (top panel) and language arts (bottom panel) for students born from 1992-2002 in Chile to mothers with college education and with less than high school education. The red circles and lines indicate first grade results and the darker colors represent eighth grade achievement. The grades have been standardized at the classroom level. The solid line represents a local second order polynomial regression. The dots represent a moving average with a centered window width of 30 grams. 32 6.2 Twins Fixed Effects Estimates To tackle the problem of unobserved characteristics and inputs, we modify equation (19) by including a dummy for the mother - i.e. a twins fixed effect. As suggested earlier, under certain assumptions, a twins estimate does a good job of recovering the true λ g . Table 2 estimates equation (22) using log birth weight and a dummy variable for low birth weight in separate regressions as the independent variables of interest. In table 2, statistical tests reveal that λ8 and λ1 as obtained under the fixed effects estimation are not different, suggesting that the twins estimates of the impact of birth weight on test scores do not appear to diminish over time. Table 2 suggests that a 10% increase in birth weight (corresponding to a 250 gram increase) raises test scores in math by 0.046 SD in 1st grade and that this effect is largely persistent. This is in sharp contrast to the OLS estimates discussed earlier. Table 2 also shows that the impact of being born low birth weight is fairly severe on math grades - on average, being low birth weight reduces math scores by 0.1 SD. 6.2.1 Heterogeneity We can also examine whether twins fixed effects results vary by observable characteristics of the mother. In Table 3 we show that examining twins fixed effects for mothers with high school and above is very similar to the effects obtained for mothers without a high school degree. To interpret this result in the context for our model, we require some assumptions about whether more educated and less educated mothers have different preferences with regards to inequality aversion across their children. To the extent we think that inequality aversion does not vary across high and low educated mothers, this result is not all together 33 surprising. The next two rows in Table 3 examines the results by type of school and the socioeconomic background of the children at the school. The SIMCE survey categorizes schools into five SES brackets using household data on the parents of the students that attend each school. We take the two lowest levels and designate them as “Low SES”. Twins fixed effects results restricted to this school type shows largely similar results, although the birth weight effect seems to increase slightly over time. The next panel shows the results by private schools in Chile and the while the pattern over time is similar in that the effect remains the same, the levels are quite a bit larger. We interpret these results as evidence that there does appear to be some heterogeneity in the birth weight effect by school type and socioeconomic background. 6.3 Differences in Twins and OLS Estimates: The Role of Parental Investments Twins fixed effects and OLS estimates contrast in patterns that are worth exploring further. In particular, while all estimation methods show a similar effect in grade 1, twins estimates stay persistent, while OLS estimates steadily decline over time (i.e. the effect of birth weight appears to lessen in later grades). Our model in Section 2 suggests that part of the reason for the differences in twins and OLS estimates is the role of parental investments. Recall that under OLS, we estimate λt with bias: g−1 λOLS g = λ g + Cov( BWij , ∑ βs+1 Xijg−s + ǫijg ) s=0 34 (25) g−1 Where ǫijg is the current shock to T and ∑s=0 β s+1 Xijg−s contains the complete history of unobserved parental inputs (the same Xijg ’s from equation (19)). Given that OLS is smaller than twins fixed effects, we can conclude, if twins fixed effects are unbiased, that the direction of bias is negative. The results and the model would imply that parental investments and birth weight are negatively correlated. We can test this correlation in the data. We acknowledge that while we view these correlations as a partial explanation for why the differences in twins and OLS estimations arise, these results are by no means causal, and neither do we attempt to get at a causal relationship between the role of parental investments and test scores. We also recognize that OLS and twins fixed effects can vary for a host of reasons, but within the context of our model and the data, the role of parental inputs appears to be the most tractable. An important aspect of parental investments might be the choice of school that parents send their children to. However, in the case of twins in Chile, nearly 95% of the time both twins attend the same school and grade. Hence, there is simply very little variation in terms of school choice within twin pairs. Statistical tests also reveal that within the context of a twin fixed effects regression, birth weight does not matter for choice of school (in this case the dependent variable examined was whether or not a child attends a private school these results are available upon request). Since we cannot study school choice within families as a credible source of varying parental investments, we turn to other data that more directly measure parental time investments at the individual child level. Table 4 estimates the relationship between parental investments (as reported by parents and children in separate columns) and birth weight for a subset of the data (see the data section on why we only have this data for a subset of our overall sample). The investments (measured in grade 4) are on a scale of 1-5 where 5 denotes “very often” and 1 denotes “never”. We aggregate these responses into a dummy variable that takes on the 35 value of 1 if parents report “often” or “very often” and 0 if parents report “never”, “not often” or “sometimes”. Since there are a wide range of investment questions, we aggregate these into a single index and also perform factor analysis to get summary measures of investments. These factors appear to be easily interpretable (in the parent responses for example) into educational and non-educational inputs. Educational inputs for example include questions like, “How often do you read to your child”, “Do you help your child with homework” etc, whereas non-educational inputs include questions like, “How often do you talk to your child”, “How often do you write messages for your child”, “How often do you run errands with your child”. In the case of child responses about parental investments, the factors lump into what we can term as more straightforward educational inputs and “educational encouragement”. “Educational encouragement” contains questions such as, “Parent congratulates me on good grades in school”, “Parent challenges me to get better grades” etc. A detailed list of the investment questions and its correlation with birth weight appears in Table 5. The broad results from Table 4 and 5 are quite obvious: OLS estimates reveal a negative relationship between investments and birth weight. In particular this appears to be true in the case of educational inputs. What is interesting is that both, parent and child responses to the questions reveal similar correlations. This is important as parents might be more likely to misreport how much they invest in their own children. A crucial assumption for interpreting twins fixed effects (within the context of our model, which admittedly suppresses other behavioral forces as mentioned earlier) as revealing the unbiased effect of birth weight on test scores is that parental investments are the same within twins. The model in section 2 suggested why this might be the case for twins due to public goods and spillovers in investments in households with twins. Given the data on parental investments, we can test within a twins fixed effects framework whether 36 investments vary by birth weight. Table 6 shows that with a twins fixed effect there appears to be no significant correlation between birth weight and parental investments. Hence, in the context of our model and the data, it appears that twins fixed effects do indeed result in an unbiased coefficient of birth weight on test scores. It is important to realize that these parental investments are positively correlated with test scores.12 While the model might suggest that controlling for parental inputs will make the OLS estimates closer to twins estimates, we do not find this to be true. We believe this is due to the fact that ultimately we only observe a small subset of various investments that parents engage in. Moreover, we certainly do not believe that the entire difference between OLS and twins are due to parental investments. There could be other biases at play, such as the role of schools or teachers that could mitigate or exacerbate the role of initial endowments. 12 Correlations between school performance and parental investments (using the parental responses) suggest that moving from ”Never” to ”Often” in terms of studying with the child, is correlated with an increase in test scores of 0.04 SD (these correlations are available upon request). 37 Figure 7: OLS and Fixed Effects Estimates for Twins: Math and Language Math Language Twins OLS 0.6 Coefficient on Log BW Coefficient on Log BW 0.6 0.5 0.4 0.3 0.2 0.1 0 0.5 0.4 0.3 0.2 0.1 1 2 3 5 6 4 Grade Level 7 0 8 1 2 3 5 6 4 Grade Level 7 8 Note : This graph shows how the coefficient on log birth weight changes as children become older using different estimation strategies. These coefficients are from Tables 1, 2 and 8. 6.4 Siblings Fixed Effects Estimates Siblings fixed effects in our case are useful to validate the ”degree” of public goods argument in Section 2. Proposition 1 suggests that over time, in the presence of public goods, test scores should converge less than without public goods or spillovers. Siblings can provide a validation check on this idea by tracking test scores differences within siblings who are close together in age and siblings far apart in age. Table 7 estimates equation (24) for two types of sibling groups - those who are 1 year apart and those between 3 and 4 years 38 apart.13 The results across grades suggest that siblings 1 year apart show patterns quite similar to twins whereas siblings 3-4 years apart show patterns similar to OLS, in that the test score differences over time show declines. Siblings fixed effects, while validating our idea of public goods within the household for parental investments, also show in a more general setting the importance of health at birth in determining school performance. Since twins form a small portion of the overall population, it is useful to show that birth weight matters for school achievement in a setting with sibling fixed effects. Figure 8: Siblings Fixed Effects Estimates for Math Sibs +3 Coefficient on Log BW 0.6 Sibs +1 0.5 0.4 0.3 0.2 0.1 0 1 2 3 5 4 Grade Level 6 7 8 Note : This graph shows how the coefficient on log birth weight changes as children become older using different estimation strategies. These coefficients are from Table 7. 13 Note that our sibling fixed effects estimates only use families that have exactly two children. This choice was made to avoid complications that might arise due to birth order or aspects of being a middle child etc. 39 6.5 Other School Achievement Variables While mathematics grades in school is the main subject we have focussed on, the data allows us to examine the effects of birth weight on language grades as well as nationalized tests such as SIMCE and the PSU. Table 8 shows our main estimates using OLS, twins and sibling fixed effects strategies for language scores between grades 1 and 8. The patterns for language mirror the patterns seen in math. While twins fixed effects estimates show a stable coefficient across each grade, OLS and larger sibling differences show a steady decline. Estimates for siblings 1 year apart are quite close to the twins estimates. Table 9 uses the SIMCE and PSU as the main dependent variable. In each case we have examined both math and language scores. The main difference here is that we are able to examine the birth weight effect up to grade 10 and even up to grade 12 (PSU). Hence, we find that the birth weight effect in the case of twins appears to last throughout the schooling period. The OLS counterpart in these tables show some decline in the effect, but the decline is less than what is seen using classroom standardized grades. Moreover, we are unable to estimate sibling fixed effects models in the case of SIMCE and PSU given the timing of the tests and the data availability. We view these results as supportive of our overall findings, but ultimately given that the tests are only administered in 4th, 8th and 10th grade, we do not view these results as the core of the paper which is focussed on understanding the dynamics of the birth weight effect over time. 40 7 Conclusion This paper examined the relationship between health at birth, subsequent parental investments and academic outcomes from childhood to adolescence using administrative data from Chile, a middle income OECD member country. Using data on all births in the country from 1992 to 2002 merged with schooling records for the entire education system we construct a panel following children from birth to high school graduation. We find a declining correlation between initial health measured by birth weight and academic outcomes as children progress through school. In contrast, siblings and twins fixed effects estimators show a more persistent relationship between initial health and academic outcomes throughout schooling years. In particular twins fixed effects models show strikingly persistent effects throughout 1st to 8th grade - a 10% increase in birth weight is associated with nearly 0.05 standard deviations higher performance in math. Similar results are found for national tests taken in fourth, eighth and tenth grade and as well as for the national college entrance exam after high school graduation. In addition, we find evidence that parental investments are larger for children of lower birth weight across families with similar observable characteristics suggesting a compensatory relationship between initial health and investments. We find suggestive evidence that this differential parental investment is decreasing in the age difference among siblings and is virtually absent among twins. We present a simple model of human capital accumulation and extend existing models of intra household allocations to include a dimension of parental investment spillovers. This model is able to rationalize three empirical features found in the data : 1) the observed behavior of parental investments, 2) declining correlation between birth weight and academic achievement in the population and 3) persistent twins estimates. This framework interprets the different empirical results through the lens of a simple human capital accu- 41 mulation model that implies varying degrees of bias in estimates of the relationship between initial health and later academic outcomes depending on the relationship between parental investments and endowments and how these accumulate over time. Thus this model rationalizes both the observed behavior of parental investments and the different OLS, siblings and twins estimates of the relationship between initial health and academic achievement in school as well as its evolution over time. We conclude that within the context of our model, because parents do not differentially invest among twins, these fixed effects models effectively identify the structural relationship between initial conditions at birth measured by birth weight and later academic outcomes described in the model presented. However, given the evidence presented shows parental investments are compensatory in this context, twins estimates overestimate the empirical relationship in the general population and suggest that differential parental investments seem to mitigate to some extent initial differences in endowments and this becomes more relevant over time as parents have more time to adjust. This result helps put prior empirical work using twins estimators into context with regard to the general population. It also highlights that some of the inequalities at birth can potentially be undone through the efforts made by parents and possibly public policies aimed at investing in the health and human capital of children. A deeper understanding of how parents invest and precisely what types of investments matter more would be a fruitful topic for future research in this area. 42 References A BERNETHY, L., M. PALANIAPPAN , AND R. C OOKE (2002): “Quantitative magnetic reso- nance imaging of the brain in survivors of very low birth weight,” Archives of disease in childhood, 87(4), 279. A DHVARYU , A. R., AND A. N YSHADHAM (2012): “Endowments at Birth and Parents In- vestments in Children,” Unpublished manuscript. Yale University. A IZER , A., AND F. C UNHA (2010): “Child Endowments, Parental Investments and the De- velopment of Human Capital,” Brown University Working Paper. A LMOND , D., K. C HAY, AND D. L EE (2005): “The Costs of Low Birth Weight,” The Quarterly Journal of Economics, 120(3), 1031–1083. A LMOND , D., AND J. C URRIE (2011a): Human Capital Development before Age Fivevol. 4 of Handbook of Labor Economics, chap. 15, pp. 1315–1486. Elsevier. (2011b): “Killing me softly: The fetal origins hypothesis,” The Journal of Economic Perspectives, 25(3), 153–172. A LMOND , D., AND B. M AZUMDER (2013): “Fetal origins and parental responses,” Working Paper Series WP-2012-14, Federal Reserve Bank of Chicago. A NNIBALE , D., AND J. H ILL (2008): “Periventricular Hemorrhage–Intraventricular Hem- orrhage,” URL: www. emedicine. com/ped/topic, 2595. A SHENFELTER , O., AND C. R OUSE (1998): “Income, Schooling, and Ability: Evidence from A New Sample of Identical Twins,” Quarterly Journal of Economics, 113(1), 253–284. 43 B ANERJEE , A., S. C OLE , E. D UFLO , AND L. L INDEN (2007): “Remedying Education: Evi- dence from Two Randomized Experiments in India*,” The Quarterly Journal of Economics, 122(3), 1235–1264. B EHRMAN , J. R., R. A. P OLLAK , AND P. TAUBMAN (1982): “Parental preferences and pro- vision for progeny,” The Journal of Political Economy, pp. 52–73. B LACK , S., P. D EVEREUX , AND K. S ALVANES (2007): “From the Cradle to the Labor Mar- ket? The Effect of Birth Weight on Adult Outcomes*,” The Quarterly Journal of Economics, 122(1), 409–439. C ONTI , G., J. J. H ECKMAN , J. Y I , AND J. Z HANG (2010): “Early Health Shocks, Parental Responses, and Child Outcomes,” Unpublished manuscript, Department of Economics, University of Hong Kong. D UFLO , E., AND R. H ANNA (2005): “Monitoring works: getting teachers to come to school,” NBER Working Paper. F IGLIO , D. N., J. G URYAN , K. K ARBOWNIK , AND J. R OTH (2013): “The Effects of Poor Neonatal Health on Children’s Cognitive Development,” Discussion paper, National Bureau of Economic Research. H ACK , M., N. K LEIN , AND H. TAYLOR (1995): “Long-term developmental outcomes of low birth weight infants,” The future of children, 5(1), 176–196. H ASTINGS , J. S., C. A. N EILSON , AND S. D. Z IMMERMAN (2013): “Are Some Degrees Worth More than Others? Evidence from college admission cutoffs in Chile,” Working Paper 19241, National Bureau of Economic Research. 44 H ECKMAN , J. (2007): “The economics, technology, and neuroscience of human capability formation,” Proceedings of the National Academy of Sciences, 104(33), 13250. L EWIS , M., AND M. B ENDERSKY (1989): “Cognitive and motor differences among low birth weight infants: Impact of intraventricular hemorrhage, medical risk, and social class,” Pediatrics, 83(2), 187. L OUGHRAN , D., A. D ATAR , AND M. K ILBURN (2004): “The Interactive Effect of Birth Weight and Parental Investment on Child Test Scores,” RAND Labor and Population Working Paper WR-168. M URALIDHARAN , K., AND V. S UNDARARAMAN (2009): “Teacher performance pay: Exper- imental evidence from India,” Working Paper. O REOPOULOS , P., M. S TABILE , AND R. WALLD (2008): “Short, Medium, and Long Term Consequences of Poor Infant Health: An Analysis Using Siblings and Twins,” Journal of Human Resources, 43, 88–138. R OSENZWEIG , M., AND J. Z HANG (2009): “Do Population Control Policies Induce More Human Capital Investment? Twins, Birth Weight and China’s One-Child Policy,” Review of Economic Studies, 76(3), 1149–1174. TODD , P., AND K. W OLPIN (2007): “The production of cognitive achievement in children: Home, school, and racial test score gaps,” Journal of Human capital, 1(1), 91–136. TORCHE , F., AND G. E CHEVARR ÍA (2011): “The effect of birthweight on childhood cognitive development in a middle-income country,” International journal of epidemiology, 40(4), 1008–1018. 45 For Online Publication A Appendix: Model A.1 Deciding Between Educational and Non-Educational Inputs In order to better understand what happens to effective time endowments in the case with and without public goods in parental investments, we consider a problem where education is not the only activity in the household, and other competing activities may be also important for raising a child. The second part of the problem is related to the possibility that parents can strategically use investment time to reinforce the difference between siblings, for efficiency motives, or compensate the less endowed child, for inequality aversion motives. In our model, we explore the implications of both cases. We start assuming that parents allocate time among different activities to raise their children. Specifically, parents can allocate time between educational activities TE or non educational activities TNE . We can think of the parents’ problem as max TE ,TNE s.t. V (TE , TNE ) (26) TE + TNE ≤ T Where V is the utility coming from educational and non educational activities14 . T is 14 An alternative formulation consists on assuming that parents maximize the production of “children quality”, that uses time in both educational and non educational inputs. Thus, the allocation of time is related 46 the total time allocated to raise the children in the household. Note that, if there are more than on child in the household, parents use the aggregate educational and non educational times, and utilities to make the allocation decision. ∗ the optimal allocation of time, coming from the solution of We denote TE∗ and TNE the maximization problem in equation (26). Note that the optimal allocation depends on the marginal utilities associated with the educational and non-educational activities. In the main text, for expositional ease, we refer to TE∗ as TE . A.2 Allocations in the presence of public goods max TE ,TNE s.t. V (TE , TNE ) (27) TE + TNE ≤ T ∗ are the optimal allocation, they satisfied the first order conditions: If TE∗ and TNE combined: ∗ ) ∂V (TE∗ , TNE = λ ∂TE ∗ ) ∂V (TE∗ , TNE = λ ∂TNE ∗ ) ∗ ) ∂V (TE∗ , TNE ∂V (TE∗ , TNE = ∂TE ∂TNE (28) to the marginal productivity, in opposite to the marginal utility associated with the formulation presented in the main text. 47 When parents know the public good dimension of parental investment, they realize that their effort TE effectively converts into T̂E = (1 + δ(i, i ′ ))TE . Therefore, they solve the problem max T̂E ,TNE s.t. V (T̂E , TNE ) (29) T̂E + TNE ≤ T 1 + δ(i, i ′ ) In a similar way than in the absence of public good, we combined the first order equations to obtain ∗∗ ) ∗∗ ) ∂V (TE∗∗ , TNE ∂V (TE∗∗ , TNE (1 + δ(i, i ′ )) = ∂TE ∂TNE (30) where TE∗∗ (1 + δ(i, i ′ ) = T̂E∗ . Proposition: If V is a Leontief utility function, the optimal allocation for educational activities when there is a degree of public good dimension in parental investments is smaller than in the case without public goods. Proof 1 ∗ the optimal allocations in the absence of public good dimension on Let’s denote TE∗ and TNE ∗∗ the optimal allocations when public good dimension parental investment, and TE∗∗ and TNE on parental investment is present. Finally, T̂ represents the effective time, when the public good dimension feature is present. 48 V is a Leontief production function, expressed as V (TE , TNE ) = min{ a1 TE , a2 TNE } It is well known that the solution for the optimal allocation for the Leontief utility function is that TE∗ = a2 T a1 + a2 ∧ ∗ TNE = a1 T a1 + a2 (31) The public good dimension of parental investment effectively increases the parameter a1 from its original value to a1 (1 + δ(i, i ′ )). Therefore, the new optimal allocations are TE∗∗ = a2 T a1 (1 + δ(i, i ′ )) + a2 ∧ ∗ TNE = a1 (1 + δ(i, i ′ )) T a1 + a2 (32) Comparing the allocation assigned to educational activities in (31) with the one displayed in (32), it is easy to see that the public good dimension of parental investment induce a decrease in the time assigned to educational activities. A.3 Proof of Proposition 1 from Section 4 If compensating (reinforcing) parents can fully differentiate the educational inputs allocated to each child, the test score gap between siblings will decrease (increase) over time. If there is only partial parental investment differentiation, the test score gap may decrease (increase), but this decrease (increase) will be less than in the case of full differentiation. Proof 2 49 For the case of fully differentiation, equations (10) and (11) indicate that, for given cognitive endowments θ1jg and θ2jg′ , the allocation for child 1 is just a factor of allocation for child 2. In particular, the factor is C(γ, ρ, θ1jg , θ2jg′ ) = θ2jg′ θ1jg ! γρ ( 1− γ ) ρ −1 Without loss of generality, let’s assume that child 1 has a higher cognitive endowment that child 2. Thus, θ2jg′ θ1jg < 1. Additionally, if ρ < 0, or when parents present a compensating behavior, the exponent γρ (1− γ ) ρ − 1 > 0, because numerator and denominator are both negative. We conclude that C(γ, ρ, θ1jg , θ2jg′ ) < 1, and therefore, the parental investment allocation for child 2 is bigger than for child 1, which is consistent with the compensating behavior. Note that, if ρ > 0, γρ (1− γ ) ρ − 1 < 0, and therefore C(γ, ρ, θ1jg , θ2jg′ ) > 1. If child 2 has lower cognitive endowment that child 1, he or she will receive higher educational inputs. Equation (12) captures the evolution of cognitive endowments, and it shows that higher values of educational inputs for child 2 will reduce the gap between the cognitive endowments15 . As θ2jg′ −→ θ1jg , the factor C(γ, ρ, θ1jg , θ2jg′ ) −→ 1, producing the convergency of cognitive endowments, optimal educational inputs and test scores. In the case of partial differentiation, we can assume without loss of generality that the actual parental investment received by the children is a weighted average of the optimal 15 In order to rule out a overshooting behavior from the parents, and to make the evolution of cognitive endowment a relatively persistent process, we assume a specific region for the parameters β X , β Xθ , and T. 50 parental investment expressed in equations (10) y (11). In other words, ∗ ∗ X̃1jg = α1 X1jg + (1 − α1 )X2jg ′ ∗ ∗ X̃2jg = α2 X1jg + (1 − α2 )X2jg ′ where the tilde represents the actual educational input received by each child. Partial differentiation implies that α1 and α2 are in the interval (0,1). From the previous discussion, we know that if child 2 has a lower endowment, X1jg < X2jg′ , and therefore ∗ ∗ X1jg < X̃1jg < X2jg ′ ∧ ∗ ∗ X1jg < X̃2jg′ < X2jg ′ It is easy to conclude that the compensating effort in the partial differentiation case will reduce less the gap in the cognitive endowment dimension than in the case of full ∗ differentiation. This is because X1jg < X̃1jg , or the high endowed child receives more ∗ implies that the low parental investment in the partial differentiation case, and X̃2jg′ < X2jg ′ endowed child receives less parental investment in the partial differentiation case. Corollary 2 The public good dimension of parental investment implies partial differentiation across children. Thus, the compensating (reinforcing) behavior will take longer to reduce (increase) the test score gap, than in the absence of public good dimension. Proof 3 51 According to our model, the public good dimension feature of parental investment implies that the optimal allocation for child 1 (denoted by double stars) satisfies ∗∗ ∗∗ ∗ + δ(1, 2) X2jg = X1jg X̂1jg ′ = = = = # "    γρ  θ ′  (1−γρ   γ ) ρ −1 θ2jg′ (1−γ)ρ−1  TE∗∗ 2jg − δ(1, 2) + δ(1, 2) 1 − δ(1, 2) h  θ ′  (1−γρ i θ1jg θ1jg γ ) ρ −1 2jg (1 − δ(1, 2)) 1 + θ1jg # "  (1−γρ  γ ) ρ −1 ′ θ TE∗∗ 2jg 2 h  θ ′  (1−γρ i (1 − δ(1, 2) ) θ γ ) ρ −1 1jg 2jg (1 − δ(1, 2)) 1 + θ1jg "   γρ # θ2jg′ (1−γ)ρ−1 TE∗∗ T̂E∗ ∗∗ ( 1 + δ ( 1, 2 )) = but T γρ E i h  θ ′  (1 − γ ) ρ −1 θ1jg 1 + δ(1, 2) 1 + θ2jg 1jg   γρ ∗ θ2jg′ (1−γ)ρ−1 TˆE i θ h  θ ′  (1−γρ γ ) ρ −1 1jg 1 + θ2jg 1jg Which is exactly the same expression than in the original case, but with T̂E∗ instead of TE∗ . ∗ can be Furthermore, because T̂E∗ < TE∗ , it is easy to show that there is α1 such that X̂1jg written as ∗ ∗ ∗ X̂1jg = α1 X1jg + (1 − α1 )X2jg ′ ∗ . Similarly for X̂2jg ′ Therefore, the public good dimension is a particular case of partial differentiation, and the results of the proposition can be apply for this case. A.4 Simulation Details All the figures in the main text where constructed using the solutions simulated in Matlab 7.12. 52 The solutions for the optimal allocations are presented in equations (10) and (11). We simulate the solutions with the following parameters: Optimal Allocation Parameters ρ 40 equidistant points in the interval [−0.9, 0.9] γ 0.8 θ1j1 3.0 θ2j1 2.0 TE 0.375 Evolution of Endowments βX 0.3 βθ 1.0 β Xθ 0.01 Public Good Parameters δ(i, i ′ ) δ(age difference) δ 0.8 Age Difference 1.5 Starting with the initial values of θ presented in the table above, and the solution for optimal allocation of parental investment X ∗ , we constructed the evolution of θ over time for each child. Once we have the sequence of optimal X and the implied θ, we calculate the test scores, using the equation γ (1− γ ) Tijg = θijg · Xijg 53 TABLE 1: Birth Weight and Test Scores - OLS Estimates Grade Standardized Math Scores 1 2 3 4 5 6 7 8 OLS: Sample uses same birth weight support as twins (0-3000 grams) Log Birth Weight Observations 0.403 0.384 0.368 0.349 0.277 0.260 0.233 (0.00958)*** (0.00896)*** (0.00891)*** (0.00880)*** (0.00904)*** (0.00965)*** (0.0103)*** 485,991 552,931 581,559 591,286 557,175 491,201 431,634 0.236 (0.0117)*** 357,937 OLS: Twins sample Log Birth Weight Observations 0.357 (0.0322)*** 30,353 0.298 (0.0308)*** 31,586 0.329 (0.0335)*** 31,212 0.333 (0.0321)*** 30,849 0.277 (0.0322)*** 28,478 0.282 (0.0352)*** 24,919 0.244 (0.0380)*** 21,755 0.202 (0.0465)*** 17,874 Robust standard errors in pare *** p<0.01, ** p<0.05, * p< Note: All estimates control for gestational age, mother's age and education and sex of the child. The dependent variable is standarized classroom grades in math. Standardized Math Scores TABLE 2: Birth Weight and Test Scores - Twins Fixed Effect Estimates Grade 1 2 3 4 5 6 7 8 Log Birth Weight 0.468 (0.0410)*** 0.477 (0.0408)*** 0.482 (0.0410)*** 0.560 (0.0415)*** 0.523 (0.0432)*** 0.513 (0.0477)*** 0.538 (0.0524)*** 0.479 (0.0590)*** Low Birth Weight -0.0777 (0.0134)*** -0.0815 (0.0133)*** -0.0861 (0.0134)*** -0.104 (0.0136)*** -0.109 (0.0140)*** -0.0902 (0.0154)*** -0.108 (0.0169)*** -0.103 (0.0189)*** 30,353 15,740 31,586 16,496 31,212 16,350 30,849 16,187 28,478 14,961 24,919 13,160 21,755 11,572 17,874 9,564 Observations Number of Twin Pairs Robust standard errors in pare *** p<0.01, ** p<0.05, * p< Notes: All estimates control for sex of the child. TABLE 3: Birth Weight and Test Scores - Heterogeneity: Twins Estimates Grade Standardized Math Scores 1 2 3 4 5 6 7 8 All coefficients reported are on log birth weight using twin fixed effects Mother with high school and above 0.476 (0.0477)*** 0.520 (0.0473)*** 0.536 (0.0476)*** 0.613 (0.0484)*** 0.541 (0.0503)*** 0.514 (0.0555)*** 0.563 (0.0616)*** 0.465 (0.0697)*** Mothers with less than high school 0.436 (0.0809)*** 0.339 (0.0812)*** 0.302 (0.0820)*** 0.397 (0.0815)*** 0.456 (0.0854)*** 0.497 (0.0935)*** 0.478 (0.101)*** 0.517 (0.112)*** Mother Employed 0.482 (0.0784)*** 0.604 (0.0802)*** 0.572 (0.0805)*** 0.531 (0.0837)*** 0.472 (0.0899)*** 0.454 (0.0976)*** 0.555 (0.108)*** 0.343 (0.123)*** Mother Unemployed 0.459 (0.0477)*** 0.421 (0.0470)*** 0.445 (0.0475)*** 0.569 (0.0476)*** 0.539 (0.0491)*** 0.532 (0.0546)*** 0.533 (0.0599)*** 0.523 (0.0672)*** Santiago 0.486 (0.0643)*** 0.513 (0.0639)*** 0.443 (0.0630)*** 0.544 (0.0644)*** 0.505 (0.0671)*** 0.549 (0.0740)*** 0.497 (0.0835)*** 0.404 (0.0941)*** Non-Santiago 0.454 (0.0531)*** 0.450 (0.0529)*** 0.514 (0.0540)*** 0.572 (0.0541)*** 0.535 (0.0566)*** 0.484 (0.0624)*** 0.565 (0.0672)*** 0.531 (0.0756)*** 0.319 (0.191)* 0.804 (0.194)*** 0.813 (0.182)*** 0.748 (0.179)*** 0.751 (0.187)*** 0.743 (0.195)*** 0.790 (0.205)*** 0.713 (0.254)*** 0.432 (0.0562)*** 0.329 (0.0573)*** 0.339 (0.0590)*** 0.504 (0.0599)*** 0.465 (0.0634)*** 0.483 (0.0721)*** 0.515 (0.0790)*** 0.506 (0.0878)*** Private schools Poor Schools Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 Notes: All estimates control for sex of the child. School categories are based on a 2010 categorization of schools in Chile. Hence, a school's classification as of 2010 is assumed to be the same between 2002-2008. Table 4: Parental Investments and Birth Weight - OLS Estimates Standardized Investment 2002 Parent report of Investments Standardized PCA: NonInvestment Educational 2007 Investments PCA: Educational Investments Child's report of parental investments Standardized PCA: PCA: Investment Educational Educational 2009 Investments Encouragement OLS: Full Sample Log Birth Weight -0.0128 (0.0146) -0.0588 (0.0165)*** 0.0240 (0.0147) -0.100 (0.0145)*** -0.0460 (0.0105)*** -0.0367 (0.0119)*** 0.00766 (0.0119) Observations 192,833 169,234 193,017 193,017 377,853 295,137 295,137 OLS: Sample uses same birth weight support as twins Log Birth Weight Observations -0.00989 (0.0276) -0.0813 (0.0347)** 0.0703 (0.0277)** -0.121 (0.0275)*** -0.0736 (0.0210)*** -0.0507 (0.0298)* 0.0146 (0.0313) 58,806 48,010 60,027 60,027 105,893 48,635 48,635 -0.180 (0.117) -0.0936 (0.101) -0.0799 (0.119) -0.309 (0.116)*** 0.0338 (0.0848) 0.00455 (0.132) -0.0784 (0.131) 2,833 2,617 2,900 2,900 2,583 2,583 2,583 OLS: Twins Sample Log Birth Weight Observations Robust standard errors in *** p<0.01, ** p<0.05, * Notes: All regressions control for gestational age, mother's age and education and sex of the child. "Standardized" investments use all investment related questions to create a single composite measure. "PCA" denotes measures obtained from Pricipal Components Analysis. Details of this procedure are available upon request. "PCA" components for parental responses are computed over their responses to the 2002 survey, and child responses are obly available from 2009. All investment meausres are asked of children in grade 4. Table 5: Parental Investments and Birth Weight - OLS Estimates Details Common support sample Review Homework Help with Homework Study with Child Read to Child Give math problems Talk to Child Run errands with child -0.0348 (0.0129)*** -0.0520 (0.0144)*** -0.0450 (0.0148)*** 0.00463 (0.0110) -0.00913 (0.0149) 0.00674 (0.00999) -0.0151 (0.0141) 45,106 0.777 45,106 0.679 45,106 0.634 45,106 0.322 Parent explains things Parent helps study 45,106 0.643 Parent congratulates me on good performance -0.0376 (0.0112)*** -0.0447 (0.0117)*** -0.0344 (0.0119)*** 0.00767 (0.0106) -0.00572 (0.00915) -0.0180 (0.0120) 0.00518 (0.0116) 79,839 0.555 79,762 0.500 78,676 0.484 78,759 0.752 68,489 0.835 73,551 0.408 78,486 0.618 Details on Investments (Parent Responses) Review Homework Help with Homework Study with Child Read to Child Give math problems Talk to Child Run errands with child Log Birth Weight -0.104 (0.0518)** -0.162 (0.0561)*** -0.110 (0.0573)* -0.0834 (0.0305)*** 0.0242 (0.0575) 0.0112 (0.0380) -0.0125 (0.0538) 2,900 0.744 2,900 0.665 2,900 0.633 2,900 0.367 Parent explains things Parent helps study 2,900 0.637 Parent congratulates me on good performance 0.0173 (0.0420) 0.00570 (0.0433) -0.0222 (0.0437) -0.0386 (0.0376) 0.0243 (0.0360) 0.00229 (0.0438) 0.0356 (0.0426) 5,652 0.543 5,641 0.486 5,548 0.467 5,583 0.737 4,857 0.824 5,206 0.405 5,540 0.615 Details on Investments (Parent Responses) Log Birth Weight Observations Mean of dependent variable Details on Investments (Child Responses) Log Birth Weight Observations Mean of dependent variable Parent helps with Parent knows chores grades in school 45,106 45,106 0.882 0.708 Parent challenges me Parent willing to help to get good grades Twins sample Observations Mean of dependent variable Details on Investments (Child Responses) Log Birth Weight Observations Mean of dependent variable Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 Parent helps with Parent knows chores grades in school Notes: All regressions control for gestational age, mother's age and education and sex of the child. 2,900 2,900 0.885 0.719 Parent challenges me Parent willing to to get good help grades Table 6: Parental Investments and Birth Weight - Fixed Effects Estimates Overall measures Log Birth Weight Observations Details on Investments (Parent Responses) Log Birth Weight Observations Details on Investments (Child Responses) Log Birth Weight Investment 2002 Parent report of Investments PCA: NonInvestment Educational 2007 Investments PCA: Educational Investments Child's report of parental investments PCA: PCA: Standardized Educational Educational Investment Investments Encouragement 0.109 (0.0835) 0.120 (0.0907) 0.105 (0.0882) -0.0186 (0.101) -0.0397 (0.146) 0.0998 (0.238) 0.299 (0.263) 2,833 2,617 2,900 2,900 5,701 2,583 2,583 Review Homework Help with Homework Study with Child Read to Child Give math problems Talk to Child Run errands with child 0.0502 (0.0466) -0.0699 (0.0490) 0.0382 (0.0495) 0.0249 (0.0270) 0.0499 (0.0488) -0.00482 (0.0355) 0.0449 (0.0430) 2,900 2,900 2,900 8,541 2,900 2,900 2,900 Parent explains things Parent helps study Parent helps with chores 0.0622 (0.0764) -0.0795 (0.0785) -0.0144 (0.0850) -0.107 (0.0727) 0.0285 (0.0713) -0.0604 (0.0847) 0.0893 (0.0812) 5,548 0.467 5,583 0.737 4,857 0.824 5,206 0.405 5,540 0.615 Observations 5,652 5,641 Mean of dependent var 0.543 0.486 Robust standard errors in *** p<0.01, ** p<0.05, * Notes: all regressions control for sex of the child. Parent Parent Parent knows congratulates challenges Parent willing to grades in me on good me to get help school performance good grades TABLE 7: Birth Weight and Test Scores - Sibling Fixed Effect Estimates Grade Standardized Math scores Siblings 1 year apart Log Birth Weight Observations Siblings 3-4 years apart Log Birth Weight Observations 1 2 3 4 5 6 7 8 0.482 (0.122)*** 2383 0.531 (0.102)*** 2659 0.413 (0.101)*** 2796 0.542 (0.0967)*** 3052 0.421 (0.102)*** 2967 0.317 (0.111)*** 2607 0.355 (0.125)*** 2265 0.412 (0.138)*** 1775 0.445 (0.0747)*** 6434 0.319 (0.0719)*** 7062 0.410 (0.0720)*** 7215 0.375 (0.0707)*** 7388 0.228 (0.0728)*** 6647 0.227 (0.0855)*** 5293 0.107 (0.0983) 3989 0.194 (0.139) 2494 Robust standard errors in paren *** p<0.01, ** p<0.05, * p<0. Note: Sample uses siblings on common birth weight support as twins (0-3000 grams). All regressions control for gestational age, mother's age and education and sex of the child. TABLE 8: Birth Weight and Language Test Scores Grade Standardized Language Scores 1 2 3 4 5 6 7 8 Twins FE 0.427 (0.0394)*** 0.386 (0.0392)*** 0.342 (0.0391)*** 0.399 (0.0391)*** 0.322 (0.0400)*** 0.295 (0.0435)*** 0.341 (0.0491)*** 0.349 (0.0553)*** OLS (Twins Sample) 0.278 (0.0316)*** 0.204 (0.0315)*** 0.229 (0.0306)*** 0.186 (0.0326)*** 0.141 (0.0321)*** 0.0918 (0.0353)*** 0.0906 (0.0378)** 0.0569 (0.0478) 0.218 (0.119)* 0.473 (0.0990)*** 0.193 (0.0968)** 0.338 (0.0948)*** 0.236 (0.0970)** 0.0307 (0.107) 0.160 (0.118) 0.272 (0.135)** 0.362 (0.0730)*** 0.327 (0.0690)*** 0.282 (0.0696)*** 0.194 (0.0692)*** 0.170 (0.0713)** 0.155 (0.0839)* 0.0616 (0.0954) -0.110 (0.139) Siblings 1 year apart (FE) Siblings 3-4 years apart (FE) Robust standard errors in paren *** p<0.01, ** p<0.05, * p<0. Notes: All estimates control for sex of the child. OLS and Sibling estimates contain other controls, see notes under Table 1 & 7. Table 9: SIMCE and PSU test scores SIMCE 8th Grade 10th Grade College Entrance 0.601 (0.0503)*** 0.578 (0.0975)*** 0.432 (0.102)*** 0.465 (0.109)*** 0.308 (0.0291)*** 22790 0.306 (0.0598)*** 6180 0.178 (0.0634)*** 5416 0.329 (0.0770)*** 5052 0.397 (0.0531)*** 0.338 (0.101)*** 0.327 (0.112)*** 0.281 (0.109)*** 0.115 (0.0292)*** 22,790 0.102 (0.0607)* 6,180 0.121 (0.0662)* 5,416 0.142 (0.0763)* 5,052 VARIABLES 4th grade All estimates are the coefficient on log birth wei Math Twins FE OLS (Twins Sample) Observations Language Twins FE OLS (Twins Sample) Observations Standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 Notes: All estimates control for sex of the child. OLS estimates contain other controls, see notes under Table 1.