Academia.eduAcademia.edu

Strategy diversity in early mathematics classrooms

2020, Contemporary Educational Psychology

https://doi.org/10.1016/j.cedpsych.2019.101834

Strategic processes are a form of procedural knowledge in which a child knows how to enact a given strategy that improves their capability in problem solving or learning. The solution strategies children use are critical components of their learning, especially in mathematics. Children vary substantially in their knowledge and use of different strategies, and much research has focused on intraindividual strategy variability. However, we do not know if classrooms that evince a broader variety of strategies across children are related to higher mathematics achievement. We investigated the diversity of arithmetical strategies within classrooms and examined the relations between strategy diversity and mathematical achievement as children moved from preschool to kindergarten and first grade. These analyses were applied to data from a large-scale experiment involving 1305 children from 42 schools and 106 classrooms. We created and applied a new method of measuring classroom strategy diversity and related this measure to children's concurrent and subsequent math achievement. We found that early strategy diversity was strongly related to achievement, but in subsequently, less diversity was so related. We compared these results to the predictions of three theoretical categories and found that our results mainly supported one.

Contemporary Educational Psychology 60 (2020) 101834 Contents lists available at ScienceDirect Contemporary Educational Psychology journal homepage: www.elsevier.com/locate/cedpsych Strategy diversity in early mathematics classrooms☆ a,⁎ a a b a Douglas H. Clements , Denis Dumas , Yixiao Dong , Holland W. Banse , Julie Sarama , Crystal A. Day-Hessa a b T University of Denver, United States University of Alabama, United States A R T I C LE I N FO A B S T R A C T Keywords: Arithmetic Classroom characteristics Early childhood Learning trajectories Mathematics education Strategies Strategic processes are a form of procedural knowledge in which a child knows how to enact a given strategy that improves their capability in problem solving or learning. The solution strategies children use are critical components of their learning, especially in mathematics. Children vary substantially in their knowledge and use of different strategies, and much research has focused on intraindividual strategy variability. However, we do not know if classrooms that evince a broader variety of strategies across children are related to higher mathematics achievement. We investigated the diversity of arithmetical strategies within classrooms and examined the relations between strategy diversity and mathematical achievement as children moved from preschool to kindergarten and first grade. These analyses were applied to data from a large-scale experiment involving 1305 children from 42 schools and 106 classrooms. We created and applied a new method of measuring classroom strategy diversity and related this measure to children’s concurrent and subsequent math achievement. We found that early strategy diversity was strongly related to achievement, but in subsequently, less diversity was so related. We compared these results to the predictions of three theoretical categories and found that our results mainly supported one. Cognitive strategies are goal-directed and effortful procedures that children employ to aid in the regulation, execution, or evaluation of a problem or task (Alexander & Judy, 1988; Alexander, Grossnickle, Dumas, & Hattan, 2018). As such, strategic processes represent a form of procedural knowledge in which a child knows how to enact a given strategy that improves their capability in problem solving or learning (Dinsmore, 2017; Dumas, 2019). The solution strategies children use in mathematics are a particularly critical component of their learning in that domain (e.g., Biddlecomb & Carr, 2011; Carr, Jessup, & Fuller, 1999; Fennema et al., 1996; Pang & Kim, 2018; Sherin & Fuson, 2005; Wansart, 1990). Children’s invented strategies may contribute to accuracy, problem-solving ability, base-ten number concepts, and flexibility in transferring knowledge in arithmetic (Carpenter, Franke, Jacobs, Fennema, & Empson, 1998). When the same tasks are posed to multiple children, they often differ in the strategic procedures they employ to solve them (Magliano, Trabasso, & Graesser, 1999; Rhodes et al., 2019). Most research has focused on these as individual differences of strategy employment as well as on intraindividual strategy variability (although sometimes called “diversity” in the literature, we will use “variability” for intraindividual variety and reserve “diversity” for classroom, or interindividual, variety of strategies for clarity). Far less is known about the influence of classroom-level strategic diversity—the number of different strategies used within the classroom learning context. Further, previous research has perennially considered the differential effectiveness of various strategic processes to improve student performance at the level of the individual student (Hickendorff, van Putten, Verhelst, & Heiser, 2010; Pressley & Harris, 2006). Specific instruction in mathematical strategies has been shown to be effective (Franke, Kazemi, & Battey, 2007; Jacobs & Empson, 2016), especially ☆ The original research was supported by the Institute of Education Sciences, U.S. Department of Education, through grants R305K05157 and R305A110188 and also by the National Science Foundation, through grants ESI-9730804 and REC-0228440 and the present analyses were supported in part by the supported in part by the Heising-Simons Foundation Grant #2015-156. The opinions expressed are those of the authors and do not represent views of the funders. A minor component of the intervention used in this research has been published by two of the authors, who thus could have a vested interest in the results. An external auditor oversaw the research design, data collection, and analyses of the original study (the present analyses have no implications for the curriculum). The authors wish to express appreciation to the original school districts, teachers, and children who participated in this research and to Megan Franke, who encouraged us to explore strategy diversity in classrooms. ⁎ Corresponding author at: University of Denver, Denver, CO 80210, United States. E-mail address: [email protected] (D.H. Clements). https://doi.org/10.1016/j.cedpsych.2019.101834 Available online 23 December 2019 0361-476X/ © 2019 Elsevier Inc. All rights reserved. Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. sophisticated strategies (Kerkman & Siegler, 1993). Further, when provided opportunities to develop content knowledge, the former children’s levels of accuracy, speed, and strategy use were comparable (Siegler, 1993). for children with special needs (e.g., Fuchs et al., 2010; Naglieri & Johnson, 2000; Powell & Fuchs, 2015; Rockwell, Griffin, & Jones, 2011), although there are exceptions (e.g., Clearinghouse, 2017). However, these studies took quite different pedagogical approaches, from direct teaching of strategies to consistent evocation of a variety of strategies. Similarly, teachers receive conflicting messages regarding the benefits of eliciting diverse strategies and the different approaches to strategy instruction (Carpenter et al., 1998). In summary, there is little research that investigates whether classrooms that evince a broader variety of strategies are related to higher mathematics achievement than classrooms with less diversity. That is, if children within a given classroom spontaneously enact strategies that differ from one another, would the level of strategic diversity be a positive or negative predictor of child performance in that classroom, and would that predictive relation vary depending on other factors (e.g., level of schooling, or individual characteristics)? In the current study, we investigate this question by measuring the diversity of mathematical strategies within classrooms and examine the relations between such strategy diversity and mathematical achievement as children moved from preschool to kindergarten and first grade. This age range is particularly appropriate because these are the years that arithmetic strategies are first developing for most children. 1.2. Classroom strategy diversity and children’s learning of arithmetic Although many educators consider strategic development central to learning, their goals for diverse strategy use by children differ substantially. Because distinct learning goals have strong effects on strategy development (Clements, Agodini, & Harris, 2013; McNeal, 1995), we consider three categories of approaches to children’s learning. Before we define these categories, however, we clarify the relations of our classification structure to the more general framework of discovery learning versus learning from direct instruction to avoid any confusion. Theory and research on the latter framework often use three categories as well: unassisted discovery learning, enhanced discovery learning, and learning from direct or explicit instruction (Alfieri, Brooks, Aldrich, & Tenenbaum, 2010). Our research focuses specifically on classroom strategy diversity with possible implications for pedagogical approaches (although we do not measure teacher instruction here, these will be a proper subset of the larger discovery/explicit instruction debate). However, although our categories relate to this general framework, there is not a direct mapping, as we delineate in this and the final section. Our first category values as much diversity as possible. Consistent with this, researchers have shown the benefits of teachers’ noticing, encouraging, and discussing children’s different solution strategies (e.g., Choppin, 2011; Stein, Engle, Smith, & Hughes, 2008). As an example in early arithmetic, some children may solve 3 + 5 with a counting-all procedure, counting out a set of three items, then counting out five more items, and then counting all those starting at “1,” reporting “8.” Others may count on from the initial number to the total, keeping track of the number added on the fingers, as in “3…5, 6, 7, 8!” Still, others might put up three fingers on one hand and five on the other then recognize the total. This category implies the prediction that more diversity at every time point will correlate with higher achievement at the early time point (concurrently) and at subsequent time points because it supports children’s creative thinking and use of strategies that are most meaningful to them (Kam, 2017). Note that in this approach, as well as the other two approaches, such inter-individual diversity does not necessarily imply or depend on intraindividual strategy variability; for example, children may use one “favorite” strategy, but that strategy may differ from child to child. The second category includes the competing perspective that education is more efficient and mathematically rigorous if children learn accurate definitions and demonstrate a single prescribed, accurate mathematical procedure for a certain type of problem, obviating the need for children to invent or experience multiple strategies (see Carnine, Jitendra, & Silbert, 1997; Clark, Kirschner, & Sweller, 2012; Wu, 2011). Research findings on children’s learning is mixed, but there is evidence supporting this approach to children’s learning (Carnine et al., 1997; Clark et al., 2012; Gersten, 1985; Heasty, McLaughlin, Williams, & Keenan, 2012). This category implies the prediction that less diversity at every time point would correlate with higher achievement at early and subsequent time points. A third category is a combination of the first two. Here, strategy diversity is an early goal, with subsequent funneling of children’s strategic use to more effective strategies (not aimed at diversity per se). For example, in a Japanese approach to teaching first grade arithmetic (Murata & Fuson, 2006), teachers first elicit, value, and discuss childinvented strategies and encourage children to use diverse strategies to solve a variety of problems. However, later in the year, teachers focus on and encourage a particularly effective method (Henry & Brown, 2008; Murata & Fuson, 2006). This approach is consistent with the notion that variability supports early phases of learning, but expertise 1. Background We briefly discuss key findings from the literature because these findings contribute to the psychological foundation of our research. To ground our discussion in a specific mathematical context, we use arithmetic learning as a basis for describing how children develop various mathematical strategies. We then turn to issues of children’s learning of arithmetical strategies and classroom strategy diversity. 1.1. Intra-individual variability of strategies As children develop arithmetical competencies, they will use multiple strategies and choose among these strategies adaptively. This process supports their future learning (Coyle, 2001). Learning and development involve an ongoing competition among alternative strategies, with faster and more accurate strategies gradually becoming dominant (Kerkman & Siegler, 1993; Minsky, 1986; Siegler, 1993). Intraindividual variability tends to be greatest during periods of rapid learning (although substantial variability is present in relatively stable periods as well, Siegler, 2006). Intraindividual variability also tends to be cyclical as periods of lesser and greater variability alternate over the course of learning (Siegler, 2006). For instance, a child learning to solve simple addition or subtraction problems by counting all objects versus counting on from one addend may oscillate between the two strategies until the child becomes comfortable exclusively using the more efficient counting on strategy. One can ask whether possessing a larger number of different strategies as an individual is developmentally advantageous. Siegler (1995) reported that early variability predicts later achievement and posited the moderate experience hypothesis: use of multiple strategies is most likely when people have moderate amounts of experience with the problems being addressed. This leads to an inverted-U for the number, and therefore intra-individual variability, of strategies of individual children. However, consideration of variability alone may not be sufficient–strategies must be efficient and especially flexible/adaptable (Baroody & Dowker, 2003; Torbeyns, Verschaffel, & Ghesquière, 2005; Verschaffel, Torbeyns, De Smedt, Luwel, & Dooren, 2007). Adaptation of strategies to different contexts is viewed as a positive aspect of mathematics development (Baroody & Dowker, 2003; Lemaire & Siegler, 1995). Children tend to choose strategies adaptively; even if they have had fewer opportunities to learn and thus use less sophisticated strategies, children are as adaptive as children with more 2 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. 2. Method brings decreasingly variable performance across children (Siegler, 1994). Thus, this category implies the prediction that early diversity would correlate with higher achievement at an early time point (for material children were beginning to learn) as well as at subsequent time points, but that at subsequent time points less diversity in strategies (e.g., because a class coalesced around fewer, sophisticated strategies) would be positively related concurrently. This study compared the implications of these three categories. Thus, our research question was: Which of the three categories’ prediction is most consistent with the empirical results regarding the relations of strategy diversity in classrooms to children’s concurrent and subsequent math achievement? That is, how does classroom-level strategy diversity relate to children’s mathematics achievement, both concurrently and at later time points? Such a question has important implications both for research and practice. Data from the TRIAD cluster randomized experiment was used for the current study, which included 42 schools in two cities that were randomly assigned to one of the three conditions. 2.1. Participants Participants were the 1305 children from the original 42 schools and 106 classrooms in Buffalo, NY, and Boston, MA who had both a pretest and posttest in pre-K (Clements et al., 2011), and the pre-K and kindergarten teachers in those schools. At the pretest of the pre-K year, children ranged in age from 44 to 64 months, with a mean age of 52.06 months (SD = 4.09). 24.5% (n = 319) of children were identified as bilingual, and 51% (n = 664) of them were female. In addition, 2% of children reported their ethnicity as Native American; 53% of children reported their ethnicity as African-American; 4% reported their ethnicity as Asian/Pacific Islander; 22% reported their ethnicity as Hispanic; 19% reported their ethnicity as White non-Hispanic and < 1% other. From the pre-K year to first-grade year, the total attrition rate in this study was 13.74% (n = 179), and the attrition was unrelated to child demographic variables of bilingualism (χ2 = 0.02, df = 1, p = .89), gender (χ2 = 0.20, df = 1, p = .65), and race/ ethnicity (χ2 = 6.75, df = 5, p = .24). 1.3. Present study Although the three categories contribute to the wider debate regarding general categories of unassisted discovery learning, enhanced discovery learning, and learning from direct or explicit instruction (Alfieri et al., 2010), each of the three could be used in tandem with more than one of these general categories, meaning that research on classroom strategy diversity is a theoretically and empirically distinct subfield. Given that both intraindividual strategy variability and inter-individual strategy diversity are theoretically most important from infancy through the early childhood years (Siegler, 1994), we addressed our research questions in the earliest years of mathematics instruction. Data for this study came from the experimental study of the TRIAD (Technology-enhanced, Research-based, Instruction, Assessment, and professional Development) scale-up model. This model evaluated the implementation of a preschool math program (Clements, Sarama, Spitler, Lange, & Wolfe, 2011; Sarama, Lange, Clements, & Wolfe, 2012) and the first year of implementation of a follow-through intervention at the end of kindergarten (Sarama, Clements, Wolfe, & Spitler, 2012). At the preschool level (called pre-K hereafter to denote the prekindergarten year), the two experimental conditions were identical. Evaluations revealed a substantial and significant effect at the end of pre-K (effect size, g = 0.72, Clements et al., 2011). One experimental condition was assigned to experience a follow-through intervention into the kindergarten and first-grade years, in which teachers were taught about the pre-K intervention and ways to build upon it using learning trajectories. Kindergartners in both the follow-through condition (g = 0.38) and non-follow-through condition (g = 0.30) scored statistically significantly higher than children in the control condition, although the effect sizes were about half of that at the end of the pre-K year (Sarama, Clements, et al., 2012). Similarly, first graders in both the follow-through condition scored significantly higher than control children (g = 0.51 for those who received follow through intervention in kindergarten and first grade; g = 0.28 for non-follow through) and follow-through children scored significantly higher than non-followthrough children (g = 0.24, Clements, Sarama, Wolfe, & Spitler, 2013). Analyses used a Rasch score that incorporated two types of scores: correctness for all items and, for those with observable and variable strategies, a strategy-sophistication score. Because the mathematical assessment recorded the specific strategies children used, it allowed us to create a new measure of the diversity of strategies used within each class, at least in an individual assessment context, and relate this measure to the Rasch scores, concurrently and predictively. Most notably, the primary benefit of using extant TRIAD data for the present study is that the implementation of multiple interventions ensured a variety of pedagogical activities and teaching strategies, contributing to our analyses. 2.2. Assessments The Research-based Elementary Math Assessment (REMA, Clements, Sarama, Wolfe, & Day-Hess, 2008/2019) measures core mathematical abilities of children from age 3–8 years using an individual interview format with standardized administration protocol, videotaping, coding and scoring procedures. Abilities are assessed according to theoretically- and empirically-based developmental progressions (National Research Council, 2007; Sarama & Clements, 2009). Topics in number include verbal counting, object counting, subitizing, number comparison, number sequencing, connection of numerals to quantities, number composition and decomposition, adding and subtracting, and place value. Geometry topics include shape recognition, shape composition and decomposition, congruence, construction of shapes, and spatial imagery, as well as geometric measurement, patterning, and reasoning. The developmental progression of items as well as the fit of individual items has been reported in earlier research (Clements, Sarama, & Liu, 2008). The REMA measures mathematical competence as a latent trait in Item Response Theory (IRT), yielding a score that locates children on a common ability scale with a consistent, justifiable metric (allowing accurate comparisons, even across ages and meaningful comparison of change scores, even when initial scores differ, Wright & Stone, 1979). The 225 items are ordered by Rasch item difficulty; children stop after four consecutive errors on each of the number and geometry sections. Beyond correctness, the REMA also collects, codes, and scores children’s strategies when those are observable and relevant (e.g., processes of perceptual subitizing, or quick recognition of the number of objects in a set, would be neither). Arithmetic problems are a main sources of such strategies. An example item, including strategies that are common to many REMA items, is provided in Fig. 1. As an example of quite different strategies, Fig. 2 presents an item that assesses geometric composition of two-dimensional geometric shapes. These strategies are recoded into three levels of sophistication for submission to the Rasch model. Until this study, no other analyses were made of the diversity of the strategies. Training sessions for REMA assessors included orientation, demonstration, and practice, with a focus on standardized delivery. Subsequent individual practice sessions were taped and critiqued, with 98–100% error-free delivery required for certification. All assessment sessions were videotaped, and each item coded by a trained coder for 3 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. Fig. 1. Example arithmetic item with strategies. as 5 + 3. At a higher level in the developmental progression, they understand that starting a counting sequence at the first number can “stand in for” counting out the first addend–the number word substitutes for the counting acts from 1 to 5. That is, saying “fiiiive…” is a way of reifying that quantity. Children can then count on from that number, counting on through second addend (“fiiiive…, six, seven, eight. Eight!”). They may use a “rhythm of three” (“Doo – Day – Doo”) to create the sequence “six, seven, eight.” At the following level of the developmental progression, children use anticipatory thinking to keep track of those counts (especially when a rhythmic pattern is difficult, such as adding 5 + 7). For example, starting at 5 and counting seven counting words, while putting one finger up for each count until a finger pattern of 5 (five fingers on one hand, two on the other) is reached (or putting up seven fingers to begin, and lowering one finger for each count). Later, children learning de/composition strategies, such as 5 + 7 = 5 + 5 + 2 = 10 + 2 = 12 and so forth. Such learning trajectories were intended to develop teachers’ knowledge of children’s developmental progressions in learning that content. They were designed to develop teachers’ knowledge of the instructional activities created to teach children the content and processes defining the level of thinking in those progressions and to inform teachers of the rationale for the instructional design of each (e.g., why the curriculum teaches children to start at numbers other than one and keep counting and why it shows and then hides chips representing the first addend to encourage counting on). The learning trajectories assist curriculum enactment with fidelity in that they connect the developmental progressions to the instructional tasks, providing multiple guidelines or sources of stability in teachers’ instantiation of the instructional activities. They were also designed to motivate and support the use of formative assessment. The TRIAD scale-up model included extensive professional development, both training and coaching (for details see Clements et al., 2011). Kindergarten and first-grade teachers in schools assigned to TRIAD’s correctness and for solution strategy; 10% of the assessments were double-coded. Both assessors and coders were blind to the group membership of the children. Continuous coder calibration by an expert coder (one tape per coder per week) militated against drift. Calibration feedback was sent to coders, alerting them to any variance from coding protocols. Previous analysis of the assessment data showed that its reliability ranged from 0.93 to 0.94 on the total test scores (Clements et al., 2008); the reliability was 0.92 with the present population. In addition, the REMA had a correlation of 0.86 with a different measure of preschool mathematics achievement (Clements et al., 2008), the Child Math Assessment: Preschool Battery (Klein, Starkey, & Wakeley, 2000), and a correlation of 0.74 with the Woodcock-Johnson Applied Problems subscale for a pre-K specific subset of 19 items (Weiland et al., 2012). 2.3. Intervention We briefly describe the intervention used in the original experiment to elucidate the level of understanding early childhood teachers in our sample possessed regarding mathematics instruction. The TRIAD intervention helps schools implement the Building Blocks pre-K mathematics curriculum (Clements & Sarama, 2007/2013) that was designed using a comprehensive Curriculum Research Framework (Clements, 2007) to address numeric/quantitative and geometric/spatial ideas and skills. Woven throughout the Building Blocks curriculum are mathematical processes. That is, Building Blocks is structured on learning trajectories that were designed to develop teachers’ content knowledge by explicating the mathematical concepts, principles, and processes involved in each level and the relationships across levels and topics. As an arithmetical example, children may first “count all” as in solving a story problem involving 3 + 2 by counting out three chips, then counting out two, then counting all five starting at 1. They then extend their counting skills to larger numbers, enabling them to solve problems such 4 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. Fig. 2. Example geometry item with strategies. the treatment conditions were a priori aligned with the three categories. (Further, the impact of intervention on diversity scores across four time-points is small; at T1 and T3 they are statistically significant, but not at T2 and T4. Most important, the effect sizes were small; partial η2 ranges from 0.01 to 0.07). The control teachers taught a curriculum that, like the treatment conditions, was connected to standards and thus addressed the same topics, in approximately the same proportion. Of course, teachers in all three treatment conditions taught mathematics differently, with variance in their evocation, discussion, and support of different mathematical strategies (as indicated by follow-through condition were taught about the pre-K intervention and ways to build upon it. That is, they were shown the mathematics many of their entering children had learned. Teachers were also taught about the learning trajectories that extended through the kindergarten curriculum, including the developmental progressions and how to modify their extant curricula to more closely match the levels of thinking of their children; however, they were not provided with any specific activities (for details see Sarama, Clements, et al., 2012). Although discussion of strategies was encouraged in the TRIAD intervention, there was no focus on diversity of strategies per se at any grade, so none of 5 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. in psychometrics (e.g., Bauer & Hussong, 2009), each of these new dichotomously scored strategy variables were assumed to follow a Bernoulli distribution, a special-case of the binomial distribution for a single variable in which the variance of that dichotomously coded variable is represented by Eq. (1), where p is the mean of the dichotomously coded variable (i.e., the proportion of 1′s coded for that variable in the dataset). Because this investigation was focused on the creation of diversity scores that represented the variety of strategies employed by children within each particular classroom in the dataset, these strategy variances were calculated individually for every dichotomously coded strategy variable (k), which had been dis-aggregated from every REMA item (j) within each individual classroom (c) in the dataset following Eq. (1), below: teacher observations, Clements et al., 2011; Clements et al., 2013; Sarama, Clements, et al., 2012). 2.4. Teacher training and classroom observations Pre-K teachers in schools randomly selected to receive the intervention completed seven days of professional development (PD) the year before data collection, during which time they implemented the Building Blocks curriculum in a non-stressful context. They received an additional five days of professional development during the following year. They also received coaching about two times per month for both years. The kindergarten teachers received seven days of PD that started in the fall when they received the cohort from the preschool intervention. Assessors administered the REMA to all children in the fall of their pre-K year before the intervention started, at the end of their pre-K year, and at the end of their kindergarten year. 2 σ jkc = pjkc × (1 − pjkc ) (1) After calculating these variances following Eq. (1), they were summed within every classroom (c) to create a quantity that represented the total variance on the strategy codes within that classroom. Then, to correct for the fact that classrooms differed in the number of items (N) that children attempted as well as the number of strategies that were available for each of those items, the summed variances were divided by the median number of dichotomously scored strategy variables that were available on the REMA to the children within that classroom. Finally, the resulting decimal was multiplied by 100 for scale. In effect, this strategy diversity score is the degree to which children within a particular classroom varied inter-individually on their strategy usage on the items they were exposed to. The calculation of this diversity score is presented in Eq. (2). 2.5. Quantification of strategic diversity Given that literature currently focuses on intra-individual strategy variability, addressing our research questions meant facing the methodological challenge of quantifying inter-individual strategy diversity displayed by children within a particular classroom or school. Recall that the REMA was designed to be flexibly administered to children of varying levels of mathematical competence and it features previously validated (Clements et al., 2008, 2019) start- and stop-rules that allow the test-administrator to determine which REMA items a child will attempt, given their ability level. For practical considerations in the creation of a scoring algorithm for the quantification of strategic diversity, this aspect of the REMA meant that participants each attempted different items, and different numbers of items at that. In addition, the codes used by test administrators to indicate which strategy a child utilized were nominal in nature and reflected a unique way to solve that math problem (i.e., they were not ordinal, although in later stages of analyses they were recoded into three levels of sophistication for Rasch scoring.). Further, the REMA items also featured different total numbers of possible strategies, such that a single set of nominally coded strategic processes was impossible to define across all the items. Although item response theory (IRT) methods for estimating a latent psychological attribute from nominal data exist (Darrell Bock, 1972), these methods are more adept at estimating latent attributes in relatively large sample sizes both in terms of participants and the number of items administered. Nominal IRT approaches also produce more readily interpretable scores if all participants are administered the same items and those items feature the same nominal categories (Nering & Ostini, 2011), which are not characteristics of our REMA data. In addition, some items on the REMA have 10 or more possible strategic process codes, which would be more nominal categories than are typically recommended for use in a nominal IRT model. Moreover, the focus of this investigation is not on the latent attribute of strategic ability per se, but on the inter-individual variance in the deployment of strategies within a salient group of children (e.g., a classroom). Although nominal IRT models estimate variances for the latent attributes being measured, that latent variance would represent the degree to which children varied along the latent continuum, and would not necessarily indicate strategic diversity within the classroom in the way that is intended in this investigation. Therefore, we conceptualized and developed a novel procedure for quantifying strategic diversity within a classroom based on REMA nominal strategy codes. To enact this method, the existing REMA strategy codes for every item (j) were dis-aggregated into a number of new dichotomously coded variables, where zero indicated the absence of that strategy, and 1 indicated the use of that strategy. In this way, the number of possible strategy codes that were available for any given item dictated the number of newly created dichotomously coded strategy variables (k) for that REMA item. Then, following a common distributional assumption ⎛ ∑ σ2 ⎞ jkc ⎜ ⎟ DSc = i = 1 × 100 ⎜ P50 (Nc ) ⎟ ⎜ ⎟ ⎝ ⎠ N (2) These strategy diversity scores were saved for every classroom in the analytic dataset and analyzed as described in the following Results section. It should be noted that the diversity scores have the common property of being influenced by the number of individual children within a given classroom. For instance, hypothetically, if only one child were present in a classroom, the diversity score would necessarily be zero because inter-individual variance is impossible in that case. Given the focus of this investigation on inter-individual diversity in strategic processing, such a property of the diversity scores was not considered problematic; however, further analysis was limited to classrooms with at least five study children to provide meaningful inferences about strategy diversity. 3. Results Correlations and multi-level linear regression analyses were conducted to examine the three predictions regarding relations of strategy diversity in classrooms to children’s concurrent and subsequent math achievement. To address our research questions, we conducted an analysis of the resulting classroom strategy diversity scores that unfolded in three general stages. First, we examined changes in strategy diversity across time points (i.e. pre-K through first grade) through a comparison of the average diversity scores across classrooms within each time-point. Then, we tested bivariate correlations among strategy diversity and standardized mathematics achievement across the four time-points included in this study. Third, we conducted two-level linear models with random intercepts to examine the predictive power of classroom diversity scores in predicting mathematics achievement at each time-point, while also adjusting for the classroom-nested structure of these educational data. Furthermore, multi-level models incorporating child background characteristics as level-one covariates (i.e., baseline mathematics achievement, age, socio-economic status, 6 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. Table 1 Descriptive statistics and bivariate correlations. Bivariate Correlations Variable Math Achievement Strategy Diversity Mean SD Math Achievement Strategy Diversity T1 T2 T3 T4 T1 T2 T3 T4 −3.22 −1.97 −1.09 −0.07 0.82 0.71 0.68 0.70 1.00 0.57** 0.55** 0.54** 1.00 0.76** 0.73** 1.00 0.81** 1.00 T1 T2 T3 T4 11.13 15.02 10.87 9.33 8.31 5.16 3.67 2.51 0.24** 0.14** −0.11** −0.09* 0.26** 0.11** −0.12** −0.03 0.27** 0.17** −0.16** −0.06 0.26** 0.17** −0.19** −0.08* T1 T2 T3 T4 1.00 0.27** −0.07* 0.06 1.00 0.01 0.03 1.00 0.18** 1.00 Note. T1 = pretest of pre-K year; T2 = posttest of pre-K year; T3 = posttest of Kindergarten year; T4 = posttest of 1st grade year. ** p < .01. * p < .05. intervention versus control condition, and bilingualism) were performed to fully model the pattern of influence of classroom-level strategic diversity scores on math achievement. Results of each stage of this investigation follow. indicated that a non-trivial amount of the variation in math achievement occurred between classrooms. Therefore, multi-level regression analysis was considered appropriate to account for the classroomnested structure of these data. 3.1. Score description 3.3.1. Means-as-outcomes models To initially investigate how diversity scores can contribute to predicting math achievement at the classroom level, means-as-outcomes models that only contained the classroom-level predictor (i.e., classroom strategic diversity scores) were performed. See Table 2 for the specific coefficients discussed in this section. After controlling for the classroom diversity scores, the clustering effects were reduced (conditional ICCT2 = 13.26%, conditional ICCT3 = 14.29%, and conditional ICCT4 = 13.26%), which indicated that less variation of math achievement occurred between classrooms at each time-point, after accounting for classroom level strategic diversity. In the analysis predicting math achievement at the end of the pre-K year, the proportion of variance explained (PVE) by the model with diversity scores at T1 and T2 was equal to 35.93%. That is, about 36% of the between-classroom variance in math achievement was accounted for by the combination of diversity scores at two time-points. Specifically, more classroom strategy diversity at the pretest of pre-K predicted higher achievement scores at the pre-K posttest (B = 0.021, SE = 0.004, z = 5.57, p < .001), and the concurrent diversity score of the pre-K posttest was not a significant predictor. Then, in the multi-level regression analysis predicting math achievement at the end of the kindergarten year, a larger amount of the true between-classroom variance (PVE = 55.57%) in math achievement was accounted by diversity scores at T1, T2, and T3. The three diversity scores were all significant predictors of math achievement at the end of the kindergarten year (BT1 = 0.015, SE = 0.003, z = 4.84, p < .001; BT2 = 0.012, SE = 0.005, z = 2.24, p = .025; BT3 = -0.027, SE = 0.008, z = -3.34, p = .001). However, they yielded different directions in that prediction. Specifically, children from classrooms with more strategy diversity in the earlier learning phases (pre- and posttest of the pre-K year) demonstrated higher math achievement at the end of kindergarten, but children in classrooms with higher diversity at the end of kindergarten demonstrated lower concurrent achievement. Lastly, in the analysis predicting math achievement at the end of the first-grade year, an increasingly larger amount of the between-classroom variance (PVE = 59.78%) in math achievement was accounted for by classroom diversity scores at T1, T2, T3, and T4. The diversity scores at the pre-K and kindergarten year all predicted first-grade math achievement (BT1 = 0.015, SE = 0.004, z = 4.12, p < .001; BT2 = 0.013, SE = 0.006, z = 2.28, p = .023; BT3 = −0.024, SE = 0.007, z = -3.35, p = .001), while the concurrent diversity score Table 1 presents means and standard deviations of math achievement and diversity scores at four-time points: pre- and posttest of the pre-K year (T1 and T2) and posttests at the end of the kindergarten and first-grade years (T3 and T4). As can be seen, children’s average standardized math scores gradually increased across time (M range = −3.22 to −0.07, SD range = 0.68–0.82) while the average classroom strategy diversity scores, as well as their level of variation (i.e., standard deviation), decreased from the end of the pre-K year to the first-grade year (M range = 9.33–11.13, SD range = 2.51–8.31). The decreases in diversity scores were statistically significant, [F (2, 1458) = 410.79, p < 0.001, partial η2 = 0.36]. In other words, strategy use within classrooms was significantly more diverse in the earlier learning phases (T1 and T2) than the later learning phases (T3 and T4), and, meanwhile, the level of strategy diversity across different classrooms became less heterogenous (i.e., SDs decreased). 3.2. Bivariate correlations 3.2.1. Concurrent time point analysis The bivariate correlations (presented in Table 1) revealed positive and significant correlations between classroom strategy diversity and concurrent achievement at pretest (r = 0.24, p < .01) and posttest (r = 0.11, p < .01) in the pre-K year. However, at the later time points (i.e., the end of kindergarten and end of first grade) diversity in strategies turned to be negatively correlated with the concurrent math achievement scores (r = −0.16, p < .01 for kindergarten; r = −0.08, p < .05 for first grade). 3.2.2. Subsequent time point analysis Furthermore, diversity scores at the pre-K year positively and significantly correlated with math achievement at subsequent time points (r ranging from 0.17 to 0.27, p < .01) while strategy diversity of the kindergarten year negatively correlated with achievement at 1st grade (r = −0.19, p < .01). 3.3. Multi-level linear regression analyses The intraclass correlations (ICC, Raudenbush & Bryk, 2002) in the predictions of child-level math achievement at the three time-points (T2, T3, and T4) were 19.29%, 26.13%, and 35.18%, respectively, which 7 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. Table 2 Summary of multi-level linear regression predicting early mathematics (Means-as-Outcomes Models, Level 2 Predictors only). Parameters Estimates SE z p Predicting Math Achievement at the End of Pre-K Year Fixed Effects Diversity Score T1 0.021 Diversity Score T2 0.007 0.004 0.006 5.570 1.130 < 0.001*** 0.259 Random Effects (variance components) 0.063 Intercept, μ0j Residual, rij 0.409 0.014 0.017 4.597 24.366 < 0.001*** < 0.001*** Predicting Math Achievement at the End of Kindergarten Year Fixed Effects Diversity Score T1 0.015 0.003 Diversity Score T2 0.012 0.005 Diversity Score T3 −0.027 0.008 4.840 2.240 −3.340 < 0.001*** 0.025* 0.001** Random Effects (variance components) 0.055 Intercept, μ0j Residual, rij 0.332 0.013 0.016 4.143 20.402 < 0.001*** < 0.001*** Predicting Math Achievement at the End of First-Grade Year Fixed Effects Diversity Score T1 0.015 Diversity Score T2 0.013 Diversity Score T3 −0.024 Diversity Score T4 −0.017 0.004 0.006 0.007 0.015 4.120 2.280 −3.350 −1.110 < 0.001*** 0.023* 0.001** 0.266 Random Effects (variance components) Intercept, μ0j 0.074 Residual, rij 0.325 0.018 0.019 4.015 17.381 < 0.001*** < 0.001*** PVE ρ ( ρ )̂ 35.93% 19.29% (13.26%) 55.57% 26.13% (14.29%) 59.78% 35.81% (18.56%) Note. T1 = pretest of pre-K year; T2 = posttest of pre-K year; T3 = posttest of kindergarten year; T4 = posttest of first-grade year, PVE = portion of variance explained, ρ = intraclass correlation, ρ ̂ = conditional intraclass correlation. *** p < .001. ** p < .01. * p < .05. at T4 was not a significant predictor (BT4 = .−017, SE = 0.015, z = −1.11, p = .266). In terms of the directions of these coefficients, more classroom strategy diversity in the earlier learning phases (T1 and T2) and less during the later learning phases (T3 and T4) appears to support children’s math achievement, given the results of this classroom level model. and/or T4) within math classrooms could help children achieve better math learning outcomes, even after controlling for a variety of childlevel covariates. 4. Discussion and implications Most educators agree that the strategies children use to solve mathematical problems are an important component of their learning (Biddlecomb & Carr, 2011; Carpenter et al., 1998; Carr et al., 1999; Fennema et al., 1996; Pang & Kim, 2018; Sherin & Fuson, 2005; Wansart, 1990). Pedagogically, instruction focused on arithmetic strategies can be effective for all children (Franke et al., 2007; Jacobs & Empson, 2016) including, and perhaps especially, for those with special needs (e.g., Fuchs et al., 2010; Naglieri & Johnson, 2000; Powell & Fuchs, 2015; Rockwell et al., 2011). However, there is little research that investigates whether classrooms that evince a broader diversity of strategies engender higher mathematics achievement. We begin by discussing the results in terms of the predictions of the three categories of approaches to children’s learning of strategies, especially diverse strategies. Although we did not measure teachers’ instruction, the results have important implications because there are quite different pedagogical approaches for teaching strategies to children that have different goals and approaches have a strong effect on strategy development (Clements, Agodini, & Harris, 2013; McNeal, 1995). Therefore, we discuss the pedagogies used in each of the three categories and draw implications for them from our findings. 3.3.2. Full multi-level models with random intercepts To further verify the observed patterns of mathematics achievement prediction, baseline math scores, age, socio-economic status, intervention versus control condition, and bilingualism, were included as childlevel covariates in the two-level models. The mean-centering of predictor variables within multi-level models in educational research has been a much-discussed topic within the methodological literature (for technical treatments of this issue see Plewis, 1989; Raudenbush, 1989). However, there is currently not a clear rule of thumb for this modeling decision (Paccagnella, 2006). In this study, we did not impose centering strategies to the continuous predictors at level-one for a number of reasons. For example, interpreting level-one predictors is not a primary goal of the present study, and there was no strong collinearity among all predictors, making mean-centering extraneous. Table 3 presents the specific coefficients discussed in this section. Compared to the previous models with classroom-level predictors only, the significance and strength of the coefficients associated with the diversity scores was weakened after accounting for all of these child-level control variables in predicting math achievement of children in kindergarten and first-grade years. The child-level covariates altered the pattern of significance of the diversity scores in predicting pre-K math achievement (see Table 3 for specific coefficients). Nevertheless, the directions of diversity score parameters from the kindergarten and first grade regression models remained as in the previous model, which provided further evidence for the observed pattern that more strategy diversity in the earlier years (T1 and T2) and less in the later years (T3 4.1. Classroom strategy diversity predicting children’s learning We examined patterns between class-level diversity and children’s achievement in light of three possible categories. The first category values strategy diversity at every time point of mathematical learning; the second category suggests that children should use a single, preferred 8 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. from classrooms with higher levels of diverse child-generated strategies at early time points have higher math achievement scores. The same set of means-as-outcomes models also indicated that diversity was either unrelated or negatively and significantly related to concurrent achievement (not including the pre-K pretest). For instance, in the pre-K year, spring (posttest) diversity was positively yet nonsignificantly related to spring pre-K math achievement. The same pattern held for spring first grade diversity and spring first grade achievement. In kindergarten, spring diversity was significantly and negatively related to spring achievement. In comparison to the previously described findings between early diversity and later achievement, these findings suggest that concurrent diversity does appear to have the same benefits for concurrent achievement. In the second set of multi-level models, we found that after controlling for baseline score, age, intervention, socio-economic status, and bilingual status, greater classroom strategy diversity at the pretest of pre-K was positively but not significantly related to higher achievement scores at the pre-K posttest. Additionally, the concurrent diversity score of the pre-K posttest was not a significant predictor. Higher diversity in the fall of pre-K predicted higher achievement in the spring of kindergarten, after controlling for the same set of covariates, but concurrent spring diversity in kindergarten was not related to spring achievement. Counter to our earlier findings, only spring first-grade diversity predicted spring first-grade achievement. Recall that these results accounted for a substantial portion of the variance, even after controlling for a variety of child-level covariates. These analyses, therefore, provide support most consistent with the approach of the third category. That is, early diversity predicted higher achievement for material that children were just beginning to learn as well as their subsequent learning. However, in the later phases of learning, less diversity in strategies was positively related concurrently, perhaps because the children in a class coalesced around fewer, more sophisticated strategies. Table 3 Summary of multi-level linear regression analysis (full model) early mathematics. Parameters Estimates SE z p Predicting Math Achievement at the End of Pre-K Year Level 1 Predictors Baseline 0.436 0.022 19.890 Age 0.026 0.004 5.840 Bilingual 0.100 0.042 2.390 FRL −0.155 0.045 −3.430 Intervention 0.424 0.052 8.110 < 0.001*** < 0.001*** 0.017* 0.001** < 0.001*** Level 2 Predictors Diversity Score T1 Diversity Score T2 0.212 0.497 0.004 −0.003 0.003 0.005 1.250 −0.680 Predicting Math Achievement at the End of Kindergarten Year Level 1 Predictors Baseline 0.686 0.024 28.360 Age 0.009 0.004 2.200 Bilingual 0.139 0.037 3.770 FRL −0.127 0.042 −3.050 Intervention −0.114 0.045 −2.500 < 0.001*** 0.028* < 0.001*** 0.002** 0.012* Level 2 Predictors Diversity Score T1 Diversity Score T2 Diversity Score T3 0.046* 0.086 0.144 0.005 0.006 −0.008 0.002 0.004 0.005 1.990 1.720 −1.460 Predicting Math Achievement at the End of First-Grade Year Level 1 Predictors Baseline 0.770 0.027 28.150 Age −0.003 0.004 −0.780 Bilingual 0.008 0.040 0.190 FRL −0.217 0.044 −4.900 Intervention −0.023 0.042 −0.540 < 0.001*** 0.437 0.848 < 0.001*** 0.586 Level 2 Predictors Diversity Score T1 Diversity Score T2 Diversity Score T3 Diversity Score T4 0.286 0.210 0.080 0.035* 0.003 0.004 −0.008 −0.016 0.002 0.004 0.005 0.007 1.070 1.250 −1.750 −2.110 4.2. Teaching arithmetical strategies and classroom strategy diversity Note. T1 = pretest of pre-K year; T2 = posttest of pre-K year; T3 = posttest of kindergarten year; T4 = posttest of first-grade year, FRL = Free/Reduced price lunch status. *** p < .001. ** p < .01. * p < .05. Bilingual, FRL, and whether received TRIAD Intervention were coded as 1(Yes) and 0 (No). Each of the three categories not only make unique predictions regarding the relationship of strategy diversity in classrooms to children’s concurrent and subsequent achievement but have very different implications for curriculum and teaching. These are important, given the variety of messages teachers receive regarding the type of instruction (i.e., ranging from direct instruction to unassisted discovery) that is best for children’s learning. To explicate these pedagogical implications, we use a gardening metaphor that has the goal of creating healthy and beautiful trees and shrubs. The first category aligns with unchecked development, which posits the benefits of providing nutrients and sunshine and otherwise allowing the plants to grow as they will naturally. In education, this pedagogical approach values as much strategy diversity as possible and favors a consistent evocation and sharing of a variety of child-generated strategies and alternative strategies (e.g., Carpenter, Fennema, Franke, Levi, & Empson, 2014; Choppin, 2011; Stein et al., 2008). A goal of such constructivist-based approaches is for children to use and share such different strategies (e.g., Carpenter et al., 2014), thus encouraging classroom strategy diversity, with beneficial effects on learning, presumably based on increases in conceptual understanding (e.g., observing that different strategies yield the same mathematical result) and access to different, often more effective and efficient strategies (Carpenter et al., 1998; Gersten, Beckmann, Clarke, Foegen, Marsh, Star, & Witzel, 2009; Peterson, Carpenter, & Fennema, 1989). This pedagogical approach was advised by Paulos (1991): Stress a few basic principles and leave most of the details to the student (from Baroody, Lai, & Mix, 2006; 1991). This approach may appear aligned with unassisted discovery learning, but it could easily be used in enhanced discovery approaches using the enhancements of generation and elicited explanations (Alfieri et al., 2010). strategy and thus evince less diversity at the classroom-level; the third category indicates that children will use a diverse array of strategies when first learning material and then coalesce around a smaller set of strategies as they master material—and thus, early diversity would be related to mathematical achievement at subsequent time points. We used multi-level regression analyses to (1) investigate how diversity scores can contribute to predicting math achievement at the classroom level and to (2) confirm the patterns of mathematics achievement prediction after controlling for child-level covariates including baseline math scores, age, socio-economic status, intervention versus control condition, and bilingualism using two-level models. Both sets of models reveal similar patterns between class-level diversity scores and children’s achievement. We found that children’s standardized mathematics scores increased through the four time points, but the average classroom strategy diversity scores, as well as their level of variation, increased in the preschool year, and then decreased at the two subsequent time points. In the first set of means-as-outcomes models, pre-K fall strategy diversity was positively and significantly related to spring achievement in the pre-K and kindergarten years. Additionally, kindergarten diversity scores were positively and significantly related to first-grade spring achievement. Taken together, these findings imply that children 9 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. patterns. We note that in the means-as-outcomes models, patterns of findings conformed more precisely to the pruning approach. It is easy for a lay person to see the results of pruning and shearing as indistinguishable in both gardening and education. However, the processes and the effects are distinct. Shearing is a more rigid intervention, including cutting the leader, selecting a new terminal bud, setting the taper of the tree, trimming side branches, and removing growth seen as problematic because it does not fit a pre-determined shape. It can leave mutilated branches and restrict new growth to the outside of the plants, which results in dead, unhealthy, leafless interiors. Educationally, shearing may help children use mathematically accurate procedures, but curiosity, creativity, and individual development may be curtailed (Baroody & Dowker, 2003; Clements & Sarama, 2014; Kamii & Dominick, 1998; Steffe, 1994; Torbeyns et al., 2005). Pruning, on the other hand, adheres to the plant's natural growth habit and selectively abandons branches anywhere in the shrub where growth is stalled, thus opening up the center of the plant, allowing air and light to maintain healthy growth. In education, then, pruning supports children in creatively inventing their own procedures and then later letting go of unsuccessful and less sophisticated strategies to develop those that are both consistent with children’s natural development and stronger mathematically. This could be done via direct instruction but may be more positively with enhanced-discovery instruction featuring carefully selected and scaffolded arithmetical problems, opportunities for children to explain their ideas, and the provision of timely feedback (Alfieri et al., 2010). In both gardening and education, pruning is a more difficult process than shearing. The second category aligns with shearing, or consistently cutting only the top bits of growth to shape the plants into formal hedges for aesthetic or privacy purposes. In education, this competing pedagogy promotes direct instruction as more efficient and mathematically rigorous (see Carnine et al., 1997; Clark et al., 2012; Wu, 2011). In a similar vein, many educators believe that one must teach grade-level goals directly, in contrast to an approach that considers learning trajectories (e.g., see Clements & Sarama, 2014; Sarama & Clements, 2009) in which instruction is based on levels of thinking that most children develop as they build up to competence in such grade-level goals (e.g., Confrey, 2019). The research on such direct teaching and enforcement of specific efficient strategies is contradictory. There is a good body of evidence supporting this approach (Carnine et al., 1997; Clark et al., 2012; Gersten, 1985; Heasty et al., 2012); however, others indicate that teaching formal strategies such as arithmetical algorithms, even with manipulatives, leaves some children even into fourth grade with limited conceptual structures and alternative strategies (Biddlecomb & Carr, 2011). Supporting a more moderate implementation of this second category, studies suggest that explicitly teaching specific strategies can have a positive impact on children’s numerical and arithmetical knowledge (Clarke, Doabler, Nelson, & Shanley, 2014), such as specific teaching of the counting on procedure (Saxton & Cakir, 2006) or teaching a specific sequence of strategies (e.g., Thornton, 1979). These approaches are well aligned with general explicit instruction, although their goals differ as far as teaching one or multiple strategies (Alfieri et al., 2010). The third category aligns with pruning, a synthesis of these two. That is, the goal in gardening is to support natural growth but also to selectively trim branches that are not working and develop an increasingly healthy plant with an aesthetic structure. In education, this would favor an early encouragement and support of child-generated strategies, and only considerably later selectively focusing support on new or strong branches, thus allowing (children’s) abandonment of branches that are not growing or sustaining as well. Thus, pruning combines pedagogical approaches that encourage strategy diversity with subsequent support for children to construct more effective strategies (not aimed at diversity per se). Elaborating on the Japanese approach to teaching first-grade arithmetic mentioned previously, teachers first encourage child- invented strategies then focus on and support a particularly effective method later in the year (Henry & Brown, 2008; Murata & Fuson, 2006), in particular the break-apart to make ten strategy (BAMT, such as 9 + 6—what is needed to add to 9 to make 10, subtract 1 from the 6, then add 10 and 5). The BAMT strategy is carefully extended to more challenging numbers and reintroduced in additional contexts. Thus, the classrooms start with a diversity of strategies, but then narrow their focus to shared, effective strategies. This approach does not align with discovery, enhanced discovery, or explicit instruction (Alfieri et al., 2010), but combines these differently at separate phases of the learning process. In most cases, results from this study aligned more with the pruning approach than with the unchecked development or the shearing approaches. That is, across both sets of models, the earliest phases of learning classroom strategy diversity tended to be positively related to concurrent achievement (contrary to the shearing approach). However, at subsequent time points strategy diversity was usually negatively related or unrelated to concurrent achievement (contrary to the unchecked development approach). Classroom strategy diversity was predictive of achievement from one to three time periods in the future (again contrary to the shearing approach). These relationships were consistent with the implications of pruning approach with one exception in the multi-level linear regression models: we expected kindergarten classroom diversity to positively and significantly predict firstgrade achievement. It may be that the range of problems that most children attempted was too narrow and thus restricted the achievement measure. If true, future research examining a wider range of more difficult (e.g., multiplicative) arithmetic problems may reveal different 4.3. Caveats and future research This is an exploratory study and there are several limitations. First, while the data justify claims about the amount of strategy diversity evident in children’s responses, we did not specifically measure teacher instruction of various mathematical strategies1. Future research can examine the links between strategy instruction, class-level strategy diversity, and children’s mathematical achievement over time. Additionally, although the broader study was a cluster randomized trial, classroom strategy diversity was not randomly assigned, and thus results are only correlational. Further, diversity was examined in children’s responses to an assessment. Future research, therefore, could examine children’s strategies during instruction, as well as randomly assigning classrooms to interventions embodying the three pedagogical approaches identified here. Such research might also investigate generalizability to other-aged children and thus other topics and levels of mathematical content. Of course, not all instruments collect data on strategies; the present results suggest this may be a useful assessment practice for future research. 4.4. Summary Keeping these caveats in mind, the findings have important implications for research and practice. They do not support one goal for strategy diversity or a single pedagogical approach. Indeed, they support the pruning approach that synergistically combines the other two approaches and allows multiple general pedagogical stances. Future research is needed, on strategy diversity per se as in the present study as well as via experimental studies with treatments that align to the three pedagogical approaches, unchecked development, shearing, and pruning. We also caution that some children will likely do quite well in an unchecked development environment and perhaps others with shearing approaches. Nevertheless, the evidence of this study suggests 1 Including, as one reviewer pointed out, variables such as other aspects of teachers' behavior that may be related but not dealing with strategies explicitly, such as autonomy-supportive versus controlling behavior. 10 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. that the combination of approaches—children’s invention in a guided discovery setting with teachers who encourage diverse strategy use for a considerable period, followed by more focused support on mathematically sophisticated and effective strategies–may be superior for most young children. Confrey, J. (2019). A synthesis of research on learning trajectories/progressions in mathematics. Retrieved from Organisation for Economic Co-operation and Development website: http://www.oecd.org/education/2030/A-Synthesis-ofResearch-on-Learning-Trajectories-Progressions-in-Mathematics.pdf. Coyle, T. R. (2001). Factor analysis of variability measures in eight independent samples of children and adults. Journal of Experimental Child Psychology, 78, 330–358. Dinsmore, D. L. (2017). Toward a dynamic, multidimensional research framework for strategic processing. Educational Psychology Review, 29(2), 235–268. https://doi.org/ 10.1007/s10648-017-9407-5. Dumas, D. (2019). Strategic processing within and across domains of learning. In D. L. Dinsmore, L. K. Fryer, & M. M. Parkinson (Eds.). Handbook of strategies and strategic processing: Conceptualization, intervention, measurement, and analysis. New York, NY: Routledge. Fennema, E. H., Carpenter, T. P., Franke, M. L., Levi, L., Jacobs, V. R., & Empson, S. B. (1996). Mathematics instruction and teachers' beliefs: A longitudinal study of using children's thinking. Journal for Research in Mathematics Education, 27, 403–434. Franke, M. L., Kazemi, E., & Battey, D. (2007). Mathematics teaching and classroom practice. In F. K. Lester (Vol. Ed.), Second handbook of research on mathematics teaching and learning: Vol. 1, (pp. 225–256). New York, NY: Information Age Publishing. Fuchs, L. S., Powell, S. R., Seethaler, P. M., Cirino, P. T., Fletcher, J. M., Fuchs, D., & Hamlett, C. L. (2010). The effects of strategic counting instruction, with and without deliberate practice, on number combination skill among students with mathematics difficulties. Learning and Individual Differences, 20(2), 89–100. https://doi.org/10. 1016/j.lindif.2009.09.003. Gersten, R. (1985). Direct instruction with special education students: A review of evaluation research. The Journal of Special Education, 19, 41–58. Gersten, R., Beckmann, S., Clarke, B., Foegen, A., Marsh, L., Star, J. R., & Witzel, B. (2009). Assisting students struggling with mathematics: Response to Intervention (RtI) for elementary and middle schools (NCEE 2009-4060). Retrieved from National Center for Education Evaluation and Regional Assistance, Institute of Education Sciences, U.S. Department of Education website: http://ies.ed.gov/ncee/wwc/ publications/practiceguides/. Heasty, M., McLaughlin, T. F., Williams, R. L., & Keenan, B. (2012). The effects of using direct instruction mathematics formats to teach basic math skills to a third grade student with a learning disability. Academic Research International, 2(3), 382–387. Henry, V. J., & Brown, R. S. (2008). First-grade basic facts: An investigation into teaching and learning of an accelerated, high-demand memorization standard. Journal for Research in Mathematics Education, 39(2), 153–183. Hickendorff, M., van Putten, C. M., Verhelst, N. D., & Heiser, W. J. (2010). Individual differences in strategy use on division problems: Mental versus written computation. Journal of Educational Psychology, 102(2), 438–452. https://doi.org/10.1037/ a0018177. Jacobs, V. R., & Empson, S. B. (2016). Responding to children’s mathematical thinking in the moment: An emerging framework of teaching moves. ZDM Mathematics Education, 48, 185–197. https://doi.org/10.1007/s11858-015-0717-0. Kam, M. (2017). Fostering students' mathematics communication in measurement: A grade 3 teacher's perceptions and practices. (Doctoral Dissertation), Ontario Institute for Studies in Education of the University of Toronto, Toronto, Canada. Retrieved from https://tspace.library.utoronto.ca/bitstream/1807/79024/1/Kam_Mimi_ 201706_PhD_thesis.pdf. Kamii, C., & Dominick, A. (1998). The harmful effects of algorithms in grades 1–4. In L. J. Morrow, & M. J. Kenney (Eds.). The teaching and learning of algorithms in school mathematics (pp. 130–140). Reston, VA: National Council of Teachers of Mathematics. Kerkman, D. D., & Siegler, R. S. (1993). Individual differences and adaptive flexibility in lower-income children's strategy choices. Learning and Individual Differences, 5(2), 113–136. Klein, A., Starkey, P., & Wakeley, A. (2000). Child Math Assessment: Preschool Battery (CMA). Berkeley, CA: University of California, Berkeley. Lemaire, P., & Siegler, R. S. (1995). Four aspects of strategic change: Contributions to children's learning of multiplication. Journal of Experimental Psychology: General, 124(1), 83–97. Magliano, J. P., Trabasso, T., & Graesser, A. (1999). Strategic processing during comprehension. Journal of Educational Psychology, 91(4), 615–629. https://doi.org/10. 1037/0022-0663.91.4.615. McNeal, B. (1995). Learning not to think in a textbook-based mathematics class. Journal of Mathematical Behavior, 14(2), 205–234. Minsky, M. (1986). The society of mind. New York, NY: Simon and Schuster. Murata, A., & Fuson, K. C. (2006). Teaching as assisting individual constructive paths within an interdependent class learning zone: Japanese first graders learning to add using 10. Journal for Research in Mathematics Education, 37(5), 421–456. https://doi. org/10.2307/30034861. Naglieri, J. A., & Johnson, D. (2000). Effectiveness of a cognitive strategy intervention in improving arithmetic computation based on the PASS theory. Journal of Learning Disabilities, 33(6), 591–597. https://doi.org/10.1177/002221940003300607. National Research Council (2007). Taking science to school: Learning and teaching sciences in grades K-8. Washington, DC: National Academies Press. Nering, M. L., & Ostini, R. (2011). Handbook of polytomous item response theory models. New York, NY: Taylor & Francis. Paccagnella, O. (2006). Centering or not centering in multilevel models? The role of the group mean and the assessment of group effects. Evaluation Review, 30(1), 66–85. https://doi.org/10.1177/0193841X05275649. Pang, J. S., & Kim, J. W. (2018). Characteristics of Korean students’ early algebraic thinking: A generalized arithmetic perspective. In C. Kieran (Ed.). Teaching and learning algebraic thinking with 5- to 12-year-olds: The global evolution of an emerging field of research and practice (pp. 141–165). Cham: Springer International Publishing. Declaration of Competing Interest The authors declare that they have no conflicts of interest. References Alexander, P. A., Grossnickle, E. M., Dumas, D., & Hattan, C. (2018). A retrospective and prospective examination of cognitive strategies and academic development: Where have we come in twenty-five years? In A. O’Donnell (Ed.). Oxford Handbook of Educational Psychology. Oxford, UK: Oxford University Press. Alexander, P. A., & Judy, J. E. (1988). The interaction of domain-specific and strategic knowledge in academic performance. Review of Educational Research, 58(4), 75–404. https://doi.org/10.3102/00346543058004375. Alfieri, L., Brooks, P. J., Aldrich, N. J., & Tenenbaum, H. R. (2010). Does discovery-based instruction enhance learning? Journal of Educational Psychology, 103(1), 1–18. https://doi.org/10.1037/a0021017. Baroody, A. J., & Dowker, A. (2003). The development of arithmetic concepts and skills: Constructing adaptive expertise. Mahwah, NJ: Erlbaum. Baroody, A. J., Lai, M.-L., & Mix, K. S. (2006). The development of young children's number and operation sense and its implications for early childhood education. In B. Spodek, & O. N. Saracho (Eds.). Handbook of research on the education of young children (pp. 187–221). Mahwah, NJ: Erlbaum. Bauer, D. J., & Hussong, A. (2009). Psychometric approaches for developing commensurate measures across independent studies: Traditional and new models. Psychological Methods, 14(2), 101–125. https://doi.org/10.1037/a0015583. Biddlecomb, B., & Carr, M. (2011). A longitudinal study of the development of mathematics strategies and underlying counting schemes. International Journal of Science and Mathematics Education, 9, 1–24. Darrell Bock, R. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37(1), 29–51. https://doi. org/10.1007/BF02291411. Carnine, D. W., Jitendra, A. K., & Silbert, J. (1997). A descriptive analysis of mathematics curricular materials from a pedagogical perspective: A case study of fractions. Remedial and Special Education, 18(2), 66–81. Carpenter, T. P., Fennema, E. H., Franke, M. L., Levi, L., & Empson, S. B. (2014). Children's mathematics: Cognitively guided instruction (2nd ed.). Portsmouth, NH: Heinemann. Carpenter, T. P., Franke, M. L., Jacobs, V. R., Fennema, E. H., & Empson, S. B. (1998). A longitudinal study of invention and understanding in children’s multidigit addition and subtraction. Journal for Research in Mathematics Education, 29(1), 3–20. https:// doi.org/10.2307/749715. Carr, M., Jessup, D. L., & Fuller, D. (1999). Gender differences in first-grade mathematics strategy use: Parent and teacher contributions. Journal for Research in Mathematics Education, 30, 20–46. Choppin, J. (2011). The impact of professional noticing on teachers' adaptations of challenging tasks. Mathematical Thinking and Learning, 13, 175–197. Clark, R. E., Kirschner, P. A., & Sweller, J. (2012). Putting students on the path to learning: The case for fully guided instruction. American Educator, 36(1), 6–11. Clarke, B., Doabler, C. T., Nelson, N. J., & Shanley, C. (2014). Effective instructional strategies for kindergarten and first-grade students at risk in mathematics. Intervention in School and Clinic. https://doi.org/10.1177/1053451214560888. Clements, D. H. (2007). Curriculum research: Toward a framework for ‘research-based curricula. Journal for Research in Mathematics Education, 38(1), 35–70. https://doi. org/10.2307/30034927. Clements, D. H., Agodini, R., & Harris, B. (2013). Instructional practices and student math achievement: Correlations from a study of math curricula. Retrieved from NCEE (National Center for Education Evaluation and Regional Assistance) website: http:// ies.ed.gov/ncee/pubs/20134020/. Clements, D. H., & Sarama, J. (2007/2013). Building Blocks, Volumes 1 and 2. Columbus, OH: McGraw-Hill Education. Clements, D. H., & Sarama, J. (2014). Learning and teaching early math: The learning trajectories approach (2nd ed.). New York, NY: Routledge. Clements, D. H., Sarama, J., & Liu, X. (2008). Development of a measure of early mathematics achievement using the Rasch model: The Research-based Early Maths Assessment. Educational Psychology, 28(4), 457–482. https://doi.org/10.1080/ 01443410701777272. Clements, D. H., Sarama, J., Spitler, M. E., Lange, A. A., & Wolfe, C. B. (2011). Mathematics learned by young children in an intervention based on learning trajectories: A large-scale cluster randomized trial. Journal for Research in Mathematics Education, 42(2), 127–166. https://doi.org/10.5951/jresematheduc.42.2.0127. Clements, D. H., Sarama, J., Wolfe, C. B., & Day-Hess, C. A. (2008/2019). REMA—Research-based Early Mathematics Assessment. Denver, CO: Kennedy Institute, University of Denver. Clements, D. H., Sarama, J., Wolfe, C. B., & Spitler, M. E. (2013). Longitudinal evaluation of a scale-up model for teaching mathematics with trajectories and technologies: Persistence of effects in the third year. American Educational Research Journal, 50(4), 812–850. https://doi.org/10.3102/0002831212469270. 11 Contemporary Educational Psychology 60 (2020) 101834 D.H. Clements, et al. strategy use. In L. A. Penner, G. M. Batsche, H. M. Knoff, & D. L. Nelson (Eds.). Contributions of psychology to science and mathematics education (pp. 341–366). Washington, DC: American Psychological Association. Siegler, R. S. (1994). Cognitive variability: A key to understanding cognitive development. Behaviometrika, 3, 1–50. Siegler, R. S. (1995). How does change occur: A microgenetic study of number conservation. Cognitive Psychology, 28, 255–273. https://doi.org/10.1006/cogp.1995. 1006. Siegler, R. S. (2006). Microgenetic analyses of learning. In W. Damon, R. M. S. E. Lerner, D. Kuhn, & R. S. V. E. Siegler (Eds.). Handbook of child psychology: Volume 2: Cognition, perception, and language (pp. 464–510). (6th ed.). Hoboken, NJ: John Wiley & Sons. Steffe, L. P. (1994). Children’s multiplying schemes. In G. Harel, & J. Confrey (Eds.). The development of multiplicative reasoning in the learning of mathematics (pp. 3–39). Albany, NY: SUNY Press. Stein, M. K., Engle, R. A., Smith, M. S., & Hughes, E. K. (2008). Orchestrating productive mathematical discussions: Five practices for helping teachers move beyond show and tell. Mathematical Thinking and Learning, 10(4), 313–340. Thornton, C. A. (1979). See Emphasis on thinking strategies in basic fact instruction-a response. Journal for Research in Mathematics Education, 10(3), 233–234. Torbeyns, J., Verschaffel, L., & Ghesquière, P. (2005). Simple addition strategies in a firstgrade class with multiple strategy instruction. Cognition and Instruction, 23, 1–21. Verschaffel, L., Torbeyns, J., De Smedt, B., Luwel, K., & Dooren, W. V. (2007). Strategy flexibility in children with low achievement in mathematics. Educational & Child Psychology, 24(2), 16–27. Wansart, W. L. (1990). Learning to solve a problem: A microanalysis of the solution strategies of children with learning disabilities. Journal of Learning Disabilities, 23(164–170), 184. Weiland, C., Wolfe, C. B., Hurwitz, M. D., Clements, D. H., Sarama, J., & Yoshikawa, H. (2012). Early mathematics assessment: Validation of the short form of a prekindergarten and kindergarten mathematics measure. Educational Psychology, 32(3), 311–333. https://doi.org/10.1080/01443410.2011.654190. What Works Clearinghouse. (2017). Self-regulated strategy development. Retrieved from What Works Clearinghouse website: https://ies.ed.gov/ncee/wwc/Docs/ InterventionReports/wwc_srsd_111417.pdf. Wright, B. D., & Stone, M. H. (1979). Best test design: Rasch measurement. Chicago: MESA Press. Wu, H.-H. (2011). Understanding numbers in elementary school mathematics. Providence, RI: American Mathematical Society. Paulos, J. A. (1991). Beyond numeracy: Ruminations of a numbers man. New York, NY: Alfred A. Knopf. Peterson, P. L., Carpenter, T. P., & Fennema, E. H. (1989). Teachers' knowledge of students' knowledge in mathematics problem solving: Correlational and case analyses. Journal of Educational Psychology, 81, 558–569. Plewis, I. (1989). Comment on “centering” predictors in multilevel analysis. Multilevel Modelling Newsletter, 1(3), 6, 11. Powell, S. R., & Fuchs, L. S. (2015). Intensive intervention in mathematics. Learning Disabilities Research & Practice, 30(4), 182–192. https://doi.org/10.1111/ldrp.12087. Pressley, M., & Harris, K. R. (2006). Cognitive strategies instruction: From basic research to classroom instruction. In P. A. Alexander, & P. H. Winne (Eds.). Handbook of educational psychology (pp. 265–286). Mahwah, NJ: Lawrence Erlbaum Associates. Raudenbush, S. W. (1989). “Centering” predictors in multilevel analysis: Choices and consequences. Multilevel Modelling Newsletter, 1(2), 10–12. Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models: Applications and data analysis methods (2nd ed.). Newbury Park, CA: Sage. Rhodes, K. T., Lukowski, S., Branum-Martin, L., Opfer, J., Geary, D. C., & Petrill, S. A. (2019). Individual differences in addition strategy choice: A psychometric evaluation. Journal of Educational Psychology, 111(3), 414–433. https://doi.org/10.1037/ edu0000294. Rockwell, S. B., Griffin, C. C., & Jones, H. A. (2011). Schema-based strategy instruction in mathematics and the word problem-solving performance of a student with autism. Focus on Autism and Other Developmental Disabilities, 26, 87–95. Sarama, J., & Clements, D. H. (2009). Early childhood mathematics education research: Learning trajectories for young children. New York, NY: Routledge. Sarama, J., Clements, D. H., Wolfe, C. B., & Spitler, M. E. (2012). Longitudinal evaluation of a scale-up model for teaching mathematics with trajectories and technologies. Journal of Research on Educational Effectiveness, 5(2), 105–135 10.1080119345747.2011,627980. Sarama, J., Lange, A., Clements, D. H., & Wolfe, C. B. (2012). The impacts of an early mathematics curriculum on emerging literacy and language. Early Childhood Research Quarterly, 27(3), 489–502. https://doi.org/10.1016/j.ecresq.2011.12.002. Saxton, M., & Cakir, K. (2006). Counting-on, trading and partitioning: Effects of training and prior knowledge on performance on base-10 tasks. Child Development, 77(3), 767–785. Sherin, B., & Fuson, K. C. (2005). Multiplication strategies and the appropriation of computational resources. Journal for Research in Mathematics Education, 36(4), 347–395. Siegler, R. S. (1993). Adaptive and non-adaptive characteristics of low income children's 12