As highlighted throughout the report, Pre-K–12 science, technology, engineering, and mathematics (STEM) education initiatives can have a complex constellation of goals and related outcomes for beneficiaries (e.g., students, teachers). For example, they might be designed to advance the STEM workforce, to develop skills and competency in STEM disciplines, to foster a sense of belonging and identity as a STEM professional, and/or as a way to cultivate just, sustainable, and thriving human communities (National Academies of Sciences, Engineering, and Medicine [NASEM], 2024a). Innovations might be designed to meet multiple goals, some of which are explicitly stated, others that are more implicit. As an innovation is implemented, enactors (educators and administrators) may emphasize specific outcomes or introduce additional goals. Some of the goals may be measured, and others may not be. Regardless, throughout the curricula, programs, and experiences, the intended outcomes are intertwined. Recent research in STEM education has focused on understanding and measuring these broadened goals and outcomes of educational approaches, programs, curricula, and interventions (National Academy of Engineering [NAE] & National Research Council [NRC], 2014; NASEM, 2020, 2021a,b, 2022, 2023, 2024a; NRC, 2011, 2013).
A goal for some Pre-K–12 STEM education innovations is to scale beyond the contexts in which it was designed. Chapters 4 and 5 described some of the approaches to scaling and sustaining that innovations might take and the ways in which innovations can propagate. Throughout these discussions, the committee navigates the tensions between the landscape
of innovation and implementation as well as the tensions between features of the innovation itself that are necessary to support the scaling and sustainability, and broader system-level supports. These issues are necessarily intertwined and highly dependent.
Pre-K–12 STEM education innovations are highly variable in their implementation and reach, which presents challenges for understanding the landscape of access and opportunity. A number of recent reports have attempted to catalogue the patterns of opportunity and access to Pre-K–12 STEM education innovations (see NASEM, 2022, 2023, 2024a,b). Given the variability, the committee was unable to provide a detailed analysis of this opportunity landscape; however, as can be observed through the case summaries in the compendium that was completed by the Education Development Center (EDC; Appendix C), there are a number of innovations designed to increase opportunity and develop a sense of belonging and identity with STEM.
The goal for this chapter is to provide what is known about the features at the innovation level that support the scaling of promising, evidence-based Pre-K–12 STEM education innovations. This is accomplished by drawing on the findings recorded in the compendium that was completed by the EDC. First, the committee describes the development of the compendium. The committee then explores the conditions that support scaling promising, evidence-based, Pre-K–12 STEM education innovations. The chapter concludes with a brief introduction of the factors that were identified as hindrances to scaling innovations, issues that are taken up more fully in the subsequent chapter.
The committee was tasked with producing a compendium of promising, evidence-based Pre-K–12 STEM education innovations. As noted in Chapter 1, the committee recognized the importance of this task and the enormity of the undertaking. Therefore, the committee commissioned the production of the compendium, which was developed by EDC. What follows is a brief outline of the methodology and a brief overview of promising innovations identified through this process.
___________________
1 It should be noted that the compendium was produced while the committee was engaged in its analysis of the literature and reaching consensus on its evidence-based conceptualization of scaling. Therefore, the production of the compendium did not include questions that would allow for a more nuanced analysis and understanding of scaling using the dimensions of scale as described by the committee.
EDC conducted a seven-step process with the goal of identifying promising Pre-K–12 STEM education innovations that have gone through a scaling-up process. This process included these steps:
The full methodology for the development of the compendium can be found in Appendix B, and the full compendium is provided in Appendix C.
Before identifying innovations, EDC first conducted a literature search to identify factors that appear to support innovations in their scaling efforts and those that may hinder them in service of understanding how innovations scale. EDC systematically searched Google Scholar and EBSCO using various combinations of the following keywords: STEM, science, mathematics, engineering, education, innovation, scaling, and scale-up. Results were filtered to identify literature published in the last ten years. In EDC’s review, it was determined that there is a growing body of research that has attempted to uncover the factors that enable or constrain the scaling of innovations (Dearing et al., 2015; Krainer et al., 2019; Looi & Teh, 2015; Lowrie, Leonard, & Fitzgerald, 2018; Maass et al., 2019; Sabelli & Harris, 2015; Young et al., 2016). From the literature, the following promising factors were identified:
These promising factors were intentionally built into the questions asked by EDC in its solicitation and in the rubric so that EDC could provide the committee with an understanding of how the innovations accounted for these factors during scaling and sustainability efforts.
For an innovation to be considered, it needed to meet the following characteristics:
A Google form was created for potential programs to self-nominate. The form requested general information (e.g., program name and point of contact); program information (e.g., Pre-K–12 grade band, subject domain[s], intended audience[s]); evidence of scale; and evidence of impact. Evidence of scale and evidence of impact were collected mostly as open-ended responses to allow programs maximum flexibility to explain how they view these parameters and apply them in their individual contexts.
EDC developed a working list of sources and field experts to assist in identifying programs to directly contact regarding the self-nomination process. Based on this, EDC identified programs through the following:
In order to identify specific programs to reach out to, EDC first solicited suggestions from their personal contacts as well as through conversations with program officers from the ED, NSF, National Aeronautics and Space Administration, and National Oceanic and Atmospheric Administration. This resulted in 41 programs identified.
EDC then utilized publicly available databases to review federally funded projects from the past 15 years. This included programs funded by Education Innovation and Research scale-up grants from ED, as well as Innovative Technology Experiences for Students and Teachers (ITEST) and Discovery Research Pre-K–12 (DRK–12) grants from NSF. EDC included a search for the terms “scale” or “scale-up” in order to limit the results to those referencing scale. The list of programs was then reviewed to include only those programs that had scaled, were focused on a field of STEM, and were connected to formal classrooms. EDC also reviewed the What Works Clearinghouse to identify programs that met criteria. This resulted in the identification of an additional 81 programs.
Finally, EDC’s review of the literature identified an additional 14 programs that had not already been identified through other means. In total, EDC sent invitations to 136 programs inviting them to self-nominate (see Table B-1 in Appendix B).
In addition to reaching out to specific programs, EDC also sent information about this opportunity to organizations asking them to share the opportunity with programs that may be eligible for inclusion in the compendium. In total, EDC contacted individuals representing 64 organizations (see Table B-2 in Appendix B).
Program contacts received an outreach email encouraging them to nominate their programs and share the opportunity with their networks. EDC also utilized social media for outreach with posts shared by EDC, Community for Advancing Discovery Research in Education, and STEM Learning and Research Center. Moreover, EDC sought word-of-mouth recommendations from knowledgeable colleagues and others in the field whenever possible.
All programs were asked to provide a brief summary of evidence of impact to include evidence that (a) STEM learning goals were achieved (e.g., content-specific knowledge), (b) STEM skills were developed (e.g., critical thinking, problem solving, communication), (c) affective goals were achieved (e.g., generated interest in STEM), and/or (d) the innovation helped participants make connections to STEM careers. The various innovations were evaluated against a rubric to determine the strength of evidence for scale, impact, and additional factors related to the scalability of an innovation as noted above (see Appendix B for the Rubric). EDC examined innovations looking for: (a) evidence of fidelity during scaling,2 (b) evidence of scaling to new audiences or contexts, (c) evidence that participant outcomes were achieved or skills developed, and (d) understanding of effectiveness for different contexts/learners. The innovations were also reviewed to determine to what extent there was evidence for the factors identified by the literature search that have contributed to successful scaling efforts.
A total of 65 nominations were received. Four submissions were excluded because they did not include an external evaluation or peer review, another four were excluded because they were not connected to formal classroom settings, and one was excluded because it did not focus on STEM. The remaining 56 programs were reviewed and an overview of their breakdown is summarized below (see Appendix C for individual descriptions of all of the programs).
At least half of the innovations were intended for elementary, middle, and/or high school with two-thirds focused on middle school. Fewer innovations focused on Pre-K (30%). More than half of the innovations (66%) indicated that they focused on two or more grade bands. Most innovations (70%) included a focus on science content, whereas about half (48%) included a focus on engineering or mathematics. More than half of the identified nominations (63%) indicated more than one domain, and 13 innovations (23%) selected all four of the main STEM disciplines.
___________________
2 As previously noted, this work was done in parallel as the committee was reaching consensus on the more nuanced characterization of scaling; the committee recognizes that focusing on “fidelity of implementation” represents a limited view on scaling an innovation.
TABLE 7-1 Nominated Innovations by Domain and Grade Band
| % Nominations | Science | Technology | Engineering | Mathematics | Computer Science | Data Science |
|---|---|---|---|---|---|---|
| Pre-K (n=17) | 65% | 41% | 53% | 82% | 29% | 29% |
| Elementary (n=29) | 62% | 41% | 62% | 59% | 38% | 34% |
| Middle (n=38) | 79% | 37% | 58% | 45% | 34% | 26% |
| High (n=32) | 78% | 38% | 44% | 41% | 34% | 31% |
* Innovations could select more than one response option.
SOURCE: EDC, 2024.
Looking at the distribution of domains across grade bands, as seen in Table 7-1, most Pre-K programs included a mathematics component (82%). Science was a component of the majority of middle and high school programs (79% and 78%, respectively). Elementary school programs had a more even distribution across the domains, but were most likely to include science, engineering, or mathematics.
Innovations can arise from and be implemented in a variety of contexts (see Chapter 5 for more discussion). Although innovations needed to have a connection to Pre-K–12 settings, innovations were also frequently implemented in informal learning and afterschool settings. Twenty-eight (50%) of the innovations indicated that they take place in or are connected to formal Pre-K–12 settings only. Moreover, about half of the innovations (52%) have been in operation for more than 10 years, 23 percent have been operating for 2–5 years, and 20 percent for 6–10 years.
In terms of the intended audiences for the identified innovations, all but one program indicated that teachers/educators were the intended audience in addition to most (86%) also selecting students. At least half of the innovations also identified low-income communities, individuals from ethnic or racial groups traditionally underrepresented in STEM, and underresourced geographic areas as intended audiences (see Table 7-2). Given these intended audiences, many of the innovations (87%) incorporated supports for building teacher capacity (e.g., professional learning, differentiated teaching/learning, cultural/contextual responsiveness) to implement the innovation successfully with the anticipated participants.
As described in the earlier methodology section, EDC conducted a review of the literature to identify conditions that hinder or support the
TABLE 7-2 Nominated Innovations by Audience
| Audiences | n* | % |
|---|---|---|
| Teachers/educators | 55 | 98% |
| Students/youth | 48 | 86% |
| Low-income communities | 34 | 61% |
| Individuals from ethnic or racial groups traditionally underrepresented in STEM | 31 | 55% |
| Under-resourced geographic areas | 29 | 52% |
| Multilingual learners | 27 | 48% |
| Girls | 23 | 41% |
| Parents/caregivers/families | 14 | 25% |
| Other** | 5 | 9% |
* Programs could select more than one choice.
** Other responses: All; School/District Administrators; designed to work for all students taking a science class; Tribal communities and high schools with a minimum of 60% enrollment of Native American, Alaska Native, Native Hawaiian and/or Pacific Islander students; teacher leaders, professional learning providers, administrators, state-level leaders.
SOURCE: EDC, 2024.
scaling of innovations (Dearing et al., 2015; Krainer et al., 2019; Looi & Teh, 2015; Lowrie, Leonard, & Fitzgerald, 2018; Maass et al., 2019; Sabelli & Harris, 2015; Young et al., 2016). As listed above, key factors identified as supportive conditions for scaling were:
These are the factors that are inherent (or need to be considered) in the design and implementation of an innovation and are related to the approaches articulated in Chapter 4. Many of these factors are also related to systems-level factors that can enable or constrain the scaling and sustaining of innovations—these system-level ideas will be taken up in Chapter 8. What follows is a discussion of each factor, using case studies from the compendium to support the ways in which the various factors supported the scaling (and sustainability) of the innovation. In describing each case study, the committee discusses the factors that contributed to the innovation’s success and how challenges were managed.
For innovations to successfully scale, it is important that they include a proven core with ample room for adaptation to different contexts and learners (Looi & Teh, 2015). Wylie and colleagues (2008) note that a “tight but loose” framework focuses on the tension between two opposing factors in school reform: the need for an innovation to stay consistent with its governing principles, while also offering ample opportunity to adapt to the countless STEM learning environments in which it makes sense to take up said innovation. The framework “combines an obsessive adherence to central design principles (the tight part) with accommodations to the needs, resources, constraints, and particularities that occur in any school or district (the loose part), but only where these do not conflict with the theory of action of the intervention” (p. 2). This is akin to the mutual adaptation discussion in Chapter 4.
The flexibility offered by this framework can also support educator autonomy, which can aid in motivation and buy-in. It is important that the core of the innovation be clearly stated and shown to be sufficient to meet desired outcomes (Dearing et al., 2015). As an example of an innovation using the “tight but loose” approach, the framework utilized by SCRIPT supports schools and districts in creating a plan for high-quality computer science implementation in a way that considers their individual contexts and goals (see Box 7-1). The SCRIPT framework not only allows but trains educators to appropriately adapt the innovation to support local relevance, leading to adoption across a variety of settings.
Additionally, adaptations by practitioners can have the added benefit of serving as a source of additional resources that could be used to expand upon the materials offered by the innovation (Sabelli & Harris, 2015). For example, the compendium describes FUSE,3 which is a program that is guided by core design principles while offering individual educators the freedom to design the program according to the needs of the school/organization and their community, including logistics like classroom schedules. Research-based recommendations are shared with all partners as they develop their plans to guide adaptation to fit their local needs. FUSE was first piloted in the Chicago area but has been expanded to additional urban, suburban, and rural partners.
For innovations to have the potential to scale, it is important for the innovation to be aligned with the priorities set by policy, organizations, and
___________________
3 FUSE is not an acronym. The name originated from youth who had participated in the program, which had been called ‘YouSTEM’ (see https://www.fusestudio.net/about).
SCRIPT utilizes a framework to guide strategic planning for school and district leadership teams to implement computer science education. The SCRIPT team works with school and district leadership teams to create a plan for K–12 implementation in a way that considers their individual contexts and goals—so although the SCRIPT framework itself is stable, it is flexible so that the plan can be adapted to fit the context, a factor critical for the scalability of this innovation. As part of the planning, SCRIPT helps leadership teams consider capacity building, including teacher professional learning, technology implementation, and community resources. SCRIPT has worked with over 950 school districts in the United States, increasing its reach by over 25 times since its inception, and has multiple states with facilitators trained and certified and currently offering workshops; the utilization of trained facilitators across the country primed to support adaptation based on local contexts and goals has aided in the scalability of the innovation.
Research has shown that SCRIPT has had impact on buy-in, decision making, goal setting, and actions at the district level (Cobo et al., 2024), with more recent research focusing on longitudinal impact on student course taking and equity.
SOURCE: EDC, 2024.
individual practitioners (Looi & Teh, 2015; Lowrie, Leonard, & Fitzgerald, 2018). As noted earlier, innovations can be developed with multiple goals and purposes—from broadening participation and access, to developing skills and content knowledge, to advancing a sense of belonging and identity as a person who participates in STEM (among others). Each of these goals might be attended to in various ways in the construction and implementation of an innovation, and some of these may be easier to advance when they are aligned with other goals as well as existing policies and practices (this will be taken up more in Chapter 8).
Alignment with goals, policies, and practices helps to facilitate alignment across levels of the system and the different partners (Maass et al., 2019); alignment in this way can lead to lasting organizational and policy related changes (Sabelli & Harris, 2015). For example, showing how the program is aligned with standards can help with buy-in by showing how implementing the program can be incorporated with existing practices and help achieve goals for learners (Sabelli & Harris, 2015). The IRCEDE STEM for Our Youngest Learners (see Box 7-2) is an innovation that is aligned with both the Early Learning Outcomes Framework4 and the Next Generation Science Standards (NGSS Lead States, 2013). By providing aligned materials and
___________________
STEM for our Youngest Learners is a framework for a student-centered, inquiry-based approach to STEM that engages young children in sensemaking as they design solutions to self-defined engineering problems. The framework of STEM for Our Youngest Learners is aligned with both the Early Learning Outcomes Framework and the Next Generation Science Standards. The program is not a stand-alone curriculum or a package of lessons but is an approach to pedagogy that is taught in hands-on teacher play sessions for adults using the same open-ended materials they will offer children in their settings. The goal is for teachers to develop the skills and confidence to enhance their existing science curriculum and sequential lesson plans, adapting to their local contexts and learners while considering other factors such as grade, time schedules, culture, and mandated curricula.
The framework was originated with Pre-K–2 teachers at an inclusive laboratory school serving primarily low-income urban Black families. Additional funding allowed for expansion to the predominantly rural northeast part of the state. The Iowa Governor’s STEM Advisory Council has funded the program annually since 2017 to provide professional learning and classroom materials to over 1,000 of Iowa’s early childhood educators and informal learning programs, in underresourced rural as well as urban sites, across the state. Additional funding has also allowed for ongoing improvement of materials, and dissemination occurs through a partnership with Pre-K–2 teachers at North Tama County Community Schools, a rural public school.
An NSF DRK-12 grant enabled a pilot test and field test in multiple sites in different states. The evaluation revealed that most of the participating teachers created an environment that supports science teaching and learning and that children made gains in physical science content knowledge. Additionally, teachers reported that they had gained important knowledge, skills, and abilities in terms of science content, supporting science inquiry, and facilitating inquiry-based learning.
SOURCE: EDC, 2024.
professional learning, teachers gain important knowledge, skills, and abilities that support student learning across a variety of contexts and student populations. As teachers gain more confidence with STEM, they are able to adapt the program to their local contexts and learners, considering such factors such as grade, time schedules, cultures, and mandated curricula.
To ensure that all learners have access to STEM-related opportunities, there has been increasing focus on equity-related goals. Some studies have revealed that explicit connections to equity-oriented components within innovations yield better outcomes for learners who have been traditionally underrepresented in STEM (Cunningham et al., 2020; NAE & NRC, 2014; NASEM, 2021a,b). Box 7-3 presents the example of the Beauty and Joy of Computing (BJC) course—a high school Advanced Placement Computer
BJC is a College Board-endorsed Advanced Placement (AP) Computer Science Principles (CSP) course designed to provide students with a rigorous and engaging introduction to CS and to support the participation of students from groups historically underrepresented in computer science. The BJC course uses the blocks-based visual programming language Snap! to increase accessibility and engagement for students. It employs a project-centered approach, with projects in a variety of contexts (e.g., games, art/design, mathematics); supports culturally responsive instruction; and incorporates critical social implications of computing in the content of the course.
Findings from a BJC field-test indicate that teachers using the BJC curriculum and participating in summer professional learning made statistically significant pre/post gains in content knowledge, self-efficacy, self-rated preparation/effectiveness, self-rated programming ability, and knowledge/fluency (Price et al., 2016). External evaluation reports revealed that students in the BJC course in 2016–2017 (n = 311) showed significant pre/post gains on a content assessment, with small to medium effect sizes. Findings for student engagement and attitudes included significant gains for confidence and identity sub-scales, but no significant gains for interest and belongingness. Girls and Black and Latinx students achieved similar gains on the content assessment and on engagement and attitude measures as male and non-Black and Latinx students.
Student enrollment data in NYC indicate gains in the percentages of female, Black, and Hispanic students participating in BJC classes, and taking and passing the AP CSP exam. On the 2017 AP CSP exam, 2,854 NYC students took the exam, and 2,076 passed—a 73 percent pass rate compared with 74 percent nationwide, with higher percentages of female, Black, and Hispanic students in NYC taking the AP CSP exam than nationwide (Mark & Klein, 2019). National AP CSP exam data from the College Board in 2021 indicated female BJC students passed at a rate of 6.4 percentage points higher than the national average, Black BJC students passed at 1.2 points higher, and Hispanic BJC students at 4.6 points higher than the national average.
With support from NSF (grant #1441075), the initial cohort of teachers that piloted the high school BJC curriculum in 2015 included 28 NYC high school teachers. Between 2015 and 2022, BJC curriculum training was provided to over 200 NYC teachers from 136 schools. Those teachers, in turn, served over 25,000 NYC high school students. BJC AP CSP has been offered in the NYC Public Schools (NYCPS) continuously since the 2015–2016 school year with teacher professional learning opportunities offered through the NYCPS Computer Science for All (CS4All) initiative led by NYCPS teachers. Hundreds of additional BJC teachers have been trained nationally through annual summer professional development offerings at North Carolina State University. With additional support from EIR program at ED, an additional 38 teachers from 36 schools across the country are using BJC as part of a school CS equity program designed to broaden participation in CS coursework through recruitment, enrollment, and retention of high-need students in AP CSP courses. While the program is currently focused on working and expanding within a specific population (NYC schools), additional funding is currently supporting expansion across the country with a focus on equity.
SOURCE: EDC, 2024.
Science Principles course that has scaled to over 200 New York City (NYC) teachers and 600 non-NYC teachers. It is endorsed by the College Board, aligning to goals which can help administrators and teachers feel confident that this curriculum will help them meet their objectives. The objectives of BJC include providing students with a rigorous and engaging introduction to computer science and supporting participation of students from groups historically underrepresented in computer science. Findings from studies evaluating BJC have shown not only improved academic performance but also significant gains in confidence and identity for girls and Black and Latinx students.
In addition to ensuring that there is a core, adaptable framework and alignment with goals, policies, and practices, there is also a need for capacity building and organizational support. This can come from a variety of different forms, such as providing support for practitioners through professional learning (Maass et al., 2019; Young et al., 2016). Professional learning can support teachers as they navigate the changing landscape with growing expectations for what they need to know and do in their day-to-day practice of teaching (as described in Chapter 1; NASEM, 2020). Teachers are increasingly charged with ensuring that classrooms serve as equitable learning communities, fostering trusting and caring relationships among students and with teachers, and serving as a bridge between the school and families and communities. These increased expectations for learning, combined with the demand to create a responsive learning environment that supports the needs of all students, call for innovative approaches to instruction that may differ substantially from teachers’ own experiences as students or their preservice education (NASEM, 2020). At the same time, in the case of many STEM education innovations designed for students, teachers are considered key players in implementation. Thus, they are often supported through professional learning in an effort to facilitate effective implementation of these STEM education innovations. For example, STeLLA, a program included in the compendium, is a year-long science professional learning program for K–12 teachers, integrating video-based analysis of practice and a focus on student thinking (see Box 7-4). It provides approximately 90 hours of professional learning, including a summer institute and school year study groups. Research has provided evidence of impacts on teacher content knowledge, teacher pedagogical content knowledge, and teacher classroom practice that resulted in improved student achievement.
Beyond ensuring opportunities for professional learning, there is also a need for support from the organization via an investment in resources and continued support/buy-in (Krainer et al., 2019; Maass et al., 2019). In some cases, this might mean focusing on a bigger shift in the organization or
STeLLA, by BSCS Science Learning, began with 32 upper elementary school teachers in California in 2003. This innovation is a year-long science professional learning program for K–12 teachers, integrating video-based analysis of practice and a focus on student thinking. The goal is to improve students’ science achievement by improving teachers’ science content knowledge and their abilities to (a) explain science concepts to students, (b) clearly identify to students the science concepts used in student learning activities, and (c) engage students in thinking about science. Teachers learn to use these strategies by analyzing classroom videos and units, and sharing their thinking in facilitated sessions with other teachers. Throughout the school year, teachers apply what they are learning in their own classrooms. In 2009, STeLLA expanded to Colorado, initially serving 144 elementary teachers and then again in 2015 to reach middle and high school teachers in the same state. In 2019, the program expanded to Kentucky and Tennessee.
STeLLA’s approach to scaling was to steadily grow the project over time through additional funding and partnerships. This allowed them to test out their model in different contexts (e.g., different states and with different grade levels). They also utilize expert mentors to provide in-person, online, or hybrid teacher professional learning experiences.
Randomized controlled trial and quasi-experimental studies of the core program provide evidence of impacts on student achievement (effect size 0.68), teacher content knowledge, teacher pedagogical content knowledge, and teacher classroom practice (e.g., Roth et al., 2011, 2019; Taylor et al., 2017; Wilson et al., 2018). Additional studies have included online adaptations and application in preservice education and across settings (elementary, middle, and high school), as well as small- and large-scale implementation.
SOURCE: EDC, 2024.
policies to establish a system where the essential principles of an innovation can be adopted and sustained (Looi & Teh, 2015; Sabelli & Harris, 2015). An example is Making Sense of Science, a train-the-trainer approach that can support local leaders in facilitating high-quality teacher professional learning that, coupled with support from administrators, can develop systems of support for STEM learning (see Box 7-5). In this way, an organization’s capacity for adopting any new program could be strengthened. These ideas are expanded upon in Chapter 8.
As introduced in Chapters 4 and 5, research has suggested that practitioners can be key to providing developers with a better sense of what works
Making Science of SCIENCE (MSS) provides leadership development, professional learning, and needs-based technical assistance to teachers, coaches, leaders, and administrators. The program has been brought to scale through two key mechanisms: (a) a train-the-trainer model to support local leaders in facilitating high-quality teacher professional learning, and (b) work with administrators and local leaders to develop systems of support for STEM learning. These approaches work hand-in-hand to build local capacity in ways that account for differing contexts and authentically meet the needs of partnering schools, districts, and states. As two examples of state-wide scaling, MSS partnered with the Math and Science Bureau of the New Mexico Public Education Department and the Texas Regional Collaboratives to lead Facilitation Academies for regional leaders on a variety of topics. These trained regional leaders then provided MSS professional learning across the state, reaching 400–1,000 teachers each year for multiple consecutive years.
As reported by Heller and colleagues (2012), the MSS train-the-trainer approach has been shown to produce facilitators who successfully lead impactful teacher professional learning. A national randomized controlled trial (RCT) showed that teacher preparation in just 24 hours of MSS professional learning resulted in statistically significant gains in teacher and student content knowledge. Teacher content knowledge was maintained during the following school year and the new cohort of students also experienced gains in content knowledge. Following the national RCT, a follow-up study analyzed videos of 30 classrooms to better understand shifts in teacher instruction resulting from participation in the professional learning The MSS professional learning had a large, statistically significant effect on the overall quality of classroom instruction. Teacher participation in the professional learning resulted in statistically significant improvements in student cognitive engagement, student engagement in scientific sensemaking practices, and teacher elicitation of and attention to student thinking.
SOURCE: EDC, 2024.
for whom and in what contexts (Looi & Teh, 2015; McLaughlin & Mitra, 2001; Sabelli & Harris, 2015; Sarama & Clements, 2013). Additionally, testing the program in a variety of different settings (e.g., via design-based research) with different learners and including the input of educators whenever possible can provide valuable insights to identify challenges and adaptations (Lowrie, Leonard, & Fitzgerald, 2018). NURTURES (see Box 7-6) is an innovation that began with implementation in urban districts before expanding to rural districts and then military-connected districts in seven states. Throughout the program, research has been conducted to understand how to implement the innovation in different settings and with different audiences.
NURTURES provides professional learning for Pre-K–3 teachers and family engagement for families of young children (take-home Family Science packs and family engagement activities hosted in the community after school or on weekends). NURTURES began with implementation in urban districts in Toledo, Ohio. Building on the success of the initial phase of the program, they received additional NSF funding to implement the program in rural districts in Ohio and Michigan. A particularly important facet of this follow-up project was to research how each component (teacher professional learning versus family engagement) impacts student learning. Further funding from the Department of Defense allowed the program to expand to military-connected districts in seven states. Throughout the life of the program, the NURTURES team has conducted research to understand how to implement the program in different settings and with a wider audience. Continuous federal funding has supported this.
An initial evaluation documented the first five years of the program. Findings verified fidelity of implementation, and corrections or modifications to implementation were made due to early detection of variance from implementation plans. Findings indicated that NURTURES is a successful intervention for improving science teaching and student outcomes in early childhood classrooms as well as for increasing family science participation and the quality of that participation.
A longitudinal case study revealed improved pedagogical practices among teachers, increased science content knowledge and confidence to teach science, and increased use of technology in the classroom. A randomized controlled trial study found that K–2 students whose teachers had participated in NURTURES demonstrated higher achievement than peers whose teachers had not been in the program. Longitudinal studies also demonstrate the long-term impact of the program on student learning with gains in science, early literacy, and mathematics being sustained to grade 5.
Current research, funded through the Department of Defense, focuses on the delivery of program elements, examining methods of program delivery, multi-site facilitation, impacts on student learning, and the feasibility of offering NURTURES at scale.
SOURCE: EDC, 2024.
If the goal is to get many practitioners from a variety of contexts to try out a program, considering the feasibility (“trialability”), usability (“simplicity”), and cost can help increase the chances that the program will be implemented. Dearing and colleagues (2015) described case studies from three community college advanced technological education programs as they were supported in scaling innovations. Several lessons were learned through these case studies. First, it was determined that scaling required critical thinking, trial and error,
and reflection during implementation. That is, factors aiding in scale included the ease with which adopters can experiment with an innovation (i.e., trialability), how easy the innovation is to understand and use (i.e., simplicity), compatibility with diverse settings, and how much implementation of the innovation costs in terms of both time and money. As discussed in Chapter 4, an inherent tension exists between the ease of adoption and the depth of change that is needed to scale and sustain an innovation, and it is important to recognize that just because an innovation might by easy to adopt does not mean that it will result in the depth of change that is needed to be sustained.
Moreover, scaling benefits from encouraging adopters to adapt the innovation (within limits) and utilize these adaptations to improve the innovation. For example, Project Learning Tree (PLT) is an innovation that is supported by a network of state-level coordinators to engage with educators and communities at the local level. Because PLT programs are directed and implemented locally, educators received state-specific supplements to PLT’s educational materials that address the local environment, which aids in the ease of adoption of the materials. As another example, AlgebraByExample (Box 7-7) is a set of supplementary assignments that were explicitly designed to narrow the achievement gap for minority students.
Easily incorporated into any existing Algebra 1 curricula, AlgebraByExample is a set of freely downloadable supplementary Algebra 1 assignments that require students to analyze correctly- and incorrectly-worked examples that target common misconceptions and errors. These assignments can be used for practice, probes, personal reflection, and jumping off points for discussion. Developed via design-based research through the Strategic Education Research Partnership (SERP) and educators from the Minority Student Achievement Network (MSAN) within a year-long random-assignment study in over 300 classrooms with over 6,000 students, this set of assignments was developed with the intent of narrowing the achievement gap for minority students in the MSAN without isolating minority students in intervention settings (see Booth et al., 2015).
On average, students using AlgebraByExample demonstrated statistically significant gains in procedural and conceptual understanding, as well on released items from standardized tests, but the lowest performing students showed the most significant gains. More research needs to be done to see the impact on minority students.
Although not much information was provided about their scaling efforts, the fact that it is freely accessible and designed to supplement classroom teaching may contribute to scaling due to the ease of accessing and integrating the materials.
SOURCE: Booth et al., 2015; EDC, 2024.
However, in striving to develop an innovation that can be scaled across a variety of contexts, it is key to not fall into the “trap of perfection” (Dearing et al., 2015). As described in Chapter 4, there are a number of dimensions for scaling. Maintaining depth (what the program is supposed to do) while also allowing for simplicity and flexibility is desirable for scalability. Therefore, there can be a tradeoff between ease of adoption and impact. That which is easily adopted in terms of curricular innovation may be less likely to push on practice in the way today’s STEM standards require. The committee returns to this issue in Chapter 8.
Partnerships can be valuable for scaling in a variety of ways (Lowrie, Leonard, & Fitzgerald, 2018; Maass et al., 2019; Young et al., 2016). For example, the common online data analysis platform (CODAP) is free educational software for data analysis with wide uptake and acknowledgement among the statistics education community and beyond. This web-based data science tool is designed both as a platform for curriculum developers and as an application for students in grades 6–12. CODAP has utilized partnerships with other projects to incorporate and spread their work and is currently in broad use by approximately 60,000 users monthly. More than 130 learning science researchers have incorporated CODAP into their research and development efforts, resulting in dozens of curricula and lessons nationwide. Through the varied partnerships across multiple communities, CODAP has had the potential to scale across multiple contexts.
Moreover, partnerships can be leveraged to help in scaling culturally specific programming across the country. For example, Seeding Innovation, described in the compendium, is an innovation in which American Indian Science and Engineering Society (AISES) and the Kapor Center have partnered to provide a sequence of culturally revitalizing computer science curricula to partner schools across the country. The partner organization works collaboratively with school sites to create an engaging computer science curriculum, while also working with teachers and, when possible, community members to integrate cultural traditions, language, stories, art and more.5
Although external partners can be valuable sources of resources, funding, and support for those developing innovations, partnering with districts, schools, and teachers can provide in-depth insights into specific contexts to guide development and improvements. Building networks in this way can allow for the innovation to be carried out in a larger number of settings across a larger geographic area. For example, Box 7-8 highlights the
___________________
5 For more information, see Kapor Foundation & AISES (2023) report, State of diversity: The native tech ecosystem, https://www.kaporcenter.org/wp-content/uploads/2023/10/StateofDiversity.TheNativeTechEcosystem.pdf
The Exploratorium has developed a professional network of over 1,100 science education leaders across California who bring high-quality science professional learning and advocacy to over 100,000 teachers statewide. Leaders in the network access a constellation of professional learning opportunities that begins with a multi-day core institute and includes series of stand-alone virtual and in-person workshops, communities of practice, and access to high-quality teaching leadership resources.
The Exploratorium K–12 Science Leader Network spans the state of California and now includes over 950 leaders, 85 percent of whom work with Title I schools. Because this network is aimed at teacher professional learning, their leadership network impact spreads to 95,000 classroom teachers and millions of students. By 2025, they would like to further expand their network to 1,500 leaders serving 150,000 classroom teachers. They provide professional learning programs based on grade band: One to train K–12 Science Leaders, one for K–5 educators, and one for secondary educators.
Based on an external evaluation by Inverness Research Inc., the Exploratorium’s K–12 Science Leader Network and professional learning programs consistently show positive impact on improved leadership capacities and achieving equity in the science classroom.
SOURCE: EDC, 2024.
Exploratorium California K–12 Science Leader Network, which grew from serving a single county office to spanning the state of California.
The complexity of educational environments results in embedded challenges that can negatively affect successful scaling and sustainability efforts. Challenges identified from the literature review in combination with the compendium submissions include:
If an innovation is designed to align with specific policies or to meet specific goals, there can be a large threat to scaling and sustainability when policies or priorities change and are not in line with the direction of the innovation. Scaling and sustainability are aided by buy-in from and capacity
building with individuals and organizations who are the enactors. If individuals championing the innovation at an organization or those trained to implement it leave, it can hinder continuity and efforts to scale the innovation. During scaling efforts, it can be difficult to also update materials as these would require additional development and testing. This can lead to materials that are out-of-date with the latest advances in STEM fields. Working with partners, especially teachers, who adapt and update materials can help mitigate this. Lastly, once a program has scaled broadly, it is challenging to monitor and evaluate. When an innovation is being implemented and adapted in many different contexts, it is difficult to draw conclusions across sites and learners. Additionally, the cost of evaluating something at this scale in a deep way can be prohibitive.
Although these issues were identified as having impacts on the individual innovations, they represent more systemic issues. Chapter 8 provides a more in-depth discussion of these challenges and offers suggestions for how the system can respond.
Pre-K–12 STEM education innovations can be implemented in a variety of contexts to achieve a variety of different goals. As highlighted throughout the compendium commissioned for this report, although there can be notable challenges to scaling and sustaining innovations, there are several factors that support the conditions for scaling, to include a “tight but loose” framework; alignment of goals, policies, and practices; capacity building and organizational support; adaptability to a variety of settings; ease of adoption; and partnerships and networks. It is important to note that these are not the only factors that are needed for scaling, nor are they a requirement for successful scaling efforts. Looking across the various innovations, it is clear that there is not a “one-size-fits- all” approach to scaling, but rather, design of scaling innovations depends on the goals of the innovation, the resources available, and the contexts and learners they are trying to reach. Moreover, the process of evaluating and understanding implementation and outcomes can be challenging at scale, especially harder-to-study aspects such as fidelity of implementation, integrity of implementation (as described in Chapter 4), and implementation in a variety of contexts. Although many innovations in the compendium were able to collect this information and use it to scale the innovation, evidence cannot cover all settings and all participants. Finally, as discussed in Chapter 5, continuous funding, especially larger grants from federal sources, are important to funding these efforts, which can lead to a robust innovation that has scaled across a range of contexts.
Overall, in an effort to best understand the supportive conditions that allow for the scaling and sustaining of innovations, there is a need for
consideration as to how and in what ways evidence of scale and implementation is gathered, what types of evidence and understanding are important for the development and scale of innovations, and how funders can support these efforts. Chapter 8 will unpack these ideas more.
Booth, J. L., Cooper, L. A., Donovan, M. S., Huyghe, A., Koedinger, K. R., & Paré-Blagoev, E. J. (2015). Design-based research within the constraints of practice: AlgebraByExample. Journal of Education for Students Placed at Risk (JESPAR), 20(1–2), 79–100. https://doi.org/10.1080/10824669.2014.986674
Cunningham, C. M., Lachapelle, C. P., Brennan, R. T., Kelly, G. J., Tunis, C. S. A., & Gentry, C. A. (2020). The impact of engineering curriculum design principles on elementary students’ engineering and science learning. Journal of Research in Science Teaching, 57(3), 423–453.
Dearing, J. W., Dede, C., Boisvert, D., Carrese, J., Clement, L., Craft, E., Gardner, P., Hyder, J., Johnson, E., McNeel, D., Phiri, J., & Pleil, M. (2015). How educational innovators apply diffusion and scale-up concepts. Scaling Educational Innovations, 81–104. https://doi.org/10.1007/978-981-287-537-2_5
Education Development Center. (2024). [Compendium of PreK–12 STEM education programs: Impact and scale]. Paper commissioned by the Committee on Pre-K–12 STEM Education Innovations. https://nap.nationalacademies.org/resource/27950/Compendium%20of%20PreK-12%20STEM%20Education%20Programs_EDC.pdf
Heller, J. I., Daehler, K. R., Wong, N., Shinohara, M., & Miratrix, L. W. (2012). Differential effects of three professional development models on teacher knowledge and student achievement in elementary science. Journal of Research in Science Teaching, 49(3), 333–362.
Kapor Foundation & AISES. (2023). State of diversity: The native tech ecosystem. https://www.kaporcenter.org/wp-content/uploads/2023/10/StateofDiversity.TheNativeTechEcosystem.pdf
Krainer, K., Zehetmeier, S., Hanfstingl, B., Rauch, F., & Tscheinig, T. (2019). Insights into scaling up a nationwide learning and teaching initiative on various levels. Educational Studies in Mathematics, 102, 395–415. https://doi.org/10.1007/s10649-018-9826-3
Looi, C. K., & Teh, L. W. (2015). Towards critical discussions of scaling up educational innovations. In C. K. Looi & L. Teh (Eds.), Scaling educational innovations (pp. 1–10). Springer. https://doi.org/10.1007/978-981-287-537-2
Lowrie, T., Leonard, S., & Fitzgerald, R. (2018). STEM practices: A translational framework for large-scale STEM education design. Educational Design Research, 2(1). https://doi.org/10.15460/eder.2.1.1243
Maass, K., Cobb, P., Krainer, K., & Potari, D. (2019). Different ways to implement innovative teaching approaches at scale. Educational Studies in Mathematics, 102, 303–318.
Mark, J., & Klein, K. (2019, February). Beauty and joy of computing: 2016–17 findings from an AP CS Principles course. In Proceedings of the 50th ACM Technical Symposium on Computer Science Education (pp. 627–633).
McLaughlin, M. W., & Mitra, D. (2001). Theory-based change and change-based theory: Going deeper, going broader. Journal of Educational Change, 2, 301–323.
National Academies of Sciences, Engineering, and Medicine (NASEM). (2020). Changing expectations for the K-12 teacher workforce: Policies, preservice education, professional development, and the workplace. The National Academies Press.
___. (2021a). Call to action for science education: Building opportunity for the future. The National Academies Press. https://doi.org/10.17226/26152
___. (2021b). Cultivating interest and competencies in computing: Authentic experiences and design factors. The National Academies Press. https://doi.org/10.17226/25912
___. (2022). Science and engineering in preschool through elementary grades: The brilliance of children and the strengths of educators. The National Academies Press. https://doi.org/10.17226/26215
___. (2023). Closing the opportunity gap for young children. The National Academies Press. https://doi.org/10.17226/26743
___. (2024a). Equity in K–12 STEM education: Framing decisions for the future. The National Academies Press. https://doi.org/10.17226/26859
___. (2024b). A new vision for high-quality preschool curriculum. The National Academies Press. https://doi.org/10.17226/27429
National Academy of Engineering, & National Research Council. (2014). STEM integration in K–12 education: Status, prospects, and an agenda for research. The National Academies Press. https://doi.org/10.17226/18612
National Research Council (NRC). (2011). Successful K-12 STEM education: Identifying effective approaches in science, technology, engineering, and mathematics. The National Academies Press. https://doi.org/10.17226/13158
___. (2013). Monitoring progress toward successful K-12 STEM education: A nation advancing? The National Academies Press. https://doi.org/10.17226/13509
NGSS Leads States. (2013). Next generation science standards: For states, by states. The National Academies Press.
Price, T. W., Cateté, V., Albert, J., Barnes, T., & Garcia, D. D. (2016, February). Lessons learned from “BJC” CS Principles professional development. In Proceedings of the 47th ACM Technical Symposium on Computing Science Education (pp. 467–472). https://bjc.berkeley.edu/documents/2014%20SIGCSE%20-%20Lessons%20Learned%20from%20BJC%20CS%20Principles%20Professional%20Development.pdf
Roth, K. J., Garnier, H. E., Chen, C., Lemmens, M., Schwille, K., & Wickler, N. I. (2011). Videobased lesson analysis: Effective science PD for teacher and student learning. Journal of Research in Science Teaching, 48(2), 117–148.
Roth, K. J., Wilson, C. D., Taylor, J. A., Stuhlsatz, M. A., & Hvidsten, C. (2019). Comparing the effects of analysis-of-practice and content-based professional development on teacher and student outcomes in science. American Educational Research Journal, 56(4), 1217–1253.
Sabelli, N. H., & Harris, C. J. (2015). The role of innovation in scaling up educational innovations. In C. K. Looi & L. Teh (Eds.), Scaling educational innovations. Springer. https://doi.org/10.1007/978-981-287-537-2_2
Taylor, J., Roth, K., Wilson, C., Stuhlsatz, M., & Tipton, E. (2017). The effect of an analysis-of-practice, videocase-based, teacher professional development program on elementary students’ science achievement. Journal of Research on Educational Effectiveness, 10(2), 241–271.
Wilson, C. D., Stuhlsatz, M., Hvidsten, C., & Gardner, A. (2018). Analysis of practice and teacher PCK: Inferences from professional development research. Pedagogical content knowledge in STEM: Research to Practice, 3–16.
Wylie, E. C., Goe, L., Leusner, D. M., Lyon, C. J., Tocic, C., Wylie, E. C., Cleland, D., Gannon, M., Ellsworth, J., Heritage, M., Maher, J., Mardy, D., Popham, W. J., Snodgrass, D., Taylor, G., Thompson, M., & Wiliam, D. (2008). Tight but loose: Scaling up teacher professional development in diverse contexts. ETS Research Report Series, 2008(1), i–141. https://doi.org/10.1002/j.2333-8504.2008.tb02115.x
Young, V. M., House, A., Sherer, D., Singleton, C., & Wang, H. (2016). Scaling up STEM academies statewide: Implementation, network supports, and early outcomes. Teachers College Record, 118(13), 1–26. https://doi.org/10.1177/016146811611801310