Introduction

In response to the global challenges, the promotion of economic development, and the need to meet modern society’s demands for knowledge and skills within the realms of STEM, the emergence of STEM education aimed to develop employment opportunities in STEM fields while bolstering national competitiveness. The acronym STEM education originated from the four subjects (i.e., science, mathematics, engineering, and technology) that were proposed in the report “Undergraduate Science, Mathematics, and Engineering Education” (National Science Board 1986). Essentially, STEM education stands as an innovation-oriented education that prevailed in Western countries, spearheaded by the United States.

Subsequently, Yakman (2008) introduced the addition of the “A” element, representing arts, to STEM education, thereby incorporating humanities subjects such as history, philosophy, and religion. The fundamental objective of STEM education is to amalgamate multiple subjects into a cohesive framework (Morrison 2006). According to the National Science Foundation (2014), STEM entails a comprehensive integration of various disciplines, encompassing not only the subjects of natural sciences (e.g., computers and information, engineering, and mathematics), but also the subjects of social sciences (e.g., psychology, economics, sociology, and political science). With an increasing number of disciplines becoming intertwined in STEM education, its interdisciplinary essence has become progressively prominent. As a result, STEM education is increasingly acknowledged as interdisciplinary education with a focus on engineering, where subject integration plays a central role.

In the past two decades, STEM education has witnessed a large number of research achievements, and many scholars have conducted comprehensive reviews on the topic. These studies have focused either on curriculum reform (Uskoković 2023), teaching methods (Li and Wong 2023), or technology applications (Salas-Pilco et al. 2022; Conde et al. 2021). At the research level, especially in teaching and learning, many researchers have recognized the interdisciplinary nature of STEM education, but almost no research has focused on the development of STEM education from the perspective of subject integration (Perignat and Katz-Buonincontro 2019). The evolution of STEM disciplines and the development of their themes are closely interrelated, but the underlying coupling relationships and reasons for their formation remain unexplored.

Moreover, there exist significant differences in the disciplinary systems of K-12 education and higher education, including teaching objectives, methods, breadth, and depth. As a result, STEM education at different educational levels exhibits distinct characteristics, making it necessary to conduct a segmented analysis. Although some researchers have analyzed the development trends in STEM education from a macro perspective and recognized differences between educational stages, this has not been the primary focus of their work, and there has also been a lack of emphasis on specific disciplines (Zhan et al. 2022a).

Based on these considerations, this study attempts to examine and explore the developmental trajectories and trends of STEM education at various educational stages from the perspective of disciplinary evolution. Specifically, the following questions will be addressed:

RQ1: How were subjects integrated into STEM education in K-12 and higher education?

RQ2: What is the distribution of the subject themes involved in STEM education at the K-12 and higher education levels?

Methods

Keyword search

Papers related to STEM education were searched on 10 July 2023 from the Science Web Core Collection. The query started with the search statement TI = (STEM education) OR TI = (STEAM education) OR AK = (STEM education) OR AK = (STEAM education) OR AK = (STEAM education) OR KP= (STEM education) OR KP = (STEAM education), which yielded a total of 3668 publications. The search results were further refined according to the research area, while duplicates, poorly indexed documents, and documents inconsistent with STEM Education/STEAM Education research were removed, leaving a final total of 2188 publications.

Research process

WOS (Web of Science) was selected as the data source for this study. This database covers a wide range of journals, has a high impact, and can provide a complete sample for this study (Martín-Páez et al. 2019). Then, the following steps were used to analyze the data.

Step 1: Data classification (education stage classification). It has been shown that K-12 and higher education systems have different focuses on STEM education (Zhan et al. 2022a). To clarify the characteristics of the different stages, the data was divided into K-12 and higher education levels based on field information such as title, keywords (including author keywords and keywords plus), journal, and abstract. After discarding the data that could not be categorized, 903 valid data were obtained for the K-12 stage, with the time range from 2009 to 2023, and 873 valid data for the higher education stage, with the time range from 2004 to 2023.

Step 2: Keywords cleaning. In the collected data, some keywords have the same meaning but may be analyzed as different words, such as math, mathematics; model, models, etc., and some words have similar semantics, which may also lead to inaccurate analysis results when analyzed separately, so it was necessary to build a synonym database for synonym replacement so that they could be more accurately counted and visualized.

Step 3: Data classification (time and theme classification). The data at different stages were sub-categorized by time and theme respectively. Time division according to a time slice for a year. The keywords with the top 10 frequency in each subject were screened as alternative theme terms, and the alternative subject terms of each subject were integrated, then the remaining keywords were used as subject terms to participate in the final statistics.

Step 4: Data statistics and visualization. The categorized subject time and subject themes of different sections were counted separately, and the statistical results were visualized and described using heat maps. The heat map used in this study is a kind of statistical chart that shows the frequency of a certain word by the relative shades of color blocks, with dark colors representing the high frequency of occurrence and light colors representing the low frequency of occurrence. Finally, four maps were created to depict the time distribution and theme distribution at the K-12 and higher education levels. The research process is shown in Fig. 1.

Fig. 1: Research process.
figure 1

The entire research process went through five stages: data acquisition, data classification (education stage classification), keyword cleaning, data classification (time and theme classification), and data statistics and visualization.

Research findings

Analysis of the temporal evolution of the subject

STEM education originated from higher education, but in recent years, there has been rapid development in the K-12 stage. Both levels show a similar trend of overall integration, starting with a focus on science, technology, engineering, and mathematics, and later, an increasing involvement of humanities and social sciences. Interdisciplinary integration has become prominent, particularly in higher education. As shown in Fig. 2.

Fig. 2: Time distribution of subjects. (1) Time distribution of subjects in K-12 education. (2) Time distribution of subjects in higher education.
figure 2

The first column displays the subjects involved in STEM education, and the first row is the timeline. This figure illustrates the time and subject distribution of STEM-related literature. Darker colors indicate a greater number of documents related to the corresponding time node and subject.

Subject integration of STEM in K-12 Education

Subject integration refers to the methods and processes of cross-fertilization of different subjects, which is specifically expressed as the mutual integration of a subject with one or more subjects through knowledge, concepts, skills, methods, etc. at a certain time node, so time node is one important element of subject integration path analysis. Figure 2(1) illustrates the integration of different subjects at different time points at the K12 level. In the early stages (2009 to 2014), the subjects of science, technology, engineering, and mathematics played a dominant role, and these subjects were considered to be the core of STEM education. Over time, science subjects such as computer science, arts, physics, and environmental science were gradually incorporated into the STEM education integration pathway. In the post-2019 period, more and more research has emerged in the humanities and social sciences.

Different subjects played different roles in STEM education at the K-12 level. Science and technology provided a rich foundation of knowledge and practice for students involved in STEM education. Engineering developed students’ design thinking and problem-solving skills, while mathematics provided the foundation for quantitative and logical thinking. Early STEM education has not yet shown a clear trend of cross-fertilization of disciplines. Science courses, such as physics, chemistry, and biology, were considered the main foci of STEM education, with students exploring basic science concepts through participation in experiments and educational games.

As time went on, computer science and environmental science became important subjects for STEM education, and they facilitated the development of computational thinking and environmental awareness in students at the basic education level (Zhan et al. 2022b). In 2013, Grover and Pea (2013) published a study entitled “Computational Thinking in K-12: A Review of the State of the Field”, which explored the importance of including computational thinking as a content and goal of STEM education and had a profound impact on subsequent research regarding the integration of computing into STEM education. In 2022, the U.S. Department of Education proposed “Science, Technology, Engineering, and Math, including Computer Science”, also hinting at the importance of computer science in STEM (Department of Education, 2022).

Environmental issues have always been important social topics and are closely related to the development of engineering and technology. The integration of environmental science emphasized the importance of environmental awareness and sustainable development, making students conscious of environmental problems and proposing solutions through scientific and technological means. At the K-12 level, researchers have focused on green skills elements in STEM curricula and the integration of STEM educational approaches in environmental curricula (Sümen and Çalisici 2016).

After 2019, the integration of humanities and social sciences brought more dimensions and diversity to STEM education. At this stage, STEM education showed a clear interdisciplinary character. Compared to science courses that are involved in STEM education in the form of teaching content, humanities, and social sciences are integrated in a way that is more on the level of research methods and educational philosophy.

Psychological research explored the impact of spatial thinking, spatial skills, and spatial abilities on STEM learning, recognizing the importance of students’ mental states and cognitive abilities for learning (Buckley et al. 2018; Gilligan et al. 2017; Taylor and Hutton 2013). The inclusion of arts enhanced students’ understanding of creativity and encouraged them to use their imagination and creative abilities in the practice of science and engineering (Yakman 2010). The inclusion of political science primarily conducted a comparative study of STEM education across different regions from the perspective of policies (Sharma and Yarlagadda 2018).

Philosophy created a framework for analyzing and synthesizing STEM education goals and discourses, encouraging students to think deeply about the value and impact of science and technology (Ortiz-Revilla et al. 2020). The incorporation of history offered students diverse learning objectives that enabled them to understand the context and social impact of the development of science and technology (Park and Cho 2022). The inclusion of linguistics promoted the engagement of culturally and linguistically diverse students in STEM education, encouraging cross-cultural communication and collaboration across linguistic and cultural boundaries (Mallinson and Hudley 2018).

In the K-12 stage, there is a significant concentration of disciplines in STEM education, with computer science and arts receiving the most attention alongside the four main subjects. Additionally, interdisciplinary teaching in this stage is guided by conceptual instruction. In 2013, the United States released the milestone document “K-12 Science Education Framework”, initiating a major reform in science education. This document became the blueprint for the formal launch of the new era of science education reform known as the “Next Generation Science Standards” (NGSS). NGSS proposed a paradigm for science education in the U.S., integrating three dimensions: practices, cross-cutting concepts, and disciplinary core ideas. Seven powerful cross-cutting concepts were selected from these dimensions to bridge the boundaries between different subjects. These concepts include patterns, cause and effect relationships, systems and system models, matter and energy, structure and function, stability and change, and scale, proportion, and quantity (National Research Council 2013). The document brought new guidance and direction to STEM education in the United States, emphasizing comprehensive and interdisciplinary educational principles.

Subject integration of STEM in higher education

Figure 2(2) illustrates the time distribution of subjects at the higher education level. Since 2004, a total of 16 subjects have been involved in STEM studies at the higher education level. Similar to the K-12 level, the integration in higher education also shows an intersection of science, technology, engineering, humanities, and social sciences.

STEM subjects (science, technology, engineering, and mathematics) continued to play an important role at the higher education level, covering a wide range of fields of study. Unlike at the K-12 level, STEM education in higher education has exhibited a blend of disciplines at the beginning because of the strong interdisciplinary nature of the courses offered at universities themselves (for example, biochemistry). Students were exposed to more specialized and in-depth knowledge of science, technology, engineering, and mathematics disciplines in their areas of specialization. The focus of disciplinary integration was on combining theories and methods from different disciplines for cross-disciplinary research and innovation. For example, researchers in the multidisciplinary education (ME) course selected undergraduate students in engineering, pre-nursing, and pre-occupational health to collaborate in a maker space to solve health problems and create practical solutions to health-related problems facing the community through their backgrounds and competencies (Ludwig et al. 2017).

The development and disciplinary integration of STEM education was influenced by educational reform and societal needs. With the continuous advancement of technology and globalization, there was an increasing demand for comprehensive ability and interdisciplinary thinking. Traditional science and engineering education could no longer meet the current social and professional needs. Therefore, the integration of humanities and social science disciplines has become an important trend in the development of STEM education. For example, art subjects have promoted the integration of innovation and esthetics by providing creative expression and the development of design thinking. The prominence of gender, race, and economic issues, cultural background conflict in higher education has called for the inclusion of social science disciplines such as psychology, economics, and philosophy, linguistics, political science.

In higher education, the distribution of subjects was relatively diverse, with engineering receiving significant attention among the four main subjects. Additionally, chemistry has also been highly regarded, while comparatively, computer science’s involvement is not as prominent.

Comparing the temporal evolution of subjects at different educational levels

In summary, the concept of STEM education was gradually evolving from an initial bias toward engineering education to a more integrated and diverse educational paradigm. Since 2004, there have been 16 subjects involved in STEM (i.e., science, technology, engineering, mathematics, art, physics, chemistry, biology, psychology, computer science, environmental science, linguistics, economics, political science, philosophy, and history). In the analysis of subject integration, the overall integration trend was similar between the K-12 stage and the higher education stage. However, there were still some differences between K-12 education and higher education.

First, STEM education arose in higher education, but there seems to be a trend of research focus shifting from higher education to K-12 education. From 2004 to 2009, STEM research was focused on higher education, and after 2016, the number of papers in K-12 surpassed higher education. The reason for this phenomenon may be that the rise of STEM education sprung from the lack of talent in STEM careers, and higher education was directly oriented to society, so it was reasonable for research and reform to start from higher education, while government policies lead and funding investment largely promoted the rapid development of STEM education in K-12 education stage. Higher education points to the current talent needs of society, while K-12 education points to the future talent needs of society. The inclusion of STEM education in the education strategy of several countries also indicates that STEM talents are an important component of future national competitiveness, so it is very necessary to emphasize the K-12 stage.

Second, at the level of pedagogy and practice, disciplinary integration in STEM education at the K-12 level was often achieved through interdisciplinary projects and activities, such as engineering design challenges, science experiments, and mathematical modeling. These activities were usually classroom-cantered, with teachers guiding students through practice and inquiry. In contrast, in STEM education at the higher education level, disciplinary integration was focused more on the integration of research and practice. Students explored and applied integrated disciplinary knowledge in depth through participation in research projects, hands-on internships, and interdisciplinary courses.

In addition, the concept of STEAM education was more popular at the K-12 stage. “STEAM” was more frequently used in the K-12 stage, which could be said to a certain extent that the STEAM education concept was more popular in the K-12 stage, but may not necessarily indicate a deeper level of interdisciplinary integration in this stage.

Analysis of the evolution of subject themes

Research hotspots are reflected, to some extent, by the frequency of scientific theme terms. In this study, 32 keywords were selected as subject themes at the K-12 level and 33 keywords were selected at the higher education level. To facilitate the analysis, these keywords were grouped into “learning outcomes”, “teachers’ professional development”, “technology empowerment”, “social relevance”, and “pedagogy”. As shown in Fig. 3.

Fig. 3: Themes distribution of subjects. (1) Themes distribution of subjects in K-12 education. (2) Themes distribution of subjects in higher education.
figure 3

The first column represents topic categories, the second column contains relevant keywords, and the third row displays the subjects involved in STEM education. This figure illustrates the theme and subject distribution of STEM-related literature. Darker colors indicate a greater number of documents related to the corresponding subject and theme.

Subject theme evolution in K-12 education

Overall, STEM research topics at the K-12 level predominantly emphasize “learning outcomes”, while maintaining a relatively balanced distribution across “teachers’ professional development”, “technology empowerment”, “social relevance”, and “pedagogy”. The dimension of “learning outcomes” primarily encompassed keywords such as students’ academic performance, thinking skills, and associated influencing factors. “Teachers’ professional development” involved aspects related to teachers’ preparedness for STEM education and collaborative efforts among educators. “Technology empowerment” focused on the impact of various technologies such as modeling, robotics, programming, and augmented reality on both the teaching environment and instructional content. “Pedagogy” primarily revolved around inquiry based and game based learning. Furthermore, research related to social themes primarily aimed to foster educational equity from multiple dimensions, including aspects like gender, culture, and policy.

At the K-12 level, the theme of “learning outcomes” account for the largest proportion with 37.69%, under which the theme words included “achievement”, “self-efficacy”, “performance”, “attitudes”, “computational thinking”, “knowledge”, “creativity”, “beliefs”, “design thinking” and “cognitive-load”. In 2009, Obama proposed the “Competing for Excellence” initiative, which aimed to improve students’ achievement in STEM. This initiative has led to more researchers exploring different teaching models, activities, and tools to improve student achievement and performance. Also, students’ attitudes, knowledge, beliefs, self-efficacy, and cognitive-load were important factors influencing STEM performance and interest and have received close attention from researchers. Self-efficacy refers to one’s perceived ability to perform specific behaviors that may contain difficulties and stress (Bandura et al. 1999). Cognitive load is a multidimensional structure that represents the burden placed on a learner’s cognitive system when processing specific tasks, often appearing alongside keywords like motivation, performance, etc., in educational research with technical support (Kao and Ruan 2022).

Computational thinking, creativity (Zhan et al. 2023), and design thinking were goals of STEM education and were closely related to the disciplines. Computational thinking (CT) could be seen as a thinking pattern for solving problems with computational tools, and it is a fundamental skill required in everyday life (Wing 2006). It has the most direct relationship with computers, and the “Next Generation Science Standards” emphasized its significance by considering computational thinking as a core scientific practice. In China, computational thinking is recognized as a core competency in the curriculum standards for information technology. In addition, there is also increasing research focusing on the connection between CT and mathematics (Lv et al. 2023). Weintrop et al. defined computational thinking in mathematical and scientific practices using a taxonomy that includes four main categories: data practices, modeling and simulation practices, computational problem-solving practices, and systems thinking practices, which had a broad impact on K-12 education (Weintrop et al. 2016).

Furthermore, there was a clear association between creativity and the arts, as well as between design thinking and engineering disciplines. Some scholars argued that creativity plays one of three roles that arts assume in STEM education, with the other two being arts/esthetic learning and contextual understanding (Liu et al. 2021). Design is a prerequisite for making and the first step in the formation of STEM work, often found in studies of engineering subjects (Hernandez et al. 2014), and design thinking also plays an important role in engineering education, especially in high school (Li and Zhan 2022).

“Technology empowerment” (18.91%) was the second most popular theme, with the following themes: “modeling”, “robotics”, “programming”, “augmented reality”, and “scratch”. “Technology empowerment” emphasized the development of student literacy such as information awareness and computational thinking on the one hand, and laid the foundation for students’ STEM education practices on the other. Researchers have explored that robotics education has the potential to cultivate transferable skills in the STEM field (Nelson 2014) and narrow the gender gap in STEM, particularly by promoting girls’ learning (Zhong et al. 2023). The use of modeling tools can help students visualize abstract scientific and mathematical concepts or objects, which has a positive impact on learners’ academic and personal growth.

In addition, programming is a fundamental requirement for learning computer subjects, and the development of skills related to computer programming and robotics, as well as the introduction of computational thinking principles in STEM education, were considered by researchers as trends in today’s world (Bermúdez et al. 2019). AR (Augmented Reality) is the technology that allows virtual objects to be overlaid on real images, enriching students’ learning experiences. AR-STEM research was primarily conducted among K-12 students and typically relies on marker-based AR. However, location-based AR has significant advantages in supporting student learning beyond the classroom and facilitating scientific inquiry-based learning (Sırakaya and Alsancak Sırakaya 2022). Scratch is a graphical programming tool. In the K-12 stage, the abstract nature of programming concepts and languages makes it challenging for students to grasp them directly. Graphical programming significantly reduces the complexity of programming, making Scratch widely adopted (Kao and Ruan 2022).

The theme of “social relevance” ranked third with 17.53%, with the main themes related to “gender”, “equity”, “culture”, “policy”, “justice” and “patriotism”. Equality has always been an important topic in education, ensuring that individuals of different genders and races can participate in STEM education without discrimination. The Obama administration launched “the Teach for Innovation program” in 2009, which aimed to increase access to STEM education and employment opportunities for disadvantaged groups, and has contributed in part to researchers’ attention to gender. The topic of justice was multifaceted, with environmental justice being particularly prominent. Its purpose was to encourage readers to reframe societal and environmental issues as an ethical responsibility, fostering the construction of this responsibility through care, recognition, openness, and responsiveness to both human and non-human vitality (Kayumova et al. 2019).

Furthermore, since STEM education was a national priority, many researchers have analyzed the development of STEM education through policy analysis (Zhong et al. 2022), particularly focusing on different countries and regions such as South Korea (Park et al. 2016), the United States, Europe (Subotnik et al. 2017), India, Australia (Sharma and Yarlagadda 2018), etc. In South Korea, researchers have combined history education with traditional STEM education to inspire students’ patriotism (Park and Cho, 2022).

STEM education originated in the United States, and its evolution is determined by a variety of factors, including national economy, politics, and culture (Zhong et al. 2022). As STEM education was increasingly promoted worldwide, it faced challenges of cultural conflicts and international exchanges. “Culture” was a broadly encompassing term, and research about culture could be divided into two categories. First, it served as a research methodology, such as sociocultural theory, exploring social issues like gender and race and aiming at promoting educational equity for students of diverse cultures and languages (Eisenhart and Allen 2020).

Second, culture served as the background and content carrier for STEM activities. In China, researchers have developed C-STEAM, or culturally oriented disciplinary integration education, based on STEM education and considering the reality and needs of China’s development. This concept emphasized exploring and creating cultural concepts using related disciplines in the context of traditional Chinese culture, cultivating students’ humanistic spirit, and enhancing their cultural identity and understanding. At the same time, C-STEAM embodied the nurturing value of cultivating students’ core literacy, the carrying value of passing on excellent traditional culture, and the social value of creating a culture with regional characteristics. On this basis, the researcher proposed the ETIC curriculum classification framework and 6 C implementation model, which provided a reference for promoting the construction and development of the regional C-STEAM curriculum. (Zhan et al. 2020, 2021; Huo et al. 2020).

“Professional development” ranked fourth with 15.62%. The theme words related were “knowledge”, “professional development”, “attitudes”, “conceptions”, “beliefs”, “teacher preparation”, and “teacher collaboration”. Researchers have indicated that changing teachers to interdisciplinary teaching requires first developing the skills and attitudes of interdisciplinary teaching, and professional development (PD) was considered a key component to helping teachers through this transition process (Al Salami et al. 2017). The link between teacher preparation to teach STEM and student STEM achievement has motivated researchers to develop professional development programs to address teacher confidence, attitudes, knowledge, pedagogy, and other preparation issues (Nadelson et al. 2013). Understanding the beliefs held by educators was central to influencing change and improving instruction, so researchers needed to be able to design educational programs that address teachers’ beliefs and work to change them when appropriate (Nathan et al. 2010; Vossen et al. 2020).

Furthermore, there was still considerable uncertainty about “what STEM education is” and “what it means” in terms of curriculum and student achievement, research and discussion on the concept of STEM aimed to create a shared concept of STEM education to facilitate dialog between different stakeholders (Dare et al. 2019; Holmes et al. 2018). The above topics can all be categorized as preparations for STEM education, primarily referring to pre-service and in-service STEM teacher training. In addition to the mentioned content, this also included language training, relevant technical learning, and teaching methods. Furthermore, due to the interdisciplinary nature of STEM education, collaboration among teachers from multiple disciplines was necessary, especially when humanities and social sciences were involved (Park and Cho 2022). Therefore, teacher cooperation was also an important way for teachers’ professional development.

“Pedagogy” received the least attention (10.25%). The theme words related were “inquiry based learning”, “game based learning”, “project based learning”, and “self-regulated learning”. Game based learning demonstrated a close association with technology and computers. Nowadays, students are generally passionate about electronic games, however, they often lack sufficient computer programming knowledge and skills, which limits their development in the computer and technology fields. To address this issue, game based learning has received significant attention in the K-12 stage. The purpose of inquiry based learning was to cultivate students’ inquiry skills, which was also at the core of the science curriculum. In STEM education, this method was considered to have three components: data analysis, interpretive reflection, and critical reflection. Using inquiry based learning could integrate various disciplines, enhance educators’ attitudes, and it’s also suitable for the special needs of gifted students (Abdurrahman et al. 2019).

STEM PBL (STEM Project-Based Learning) is a student-centered teaching approach based on constructivism, characterized by clear outcomes and vaguely defined tasks (Capraro and Slough 2013). STEM PBL activities are fundamentally interdisciplinary, encouraging students to construct knowledge, identify problems independently, and collaborate to solve them (Han et al. 2015). Self-regulated learning (SRL) refers to an active, iterative process in which learners achieve their goals by controlling, monitoring, and adjusting their cognitive/metacognitive processes and learning behaviors. This approach was effective in activating and monitoring learners’ behaviors, cognitions, and emotions, which is crucial for task performance in the STEM field (Li et al. 2020).

Through the above analysis, it is evident that research topics in different disciplines have varying emphases. “Achievement” and “gender” were highly popular topics in the scientific community. Additionally, in the fields of math, physics, chemistry, and biology, there was a greater emphasis on “technological empowerment” and “pedagogy”. Technology placed the most emphasis on “modeling”, while computer science was concerned with “computational thinking”. Engineering exhibited a relatively even distribution of research topics. In contrast, the focus areas within humanities and social sciences were relatively scattered.

Subject theme evolution in higher education

In comparison to the K-12 level, research theme distribution in higher education appeared to be more concentrated. This was primarily manifested in the prevalence of research related to “learning outcomes” and “social relevance”, which collectively account for over three-quarters of the total research. Conversely, research areas focusing on “teachers’ professional development”, “technology empowerment”, and “pedagogy” were relatively scarce. However, from a disciplinary perspective, research topics in the humanities and social sciences at the higher education level exhibited greater diversity and richness.

“Social relevance” was the most popular theme in higher education research (47.31%). The research content could be broadly categorized into three types. The first category was educational equity and justice, including keywords “gender”, “identity”, “stereotype threat”, “race”, “equity”, “minority”, and “marginalized populations”. STEM identity is an expressed connection between one’s self and STEM, which depends on the individual’s beliefs about their abilities and their conceptual and practical knowledge of their particular STEM subject (Charleston et al. 2014). Enhancing the self-identity of minority groups and optimizing the experience of marginalized populations, especially females, contributed to their more active participation in STEM education. Stereotype threat is a risk experienced by individuals in which individuals fear that they will validate negative stereotypes of the group to which they belong (Spencer et al. 1999). Stereotype threat has been shown to have a significant impact on the likelihood of women, minorities, and white men leaving STEM professions (Beasley and Fischer 2012).

The second category was students’ career development, including the keywords “career” and “choice”. Career orientation was more prominent at the higher education level than at the K-12 level, with researchers focusing on career goals, career preparation, the position of STEM talent in the labor market, major selection, and attrition.

The third category was culture-related research, which, in higher education, connected with various humanities and social sciences disciplines such as psychology, philosophy, history, linguistics, and more. Research in this category focused on promoting educational equity and students’ full participation in STEM education by addressing the fair treatment of students from different sociocultural backgrounds and using “culturally responsive pedagogy”. This approach involved leveraging the cultural characteristics, experiences, and perspectives of ethnically diverse students to teach them more effectively, fostering educational equity and comprehensive engagement in STEM education (Gay 2003).

“Learning outcomes” was also a theme that received a lot of attention in higher education, with 30.47%. The related themes included “achievement”, “performance”, “self-efficacy”, “motivation”, “persistence”, “innovation”, “critical thinking”, “computational thinking”, “creativity”, and “digital skills”. It was evident from this that higher education was not only concerned with issues such as students’ achievement, performance, and computational thinking but also paid attention to influencing factors such as students’ self-efficacy and motivation. How to sustain students in STEM majors and reduce attrition of STEM majors, especially among minority and female populations, was a concern in studies related to “persistence” (Burt et al. 2019; Ong et al. 2018).

Compared to the K-12 stage, higher education placed less emphasis on computational thinking and creativity but focused more on innovation and critical thinking. Creativity refers to “the generation of novel and useful ideas by an individual or a small group of individuals” while innovation is “the successful implementation of creative ideas within an organization” (Amabile 1988). The distinction between creativity and innovation lies in the emphasis on products and outcomes in innovation. Higher education demands that students not only have creative ideas but also successfully transform these ideas into scalable products. In contrast, K-12 education placed more emphasis on encouraging students to generate new ideas. Besides, Critical thinking was another important developmental goal at the higher education level. It served as a method and tool for problem-solving, conceptualized as purposeful, self-regulated judgment involving various thinking skills such as analysis, evaluation, and reasoning (Gadot and Tsybulsky 2023).

Digital skill is a concept encompassing skills and specific techniques that are necessary for the use of effective digital technology (van Laar et al. 2019). In research, various terms were used to describe the ability to use digital technology effectively in learning activities, such as digital skills, technical skills, digital literacy, digital competence, digital tools, 21st-century skills, ICT literacy, and ICT skills. Studies have shown a positive correlation between students’ digital skills and their creative self-efficacy, and higher levels of digital skills were often predictive of higher levels of actual performance (Chonsalasin and Khampirat 2022).

“Technology empowerment” was ranked third with 11.53%, and the related themes were “modeling”, “robotics”, “programming”, “augmented reality” and “virtual reality”. Modeling is a useful tool to identify current problem situations, predict future societal changes, and identify possible solutions (Suh and Han 2019). Programming was considered to be related to problem-solving and the main pedagogical challenge was the lack of appropriate methods and tools as well as scaled and personalized instruction (Medeiros et al. 2019). Robots were often used in the classroom to develop students’ human-machine collaboration skills (Mathers et al. 2012).

Augmented Reality (AR) refers to the technology that enhances virtual information in the real environment through ongoing activities and user input, while “Virtual Reality (VR)” is the technology that immerses users in a purely virtual environment. The learning environments created by VR and AR technologies contributed to the formation of collaborative, interactive, and highly immersive learning experiences, thereby enhancing the efficiency of learning for learners (Zhong et al. 2021). Additionally, they demonstrated the potential to help students improve their cross-cultural communication skills (Akdere et al. 2021).

“Teachers’ Professional Development” was ranked fourth with 6.11% of the total, and related terms were “faculty training”, “professional development”, and “educational innovation”. Faculty training and professional development were broadly defined terms, and there was a significant degree of overlap in their research content. They encompassed research related to teacher development (such as teacher reflection and active learning), diversity and equity issues among the teaching staff, curriculum design, teaching methodologies, and pedagogical knowledge. Research related to educational innovation encompassed the introduction of new educational technologies, teaching methods, curriculum designs, and assessment approaches to address evolving learning needs and societal challenges.

“Pedagogy” was the least studied topic (4.58%), with related themes including “collaborative learning”, “active learning”, “experiential learning”, “game based learning”, and “positive learning”. Collaborative learning played a significant role in enhancing the likelihood of successful problem-solving. Additionally, collaborative skills are crucial for individuals pursuing STEM careers. Active learning is a method characterized by students taking control of their learning to some extent through metacognition, self-assessment, and reflection, within student-centered and inquiry based learning approaches (National Research Council et al. 2000; Kuh 2008). The American Association for the Advancement of Science encouraged university science educators to shift their teaching from traditional lectures to active learning (American Association for the Advancement of Science 2011).

Experiential Learning is an educational approach that emphasizes acquiring knowledge and skills through first-hand experiences, practice, and reflection, often in forms such as teaching, research, and internships. Experiential learning can facilitate the transfer of classroom learning to real-world practice and has the potential to enhance students’ learning, motivation, skill development, and graduation rates (Gong et al. 2022). Game based learning was not very common in higher education, and research in this area was quite scattered, covering topics such as computer-based learning and the creation of diverse and inclusive learning environments. The origins of positive learning can be traced back to the early days of the positive psychology movement, to promote students’ overall well-being, not just the imparting of knowledge and skills, but also the cultivation of their positive psychological traits and qualities (White 2016).

Undoubtedly, in higher education, almost all disciplines focused their research on “learning outcomes” and “social relevance”. Among these, the most emphasized areas included students’ performance, diversity, equity, and career development. Furthermore, engineering placed a significant emphasis on programming and robotics technology; mathematics and technology prioritized students’ self-efficacy, motivation, persistence, and programming skills. Chemistry, on the other hand, exhibited a unique pattern by showing less focus on learning outcomes but a greater emphasis on technology integration and pedagogy. The arts concentrated more on technology integration and social relevance. However, many other disciplines lacked a substantial focus on teacher professional development.

Comparing the evolution of subject themes at different educational levels

From the above analysis, it can be found that the distribution of research topics in K-12 education was relatively balanced, while in higher education, it was more concentrated. However, in higher education, research in the humanities and social sciences was more in-depth, and the distribution of themes was more extensive. The research hotspots at the two levels have shown the following differences.

Overall, in the K-12 stage, “learning outcomes” received the most attention, while career education for students was lacking. In higher education, “learning outcomes” and “social relevance” were the most emphasized aspects, while “teachers’ professional development” and “pedagogy” were relatively neglected.

Specifically, concerning “learning outcomes”, achievement, performance, and self-efficacy were common topics across different educational levels. K-12 education placed more emphasis on computational thinking, creativity, and design thinking, while higher education focused more on innovation and critical thinking. Regarding “teachers’ professional development”, higher education paid relatively less attention to teachers and their development, lacking a systematic body of research. In “technology empowerment”, technologies in the research were highly similar, but there was a greater volume of publications in K-12 education. The knowledge or tools learned were also more foundational and straightforward at this level. In the realm of “social relevance” research, gender, equity, and culture were common topics of interest, but higher education delved into students’ career choices and development, an area that lacked emphasis in K-12 education. In terms of “pedagogy” research, K-12 education primarily focused on inquiry based learning and game based learning, while higher education emphasized collaborative learning and active learning.

Discussion

This study analyzed and compared the development of the STEM research field in two aspects: subject integration and subject themes distribution, to clarify the STEM subject orientation and the ecological map of subject integration in the STEM field.

Referring to RQ1, the subject time distribution maps were used to find out how subjects integrated into STEM education at the K-12 and higher education levels. From the above analysis, it is clear that subject integration followed the evolutionary path of science, technology, engineering, and mathematics to the addition of social sciences and humanities. The addition of the latter has qualitatively improved the connotation of STEM education and fundamentally changed the subject integration path. In other words, the field of STEM studies has expanded from science education to the whole education field, and the cross-fertilization of subjects has become its most fundamental feature. This conclusion has been corroborated by existing research and policies (Perignat and Katz-Buonincontro 2019; Zhan et al. 2022a).

Referring to RQ2, the subject themes distribution maps at the K-12 and higher education levels reflected the main research content of STEM education. Research themes were not evenly distributed, especially since the research on “learning outcomes” was much more than the research on “teachers’ professional development” and “pedagogy”, which implied that the current attention to STEM teachers was insufficient. Previous research indicated that teacher education programs lack content related to interdisciplinary integration across different subject areas and do not provide suitable activities for integrating STEM education (Türk et al. 2018). In addition, although K-12 education started late, it has developed rapidly due to the promotion of policies and the future needs of society, but there is still much room for expansion of its research scope, especially career issues. In recent years, with the further development of globalization, student diversity has become evident not only in higher education but also in K-12 education. Research has shown that multicultural education and culturally supportive teaching contribute to addressing the persistent inequalities in the field of STEM education (Charity Hudley and Mallinson 2017).

STEM education has obvious interdisciplinary characteristics, in which different subjects play different roles, as shown in Table 1. The essence of science subjects is to understand the objective laws of the world, and science education aims to help students understand the world through inquiry methods, knowledge is the key to its teaching. The essence of technology is the application of knowledge scenarios, and technology achieves the purpose of transforming the world by manipulating and optimizing the variables that affect the results (products), the key to its teaching is the acquisition of skills. Engineering is the integrated application of technology, and its purpose is also to transform the world, but unlike technology, engineering places more emphasis on the coordination of all elements within the system to find the optimal solution to the problem, and engineering operates and optimizes the variables that affect the system to achieve the purpose of system optimization. The essence of mathematics is measurement and calculation, which develops itself through abstract, non-empirical mathematical operations and heuristic logical deduction, and can provide the logical and calculative basis for other subjects, and the key to its teaching is calculation, measurement, and logical deduction.

Table 1 STEM subject orientation.

Unlike the above subjects, the essence of humanities and social sciences is to feel, interpret, and create the man-made world. It contributes to the all-around development of human beings, the enhancement of moral values and cultural identity, and the development of creative and innovative thinking through the unity of awareness, expression, values, and emotions, the key to teaching is tasting, designing, and creating. In addition, there is a slight difference between the humanities and social sciences. The social sciences involved in STEM fields mainly reflect on the social issues that exist or are raised in STEM education from the perspective of research, but are less reflected in the teaching of the subjects, such as psychology. The involvement of the humanities is mainly reflected in the teaching of the subjects, and the educational goals are achieved through teaching students to appreciate the appeal and value of the arts.

The STEM education research ecosystem comprises two parts. The upper elliptical portion reveals the distribution of disciplines and research topics, while the lower timeline illustrates the timeline of interdisciplinary integration. The central part of the ellipse indicates the disciplinary composition of STEM education. Science, oriented towards exploration, forms the foundation of STEM education. Engineering, driven by creativity and innovation, plays a crucial role in fostering students’ creativity and innovation. Science and engineering mutually reinforce each other and progress together. Technology provides the tools and support for STEM education, while mathematics serves as the computational foundation, collectively facilitating STEM education activities.

STEM education, through interdisciplinary teaching, emphasizes the cultivation of students’ higher-order thinking skills, such as scientific thinking, design thinking, engineering thinking, and computational thinking. The outermost circle includes other disciplines involved in STEM education, such as arts, economics, history, political science, linguistics, psychology, philosophy, physics, biology, computer science, environmental studies, chemistry, and more. This demonstrates the trend in STEM education shifting from STEM to STEAM (Science, Technology, Engineering, Arts, and Mathematics) and the integration of science, technology, engineering, mathematics, and social sciences in education. The pink and blue sections represent the distribution of research topics in the K-12 and higher education stages.

From the above analysis, we could outline the ecological map of STEM subject integration in terms of subject integration and subject themes distribution, as shown in Fig. 4, which demonstrates the subject integration and main research contents of STEM education.

Fig. 4: Ecological map of subject integration in STEM.
figure 4

This figure is composed of two parts, with the upper part representing the content dimension, and the lower part representing the time dimension. The pink area within the ellipse illustrates the most prominent research themes in the K-12 stage, while the blue area illustrates the most prominent research themes in higher education.

Conclusion and future research

Based on the literature related to STEM education in the WOS database from 2004 to 2023, covering 903 papers at the K-12 level and 873 papers at the higher education level, this study conducted a bibliometric analysis from the perspective of subject evolution, including subject timeline evolution analysis and subject theme evolution analysis, to reveal the subject evolution trends and research hotspots in STEM education. The following conclusions were reached.

First, regarding subject integration, the interdisciplinary and cross-subject collaboration in STEM education was constantly expanding and deepening, forming a new situation in which science, engineering, humanities, and social sciences are integrated. Since 2004, a total of 16 subjects have been involved, among them, arts, physics, chemistry, biology, computer science, and environmental science were the main integrated subjects. Interdisciplinary integration promoted the innovation and development of STEM education research.

Second, regarding the research themes, humanism was more and more emphasized in STEM education. In the temporal evolution of subjects in STEM education, it was found that the research outputs of humanities and social science subjects such as arts, psychology, and philosophy kept increasing. The cultural themes have enriched the diversity of participants and the uniqueness of regions in STEM education research, viewed from perspectives such as theory, teaching methods, and regional development. “Social relevance” has garnered significant attention across different educational levels. In K-12 education, research topics were relatively balanced, but there was a lack of research on students’ career choices and development. In higher education, research topics in the humanities and social sciences were more diverse in their distribution.

To sum up, this study analyzed the developmental lineage of STEM education, focusing on the subject roles, and hot topics of research, and summing up potential guidance for subsequent subject integration research. Future work should prioritize the articulation of STEM subject integration between K-12 education and higher education. At the K-12 level, it is necessary to enhance vocational education appropriately, while in higher education, reducing the attrition rate of STEM majors may become a crucial issue. Additionally, attention to multi-discipline teacher collaboration and professional development, high-quality curricula design, and regional policy support should continue to be emphasized. Moreover, different countries present different characteristics in the development of STEM education due to their different cultural, political, and economic backgrounds. In future studies, we aim to conduct a comparative study on the development of STEM education on a country-by-country basis.