The evolution of knowledge within and across fields in modern physics

Sun, Ye; Latora, Vito

doi:10.1038/s41598-020-68774-w

Download PDF

Article
Open access
Published: 21 July 2020

The evolution of knowledge within and across fields in modern physics

Scientific Reports volume 10, Article number: 12097 (2020) Cite this article

7293 Accesses
22 Citations
Metrics details

Subjects

Abstract

The exchange of knowledge across different areas and disciplines plays a key role in the process of knowledge creation, and can stimulate innovation and the emergence of new fields. We develop here a quantitative framework to extract significant dependencies among scientific disciplines and turn them into a time-varying network whose nodes are the different fields, while the weighted links represent the flow of knowledge from one field to another at a given period of time. Drawing on a comprehensive data set on scientific production in modern physics and on the patterns of citations between articles published in the various fields in the last 30 years, we are then able to map, over time, how the ideas developed in a given field in a certain time period have influenced later discoveries in the same field or in other fields. The analysis of knowledge flows internal to each field displays a remarkable variety of temporal behaviours, with some fields of physics showing to be more self-referential than others. The temporal networks of knowledge exchanges across fields reveal cases of one field continuously absorbing knowledge from another field in the entire observed period, pairs of fields mutually influencing each other, but also cases of evolution from absorbing to mutual or even to back-nurture behaviors.

A mathematical model for the process of accumulation of scientific knowledge in the early modern period

Article Open access 25 August 2023

Maryam Zamani, Hassan El-Hajj, … Matteo Valleriani

Papers and patents are becoming less disruptive over time

Article 04 January 2023

Michael Park, Erin Leahey & Russell J. Funk

Investigating patterns of change, stability, and interaction among scientific disciplines using embeddings

Article Open access 22 August 2022

Barbara McGillivray, Gard B. Jenset, … Donna Schut

Introduction

Knowledge creation and knowledge sharing go hand in hand. Knowledge is in fact created through combination and integration of different concepts, and can benefits from social interactions and interdisciplinary collaborations. Recent works have explored from many angles how knowledge flows across scholars^1,2,3,4,5, institutions^6,7,8,9 and disciplines^10,11,12. In particular, it has been shown that knowledge exchange across fields can influence the evolution of culture and language^13,14, strengthen multi-faceted cooperation^3,15, and drive the innovation and development of science^16,17,18,19. Research publications are one of the primary channels of communication for the exchange and spreading of knowledge in science²⁰. By publishing their own articles and citing works by their peers, researchers continuously contribute to the processes of knowledge creation, knowledge sharing and knowledge acquisition²¹, thereby promoting the advancement of science. The presence of a citation between two research articles often denotes a certain transfer of knowledge from the cited article to the citing articles. It is therefore natural to use citations between articles published in different scientific fields to investigate the flow of knowledge across different domains of science. Despite some works in this direction have already started elucidating the main mechanisms of knowledge sharing and diffusion^22,23,24, a systematic study on how knowledge evolves in time²⁵ and of the complex interactions and influences between different fields²⁶ is still lacking.

In this article, we propose a novel framework to detect and quantify relevant transfers of knowledge across disciplines and between different time periods. One of the outcomes of the method is the construction of a time-varying network mapping the structure of knowledge and the relations between disciplines. In particular, we present an application to study the evolution of scientific knowledge in modern physics, namely to investigate how influences from one field of physics to another have evolved over time in the last 30 years. Building on bibliographic information of over 430,000 articles published by the American Physical Society (APS) between 1985 and 2015, and making use of the highest-level Physics and Astronomy Classification Scheme (PACS) codes, which indicate the fields of physics an article belongs to, we construct a temporal network where the nodes represent the fields of modern physics and the directed links denote the presence of a significant dependence of a field on another. Such a network is changing over time and, as we will show below, its analysis by the methods of network science^27,28 is able to reveal essential properties of how knowledge is exchanged among fields and over different time periods. We have found that, overall, knowledge flows have become increasingly homogeneous over the last years, indicating the important role of interdisciplinary research^{10,11,29,30,31}. In spite of this, some typical patterns of influence, such as cases of one field absorbing knowledge from another field, or two fields mutually influencing each other, clearly emerge at the microscopic scale. Our findings provide insights into the basic mechanisms of knowledge exchange in science, and can turn very useful to understand the dynamics of scientific production and the growth of novelties in scientific domains^32,33,34.

Results

The fields of modern physics

An overall insight into the main fields of modern physics can be obtained by a basic analysis of the characteristics of the APS data sets. Considered at their highest level, PACS codes divide modern physics into ten major fields (see Table 1). A measure of the relevance of each field can then be derived from the volume of papers published in each field. Since each paper can be listed with multiple PACS codes we assign it to multiple fields. We therefore consider each paper as one unit of knowledge and define the field composition of the paper as the relative frequency of its PACS codes. For instance, if a paper is listed with three PACS codes ${\textit{89.75.-k}}$, ${\textit{81.05.-t}}$ and ${\textit{05.45.-a}}$, we assign two-thirds of this paper to Interdisciplinary Physics (PACS 80), and the remaining one-third to General Physics (PACS 00). The total number of papers $N_{paper}$ associated to each field over the entire time period of 31 years is reported in Table 1. One can see that the three largest fields are Condensed Matter (PACS 60 and 70) and General Physics (PACS 00), capturing $57\%$ of the entire publications in APS journals. GPE is the smallest field, with only 8325 papers, which is roughly one-fifteenth the size of the largest field CM2. To quantify and compare the growth rate of each field, the average yearly change in the number of papers $\Delta N_{paper}$ is also reported in Table 1. Consistently with the rankings based on field sizes, GEN and CM2 also exhibit the highest growth rate, larger than 100 papers per year, while GPE shows the slowest increase, with only 5 papers per year on average. On the contrary, CM2, the third-largest field by size, is ranked fourth from the bottom according to the average growth $\Delta N_{paper}$. An opposite trend is observed for Interdisciplinary Physics and Astrophysics, which respectively take fifth and sixth place according to their growth rates, although their field sizes are ranked eighth and ninth among these fields, reflecting their rapid development during the observing period.

We found that $91\%$ of the papers have more than one PACS code, with $36\%$ of them reporting PACS codes which are at least from two different fields. To quantify the level of interdisciplinarity of a given field, we have collected all the papers with at least one PACS code from that field, and then calculated the proportion J of these papers which are also classified by at least one PACS code from other fields. The results in Table 1 show that Interdisciplinary Physics is the field with the largest value of J: almost $90\%$ of the Interdisciplinary Physics papers are also classified by PACS codes from other fields of physics. This result is consistent with the expectation that interdisciplinary research combines knowledge from various disciplines. Instead, papers in the fields of Nuclear, Particles and Condensed matter 2 physics are more likely to use PACS codes from their own fields. Summing up, the above analyses indicate that the differences between fields of physics are remarkable, either in terms of the size and growth of the fields, and in terms of their interactions with other fields.

Table 1 The ten fields of modern physics.

Full size table

The knowledge flow network

Interactions among scientific fields can be better characterized by making use of scientific citations. A published article in a scientific field citing articles of another field implies that the cited field reflects a piece of previously existing knowledge that the citing field builds upon. And this, in turn, indicates a flow of knowledge from the cited field to the citing field. Hence, we can construct a network of knowledge flow across fields by analyzing the pattern of citations among papers of different fields. The nodes of such a network represent the ten fields of modern physics as indicated by the PACS codes, while the directed links between fields denote the flows of knowledge from one area of physics to another.

Specifically, for a given citation c there will be a transfer of knowledge from each PACS code in the cited reference to all the PACS codes in the citing article. We hence indicate as $f^{c}_{\alpha \rightarrow \beta }$ the volume of knowledge flow from field $\alpha$ to field $\beta$ due to citation c. As shown in Fig. 1a, this is calculated as the product of the proportion of PACS codes from field $\beta$ in the citing article times the proportion of PACS codes from field $\alpha$ in the cited article. This ensures the normalization $\sum _{\alpha }\sum _{\beta }{f}^{c}_{\alpha \rightarrow \beta }=1$ for each citation c, meaning that each citation contributes a unit of knowledge transfer that is then split to the different fields. For instance, in Fig. 1a, two of the three PACS codes of the cited article belong to field $\alpha$, while one over two of the PACS codes in the citing article is from field $\beta$. Consequently, we assume that the volume of knowledge flowing from field $\alpha$ to field $\beta$, due to citation c, is $f^{c}_{\alpha \rightarrow \beta }=2/3 \times 1/2$. Similarly, we can calculate the quantities $f^{c}_{\alpha \rightarrow \alpha }$, $f^{c}_{\beta \rightarrow \alpha }$ and $f^c_{\beta \rightarrow \beta }$. In order to characterize the flow of knowledge across fields and to study its evolution over the years, we construct yearly aggregated networks by selecting different pairs of years for citing and cited articles respectively. This is done by analyzing all the citations from papers published in a given year t to papers published in year $t-n$, and defining the total volume of knowledge flowing from field $\alpha$ to field $\beta$ as:

$$F_{\alpha \rightarrow \beta }^{t-n \rightarrow t} = \sum _{\mathrm{cit}} f^{c}_{\alpha \rightarrow \beta }$$

(1)

where the sum runs over all citations c from papers published in field $\beta$ in year t to papers published in field $\alpha$ in year $t-n$. Notice that n is a tunable parameter, denoting the relative age of cited papers with respect to the citing year t. Having the possibility to vary both t and n allows to take into account that the probability of a citation is influenced by: (1) the relative age of the two papers³⁵, and by (2) the number of papers published in the cited year $t-n$. The quantities $F_{\alpha \rightarrow \beta }^{t-n \rightarrow t}$ are, however, affected by field-specific characteristics and publishing conventions, such as typical field sizes and time-varying growth rates, which, as shown in Table 1, may vary a lot from field to field. Some other influencing factors that are not discussed here might exist, such as the difference of reference list length across fields, which could be considered in more detailed study. Hence, an increase of $F_{\alpha \rightarrow \beta }^{t-n \rightarrow t}$ over time does not automatically reflect a closer relation between fields $\alpha$ and $\beta$, as it can only be due to a rapid growth in the number of publications in these two fields. In order to account for this, we define the statistical significance $\phi (\alpha _{t-n},\beta _{t})$, which quantifies how the observed knowledge flow $F_{\alpha \rightarrow \beta }^{t-n \rightarrow t}$ exceeds the flow expected in a opportunely chosen null model (see the “Methods” section). Figure 1b illustrates an example of how to calculate the quantities $\phi (\alpha _{t-n},\beta _{t})$ in Eq. (5). Suppose, for instance, field $\alpha$ in year $t-n$ provides a total of 75 units of knowledge to all the fields in year t, with field $\alpha$ itself receiving 50 of these 75 units, and $\beta$ getting 25, i.e. $F_{\alpha \rightarrow \alpha }^{t-n \rightarrow t}=50$ and $F_{\alpha \rightarrow \beta }^{t-n \rightarrow t}=25$. Analogously we assume field $\beta$ in year $t-n$ provides a total of 25 units to year t, 15 to field $\beta$ itself and 10 to $\alpha$. Now, the marginal probability that the citing field is $\beta$ can be obtained as $Pr(X_{t}^{citing}=\beta )=\sum _{\alpha } F_{\alpha \rightarrow \beta }^{t-n \rightarrow t}/\sum _{\alpha ,\beta } F_{\alpha \rightarrow \beta }^{t-n \rightarrow t}=(25+15)/(75+25)$, while $Pr(Y_{t-n}^{cited}=\alpha | X_{t}^{citing}=\beta )= F_{\alpha \rightarrow \beta }^{t-n \rightarrow t}/\sum _{\alpha } F_{\alpha \rightarrow \beta }^{t-n \rightarrow t} = 25/(25+15)$. We therefore have $P(\alpha _{t-n}, \beta _{t}) = 25/40 \cdot 40/100$. Such a probability needs to be compared to that of a null model in which $P^{\mathrm{rand}}(\alpha _{t-n}, \beta _{t})=Pr(Y_{t-n}^{cited}=\alpha ) \cdot Pr(X_{t}^{citing}=\beta )= 80/150 \cdot 40/100$, since the probability $Pr(Y_{t-n}^{cited}=\alpha )$ that the cited paper in year $t-n$ is in field $\alpha$ is equal to $80/(80+70)$. Finally, the ratio $\phi (\alpha _{t-n},\beta _{t})$ in Eq. (5) is equal to $25/40 \cdot 150/80$.

Analogously, we can calculate the statistical significance of all the other flows reported in Fig. 1b. Such quantities allow to capture the intrinsic variation of knowledge flows among fields and also to compare different pair of fields. Finally, to obtain the weights of knowledge flows in year t from a cited time window $\Delta t'$, we define the flow weights $w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow t}$ for each couple of cited field $\alpha$ and citing field $\beta$ as:

$$\begin{aligned} w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow t}= \frac{1}{\vert \Delta t' \vert }\sum _{n \in \Delta t'}{\phi (\alpha _{t-n}, \beta _{t})} \end{aligned}$$

(2)

where $\vert \Delta t' \vert$ is the length of the time window. Let $\Delta t'=[1, 5]$ and $\vert \Delta t' \vert =5$, then one can construct the significant knowledge flow network in each year t from the previous 5 years. Furthermore, for each given source period $\Delta t'$, one can also investigate the knowledge flows within an observing period $\Delta t$:

$$\begin{aligned} w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow \Delta t}= \frac{1}{\vert \Delta t \vert }\sum _{t \in \Delta t} w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow t} \end{aligned}$$

(3)

For example, let $\vert \Delta t \vert =5$, we can divide the entire time period into five observing time windows, namely [1990, 1994], [1995, 1999], [2000, 2004], [2005, 2009] and [2010, 2014]. The weight of each link in the network reflects how significant the knowledge flows between two related fields. This quantitative framework enable us to investigate the evolution of knowledge flows in two time dimensions: (1) for each given observing period $\Delta t$, the weights of knowledge flows from different time interval $\Delta t'$ can be observed; and also (2) for each fixed $\Delta t'$, the weights of knowledge flows within different observing period $\Delta t$ can be compared. Although in this paper we have studied knowledge flows across the ten major fields of physics, we believe that our framework can also give important information when applied to investigate knowledge transfer among subfields at any possible level of hierarchy.

Temporal analysis of knowledge flow networks

We first investigate how the overall properties of the knowledge flow networks have changed over time. Specifically we have evaluated, for each year, the flows of knowledge from the previous 5 years, i.e we have fixed $\Delta t'=[1,5]$ and $\vert \Delta t \vert =1$ in our framework. To better visualize the temporal changes, the whole knowledge flow networks obtained for the 3 years 1990, 2000 and 2010 are reported in Fig. 1c. Links representing a significant flow of knowledge ($w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow t}>1$) are shown in red color.

The first thing to notice is that the number of significant links is roughly constant over the years, as also illustrated in Fig. 2a. In addition to this, we observe that more links are reciprocated in 2000 with respect to years 1990 and 2010, which suggests that situation in which couples of fields mutually influence each other are more common 2000. To further examine this, we have computed the network reciprocity (see “Methods” section) for each year. The results reported in Fig. 2b indicate that the value of the reciprocity $\rho$ has increased in the first few years, reached a peak around 1998, and then has begun to decrease in the following years. This has lead us to conclude that the highest levels of mutuality in knowledge transfer among different fields of physics have been experienced between 1995 and 2000.

We have then extracted the typical patterns of knowledge transfer in the network. For this reason, we have focused on the statistically significant three-node motifs in the knowledge flow networks³⁶, i.e. the directed connected subgraphs of three nodes that appear in the network more often than they would occur by chance. Figure 2c illustrates the Z-scores (see the “Methods” section) of two relevant three-node motifs over the years. One can see that the subgraph represented by bi-directed paths (diamond symbol) is the most significant motif throughout the whole time period, with a Z-score on average equal to about 6. Furthermore, complete subgraphs of three nodes, corresponding to three mutually connected fields of physics, are only statistically significant in the period from 1998 to 2000, where the complete subgraph GEN, EOA and IPR appears. Notice that this period also corresponds to the time period of high reciprocity in Fig. 2b.

In addition, Fig. 1c indicates that there are fewer links with large weights in year 2010 than in 1990 and 2000. To further investigate this trend, Fig. 2d reports mean $\overline{w}$ and standard deviation $\sigma _w$ of the weights of significant links between 1990 and 2016. We find that both the values of $\overline{w}$ and $\sigma _{w}$ gradually decrease overtime and eventually stabilize to values slightly above 1 and 0 respectively. This indicates that the exchange of knowledge across domains has increasingly become more homogeneous with respect to the beginning of 1980s, when each field only absorbed knowledge from a handful of close domains. From a different perspective, this also reflects a rise of the interdisciplinary character of research in physics.

Internal knowledge flows

The weights of the internal flows $w_{\alpha \rightarrow \alpha }^{\Delta t' \rightarrow t}$ from a field $\alpha$ to itself are an indication of the degree of self-dependence of the research field. To investigate the evolution of the internal knowledge flows, we have computed, for each of the ten fields of physics, the weights of the internal flows in every observing year t (between 1990 and 2015) from each of the previous 5 years, namely adopting a cited time window $\Delta t'=[1,5]$. Figure 3a indicates that the internal knowledge flows are significant ($w_{\alpha \rightarrow \alpha }^{\Delta t' \rightarrow t}>1$) for all ten fields over the whole 21-year period of time, although the temporal trends can vary from field to field. The two fields with the largest variations are GAA and GPE. Field GAA exhibits a remarkable decrease in the degree of self-reference after 1993, indicating that in this field the internal transfer of knowledge has become less and less significant over time. Conversely, field GPE shows an increasing trend and becomes the most self-referential field after 1995. Other fields exhibit decreasing (EOA and IPR) or increasing (NUC and ATM) patterns, while the contribution of internal flows are relatively low and keeps nearly constant for fields such as GEN, CM1 and CM2.

In Fig. 3b–e we focus on the evolution of internal flows of the four fields EPF, NUC, IPR and GAA. In particular, we perform a two-dimensional analysis in which we change the positions of both the observing time windows $\Delta t$ and the source time window $\Delta t'$. We consider the case where the lengths of the two time windows is the same and is equal to 5 years. The colours in Fig. 3b–e represent the values of internal knowledge flows $w_{\alpha \rightarrow \alpha }^{\Delta t' \rightarrow \Delta t}$. By looking at the variations of colours in each row we find that field NUC shows an increasingly high degree of self-reference over time, while IPR and GAA tend to lower their degree of internal flows, which is consistent with the results in Fig. 3a.

By looking at the variation of colours over each column of Fig. 3b–e we can instead investigate the influence of reference’s age on the internal flows of knowledge. One can see that fields such as NUC and IPR show a decreasing trend from most recent times to the past, in agreement with previous studies stating that the likelihood of a paper being discovered significantly decreases with the papers’ age³⁷. By contrast, we observe an unexpected and very clear pattern for EPF and GAA, since both fields exhibit a maximum of the values along the anti-diagonal line. Notice that each square along the anti-diagonal line represents the same cited time window, namely a time window of 5 years before the period [1990, 1994]. This may be due to important discoveries and the publication of pioneering research works in the fields EPF and GAA during the period [1985, 1990] which would clearly increase the probability for researchers in the field to cite, in the following years, papers published in that period. A possible explanation can be for instance the rapid development in the period [1985, 1989] of the new research area “astroparticle physics”, emerging at the intersection of particle physics, astronomy and astrophysics³⁸, and which mainly combines the knowledge from fields EPF and GAA. As an evidence of this rapid development, notice that even a new journal named “Astroparticle Physics” was established in 1992. Moreover, the fact that the weights of the internal flows in GAA are nearly three times larger than those in EPF, can be due to the Hubble Space Telescope, one of the major scientific breakthroughs in field GAA. The telescope is one of the largest and most productive scientific research tool for astronomy, and it was indeed launched in 1990 (within the period of interest in the anti-diagonal line), greatly promoting the development of astronomy in GAA. We have further examined the evolution of internal flows for the remaining six fields and found similar patterns (see Supplementary Information (SI)).

The evolution of knowledge flows across fields

Examining how the discoveries in a field have contributed to a different field of physics is even more important than studying the flows of knowledge within a given field. In order to get an overall picture of the existing influences across different fields of modern physics, we report in Fig. 4a the average weights of knowledge flows between each couple of fields over the whole period under study. To highlight the mutual exchange of flows, the results are shown in a ($\overline{w}_{\alpha \rightarrow \beta }$)-($\overline{w}_{\beta \rightarrow \alpha }$) plane. Each point refers to a pair of fields, and the distance from the position of the point to the bisector (red line) measures the level of asymmetry in the exchange of knowledge between the two fields. We notice that most of the points are concentrated around the bisector, especially those points in the lower-left corner corresponding to pairs of fields with small significance weights. However, there are also points far from the line, such as the point corresponding to the pair GPE and ATM (red up-triangle in the lower right of the panel), indicating asymmetric transfers of knowledge between two fields.

To investigate the temporal evolution in the exchange of knowledge between two fields in Fig. 4b–e, we have considered the same types of plots over time. In such a case, each pair of fields corresponds to a trajectory joining the points corresponding to the different years from 1990 to 2015. The colors of symbols from light to dark indicates the years from past to the most recent. Although the significance of the links in general decreases over time, the temporal patterns can vary from one pair of fields to another. The four panels illustrate the four major classes of behaviour (modes) we have found, namely: absorbing, absorbing to mutual, back-nurture and mutual mode. Absorbing mode can be seen in field GPE, which has absorbed more knowledge from fields ATM and EOA throughout the whole period under study (Fig. 4b). Fields GEN and EPF show a similar behavior as GPE in the beginning, absorbing more knowledge from GAA, while in the last few years, GEN and EPF tend to mutually exchange knowledge with GAA, although the weights on the links in both directions become less significant (Fig. 4c). More interestingly, we also find a back-nurture mode as shown in Fig. 4d. Field GEN at first absorbs more knowledge from EOA than what it provides to EOA, but later the situation is inverted. Finally, fields IPR and CM1 shows another pattern, the mutual mode, indicating that they have exchanged knowledge in an almost symmetric way over the whole period. Similar evolution modes have also been seen in the remaining six fields (see SI). These different evolution patterns clearly demonstrate that the processes of knowledge creation and transfer across fields can be highly heterogeneous.

Discussion

Knowledge sharing and transfer across scientific disciplines, and cross-fertilization are increasingly recognized as crucial factors to breakthrough innovation in science^12,39,40. The temporal network approach we have proposed in this article can be useful to shed lights on the evolution of knowledge within a field and on the dynamic patterns of influences between different fields. Our study case application has shown that major developments in physics can influence a field for many decades and can even trigger knowledge production in other fields. Indeed, the patterns of cross-fertilization vary greatly among the different disciplines of physics and can also show marked transitions over time. For instance, the physics of gases and plasmas has consistently absorbed knowledge from atomic and molecular physics and from electromagnetism over the last 3 decades. Other fields such as condensed matter and interdisciplinary physics have instead always shared and mutually exchanged knowledge. Finally, we have revealed interesting transitions from absorbing to mutual modes, for instance in the case of the physics of elementary particles, a field of physics that has initially been strongly influenced by astronomy and astrophysics, but in the new century has also contributed to the progress of these latter disciplines. Our findings not only shed new lights on the basic laws governing the development of scientific fields, but can also have practical implications on the future development of economic policies and research strategies.

Methods

Data

The data set contains 435, 717 articles published by the American Physical Society (APS) from year 1985 until the end of 2015. Publication date, Physics and Astronomy Classification Scheme (PACS) codes, and bibliography have been extracted for each article. The PACS codes are grouped into a five-level hierarchy and each of them indicates a very specific field of physics. As an example, the PACS code $\textit{64.60.aq}$, indicating the field ”Networks”, belongs to the broader field ”Equations of state, phase equilibria, and phase transitions” (PACS 64) and further belongs to top-level field ”Condensed Matter: Structural, Mechanical and Thermal Properties” (PACS 60). Here, we consider the PACS codes at the highest level, which classify the physics into ten main fields (Table 1). Each article is associated with up to four PACS codes. With regard to the bibliography, only the citations referring to articles published in the APS journals were considered.

Null model and statistically significant networks

To characterize the flow of knowledge across fields and at different time periods, the statistical significance of each contribution has been validated with respect to an appropriately chosen null model. For each couple of fields all the citations from papers published in citing year t to papers published in cited year $t-n$ have been considered. Let ${\mathrm{X}}_{t}^{\mathrm{citing}}$ be the field of citing papers published in year t, and ${\mathrm{Y}}_{t-n}^{\mathrm{cited}}$ the field of cited papers published in year $t-n$. We indicate as $P (\alpha _{t-n}, \beta _{t}) = {\mathrm{Pr}}({\mathrm{Y}}_{t-n}^{\mathrm{cited}}=\alpha , {\mathrm{X}}_{t}^{\mathrm{citing}}=\beta )$ the joint probability that papers published in year t in field $\beta$ cite papers published in year $t-n$ in field $\alpha$. Such a probability can be written as:

$$P(\alpha _{t-n}, \beta _{t})= {\mathrm{Pr}}(\mathrm{Y}_{t-n}^{\mathrm{cited}}=\alpha | {\mathrm{X}}_{t}^{\mathrm{citing}}=\beta ) \times {\mathrm{Pr}}({\mathrm{X}}_{t}^{\mathrm{citing}}=\beta )$$

(4)

where ${\mathrm{Pr}}({\mathrm{Y}}_{t-n}^{\mathrm{cited}}=\alpha | {\mathrm{X}}_{t}^{\mathrm{citing}}=\beta )$ is the conditional probability of ${\mathrm{Y}}_{t-n}^{\mathrm{cited}}=\alpha$ given that ${\mathrm{X}}_{t}^{\mathrm{citing}}=\beta$, and ${\mathrm{Pr}}({\mathrm{X}}_{t}^{\mathrm{citing}}=\beta )$ is the marginal probability. We then consider a null model in which the papers published in year t in field ${\mathrm{X}}$ randomly select papers published in year $t-n$ as their citations, regardless of which fields they belong to. Hence, the joint probabilities in the null model can be written in terms of the marginal probabilities as:

$$\begin{aligned} P^{\mathrm{rand}}(\alpha _{t-n}, \beta _{t})={\mathrm{Pr}}({\mathrm{Y}}_{t-n}^{\mathrm{cited}}=\alpha ) \cdot {\mathrm{Pr}}({\mathrm{X}}_{t}^{\mathrm{citing}}=\beta ) \end{aligned}$$

(5)

By calculating the ratio $\phi (\alpha _{t-n}, \beta _{t})$ between the two probabilities in Eqs. (4) and (5):

$$\phi (\alpha _{t-n}, \beta _{t})= \frac{ {\mathrm{Pr}}({\mathrm{Y}}_{t-n}^{\mathrm{cited}}=\alpha |{\mathrm{X}}_{t}^{\mathrm{citing}}=\beta )}{{\mathrm{Pr}}({\mathrm{Y}}_{t-n}^{\mathrm{cited}}=\alpha )}$$

(6)

we were able to quantify how the observed flows of knowledge deviate from the flows expected to arise simply from random choices. A value $\phi (\alpha _{t-n},\beta _{t})=1$ has been adopted as critical threshold to distinguish whether the knowledge flow from field $\alpha$ to field $\beta$ is statistically significant, with $\phi (\alpha _{t-n},\beta _{t})>1$ indicating that field $\beta$ in year t is more likely to have extracted knowledge from field $\alpha$ in year $t-n$ than would be expected at random.

Network analysis

To characterize the networks of knowledge flow we have evaluated the network reciprocity and we have performed a motif analysis.

(1)
Reciprocity. For each year t we have computed the reciprocity coefficient⁴¹ of the knowledge flow network as:
$$\begin{aligned} \rho _t=\frac{\sum \limits _{\alpha \ne \beta } (w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow t}-\overline{w}_{\Delta t' \rightarrow t})(w_{\beta \rightarrow \alpha }^{\Delta t' \rightarrow t}-\overline{w}_{\Delta t' \rightarrow t})}{\sum \limits _{\alpha \ne \beta }(w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow t}-\overline{w}_{\Delta t' \rightarrow t})^2} \end{aligned}$$
(7)
where the average value $\overline{w}_{\Delta t' \rightarrow t} \equiv \sum _{\alpha \ne \beta }w_{\alpha \rightarrow \beta }^{\Delta t' \rightarrow t}/N(N-1)$ indicates the mean of the link weights and $\Delta t'=[1, 5]$. The reciprocity coefficient $\rho _{t}$ ranges from $-1$ to 1, and allows us to distinguish between antireciprocal ($\rho _t<0$) and reciprocal ($\rho _t>0$) networks. We have only considered in the calculation the top half pairs of mutual links with the largest weights.
(2)
Motifs. In this study, we focus on three-node motif analysis³⁶. There are 13 different possible connected subgraphs of three nodes. In order to measure the statistical significance of each subgraph g, we have computed the Z-score $Z_g$ defined as
$$\begin{aligned} Z_{g} = (N_{g}-\langle N^{rand}_{g} \rangle )/\sigma _{g} \end{aligned}$$
(8)
where $N_{g}$ is the number of times subgraph g appears in the network, and $\langle N^{rand}_{g} \rangle$ and $\sigma _{g}$ are respectively average and standard deviation of the number of times subgraph g occurs in an ensemble of randomized graphs with the same degree distribution as the original network. For each year, we have generated an ensemble of 5000 different randomized samples of the original network using the configuration model.

References

Sekara, V. et al. The chaperone effect in scientific publishing. Proc. Natl. Acad. Sci. USA 115, 12603–12607 (2018).
Article ADS CAS PubMed Google Scholar
Li, W., Aste, T., Caccioli, F. & Livan, G. Early coauthorship with top scientists predicts success in academic careers. Nat. Commun. 10, 1–9 (2019).
Article Google Scholar
Monechi, B., Pullano, G. & Loreto, V. Efficient team structures in an open-ended cooperative creativity experiment. Proc. Natl. Acad. Sci. USA 116, 22088–22093 (2019).
Article ADS CAS PubMed Google Scholar
Armano, G. & Javarone, M. A. The beneficial role of mobility for the emergence of innovation. Sci. Rep. 7, 1781 (2017).
Article ADS PubMed PubMed Central Google Scholar
Milojević, S., Radicchi, F. & Walsh, J. P. Changing demographics of scientific careers: the rise of the temporary workforce. Proc. Natl. Acad. Sci. USA 115, 12616–12623 (2018).
Article PubMed Google Scholar
Clauset, A., Arbesman, S. & Larremore, D. B. Systematic inequality and hierarchy in faculty hiring networks. Sci. Adv. 1, e1400005 (2015).
Article ADS PubMed PubMed Central Google Scholar
Gargiulo, F. & Carletti, T. Driving forces of researchers mobility. Sci. Rep. 4, 4860 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Deville, P. et al. Career on the move: geography, stratification, and scientific impact. Sci. Rep. 4, 4770 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ma, A., Mondragón, R. J. & Latora, V. Anatomy of funded research in science. Proc. Natl. Acad. Sci. USA 112, 14760–14765 (2015).
Article ADS CAS PubMed Google Scholar
Van Noorden, R. Interdisciplinary research by the numbers. Nature 525, 306–307 (2015).
Article ADS PubMed Google Scholar
Sinatra, R., Deville, P., Szell, M., Wang, D. & Barabási, A.-L. A century of physics. Nat. Phys. 11, 791 (2015).
Article CAS Google Scholar
Battiston, F. et al. Taking census of physics. Nat. Rev. Phys. 1, 89–97 (2019).
Article Google Scholar
Bhagat, R. S., Kedia, B. L., Harveston, P. D. & Triandis, H. C. Cultural variations in the cross-border transfer of organizational knowledge: an integrative framework. Acad. Manag. Rev. 27, 204–221 (2002).
Article Google Scholar
Chen, J., Sun, P. Y. & McQueen, R. J. The impact of national cultures on structured knowledge transfer. J. Knowl. Manag. 14, 228–242 (2010).
Article Google Scholar
Bell, G. G. & Zaheer, A. Geography, networks, and knowledge flow. Organ. Sci. 18, 955–972 (2007).
Article Google Scholar
Sorenson, O., Rivkin, J. W. & Fleming, L. Complexity, networks and knowledge flow. Res. Policy 35, 994–1017 (2006).
Article Google Scholar
Agrawal, A., Kapur, D. & McHale, J. How do spatial and social proximity influence knowledge flows? Evidence from patent data. J. Urban Econ. 64, 258–269 (2008).
Article Google Scholar
Meyer, M. Tracing knowledge flows in innovation systems. Scientometrics 54, 193–212 (2002).
Article Google Scholar
Acemoglu, D., Akcigit, U. & Kerr, W. R. Innovation network. Proc. Natl. Acad. Sci. USA 113, 11483–11488 (2016).
Article CAS PubMed Google Scholar
Zeng, A. et al. The science of science: from the perspective of complex systems. Phys. Rep. 714, 1–73 (2017).
Article ADS MathSciNet MATH Google Scholar
Zhang, Q., Perra, N., Gonçalves, B., Ciulla, F. & Vespignani, A. Characterizing scientific production and consumption in physics. Sci. Rep. 3, 1640 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Börner, K., Penumarthy, S., Meiss, M. & Ke, W. Mapping the diffusion of scholarly knowledge among major us research institutions. Scientometrics 68, 415–426 (2006).
Article Google Scholar
Zhuge, H. A knowledge flow model for peer-to-peer team knowledge sharing and management. Expert Syst. Appl. 23, 23–30 (2002).
Article Google Scholar
Yan, E. Disciplinary knowledge production and diffusion in science. J. Assoc. Inf. Sci. Technol. 67, 2223–2245 (2016).
Article CAS Google Scholar
Perc, M. Self-organization of progress across the century of physics. Sci. Rep. 3, 1–5 (2013).
Article Google Scholar
Shen, Z. et al. Interrelations among scientific fields and their relative influences revealed by an input–output analysis. J. Informetr. 10, 82–97 (2016).
Article Google Scholar
Latora, V., Nicosia, V. & Russo, G. Complex Networks: Principles, Methods and Applications (Cambridge University Press, Cambridge, 2017).
Book MATH Google Scholar
Newman, M. Networks (Oxford University Press, Oxford, 2018).
Book MATH Google Scholar
Pan, R. K., Sinha, S., Kaski, K. & Saramäki, J. The evolution of interdisciplinarity in physics research. Sci. Rep. 2, 551 (2012).
Article ADS PubMed PubMed Central Google Scholar
Bonaventura, M., Latora, V., Nicosia, V. & Panzarasa, P. The advantages of interdisciplinarity in modern science. arXiv:1712.07910 (2017).
Pluchino, A. et al. Exploring the role of interdisciplinarity in physics: success, talent and luck. PLoS ONE 14, e0218793 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tria, F., Loreto, V., Servedio, V. D. P. & Strogatz, S. H. The dynamics of correlated novelties. Sci. Rep. 4, 5890 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Iacopini, I., Milojević, S. C. V. & Latora, V. Network dynamics of innovation processes. Phys. Rev. Lett. 120, 048301 (2018).
Article ADS CAS PubMed Google Scholar
Chinazzi, M., Gonçalves, B., Zhang, Q. & Vespignani, A. Mapping the physics research space: a machine learning approach. EPJ Data Sci. 8, 33 (2019).
Article Google Scholar
Zhu, H., Wang, X. & Zhu, J.-Y. Effect of aging on network structure. Phys. Rev. E 68, 056121 (2003).
Article ADS Google Scholar
Milo, R. et al. Network motifs: simple building blocks of complex networks. Science 298, 824–827 (2002).
Article ADS CAS PubMed Google Scholar
Tahamtan, I., Afshar, A. S. & Ahamdzadeh, K. Factors affecting number of citations: a comprehensive review of the literature. Scientometrics 107, 1195–1225 (2016).
Article Google Scholar
Cirkel-Bartelt, V. History of astroparticle physics and its components. Living Rev. Relativ. 11, 2 (2008).
Article ADS PubMed PubMed Central MATH Google Scholar
Rinia, E. J., Van Leeuwen, T. N., Bruins, E. E., Van Vuren, H. G. & Van Raan, A. F. Measuring knowledge transfer between fields of science. Scientometrics 54, 347–362 (2002).
Article Google Scholar
Phene, A., Fladmoe-Lindquist, K. & Marsh, L. Breakthrough innovations in the us biotechnology industry: the effects of technological space and geographic origin. Strat. Manag. J. 27, 369–388 (2006).
Article Google Scholar
Garlaschelli, D. & Loffredo, M. I. Patterns of link reciprocity in directed networks. Phys. Rev. Lett. 93, 268701 (2004).
Article ADS PubMed Google Scholar

Download references

Acknowledgements

This work was funded by the Leverhulme Trust Research Fellowship “CREATE: the network components of creativity and success” and EPSRC grant EP/N013492/1.

Author information

Authors and Affiliations

School of Mathematical Sciences, Queen Mary University of London, London, E1 4NS, UK
Ye Sun & Vito Latora
Dipartimento di Fisica ed Astronomia, Università di Catania and INFN, 95123, Catania, Italy
Vito Latora
The Alan Turing Institute, The British Library, London, NW1 2DB, UK
Vito Latora
Complexity Science Hub Vienna (CSHV), Vienna, Austria
Vito Latora

Authors

Ye Sun
View author publications
You can also search for this author in PubMed Google Scholar
Vito Latora
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.S. and V.L. designed research; Y.S. performed research; Y.S. and V.L. analyzed data; and Y.S. and V.L. wrote the paper.

Corresponding author

Correspondence to Vito Latora.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sun, Y., Latora, V. The evolution of knowledge within and across fields in modern physics. Sci Rep 10, 12097 (2020). https://doi.org/10.1038/s41598-020-68774-w

Download citation

Received: 17 February 2020
Accepted: 24 June 2020
Published: 21 July 2020
DOI: https://doi.org/10.1038/s41598-020-68774-w

This article is cited by

Dynamics and characteristics of interdisciplinary research in scientific breakthroughs: case studies of Nobel-winning research in the past 120 years
- Jingjing Ren
- Fang Wang
- Minglu Li
Scientometrics (2023)
Interdisciplinary researchers attain better long-term funding performance
- Ye Sun
- Giacomo Livan
- Vito Latora
Communications Physics (2021)
Evolution and transformation of early modern cosmological knowledge: a network study
- Maryam Zamani
- Alejandro Tejedor
- Holger Kantz
Scientific Reports (2020)
Knowledge and social relatedness shape research portfolio diversification
- Giorgio Tripodi
- Francesca Chiaromonte
- Fabrizio Lillo
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.