Introduction

The extraordinary growth of a scientific topic occurs when there is a period of unexpected and abnormally large growth in a topic’s impact or size1,2,3. Classical studies of extraordinary growth focus on the specifics of illustrative cases, such as the Copernican revolution and its associated changes in astronomical topics’ productivity and impact on scientific thinking1,4. Newly-available, large-scale data on nearly all scientific topics enables the study of periods of extraordinary growth from a statistical perspective. Here we investigate generalizable factors associated with the onset and magnitude of the extraordinary growth of scientific topics using a large and diverse sample of scientific topics2,5,6,7.

We investigated the statistical dynamics around a possible correlate of the onset of a scientific topic’s extraordinary growth: a topic’s association with a scientific prize8,9. Scientific prizes were originally restricted to certain scientific disciplines and topics, but they have proliferated and are now awarded in nearly all disciplines3,9. Prize research has primarily studied how awards change prizewinners’ careers10,11,12,13,14,15,16 by recognizing17,18, celebrating9,18,19, and changing perceptions of a scholar’s work2,20,21. For example, case studies of Howard Hughes Medical Investigators and John Bates Clark and Fields medalists have shown that prizewinners’ papers published before their prize is conferred gain citations significantly faster than expected10,11,12,14,15,22 after the prize and that winning one prize increases the probability of the same scholar winning future prizes9.

It is uncertain whether the link between prizes and unexpected growth for a single prizewinner’s work extends to changes in the growth of an entire topic, and current theoretical arguments and empirical work are nascent. On the one hand, scholars have argued that the increased interest in a prizewinning scientist’s work may likewise increase interest in the topic associated with a prize11,23,24. On the other hand, scholars have claimed that prizes signal that the best work associated with a topic has been done, which would lessen rather than an expand interest in a topic11,25. An impression reinforced by the fact that prizewinners tend to move onto new topics after winning a prize10,12 and the papers by matched contenders for the prize are concommitently cited less than expected11,12.

As a first step toward understanding the possible links between prizes and a topic’s extraordinary growth, we investigated whether prizes are statistically associated with the onset of a period of abnormal growth in a topic’s productivity, impact, and the migration of scientists into and out of the topic. Our analysis uses new data on hundreds of recognized scientific prizes worldwide9 and longitudinal data on over 10,000 scientific topics.

From various sources, we collected data on 405 scientific prizes conferred 2900 times between 1970 and 2007 with respect to 11,539 scientific topics in 19 disciplines. Scientific prize data was collected from Wikipedia pages on prizes and scientists. The prizes in our sample include awards like the Wolf Prize and Turing Prize, as well as hundreds of other prizes recognized on Wikipedia’s “scientific prizes” page9,16. To validate the Wikipedia data, we manually cross-checked it with prize-related data on dedicated webpages and in print media. Figure 1a illustrates how prizes and topics are linked. Prizes were linked to topics by associating the prizewinning scientist with the topics to which they are considered to have made meaningful contributions, also known as their “known-for” topics. We defined known-for topics empirically and via crowdsourcing. Empirically, known-for topics are the topics on which a scientist has published 10 or more total papers. We cross-validated this operationalization using Wikipedia’s “Known-For” dataset, which lists a scientist’s known-for topics based on the crowdsourced opinions of scientists and Wikipedia users (see Methods section for details). Consistent with our 10-paper threshold, we found that a scholar’s Wikipedia “Known-For” topics were those topics scholars had 10 papers on average (see Supplementary Fig. S1 for alternative thresholds and robustness checks).

Fig. 1: Procedure for defining prizewinning topics.
figure 1

a Our procedure has three steps. First, we collected a large sample of over 400 recognized prizes recorded in Wikipedia. Second, we linked prizes to prizewinners using the prize’s homepage and the prizewinner’s Wikipedia page. Third, we linked topics to prizes via the prizewinner’s Known For (KF) topics. KF topics are the subset of topics that a researcher is known for out of all the topics a researcher has worked on. We defined a researcher’s KF topics as those topics on which the researcher had published 10 papers or more, which was cross-validated with Wikipedia’s KF topics pages. Wikipedia’s KF topic’s page uses crowdsourcing to identify a researcher’s KF topics (Robustness Check, Supplementary Fig. S1). For any known-for topic associated with a prize, we used MAG to verify that the prizewinner had publications on the known-for topic that predated the year of the associated prize. Each prizewinning topic was then matched with five non-prizewinning topics that had statistically indistinguishable year-to-year growth from the prizewinning topic during the 10-year period prior to the prizewinning year as explained in detail in the main text. b Δt measures the difference between the log of the average growth of prizewinning topics minus the log of the average growth of matched topics after the prize year. The inset shows Δt as the relative percentage growth of the prizewinning topic compared to the matched topic group, hence the time prior to prize year, t = 0 is flat. Δt is measured and reported separately for 6 growth measures of a topic as described in the text. Gray panel represents the time before the prize year. Illustration images are from www.pxfuel.com and www.pikpng.com.

Scientific topic data comes from Microsoft Academic Graph (MAG). MAG covers over 172 million publications by 209 million authors in 48,000 journals from 1800 to 2017. It uses crowdsourcing, AI, and NLP to generate meaningful and plausible classifications of scientific topics in ways that integrate human expert observations and empirical replicability. MAG defines the universe of scientific topics using Wikipedia article pages that are classified as being on “scientific topics”; these pages are created and updated by scientists and users through crowdsourcing. MAG then uses NLP and AI to assign research publications to topics by associating a paper’s text (not just keywords) with the linguistic content of Wikipedia’s scientific topics pages26. As a validity check, MAG uses statistical sampling to verify the face validity of the classification of papers to topics27,28 (see SI. Sec.1 for details).

To test whether a prizewinning topic’s growth after the prize is awarded is significantly greater than expected, i.e., “extraordinary,” we used a difference-in-differences (DID) regression design. DID regression tests for the effects of randomly assigned treatments as well as non-causal tests of statistical significance, in which one group experiences an event that the other group does not experience11,12,13,29. Following prior prize research11,12,13, we used DID to test for non-causal statistical relationships between prizewinning and a topic’s extraordinary growth. Also, following the literature, we combined DID with Dynamic Optical Matching (DOM)30,31,32,33. DOM is a longitudinal case control matching procedure that identifies a matched group of non-prizewinning topics. (The Method section reports measurement details and the formal regression equation.)

Using DOM30,31,32,33, we identified five non-prizewinning topics that had growth patterns statistically equivalent to prizewinning topics before the prize year on six growth criteria. Three growth criteria broadly represented a topic’s impact: (a) productivity, (b) number of citations, and (c) number citations of the topic’s leading scientists. Three criteria represent the movement of scientists into and out of a topic: (a) number of incumbent scientists, (b) number of new entrants, and (c) number of star scientists from outside the topic26,34,35,36,37,38. To ensure that prizewinning topics and their matched topics have yearly parallel growth before the prize year, prizewinning and matched topics had to be from the same discipline and had to have statistically indistinguishable growth patterns on all six measures, each year, for 10 years before the prize39,40.

“Productivity” measures a topic’s yearly number of publications and relates to a topic’s output, resources, and publication norms41. “Citations” measures a topic’s yearly number of citations and captures a topic’s impact42. Citation growth was measured on an absolute and per capita basis. “Impact of Topic’s Leading Scientists” measures the mean total citations of the upper 5% of scientists working on a topic and captures the impact of a topic’s intellectual leaders. To measure a topic’s size36, we created the variable “#Incumbents,” which measures a topic’s yearly number of continuing scientists, and the variable “#Entrants,” which measures the yearly number of scientists who publish for the first time on a topic. “#Disciplinary Stars” measures the number of scientists working on a topic that are among the 5% most cited scientists in the topic’s discipline43. The last three variables ensure that matched topics and prizewinning topics have equivalently eminent scientists and likely prizewinners12,44. Supplementary Figure S4 demonstrates that prizewinning and matched topics have no statistical differences on all matching variables prior to the prize year (11 years × 6 measures = 66 tests, all p-values > 0.2).

To test for a statistical relationship between prizes and extraordinary growth, we compared the post-prize growth in prizewinning topics at time t against the average growth rate of the matched topics at time t. We refer to this difference in growth as Δt, formally expressed as:

$${\Delta }_{t}={{{{{\rm{log }}}}}}\left(Y_{t}\right)-{{{{{\rm{log }}}}}}\left(\widetilde{{Y}_{t}}\right),$$
(1)

Yt is the prizewinning topic’s growth at time t and \(\widetilde{{Y}_{t}}\) is the same quantity based on the average growth of the matched topic group (see Fig. 1b for an illustration). Time t’s range is −10 to +10. A Δt = 0.0 indicates no difference in the growth of the prizewinning and matched groups. Time t represents actual time in years (i.e., no rescaling). Extraordinary growth occurs when Δt is statistically and significantly different from zero. We analyze the growth dynamics associated with a topic’s first prize and consider a prize to be correlated with the onset of extraordinary growth if growth starts shortly after the prize year. The SI reports confirmatory robustness checks for alternative measures of Δt (Supplementary Fig. S2).

Results

Prizewinning is strongly and positively related to and extraordinary growth. Relative to matched topics, prizewinning topics have unexpected and significant increases in growth on all six measures of growth. Table 1 shows the DID regression results for all six growth measures net of controls. Prizewinning (β1) is a binary variable equal to one or zero for prizewinning and matched topics, respectively. The estimated β1s all have insignificant p-values greater than 0.05, indicating that prizewinning and matched topics have no statistically significant growth differences before the prizewinning event on all six measures of growth. Post (β2) is a binary variable equal to one for the 10-year period after the prize year and equal to zero before the prize year. Prizewinning*post (β3) is the interaction term. β2 and β3 are significant for all six growth measures (all p-values < 0.001), demonstrating that prizewinning topics grow unexpectedly larger after the prize is awarded relative to matched topics.

Table 1 DID regression of a topic’s comparative post-prizewinning growth on six measures.

Figure 2a–f plot the average magnitudes of extraordinary growth \(\Delta_{t} (e^{{\varDelta }_{t}}-1=({Y}_{t}-\widetilde{{Y}_{t}})/\widetilde{{Y}_{t}})\) for all six growth measures. Notably, extraordinary growth begins the year following the prize and continues for at least the next 10 years. At 5 years after the prize, prizewinning topics have grown an average 17–30% larger than matched topics depending on the growth measure (\({\Delta }_{5},\)all p-values < 0.0001). At 10 years after the prize, the growth gap increases to 25–55% depending on the growth measure (Δ10, all p-values < 0.0001).

Fig. 2: Scientific prizes and extraordinary growth.
figure 2

Panels a through f show the differences in growth rates of prizewinning and matched topics for 10 years prior to and 10 years after the prize year in relation to a productivity, b citation, c topic’s lead scientist impact, d #incumbent scientists, e #entrants, and f #disciplinary stars. Statistically significant growth differentials between prizewinning (gold line) and matched topics (flat black line) begin shortly after the prize (vertical line) and continue yearly following the prize (95% CIs shown as dashed lines). At 10 years, the growth rates of prizewinning topics exceed matched topics by 25–55% depending on the growth variable (two-tailed t-test, p < 0.001. p-value ranges from 0 to \({1.56\times 10}^{-139}\)). Prizewinning and non-prizewinning matched topic groups had not growth differences for 10 years prior to the prize (two-tailed t-test, p > 0.05 for all 6 × 11 = 66 tests, p-values range from 0.1438 to 0.9310).

In comparing the overall impact of prizewinning topics to matched topics, we observe large and consistent growth differences. Prizewinning topics display strong post-prize extraordinary growth in productivity and citation impact. At 10 years after the prize, prizewinning topics are 39.8% more productive in terms of the number of publications (\({\Delta }_{10}=0.3351\), \({e}^{{\varDelta }_{10}}-1=0.3981\)) than matched topics (Fig. 2a). Figure 2b and c show changes in citation impact. Prizewinning topics have 32.6% more yearly citations (\({\Delta }_{10}\) = 0.2825, \({e}^{{\varDelta }_{10}}-1=0.\)3264) and experience a 7.75% average increase in per capita citations per paper at year 10 over matched topics (Supplementary Fig. S3), indicating that the increase in citation impact holds on a per scientist basis as well. Lastly, the citation impact of leading scientists within the topic is 25% greater than the impact of leading scientists working on non-prizewinning matched topics (\(\Delta_{10} = 0.2232, {e}^{{\varDelta }_{10}} - 1 = 0.2500\), Fig. 2c).

The migration of scientists into and out of a topic is an indicator of a topic’s research attractiveness. Prizewinning topics retain significantly more incumbents (Fig. 2d). After the prize year, incumbent scientists continue to routinely publish on that topic at a rate that is 54.8% (\({\Delta }_{10}=0.4366,{e}^{{\Delta }_{10}}-1=0.5475\)) higher than incumbent scientists publishing in matched topics.

New entrants enter prizewinning topics in abnormally high numbers. Prizewinning topics gain over 36.7% more new entrants on average than do matched topics (\({\Delta }_{10}=0.3129\), \({e}^{{\varDelta }_{10}}-1=0.3673\), Fig. 2e). About half of new entrants (46.3%) are rookie scientists who make their first publication on the prizewinning topic. This indicates that prizewinning topics attract young scholars who often make longer-term research commitments45.

Star scientists are at the other end of their career stage relative to rookie scientists. Star scientists are the 5% most highly-cited scholars in their discipline (physics, chemistry, sociology, etc.)46. Star scientists move into prizewinning topics in larger numbers than into matched topics. Counting star scientists working on prizewinning and matched topics before and after the prize, we found that prize prizewinning topics gain over 47% more star scientists from across the whole discipline than do matched topics (\({\Delta }_{10}=0.3878,{e}^{{\Delta }_{10}}-1=0.4737\), Fig. 2f).

Finally, we tested whether extraordinary growth is associated with a topic’s paradigmatic diversification, a relationship hypothesized in the literature1. We created a master list of the topics that new entrants into prizewinning and matched topic groups had published before the prize year. We used Shannon Entropy to quantify the topic diversity of the lists (see SI). The findings suggest that extraordinary growth positively correlates with paradigmatic diversification. As the \({\Delta }_{10}\) of topics increase, the paradigmatic diversification of the prizewinning topic grows systematically greater than the paradigmatic diversification of the matched topic group. For example, for topics with a Δ10 value of 1.5, the prizewinning topic group’s paradigmatic diversity is 11.6% greater than the matched topic group (see Supplementary Fig. S7 for all change comparisons).

Funding and extraordinary growth

Does funding explain the results? Funding can provide resources that may affect a topic’s growth and impact47. Also, funding agencies explicitly ration their funds to the most promising research topics48,49, which implies that topics that receive grants are seen by funders as having special growth potential. To examine whether funding differences between prizewinning and matched topics explain the extraordinary growth of prizewinning topics, we collected funding data on the subset of our data for which it is available. The NIH publishes funding data in the form of a public list of all papers funded by a specific grant. Using this list, we found that a subsample of 2853 out of the 11,539 prizewinning topics in our full sample received NIH funding from 1985 to 2005. For the sample of 2853 topics, we used DOM to create a new matched group of five non-prizewinning topics for each prizewinning topic, just as we did in the main analysis to ensure that the prizewinning and matched topic groups were balanced and met the parallel trends criteria. In this subsample, we used the same six matching criteria as in the main sample. Seventy-six percent of the matched topics turned out to also be NIH grant recipients, which makes sense, since funding should correlate with our six matching criteria.

To further validate the preceding analysis, we conducted a second analysis with a different subsample. In the second subsample, we matched prizewinning NIH funded topics based on the six criteria used in the main analysis plus a seventh criterion: the matched topics had to have received NIH funding. The results replicate the funding analysis shown above and the main findings, further demonstrating that funding differences do not explain the extraordinary growth of prizewinning topics (please see Supplementary Information, Supplementary Fig. S8, and Table S13).

Based on the above findings, we concluded that research funding appears to be uncorrelated with a topic’s extraordinary growth, at least within this subsample of NIH projects. First, prizewinning topics have equivalent or slightly less NIH funding before the prize than matched topics (Fig. 3a–c). Second, the level of NIH funding of prizewinning topics is the same before and after prize year (Fig. 3d). Third, consistent with the main results, after accounting for funding levels of prizewinning and matched topics, we find that the onset of the extraordinary growth of prizewinning topics begins the first year after the prize year (Fig. 3e–j).

Fig. 3: Funding and extraordinary growth are statistically unrelated in NIH subsample.
figure 3

The analysis examines whether funding plays a role in the extraordinary growth of prizewinning topics. In the analysis, we find a subset of 2853 prizewinning topics that were matched on six measures of growth and the receipt of NIH funding, which is publicly available. Repeating the main analysis that was done on the full sample of over 11,000 topics, we found that funding does not explain the extraordinary growth of prizewinning topics. First, ac shows that prizewinning have statistically equivalent or less funding before the prize than matched topics. Second, d shows that the normalized number of NIH grants is largely flat before (gray box) and after (gold box) the prizewinning event for the prizewinning topics. The center line of the box plot is the median of the normalized grants, box limits correspond to the data’s first and third quartiles, notches represent 95% CI, and violin plots represent the data’s distribution. Third, plots ej show that prizewinning topics grow relatively larger than expected after the prize year consistent with the main analysis with the full sample.

Topic-by-topic generalizability and placebo tests

Our main findings compare differences in post-prize growth for prizewinning and matched topic groups. Here, we examined the growth of each prizewinning topics (N = 11,539) and their five matched topics separately. After the prize, 60% of the prizewinning topics show growth larger than that of their specific five matched topics (binomial test, all p-values < 0.001). This finding indicates that the results are not driven by outlier topics and are generalizable (see Supplementary Tab. S1). The binomial tests were reinforced by placebo tests40,50,51. The placebo test examines whether the matched topics showed abnormal growth following the prize year of the prizewinning topic (even if they grew slower than prizewinning topics). Supplementary Figure S5 shows that matched topics have no coincidental extraordinary growth (all p-values > 0.05), reinforcing our main findingprizewinning is associated with the onset of a topic’s sustained period of unexpected and extraordinary growth.

Prize characteristics and the post-prize magnitude of extraordinary growth

A prize’s characteristics and a topic’s growth dynamics may be interrelated9,24. For example, prizes have been explicitly established for specific fields or with money to send “a symbolic message to the general public that perhaps science and scientists really mattered24”. We researched three features of prizes that appear broadly in prize data: money, discipline-specificity, and research recency18,19.

Moneyed prizes provide symbolic cultural value and dedicate valuable resources to a topic19,25. For example, the 2012 Fundamental Physics Prize, 2013 Tang Prize in Chinese Studies, and the Breakthrough Prize in the Life Science are monied prizes created to communicate to scientists and the public that “the best minds should make at least as much as any trader on Wall Street” and that science aims to “contribute to world development52.” Nevertheless, the impact of monetary prizes is unknown. On the one hand, prize money might raise perceptions of a topic’s importance, and on the other hand, extrinsic rewards can reduce intrinsic motivation or make little topic-wide difference if money goes to a single individual, not to science52. We operationalized prize money as a binary — money vs. no money variable (45% of prizes were coded as monied). We also created a three-category money variable defined as (a) no money, (b) money below the median, and (c) money above the median (see Supplementary Tab. S11), which confirmed the simpler binary variable reported below.

Discipline-specific prizes often have greater perceived within-discipline status than do general prizes. For instance, the Fields Medal in math is generally viewed as more prestigious than the general-science National Medal of Science Prize18,19. The discipline-specific variable takes on the value of one if at least 85% of all the winners of the prize come from the same discipline; zero otherwise (78% of prizes were coded as field-specific).

Though prizes can be given for recent or past research, most are generally awarded for contemporary work, not longstanding research; otherwise, many prizes would exclude younger scholars53. For example, the MacArthur Prize website acknowledges that the fellowship “…is not a reward for past accomplishment, but rather an investment in a person’s originality, insight, and potential.” To code a prize for its association with recent or past research, we counted the interevent time in years between a prize year and the first year a prizewinner worked on the prizewinning topic. We then created a distribution of inter-event times across all prizes.

To examine the link between prize characteristics and magnitude of extraordinary growth, we regressed the \({\Delta }_{10}\) of our six growth variables on money, discipline-specificity, and recency, along with control variables. Control variables include lagged values of each growth trend at times t-1, t-2, and t-3 years to account for autoregressive effects of \({\Delta }_{10}\). To account for differences in a prize’s visibility, we added control variables for the prize’s average Wikipedia page views during 2017, the number of past conferrals of a prize up to the prize year, and the prize’s age at the prize year. To control for multiple prize conferrals in the same year, we added a binary variable for whether a topic had multiple prize recipients. To control for the prizewinner’s scientific status, we added a binary variable for whether the prizewinner is a star scientist, a designation measured as being among a topic’s top 5% of cited authors. Fixed effects for discipline and year control differences that vary with the discipline and year of prizewinning. Supplementary Information presents details on variable measurements and descriptive statistics.

Figure 4 plots the raw data relationship between a prize’s features and Δ10 for our six growth variables. Prize characteristics significantly predict the magnitude of Δ10 in 17 out of 18 cases. The 17 positive tests are statistically significant and sometimes substantively large. The only null relationship out of the 18 tests indicates that prize money does not predict changes in the citation impact of a topic’s leading scientists.

Fig. 4: Prize characteristics predict the magnitude of extraordinary growth.
figure 4

a through c show the raw data relationship between a prize’s characteristics and the estimated magnitude of \({\Delta }_{10}\) on 6 growth measures (***p < 0.001; **p < 0.01; and *p < 0.05). The color dots with error bars represent the mean value and 95% CI, the center line of the black box plot is the median, box limits correspond to the data’s first and third quartiles, and violin shades represent the data’s distribution. Two-sided t-test are used in these analyses. The p-value for a ranges from \(2.70\times {10}^{-180}\) to \(5.30\times {10}^{-67}\), for c ranges from \(1.60\times {10}^{-25}\) to \(9.06\times {10}^{-9}\), and for b p-value ranges from 0.0017 to 0.013 except for lead scientist impact (p-value = 0.374). Sample sizes for topics associated with prizes with recency, money, and discipline specific are n = 5672 (46.73%), n = 6102 (50.27%), and n = 8812 (72.59%) respectively. df show the standardized coefficient from a regression of \({\Delta }_{10}\) on all three prize characteristics, indicating the significant and substantive association between prize characteristics and the magnitude of the expected level of extraordinary growth in a topic after it is associated with a prize. Control variables are lagged \({\Delta }_{t}\) at t = −1, −2, −3, discipline, year, prize visibility and reputation, and prizewinner status. Error bars demonstrate 95% CI. Please see Supplementary Tables S2S10 for details of regression analyses and further robustness checks.

The significant relationship between prize characteristics and Δ10 generalizes when controlling for confounds. Figure 4d–f report the standardized coefficients for all the predictor variables net of control variables. Of the three prize features, recency and specialization have the largest substantive associations with Δ10. For example, conditional on the topic being a prizewinner, a single standard deviation increase in recency is associated with a 13.8% increase in Δ10 of new scientists and a 14.6% increase in Δ10 of citations. Moneyed or field-specific prizes predict an increase in Δ10 of 1.9% or 5.3% in citation growth, respectively. These findings indicate that prizes are associated with extraordinary growth in a topic and that the magnitude of extraordinary growth is strongly predicted by a prize’s features. Supplementary Tables S2S10 report all regression-detailed estimates, robustness checks, and other fit statistics54.

Discussion

The rapid, unexpected growth of scientific topics have long held a reputation in science because of their connection to unexpected shifts in research efforts, technology developments, and individual scientific careers. Yet, most work has focused on intricate analyses of specific cases rather than on the statistical dynamics of extraordinarly growth. We conducted a large-scale, science-wide analysis of one factor that may be associated with the onset and magnitude of growth: scientific prizes. We found that scientific prizes predict unexpectedly large changes in the growth of a research topic.

Relative to matched, non-prizewinning topics with equivalent historical growth, disciplinary status, and demography, prizewinning topics are significantly more productive, higher impact, and attractive to incumbent, rookie, and star scientists. At 5-to-10 years after the prize year, prizewinning topics are over 30% more productive and are associated with significantly higher citation impact on average on a per capita basis. Also, we found that prize characteristics predict the magnitude of a topic’s unexpected and extraordinary growth. The magnitude of growth is greatest when the topic of research associated with the prize is recent, the prize is a discipline-specific rather than general science award, and the award is monied. Surprisingly, we found no evidence that funding is related to the extraordinary growth of prizewinning topics. These findings have implications for the study of the mechanisms behind the extraordinary growth of prizewinning scientific topics.

Possible mechanisms behind the extraordinary growth of scientific topics

While our work is among the first to look at how prizewinning relates to the growth of ideas rather than the careers of individual scientists, the mechanisms by which prizewinning plays a role in the abnormal growth of topics remains an important avenue of future research. We focused on the statistical links between prizes and the onset of extraordinary growth, which can provide a basis for examining mechanisms. Our findings relate to what Thomas Kuhn called the “essential tension between tradition and innovation”1,3,55. Kuhn argued that conservative investment and risk-taking are interconnected in scientific advances. Consistent with Kuhn, we found that prizewinning topics retain a base of incumbents who perhaps reinvest in the topic with a conservative orientation that takes current ideas further, while at the same time the new entrants attracted to prizewinning topics bring with them new viewpoints that promote a risk-taking orientation for experimentation.

The new points of view that entrants bring to the prizewinning topic may also be connected to Kuhn’s notion of paradigm shifts. Paradigm shifts have been identified in a few detailed case studies and focus on a topic’s revolutionary growth in size and impact. For example, Kuhn references work on the Copernican revolution that indicated that the earth-centric view of the solar system was replaced with sun-centric view of the solar system over a more than 100-year period. Our work suggests that a possible early source of a paradigm shift maybe the onset of a topic’s unexpected growth, which involves an influx of new scientists whose research expertize and diverse demographics provide the potential building blocks of new paradigms, a notion compatible with the teams, social network, and organizational literatures that have shown that new entrants with diverse backgrounds can transform thinking and stimulate innovation56,57,58,59,60,61.

Our findings also relate to Zuckerman’s prizes-as-signals argument62. In this theoretical argument, Zuckerman argued that prizes may act as signals to scientists that a prizewinning topic offers comparatively strong prospects for professional growth, or conversely, prizes signal the end of a period of growth by suggesting that a topic’s potential opportunities for professional recognition have run their course, leading scientists to move away from the prizewinning topic25.

Examining our findings in light of these crisscrossing arguments suggests conditional support for the claim that a prizewinning topic’s intellectual and professional attractiveness become more rather than less positive after the prize’s conferment. First, we found that the prize characteristics of money, disciplinary-specialization, and recency are positively related to the onset and magnitude of extraordinary growth. Second, we found that rookie and star scientists begin working on prizewinning topics in unexpectedly high numbers after the prize’s award. Nevertheless, a firm conclusion may have to wait for new data or emerging natural experiments that could shed more definitive light on these causal arguments and their contingencies.

Funding stands out as a possible driver of the growth of topics because funding decisions intentionally involve peer and expert reviews designed to steer researchers and resources to topics predicted to be promising research areas47,48,63. For example, special funding was dedicated boost rates of interdisciplinary research57,64,65. Regardless of the stated objectives of funding, Myers (2020) found that funding weakly induces scientists to change their research direction47. Consistent with Myers’ work, we found that available data on prizewinning and non-prizewinning topics do not differ in NIH grants received before the prize’s conferral, and funding remains flat for prizewinning topics after the prize, indicating that funding levels do not explain the association between prizes and a topic’s extraordinary growth. While this result is consistent with other research showing that funding does not materially change research agendas already in process47, the finding does rely only on NIH funding, which is a large and leading source of funding, but does not necessarily generalize to all types of funding or funding practices. Nevertheless, with the number of topics growing each year59 while funding levels have remain relatively flat suggests that the link between funding and extraordinary growth may be an especially important target for future research pursuits.

Future prize research

A central question in the scientific prize literature since its inception has been whether prizes “select” the highest quality research9,62. To date, the answer to this question has remained beyond the reach of empirical work, including in this work, as the answer depends on many factors such as the multiple ways in which an idea’s quality can be defined and measured. What has been determined is that prizes are awarded on average to worthy work. For example, the official statement by the International Congress of Mathematicians, which include the Field’s Medal, noted that on average prizes are not intended to select the best work, just work that is worthy: “We must bear in mind how clearly hindsight shows that past recipients of the Field’s medal were only a selection from a much larger group of mathematicians whose impact on mathematics was at least as great as that of the chosen” (ICM 1994)12.

If one accepts the ICM’s approach to prizes, newly identified linkages among prizewinners and prizes raise related questions about prizes and meritocracy. Ma et al. 9 found that the number of prizes worldwide have proliferated, which presumably offers more opportunity to celebrate scientists and ideas, however, as prizes have proliferated, prizes have become increasingly concentrated within a relatively small network of elite scientists and their mentees. Further, the most successful mentees study topics that differ from their mentor’s topics of study, suggesting that skills associated with conducting prizewinning work can be mentored and applied to new research areas16. Future research may therefore examine ways in which prizewinning tacit knowledge and research skills can be diffused more broadly across scientists and topics, and how vacancy chains within the prize network may open and close.

Another area of prize meritocracy focuses on representation. New research indicates that women are winning prizes more inline with their participation rates in science but continue to be but statistically less likely to win research prizes than their male peers18. Focusing on research areas where women’s participation is large and there are many prizes, such as biomedicine, women are winning prizes a rates closer to their representation in the field but the prizes they win have lower prestige and less money than the prizes won by their male counterparts18. Those trends coupled with the finding that a prize’s prestige and money predict the magnitude of extraordinary growth a prize can have on a topic suggests that future research should investigate how prize committee nomation and decision procedures are linked to diversity, a process already taking place among some prize committees66.

Relatedly, the question of prize meritocracy in science depends on the availability of data that can connect scientists, topics of study, and impact. Here we used the relatively new Microsoft Academic Graph (MAG) data, which uses expert human opinion and algorthims to identify differences in ideas and topics. The relatively new benefits of the MAG data is that it has explicit reproducbile procedures that can be evaluated for bias. Nevertheless, science is an ever changing entity. Year to year, new fields emerge and the semantic meaning of terms change over time and at different rates across different communities of practice, which means MAG provides a defensible snapshot of scientific topics, but not necessarily an immutable one.

In this way, our work contributes to knowledge about changes in scientific thinking, the mobility of scientists between topics, and investment funding that together broaden understanding of how the frontiers of science develop.

Methods

Difference-in-difference regression

To study whether a growth pattern before and after a topic is associated with a topic’s prizewinning event, we combine difference-in-difference regression with matching. This approach has been used frequently in studies of prizes and in other areas where observation data is not randomly assigned and where experimental random assignment may be impossible due to factors such as harm to subjects12,22,29. Borjas and Doran12 studied how Field Medal prizewinners’ career change relative to non-prizewinners. In their case, like ours, prizes were not assumed to be randomly assigned to topic. Hence, they used matching, where matching is done between the prizewinner and a set of “contenders” non-prizewinners whose historical scholarly performance would have made them contenders for the prize. To justify matches between prizewinners and non-prizewinners, Borjas and Doran stated, “In short, while it is tempting to claim that the 52 Fields Medal recipients are in a class by themselves and that there are no losing contenders with equivalent or better early achievements, this view does not correspond with what mathematicians themselves have written12.” For example, The International Congress of Mathematicians67, which awards the Fields Medal, the Nevanlinna Prize, the Gauss Prize, and the Chern Medal during the congress’s opening ceremony has officially stated: “We must bear in mind how clearly hindsight shows that past recipients of the Fields Medal were only a selection from a much larger group of mathematicians whose impact on mathematics was at least as great as that of the chosen” (ICM 1994)12. The arbitrariness in the number, timing, and field distribution of Fields Medal recipients means that a similarly great group of “contenders” should exist that can be contrasted with the winners in a difference-in-differences strategy to determine how winning the medal influences productivity and research choices.” Similarly, the DID regression used in our analysis is specified as follows:

$${Z}_{i,t}={{\beta }_{0}+\beta }_{1}{ {{prizewinnin}}}{ {{g}}}_{i}+{\beta }_{2}{ {{pos}}}{ {{t}}}_{t}+{\beta }_{3}{ {{prizewinnin}}}{ {{g}}}_{i}* { {{pos}}}{ {{t}}}_{t}+{ {{fixed}}\; {{effect}}}+{\epsilon }_{i,t},$$
(2)

where Zi,t is one of our six outcome variables for topic i at time t. Prizewinningi is a dummy variable quantifying whether the topic \(i\) is a prizewinning topic or a non-prizewinning topic from a matched group. \({{{{{{{\mathrm{Post}}}}}}}}_{t}\) is a dummy variable measuring whether time \(t\) is before or after the prizewinning event68. If the topic belongs to the matched group, the prizewinning year of the related prizewinning topic will be used as the reference point. \({\epsilon }_{i,t}\) is an error term. Regression is specified with fixed effects for discipline and prize year and robust standard errors. The parallel trend assumption (PTA) is essential for DID analysis. We met the PTA criteria by matching prizewinning and non-prizewinning topics on six separate growth indices that uses a procedure that ensures that the pre-prize growth indices of the prizewinning and matched group parallel each other (Fig. 2). Moreover, our analysis meets the PTA criteria of “indistinguishability.” “Indistinguishability” is a stronger version of the PTA criteria. PTA requires growth is constant. Indistinguishability requires the growth trends are constant and have zero difference. In our analysis, both groups have constant growth and have zero difference from each other.

Dynamic optimal matching procedure

To select the matched non-prizewinning topics, we used a Dynamic Optimal Matching method, which applies the Optimal Matching Method30,32,33 to a time-series data to simultaneously maximize the closeness and balance characteristics of accurate matching30,31,33,69. To study the relationship between between prizewinning and a topic’s growth pattern, we measured a topic’s growth for a 10-year period prior to the prizewinning event and 10 years after the prizewinning event for a total of 21 years of study per topic. In our periodization, time zero designates the prizewinning event year. The 10-year period prior to time zero was used to find five matched, non-prizewinning topics in the same discipline that grew in a statistically indistinguishable way from the prizewinning topic’s pre-prize growth on six growth measures: (a) productivity, (b) citation, (c) topic’s lead scientist impact, (d) #incumbent scientists, (e) #entrants, and (f) #disciplinary stars. The 10-year period after the prizewinning event was used to measure differences in the growth of the prizewinng topic and its matched, non-prizewinning topics. Our MAG data begins in 1960 and ends in 2017, which permitted us to study prizes and topic growth between 1970 and 2007. Our sample includes 12, 041 prizewinning of which a total of 11,539 topics (>95%) prizewinning topics could be statistically matched to 5 non-prizewinning topic using DOM. If a topic is associated with multiple prizes over time, we studied growth from the first prizewinning event. Limiting the analysis to topics win one prize during the 10-year post-prize period produced same results as the full sample.

First, we selected a matched topic candidate pool32. To achieve this, for each prizewinning topic \(i\), we selected up to 40 close-distance topics in terms of a distance measure (\(\theta_{i,j}\)) from the same discipline, generating a peer non-prizewinning candidate pool. For 95% of the prizewinning topics, a proper non-prizewinning candidate pool was identified (11,539/12,041 = 95.8%). To achieve matching, we defined a distance measure \(\theta_{i,j}\) to quantify the closeness between the prizewinning topic \(i\) and a non-prizewinning topic \(j:\)29,70

$$\theta_{i,j}=\frac{\mathop{\sum}\limits_{n=1}^{N}\mathop{\sum}\limits_{t={t^{* }}-{t_{0}}}^{{t^{*}}}\left({{{{{\rm{log }}}}}}{Y}_{i,n}\left(t\right)-{{{{{\rm{log }}}}}}{Y}_{j,n}\left(t\right)\right)^{2}}{N* ({t}_{0}+1)},$$
(3)

where \({Y}_{i,n}\) indicates the quantity for the topic \(i\) in terms of one of the \(N=\,\)6 matched categories (i.e. Productivity, Citations, Lead Scientist impact, #incumbents, #Entrants and #Disciplinary Stars). \(t\) measures number of years prior to the prizewinning year for topic \(i\), where t* represents the first prizewinning year for topic \(i\), and \({t}_{0}=10\), indicates we traced the growth pattern for topics in an 11-year duration, including 10 years prior to the prizewinning year. To account for possible correlations among different variables, we repeated the analyses using an alternative distance measure - Mahalanobis distance - to quantify the closeness of the topics, finding consistent results (Supplementary Fig. S6).

Second, to ensure the balance between the matched and prizewinning topics for the entire system, we select 5 matching topics from the candidate pool to be the topic’s peer group. In this process, we (1) minimized the distances between the matched and prizewinning topics in terms of \({\theta }_{i,j}\), and (2) made sure the distribution of the matched and prizewinning topics are acceptably and simultaneously close enough for all 66 covariates.

Specifically, we make sure that the differences between the prizewinning and matched topic groups are small enough for each matching category n and for any time t before the prizewinning event, where the differences between the prizewinning topic \(i\) and its expected growth at time \(t\) and category \(n\) are quantified by \({\Delta }_{i,n}(t)=({{{{{\rm{log }}}}}}{Y}_{i,n}\left(t\right)-{{{{{\rm{log }}}}}}{\widetilde{Y}}_{i,n}\left(t\right))\). The expected growth is obtained by averaging the trajectory of the matched topics. This problem is a classical optimization problem, which could be solved with typical Mixed Integer Programming (MIP) methods31,32. We found the best-optimized matching possible where (1) the distance between the prizewinning topics and the matched topics is minimized; at the same time, (2) the difference between the matched and prizewinning groups is not statistically significant for any \(t\in \left[-10,0\right]\) and \(n\in \left[1,6\right]\). Mathematically, we have:

$$\left|\frac{\mathop{\sum }\nolimits_{i=1}^{M}{\Delta }_{i,n}(t)}{M}-0\right| \, < \, 1.96* {SE}\left({\Delta }_{i,n}\left(t\right)\right)\cdot {for}\; \forall 1\le n\le 6,-10\le t\le 0.$$
(4)

Here \({SE}({\Delta }_{i,n}(t))\) measures the standard error of the \({\Delta }_{i,n}(t)\) for the prizewinning topics at time \(t\) and in category \(n\), and \(M\) captures number of prizewinning topics. To prevent outlier bias from topics with a large \({\Delta }_{i,n}(t)\) in the MIP process, we also measured the topic-by-topic growth of each individual topic. Specifically, for any \(t\left(-10\le t\le 0\right)\), we compare the growth pattern of each prizewinning topic and all of its five matched topics. For each of the 66 covariances, we ensure prizewinning topics had equal probability to show larger or smaller growth than their matched topics. This method not only guarantees closeness between the matched and prizewinning topics but also ensure good balancing between and within topic groups. The method has also been validated by an additional placebo test. Please see Supplementary Information and Supplementary Figs. S4S6 for detailed information.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.