The hot streak—loosely defined as ‘winning begets more winnings’—highlights a specific period during which an individual’s performance is substantially better than his or her typical performance. Although hot streaks have been widely debated in sports1,2, gambling3,4,5 and financial markets6,7 over the past several decades, little is known about whether they apply to individual careers. Here, building on rich literature on the lifecycle of creativity8,9,10,11,12,13,14,15,16,17,18,19,20,21,22, we collected large-scale career histories of individual artists, film directors and scientists, tracing the artworks, films and scientific publications they produced. We find that, across all three domains, hit works within a career show a high degree of temporal regularity, with each career being characterized by bursts of high-impact works occurring in sequence. We demonstrate that these observations can be explained by a simple hot-streak model, allowing us to probe quantitatively the hot streak phenomenon governing individual careers. We find this phenomemon to be remarkably universal across diverse domains: hot streaks are ubiquitous yet usually unique across different careers. The hot streak emerges randomly within an individual’s sequence of works, is temporally localized, and is not associated with any detectable change in productivity. We show that, because works produced during hot streaks garner substantially more impact, the uncovered hot streaks fundamentally drive the collective impact of an individual, and ignoring this leads us to systematically overestimate or underestimate the future impact of a career. These results not only deepen our quantitative understanding of patterns that govern individual ingenuity and success, but also may have implications for identifying and nurturing individuals whose work will have lasting impact.
Access optionsAccess options
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
Subscribe to Journal
Get full journal access for 1 year
only $3.90 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
We thank A.-L. Barabasi, R. Burt, J. Chown, J. Evans, C. Jin, L. Nordgren, W. Ocasio, B. Uzzi, Y. Yin, the Kellogg Insights and all members of NICO for invaluable comments. This work is supported by the Air Force Office of Scientific Research (AFOSR) under award number FA9550-15-1-0162 and FA9550-17-1-0089, and Northwestern University’s Data Science Initiative. R.S. acknowledges support from AFOSR grant FA9550-15-1-0364 and from the Central European University Intellectual Themes Initiative ‘Just Data’.Reviewer information
Nature thanks J. Walsh, J. West and the other anonymous reviewer(s) for their contribution to the peer review of this work.
Extended data figures and tables
Extended Data Fig. 1 Additional results on hot streaks in artistic, cultural, and scientific careers.
a–c, The cumulative distribution P(≥Ni/N) for the order of the top three highest impact works within a career for artists (a), directors (b) and scientists (c). Ni denotes the order of the ith highest-impact work within a career. The colours denote different hit works, and the dashed grey line denotes P(≥Ni/N) for a uniform distribution. d–f, ϕ(N**, N***) for the second- and third-highest-impact works within a career. ϕ(N**, N***) is also overrepresented along the diagonal. g–i, ϕ(N*, N***) for the first- and third-highest-impact works within a career. j–r, We shuffled the order of each work in a career while keeping their impact intact. The diagonal patterns in d–i and Fig. 1a–c disappeared for shuffled careers. s–u, ϕ(N*, N**) predicted by the hot-streak model successfully recovered the diagonal patterns observed in a–c. For d–u and Fig. 1a–c, we applied the same binning procedure to data, using bins that ranged from 0 to 1 with increments of 0.1.
a, The distribution of auction price P(Price) for artists. Blue dots denote data, and the red line is a log-normal distribution with average μ = 7.9 and standard deviation σ = 1.5. b, The distribution of film rating P(Rating) for directors. The red line is a normal distribution with average μ = 7.1 and standard deviation σ = 1.2. c, The distribution of raw and rescaled C10 (inset) for scientists. The red line is a log-normal distribution, with μ = 2.3 and σ = 1.3 for c and μ = −0.4 and σ = 0.8 for the inset. d–f, Definitions of the longest streak L for artists (d), directors (e) and scientists (f). Dots are coloured orange above the threshold, blue otherwise. The lower panel highlights the longest streak in a career. g–i, P(L) for real careers and P(Ls) for shuffled careers using the mean impact within a career as the threshold. j–l, As in g–i, but using the top 10% impact as the threshold to calculate L and Ls. In all cases, P(L) has a wider tail than P(Ls), indicating that high-impact works in real careers tend to cluster together.
To test the robustness of our results, we repeated our measurements by controlling for the career length of individuals. a–i, Artists and directors with careers of at least 20 years and scientists with careers of at least 30 years. a–c, P(≥Ni/N) of the top three highest-impact works within a career. d–f, among the top three highest-impact works in a career. g–i, P(L) for real careers and P(Ls) for shuffled careers. j–r, As in a–i but for artists and directors with careers of at least 30 years and scientists with careers of at least 40 years. These results demonstrate that the patterns observed in Fig. 1 hold for individuals with different career lengths.
a, for artists who started their careers before 1850. b, P(L) for real careers and P(Ls) for shuffled careers for artists who started their careers before 1850. c, for artists who started careers between 1850 and 1900. d, P(L) for real careers and P(Ls) for shuffled careers for artists who started their careers between 1850 and 1900. These results demonstrate that the patterns observed in Fig. 1 hold for artists from different eras.
a–c, Correlations between ΓH and Γ0 for artists (a; n = 3,166), directors (b; n = 5,098) and scientists (c; n = 18,121). The blue background denotes the kernel density of data, dots represent binning results of data, and the red lines depict a linear fit. Inset, the relationship between ΔΓ ( = ΓH − Γ0) and Γ0. d–f, The distribution of τH/T, representing the duration of hot streaks over total career lengths. The temporally localized nature of a hot streak is also captured by its proportion over career length τH/T.
a–c, g(t) of three scientists in our data set with mid-career (a), late-career (b) and early-career (c) onset of hot streaks. Red dots denote data, the blue line is the null model’s prediction based on early performance, and the red line captures the predictions from the hot-streak model, with dashed grey lines denoting the start and end of hot streaks. d, The difference between our hot-streak model and the null model for each individual, Δg(t). Dashed lines with corresponding colours denote the start of the hot streak. d illustrates the discrepancies in estimating an individual’s future impact if we ignore the uncovered hot streaks. e, The distribution of the BIC measure, P(BIC), showing that the hot-streak model outperforms the null model in describing g(t) after accounting for model complexity. f, The distribution of the MAPE measure, P(MAPE), showing that the hot-streak model outperforms the null model in describing g(t). g, The uncertainty envelope of g(t) for an individual in our data set. Blue dots denote data, and the red line is the fitting result of equation (4). Shaded area illustrates predicted uncertainty (one standard deviation). h, The fraction of g(t) falling within the envelope for the null model (blue) and our hot-streak model (red). Fraction = 1.0 indicates that the entire g(t) trajectory falls within the envelope. i, Average MAPE of our hot-streak model and the null model for individuals with early-career, mid-career and late-career onset of hot streaks. The difference is largest for individuals with early-onset hot streaks and smallest for those with late-onset ones.
a–f, Illustrative examples of Γ(N) for the hot-streak model (a), right trapezoid function (b), isosceles trapezoid function (c), quadratic function (d), tent function (e) and left trapezoid function (f). g, The distribution of the relative position of the three highest-impact works among the six highest-impact works within a career for artists, where denotes the relative order among the top six hits. h–m, predicted by corresponding models shown in a–f, respectively, according to artists’ real productivity profiles. To test whether data agree with model predictions, we measured their statistical difference using the P value of the Kolmogorov–Smirnov test for discrete distributions. We colour the distributions green if we cannot reject the hypothesis that the data and the model predictions come from the same distributions, and red otherwise. Among the six models, the hot-streak model is the only model whose predictions are consistent with the data in terms of the relative ordering among the six highest-impact works observed in real careers. n, The proportion of real careers that are captured by the model with the smallest BIC among different hypotheses. The hot-streak model again stands out as the best model to describe real careers. We repeated the analyses for directors (p–v) and scientists (x–ad), the conclusions remained the same across all three domains.
Here we test whether the observed patterns can be explained by Markovian dynamics that introduce correlations between neighbouring data points. We first test the assumptions of the Markovian hypothesis from the data (a–f). a–c, The distribution of N, N + 1 differences between adjacent data points observed in real careers for artists (a, n = 3,480), directors (b, n = 6,233) and scientists (c, n = 20,040). d–f, The autocorrelation measured in real careers for artists (d, n = 3,480), directors (e, n = 6,233), and scientists (f, n = 20,040). a–f suggest that there is little short-range correlation in data across the three domains. We test three variants of Markovian models (g–l). The details of these models are outlined in Supplementary Information S6.2. g–i, ϕ(N*, N**) of the top two highest-impact works within a career for three Markovian models using scientists’ profiles as input. j–l, The distribution of the longest streak length P(L) and P(Ls) using median impact within a career as threshold for the three Markovian models. g–l demonstrate that the three Markovian models failed to capture the observed colocations among hits.
a–c, Each subplot denotes the fitting result on Γ sequence for a randomly selected career for artists (a), directors (b) and scientists (c). Blue dots denote the moving average Γ(N) from data and red lines denote the best fitting result of the hot-streak model for each individual.
a–c, The distribution of average impacts for individuals with one or more than one hot streaks for artists (a), directors (b) and scientists (c). Blue dots denote individuals with one hot streak, and red dots denote individuals with at least two hot streaks. d–f, The distribution of the number of works P(N) within a career for individuals with one or more than one hot streak for for artists (d), directors (e) and scientists (f). g–i, The distribution of career length P(τ) for individuals with one or more than one hot streaks for artists (g), directors (h) and scientists (i). j–l, The distribution of P(ΓH) for individuals with one or more than one hot streaks for artists (j), directors (k) and scientists (l). Between those who have one or two hot streaks, there is no detectable difference in terms of typical performance metrics, including impact, productivity and career length, suggesting that the hot streak captures an orthogonal dimension to current metrics characterizing individual careers.