Significant hot hand effect in the game of cricket

Ram, Sumit Kumar; Nandan, Shyam; Sornette, Didier

doi:10.1038/s41598-022-14980-7

Download PDF

Article
Open access
Published: 08 July 2022

Significant hot hand effect in the game of cricket

Sumit Kumar Ram^1,2,
Shyam Nandan³ &
Didier Sornette^2,4

Scientific Reports volume 12, Article number: 11663 (2022) Cite this article

1385 Accesses
Metrics details

Subjects

Abstract

We investigate the predictability and persistence of individual and team performance (hot-hand effect) by analyzing the complete recorded history of international cricket. We introduce an original temporal representation of performance streaks, which is suitable to be modelled as a self-exciting point process. We confirm the presence of predictability and hot-hands across the individual performance and the absence of the same in team performance and game outcome. Thus, Cricket is a game of skill for individuals and a game of chance for the teams. Our study contributes to recent historiographical debates concerning the presence of persistence in individual and collective productivity and success. The introduction of several metrics and methods can be useful to test and exploit clustering of performance in the study of human behavior and design of algorithms for predicting success.

Quantifying human performance in chess

Article Open access 06 February 2023

Experience shapes non-linearities between team behavioral interdependence, team collaboration, and performance in massively multiplayer online games

Article Open access 03 April 2024

A hierarchical approach for evaluating athlete performance with an application in elite basketball

Article Open access 19 January 2024

Introduction

The study of what bring success or failure in battles and wars, in politics, in business, in sports, even in our personal lives, has a very long history, being part of the DNA of human evolution that has tended to promote the genes of the “successful ones”¹. The ‘science of success’ has received a boost in recent years with the growing availability of large datasets describing individual’s careers from which much can be learned and importantly predicted^{2,3,4,5,6,7,8,9,10}. The increasing shift towards collaborative and team-based effort (performance) in recent decades has made it more important to quantify and predict teamwork^{11,12,13,14,15}. However, the translation of the predictability in individual performance to team performance is still missing.

In this study, we develop novel statistical tools to uncover the temporal features that are characteristic of a set of performances. We explore the complete history of International cricket^16,17 to quantify individual and team performances. We study the sequence of consecutive performances of each player and team. By investigating the scores of individual players against the index of the games within the career, we note that success breeds success in individual career (also supported by ARIMA model in SM). We further document that the best performances in a given player’s career are clustered in time (see Fig. 1), contrary to previous findings^18,19. However, we cannot say the same for teams. We uncover the presence of hot hands in individual careers in both formats of the game but the absence of the same in team performances. Our proposed Hawkes model applied to the performance time not only outperforms the traditional techniques like ARIMA and autocorrelation measures^20,21 (see SM) but is successful in capturing the ingredients of self-excitation in the patterns of consecutive superior performances. These findings raise intriguing questions regarding the nature and extent of predictability of one’s success and team success in a team game. This is particularly interesting, since these findings not only refute the well-established narratives of the absence of hot hands in team games^18,19,22,23 where performances are usually driven by stochastic events. Our findings suggest that the hot hand effect is not just a psychological bias^18,19. A part of results corroborate previous works on hot-hands^8,9,24,25,26. One of the possible explanations for the observation of such a peculiar behavior in the game of cricket may be the relatively larger importance of skill in the outcomes of a player’s game and luck in the outcomes of a teams’ game^10,27.

The rest of the article is structured as follows. In section “Literature review”, we present a short literature review to motivate our study and put it in the right context. Section "Methods" includes three subsections Dataset, Distributions of temporal locations of best performances and Clustering point process representation. Subsection “Dataset” describes the dataset that has been used in the study and the data acquisition methodology. Subsection “Distributions of temporal locations of best performances” summarizes the empirical observations. Subsection “Clustering point process representation” presents our proposed clustering point process representation in the form of a self-excited point process model to quantify and predict the hot hands in the sequences of performances. Section “Results and discussion” presents our main results. We conclude the results of the study in Section “Conclusion”.

Literature review

A much-debated question is whether or not a string of successes of an individual or entity is more likely to cause continued success. When present, this is called The Hot Hand effect. When absent, the belief in it is called the hot-hand fallacy, whereas the belief in the opposite, i.e., success is less likely after a streak of success is called Gambler’s fallacy²⁸. The question of whether the Hot Hand effect genuinely exists is important, as its positive answer has far-reaching consequences in several research fields, including finance and econometrics^{10,26,29,30,31}, psychology^18,19,32,33 and sociology^2,8,9,34,35. The debate on the “Hot Hand fallacy” vs. the “Gambler’s Fallacy” revolves around the deeper question: ‘to what extent, human beings are capable of dealing with inherent systemic stochasticity’^10,27. In their seminal paper, Gilovich et al. refute the validity of “the hot hand” and “streak shooting” in the game of basketball¹⁸. Their analyses of the shooting records of the Philadelphia 76ers, Boston Celtics, and a controlled shooting experiment with the men and women of Cornell’s varsity teams provided no evidence for a positive correlation between the outcomes of successive shots. They further showed that the belief in the hot hand and the detection of streaks in random sequences is nothing but an expression of the general misconception of chance¹⁸, according to which even short random sequences are thought to be highly representative of their generating process. There has been very strong support for this reasoning in the literature, especially in the field of finance and economics^{23,30,31,32,36}. These studies support the idea that the hot-hand effect is a fallacy, stating that the hot hand does not exist and is nothing but a psychological bias based on the “law of small numbers”. Moreover, these studies warn that this fallacy may often lead people to take costly and risky decisions.

On the other side of the debate, Miller and Sanjurjo²⁴ have recently challenged the original findings in¹⁸, with contrasting conclusions revealing significant evidence for streak shooting. Miller and Sanjurjo showed that the method used in¹⁸ introduced a sampling bias because they start counting after a series of hits/misses. They further showed that the method of¹⁸ is biased towards more misses, thus claiming that an equal rate of hits to misses after a streak presented in¹⁸ is, in fact, a sign of a hot hand. Csapo and Raab³⁷ found evidence for the “hot hand” in that making the first free throw is associated with a significantly higher probability of making the second free throw. The debate about successful streaks has gained fresh prominence in many other fields, with many arguing for the presence of such streaks in large scale data sets of scientific careers, artistic career and acting careers^8,9,33,38,39.

From the point of view of sport psychology, a belief built out of random sequential events can have positive effects on behavior. Athletes believe in the hot hand in volleyball and that streaks do exist for half of the players. Coaches can detect players' performance variability and use it to make strategic decisions, and playmakers are also sensitive to streaks and tend to use it "adaptively," which results in more hits for a team⁴⁰. The belief in hot streaks can provide valid cues to decide who to give shots to, and this behavior is supported by the fallacious belief in dependency⁴¹. This is further validated in⁴², which analyzed the sequential choices made by expert athletes and found that they were sensitive to base rates and adapted their decision strategies accordingly. Additionally, defensive pressure and shot difficulty increase during hot streaks, so that defenders seem to behave according to the hot-hand belief and try to force hot players into more difficult shots³⁷. Thus, even a single successful shot is enough to increase a player's likelihood of taking the next shot, and also to increase the average distance from which that shot is taken⁴³. Arkes⁴⁴ also found evidence for the “hot hand” in that making the first free throw is associated with a significantly higher probability of making the second free throw. However, the success of the next shot can be lower, while the coach is less likely to replace the player⁴³. Additionally³⁷, also found that shooting percentages of presumably hot players do not increase and that shooting performance is not related to streakiness, so that the defenders' hot-hand behavior cannot be considered ecologically rational.

The above debates revolve around the investigation of presence or absence of the hot-hand effect in individual performances. However, they fail to show how these effects can be exploited for better prediction or how the aggregated individual performances drive the evolution of team performance. In this study, we present a novel methodology to better understand and predict individual and team performances. We derive our methodology from the self-excited conditional Hawkes point process⁴⁵, which has been applied in a variety of fields particularly the description of social diffusion processes^46,47,48, financial systems^49,50,51, and seismological predictions^52,53,54. To the best of our knowledge, this is the first use of Hawkes processes in the domain of ‘science of success’. We apply our methodology for studying the presence (or absence) of the hot hand effect within the performance sequences in individual performance in the game of cricket. Our methodology would be useful in predicting and quantifying hot-hand effect in performance sequences in many other domains.

Methods

Dataset

The dataset we use in this study includes 4178 One Day International (ODI) games starting from January 5, 1971, till July 1, 2019 (48 years) and 2351 international Test games spanning March 1877 to March 2019 (142 years) (see SM for data acquisition and preparation). We record 51,699 batting performances of 2959 Test batsmen and 51,088 bowling performances of 2874 Test bowlers, 90,166 batting performances of 2500 ODI batsmen and 90,754 bowling performances of 2505 ODI bowlers (in total 283,707 records) (see Fig. 2). The dataset further contains the information about the performance of the teams and the outcomes of the games. To have meaningful calibration results, we only analyze the performances of those batsmen who have played at least 30 games (see goodness of fit in SM).

Distributions of temporal locations of best performances

To study the self-excited nature of the scores in an individual’s career, we investigate the relative positions of the best three performances in each player’s career. We first order the games within one's career according to calendar time. We define the index (t) of any game by the rank of this game within this ordered sequence. We denote $t_{j}^{*}$ the index of the best performance in player j’s career, i.e.,

$$\begin{array}{*{20}c} { t_{j}^{*} = argmax_{t} S_{j} \left( t \right)} \\ \end{array}$$

(1)

Where, ${S_{\mathrm {j}} \left( \text{t} \right)}$ is the performance of the player j at $ {\mathrm{t}}^{\mathrm{th}} $ attempt. We also define $t_{j}^{**} , t_{j}^{***}$ as the indices of the second, third best performance, and $\tau_{j}$ as the length of an individual’s career. We then calculate the relative difference of indices of the three best performances as

$$\Delta_{j}^{1,2} = \frac{{t_{j}^{*} - t_{j}^{**} }}{{\tau_{j} }},\Delta_{j}^{1,3} = \frac{{t_{j}^{*} - t_{j}^{***} }}{{\tau_{j} }},\Delta_{j}^{2,3} = \frac{{t_{j}^{**} - t_{j}^{***} }}{{\tau_{j} }}$$

(2)

for all players in our dataset and define the marginal probability density functions $P( {\Delta_{j}^{1,2} } ), P( {\Delta_{j}^{1,3} } ), P( {\Delta_{j}^{2,3} } )$ and the joint probability distribution $Q( {\Delta_{j}^{1,2} ,\Delta_{j}^{1,3} } )$. As a control, we shuffle the indices of the performances within the individual’s career and reevaluate these quantities. The primed quantities correspond to the shuffled career, i.e., $t_{j}^{\prime *}$ corresponds to the index of the best performance within the randomly reshuffled player j’s career, and so on. We define the marginal probability density functions $P( {\Delta_{j}^{\prime 1,2} } ), P( {\Delta_{j}^{\prime 1,3} } ), P( {\Delta_{j}^{\prime 2,3} } ),$ which are the distributions of the shuffled versions $\Delta_{j}^{\prime 1,2} \,{\text{of}}\,\Delta_{j}^{1,2} ,\Delta_{j}^{1,3} \,{\text{of}}\,\Delta_{j}^{\prime 1,3}$ and $\Delta_{j}^{\prime 2,3} \,{\text{of}}\,\Delta_{j}^{2,3}$. We define the ratios $R( {\Delta t} )$ of these marginal probabilities to quantify the temporal colocation of the best performances in an individual career

$$\begin{array}{*{20}c} {R\left( {\Delta t} \right) = \frac{{P\left( {\Delta t} \right)}}{{P^{\prime } \left( {\Delta t^{\prime } } \right)}},\quad {\text{where}}\quad \Delta t = \Delta_{j}^{1,2} ,\Delta_{j}^{1,3} \,{\text{or}}\,\Delta_{j}^{2,3} .} \\ \end{array}$$

(3)

Figure 1 presents the joint probability distribution of relative difference of indices of best and second-best against the best and third best $( {Q( {\Delta_{j}^{1,2} ,\Delta_{j}^{1,3} } )} )$ (top panels) defined by Eq. (2), for ODI and Test formats over all individuals’ careers. We observe a concentration of high probability around the origin (0,0) in both formats of the game. This correlation is interesting since this characteristic is a feature of the self-excited process and is not expected in a pure memoryless Poissonian process. We further compare the joint probability distribution $( {Q( {\Delta_{j}^{1,2} ,\Delta_{j}^{1,3} } )} )$ with the corresponding reshuffled joint probability distribution $( {Q( {\Delta_{j}^{{{\prime }1,2}} ,\Delta_{j}^{{{\prime }1,3}} } )} )$ and present in figure S2. The p values from 2D Kolmogorov–Smirnov two sample tests in figure S2 signifies the significant clustering around origin. This finding constitutes a first line of evidence for the existence of temporal clustering in the performances across players’ careers.

The bottom panels of Fig. 1 shows the ratio $R( {\Delta t} )$ (Eq. (3)), which compares the marginal probability distribution of the relative difference of the indices in the real careers against the indices obtained from shuffled careers. The distinctive peak around 0 in the plots provides additional support for clustering of performance within careers. $R( {\Delta t} )$ is approximately symmetric around the origin, indicating that the highest performances are equally likely to arrive before or after the second highest and third-highest scores. This pattern is expected from a self-excited process with approximately equal propensity for performance persistence among the best performance streaks^55,56. This was shown in the context of earthquake time and space clustering. Here, we can think of the highest performance as equivalent to the main shock in a seismic sequence. Then, the main shock can be shown to be triggered by large events that occur before it (“foreshocks”) and the main shock itself triggers large events (“aftershocks”)^55,56.

Clustering point process representation

Definition of the “performance time”

We call $S_{j} ( t )$ the performance (see SM for more details about the game of cricket) of the player $j$ at his tth attempt within his career. We define the subordinate time process $H_{j} ( t )$ of the stochastic process $S_{j} ( t )$⁵⁷ as

$$\begin{array}{*{20}c} {H_{j} \left( t \right) = \mathop \sum \limits_{{t_{i} = 1}}^{t} \frac{1}{{S_{j} \left( {t_{i} } \right)}}} \\ \end{array}$$

(4)

The $t \to H_{j} ( t )$ map represents a nonlinear transformation from the index $t$ onto an effective “performance time” of player $j$. $H_{j} ( t )$ denotes a transformed time-stamp at which the tth event takes place for player j. This defines a point process along “performance time” with the time stamps $\{ {H_{j} ( {t_{1} } ),H_{j} ( {t_{2} } ), \ldots ,H_{j} ( {t_{n} } ), \ldots } \}$. The intuition behind definition (4) is that a series of strong performance values $\{ {S_{j} ( {t_{i} } ), S_{j} ( {t_{i + 1} } ), \ldots } \}$ are transformed into closely clustered points in “performance time”. This allows us to analyze the relationship between performances in time using simple one-dimensional techniques. In other words, by transforming $S_{j} ( t )$, into $H_{j} ( t )$, we project the stochastic process described by the sequence $\{ {S_{j} ( t ),t = 1, \ldots } \}$ onto an one-dimensional point process with time stamps $\{ {H_{j} ( {t_{1} } ), H_{j} ( {t_{2} } ), \ldots , H_{j} ( {t_{n} } ), \ldots } \}$. By construction, the $t \to H_{j} ( t )$ transformation preserves the self-excited component of performance scores described by the stochastic process $\{ {S_{j} ( t )} \}$ and amplifies it by the magnitude of the performance values.

Figure 3 presents the example of the career of Sachin Tendulkar, who has the highest sum of performances in both formats of the game. Top panels show the performance time $H( t )$ as a function of t, t is the index of the tth attempt, as defined in Eq. (4), for two international cricketing formats, ODI and Test. Bottom panels show the scores $S_{j} ( t )$ as a function of $t$, for the two international cricketing formats, ODI and Test. The presence of local temporal clustering around the high and low performances is clearly visible in both representations of $H( t )$ and $S( t )$ for this player.

Hawkes point process along the “performance time”

The performance time $H_{j} \left( t \right)$ of player j defined by expression (4) allows us to introduce a point process by the performance times $\{ H_{j} \left( {t_{1} } \right),H_{j} \left( {t_{2} } \right), \ldots ,H_{j} \left( {t_{n} } \right), \ldots \}$ along the $H$ axis. In other words, we consider the “performance time” axis $H_{j} \left( t \right)$ and, along this new time axis, we identify “points” at the locations $\{ H_{j} \left( {t_{1} } \right),H_{j} \left( {t_{2} } \right), \ldots ,H_{j} \left( {t_{n} } \right), \ldots \}$. When player $j$ has a series of large scores, this is expressed as a cluster of closely spaced points along the $H$ axis as shown in Fig. 3.

Inspired by the analyses of^46,49,51 using generalized non-homogeneous Poisson processes, we propose to model the clustering of the points along the H axis of each player by using the self-excited stochastic Hawkes point process model^45,50, augmented by some necessary ingredients for constructing a prediction model¹⁹. In other words, we visualize the points for a given player j along the performance time axis $H_{j} \left( t \right)$ as being generated by a Hawkes model with intensity $\lambda \left( t \right)$ given by

$$\begin{array}{*{20}c} {\lambda \left( t \right) = \mu + \mathop \sum \limits_{{t_{i} < t}} \varphi \left( {t - t_{i} } \right)} \\ \end{array}$$

(5)

In expression (5), the first term μ in the right-hand-side is the background intensity, which quantifies the “intrinsic” performance level of a player, uninfluenced by his/her past performances. The second term describes how past points can trigger future points along the H axis. This is a convenient and elegant way to account for the possibility of a hot-hand effect, since each next point is function of the whole history, with a weight quantified by the memory or kernel function $\varphi \left( {t - t_{i} } \right) > 0$, which is decaying as a function of its argument (points further in the past have a weaker influence). Thus, the sum $\mathop \sum \nolimits_{{t_{i} < t}} \varphi \left( {t - t_{i} } \right)$ quantifies the influence of the history of past performances on a player’s present performance.

Depending on the problem, previous researchers have used different parametric forms for, e.g.^46,53,54 use a power law kernel, whereas⁵⁸ use an exponential kernel. In the present case, as there is no reason to favor any parametric form, we decide to use a non-parametric kernel function for φ^50,59. Thus, shortly after a large performance amplitude, model (2) describes the possibility that the excess intensity of observing a similar performance is boosted and then decays to the baseline average performance level $\mu$ at long times.

The self-excited Hawkes conditional point process is one of the simplest models to account for how the past can influence the future, while keeping a very convenient linear dependence of the past onto the future. The most important parameter of the Hawkes model is its branching ratio defined by

$$\begin{array}{*{20}c} {n = \mathop \int \limits_{0}^{\infty } \varphi \left( t \right)dt.} \\ \end{array}$$

(6)

The branching ratio n is the average number of points (or events) of first generation triggered by a given point. It is also the fraction of points (events) that have been triggered by past events⁶⁰. A value of n close to the critical value 1 thus qualifies a large level of triggering (strong hot hand effect) and endogeneity. Please see figure S4 for details about the used method.

We use the expectation maximization algorithm as described⁵⁰ to calibrate the model.

Results and discussion

Hot individual hands

We partition the career of a player j into training set and validation set. We take the first 80% of the performances as the training set and the next 20% as the validation set. We transform the performance sequence in training and validation set to performance time representation (4) as discussed in “Methods” section. We calibrate the performance time in training set to determine background intensity $\mu$ and the memory kernel $\varphi$. We then use the calibrated $\mu$ and $\varphi$ to evaluate the prediction performance in validation set using the log-likelihood score and call the median value ${\mathcal{L}}_{j}^{model}$.

Similarly, we prepare a controlled set of log-likelihood estimation for the same player. Keeping the validation set unaltered, we shuffle the sequence of the performance in the training set 100 times and use this to train the model. We evaluate the trained model on the unaltered validation set to determine the corresponding median log-likelihood estimation ${\mathcal{L}}_{j}^{control}$. With the above constructions, we define the relative differences $\delta ( {{\mathcal{L}}_{j}^{model} ,{\mathcal{L}}_{j}^{control} } )$ by

$$\begin{array}{*{20}c} {\delta \left( {{\mathcal{L}}_{j}^{model} ,{\mathcal{L}}_{j}^{control} } \right) = \frac{{{\mathcal{L}}_{j}^{model} - {\mathcal{L}}_{j}^{control} }}{{{\mathcal{L}}_{j}^{control} }} } \\ \end{array}$$

(7)

Additionally, we estimate the branching ratios (see Eq. (6))^46,49,53 of the performance time for all players over the duration of their entire career. For comparison, we construct null estimations by randomly shuffling the performance time times and reevaluating the 100 null branching ratios for each of the players.

The relative difference of log-likelihood prediction scores in Eq. (7) is shown in the bottom panels of Fig. 4, for both formats of the games. The insets present the fraction of time control performing better and the fraction of time the model performing better. The results show a significant improvement in prediction score in model experiments compared to the control experiments. We plot the distribution of the branching ratios obtained from the data and the null branching ratios and compare them in the top panels of Fig. 4. In the plots, the shaded region marks the fraction of players’ branching ratios that are never found in the null models. This behavior is robust against the number of simulated null models, i.e., the fraction of players’ branching ratios that are never found in the null model remains the same even if we consider 500 and 1000 null models.

We then compare the log-likelihood score from 100 control estimates with the log-likelihood score obtained from the data for each of the player. We evaluate the statistical significance of having a better log-likelihood score in the model experiments compared to the control experiments. We perform the Wilcoxon signed-rank test in each career to determine the statistical significance. Considering a confidence level of 0.05, we observe that, in 49.6% of Test careers and in 46.8% of the ODI careers, the log-likelihood prediction score in original sequences is significantly higher than the median log-likelihood prediction score in control experiments. This leads us to conclude that the probability of falsely accepting the null hypotheses—the control experiments perform equally good—is $< 10^{ - 6}$(using a binomial probability distribution with success rate 0.05 of false test result) for both the cases. This result is sufficient to support the predictive power of our model. Furthermore, our model performs better than the standard techniques like ARIMA and autocorrelation measures^20,21 (please refer to the SM).

We then compare the branching ratios (see Eq. (6)) of the performance time obtained from data and null shuffling for each player to quantify the Hot-Hand effect. We perform the Wilcoxon signed-rank test to determine the statistical significance. We observe that in 56.8% of Test careers and in 53.7% of the ODI careers, the branching ratio of original performance time is significantly higher than the median branching ratio in null performance time (confidence level = 0.05). These results suggest a significant presence of Hot Hands in the players career, as the probability for the absence of Hot Hands is $< 10^{ - 6}$(using a binomial probability distribution with success rate 0.05 of false test result).

Hot team hands

We repeat the above analysis to predict and quantify the team performances (sum of all individual performances in a game) (please see SM for more details). We take the first 80% of the team performances as the training set and validate the model on the next 20%. Using the Wilcoxon signed-rank test with confidence level 0.05, we observe that, only in 30% and 20% of ODI and Test teams, the log likelihood scores in model experiments is significantly better than the control experiments. These results suggest a significant reduction in prediction (~ 50% reduction) compared to predictability of individual performances (please see SM for more details). Further the probability of falsely accepting the null hypotheses—the control experiments perform better—increases to $\sim 10^{ - 2}$ and $\sim 10^{ - 1}$ respectively (using a binomial probability distribution with success rate 0.05 of false test result). The absence of reliable prediction in the above results suggest the absence of exploitable self-excited patterns in team performance.

Hot winning hands

We investigate the presence of hot hands in the team performances by going through the complete history of games played by each team and analyze the winning streaks (i.e., the number of continuous wins without losing a single game in between). We note down the length of winning streaks and the corresponding frequencies of occurrences of such streaks in each team playing history.

Then, we construct a statistical ensemble of possible performance trajectories. We randomly shuffle the original performance sequences to generate 1000 synthetic performance trajectories. Using this statistical ensemble, we evaluate the null probability distribution for the joint occurrence of streaks of length n and of corresponding frequency f. We use this probability distribution for estimating the p values for the observed events. we define the p values $p( n )$ and $p( {n_{f} } )$ according to

$$\begin{array}{*{20}c} {p\left( n \right) = P\left( {n_{i} \ge n} \right), \quad p\left( {n_{f} } \right) = P\left( {n_{i} \ge n|f} \right)} \\ \end{array}$$

(8)

which respectively represent the p value for observation of streaks with length n and streaks with length n conditional on frequency f. To avoid the problem of multiple hypothesis testing⁶¹, because of simultaneous consideration of the multiple individual tests, we correct the error rates of individual tests using multiple hypothesis testing methods^{62,63,64,65,66}. We note down the results from the methods^{62,63,64,65,66} and identify the extreme events (see supporting tables for multiple hypothesis testing in SM).

Figure 5 presents the position of the realized winning streaks, along with the null distribution of the winning streaks for the 10 teams in the ODI format (top panel) and in the Test format (bottom panel). The red stars in figure reveal several highly improbable i.e., one or both of $p\left( n \right)$ and $p\left( {n_{f} } \right)$ is significant with confidence level 0.05, after multiple testing. A large number of white stars indicate probable events i.e., none of $p\left( n \right)$ and $p\left( {n_{f} } \right)$ is significant. We present the $p\left( n \right)$ and $p\left( {n_{f} } \right)$ values for the events that pass the multiple hypothesis tests in figure.

We observe 5 out of 98 (5.1%) streaks in ODI cricket are significantly long, considering both their length (n) and frequency (f). In Test cricket, 6 out of 73 (8%) considering the length and 5 out of 73 (7%) considering the frequency are statistically significant. Because of the considered significance level, we expect an error rate of 0.05 in individual verification. In total we verified 98 possible streaks in ODI cricket and 73 streaks in Test cricket. The binomial probability for the observation of 5 hot hands in ODI cricket is 0.18 and more than 5 hot hands is 0.36. However, for the Test format, the probability of observing 5 and 6 hot hands are 0.14 and 0.08 and more than 5 and 6 are 0.15 and 0.07 respectively. This allows us to conclude that we don’t observe any Hot Hand effect in winning streaks of teams both in ODI and Test cricket. The length of winning streaks is an important variable to consider while investigating hot hands. Both the belief and the behavior of performers are sensitive to decision frames that they derive from these streak lengths⁶⁷. This can inform coaches on the importance of how to provide information to athletes.

Conclusion

In this study, we have quantified the predictability and persistence of individual and collective performances of the teams in a team game. We introduced a number of novel statistical tools to study the hot hand effect in a new dataset on game of Cricket. We quantified and exploited the self-excited patterns in individual and team performances to better predict the future compared to traditional methods like ARIMA.

Our investigation has confirmed the presence of significant hot-hands in individual performance. This is supported by the fact that the three highest performances in individual career cluster in time, particularly when players partake in hundreds of games. Further, the shaded branching ratios in Fig. 4A,B are very rarely found in simulated null data, confirming the strength of the self-excitation that qualifies the presence of the hot-hand effect. The major finding of our work is that these self-excitation patterns can indeed be exploited for predicting future performances. The findings of this investigation complement those of earlier studies supporting the presence of hot hands in individual careers, while raising questions about the validity of those refuting the same.

Additionally, we have showed a significant reduction in prediction of team performances compared with single players’ performance, suggesting the dominance of stochasticity in the determinant of teams’ performance. While there is still some predictability to a certain extent, the outcome of the game cannot be predicted, nor do they cluster in time. This leads us to suggest the somewhat paradoxical conclusion that ‘Cricket is a game of skill for individuals and a game of chance for the teams.’

Our study showed that, while an individual can consistently deal with the environmental systemic stochasticity, it is difficult for the team to perform equally well. Thus, these results open door for future research in the direction of the impact of group size in predictability and consistency of performance.

Furthermore, the present study established a quantitative framework for detecting and predicting the performances in individual careers. This approach will prove useful in expanding our understanding of the predictability of success in individual careers. This paper contributes to recent historiographical debates concerning the presence of hot hands in the sequence of successes in individual performances. Further work needs to be done to establish whether the presented methodology for predicting the performances can be improved for commercial usage and for financial gains, exploiting the presence of self-excited patterns in individual careers. The findings of this study have a number of important implications for future research in the field of quantifying self-excited performance patterns involved in the study of human behavior and design of algorithms for predicting success.

Limitations

Our analysis has been performed on players with more than 30 games. Such minimum sample size is required to accurately fit the Hawkes process to data, as shown in the SM with the measures of dispersion. We acknowledge that this could have influenced the overall result as we cannot infer the existence/absence of hot hands in shorter careers. Additionally, the Hawkes point process used as a representation of the performance sequence assumes a constant background rate of new events. Thus, our methodology doesn’t account for possible seasonal variation of the performances within the players’ career. A further study could improve the methodology by considering the temporal variation of the background rate using the methods developed in⁶⁸ to successfully account of possible complex seasonality effects. Through this study, we analyze the hot hand effect within the time frame of individual games. A natural extension of our work would be to analyze alternative time frames, such as different sections of a game (half-time, set), a half-season or season or multiple seasons to investigate the existence of hot hands.

Data availability

The datasets used in this study is publicly available at https://www.espncricinfo.com, http://howstat.com/. All methods were carried out in accordance with relevant guidelines and regulations. All data, codes, and materials used in the analysis would be made available.

References

Favre, M. & Sornette, D. Strong gender differences in reproductive success variance, and the times to the most recent common ancestors. J. Theor. Biol. 310, 43–54 (2012).
Article ADS PubMed Google Scholar
Fraiberger, S. P., Sinatra, R., Resch, M., Riedl, C. & Barabási, A. L. Quantifying reputation and success in art. Science (80-) 362, 825–829 (2018).
Article ADS CAS Google Scholar
Sinatra, R., Wang, D., Deville, P., Song, C. & Barabási, A. L. Quantifying the evolution of individual scientific impact. Science (80-) 354, aaf5239 (2016).
Article CAS Google Scholar
Deville, P. et al. Career on the move: Geography, stratification, and scientific impact. Sci. Rep. 4, 4770 (2014).
Article CAS PubMed PubMed Central Google Scholar
Berger, J. & Pope, D. Can losing lead to winning?. Manag. Sci. 57, 817–827 (2011).
Article Google Scholar
Way, S. F., Morgan, A. C., Clauset, A. & Larremore, D. B. The misleading narrative of the canonical faculty productivity trajectory. Proc. Natl. Acad. Sci. 114, E9216–E9223 (2017).
Article CAS PubMed PubMed Central Google Scholar
Clauset, A., Arbesman, S. & Larremore, D. B. Systematic inequality and hierarchy in faculty hiring networks. Sci. Adv. 1, e1400005 (2015).
Article ADS PubMed PubMed Central Google Scholar
Liu, L. et al. Hot streaks in artistic, cultural, and scientific careers. Nature 559, 396–399 (2018).
Article ADS CAS PubMed Google Scholar
Williams, O. E., Lacasa, L. & Latora, V. Quantifying and predicting success in show business. Nat. Commun. 10, 2256 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Mauboussin, M. J. The Success Equation: Untangling Skill and Luck in Business, Sports, and Investing (Harvard Business Press, 2012).
Google Scholar
Carron, A. V., Bray, S. R. & Eys, M. A. Team cohesion and team success in sport. J. Sports Sci. 20, 119–126 (2002).
Article PubMed Google Scholar
Wuchty, S., Jones, B. F. & Uzzi, B. The increasing dominance of teams in production of knowledge. Science (80-) 316, 1036–1039 (2007).
Article ADS CAS Google Scholar
Cooke, N. J. et al. Enhancing the Effectiveness of Team Science (National Academies Press, 2015).
Google Scholar
Wu, L., Wang, D. & Evans, J. A. Large teams develop and small teams disrupt science and technology. Nature 566, 378–382 (2019).
Article ADS CAS PubMed Google Scholar
Larivière, V., Gingras, Y., Sugimoto, C. R. & Tsou, A. Team size matters: Collaboration and scientific impact since 1900. J. Assoc. Inf. Sci. Technol. 66, 1323–1332 (2015).
Article Google Scholar
Mukherjee, S. Quantifying individual performance in Cricket—A network analysis of Batsmen and Bowlers. Phys. A Stat. Mech. Its Appl. 393, 624–637 (2014).
Article ADS Google Scholar
Mukherjee, S. Identifying the greatest team and captain—A complex network approach to cricket matches. Phys. A Stat. Mech. Its Appl. 391, 6066–6076 (2012).
Article ADS Google Scholar
Gilovich, T., Vallone, R. & Tversky, A. The hot hand in basketball: On the misperception of random sequences. Cogn. Psychol. 17, 295–314 (1985).
Article Google Scholar
Kahneman, D. & Tversky, A. On the psychology of prediction. Psychol. Rev. 80, 237 (1973).
Article Google Scholar
Hales, S. D. An epistemologist looks at the hot hand in sports. J. Philos. Sport 26, 79–87 (2012).
Article Google Scholar
Bar-Eli, M., Avugos, S. & Raab, M. Twenty years of “hot hand” research: Review and critique. Psychol. Sport Exerc. 7, 525–553 (2006).
Article Google Scholar
Tversky, A., Kahneman, D. & Kahneman, S. Tversky, Belief in the law of small numbers. A Handb. Data Anal. Behav. Sci. 1, 341 (2014).
MATH Google Scholar
Kahneman, D. & Riepe M. W. Aspects of investor psychology. J. Portf. Manag. 24, 52--+ (1998).
Miller, J. B. & Sanjurjo, A. Surprised by the hot hand fallacy? A truth in the law of small numbers. Econometrica 86, 2019–2047 (2018).
Article MathSciNet MATH Google Scholar
Koehler, J. J. & Conley, C. A. The “hot hand” myth in professional basketball. J. Sport Exerc. Psychol. 25, 253–259 (2003).
Article Google Scholar
Hendricks, D., Patel, J. & Zeckhauser, R. Hot hands in mutual funds: Short-run persistence of relative performance, 1974–1988. J. Finance 48, 93–130 (1993).
Article Google Scholar
Sornette, D., Wheatley, S. & Cauwels P. The fair reward problem: the illusion of success and how to solve it. Adv. Complex Syst. 22, 1950005 (52 pages) (2019).
Roney, C. J. R. & Trick, L. M. Sympathetic magic and perceptions of randomness: The hot hand versus the gambler’s fallacy. Think. Reason. 15, 197–210 (2009).
Article Google Scholar
Fama, E. F. & French, K. R. Luck versus skill in the cross-section of mutual fund returns. J. Finance 65, 1915–1947 (2010).
Article Google Scholar
Hirshleifer, D. Investor psychology and asset pricing. J. Finance 56, 1533–1597 (2001).
Article Google Scholar
Carhart, M. M. On persistence in mutual fund performance. J. Finance 52, 57–82 (1997).
Article Google Scholar
Gigerenzer, G. & Brighton, H. Homo heuristicus: Why biased minds make better inferences. Top. Cogn. Sci. 1, 107–143 (2009).
Article PubMed Google Scholar
Merton, R. K. The matthew effect in science. Science (80-) 159, 56–62 (1968).
Article ADS CAS Google Scholar
Lazer, D. et al. Computational social science. Science (80-) 323, 721–723 (2009).
Article CAS Google Scholar
Iacopini, I., Milojević, S. & Latora, V. Network dynamics of innovation processes. Phys. Rev. Lett. 120, 48301 (2018).
Article ADS CAS Google Scholar
Heatherton, T. et al. Losing Control: How and Why People fail at Self-regulation (Academic Press Inc, 1994).
Google Scholar
Csapo, P. & Raab, M. “Hand down, Man down.” Analysis of defensive adjustments in response to the hot hand in basketball using Novel defense metrics. PLoS ONE 9, e114184 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Clauset, A., Larremore, D. B. & Sinatra, R. Data-driven predictions in the science of science. Science (80-) 355, 477–480 (2017).
Article ADS CAS Google Scholar
Bol, T., de Vaan, M. & van de Rijt, A. The Matthew effect in science funding. Proc. Natl. Acad. Sci. 115, 4887–4890 (2018).
Article CAS PubMed PubMed Central Google Scholar
Raab, M., Gula, B. & Gigerenzer, G. The hot hand exists in volleyball and is used for allocation decisions. J. Exp. Psychol. Appl. 18, 81–94 (2012).
Article PubMed Google Scholar
Burns, B. D. Heuristics as beliefs and as behaviors: The adaptiveness of the “hot hand”. Cogn. Psychol. 48, 295–331 (2004).
Article PubMed Google Scholar
Link, D. & Raab, M. Experts use base rates in real-world sequential decisions. Psychon. Bull. Rev. 5, 1–8 (2021).
Google Scholar
Attali, Y. Perceived hotness affects behavior of basketball players and coaches. Psychol. Sci. 24, 1151–1156 (2013).
Article PubMed Google Scholar
Arkes, J. Revisiting the hot hand theory with free throw data in a multivariate framework. J. Quant. Anal. Sport. 6, 2 (2010).
MathSciNet Google Scholar
Hawkes, A. G. Spectra of some self-exciting and mutually exciting point processes. Biometrika 58, 83–90 (1971).
Article MathSciNet MATH Google Scholar
Crane, R. & Sornette, D. Robust dynamic classes revealed by measuring the response function of a social system. Proc. Natl. Acad. Sci. USA 105, 15649–15653 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
O’Brien, J. D., Aleta, A., Moreno, Y. & Gleeson, J. P. Quantifying uncertainty in a predictive model for popularity dynamics. Phys. Rev. E 101, 62311 (2020).
Article ADS MathSciNet Google Scholar
Medvedev, A. N., Delvenne, J.-C. & Lambiotte, R. Modelling structure and predicting dynamics of discussion threads in online boards. J. Complex Networks 7, 67–82 (2019).
Article MathSciNet MATH Google Scholar
Filimonov, V. & Sornette, D. Quantifying reflexivity in financial markets: Toward a prediction of flash crashes. Phys. Rev. E 85, 56108 (2012).
Article ADS CAS Google Scholar
Lewis, E. & Mohler, G. A nonparametric EM algorithm for multiscale Hawkes processes. J. Nonparametr. Stat. 1, 1–20 (2011).
Google Scholar
Filimonov, V. & Sornette, D. Apparent criticality and calibration issues in the {H}awkes self-excited point process model: application to high-frequency financial data. Quant. Financ. 15, 1293–1314 (2015).
Article MATH Google Scholar
Shcherbakov, R., Zhuang, J., Zöller, G. & Ogata, Y. Forecasting the magnitude of the largest expected earthquake. Nat. Commun. 10, 1–11 (2019).
Article CAS Google Scholar
Nandan, S., Ouillon, G., Wiemer, S. & Sornette, D. Objective estimation of spatially variable parameters of epidemic type aftershock sequence model: Application to California. J. Geophys. Res. Solid Earth 122, 5118–5143 (2017).
Article ADS Google Scholar
Nandan, S., Ouillon, G., Sornette, D. & Wiemer, S. Forecasting the rates of future aftershocks of all generations is essential to develop better earthquake forecast models. J. Geophys. Res. Solid Earth 124, 8404–8425 (2019).
Article ADS Google Scholar
Sornette, D. & Helmstetter, A. Endogenous versus exogenous shocks in systems with memory. Phys. A Stat. Mech. Its Appl. 318, 577–591 (2003).
Article ADS MATH Google Scholar
Helmstetter, A., Sornette, D. & Grasso, J.-R. Mainshocks are Aftershocks of Conditional Foreshocks: How do foreshock statistical properties emerge from aftershock laws. J. Geophys. Res. (Solid Earth) 108, 2046. https://doi.org/10.1029/2002JB001991 (2003).
Article ADS Google Scholar
Jagielski, M., Kutner, R. & Sornette, D. Theory of earthquakes interevent times applied to financial markets. Phys. A Stat. Mech. Its Appl. 483, 68–73 (2017).
Article ADS Google Scholar
Filimonov, V. & Sornette, D. Spurious trend switching phenomena in financial markets. Eur. Phys. J. B 85, 155 (2012).
Article ADS CAS Google Scholar
Sornette, D. & Utkin, S. Limits of declustering methods for disentangling exogenous from endogenous events in time series with foreshocks, main shocks, and aftershocks. Phys. Rev. E 79, 61110 (2009).
Article ADS MathSciNet CAS Google Scholar
Helmstetter, A. & Sornette, D. Importance of direct and indirect triggered seismicity in the ETAS model of seismicity. Geophys. Res. Lett. https://doi.org/10.1029/2003GL017670 (2003).
Article Google Scholar
Fiévet, L. & Sornette, D. Decision trees unearth return sign predictability in the S&P 500. Quant. Financ. 18, 1797–1814 (2018).
Article MathSciNet MATH Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Hochberg, Y. A sharper Bonferroni procedure for multiple tests of significance. Biometrika 75, 800–802 (1988).
Article MathSciNet MATH Google Scholar
Holm, S. A simple sequentially rejective multiple test procedure. Scand. J. Stat. 5, 65–70 (1979).
MathSciNet MATH Google Scholar
Šidák, Z. Rectangular confidence regions for the means of multivariate normal distributions. J. Am. Stat. Assoc. 62, 626–633 (1967).
MathSciNet MATH Google Scholar
Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proc. Natl. Acad. Sci. 100, 9440–9445 (2003).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Raab, M. & MacMahon, C. Does framing the hot hand belief change decision-making behavior in volleyball?. Res. Q. Exerc. Sport 86, 152–162 (2015).
Article PubMed Google Scholar
Wehrli, A., Wheatley, S. & Sornette, D. Scale-, time- and asset-dependence of Hawkes process estimates on high frequency price changes. Quant. Financ. 21, 729–752 (2021).
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We thank Ananya Acharya, Ali Ayoub, Giuseppe-Maria Ferro, Jan-Christian Gerlach and Devendra Shintre for many enlightening discussions during the preparation of the manuscript.

Author information

Authors and Affiliations

Connection Science, Massachusetts Institute of Technology, Cambridge, USA
Sumit Kumar Ram
Department of Management, Technology and Economics, ETH Zürich, Scheuchzerstrasse 7, 8092, Zurich, Switzerland
Sumit Kumar Ram & Didier Sornette
Swiss Seismological Service, ETH Zürich, Sonneggstrasse 5, 8092, Zurich, Switzerland
Shyam Nandan
Institute of Risk Analysis, Prediction and Management (Risks-X), Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology (SUSTech), Shenzhen, China
Didier Sornette

Authors

Sumit Kumar Ram
View author publications
You can also search for this author in PubMed Google Scholar
Shyam Nandan
View author publications
You can also search for this author in PubMed Google Scholar
Didier Sornette
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.R. and D.S. designed the experiments. S.R. performed the experiments. S.N. added analytical reagents. S.R., S.N., D.S. wrote the manuscript.

Corresponding authors

Correspondence to Sumit Kumar Ram or Didier Sornette.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ram, S.K., Nandan, S. & Sornette, D. Significant hot hand effect in the game of cricket. Sci Rep 12, 11663 (2022). https://doi.org/10.1038/s41598-022-14980-7

Download citation

Received: 08 November 2021
Accepted: 16 May 2022
Published: 08 July 2022
DOI: https://doi.org/10.1038/s41598-022-14980-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.