Impact of the Euro 2020 championship on the spread of COVID-19

Dehning, Jonas; Mohr, Sebastian B.; Contreras, Sebastian; Dönges, Philipp; Iftekhar, Emil N.; Schulz, Oliver; Bechtle, Philip; Priesemann, Viola

doi:10.1038/s41467-022-35512-x

Download PDF

Article
Open access
Published: 18 January 2023

Impact of the Euro 2020 championship on the spread of COVID-19

Nature Communications volume 14, Article number: 122 (2023) Cite this article

7965 Accesses
3 Citations
301 Altmetric
Metrics details

Subjects

Abstract

Large-scale events like the UEFA Euro 2020 football (soccer) championship offer a unique opportunity to quantify the impact of gatherings on the spread of COVID-19, as the number and dates of matches played by participating countries resembles a randomized study. Using Bayesian modeling and the gender imbalance in COVID-19 data, we attribute 840,000 (95% CI: [0.39M, 1.26M]) COVID-19 cases across 12 countries to the championship. The impact depends non-linearly on the initial incidence, the reproduction number R, and the number of matches played. The strongest effects are seen in Scotland and England, where as much as 10,000 primary cases per million inhabitants occur from championship-related gatherings. The average match-induced increase in R was 0.46 [0.18, 0.75] on match days, but important matches caused an increase as large as +3. Altogether, our results provide quantitative insights that help judge and mitigate the impact of large-scale events on pandemic spread.

Early pandemic COVID-19 case growth rates increase with city size

Article Open access 28 June 2021

Effects of government policies on the spread of COVID-19 worldwide

Article Open access 14 October 2021

Masks and distancing during COVID-19: a causal framework for imputing value to public-health interventions

Article Open access 04 March 2021

Introduction

Passion for competitive team sports is widespread worldwide. However, the tradition of watching and celebrating popular matches together may pose a danger to coronavirus disease 2019 (COVID-19) mitigation, especially in large gatherings and crowded indoor settings (see, e.g., refs. 1,2,3,4,5,6). Interestingly, sports events taking place under substantial contact restrictions had only a minor effect on COVID-19 transmission^7,8,9,10,11. However, large events with massive media coverage, stadium attendance, increased travel, and viewing parties can play a major role in the spread of COVID-19—especially if taking place in settings with few COVID-19-related restrictions. This was the case for the UEFA Euro 2020 Football Championship (Euro 2020 in short), staged from June 11 to July 11, 2021. While stadium attendance might only have a minor effect^12,13,14, it increases TV viewer engagement^15,16,17, and encourages additional social gatherings¹⁸. These phenomena and previous observational analyses¹⁹ suggest that the Euro 2020’s impact may have been considerable. Therefore, we used this championship as a case study to quantify the impact of large events on the spread of COVID-19. Counting with quantitative insights on the impact of these events allows policymakers to determine the set of interventions required to mitigate it.

Two facts make the Euro 2020 especially suitable for the quantification. First, the Euro 2020 resembles a randomized study across countries: The time-points of the matches in a country do not depend on the state of the pandemic in that country and how far a team advances in the championship has a random component as well²⁰. This independence between the time-points of the match and the COVID-19 incidence allows quantifying the effect of football-related social gatherings without classical biasing effects. This is advantageous compared to classical inference studies quantifying the impact of non-pharmaceutical interventions (NPIs) on COVID-19 where implementing NPIs is a typical reaction to growing case numbers^21,22,23. Second, the attendance at match-related events, and thus the cases associated with each match, is expected to show a gender imbalance²⁴. This was confirmed by news outlets and early studies^25,26,27,28. Hence, the gender imbalance presents a unique opportunity to disentangle the impact of the matches from other effects on pathogen transmission rates.

Here we build a Bayesian model to quantify the effect large-scale sports events on the spread of COVID-19, using the Euro 2020 as case study. In the following, we use “case” to refer to a confirmed case of a severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in a human and “case numbers” to refer to the number of such cases. Not all infections are detected and represented in the cases and cases come with a delay after the actual infection. Our model simulates COVID-19 spread in each country using a discrete renewal process^22,29 for each gender separately, such that the effect of matches can be assessed through the gender imbalance in case numbers. This is defined as “(male incidence − female incidence)/total incidence”, and through the temporal association of cases to match dates of the countries’ teams. Regarding the expected gender imbalance at football-related gatherings, we chose a prior value of 33% (95% percentiles [18%, 51%]) female participants, which is more balanced than the values reported for national leagues (about 20%)²⁴. However, this agrees with the expected homogeneous and broad media attention of events like the Euro 2020. For the effective reproduction number R_eff we distinguish three additive contributions; the base, NPI-, and behavior-dependent reproduction number R_base, a match-induced boost on it ΔR_football, and a noise term ΔR_noise, such that R_eff = R_base + ΔR_football + ΔR_noise. We assume R_base to vary smoothly over time, while the effect of single matches ΔR_football is concentrated on one day and allows for a gender imbalance. The term ΔR_noise allows the model to vary the relative reproduction number for each gender independent of the football events smoothly over time. We analyzed data from all participating countries in the Euro 2020 that publish daily gender-resolved case numbers (n = 12): England, the Czech Republic, Italy, Scotland, Spain, Germany, France, Slovakia, Austria, Belgium, Portugal, and the Netherlands (ordered by resulting effect size). We retrieved datasets directly from governmental institutions or the COVerAGE-DB³⁰. See Supplementary Section S1 for a list of data sources. Our analyses were carried out following FAIR³¹ principles; all code, including generated datasets, are publicly available (https://github.com/Priesemann-Group/covid19_soccer).

Results

The main impact arises from the subsequent infection chains

We quantified the impact of the Euro 2020 matches on the reproduction number for the 12 analyzed countries (Fig. 1a) and for every single match (Supplementary Fig. S8). On average, a match increases the reproduction number R by 0.46 (95% CI [0.18, 0.75]) (Fig. 1a and Supplementary Table S4) for a single day. In other words, when a country participated in a match of the Euro 2020 championship, every individual of the country infected on average ΔR_match extra persons (see Supplementary Section S2 for more details). The cases resulting from these infections occurring at gatherings on the match days are referred to as primary cases.

**Fig. 1: Quantifying the impact of the Euro 2020 on COVID-19 spread.**

However, primary cases are only the tip of the iceberg; any of these cases can initiate a new infection chain, potentially spreading for weeks (see Supplementary Section S2 for more details). We included all subsequent cases until July 31, which is about two weeks after the final. As expected, subsequent cases outnumber the primary cases considerably at a ratio of about 4:1 on average (Supplementary Table S3). As a consequence, on average, only 3.2% [1.3%, 5.2%] of new cases are directly associated with the match-related social gatherings throughout that analysis period (Fig. 1b). This surge of subsequent cases highlights the long-lasting impact of potential single events on the COVID-19 spread (see Supplementary Table S2).

We find an increase in COVID-19 spread at the Euro 2020 matches in all countries we analyzed, except for the Netherlands. In the Netherlands, a “freedom day" coincided with the analysis period³² and was accompanied by the opposite gender imbalance compared to the football matches, thereby apparently inverted the football effect. Therefore, we exclude the Netherlands from general averages and correlation studies, but still display the results for completeness.

The primary and subsequent cases on average amounted to 2200 (95% CI [986, 3308]) cases per million inhabitants (Fig. 1c and Supplementary Table S2). This amounts to about 0.84 million (CI: [0.39M, 1.26M]) cases related to the Euro 2020 in the 12 countries (cf. Supplementary Table S3). With the case fatality risk of that period, this corresponds to about 1700 (CI: [762, 2470]) deaths, assuming that the primary and subsequent spread affects all ages equally. Most likely this is slightly overestimated since the age groups most at risk from COVID-19-related death are probably underrepresented in football-related social activities and thus more unlikely to be affected by primary championship-related infections. However, the overall number of primary and subsequent cases attributed to the championship is dominated by the subsequent cases, and the mixing of individuals of different age-groups then mitigates this bias. Individually, three countries, England, the Czech Republic, and Scotland showed a significant increase in COVID-19 incidence associated with the Euro 2020, and Spain and France show an increase at the one-sided 90% significance threshold. In other countries such as Germany, only a relatively small contribution of primary cases was associated with the Euro 2020 championship, and a small gender imbalance was observed. Low COVID-19 incidence during the championship or imprecise temporal association between infection and confirmation of it as a case can lead to a loss of sensitivity and hinder the detection of an effect, as can be seen from the large width of several posterior distributions (e.g., Italy and Slovakia, which had particularly low incidence).

The strongest effect is observed in England and Scotland

Overall, the effect of the Euro 2020 was quite diverse across the participating countries, ranging from almost no additional infections to up to 1% of the entire population being infected (i.e., from Portugal to England, Fig. 1). To illustrate this diversity, the comparison between England, Scotland, and the Czech Republic is particularly illustrative (Fig. 2). For all countries, we disentangled the cases that are considered to happen independently of the Euro 2020 (Fig. 2a, gray), the primary cases directly associated with gatherings on the days of the matches (red), and the subsequent infection chains started by the primary cases (orange; see Supplementary Figs. S24–S36 for all countries).

**Fig. 2: Example cases illustrate that the spread associated with the Euro 2020 can encompass a substantial fraction of the observed cases.**

England, being the runner-up of the championship and thus played the maximum number of matches, displays the strongest effect over the longest duration, with a substantial increase in reproduction number ΔR_match towards the last matches of the championship. This reflects the increasing popularity of the later matches, as e.g., quantified by the increase of the search term on Google (Supplementary Fig. S20). Scotland shows a particularly strong effect of a single match (Scotland vs England) staged in London during the group phase, with ΔR_match = 3.5 [2.9, 4.2] (Fig. 2c). This means that on average over the total Scottish population, every single person infected additional 3.5 persons at or around that single day. These are very strong effects. As a consequence, in Scotland the subsequent cases from the single match accounted for about 30% of the cases in the following weeks, illustrating the impact of such gatherings on public health.

Low overall incidence prevents large match-related spread

In the Czech Republic, the situation was different compared to England and Scotland, although the analyses point to similarly strong gatherings on the match days (i.e., large ΔR_match, Fig. 1a). However, because of the overall low incidence much fewer people were infected throughout the championship. The advantage of low incidence or fewer games is illustrated in two counterfactual scenarios. Even under the assumption that the Czech team had continued to the final and the population had gathered exactly like the English (i.e., showing the same ΔR_match in the matches they played), the total number of cases (per million) would have been more than 40 times lower than in England, owing to the lower base incidence and a lower base reproduction number (Fig. 2d). Assuming, as a counterfactual scenario, that England had dropped out in the group stage, the number of cases associated with the Euro 2020 would have been much lower. This suggests that both the success in the championship and the base incidence and behavior in a country influence the public health impact of such large-scale events.

To better understand the impact of the Euro 2020, we quantified the determinants of the spread across countries. From theory, we expect the absolute number of infections generated by Euro 2020 matches to depend non-linearly on a country’s base incidence N₀, which determines the probability to meet an infected person, and on the effective reproduction number prior to the championship ${R}_{{{{{{{{\rm{pre}}}}}}}}}$, as a gauge for the underlying infection dynamics generating the subsequent cases, which determines how strongly an additional infection spreads in the population. We can then define the potential for COVID-19 spread as the number of COVID-19 cases that would be expected during the time T a country is playing in the Euro 2020 (${N}_{0}\cdot {R}_{{{{{{{{\rm{pre}}}}}}}}}^{T/4}$), assuming a generation interval of 4 days. Indeed, we find a clear correlation between the observed and the expected incidence Fig. 3a, R² = 0.77 (95% CI [0.39,0.9]), p < 0.001, with a slope of 1.62 (95% CI [1.0, 2.26]). The strong significance of this correlation relies mainly on England and Scotland. However, the observed slope in an analysis without these two countries (0.76, 95% CI: [−1.46, 3.04]), while not significant at the 95% confidence level, is consistent with the findings including all countries. This is shown in Supplementary Fig. S7.

**Fig. 3: Which variables can predict the extent of the impact of Euro 2020 matches?**

Furthermore, quantifying correlations between N₀ and ${R}_{{{{{{{{\rm{pre}}}}}}}}}$ and the number of primary and subsequent cases related to the Euro 2020, we see a trend for each (Supplementary Fig. S6a, b). However, these are weak and statistically significant only for ${R}_{{{{{{{{\rm{pre}}}}}}}}}$. Altogether, our data suggest that a favorable pandemic situation (low ${R}_{{{{{{{{\rm{pre}}}}}}}}}$ and low N₀) before the gatherings, and low R_base during the period of gatherings jointly minimize the impact of the Euro 2020 on community contagion. A prerequisite for this is that the known preventive measures, such as reducing group size, imposing preventive measures, and minimizing the number of encounters remain encouraged.

Independently on the epidemic situation, Euro 2020’s effect might be influenced by people’s prudence and the team’s popularity and success during the championship. While we do not observe any obvious effect of local mobility as a measure of the prudence of people (Fig. 3b, R² = 0.06 (95% CI [0.00, 0.34]), p = 0.54, and Supplementary Fig. S4), the potential popularity —represented by the number of matches played and hosted by a given country—had a more notable trend (Supplementary Fig. S6c). Still, this correlation was not statistically significant. Moreover, we found no relationship between the effect size and the Oxford governmental response tracker³³ (Supplementary Fig. S5).

Discussion

Large international-scale sports events like the Euro 2020 Football Championship have the potential to gather people like no other type of event. Our quantitative insights on the impact of such gatherings on COVID-19 spread provide policymakers with tools to design the portfolio of interventions required for mitigation (using, e.g., results of refs. 22,23,34). Thereby, our quantification can support society in carefully weighing the positive social, psychological, and economic effects of mass events against the potential negative impact on public health³⁵. Our analysis attributes about 0.84 million (95% CI: [0.39M, 1.26M]) additional infected persons to the Euro 2020 championship. Assuming that the primary and subsequent spread affects all ages equally, this corresponds across the 12 countries to about 1700 (CI: [762, 2470]) deaths. Thus, the public health impact of the EURO 2020 was not negligible.

To prevent the impacts of these events, measures, such as promoting vaccination, enacting mask mandates, and limiting gathering sizes, can be helpful. Besides, the effectiveness of such interventions has already been quantified in different settings (e.g., refs. 22,23) so that policymakers can weigh them according to specific targets and priorities. Furthermore, focused measures that aim to mitigate disease spread in situ, such as testing campaigns and requiring COVID passports to attend sport-related gatherings and viewing parties, present themselves as helpful options. In addition, one could encourage participants of a large gathering to self-quarantine and test themselves afterward. Moreover, the championship distribution of matches every 4–5 days coincides with the mean incubation period and generation interval of COVID-19. This means that individuals who get infected watching a match can turn infectious by the subsequent while potentially pre-symptomatic. Such resonance effects between gathering intervals and incubation time can increase the spread considerably³⁴. It thus depends on the design of the championships, on the precautionary behavior of individuals, and on the basic infection situation how much large-scale events threaten public health, even if the reproduction number is transiently increased during these events.

Previous studies that evaluated the impact of sports events on the spread of COVID-19 and considered the spectator gatherings at match venues were not conclusive^7,8,36. This agrees with our results as we find the impact of hosting a match to be small to non-existent (Supplementary Fig. S9). However, location having little effect may well be specific to the Euro 2020, where matches were distributed across different countries. In the traditional settings of the UEFA European Football Championship or the FIFA World Cup, a single country or a small group of countries hosts the entire championship, and the championship is accompanied by elaborate supporting events, public viewing, and extensive travel of international guests. Hence, for other championships, such as, e.g., the FIFA World Cup 2022 in Qatar or the Euro 2024 in Germany, the impact of location might be considerably larger.

Our model accounts for slow changes in the transmission rates that are unrelated to football matches through the gender-independent reproduction number R_base. We find R_base to increase at least transiently during the championship in all 12 countries except for England and Portugal (Supplementary Figs. S24–S35). The above may suggest that our estimate of the match effect ΔR_match is conservative: The overall increase of COVID-19 spread might in part be attributed to R_base, but will not be incorrectly associated with football matches. Our results might further be biased if the incidence and the teams’ progression in the Euro 2020 are correlated. It is conceivable that high incidence would negatively correlate with team progression through ill or quarantined team members. However, there were only few such cases during the Euro 2020³⁷, and the correlation might also be positive: At higher case numbers the team might be more careful. Hence, the correlation is unclear and probably negligible.

The COVID-19 spread obviously depends on many factors. However, many of those parameters, such as the vaccination rate, the contact behavior or motivation to be tested, are changing slowly over time and hence can be absorbed into the slowly changing base reproduction rate R_base and the gender-asymmetric noise ΔR_noise; other parameters, like social and regional differences, age-structure or specific contact networks are expected to be constant over time and average out across a country. To further test the robustness of our model, we systematically varied the prior assumptions on the central model parameters, among them the delay (Supplementary Fig. S12), the width of the delay kernel (Supplementary Fig. S13), the change point interval (Supplementary Fig. S14), the generation interval (Supplementary Fig. S16) and a range of other priors (Supplementary Fig. S17). Furthermore, when using wider prior ranges for the gender imbalance, football-related COVID-19 cases remain unchanged but the uncertainty increases (Supplementary Fig. S15), thus validating our choice. Even for the case of prior symmetric gender imbalance assumptions, the posterior distribution of the female participation converges for the three most significant countries to median values between 20 and 45%. As last cross-check, we made sure that we found no effect when shifting the match dates by 2 weeks relative to the case numbers (Supplementary Fig. S10) nor by shifting match dates outside the championship range, by more than ±30 days (Supplementary Fig. S11).

Besides quantifying the impact of matches on the reproduction number, our methodology allowed us to estimate the delay between infections and confirmation of positive tests D without a requirement to identify the source of each infection (Supplementary Fig. S19). Our estimates for D in the participating countries were around 3-5 days (England: 4.5 days (95% CI [4.3, 5]), Scotland: 3.5 days (95% CI [3.3, 3.8]), Supplementary Figs. S24 and S33g and Supplementary Table S4). This agrees with available literature and is an encouraging signal for the feasibility of containing COVID-19 with test-trace-and-isolate^{38,39,40,41,42}. However, we expect that some individuals would actively get tested right after a match, thereby increasing the case finding and reporting rates. This can slightly affect our estimates for the delay distribution D and would require additional information to be corrected. Altogether, analyzing large-scale events with precise timing and substantial impact on the spread presents a promising, resource-efficient complement to classical quantification of delays.

Understanding how popular events with major in-person gatherings affect the spreading dynamics of COVID-19 can help us design better strategies to prevent new outbreaks. The Euro 2020 had a pronounced impact on the spread despite considerable awareness of the risks of COVID-19. We estimate that, e.g., about 48% of all cases in England until July 31 are related to the championship. In future, with declining awareness about COVID-19 but potentially better immunity, similar mass events, such as the football world cups, the Super Bowl, or the Olympics, will still unfold their impact. Acute, long-COVID-19 and post-COVID-19 will continue to pose a challenge to societies in the years to come. Our analysis suggest that a combination of low ${R}_{{{{{{{{\rm{pre}}}}}}}}}$ and low initial incidence at the beginning of the event, together with the known preventive measures, can strongly reduce the impact of these events on community contagion. Fulfilling these preconditions and increasing health education in the general population can substantially reduce the adverse health effects of future mass events.

Methods

To estimate the effect of the championship in different countries, we constructed a Bayesian model that uses the reported case numbers in 12 countries. Ethical approval was not sought as we only worked with openly available data. A graphical overview of the inference model is given in Fig. 4 and model variables, prior distributions, indices, country-dependent priors, and sampling performance are summarized in Tables 1, 2, 3, 4 and 5, respectively.

**Fig. 4: Model overview illustrating the relationship between the chosen prior distributions and the disease dynamics.**

Table 1 The intermediate variables of the model and their meaning

Full size table

Table 2 Prior distributions

Full size table

Table 3 Indices

Full size table

Table 4 Country-dependent priors on the delay structure

Full size table

Table 5 Maximal ${{{{{{{\mathcal{R}}}}}}}}$-hat values⁵¹

Full size table

Modeling the spreading dynamics, including gender imbalance

The model simulates the spread of COVID-19 in each country separately using a discrete renewal process^22,29,43. We infer a time-dependent effective reproduction number with gender interactions between genders g and ${g}^{{\prime} }$, ${R}_{{{{{{{{\rm{eff}}}}}}}},g,{g}^{{\prime} }}(t)$, for each country²¹.

Even though participation of women in football fan activity has increased in the last decades⁴⁴, football fans are still predominantly male²⁴. Hence one expects a higher infection probability at the days of the match for the male compared to the female population. Integrating this information into the model by using gender resolved case numbers, allows improved inference of the Euro 2020’s impact. In the following, genders “male” and “female” are denoted by the subscripts •_g=1 and •_g=2, respectively. Furthermore, we modeled the spreading dynamics of COVID-19 in each country separately.

In the discrete renewal process for disease dynamics of the respective country, we define for each gender g a susceptible pool S_g and an infected pool I_g. With N denoting the population size, the spreading dynamics with daily time resolution t reads as

$${I}_{g}(t)=\frac{{S}_{g}(t)}{N}\mathop{\sum }\limits_{{g}^{{\prime} }=1}^{2}{{{{{{{{\bf{R}}}}}}}}}_{{{{{{{{\rm{eff}}}}}}}},g,{g}^{{\prime} }}(t)\mathop{\sum }\limits_{\tau=0}^{10}{I}_{{g}^{{\prime} }}(t-1-\tau )\,G(\tau ),$$

(1)

$${S}_{g}(t)={S}_{g}(t-1)-{E}_{g}(t-1),$$

(2)

$$G(\tau )={{{{{{{\rm{Gamma}}}}}}}}(\tau ;\mu=4,\sigma=1.5).$$

(3)

We apply a discrete convolution in Eq. (1) to account for the latent period and subsequent infection (red box in Fig. 4). This generation interval (between infections) is modeled by a Gamma distribution G(τ) with a mean μ of four days and standard deviation σ of one and a half days. This is a little longer than the estimates of the generation interval of the Delta variant^45,46, but shorter than the estimated generation interval of the original strain^47,48. The impact of the choice of generation interval has negligible impact on our results (Supplementary Fig. S16). The infected compartment (commonly I) is not modeled explicitly as a separate compartment, but implicitly with the assumed generation interval kernel.

The effective spread in a given country is described by the country-dependent effective reproduction numbers for infections of individuals of gender g by individuals of gender ${g}^{{\prime} }$

$${{{{{{{{\bf{R}}}}}}}}}_{{{{{{{{\rm{eff}}}}}}}},g,{g}^{{\prime} }}(t)={R}_{{{{{{{{\rm{base}}}}}}}}}(t){C}_{{{{{{{{\rm{base}}}}}}}},g,{g}^{{\prime} }}+\Delta {R}_{{{{{{{{\rm{football}}}}}}}}}(t){C}_{{{{{{{{\rm{match}}}}}}}},g,{g}^{{\prime} }}+\Delta {R}_{{{{{{{{\rm{noise}}}}}}}}}(t){C}_{{{{{{{{\rm{noise}}}}}}}},g,{g}^{{\prime} }},$$

(4)

where ${C}_{{{{{{{{\rm{base}}}}}}}},g,{g}^{{\prime} }}$, ${C}_{{{{{{{{\rm{match}}}}}}}},g,{g}^{{\prime} }}$, and ${C}_{{{{{{{{\rm{noise}}}}}}}},g,{g}^{{\prime} }}$ describe the entries of the contact matrices C_base, C_match, C_noise respectively (purple boxes in Fig. 4).

This effective reproduction number is a function of three different reproduction numbers (yellow and orange boxes in Fig. 4):

1.
A slowly changing base reproduction number R_base (22) that has the same effect on both genders; besides incorporating the epidemiological information given by the basic reproduction number R₀, it represents the day-to-day contact behavior, including the impact of non-pharmaceutical interventions (NPIs), voluntary preventive measures, immunity status, etc.
2.
The reproduction number associated with social gatherings in the context of a football match R_match(t) (11); this number is only different from zero on days with matches that the respective country’s team participates in and it has a larger effect on men than on women.
3.
A slowly changing noise term ΔR_noise(t) (31), which subsumes all additional effects which might change the incidence ratio between males and females (gender imbalance).

The interaction between persons of specific genders is implemented by effective contact matrices C_match, C_base and C_noise. All three are assumed to be symmetric.

C_base describes non-football related contacts outside the context of Euro 2020 matches (left purple box in Fig. 4):

$${{{{{{{{\bf{C}}}}}}}}}_{{{{{{{{\rm{base}}}}}}}}}=\left(\begin{array}{cc}1-{c}_{{{{{{{{\rm{off}}}}}}}}}&{c}_{{{{{{{{\rm{off}}}}}}}}}\\ {c}_{{{{{{{{\rm{off}}}}}}}}}&1-{c}_{{{{{{{{\rm{off}}}}}}}}}\end{array}\right),$$

(5)

$${{{{{{{\rm{with}}}}}}}}\,{c}_{{{{{{{{\rm{off}}}}}}}}} \sim {{{{{{{\rm{Beta}}}}}}}}(\alpha=8,\, \beta=8).$$

(6)

Here, we have the prior assumption that contacts between women, contacts between men, and contacts between women and men are equally probable. Hence, we chose the parameters for the Beta distribution such that c_off has a mean of 50% with a 2.5th and 97.5th percentile of [27%, 77%]. This prior is chosen such that it is rather uninformative. As shown in Supplementary Fig. S17, this and other priors of auxiliary parameters do not affect the parameter of interest if their width is varied within a factor of 2 up and down.

C_match describes the contact behavior in the context of the Euro 2020 football matches (right purple box in Fig. 4). Here, we assume as a prior that the female participation in football-related gatherings accounts for ≃ 33% (95% percentiles [18%,51%]) of the total participation. Hence, we get the following contact matrix

$${{{{{{{{\bf{C}}}}}}}}}_{{{{{{{{\rm{match}}}}}}}},{{{{{{{\rm{unnorm}}}}}}}}.}=\left(\begin{array}{cc}{(1-{\omega }_{{{{{{{{\rm{gender}}}}}}}}})}^{2}&{\omega }_{{{{{{{{\rm{gender}}}}}}}}}(1-{\omega }_{{{{{{{{\rm{gender}}}}}}}}})\\ {\omega }_{{{{{{{{\rm{gender}}}}}}}}}(1-{\omega }_{{{{{{{{\rm{gender}}}}}}}}})&{\omega }_{{{{{{{{\rm{gender}}}}}}}}}^{2}\end{array}\right)$$

(7)

$${{{{{{{{\bf{C}}}}}}}}}_{{{{{{{{\rm{match}}}}}}}}}=\frac{{{{{{{{{\bf{C}}}}}}}}}_{{{{{{{{\rm{match}}}}}}}},{{{{{{{\rm{unnorm.}}}}}}}}}}{{\left|{{{{{{{{\bf{C}}}}}}}}}_{{{{{{{{\rm{match}}}}}}}},{{{{{{{\rm{unnorm.}}}}}}}}}\cdot {\left(0.5,0.5\right)}^{T}\right|}_{2}}$$

(8)

$${\omega }_{{{{{{{{\rm{gender}}}}}}}}} \sim {{{{{{{\rm{Beta}}}}}}}}\left(\alpha=10,\, \beta=20\right).$$

(9)

The prior beta distribution of ω_gender is bounded between at 0 and 1 and with the parameter values of α = 10 and β = 20 has the expectation value of 1/3. The robustness of the choice of this parameter is explored in Supplementary Fig. S15. C_match is normalized such that for balanced case numbers (equal case numbers for men and women) and an additive reproduction number R_match = 1 will lead to a unitary increase of total case numbers. The reproduction number of women will therefore increase by 2ω_genderΔR_match(t) on match days whereas the one of men will increase by 2(1 − ω_gender)ΔR_match, assuming balanced case numbers beforehand.

C_noise describes the effect of an additional noise term, which changes gender balance without being related to football matches (middle purple box in Fig. 4). For simplicity, it is implemented as

$${{{{{{{{\bf{C}}}}}}}}}_{{{{{{{{\rm{noise}}}}}}}}}=\left(\begin{array}{cc}1&0\\ 0&-1\end{array}\right),$$

(10)

whereby we center the diagonal elements such that the cases introduced by the noise term sum up to zero, i.e. ∑_i,jR_noise ⋅ C_noise,i,j = 0.

Football-related effect

Our aim is to quantify the number of cases (or equivalently the fraction of cases) associated with the Euro 2020, ${{{\Gamma }}}_{g}^{{{{{{{{\rm{Euro}}}}}}}}}$. To that end we assume that infections can occur at public or private football screenings in the two countries participating in the respective match m (parameterized by ΔR_match,m). Note that for the Euro 2020 not a single country, but a set of 11 countries hosted the matches. The participation of a team or the staging of a match in a country may have different effect sizes. Thus, we define the football related additive reproduction number as

$$\Delta {R}_{{{{{{{{\rm{football}}}}}}}}}(t)=\mathop{\sum}\limits_{m}\Delta {R}_{{{{{{{{\rm{match}}}}}}}},m}\cdot \delta ({t}_{m}-t).$$

(11)

We assume the effect of each match to only be effective in a small time window centered around the day of a match m, t_m (light orange box in Fig. 4). Thus, we apply an approximate delta function δ(t_m − t). To guarantee differentiability and hence better convergence of the model, we did not use a delta distribution but instead a narrow normal distribution centered around t_m, with a standard deviation of one day:

$$\delta (t)=\frac{1}{\sqrt{2\pi }}\exp \left(-\frac{{t}^{2}}{2}\right).$$

(12)

We distinguish between the effect size of each match m on the spread of COVID-19. For modeling the effect ΔR_match,m, associated with public or private football screenings in the home country, we introduce one base effect $\Delta {R}_{{{{{{{{\rm{match}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}}$ and a match specific offset Δα_m for a typical hierarchical modeling approach (dark orange box in Fig. 4). As prior we assume that the base effect $\Delta {R}_{{{{{{{{\rm{match}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}}$ is centered around zero, which means that in principle also a negative effect of the football matches can be inferred:

$$\Delta {R}_{{{{{{{{\rm{match}}}}}}}},m}={\alpha }_{{{{{{{{\rm{prior}}}}}}}},m}\left(\Delta {R}_{{{{{{{{\rm{match}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}}+\Delta {\alpha }_{m}\right)$$

(13)

$$\Delta {R}_{{{{{{{{\rm{match}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}} \sim {{{{{{{\mathcal{N}}}}}}}}\left(0,5\right)$$

(14)

$$\Delta {\alpha }_{m} \sim {{{{{{{\mathcal{N}}}}}}}}\left(0,{\sigma }_{\alpha }\right)$$

(15)

$${\sigma }_{\alpha } \sim {{{{{{{\rm{HalfNormal}}}}}}}}\;\left(5\right).$$

(16)

α_prior,m is the m-th element of the vector that encodes the prior expectation of the effect of a match on the reproduction number. If a country participated in a match, the entry is 1 and otherwise 0. The robustness of the results with respect to the hyperprior σ_α is explored in Supplementary Fig. S17.

For Supplementary Fig. S9, we expand the model by including the effect of infections happening in stadiums and in the vicinity of it as well as during travel towards the venue of the match. In detail, we add to the football related additive reproduction number (Eq. (11)) an additive effect ΔR_stadium,m:

$$\Delta {R}_{{{{{{{{\rm{football}}}}}}}}}(t)=\mathop{\sum}\limits_{m}(\Delta {R}_{{{{{{{{\rm{match}}}}}}}},m}+\Delta {R}_{{{{{{{{\rm{stadium}}}}}}}},m})\cdot \delta ({t}_{m}-t).$$

(17)

Analogously to the gathering-related effect we apply the same hierarchy to the effect caused by hosting a match in the stadium – but change the prior of the day of the effect:

$$\Delta {R}_{{{{{{{{\rm{stadium}}}}}}}},m}={\beta }_{{{{{{{{\rm{prior}}}}}}}},m}\left(\Delta {R}_{{{{{{{{\rm{stadium}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}}+\Delta {\beta }_{m}\right)$$

(18)

$$\Delta {R}_{{{{{{{{\rm{stadium}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}} \sim {{{{{{{\mathcal{N}}}}}}}}\left(0,5\right)$$

(19)

$$\Delta {\beta }_{m} \sim {{{{{{{\mathcal{N}}}}}}}}(0,{\sigma }_{\beta })$$

(20)

$${\sigma }_{\beta } \sim {{{{{{{\rm{HalfNormal}}}}}}}}\left(5\right).$$

(21)

β_prior,m encodes whether or not a match was hosted by the respective country, i.e equates 1 if the match took place in the country and otherwise equates 0.

Non-football-related reproduction number

To account for effects not related to the football matches, e.g., non-pharmaceutical interventions, vaccinations, seasonality or variants, we introduce a slowly changing reproduction number R_base(t), which is identical for both genders and should map all other not specifically modeled gender independent effects (left yellow box in Fig. 4):

$${R}_{{{{{{{{\rm{base}}}}}}}}}(t)={R}_{0}\exp \left(\mathop{\sum}\limits_{n}{\gamma }_{n}(t)\right)$$

(22)

$${R}_{0} \sim {{{{{{{\rm{LogNormal}}}}}}}}\left(\mu=1,\sigma=1\right)$$

(23)

This base reproduction number is modeled as a superposition of logistic change points γ(t) every 10 days, which are parameterized by the transient length of the change points l, the date of the change point d and the effect of the change point Δγ_n. The subscripts n denotes the discrete enumeration of the change points:

$${\gamma }_{n}(t)=\frac{1}{1+{e}^{-4/{l}_{n}\cdot (t-{d}_{n})}}\cdot \Delta {\gamma }_{n}$$

(24)

$$\Delta {\gamma }_{n} \sim {{{{{{{\mathcal{N}}}}}}}}(0,{\sigma }_{\Delta \gamma })\,\forall n$$

(25)

$${\sigma }_{\Delta \gamma } \sim {{{{{{{\rm{HalfCauchy}}}}}}}}\left(0.5\right)$$

(26)

$${l}_{n}=\log \big(1+\exp ({l}_{n}^{{{{\dagger}}} })\big)$$

(27)

$${l}_{n}^{{{{\dagger}}} } \sim {{{{{{{\mathcal{N}}}}}}}}\left(4,1\right)\,\forall n\,({{{{{\rm{unit}}}}}}\;{{{{{\rm{is}}}}}}\;{{{{{\rm{days}}}}}})$$

(28)

$${d}_{n}=2{7}^{{{{{{{{\rm{th}}}}}}}}}\;{{{{{{{\rm{May\;2021}}}}}}}}\;+10\cdot n+\Delta {d}_{n}\,{{{{{{{\rm{for}}}}}}}}\; n=0,\ldots,9$$

(29)

$$\Delta {d}_{n} \sim {{{{{{{\mathcal{N}}}}}}}}\left(0,3.5\right)\,\forall n\,({{{{{\rm{unit}}}}}}\;{{{{{\rm{is}}}}}}\;{{{{{\rm{days}}}}}}).$$

(30)

The idea behind this parameterization is that Δγ_n models the change of R-value, which occurs at times d_n. These changes are then summed in Eq. (24). Change points that have not occurred yet at time t do not contribute in a significant way to the sum as the sigmoid function tends to zero for t < < d_n. The robustness of the results regarding the spacing of the change-points d_n is explored in Supplementary Fig. S14 and the robustness of the choice of the hyperprior σ_Δγ is explored in Supplementary Fig. S17.

Similarly, to account for small changes in the gender imbalance, the noise on the ratio between infections in men and women is modeled by a slowly varying reproduction number (middle yellow box in Fig. 4), parameterized by series of change points every 10 days:

$$\Delta {R}_{{{{{{{{\rm{noise}}}}}}}}}(t)=\Delta {R}_{0,{{{{{{{\rm{noise}}}}}}}}}+\bigg(\mathop{\sum}\limits_{n}{\tilde{\gamma }}_{n}(t)\bigg)$$

(31)

$$\Delta {R}_{0,{{{{{{{\rm{noise}}}}}}}}} \sim {{{{{{{\mathcal{N}}}}}}}}\left(\mu=0,\sigma=0.1\right)$$

(32)

$${\tilde{\gamma }}_{n}(t)=\frac{1}{1+{e}^{-4/{\tilde{l}}_{n}\cdot (t-{\tilde{d}}_{n})}}\cdot \Delta {\tilde{\gamma }}_{n}$$

(33)

$$\Delta {\tilde{\gamma }}_{n} \sim {{{{{{{\mathcal{N}}}}}}}}(0,{\sigma }_{\Delta \tilde{\gamma }})$$

(34)

$${\sigma }_{\Delta \tilde{\gamma }} \sim {{{{{{{\rm{HalfCauchy}}}}}}}}\left(0.2\right)$$

(35)

$${\tilde{l}}_{n}=\log \left(1+\exp ({\tilde{l}}_{n}^{{{{\dagger}}} })\right)$$

(36)

$${\tilde{l}}_{n}^{{{{\dagger}}} } \sim {{{{{{{\mathcal{N}}}}}}}}\left(4,1\right)\,\forall n\,({{{{{\rm{unit}}}}}}\,{{{{{\rm{is}}}}}}\,{{{{{\rm{days}}}}}})$$

(37)

$${\tilde{d}}_{n}=2{7}^{{{{{{{{\rm{th}}}}}}}}}\,{{{{{{{\rm{May}}}}}}}}\,{{{{{{{\rm{2021}}}}}}}}+10\cdot n+\Delta {\tilde{d}}_{n}\,{{{{{{{\rm{for}}}}}}}}\,n=0,\ldots,9$$

(38)

$$\Delta {\tilde{d}}_{n} \sim {{{{{{{\mathcal{N}}}}}}}}\left(0,3.5\right)\,\forall n\,({{{{{{{\rm{unit}}}}}}}}\,{{{{{{{\rm{is}}}}}}}}\,{{{{{{{\rm{days}}}}}}}}).$$

(39)

Delay

Modeling the delay between the time of infection and the reporting of it is an important part of the model (blue boxes in Fig. 4); it allows for a precise identification of changes in the infection dynamics because of football matches and the reported cases. We split the delay into two different parts: First we convolved the number of newly infected people with a kernel, which delays the cases between 4 and 7 days. Second, to account for delays that occur because of the weekly structure (some people might delay getting tested until Monday if they have symptoms on Saturday or Sunday), we added a variable fraction that delays cases depending on the day of the week.

Constant delay

To account for the latent period and an eventual apparition of symptoms we apply a discrete convolution, a Gamma kernel, to the infected pool (right blue box in Fig. 4). The prior delay distribution D is defined by incorporating knowledge about the country specific reporting structure: If the reported date corresponds to the moment of the sample collection (which is the case in England, Scotland and France) or if the reported date corresponds to the onset of symptoms (which is the case in the Netherlands), we assumed 4 days as the prior median of the delay between infection and case. If the reported date corresponds to the transmission of the case data to the authorities, we assumed 7 days as prior median of the delay. If we do not know what the published date corresponds to, we assumed a median ${\bar{D}}_{{{{{{{{\rm{country}}}}}}}}}$ of 5 days, with a larger prior standard deviation ${\sigma }_{\log \bar{D}}$ (see Table 4):

$${C}_{g}^{{{{\dagger}}} }\left(t\right)=\mathop{\sum }\limits_{\tau=1}^{T}{E}_{g}(t-\tau )\cdot {{{{{{{\rm{Gamma}}}}}}}}(\tau ;\mu=D,\sigma={\sigma }_{D})$$

(40)

$$D=\log \left({D}^{{{{\dagger}}} }\right)$$

(41)

$${D}^{{{{\dagger}}} } \sim {{{{{{{\mathcal{N}}}}}}}}\big(\mu=\exp \big({\bar{D}}_{{{{{{{{\rm{country}}}}}}}}}\big),\sigma={\sigma }_{\log \bar{D}}\big)$$

(42)

$${\sigma }_{D} \sim {{{{{{{\mathcal{N}}}}}}}}\big(\mu=0.2\cdot {\bar{D}}_{{{{{{{{\rm{country}}}}}}}}},\sigma=0.08\cdot {\bar{D}}_{{{{{{{{\rm{country}}}}}}}}}\big).$$

(43)

Here, Gamma represents the delay kernel. We obtain a delayed number of infected persons ${C}_{g}^{{{{\dagger}}} }$ by delaying the newly infected number of persons I_g(t) of gender g from Eq. (1). The robustness of the choice of the width of the delay kernel σ_D is explored in Supplementary Fig. S17.

Weekday-dependent delay

Because of the different availability of testing resources during a week, we further delay a fraction of persons, depending on the day of the week (left blue box in Fig. 4). We model the fraction η_t of delayed tests on a day t in a recurrent fashion, meaning that if a certain fraction gets delayed on Saturday, these same individuals can still get delayed on Sunday (Eq. (44)). The fraction η_t is drawn separately for each individual day. However, the prior is the same for certain days of the week d (Eq. (45)): we assume that few tests get delayed on Tuesday, Wednesday, and Thursday, using a prior with mean 0.67% (Eq. (48)), whereas we assume that more tests might be delayed on Monday, Friday, Saturday and Sunday. Hence compared to ${C}_{g}^{{{{\dagger}}} }$, we obtain slightly more delayed numbers of cases ${\hat{C}}_{g}$, which now include a weekday-dependent delay:

$${\hat{C}}_{g}\left(t\right)=\left(1-{\eta }_{t}\right)\cdot \left({C}_{g}^{{{{\dagger}}} }\left(t\right)+{\eta }_{t-1}{\hat{C}}_{g}\left(t-1\right)\right)\;{{{{{{{\rm{with}}}}}}}}\;{\hat{C}}_{g}\left(0\right)={C}_{g}^{{{{\dagger}}} }\left(0\right)$$

(44)

$${\eta }_{t} \sim {{{{{{{\rm{Beta}}}}}}}}\left(\alpha=\frac{{r}_{d}}{e},\beta=\frac{1-{r}_{d}}{e}\right)\;{{{{{{{\rm{with}}}}}}}}\;d=\,{{{{{\rm{Monday}}}}}}\,,...,\,{{{{{\rm{Sunday}}}}}}\,$$

(45)

$${r}_{d}={{{{{{{\rm{sigmoid}}}}}}}}\left({r}_{d}^{{{{\dagger}}} }\right)$$

(46)

$${r}_{d}^{{{{\dagger}}} }={r}_{{{{{{{{\rm{base}}}}}}}},d}^{{{{\dagger}}} }+\Delta {r}_{d}^{{{{\dagger}}} }$$

(47)

$${r}_{{{{{{{{\rm{base}}}}}}}},d}^{{{{\dagger}}} } \sim {{{{{{{\mathcal{N}}}}}}}}\left(-5,1\right)\;{{{{{{{\rm{for}}}}}}}}\;d=\,{{{{{\rm{Tuesday}}}}}},\,{{{{{\rm{Wednesday}}}}}},\,{{{{{\rm{Thursday}}}}}}$$

(48)

$${r}_{{{{{{{{\rm{base}}}}}}}},d}^{{{{\dagger}}} } \sim {{{{{{{\mathcal{N}}}}}}}}\left(-3,2\right)\;{{{{{{{\rm{for}}}}}}}}\;d=\,{{{{{\rm{Friday}}}}}},\,{{{{{\rm{Saturday}}}}}},\,{{{{{\rm{Sunday}}}}}},\,{{{{{\rm{Monday}}}}}}$$

(49)

$$\Delta {r}_{d}^{{{{\dagger}}} } \sim {{{{{{{\mathcal{N}}}}}}}}\left(0,{\sigma }_{r}\right)$$

(50)

$${\sigma }_{r} \sim {{{{{{{\rm{HalfNormal}}}}}}}}\left(1\right)$$

(51)

$$e \sim {{{{{{{\rm{HalfCauchy}}}}}}}}\left(0.2\right).$$

(52)

The parameter r_d is defined such that it models the mean of the Beta distribution of Eq. (45), whereas e models the scale of the Beta distribution. r_d is then transformed to an unbounded space by the sigmoid $f\left(x\right)=\frac{1}{1+\exp \left(-x\right)}$ (Eq. (46)). This allows to define the hierarchical prior structure for the different weekdays. We chose the prior of ${r}_{{{{{{{{\rm{base}}}}}}}},d}^{{{{\dagger}}} }$ for Tuesday, Wednesday, and Thursday such that only a small fraction of cases are delayed during the week. The chosen prior in Eq. (48) corresponds to a 2.5th and 97.5th percentile of r_d of [0%; 5%]. For the other days (Friday, Saturday, Sunday, Monday), the chosen prior leaves a lot of freedom for inferring the delay. Equation (49) corresponds to a 2.5th and 97.5th percentile of r_d of [0%; 72%]. The robustness of the other priors σ_r and e is explored in Supplementary Fig. S17.

Likelihood

Next we want to define the goodness of fit of our model to the sample data. The likelihood of that is modeled by a Student’s t-distribution, which allows for some outliers because of its heavier tails compared to a Normal distribution (green box in Fig. 4). The error of the Student’s t-distribution is proportional to the square root of the number of cases, which corresponds to the scaling of the errors in a Poisson or Negative Binomial distribution:

$${C}_{g}(t) \sim {{{{{{{{\rm{StudentT}}}}}}}}}_{\nu \!=\! 4}\left(\mu={\hat{C}}_{g}(t),\sigma=\kappa \sqrt{{\hat{C}}_{g}(t)+1}\right)$$

(53)

$$\kappa \sim {{{{{{{\rm{HalfCauchy}}}}}}}}(\sigma=30).$$

(54)

Here C_g(t) is the measured number of cases in the population of gender g as reported by the respective health authorities, whereas ${\hat{C}}_{g}(t)$ is the modeled number of cases (Eq. (44)). The robustness of the prior κ is explored in Supplementary Fig. S17.

Average effect across countries

In order to calculate the mean effect size across countries (Fig. 1b, c), we average the individual effects of each country. To be consistent in our approach, we build an hierarchical Bayesian model accounting for the individual uncertainties of each country estimated from the width of the posterior distributions. As effect size, we use the fraction of primary cases associated with football matches during the championship. Then our estimated mean effect size ${\hat{I}}_{g}$ across all countries c (except the Netherlands) for the gender g is inferred with the following model:

$${\hat{I}}_{g} \sim {{{{{{{\rm{Normal}}}}}}}}(\mu=0,\sigma=2)\,\,\,{{{{{{{\rm{with}}}}}}}}\,\,g=\{{{{{{{{\rm{male}}}}}}}},\,{{{{{{{\rm{female}}}}}}}}\}$$

(55)

$${\tau }_{g} \sim {{{{{{{\rm{HalfCauchy}}}}}}}}(\beta=10)$$

(56)

$${I}_{c,g}^{{{{\dagger}}} } \sim {{{{{{{\rm{Normal}}}}}}}}(\mu={\hat{I}}_{g},\sigma={\tau }_{g})$$

(57)

$${\hat{\sigma }}_{c,g} \sim {{{{{{{\rm{HalfCauchy}}}}}}}}(\beta=10)$$

(58)

$${I}_{s,c,g} \sim {{{{{{{{\rm{StudentT}}}}}}}}}_{\nu=4}\left(\mu={I}_{c,g}^{{{{\dagger}}} },\sigma={\hat{\sigma }}_{c,g}\right).$$

(59)

The estimated effect size of each country (the fraction of primary cases) is denoted by ${I}_{c,g}^{{{{\dagger}}} }$ and the effect size of individual samples s from the posterior of the main model is denoted by I_s,c,g.

We applied a similar hierarchical model but without gender dimensions and with slightly different priors to calculate the average mean match effect $\Delta {R}_{{{{{{{{\rm{match}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}}$ (Fig. 1a). Here by reusing the same notation:

$$\hat{I} \sim {{{{{{{\rm{Normal}}}}}}}}(\mu=0,\sigma=10)$$

(60)

$$\tau \sim {{{{{{{\rm{HalfCauchy}}}}}}}}(\beta=10)$$

(61)

$${I}_{c}^{{{{\dagger}}} } \sim {{{{{{{\rm{Normal}}}}}}}}(\mu=\hat{I},\sigma=\tau )$$

(62)

$${\hat{\sigma }}_{c} \sim {{{{{{{\rm{HalfCauchy}}}}}}}}(\beta=10)$$

(63)

$$\Delta {R}_{{{{{{{{\rm{match}}}}}}}},c,s}^{{{{{{{{\rm{mean}}}}}}}}} \sim {{{{{{{{\rm{StudentT}}}}}}}}}_{\nu=4}\left(\mu={I}_{c}^{{{{\dagger}}} },\sigma={\hat{\sigma }}_{c}\right),$$

(64)

where $\Delta {R}_{{{{{{{{\rm{match}}}}}}}},c,s}^{{{{{{{{\rm{mean}}}}}}}}}$ are the posterior samples from the main model runs of the $\Delta {R}_{{{{{{{{\rm{match}}}}}}}}}^{{{{{{{{\rm{mean}}}}}}}}}$ variable.

Calculating the primary and subsequent cases

We compute the number of primary football related infected I_primary,g(t) as the number of infections happening at football related gathering. The percentage of primary cases f_g is then computed by dividing by the total number of infected I_g(t).

$${I}_{{{{{{{{\rm{primary}}}}}}}},g}(t)=\frac{S(t){R}_{{{{{{{{\rm{football}}}}}}}}}(t)}{N}\mathop{\sum}\limits_{{g}^{{\prime} }}{I}_{{g}^{{\prime} }}(t){{{{{{{{\bf{C}}}}}}}}}_{{{{{{{{\rm{football}}}}}}}},{g}^{{\prime} },g}$$

(65)

$${f}_{g}=\mathop{\sum}\limits_{t}\frac{{I}_{{{{{{{{\rm{primary}}}}}}}},g}(t)}{{I}_{g}(t)}\,t\in [11{{{{{{{\rm{th}}}}}}}}\,{{{{{{{\rm{June}}}}}}}},\,31{{{{{{{\rm{st}}}}}}}}\,{{{{{{{\rm{July}}}}}}}}]$$

(66)

To obtain the subsequent infected I_subsequent,g(t), we subtract infected obtained from a hypothetical scenario without football games I_none,g(t) from the total number of infected.

$${I}_{{{{{{{{\rm{subsequent}}}}}}}},g}={I}_{g}(t)-{I}_{{{{{{{{\rm{primary}}}}}}}},g}(t)-{I}_{{{{{{{{\rm{none}}}}}}}},g}(t)$$

(67)

Specific, we consider a counterfactual scenario, where we sample from our model leaving all inferred parameters the same expect for the football related reproduction number R_football,g(t), which we set to zero.

Sampling

The sampling was done using PyMC3⁴⁹. We use a NUTS sampler⁵⁰, which is a Hamiltonian Monte-Carlo sampler. As random initialization often leads to some chains getting stuck in local minima, we run 32 chains for 500 initialization steps and chose the 8 chains with the highest unnormalized posterior to continue tuning and sampling. We then let these chains tune for additional 2000 steps and draw 4000 samples. The maximum tree depth was set to 12.

The quality of the mixing was tested with the ${{{{{{{\mathcal{R}}}}}}}}$-hat measure⁵¹ (Table 5). The ${{{{{{{\mathcal{R}}}}}}}}$-hat value measures how well chains with different starting values mix; optimal are values near one. We measured twice: (1) for all variables and (2) for the subset of variables encoding the reproduction number. Variables modeling the reproduction number are the central part of our model (lower half of Fig. 4). As such, we are satisfied if the ${{{{{{{\mathcal{R}}}}}}}}$-hat values is sufficiently good for these variables, which it is (≤1.07). The high ${{{{{{{\mathcal{R}}}}}}}}$-hat when calculated over all variables is mostly due to the weekday-dependent delay, which we assume is not central to the results we are interested in.

Robustness tests

In the base model for each country, we only consider the matches in which the respective country participated. It is reasonable to ask whether the matches of foreign countries occurring in local stadium have an effect on the case numbers, caused by transmission in and around the stadium and related travel. To investigate this question we ran a model with an additional parameter (in-country effect) associating the case numbers to the in-country matches (Eq. (17)). In some countries the in-country effect parameter and the original fan gathering effect are covariant, as a large number of matches are played by the country at home, whereas in other countries the additional parameter had no significant effect (Supplementary Fig. S9).

We checked that the inferred fractions of football related cases are robust against changes in the priors of the width σ_D of the delay parameter D (see Supplementary Fig. S13) and the intervals of change points of R_base (Supplementary Fig. S14). The results are also, to a very large degree, robust against a more uninformative prior on the fraction of female participants in the fan activities dominating the additional transmission ω_gender (Supplementary Fig. S15). To reduce CO₂ emissions, we performed fewer runs for these robustness tests: We only ran the models for which the original posterior distributions might indicate that one could find a significant effect. Each country required eight cores for about 10 days to finish sampling.

In order to further test the robustness of the association between individual matches and infections, we varied the dates of the matches, i.e., shifted them forward and backward in time. The results for the twelve countries under investigation are shown in Supplementary Figs. S10 and S12. In the countries where sensitivity to a championship-related case surge exists, a stable association is obtained for shifts by up to 2 days. As shown for the examples of England and Scotland in Fig. S19, such a shift is compensated by the model by a complementary adjustment of the delay parameter D. For larger shifts, the model might associate other matches to the increase of cases, as matches took place approximately every 4 days.

Correlations

In order to calculate the correlation between the effect size and various explainable variables (Fig. 3 and Supplementary Figs. S4 and S6), we built a Bayesian regression model, using the previously computed posterior samples from the individual runs of each country. Let us denote the previously computed cumulative primary and subsequent cases related to the Euro 2020 by Y_s,c, for every sample s and analyzed country c, and the explainable variable from auxiliary data by X_c. We used a simple linear model to check for pairwise correlation between Y_s,c and X_c:

$${\hat{Y}}_{c}={\beta }_{0}+{\beta }_{1}{\hat{X}}_{c}$$

(68)

$${\beta }_{0} \sim {{{{{{{\rm{Normal}}}}}}}}(\mu=0,\sigma=10000)$$

(69)

$${\beta }_{1} \sim {{{{{{{\rm{Normal}}}}}}}}(\mu=0,\sigma=100000)$$

(70)

We used every sample s obtained from the main analysis to incorporate uncertainties on the variable Y_c from our prior results. The auxiliary data X_c might also have errors ϵ_c, which we model using a Normal distribution. Additionally, we allow our estimate for the effect size ${\hat{Y}}_{c}$ to have an error for each country c in a typical hierarchical manner and choose uninformative priors for the scale hyper-parameter τ. As prior we considered 10k a reasonable choice for the β parameter as our data X_c is normally in a range multiple magnitudes smaller:

$${\hat{X}}_{c} \sim {{{{{{{\rm{Normal}}}}}}}}(\mu={X}_{c},\sigma={\epsilon }_{c})\,\forall c$$

(71)

$$\tau \sim {{{{{{{\rm{HalfCauchy}}}}}}}}(\beta=10000)$$

(72)

$${Y}_{c}^{{{{\dagger}}} } \sim {{{{{{{\rm{Normal}}}}}}}}(\mu={\hat{Y}}_{c},\sigma=\tau )\,\forall c.$$

(73)

Again using uninformative priors for the error, the likelihood to obtain our results given the individual country effect size estimate ${Y}_{c}^{{{{\dagger}}} }$ from the hierarchical linear model is

$${Y}_{s,c} \sim {{{{{{{{\rm{StudentT}}}}}}}}}_{\nu=4}\left(\mu={Y}_{c}^{{{{\dagger}}} },\sigma={\hat{\sigma }}_{c}\right)\,with$$

(74)

$${\hat{\sigma }}_{c} \sim {{{{{{{\rm{HalfCauchy}}}}}}}}(\beta=10000).$$

(75)

Therefore, our regression model includes the “measurement error” ${\hat{\sigma }}_{c}$, which models the heteroscadistic effect size of every country, and an additional model error τ which models the homoscedastic deviations of the country effect sizes from the linear model. In the plots, we plot the regression line ${\hat{Y}}_{c}$ with its shaded 95% CI, and data points (${\hat{X}}_{c}$, ${Y}_{c}^{{{{\dagger}}} }$) where the whiskers correspond to the one standard deviation, modeled here by ϵ_c and ${\hat{\sigma }}_{c}$.

The coefficient of determination, R², is calculated following the procedure suggested by Gelman and colleagues⁵². Their R² measure is intended for Bayesian regression models as it notably uses the expected data variance given the model instead of the observed data variance. For our model, it is defined as

$${R}^{2}=\frac{{{{{{{{\rm{Explained\;variance}}}}}}}}}{{{{{{{{\rm{Residual\;variance}}}}}}}}+{{{{{{{\rm{Explained\;variance}}}}}}}}}=\frac{\frac{1}{{n}_{c}-1}{\sum }_{c}{\hat{Y}}_{c}^{2}}{{\tau }^{2}+\frac{1}{{n}_{c}-1}{\sum }_{c}{\hat{Y}}_{c}^{2}},$$

(76)

where n_c is the number of countries. With this formula, one obtains the posterior distribution of R² by evaluating it for every sample.

As auxiliary data, we used:

1.
Mobility data: We use the mobility index m_c,t provided by the “Google COVID-19 Community Mobility Reports”⁵³ for each country c at day t during the Euro 2020 (t ∈ [June 11 2021, July 11 2021]), where N denotes the number of days in the interval. The error is the standard deviation of the mean:
$${X}_{c}=\frac{1}{N}\mathop{\sum}\limits_{t}{m}_{c,t}$$
(77)
$${\epsilon }_{c}=\sqrt{\frac{1}{{N}^{2}}\mathop{\sum}\limits_{t}{({m}_{c,t}-{X}_{c})}^{2}}$$
(78)
2.
Reproduction number: We use the base reproduction number ${R}_{{{{{{{{\rm{pre}}}}}}}},c}$ for each country c as inferred from our model 2 weeks prior to the Euro 2020 (t ∈ [May 28 2021, June 11 2021]).
$${X}_{c}=\frac{1}{N}\mathop{\sum}\limits_{t}{R}_{{{{{{{{\rm{pre}}}}}}}},c}(t)$$
(79)
$${\epsilon }_{c}=\sqrt{\frac{1}{N}\mathop{\sum}\limits_{t}{({R}_{{{{{{{{\rm{pre}}}}}}}},c}(t)-{X}_{c})}^{2}}$$
(80)
3.
Cumulative reported cases: From the daily reported cases C(t) two weeks prior to the Euro 2020 (t ∈ [May 28 2021, June 11 2021]), we computed the cumulative reported cases normalized by the number of inhabitants p_c in each country c. Note: We also used reported cases without gender assignment here.
$${X}_{c}=\frac{{\sum }_{t}C(t)}{{p}_{c}}$$
(81)
$${\epsilon }_{c}=\overrightarrow{0}$$
(82)
4.
Potential for COVID-19 spread: As for the cumulative cases we used the daily reported cases C(t) two weeks prior to the Euro 2020 (t ∈ [May 28 2021, June 11 2021]), and we computed the cumulative reported cases normalized by the number of inhabitants p_c in each country c. Furthermore, we used the base reproduction number R_base(t) 2 weeks prior to the Euro 2020, as well as the duration of a country participating in the championship T_c (Table S5) to compute the potential for spread:
$${N}_{0}=\frac{{\sum }_{t}C(t)}{{p}_{c}}$$
(83)
$${X}_{c}={N}_{0}\cdot \frac{{\sum }_{t}{R}_{{{{{{{{\rm{pre}}}}}}}},c}^{{T}_{c}/4}(t)}{N}$$
(84)
$${\epsilon }_{c}=\overrightarrow{0}$$
(85)
5.
Proxy for popularity: To represent popularity of the Euro 2020 in country c, we used the union of the number of matches played by each country n_match,c and the number of matches hosted by each country n_hosted,c (Table S5). By “union” we mean the sum without the overlap, i.e., we take the sum of these numbers and subtract the number of home matches n_home,c
$${X}_{c}={n}_{{{{{{{{\rm{match}}}}}}}},c}+{n}_{{{{{{{{\rm{hosted}}}}}}}},c}-{n}_{{{{{{{{\rm{home}}}}}}}},c}$$
(86)
$${\epsilon }_{c}=\overrightarrow{0}$$
(87)

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data from our model runs, i.e., from the sampling is available on G-node https://gin.g-node.org/semohr/covid19_soccer_data. The daily case numbers stratified by age and gender were acquired from the local health authorities (see also Supplementary section S1 from the following sources: Robert Koch Institut, Germany; Santé publique, France; National Health Service, England; Österreichische Agentur für Gesundheit und Ernährungssicherheit GmbH, Austria; Sciensano, Belgium Ministerstvo zdravotnictví, Czech Republic; National Institute for Public Health and the Environment, The Netherlands; and COVerAGE-DB.

Code availability

All code to reproduce the analysis and figures shown in the manuscript as well as in the Supplementary Information is available online on GitHub https://github.com/Priesemann-Group/covid19_soccer or via https://doi.org/10.5281/zenodo.7386313⁵⁴.

References

Chau, N. V. V. et al. Superspreading event of SARS-CoV-2 infection at a Bar, Ho Chi Minh City, Vietnam. Emerg. Infect. Dis. 27, 310 (2021).
Article Google Scholar
Bernheim, B. D., Buchmann, N., Freitas-Groff, Z. & Otero, S. The effects of large group meetings on the spread of COVID-19: the case of Trump rallies. https://siepr.stanford.edu/publications/working-paper/effects-large-group-meetings-spread-covid-19-case-trump-rallies (2020).
Wang, L. et al. Inference of person-to-person transmission of COVID-19 reveals hidden super-spreading events during the early outbreak phase. Nat. Commun. 11, 1–6 (2020).
ADS Google Scholar
Dave, D., McNichols, D. & Sabia, J. J. The contagion externality of a superspreading event: the Sturgis Motorcycle Rally and COVID-19. South Econ. J. 87, 769–807 (2021).
Article Google Scholar
Leclerc, Q. J. et al. What settings have been linked to SARS-CoV-2 transmission clusters?Wellcome Open Res. 5, 83 (2020).
Nordsiek, F., Bodenschatz, E. & Bagheri, G. Risk assessment for airborne disease transmission by poly-pathogen aerosols. PLoS ONE 16, e0248004 (2021).
Article CAS Google Scholar
Fischer, K. Thinning out spectators: did football matches contribute to the second COVID-19 wave in Germany? Ger. Econ. Rev. 23, 595–640 (2022).
Article Google Scholar
Toumi, A., Zhao, H., Chhatwal, J., Linas, B. P. & Ayer, T. The effect of NFL and NCAA football games on the spread of COVID-19 in the United States: an empirical analysis. Preprint at medRxiv https://doi.org/10.1101/2021.02.15.21251745 (2021).
Olczak, M., Reade, J. & Yeo, M. Mass outdoor events and the spread of an airborne virus: English football and Covid-19. https://doi.org/10.2139/ssrn.3682781 (2021).
Alfano, V. COVID-19 diffusion before awareness: the role of football match attendance in Italy. J. Sports Econ. 23, 503–523 (2022).
Article Google Scholar
Gómez, J.-P. & Mironov, M. Using soccer games as an instrument to forecast the spread of covid-19 in europe. Financ. Res. Lett. 43, 101992 (2021).
Article Google Scholar
Jones, B. et al. SARS-CoV-2 transmission during rugby league matches: do players become infected after participating with SARS-CoV-2 positive players? Br. J. Sports Med. 55, 807–813 (2021).
Article Google Scholar
Schumacher, Y. O. et al. Resuming professional football (soccer) during the COVID-19 pandemic in a country with high infection rates: a prospective cohort study. Br. J. Sports Med. 55, 1092–1098 (2021).
Article Google Scholar
Egger, F., Faude, O., Schreiber, S., Gärtner, B. C. & Meyer, T. Does playing football (soccer) lead to SARS-CoV-2 transmission? - a case study of 3 matches with 18 infected football players. Sci. Med. Footb. 5, 2–7 (2021).
Article Google Scholar
Oh, T., Sung, H. & Kwon, K. D. Effect of the stadium occupancy rate on perceived game quality and visit intention. Int. J. Sports Mark. Spons. 18, 166–179 (2017).
Google Scholar
Herold, E., Boronczyk, F. & Breuer, C. Professional clubs as platforms in multi-sided markets in times of COVID-19: the role of spectators and atmosphere in live football. Sustainability 13, 2312 (2021).
Horky, T. No sports, no spectators - no media, no money? The importance of spectators and broadcasting for professional sports during covid-19. Soccer Soc. 22, 96–102 (2021).
Article Google Scholar
Yim, B. H., Byon, K. K., Baker, T. A. & Zhang, J. J. Identifying critical factors in sport consumption decision making of millennial sport fans: mixed-methods approach. Eur. Sport Manag. Q. 21, 484–503 (2021).
Article Google Scholar
Cuschieri, S., Grech, S. & Cuschieri, A. An observational study of the covid-19 situation following the first pan-european mass sports event. Eur. J. Clin. Investig. 52, e13743 (2022).
Article CAS Google Scholar
Feder, T. Soccer obeys bessel-function statistics. Phys. Today 59, 26 (2006).
ADS Google Scholar
Dehning, J. et al. Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science 369, eabb9789 (2020).
Brauner, J. M. et al. Inferring the effectiveness of government interventions against COVID-19. Science 371, eabd9338 (2020).
Sharma, M. et al. Understanding the effectiveness of government interventions against the resurgence of COVID-19 in Europe. Nat. Commun. 12, 1–13 (2021).
Article CAS Google Scholar
Lagaert, S. & Roose, H. The gender gap in sport event attendance in Europe: the impact of macro-level gender equality. Int. Rev. Sociol. Sport 53, 533–549 (2018).
Article Google Scholar
Shah, S. A. et al. Predicted COVID-19 positive cases, hospitalisations, and deaths associated with the Delta variant of concern, June–July, 2021. Lancet Digital Health 3, e539–e541 (2021).
Marsh, K., Griffiths, E., Young, J. J., Gibb, C.-A. & McMenamin, J. Contributions of the EURO 2020 football championship events to a third wave of SARS-CoV-2 in Scotland, 11 June to 7 July 2021. Euro Surveill. 26, 2100707 (2021).
Riley, S. et al. REACT-1 round 13 interim report: acceleration of SARS-CoV-2 Delta epidemic in the community in England during late June and early July 2021. Preprint at medRxiv https://doi.org/10.1101/2021.07.08.21260185 (2021).
Roxby, P. Covid: watching Euros may be behind rise in infections in men. BBC News (8 July 2021).
Fraser, C. Estimating individual and household reproduction numbers in an emerging epidemic. PLoS ONE 2, e758 (2007).
Riffe, T. & Acosta, E. Data resource profile: COVerAGE-DB: a global demographic database of COVID-19 cases and deaths. Int. J. Epidemiol. 50, 390–390f (2021).
Article Google Scholar
Wilkinson, M. D. et al. The fair guiding principles for scientific data management and stewardship. Sci. Data 3, 1–9 (2016).
Article Google Scholar
Government of The Netherlands. Reopening society step by step. https://web.archive.org/web/20210627222056/https://www.government.nl/topics/coronavirus-covid-19/plan-to-reopen-society (2022).
Hale, T. et al. A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker). Nat. Hum. Behav. 5, 529–538 (2021).
Article Google Scholar
Zierenberg, J., Spitzner, F. P., Priesemann, V., Weigel, M. & Wilczek, M. How contact patterns destabilize and modulate epidemic outbreaks. Preprint at arXiv https://arxiv.org/abs/2109.12180 (2021).
Iftekhar, E. N. et al. A look into the future of the COVID-19 pandemic in Europe: an expert consultation. Lancet Regional Health Eur. 8, 100185 (2021).
Article Google Scholar
Heese, H. et al. Results of the enhanced COVID-19 surveillance during UEFA EURO 2020 in Germany. Epidemiol. Infect. 150, 1–7 (2022).
Article Google Scholar
Reuters. Euro 2020 players affected by COVID-19. https://www.reuters.com/article/soccer-euro-coronavirus-idUKL5N2NS2ED (2021).
Kretzschmar, M. E., Rozhnova, G. & van Boven, M. Isolation and contact tracing can tip the scale to containment of COVID-19 in populations with social distancing. Front. Phys. https://doi.org/10.3389/fphy.2020.622485 (2021).
Kerr, C. C. et al. Controlling COVID-19 via test-trace-quarantine. Nat. Commun. 12, 1–12 (2021).
Article Google Scholar
Contreras, S. et al. The challenges of containing SARS-CoV-2 via test-trace-and-isolate. Nat. Commun. 12, 1–13 (2021).
Article Google Scholar
Contreras, S. et al. Low case numbers enable long-term stable pandemic control without lockdowns. Sci. Adv. 7, eabg2243 (2021).
Salathé, M. et al. COVID-19 epidemic in Switzerland: on the importance of testing, contact tracing and isolation. Swiss Med. Wkly. 150, w20225 (2020).
Google Scholar
Flaxman, S. et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature 584, 257–261 (2020).
Meier, H. E., Strauss, B. & Riedl, D. Feminization of sport audiences and fans? Evidence from the German men’s national soccer team. Int. Rev. Sociol. Sport 52, 712–733 (2017).
Article Google Scholar
Zhang, M. et al. Transmission dynamics of an outbreak of the COVID-19 Delta variant B.1.617.2 - Guangdong Province, China, May-June 2021. China CDC Wkly. 3, 584–586 (2021).
Article Google Scholar
Hart, W. S. et al. Generation time of the alpha and delta SARS-CoV-2 variants: an epidemiological analysis. Lancet Infect. Dis. 22, 603–610 (2022).
Article CAS Google Scholar
Hu, S. et al. Infectivity, susceptibility, and risk factors associated with SARS-CoV-2 transmission under intensive contact tracing in Hunan, China. Nat. Commun. 12, 1533 (2021).
Article ADS CAS Google Scholar
Ferretti, L. et al. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science 368, eabb6936 (2020).
Article CAS Google Scholar
Salvatier, J., Wiecki, T. V. & Fonnesbeck, C. Probabilistic programming in Python using PyMC3. PeerJ Comput. Sci. 2, e55 (2016).
Article Google Scholar
Hoffman, M. D. & Gelman, A. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J. Mach. Learn. Res. 15.1, 1593–1623 (2014).
Vehtari, A., Gelman, A., Simpson, D., Carpenter, B. & Bürkner, P.-C. Rank-normalization, folding, and localization: An improved $\widehat{R}$ for assessing convergence of MCMC. Bayesian Analysis16 (2021).
Gelman, A., Goodrich, B., Gabry, J. & Vehtari, A. R-squared for Bayesian Regression Models. Am. Stat. 73, 307–309 (2019).
Article MathSciNet MATH Google Scholar
Google. COVID-19 community mobility reports. https://www.google.com/covid19/mobility/ (2022).
Dehning, J. et al. Software: impact of the Euro 2020 championship on the spread of COVID-19. github https://github.com/Priesemann-Group/covid19_soccer; zenodo https://doi.org/10.5281/zenodo.7386313 (2022).

Download references

Acknowledgements

We thank the Priesemann group for exciting discussions and for their valuable input. We also thank Arne Gottwald and Cornelius Grunwald for their continuous input during the conceptual and finalizing stage of the manuscript. We thank Piklu Mallick for proofreading. All authors received support from the Max-Planck-Society. J.D. and S.B.M. received funding from the “Netzwerk Universitätsmedizin" (NUM) project egePan (01KX2021). S.B.M. received funding from the “Infrastructure for exchange of research data and software" (crc1456-inf) project. S.C. and P.D. received funding by the German Federal Ministry for Education and Research for the RESPINOW project (031L0298) and ENI for the infoXpand project (031L0300A). V.P. was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - EXC 2067/1-390729940. This work was partly performed in the framework of the PUNCH4NFDI consortium supported by DFG fund “NFDI 39/1”, Germany.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Jonas Dehning, Sebastian B. Mohr.

Authors and Affiliations

Max Planck Institute for Dynamics and Self-Organization, Am Faßberg 17, 37077, Göttingen, Germany
Jonas Dehning, Sebastian B. Mohr, Sebastian Contreras, Philipp Dönges, Emil N. Iftekhar & Viola Priesemann
Max Planck Institute for Physics, Föhringer Ring 6, 80805, München, Germany
Oliver Schulz
Physikalisches Institut, Universität Bonn, Nußallee 12, 53115, Bonn, Germany
Philip Bechtle
Institute for the Dynamics of Complex Systems, University of Göttingen, Friedrich-Hund-Platz 1, 37077, Göttingen, Germany
Viola Priesemann
Institute of Computer Science and Campus Institute Data Science, University of Göttingen, Goldschmidtstraße 7, 24118, Göttingen, Germany
Viola Priesemann

Authors

Jonas Dehning
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian B. Mohr
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Contreras
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Dönges
View author publications
You can also search for this author in PubMed Google Scholar
Emil N. Iftekhar
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Schulz
View author publications
You can also search for this author in PubMed Google Scholar
Philip Bechtle
View author publications
You can also search for this author in PubMed Google Scholar
Viola Priesemann
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: V.P., S.B.M., J.D., S.C., P.D. Methodology: S.B.M., J.D., V.P., P.B., O.S. Software: S.B.M., J.D. Validation: S.B.M., J.D. Formal analysis: S.B.M., J.D. Investigation: S.B.M., J.D., O.S., P.B. Data curation: S.B.M., J.D., P.D., O.S. Writing—original draft: all. Writing—review and editing: all. Visualization: S.B.M., J.D., P.D., S.C., E.I. Supervision: V.P., P.B., O.S., J.D. Funding acquisition: V.P., P.B., O.S.

Corresponding authors

Correspondence to Philip Bechtle or Viola Priesemann.

Ethics declarations

Competing interests

V.P. is a member of the ExpertInnenrat of the German federal government on COVID and is also advising other governmental and non-governmental entities. The other authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dehning, J., Mohr, S.B., Contreras, S. et al. Impact of the Euro 2020 championship on the spread of COVID-19. Nat Commun 14, 122 (2023). https://doi.org/10.1038/s41467-022-35512-x

Download citation

Received: 27 June 2022
Accepted: 08 December 2022
Published: 18 January 2023
DOI: https://doi.org/10.1038/s41467-022-35512-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Early pandemic COVID-19 case growth rates increase with city size

Effects of government policies on the spread of COVID-19 worldwide

Masks and distancing during COVID-19: a causal framework for imputing value to public-health interventions

Introduction

Results

The main impact arises from the subsequent infection chains

The strongest effect is observed in England and Scotland

Low overall incidence prevents large match-related spread

Discussion

Methods

Modeling the spreading dynamics, including gender imbalance

Football-related effect

Non-football-related reproduction number

Delay

Constant delay

Weekday-dependent delay

Likelihood

Average effect across countries

Calculating the primary and subsequent cases

Sampling

Robustness tests

Correlations

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links