Regional excess mortality during the 2020 COVID-19 pandemic in five European countries

The impact of the COVID-19 pandemic on excess mortality from all causes in 2020 varied across and within European countries. Using data for 2015–2019, we applied Bayesian spatio-temporal models to quantify the expected weekly deaths at the regional level had the pandemic not occurred in England, Greece, Italy, Spain, and Switzerland. With around 30%, Madrid, Castile-La Mancha, Castile-Leon (Spain) and Lombardia (Italy) were the regions with the highest excess mortality. In England, Greece and Switzerland, the regions most affected were Outer London and the West Midlands (England), Eastern, Western and Central Macedonia (Greece), and Ticino (Switzerland), with 15–20% excess mortality in 2020. Our study highlights the importance of the large transportation hubs for establishing community transmission in the first stages of the pandemic. Here, we show that acting promptly to limit transmission around these hubs is essential to prevent spread to other regions and countries.

B y December 2020, the World Health Organization (WHO) reported 1,813,188 Coronavirus disease 2019 (COVID-19) related deaths globally 1 . Although COVID-19 related deaths are key to monitoring the pandemic's burden, vital statistics generally suffer from issues related to accuracy and completeness 2 . Additionally, they can be subject to changes in definition and different policies regarding testing and reporting 3,4 . At the same time, focusing only on COVID-19 deaths does not provide information about indirect pandemic effects due to disruption to health services and wider economic, social and behavioural changes in the population 5 . An effective way to quantify the total mortality burden of the COVID-19 pandemic is through excess mortality 6 . Excess mortality compares the number of deaths from all causes observed during the pandemic, and the number of deaths expected had the pandemic not occurred, using data from recent pre-pandemic years. Preliminary estimates suggest that the total number of global deaths attributable to the COVID-19 pandemic in 2020 is at least 3 million, with approximately 37% of these deaths occurring in the European region 1 .
Previous studies have examined excess mortality at the national level reporting a disproportionate mortality burden 6,7 . In Europe, England and Wales, Spain and Italy experienced the largest increase in mortality during March-May 2020, with excess mortality estimates for the two sexes ranging from 70 to 102 per 100,000 population 7 . In contrast, from October to December 2020, the impact on mortality was great in Switzerland 6 . Studies focusing on the regional level have also found differential effects on mortality [8][9][10] . In Italy higher excess mortality was observed in some provinces in the North-west 8,9 . In England, the highest excess mortality was observed in London and in the West Midlands during March-May 2020 11,12 . In Greece, Eastern Macedonia and Thrace, Western Macedonia and Central Macedonia reported more than 10% excess mortality, in contrast to the Aegean islands and Crete for which the excess was smaller than 3% 10 . Studying these variations may help our understanding of the transmission patterns and the effectiveness of policies and measures to contain the pandemic. Other factors that may have also contributed to the varying impact on mortality across regions include differences in demographics 13 , the prevalence of comorbidities 13 and environmental factors 12,14,15 .
In this study of five European countries, we examined the impact of the COVID-19 pandemic on mortality in 2020 using weekly regional all-cause mortality data. We included countries from Northern, Western and Southern Europe (England, Greece, Italy, Spain and Switzerland) and analysed regions defined by Eurostat, which are consistent across different European countries, Supplementary Fig. 1. We used a model-based approach to predict deaths for 2020 by specific age-and sex groups, under the counterfactual scenario that COVID-19 had not occurred. To the best of our knowledge, this is the first international study of the COVID-19 regional impact on mortality.

Results
A total of 565,505 deaths were recorded in 2020 in England,132,514 in Greece, 756,450 in Italy, 485,536 in Spain and 77,222 in Switzerland (Table 1). The estimated population in 2020 was 56,702,967 in England,10,718,447 in Greece, 59,641,219 in Italy, 47,332,587 in Spain and 8,681,297 in Switzerland (Table 1). In all countries, the number of deaths in males compared to females was larger for all age groups below 80 years. Comparing the observed number of deaths with the mean number of deaths from 2015 to 2019, there were 40,631 and 27,739 excess deaths in males and in females, respectively, in England, 5,380 and 5,909 in Greece, 59,327 and 52,206 in Italy, 35,868 and 33,208 in Spain and 5788 and 4526 in Switzerland (Table 1). Table 1 Age and sex-specific number of excess and observed deaths and population for 2020 by country of death. The expected number of deaths were calculated as the mean of crude deaths by age and sex during 2015-2019. The population numbers refer to the 1st of January in 2020. Sub-national level patterns: NUTS2 regions. Excess mortality was evident for most regions, but with large intra-country variability, as shown in Fig. 1. Across the five countries, Madrid, Castile-La Mancha, Castile-Leon (Spain) and Lombardia (Italy) are the regions with the highest excess mortality in 2020, ranging from 28% (95% CrI 22%-34%) to 33% (95% CrI 27%-39%) for males and 25% (95% CrI 17%-38%) to 32% (95% CrI 23%-40%) for females ( Fig. 1 and Supplementary Tables 4-5). Ceuta (Spain) experienced a similar median excess for females (31%: 95% CrI 14%-54%), albeit associated with larger uncertainty, Fig. 1  Finer sub-national level patterns: NUTS3 regions. The higher resolution maps in Figs. 2 and 3 show the median relative excess mortality and the posterior probability of a positive excess, allowing appreciation of patterns missed by the lower resolution. In England, the high excess experienced by the West Midlands was driven by Birmingham, the largest urban area in the region, which recorded values above 20%. In Greece, the municipality of Drama, with an excess >20% was responsible for the high excess in Eastern Macedonia and Thrace. For Italy, outside the Northern regions, the model highlights localised excess in the provinces of Rimini, Pesaro-Urbino and Foggia, on the Eastern coast, while central Spain and Catalonia had the highest excess in Spain. In Switzerland, all regions had a relative excess of <20% but the French and Italian speaking regions experienced a posterior probability of a positive excess above 0.95. In contrast, the German-speaking regions were more diverse, Figs. 2 and 3. In Supplementary Figs. 12 to 21 we report the spatial trends by age and sex at the higher geographical resolution. The spatial trends differ depending on the age group, but are similar for men and women for age groups over 60 years.

England
Spatio-temporal trends at the regional level. Figure 4 shows the relative excess mortality and the posterior probability that the excess mortality is greater than 0 for the different NUTS2 regions, for each week in 2020. Across the five countries, relative excess larger than 200% is observed only during the first epidemic period of 2020 (March-May 2020) in England (Greater London), Italy (Lombardia) and Spain (Madrid, Castille-La Mancha, Catalonia), Fig. 4. In Switzerland, during the first epidemic period, the geographical patterns of excess mortality were highly localised, with Ticino experiencing the highest excess mortality. In contrast, the geographical variability in Greece was more diffuse. During the second epidemic period (October-December 2020), the excess mortality in Italy and Switzerland was similar across the country, whereas in Greece it was highly localised, with Central Macedonia experiencing a relative excess mortality between 100-200% during November 2020, Fig

Discussion
To the best of our knowledge, this is the first multi-country study examining excess mortality in 2020 across five European countries at the sub-national level. We found that excess mortality in 2020 varied widely both between countries and within countries. Spain experienced the largest excess mortality among the five countries studied. Within Greece and Italy the northern regions were more affected than other regions. The temporal trends at the sub-national level showed patterns of localised excess mortality in England, Italy, Spain and Switzerland during the first wave, whereas in Greece the excess mortality was homogeneous. During the second wave, excess deaths were overall lower in magnitude and their distribution more homogeneous in England, Italy and Spain. In contrast, in Greece and Switzerland, the second wave was more severe than the first one. Our study has several strengths. It quantifies the short term, direct effect of the COVID-19 pandemic and indirect effects on mortality due to other life-threatening conditions, such as myocardial infarction 17 , in five European countries. Reduced access to or uptake of medical care due to COVID-19 leading to delays in cancer screening, cancer diagnosis, rescheduling of surgery or cancellation of outpatient visits in patients with chronic conditions may have increased mortality during the pandemic, particularly in countries with weaker health systems [18][19][20] . In contrast, a population-based study in the UK found that that primary care contacts for physical and mental health conditions decreased after the introduction of lockdowns in March, 2020, and remained below pre-lockdown levels 21 . These trends, could represent a substantial burden of unmet needs, with potential implications for subsequent morbidity and premature mortality 21 . Our modelling approach accounts for spatial and temporal mortality trends, factors such as temperature and public  holidays, and the different population temporal trends across space, age and sex groups, factors not taken into account in previous reports 10,11 . We carefully validated the model employing a cross-validation approach and found that it had high predictive accuracy. In contrast to previous studies, we stratified by age and sex, thus allowing the spatial and temporal mortality trends to vary across these groups. Weaknesses include the lack of detailed data on the causes of death, which would have allowed insights into the sources of the observed variation in excess deaths. We used population estimates coming from the national statistical institutes, but we did not account for their uncertainty, which is supposed to increase the further we move from the last census. This is likely to underestimate the width of the 95% CrI of the excess mortality 22 . Several previous studies reported nationwide excess mortality for 2020. The Office for National Statistics in England reported a 17.9% increase in male mortality and 11.2% in females 11 . A recent study of 40 industrialised countries covered the period from February 2020 to February 2021 and found an excess mortality of 15% to 20% in England and Wales, Spain and Italy 23 . Our point estimates are lower but when we consider their uncertainty, the differences are relatively small and probably explained by the periods used to train the model, the data sources used, the prediction periods and the different population estimation approaches 24 . Other data-related differences may play a role: for instance, previous analyses considered England and Wales 23 , while our focus is on England only. Our results are in line with estimates from the Hellenic Statistical Authority, which reported a 7.3% increase in the relative excess in Greece during 2020 10 , a Swiss study reporting a 10.6% increase in excess mortality in males and a 7.2% increases in females relative to 2019 25 and the estimates from the Italian National Institute of Statistics 26 . The Fig. 1 Posterior distribution of relative excess deaths (%) across the different countries by NUTS2 region and sex in 2020. a Posterior distribution of relative excess deaths (%) in males in England. b Posterior distribution of relative excess deaths (%) in females in England. c Posterior distribution of relative excess deaths (%) in males in Greece. d Posterior distribution of relative excess deaths (%) in females in Greece. e Posterior distribution of relative excess deaths (%) in males in Italy. f Posterior distribution of relative excess deaths (%) in females in Italy. g Posterior distribution of relative excess deaths (%) in males in Spain. h Posterior distribution of relative excess deaths (%) in females in Spain. i Posterior distribution of relative excess deaths (%) in males in Switzerland and j posterior distribution of relative excess deaths (%) in females in Switzerland. The black dots represent the medians of the posterior distribution of relative excess deaths. The red line highlights the 0% relative excess deaths, which means no observed difference in the 2020 mortality compared to the counterfactual scenario that the pandemic did not occur.  27 . At the regional level, our findings align with the Office for National Statistics in England, which reported a 20% increase in the relative excess mortality in London, the largest relative excess observed nationwide in 2020 11 . Our estimates are also in line with the Hellenic Statistical Authority reports suggesting that Macedonia and Thrace experience the largest relative increase in excess deaths in Greece (14.9% in males and 12.9% in females) 10 . The observed north-to-south geographical gradient in the impact of COVID-19 in Italy is in line with previous studies 8,9 . The Italian National Institute of Statistics reported that the provinces with the highest excess of mortality in Northern Italy were Bergamo (51.5%), Cremona (47.5%), Lodi (39.9%) and Piacenza (35.7%) 26 . In central and southern Italy, Pesaro-Urbino (21.1%) and Foggia (16.1%) were the most affected provinces 26  Selected studies focusing on regional excess mortality during 2020 in countries not included in the current study have also reported geographical discrepancies 23,28 . A study focusing in selected Latin America countries found that the subregions of Roriama, Lima, Puebla and Santa Elena were the subregions most affected in Brazil, Peru, Mexico and Ecuador with the percentage increase in the excess mortality varying from 50 to 160% during 2020 28 . These values are much higher compared to the maximum observed increase in excess mortality in our study which was 33% increase in males across all ages in Madrid. Similarly, a lot of geographical variation was observed in the US with the maximum percentage increase in excess mortality observed in the state of New Jersey during 2020, with the excess being approximately 35%, comparable with the value observed in males in Madrid 23 . Regional studies focusing on the early stages of the pandemic reported geographical discrepancies in Europe and Asia. A study in France reported 63.7% excess mortality in Île-de-France during March and May 2020 29 . A study focusing on the three months of the COVID-19 outbreak in China reported 56% (33%-87%) increase in the excess in three Wuhan districts, whereas no significant excess in the rest of China 30 .
Several factors may have contributed to the differences in excess mortality we observed during the COVID-19 pandemic across countries and regions. Mortality depends on the probability of being infected and mortality among those infected. Both probabilities vary depending on the country's demographic and   Probability that the relative excess deaths is higher than 0% by NUTS3 region in 2020. a Probability that the relative excess deaths is higher than 0% in England. b Probability that the relative excess deaths is higher than 0% in Greece. c Probability that the relative excess deaths is higher than 0% in Italy. d Probability that the relative excess deaths is higher than 0% in Spain and e probability that the relative excess deaths is higher than 0% in Switzerland. Areas in blue indicate areas that observed less deaths than expected had the pandemic not occurred, whereas the different shades of red indicate the higher relative excess mortality. The black solid lines correspond to the NUTS2 region borders.
socio-economic characteristics, including age structure, ethnicity, level of deprivation, and environmental factors 14 . Further, the timeframe of non-pharmaceutical interventions in countries and regions and the resilience and capacity of health care systems have played a role 7 . The mobility of populations across borders and between regions and the timeliness of lockdowns have probably been the most important factors 31,32 . We observed weak, but consistent, evidence of differences in the excess mortality by sex in England, with potential explanations being risk factors that are known to change with sex including differences in occupation, lifestyle, medical comorbidities, or use of medications 33 . The first wave of the pandemic was mainly exogenous, with international airports and transport routes serving as main entry points. Thus, the highest number of excess deaths during the first wave was observed in the areas affected first, i.e., big transit hubs like London, Madrid, Lombardia and Ticino, and Geneva. From the initial point of introduction, SARS-CoV-2 spread to nearby large urban areas where community transmission was established and increased exponentially, spreading to the entire country in the absence of mobility restrictions 34 . Furthermore, during the first wave stochastic super-spreader events like the Champion's League football game between Atalanta and Valencia on February 19, 2020 35 played an important role in establishing community transmission 36 . The lockdowns in Italy, England and Spain were introduced after community transmission was established in the areas first affected. On the day of the national lockdown 1,797 new cases were reported in Italy, 1,159 in Spain and 2,349 in the UK. The lockdown reduced mobility, allowing some areas to maintain lower levels of community transmission. The importance of the timeliness of the lockdown is, among other results, highlighted in the example of Italy, where the large number of COVID-19 cases in the North forced nationwide mobility restrictions, benefiting the south of Italy where community transmission was not established, creating a North to South gradient in the excess mortality 31 . In Greece, a timely nationwide lockdown was imposed on March 13, 2020, before the country reached 100 reported cases per day, potentially explaining the lack of excess mortality nationwide during the first six months of 2020. However, given the data available, it is not trivial to disentangle the specific effect of lockdown measures from other control measures and changes in behaviour that occurred simultaneously in response to the pandemic such as spontaneous social distancing, face mask usage, or contact tracing.
The spatial distribution of excess mortality during the second wave of the pandemic was more homogeneous, reflecting multiple routes of entry and transmission. Factors contributing to this situation included the relaxation of non-pharmaceutical interventions with the reopening of schools, retail and other activities, domestic and international travel, and the public's loosening of preventive behaviours 37 . The timeliness of the lockdowns and population mobility again played a crucial role. Lockdowns during the second wave were slower to be implemented and less rigorous 38 . In Italy and Switzerland, the geographical distribution of the excess deaths was equal nationwide, whereas it was more variable in England, Greece and Spain. In Greece, where community transmission was not established during the first epidemic wave, the patterns observed were highly localised, mimicking the patters observed in the other countries during the first epidemic wave. Central Macedonia (with the transit hub Thessaloniki), Eastern Macedonia and Thrace, and Western Macedonia which border on Bulgaria and Turkey, are the hardest-hit regions. On November 3, 2020, the Greek nationwide lockdown limited transmission in the rest of the country, resulting in lower excess mortality in areas in the south. In Switzerland, the area hit hardest during the second wave was the lake of Geneva region, potentially influenced by the French second wave 37 .
In conclusion, this study provides the first comprehensive analysis of weekly sub-national excess mortality for 2020 across five countries, disaggregated by sex and age groups. Our findings highlight how excess mortality varied largely across countries, within countries and over time. They suggest that a timely lockdown led to reduced community transmissions and, subsequently, lower excess mortality. However, lockdowns have adverse short and long-term health, psychosocial and economic effects that need to be considered 7,39 . Community transmission was established in the transit hubs and nearby large metropolitan areas during the first stages of the pandemic. Therefore, rapid action to limit transmission around these hubs is essential to prevent spread to other regions and countries. The study is about secondary, aggregate anonymised data so no additional ethical permission is required.
All-cause mortality. We retrieved data for all-cause deaths and population counts from the Office for National Statistics in England (derived from the national mortality and birth registrations and the Census), the Hellenic Statistical Authority in Greece, the Italian National Institute of Statistics in Italy, the National Centre of Epidemiology at the Carlos III Health Institute and the Daily Monitoring Mortality System and also the National Statistics Institute and Ministry of Justice in Spain and the Federal Statistical Office in Switzerland, Supplementary Table 8. We selected the current Nomenclature of Territorial Units for Statistics (NUTS) and in particular NUTS3 (small regions for specific diagnoses) as the main spatial unit of our analysis 40 . We also show results at the NUTS2 level, which is defined to reflect basic regions for the application of regional policies (https://ec.europa.eu/eurostat/ web/nuts/background/). The number of deaths from all-causes and the population denominator was available by sex, age, week and NUTS3 region defined as areas with a population varying from 150,000 to 800,000, for 2015-2020. We used the International Organization for Standardization (ISO) week calendar, i.e. the seven consecutive days beginning with a Monday and ending with a Sunday. We aggregated mortality and population data by age groups <40, 40-59, 60-69, 70-79 and 80 years and above to maintain consistency between countries and the literature 12 .  Table 7, which for 2020 were affected by COVID-19 deaths during the first wave. We, therefore, used the data for 2015 to 2019 and estimated the midyear population of 2020 through linear interpolation for England. We estimated population numbers for January 1 2020, and then used linear interpolation to obtain the weekly population of 2019, which we used as a proxy for 2020. Fig. 4 Weekly median relative excess deaths (%) across the different countries by NUTS2 region in 2020 (left) and corresponding probability that the weekly relative excess is larger than 0% (right). a Weekly median relative excess deaths in England. b Posterior probability that the relative excess is higher than 0% in England. c Weekly median relative excess deaths in Greece. d Posterior probability that the relative excess is higher than 0% in Greece. e Weekly median relative excess deaths in Italy. f Posterior probability that the relative excess is higher than 0% in Italy. g Weekly median relative excess deaths in Spain. h Posterior probability that the relative excess is higher than 0% in Spain. i Weekly median relative excess deaths in Switzerland and j posterior probability that the relative excess is higher than 0% in Switzerland. Different shades of red on the panels a, c, e, g and i indicate higher relative excess mortality, whereas the white ones relative excess mortality lower than 0%. The white colour on the panels b, d, f, h and j indicate insufficient evidence of a relative excess larger than 0%, whereas the grey strong evidence.
Covariates. As ambient temperature influences death rates 41 , we retrieved data on temperature from the ERA5 reanalysis data set of the Copernicus climate data 42 . Using data from global in situ and satellite measurements, ERA5 provides hourly estimates of a large number of atmospheric, land and oceanic climate variables, spatially and temporally compatible with our analysis 42 . For each centroid of the grid cells (at 0.25 ∘ × 0.25 ∘ resolution) that fall into the NUTS3 regions, we calculated the daily mean temperature during 2015-2020 and then the weekly mean, align temperature and mortality data. We are modelling weekly deaths due to data availability, hence we are implicitly considering a 0-7 lag. Additionally, as mortality from all causes can be different during national holidays, we also included a binary variable taking the value 1 if the week contains a public holiday and 0 otherwise.
Statistical methods. We used Bayesian hierarchical models to predict deaths in 2020, under the scenario of absence of the pandemic. Let y jtsk be the number of allcause deaths, P jtsk be the population at risk and r jtsk the risk in the j-th week of the t-th year (t = 1, …, 5 with year 1 corresponding to 2015), for the s-th spatial unit (s = 1, …, S) and k-th age-sex group (k = 1, …, 10) (male-female and <40, 40-59, 60-69, 70-79, ≥80). The models in the main analysis excluded the age group below 40 years, based on the cross-validation exercise. We assume a Poisson distribution for the number of deaths y jtsk and modelled the risk r jtsk using the following specification: where β 0t is the year specific intercept given by β 0t = β 0 + ϵ t , with β 0 being the global intercept and ϵ t $ Normal ð0; τ À1 ϵ Þ an unstructured random effect representing the deviation of each year from the global intercept, with τ ϵ denoting the precision of ϵ t . The term β 1 represents the effect of public holidays (i.e., Z j = 1 if week j contains a public holiday and 0 otherwise). The linear predictor includes also a non-linear effect f(⋅) of the average weekly temperature in each area, x jts ; in particular, we assume the following second-order random walk (RW2) model: with τ x denoting the precision. The term b s is a spatial field defined as an extension of the Besag-York-Mollié model given by the sum of an unstructured random effect, v s $ Normal ð0; τ À1 v Þ, and a spatially structured effect u s 43-45 . In particular b s is defined as follows: where u ? s and v ? s are standardised version of u s and v s to have variance equal to 1 46 . The term 0 ≤ ϕ ≤ 1 is a mixing parameter which measures the proportion of the marginal variance explained by the structured effect.
To account for seasonality, we included in the linear predictor a non-linear weekly effect w j , common to all the areas, with a first order random walk (RW1) structure: w j jw jÀ1 ; τ w $ Normal ðw jÀ1 ; τ À1 w Þ; where τ w is the precision of w j .
We specified minimally informative prior distributions, i.e. for the fixed effects β 0 and β 1 . For the spatial field hyperparameters ϕ and τ b we adopted priors that tend to regularise inference while not providing too strong information, the socalled penalize complexity (PC) priors introduced in 46 . In particular, for the standard deviation we selected a prior so that Pr(σ b > 1) = 0.01, implying that it is unlikely to have a spatial relative risk higher than expð2Þ based solely on spatial or temporal variation. For ϕ we set Pr(ϕ < 0.5) = 0.5 reflecting our lack of knowledge about which spatial component, the unstructured or structured, should dominate the field b. Finally, PC priors are also adopted for all the standard such that for each hyperparameter Pr(σ > 1) = 0.01. We train the model using the years 2015-2019 and predict area level weekly mortality for 2020 assuming that the pandemic did not take place. To summarise the results we retrieve samples by age, sex, week and NUTS3 regions for 2020 from the posterior predictive distribution: where θ is the vector of the model parameters and D the observed data. We report the weekly observed number of deaths in 2020 together with p(y jsk6 |D) at the weekly resolution. We also compare p(y jsk6 |D) with the observed number of deaths in 2020 and retrieve the posterior of the relative (percent) increase in mortality (i.e. relative to what is expected had the pandemic not occurred). We summarise the above posterior reporting medians, 95% credible intervals and posterior probability that the relative excess mortality is larger than 0. We also present half-year estimates of the median excess mortality by NUTS3 regions, age and sex and the posterior probability that the excess is larger than 0.NUTS3 regions and weeks were the main spatial and temporal resolution of the analysis. To calculate the different temporal, spatial, sex and age specific aggregations, we combined p(y jsk6 |D) for the different age and sex groups or integrate in time or space accordingly, resulting in the corresponding posteriors.
Model validation. We perform a cross-validation like procedure to examine the validity of our predictions. Using the years 2015-2019, we fit the proposed model multiple times, leaving out one year at a time and predicting the weekly number of deaths by NUTS3 regions for the year left out. We repeat for the different age and sex groups, and different countries. We assess the agreement between the predicted and observed deaths at the year t. We use the following metrics: a) the correlation between the predicted and observed deaths and b) the 95% coverage, defined as the probability that the observed deaths lie within the 95% interval estimated from the model.