Factors associated with the difference between the incidence and case-fatality ratio of coronavirus disease 2019 by country

Kim, Jeehyun; Hong, Kwan; Yum, Sujin; Gómez Gómez, Raquel Elizabeth; Jang, Jieun; Park, Sun Hee; Choe, Young June; Ryu, Sukhyun; Park, Dae Won; Lee, Young Seok; Lee, Heeyoung; Kim, Dong Hyun; Kim, Dong-Hyun; Chun, Byung Chul

doi:10.1038/s41598-021-98378-x

Download PDF

Article
Open access
Published: 23 September 2021

Factors associated with the difference between the incidence and case-fatality ratio of coronavirus disease 2019 by country

Jeehyun Kim^1,2,3,
Kwan Hong^1,2,
Sujin Yum^1,2,
Raquel Elizabeth Gómez Gómez^1,2,
Jieun Jang¹,
Sun Hee Park⁴,
Young June Choe⁵,
Sukhyun Ryu⁶,
Dae Won Park⁷,
Young Seok Lee⁸,
Heeyoung Lee⁹,
Dong Hyun Kim¹⁰,
Dong-Hyun Kim¹¹ &
…
Byung Chul Chun^1,2,3

Scientific Reports volume 11, Article number: 18938 (2021) Cite this article

2452 Accesses
18 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Coronavirus disease (COVID-19) has been spreading all over the world; however, its incidence and case-fatality ratio differ greatly between countries and between continents. We investigated factors associated with international variation in COVID-19 incidence and case-fatality ratio (CFR) across 107 northern hemisphere countries, using publicly available COVID-19 outcome data as of 14 September 2020. We included country-specific geographic, demographic, socio-economic features, global health security index (GHSI), healthcare capacity, and major health behavior indexes in multivariate models to explain this variation. Multiple linear regression highlighted that incidence was associated with ethnic region (p < 0.05), global health security index 4 (GHSI4) (beta coefficient [β] 0.50, 95% Confidence Interval [CI] 0.14–0.87), population density (β 0.35, 95% CI 0.10–0.60), and water safety level (β 0.51, 95% CI 0.19–0.84). The CFR was associated with ethnic region (p < 0.05), GHSI4 (β 0.53, 95% CI 0.14–0.92), proportion of population over 65 (β 0.71, 95% CI 0.19–1.24), international tourism receipt level (β − 0.23, 95% CI − 0.43 to − 0.03), and the number of physicians (β − 0.37, 95% CI − 0.69 to − 0.06). Ethnic region was the most influential factor for both COVID-19 incidence (partial ${R}^{2}$ = 0.545) and CFR (partial ${R}^{2}$ = 0.372), even after adjusting for various confounding factors.

Disease burden and clinical severity of the first pandemic wave of COVID-19 in Wuhan, China

Article Open access 27 October 2020

Juan Yang, Xinhua Chen, … Prof Hongjie Yu

Spatial and temporal fluctuations in COVID-19 fatality rates in Brazilian hospitals

Article Open access 10 May 2022

Andrea Brizzi, Charles Whittaker, … Oliver Ratmann

Global ecological analysis of COVID-19 mortality and comparison between “the East” and “the West”

Article Open access 28 March 2022

Ariel Pablos-Méndez, Simone Villa, … Richard Alan Cash

Introduction

Coronavirus disease (COVID-19), caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was declared as pandemic by World Health Organization (WHO) on 11 March 2020. Until the end of July 2021, pandemic has resulted in 196 million confirmed cases and more than four million deaths worldwide¹. There is variation in the caseloads and severities of COVID-19 across continents and countries^1,2. The Americas reported confirmed cases per one million population approximately 52.7 times higher confirmed cases per one million population than that of Western Pacific as of 16 August 2020 and these regional differences are not vanished even as the pandemic continues, Americas being approximately 33.3 times higher than Western Pacific as of 31 July 2021¹. The reason of the difference should be investigated, considering Western Pacific, where the disease occurred at first, had notably lower incidence and mortality than the Americas and Europe, where most of the industrialized countries with sufficient healthcare capacity and hygiene condition are.

Numerous studies investigating the risk factors for COVID-19 outcomes within countries have been published. The examined risk factors are demographic features, comorbidities, socioeconomic disparity, and environmental features^3,4 at the district level. Specifically, male sex^5,6,7, older age^7,8, comorbidities^6,7,9,10,11 are suggested as factors that increase the risk of negative COVID-19 outcomes. Socioeconomic disparities, such as income level^12,13,14, education^12,13, and unemployment¹⁴, are reported to be associated with COVID-19. Moreover, ethnicity is suggested to be associated with the disparity of COVID-19 outcomes, although it is not verified whether the underlying cause of the disparity results from biological or socio-economic features of the different ethnicities^8,12,15. However, there are few country-level studies investigating the possible factors for international variation in COVID-19 outcomes. Clarifying the potential country-level factors could provide evidence for policy makers to implement appropriate COVID-19 control measures, such as social distancing and lockdowns.

Using publicly available data, this study aims to identify the factors related to the international variation of COVID-19 outcomes at the country level and to measure how much each factor could explain the disease outcome, by adjusting for the national COVID-19 test rate, and the demographic and economic features.

Methods

Data extraction

We obtained the data on COVID-19 outcomes of each country, i.e., total confirmed cases, recovered cases, deaths, and number of tests performed from Worldometer Coronavirus statistics websites¹⁶, one of the most popular COVID-19 data sources, at 14 September 2020. We retrieved data at 14 September 2020 so that we could consider that most of the countries had gone through the first wave of COVID-19^17,18 and chances of biased results caused by possible cocirculation of flu and COVID-19¹⁸ could be reduced. The number of total confirmed cases per million population was used as COVID-19 incidence and the number of total deaths divided by the number of confirmed cases was used as the CFR (%). Only countries in northern hemisphere were included since northern and southern hemispheres had different prevalence duration of COVID-19, and each hemisphere had different seasonality as of 14 September 2020⁴.

Information on country-level indices, namely, geographic, demographic, and socio-economic features, global health security index (GHSI), healthcare capacity, and health behaviors, were examined for possible factors, considering the results of previous studies which investigated association between COVID-19 health outcomes and each variable^{2,3,4,5,6,7,8,9,10,11,12,13,14,15,19,20,21,22,23}. Specifically, information on ethnic region²⁴, proportion of female (%)²⁵, land area (km²)²⁵, median age²⁶, population over 65 years of age (% of total population)²⁵, total population²⁷, population density (P/km²)²⁷, urban population (% of total population)²⁷, education index²⁶, GDP per capita (current US$)²⁵, Gini index²⁵ for detection of income dispersion, international tourism receipts (% of total exports)²⁵, and unemployment (% of total labor force)²⁵ was included in this study. Ethnic region was based on the data of ethnic categories extracted from a previously published article²⁴, because recognized social standards that defined ethnic categories at the national level was absent²⁸. Rawshani et al.²⁴ categorized ethnicity by considering geographical adjoins and evaluating each country’s ethnic composition, economic development, history, and religion. The ethnic region in our research consisted of nine categories: East Asia; Europe (high income), North America and Oceania; Europe (low income), Russia and Central Asia; Latin America and the Caribbean; Mediterranean Basin; Middle East and North Africa; Nordic countries; South Asia; and Sub-Saharan Africa. GHSI was a comprehensive assessment of the health secure capability of a country to prevent and combat epidemic. The index had an overall score and comprised six categories: prevention of pathogen release (GHSI1); detection and reporting for epidemics (GHSI2); rapid response to an epidemic (GHSI3); capability of the health system to treat patients and protect healthcare workers (GHSI4); compliance with international commitments (GHSI5); and, nationwide environmental risk and public health vulnerability to biological threats (GHSI6)²⁹. Each category of the GHSI and the overall scores ranged from 0 to 100, with higher scores indicating better preparedness in the corresponding category.

We collected information on healthcare capacity, such as healthcare access and quality (HAQ) index³⁰, health expenditure (% of GDP)²⁵, out-of-pocket expenditure (% of current health expenditure)²⁵, and the number of hospital beds, nurses, and physicians per 1000 people²⁵. The HAQ index analyzed the 32 causes of death that are considered avoidable in the availability of quality medical services³⁰. Causes of death included various health service areas, such as vaccine-preventable diseases; epidemics and maternal and child health; non-infectious diseases; and, gastrointestinal diseases in which death is preventable by surgery³⁰. The values ranged from 0 to 100, and higher values indicate that the country has a higher quality of and better accessibility to medical care³⁰. Information on comorbidities and health behaviors which can contribute to COVID-19 outcomes was extracted. We included information on obesity prevalence³¹, diabetes prevalence²⁵, smoking prevalence³¹, alcohol consumption²⁵, and water, sanitation, and hygiene (WASH) index³². The WASH index assesses the safety and accessibility to water and sanitation facilities and personal hygiene levels. The indicators are independent but also interdependent. The values ranged from 0 to 100, with higher scores indicating better conditions for the corresponding factor³². The WHO argued that ensuring proper condition of WASH in communities, homes, schools, and medical facilities would help prevent COVID-19 transmission³³. The WASH index that assessed personal hygiene was excluded from our analysis due to an abundance of missing values (81, 59.6%). There was no duplication between variables. All the data used in this study were publicly available.

Statistical analysis

The analysis was conducted in country level. Baseline information of variables was assessed with median, mean, minimum, maximum, 25th and 75th percentile. Medians was used for the imputation of missing values of independent variables as the independent variables were not normally distributed.

Multiple linear regression was used to identify potential factors associated with incidence and CFR. Outcome variables, including incidence and CFR, were log transformed for the multiple linear regression analysis. The zero value in the CFR (%) was imputed with 0.005 for log transformation (corresponding country: Laos, Mongolia, and Cambodia). The continuous independent variables were standardized to properly compare the effects of potential factors, as the scale of each factor was different.

Potential predictors were first identified by univariate linear regression with p < 0.25 (Tables S2 and S3 in Supplement). A backward elimination was implemented. Then, incidence model embedded variables that stood for sex, age, GDP per capita, and COVID-19 test rates, to verify the effects of potential factors on the disease even after adjusting for the national demographic and economic features, and COVID-19 test rates. CFR model embedded COVID-19 incidence instead of COVID-19 test rates, considering incidence could affect mortality by bringing burden to national capacity against COVID-19 and medical system. Multicollinearity was considered (variance inflation factors (VIF) < 10) for the variable selection. Thus, variables with VIF ≥ 10 were excluded for the final model. The outcomes were presented with beta coefficients (β), 95% confidence interval (CI) of beta coefficients, and partial R-squared statistics. Partial R-squared statistics implicated the explanation portion of each variable in the model. The explanatory power of the model was assessed using adjusted R-squared statistics.

The sub-analyses on 136 countries, including countries in both northern and southern hemisphere, selected variables as the main analysis did. Multiple linear regression on log transformed incidence (Table S4 in Supplement) and log transformed CFR (Table S5 in Supplement) were conducted. We also performed the sub-analyses by each ethnic region respectively, except Mediterranean Basin (N = 5), Nordic countries (N = 4), and Sub-Saharan Africa (N = 4) region because the number of countries included in corresponding regions was less than 10. By each ethnic region, COVID-19 incidence and CFR were dichotomized with the median value (0: lower incidence [CFR]; 1: higher incidence [CFR]). A backward elimination process was implemented on the model with potential factors as which identified by univariate logistic regression with p < 0.25. The model on incidence included variables that stood for sex, age, GDP per capita, and COVID-19 test rates, while the model on CFR embedded COVID-19 incidence instead of COVID-19 test rates. Multicollinearity was considered (VIF < 10) when selecting variables for the final model. Multiple logistic regression was conducted, and the results are suggested in Supplementary Tables S6 and S7.

All statistical analyses were performed using R version 4.0.2 (R foundation for Statistical Computing, https://www.r-project.org). We used QGIS version 3.10.13 (QGIS Development Team, http://qgis.osgeo.org) for mapping. The institutional review board (IRB) of Korea University granted exemption for this study (IRB exemption number: KUIRB-2020-0281-01).

Results

Characteristics of total selected countries

There were 215 countries or regions reported on Worldometer site on 14 September 2020. Countries or regions with less than one million population (n = 59), those with lower value than 0.001 for total test per population (n = 17), those with more than 10% missing independent variables (n = 3), and those in the southern hemisphere (n = 29) were excluded. Finally, 107 northern hemisphere countries were included for analysis (Fig. 1). Lists of the countries included in each ethnic region are summarized in Table 1. Among 107 countries, the most frequent ethnic regions were “Middle East and North Africa (21, 19.6%)” whereas “Nordic countries (4, 3.7%)” and “South Asia (4, 3.7%)” were the least frequent.

Table 1 Lists of countries in each ethnic region included in this study.

Full size table

Table 2 summarized the inherent characteristics, namely, the number of tests for COVID-19 performed per one million population (COVID-19 test rate); demographic, socio-economic features; Global Health Security capabilities; healthcare capacities; and, personal health-related features, of the 107 countries. COVID-19 test rate was 55,710.0 (25–75th percentile 14,451.0–136,931.0). The proportion of female was 50.4% (25–75th percentile 49.8–51.2). The median age was 32.5 years (25–75th percentile 25.6–41.7), and the proportion of the population over 65 years of age was 8.6% (25–75th percentile 4.2–17.6). The population density (P/km²) was 103.0 (25–75th percentile 56.0–219.0) and the proportion of urban population was 0.7 (25–75th percentile 0.5–0.8).

Table 2 Characteristics of total selected countries.

Full size table

The median education index was 0.7 (25–75th percentile 0.6–0.8). The GDP per capita was US$ 7808.2 (25–75th percentile 2574.9–23,504.0), and Gini index was 34.7 (25–75th percentile 31.5–40.8). The proportions of international tourism receipts and unemployment was 7.4% (25–75th percentile 3.9–16.1) and 5.3% (25–75th percentile 3.4–8.9), respectively. The overall GHSI was 44.2 (25–75th percentile 35.5–55.4). The GHSI assessing the risk environment (GHSI6) had the highest score (57.0, 25–75th percentile 46.8‒69.6) among the six categories whereas the index assessing health system (GHSI4) had the lowest score (31.6, 25–75th percentile 19.5–45.7).

The HAQ score was 72.0 (25–75th percentile 55.3–81.7). The percentage of health expenditure within the GDP was 6.4% (25–75th percentile 4.4–8.2), and the percentage of out-of-pocket expenditure was 33.5% (25–75th percentile 19.1–49.6). The number of hospital beds, nurses, and physicians per 1000 people were 2.7 (25–75th percentile 1.2–4.7), 4.2 (25–75th percentile 1.4–7.3), and 2.3 (25–75th percentile 0.7–3.3), respectively. The prevalence of alcohol consumption and smoking were 7.1% (25–75th percentile 2.7–10.5) and 23.6% (25–75th percentile 14.2–28.2). The prevalence of diabetes and obesity were 6.8% (25–75th percentile 5.4–9.2) and 21.5% (25–75th percentile 10.2–25.4). The WASH index for water safety was 97.1 (25–75th percentile 89.3–99.0) and index for facility sanitation was 94.2 (25–75th percentile 75.8–99.0).

COVID-19 incidence and case-fatality ratio as of 14 September 2020

The COVID-19 health-related outcomes among 107 countries as of 14 September 2020 are summarized in Table 3. The median value for incidence was 2583.0 (25–75th percentile 783.0–6261.0) and the maximum was 43,358.0, while the minimum was 3.0. The median value of deaths per one million population was 55.0 (25–75th percentile 11.0–152.0) and the maximum value was 855.0 whereas three countries, Laos, Mongolia, and Cambodia, had no deaths. The median CFR was 2.1% (25–75th percentile 1.3–3.2) and the maximum was 12.4% whereas the minimum was 0.0%.

Table 3 COVID-19 health-related outcomes of total selected countries.

Full size table

The median values of incidence and CFR across ethnic region are summarized in Figs. 2 and 3 and Table S1 in Supplement. The median value of incidence in “East Asia” was 95.0 (25–75th percentile 33.0–1223.5) whereas that in “Europe (low income), Russia and Central Asia” was 4810.5 (25–75th percentile 2828.0–7356.5). The CFR in “East Asia” was 1.3 (25–75th percentile 0.0–1.8) whereas that in “Europe (high income), North America and Oceania” was 3.8 (25–75th percentile 2.4–6.5).

Factors related to COVID-19 incidence

The results of the multiple linear regression analysis to investigate the significant factors affecting COVID-19 incidence are presented in Table 4. The explanatory power of the model was 63.7% (adjusted ${R}^{2}$ = 0.637). Ethnic region (p < 0.05), GHSI4 (β 0.50, 95% CI 0.14–0.87), population density (β 0.35, 95% CI 0.10–0.60), and WASH index for water safety (β 0.51, 95% CI 0.19–0.84) had positive associations with incidence, even after adjusting for the national demographic and economic features and COVID-19 test rates. Specifically, all other ethnic regions had significantly higher incidences than “East Asia.” “Latin America and the Caribbean” region had the highest beta coefficient among the regions (β 4.28, 95% CI 3.32–5.23). Ethnic region had the highest partial R-squared statistics among the factors (partial ${R}^{2}$ = 0.545). If the country had a higher GHSI4, population density, and WASH index for water safety, it was likely to have a higher incidence and ethnic region explained the largest part of the model.

Table 4 Multiple linear regression analysis on log transformed COVID-19 incidence.

Full size table

Factors related to COVID-19 case-fatality ratio

The results of the multiple linear regression analysis to investigate the significant factors influencing CFR are presented in Table 5. The model had 49.9% of explanatory power (adjusted ${R}^{2}$ = 0.499). The factors that had positive associations with the CFR were ethnic region (p < 0.05), GHSI4 (β 0.53, 95% CI 0.14–0.92), the proportion of the population over 65 years of age (β 0.71, 95% CI 0.19–1.24) whereas the number of physicians (β − 0.37, 95% CI − 0.69 to − 0.06) and the number of international tourism receipts (β − 0.23, 95% CI − 0.43 to − 0.03) had a negative association with the CFR. Specifically, the CFRs of all the other ethnic regions were significantly higher than that of “East Asia,” even after adjusting for sex, age, economic status, and COVID-19 incidence. The beta coefficient of “Latin America and the Caribbean” region was the highest among the ethnic regions (β 3.77, 95% CI 2.62–4.92). Ethnic region had the highest partial R-squared statistics among the factors (partial ${R}^{2}$ = 0.372). Countries with higher GHSI4, higher proportions of population over 65 years of age, fewer international tourism receipts, and fewer physicians were likely to have higher CFRs and ethnic region explained the largest part of the model.

Table 5 Multiple linear regression analysis on log transformed COVID-19 case-fatality ratio.

Full size table

Discussion

An analysis was conducted with publicly available data to identify the factors associated with COVID-19 incidence and CFR. Possible factors, namely, COVID-19 test rate; geographical, demographical, and socio-economic variables; degree of preparedness for epidemics; healthcare capacity; and, personal health-related variables, were evaluated.

Ethnic region was the most influential factor for both COVID-19 incidence and CFR, even after adjusting for the national demographic and economic features and COVID-19 test rates/incidence. The results of the sub-analysis including countries in both hemispheres also showed that ethnic region accounts for the largest part in the incidence (partial ${R}^{2}$ = 0.511) and CFR models (partial ${R}^{2}$ = 0.322) (Tables S4 and S5 in Supplement). Furthermore, sub-analyses by each ethnic region did not reveal any significant factors related to incidence and CFR consistently (Tables S6 and S7 in Supplement). Our results are possible to support the hypothesis that East Asia could have evolved for a long time to be more resistant to SARS-CoV-2, suggested by Yamamoto and Bauer². Yamamoto and Bauer² proposed that, differences in (1) socio-behavioral aspects, (2) virulency of viruses, (3) evolutionary history related to selection of people by the virus, or (4) hygienic conditions could cause discrepancies in COVID-19 outcomes between Central Europe and East Asia. In our results, ethnic region was the most influential features explaining the international variation of the disease, even after considering socio-behavioral aspects and hygienic aspects, with the WASH index, as possible factors. As COVID-19 control policies were implemented to constrain socio-behavioral aspects, national differences in policies could partly explain the differences in incidence^2,19. However, the national differences in policies could not fully explain the differences in the CFRs across countries². Chaudhry et al.¹⁹ also suggested that government actions, such as rapid border closing and complete lockdowns, could not sufficiently explain COVID-19 mortality. Furthermore, since there are insufficient virological studies investigating SARS-CoV-2 worldwide², the hypothesis that highlighted the differences in pathogenicity of viruses across regions is hardly supported. Therefore, our findings could support the ‘evolutionary hypothesis’ among the four hypotheses to explain these regional variations suggested by Yamamoto and Bauer². That is, the difference in native susceptibility of the hosts in each region may be a possible factor to explain these regional variations of incidence and fatality of COVID-19. Asians living in ‘Asian ethnic region’ including Chinese may have lower susceptibility to SARS-CoV-2, for any reason including the possibility of exposure to a pathogen with a similar antigenicity in the past. However, our data and analysis in this study may be insufficient to rule out other possible hypotheses and explanations. We are not against the results of previous studies³⁴ that the impact of the effective control measures against COVID-19 in East Asia could have resulted in lower incidence and CFR. As our study being country-level ecological study, we aim to suggest a hypothesis, not to prove hypothesis. Therefore, further studies at the individual levels are required to derive direct evidence for different susceptibilities to COVID-19 across ethnic regions, considering collinearity between ethnic region and control measures.

GHSI4, which evaluated the health system, was associated with a higher COVID-19 incidence and CFR. Our results support the argument that GHSI is not sufficiently predictive of pandemic response^35,36, and additional factors that better estimate pandemic preparedness should be embedded in the index³⁶. However, we should be cautious while interpreting the predictiveness of GHSI for the vulnerability to the epidemic as the COVID-19 pandemic is still ongoing.

Countries with better water safety levels were likely to have higher incidence. These results support the hypothesis that poorer hygienic conditions are associated with higher resistance to infectious disease². However, the observed negative effects of the WASH Index should be interpreted with caution. The association between water security and incidence might have resulted because countries with high water security usually had high economic statuses, given that GDP per capita and WASH index for water safety had a positive correlation (r = 0.47, p < 0.001). Therefore, the authors are not convinced of the negative effect of water safety and support that water security should be ensured for tackling the pandemic³⁷.

Countries with higher population densities were expected to have higher incidences. In common perception, dense areas could be vulnerable to closer contact, which leads to higher caseloads in directly transmitted infectious diseases. Our study supports this common perception, which is also supported by Bhadra et al.²⁰ and Coşkun et al.²¹. However, a study that analyzed 913 U.S. metropolitan counties²² disputed this perception by showing that the connectivity between counties was significantly associated with incidence rather than the population density. As studies are usually performed within countries^20,21,22, further studies at the country level are needed to clarify whether population density is associated with the disease outcomes.

As examined by several other studies^19,38, older age was associated with a higher CFR. Older patients with COVID-19 are more vulnerable to progress to severe disease³⁹ and a greater number of patients with severe disease could burden the national economy and healthcare capacity. Therefore, the government should have great interest in older patients with COVID-19.

Countries with fewer healthcare professionals, especially physicians, were vulnerable to CFR. It is possible to consider that an increase in CFR, resulting from the lack of healthcare professionals, could lead to the collapse of the healthcare system. Retaining a sufficient number of healthcare workers is essential to win this war⁴⁰. Therefore, the government should secure the safety and well-being of healthcare professionals in physical and psychological aspects^40,41.

Countries with higher usual tourism receipts were likely to have lower CFRs. Contrastingly, Farzanegan et al.²³ suggested that countries with higher inbound and outbound tourism are more likely to have higher number of confirmed cases and deaths. Most European countries enforced border control measures at a later stage as compared to Asia-Pacific countries⁴². Since the extraction date of COVID-19 outbreak data we used is about five months later than that of a previous study²³, it is possible for the effect of border control to be fully reflected in our study. However, effect of border control could not be fully considered, further studies which consider the characteristics of border controls implemented by countries are required.

Our study has several limitations. As COVID-19 pandemic is still ongoing, the data we used has limitation with respect to reflecting the current situation. Because the information related to COVID-19 was extracted only once, i.e., on 14 September 2020, information after this date cannot be applied in our analysis. However, by setting 14 September 2020 as data capture date, we could consider that most of the countries had gone through the first wave of COVID-19^17,18 and we could reduce the chances of biased results because of possible cocirculation of flu and COVID-19¹⁸, and because of possible effect of vaccination. We did not include national control measures as potential factors, as mitigation policies themselves have limitations in comparing effectiveness. Specifically, each country had various kinds of policies at different intensities^43,44, different initiation times^43,44, and various degrees of compliance of the public to the policy^45,46,47. Age-standardization, which is useful to fairly compare the disease outcomes across countries⁴⁸, could not be implemented in our study. This was because each country reported the outcomes with different age standards, and some countries did not report based on age group. However, including age-representing variables in the analysis models must have adjusted the differences in age structure among countries to some degree. Finally, we hardly support a definitive judgement on the effect of ethnicity across countries, as the categories of ethnic region we used were not based on social consent but were ones used by a single published article²⁴. However, because social standards in ethnic category are absent, the ethnic grouping we used was the best option to handle the ethnic categories. Genetic factors could not be investigated in our study because data regarding genetic factors related to COVID-19 was unavailable.

This study is meaningful in examining the association of ethnicity with COVID-19 health-related outcomes at the country level and highlighting that ethnicity could largely explain COVID-19 incidence and CFR. Moreover, the authors consider that this work could be used as a trigger for further research investigating the effect of different genetic predispositions across ethnicities on COVID-19 outcomes.

Data availability

Information on COVID-19 health-related outcomes is open to public. Data download is available in the following website: https://www.worldometers.info/coronavirus. This research has been conducted using COVID-19 health-related outcomes on 14 September 2020. Information on country-level indices, including demographic, and socio-economic features, global health security index, healthcare capacity, and health behaviors, is publicly available. Data could be downloaded from following websites: World Bank Open data (https://data.worldbank.org); Human Development Data (1990–2018) (http://hdr.undp.org/en/data); Countries in the world by population (https://www.worldometers.info/world-population/population-by-country); 2019 Global Health Security Index (https://www.ghsindex.org); Global Burden of Disease Study 2015 (GBD 2015) (http://ghdx.healthdata.org/record/ihme-data/gbd-2015-healthcare-access-and-quality-index-1990-2015); World Health Data Platform (https://www.who.int/data); and Water, sanitation & hygiene (WASH) data (https://data.unicef.org/resources/dataset/drinking-water-sanitation-hygiene-database). Data used in this work are available upon request to the corresponding author. The Shapefile used for Figs. 2 and 3 was obtained from “Admin 0-Countries” of Natural Earth (https://www.naturalearthdata.com/downloads/110m-cultural-vectors/). The data to create maps for academic publishing are freely available (Term of use: https://www.naturalearthdata.com/about/terms-of-use/).

References

World Health Organization Coronavirus Disease (COVID-19) Dashboard (2020). https://covid19.who.int, accessed 31 July 2021.
Yamamoto, N. & Bauer, G. Apparent difference in fatalities between Central Europe and East Asia due to SARS-COV-2 and COVID-19: Four hypotheses for possible explanation. Med. Hypotheses 144, 110160. https://doi.org/10.1016/j.mehy.2020.110160 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sarmadi, M., Marufi, N. & Kazemi Moghaddam, V. Association of COVID-19 global distribution and environmental and demographic factors: An updated three-month study. Environ. Res. 188, 109748. https://doi.org/10.1016/j.envres.2020.109748 (2020).
Article CAS PubMed PubMed Central Google Scholar
Smit, A. J. et al. Winter is coming: A southern hemisphere perspective of the environmental drivers of SARS-CoV-2 and the potential seasonality of COVID-19. Int. J. Environ. Res. Public Health 17, 5634. https://doi.org/10.3390/ijerph17165634 (2020).
Article CAS PubMed Central Google Scholar
Islam, N., Khunti, K., Dambha-Miller, H., Kawachi, I. & Marmot, M. COVID-19 mortality: A complex interplay of sex, gender and ethnicity. Eur. J. Public Health 30, 847–848. https://doi.org/10.1093/eurpub/ckaa150 (2020).
Article PubMed Google Scholar
Palaiodimos, L. et al. Severe obesity, increasing age and male sex are independently associated with worse in-hospital outcomes, and higher in-hospital mortality, in a cohort of patients with COVID-19 in the Bronx, New York. Metabolism 108, 154262. https://doi.org/10.1016/j.metabol.2020.154262 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lee, H., Lee, J.-R., Jung, H. & Lee, J. Y. Factors associated with incidence, mortality, and case fatality of COVID-19: A natural experimental study in South Korea. https://ssrn.com/abstract=3675411 (2020).
Gold, J. A. W. et al. Race, ethnicity, and age trends in persons who died from COVID-19—United States, May–August 2020. MMWR Morb. Mortal. Wkly. Rep. 69, 1517–1521. https://doi.org/10.15585/mmwr.mm6942e1 (2020).
Article CAS PubMed PubMed Central Google Scholar
Caussy, C. et al. Prevalence of obesity among adult inpatients with COVID-19 in France. Lancet Diabetes Endocrinol. 8, 562–564. https://doi.org/10.1016/S2213-8587(20)30160-1 (2020).
Article CAS PubMed PubMed Central Google Scholar
Caballero, A. E. et al. COVID-19 in people living with diabetes: An international consensus. J. Diabetes Complications 34, 107671. https://doi.org/10.1016/j.jdiacomp.2020.107671 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mantovani, A., Byrne, C. D., Zheng, M.-H. & Targher, G. Diabetes as a risk factor for greater COVID-19 severity and in-hospital death: A meta-analysis of observational studies. Nutr. Metab. Cardiovasc. Dis. 30, 1236–1248. https://doi.org/10.1016/j.numecd.2020.05.014 (2020).
Article CAS PubMed PubMed Central Google Scholar
Abedi, V. et al. Racial, economic, and health inequality and COVID-19 infection in the United States. J. Racial Ethn. Health Disparities https://doi.org/10.1007/s40615-020-00833-4 (2020).
Article PubMed PubMed Central Google Scholar
Drefahl, S. et al. A population-based cohort study of socio-demographic risk factors for COVID-19 deaths in Sweden. Nat. Commun. 11, 5097. https://doi.org/10.1038/s41467-020-18926-3 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hawkins, D. Social determinants of COVID-19 in Massachusetts, United States: An ecological study. J. Prev. Med. Public Health 53, 220–227. https://doi.org/10.3961/jpmph.20.256 (2020).
Article PubMed PubMed Central Google Scholar
Tirupathi, R. et al. COVID-19 disparity among racial and ethnic minorities in the US: A cross sectional analysis. Travel Med. Infect. Dis. 38, 101904. https://doi.org/10.1016/j.tmaid.2020.101904 (2020).
Article PubMed PubMed Central Google Scholar
Worldometer COVID-19 CORONAVIRUS PANDEMIC (2020). https://www.worldometers.info/coronavirus, accessed 14 Sept 2020.
Fan, G. et al. Decreased case fatality rate of COVID-19 in the second wave: A study in 53 countries or regions. Transbound. Emreg. Dis. https://doi.org/10.1111/tbed.13819 (2020).
Article Google Scholar
Burki, T. K. Double threat of COVID-19 and influenza. Lancet Respir. Med. 8, e97. https://doi.org/10.1016/S2213-2600(20)30508-7 (2020).
Article CAS PubMed Central Google Scholar
Chaudhry, R., Dranitsaris, G., Mubashir, T., Bartoszko, J. & Riazi, S. A country level analysis measuring the impact of government actions, country preparedness and socioeconomic factors on COVID-19 mortality and related health outcomes. EClinicalMedicine 25, 100464. https://doi.org/10.1016/j.eclinm.2020.100464 (2020).
Article PubMed PubMed Central Google Scholar
Bhadra, A., Mukherjee, A. & Sarkar, K. Impact of population density on Covid-19 infected and mortality rate in India. Model. Earth Syst. Environ. https://doi.org/10.1007/s40808-020-00984-7 (2020).
Article PubMed PubMed Central Google Scholar
Coşkun, H., Yıldırım, N. & Gündüz, S. The spread of COVID-19 virus through population density and wind in Turkey cities. Sci. Total Environ. 751, 141663. https://doi.org/10.1016/j.scitotenv.2020.141663 (2021).
Article ADS CAS PubMed Google Scholar
Hamidi, S., Sabouri, S. & Ewing, R. Does density aggravate the COVID-19 pandemic?. J. Am. Plann. Assoc. 86, 495–509. https://doi.org/10.1080/01944363.2020.1777891 (2020).
Article Google Scholar
Farzanegan, M. R., Gholipour, H. F., Feizi, M., Nunkoo, R. & Andargoli, A. E. International tourism and outbreak of coronavirus (COVID-19): A cross-country analysis. J. Travel Res. 17, 5409. https://doi.org/10.1177/0047287520931593 (2020).
Article Google Scholar
Rawshani, A. et al. Impact of ethnicity on progress of glycaemic control in 131 935 newly diagnosed patients with type 2 diabetes: a nationwide observational study from the Swedish National Diabetes Register. BMJ Open 5, e007599. https://doi.org/10.1136/bmjopen-2015-007599 (2015).
Article PubMed PubMed Central Google Scholar
World Bank Open Data (2020). https://data.worldbank.org. Accessed 9 September 2020.
United Nations Development Programme Human Development Data (1990–2018) (2020). http://hdr.undp.org/en/data, accessed 5 Sept 2020.
Worldometer Countries in the world by population (2020). https://www.worldometers.info/world-population/population-by-country, accessed 14 Sept 2020.
Bhopal, R. S. Migration, Ethnicity, Race, and Health in Multicultural Societies (Oxford University Press, 2014).
Google Scholar
2019 Global Health Security Index (2020). https://www.ghsindex.org, accessed 31 Aug 2020.
Institute for Health Metrics and Evaluation Global Burden of Disease Study 2015 (GBD 2015) Healthcare Access and Quality Index Based on Amenable Mortality 1990–2015 (2017). http://ghdx.healthdata.org/record/ihme-data/gbd-2015-healthcare-access-and-quality-index-1990-2015, accessed 31 Aug 2020.
World Health Organization World Health Data Platform (2020). https://www.who.int/data, accessed 9 Sept 2020.
United Nations International Children’s Emergency Fund Water, sanitation & hygiene (WASH) data (2020). https://data.unicef.org/resources/dataset/drinking-water-sanitation-hygiene-database, accessed 31 Aug 2020.
World Health Organization. Interim guidance: Water, sanitation, hygiene, and waste management for SARS-CoV-2, the virus that causes COVID-19 (2020). https://www.who.int/publications/i/item/WHO-2019-nCoV-IPC-WASH-2020.4, accessed 31 Aug 2020.
Chen, S., Yang, J., Yang, W., Wang, C. & Bärnighausen, T. COVID-19 control in China during mass population movements at New Year. Lancet 395, 10226. https://doi.org/10.1016/S0140-6736(20)30421-9 (2020).
Article Google Scholar
Haider, N. et al. The Global Health Security index and Joint External Evaluation score for health preparedness are not correlated with countries’ COVID-19 detection response time and mortality outcome. Epidemiol. Infect. 148, e210. https://doi.org/10.1017/s0950268820002046 (2020).
Article CAS PubMed Google Scholar
Abbey, E. J. et al. The Global Health Security Index is not predictive of coronavirus pandemic responses among Organization for Economic Cooperation and Development countries. PLoS ONE 15, e0239398. https://doi.org/10.1371/journal.pone.0239398 (2020).
Article CAS PubMed PubMed Central Google Scholar
Staddon, C. et al. Water insecurity compounds the global coronavirus crisis. Water Int. 45, 416–422. https://doi.org/10.1080/02508060.2020.1769345 (2020).
Article Google Scholar
Gilbert, M. et al. Preparedness and vulnerability of African countries against importations of COVID-19: A modelling study. Lancet 395, 871–877. https://doi.org/10.1016/S0140-6736(20)30411-6 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liu, K., Chen, Y., Lin, R. & Han, K. Clinical features of COVID-19 in elderly patients: A comparison with young and middle-aged patients. J. Infect. 80, e14–e18. https://doi.org/10.1016/j.jinf.2020.03.005 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ehrlich, H., McKenney, M. & Elkbuli, A. Protecting our healthcare workers during the COVID-19 pandemic. Am. J. Emerg. Med. 38, 1527–1528. https://doi.org/10.1016/j.ajem.2020.04.024 (2020).
Article PubMed PubMed Central Google Scholar
Kannampallil, T. G. et al. Exposure to COVID-19 patients increases physician trainee stress and burnout. PLoS ONE 15, e0237301. https://doi.org/10.1371/journal.pone.0237301 (2020).
Article CAS PubMed PubMed Central Google Scholar
Han, E. et al. Lessons learnt from easing COVID-19 restrictions: An analysis of countries and regions in Asia Pacific and Europe. Lancet 396, 1525–1534. https://doi.org/10.1016/s0140-6736(20)32007-9 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chowdhury, R. et al. Dynamic interventions to control COVID-19 pandemic: A multivariate prediction modelling study comparing 16 worldwide countries. Eur. J. Epidemiol. 35, 389–399. https://doi.org/10.1007/s10654-020-00649-w (2020).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. et al. The temporal association of introducing and lifting non-pharmaceutical interventions with the time-varying reproduction number (R) of SARS-CoV-2: A modelling study across 131 countries. Lancet Infect. Dis. https://doi.org/10.1016/S1473-3099(20)30785-4 (2020).
Article PubMed PubMed Central Google Scholar
Roma, P. et al. How to improve compliance with protective health measures during the COVID-19 outbreak: Testing a moderated mediation model and machine learning algorithms. Int. J. Environ. Res. Public Health 17, 7252. https://doi.org/10.3390/ijerph17197252 (2020).
Article CAS PubMed Central Google Scholar
Van Rooij, B. et al. Compliance with COVID-19 mitigation measures in the United States. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3582626 (2020).
Zajenkowski, M., Jonason, P. K., Leniarska, M. & Kozakiewicz, Z. Who complies with the restrictions to reduce the spread of COVID-19?: Personality and perceptions of the COVID-19 situation. Pers. Individ. Dif. 166, 110199. https://doi.org/10.1016/j.paid.2020.110199 (2020).
Article PubMed PubMed Central Google Scholar
Kim, D. H., Choe, Y. J. & Jeong, J. Y. Understanding and interpretation of case fatality rate of coronavirus disease 2019. J. Korean Med. Sci. 35, e137–e137. https://doi.org/10.3346/jkms.2020.35.e137 (2020).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the Research Program funded by the Korea Centers for Disease Control and Prevention (2020-ER5313-00). The authors declare no competing interests.

Author information

Authors and Affiliations

Department of Preventive Medicine, Korea University College of Medicine, Seoul, Republic of Korea
Jeehyun Kim, Kwan Hong, Sujin Yum, Raquel Elizabeth Gómez Gómez, Jieun Jang & Byung Chul Chun
Graduate School of Public Health, Korea University, Seoul, Republic of Korea
Jeehyun Kim, Kwan Hong, Sujin Yum, Raquel Elizabeth Gómez Gómez & Byung Chul Chun
Transdisciplinary Major in Learning Health Systems, Department of Healthcare Sciences, Graduate School, Korea University, Seoul, Republic of Korea
Jeehyun Kim & Byung Chul Chun
Division of Infectious Diseases, Department of Internal Medicine, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
Sun Hee Park
Department of Pediatrics, Korea University Anam Hospital, Seoul, Republic of Korea
Young June Choe
Department of Preventive Medicine, Konyang University College of Medicine, Daejeon, Republic of Korea
Sukhyun Ryu
Division of Infectious Diseases, Department of Internal Medicine, Korea University Ansan Hospital, Ansan, Republic of Korea
Dae Won Park
Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Internal Medicine, Korea University Guro Hospital, Seoul, Republic of Korea
Young Seok Lee
Center for Preventive Medicine and Public Health, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
Heeyoung Lee
Department of Pediatrics, Inha University School of Medicine, Incheon, Republic of Korea
Dong Hyun Kim
Department of Social and Preventive Medicine, Hallym University College of Medicine, Chuncheon, Gangwon, Republic of Korea
Dong-Hyun Kim

Authors

Jeehyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kwan Hong
View author publications
You can also search for this author in PubMed Google Scholar
Sujin Yum
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Elizabeth Gómez Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Jieun Jang
View author publications
You can also search for this author in PubMed Google Scholar
Sun Hee Park
View author publications
You can also search for this author in PubMed Google Scholar
Young June Choe
View author publications
You can also search for this author in PubMed Google Scholar
Sukhyun Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Dae Won Park
View author publications
You can also search for this author in PubMed Google Scholar
Young Seok Lee
View author publications
You can also search for this author in PubMed Google Scholar
Heeyoung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Dong Hyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Hyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Byung Chul Chun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.K. and B.C.C. designed the study. J.K., S.Y., H.K. and R.G. collected data. J.K. analyzed the data and wrote the first draft of the manuscript. B.C.C., K.H., S.Y., R.G., J.J., S.H.P., Y.J.C., S.R., D.W.P., Y.S.L., H.L., D.H.K., and D.-H.K. critically reviewed the manuscript and interpreted the results of analysis. J.K. and B.C.C. revised final manuscript. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Byung Chul Chun.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, J., Hong, K., Yum, S. et al. Factors associated with the difference between the incidence and case-fatality ratio of coronavirus disease 2019 by country. Sci Rep 11, 18938 (2021). https://doi.org/10.1038/s41598-021-98378-x

Download citation

Received: 19 January 2021
Accepted: 06 September 2021
Published: 23 September 2021
DOI: https://doi.org/10.1038/s41598-021-98378-x

This article is cited by

Health system characteristics and COVID-19 performance in high-income countries
- Iris Moolla
- Heikki Hiilamo
BMC Health Services Research (2023)
Time-dependent risk of COVID-19 death with overwhelmed health-care capacity in Japan, 2020–2022
- Katsuma Hayashi
- Hiroshi Nishiura
BMC Infectious Diseases (2022)
Analyzing the GHSI puzzle of whether highly developed countries fared worse in COVID-19
- Sofija Markovic
- Igor Salom
- Marko Djordjevic
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.