Introduction

Since the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China, COVID-19 has become a major global health problem. Clusters of COVID-19 pneumonia cases led to the eventual identification of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The World Health Organization (WHO) declared COVID-19 a pandemic on March 11, 2020. As of December 29, 2020, there were 81,278,104 confirmed cases from 220 countries, with 1,774,388 deaths, and an overall projected case fatality rate of 2.2%1.

The COVID-19 pandemic has led to shortages of medical resources, including hospital beds, ventilators, and personal protective equipment in several countries2,3,4. Therefore, accurate diagnosis and outcome prediction are essential to decrease the burden on national healthcare systems and provide optimal care for COVID-19 patients.

During the early period of the outbreak, some case series from China reported the epidemiological and clinical characteristics of COVID-195,6,7. Since then, some cohort studies have described potential predictors of mortality and poor clinical outcomes among patients with COVID-198,9,10. Recently, some large cohort studies including nationwide data (United Kingdoms, Germany and France, respectively) reported the clinical characteristics and factors associated with outcomes among COVID-19 patients11,12,13. Although this nationwide COVID-19 evidence has emerged, there are still controversies about risk factors of mortality and poor clinical outcomes among COVID-19 patients.

By using the data collected by the Korea Disease Control and Prevention Agency (KDCA), we presented details of all patients with laboratory-confirmed COVID-19 and a definite clinical outcome (survival discharge or death) as of April 30, 2020. We aimed to explore the risk factors of mortality and fatal adverse outcomes among COVID-19 patients in Korea.

Results

Demographic and clinical characteristics

A total of 5628 patients with confirmed COVID-19 were recorded in the KDCA registry during the study period. After excluding 7 patients who underwent laboratory tests for COVID-19 but died before the final confirmation of SARS-CoV-2 [using real-time reverse transcription polymerase chain reaction (RT-PCR) methods], we included 5621 patients in the final analysis (Fig. 1). Of the 5621 patients, 5387 were discharged while 234 died in the hospital (Table 1 and Fig. 1). The most common age group was 50–59 years, followed by 20–29 and 60–69 years. Of the 5621 patients, 2317 (41.2%) were men. Comorbidities were present in nearly half of the patients, including hypertension (21.3%), diabetes (12.3%), dementia (4.0%), chronic cardiac disease (3.2%), and malignancy (2.5%). The most common symptoms at presentation were cough and sputum, followed by fever and headache. Overall, the median isolation period was 24 days [interquartile range (IQR) 18–32]. The median survival (isolation) period among non-survivors was 11 days (IQR 6–21) after confirmation of COVID-19. Comparisons of clinical and epidemiological characteristics between survivors and non-survivors are shown in Table 1. Supplementary Table 1 presents the levels of complications that occurred during the isolation period among all patients. Patients who had no limitation of activity were 4482 (79.7%) while those requiring oxygen supply was 809 (14.4%).

Figure 1
figure 1

Flowchart of the study population.

Table 1 Baseline characteristics of COVID-19 patients in Korea.

Kaplan–Meier survival curves according to age and sex

Mortality was significantly higher in elderly patients (age ≥ 65 years) than in younger patients (age < 65 years) (log-rank test, p < 0.001) (Fig. 2a) and in male patients than in female patients (log-rank test, p < 0.001) (Fig. 2b).

Figure 2
figure 2

Kaplan–Meier survival curves stratified by age (a) and sex (b).

Factors associated with mortality and fatal adverse outcomes

Logistic regression analysis

In univariable logistic regression analysis, the odds of mortality were higher in patients aged ≥ 60 years, and those with altered mentality, dementia, heart failure, and chronic kidney disease (Table 2). Sex, isolation period, obesity, heart rate, body temperature, fever, dyspnoea, diabetes, hypertension, chronic cardiac disease, asthma, chronic obstructive pulmonary disease, malignancy, and leucocytosis, were also associated with mortality (Table 2). Multivariable logistic regression analysis results showed increasing odds of mortality with age ≥ 60 years, short isolation period, dyspnoea, altered mentality, diabetes, malignancy, dementia, intensive care unit (ICU) admission, and lower levels of lymphocyte, haemoglobin, or platelet (Table 2). The multiple regression equation including all the potential variables effectively predicted mortality in COVID-19 patients [area under the curve (AUC) = 0.979, 95% confidence interval (CI) 0.964–0.993].

Table 2 Logistic regression analysis for mortality among COVID-19 patients.

Univariate analysis showed that the odds of fatal adverse outcomes were higher in patients aged ≥ 60 years, and those with altered mentality, dementia, heart failure, and chronic kidney disease (Table 3). Sex, isolation period, obesity, heart rate, body temperature, fever, dyspnoea, diabetes, hypertension, chronic cardiac disease, asthma, chronic obstructive pulmonary disease, malignancy, and leucocytosis, were also associated with fatal adverse outcomes (Table 3). Multivariable logistic regression analysis showed increasing odds of fatal adverse outcomes associated with age ≥ 60 years, short isolation period, dyspnoea, altered mentality, diabetes, hypertension, malignancy, dementia, ICU admission, and lower levels of lymphocytes, haemoglobin, or platelets (Table 3).

Table 3 Logistic regression analysis of factors associated with fatal adverse outcomes (invasive mechanical ventilation, multi-organ failure, ECMO, and death) among COVID-19 patients.

Cox proportional hazards model

The multivariable Cox proportional hazards model showed increasing hazards of mortality with dementia, ICU admission, age ≥ 60 years, malignancy, and dyspnoea (Table 4). The hazard ratio plot for mortality using the multivariable Cox proportional hazards model is presented in Fig. 3a. Body mass index (BMI) 18.5–22.9 or 23.0–24.9, headache, and higher levels of lymphocytes or platelets were associated with decreasing mortality hazards. The multivariable Cox proportional hazards model showed that increasing hazards of fatal adverse outcomes were associated with dementia, ICU admission, age ≥ 60 years, malignancy, and dyspnoea (Table 5). The hazard ratio plot for fatal adverse outcomes using the multivariable Cox proportional hazards model is presented in Fig. 3b. BMI, 18.5–22.9 or 23.0–24.9, and higher levels of lymphocytes or platelets were associated with decreasing hazards of fatal adverse outcomes.

Table 4 Cox proportional hazards model for mortality among COVID-19 patients.
Figure 3
figure 3

Hazards ratio plot in prediction of mortality (a) and fatal adverse outcomes (b).

Table 5 Cox proportional hazards model for fatal adverse outcomes among COVID-19 patients.

Discussion

To our knowledge, this is one of the largest nationwide cohort studies in COVID-19 patients with definite clinical outcomes. This nationwide retrospective cohort study identified the risk factors of mortality and fatal adverse outcomes among patients with laboratory-confirmed COVID-19 in Korea. In particular, dementia, ICU admission, age ≥ 60 years, malignancy, and dyspnoea, were associated with higher odds of mortality. In addition, BMI 18.5–22.9 or 23.0–24.9, and higher levels of lymphocytes or platelets, were associated with lower odds of poor clinical outcomes.

The present study showed that older age (≥ 60 years) was associated with mortality and fatal adverse outcomes in patients with COVID-19. Although individuals > 65 years comprise 17% of the total population in the United States, they account for 31% of COVID-19 patients, 45% of hospitalized patients, 53% of those on intensive care unit admission, and 80% of those that died due to COVID-1914. This suggests that older individuals are susceptible to developing COVID-19 and have poor clinical outcomes compared with the general population. Our results showed that individuals ≥ 60 accounted for 32% of COVID-19 and 92% of deaths due to COVID-19. In accordance with our study, recent studies reported that older age is significantly associated with mortality among COVID-19 patients9,10.

Previous studies reported that male sex is significantly associated with mortality9,15. Our data showed that although mortality was significantly higher in male patients than in female patients in the Kaplan–Meier survival curve analysis, male sex was not a significant risk factor for mortality in the Cox proportional hazards model. We recommend a larger worldwide cohort study to explore this discrepancy between our results and those of previous studies.

The risk of serious disease and mortality in COVID-19 increases with the presence of comorbidities16,17,18,19,20. Our data showed malignancy as an independent risk factor for mortality in the multivariable Cox proportional hazards model with a hazard ratio (HR) of 3.054. According to a recent study from China, cancer patients with COVID-19 had 3.5 times higher risk of requiring ICU admission or mechanical ventilation, compared to the general population21. Cancer patients are generally immunocompromised and are at higher risk of COVID-19-related fatal events in comparison to the cancer-free population21,22.

Our study also showed that dementia was associated with mortality and fatal adverse outcomes. Some reasons can be proposed to explain our results. First, most of the patients suffering dementia are old and have other comorbidities that could result in poor clinical outcomes23. Second, people with dementia would require the support of dementia caregivers in order to keep preventive and healthcare measures. However, this COVID-19 pandemic has made the availability of caregivers limited24. Third, persistent inflammatory state of dementia patients may increase peripheral blood white blood cells and neutrophil counts, which was associated with higher mortality from COVID-1925. Fourth, the ApoE e4 genotype, which were associated with patients with dementia26, can increase the risk of having acute respiratory distress syndrome in COVID-19 patients. Similar to our data, a previous study reported that mortality due to COVID-19 was significantly higher in elderly patients with dementia27. Another study reported an in-hospital mortality of 61% among hospitalized patients with dementia and COVID-1928. In our study, mortality during the isolation period was 33.5% among patients with COVID-19. Two large cohort studies from the United Kingdom showed that pre-existing dementia was a significant predictor of COVID-19 mortality11,20. Furthermore, a recent study, including global big data on dementia and COVID-19 in 185 countries revealed a significant correlation between dementia burden and COVID-19-related death29.

In the present study, although diabetes mellitus (DM) was not a predictor of mortality in the Cox proportional hazards model, it was associated with mortality and fatal adverse outcomes in multivariable logistic regression analysis. Some case series reported that patients with pre-existing DM are vulnerable to infection and are at a higher risk of death from COVID-195,15,30,31,32. According to a recent meta-analysis, diabetes in patients with COVID-19 is associated with a two-fold increase in mortality as well as severity of COVID-19, as compared to those in non-diabetics33. Similar to the prior meta-analysis, our results showed that DM was associated with increased odds of mortality of 2.728 and fatal adverse outcomes of 1.838 in multivariable logistic regression analysis.

Our study demonstrated that dyspnoea and ICU admission were associated with mortality and fatal adverse outcomes in both the multivariable logistic regression analysis and Cox proportional hazards model. Data from urban communities in the United States revealed that patients admitted to the ICU had a higher incidence of respiratory failure requiring invasive mechanical ventilation and mortality compared with patients in the general ward9. Another study showed that dyspnoea at presentation was associated with hospitalization and the need for ICU management9. According to a recent meta-analysis, dyspnoea was the only symptom predictive of severe COVID-19 and ICU admission19. Similar to the previous meta-analysis, our study showed that dyspnoea was the only symptom predictive of fatal adverse outcomes.

A recent study using a systemic review and meta-analysis showed that hematologic and inflammatory markers such as procalcitonin, C-reactive protein, D-dimer, and lactate dehydrogenase and decreased albumin could help predict severe outcomes in COVID-1934. Our study showed that lower levels of lymphocytes or platelets were associated with increasing hazards of mortality and fatal adverse outcomes. COVID-19 patients with low levels of lymphocyte and platelets should be monitored closely to minimize the risk of progression to severe disease.

The present study has some limitations. First, due to the retrospective nature of the study design and data extraction procedure, some laboratory data were missing or unavailable. Therefore, their influence might have been underestimated in predicting mortality and fatal adverse outcomes. Second, our data did not include information regarding radiologic findings, which might reveal prognostic factors35. Third, although treatment strategy can have a significant impact on clinical outcome36,37,38, we assumed that enrolled patients all had adequate and timely management. Fourth, although our study included over 5600 patients with COVID-19 from Korea, there was a lack of time-series analysis for potential variables. Fifth, by excluding patients still under isolation as of April 30, 2020, the case fatality ratio in our data could not reflect the true mortality of COVID-19. Sixth, our study did not include data regarding the status of therapy for each comorbidity. For instance, metformin use was associated with reduced mortality39, but statin use was not40.

Conclusions

This nationwide cohort study showed that dementia, ICU admission, age ≥ 60 years, malignancy, and dyspnoea were strong predictors of death and fatal adverse outcomes in COVID-19 patients. The multiple regression equation including clinical and epidemiological variables predicted mortality in such patients. These findings can help clinicians to identify COVID-19 patients with poor prognosis.

Methods

Study design and population

This retrospective cohort study included a nationwide cohort data of COVID-19 patients collected by KDCA, Korea. All patients who were diagnosed with COVID-19 were screened, and those who died or were discharged between January 20, 2020 (i.e., when the first patients were diagnosed in Korea), and April 30, 2020, were included in the present study. Since KDCA is a governmental institution responsible for surveillance and control of COVID-19 in Korea, our study enrolled all the patients who were diagnosed with COVID-19 and had a definite clinical outcome (discharged or dead) at the early stage of the outbreak.

This study was conducted according to the principles of the Declaration of Helsinki and was approved by the Institutional Review Board (IRB) of the Korea University Ansan Hospital (IRB number 2020AS0245). The requirement for informed consent was waived by the IRB because of the retrospective nature of the study.

Data source

Clinical, epidemiological, demographic, laboratory, and outcome data were collected by KDCA since the diagnosis of the first patients with COVID-19 on January 20, 2020. All data were checked by the physicians and the KDCA preventive medicine experts. We obtained permission to access the encoded KDCA server for three weeks to extract and analyse the data during the limited period. These data cannot be shared publicly according to the KDCA directive, which prohibits researchers from such. KCDA provided anonymised clinical data for the public interest.

Laboratory results

For the laboratory confirmation of SARS-CoV-2, real-time RT-PCR was performed using throat-swab specimens. The criteria for discharge were two negative throat-swab samples for SARS-CoV-2 RNA, absence of fever for at least 3 days, definite improvement of pneumonia on chest computed tomography, and stable vital signs. Routine blood examination included a complete blood count.

Definitions

We defined fever as an axillary temperature of at least 37.5 °C. Fatal adverse outcomes were defined as at least one of invasive mechanical ventilation, multi-organ failure (MOF), ECMO, or death during the isolation period. In the present study, MOF refers to altered function in two or more organ systems after the diagnosis of COVID-19.

Statistical analysis

Statistical analyses were performed using R 4.0.2 (the R Foundation for Statistical Computing, Vienna, Austria). For all analyses, a 2-tailed test with a p value < 0.05 was considered statistically significant. Continuous variables were presented as median (IQR) or mean [standard deviation (SD)], and categorical variables were presented as frequencies (percentages). We used the t test, Mann–Whitney U test, χ2 test, or Fisher’s exact test to compare differences between survivors and non-survivors, as appropriate. The risk factors for death and fatal adverse outcomes in patients were determined using univariable and multivariable logistic regression models and presented as odds ratios (ORs) and 95% CIs. We excluded variables from the univariable logistic regression analysis if their between-group differences (the t test, Mann–Whitney U test, χ2 test, or Fisher’s exact test) were not significant. We considered variables with p ≤ 0.05 in the univariate analysis as candidate variables for the multivariable analysis. Kaplan–Meier curves and log-rank tests for age and sex were also performed for survival analyses. To explore whether potential variables were independently associated with shortened survival, we used univariable and multivariable Cox proportional-hazards models to calculate the HR and 95% CI for each variable.