SARS2 simplified scores to estimate risk of hospitalization and death among patients with COVID-19

Dashti, Hesam; Roche, Elise C.; Bates, David William; Mora, Samia; Demler, Olga

doi:10.1038/s41598-021-84603-0

Download PDF

Article
Open access
Published: 02 March 2021

SARS2 simplified scores to estimate risk of hospitalization and death among patients with COVID-19

Scientific Reports volume 11, Article number: 4945 (2021) Cite this article

8398 Accesses
14 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Although models have been developed for predicting severity of COVID-19 from the medical history of patients, simplified models with good accuracy could be more practical. In this study, we examined utility of simpler models for estimating risk of hospitalization of patients with COVID-19 and mortality of these patients based on demographic characteristics (sex, age, race, median household income based on zip code) and smoking status of 12,347 patients who tested positive at Mass General Brigham centers. The corresponding electronic records were queried (02/26–07/14/2020) to construct derivation and validation cohorts. The derivation cohort was used to fit generalized linear models for estimating risk of hospitalization within 30 days of COVID-19 diagnosis and mortality within approximately 3 months for the hospitalized patients. In the validation cohort, the model resulted in c-statistics of 0.77 [95% CI 0.73–0.80] for hospitalization, and 0.84 [95% CI 0.74–0.94] for mortality among hospitalized patients. Higher risk was associated with older age, male sex, Black ethnicity, lower socioeconomic status, and current/past smoking status. The models can be applied to predict the absolute risks of hospitalization and mortality, and could aid in individualizing the decision making when detailed medical history of patients is not readily available.

A novel scale based on biomarkers associated with COVID-19 severity can predict the need for hospitalization and intensive care, as well as enhanced probabilities for mortality

Article Open access 04 June 2023

Developing and validating COVID-19 adverse outcome risk prediction models from a bi-national European cohort of 5594 patients

Article Open access 05 February 2021

Trends and associated factors for Covid-19 hospitalisation and fatality risk in 2.3 million adults in England

Article Open access 29 April 2022

Introduction

On 29 August 2020, the Centers for Disease Control and Prevention (CDC) reported 291,985 new COVID-19 weekly cases in the U.S. that increased the total number of cases in the U.S. to 5,890,532 patients¹. At the rise of the new surge in cases, designing models for predicting severity of COVID-19 illness is essential for public health strategies, as risk scores could enable allocations of limited medical resources and preparedness of healthcare facilities. The CDC reports age and medical comorbidities (e.g. chronic kidney disease, heart conditions, immunocompromised conditions, obesity, etc.) as leading risk factors of severe illness in patients with COVID-19². The importance of these risk markers has been studied^{3,4,5,6,7,8,9,10,11,12,13,14,15,16}, and significance of associations between severity of illness and different patient characteristics have been demonstrated. These studies reported association between higher age and severe illness, pre-pandemic health disparities and higher risk of severe COVID-19 outcomes in blacks and racial minorities^9,10,17, importance of obesity¹⁸ and its impacts on infected children and adults^8,19,20, increased severity of COVID-19 illness in immunodeficient patients^4,11, the role of preexisting cardiovascular disease (CVD) and the use of cardiovascular medications^21,22,23,24 on severity of outcomes, and effects of kidney and pulmonary diseases³. Smoking has also been associated with COVID-19 outcomes^25,26,27,28. The largest COVID-19 cohort study on more than 10,000 COVID-19 related deaths in the UK¹² indicated a few preexisting medical conditions were significantly associated with severity in non-white and low socioeconomical regions. In another study on mortality of patients with COVID-19 in intensive care units (ICU) in the Lombardy region of Italy, older age, male sex, and measured arterial oxygenation parameters on admission to ICU were independently associated with mortality, while they also identified risk factors from patients’ medical history (chronic obstructive pulmonary disease, hypercholesterolemia, and type 2 diabetes)⁵. In a similar study in the U.S., mortality rate of ICU patients was associated to older age, male sex, high body mass index, arterial oxygenation, liver and kidney disfunction on admission, and medical history of coronary artery disease and active cancer were independently associated with mortality⁷. The 4C mortality risk score²⁹ introduced a model that uses age, sex, respiratory rate, peripheral oxygen saturation, Glasgow coma score, number of comorbidities, urea and C-reactive protein concentrations to estimate risk of mortality among hospitalized patients. Compared to previous models that utilized comprehensive lists of potential severity risk factors, the 4C Mortality Score could achieve high accuracy in the UK²⁹.

In these studies, the list of investigated and recorded risk markers from the medical history of patients varied, which could be due to the complexity and challenges associated with extracting phenotypes from electronic health records (EHR) data^30,31,32,33. In addition, recent surges in the number of patients with COVID-19 combined with the increasingly limited medical resources and challenges related to clinic- or hospital-based assessments highlight the need for simplified home-based prediction models to identify higher risk patients. Hence, simplified models that can accurately predict severity of the illness without the need of detailed examination of medical history could be more practical. In addition, patient characteristics on admission have been demonstrated to be strongly associated with the severity of illness, and the most common risk markers have been demographic variables. Therefore, we hypothesized that simplified models may provide a fast and reliable prediction of hospitalization of patients with COVID-19 and mortality among these patients. We examined this hypothesis using demographic variables and smoking status of patients tested positive for COVID-19 at Mass General Brigham (MGB) medical centers, Massachusetts, USA.

Results

The examined population contained N = 12,347 patients tested positive for COVID-19 at MGB facilities. This population consists of 42.77% white, 15.91% black, 9.05% Hispanic, and 32.28% other/unknown races. Cumulative endpoints were 3401 hospitalized patients, from which 509 were deceased. Characteristics of these patients are shown in Table 1.

Table 1 Characteristics of N = 12,347 patients with COVID-19 from the Mass General Brigham electronic health records.

Full size table

Predicting risk of hospitalization

The fitted generalized linear model (GLM) in the derivation cohort of MGB’s non-employees (N = 10,496, 30.46% hospitalized) indicated significant associations between the examined variables and hospitalization (Table 2). The odds ratios (OR) indicated higher risks of hospitalization for older and male patients. Compared with white patients, Hispanic patients had lower risk of hospitalization while black patients were at the highest risk (test of trend p-value < 0.001). Although the OR of median household income was close to 1, higher income was associated with lower risk of hospitalization. Test for trend in smoking status was significant (p-value < 0.001) with current smokers at the highest risk, followed by former smokers, and finally non-smokers at a lower risk of hospitalization.

Table 2 Adjusted odds ratios of the examined variables for predicting risk of hospitalization among patients with COVID-19 and risk of mortality for the hospitalized patients (N = 10,496 patients; 30.46% hospitalized).

Full size table

Examining this model in the validation cohort of MGB employees (N = 1851, 11.02% hospitalized) showed an area under the curve (AUC) of 0.77 [95% CI 0.73–0.80] (Supplementary Fig. 1a). The optimal predicted probability cutoff for discriminating between the two groups was 0.29, and the second optimal cutoff for identifying an intermediate risk group was 0.16. After applying these cutoffs on the MGB employees, the resulting receiver operating characteristic curve had an AUC of 0.73 [95% CI 0.70–0.76]. The model was well-calibrated in the validation cohort, based on the Hosmer–Lemeshow goodness of fit (GOF) test, p-value of 0.11. The GOF test was conducted after performing recalibration to adjust for different event rates in the derivation and validation cohorts. The corresponding calibration plot is shown in Supplementary Fig. 1b. After categorizing age (0–29, 30–59, 60–79, ≥ 80; years) and median household income (< 60, 60–80, ≥ 80; $1000), a GLM was fit on the derivation cohort and the model performed consistently with the main model (AUC in validation set: 0.75 [95% CI 0.71–0.78]). The ORs of this model were consistent with the main model (Supplementary Table 1). Heatmap of risk scores according to this categorization of age and median income is presented in Fig. 1a. Figure 1b shows the corresponding predicted probabilities of the categorized patient characteristics.

Predicting mortality

The GLM model (Table 2) was fit to predict death among hospitalized patients with COVID-19 (N = 3401, 14.97% deceased). The AUC was 0.841 [95% CI 0.74–0.94] (Supplementary Fig. 1c). The optimal predicted probability cutoff point for distinguishing deceased vs. alive hospitalized patients was 0.10, and the second cutoff was 0.06. Applying these cutoffs resulted in AUC of 0.837 [95% CI 0.75–0.92]. Based on the Hosmer–Lemeshow GOF test, the model is well calibrated, p-value of 0.6 (Supplementary Fig. 1d).

Sensitivity analyses

Effects of MGB’s change of policies in COVID-19 testing criteria before and after April 29, 2020 were considered. Two GLM models were trained on MGB non-employees who were tested for COVID-19 before (N = 6624, 33.57% hospitalized) and after (N = 3872, 25.13% hospitalized) April 29, 2020 that showed similar trends to the main model (Supplementary Table 2). Although the OR for median household incomes remained close to 1, the corresponding OR of the after April 29th cohort showed a different direction (OR 1.04 [95% CI 1.01–1.07], p-value 0.005) compared to the main model (OR 0.98 [95% CI 0.96–0.99], p-value 0.007). The ORs of the other characteristics (age, sex, race, and smoking) from the main model were confirmed in both before and after cohorts. We examined performance of the model predicting mortality of hospitalized patients with reference COVID-19 date before (N = 2379, 16.98% deceased) and after (N = 1022, 10.27% deceased) April 29th, 2020 (results not shown here). This analysis showed a good AUC of 0.76 [95% CI 0.72–0.81] for the model fit in the former group but evaluated in the latter group. For the patients tested after April 29th, the model showed an AUC of 0.81 [95% CI 0.78–0.83], when evaluated among patients tested prior to April 29, 2020. Effect sizes were consistent with the main model reported in Table 2, except income that was somewhat attenuated and no longer significant in the post April 29, 2020 subset (Supplementary Table 4).

Risk groups

The optimal predicted probability cutoffs for hospitalization of patients with COVID-19 were 0.29 and 0.16, and 0.10 and 0.06 when predicting mortality among hospitalized patients. These cutoffs were used to define low, intermediate, and high-risk groups. The beta coefficients of the model were mapped according to 1 unit change that rescaled risk scores for hospitalization to 32–75 and 86–148 for mortality among hospitalized COVID-19 patients (Table 3). The rescaled cutoffs indicated high risk of hospitalization for patients with score ≥ 44, intermediate risk (39 ≤ score < 44), and low risk (score < 39). Similarly, high risk of mortality among hospitalized patients was assigned to scores ≥ 92, intermediate risk to 89 ≤ score < 92, and low risk patients have a score of less than 89. The prevalence of hospitalization within 30 days from COVID-19 diagnosis in the low, intermediate, and high risk groups were 2.75%, 8.47%, 22.85%, respectively. The incidence of mortality over approximately 3 months among hospitalized patients with COVID-19 diagnosis ranged from 0.92% in the low risk group to 4.44% and 26%, respectively, for the intermediate and high risk groups.

Table 3 SARS2 risk scores.

Full size table

Discussion

Currently, the U.S. is one of the epicenters of the pandemic with an increasing number of COVID-19 cases and mortality. The capability of predicting severity of COVID-19 illness in a fast and efficient manner would help healthcare workers to distinguish high risk patients. We utilized MGB EHR data of patients with COVID-19 to design simplified models for predicting hospitalization risk and also risk of mortality among hospitalized patients, where the model requires only demographic variables (age, sex, race, median household income) and smoking status of the patients. Testing the models on the validation cohorts showed high AUC (0.77 and 0.72 for hospitalization and mortality), and applying discrimination cutoffs for distinguishing patients with severe illness resulted in good AUCs as well. The Hosmer–Lemeshow GOF test resulted in p-values > 0.05 indicating good calibration of the SARS2 model.

Model performance characteristics such as AUC and Hosmer–Lemeshow GOF test calculated in set-aside validation cohorts indicated that the model has good discrimination and calibration, and performed well in the population of MGB patients. The odds ratios reported for our model are consistent with the currently available knowledge about association of severity of COVID-19 with demographic characteristics. This model is named “SARS2”, for its input variables: Sex, Age, Race, Socioeconomics status, Smoking status. The proposed SARS2 model is provided as a web interface for seamless calculation of the risk scores and risk categories (https://dashti.bwh.harvard.edu/sars2/).

In the main and the sensitivity analyses, Hispanic patients had a lower risk compared to white and black patients. Although these results align with the lower rate of hospitalized Hispanic patients in the current CDC reports (Hispanic: 22.9%, white: 31.7%, and black: 32.9%)³⁴, analysis on the MGB’s EHR records showed 84.33% of Hispanic patients with COVID-19 are younger than 60 years. The younger age could explain the lower rate of hospitalization, and further investigations on Hispanic patients are needed. The derivation and validation cohorts are from patients tested positive for COVID-19 at MGB medical centers, and further validation of the models on other cohorts is required to establish generalizability beyond our data. Because of the complexity of EHR data, admission diagnoses and causes of death were not considered in this study. Therefore, although non-COVID-19 related admission rates dropped during the pandemic, some of our hospitalization and mortality endpoints may not be due to COVID-19 illness.

We would like to mention important strengths and limitations of this study. This study used registry data collected in large Boston-area hospitals; therefore it captures well medical records data for patients who were seen in this hospital system, but it is also limited to those who have an access to it. Simplicity of the proposed SARS2 model of risk of mortality allows medical professionals to use it in situations when rapid or home-based decisions must be made about the risk of severe outcome due to COVID-19. On the other hand, we intentionally did not use test results such as laboratory values and imaging data such as X-rays and history of comorbid conditions, which when available should be taken into account as well as the severity of symptoms at presentation. However, a parsimonious model such as SARS2 can be used when other information is not available or is not reliable.

In conclusion, the proposed SARS2 model for predicting hospitalization among patients with COVID-19, and mortality among those hospitalized patients is designed based on easily accessible risk markers (age, sex, race, median household income, and smoking status). The SARS2 risk score table can be used for rapid risk stratification and assessment of the severity of hospitalization and mortality risks of patients with COVID-19. The SARS2 scheme successfully identified COVID-19 patients who were at high risk of hospitalization (22.85% observed risk of hospitalization) as well as those at low risk of hospitalization (2.75%), allowing individualized selection of patients who may require closer monitoring or further evaluation. Furthermore, the SARS2 scheme provides a rapid estimate of risk of mortality that could be used immediately on arrival in the Emergency Department, when laboratory or radiological assessments are not available yet. It is well known that extraction of a valid history of medication-use, and diagnoses and preconditions is not always feasible or may result in further delays. Therefore, designing simplified, rapid, home-based models that can be used as prescreening at clinics or at home increases the practicality and efficiency of these models in healthcare facilities. Although there is a limited number of risk scores available for predicting hospitalization or death among patients with COVID-19, the SARS2 models presented here are on par with the c-statistics of more comprehensive models that for example predict mortality in the largest available COVID-19 cohort (average AUC of 0.77)¹², or the 4C mortality score²⁹ that in addition to demographic variables uses biomarkers (e.g. urea and C-reactive protein) and in-hospital measurements (e.g. respiratory and peripheral SO2) that resulted in an AUC of 0.77. Similarly, the survival model developed using cytokines, demographics and comorbidities on patients admitted to the Mount Sinai Health System in New York (AUC ranged from 0.65 to 0.76)¹⁶. The provided web interface for calculating SARS2 scores and estimated absolute risks enables reliable and rapid assessment of risks of hospitalization and mortality to be individualized.

Methods

Study population

On 07/14/2020 a total of 12,460 individuals (outpatients and inpatients) have been diagnosed with COVID-19 at MGB medical centers. Demographic variables (age, sex, race, zip code), smoking status, hospital admission records, and COVID-19 lab results of these patients were queried from MGB’s EHR (Fig. 2). All data were obtained from Electronic Health Records Repository maintained by Mass General Brigham HealthCare Systems in full compliance with the Institutional Review Board (IRB) protocols and met all data access requirements. The study protocols have been reviewed and approved by the Partners Healthcare System IRB, and given the logistical complexities associated with the utilization of EHR data, informed consent was waived by the IRB for this study. Due to the use of EHR data, and the associated serious patient privacy concerns, the data utilized by this study is highly regulated and only released with appropriate IRB approval and under the most restrictive and carefully controlled conditions. As allowed by the IRB, summary statistics of the patient data have been included in the manuscript.

The COVID-19 lab results were dated within 03/04/2019–06/29/2020, and during this period, MGB employees working onsite underwent constant self-monitoring for symptoms and selective COVID-19 testing. The criteria for testing non-employees varied during the examined time interval; before April 29, 2020 symptomatic patients who were defined as high risk (e.g., age ≥70, severe chronic lung disease, sever heart disease, on immunocompromising medications, reside in counties with high number of cases) or of specific categories (e.g., pregnant ≥ 36 weeks, patients being discharged) were tested. However, a more relaxed criteria were applied after April 29, 2020 such that testing was not dependent on older age or preexisting medical conditions, and instead the criteria were defined based on symptoms (e.g., documented fever, cough, anosmia).

For every patient, the earliest positive (positive or presumptive positive) result of their COVID-19 tests was used as a reference date, and the time interval from these reference dates to the time of retrieving data for this study (07/14/2020) has a median follow-up of 84 days [95% IQR 69–96 days]. The EHR contained patients labeled as COVID-19 positive when their lab test results were positive/presumptive positive or patients were diagnosed with COVID-19 infection by the medical staff at MGB centers (COVID-19 ICD codes were used). Those without available COVID-19 lab test results were excluded from this study. The deceased flag and its corresponding date were retrieved from the EHR that indicated date of death among hospitalized patients within 74 days from the date of COVID-19 diagnosis (median date of death: 9 days [95% IQR 4–16 days]). Because of the waiting periods for receiving results of COVID-19 tests, hospital admission records dated between 7 days before until 30 days after patients reference date were queried from the EHR to identify hospitalized patients. Time to hospitalization ranged from − 7 to 29 days with median of 0 days, that reflects a positive COVID-19 diagnosis was a requirement for hospitalization in most cases. We note that the examined patient characteristics (age, sex, race, zip code, smoking status) are independent from time of events (hospitalization or death), and an ideal testing condition, with immediate availability of results, will not change associations between the examined characteristics and the events. Therefore, the events are considered as cumulative endpoints for the examined follow-up duration. We verified that outpatients had no record of admissions (more than 2 days) to MGB medical facilities during the period of − 7 to 30 days of follow-up.

In order to expand applications of the SARS2 models to more diverse regions in the U.S., we mapped patients’ primary zip codes to their median household incomes according to the U.S. Census 2018 data. These median household incomes were used as indicators of socioeconomic status of the patients. The EHR population contained 385 Asian, 18 Hawaiian, 30 American Indian, and 5 Dominicans that were considered as other races in the analysis.

MGB employees (validation cohort) and non-employees (derivation cohort) differed in their demographic characteristics (Supplementary Table 3) and also followed different COVID-19 testing criteria in the limited capacity setting. Presence of these differences between derivation and validation cohorts protects against over-optimism in estimating model performance characteristics and ensures robustness of the model. A logistic regression model (a generalized linear model with logit link (GLM)) was fit to predict hospitalization outcome. The same model was used for predicting mortality among the hospitalized patients.

To derive a model for predicting hospitalization of patients, we trained a GLM on demographic characteristics (sex, age, race, median household income), and smoking among non-employees (N = 10,496, 30.46% hospitalized) and validated the model on MGB employees (N = 1851, 11.02% hospitalized). Because mortality was recorded for inpatients, we examined the model performance for estimating mortality of the hospitalized patients (N = 3401, 14.97% deceased). In addition, because of the relatively lower rates of mortality among MGB employees, an average c-statistics of 5 iterations of validating the prediction model on randomly selected 20% of the hospitalized patients was also reported.

Statistical methods

The EHR data were preprocessed using Python scripts. All variables (sex, age, race, median household income, and smoking status) were used in the R glm function to derive a multivariable model for predicting risk of hospitalization. In this model, linear associations with binomial distribution (logit link function) was used to distinguish between hospitalized vs. outpatient. The default glm convergence criteria on deviances was used to stop the iterations. The DeLong method was used to calculate confidence intervals for the c-statistics. The R coords function with Youden’s ‘best’ method was used to calculate the optimal cutoff points on the receiver operating characteristic curves. Model calibration was evaluated using Hosmer–Lemeshow goodness-of-fit (GOF) test (the R hoslem.test function) in the validation cohort, and the R plotCalibration function was used to plot the GOF calibration. A model was also fit after categorizing age (0–29, 30–59, 60–79, ≥ 80; years) and median household income (< 60, 60–80, ≥ 80; $1000). The cutoffs on the median household income correspond to (< 62%, 62–85%, ≥ 85%) of the US median household incomes, according to the 2018 Census 2018. The beta coefficients of this model were used to design a hospitalization heatmap. In order to enhance readability of the heatmap, risk scores were scaled to the minimum change in the coefficients. The p-values of the test of trend were reported in the derivation cohort. Because of the differences in testing criteria before and after April 29, 2020, a sensitivity analysis was conducted after dividing patients based on their corresponding reference dates. The same procedure as the main model were applied to the derivation and validation cohorts among patients tested before and after April 29, 2020. Additional sensitivity analysis was conducted on the population without discarding the 24 patients who have been hospitalized after the 30 days interval. In this analysis, these patients were considered as outpatients and a GLM was derived and examined.

The optimal cutoff for predicted probabilities was used to categorize patients into high risk category. Patients with estimated risks less than the above cutoff were then analyzed to calculate another optimal cutoff to define an intermediate risk category. Patients with estimated risk less than the second cutoff were reported as low risk. The same procedure was followed to group mortality risks of the hospitalized patients into low, intermediate, and high-risk groups.

A Python implementation of the risk prediction model with categorized age and income is hosted at our website for seamless public access (https://dashti.bwh.harvard.edu/sars2/).

References

CDC. COVID-19 cases in the U.S., https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/cases-in-us.html (2020).
CDC. Assessing Risk Factors for Severe COVID-19 Illness, https://www.cdc.gov/coronavirus/2019-ncov/covid-data/investigations-discovery/assessing-risk-factors.html (2020).
Bajgain, K. T., Badal, S., Bajgain, B. B. & Santana, M. J. Prevalence of comorbidities among individuals with COVID-19: A rapid review of current literature. Am. J. Infect. Control https://doi.org/10.1016/j.ajic.2020.06.213 (2020).
Article PubMed PubMed Central Google Scholar
Gao, Y., Chen, Y., Liu, M., Shi, S. & Tian, J. Impacts of immunosuppression and immunodeficiency on COVID-19: A systematic review and meta-analysis. J. Infect. S0163–4453(0120), 30294. https://doi.org/10.1016/j.jinf.2020.05.017 (2020).
Article CAS Google Scholar
Grasselli, G. et al. Risk factors associated with mortality among patients with COVID-19 in intensive care units in Lombardy Italy. JAMA Internal Med. https://doi.org/10.1001/jamainternmed.2020.3539 (2020).
Article Google Scholar
Grasselli, G. et al. Baseline characteristics and outcomes of 1591 patients infected with SARS-CoV-2 admitted to ICUs of the Lombardy Region Italy. JAMA 323, 1574–1581. https://doi.org/10.1001/jama.2020.5394 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gupta, S. F. et al. Factors associated with death in critically ill patients with coronavirus disease 2019 in the US in the US. JAMA Internal Med. https://doi.org/10.1001/jamainternmed.2020.3596 (2020).
Article Google Scholar
Kalligeros, M. et al. Association of obesity with disease severity among patients with coronavirus disease 2019. Obesity 28, 1200–1204. https://doi.org/10.1002/oby.22859 (2020).
Article CAS PubMed Google Scholar
Pan, D. et al. The impact of ethnicity on clinical outcomes in COVID-19: A systematic review. EClinicalMedicine 23. https://doi.org/10.1016/j.eclinm.2020.100404 (2020).
Price-Haywood, E. G., Burton, J., Fort, D. & Seoane, L. Hospitalization and mortality among black patients and white patients with covid-19. N. Engl. J. Med. 382, 2534–2543. https://doi.org/10.1056/NEJMsa2011686 (2020).
Article CAS PubMed Google Scholar
Siddiqi, H. K. & Mehra, M. R. COVID-19 illness in native and immunosuppressed states: A clinical-therapeutic staging proposal. J. Heart Lung Transplant. 39, 405–407. https://doi.org/10.1016/j.healun.2020.03.012 (2020).
Article PubMed PubMed Central Google Scholar
Williamson, E. J. et al. OpenSAFELY: Factors associated with COVID-19 death in 17 million patients. Nature https://doi.org/10.1038/s41586-020-2521-4 (2020).
Article PubMed PubMed Central Google Scholar
Wu, Z. & McGoogan, J. M. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: Summary of a report of 72,314 cases from the Chinese Center for disease control and prevention. JAMA 323, 1239–1242. https://doi.org/10.1001/jama.2020.2648 (2020).
Article CAS PubMed Google Scholar
McMichael, T. M. et al. Epidemiology of Covid-19 in a long-term care facility in King County Washington. N. Engl. J. Med. 382, 2005–2011. https://doi.org/10.1056/NEJMoa2005412 (2020).
Article CAS PubMed Google Scholar
Berlin, D. A., Gulick, R. M. & Martinez, F. J. Severe Covid-19. N. Engl. J. Med. https://doi.org/10.1056/NEJMcp2009575 (2020).
Article PubMed Google Scholar
Del Valle, D. M. et al. An inflammatory cytokine signature predicts COVID-19 severity and survival. Nat. Med. https://doi.org/10.1038/s41591-020-1051-9 (2020).
Article PubMed PubMed Central Google Scholar
Selden, T. M. & Berdahl, T. A. COVID-19 and racial/ethnic disparities in health risk, employment And Household Composition. Health Aff. https://doi.org/10.1377/hlthaff.2020.00897 (2020).
Article Google Scholar
Sattar, N., McInnes Iain, B. & McMurray John, J. V. Obesity is a risk factor for severe COVID-19 infection. Circulation 142, 4–6. https://doi.org/10.1161/CIRCULATIONAHA.120.047659 (2020).
Zachariah, P. et al. Epidemiology, clinical features, and disease severity in patients with coronavirus disease 2019 (COVID-19) in a children’s hospital in New York City, New York. JAMA Pediatrics, e202430-e202430. https://doi.org/10.1001/jamapediatrics.2020.2430 (2020).
Lighter, J. et al. Obesity in patients younger than 60 years is a risk factor for COVID-19 hospital admission. Clin. Infect. Dis. https://doi.org/10.1093/cid/ciaa415 (2020).
Article PubMed Google Scholar
Aggarwal, G. et al. Association of cardiovascular disease with coronavirus disease 2019 (COVID-19) severity: A meta-analysis. Curr. Probl. Cardiol. 45, 100617–100617. https://doi.org/10.1016/j.cpcardiol.2020.100617 (2020).
Article PubMed PubMed Central Google Scholar
Mehra, M. R., Desai, S. S., Kuy, S., Henry, T. D. & Patel, A. N. Cardiovascular disease, drug therapy, and mortality in Covid-19. N. Engl. J. Med. 382, e102. https://doi.org/10.1056/NEJMoa2007621 (2020).
Article CAS PubMed Google Scholar
Reynolds, H. R. et al. Renin–angiotensin–aldosterone system inhibitors and risk of Covid-19. N. Engl. J. Med. 382, 2441–2448. https://doi.org/10.1056/NEJMoa2008975 (2020).
Article CAS PubMed Google Scholar
Bandyopadhyay, D. et al. COVID-19 Pandemic: Cardiovascular complications and future implications. Am. J. Cardiovasc. Drugs 1–14. https://doi.org/10.1007/s40256-020-00420-2 (2020).
Lippi, G. & Henry, B. M. Active smoking is not associated with severity of coronavirus disease 2019 (COVID-19). Eur. J. Intern. Med. 75, 107–108. https://doi.org/10.1016/j.ejim.2020.03.014 (2020).
Article CAS PubMed PubMed Central Google Scholar
Organization, W. H. Smoking and COVID-19, https://www.who.int/news-room/commentaries/detail/smoking-and-covid-19 (2020).
Polosa, R. & Caci, G. COVID-19: counter-intuitive data on smoking prevalence and therapeutic implications for nicotine. Intern. Emerg. Med. 1–4. https://doi.org/10.1007/s11739-020-02361-9 (2020).
Rentsch, C. T. et al. Covid-19 testing, hospital admission, and intensive care among 2,026,227 United States veterans aged 54–75 years. medRxiv, 2020.2004.2009.20059964. https://doi.org/10.1101/2020.04.09.20059964 (2020).
Knight, S. R. et al. Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: Development and validation of the 4C Mortality Score. BMJ 370, m3339. https://doi.org/10.1136/bmj.m3339 (2020).
Article PubMed Google Scholar
Zhang, X. A. et al. Semantic integration of clinical laboratory tests from electronic health records for deep phenotyping and biomarker discovery. NPJ Digit. Med. 2, 32. https://doi.org/10.1038/s41746-019-0110-4 (2019).
Xiao, C., Choi, E. & Sun, J. Opportunities and challenges in developing deep learning models using electronic health records data: A systematic review. J. Am. Med. Inform. Assoc. 25, 1419–1428. https://doi.org/10.1093/jamia/ocy068 (2018).
Article PubMed PubMed Central Google Scholar
Heisey-Grove, D., Danehy, L.-N., Consolazio, M., Lynch, K. & Mostashari, F. A national study of challenges to electronic health record adoption and meaningful use. Med. Care 52, 144–148 (2014).
Article PubMed Google Scholar
Bayer, R., Santelli, J. & Klitzman, R. New challenges for electronic health records: Confidentiality and access to sensitive health information about parents and adolescents. JAMA 313, 29–30. https://doi.org/10.1001/jama.2014.15391 (2015).
Article CAS PubMed Google Scholar
CDC. COVID-19 Laboratory-Confirmed Hospitalization, https://gis.cdc.gov/grasp/COVIDNet/COVID19_5.html (2020).

Download references

Acknowledgements

We are grateful for the constructive comments from Dr. Nancy R. Cook, Brigham and Woman’s Hospital and Harvard Medical School. Authors are grateful for the support from the Enterprise Data Warehouse, Research Patient Data Repository, and COVID-19 Data Mart personnel at Mass General Brigham, in particular continuous helps from Stacey A. Duey and Julie M. Fiskio. This work was supported in part by the National Heart Lung and Blood Institute (T32 HL007575 to H.D., K24 HL136852 to S.M., and 5K01HL135342 to O.D.), by 17IGMV33860009 from the American Heart Association to O.D., by the BWH Lerner Junior Faculty Research Award to O.D., and by philanthropic support from the Brigham and Women’s Hospital COVID fund.

Author information

These authors contributed equally: Samia Mora and Olga Demler.

Authors and Affiliations

Division of Preventive Medicine, Center for Lipid Metabolomics, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Hesam Dashti, Elise C. Roche, David William Bates, Samia Mora & Olga Demler
Broad Institute of MIT and Harvard, Cambridge, MA, USA
Hesam Dashti
Division of Cardiovascular Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Samia Mora

Authors

Hesam Dashti
View author publications
You can also search for this author in PubMed Google Scholar
Elise C. Roche
View author publications
You can also search for this author in PubMed Google Scholar
David William Bates
View author publications
You can also search for this author in PubMed Google Scholar
Samia Mora
View author publications
You can also search for this author in PubMed Google Scholar
Olga Demler
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.D., S.M., and O.D. were involved in the planning, conceptualization, and design of the study. H.D., E.C.R., D.W.B., and O.D. conducted data acquisition procedures, and performed the analysis. H.D., S.M., and O.D. were involved in interpretation of the data and analysis, and preparations of the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Samia Mora.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dashti, H., Roche, E.C., Bates, D.W. et al. SARS2 simplified scores to estimate risk of hospitalization and death among patients with COVID-19. Sci Rep 11, 4945 (2021). https://doi.org/10.1038/s41598-021-84603-0

Download citation

Received: 15 September 2020
Accepted: 18 February 2021
Published: 02 March 2021
DOI: https://doi.org/10.1038/s41598-021-84603-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.