Influence of social deprivation index on in-hospital outcomes of COVID-19

Goyal, Parag; Schenck, Edward; Wu, Yiyuan; Zhang, Yongkang; Visaria, Aayush; Orlander, Duncan; Xi, Wenna; Díaz, Iván; Morozyuk, Dmitry; Weiner, Mark; Kaushal, Rainu; Banerjee, Samprit

doi:10.1038/s41598-023-28362-0

Download PDF

Article
Open access
Published: 31 January 2023

Influence of social deprivation index on in-hospital outcomes of COVID-19

Parag Goyal^1,2,
Edward Schenck ORCID: orcid.org/0000-0002-7950-5989^1,2,
Yiyuan Wu³,
Yongkang Zhang³,
Aayush Visaria⁴,
Duncan Orlander³,
Wenna Xi³,
Iván Díaz³,
Dmitry Morozyuk³,
Mark Weiner³,
Rainu Kaushal^1,2,3,5 &
…
Samprit Banerjee^3,6

Scientific Reports volume 13, Article number: 1746 (2023) Cite this article

2886 Accesses
3 Citations
12 Altmetric
Metrics details

Subjects

Abstract

While it is known that social deprivation index (SDI) plays an important role on risk for acquiring Coronavirus Disease 2019 (COVID-19), the impact of SDI on in-hospital outcomes such as intubation and mortality are less well-characterized. We analyzed electronic health record data of adults hospitalized with confirmed COVID-19 between March 1, 2020 and February 8, 2021 from the INSIGHT Clinical Research Network (CRN). To compute the SDI (exposure variable), we linked clinical data using patient’s residential zip-code with social data at zip-code tabulation area. SDI is a composite of seven socioeconomic characteristics determinants at the zip-code level. For this analysis, we categorized SDI into quintiles. The two outcomes of interest were in-hospital intubation and mortality. For each outcome, we examined logistic regression and random forests to determine incremental value of SDI in predicting outcomes. We studied 30,016 included COVID-19 patients. In a logistic regression model for intubation, a model including demographics, comorbidity, and vitals had an Area under the receiver operating characteristic curve (AUROC) = 0.73 (95% CI 0.70–0.75); the addition of SDI did not improve prediction [AUROC = 0.73 (95% CI 0.71–0.75)]. In a logistic regression model for in-hospital mortality, demographics, comorbidity, and vitals had an AUROC = 0.80 (95% CI 0.79–0.82); the addition of SDI in Model 2 did not improve prediction [AUROC = 0.81 (95% CI 0.79–0.82)]. Random forests revealed similar findings. SDI did not provide incremental improvement in predicting in-hospital intubation or mortality. SDI plays an important role on who acquires COVID-19 and its severity; but once hospitalized, SDI appears less important.

Early risk assessment for COVID-19 patients from emergency department data using machine learning

Article Open access 18 February 2021

A retrospective cohort study of risk factors for mortality among nursing homes exposed to COVID-19 in Spain

Article 28 June 2021

Comparing COVID-19 risk factors in Brazil using machine learning: the importance of socioeconomic, demographic and structural factors

Article Open access 02 August 2021

Introduction

Given the profound impact of the COVID-19 pandemic, research on COVID-19 has become an important priority across the world, especially as it relates to prediction of outcomes¹. Beyond patient demographics and traditional clinical characteristics, social factors have emerged as important predictors of outcomes. For example, prior work from our group and others have shown that higher social deprivation index (SDI) and deprived living environment are associated with hospitalization for COVID-19^2,3, with mortality and other poor outcomes both in the US and in the UK.

The SDI is a neighborhood-level marker of social disadvantage related to a dearth of health care resources⁴. SDI is especially important in the United States because individuals from regions with higher SDI (more disadvantaged) have higher risk of disease and often experience limited access to care, resulting in an unmet need in healthcare and poor patient outcomes⁴. High SDI is also associated with worse post-hospitalization outcomes in myriad diseases, and has been postulated to result from limited resources for recovery, limited continuity of care, and a greater burden of comorbidities⁵. While it is well-known that SDI plays an important role on processes prior to a hospitalization and after hospitalization⁵, its impact on in-hospital processes is less well-characterized.

Given emerging evidence regarding the influence of SDI and other social risk indicators on multiple aspects of the COVID-19 pandemic^2,6, we sought to examine the importance of SDI on the prediction of in-hospital COVID-19 outcomes including intubation and in-hospital mortality. Understanding the influence of SDI on in-hospital outcomes’ prediction could provide important insights on care delivery during a pandemic, and potentially identify an important source of significant disparities for in-hospital outcomes of COVID-19. To address this gap in knowledge, we leveraged one of the largest electronic health record (EHR) datasets of hospitalized patients, derived from three major health systems in New York City (NYC)⁷ one of the first epicenters—during multiple phases of the pandemic (from March 1, 2020 to February 8, 2021).

Methods

Inclusion criteria were: (1) adults (≥ 18 years of age) (2) confirmed COVID-19 by positive RT-PCR test or ICD-10 diagnosis (3) admission to emergency department (ED) or hospital between March 1, 2020 and February 8, 2021. Patients living in a nursing home prior to their index presentation were excluded as zip codes in EHR may not represent their residence. The resulting cohort included 30,016 unique patients with confirmed COVID-19.

Exposure: social deprivation index

We linked clinical data using patient’s residential zip-code with social data at zip-code tabulation area (ZCTA) to compute the Social Deprivation Index (SDI)⁸ for 2020 using publicly available sources⁹. SDI is a composite of six socioeconomic characteristics (income, education, employment, housing, household characteristics and transportation) determined at the ZCTA level. We mapped patients’ residential zip codes onto ZCTAs. We categorized all ZCTAs into quintiles based on SDI score.

Outcomes

The two outcomes of interest were in-hospital intubation and in-hospital mortality. Intubation was defined as mechanical ventilation during hospital stay based on the presence of relevant orders and procedure codes. In-hospital mortality was defined as deaths that occurred during the hospitalization recorded in hospital EHR or reflected in the Diagnosis Related Group.

Patient characteristics

We examined demographics, baseline comorbidities, and vital signs at admission. Demographics included age, sex, race (White or non-White), and ethnicity (Hispanic or non-Hispanic). Established diagnosis codes¹⁰ were used to identify baseline comorbidities including hypertension, diabetes, coronary artery disease, heart failure, chronic obstructive pulmonary disease, asthma, cancer, obesity, and hyperlipidemia. Vital signs that were robustly captured by the participating health systems included systolic and diastolic blood pressure, and Body Mass Index (BMI) at admission.

Statistical analysis

To predict the two binary outcomes (intubation and mortality), we considered two methods—a logistic regression and random forests (RF). First, we constructed a sequence of models with logistic regression to evaluate the incremental effect of each group of predictors. Model 1included demographics, comorbidity, and vital signs; we added SDI quintile to construct Model 2, added time since the start of the pandemic to construct Model 3, and finally added an SDI by time interaction to construct Model 4.

Similar models (except Model 4) were constructed with Random Forests, a machine learning algorithm that automatically models complex interactions between predictors. Model performance was estimated by Area Under the Receiver Operating Characteristic curve (AUROC) using a five-fold cross-validation and its 95% confidence interval reported. Missing data in predictors (range 3.6–12%) were imputed with random forest¹¹ to produce an imputed dataset.

Ethical Information

Weill Cornell IRB (#20-04021948) approved this study and determined that this study meets exemption requirements at HHS 45 CFR 46.104(d). All data management and analysis were conducted in a manner that is HIPAA-compliant.

Results

Patient characteristics

Among N = 30,016 COVID-19 patients, the median Inter-quartile range (IQR) age was 59.5 (43.2–72.4) years, 50.8% were males, 63.5% were non-White race and 36.4% had Hispanic ethnicity. The most common comorbid conditions were hypertension (53.6%), hyperlipidemia (38.6%), and diabetes (32.9%) (Table 1). Compared to the group with the lowest SDI (1st quintile), the group with highest SDI (5th quintile) had a higher proportion of non-White race and Hispanic ethnicity, had higher prevalence of each of the comorbid conditions, and presented to the hospital earlier in the pandemic.

Table 1 Baseline characteristics of hospitalized Covid-19 patients included in the study across SDI (Social Deprivation Index) quintiles.

Full size table

Intubation

In a logistic regression model, Model 1 (demographics, comorbidity, and vitals) predicted intubation with moderate accuracy (AUROC = 0.73; 95% CI 0.70–0.75). The addition of SDI in Model 2 did not improve accuracy (AUROC = 0.73; 95% CI 0.71–0.75). The addition of time in Model 3 increased accuracy (AUROC = 0.78; 95% CI 0.76–0.79) compared to Model 2. The addition of an interaction between SDI quintiles and time did not improve prediction (AUROC = 0.78; 95% CI 0.76–0.79). Results from the RF showed similar results (Fig. 1).

In-hospital mortality

In a logistic regression model, Model 1 (demographics, comorbidity, and vitals) predicted mortality accurately (AUROC = 0.80; 95% CI 0.79–0.82). The addition of SDI in Model 2 did not improve prediction (AUROC = 0.81; 95% CI 0.79–0.82). The addition of time in Model 3 increased accuracy (AUROC = 0.84; 95% CI 0.82–0.85) compared to Model 2. The addition of an interaction between SDI quintiles and time did not improve prediction (AUROC = 0.84; 95% CI 0.82–0.85). Results from the RF showed similar results (Fig. 1).

Discussion

This study, of over 30,000 patients hospitalized for confirmed COVID-19 in NYC, found that neither SDI nor its interaction with time provided incremental value in predicting in-hospital intubation or death. This suggests that SDI based on a patient’s neighborhood did not influence outcomes once hospitalized for COVID-19 beyond known clinical risk factors, and this did not change over the course of the pandemic.

The importance of social determinants of health has garnered a spotlight in the United States over the past couple years in the setting of national events including COVID-19¹². Our group previously showed that SDI was associated with hospitalization for COVID-19 and all-cause mortality; but did not consider in-hospital events². This study extends prior findings by indicating that SDI does not predict adverse events beyond demographic and clinical predictors once medical attention is sought.

Our findings are reassuring given concerns about the impact of implicit bias related to social determinants of health on provision of care and associated outcomes during the COVID-19 pandemic^13,14. Concerns about implicit bias were especially relevant at the peak of the pandemic during which some hospitals had to make plans for rationing care^15,16. Our findings reveal that SDI did not have a major influence on in-hospital outcomes at any point of the pandemic including at the peak. Given these observations, interventions to address disparities as they relate to SDI should focus on the community rather than the hospital. For example, increased efforts to improve vaccinations rates in high SDI-regions may be especially important to improve outcomes for vulnerable populations. With emerging data about the long-term sequelae of COVID-19¹⁷, lack of healthcare resources may exacerbate negative impact of “long COVID, suggesting the potential utility of additional resources (e.g. paid sick-leave, housing support etc.) for COVID-19 survivors living in high SDI-regions.”

The strengths of this study are—first, inclusion of several major health systems in NYC, making our findings more generalizable than prior studies based on single institutions; second, the study time-period of 1-year since the beginning of the pandemic, thereby capturing the evolution of clinical knowledge and experience, practice habits, and evidence for therapies over the course of the pandemic; third, use of logistic regression and random forest to model high-level interactions that may not otherwise be easily discerned.

The important limitations are—first, findings are limited to NYC, and may not be generalizable to other regions of the country; second, SDI measures neighborhood level rather than individual level social disadvantage. Consequently, some individual patients may have more or less disadvantage than their neighborhood SDI; third, the complex interplay between SDI, overall health and COVID-19 infection makes risk estimates of predictors biased (collider bias)¹⁸, therefore we focus on prediction accuracy only; finally, clinical predictors were limited to those robustly captured across all health systems and we did not have data on some known predictors (e.g. respiratory rate) and health behaviors (e.g. diet).

Conclusion

SDI did not provide incremental improvement in predicting in-hospital intubation or mortality beyond known demographic and clinical predictors. SDI likely plays an important role on who acquires COVID-19, and its severity; but once hospitalized, SDI appears less important. Future interventions to address SDI-related disparities should focus on improving health of the community before acquiring COVID-19, such as through vaccination efforts.

Data availability

The datasets analyzed in this study are not publicly available because the data involved de-identified Electronic Health Records of patients served by several academic health centers in New York, NY and housed in a secure data warehouse.

References

Wynants, L. et al. Prediction models for diagnosis and prognosis of covid-19: Systematic review and critical appraisal. BMJ 369, m1328. https://doi.org/10.1136/bmj.m1328 (2020).
Article Google Scholar
Zhang, Y. et al. Socioeconomic variation in characteristics, outcomes, and healthcare utilization of COVID-19 patients in New York City. PLoS ONE 16(7), e0255171. https://doi.org/10.1371/journal.pone.0255171 (2021) (In Eng).
Article CAS Google Scholar
Soltan, M. et al. L12 To what extent are social determinants of health, including household overcrowding, air pollution and housing quality deprivation, modulators of presentation, ITU admission and outcomes among patients with SARS-COV-2 infection in an urban catchment area in Birmingham, United Kingdom? Thorax 76(Issue Supplement 1), A237–A238 (2021).
Google Scholar
Butler, D. C., Petterson, S., Phillips, R. L. & Bazemore, A. W. Measures of social deprivation that predict health care access and need within a rational area of primary care service delivery. Health Serv. Res. 48(2 Pt 1), 539–559. https://doi.org/10.1111/j.1475-6773.2012.01449.x (2013) (In Eng).
Article Google Scholar
Salerno, S. et al. Comprehensive evaluation of COVID-19 patient short- and long-term outcomes: Disparities in healthcare utilization and post-hospitalization outcomes. PLoS ONE 16(10), e0258278. https://doi.org/10.1371/journal.pone.0258278 (2021).
Article CAS Google Scholar
Ossimetha, A., Ossimetha, A., Kosar, C. M. & Rahman, M. Socioeconomic disparities in community mobility reduction and COVID-19 growth. Mayo Clin. Proc. 96(1), 78–85. https://doi.org/10.1016/j.mayocp.2020.10.019 (2021) (In Eng).
Article CAS Google Scholar
Richardson, S. et al. Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City Area. JAMA 323(20), 2052–2059. https://doi.org/10.1001/jama.2020.6775 (2020) (In Eng).
Article CAS Google Scholar
Center. TRG. Social Deprivation Index (SDI). https://www.graham-center.org/rgc/maps-data-tools/sdi/social-deprivation-index.html.
Bureau. USC. American Community Survey 2020. https://www.census.gov/programs-surveys/acs.
Adhikari, S. et al. Assessment of community-level disparities in coronavirus disease 2019 (COVID-19) infections and deaths in large US Metropolitan areas. JAMA Netw. Open 3(7), e2016938. https://doi.org/10.1001/jamanetworkopen.2020.16938 (2020) (In Eng).
Article Google Scholar
Buuren, S. V. Flexible Imputation of Missing Data (Chapman and hall/CRC, 2018).
Book MATH Google Scholar
Thakur, N., Lovinsky-Desir, S., Bime, C., Wisnivesky, J. P. & Celedón, J. C. The structural and social determinants of the racial/ethnic disparities in the U.S. COVID-19 pandemic: What’s our role? Am. J. Respir. Crit. Care Med. 202(7), 943–949. https://doi.org/10.1164/rccm.202005-1523PP (2020) (In Eng).
Article CAS Google Scholar
Dorn, A. V., Cooney, R. E. & Sabin, M. L. COVID-19 exacerbating inequalities in the US. Lancet 395(10232), 1243–1244. https://doi.org/10.1016/s0140-6736(20)30893-x (2020) (In Eng).
Article CAS Google Scholar
Webb Hooper, M., Nápoles, A. M. & Pérez-Stable, E. J. COVID-19 and racial/ethnic disparities. JAMA 323(24), 2466–2467. https://doi.org/10.1001/jama.2020.8598 (2020) (In Eng).
Article CAS Google Scholar
Tolchin, B., Hull, S. C. & Kraschel, K. Triage and justice in an unjust pandemic: Ethical allocation of scarce medical resources in the setting of racial and socioeconomic disparities. J. Med. Ethics https://doi.org/10.1136/medethics-2020-106457 (2020) (In Eng).
Article Google Scholar
Gershengorn, H. B. et al. Assessment of disparities associated with a crisis standards of care resource allocation algorithm for patients in 2 US hospitals during the COVID-19 pandemic. JAMA Netw. Open 4(3), e214149–e214149. https://doi.org/10.1001/jamanetworkopen.2021.4149 (2021).
Article Google Scholar
Sudre, C. H. et al. Attributes and predictors of long COVID. Nat. Med. 27(4), 626–631. https://doi.org/10.1038/s41591-021-01292-y (2021) (In Eng).
Article CAS Google Scholar
Griffith, G. J. et al. Collider bias undermines our understanding of COVID-19 disease risk and severity. Nat. Commun. 11(1), 5749. https://doi.org/10.1038/s41467-020-19478-2 (2020).
Article ADS CAS Google Scholar

Download references

Funding

This study was funded by a COVID-19 Enhancement Award PCORI/HSD-1604-35187 entitled “Using Predictive Models to Improve Care for Hospitalized Patients with Novel Coronavirus Disease” from the Patient-Centered Outcomes Research Institute. (Parent Award: “Identifying and Predicting Patients with Preventable High Utilization”). Specifically, the PCORI Enhancement award supported the data curation and effort of authors (PG, ED, YW, DO, ID, MW, RK, SB). In addition, the effort of Drs. Banerjee, Diaz and Ms. Wu was also supported by National Institute of Mental Health P50 MH113838. The content is solely the responsibility of the authors and does not necessarily represent the official views of Patient-Centered Outcomes Research Institute or the National Institutes of Health. The funding sponsors did not contribute to design and conduct of the study, collection, management, analysis, or interpretation of the data or preparation, review, or approval of the manuscript.

Author information

Authors and Affiliations

Department of Medicine, Weill Cornell Medical College, 1320 York Avenue, New York, NY, 10021, USA
Parag Goyal, Edward Schenck & Rainu Kaushal
NewYork-Presbyterian Hospital, 525 East 68th Street, New York, NY, 10065, USA
Parag Goyal, Edward Schenck & Rainu Kaushal
Department of Population Health Sciences, Weill Cornell Medical College, 425 East 61St Street, New York, NY, 10065, USA
Yiyuan Wu, Yongkang Zhang, Duncan Orlander, Wenna Xi, Iván Díaz, Dmitry Morozyuk, Mark Weiner, Rainu Kaushal & Samprit Banerjee
Center for Pharmacoepidemiology and Treatment Sciences, Rutgers Institute for Health, Health Care Policy, and Aging Research, New Brunswick, NJ, USA
Aayush Visaria
Department of Pediatrics, Weill Cornell Medical College, New York, NY, USA
Rainu Kaushal
New York, USA
Samprit Banerjee

Authors

Parag Goyal
View author publications
You can also search for this author in PubMed Google Scholar
Edward Schenck
View author publications
You can also search for this author in PubMed Google Scholar
Yiyuan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yongkang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Aayush Visaria
View author publications
You can also search for this author in PubMed Google Scholar
Duncan Orlander
View author publications
You can also search for this author in PubMed Google Scholar
Wenna Xi
View author publications
You can also search for this author in PubMed Google Scholar
Iván Díaz
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry Morozyuk
View author publications
You can also search for this author in PubMed Google Scholar
Mark Weiner
View author publications
You can also search for this author in PubMed Google Scholar
Rainu Kaushal
View author publications
You can also search for this author in PubMed Google Scholar
Samprit Banerjee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.B., P.G., E.S. and R.K. conceived the idea for the research questions and study design. M.W., D.M. and D.O. extracted and pre-processed E.H.R. data from a central data warehouse. S.B., I.D., Y.Z. and W.X. designed the statistical analysis plan. Y.W. carried out the statistical analysis. P.G., S.B. and A.V. created the first draft of the manuscript. All authors contributed to the interpretation of the findings and critically revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Samprit Banerjee.

Ethics declarations

Competing interests

Dr. Schenck received consulting fees from Axle informatics for the NAID’s subject matter expert COVID vaccine program; the remaining authors have no disclosures to report.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Goyal, P., Schenck, E., Wu, Y. et al. Influence of social deprivation index on in-hospital outcomes of COVID-19. Sci Rep 13, 1746 (2023). https://doi.org/10.1038/s41598-023-28362-0

Download citation

Received: 10 April 2022
Accepted: 17 January 2023
Published: 31 January 2023
DOI: https://doi.org/10.1038/s41598-023-28362-0

This article is cited by

The Impact of Socio-Economic Conditions on Individuals’ Health: Development of an Index and Examination of its Association with Three of the Most Frequently Registered Diseases in Lazio Region of Italy
- Ilaria Valentini
- Mario Cesare Nurchis
- Giuseppe Arbia
Social Indicators Research (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.