Abstract
Heart failure (HF) has no cure and, for HF with preserved ejection fraction (HFpEF), no life-extending treatments. Defining the clinical epidemiology of HF could facilitate earlier identification of high-risk individuals. We define the clinical epidemiology of HF subtypes (HFpEF and HF with reduced ejection fraction [HFrEF]), identified among 2.7 million individuals receiving routine clinical care. Differences in patterns and rates of accumulation of comorbidities, frequency of hospitalization, use of specialty care, were defined for each HF subtype. Among 28,156 HF cases, 8322 (30%) were HFpEF and 11,677 (42%) were HFrEF. HFpEF was the more prevalent subtype among older women. 177 Phenotypes differentially associated with HFpEF versus HFrEF. HFrEF was more frequently associated with diagnoses related to ischemic cardiac injury while HFpEF was associated more with non-cardiac comorbidities and HF symptoms. These comorbidity patterns were frequently present 3 years prior to a HFpEF diagnosis. HF subtypes demonstrated distinct patterns of clinical co-morbidities and disease progression. For HFpEF, these comorbidities were often non-cardiac and manifested prior to the onset of a HF diagnosis. Recognizing these comorbidity patterns, along the care continuum, may present a window of opportunity to identify individuals at risk for developing incident HFpEF.
Similar content being viewed by others
Introduction
Heart failure (HF), a clinical syndrome driven by the heart’s inability to meet the oxygen needs of the body, is a significant cause of annually increasing morbidity and mortality. HF accounted for 1 in 8 deaths in the U.S. in 20171. In 2018, there were about 6.2 million adults with heart failure in the United States, and 379,800 death certificates listed heart failure (13.4%)2. Heart failure is broadly subtyped by left ventricular ejection fraction (LVEF) into HF with reduced ejection fraction (HFrEF, LVEF < 40) and HF with preserved ejection fraction (HFpEF, LVEF ≥ 50)3. Currently, HFpEF patients account for about 50% of all heart failure patients3,4. There is no cure for HF and, for HFpEF, there are currently no life-extending treatments3,4,5,6,7. Hence, prevention is paramount for reducing the burden of HF, especially for HFpEF, which is becoming the more prevalent subtype3,4,5,6,7.
Identifying high risk individuals is essential for effective prevention, and this requires a clear understanding of the evolving epidemiology of HF8,9,10,11. In particular, a clearer understanding of the spectrum and temporality of the clinical manifestations of early HF could better enable primary care providers to identify high risk patients with sufficient lead time to enable early intervention to delay or prevent the onset of HF and sequelae12,13. Defining the relevant clinical epidemiology requires large, contemporary clinical cohorts, such as those who seek care at large medical centers14,15,16,17,18,19,20.
We leveraged electronic health records (EHR) from ~ 2.7 million individuals seeking routine clinical care at Vanderbilt University Medical Center. We identified a large collection of HF cases and characterized the comorbidity profiles of HF subtypes prior to and at the time of their clinical diagnosis. We asked if there were distinct longitudinal patterns of comorbidity accumulation between the HF subtypes, as patients moved through the healthcare system, and if this could have implications for HF management.
Methods
All individuals were derived from Vanderbilt University Medical Center’s Synthetic Derivative database, a research tool for conducting epidemiological studies using de-identified EHR data, especially suited to work with tools of machine learning and big data21,22,23,24. This resource comprises inpatient and outpatient clinical data from multiple sources including diagnostic and procedure codes (ICD-9 [International Classification of Disease, Ninth revision] and CPT [Current Procedural Terminology]), demographics, text from clinical notes, laboratory values, procedural reports (e.g., echocardiograms), and medications extracted individual clinical records23,24,25. The study was reviewed and approved by Vanderbilt’s Institutional Review Board and was determined to be non-human subjects research. These data contain no HIPAA or other personal identifiers.
Clinical phenotypes
Medication data were extracted using the validated MEDEX tool26. Keyword features were extracted from source documents (problem lists and clinical documents) and occurrences were excluded if a negation term (such as “not” or “ruled out”) was within 100 characters of the keyword (Supplementary Tables 1 and 2)27,28. For each individual, sex, race (white, black or other), blood pressure, heart rate, body mass index (BMI), BNP values, and clinician visit dates and types (i.e., inpatient/outpatient, clinic type) were extracted from structured tables in the SD. Diagnoses of myocardial infarction (MI), coronary artery disease (CAD), hypertension, dyslipidemia, atrial fibrillation (AF), type 2 diabetes (T2D), and chronic kidney disease (CKD) were based on previously validated EHR algorithms (Supplementary Table 3)29.
Echocardiographic measures of cardiac structure and function
Measures, extracted from transthoracic echocardiography (TTE) reports, using previously described approaches29,30, included left ventricular wall thickness, left ventricular ejection fraction (LVEF), diastolic function, and cardiac chamber dimensions. For each HF case with one or more LVEF measurements, the following HF subtypes were defined:
-
(1)
HFpEF: All LVEF measurements ≥ 50%.
-
(2)
HFmEF (Heart Failure with Mid-Range Ejection Fraction): Any LVEF < 50% and no LVEF < 40%
-
(3)
HFrEF: Any LVEF ≤ 40%.
PheWAS phenotypes
Phenome-wide association study (PheWAS) codes are validated groupings of related ICD-9 billing codes that capture the extended range of clinical diagnoses within an EHR data set31,32. A complete list of PheWAS codes and their mappings to ICD-9 codes can be found at https://www.vumc.org/cpm/cpm-blog/phewas-phenome-wide-association-studies. For a given PheWAS code, cases are individuals with 1 or more codes and controls are individuals with no related codes. After excluding phenotypes affecting a single sex or with ≤ 100 cases, 1224 clinical phenotypes were included in analyses.
Universal definition of heart failure
We examined how many of the cases used in the Validation set met the Bozkurt et al.20 universal heart failure algorithm criteria based on the features available to the machine learning algorithm. In line with the guidelines, we tested for the presence of any of the following symptoms: pulmonary edema, pleural effusion, orthopnea, paroxysmal nocturnal dyspnea, nocturnal cough, lower extremity edema, dyspnea on exertion, jugular venous distension/elevated jugular venous pressure/elevated JVP, cardiomegaly, third heart sound, hepatomegaly, rales, ejection fraction < 50%, and an outpatient NT-Pro-BNP, pg/ml ≥ 125. The proportion of individuals who met all of these criteria was determined.
Analysis
The random forest machine learning classifier, implemented in ScikitLearn (v0.18.1)33, was used to identify heart failure cases34. The approach and implementation are detailed in the Supplementary Methods. In brief, the machine learning model was developed on a training set (n = 1091) and validated on a testing set (n = 468). Based on the final set of features selected, an independent testing set demonstrated that the model had a positive predictive value (PPV) of 0.92 and a case sensitivity and specificity of 0.67 and 0.99, respectively, as compared to manual clinical record review by a cardiologist (Supplementary Tables 4 and 5). This validated model was then deployed in the EHR population to identify heart failure cases. For cases with TTE measurements, heart failure subtypes were assigned based on LVEF (see above). While a manual record review by a cardiologist confirmed the heart failure diagnosis among all cases used in the Validation set, only 34.8% met the Universal Definition of HF (described above) based on features employed by the random forest classifier.
Descriptive statistics were generated for all HF cases as well as for each subtype (HFpEF, HFrEF, HFmEF) separately. Between group differences were assessed using the Mann–Whitney U test or Chi-square test for continuous and dichotomous variables, respectively.
We also conducted stratified analyses comparing HF cases identified before and after 2005, the approximate midpoint of the time range for which EHR data were available, to characterize secular trends in HF epidemiology. Subjects were classified by LVEF in 10-year age ranges, centered around decades for which LVEF data were present, and stratified by sex and date of HF diagnosis.
PheWAS analyses were used to comprehensively scan the medical phenome to identify clinical diagnoses differentially associated with HFrEF versus HFpEF, since these subtypes had the largest numbers of individuals. Because PheWAS phenotypes are highly correlated, we conducted a 2-step analysis to identify phenotypes independently associated with the HF subtypes. First, multivariable logistic regression analysis adjusting for age at HF diagnosis, sex, and self-reported race was used to test the association between each PheWAS code and the HF subtype (odds-ratios are the association with HFpEF, as compared to HFrEF). Phenotypes with a Benjamini–Hochberg (B–H) false discovery rate (FDR)35 q-value < 0.1 were then jointly analyzed using a multivariable logistic model in conjunction with a stepwise selection feature (using Proc Logistic in SAS) that retained all phenotypes with an independent association p < 0.05. PheWAS analyses were stratified by 10-year age ranges.
The ML algorithm also assigned a diagnoses date for each case. To better define the clinical events leading up to a HF diagnosis, we extracted ICD-9 based diagnoses during each of the three years prior to diagnosis of HF and mapped them to corresponding PheWAS phenotypes. Because ascertainment of diagnoses during this time may be incomplete in individuals referred for specialty care, analyses were limited to those individuals receiving medical care at VUMC prior to diagnosis of HF, defined as two or more primary care or cardiology outpatient visits over the three-year period. Descriptive statistics were generated for PheWAS diagnoses and medical encounters prior to HF diagnosis for both HFpEF and HFrEF subjects and were also stratified by 10-year age groups.
Results
The ML classifier identified 28,156 HF cases (Fig. 1). Of these, 8322 (30%) were classified as HFpEF, 11,677 (42%) as HFrEF, 1958 (7%) HFmEF, and 21% had no LVEF data. HFrEF cases were more likely to be male (66 vs 42%) and have coronary heart disease (78 vs. 63%), as compared to HFpEF cases, (Table 1). Additional results on the ML model are presented in the Supplementary Methods. A significant portion of the comorbidities presented in Table 1 include diagnoses associated after a HF diagnosis, the fractions before and after HF diagnosis are presented in Supplementary Table 6. Many of the comorbidities appeared after a heart failure diagnosis. Conversely, subjects with HFpEF, were older (67 vs 62 years), had higher BMI’s (30.7 [interquartile range, 25.9, 36.9] vs 28.2 [24.6, 33.0] kg/m2), and had a higher prevalence of hypertension (91% vs 87%) (p < 1 × 10–15 for all). Subjects with HFmEF variably differed with respect to HFpEF and HFrEF for these features.
HF cases entered the medical system between 1989 and 2017, with the majority entering after the 2005 (the middle timepoint) [N = 19,475 (69%)]. The relative proportions of HF subtypes, by age and sex, were similar before and after 2005 (Fig. 2). Among both sexes, the proportion of individuals with HFrEF decreased with age. Among men, HFrEF was the most common subtype, though the relative proportion of HFrEF decreased in more recent time periods and with increasing age. Among women, HFpEF was the more prevalent subtype after age 30–40 and this proportion increased with age. Compared to HF cases diagnosed before 2005, subjects entering the health system after 2005 were older (66.0 vs 62.5 years), more likely to be diagnosed with hypertension (89% vs 83%), diabetes (24% vs 18%), CKD (35% vs 28%), and AF (47% vs 43%) (p < 1 × 10–15 for all) (Supplementary Table 6A). The trend in increasing age and diagnoses of hypertension, T2D, dyslipidemia, and CKD was seen across all subclasses. The prevalence of CAD remained similar in all HF subclasses, and median BMI was above 28 and 30 for HFrEF and HFpEF, respectively. Among HFrEF subjects, those entering the medical system after 2005, were more likely to have a history of myocardial infarction (MI) (34% vs 30%; p = 1.96 × 10–8) and a slightly higher LVEF nadir (25% vs 20%; p = 1 × 10–15).
Phenome-wide comparisons identified clinical diagnoses differentially associated with HFpEF versus HFrEF cases across multiple age ranges. Among all age strata, there was a directionally consistent association between an ICD-9 based diagnosis of a heart failure subtype (HFpEF or HFrEF) and the assigned ML subtype (Fig. 3). For instance, individuals age 30–45 with HFpEF were less likely to carry a clinical diagnosis of HFrEF (odds-ratio [OR] = 0.35, 95% confidence interval [0.27–0.47], p = 2.9 × 10–14) more likely to have diagnosis of HFpEF (OR = 3.52 [2.54–4.88], p = 3.4 × 10–14) (Supplementary Table 7).
Across all ages, most diagnoses prevalent among HFrEF cases were related to the circulatory system (Supplementary Fig. 1). HFrEF was consistently associated with ischemic heart disease (IHD) and MI across age groups and was associated with valve disease and sequelae of IHD in older age groups, including ventricular arrhythmias and ECG abnormalities such as left bundle branch block (Fig. 3 and Supplementary Fig. 2).
In contrast, diseases associated with HFpEF cases were more heterogeneous and reflected a higher burden of comorbidities across the clinical disease spectrum. Among these were diagnoses related to symptomatic HF, such as respiratory failure, volume overload and edema, as well as obesity and related complications. For example, HFpEF was associated with sleep apnea at all ages, a diagnosis of obesity after age 45, and non-alcoholic liver disease after age 55 (OR = 1.59 [1.08–2.32], p = 1.8 × 10–2). It was also associated with a diagnosis of hypertrophic obstructive cardiomyopathy among individuals ages 65–75 (OR = 6.37 [3.81–10.7], p = 1.9 × 10–12).
In the youngest age group, the patterns of comorbidities were suggestive of HF etiologies secondary to congenital heart disease, extra-cardiac etiologies (e.g., pulmonary hypertension), as well as metabolic risk factors (Fig. 3, Supplemental Fig. 3).
Hospitalizations and cardiovascular evaluations became increasingly common in the years immediately prior to HF diagnosis (Fig. 4). In the year immediately preceding a HF diagnosis, approximately one quarter of both HFpEF and HFrEF subjects were hospitalized and over half were evaluated in outpatient cardiology clinics. During the three years prior to diagnosis subjects were increasingly diagnosed with cardiovascular comorbidities including IHD, hypertension, dyslipidemia, diabetes, and coronary atherosclerosis as well as symptoms such as shortness of breath, edema, and chest pain (Supplementary Fig. 3).
Overall patterns were similar between HFpEF and HFrEF, though IHD was more common in HFrEF and hypertension more common in HFpEF. HFpEF subjects also accumulated comorbid diagnoses for a slightly longer duration before a HF diagnosis, as compared to subjects with HFrEF (~ 2 years vs ~ 1 year). We examined the timing of the first mention of diuretic use, as a marker of early symptomatic heart failure, in the 3 years prior to a HF diagnosis. For HFpEF and HFrEF, respectively, 98% and 99% of loop diuretics were first mentioned only within the 6 months prior to a diagnosis (Supplementary Fig. 7, Supplementary Table 8).
Discussion
This paper defined the comorbidity profiles and patterns of healthcare utilization for a large collection of HF cases receiving routine healthcare. There were clear differences in the profiles for the HF subtypes; overall, HFpEF-associated comorbidities were less likely to be cardiac diseases, as compared to HFrEF comorbidities (summarized in Fig. 4). Many of these comorbidities were present for up to 3 years before a diagnosis of HF was made. For HFpEF, these comorbidities were often presentations of symptomatic HF as well as complications often attributable to obesity. Recognition of these comorbidity patterns may present a window of opportunity to identify individuals at risk for developing incident HFpEF prior to fulminant disease.
Our rationale for pursuing these studies was motivated by the need to provide a fuller understanding of mechanisms contributing to HF morbidity, especially HFpEF, for which there are no proven life-extending therapies. The 2020 National Heart, Lung, and Blood Institute Working Group Summary on HFpEF research priorities has proposed potential molecular mechanisms of cardiac re-modelling in HFpEF attributable to comorbidities6. In particular, non-cardiac mechanisms underlying HF development need more research, and successful implementation of therapies targeting these mechanisms requires delineation of the presence and timing of these comorbidities in the context of the natural history of the disease4.
The timing and nature of the comorbidities associated with HFpEF was indicative of a frequently insidious disease course characterized by an accumulation of diagnoses related to heart failure symptomology and volume overload such as edema, respiratory symptoms and “fluid overload”. There were also diagnoses indicative of organ damage due to poorly controlled hypertension, such as hypertensive heart disease. Other associations were indicative of advance disease, such as anemia36, and chronic conditions that are downstream sequelae of chronic heart disease such as hearing loss37. The comorbid association patterns among 30–45-year-olds also highlight mechanisms particular to the development of HF symptoms in this age group that are not classically associated with the heart failure syndrome. These include congenital heart defects and primary pulmonary hypertension, though the latter is often misclassified in EHR data sets and typically represents secondary pulmonary hypertension. Thus, this age group is likely enriched in HF subtypes not representative of the more common forms of HF seen in older adults.
HFpEF was also associated with a collection of diagnoses related to chronic obesity including morbid obesity, sleep apnea and non-alcoholic liver cirrhosis, highlighting the significant contribution of obesity to adverse LV remodeling, including diastolic dysfunction38. There was a temporality to these associations. For instance, sleep apnea was more common at a younger age, suggesting that this may be an important early clinical biomarker of risk for future HFpEF risk. In contrast, older ages were associated with the sequelae of end-organ failure, including non-alcoholic liver disease and HFpEF. It has been demonstrated that obesity is associated with an increased natriuretic peptide receptor type C/type A ratio. It has been postulated that this altered ratio may cause breakdown and deficiency of natriuretic peptide, leading to impaired cardiac function and possible progression to HFpEF4,6.
In contrast to HFpEF, HFrEF was associated with cardiovascular conditions that adversely impacted left ventricular systolic function. These associations included intrinsic (e.g., idiopathic) cardiomyopathies among individuals < 55 years old, structural heart disease (e.g., valve disease) at older ages, and ischemic heart disease across all age ranges. HFmEF patients in our study, comprising 10% of the HF cases, were a hybrid of HFpEF (female, HTN, DM2), and HFrEF (CAD burden) features, in agreement with prior observations39.
Our overall findings are corroborated by a recent study that examined the burden of 15 comorbidities among participants in the Atherosclerosis Risk In Communities (ARIC) study diagnosed with acute, decompensated heart failure40. In that study, HFpEF had a higher average burden of comorbidities, as compared to HFrEF. Importantly, a higher comorbidity burden was associated with higher mortality rates. Our study extends that by demonstrating that the higher comorbidity burden extends to a far broader spectrum of comorbidities. We also add to their findings by demonstrating that these comorbidities are antecedent in nature, are along the care continuum, and may carry a differential risk for developing incident HFpEF.
The majority of encounters occurring prior to a HF diagnosis were with non-cardiologist providers, especially for patients who went on to develop HFpEF. Approximately 50% of HFpEF patients received loop diuretics before their HF diagnosis, and most (98%) received them in the 6 months prior to HF diagnosis. Sensitizing primary providers that a loop diuretic requirement may suggest the onset of symptomatic heart failure could advance the lead time for a HF diagnosis and potentially enhance outcomes.
A large portion of research related to HFpEF has focused on adverse cardiac remodeling including diastolic dysfunction, cardiac hypertrophy and myocardial fibrosis4. Less research focuses on the multi-organ system processes that may play a role in the development of HFpEF. Our findings support the notion that HFpEF is a layered continuum characterized by premorbid states, such as obesity, that ultimately can lead to HFpEF (Fig. 4). This period of premorbid conditions may represent preclinical HFpEF, a timepoint where biomarkers like natriuretic peptide may be informative for early risk stratification. Most therapeutic trials recruit participants with fulminant HFpEF, a timepoint where refractory pathological changes have likely set in3,4,5,6,7. Better characterization of the premorbid state may identify timepoints where therapies may demonstrate greater benefit.
There are several novel aspects of the current study. The combination of a large data set, a broad representation of cardiac and non-cardiac phenotypes and large numbers of individuals across the full adult age spectrum provided an opportunity to identify important drivers of HF across the life course. In the younger age group, ischemic heart disease was a less common HF comorbidity, as compared to older ages. Cardiometabolic risk factors were common among this age group, and structural heart disease related to congenital heart defects were also more prevalent. Across all ages, HFpEF is a condition associated with a broad range of comorbidities, many of which are related to obesity. Prior to a HF diagnosis, most clinical encounters were with non-cardiologists for both HF subtypes, especially HFpEF. Importantly, the methods used here are easily portable to other EHR environments, and implementation of this approach across a broad range of EHR data sets will enable efficient, contemporaneous characterization of the evolving epidemiology of HF and will highlight important differences across diverse populations which will directed enhance treatment and prevention.
There are limitations to this study. The VUMC EHR is an observational, single site data set, which can be associated with differentially missing data elements and ascertainment biases. Thus, for instance, TTE measurements performed at outside institutions were not available for these analyses. VUMC is a tertiary care center and may represent a sicker cohort than the general population. The data elements available for analysis are influenced by clinical decision making and practice patterns, which can lead to systematic biases. There was no active follow-up of study subjects, which can lead to differential censoring of data.
Finally, these analyses used a random forest algorithm to identify cases. While the threshold to define HF cases had PPV of 92%, up to 8% of cases may be misclassified. Importantly, the machine-learning algorithm heavily loaded on the frequency of instances that an individual was assigned billing codes for a heart failure diagnosis. Thus, the case definition is heavily based on the summative assessment of the clinical provider using clinical criteria relevant to the time period when the diagnosis was made. When cases used in the Validation set were evaluated using a Universal heart failure definition20, only 35% had sufficient data available in an electronic format to confirm that they met the case definition. Thus, the validity of the case definition with respect to a contemporary epidemiological case definition of heart failure is not well-defined. To fully address the robustness of the approach, it will need to be systematically evaluated in other native clinical environments.
In summary, we characterized the clinical epidemiology of HF subtypes derived from a large clinical data set. HF subtypes demonstrated distinct patterns of clinical co-morbidities and disease progression. Of direct clinical relevance, we demonstrate that, for HFpEF, these comorbidities are often not related to cardiac disease and manifest prior to the onset of a HF diagnosis. Awareness of these stereotypical patterns of clinical presentation will enhance early detection and prevention strategies, which is essential for HFpEF, which has no proven life-extending therapies.
Data availability
The aggregate data supporting the findings of this study are available from the corresponding author upon reasonable request; however, due to institutional data use agreements, access to individual level data is limited.
Abbreviations
- EHR:
-
Electronic health record
- ML:
-
Machine learning
- RF:
-
Random forest
- HF:
-
Heart failure
- HFpEF:
-
Heart failure with preserved ejection fraction
- HFrEF:
-
Heart failure with reduced ejection fraction
- HFmEF:
-
Heart failure with mid-range ejection fraction
- PPV:
-
Positive predictive value
- VUMC SD:
-
Vanderbilt University Medical Center Synthetic Derivative
References
Benjamin, E. J. et al. Heart disease and stroke statistics—2019 update: A report from the American Heart Association. Circulation 139, e56–e66 (2019).
Virani, S. S. et al. Heart disease and stroke statistics—2020 update. Circulation 141, e139–e596 (2020).
Redfield, M. M. Heart failure with preserved ejection fraction. N. Engl. J. Med. 375, 1868–1877 (2016).
Mishra, S. & Kass, D. A. Cellular and molecular pathobiology of heart failure with preserved ejection fraction. Nat. Rev. Cardiol. 18, 1–24. https://doi.org/10.1038/s41569-020-00480-6 (2021).
Gladden, J. D., Chaanine, A. H. & Redfield, M. M. Heart failure with preserved ejection fraction. Annu. Rev. Med. 69, 65–79 (2018).
Shah, S. J. et al. Research priorities for heart failure with preserved ejection fraction. Circulation 141, 1001–1026 (2020).
Borlaug, B. A. Evaluation and management of heart failure with preserved ejection fraction. Nat. Rev. Cardiol. 17, 559–573 (2020).
Christiansen, M. N. et al. Age-specific trends in incidence, mortality and comorbidities of heart failure in Denmark 1995–2012. Circulation 135, 1214–1223 (2017).
Vasan, R. S. et al. Epidemiology of left ventricular systolic dysfunction and heart failure in the Framingham study: An echocardiographic study over 3 decades. JACC Cardiovasc. Imaging 11, 1–11 (2017).
Gerber, Y. et al. A contemporary appraisal of the heart failure epidemic in Olmsted County, Minnesota, 2000 to 2010. JAMA Intern. Med. 175, 996–1004 (2015).
Conrad, N. et al. Temporal trends and patterns in heart failure incidence: A population-based study of 4 million individuals. Lancet 391, 572–580 (2018).
Rastogi, A., Novak, E., Platts, A. E. & Mann, D. L. Epidemiology, pathophysiology and clinical outcomes for heart failure patients with a mid-range ejection fraction. Eur. J. Heart Fail. 19, 1597–1605 (2017).
Lam, C. S. P. & Solomon, S. D. The middle child in heart failure: Heart failure with mid-range ejection fraction (40–50%). Eur. J. Heart Fail. 16, 1049–1055 (2014).
Bowton, E. et al. Biobanks and electronic medical records: Enabling cost-effective research. Sci. Transl. Med. 6, 234 (2014).
Frankovich, J., Longhurst, C. A. & Sutherland, S. M. Evidence-based medicine in the EMR era. N. Engl. J. Med. 365, 1758–1759 (2011).
Roque, F. S. et al. Using electronic patient records to discover disease correlations and stratify patient cohorts. PLoS Comput. Biol. 7, e1002141 (2011).
Blecker, S. et al. Comparison of approaches for heart failure case identification from electronic health record data. JAMA Cardiol. 1, 1014 (2016).
Bielinski, S. J. et al. A robust e-epidemiology tool in phenotyping heart failure with differentiation for preserved and reduced ejection fraction: The Electronic Medical Records and Genomics (eMERGE) Network. J. Cardiovasc. Transl. 8, 475–483 (2015).
Banerjee, D. et al. An electronic medical record report improves identification of hospitalized patients with heart failure. J. Card. Fail. 22, 402–405 (2016).
Bozkurt, B. et al. Universal definition and classification of heart failure a report of the Heart Failure Society of America, Heart Failure Association of the European Society of Cardiology, Japanese Heart Failure Society and Writing Committee of the Universal Definition of Heart Failure. J. Card. Fail. 27, 387–413 (2021).
Baştanlar, Y. & Özuysal, M. Introduction to machine learning. Methods Mol. Biol. 1107, 105–128 (2014).
Beam, A. L. & Kohane, I. S. Big data and machine learning in health care. JAMA 319, 1317 (2018).
Deo, R. C. Machine learning in medicine. Circulation 132, 1920–1930 (2015).
Roden, D. et al. Development of a large-scale de-identified DNA biobank to enable personalized medicine. Clin. Pharmacol. Ther. 84, 362–369 (2008).
Israel, R. A. The International Classification of Disease: Two hundred years of development. Public Health Rep. 93, 150–152 (1978).
Xu, H. et al. MedEx: A medication information extraction system for clinical narratives. J. Am. Med. Inf. Assoc 17, 19–24 (2010).
Chapman, W. W., Bridewell, W., Hanbury, P., Cooper, G. F. & Buchanan, B. G. A simple algorithm for identifying negated findings and diseases in discharge summaries. J. Biomed. Inf. 34, 301–310 (2001).
Kirby, J. C. et al. PheKB: A catalog and workflow for creating electronic phenotype algorithms for transportability. J. Am. Med. Inf. Assoc. 23, 1046–1052 (2016).
Wells, Q. S., Farber-Eger, E. & Crawford, D. C. Extraction of echocardiographic data from the electronic medical record is a rapid and efficient method for study of cardiac structure and function. J. Clin. Bioinform. 4, 12 (2014).
Mosley, J. D. et al. Clinical features associated with nascent left ventricular diastolic dysfunction in a population aged 40 to 55 years. Am. J. Cardiol. 121, 1552–1557 (2018).
Denny, J. C. et al. PheWAS: Demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations. Bioinformatics 26, 1205–1210 (2010).
Denny, J. C. et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat. Biotechnol. 31, 1102–1111 (2013).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Benjamini, Y. & Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188 (2001).
Lam, C. S. P. et al. Cardiac dysfunction and noncardiac dysfunction as precursors of heart failure with reduced and preserved ejection fraction in the community. Circulation 124, 24–30 (2011).
Wattamwar, K. et al. Association of cardiovascular comorbidities with hearing loss in the older old. JAMA Otolaryngol. Head Neck Surg. 144, 623 (2018).
Heymsfield, S. B. & Wadden, T. A. Mechanisms, pathophysiology, and management of obesity. N. Engl. J. Med. 376, 254–266 (2017).
Hsu, J. J., Ziaeian, B. & Fonarow, G. C. Heart failure with mid-range (borderline) ejection fraction clinical implications and future directions. JACC Heart Fail. 5, 763–771 (2017).
Pandey, A. et al. Temporal trends in prevalence and prognostic implications of comorbidities among patients with acute decompensated heart failure: The ARIC study community surveillance. Circulation 142, 230–243 (2020).
Funding
This work was supported by a career development award from the Vanderbilt Faculty Research Scholars Fund (JDM), American Heart Association (AHA) 16FTF30130005 (JDM), AHA 13FTF16810038 (QSW), AHA 17IBDG33780015 (QSW), NIH/NHLBI 1R01HL140074 (QSW). The datasets used for the analyses described were obtained from VUMC’s Synthetic Derivative which is supported by numerous sources: institutional funding, private agencies, and federal grants. These include the NIH funded Shared Instrumentation Grant S10RR025141 and CTSA grants UL1TR002243, UL1TR000445, and UL1RR024975.
Author information
Authors and Affiliations
Contributions
R.T.L.: data analysis, data preparation, data interpretation, manuscript preparation. N.S.V.: data interpretation, data analysis, manuscript preparation. E.F.E.: data acquisition, data analysis, data preparation. D.M.R.: genetic data acquisition, data interpretation. T.A.L.: data analysis, data interpretation. Q.S.W.: data analysis, data preparation, data interpretation, manuscript preparation. J.D.M.: data analysis, data preparation, data interpretation, manuscript preparation.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Levinson, R.T., Vaitinidin, N.S., Farber-Eger, E. et al. Heart failure clinical care analysis uncovers risk reduction opportunities for preserved ejection fraction subtype. Sci Rep 11, 18618 (2021). https://doi.org/10.1038/s41598-021-97831-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-97831-1
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.