Deep learning of ECG waveforms for diagnosis of heart failure with a reduced left ventricular ejection fraction

Choi, JungMin; Lee, Sungjae; Chang, Mineok; Lee, Yeha; Oh, Gyu Chul; Lee, Hae-Young

doi:10.1038/s41598-022-18640-8

Download PDF

Article
Open access
Published: 20 August 2022

Deep learning of ECG waveforms for diagnosis of heart failure with a reduced left ventricular ejection fraction

JungMin Choi^1,2^na1,
Sungjae Lee³^na1,
Mineok Chang³,
Yeha Lee³,
Gyu Chul Oh⁴ &
…
Hae-Young Lee^1,2

Scientific Reports volume 12, Article number: 14235 (2022) Cite this article

4857 Accesses
3 Citations
28 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 13 October 2022

This article has been updated

Abstract

The performance and clinical implications of the deep learning aided algorithm using electrocardiogram of heart failure (HF) with reduced ejection fraction (DeepECG-HFrEF) were evaluated in patients with acute HF. The DeepECG-HFrEF algorithm was trained to identify left ventricular systolic dysfunction (LVSD), defined by an ejection fraction (EF) < 40%. Symptomatic HF patients admitted at Seoul National University Hospital between 2011 and 2014 were included. The performance of DeepECG-HFrEF was determined using the area under the receiver operating characteristic curve (AUC) values. The 5-year mortality according to DeepECG-HFrEF results was analyzed using the Kaplan–Meier method. A total of 690 patients contributing 18,449 ECGs were included with final 1291 ECGs eligible for the study (mean age 67.8 ± 14.4 years; men, 56%). HFrEF (+) identified an EF < 40% and HFrEF (−) identified EF ≥ 40%. The AUC value was 0.844 for identifying HFrEF among patients with acute symptomatic HF. Those classified as HFrEF (+) showed lower survival rates than HFrEF (−) (log-rank p < 0.001). The DeepECG-HFrEF algorithm can discriminate HFrEF in a real-world HF cohort with acceptable performance. HFrEF (+) was associated with higher mortality rates. The DeepECG-HFrEF algorithm may help in identification of LVSD and of patients at risk of worse survival in resource-limited settings.

ECG-guided non-invasive estimation of pulmonary congestion in patients with heart failure

Article Open access 09 March 2023

Aniruddh Raghu, Daphne Schlesinger, … Collin M. Stultz

Multi-center retrospective cohort study applying deep learning to electrocardiograms to identify left heart valvular dysfunction

Article Open access 14 February 2023

Akhil Vaid, Edgar Argulian, … Benjamin S. Glicksberg

Development and validation of deep learning ECG-based prediction of myocardial infarction in emergency department patients

Article Open access 15 November 2022

Stefan Gustafsson, Daniel Gedon, … Johan Sundström

Introduction

Left ventricular systolic dysfunction (LVSD) increases the risk of systemic embolism, stroke, and death compared to heart failure (HF) with preserved LV systolic function¹. Although Vasan et al. showed a decline in asymptomatic LVSD over the past three decades, the prognosis of LVSD has remained unchanged, emphasizing the importance of early diagnosis and adequate management of LVSD². While echocardiography is the standard tool for LVSD diagnosis, the results are highly influenced by operator-dependent factors and its interpretation is subjective, resulting in high dependence to assessor’s expertise³. These limitations restrict the routine use of echocardiography in a resource-limited medical setting. Thus, the development of alternative screening tools for LVSD has been attempted, such as biochemical options and electrocardiogram (ECG)^4,5,6,7,8,9.

The use of ECG for LVSD diagnosis has been ongoing since 1996, from identification of simple abnormalities on ECG to the more recent development of artificial intelligence (AI) algorithms^{5,7,8,9,10,11,12,13,14,15}. Various AI algorithms have been developed and performed based on different definitions of LVSD (e.g., ejection fraction (EF) < 35%^7,10,14, < 40%^8,9,11,12,13, or < 50%¹²) and for distinct study populations^9,13. Despite advancement in AI-based LVSD diagnosis, an AI algorithm to identify LVSD patients with an EF < 40% has not been validated in a clinical population of patients with symptomatic HF regardless of EF. To address this gap, we validated the previously developed AI algorithm by Cho et al.⁸ into a deep learning-aided algorithm using ECG for HF with reduced ejection fraction (DeepECG-HFrEF) to identify LVSD, specifically LVSD with an EF < 40% among symptomatic HF patients regardless of EF. For training, we used 12-lead 10 s ECGs recorded from patients with symptomatic HF at Seoul National University Hospital who were enrolled in the Korean Acute HF (KorAHF) Registry. We further evaluated the predictive power of the DeepECG-HFrEF on 5-year all-cause mortality.

Results

Baseline characteristics

A total of 690 patients, contributing 18,449 ECGs, who were hospitalized for acute HF were eligible. Of these, those with no matching echocardiography within one month of enrollment (191 ECGs from 2 patients) and ECGs that were not the closest matching to the echocardiography (16,979 ECGs from 14 patients) were excluded. After exclusion, 675 patients contributing 1291 ECGs were included in the analysis (Fig. 1). The mean time interval between the ECG and echocardiography was 29.1 h, with over 82.1% (1060/1291) of the ECGs matched within 24 h of the index echocardiography.

Characteristics of the patients based on the archived ECGs classified by DeepECG-HFrEF algorithm are presented in Table 1. Characteristics of the study population according to echocardiographic results at enrollment are summarized in Supplemental Table S1. Owing to the usage of multiple ECGs from the same patient, the sum of DeepECG-HFrEF based patient-set was larger than the original patient-set. However, the paired datasets were used per patient mostly once or twice. Those classified in the DeepECG-HFrEF (+) group were more likely to be men, to have more comorbidities, to be admitted for de novo HF than for acute decompensated HF, and to present with more severe symptoms of dyspnea. Among the etiologies of HF, ischemic (45.8%) was the most common etiology in the DeepECG-HFrEF (+) group, whereas valvular heart disease (30.1%) was the most common etiology in the DeepECG-HFrEF (−) group. The most prevalent HF group also differed between the two groups, with HFrEF being the most prevalent in the DeepECG-HFrEF (+) group and HFpEF in the DeepECG-HFrEF (−) group. The echocardiographic values differed between the two groups. The DeepECG-HFrEF (+) group showed worse EF along with worse early diastolic velocity (E/e’), and right ventricle systolic pressure (RVSP). These results were consistently observed when confined to ECGs specifically corresponding to HFrEF patients (Supplement Table S2). The confidence score of DeepECG-HFrEF for each ECG was presented with corresponding left ventricular end systolic dimension (LVESD) as scatterplot (Supplement Figure S1). False-positive cases appeared to have smaller LVESD than true-positive cases and similar pattern was seen on false-negative cases when compared to true-negative cases.

Table 1 Clinical data of the patients according to the DeepECG-HFrEF algorithm.

Full size table

ECG findings

The differences in various ECG parameters between the two DeepECG-HFrEF groups are shown in Table 2. The DeepECG-HFrEF (+) group showed a higher heart rate with longer QRS duration and QTc interval, as well as more prominent QRS widening, QTc prolongation, and Q wave. The two groups did not differ in the PR interval, PR prolongation, or axis. Among causes of QRS widening, left bundle branch block (LBBB) and intraventricular conduction delay (IVCD) were more common in the DeepECG-HFrEF (+) than (−) group (p = 0.001). Similar patterns were observed within the ECGs of HFrEF patients (Supplement Table S3).

Table 2 ECG findings according to the DeepECG-HFrEF algorithm.

Full size table

Performance of the DeepECG-HFrEF algorithm for different EF cut-offs

The performance of the DeepECG-HFrEF algorithm for different EF cut-off values are reported in Supplement Table S4. Using the optimal cut-off, based on Youden's index, the AUC value for identifying HFrEF among patients with HF was 0.845. For an EF < 40% cut-off, the sensitivity was 0.779, with specificity of 0.763, positive predictive value (PPV) of 0.708, negative predictive value (NPV) of 0.824, and accuracy of 0.770. The AUC, sensitivity, PPV, and accuracy increased, while NPV decreased with an increase in EF.

Performance of the DeepECG-HFrEF algorithm according to actual EF

The proportion of patients diagnosed with DeepECG-HFrEF (+) increased when the actual EF was lower (Fig. 2A). The DeepECG-HFrEF algorithm was more likely to yield false-positive and false-negative results when the actual EF was near 40% (Fig. 2B). The scatter plot also shows a higher proportion of correct classifications (true-positives) when the actual EF was lower (Fig. 3).

Performance of DeepECG-HFrEF algorithm in different subpopulations

Figure 4 is a forest plot of the AUC and associated 95% confidence interval (CI) for the DeepECG-HFrEF algorithm according to various clinical patient parameters. The performance of the DeepECG-HFrEF algorithm was slightly better in the subgroups of patients: age ≤ 70 years, without hypertension, non-ischemic HF, sinus rhythm, PR interval ≤ 200 ms, QRS duration ≤ 140 ms, corrected QT interval of ≤ 450 ms for men and ≤ 470 ms for women, and normal axis or LAD.

The 5-year all-cause mortality

Overall, the 5-year survival was worse in the DeepECG-HFrEF (+) than (−) group (p < 0.001; Fig. 5A). The Kaplan–Meier curve also showed a lower survival rate among patients with an actual EF< 40% (Fig. 5B). The crude and adjusted hazard ratios (HRs) for 5-year all-cause mortality for the three different models are reported in Table 3 All components of model 1 showed significantly increased crude HR and multivariable-adjusted HR. In model 2, echocardiographic EF < 40% added to model 1, DeepECG-HFrEF (+) remained as significantly higher HR even after multivariable-adjustment. In model 3, which included a B-type natriuretic peptide (BNP) > 500 pg/mL added to model 1, DeepECG-HFrEF (+) was offset by BNP.

Table 3 Crude and adjusted hazard ratio for 5-year all-cause mortality among 1291 of heart failure ECGs.

Full size table

Discussion

In this study, we validated the DeepECG-HFrEF to identify LVSD in patients with symptomatic HF regardless of EF and evaluated the predictive power of the algorithm for the 5-year all-cause mortality. The DeepECG-HFrEF algorithm showed outstanding performance in discriminating LVSD among patients with HF. DeepECG-HFrEF (+) was associated with a worse 5-year survival, even when compared to using the actual EF value. To our knowledge, this is the first study to validate the performance of a deep learning-based AI algorithm for LVSD detection and to show risk predictability in symptomatic patients with HF.

LVSD is identified in 40–50% of patients with HF¹⁶. Although survival rates of patients with HF have recently improved in developed countries, patients with HF still show an eight-fold higher mortality than an age-matched population^17,18. Not only does HF increase the risk of mortality, but the associated economic burden cannot be overlooked. The economic burden of HF was estimated to be $108 billion per annum globally in 2012, with 60% direct costs to the healthcare system and 40% indirect costs to society through morbidity and others¹⁹. Such burden is even higher in Asian countries compared to the United States, with a large proportion of the HF-related healthcare costs directly associated to hospitalization²⁰. The impact of this burden is accentuated among elderly patients, with almost three-quarters of the total resources assigned to HF being solely devoted to the older population²¹. The increase in the proportion of elderly individuals in the general population, social ageing phenomenon, is consistent throughout the world, with the elderly population projected to double to almost 1.6 billion globally, from 2025 to 2050²². Considering the economic burden of HF in the elderly population, there is a need to improve early diagnosis and treatment of LVSD to slow or even prevent its progression to HF.

A summary of currently developed AI algorithms for the detection of LVSD and the validation of these algorithms is provided in Supplementary Table S5. The definition of LVSD and the primary endpoint differed among studies, with an EF cut-off of 35% to 40% having been used. The study population used for validation also differed between the studies, from using patients at a community general hospital to patients in cardiac intensive care unit and patients with COVID-19^9,12,13. As a result of these differences in the clinical population used, the proportion of patients within the validation population varied between 2 and 20%^7,11. Our study is the first to validate the algorithm to detect LVSD solely using patients with HF. Our results showed the strength of the DeepECG-HFrEF algorithm to discriminate LVSD even when the prevalence of HF is high.

Despite recent advances in HF pharmacotherapy, the mortality and rehospitalization rates of patients with HF are still high. Therefore, the identification of high-risk patients who would benefit the most from comprehensive HF treatment is urgently required²³. A few studies suggested the promising role of AI support for the early diagnosis of low EF¹⁵. Regarding AI for the detection of LVSD, only one study, by Attia et al., reported on the power of an AI algorithm to predict future LVSD development⁷. Our study is the first to show an association between long-term survival and LVSD of patients with HF based on an AI algorithm. Our results show that the AI algorithm can identify abnormalities in ECG before overt LVSD is observed on echocardiography.

The AI algorithms are known for being a “black box” with exact mechanism unexplainable. However, there are some ECG characteristics in the DeepECG-HFrEF (+) group which might have contributed to the prognostic performance of the algorithm. The DeepECG-HFrEF (+) group had significantly increased corrected QT intervals and increased proportions of LBBB and IVCD. A study by Lee et al. showed that LBBB and IVCD were associated with an increased risk of all-cause mortality and rehospitalization due to HF aggravation²⁴. Regarding the QTc interval, a study by Park et al. showed a J-curve association between the corrected QT interval and mortality among patients with acute HF, with a nadir of 440–450 ms in men and 470–480 ms in women²⁵. Thus, such an association might be one of the factors used by the DeepECG-HFrEF algorithm to differentiate between the two groups. Nevertheless, as our study did not specifically differentiate the corrected QT interval according to sex, the application of results by Park et al. should be done with caution²⁵. Thus, we can carefully interpret that the features shown in the DeepECG-HFrEF (+) group, such as LBBB and IVCD, might be factors that the algorithm is searching for group classification.

There is no clear explanation for the increased false-positive and false-negative rates among patients with an EF near 40%. One plausible explanation might be that the clustering near an EF of 40% may be a heterogeneous group. A previous study by Rastogi et al. showed heterogeneity in the underlying demographics of HFmrEF to be associated with changes in EF over time²⁶. Among the HFmrEF groups, improvement in EF tends to be associated with coronary artery disease, while a worsening of EF is more likely to coexist with hypertension and diastolic dysfunction²⁶. Patients with acute coronary syndrome are more likely to have dynamic changes in their ECGs and EF over a short period of time^27,28. As ischemia was the leading cause of acute HF among patients in the KorAHF Registry, such dynamic changes might have contributed to heterogeneity, resulting in a discrepancy between actual EF and DeepECG-HFrEF algorithm results²⁹.

Limitations

The limitations of our study need to be acknowledged in the interpretation of results. First, owing to the retrospective design used, causation between identified factors of LVSD among patients with HF could not be inferred. Further validation of the algorithm using a prospective study design is needed. Second, generalization of our results is limited, and should be cautiously interpreted, as the study population was drawn from a single hospital site in Korea. Further studies on a wider range of race and ethnicity are necessary, as done per the study conducted by the Mayo Clinic using an artificial intelligence-augmented electrocardiogram (AI-ECG) in the United States and Uganda^9,14. Third, although most of the ECGs were matched to echocardiography within 24 h, some were performed within 30 days. Although these time gaps might influence the performance of our model, the mean ± standard deviation of time gaps for true positive, false positive, false negative, and true negative are 22.0 (± 65.6), 30.6 (± 86.4), 31.3 (± 107.3), and 33.6 (± 90.2), respectively, which was not statistically significant (p = 0.192). Also, the performance of the algorithm although the 30-day maximum has generally been accepted in previous studies^10,12. It is important to note that the ECG matched to echocardiography within 24 h comprised 82.1% of the data used in this study. Fourth, HF medication compliance was not considered. As angiotensin-converting enzyme inhibitors and beta-blockers are known to have a favorable prognosis for the treatment of LVSD, data on such medication adherence would have affected survival. Fifth, our study focused on the association between ECG and echocardiography and included multiple ECG and echocardiographic data from one person. This may have had a slight influence on the survival analysis. A sequential study using a single ECG and echocardiography from individual patients would be useful to confirm our results. Lastly, our study used visually estimated EF values documented by the examiners because EF measurement by Simpson’s biplane or other calculated methods were inadequate either by poor echocardiographic window or severely unbalanced myocardial contraction (61 out of 1291 cases).

Conclusions

The DeepECG-HFrEF algorithm showed acceptable performance in distinguishing HFrEF in a real-world HF cohort. Patients with a DeepECG-HFrEF (+) classification had a significantly worse 5-year survival. Application of the DeepECG-HFrEF algorithm may be of specific benefit in resource-limited clinical settings where echocardiography is not readily eligible to identify high-risk patients who may benefit from active therapeutic intervention.

Methods

Statement of ethics

Our Institutional Review Boards approved this retrospective database study at Seoul National University Hospital (No.2012-191-1186). The requirement for informed consent from the study subjects was waived by the IRB of Seoul National University Hospital due to the retrospective study design. All research was performed in accordance with the Declaration of Helsinki. Use of the data from the KorAHF Registry was previously approved (Institutional Review Boards of Seoul National University Hospital No. 2004-166-1119)²⁹.

Study population

This was a retrospective validation study of the AI ECG algorithm for patients with symptomatic HF at Seoul National University Hospital. The ECGs used to validate the DeepECG-HFrEF for the diagnosis of HFrEF were retrieved from the KorAHF Registry. Eligible were patients who had undergone ECG and echocardiography within a 30-day interval. Patients with missing demographics, ECGs, and echocardiographic information were excluded. For patients who underwent repeated ECGs and echocardiography, all records were verified, and the ECGs performed closest to (before or after) the index echocardiography selected for analysis. All ECGs included in the analysis were manually reviewed by two certified cardiologists to confirm the cardiac rhythm diagnosis.

Data management

Demographic and echocardiographic data, and clinical outcomes were obtained from the KorAHF Registry^25,29. The 12-lead ECGs were performed using the MUSE system (MAC 5500 HD, versions 5D to 8, GE Healthcare), at a sampling rate of 500 Hz. The left ventricular EF was determined using the following hierarchical approach: Simpson’s biplane method was used preferentially; if this was not available, then other calculated methods were used; and finally, if EF could not be calculated, then visual estimation was used. HF was classified according to the left ventricular EF, as follows: HFrEF (EF < 40%); HF with mildly reduced EF (HFmrEF, EF: 40–50%); and HF with preserved EF (HFpEF, EF > 50%)³⁰.

AI Algorithm

The original convolutional neural network (CNN)-based algorithm was previously described, developed, and externally validated⁸. The DeepECG-HFrEF algorithm to detect a LVEF < 40% was validated to detect an EF < 40% from 12-lead 10 s ECGs data of HF patients. The algorithm was implemented on the TensorFlow (Google, Mountain View, CA) framework and written in Python (version 3.6; Python Software Foundation, Beaverton, OR). For this study, the algorithm was newly implemented on PyTorch (Facebook, Menlo Park, CA), with no additional training or optimization of the original algorithm. The output for the algorithm is a continuous value between 0 and 1, representing a confidence score for an EF < 40%. Using a certain cut-off value, all tests either had a positive (+) or negative (−) result, and none of the tests were considered intermediate.

Statistical analysis

A comprehensive panel of diagnostic performance metrics was summarized to evaluate the performance of the DeepECG-HFrEF algorithm. In particular, the sensitivity, specificity, PPV, NPV, accuracy, and accuracy of the validation study were determined using the original algorithm positive (+) of greater than or equal to the cut-off of 0.370, indicating that the input ECG had a confidence score of 0.370 to detect a LVEF < 40%⁸. The AUC with confidence interval was evaluated via a 2000-sample bootstrapping method. We examined the optimal threshold, which is defined as the threshold that maximizes the sum of sensitivity and specificity (i.e., Youden’s index). Continuous variables are presented as the mean ± standard deviation and compared using the unpaired Student’s t-test. Categorical variables were expressed as frequencies or percentages and were compared using the chi-squared test. For the secondary objective of exploring the long-term prognostic impact of DeepECG-HFrEF (+), the Kaplan–Meier method was used with between-group differences assessed using the log-rank test. The Cox proportional-hazards regression model was used to identify the predictors of 5-year all-cause mortality. The performance of three models was evaluated: DeepECG-HFrEF (+) model 1 (age > 70 years, diabetes, ischemic heart disease, and chronic kidney disease (CKD) stage 4–5); DeepECG-HFrEF (+) model 2 (echocardiographic results of EF < 40%, age > 70 years, diabetes, ischemic heart disease, and CKD stage 4–5); and DeepECG-HFrEF (+) model 3 (BNP > 500 pg/mL, age > 70 years, diabetes, ischemic heart disease, and CKD stage 4–5). All reported p-values were two-sided, with a p-value < 0.05 considered significant. Statistical analyses were performed using IBM SPSS Statistics version 23 (IBM Co., Armonk, NY, USA).

Data availability

The datasets used and/or analyzed during the current study are all available from the corresponding author on reasonable request.

Change history

13 October 2022
A Correction to this paper has been published: https://doi.org/10.1038/s41598-022-22012-7

References

McMurray, J. J. V. et al. Left ventricular systolic dysfunction, heart failure, and the risk of stroke and systemic embolism in patients with atrial fibrillation. Circ. Heart Fail. 6, 451–460. https://doi.org/10.1161/CIRCHEARTFAILURE.112.000143 (2013).
Article CAS PubMed Google Scholar
Vasan, R. S. et al. Epidemiology of left ventricular systolic dysfunction and heart failure in the framingham study: an echocardiographic study over 3 decades. JACC: Cardiovasc. Imaging 11, 1–11. https://doi.org/10.1016/j.jcmg.2017.08.007 (2018).
Article Google Scholar
Hoffmann, R. et al. Analysis of interinstitutional observer agreement in interpretation of dobutamine stress echocardiograms. J. Am. Coll. Cardiol. 27, 330–336. https://doi.org/10.1016/0735-1097(95)00483-1 (1996).
Article CAS PubMed Google Scholar
McDonagh, T. A. et al. Biochemical detection of left-ventricular systolic dysfunction. Lancet 351, 9–13. https://doi.org/10.1016/S0140-6736(97)03034-1 (1998).
Article CAS PubMed Google Scholar
Davie, A. P. et al. Value of the electrocardiogram in identifying heart failure due to left ventricular systolic dysfunction. BMJ : Br. Med. J. 312, 222. https://doi.org/10.1136/bmj.312.7025.222 (1996).
Article CAS Google Scholar
Potter, E. L. et al. Machine learning of ECG waveforms to improve selection for testing for asymptomatic left ventricular dysfunction. JACC: Cardiovasc. Imaging 14, 1904–1915. https://doi.org/10.1016/j.jcmg.2021.04.020 (2021).
Article Google Scholar
Attia, Z. I. et al. Screening for cardiac contractile dysfunction using an artificial intelligence–enabled electrocardiogram. Nat. Med. 25, 70–74. https://doi.org/10.1038/s41591-018-0240-2 (2019).
Article CAS PubMed Google Scholar
Cho, J. et al. Artificial intelligence algorithm for screening heart failure with reduced ejection fraction using electrocardiography. ASAIO J. 67, 314–321. https://doi.org/10.1097/mat.0000000000001218 (2021).
Article PubMed Google Scholar
Jentzer, J. C. et al. Left ventricular systolic dysfunction identification using artificial intelligence-augmented electrocardiogram in cardiac intensive care unit patients. Int. J. Cardiol. 326, 114–123. https://doi.org/10.1016/j.ijcard.2020.10.074 (2021).
Article PubMed Google Scholar
Adedinsewo, D. et al. Artificial intelligence-enabled ECG algorithm to identify patients with left ventricular systolic dysfunction presenting to the emergency department with dyspnea. Circ. Arrhythm. Electrophysiol. 13, e008437. https://doi.org/10.1161/circep.120.008437 (2020).
Article CAS PubMed Google Scholar
Kashou, A. H. et al. Artificial intelligence-augmented electrocardiogram detection of left ventricular systolic dysfunction in the general population. Mayo. Clin. Proc. 96, 2576–2586. https://doi.org/10.1016/j.mayocp.2021.02.029 (2021).
Article PubMed Google Scholar
Kwon, J. M. et al. Development and validation of deep-learning algorithm for electrocardiography-based heart failure identification. Korean. Circ. J. 49, 629–639. https://doi.org/10.4070/kcj.2018.0446 (2019).
Article PubMed PubMed Central Google Scholar
Attia, Z. I., Kapa, S., Noseworthy, P. A., Lopez-Jimenez, F. & Friedman, P. A. Artificial intelligence ECG to detect left ventricular dysfunction in COVID-19: a case series. Mayo. Clin. Proc. 95, 2464–2466. https://doi.org/10.1016/j.mayocp.2020.09.020 (2020).
Article CAS PubMed Google Scholar
Mondo, C. K. et al. External validation of an electrocardiography artificial intelligence-generated algorithm to detect left ventricular systolic function in a general cardiac clinic in Uganda. Eur. Heart J. https://doi.org/10.1093/ehjci/ehaa946.1013 (2020).
Article Google Scholar
Yao, X. et al. Artificial intelligence–enabled electrocardiograms for identification of patients with low ejection fraction: a pragmatic, randomized clinical trial. Nat. Med. 27, 815–819. https://doi.org/10.1038/s41591-021-01335-4 (2021).
Article CAS PubMed Google Scholar
Davies, M. K. et al. Prevalence of left-ventricular systolic dysfunction and heart failure in the echocardiographic heart of England screening study: a population based study. Lancet 358, 439–444. https://doi.org/10.1016/S0140-6736(01)05620-3 (2001).
Article CAS PubMed Google Scholar
Wilhelmsen, L., Rosengren, A., Eriksson, H. & Lappas, G. Heart failure in the general population of men—morbidity, risk factors and prognosis. J. Intern. Med. 249, 253–261. https://doi.org/10.1111/j.1365-2796.2001.00801.x (2001).
Article CAS PubMed Google Scholar
Rahimi, K., Duncan, M., Pitcher, A., Emdin, C. A. & Goldacre, M. J. Mortality from heart failure, acute myocardial infarction and other ischaemic heart disease in England and Oxford: a trend study of multiple-cause-coded death certification. J. Epidemiol. Community Health 69, 1000–1005. https://doi.org/10.1136/jech-2015-205689 (2015).
Article PubMed Google Scholar
Cook, C., Cole, G., Asaria, P., Jabbour, R. & Francis, D. P. The annual global economic burden of heart failure. Int. J. Cardiol. 171, 368–376. https://doi.org/10.1016/j.ijcard.2013.12.028 (2014).
Article PubMed Google Scholar
Chung, H. & Sohn, I. S. Economic burden of heart failure in Asian countries based on real-world data. Korean Circ. J. 51, 694–695. https://doi.org/10.4070/kcj.2021.0197 (2021).
Article PubMed PubMed Central Google Scholar
Liao, L., Allen, L. A. & Whellan, D. J. Economic burden of heart failure in the elderly. Pharmacoeconomics 26, 447–462. https://doi.org/10.2165/00019053-200826060-00001 (2008).
Article PubMed Google Scholar
Wan He, D. G., and Paul Kowal. An Aging World: 2015, https://www.census.gov/library/publications/2016/demo/P95-16-1.html (2014).
Choi, H. M., Park, M. S. & Youn, J. C. Update on heart failure management and future directions. Korean J. Intern. Med. 34, 944. https://doi.org/10.3904/kjim.2018.428.e1 (2019).
Article PubMed PubMed Central Google Scholar
Lee, J. H. et al. Prognostic implication of ventricular conduction disturbance pattern in hospitalized patients with acute heart failure syndrome. Korean Circ. J. 49, 602–611. https://doi.org/10.4070/kcj.2018.0290 (2019).
Article CAS PubMed PubMed Central Google Scholar
Park, C. S. et al. J-curve relationship between corrected QT interval and mortality in acute heart failure patients. Korean J. Intern. Med. 35, 1371–1384. https://doi.org/10.3904/kjim.2019.030 (2020).
Article PubMed PubMed Central Google Scholar
Rastogi, A., Novak, E., Platts, A. E. & Mann, D. L. Epidemiology, pathophysiology and clinical outcomes for heart failure patients with a mid-range ejection fraction. Eur. J. Heart Fail. 19, 1597–1605. https://doi.org/10.1002/ejhf.879 (2017).
Article CAS PubMed Google Scholar
Nikus, K. et al. Electrocardiographic classification of acute coronary syndromes: a review by a committee of the International society for holter and non-invasive electrocardiology. J. Electrocardiol. 43, 91–103. https://doi.org/10.1016/j.jelectrocard.2009.07.009 (2010).
Article PubMed Google Scholar
Ezekowitz, J. A. et al. Declining in-hospital mortality and increasing heart failure incidence in elderly patients with first myocardial infarction. J. Am. Coll. Cardiol. 53, 13–20. https://doi.org/10.1016/j.jacc.2008.08.067 (2009).
Article PubMed Google Scholar
Lee, S. E. et al. A multicentre cohort study of acute heart failure syndromes in Korea: rationale, design, and interim observations of the korean acute heart failure (KorAHF) registry. Eur. J. Heart Fail. 16, 700–708. https://doi.org/10.1002/ejhf.91 (2014).
Article PubMed Google Scholar
Hsu, J. J., Ziaeian, B. & Fonarow, G. C. Heart failure with mid-range (borderline) ejection fraction: clinical implications and future directions. JACC Heart Fail. 5, 763–771. https://doi.org/10.1016/j.jchf.2017.06.013 (2017).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank Editage (www.editage.co.kr) for English language editing.

Funding

No funding was received in this study. Development of the AI algorithm was supported by VUNO Inc., Seoul, Korea. JMC, GCO, HYL, SL, MC, YL have no financial conflicts of interest. All authors contributed to the data collection, data interpretation, and literature research, and were involved in all stages of manuscript development.

Author information

These authors contributed equally: JungMin Choi and Sungjae Lee.

Authors and Affiliations

Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
JungMin Choi & Hae-Young Lee
Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
JungMin Choi & Hae-Young Lee
VUNO Inc, Seoul, Republic of Korea
Sungjae Lee, Mineok Chang & Yeha Lee
Division of Cardiology, Department of Internal Medicine, Seoul St. Mary’s Hospital, Seoul, Republic of Korea
Gyu Chul Oh

Authors

JungMin Choi
View author publications
You can also search for this author in PubMed Google Scholar
Sungjae Lee
View author publications
You can also search for this author in PubMed Google Scholar
Mineok Chang
View author publications
You can also search for this author in PubMed Google Scholar
Yeha Lee
View author publications
You can also search for this author in PubMed Google Scholar
Gyu Chul Oh
View author publications
You can also search for this author in PubMed Google Scholar
Hae-Young Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the data collection, data interpretation, and literature research, and were involved in all stages of manuscript development.

Corresponding author

Correspondence to Hae-Young Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The original version of this Article contained an error in Figure 5, where the label of the orange-colored line in panel (b) was incorrectly given as "DeepECG HFrEF (-) among EF < 40%".

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Choi, J., Lee, S., Chang, M. et al. Deep learning of ECG waveforms for diagnosis of heart failure with a reduced left ventricular ejection fraction. Sci Rep 12, 14235 (2022). https://doi.org/10.1038/s41598-022-18640-8

Download citation

Received: 19 February 2022
Accepted: 17 August 2022
Published: 20 August 2022
DOI: https://doi.org/10.1038/s41598-022-18640-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

ECG-guided non-invasive estimation of pulmonary congestion in patients with heart failure

Multi-center retrospective cohort study applying deep learning to electrocardiograms to identify left heart valvular dysfunction

Development and validation of deep learning ECG-based prediction of myocardial infarction in emergency department patients

Introduction

Results

Baseline characteristics

ECG findings

Performance of the DeepECG-HFrEF algorithm for different EF cut-offs

Performance of the DeepECG-HFrEF algorithm according to actual EF

Performance of DeepECG-HFrEF algorithm in different subpopulations

The 5-year all-cause mortality

Discussion

Limitations

Conclusions

Methods

Statement of ethics

Study population

Data management

AI Algorithm

Statistical analysis

Data availability

Change history

13 October 2022

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links