Development and Validation of Deep-Learning Algorithm for Electrocardiography-Based Heart Failure Identification

Joon Myoung Kwon, Kyung Hee Kim, Ki Hyun Jeon, Hyue Mee Kim, Min Jeong Kim, Sung Min Lim, Pil Sang Song, Jinsik Park, Rak Kyeong Choi, Byung Hee Oh, Joon Myoung Kwon, Kyung Hee Kim, Ki Hyun Jeon, Hyue Mee Kim, Min Jeong Kim, Sung Min Lim, Pil Sang Song, Jinsik Park, Rak Kyeong Choi, Byung Hee Oh

Abstract

Background and objectives: Screening and early diagnosis for heart failure (HF) are critical. However, conventional screening diagnostic methods have limitations, and electrocardiography (ECG)-based HF identification may be helpful. This study aimed to develop and validate a deep-learning algorithm for ECG-based HF identification (DEHF).

Methods: The study involved 2 hospitals and 55,163 ECGs of 22,765 patients who performed echocardiography within 4 weeks were study subjects. ECGs were divided into derivation and validation data. Demographic and ECG features were used as predictive variables. The primary endpoint was detection of HF with reduced ejection fraction (HFrEF; ejection fraction [EF]≤40%), and the secondary endpoint was HF with mid-range to reduced EF (≤50%). We developed the DEHF using derivation data and the algorithm representing the risk of HF between 0 and 1. We confirmed accuracy and compared logistic regression (LR) and random forest (RF) analyses using validation data.

Results: The area under the receiver operating characteristic curves (AUROCs) of DEHF for identification of HFrEF were 0.843 (95% confidence interval, 0.840-0.845) and 0.889 (0.887-0.891) for internal and external validation, respectively, and these results significantly outperformed those of LR (0.800 [0.797-0.803], 0.847 [0.844-0.850]) and RF (0.807 [0.804-0.810], 0.853 [0.850-0.855]) analyses. The AUROCs of deep learning for identification of the secondary endpoint was 0.821 (0.819-0.823) and 0.850 (0.848-0.852) for internal and external validation, respectively, and these results significantly outperformed those of LR and RF.

Conclusions: The deep-learning algorithm accurately identified HF using ECG features and outperformed other machine-learning methods.

Keywords: Artificial intelligence; Deep learning; Electrocardiography; Heart failure; Machine learning.

Conflict of interest statement

The authors have no financial conflicts of interest.

Copyright © 2019. The Korean Society of Cardiology.

Figures

Figure 1. Study flow chart.
Figure 1. Study flow chart.
DEHF = deep-learning algorithm for electrocardiography-based heart failure identification; ECG = electrocardiography.
Figure 2. Development and validation of DEHF.
Figure 2. Development and validation of DEHF.
DEHF = deep-learning algorithm for electrocardiography-based heart failure identification; ECG = electrocardiography; HF = heart failure.
Figure 3. AUROC of each algorithm for…
Figure 3. AUROC of each algorithm for identification of HF.
AUROC = area under the receiver operating characteristic curve; EF = ejection fraction; HF = heart failure; HFrEF = heart failure with reduced ejection fraction; HFmrEF = heart failure with mid-range ejection fraction.

References

    1. Ziaeian B, Fonarow GC. Epidemiology and aetiology of heart failure. Nat Rev Cardiol. 2016;13:368–378.
    1. Ponikowski P, Anker SD, AlHabib KF, et al. Heart failure: preventing disease and death worldwide. ESC Heart Fail. 2014;1:4–25.
    1. Ambrosy AP, Fonarow GC, Butler J, et al. The global health and economic burden of hospitalizations for heart failure: lessons learned from hospitalized heart failure registries. J Am Coll Cardiol. 2014;63:1123–1133.
    1. Yancy CW, Jessup M, Bozkurt B, et al. 2017 ACC/AHA/HFSA focused update of the 2013 ACCF/AHA guideline for the management of heart failure: a report of the American College of Cardiology/American Heart Association task force on clinical practice guidelines and the Heart Failure Society of America. J Am Coll Cardiol. 2017;70:776–803.
    1. Bagley SC, White H, Golomb BA. Logistic regression in the medical literature: standards for use and reporting, with particular attention to one medical domain. J Clin Epidemiol. 2001;54:979–985.
    1. Breiman L. Statistical modeling: the two cultures. Stat Sci. 2001;16:199–215.
    1. Nainwal A, Kumar Y, Jha B. Morphological changes in congestive heart failure ECG; 2nd International Conference on Advances in Computing, Communication, & Automation (ICACCA); 2016 Sep 30–Oct 1; Fri, India. Bareilly: Institute of Electrical and Electronics Engineers; 2016.
    1. Hendry PB, Krisdinarti L, Erika M. Scoring system based on electrocardiogram features to predict the type of heart failure in patients with chronic heart failure. Cardiol Res. 2016;7:110–116.
    1. Attia ZI, Kapa S, Lopez-Jimenez F, et al. Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram. Nat Med. 2019;25:70–74.
    1. Sengupta PP, Kulkarni H, Narula J. Prediction of abnormal myocardial relaxation from signal processed surface ECG. J Am Coll Cardiol. 2018;71:1650–1660.
    1. Johnson KW, Torres Soto J, Glicksberg BS, et al. Artificial intelligence in cardiology. J Am Coll Cardiol. 2018;71:2668–2679.
    1. Ting DS, Cheung CY, Lim G, et al. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes. JAMA. 2017;318:2211–2223.
    1. Kwon JM, Lee Y, Lee Y, Lee S, Park J. An algorithm based on deep learning for predicting in-hospital cardiac arrest. J Am Heart Assoc. 2018;7:e008678.
    1. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–444.
    1. Pal SK, Mitra S. Multilayer perceptron, fuzzy sets, and classification. IEEE Trans Neural Netw. 1992;3:683–697.
    1. Nair V, Hinton GE. Rectified linear units improve restricted Boltzmann machines; Proceedings of the 27th International Conference on Machine Learning (ICML-10); 2010 Jun 21–24; Mon, Israel. Haifa: International Machine Learning Society; 2010.
    1. Abadi M, Barham P, Chen J, et al. TensorFlow: a system for large-scale machine learning; Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI' 16); 2016 Nov 2–4; Wen, USA. Savannah (GA): USENIX Association; 2016.
    1. Jayalakshmi T, Santhakumaran A. Statistical normalization and backpropagation for classification. Int J Comput Theory Eng. 2011;3:89–93.
    1. Shouval R, Hadanny A, Shlomo N, et al. Machine learning for prediction of 30-day mortality after ST elevation myocardial infraction: an acute coronary syndrome Israeli survey data mining study. Int J Cardiol. 2017;246:7–13.
    1. Calcagno V, de Mazancourt C. Glmulti: an R package for easy automated model selection with (generalized) linear models. J Stat Softw. 2010;34:1–29.
    1. Khalilia M, Chakraborty S, Popescu M. Predicting disease risks from highly imbalanced data using random forest. BMC Med Inform Decis Mak. 2011;11:51.
    1. Carpenter J, Bithell J. Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. Stat Med. 2000;19:1141–1164.
    1. Son CS, Kim YN, Kim HS, Park HS, Kim MS. Decision-making model for early diagnosis of congestive heart failure using rough set and decision tree approaches. J Biomed Inform. 2012;45:999–1008.
    1. Masetic Z, Subasi A. Congestive heart failure detection using random forest classifier. Comput Methods Programs Biomed. 2016;130:54–64.
    1. Alonso-Betanzos A, Bolón-Canedo V, Heyndrickx GR, Kerkhof PL. Exploring guidelines for classification of major heart failure subtypes by using machine learning. Clin Med Insights Cardiol. 2015;9:57–71.
    1. Isler Y. Discrimination of systolic and diastolic dysfunctions using multi-layer perceptron in heart rate variability analysis. Comput Biol Med. 2016;76:113–119.
    1. Melillo P, De Luca N, Bracale M, Pecchia L. Classification tree for risk assessment in patients suffering from congestive heart failure via long-term heart rate variability. IEEE J Biomed Health Inform. 2013;17:727–733.
    1. Guidi G, Pettenati MC, Melillo P, Iadanza E. A machine learning system to improve heart failure patient assistance. IEEE J Biomed Health Inform. 2014;18:1750–1756.
    1. Fong RC, Vedaldi A. Interpretable explanations of black boxes by meaningful perturbation. Proc IEEE Int Conf Comput Vis. 2017:3449–3457.
    1. Wolpert DH. The supervised learning no-free-lunch theorems. In: Roy R, Köppen M, Ovaska S, Furuhashi T, Hoffmann F, editors. Soft Computing and Industry. London: Springer; 2002. pp. 25–42.

Source: PubMed

3
Subscribe