rECHOmmend: An ECG-Based Machine Learning Approach for Identifying Patients at Increased Risk of Undiagnosed Structural Heart Disease Detectable by Echocardiography

Alvaro E Ulloa-Cerna, Linyuan Jing, John M Pfeifer, Sushravya Raghunath, Jeffrey A Ruhl, Daniel B Rocha, Joseph B Leader, Noah Zimmerman, Greg Lee, Steven R Steinhubl, Christopher W Good, Christopher M Haggerty, Brandon K Fornwalt, Ruijun Chen, Alvaro E Ulloa-Cerna, Linyuan Jing, John M Pfeifer, Sushravya Raghunath, Jeffrey A Ruhl, Daniel B Rocha, Joseph B Leader, Noah Zimmerman, Greg Lee, Steven R Steinhubl, Christopher W Good, Christopher M Haggerty, Brandon K Fornwalt, Ruijun Chen

Abstract

Background: Timely diagnosis of structural heart disease improves patient outcomes, yet many remain underdiagnosed. While population screening with echocardiography is impractical, ECG-based prediction models can help target high-risk patients. We developed a novel ECG-based machine learning approach to predict multiple structural heart conditions, hypothesizing that a composite model would yield higher prevalence and positive predictive values to facilitate meaningful recommendations for echocardiography.

Methods: Using 2 232 130 ECGs linked to electronic health records and echocardiography reports from 484 765 adults between 1984 to 2021, we trained machine learning models to predict the presence or absence of any of 7 echocardiography-confirmed diseases within 1 year. This composite label included the following: moderate or severe valvular disease (aortic/mitral stenosis or regurgitation, tricuspid regurgitation), reduced ejection fraction <50%, or interventricular septal thickness >15 mm. We tested various combinations of input features (demographics, laboratory values, structured ECG data, ECG traces) and evaluated model performance using 5-fold cross-validation, multisite validation trained on 1 site and tested on 10 independent sites, and simulated retrospective deployment trained on pre-2010 data and deployed in 2010.

Results: Our composite rECHOmmend model used age, sex, and ECG traces and had a 0.91 area under the receiver operating characteristic curve and a 42% positive predictive value at 90% sensitivity, with a composite label prevalence of 17.9%. Individual disease models had area under the receiver operating characteristic curves from 0.86 to 0.93 and lower positive predictive values from 1% to 31%. Area under the receiver operating characteristic curves for models using different input features ranged from 0.80 to 0.93, increasing with additional features. Multisite validation showed similar results to cross-validation, with an aggregate area under the receiver operating characteristic curve of 0.91 across our independent test set of 10 clinical sites after training on a separate site. Our simulated retrospective deployment showed that for ECGs acquired in patients without preexisting structural heart disease in the year 2010, 11% were classified as high risk and 41% (4.5% of total patients) developed true echocardiography-confirmed disease within 1 year.

Conclusions: An ECG-based machine learning model using a composite end point can identify a high-risk population for having undiagnosed, clinically significant structural heart disease while outperforming single-disease models and improving practical utility with higher positive predictive values. This approach can facilitate targeted screening with echocardiography to improve underdiagnosis of structural heart disease.

Keywords: cardiomyopathies; echocardiography; electrocardiography; heart valve diseases; machine learning; ventricular dysfunction.

Figures

Figure 1.
Figure 1.
Flow diagram from source data to the study datasets. We processed data from research repositories created using EHR data from Epic, ECG data from MUSE, and echocardiography data from Xcelera. The clinical MUSE database was processed to include 12-lead ECGs sampled at either 250 Hz or 500 Hz, acquired after 1984 from patients >18 years. ECG indicates electrocardiogram; EHR, electronic health record; and Echo, echocardiography.
Figure 2.
Figure 2.
rECHOmmend model diagram showing the classification pipeline for ECG traces and other electronic health record data. The output (gray triangle) of each CNN applied to ECG trace data are concatenated with labs, vitals, and demographics to form a feature vector. This vector is the input to the classification pipeline (min–max scaling, mean imputation, XGBoost classifier, and calibration), which outputs a composite prediction for the patient. AR indicates aortic regurgitation; AS, aortic stenosis; CNN, convolutional neural network; ECG, electrocardiogram; EF, ejection fraction; EHR, electronic health record; IVS, interventricular septum; MR, mitral regurgitation; MS, mitral stenosis; and TR, tricuspid regurgitation.
Figure 3.
Figure 3.
Performance of the rECHOmmend model in cross-validation experiments across various inputs. The plot on the left shows the AUROC while the plot on the right shows the AUPRC. AUPRC indicates area under the precision-recall curve; and AUROC, area under the receiver operating curve.
Figure 4.
Figure 4.
Results of retrospective deployment scenario from 2010. A, Results for all patients. B, Relative results per 100 at-risk patients. These results are based on a threshold yielding 50% sensitivity from the pre-2010 cross-validation experiment, resulting in 41.1% positive predictive value, 96.2% negative predictive value, 95.7% specificity, 44.1% sensitivity, and 6.4% prevalence in 2010. For 100 patients without known history of disease obtaining an ECG, the rECHOmmend model will identify 11 patients at high risk of disease, of which 5 are expected to have true disease within 1 year. The model will identify 89 patients not at high risk of disease, of which 86 are not expected to have true disease within 1 year. ECG indicates electrocardiogram; FN, false negative; FP, false positive; TN, true negative; and TP, true positive.

References

    1. Ross J, Jr, Braunwald E. Aortic stenosis. Circulation. 1968;38(1 Suppl):61–67. doi: 10.1161/01.cir.38.1s5.v-61
    1. Cheitlin MD, Gertz EW, Brundage BH, Carlson CJ, Quash JA, Bode RS, Jr. Rate of progression of severity of valvular aortic stenosis in the adult. Am Heart J. 1979;98:689–700. doi: 10.1016/0002-8703(79)90465-4
    1. Davies SW, Gershlick AH, Balcon R. Progression of valvar aortic stenosis: a long-term retrospective study. Eur Heart J. 1991;12:10–14. doi: 10.1093/oxfordjournals.eurheartj.a059815
    1. Curtis JP, Sokol SI, Wang Y, Rathore SS, Ko DT, Jadbabaie F, Portnay EL, Marshalko SJ, Radford MJ, Krumholz HM. The association of left ventricular ejection fraction, mortality, and cause of death in stable outpatients with heart failure. J Am Coll Cardiol. 2003;42:736–742. doi: 10.1016/s0735-1097(03)00789-7
    1. Martinez-Naharro A, Baksi AJ, Hawkins PN, Fontana M. Diagnostic imaging of cardiac amyloidosis. Nat Rev Cardiol. 2020;17:413–426. doi: 10.1038/s41569-020-0334-7
    1. Otto CM, Nishimura RA, Bonow RO, Carabello BA, Erwin JP, 3rd, Gentile F, Jneid H, Krieger EV, Mack M, McLeod C, et al. . 2020 ACC/AHA guideline for the management of patients with valvular heart disease: a report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation. 2021;143:e72–e227. doi: 10.1161/CIR.0000000000000923
    1. Cheitlin MD, Alpert JS, Armstrong WF, Aurigemma GP, Beller GA, Bierman FZ, Davidson TW, Davis JL, Douglas PS, Gillam LD. ACC/AHA guidelines for the clinical application of echocardiography. A report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines (Committee on Clinical Application of Echocardiography). Developed in collaboration with the American Society of Echocardiography. Circulation. 1997;95:1686–1744. doi: 10.1161/01.cir.95.6.1686
    1. Lang RM, Badano LP, Mor-Avi V, Afilalo J, Armstrong A, Ernande L, Flachskampf FA, Foster E, Goldstein SA, Kuznetsova T, et al. . Recommendations for cardiac chamber quantification by echocardiography in adults: an update from the American Society of Echocardiography and the European Association of Cardiovascular Imaging. Eur Heart J Cardiovasc Imaging. 2015;16:233–270. doi: 10.1093/ehjci/jev014
    1. Nkomo VT, Gardin JM, Skelton TN, Gottdiener JS, Scott CG, Enriquez-Sarano M. Burden of valvular heart diseases: a population-based study. Lancet. 2006;368:1005–1011. doi: 10.1016/S0140-6736(06)69208-8
    1. Alexander KM, Orav J, Singh A, Jacob SA, Menon A, Padera RF, Kijewski MF, Liao R, Di Carli MF, Laubach JP, et al. . Geographic disparities in reported US amyloidosis mortality from 1979 to 2015: potential underdetection of cardiac amyloidosis. JAMA Cardiol. 2018;3:865–870. doi: 10.1001/jamacardio.2018.2093
    1. d’Arcy JL, Coffey S, Loudon MA, Kennedy A, Pearson-Stuttard J, Birks J, Frangou E, Farmer AJ, Mant D, Wilson J, et al. . Large-scale community echocardiographic screening reveals a major burden of undiagnosed valvular heart disease in older people: the OxVALVE Population Cohort Study. Eur Heart J. 2016;37:3515–3522. doi: 10.1093/eurheartj/ehw229
    1. Maron MS, Hellawell JL, Lucove JC, Farzaneh-Far R, Olivotto I. Occurrence of clinically diagnosed hypertrophic cardiomyopathy in the United States. Am J Cardiol. 2016;117:1651–1654. doi: 10.1016/j.amjcard.2016.02.044
    1. Park SJ, Enriquez-Sarano M, Chang SA, Choi JO, Lee SC, Park SW, Kim DK, Jeon ES, Oh JK. Hemodynamic patterns for symptomatic presentations of severe aortic stenosis. JACC Cardiovasc Imaging. 2013;6:137–146. doi: 10.1016/j.jcmg.2012.10.013
    1. Goliasch G, Kammerlander AA, Nitsche C, Dona C, Schachner L, Öztürk B, Binder C, Duca F, Aschauer S, Laufer G, et al. . Syncope: the underestimated threat in severe aortic stenosis. JACC Cardiovasc Imaging. 2019;12:225–232. doi: 10.1016/j.jcmg.2018.09.020
    1. Selzer A. Changing aspects of the natural history of valvular aortic stenosis. N Engl J Med. 1987;317:91–98. doi: 10.1056/NEJM198707093170206
    1. Raghunath S, Pfeifer JM, Ulloa-Cerna AE, Nemani A, Carbonati T, Jing L, vanMaanen DP, Hartzel DN, Ruhl JA, Lagerman BF, et al. . Deep neural networks can predict new-onset atrial fibrillation from the 12-lead ECG and help identify those at risk of atrial fibrillation-related stroke. Circulation. 2021;143:1287–1298. doi: 10.1161/CIRCULATIONAHA.120.047829
    1. Kwon JM, Lee SY, Jeon KH, Lee Y, Kim KH, Park J, Oh BH, Lee MM. Deep learning-based algorithm for detecting aortic stenosis using electrocardiography. J Am Heart Assoc. 2020;9:e014717. doi: 10.1161/JAHA.119.014717
    1. Attia ZI, Kapa S, Lopez-Jimenez F, McKie PM, Ladewig DJ, Satam G, Pellikka PA, Enriquez-Sarano M, Noseworthy PA, Munger TM, et al. . Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram. Nat Med. 2019;25:70–74. doi: 10.1038/s41591-018-0240-2
    1. Cohen-Shelly M, Attia ZI, Friedman PA, Ito S, Essayagh BA, Ko WY, Murphree DH, Michelena HI, Enriquez-Sarano M, Carter RE, et al. . Electrocardiogram screening for aortic valve stenosis using artificial intelligence. Eur Heart J. 2021;42:2885–2896. doi: 10.1093/eurheartj/ehab153
    1. Samad MD, Ulloa A, Wehner GJ, Jing L, Hartzel D, Good CW, Williams BA, Haggerty CM, Fornwalt BK. Predicting survival from large echocardiography and electronic health record datasets: optimization with machine learning. JACC Cardiovas Imaging. 2019;12:681–689.
    1. Ommen SR, Mital S, Burke MA, Day SM, Deswal A, Elliott P, Evanovich LL, Hung J, Joglar JA, Kantor P, et al. . 2020 AHA/ACC guideline for the diagnosis and treatment of patients with hypertrophic cardiomyopathy: a report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation. 2020;142:e558–e631. doi: 10.1161/CIR.0000000000000937
    1. Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning. PMLR. 2015;37:448–456.
    1. Chen T, Guestrin C. XGBoost: a scalable tree boosting system. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: Association for Computing Machinery; 2016:785–794. doi: 10.1145/2939672.2939785
    1. Platt J. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in Large Margin Classifiers. 1999;10:61–74.
    1. Bodison SA, Wesley YE, Tucker E, Green KJ. Results of screening a large group of intercollegiate competitive athletes for cardiovascular disease. J Am Coll Cardiol. 1987;10:1214–1221.
    1. Hada Y, Sakamoto T, Amano K, Yamaguchi T, Takenaka K, Takahashi H, Takikawa R, Hasegawa I, Takahashi T, Suzuki J. Prevalence of hypertrophic cardiomyopathy in a population of adult Japanese workers as detected by echocardiographic screening. Am J Cardiol. 1987;59:183–184. doi: 10.1016/s0002-9149(87)80107-8
    1. Baumgartner H, Hung J, Bermejo J, Chambers JB, Edvardsen T, Goldstein S, Lancellotti P, LeFevre M, Miller F, Jr, Otto CM. Recommendations on the echocardiographic assessment of aortic valve stenosis: a focused update from the European Association of Cardiovascular Imaging and the American Society of Echocardiography. J Am Soc Echocardiogr. 2017;30:372–392. doi: 10.1016/j.echo.2017.02.009
    1. Davies M, Hobbs F, Davis R, Kenkre J, Roalfe AK, Hare R, Wosornu D, Lancashire RJ. Prevalence of left-ventricular systolic dysfunction and heart failure in the Echocardiographic Heart of England Screening study: a population based study. Lancet. 2001;358:439–444. doi: 10.1016/s0140-6736(01)05620-3
    1. Banovic M, Putnik S, Penicka M, Doros G, Deja MA, Kockova R, Kotrc M, Glaveckaite S, Gasparovic H, Pavlovic N, et al. ; AVATAR Trial Investigators*. Aortic valve replacement versus conservative treatment in asymptomatic severe aortic stenosis: the AVATAR Trial. Circulation. 2022;145:648–658. doi: 10.1161/CIRCULATIONAHA.121.057639
    1. Kagiyama N, Piccirilli M, Yanamala N, Shrestha S, Farjo PD, Casaclang-Verzosa G, Tarhuni WM, Nezarat N, Budoff MJ, Narula J, et al. . Machine learning assessment of left ventricular diastolic function based on electrocardiographic features. J Am Coll Cardiol. 2020;76:930–941. doi: 10.1016/j.jacc.2020.06.061
    1. Wolters FJ. An AI-ECG algorithm for atrial fibrillation risk: steps towards clinical implementation. Lancet. 2020;396:235–236. doi: 10.1016/S0140-6736(20)31062-X
    1. Yao X, Rushlow DR, Inselman JW, McCoy RG, Thacher TD, Behnken EM, Bernard ME, Rosas SL, Akfaly A, Misra A, et al. . Artificial intelligence-enabled electrocardiograms for identification of patients with low ejection fraction: a pragmatic, randomized clinical trial. Nat Med. 2021;27:815–819. doi: 10.1038/s41591-021-01335-4
    1. Noseworthy PA, Attia ZI, Brewer LC, Hayes SN, Yao X, Kapa S, Friedman PA, Lopez-Jimenez F. Assessing and mitigating bias in medical artificial intelligence: the effects of race and ethnicity on a deep learning model for ecg analysis. Circ Arrhythm Electrophysiol. 2020;13:e007988. doi: 10.1161/CIRCEP.119.007988

Source: PubMed

3
Předplatit