Chest radiographs and machine learning - Past, present and future

Catherine M Jones, Quinlan D Buchlak, Luke Oakden-Rayner, Michael Milne, Jarrel Seah, Nazanin Esmaili, Ben Hachey, Catherine M Jones, Quinlan D Buchlak, Luke Oakden-Rayner, Michael Milne, Jarrel Seah, Nazanin Esmaili, Ben Hachey

Abstract

Despite its simple acquisition technique, the chest X-ray remains the most common first-line imaging tool for chest assessment globally. Recent evidence for image analysis using modern machine learning points to possible improvements in both the efficiency and the accuracy of chest X-ray interpretation. While promising, these machine learning algorithms have not provided comprehensive assessment of findings in an image and do not account for clinical history or other relevant clinical information. However, the rapid evolution in technology and evidence base for its use suggests that the next generation of comprehensive, well-tested machine learning algorithms will be a revolution akin to early advances in X-ray technology. Current use cases, strengths, limitations and applications of chest X-ray machine learning systems are discussed.

Keywords: chest X-ray; clinical decision support; deep learning; machine learning; radiomics.

© 2021 Annalise-AI. Journal of Medical Imaging and Radiation Oncology published by John Wiley & Sons Australia, Ltd on behalf of Royal Australian and New Zealand College of Radiologists.

Figures

Fig. 1
Fig. 1
(a) Heatmap investigating an exemplar algorithm in classifying pneumothorax, demonstrating its focus on the right apical pneumothorax rather than the right‐sided intercostal drain. (b) Original image demonstrating right apical pneumothorax.

References

    1. Mould RF. The early history of X‐ray diagnosis with emphasis on the contributions of physics 1895–1915. Phys Med Biol 1995; 40: 1741.
    1. Schaefer‐Prokop C, Neitzel U, Venema HW, Uffmann M, Prokop M. Digital chest radiography: an update on modern technology, dose containment and control of image quality. Eur Radiol 2008; 18: 1818–30.
    1. United Nations Scientific Committee on the Effects of Atomic Radiation. Sources and effects of ionizing radiation, 2008.
    1. Chotas HG, Ravin CE. Chest radiography: estimated lung volume and projected area obscured by the heart, mediastinum, and diaphragm. Radiology 1994; 193: 403–4.
    1. Berlin L. Accuracy of diagnostic procedures: has it improved over the past five decades? Am J Roentgenol 2007; 188: 1173–8.
    1. Garland LH. Studies on accuracy of diagnostic procedures. AJR 1959; 82: 25–38.
    1. White CS, Salis AI, Meyer CA. Missed lung cancer on chest radiography and computed tomography: imaging and medicolegal issues. J Thorac Imaging 1999; 14: 63–8.
    1. Itri JN, Tappouni RR, McEachern RO, Pesch AJ, Patel SH. Fundamentals of diagnostic error in imaging. Radiographics 2018; 38: 1845–65.
    1. Busby LP, Courtier JL, Glastonbury CM. Bias in radiology: the how and why of misses and misinterpretations. Radiographics 2018; 38: 236–47.
    1. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015; 521: 436.
    1. Szegedy C, Liu W, Jia Yet al. Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2015; 1–9.
    1. Beam AL, Kohane IS. Big data and machine learning in health care. JAMA 2018; 319: 1317–8.
    1. Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom 2020; 21: 6.
    1. Polanski J, Jankowska‐Polanska B, Rosinczuk J, Chabowski M, Szymanska‐Chabowska A. Quality of life of patients with lung cancer. Onco Targets Ther 2016; 9: 1023.
    1. del Ciello A , Franchi P, Contegiacomo A, Cicchetti G, Bonomo L, Larici AR. Missed lung cancer: when, where, and why? Diagnostic Interv Radiol 2017; 23: 118.
    1. Jang S, Song H, Shin YJet al. Deep learning–based automatic detection algorithm for reducing overlooked lung cancers on chest radiographs. Radiology 2020; 296: 652–61.
    1. Hurt B, Yen A, Kligerman S, Hsiao A. Augmenting interpretation of chest radiographs with deep learning probability maps. J Thorac Imaging 2020; 35: 285–93.
    1. Hwang EJ, Hong JH, Lee KHet al. Deep learning algorithm for surveillance of pneumothorax after lung biopsy: a multicenter diagnostic cohort study. Eur Radiol 2020; 30: 3660–71.
    1. Wang X, Yu J, Zhu Qet al. Potential of deep learning in assessing pneumoconiosis depicted on digital chest radiography. Occup Environ Med 2020; 77: 597–602.
    1. Zhou S, Zhang X, Zhang R. Identifying cardiomegaly in chest X‐ray8 using transfer learning. Stud Health Technol Inform 2019; 264: 482–6.
    1. Zou X‐L, Ren Y, Feng D‐Yet al. A promising approach for screening pulmonary hypertension based on frontal chest radiographs using deep learning: a retrospective study. PLoS One 2020; 15: e0236378.
    1. Pasa F, Golkov V, Pfeiffer F, Cremers D, Pfeiffer D. Efficient deep network architectures for fast chest X‐ray tuberculosis screening and visualization. Sci Rep 2019; 9: 1–9.
    1. Rajpurkar P, Irvin J, Ball RLet al. Deep learning for chest radiograph diagnosis: a retrospective comparison of the CheXNeXt algorithm to practicing radiologists. PLoS Med 2018; 15: e1002686.
    1. Wu JT, Wong KCL, Gur Yet al. Comparison of chest radiograph interpretations by artificial intelligence algorithm vs radiology residents. JAMA Netw Open 2020; 3: e2022779.
    1. Seah J, Tang C, Buchlak QDet al. Radiologist chest X‐ray diagnostic accuracy performance when augmented by a comprehensive deep learning model. Lancet Digit Health 2021.
    1. Lakhani P, Sundaram B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology 2017; 284: 574–82.
    1. Sim Y, Chung MJ, Kotter Eet al. Deep convolutional neural network–based software improves radiologist detection of malignant lung nodules on chest radiographs. Radiology 2020; 294: 199–209.
    1. Kim JH, Kim JY, Kim GHet al. Clinical validation of a deep learning algorithm for detection of pneumonia on chest radiographs in emergency department patients with acute febrile respiratory illness. J Clin Med 2020; 9: 1981.
    1. Hwang EJ, Nam JG, Lim WHet al. Deep learning for chest radiograph diagnosis in the emergency department. Radiology 2019; 293: 573–80.
    1. Souza JC, Diniz JOB, Ferreira JL, da Silva GLF , Silva AC, de Paiva AC . An automatic method for lung segmentation and reconstruction in chest X‐ray using deep neural networks. Comput Methods Programs Biomed 2019; 177: 285–96.
    1. Szucs‐Farkas Z, Schick A, Cullmann JLet al. Comparison of dual‐energy subtraction and electronic bone suppression combined with computer‐aided detection on chest radiographs: effect on human observers’ performance in nodule detection. Am J Roentgenol 2013; 200: 1006–13.
    1. Freedman MT, Lo S‐CB, Seibel JC, Bromley CM. Lung nodules: improved detection with software that suppresses the rib and clavicle on chest radiographs. Radiology 2011; 260: 265–73.
    1. Gordienko Y, Gang P, Hui Jet al. Deep learning with lung segmentation and bone shadow exclusion techniques for chest X‐ray analysis of lung cancer. In: International Conference on Computer Science, Engineering and Education Applications. Springer, 2018; 638–47.
    1. Khan FA, Majidulla A, Tavaziva Get al. Chest x‐ray analysis with deep learning‐based software as a triage test for pulmonary tuberculosis: a prospective study of diagnostic accuracy for culture‐confirmed disease. Lancet Digit Health 2020; 2: e573–81.
    1. Baltruschat I, Steinmeister L, Nickisch Het al. Smart chest X‐ray worklist prioritization using artificial intelligence: a clinical workflow simulation. Eur Radiol 2021; 31: 3837–45.
    1. Zech JR, Badgeley MA, Liu M, Costa AB, Titano JJ, Oermann EK. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross‐sectional study. PLoS Med 2018; 15: e1002683.
    1. Buchlak QD, Yanamadala V, Leveque J‐C, Edwards A, Nold K, Sethi R. The Seattle spine score: predicting 30‐day complication risk in adult spinal deformity surgery. J Clin Neurosci 2017; 43: 247–55.
    1. Futoma J, Simons M, Panch T, Doshi‐Velez F, Celi LA. The myth of generalisability in clinical research and machine learning in health care. Lancet Digit Health 2020; 2: e489–92.
    1. Holzinger A, Langs G, Denk H, Zatloukal K, Müller H. Causability and explainability of artificial intelligence in medicine. Wiley Interdiscip Rev Data Min Knowl Discov 2019; 9: e1312.
    1. Buchlak QD, Esmaili N, Leveque J‐C, Bennett C, Piccardi M, Farrokhi F. Ethical thinking machines in surgery and the requirement for clinical leadership. Am J Surg 2020; 220: 1372–4.
    1. Zhang Q, Zhu S‐C. Visual interpretability for deep learning: a survey. Front Inf Technol Electron Eng 2018; 19: 27–39.
    1. Seah JCY, Tang JSN, Kitchen A, Gaillard F, Dixon AF. Chest radiographs in congestive heart failure: visualizing neural network learning. Radiology 2019; 290: 514–22.
    1. Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell 2019; 1: 206–15.
    1. Oakden‐Rayner L, Dunnmon J, Carneiro G, Ré C. Hidden stratification causes clinically meaningful failures in machine learning for medical imaging. In: Proceedings of the ACM Conference on Health, Inference, and Learning, 2020; 151–9.
    1. Rueckel J, Trappmann L, Schachtner Bet al. Impact of confounding thoracic tubes and pleural dehiscence extent on artificial intelligence pneumothorax detection in chest radiographs. Invest Radiol 2020; 55: 792–8.
    1. Oakden‐Rayner L. Exploring large‐scale public medical image datasets. Acad Radiol 2020; 27: 106–12.
    1. Nobel JM, Kok EM, Robben SGF. Redefining the structure of structured reporting in radiology. Insights Imaging 2020; 11: 10.
    1. Eisenhauer EA, Therasse P, Bogaerts Jet al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer 2009; 45: 228–47.
    1. European Society of Radiology . ESR paper on structured reporting in radiology. Insights Imaging 2018; 9: 1–7.
    1. The Royal Australian and New Zealand College of Radiologists . Standards of Practice for Artificial Intelligence. Sydney, Australia, 2020.
    1. Rivera SC, Liu X, Chan A‐W, Denniston AK, Calvert MJ. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT‐AI extension. BMJ 2020; 370: m3210.
    1. United States Government Accountability Office. AI in Healthcare, 2020.
    1. Obermeyer Z, Emanuel EJ. Predicting the future—big data, machine learning, and clinical medicine. N Engl J Med 2016; 375: 1216.

Source: PubMed

3
Subscribe