Benchmarking Eliminative Radiomic Feature Selection for Head and Neck Lymph Node Classification

Zoltan R Bardosi, Daniel Dejaco, Matthias Santer, Marcel Kloppenburg, Stephanie Mangesius, Gerlig Widmann, Ute Ganswindt, Gerhard Rumpold, Herbert Riechelmann, Wolfgang Freysinger, Zoltan R Bardosi, Daniel Dejaco, Matthias Santer, Marcel Kloppenburg, Stephanie Mangesius, Gerlig Widmann, Ute Ganswindt, Gerhard Rumpold, Herbert Riechelmann, Wolfgang Freysinger

Abstract

In head and neck squamous cell carcinoma (HNSCC) pathologic cervical lymph nodes (LN) remain important negative predictors. Current criteria for LN-classification in contrast-enhanced computed-tomography scans (contrast-CT) are shape-based; contrast-CT imagery allows extraction of additional quantitative data ("features"). The data-driven technique to extract, process, and analyze features from contrast-CTs is termed "radiomics". Extracted features from contrast-CTs at various levels are typically redundant and correlated. Current sets of features for LN-classification are too complex for clinical application. Effective eliminative feature selection (EFS) is a crucial preprocessing step to reduce the complexity of sets identified. We aimed at exploring EFS-algorithms for their potential to identify sets of features, which were as small as feasible and yet retained as much accuracy as possible for LN-classification. In this retrospective cohort-study, which adhered to the STROBE guidelines, in total 252 LNs were classified as "non-pathologic" (n = 70), "pathologic" (n = 182) or "pathologic with extracapsular spread" (n = 52) by two experienced head-and-neck radiologists based on established criteria which served as a reference. The combination of sparse discriminant analysis and genetic optimization retained up to 90% of the classification accuracy with only 10% of the original numbers of features. From a clinical perspective, the selected features appeared plausible and potentially capable of correctly classifying LNs. Both the identified EFS-algorithm and the identified features need further exploration to assess their potential to prospectively classify LNs in HNSCC.

Keywords: computed-tomography; extracapsular spread; feature extraction; genetic algorithms; head and neck squamous carcinoma; lymph nodes; radiomics; recursive feature elimination; sparse discriminant analysis.

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Study flow and patient inclusion modified according to STARD criteria [19]. A total of 1110 patients were potentially eligible, of which 823 did not meet the inclusion criteria. Of 287 eligible patients a representative random sample of 28 patients (~10%) was drawn. The clinical characteristics of the 28 included patients are presented in Table 1.
Figure 2
Figure 2
Example of a LN classified as “pathologic” in a staging-CT of a 55-year-old, male HNSCC patient with a tumor of the oral cavity staged cT4a cN2b cM0. Original segmentation in the axial plane (A); sagittal (B) and coronal (C) reformatted views and three-dimensional rendering (D) of the LN are provided by the software. Dashed arrows show central necrosis; no soft tissue infiltration and irregular LN capsule was observed.
Figure 3
Figure 3
Example of a LN classified as “pathologic with ECS” in a Staging-CT of a 53-year-old, male HNSCC-patient of the oral cavity staged cT2 cN3b cM0. Manual segmentation in the axial plane (A); sagittal (B), coronal (C) reformatted views and three-dimensional rendering (D) of the LN are provided by the software. Solid arrows show soft tissue infiltration and an irregular LN capsule. Dashed arrows show central necrosis.
Figure 4
Figure 4
Example of a LN classified as “non-pathologic” in a staging-CT of a 65-year-old, female HNSCC-patient with a tumor of the oral cavity staged cT4a cN2c cM0. Manually segmented LN in the axial plane (A); sagittal (B), coronal (C) reformatted views and three-dimensional rendering (D) of the LN are provided by the software. Neither central necrosis nor soft tissue infiltration nor irregular LN capsule were observed for this LN.
Figure 5
Figure 5
Balanced accuracy distributions of EFS-algorithms trained and evaluated on the LN-label “pathologic”. For reference, the BACC of the LDA classifier without feature count reduction is shown as a dash-dotted green horizontal line (at a value of 0.87). The combination of RFE and GA (black box) appeared to be the potentially most useful EFS-algorithms, retaining a diagnostic accuracy of >80% with only 10% of the features.
Figure 6
Figure 6
Balanced accuracy distributions of EFS-algorithms trained and evaluated on the LN-label “pathologic with ECS”. For reference, the BACC of the LDA classifier without feature count reduction is shown as a dash-dotted green horizontal line (at a value of 0.96). The combination of SDA and GA (black box) appeared to be the potentially most useful EFS-algorithms, retaining a diagnostic accuracy of >90% with only 10% of the features.
Figure 7
Figure 7
Balanced accuracy distributions of EFS-algorithms trained and evaluated on the LN-label “non-pathologic”. For reference, the BACC of the LDA classifier without feature count reduction (dash-dotted green horizontal line at 0.90). The combination of SDA and GA (black box) appeared to be the potentially most useful EFS-algorithms, retaining a diagnostic accuracy of >83% with only 10% of the features.

References

    1. Gor D.M., Langer J.E., Loevner L.A. Imaging of Cervical Lymph Nodes in Head and Neck Cancer: The Basics. Radiol. Clin. 2006;44:101–110. doi: 10.1016/j.rcl.2005.08.006.
    1. Mermod M., Tolstonog G., Simon C., Monnier Y. Extracapsular spread in head and neck squamous cell carcinoma: A systematic review and meta-analysis. Oral Oncol. 2016;62:60–71. doi: 10.1016/j.oraloncology.2016.10.003.
    1. Faisal M., Dhanani R., Ullah S., Bakar M.A., Irfan N., Malik K.I., Loya A., Boban E.M., Hussain R., Jamshed A. Prognostic outcomes of treatment naïve oral tongue squamous cell carcinoa (OTSCC): A comprehensive analysis of 14 years. Eur. Arch. Otorhinolaryngol. 2021;278:3045–3053. doi: 10.1007/s00405-020-06482-x.
    1. Meccariello G., Maniaci A., Bianchi G., Cammaroto G., Iannella G., Catalano A., Sgarzani R., De Vito A., Capaccio P., Pelucchi S., et al. Neck dissection and trans oral robotic surgery for oropharyngeal squamous cell carcinoma. Auris Nasus Larynx. 2021 doi: 10.1016/j.anl.2021.05.007.
    1. Van den Brekel M.W.M., Stel H.V., Castelijns J.A., Nauta J.J., Van der Waal I., Valk J., Meyer C.J., Snow G.B. Cervical lymph node metastasis: Assessment of radiologic criteria. Radiology. 1990;177:379–384. doi: 10.1148/radiology.177.2.2217772.
    1. Eisenhauer E.A., Therasse P., Bogaerts J., Schwartz L.H., Sargent D., Ford R., Dancey J., Arbuck S., Gwyther S., Mooney M., et al. New response evaluation criteria in solid tumours: Revised RECIST guideline (version 1.1) Eur. J. Cancer. 2009;45:228–247. doi: 10.1016/j.ejca.2008.10.026.
    1. Url C., Schartinger V., Riechelmann H., Glückert R., Maier H., Trumpp M., Widmann G. Radiological detection of extracapsular spread in head and neck squamous cell carcinoma (HNSCC) cervical metastases. Eur. J. Radiol. 2013;82:1783–1787. doi: 10.1016/j.ejrad.2013.04.024.
    1. Gillies R.J., Kinahan P.E., Hricak H. Radiomics: Images are more than pictures, they are data. Radiology. 2016;278:563–577. doi: 10.1148/radiol.2015151169.
    1. Rizzo S., Botta F., Raimondi S., Origgi D., Fanciullo C., Morganti A.G., Bellomi M. Radiomics: The facts and the challenges of image analysis. Eur. Radiol. Exp. 2018;2:36. doi: 10.1186/s41747-018-0068-z.
    1. Remeseiro B., Bolon-Canedo V. A review of feature selection methods in medical applications. Comput. Biol. Med. 2019;112:103375. doi: 10.1016/j.compbiomed.2019.103375.
    1. Venkatesh B., Anuradha J. A Review of Feature Selection and Its Methods. Cybern. Inf. Technol. 2019;19:3–26. doi: 10.2478/cait-2019-0001.
    1. Howard F.M., Kochanny S., Koshy M., Spiotto M., Pearson A.T. Machine Learning–Guided Adjuvant Treatment of Head and Neck Cancer. JAMA Netw. Open. 2020;3:e2025881. doi: 10.1001/jamanetworkopen.2020.25881.
    1. LeCun Y., Haffner P., Bottou L., Bengio Y. Shape, Contour and Grouping in Computer Vision. Springer; Berlin/Heidelberg, Germany: 1999. Object recognition with gradient-based learning; pp. 319–345.
    1. Cireşan D., Meier U., Masci J., Schmidhuber J. Multi-column deep neural network for traffic sign classification. Neural Netw. 2012;32:333–338. doi: 10.1016/j.neunet.2012.02.023.
    1. Gichoya J.W., Nuthakki S., Maity P.G., Purkayastha S. Phronesis of ai in radiology: Superhuman meets natural stupidity. arXiv. 20181803.11244
    1. Gilpin L.H., Bau D., Yuan B.Z., Bajwa A., Specter M., Kagal L. Explaining Explanations: An Overview of Interpretability of Machine Learning; Proceedings of the 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA); Turin, Italy. 1–3 October 2018; pp. 80–89.
    1. Mayerhoefer M.E., Materka A., Langs G., Häggström I., Szczypiński P., Gibbs P., Cook G. Introduction to Radiomics. J. Nucl. Med. 2020;61:488–495. doi: 10.2967/jnumed.118.222893.
    1. Von Elm E., Altman D.G., Egger M., Pocock S.J., Gøtzsche P.C., Vandenbroucke J.P. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: Guidelines for reporting observational studies. PLoS Med. 2007;4:e296. doi: 10.1371/journal.pmed.0040296.
    1. Bossuyt P.M., Reitsma J.B., Bruns D.E., Gatsonis C.A., Glasziou P., Irwig L., Lijmer J.G., Moher D., Rennie D., De Vet H.C., et al. STARD 2015: An updated list of essential items for reporting diagnostic accuracy studies. BMJ. 2015;351:h5527. doi: 10.1136/bmj.h5527.
    1. Noor A., Stepan L., Kao S.S.-T., Dharmawardana N., Ooi E., Hodge J.-C., Krishnan S., Foreman A. Reviewing indications for panendoscopy in the investigation of head and neck squamous cell carcinoma. J. Laryngol. Otol. 2018;132:901–905. doi: 10.1017/S0022215118001718.
    1. National Comprehensive Cancer Network Guidelines for Head and Neck Cancers, Version 1.2022. [(accessed on 8 January 2022)]. Available online: .
    1. Shav K.S.V., Ethunanda M. Tumour seeding after fine-needle aspiration and core biopsy of the head and neck—A systematic review. Br. J. Oral Maxillofac. Surg. 2016;54:260–265.
    1. Aja-Fernandez S., García R.L., Tao D., Li X., editors. Tensors in Image Processing and Computer Vision. Springer Science + Business Media; London, UK: 2009.
    1. van Griethuysen J.J.M., Fedorov A., Parmar C., Hosny A., Aucoin N., Narayan V., Beets-Tan R.G.H., Fillion-Robin J.C., Pieper S., Aerts H.J.W. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77:104–107. doi: 10.1158/0008-5472.CAN-17-0339.
    1. PyRadiomics. [(accessed on 19 November 2021)]. Available online: .
    1. Guyon I., Weston J., Barnhill S., Vapnik V. Gene Selection for Cancer Classification using Support Vector Machines. Mach. Learn. 2002;46:389–422. doi: 10.1023/A:1012487302797.
    1. Weich M.L., McIntosh C., Haibe-Kains B., Milosevic M.F., Wee L., Dekker A., Huang S.H., Purdie T.G., O’Sullivan B., Aerts H.J.W., et al. Vulnerabilities of radiomic signature development: The need for safeguards. Radiother. Oncol. 2019;130:2–9.
    1. Costello A.B., Osborn W.J. Best practices in exploratory factor analysis: Four recommendations for getting the most from your analysis. Pract. Assess. Res. Eval. 2005;10:7.
    1. Noes T., Mevik B.H. Understanding the collinearity problem in regression and discriminant analysis. J. Chemom. 2001;15:413–426. doi: 10.1002/cem.676.
    1. Sima C., Dougherty E.R. The peaking phenomenon in the presence of feature-selection. Pattern Recognit. Lett. 2008;29:1667–1674. doi: 10.1016/j.patrec.2008.04.010.
    1. Fisher R.A. The use of multiple measurements in taxonomic problems. Ann. Eugen. 1936;7:179–188. doi: 10.1111/j.1469-1809.1936.tb02137.x.
    1. Mylavarapu S., Kaban A. Random projections versus random selection of features for classification of high dimensional data; Proceedings of the 2013 13th UK Workshop on Computational Intelligence (UKCI); Guildford, UK. 9–11 September 2013; pp. 305–312.
    1. Clemmensen L., Hastie T., Witten D., Ersbøll B. Sparse Discriminant Analysis. Technometrics. 2011;53:406–413. doi: 10.1198/TECH.2011.08118.
    1. Fonseca C.M., Fleming P.J. Genetic algorithms for multiobjective optimization: Formulation, discussion and generalization; Proceedings of the 5th International Conference on Genetic Algorithms; Champaign, IL, USA. 1 June 1993; pp. 416–423.
    1. Müllner D. Modern hierarchical, agglomerative clustering algorithms. arXiv. 20111109.2378v1
    1. Hussein F., Kharma N.N., Ward R.K. Genetic algorithms for feature selection and weighting, a review and study; Proceedings of the 6th International Conference on Document Analysis and Recognition; Seattle, WA, USA. 13 September 2001.
    1. Wutzl B., Leibnitz K., Rattay F., Kronbichler M., Murata M., Golaszweski S.M. Genetic algorithms for feature selection when classifying severe chronic disorders of conciousness. PLoS ONE. 2019;14:e0219683. doi: 10.1371/journal.pone.0219683.
    1. Bommert A., Sun X., Bischl B., Rahnenführer J., Lang M. Benchmark for filter methods for feature selection in high-dimensional classification data. Comput. Stat. Data Anal. 2019;143:106839. doi: 10.1016/j.csda.2019.106839.
    1. Feinstein A.J., Alonso J., Yang S.-E., John M.S. Diagnostic Accuracy of Fine-Needle Aspiration for Parotid and Submandibular Gland Lesions. Otolaryngol. Neck Surg. 2016;155:431–436. doi: 10.1177/0194599816643041.
    1. Dejaco D., Uprimny C., Widmann G., Riedl D., Moser P., Arnold C., Steinbichler T.B., Kofler B., Schartinger V.H., Virgolini I., et al. Response evaluation of cervical lymph nodes after chemoradiation in patients with head and neck cancer—Does additional [18F]FDG-PET-CT help? Cancer Imaging. 2020;20:69. doi: 10.1186/s40644-020-00345-8.
    1. Ho T.-Y., Chao C.-H., Chin S.-C., Ng S.-H., Kang C.-J., Tsang N.-M. Classifying Neck Lymph Nodes of Head and Neck Squamous Cell Carcinoma in MRI Images with Radiomic Features. J. Digit. Imaging. 2020;33:613–618. doi: 10.1007/s10278-019-00309-w.

Source: PubMed

3
Subskrybuj