DNA methylation-based classification of central nervous system tumours
David Capper, David T W Jones, Martin Sill, Volker Hovestadt, Daniel Schrimpf, Dominik Sturm, Christian Koelsche, Felix Sahm, Lukas Chavez, David E Reuss, Annekathrin Kratz, Annika K Wefers, Kristin Huang, Kristian W Pajtler, Leonille Schweizer, Damian Stichel, Adriana Olar, Nils W Engel, Kerstin Lindenberg, Patrick N Harter, Anne K Braczynski, Karl H Plate, Hildegard Dohmen, Boyan K Garvalov, Roland Coras, Annett Hölsken, Ekkehard Hewer, Melanie Bewerunge-Hudler, Matthias Schick, Roger Fischer, Rudi Beschorner, Jens Schittenhelm, Ori Staszewski, Khalida Wani, Pascale Varlet, Melanie Pages, Petra Temming, Dietmar Lohmann, Florian Selt, Hendrik Witt, Till Milde, Olaf Witt, Eleonora Aronica, Felice Giangaspero, Elisabeth Rushing, Wolfram Scheurlen, Christoph Geisenberger, Fausto J Rodriguez, Albert Becker, Matthias Preusser, Christine Haberler, Rolf Bjerkvig, Jane Cryan, Michael Farrell, Martina Deckert, Jürgen Hench, Stephan Frank, Jonathan Serrano, Kasthuri Kannan, Aristotelis Tsirigos, Wolfgang Brück, Silvia Hofer, Stefanie Brehmer, Marcel Seiz-Rosenhagen, Daniel Hänggi, Volkmar Hans, Stephanie Rozsnoki, Jordan R Hansford, Patricia Kohlhof, Bjarne W Kristensen, Matt Lechner, Beatriz Lopes, Christian Mawrin, Ralf Ketter, Andreas Kulozik, Ziad Khatib, Frank Heppner, Arend Koch, Anne Jouvet, Catherine Keohane, Helmut Mühleisen, Wolf Mueller, Ute Pohl, Marco Prinz, Axel Benner, Marc Zapatka, Nicholas G Gottardo, Pablo Hernáiz Driever, Christof M Kramm, Hermann L Müller, Stefan Rutkowski, Katja von Hoff, Michael C Frühwald, Astrid Gnekow, Gudrun Fleischhack, Stephan Tippelt, Gabriele Calaminus, Camelia-Maria Monoranu, Arie Perry, Chris Jones, Thomas S Jacques, Bernhard Radlwimmer, Marco Gessi, Torsten Pietsch, Johannes Schramm, Gabriele Schackert, Manfred Westphal, Guido Reifenberger, Pieter Wesseling, Michael Weller, Vincent Peter Collins, Ingmar Blümcke, Martin Bendszus, Jürgen Debus, Annie Huang, Nada Jabado, Paul A Northcott, Werner Paulus, Amar Gajjar, Giles W Robinson, Michael D Taylor, Zane Jaunmuktane, Marina Ryzhova, Michael Platten, Andreas Unterberg, Wolfgang Wick, Matthias A Karajannis, Michel Mittelbronn, Till Acker, Christian Hartmann, Kenneth Aldape, Ulrich Schüller, Rolf Buslei, Peter Lichter, Marcel Kool, Christel Herold-Mende, David W Ellison, Martin Hasselblatt, Matija Snuderl, Sebastian Brandner, Andrey Korshunov, Andreas von Deimling, Stefan M Pfister, David Capper, David T W Jones, Martin Sill, Volker Hovestadt, Daniel Schrimpf, Dominik Sturm, Christian Koelsche, Felix Sahm, Lukas Chavez, David E Reuss, Annekathrin Kratz, Annika K Wefers, Kristin Huang, Kristian W Pajtler, Leonille Schweizer, Damian Stichel, Adriana Olar, Nils W Engel, Kerstin Lindenberg, Patrick N Harter, Anne K Braczynski, Karl H Plate, Hildegard Dohmen, Boyan K Garvalov, Roland Coras, Annett Hölsken, Ekkehard Hewer, Melanie Bewerunge-Hudler, Matthias Schick, Roger Fischer, Rudi Beschorner, Jens Schittenhelm, Ori Staszewski, Khalida Wani, Pascale Varlet, Melanie Pages, Petra Temming, Dietmar Lohmann, Florian Selt, Hendrik Witt, Till Milde, Olaf Witt, Eleonora Aronica, Felice Giangaspero, Elisabeth Rushing, Wolfram Scheurlen, Christoph Geisenberger, Fausto J Rodriguez, Albert Becker, Matthias Preusser, Christine Haberler, Rolf Bjerkvig, Jane Cryan, Michael Farrell, Martina Deckert, Jürgen Hench, Stephan Frank, Jonathan Serrano, Kasthuri Kannan, Aristotelis Tsirigos, Wolfgang Brück, Silvia Hofer, Stefanie Brehmer, Marcel Seiz-Rosenhagen, Daniel Hänggi, Volkmar Hans, Stephanie Rozsnoki, Jordan R Hansford, Patricia Kohlhof, Bjarne W Kristensen, Matt Lechner, Beatriz Lopes, Christian Mawrin, Ralf Ketter, Andreas Kulozik, Ziad Khatib, Frank Heppner, Arend Koch, Anne Jouvet, Catherine Keohane, Helmut Mühleisen, Wolf Mueller, Ute Pohl, Marco Prinz, Axel Benner, Marc Zapatka, Nicholas G Gottardo, Pablo Hernáiz Driever, Christof M Kramm, Hermann L Müller, Stefan Rutkowski, Katja von Hoff, Michael C Frühwald, Astrid Gnekow, Gudrun Fleischhack, Stephan Tippelt, Gabriele Calaminus, Camelia-Maria Monoranu, Arie Perry, Chris Jones, Thomas S Jacques, Bernhard Radlwimmer, Marco Gessi, Torsten Pietsch, Johannes Schramm, Gabriele Schackert, Manfred Westphal, Guido Reifenberger, Pieter Wesseling, Michael Weller, Vincent Peter Collins, Ingmar Blümcke, Martin Bendszus, Jürgen Debus, Annie Huang, Nada Jabado, Paul A Northcott, Werner Paulus, Amar Gajjar, Giles W Robinson, Michael D Taylor, Zane Jaunmuktane, Marina Ryzhova, Michael Platten, Andreas Unterberg, Wolfgang Wick, Matthias A Karajannis, Michel Mittelbronn, Till Acker, Christian Hartmann, Kenneth Aldape, Ulrich Schüller, Rolf Buslei, Peter Lichter, Marcel Kool, Christel Herold-Mende, David W Ellison, Martin Hasselblatt, Matija Snuderl, Sebastian Brandner, Andrey Korshunov, Andreas von Deimling, Stefan M Pfister
Abstract
Accurate pathological diagnosis is crucial for optimal management of patients with cancer. For the approximately 100 known tumour types of the central nervous system, standardization of the diagnostic process has been shown to be particularly challenging-with substantial inter-observer variability in the histopathological diagnosis of many tumour types. Here we present a comprehensive approach for the DNA methylation-based classification of central nervous system tumours across all entities and age groups, and demonstrate its application in a routine diagnostic setting. We show that the availability of this method may have a substantial impact on diagnostic precision compared to standard methods, resulting in a change of diagnosis in up to 12% of prospective cases. For broader accessibility, we have designed a free online classifier tool, the use of which does not require any additional onsite data processing. Our results provide a blueprint for the generation of machine-learning-based tumour classifiers across other cancer entities, with the potential to fundamentally transform tumour pathology.
Figures
References
- Louis DN, Ohgaki H, Wiestler OD & Cavenee WK WHO Classification of Tumours of the Central Nervous System (revised 4th edition). (IARC, 2016).
- van den Bent MJ Interobserver variation of the histopathological diagnosis in clinical trials on glioma: a clinician’s perspective. Acta Neuropathol . 120, 297–304, doi:10.1007/s00401-010-0725-7 (2010).
- Ellison DW et al. Histopathological grading of pediatric ependymoma: reproducibility and clinical relevance in European trial cohorts. J Negat Results Biomed 10, 7, doi:10.1186/1477-5751-10-7 (2011).
- Sturm D et al. New Brain Tumor Entities Emerge from Molecular Classification of CNS-PNETs. Cell 164, 1060–1072, doi:10.1016/j.cell.2016.01.015 (2016).
- Fernandez AF et al. A DNA methylation fingerprint of 1628 human samples. Genome Res . 22, 407–419, doi:10.1101/gr.119867.110 (2012).
- Hovestadt V et al. Decoding the regulatory landscape of medulloblastoma using DNA methylation sequencing. Nature 510, 537–541, doi:10.1038/nature13268 (2014).
- Moran S et al. Epigenetic profiling to classify cancer of unknown primary: a multicentre, retrospective analysis. Lancet Oncol . 17, 1386–1395, doi:10.1016/S1470-2045(16)30297-2 (2016).
- Hovestadt V et al. Robust molecular subgrouping and copy-number profiling of medulloblastoma from small amounts of archival tumour material using high-density DNA methylation arrays. Acta Neuropathol . 125, 913–916, doi:10.1007/s00401-013-1126-5 (2013).
- Sturm D et al. Hotspot mutations in H3F3A and IDH1 define distinct epigenetic and biological subgroups of glioblastoma. Cancer Cell 22, 425–437, doi:10.1016/j.ccr.2012.08.024 (2012).
- Reuss DE et al. Adult IDH wild type astrocytomas biologically and clinically resolve into other tumor entities. Acta Neuropathol . 130, 407–417, doi:10.1007/s00401-015-1454-8 (2015).
- Pajtler KW et al. Molecular Classification of Ependymal Tumors across All CNS Compartments, Histopathological Grades, and Age Groups. Cancer Cell 27, 728–743, doi:10.1016/j.ccell.2015.04.002 (2015).
- Lambert SR et al. Differential expression and methylation of brain developmental genes define location-specific subsets of pilocytic astrocytoma. Acta Neuropathol . 126, 291–301, doi:10.1007/s00401-013-1124-7 (2013).
- Thomas C et al. Methylation profiling of choroid plexus tumors reveals 3 clinically distinct subgroups. Neuro Oncol . 18, 790–796, doi:10.1093/neuonc/nov322 (2016).
- Mack SC et al. Epigenomic alterations define lethal CIMP-positive ependymomas of infancy. Nature 506, 445–450, doi:10.1038/nature13108 (2014).
- Johann PD et al. Atypical Teratoid/Rhabdoid Tumors Are Comprised of Three Epigenetic Subgroups with Distinct Enhancer Landscapes. Cancer Cell 29, 379–393, doi:10.1016/j.ccell.2016.02.001 (2016).
- Wiestler B et al. Integrated DNA methylation and copy-number profiling identify three clinically and biologically relevant groups of anaplastic glioma. Acta Neuropathol . 128, 561–571, doi:10.1007/s00401-014-1315-x (2014).
- van der Maaten L & Hinton G Visualizing data using t-SNE. The Journal of Machine Learning Research 9, 85 (2008).
- Ceccarelli M et al. Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma. Cell 164, 550–563, doi:10.1016/j.cell.2015.12.028 (2016).
- Breiman L Random forests. Machine learning 45, 5–32 (2001).
- Sokolova M & Lapalme G A systematic analysis of performance measures for classification tasks. Inf. Process. Manage . 45, 427–437, doi:10.1016/j.ipm.2009.03.002 (2009).
- Sahm F et al. Next-generation sequencing in routine brain tumor diagnostics enables an integrated diagnosis and identifies actionable targets. Acta Neuropathol . 131, 903–910, doi:10.1007/s00401-015-1519-8 (2016).
- Weller M et al. Molecular classification of diffuse cerebral WHO grade II/III gliomas using genome- and transcriptome-wide profiling improves stratification of prognostically distinct patient groups. Acta Neuropathol . 129, 679–693, doi:10.1007/s00401-015-1409-0 (2015).
- Cancer Genome Atlas Research, N. et al. Comprehensive, Integrative Genomic Analysis of Diffuse Lower-Grade Gliomas. N. Engl. J. Med . 372, 2481–2498, doi:10.1056/NEJMoa1402121 (2015).
- conumee: Enhanced copy-number variation analysis using Illumina 450k methylation arrays. R package version 0.99.4, (2015).
- Bady P, Delorenzi M & Hegi ME Sensitivity Analysis of the MGMT-STP27 Model and Impact of Genetic and Epigenetic Context to Predict the MGMT Methylation Status in Gliomas and Other Tumors. J. Mol. Diagn . 18, 350–361, doi:10.1016/j.jmoldx.2015.11.009 (2016).
- Korshunov A et al. Histologically distinct neuroepithelial tumors with histone 3 G34 mutation are molecularly similar and comprise a single nosologic entity. Acta Neuropathol . 131, 137–146, doi:10.1007/s00401-015-1493-1 (2016).
- Korshunov A et al. Embryonal tumor with abundant neuropil and true rosettes (ETANTR), ependymoblastoma, and medulloepithelioma share molecular similarity and comprise a single clinicopathological entity. Acta Neuropathol . 128, 279–289, doi:10.1007/s00401-013-1228-0 (2014).
- Holsken A et al. Adamantinomatous and papillary craniopharyngiomas are characterized by distinct epigenomic as well as mutational and transcriptomic profiles. Acta Neuropathol Commun 4, 20, doi:10.1186/s40478-016-0287-6 (2016).
- Heim S et al. Papillary Tumor of the Pineal Region: A Distinct Molecular Entity. Brain Pathol . 26, 199–205, doi:10.1111/bpa.12282 (2016).
- Koelsche C et al. Melanotic tumors of the nervous system are characterized by distinct mutational, chromosomal and epigenomic profiles. Brain Pathol . 25, 202–208, doi:10.1111/bpa.12228 (2015).
- Jones DT et al. Recurrent somatic alterations of FGFR1 and NTRK2 in pilocytic astrocytoma. Nat. Genet . 45, 927–932, doi:10.1038/ng.2682 (2013).
- Jones DT et al. Dissecting the genomic complexity underlying medulloblastoma. Nature 488, 100–105, doi:10.1038/nature11284 (2012).
- Pietsch T et al. Prognostic significance of clinical, histopathological, and molecular characteristics of medulloblastomas in the prospective HIT2000 multicenter clinical trial cohort. Acta Neuropathol . 128, 137–149, doi:10.1007/s00401-014-1276-0 (2014).
- R: A language and environment for statistical computing. (R Foundation for Statistical Computing, Vienna, Austria, 2016).
- Huber W et al. Orchestrating high-throughput genomic analysis with Bioconductor. Nat Methods 12, 115–121, doi:10.1038/nmeth.3252 (2015).
- Aryee MJ et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 30, 1363–1369, doi:10.1093/bioinformatics/btu049 (2014).
- Leek JT & Storey JD Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS genetics 3, 1724–1735, doi:10.1371/journal.pgen.0030161 (2007).
- Leek JT & Storey JD A general framework for multiple testing dependence. Proc. Natl. Acad. Sci. U. S. A . 105, 18718–18723, doi:10.1073/pnas.0808709105 (2008).
- Breiman L Classification and regression trees. (Chapman & Hall/CRC, 1984).
- Liaw A & Wiener M Classification and Regression by randomForest. R News 2, 18–22 (2002).
- Chen C, Liaw A & Breiman L Using random forest to learn imbalanced data. University of California, Berkeley, 1–12 (2004).
- Kim KI & Simon R Overfitting, generalization, and MSE in class probability estimation with high-dimensional data. Biom J 56, 256–269, doi:10.1002/bimj.201300083 (2014).
- Boström H in Machine Learning and Applicati ons, 2008. ICMLA’08. Seventh International Conference on. 121–126 (IEEE).
- Smola AJ Advances in large margin classifiers. (MIT press, 2000).
- Friedman J, Hastie T & Tibshirani R Regularization paths for generalized linear models via coordinate descent. Journal of statistical software 33, 1 (2010).
- Appel IJ, Gronwald W & Spang R Estimating classification probabilities in high-dimensional diagnostic studies. Bioinformatics 27, 2563–2570 (2011).
- Hand DJ & Till RJ A simple generalisation of the area under the ROC curve for multiple class classification problems. Machine learning 45, 171–186 (2001).
- Simon R Class probability estimation for medical studies. Biom J 56, 597–600, doi:10.1002/bimj.201300296 (2014).
- Brier GW Verification of forecasts expressed in terms of probability. Monthly Weather Review 78, 1–3, doi:10.1175/1520-0493(1950)078<0001:vofeit>;2 (1950).
- Carter SL et al. Absolute quantification of somatic DNA alterations in human cancer. Nat. Biotechnol . 30, 41 3-421, doi:10.1038/nbt.2203 (2012).
Source: PubMed