The topography of mutational processes in breast cancer genomes

Sandro Morganella, Ludmil B Alexandrov, Dominik Glodzik, Xueqing Zou, Helen Davies, Johan Staaf, Anieta M Sieuwerts, Arie B Brinkman, Sancha Martin, Manasa Ramakrishna, Adam Butler, Hyung-Yong Kim, Åke Borg, Christos Sotiriou, P Andrew Futreal, Peter J Campbell, Paul N Span, Steven Van Laere, Sunil R Lakhani, Jorunn E Eyfjord, Alastair M Thompson, Hendrik G Stunnenberg, Marc J van de Vijver, John W M Martens, Anne-Lise Børresen-Dale, Andrea L Richardson, Gu Kong, Gilles Thomas, Julian Sale, Cristina Rada, Michael R Stratton, Ewan Birney, Serena Nik-Zainal, Sandro Morganella, Ludmil B Alexandrov, Dominik Glodzik, Xueqing Zou, Helen Davies, Johan Staaf, Anieta M Sieuwerts, Arie B Brinkman, Sancha Martin, Manasa Ramakrishna, Adam Butler, Hyung-Yong Kim, Åke Borg, Christos Sotiriou, P Andrew Futreal, Peter J Campbell, Paul N Span, Steven Van Laere, Sunil R Lakhani, Jorunn E Eyfjord, Alastair M Thompson, Hendrik G Stunnenberg, Marc J van de Vijver, John W M Martens, Anne-Lise Børresen-Dale, Andrea L Richardson, Gu Kong, Gilles Thomas, Julian Sale, Cristina Rada, Michael R Stratton, Ewan Birney, Serena Nik-Zainal

Abstract

Somatic mutations in human cancers show unevenness in genomic distribution that correlate with aspects of genome structure and function. These mutations are, however, generated by multiple mutational processes operating through the cellular lineage between the fertilized egg and the cancer cell, each composed of specific DNA damage and repair components and leaving its own characteristic mutational signature on the genome. Using somatic mutation catalogues from 560 breast cancer whole-genome sequences, here we show that each of 12 base substitution, 2 insertion/deletion (indel) and 6 rearrangement mutational signatures present in breast tissue, exhibit distinct relationships with genomic features relating to transcription, DNA replication and chromatin organization. This signature-based approach permits visualization of the genomic distribution of mutational processes associated with APOBEC enzymes, mismatch repair deficiency and homologous recombinational repair deficiency, as well as mutational processes of unknown aetiology. Furthermore, it highlights mechanistic insights including a putative replication-dependent mechanism of APOBEC-related mutagenesis.

Figures

Figure 1. Distribution of all mutations across…
Figure 1. Distribution of all mutations across the cell cycle.
Replication domains were identified by using conservatively defined transition zones in DNA replication time data. Data were separated into deciles, with each segment containing exactly 10% of the observed replication time signal. Normalized mutation density per decile is presented for early (left) to late (right) replication domains. (a) Aggregated distribution of mutations (green), rearrangements (purple) and indels (orange) across the cell cycle. (b) Distribution of the 12 base substitution signatures across the cell cycle. Dashed grey lines represent the predicted distribution of mutations for each signature based on simulations that take into account mutation burden and sequence characteristics of individual mutations and of the signatures that were estimated to be present in each patient (Methods section). (c) Distribution of the six rearrangement signatures across the cell cycle. Dashed grey lines represent the predicted distribution of mutations for each signature based on simulations. (d) Distribution of the indel signatures across the cell cycle.
Figure 2. Replication and transcriptional strand bias…
Figure 2. Replication and transcriptional strand bias and strand-coordinated mutagenesis of mutational signatures.
Forest plots showing replication (blue) and transcription (orange) strand bias for the 6 base substitution classes (a) and for the 12 base substitution signatures (b). Mutations were oriented in the pyrimidine context (the current convention for characterizing mutational signatures). Observed distribution between strands is shown as a diamond for replication and circle for transcriptional strands with 95% confidence intervals, against an expected probability of 0.5 (Supplementary Table 1 for values). (c) Relationship between processive group lengths (columns) and mutational signatures (rows). Processive groups were defined as sets of adjacent substitutions of the same mutational signature sharing the same reference allele, and the group length indicates the number of adjacent substitutions within each group. The size of each circle represents the number of groups (log10) observed for the specified group length (column) for each signature (row). The intensity of colour of each circle indicates significance of the likelihood of detection of a processive group of a defined length (−log10 of the P value obtained by comparing observed data to simulations, further details in Methods section).
Figure 3. Relationship between mutational signatures and…
Figure 3. Relationship between mutational signatures and nucleosome occupancy.
The distribution of the signal of nucleosome density (y axis) is shown in a 2 kb window centred on each mutation (position 0 on the x axis), for each signature. The averaged signal was calculated as the total amount of signal observed at each point divided by total number of mutations contributing to that signal. (a) Nucleosome density for aggregated substitutions (green), and for deletions observed in MMR-proficient (blue) and MMR-deficient (orange) samples. (b) Nucleosome density for the twelve base substitution signatures (note the degree of variation between substitution signatures relative to aggregated substitutions in a). The grey line shows the distribution predicted by simulations if mutations from each signature were randomly distributed. The analysis reveals that most of the observed distributions showed similar trends to those expected from simulations, apart from signatures 17, 18 and 26 and to a lesser extent signatures 5 and 8.
Figure 4. A replication-related model of mutagenesis…
Figure 4. A replication-related model of mutagenesis for putative APOBEC-related signatures 2 and 13.
1. During replication, transient moments of increased availability of single-stranded DNA (ssDNA) (for example, uncoupling between leading and lagging replicative strands or delays in elongation of the nascent lagging strand by Okazaki fragments) could occur, exposing ssDNA for APOBEC deamination, potentially for long genomic tracts. 2. Uracil-N-glycosylase (UNG) acts to remove undesirable uracils leaving a trail of abasic sites in its wake. Divergence of mutational processes occurs from this point. 3A Earlier in replication, error-prone translesion polymerases such as REV1 have been postulated to insert cytosines opposite abasic sites to avoid detrimental replication fork stalling or collapse. 4A The final outcome is stretches of successive C>G transversions at a TpC sequence context characteristic of signature 13. 3B Alternatively, uracils and abasic sites that are not fixed via REV1, undergo contingency processing, for example, the ‘A' rule. 4B The final outcome is of C>T mutations at a TpC sequence context.

References

    1. De S. & Michor F. DNA replication timing and long-range DNA interactions predict mutational landscapes of cancer genomes. Nat. Biotechnol. 29, 1103–1108 (2011).
    1. Drier Y. et al.. Somatic rearrangements across cancer reveal classes of samples with distinct patterns of DNA breakage and rearrangement-induced hypermutability. Genome Res. 23, 228–235 (2013).
    1. Koren A. et al.. Differential relationship of DNA replication timing to different forms of human mutation and variation. Am. J. Hum. Genet. 91, 1033–1040 (2012).
    1. Kundaje A. et al.. Ubiquitous heterogeneity and asymmetry of the chromatin environment at regulatory elements. Genome Res. 22, 1735–1747 (2012).
    1. Liu L., De S. & Michor F. DNA replication timing and higher-order nuclear organization determine single-nucleotide substitution patterns in cancer genomes. Nat. Commun. 4, 1502 (2013).
    1. Supek F. & Lehner B. Differential DNA mismatch repair underlies mutation rate variation across the human genome. Nature 521, 81–84 (2015).
    1. Woo Y. H. & Li W. H. DNA replication timing and selection shape the landscape of nucleotide variation in cancer genomes. Nat. Commun. 3, 1004 (2012).
    1. Stamatoyannopoulos J. A. et al.. Human mutation rate associated with DNA replication timing. Nat. Genet. 41, 393–395 (2009).
    1. Schuster-Bockler B. & Lehner B. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature 488, 504–507 (2012).
    1. Polak P. et al.. Cell-of-origin chromatin organization shapes the mutational landscape of cancer. Nature 518, 360–364 (2015).
    1. Nik-Zainal S. et al.. Mutational processes molding the genomes of 21 breast cancers. Cell 149, 979–993 (2012).
    1. Nik-Zainal S. et al.. The life history of 21 breast cancers. Cell 149, 994–1007 (2012).
    1. Helleday T., Eshtad S. & Nik-Zainal S. Mechanisms underlying mutational signatures in human cancers. Nat. Rev. Genet. 15, 585–598 (2014).
    1. Alexandrov L. B. et al.. Signatures of mutational processes in human cancer. Nature 500, 415–421 (2013).
    1. Nik-Zainal S. et al.. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature (2016).
    1. Pena-Diaz J. & Jiricny J. Mammalian mismatch repair: error-free or error-prone? Trends Biochem. Sci. 37, 206–214 (2012).
    1. O'Donnell M., Langston L. & Stillman B. Principles and concepts of DNA replication in bacteria, archaea, and eukarya. Cold Spring Harb. Perspect. Biol. 5, a010108 (2013).
    1. Alberts B. Molecular Biology of the Cell 5th edn Garland Science (2008).
    1. Fragkos M., Ganier O., Coulombe P. & Mechali M. DNA replication origin activation in space and time. Nat. Rev. Mol. Cell Biol. 16, 360–374 (2015).
    1. ENCODE (2012).
    1. Guan Z. et al.. Decreased replication origin activity in temporal transition regions. J. Cell Biol. 187, 623–635 (2009).
    1. Guilbaud G. et al.. Evidence for sequential and increasing activation of replication origins along replication timing gradients in the human genome. PLoS Comput. Biol. 7, e1002322 (2011).
    1. Signatures C. M (2015).
    1. Conticello S. G. The AID/APOBEC family of nucleic acid mutators. Genome Biol. 9, 229 (2008).
    1. Harris R. S., Petersen-Mahrt S. K. & Neuberger M. S. RNA editing enzyme APOBEC1 and some of its homologs can act as DNA mutators. Mol. Cell 10, 1247–1253 (2002).
    1. Byeon I. J. et al.. NMR structure of human restriction factor APOBEC3A reveals substrate binding and enzyme specificity. Nat. Commun. 4, 1890 (2013).
    1. Holtz C. M., Sadler H. A. & Mansky L. M. APOBEC3G cytosine deamination hotspots are defined by both sequence context and single-stranded DNA secondary structure. Nucleic Acids Res. 41, 6139–6148 (2013).
    1. Nik-Zainal S. et al.. Association of a germline copy number polymorphism of APOBEC3A and APOBEC3B with burden of putative APOBEC-dependent mutations in breast cancer. Nat. Genet. 46, 487–491 (2014).
    1. Nouspikel T. DNA repair in mammalian cells : Nucleotide excision repair: variations on versatility. Cell. Mol. Life Sci. 66, 994–1009 (2009).
    1. Pleasance E. D. et al.. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature 463, 191–196 (2010).
    1. Hanawalt P. C. & Spivak G. Transcription-coupled DNA repair: two decades of progress and surprises. Nature Rev. Mol. Cell Biol. 9, 958–970 (2008).
    1. Wang Y., Sheppard T. L., Tornaletti S., Maeda L. S. & Hanawalt P. C. Transcriptional inhibition by an oxidized abasic site in DNA. Chem. Res. Toxicol. 19, 234–241 (2006).
    1. Tornaletti S., Maeda L. S. & Hanawalt P. C. Transcription arrest at an abasic site in the transcribed strand of template DNA. Chem. Res. Toxicol. 19, 1215–1220 (2006).
    1. Tolstorukov M. Y., Volfovsky N., Stephens R. M. & Park P. J. Impact of chromatin structure on sequence variability in the human genome. Nat. Struct. Mol. Biol. 18, 510–515 (2011).
    1. Sasaki S. et al.. Chromatin-associated periodicity in genetic variation downstream of transcriptional start sites. Science 323, 401–404 (2009).
    1. Schopf B. et al.. Interplay between mismatch repair and chromatin assembly. Proc. Natl Acad. Sci. USA 109, 1895–1900 (2012).
    1. Lujan S. A. et al.. Heterogeneous polymerase fidelity and mismatch repair bias genome variation and composition. Genome Res. 24, 1751–1764 (2014).
    1. Strauss B. S. The "A" rule revisited: polymerases as determinants of mutational specificity. DNA Repair 1, 125–135 (2002).
    1. Kass E. M. & Jasin M. Collaboration and competition between DNA double-strand break repair pathways. FEBS Lett. 584, 3703–3708 (2010).
    1. Mateos-Gomez P. A. et al.. Mammalian polymerase theta promotes alternative NHEJ and suppresses recombination. Nature 518, 254–257 (2015).
    1. Ceccaldi R. et al.. Homologous-recombination-deficient tumours are dependent on Poltheta-mediated repair. Nature 518, 258–262 (2015).
    1. Costantino L. et al.. Break-induced replication repair of damaged forks induces genomic duplications in human cells. Science 343, 88–91 (2014).
    1. Durant S. T. & Nickoloff J. A. Good timing in the cell cycle for precise DNA repair by BRCA1. Cell Cycle 4, 1216–1222 (2005).
    1. Aguilera A. & Gomez-Gonzalez B. Genome instability: a mechanistic view of its causes and consequences. Nat. Rev. Genet. 9, 204–217 (2008).
    1. Carr A. M. & Lambert S. Replication stress-induced genome instability: the dark side of replication maintenance by homologous recombination. J. Mol. Biol. 425, 4733–4744 (2013).
    1. Simpson L. J. & Sale J. E. Rev1 is essential for DNA damage tolerance and non-templated immunoglobulin gene mutation in a vertebrate cell line. EMBO J. 22, 1654–1664 (2003).
    1. Alexandrov L. B., Nik-Zainal S., Wedge D. C., Campbell P. J. & Stratton M. R. Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 3, 246–259 (2013).
    1. Consortium E. P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
    1. Kaykov A. & Nurse P. The spatial and temporal organization of origin firing during the S-phase of fission yeast. Genome Res. 25, 391–401 (2015).
    1. Gilbert D. M. Making sense of eukaryotic DNA replication origins. Science 294, 96–100 (2001).
    1. Rhind N. DNA replication timing: random thoughts about origin firing. Nat. Cell Biol. 8, 1313–1316 (2006).
    1. Vengrova S. & Dalgaard J. Z. RNase-sensitive DNA modification(s) initiates S. pombe mating-type switching. Genes Dev. 18, 794–804 (2004).
    1. Arcangioli B. & de Lahondes R. Fission yeast switches mating type by a replication-recombination coupled process. EMBO J. 19, 1389–1396 (2000).
    1. Kaykov A., Taillefumier T., Bensimon A. & Nurse P. Molecular combing of single DNA molecules on the 10 megabase scale. Sci. Rep. 6, 19636 (2016).
    1. Gaffney D. J. et al.. Controls of nucleosome positioning in the human genome. PLoS Genet. 8, e1003036 (2012).

Source: PubMed

3
Subscribe