A small-cell lung cancer genome with complex signatures of tobacco exposure

Erin D Pleasance, Philip J Stephens, Sarah O'Meara, David J McBride, Alison Meynert, David Jones, Meng-Lay Lin, David Beare, King Wai Lau, Chris Greenman, Ignacio Varela, Serena Nik-Zainal, Helen R Davies, Gonzalo R Ordoñez, Laura J Mudie, Calli Latimer, Sarah Edkins, Lucy Stebbings, Lina Chen, Mingming Jia, Catherine Leroy, John Marshall, Andrew Menzies, Adam Butler, Jon W Teague, Jonathon Mangion, Yongming A Sun, Stephen F McLaughlin, Heather E Peckham, Eric F Tsung, Gina L Costa, Clarence C Lee, John D Minna, Adi Gazdar, Ewan Birney, Michael D Rhodes, Kevin J McKernan, Michael R Stratton, P Andrew Futreal, Peter J Campbell, Erin D Pleasance, Philip J Stephens, Sarah O'Meara, David J McBride, Alison Meynert, David Jones, Meng-Lay Lin, David Beare, King Wai Lau, Chris Greenman, Ignacio Varela, Serena Nik-Zainal, Helen R Davies, Gonzalo R Ordoñez, Laura J Mudie, Calli Latimer, Sarah Edkins, Lucy Stebbings, Lina Chen, Mingming Jia, Catherine Leroy, John Marshall, Andrew Menzies, Adam Butler, Jon W Teague, Jonathon Mangion, Yongming A Sun, Stephen F McLaughlin, Heather E Peckham, Eric F Tsung, Gina L Costa, Clarence C Lee, John D Minna, Adi Gazdar, Ewan Birney, Michael D Rhodes, Kevin J McKernan, Michael R Stratton, P Andrew Futreal, Peter J Campbell

Abstract

Cancer is driven by mutation. Worldwide, tobacco smoking is the principal lifestyle exposure that causes cancer, exerting carcinogenicity through >60 chemicals that bind and mutate DNA. Using massively parallel sequencing technology, we sequenced a small-cell lung cancer cell line, NCI-H209, to explore the mutational burden associated with tobacco smoking. A total of 22,910 somatic substitutions were identified, including 134 in coding exons. Multiple mutation signatures testify to the cocktail of carcinogens in tobacco smoke and their proclivities for particular bases and surrounding sequence context. Effects of transcription-coupled repair and a second, more general, expression-linked repair pathway were evident. We identified a tandem duplication that duplicates exons 3-8 of CHD7 in frame, and another two lines carrying PVT1-CHD7 fusion genes, indicating that CHD7 may be recurrently rearranged in this disease. These findings illustrate the potential for next-generation sequencing to provide unprecedented insights into mutational processes, cellular repair pathways and gene networks associated with cancer.

Figures

Figure 1
Figure 1
The compendium of somatic mutations in a small cell lung cancer genome. (A) Power calculations showing the number of true somatic substitutions detected (blue) and mis-calls (SNPs called as somatic mutations, burgundy, and sequencing errors called as mutations, green) for different levels of sequence coverage. Calculations are based on a true mutation prevalence of 1/Mb (black line). (B) Histogram of the actual coverage achieved per base of the tumour (blue) and normal (burgundy) genomes. (C) Figurative representation of the catalogue of somatic mutations in the genome of NCI-H209. Chromosome ideograms are shown around the outer ring and are oriented pter-qter in a clockwise direction with centromeres indicated in red. Other tracks contain somatic alterations: validated insertions (light green rectangles); validated deletions (dark green rectangles); heterozygous (light orange bars) and homozygous (dark orange bars) substitutions shown by density per 10 megabases; coding substitutions (coloured squares; silent in grey, missense in purple, nonsense in red and splice site in black); copy number (blue lines); validated intrachromosomal rearrangements (green lines); validated interchromosomal rearrangements (purple lines).
Figure 2
Figure 2
The mutation profile of NCI-H209. (A) Numbers of mutations in each of the 6 possible mutation classes. (B) Fraction of the three classes of guanine mutations occurring at CpG dinucleotides in NCI-H209, with p values reflecting the comparison with the expected fraction across the genome (grey). (C) Fraction of guanine mutations at CpGs which are found in CpG islands for each of the three classes of mutation. P values reflect comparison with the genome-wide fraction (grey) of CpGs found in CpG islands (and hence more likely to be constitutively unmethylated) versus outside CpG islands (high rates of constitutive methylation). (D) Distribution of the four NpA dinucleotides for each of the three classes of adenine mutation in NCI-H209, compared to the expected distribution across the genome (left). (E) Fitted curves showing the effects of gene expression and strand bias on mutation prevalence for the six classes of adenine and guanine mutation in NCI-H209. The y axis is expressed as mutations per Mb of at-risk nucleotides, namely mutations/1,000,000 Gs for G>T.
Figure 3
Figure 3
Localised complexes of somatically acquired genomic rearrangements in NCI-H209. Copy number plots across regions on chromosomes 1 and 4 are shown. Inverted intrachromosmal rearrangements (blue), non-inverted intrachromosomal rearrangements (brown) and interchromosomal rearrangements (red) are shown in relation to copy number changes. The inset shows the representative chromosomes on spectral karyotyping. There are three breakpoints between chromosome 1 (yellow) and 4 (light blue), and a translocation between chromosomes 4 and 5 (tan).
Figure 4
Figure 4
CHD7 rearrangements in SCLC cell lines. (A) A somatically acquired 39.5kb tandem duplication is found in NCI-H209. (B) The LU-135 cell line shows co-amplification of the 3′ portion of CHD7 together with MYC and the 5′ portion of PVT1. Blue lines show locations of genomic rearrangements observed in the amplicons, with the thickness of the line proportional to the number of reads spanning the breakpoint. (C) Transcripts resulting from CHD7 rearrangements are an in-frame duplication of exons 3-8 in NCI-H209 and two amplified PVT1-CHD7 fusion genes in NCI-H2171 and LU-135. (D) CHD7 is over-expressed in SCLC compared to both non-small cell lung cancer and other tumour types. LU-135 and NCI-H2171 show massive over-expression of CHD7 in keeping with the genomic amplification present in these cell lines.

References

    1. Jha P. Avoidable global cancer deaths and total deaths from smoking. Nat Rev Cancer. 2009;9:655–64.
    1. Hecht SS. Progress and challenges in selected areas of tobacco carcinogenesis. Chem Res Toxicol. 2008;21:160–71.
    1. Pfeifer GP, et al. Tobacco smoke carcinogens, DNA damage and p53 mutations in smoking-associated cancers. Oncogene. 2002;21:7435–51.
    1. DeMarini DM. Genotoxicity of tobacco smoke and tobacco smoke condensate: a review. Mutat Res. 2004;567:447–74.
    1. Rodin SN, Rodin AS. Origins and selection of p53 mutations in lung carcinogenesis. Semin Cancer Biol. 2005;15:103–12.
    1. Toh CK. The changing epidemiology of lung cancer. Methods Mol Biol. 2009;472:397–411.
    1. Sher T, Dy GK, Adjei AA. Small cell lung cancer. Mayo Clin Proc. 2008;83:355–67.
    1. Wistuba II, Gazdar AF, Minna JD. Molecular genetics of small cell lung carcinoma. Semin Oncol. 2001;28:3–13.
    1. Horowitz JM, et al. Frequent inactivation of the retinoblastoma anti-oncogene is restricted to a subset of human tumor cells. Proc Natl Acad Sci U S A. 1990;87:2775–9.
    1. Mori N, et al. Variable mutations of the RB gene in small-cell lung carcinoma. Oncogene. 1990;5:1713–7.
    1. Yokomizo A, et al. PTEN/MMAC1 mutations identified in small cell, but not in non-small cell lung cancers. Oncogene. 1998;17:475–9.
    1. Ley TJ, et al. DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature. 2008;456:66–72.
    1. Mardis ER, et al. Recurring Mutations Found by Sequencing an Acute Myeloid Leukemia Genome. N Engl J Med. 2009
    1. Campbell PJ, et al. Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nat Genet. 2008;40:722–9.
    1. Stratton MR, Campbell PJ, Futreal PA. The cancer genome. Nature. 2009;458:719–24.
    1. Carney DN, et al. Establishment and identification of small cell lung cancer cell lines having classic and variant features. Cancer Res. 1985;45:2913–23.
    1. Barbone F, Bovenzi M, Cavallieri F, Stanta G. Cigarette smoking and histologic type of lung cancer in men. Chest. 1997;112:1474–9.
    1. Lubin JH, Blot WJ. Assessment of lung cancer risk factors by histologic category. J Natl Cancer Inst. 1984;73:383–9.
    1. Kaye FJ, Kratzke RA, Gerster JL, Horowitz JM. A single amino acid substitution results in a retinoblastoma protein defective in phosphorylation and oncoprotein binding. Proc Natl Acad Sci U S A. 1990;87:6922–6.
    1. Denissenko MF, Chen JX, Tang MS, Pfeifer GP. Cytosine methylation determines hot spots of DNA damage in the human P53 gene. Proc Natl Acad Sci U S A. 1997;94:3893–8.
    1. Feng Z, Hu W, Hu Y, Tang MS. Acrolein is a major cigarette-related lung cancer agent: Preferential binding at p53 mutational hotspots and inhibition of DNA repair. Proc Natl Acad Sci U S A. 2006;103:15404–9.
    1. Eckhardt F, et al. DNA methylation profiling of human chromosomes 6, 20 and 22. Nat Genet. 2006;38:1378–85.
    1. Greenman C, et al. Patterns of somatic mutation in human cancer genomes. Nature. 2007;446:153–8.
    1. Jones S, et al. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321:1801–6.
    1. Fekairi S, et al. Human SLX4 is a Holliday junction resolvase subunit that binds multiple DNA repair/recombination endonucleases. Cell. 2009;138:78–89.
    1. Svendsen JM, et al. Mammalian BTBD12/SLX4 assembles a Holliday junction resolvase and is required for DNA repair. Cell. 2009;138:63–77.
    1. Rozman M, et al. Type I MOZ/CBP (MYST3/CREBBP) is the most common chimeric transcript in acute myeloid leukemia with t(8;16)(p11;p13) translocation. Genes Chromosomes Cancer. 2004;40:140–5.
    1. Carramusa L, et al. The PVT-1 oncogene is a Myc protein target that is overexpressed in transformed cells. J Cell Physiol. 2007;213:511–8.
    1. Zeidler R, et al. Breakpoints of Burkitt's lymphoma t(8;22) translocations map within a distance of 300 kb downstream of MYC. Genes Chromosomes Cancer. 1994;9:282–7.
    1. Guan Y, et al. Amplification of PVT1 contributes to the pathophysiology of ovarian and breast cancer. Clin Cancer Res. 2007;13:5745–55.
    1. Schnetz MP, et al. Genomic distribution of CHD7 on chromatin tracks H3K4 methylation patterns. Genome Res. 2009;19:590–601.
    1. van Haaften G, et al. Somatic mutations of the histone H3K27 demethylase gene UTX in human cancer. Nat Genet. 2009;41:521–3.
    1. Bagchi A, et al. CHD5 is a tumor suppressor at human 1p36. Cell. 2007;128:459–75.
    1. Jackson MA, Lea I, Rashid A, Peddada SD, Dunnick JK. Genetic alterations in cancer knowledge system: analysis of gene mutations in mouse and human liver and lung tumors. Toxicol Sci. 2006;90:400–18.
    1. Shuck SC, Short EA, Turchi JJ. Eukaryotic nucleotide excision repair: from understanding mechanisms to influencing biology. Cell Res. 2008;18:64–72.
    1. Liu Y, et al. Interactions of human replication protein A with single-stranded DNA adducts. Biochem J. 2005;385:519–26.
    1. Lubin JH, et al. Cigarette smoking and cancer risk: modeling total exposure and intensity. Am J Epidemiol. 2007;166:479–89.
    1. Davies H, et al. Somatic mutations of the protein kinase gene family in human lung cancer. Cancer Res. 2005;65:7591–5.
    1. Ding L, et al. Somatic mutations affect key pathways in lung adenocarcinoma. Nature. 2008;455:1069–75.
    1. Venkatraman ES, Olshen AB. A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics. 2007;23:657–63.
    1. Krzywinski M, et al. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19:1639–45.
    1. Hoffmann MM, Birney E. An effective model for natural selection in promoters. 2009 Manuscript submitted.
    1. Vlieghe D, et al. A new generation of JASPAR, the open-access repository for transcription factor binding site profiles. Nucleic Acids Res. 2006;34:D95–7.

Source: PubMed

3
Abonnere