Extrachromosomal DNA is associated with oncogene amplification and poor outcome across multiple cancers

Hoon Kim, Nam-Phuong Nguyen, Kristen Turner, Sihan Wu, Amit D Gujar, Jens Luebeck, Jihe Liu, Viraj Deshpande, Utkrisht Rajkumar, Sandeep Namburi, Samirkumar B Amin, Eunhee Yi, Francesca Menghi, Johannes H Schulte, Anton G Henssen, Howard Y Chang, Christine R Beck, Paul S Mischel, Vineet Bafna, Roel G W Verhaak, Hoon Kim, Nam-Phuong Nguyen, Kristen Turner, Sihan Wu, Amit D Gujar, Jens Luebeck, Jihe Liu, Viraj Deshpande, Utkrisht Rajkumar, Sandeep Namburi, Samirkumar B Amin, Eunhee Yi, Francesca Menghi, Johannes H Schulte, Anton G Henssen, Howard Y Chang, Christine R Beck, Paul S Mischel, Vineet Bafna, Roel G W Verhaak

Abstract

Extrachromosomal DNA (ecDNA) amplification promotes intratumoral genetic heterogeneity and accelerated tumor evolution1-3; however, its frequency and clinical impact are unclear. Using computational analysis of whole-genome sequencing data from 3,212 cancer patients, we show that ecDNA amplification frequently occurs in most cancer types but not in blood or normal tissue. Oncogenes were highly enriched on amplified ecDNA, and the most common recurrent oncogene amplifications arose on ecDNA. EcDNA amplifications resulted in higher levels of oncogene transcription compared to copy number-matched linear DNA, coupled with enhanced chromatin accessibility, and more frequently resulted in transcript fusions. Patients whose cancers carried ecDNA had significantly shorter survival, even when controlled for tissue type, than patients whose cancers were not driven by ecDNA-based oncogene amplification. The results presented here demonstrate that ecDNA-based oncogene amplification is common in cancer, is different from chromosomal amplification and drives poor outcome for patients across many cancer types.

Conflict of interest statement

COMPETING INTERESTS

H.Y.C., P.S.M., V.B. and R.G.W.V. are scientific co-founders of Boundless Bio, Inc. (BBI), and serve as consultants. V.B. is a co-founder, and has equity interest in Digital Proteomics, LLC (DP), and receives income from DP. The terms of this arrangement have been reviewed and approved by the University of California, San Diego in accordance with its conflict of interest policies. N.P. N. and K.T. are employees of Boundless Bio, Inc.

Figures

**Extended Data Fig. 1. Amplicon classification**
A. Validation on cell line data. Validation of the classification scheme on cell line data with FISH experiments for detecting ecDNA from the Turner et al. and deCarvalho et al. studies, in addition to newly generated data. FISH probes were designed for selected oncogenes and DAPI staining was performed to determine whether the FISH probe landed on chromosomal DNA or ecDNA. For each cell (represented as an image of the cell in metaphase), the number of positive ecDNA probes were counted, and for each cell line, the average positive ecDNA per cell was reported. For each probe, we report whether it landed in an amplicon (inferred from AmpliconArchitect), and if so, what was the amplicon’s classification. The distribution for the average ecDNA per cell between the Circular and non-circular classes was statistically significantly different (p-value < 1e-9; Wilcoxon rank sum test). **B, C and D.** Whole-genome sequencing derived based Circular amplicon regions (blue) were validated with Circle-seq (red) for three neuroblastoma samples (CB2001, CB2022, and CB2050, respectively) used in the Koche et al. study.

Extended Data Fig. 2. Circular vs amplified… — **Extended Data Fig. 2. Circular vs amplified non-circular amplification comparisons**
A. 24 recurrently amplified oncogenes significantly overlap circular regions (z-score 37.8), especially compared to amplified non-circular regions (z-scores of 30.4, 29.5, 28.0 for Linear, Heavily-rearranged, and BFB). B. For all oncogenes on amplicons with copy number >= 4 and present in at least 5 samples across the cohort, we show the class distribution of that oncogene. The oncogenes are ordered by proportion on circular amplification. C. For the 24 recurrent oncogenes known to be activated via amplification (**Zack et al. Nat Gen. 2013**), we report the average copy number for the oncogenes for circular amplification versus amplified-noncircular amplification. D. Breakpoint location across all samples for each recurrently amplified oncogene. We identified all breakpoints from each sample containing the recurrent oncogene on ecDNA and report the total number of breakpoints across this region in 1kb binned windows. E. Distribution of breakpoint locations across all circular samples for each recurrently amplified oncogene. We identified all breakpoints from each sample containing the recurrent oncogene on ecDNA. Shown is the distribution of the number of breakpoints in each bin, which closely follows a Poisson distribution, suggesting that the breakpoints are mostly randomly distributed across the region.

Extended Data Fig. 3. Genome instability vs… — **Extended Data Fig. 3. Genome instability vs amplicon classes**
A. Chromosome arm aneuploidy scores showing no or marginal difference in chromosomal arm level events between circular and non-circular amplification classes. B. Genome doubling events by amplification class. C. Distribution for total DNA loss segments by amplification class. WGS-inferred CNV data was used to count the total number of DNA losses within a sample. A DNA loss was defined as a segment with CN < 2. D. Distribution for total DNA gain segments by amplification class. WGS-inferred CNV data was used to count the total number of DNA gains within a sample. A DNA gain was defined as a segment with CN > 2. Circular samples contain statistically significantly more DNA gains than BFB, Heavily-rearranged, Linear, and No-fSCNA (p-value <0.03, <0.03, <1e-20, and <1e-111, respectively; Wilcox Rank Sum Test). E. Breakpoint homology by amplification class. F. Comparison of amplicon versus locus-level chromothripsis (Pearson′s Chi-squared test data: X-squared = 4674.7, df = 3, p-value < 2.2e-16). G. Comparison of sample category versus sample-level chromothripsis (Pearson′s Chi-squared test data: X-squared = 21.58, df = 3, p-value 8e-05 (excludes ‘No fSCNA detected’ category)). H. Comparison of sample category versus sample-level tandem duplication (Pearson′s Chi-squared test data: X-squared = 7.39, df = 3, p-value 0.06 (excludes ‘No fSCNA detected’ category)).

Extended Data Fig. 4. Gene expression of… — **Extended Data Fig. 4. Gene expression of amplicon classes**
Copy number of the oncogene versus its fold-change in FPKM for all oncogenes with a copy count greater than 4, for each oncogene on each amplicon. The fold-change in FPKM is computed as the oncogene’s (FPKM-UQ+1) divided by the average of (FPKM-UQ+1) for the same oncogene in all other tumor samples from the same cohort for which the oncogene is not on any amplicon (i.e., not amplified). Linear regression lines, using fold change = m*CNV+b where m and b are selected to minimize error of the fit, are shown for each class. Tukey′s range test shows oncogenes on circular structures are significantly different to oncogenes on non-circular structures (p-value

**Extended Data Fig. 5. Lymph node stage…**

**Extended Data Fig. 5. Lymph node stage vs amplicon classes**

Lymph node stage for primary…

**Extended Data Fig. 5. Lymph node stage vs amplicon classes**
Lymph node stage for primary tumors showing samples with amplification are more likely to have spread to the lymph node at time of diagnosis (Chi-square test; df=4; p-value

**Extended Data Fig. 6. Cell cycle and…**

**Extended Data Fig. 6. Cell cycle and immune infiltrate gene expression signatures vs amplicon classes**

**Extended Data Fig. 6. Cell cycle and immune infiltrate gene expression signatures vs amplicon classes**
A. Cell Cycle gene expression signature single sample GSEA (ssGSEA) scores by amplification category. B. Immune infiltrate gene expression signature single sample GSEA (ssGSEA) scores by amplification category.

**Fig. 1 |. Frequency of circular amplification…**

**Fig. 1 |. Frequency of circular amplification across tumor and non-tumor tissues.**

A. Schematic representation…

**Fig. 1 |. Frequency of circular amplification across tumor and non-tumor tissues.**
A. Schematic representation of the four classification categories. All DNA regions with a copy number of 4 or greater than ploidy and comprising at least 10 kb were classified using a hierarchical scheme based on the AmpliconArchitect amplicon reconstruction as well as the types of discordant breakpoint edges in the region. The four categories are defined as follows - 1) Linear amplicon: an amplicon that contains amplified segments with either no discordant edges or with edges suggesting deletions smaller than 1 Mb. 2) Heavily-rearranged amplicon: an amplicon which contains amplified segments connected by discordant breakpoint edges suggesting higher-order rearrangements beyond small deletions - such as inversions, interchromosomal edges or deletions > 1Mbp. 3) Breakage-fusion-bridge (BFB) amplicon: an amplicon having a proportion of foldback reads in excess of 25%, and which may have signatures of heavily rearranged or circular amplification. 4) Circular amplicon: an amplicon which contains one or more genomic segments forming a cyclic path of at least 10 kbp and 4+ copies. B. Left panel: Comparison of whole-genome sequencing derived circular DNA amplicon and Circle-seq derived segments. Right panel: Circular amplicons detected from whole-genome sequencing with AmpliconArchitect were validated with Circle-seq. N: not validated by Circle-Seq. C. Distribution of circular, BFB, Heavily-rearranged, Linear, and no focal somatic copy number amplification detected (No-fSCNA) amplicon categories by tumor and normal tissue, across 3,731 tumor and non-neoplastic sample derived whole-genomes from TCGA and 1,291 whole-genomes from PCAWG.

**Fig. 2 |. Oncogene content and structural…**

**Fig. 2 |. Oncogene content and structural component of circular amplification.**

A. Genome-wide distribution of…

**Fig. 2 |. Oncogene content and structural component of circular amplification.**
A. Genome-wide distribution of amplification peaks by amplicon class. Amplifications were counted per 1Mb bin and are shown as a fraction of the total number of samples per amplicon class. B. Classification of amplification status by gene. Shown are the 24 most frequently amplified oncogenes. C. Breakpoint locations (right) and distribution of breakpoints (left) across all circular samples with amplified *CCND1* (top), *EGFR* (middle), and *MYC* (bottom). Breakpoints were identified in each sample containing the amplified oncogene region. Shown are the total number of breakpoints across this region in 1kb binned windows (right). The distribution of the number of breakpoints in each bin closely follows a Poisson distribution (left), suggesting that the breakpoints are mostly randomly distributed across the region. D. The number of genome-wide DNA segments within a sample was compared between Circular, BFB, Heavily-rearranged, Linear, and No-fSCNA detected classes. Circular samples contained statistically significantly more DNA segments than non-circular samples (p-value 0.0046, 7.2e-6, 2.4e-19 and 9.4e-125, respectively; Wilcox Rank Sum Test (two-sided)).

**Fig. 3 |. Gene expression and chromatin…**

**Fig. 3 |. Gene expression and chromatin accessibility of amplicon classes.**

A. Copy number of…

**Fig. 3 |. Gene expression and chromatin accessibility of amplicon classes.**
A. Copy number of oncogene versus its fold-change in Fragments Per Kilobase of transcript per Million mapped reads upper quartile (FPKM-UQ) for all oncogenes with a copy count greater than 4, for each oncogene on each amplicon. The fold-change in FPKM-UQ is computed as the oncogene’s (FPKM-UQ+1) divided by the average of (FPKM-UQ+1) for the same oncogene in all other tumor samples from the same cohort for which the oncogene is not on any amplicon (i.e., not amplified). Linear regression lines, using fold change = m*copy number+b, and their 95% confidence level intervals (in grey) are shown for each class. Tukey’s range test shows oncogenes on circular structures are significantly different to oncogenes on non-circular structures (p-value < 1e-7). WGS: whole-genome sequencing. B. For each of the 36 The Cancer Genome Atlas (TCGA) samples with Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) profiles and AmpliconArchitect results, the copy-number normalized fold-change in ATAC-seq signal in each ATAC-seq peak that overlaps with the amplicon relative to tissue types without amplification within the same peak is shown. The distribution of fold-change for Circular amplicons is statistically significantly higher than Linear and Heavily-rearranged amplicons (Wilcoxon rank sum test (two-sided); p-value < 1e-16). Y-axis is on log(2) scale. Box plots are defined as 25th, 50th and 75th percentiles, respectively. Y-axis is on log(2) scale. NS: not significant. C. Circular structures expressed significantly more gene fusions compared to non-circular amplicons, after size normalization. CN: copy number. D. Representative Circos-plot showing (rings from outside to inside) 1) Amplicon regions identified by AmpliconArchitect, where interconnected breakpoints were indicated with arrows; 2) DNA copy-number, where height and color represent level (darker red means higher copy number amplification); 3) FPKM expression values in green, where height and color represent expression level (darker green means higher expression); 4) ATAC-seq chromatin accessibility in blue, where height and color represent expression level (darker blue means more accessible). CNV: Copy Number Variation.

**Fig. 4 |. Presence of circular amplification…**

**Fig. 4 |. Presence of circular amplification associates with poor outcomes.**

A. Kaplan-Meier five-year survival…

**Fig. 4 |. Presence of circular amplification associates with poor outcomes.**
A. Kaplan-Meier five-year survival curves by amplification category. Patients whose tumors contain at least one Circular amplicon have significantly worse outcome compared to patients whose tumors were classified as non-circular. The p-value comparing survival curves was based on a log-rank test. B. Multivariate Cox-Hazard model, incorporating disease and patient cohorts as parameters showing circular amplification results in significantly higher hazard ratios. The error bars represent 95% confidence intervals of the hazard ratio.

All figures (10)

See this image and copyright information in PMC

Comment in

ecDNA within tumors: a new mechanism that drives tumor heterogeneity and drug resistance.
Zeng X, Wan M, Wu J. Zeng X, et al. Signal Transduct Target Ther. 2020 Nov 24;5(1):277. doi: 10.1038/s41392-020-00403-4. Signal Transduct Target Ther. 2020. PMID: 33235201 Free PMC article. No abstract available.

Similar articles

Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity.
Turner KM, Deshpande V, Beyter D, Koga T, Rusert J, Lee C, Li B, Arden K, Ren B, Nathanson DA, Kornblum HI, Taylor MD, Kaushal S, Cavenee WK, Wechsler-Reya R, Furnari FB, Vandenberg SR, Rao PN, Wahl GM, Bafna V, Mischel PS. Turner KM, et al. Nature. 2017 Mar 2;543(7643):122-125. doi: 10.1038/nature21356. Epub 2017 Feb 8. Nature. 2017. PMID: 28178237 Free PMC article.

Extrachromosomal oncogene amplification in tumour pathogenesis and evolution.
Verhaak RGW, Bafna V, Mischel PS. Verhaak RGW, et al. Nat Rev Cancer. 2019 May;19(5):283-288. doi: 10.1038/s41568-019-0128-6. Nat Rev Cancer. 2019. PMID: 30872802 Free PMC article. Review.

Circular ecDNA promotes accessible chromatin and high oncogene expression.
Wu S, Turner KM, Nguyen N, Raviram R, Erb M, Santini J, Luebeck J, Rajkumar U, Diao Y, Li B, Zhang W, Jameson N, Corces MR, Granja JM, Chen X, Coruh C, Abnousi A, Houston J, Ye Z, Hu R, Yu M, Kim H, Law JA, Verhaak RGW, Hu M, Furnari FB, Chang HY, Ren B, Bafna V, Mischel PS. Wu S, et al. Nature. 2019 Nov;575(7784):699-703. doi: 10.1038/s41586-019-1763-5. Epub 2019 Nov 20. Nature. 2019. PMID: 31748743 Free PMC article.

Extrachromosomal DNA (ecDNA) in cancer pathogenesis.
Wu S, Bafna V, Mischel PS. Wu S, et al. Curr Opin Genet Dev. 2021 Feb;66:78-82. doi: 10.1016/j.gde.2021.01.001. Epub 2021 Jan 18. Curr Opin Genet Dev. 2021. PMID: 33477016 Review.

Extrachromosomal DNA: An Emerging Hallmark in Human Cancer.
Wu S, Bafna V, Chang HY, Mischel PS. Wu S, et al. Annu Rev Pathol. 2022 Jan 24;17:367-386. doi: 10.1146/annurev-pathmechdis-051821-114223. Epub 2021 Nov 9. Annu Rev Pathol. 2022. PMID: 34752712 Free PMC article. Review.

See all similar articles

Cited by

Extrachromosomal DNA in the cancerous transformation of Barrett's oesophagus.
Luebeck J, Ng AWT, Galipeau PC, Li X, Sanchez CA, Katz-Summercorn AC, Kim H, Jammula S, He Y, Lippman SM, Verhaak RGW, Maley CC, Alexandrov LB, Reid BJ, Fitzgerald RC, Paulson TG, Chang HY, Wu S, Bafna V, Mischel PS. Luebeck J, et al. Nature. 2023 Apr 12. doi: 10.1038/s41586-023-05937-5. Online ahead of print. Nature. 2023. PMID: 37046089

PLCG2 can exist in eccDNA and contribute to the metastasis of non-small cell lung cancer by regulating mitochondrial respiration.
Yang Y, Yang Y, Huang H, Song T, Mao S, Liu D, Zhang L, Li W. Yang Y, et al. Cell Death Dis. 2023 Apr 8;14(4):257. doi: 10.1038/s41419-023-05755-7. Cell Death Dis. 2023. PMID: 37031207 Free PMC article.

eccDB: a comprehensive repository for eccDNA-mediated chromatin contacts in multi-species.
Yang M, Qiu B, He GY, Zhou JY, Yu HJ, Zhang YY, Li YS, Li TS, Guo JC, Li XC, Xie JJ. Yang M, et al. Bioinformatics. 2023 Apr 3;39(4):btad173. doi: 10.1093/bioinformatics/btad173. Bioinformatics. 2023. PMID: 37018146 Free PMC article.

Somatic mutation landscape in a cohort of meningiomas that have undergone grade progression.
Cain SA, Pope B, Mangiola S, Mantamadiotis T, Drummond KJ. Cain SA, et al. BMC Cancer. 2023 Mar 7;23(1):216. doi: 10.1186/s12885-023-10624-9. BMC Cancer. 2023. PMID: 36882706 Free PMC article.

Extrachromosomal circular DNA in colorectal cancer: biogenesis, function and potential as therapeutic target.
Chen Y, Qiu Q, She J, Yu J. Chen Y, et al. Oncogene. 2023 Mar;42(13):941-951. doi: 10.1038/s41388-023-02640-7. Epub 2023 Mar 1. Oncogene. 2023. PMID: 36859558 Free PMC article. Review.

See all "Cited by" articles

References

deCarvalho AC et al. Discordant inheritance of chromosomal and extrachromosomal DNA elements contributes to dynamic disease evolution in glioblastoma. Nat Genet 50, 708–717 (2018). - PMC - PubMed

Turner KM et al. Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity. Nature 543, 122–125 (2017). - PMC - PubMed

Verhaak RGW, Bafna V & Mischel PS Extrachromosomal oncogene amplification in tumour pathogenesis and evolution. Nat Rev Cancer (2019). - PMC - PubMed

Weischenfeldt J et al. Pan-cancer analysis of somatic copy-number alterations implicates IRS4 and IGF2 in enhancer hijacking. Nat Genet 49, 65–74 (2017). - PMC - PubMed

Zack TI et al. Pan-cancer patterns of somatic copy number alteration. Nat Genet 45, 1134–40 (2013). - PMC - PubMed

Show all 43 references

Publication types
Research Support, N.I.H., Extramural
Actions
Search in PubMed
Search in MeSH
Add to Search
Research Support, Non-U.S. Gov't
Actions
Search in PubMed
Search in MeSH
Add to Search
Research Support, U.S. Gov't, Non-P.H.S.
Actions
Search in PubMed
Search in MeSH
Add to Search

MeSH terms
Cell Line, Tumor
Actions
Search in PubMed
Search in MeSH
Add to Search
Chromatin / genetics
Actions
Search in PubMed
Search in MeSH
Add to Search
Chromosomes / genetics*
Actions
Search in PubMed
Search in MeSH
Add to Search
DNA / genetics*
Actions
Search in PubMed
Search in MeSH
Add to Search
Gene Amplification / genetics*
Actions
Search in PubMed
Search in MeSH
Add to Search
Humans
Actions
Search in PubMed
Search in MeSH
Add to Search
Neoplasms / genetics*
Actions
Search in PubMed
Search in MeSH
Add to Search
Oncogenes / genetics*
Actions
Search in PubMed
Search in MeSH
Add to Search

Substances
Chromatin
Actions
Search in PubMed
Search in MeSH
Add to Search
DNA
Actions
Search in PubMed
Search in MeSH
Add to Search

Related information
MedGen
PubChem Compound (MeSH Keyword)

Grant support
RM1 HG007735/HG/NHGRI NIH HHS/United States
R01 CA190121/CA/NCI NIH HHS/United States
R21 NS114873/NS/NINDS NIH HHS/United States
R35 GM133600/GM/NIGMS NIH HHS/United States
R01 NS073831/NS/NINDS NIH HHS/United States
P50 HG007735/HG/NHGRI NIH HHS/United States
R01 CA237208/CA/NCI NIH HHS/United States
P30 CA023100/CA/NCI NIH HHS/United States
R01 GM114362/GM/NIGMS NIH HHS/United States
R35 CA209919/CA/NCI NIH HHS/United States
HHMI/Howard Hughes Medical Institute/United States
P30 CA034196/CA/NCI NIH HHS/United States
Show all 12 grants

LinkOut - more resources
Full Text Sources
Europe PubMed Central
Nature Publishing Group
PubMed Central
Other Literature Sources
The Lens - Patent Citations

**Full text links** [x]
Nature Publishing Group Free PMC article

[x]
Cite

Copy Download .nbib
Format: AMA APA MLA NLM

**Send To**

Clipboard

Email

Save

My Bibliography

Collections

Citation Manager

[x]

Extended Data Fig. 5. Lymph node stage… — **Extended Data Fig. 5. Lymph node stage vs amplicon classes**
Lymph node stage for primary tumors showing samples with amplification are more likely to have spread to the lymph node at time of diagnosis (Chi-square test; df=4; p-value

**Extended Data Fig. 6. Cell cycle and…**

**Extended Data Fig. 6. Cell cycle and immune infiltrate gene expression signatures vs amplicon classes**

**Extended Data Fig. 6. Cell cycle and immune infiltrate gene expression signatures vs amplicon classes**
A. Cell Cycle gene expression signature single sample GSEA (ssGSEA) scores by amplification category. B. Immune infiltrate gene expression signature single sample GSEA (ssGSEA) scores by amplification category.

**Fig. 1 |. Frequency of circular amplification…**

**Fig. 1 |. Frequency of circular amplification across tumor and non-tumor tissues.**

A. Schematic representation…

**Fig. 1 |. Frequency of circular amplification across tumor and non-tumor tissues.**
A. Schematic representation of the four classification categories. All DNA regions with a copy number of 4 or greater than ploidy and comprising at least 10 kb were classified using a hierarchical scheme based on the AmpliconArchitect amplicon reconstruction as well as the types of discordant breakpoint edges in the region. The four categories are defined as follows - 1) Linear amplicon: an amplicon that contains amplified segments with either no discordant edges or with edges suggesting deletions smaller than 1 Mb. 2) Heavily-rearranged amplicon: an amplicon which contains amplified segments connected by discordant breakpoint edges suggesting higher-order rearrangements beyond small deletions - such as inversions, interchromosomal edges or deletions > 1Mbp. 3) Breakage-fusion-bridge (BFB) amplicon: an amplicon having a proportion of foldback reads in excess of 25%, and which may have signatures of heavily rearranged or circular amplification. 4) Circular amplicon: an amplicon which contains one or more genomic segments forming a cyclic path of at least 10 kbp and 4+ copies. B. Left panel: Comparison of whole-genome sequencing derived circular DNA amplicon and Circle-seq derived segments. Right panel: Circular amplicons detected from whole-genome sequencing with AmpliconArchitect were validated with Circle-seq. N: not validated by Circle-Seq. C. Distribution of circular, BFB, Heavily-rearranged, Linear, and no focal somatic copy number amplification detected (No-fSCNA) amplicon categories by tumor and normal tissue, across 3,731 tumor and non-neoplastic sample derived whole-genomes from TCGA and 1,291 whole-genomes from PCAWG.

**Fig. 2 |. Oncogene content and structural…**

**Fig. 2 |. Oncogene content and structural component of circular amplification.**

A. Genome-wide distribution of…

**Fig. 2 |. Oncogene content and structural component of circular amplification.**
A. Genome-wide distribution of amplification peaks by amplicon class. Amplifications were counted per 1Mb bin and are shown as a fraction of the total number of samples per amplicon class. B. Classification of amplification status by gene. Shown are the 24 most frequently amplified oncogenes. C. Breakpoint locations (right) and distribution of breakpoints (left) across all circular samples with amplified *CCND1* (top), *EGFR* (middle), and *MYC* (bottom). Breakpoints were identified in each sample containing the amplified oncogene region. Shown are the total number of breakpoints across this region in 1kb binned windows (right). The distribution of the number of breakpoints in each bin closely follows a Poisson distribution (left), suggesting that the breakpoints are mostly randomly distributed across the region. D. The number of genome-wide DNA segments within a sample was compared between Circular, BFB, Heavily-rearranged, Linear, and No-fSCNA detected classes. Circular samples contained statistically significantly more DNA segments than non-circular samples (p-value 0.0046, 7.2e-6, 2.4e-19 and 9.4e-125, respectively; Wilcox Rank Sum Test (two-sided)).

**Fig. 3 |. Gene expression and chromatin…**

**Fig. 3 |. Gene expression and chromatin accessibility of amplicon classes.**

A. Copy number of…

**Fig. 3 |. Gene expression and chromatin accessibility of amplicon classes.**
A. Copy number of oncogene versus its fold-change in Fragments Per Kilobase of transcript per Million mapped reads upper quartile (FPKM-UQ) for all oncogenes with a copy count greater than 4, for each oncogene on each amplicon. The fold-change in FPKM-UQ is computed as the oncogene’s (FPKM-UQ+1) divided by the average of (FPKM-UQ+1) for the same oncogene in all other tumor samples from the same cohort for which the oncogene is not on any amplicon (i.e., not amplified). Linear regression lines, using fold change = m*copy number+b, and their 95% confidence level intervals (in grey) are shown for each class. Tukey’s range test shows oncogenes on circular structures are significantly different to oncogenes on non-circular structures (p-value < 1e-7). WGS: whole-genome sequencing. B. For each of the 36 The Cancer Genome Atlas (TCGA) samples with Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) profiles and AmpliconArchitect results, the copy-number normalized fold-change in ATAC-seq signal in each ATAC-seq peak that overlaps with the amplicon relative to tissue types without amplification within the same peak is shown. The distribution of fold-change for Circular amplicons is statistically significantly higher than Linear and Heavily-rearranged amplicons (Wilcoxon rank sum test (two-sided); p-value < 1e-16). Y-axis is on log(2) scale. Box plots are defined as 25th, 50th and 75th percentiles, respectively. Y-axis is on log(2) scale. NS: not significant. C. Circular structures expressed significantly more gene fusions compared to non-circular amplicons, after size normalization. CN: copy number. D. Representative Circos-plot showing (rings from outside to inside) 1) Amplicon regions identified by AmpliconArchitect, where interconnected breakpoints were indicated with arrows; 2) DNA copy-number, where height and color represent level (darker red means higher copy number amplification); 3) FPKM expression values in green, where height and color represent expression level (darker green means higher expression); 4) ATAC-seq chromatin accessibility in blue, where height and color represent expression level (darker blue means more accessible). CNV: Copy Number Variation.

**Fig. 4 |. Presence of circular amplification…**

**Fig. 4 |. Presence of circular amplification associates with poor outcomes.**

A. Kaplan-Meier five-year survival…

**Fig. 4 |. Presence of circular amplification associates with poor outcomes.**
A. Kaplan-Meier five-year survival curves by amplification category. Patients whose tumors contain at least one Circular amplicon have significantly worse outcome compared to patients whose tumors were classified as non-circular. The p-value comparing survival curves was based on a log-rank test. B. Multivariate Cox-Hazard model, incorporating disease and patient cohorts as parameters showing circular amplification results in significantly higher hazard ratios. The error bars represent 95% confidence intervals of the hazard ratio.

All figures (10)

See this image and copyright information in PMC

Extended Data Fig. 6. Cell cycle and… — **Extended Data Fig. 6. Cell cycle and immune infiltrate gene expression signatures vs amplicon classes**
A. Cell Cycle gene expression signature single sample GSEA (ssGSEA) scores by amplification category. B. Immune infiltrate gene expression signature single sample GSEA (ssGSEA) scores by amplification category.

Fig. 1 |. Frequency of circular amplification… — **Fig. 1 |. Frequency of circular amplification across tumor and non-tumor tissues.**
A. Schematic representation of the four classification categories. All DNA regions with a copy number of 4 or greater than ploidy and comprising at least 10 kb were classified using a hierarchical scheme based on the AmpliconArchitect amplicon reconstruction as well as the types of discordant breakpoint edges in the region. The four categories are defined as follows - 1) Linear amplicon: an amplicon that contains amplified segments with either no discordant edges or with edges suggesting deletions smaller than 1 Mb. 2) Heavily-rearranged amplicon: an amplicon which contains amplified segments connected by discordant breakpoint edges suggesting higher-order rearrangements beyond small deletions - such as inversions, interchromosomal edges or deletions > 1Mbp. 3) Breakage-fusion-bridge (BFB) amplicon: an amplicon having a proportion of foldback reads in excess of 25%, and which may have signatures of heavily rearranged or circular amplification. 4) Circular amplicon: an amplicon which contains one or more genomic segments forming a cyclic path of at least 10 kbp and 4+ copies. B. Left panel: Comparison of whole-genome sequencing derived circular DNA amplicon and Circle-seq derived segments. Right panel: Circular amplicons detected from whole-genome sequencing with AmpliconArchitect were validated with Circle-seq. N: not validated by Circle-Seq. C. Distribution of circular, BFB, Heavily-rearranged, Linear, and no focal somatic copy number amplification detected (No-fSCNA) amplicon categories by tumor and normal tissue, across 3,731 tumor and non-neoplastic sample derived whole-genomes from TCGA and 1,291 whole-genomes from PCAWG.

Fig. 2 |. Oncogene content and structural… — **Fig. 2 |. Oncogene content and structural component of circular amplification.**
A. Genome-wide distribution of amplification peaks by amplicon class. Amplifications were counted per 1Mb bin and are shown as a fraction of the total number of samples per amplicon class. B. Classification of amplification status by gene. Shown are the 24 most frequently amplified oncogenes. C. Breakpoint locations (right) and distribution of breakpoints (left) across all circular samples with amplified *CCND1* (top), *EGFR* (middle), and *MYC* (bottom). Breakpoints were identified in each sample containing the amplified oncogene region. Shown are the total number of breakpoints across this region in 1kb binned windows (right). The distribution of the number of breakpoints in each bin closely follows a Poisson distribution (left), suggesting that the breakpoints are mostly randomly distributed across the region. D. The number of genome-wide DNA segments within a sample was compared between Circular, BFB, Heavily-rearranged, Linear, and No-fSCNA detected classes. Circular samples contained statistically significantly more DNA segments than non-circular samples (p-value 0.0046, 7.2e-6, 2.4e-19 and 9.4e-125, respectively; Wilcox Rank Sum Test (two-sided)).

Fig. 3 |. Gene expression and chromatin… — **Fig. 3 |. Gene expression and chromatin accessibility of amplicon classes.**
A. Copy number of oncogene versus its fold-change in Fragments Per Kilobase of transcript per Million mapped reads upper quartile (FPKM-UQ) for all oncogenes with a copy count greater than 4, for each oncogene on each amplicon. The fold-change in FPKM-UQ is computed as the oncogene’s (FPKM-UQ+1) divided by the average of (FPKM-UQ+1) for the same oncogene in all other tumor samples from the same cohort for which the oncogene is not on any amplicon (i.e., not amplified). Linear regression lines, using fold change = m*copy number+b, and their 95% confidence level intervals (in grey) are shown for each class. Tukey’s range test shows oncogenes on circular structures are significantly different to oncogenes on non-circular structures (p-value < 1e-7). WGS: whole-genome sequencing. B. For each of the 36 The Cancer Genome Atlas (TCGA) samples with Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) profiles and AmpliconArchitect results, the copy-number normalized fold-change in ATAC-seq signal in each ATAC-seq peak that overlaps with the amplicon relative to tissue types without amplification within the same peak is shown. The distribution of fold-change for Circular amplicons is statistically significantly higher than Linear and Heavily-rearranged amplicons (Wilcoxon rank sum test (two-sided); p-value < 1e-16). Y-axis is on log(2) scale. Box plots are defined as 25th, 50th and 75th percentiles, respectively. Y-axis is on log(2) scale. NS: not significant. C. Circular structures expressed significantly more gene fusions compared to non-circular amplicons, after size normalization. CN: copy number. D. Representative Circos-plot showing (rings from outside to inside) 1) Amplicon regions identified by AmpliconArchitect, where interconnected breakpoints were indicated with arrows; 2) DNA copy-number, where height and color represent level (darker red means higher copy number amplification); 3) FPKM expression values in green, where height and color represent expression level (darker green means higher expression); 4) ATAC-seq chromatin accessibility in blue, where height and color represent expression level (darker blue means more accessible). CNV: Copy Number Variation.

Fig. 4 |. Presence of circular amplification… — **Fig. 4 |. Presence of circular amplification associates with poor outcomes.**
A. Kaplan-Meier five-year survival curves by amplification category. Patients whose tumors contain at least one Circular amplicon have significantly worse outcome compared to patients whose tumors were classified as non-circular. The p-value comparing survival curves was based on a log-rank test. B. Multivariate Cox-Hazard model, incorporating disease and patient cohorts as parameters showing circular amplification results in significantly higher hazard ratios. The error bars represent 95% confidence intervals of the hazard ratio.

References

1. deCarvalho AC et al. Discordant inheritance of chromosomal and extrachromosomal DNA elements contributes to dynamic disease evolution in glioblastoma. Nat Genet 50, 708–717 (2018).
1. Turner KM et al. Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity. Nature 543, 122–125 (2017).
1. Verhaak RGW, Bafna V & Mischel PS Extrachromosomal oncogene amplification in tumour pathogenesis and evolution. Nat Rev Cancer (2019).
1. Weischenfeldt J et al. Pan-cancer analysis of somatic copy-number alterations implicates IRS4 and IGF2 in enhancer hijacking. Nat Genet 49, 65–74 (2017).
1. Zack TI et al. Pan-cancer patterns of somatic copy number alteration. Nat Genet 45, 1134–40 (2013).
1. Beroukhim R et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905 (2010).
1. Alt FW, Kellems RE, Bertino JR & Schimke RT Selective multiplication of dihydrofolate reductase genes in methotrexate-resistant variants of cultured murine cells. J Biol Chem 253, 1357–70 (1978).
1. Kohl NE et al. Transposition and amplification of oncogene-related sequences in human neuroblastomas. Cell 35, 359–67 (1983).
1. Nathanson DA et al. Targeted therapy resistance mediated by dynamic regulation of extrachromosomal mutant EGFR DNA. Science 343, 72–6 (2014).
1. Zheng S et al. A survey of intragenic breakpoints in glioblastoma identifies a distinct subset associated with poor survival. Genes Dev 27, 1462–72 (2013).
1. Trask BJ Fluorescence in situ hybridization: applications in cytogenetics and gene mapping. Trends Genet 7, 149–54 (1991).
1. Deshpande V et al. Exploring the landscape of focal amplifications in cancer using AmpliconArchitect. Nat Commun 10, 392 (2019).
1. Xu K et al. Structure and evolution of double minutes in diagnosis and relapse brain tumors. Acta Neuropathol (2018).
1. Koche RP et al. Extrachromosomal circular DNA drives oncogenic genome remodeling in neuroblastoma. Nat Genet 52, 29–34 (2020).
1. Consortium ITP-CA o.W.G. Pan-cancer analysis of whole genomes. Nature 578, 82–93 (2020).
1. Zakov S, Kinsella M & Bafna V An algorithmic approach for breakage-fusion-bridge detection in tumor genomes. Proc Natl Acad Sci U S A 110, 5546–51 (2013).
1. Rajkumar U et al. EcSeg: Semantic Segmentation of Metaphase Images Containing Extrachromosomal DNA. iScience 21, 428–435 (2019).
1. Storlazzi CT et al. Gene amplification as double minutes or homogeneously staining regions in solid tumors: origin and structure. Genome Res 20, 1198–206 (2010).
1. Moller HD, Parsons L, Jorgensen TS, Botstein D & Regenberg B Extrachromosomal circular DNA is common in yeast. Proc Natl Acad Sci U S A 112, E3114–22 (2015).
1. Moller HD et al. Circular DNA elements of chromosomal origin are common in healthy human somatic tissue. Nat Commun 9, 1069 (2018).
1. Kumar P et al. Normal and Cancerous Tissues Release Extrachromosomal Circular DNA (eccDNA) into the Circulation. Mol Cancer Res 15, 1197–1205 (2017).
1. Shibata Y et al. Extrachromosomal microDNAs and chromosomal microdeletions in normal tissues. Science 336, 82–6 (2012).
1. Turner KM et al. Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity. Nature (2017).
1. Davoli T & de Lange T The causes and consequences of polyploidy in normal development and cancer. Annu Rev Cell Dev Biol 27, 585–610 (2011).
1. Bielski CM et al. Genome doubling shapes the evolution and prognosis of advanced cancers. Nat Genet 50, 1189–1195 (2018).
1. Cortes-Ciriano I et al. Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing. Nat Genet 52, 331–341 (2020).
1. Ly P et al. Chromosome segregation errors generate a diverse spectrum of simple and complex genomic rearrangements. Nat Genet 51, 705–715 (2019).
1. Zhang CZ et al. Chromothripsis from DNA damage in micronuclei. Nature 522, 179–84 (2015).
1. Umbreit NT et al. Mechanisms generating cancer genome complexity from a single cell division error. Science 368(2020).
1. Menghi F et al. The Tandem Duplicator Phenotype Is a Prevalent Genome-Wide Cancer Configuration Driven by Distinct Gene Mutations. Cancer Cell 34, 197–210 e5 (2018).
1. Morton AR et al. Functional Enhancers Shape Extrachromosomal Oncogene Amplifications. Cell 179, 1330–1341 e13 (2019).
1. Wu S et al. Circular ecDNA promotes accessible chromatin and high oncogene expression. Nature 575, 699–703 (2019).
1. Corces MR et al. The chromatin accessibility landscape of primary human cancers. Science 362(2018).
1. Helmsauer K et al. Enhancer hijacking determines intra- and extrachromosomal circular MYCN amplicon architecture in neuroblastoma. bioRxiv, 2019.12.20.875807 (2019).
1. Davoli T, Uno H, Wooten EC & Elledge SJ Tumor aneuploidy correlates with markers of immune evasion and with reduced response to immunotherapy. Science 355(2017).
1. Hadi K et al. Novel patterns of complex structural variation revealed across thousands of cancer genome graphs. bioRxiv, 836296 (2019).
1. Priestley P et al. Pan-cancer whole-genome analyses of metastatic solid tumours. Nature 575, 210–216 (2019).
1. Taylor AM et al. Genomic and Functional Approaches to Understanding Cancer Aneuploidy. Cancer Cell 33, 676–689 e3 (2018).
1. Hu X et al. TumorFusions: an integrative resource for cancer-associated transcript fusions. Nucleic Acids Res 46, D1144–D1149 (2018).
1. Yoshihara K et al. The landscape and therapeutic relevance of cancer-associated transcript fusions. Oncogene 34, 4845–54 (2015).
1. Torres-Garcia W et al. PRADA: pipeline for RNA sequencing data analysis. Bioinformatics 30, 2224–6 (2014).
1. Wala JA et al. SvABA: genome-wide detection of structural variants and indels by local assembly. Genome Res 28, 581–591 (2018).
1. Quinlan AR BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr Protoc Bioinformatics 47, 11 12 1–34 (2014).

Source: PubMed

Extrachromosomal DNA is associated with oncogene amplification and poor outcome across multiple cancers

Abstract

Conflict of interest statement

Figures

References

スポンサーと協力者

医学的状態

薬物療法