Inverted genomic segments and complex triplication rearrangements are mediated by inverted repeats in the human genome

Claudia M B Carvalho, Melissa B Ramocki, Davut Pehlivan, Luis M Franco, Claudia Gonzaga-Jauregui, Ping Fang, Alanna McCall, Eniko Karman Pivnick, Stacy Hines-Dowell, Laurie H Seaver, Linda Friehling, Sansan Lee, Rosemarie Smith, Daniela Del Gaudio, Marjorie Withers, Pengfei Liu, Sau Wai Cheung, John W Belmont, Huda Y Zoghbi, P J Hastings, James R Lupski, Claudia M B Carvalho, Melissa B Ramocki, Davut Pehlivan, Luis M Franco, Claudia Gonzaga-Jauregui, Ping Fang, Alanna McCall, Eniko Karman Pivnick, Stacy Hines-Dowell, Laurie H Seaver, Linda Friehling, Sansan Lee, Rosemarie Smith, Daniela Del Gaudio, Marjorie Withers, Pengfei Liu, Sau Wai Cheung, John W Belmont, Huda Y Zoghbi, P J Hastings, James R Lupski

Abstract

We identified complex genomic rearrangements consisting of intermixed duplications and triplications of genomic segments at the MECP2 and PLP1 loci. These complex rearrangements were characterized by a triplicated segment embedded within a duplication in 11 unrelated subjects. Notably, only two breakpoint junctions were generated during each rearrangement formation. All the complex rearrangement products share a common genomic organization, duplication-inverted triplication-duplication (DUP-TRP/INV-DUP), in which the triplicated segment is inverted and located between directly oriented duplicated genomic segments. We provide evidence that the DUP-TRP/INV-DUP structures are mediated by inverted repeats that can be separated by >300 kb, a genomic architecture that apparently leads to susceptibility to such complex rearrangements. A similar inverted repeat-mediated mechanism may underlie structural variation in many other regions of the human genome. We propose a mechanism that involves both homology-driven events, via inverted repeats, and microhomologous or nonhomologous events.

Figures

Figure 1. General genomic structure of the…
Figure 1. General genomic structure of the complex rearrangements triplications embedded in duplications
(a) Copy number dosage alteration inferred from aCGH. A typical aCGH experiment is shown for a complex DUP-TRP-DUP rearrangement. Transitions of copy number dosage alterations are demonstrated by black vertical dotted arrows; the size of genomic segments defined by those boundaries vary in each individual complex rearrangement and are denoted a, b, c. The horizontal line below depicts the array data. Duplications are represented in red and triplications in blue; yellow arrows represent inverted repeats. (b) Representative figure of the genomic structure as determined by further analysis of copy number breakpoint junctions (jct1 and jct2) in five independent rearrangements using multiple molecular experimental approaches. The genomic segments involved are denoted a, b, c whereas the respective copy number gains are denoted as a’, b’, c’. These findings were corroborated by three additional independent rearrangements for which we obtained information for either jct1 or jct2 from two genomic loci. Change in orientation of the genomic segment (represented by black arrows) occurred at each of the junctions formed. DUPp: proximal transition/breakpoint of duplications; TRPp: proximal transition/breakpoint of triplication; DUPd: distal transition/breakpoint of duplications; TRPd: distal transition/breakpoint of triplications.
Figure 2. Individuals carrying complex triplications of…
Figure 2. Individuals carrying complex triplications of Xq28
(a) Genomic region harboring the alterations involving MECP2. Duplications are represented in red and triplications in blue. Arrows on top of the BAB2769 and BAB2772 rearrangements indicate the position of the transitions to gain according to array-based Comparative Genomic Hybridization (aCGH); in 6 out of 8 cases (BAB2796 and BAB2980 are brothers), the distal transition/breakpoint of both duplications (DUPd) and triplications (TRPd) (indicated by arrows) cluster within a pair of LCRs termed K1 and K2 contrasting with the scattered nature of the proximal breakpoints of both duplications (DUPp) and triplications (TRPp) in the same group of patients. Vertical lines embedded within the rearrangement bars represent low copy repeat regions (LCRs) for which the copy numbers were not inferred due to poor probe coverage. (b) aCGH (Agilent Technologies, Santa Clara, CA) result for family HOU1217. Carrier mother BAB3115 with a de novo complex triplication that was transmitted to her son (BAB3114).
Figure 3. Southern blot analysis of the…
Figure 3. Southern blot analysis of the region flanked by LCRs K1 and K2 at the Xq28 chromosome
LCRs K1 and K2 are approximately 11.3 kb in length, and are located ~38 kb apart in inverted orientation. Their 99% nucleotide sequence identity is likely maintained by frequent gene conversion. These LCRs flank two genes FLNA and EMD. (a) Yellow arrows indicate an inversion: cent-FLNA/EMD-tel (H1) and the alternative genomic orientation cent-EMD/FLNA-tel (H2). An 18.2 kb band is expected to be produced in either inversion haplotype background (H1dup or H2dup) upon duplication and inversion; we hypothesize that the 18.2 kb band includes the breakpoint junction 1 (jct1). To test the haplotype of our cohort and to map the duplication breakpoints, we performed a Southern blot assay as shown here. Genomic DNA was digested with BglII; EMD was targeted using a PCR-based probe. The reference genome H1 produces a 30.7 kb band whereas the inversion haplotype (H2) yields an 18.2 kb band. (b) Southern blot results for patients carrying triplications embedded within duplications: BAB2769, BAB2772, BAB2796/BAB2980, BAB2797, BAB2801, BAB2805 (left) and family HOU1217 (right). NA10851: male carrying the reference haplotype (H1); NA15510 heterozygous female carrying both reference and inversion haplotypes (H1 and H2). BAB2771: patient carrying MECP2 duplication not involving LCR K1 and LCR K2 (H1).
Figure 4. Rearrangement structure for patients BAB2769,…
Figure 4. Rearrangement structure for patients BAB2769, BAB2772, BAB2796/BAB2980, BAB2797 and BAB2805 based on aCGH, Southern blotting and breakpoint sequencing
(a) Genomic region harboring duplications and triplications spanning chromosome Xq28 according to aCGH: duplications are represented in red, triplications in blue. Arrows on top of the rearrangements indicate the position of the breakpoints; inverted repeats involved in the rearrangement are represented as yellow arrows. Letters a, b and c represent the segments with copy-number gain. (b) Individual genomic structure of the region involved in the rearrangement as inferred by analysis of breakpoint junction 1 (jct1) and breakpoint junction 2 (jct2) for each patient. Jct1* represents those junctions analyzed by Southern blotting (please refer to Fig. 3); all others were sequenced. Genomic positions of each of the junctions are shown. Breakpoint junction sequences are color-coded to highlight their segment of origin in the reference genome (duplications in red and triplications in blue). The triplicated segment (b’) is inserted amid a normal (a, b, c) and a duplicated copy (a’, c’) in inverted orientation as supported by jct1 and jct2 analysis. This structure was further confirmed by FISH for patient BAB2805 (Supplementary Fig. 6). Microhomologies observed at the junctions are represented by underlined black letters; insertions or mismatches at the junctions are represented in green; deletions are represented by dashes; mismatches between the reference sequences and patient sequences are marked with a green asterisk underneath.
Figure 5. Proposed model for generation of…
Figure 5. Proposed model for generation of the common DUP-TRP/INV-DUP rearrangement product
(a) The rearrangement may have occurred during spermatogenesis in the ancestral male on his X chromosome, likely during S or G2 phase. (b) During replication of the sister chromatid, the replication fork may collapse and induce break-induced replication (BIR) (c) BIR uses homologous recombination to re-establish a new fork using ectopic homology provided by inverted repeats forming jct1. (d) This event initiates replication that forms a length of chromatid back in the opposite direction from that in which the fork had been traveling before the collapse, forming a large inverted duplication (e). If the reversed replication again collapses, or if there is a double-strand break in the chromatid carrying the inverted duplication, then there is a new DNA end (f). In our patient cohort, this new end rejoining (jct2 formation) occurred by either non-homologous end joining (NHEJ) requiring a double-strand break (DSB) on the original strand followed by ligation of that segment to the end of the rearranged newly replicated strand, or by a replication mechanism such as microhomology-mediated break-induced replication (MMBIR) (g) that requires a new strand annealing and extension to the end of the replicon or the chromosome. (h) Representative structure obtained after two steps of homologous and nonhomologous mechanisms. Duplicated and triplicated segments are represented in red and blue colors, respectively.

References

    1. Morrow DM, Connelly C, Hieter P. “Break copy” duplication: a model for chromosome fragment formation in Saccharomyces cerevisiae. Genetics. 1997;147:371–82.
    1. Smith CE, Llorente B, Symington LS. Template switching during break-induced replication. Nature. 2007;447:102–5.
    1. McEachern MJ, Haber JE. Break-induced replication and recombinational telomere elongation in yeast. Annu Rev Biochem. 2006;75:111–35.
    1. Hastings PJ, Ira G, Lupski JR. A microhomology-mediated break-induced replication model for the origin of human copy number variation. PLoS Genet. 2009;5:e1000327.
    1. Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet. 2009;10:551–64.
    1. Lee JA, Carvalho CM, Lupski JR. A DNA replication mechanism for generating nonrecurrent rearrangements associated with genomic disorders. Cell. 2007;131:1235–47.
    1. Slack A, Thornton PC, Magner DB, Rosenberg SM, Hastings PJ. On the mechanism of gene amplification induced under stress in Escherichia coli. PLoS Genet. 2006;2:e48.
    1. del Gaudio D, et al. Increased MECP2 gene copy number as the result of genomic duplication in neurodevelopmentally delayed males. Genet. Med. 2006;8:784–92.
    1. Carvalho CM, et al. Complex rearrangements in patients with duplications of MECP2 can occur by fork stalling and template switching. Hum. Mol. Genet. 2009;18:2188–203.
    1. Small K, Iber J, Warren ST. Emerin deletion reveals a common X- chromosome inversion mediated by inverted repeats. Nat. Genet. 1997;16:96–9.
    1. Deem A, et al. Break-induced replication is highly inaccurate. PLoS Biol. 2011;9:e1000594.
    1. Lee JA, et al. Spastic paraplegia type 2 associated with axonal neuropathy and apparent PLP1 position effect. Ann Neurol. 2006;59:398–403.
    1. Ramocki MB, Tavyev YJ, Peters SU. The MECP2 duplication syndrome. Am. J. Med. Genet. A. 2010;152A:1079–88.
    1. Bailey JA, et al. Recent segmental duplications in the human genome. Science. 2002;297:1003–7.
    1. Hu XY, Ray PN, Murphy EG, Thompson MW, Worton RG. Duplicational mutation at the Duchenne muscular dystrophy locus: its frequency, distribution, origin, and phenotypegenotype correlation. Am. J. Hum. Genet. 1990;46:682–95.
    1. Mimault C, et al. Proteolipoprotein gene analysis in 82 patients with sporadic Pelizaeus-Merzbacher Disease: duplications, the major cause of the disease, originate more frequently in male germ cells, but point mutations do not. The Clinical European Network on Brain Dysmyelinating Disease. Am. J. Hum. Genet. 1999;65:360–9.
    1. Whibley AC, et al. Fine-scale survey of X chromosome copy number variants and indels underlying intellectual disability. Am. J. Hum. Genet. 2010;87:173–88.
    1. Bartsch O, et al. Four unrelated patients with Lubs X-linked mental retardation syndrome and different Xq28 duplications. Am. J. Med. Genet. A. 2010;152A:305–12.
    1. Lieber MR. The mechanism of human nonhomologous DNA end joining. J Biol Chem. 2008;283:1–5.
    1. Sheen CR, et al. Double complex mutations involving F8 and FUNDC2 caused by distinct break-induced replication. Hum. Mutat. 2007;28:1198–206.
    1. Zhang F, Carvalho CM, Lupski JR. Complex human chromosomal and genomic rearrangements. Trends Genet. 2009;25:298–307.
    1. Zhang F, et al. The DNA replication FoSTeS/MMBIR mechanism can generate genomic, genic and exonic complex rearrangements in humans. Nat. Genet. 2009;41:849–53.
    1. Yatsenko SA, et al. Molecular mechanisms for subtelomeric rearrangements associated with the 9q34.3 microdeletion syndrome. Hum. Mol. Genet. 2009;18:1924–36.
    1. Vialard F, et al. Mechanism of intrachromosomal triplications 15q11-q13: a new clinical report. Am. J. Med. Genet. A. 2003;118A:229–34.
    1. Wang J, et al. Intrachromosomal triplication of 2q11.2-q21 in a severely malformed infant: case report and review of triplications and their possible mechanism. Am. J. Med. Genet. 1999;82:312–7.
    1. Reddy KS, Logan JJ. Intrachromosomal triplications: molecular cytogenetic and clinical studies. Clin. Genet. 2000;58:134–41.
    1. Lebofsky R, Bensimon A. DNA replication origin plasticity and perturbed fork progression in human inverted repeats. Mol Cell Biol. 2005;25:6789–97.
    1. Voineagu I, Narayanan V, Lobachev KS, Mirkin SM. Replication stalling at unstable inverted repeats: interplay between DNA hairpins and fork stabilizing proteins. Proc Natl Acad Sci U S A. 2008;105:9936–41.
    1. Paek AL, et al. Fusion of nearby inverted repeats by a replication-based mechanism leads to formation of dicentric and acentric chromosomes that cause genome instability in budding yeast. Genes Dev. 2009;23:2861–75.
    1. Mizuno K, Lambert S, Baldacci G, Murray JM, Carr AM. Nearby inverted repeats fuse to generate acentric and dicentric palindromic chromosomes by a replication template exchange mechanism. Genes Dev. 2009;23:2876–86.
    1. Kitamura E, Blow JJ, Tanaka TU. Live-cell imaging reveals replication of individual replicons in eukaryotic replication factories. Cell. 2006;125:1297–308.
    1. Mirkin EV, Mirkin SM. Mechanisms of transcription-replication collisions in bacteria. Mol Cell Biol. 2005;25:888–95.
    1. Merrikh H, Machon C, Grainger WH, Grossman AD, Soultanas P. Co-directional replication-transcription conflicts lead to replication restart. Nature. 2011;470:554–7.
    1. Caceres M, Sullivan RT, Thomas JW. A recurrent inversion on the eutherian X chromosome. Proc Natl Acad Sci U S A. 2007;104:18571–6.
    1. Lee JA, Cheung SW, Ward PA, Inoue K, Lupski JR. Prenatal diagnosis of PLP1 copy number by array comparative genomic hybridization. Prenat Diagn. 2005;25:1188–91.
    1. Singleton AB, et al. alpha-Synuclein locus triplication causes Parkinson's disease. Science. 2003;302:841.
    1. Beunders G, et al. A triplication of the Williams-Beuren syndrome region in a patient with mental retardation, a severe expressive language delay, behavioural problems and dysmorphisms. J. Med. Genet. 2010;47:271–5.
    1. Lee JA, et al. Role of genomic architecture in PLP1 duplication causing Pelizaeus-Merzbacher disease. Hum. Mol. Genet. 2006;15:2250–65.
    1. Allen RC, Zoghbi HY, Moseley AB, Rosenblatt HM, Belmont JW. Methylation of HpaII and HhaI sites near the polymorphic CAG repeat in the human androgen-receptor gene correlates with X chromosome inactivation. Am. J. Hum. Genet. 1992;51:1229–39.
    1. Ramocki MB, et al. Autism and other neuropsychiatric symptoms are prevalent in individuals with MECP2 duplication syndrome. Ann Neurol. 2009;66:771–82.
    1. Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21:263–5.

Source: PubMed

3
구독하다