Spinal muscular atrophy diagnosis and carrier screening from genome sequencing data

Xiao Chen, Alba Sanchis-Juan, Courtney E French, Andrew J Connell, Isabelle Delon, Zoya Kingsbury, Aditi Chawla, Aaron L Halpern, Ryan J Taft, NIHR BioResource, David R Bentley, Matthew E R Butchbach, F Lucy Raymond, Michael A Eberle, Xiao Chen, Alba Sanchis-Juan, Courtney E French, Andrew J Connell, Isabelle Delon, Zoya Kingsbury, Aditi Chawla, Aaron L Halpern, Ryan J Taft, NIHR BioResource, David R Bentley, Matthew E R Butchbach, F Lucy Raymond, Michael A Eberle

Abstract

Purpose: Spinal muscular atrophy (SMA), caused by loss of the SMN1 gene, is a leading cause of early childhood death. Due to the near identical sequences of SMN1 and SMN2, analysis of this region is challenging. Population-wide SMA screening to quantify the SMN1 copy number (CN) is recommended by the American College of Medical Genetics and Genomics.

Methods: We developed a method that accurately identifies the CN of SMN1 and SMN2 using genome sequencing (GS) data by analyzing read depth and eight informative reference genome differences between SMN1/2.

Results: We characterized SMN1/2 in 12,747 genomes, identified 1568 samples with SMN1 gains or losses and 6615 samples with SMN2 gains or losses, and calculated a pan-ethnic carrier frequency of 2%, consistent with previous studies. Additionally, 99.8% of our SMN1 and 99.7% of SMN2 CN calls agreed with orthogonal methods, with a recall of 100% for SMA and 97.8% for carriers, and a precision of 100% for both SMA and carriers.

Conclusion: This SMN copy-number caller can be used to identify both carrier and affected status of SMA, enabling SMA testing to be offered as a comprehensive test in neonatal care and an accurate carrier screening tool in GS sequencing projects.

Keywords: bioinformatics; carrier screening; copy-number analysis; genome sequencing (GS); spinal muscular atrophy (SMA).

Conflict of interest statement

X.C., Z.K., A.C., A.L.H., R.J.T., D.R.B. and M.A.E. are employees of Illumina Inc. The other authors declare no conflicts of interest.

Figures

Fig. 1. Common copy-number variants (CNVs) affecting…
Fig. 1. Common copy-number variants (CNVs) affecting the SMN1/2 loci.
a Depth profiles across the SMN1/SMN2 regions. Samples with a total SMN1 + SMN2 copy number of 2, 3, 4, and 5 (derived from average read depth in the two genes) are shown as green, blue, black, and orange dots, respectively. Depth from 50 samples are summed up for each CN category. Each dot represents normalized depth values in a 100-bp window. Read counts are calculated in each 100-bp window, summing up reads from both SMN1 and SMN2, and normalized to the depth of wild-type samples (CN = 4). The SMN exons are represented as purple boxes. The two x-axes show coordinates (hg19) in SMN1 (bottom) and SMN2 (upper). b Depth profiles aggregated from 50 samples carrying a deletion of exons 7 and 8 are shown as red dots. Read depths are calculated in the same way as in (a).
Fig. 2
Fig. 2
Scatterplot of total SMN (SMN1, SMN2, and SMN2∆7–8) copy number (CN) (x-axis, called by read depth in exons 1–6) and intact SMN copy number (y-axis, called by read depth in exons 7–8).
Fig. 3. Distribution of SMN1 / SMN2…
Fig. 3. Distribution of SMN1/SMN2/SMN2∆7–8 copy numbers in the population.
a Percentage of samples showing copy number (CN) call agreement with c.840C>T across 16 SMN1–SMN2 base difference sites in African and non-African populations. Coordinates of these 16 sites are given in Table S1 and site 13* is the c.840C>T splice variant site. The black horizontal line denotes 85% concordance. b Histogram of the distribution of SMN1, SMN2, and SMN2∆7–8 copy numbers across five populations in 1kGP and the National Institute for Health Research (NIHR) BioResource cohort. cSMN1 CN vs. total SMN2 CN (intact SMN2 + SMN2∆7–8). d Two trios with an SMA proband detected by the caller and orthogonally confirmed in the Next Generation Children project cohort. CNs of SMN1, SMN2, and SMN2∆7–8 are phased and labeled for each member of the trios.

References

    1. Lunn MR, Wang CH. Spinal muscular atrophy. Lancet. 2008;371:2120–2133. doi: 10.1016/S0140-6736(08)60921-6.
    1. Mercuri E, Bertini E, Iannaccone ST. Childhood spinal muscular atrophy: controversies and challenges. Lancet Neurol. 2012;11:443–452. doi: 10.1016/S1474-4422(12)70061-3.
    1. Prior TW. Perspectives and diagnostic considerations in spinal muscular atrophy. Genet Med. 2010;12:145–152. doi: 10.1097/GIM.0b013e3181c5e713.
    1. Ogino S, Leonard DGB, Rennert H, Ewens WJ, Wilson RB. Genetic risk assessment in carrier testing for spinal muscular atrophy. Am J Med Genet. 2002;110:301–307. doi: 10.1002/ajmg.10425.
    1. Hendrickson BC, Donohoe C, Akmaev VR, et al. Differences in SMN1 allele frequencies among ethnic groups within North America. J Med Genet. 2009;46:641–644. doi: 10.1136/jmg.2009.066969.
    1. Sugarman EA, Nagan N, Zhu H, et al. Pan-ethnic carrier screening and prenatal diagnosis for spinal muscular atrophy: clinical laboratory analysis of >72 400 specimens. Eur J Hum Genet. 2012;20:27–32. doi: 10.1038/ejhg.2011.134.
    1. MacDonald WK, Hamilton D, Kuhle S. SMA carrier testing: a meta-analysis of differences in test performance by ethnic group. Prenat Diagn. 2014;34:1219–1226. doi: 10.1002/pd.4459.
    1. Finkel RS, Chiriboga CA, Vajsar J, et al. Treatment of infantile-onset spinal muscular atrophy with nusinersen: a phase 2, open-label, dose-escalation study. Lancet. 2016;388:3017–3026. doi: 10.1016/S0140-6736(16)31408-8.
    1. Mendell JR, Al-Zaidy S, Shell R, et al. Single-dose gene-replacement therapy for spinal muscular atrophy. N Engl J Med. 2017;377:1713–1722. doi: 10.1056/NEJMoa1706198.
    1. Lefebvre S, Bürglen L, Reboullet S, et al. Identification and characterization of a spinal muscular atrophy-determining gene. Cell. 1995;80:155–165. doi: 10.1016/0092-8674(95)90460-3.
    1. Rochette CF, Gilbert N, Simard LR. SMN gene duplication and the emergence of the SMN2 gene occurred in distinct hominids: SMN2 is unique to Homo sapiens. Hum Genet. 2001;108:255–266. doi: 10.1007/s004390100473.
    1. Stabley DL, Harris AW, Holbrook J, et al. SMN1 and SMN2 copy numbers in cell lines derived from patients with spinal muscular atrophy as measured by array digital PCR. Mol Genet Genomic Med. 2015;3:248–257. doi: 10.1002/mgg3.141.
    1. Lorson CL, Hahnen E, Androphy EJ, Wirth B. A single nucleotide in the SMN gene regulates splicing and is responsible for spinal muscular atrophy. Proc Natl Acad Sci USA. 1999;96:6307–6311. doi: 10.1073/pnas.96.11.6307.
    1. Wirth B. An update of the mutation spectrum of the survival motor neuron gene (SMN1) in autosomal recessive spinal muscular atrophy (SMA) Hum Mutat. 2000;15:228–237. doi: 10.1002/(SICI)1098-1004(200003)15:3<228::AID-HUMU3>;2-9.
    1. Burghes AHM, Beattie CE. Spinal muscular atrophy: why do low levels of SMN make motor neurons sick? Nat Rev Neurosci. 2009;10:597–609. doi: 10.1038/nrn2670.
    1. Butchbach MER. Copy number variations in the survival motor neuron genes: implications for spinal Muscular atrophy and other neurodegenerative diseases. Front Mol Biosci. 2016;3:7. doi: 10.3389/fmolb.2016.00007.
    1. Prior TW. Carrier screening for spinal muscular atrophy. Genet Med. 2008;10:840–842. doi: 10.1097/GIM.0b013e318188d069.
    1. Kraszewski JN, Kay DM, Stevens CF, et al. Pilot study of population-based newborn screening for spinal muscular atrophy in New York state. Genet Med. 2018;20:608–613. doi: 10.1038/gim.2017.152.
    1. Feldkötter M, Schwarzer V, Wirth R, Wienker TF, Wirth B. Quantitative analyses of SMN1 and SMN2 based on real-time lightCycler PCR: fast and highly reliable carrier testing and prediction of severity of spinal muscular atrophy. Am J Hum Genet. 2002;70:358–368. doi: 10.1086/338627.
    1. Arkblad EL, Darin N, Berg K, et al. Multiplex ligation-dependent probe amplification improves diagnostics in spinal muscular atrophy. Neuromuscul Disord. 2006;16:830–838. doi: 10.1016/j.nmd.2006.08.011.
    1. Scarciolla O, Stuppia L, De Angelis MV, et al. Spinal muscular atrophy genotyping by gene dosage using multiple ligation-dependent probe amplification. Neurogenetics. 2006;7:269–276. doi: 10.1007/s10048-006-0051-3.
    1. Zhong Q, Bhattacharya S, Kotsopoulos S, et al. Multiplex digital PCR: breaking the one target per color barrier of quantitative PCR. Lab Chip. 2011;11:2167–2174. doi: 10.1039/c1lc20126c.
    1. Ashley EA. The Precision Medicine Initiative: a new national effort. JAMA. 2015;313:2119–2120. doi: 10.1001/jama.2015.3595.
    1. The Genome of the Netherlands Consortium. Francioli LC, Menelaou A, et al. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet. 2014;46:818–825. doi: 10.1038/ng.3021.
    1. Turnbull C, Scott RH, Thomas E, et al. The 100 000 Genomes Project: bringing whole genome sequencing to the NHS. BMJ. 2018;361:k1687. doi: 10.1136/bmj.k1687.
    1. Larson JL, Silver AJ, Chan D, Borroto C, Spurrier B, Silver LM. Validation of a high resolution NGS method for detecting spinal muscular atrophy carriers among phase 3 participants in the 1000 Genomes Project. BMC Med Genet. 2015;16:100.. doi: 10.1186/s12881-015-0246-2.
    1. Feng Y, Ge X, Meng L, et al. The next generation of population-based spinal muscular atrophy carrier screening: comprehensive pan-ethnic SMN1 copy-number and sequence variant analysis by massively parallel sequencing. Genet Med. 2017;19:936–944. doi: 10.1038/gim.2016.215.
    1. Luo M, Liu L, Peter I, et al. An Ashkenazi Jewish SMN1 haplotype specific to duplication alleles improves pan-ethnic carrier screening for spinal muscular atrophy. Genet Med. 2014;16:149–156. doi: 10.1038/gim.2013.84.
    1. The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393.
    1. Stabley DL, Holbrook J, Harris AW, et al. Establishing a reference dataset for the authentication of spinal muscular atrophy cell lines using STR profiling and digital PCR. Neuromuscul Disord. 2017;27:439–446. doi: 10.1016/j.nmd.2017.02.002.
    1. French CE, Delon I, Dolling H, et al. Whole genome sequencing reveals that genetic conditions are frequent in intensively ill children. Intensive Care Med. 2019;45:627–636. doi: 10.1007/s00134-019-05552-x.
    1. Vijzelaar R, Snetselaar R, Clausen M, et al. The frequency of SMN gene variants lacking exon 7 and 8 is highly population dependent. PLoS One. 2019;14:e0220211. doi: 10.1371/journal.pone.0220211.
    1. Ruhno C, McGovern VL, Avenarius MR, et al. Complete sequencing of the SMN2 gene in SMA patients detects SMN gene deletion junctions and variants in SMN2 that modify the SMA phenotype. Hum Genet. 2019;138:241–256. doi: 10.1007/s00439-019-01983-0.
    1. Ogino S, Gao S, Leonard DGB, Paessler M, Wilson RB. Inverse correlation between SMN1 and SMN2 copy numbers: evidence for gene conversion from SMN2 to SMN1. Eur J Hum Genet. 2003;11:275. doi: 10.1038/sj.ejhg.5200957.
    1. Chen T-H, Tzeng C-C, Wang C-C, et al. Identification of bidirectional gene conversion between SMN1 and SMN2 by simultaneous analysis of SMN dosage and hybrid genes in a Chinese population. J Neurol Sci. 2011;308:83–87. doi: 10.1016/j.jns.2011.06.002.
    1. Mercuri E, Finkel RS, Muntoni F, et al. Diagnosis and management of spinal muscular atrophy: part 1: recommendations for diagnosis, rehabilitation, orthopedic and nutritional care. Neuromuscul Disord. 2018;28:103–115. doi: 10.1016/j.nmd.2017.11.005.
    1. Dolzhenko E, van Vugt JJFA, Shaw RJ, et al. Detection of long repeat expansions from PCR-free whole-genome sequence data. Genome Res. 2017;27:1895–1903. doi: 10.1101/gr.225672.117.
    1. Lee S, Wheeler MM, Patterson K, et al. Stargazer: a software tool for calling star alleles from next-generation sequencing data using CYP2D6 as a model. Genet Med. 2019;21:361. doi: 10.1038/s41436-018-0054-0.

Source: PubMed

Подписаться