Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights

Alexander Gusev, Nicholas Mancuso, Hyejung Won, Maria Kousi, Hilary K Finucane, Yakir Reshef, Lingyun Song, Alexias Safi, Schizophrenia Working Group of the Psychiatric Genomics Consortium, Steven McCarroll, Benjamin M Neale, Roel A Ophoff, Michael C O'Donovan, Gregory E Crawford, Daniel H Geschwind, Nicholas Katsanis, Patrick F Sullivan, Bogdan Pasaniuc, Alkes L Price, Alexander Gusev, Nicholas Mancuso, Hyejung Won, Maria Kousi, Hilary K Finucane, Yakir Reshef, Lingyun Song, Alexias Safi, Schizophrenia Working Group of the Psychiatric Genomics Consortium, Steven McCarroll, Benjamin M Neale, Roel A Ophoff, Michael C O'Donovan, Gregory E Crawford, Daniel H Geschwind, Nicholas Katsanis, Patrick F Sullivan, Bogdan Pasaniuc, Alkes L Price

Abstract

Genome-wide association studies (GWAS) have identified over 100 risk loci for schizophrenia, but the causal mechanisms remain largely unknown. We performed a transcriptome-wide association study (TWAS) integrating a schizophrenia GWAS of 79,845 individuals from the Psychiatric Genomics Consortium with expression data from brain, blood, and adipose tissues across 3,693 primarily control individuals. We identified 157 TWAS-significant genes, of which 35 did not overlap a known GWAS locus. Of these 157 genes, 42 were associated with specific chromatin features measured in independent samples, thus highlighting potential regulatory targets for follow-up. Suppression of one identified susceptibility gene, mapk3, in zebrafish showed a significant effect on neurodevelopmental phenotypes. Expression and splicing from the brain captured most of the TWAS effect across all genes. This large-scale connection of associations to target genes, tissues, and regulatory features is an essential step in moving toward a mechanistic understanding of GWAS.

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1. Schematic of TWAS approach
Figure 1. Schematic of TWAS approach
Illustration of the TWAS approach: genetic predictor of gene expression (Eg) is learned in a reference panel (top); integrated with SCZ GWAS association statistics to infer SCZ-Eg association (middle); further integrated with individual-level chromatin phenotypes to infer genes with SCZ and chromatin-Eg associations (bottom). See Supplementary Fig. 1 for detailed analysis flowchart.
Figure 2. SCZ TWAS associations and polygenic…
Figure 2. SCZ TWAS associations and polygenic effects
(top) Manhattan plot of all TWAS associations. Each point represents a single gene tested, with physical position plotted on x-axis and Z-score of association between gene and SCZ plotted on y-axis. Transcriptome-wide significant associations are highlighted as red points, with jointly significant independent associations (see Methods) labeled with gene names and color-coded by expression reference (red CMC; blue METSIM, purple YFS, green NTR, black ALL). (bottom) Polygenic TWAS effects across reference tissues. Out of sample SCZ prediction R2 for gene-based polygenic risk scores (GE-PRS) as a function of significance cutoff. Significant correlations (after Bonferroni correction for number of thresholds tested) are indicated with a (*) and the most significant P-value reported. Right-most panel shows prediction from all tissues jointly (black) and from CMC/brain genes + splicing events jointly (red). R2 computed after subtracting ancestry principal components and converting to liability scale with population prevalence of 1%.
Figure 3. Chromatin TWAS associations compared to…
Figure 3. Chromatin TWAS associations compared to top eSNP/cQTL associations
Number of unique genes significantly associated with a chromatin peak after Bonferroni correction for a given distance from the gene (x-axis): (left) using top eSNP in chromatin cohort; (right) using chromatin TWAS from all reference panels. Results from CEU (YRI) populations shown in top (bottom) panels.
Figure 4. Chromatin and SCZ TWAS association…
Figure 4. Chromatin and SCZ TWAS association at PPP2R3C
Example association of PPP2R3C gene expression and SCZ and four nearby chromatin peaks. (A) locus schematic with all nearby genes and chromatin peaks; TWAS associated features highlighted in blue and green. (B–G left) Manhattan plots of marginal association statistics before and after conditioning on the TWAS predicted expression (colored/dark dots, respectively). Dashed line shows local significance threshold after Bonferroni correction for number of SNPs. (B–G right) Relationship between marginal GWAS/QTL association (y axis) and the correlation (x-axis) between TWAS predicted expression (GEpred estimated in the 1000 Genomes reference) and marginal GWAS/QTL association. The color of each point reflects the eQTL effect size of the expression used for GEpred and size of each point reflects absolute significance of the eQTL. (B) SCZ GWAS association; (C) PPP2R3C expression phenotype used for TWAS prediction and associated with SCZ/chromatin; (D) 1st TWAS associated H3k27ac peak in CEU; (E) 2nd TWAS associated H3k27ac peak in CEU; (F) 1st TWAS associated H3k4me1 peak in CEU; (G) 2nd TWAS associated H3k4me1 peak in CEU. See Supplementary Note, Supplementary Fig. 32, 33, 34 for additional examples and simulations.
Figure 5. Chromatin and SCZ TWAS association…
Figure 5. Chromatin and SCZ TWAS association at KLC1
Example association of KLC1 splice event to SCZ with evidence of chromatin interaction in Hi-C from developing brain. (A) locus schematic with all nearby genes and chromatin peaks; TWAS associated features highlighted in blue and green. Hi-C GZ/CP rows show significance of Hi-C chromatin interaction between the 10kb block containing the associated chromatin peaks (gray, with neighboring white blocks not tested) and every other 10kb block in the region; 10kb being the highest resolution for this Hi-C data. Dark red shading indicates more significant and interactions significant at 0.01 FDR are labeled with stars. The most significant interaction in the locus overlaps the KLC1 promoter. The interactions are shown for fetal brain data from CP (cortical and subcortical plate) and GZ (germinal zone) and corresponding topological domains (TADs) are outlined with solid black lines. (B–F left) Manhattan plots of marginal association statistics before and after conditioning on the TWAS predicted expression (colored/dark dots, respectively). Dashed line shows local significance threshold after Bonferroni correction for number of SNPs. (B–F right) Relationship between marginal GWAS/QTL association (y axis) and the correlation (x-axis) between TWAS predicted expression (GEpred estimated in the 1000 Genomes reference) and marginal GWAS/QTL association. The color of each point reflects the eQTL effect size of the expression used for GEpred and size of each point reflects absolute significance of the eQTL.(B) SCZ GWAS association; (C) KLC1 total expression, both panels show independence from the TWAS predicted expression; (D) KLC1 splicing event phenotype used for TWAS prediction and associated with SCZ/chromatin; (E) TWAS associated H3k4me1 chromatin peak in YRI; (F) TWAS associated H3k4me3 chromatin peak in YRI. See Supplementary Note, Supplementary Fig. 32, 33, 34 for additional examples and simulations.
Figure 6. Suppression of endogenous mapk3 rescues…
Figure 6. Suppression of endogenous mapk3 rescues the microcephaly and neuronal proliferation phenotypes induced by overexpression of wild-type KCTD13
Dorsal view of 4 days post fertilization (dpf) control larvae (A) and embryos injected with either morpholino (MO) against endogenous mapk3 (B), human capped wild-type (WT) KCTD13 mRNA (C) or combinatorial administration of mapk3 MO and WT human KCTD13 mRNA (D). Quantification of the headsize phenotype across the four conditions (E). Dorsal view of 3dpf embryos stained with an antibody against phospho-histone 3 (PH3), a marker of neuronal proliferation of control larvae (F) or embryos injected with either MO against mapk3 (G), human capped wild-type (WT) KCTD13 mRNA (H) or combinatorial administration (I). Graph showing quantification of the proliferating neuronal count across the four conditions (J). Student’s t-test was used to determine statistical significance.

References

    1. Schizophrenia Working Group of the Psychiatric Genomics Consortium. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–427.
    1. Price AL, Spencer CC, Donnelly P. Proc R Soc B. Vol. 282. The Royal Society; 2015. Progress and promise in understanding the genetic basis of common diseases; p. 20151684.
    1. Soldner F, Stelzer Y, Shivalila CS, Abraham BJ, Latourelle JC, Barrasa MI, Goldmann J, Myers RH, Young RA, Jaenisch R. Parkinson-associated risk variant in distal enhancer of α-synuclein modulates target gene expression. Nature. 2016;533:95–99.
    1. Sekar A, Bialas AR, de Rivera H, Davis A, Hammond TR, Kamitaki N, Tooley K, Presumey J, Baum M, Van Doren V, et al. Schizophrenia risk from complex variation of complement component 4. Nature. 2016;530:177–183.
    1. Claussnitzer M, Dankel SN, Kim K-H, Quon G, Meuleman W, Haugen C, Glunk V, Sousa IS, Beaudry JL, Puviindran V, et al. Fto obesity variant circuitry and adipocyte browning in humans. New England Journal of Medicine. 2015;373:895–907.
    1. Grubert F, Zaugg J, Kasowski M, Ursu O, Spacek D, Martin A, Greenside P, Srivas R, Phanstiel D, Pekowska A, et al. Genetic control of chromatin states in humans involves local and distal chromosomal interactions. Cell. 2015;162:1051–1065.
    1. Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, Reynolds AP, Sandstrom R, Qu H, Brody J. Systematic localization of common disease-associated variation in regulatory dna. Science. 2012;337:1190–1195.
    1. Trynka G, Sandor C, Han B, Xu H, Stranger BE, Liu XS, Raychaudhuri S. Chromatin marks identify critical cell types for fine mapping complex trait variants. Nature Genetics. 2013;45:124–130.
    1. Pickrell JK. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. The American Journal of Human Genetics. 2014;94:559–573.
    1. Gusev A, Lee SH, Trynka G, Finucane H, Vilhjálmsson BJ, Xu H, Zang C, Ripke S, Bulik-Sullivan B, Stahl E, et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. The American Journal of Human Genetics. 2014;95:535–552.
    1. Kichaev G, Yang WY, Lindstrom S, Hormozdiari F, Eskin E, Price AL, Kraft P, Pasaniuc B. Integrating functional data to prioritize causal variants in statistical fine-mapping studies. PLoS Genetics. 2014;10:e1004722.
    1. Won HH, Natarajan P, Dobbyn A, Jordan DM, Roussos P, Lage K, Raychaudhuri S, Stahl E, Do R. Disproportionate contributions of select genomic compartments and cell types to genetic risk for coronary artery disease. PLoS Genetics. 2015;11:e1005622.
    1. Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh PR, Anttila V, Xu H, Zang C, Farh K, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature Genetics. 2015;47:1228–1235.
    1. Degner JF, Pai AA, Pique-Regi R, Veyrieras JB, Gaffney DJ, Pickrell JK, De Leon S, Michelini K, Lewellen N, Crawford GE, et al. Dnase [thinsp] i sensitivity qtls are a major determinant of human expression variation. Nature. 2012;482:390–394.
    1. McVicker G, van de Geijn B, Degner JF, Cain CE, Banovich NE, Raj A, Lewellen N, Myrthil M, Gilad Y, Pritchard JK. Identification of genetic variants that affect histone modifications in human cells. Science. 2013;342:747–749.
    1. Kasowski M, Kyriazopoulou-Panagiotopoulou S, Grubert F, Zaugg JB, Kundaje A, Liu Y, Boyle AP, Zhang QC, Zakharia F, Spacek DV, et al. Extensive variation in chromatin states across humans. Science. 2013;342:750–752.
    1. Kilpinen H, Waszak SM, Gschwind AR, Raghav SK, Witwicki RM, Orioli A, Migliavacca E, Wiederkehr M, Gutierrez-Arcelus M, Panousis NI, et al. Coordinated effects of sequence variation on dna binding, chromatin structure, and transcription. Science. 2013;342:744–747.
    1. Waszak S, Delaneau O, Gschwind A, Kilpinen H, Raghav S, Witwicki R, Orioli A, Wiederkehr M, Panousis N, Yurovsky A, et al. Population variation and genetic control of modular chromatin architecture in humans. Cell. 2015;162:1039–1050.
    1. Taudt A, Colome-Tatche M, Johannes F. Genetic sources of population epigenomic variation. Nat Rev Genet. 2016 advance online publication, –.
    1. Farh KKH, Marson A, Zhu J, Kleinewietfeld M, Housley WJ, Beik S, Shoresh N, Whitton H, Ryan RJ, Shishkin AA, et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature. 2015;518:337–343.
    1. Moyerbrailean GA, Kalita CA, Harvey CT, Wen X, Luca F, Pique-Regi R. Which genetics variants in dnase-seq footprints are more likely to alter binding? PLoS Genetics. 2016;12:e1005875.
    1. Gamazon ER, Wheeler HE, Shah KP, Mozaffari SV, Aquino-Michaels K, Carroll RJ, Eyler AE, Denny JC, Nicolae DL, Cox NJ, et al. A gene-based association method for mapping traits using reference transcriptome data. Nature Genetics. 2015;47:1091–1098.
    1. Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx B, Jansen R, de Geus E, Boomsma D, Wright F, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nature Genetics. 2016;48:245–252.
    1. Zhu Z, Zhang F, Hu H, Bakshi A, Robinson M, Powell J, Montgomery G, Goddard M, Wray N, Visscher P, et al. Integration of summary data from gwas and eqtl studies predicts complex trait gene targets. Nature Genetics 2016
    1. Fromer M, Roussos P, Sieberts SK, Johnson JS, Kavanagh DH, Perumal TM, Ruderfer DM, Oh EC, Topol A, Shah HR, et al. Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nature Neuroscience. 2016;19:1442. –.
    1. Wright F, Sullivan P, Brooks A, Zou F, Sun W, Xia K, Madar V, Jansen R, Chung W, Zhou YH, et al. Heritability and genomics of gene expression in peripheral blood. Nature Genetics. 2014;46:430–437.
    1. Li Y, van+de+Geijn B, Raj A, Knowles D, Petti A, Golan D, Gilad Y, Pritchard J. Rna splicing is a primary link between genetic variation and disease. Science. 2016;352:600–604.
    1. Zhou X, Carbonetto P, Stephens M. Polygenic modeling with bayesian sparse linear mixed models. PLoS Genetics. 2013;9:e1003264.
    1. Nicolae DL, Gamazon E, Zhang W, Duan S, Dolan ME, Cox NJ. Trait-associated snps are more likely to be eqtls: annotation to enhance discovery from gwas. PLoS Genetics. 2010;6:e1000888.
    1. Yang J, Ferreira T, Morris AP, Medland SE, Madden PA, Heath AC, Martin NG, Montgomery GW, Weedon MN, Loos RJ, et al. Conditional and joint multiple-snp analysis of gwas summary statistics identifies additional variants influencing complex traits. Nature Genetics. 2012;44:369–375.
    1. Nica AC, Montgomery SB, Dimas AS, Stranger BE, Beazley C, Barroso I, Dermitzakis ET. Candidate causal regulatory effects by integration of expression qtls with complex trait genetic associations. PLoS Genetics. 2010;6:e1000895.
    1. Chun S, Casparino A, Patsopoulos NA, Croteau-Chonka DC, Raby BA, De Jager PL, Sunyaev SR, Cotsapas C. Limited statistical evidence for shared genetic effects of eqtls and autoimmune-disease-associated loci in three major immune-cell types. Nat Genet. 2017;49:600–605.
    1. Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, Plagnol V. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLOS Genetics. 2014;10:1–15.
    1. Won H, de la Torre-Ubieta L, Stein JL, Parikshak NN, Huang J, Opland CK, Gandal MJ, Sutton GJ, Hormozdiari F, Lu D, et al. Chromosome conformation elucidates regulatory relationships in developing human brain. Nature. 2016;538:523–527.
    1. Purcell SM, Wray NR, Stone JL, Visscher PM, O’Donovan MC, Sullivan PF, Sklar P, Ruderfer DM, McQuillin A, Morris DW, et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009;460:748–752.
    1. Vilhjálmsson BJ, Yang J, Finucane HK, Gusev A, Lindström S, Ripke S, Genovese G, Loh PR, Bhatia G, Do R, et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. The American Journal of Human Genetics. 2015;97:576–592.
    1. Palla L, Dudbridge F. A fast method that uses polygenic scores to estimate the variance explained by genome-wide marker panels and the proportion of variants affecting a trait. The American Journal of Human Genetics. 2015;97:250–259.
    1. Daetwyler HD, Villanueva B, Woolliams JA. Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS one. 2008;3:e3395.
    1. The GTEx Consortium. The genotype-tissue expression (gtex) pilot analysis: Multitissue gene regulation in humans. Science. 2015;348:648–660.
    1. Ongen H, Buil A, Brown AA, Dermitzakis ET, Delaneau O. Fast and efficient qtl mapper for thousands of molecular phenotypes. Bioinformatics. 2016;32:1479.
    1. Ryan J, Saffery R. Crucial timing in schizophrenia: role of dna methylation in early neurodevelopment. Genome Biology. 2014;15:495. –.
    1. Akbarian S, Liu C, Knowles JA, Vaccarino FM, Farnham PJ, Crawford GE, Jaffe AE, Pinto D, Dracheva S, Geschwind DH, et al. The psychencode project. Nature neuroscience. 2015;18:1707–1712.
    1. van de Geijn B, McVicker G, Gilad Y, Pritchard JK. Wasp: allele-specific software for robust molecular quantitative trait locus discovery. Nat Meth. 2015;12:1061–1063.
    1. Reilly SK, Yin J, Ayoub AE, Emera D, Leng J, Cotney J, Sarro R, Rakic P, Noonan JP. Evolutionary changes in promoter and enhancer activity during human corticogenesis. Science. 2015;347:1155–1159.
    1. Golzio C, Willer J, Talkowski ME, Oh EC, Taniguchi Y, Jacquemont S, Reymond A, Sun M, Sawa A, Gusella JF, et al. Kctd13 is a major driver of mirrored neuroanatomical phenotypes of the 16p11. 2 copy number variant. Nature. 2012;485:363–367.
    1. Maillard A, Ruef A, Pizzagalli F, Migliavacca E, Hippolyte L, Adaszewski S, Dukart J, Ferrari C, Conus P, Männik K, et al. The 16p11. 2 locus modulates brain structures common to autism, schizophrenia and obesity. Molecular psychiatry. 2015;20:140–147.
    1. McCarthy SE, Makarov V, Kirov G, Addington AM, McClellan J, Yoon S, Perkins DO, Dickel DE, Kusenda M, Krastoshevsky O, et al. Microduplications of 16p11. 2 are associated with schizophrenia. Nature Genetics. 2009;41:1223–1227.
    1. Migliavacca E, Golzio C, Männik K, Blumenthal I, Oh EC, Harewood L, Kosmicki JA, Loviglio MN, Giannuzzi G, Hippolyte L, et al. A potential contributory role for ciliary dysfunction in the 16p11. 2 600 kb bp4-bp5 pathology. The American Journal of Human Genetics. 2015;96:784–796.
    1. Föcking M, Lopez L, English J, Dicker P, Wolff A, Brindley E, Wynne K, Cagney G, Cotter D. Proteomic and genomic evidence implicates the postsynaptic density in schizophrenia. Molecular psychiatry. 2015;20:424–432.
    1. Sibley CR, Blazquez L, Ule J. Lessons from non-canonical splicing. Nature Reviews Genetics 2016
    1. Nelson CE, Hakim CH, Ousterout DG, Thakore PI, Moreb EA, Rivera RMC, Madhavan S, Pan X, Ran FA, Yan WX, et al. In vivo genome editing improves muscle function in a mouse model of duchenne muscular dystrophy. Science. 2016;351:403–407.
    1. Li YI, Knowles DA, Pritchard JK. Leafcutter: Annotation-free quantification of rna splicing. bioRxiv. 2016:044107.
    1. Chen CY, Pollack S, Hunter DJ, Hirschhorn JN, Kraft P, Price AL. Improved ancestry inference using weights from external reference panels. Bioinformatics. 2013:btt144.
    1. Yang J, Lee SH, Goddard ME, Visscher PM. Gcta: a tool for genome-wide complex trait analysis. The American Journal of Human Genetics. 2011;88:76–82.
    1. Yang J, Bakshi A, Zhu Z, Hemani G, Vinkhuyzen AA, Lee SH, Robinson MR, Perry JR, Nolte IM, van Vliet-Ostaptchouk JV, et al. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nature Genetics 2015
    1. Hormozdiari F, Kostem E, Kang EY, Pasaniuc B, Eskin E. Identifying causal variants at loci with multiple signals of association. Genetics. 2014;198:497–508.
    1. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, Duncan L, Perry JR, Patterson N, Robinson EB, et al. An atlas of genetic correlations across human diseases and traits. Nature Genetics 2015
    1. Jordan DM, Frangakis SG, Golzio C, Cassa CA, Kurtzberg J, Davis EE, Sunyaev SR, Katsanis N. Identification of cis-suppression of human disease mutations by comparative genomics. Nature. 2015;524:225–9.

Source: PubMed

3
Předplatit