Polygenic Epidemiology

Frank Dudbridge, Frank Dudbridge

Abstract

Much of the genetic basis of complex traits is present on current genotyping products, but the individual variants that affect the traits have largely not been identified. Several traditional problems in genetic epidemiology have recently been addressed by assuming a polygenic basis for disease and treating it as a single entity. Here I briefly review some of these applications, which collectively may be termed polygenic epidemiology. Methodologies in this area include polygenic scoring, linear mixed models, and linkage disequilibrium scoring. They have been used to establish a polygenic effect, estimate genetic correlation between traits, estimate how many variants affect a trait, stratify cases into subphenotypes, predict individual disease risks, and infer causal effects using Mendelian randomization. Polygenic epidemiology will continue to yield useful applications even while much of the specific variation underlying complex traits remains undiscovered.

Keywords: Mendelian randomization; genetic correlation; genetic risk prediction; missing heritability.

© 2016 The Authors. *Genetic Epidemiology Published by Wiley Periodicals, Inc.

Figures

Figure 1
Figure 1
P‐values (−log10 scale) for selecting variants into a polygenic score such that the area under the receiver operator characteristic curve (AUC) is maximized. A binary trait with prevalence 10% is assumed, with variants selected from a case/control study with equal number of cases and controls. Chip heritability of 40% (solid line) and 20% (dashed line) is distributed among 100,000 independent variants, of which 5% have normally distributed effects and the rest have no effect. The vertical line is at 50,000 cases and 50,000 controls, at which point over 95% of the maximum AUC is achieved.

References

    1. Antoniou AC, Pharoah PD, McMullan G, Day NE, Stratton MR, Peto J, Ponder BJ, Easton DF. 2002. A comprehensive model for familial breast cancer incorporating BRCA1, BRCA2 and other genes. Br J Cancer 86(1):76–83. Epub 2002/02/22.
    1. Aulchenko YS, Struchalin MV, Belonogova NM, Axenovich TI, Weedon MN, Hofman A, Uitterlinden AG, Kayser M, Oostra BA, van Duijn CM, and others. 2009. Predicting human height by Victorian and genomic methods. Eur J Hum Genet 17(8):1070–1075. Epub 2009/02/19.
    1. Berg JJ, Coop G. 2014. A population genetic signal of polygenic adaptation. PLoS Genet 10(8):e1004412. Epub 2014/08/08.
    1. Bowden J, Davey Smith G, Burgess S. 2015. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol 44(2):512–525. Epub 2015/06/08.
    1. Bulik‐Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Patterson N, Daly MJ, Price AL, Neale BM. 2015a. LD Score regression distinguishes confounding from polygenicity in genome‐wide association studies. Nat Genet 47(3):291–295. Epub 2015/02/03.
    1. Bulik‐Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, Duncan L, Perry JR, Patterson N, Robinson EB, and others. 2015b. An atlas of genetic correlations across human diseases and traits. Nat Genet 47(11):1236–1241. Epub 2015/09/29.
    1. Burgess S, Thompson SG. 2013. Use of allele scores as instrumental variables for Mendelian randomization. Int J Epidemiol 42(4):1134–1144. Epub 2013/09/26.
    1. Burgess S, Thompson SG. 2015. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol 181(4):251–260. Epub 2015/01/30.
    1. Burgess S, Dudbridge F, Thompson SG. 2015. Re: “Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects.” Am J Epidemiol 181(4):290–291. Epub 2015/02/11.
    1. Bush WS, Sawcer SJ, de Jager PL, Oksenberg JR, McCauley JL, Pericak‐Vance MA, Haines JL. 2010. Evidence for polygenic susceptibility to multiple sclerosis—the shape of things to come. Am J Hum Genet 86(4):621–625. Epub 2010/04/07.
    1. CardiogramPlusC4D Consortium . 2015. A comprehensive 1000 Genomes‐based genome‐wide association meta‐analysis of coronary artery disease. Nat Genet 47(10):1121–1130. Epub 2015/09/08.
    1. Chatterjee N, Wheeler B, Sampson J, Hartge P, Chanock SJ, Park JH. 2013. Projecting the performance of risk prediction based on polygenic analyses of genome‐wide association studies. Nat Genet 45(4):400–405, 5e1–5e3. Epub 2013/03/05.
    1. Clayton DG. 2009. Prediction and interaction in complex disease genetics: experience in type 1 diabetes. PLoS Genet 5(7):e1000540. Epub 2009/07/09.
    1. Cleynen I, Boucher G, Jostins L, Schumm LP, Zeissig S, Ahmad T, Andersen V, Andrews JM, Annese V, Brand S, and others. 2016. Inherited determinants of Crohn's disease and ulcerative colitis phenotypes: a genetic association study. Lancet 387(10014):156–167. Epub 2015/10/23.
    1. Cross‐Disorder Group of the Psychiatric Genomics Consortium . 2013. Identification of risk loci with shared effects on five major psychiatric disorders: a genome‐wide analysis. Lancet 381(9875):1371–1379. Epub 2013/03/05.
    1. Daetwyler HD, Villanueva B, Woolliams JA. 2008. Accuracy of predicting the genetic risk of disease using a genome‐wide approach. PLoS One 3(10):e3395. Epub 2008/10/15.
    1. Dudbridge F. 2013. Power and predictive accuracy of polygenic risk scores. PLoS Genet 9(3):e1003348. Epub 2013/04/05.
    1. Evans DM, Visscher PM, Wray NR. 2009. Harnessing the information contained within genome‐wide association studies to improve individual prediction of complex disease risk. Hum Mol Genet 18(18):3525–3531. Epub 2009/06/26.
    1. Evans DM, Brion MJ, Paternoster L, Kemp JP, McMahon G, Munafo M, Whitfield JB, Medland SE, Montgomery GW, Timpson NJ, and others. 2013. Mining the human phenome using allelic scores that index biological intermediates. PLoS Genet 9(10):e1003919. Epub 2013/11/10.
    1. Ge T, Nichols TE, Lee PH, Holmes AJ, Roffman JL, Buckner RL, Sabuncu MR, Smoller JW. 2015. Massively expedited genome‐wide heritability analysis (MEGHA). Proc Natl Acad Sci USA 112(8):2479–2984. Epub 2015/02/13.
    1. Goris A, van Setten J, Diekstra F, Ripke S, Patsopoulos NA, Sawcer SJ, van Es M, Andersen PM, Melki J, Meininger V, and others. 2014. No evidence for shared genetic basis of common variants in multiple sclerosis and amyotrophic lateral sclerosis. Hum Mol Genet 23(7):1916–1922. Epub 2013/11/16.
    1. Hamshere ML, O'Donovan MC, Jones IR, Jones L, Kirov G, Green EK, Moskvina V, Grozeva D, Bass N, McQuillin A, and others. 2011. Polygenic dissection of the bipolar phenotype. Br J Psychiatry 198(4):284–288. Epub 2011/10/06.
    1. Holmes MV, Asselbergs FW, Palmer TM, Drenos F, Lanktree MB, Nelson CP, Dale CE, Padmanabhan S, Finan C, Swerdlow DI, and others. 2015. Mendelian randomization of blood lipids for coronary heart disease. Eur Heart J 36(9):539–550. Epub 2014/01/30.
    1. Hopper JL, Mack TM. 2015. The heritability of prostate cancer‐letter. Cancer Epidemiol Biomarkers Prev 24(5):878. Epub 2015/05/03.
    1. Lee SH, Ripke S, Neale BM, Faraone SV, Purcell SM, Perlis RH, Mowry BJ, Thapar A, Goddard ME, Witte JS, and others. 2013. Genetic relationship between five psychiatric disorders estimated from genome‐wide SNPs. Nat Genet 45(9):984–994. Epub 2013/08/13.
    1. Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, Powell C, Vedantam S, Buchkovich ML, Yang J, and others. 2015. Genetic studies of body mass index yield new insights for obesity biology. Nature 518(7538):197–206. Epub 2015/02/13.
    1. Lu Y, Ek WE, Whiteman D, Vaughan TL, Spurdle AB, Easton DF, Pharoah PD, Thompson DJ, Dunning AM, Hayward NK, and others. 2014. Most common “sporadic” cancers have a significant germline genetic component. Hum Mol Genet 23(22):6112–6118. Epub 2014/06/20.
    1. Maher B. 2008. Personal genomes: the case of the missing heritability. Nature 456(7218):18–21. Epub 2008/11/07.
    1. Manolio TA. 2013. Bringing genome‐wide association findings into clinical use. Nat Rev Genet 14(8):549–558. Epub 2013/07/10.
    1. Mavaddat N, Pharoah PD, Michailidou K, Tyrer J, Brook MN, Bolla MK, Wang Q, Dennis J, Dunning AM, Shah M, and others. 2015. Prediction of breast cancer risk based on profiling with common genetic variants. J Natl Cancer Inst 107(5). Epub 2015/04/10.
    1. Meuwissen TH, Hayes BJ, Goddard ME. 2001. Prediction of total genetic value using genome‐wide dense marker maps. Genetics 157(4):1819–1829. Epub 2001/04/06.
    1. Michailidou K, Hall P, Gonzalez‐Neira A, Ghoussaini M, Dennis J, Milne RL, Schmidt MK, Chang‐Claude J, Bojesen SE, Bolla MK, and others. 2013. Large‐scale genotyping identifies 41 new loci associated with breast cancer risk. Nat Genet 45(4):353–361, 61e1–61e2. Epub 2013/03/29.
    1. Moser G, Lee SH, Hayes BJ, Goddard ME, Wray NR, Visscher PM. 2015. Simultaneous discovery, estimation and prediction analysis of complex traits using a Bayesian mixture model. PLoS Genet 11(4):e1004969. Epub 2015/04/08.
    1. Nuesch E, Dale C, Palmer TM, White J, Keating BJ, van Iperen EP, Goel A, Padmanabhan S, Asselbergs FW, Verschuren WM, and others. 2015. Adult height, coronary heart disease and stroke: a multi‐locus Mendelian randomization meta‐analysis. Int J Epidemiol. Epub 2015/05/17.
    1. Palla L, Dudbridge F. 2015. A fast method that uses polygenic scores to estimate the variance explained by genome‐wide marker panels and the proportion of variants affecting a trait. Am J Hum Genet 97(2):250–259. Epub 2015/07/21.
    1. Palmer TM, Lawlor DA, Harbord RM, Sheehan NA, Tobias JH, Timpson NJ, Davey Smith G, Sterne JA. 2012. Using multiple genetic variants as instrumental variables for modifiable risk factors. Stat Methods Med Res 21(3):223–242. Epub 2011/01/11.
    1. Pashayan N, Duffy SW, Neal DE, Hamdy FC, Donovan JL, Martin RM, and others. 2015. Implications of polygenic risk‐stratified screening for prostate cancer on overdiagnosis. Genet Med 17(10):789–795. Epub 2015/01/09.
    1. Purcell SM, Wray NR, Stone JL, Visscher PM, O'Donovan MC, Sullivan PF, Sklar P, International Schizophrenia Consortium . 2009. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460(7256):748–752. Epub 2009/07/03.
    1. Risch N. 1990. Linkage strategies for genetically complex traits. I. Multilocus models. Am J Hum Genet 46(2):222–228. Epub 1990/02/01.
    1. Schizophrenia Working Group of the Psychiatric Genomics Consortium . 2014. Biological insights from 108 schizophrenia‐associated genetic loci. Nature 511(7510):421–427. Epub 2014/07/25.
    1. Speed D, Balding DJ. 2014. MultiBLUP: improved SNP‐based prediction for complex traits. Genome Res 24(9):1550–1557. Epub 2014/06/26.
    1. Speliotes EK, Willer CJ, Berndt SI, Monda KL, Thorleifsson G, Jackson AU, Allen HL, Lindgren CM, Luan J, Magi R, and others. 2010. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat Genet 42(11):937–948. Epub 2010/10/12.
    1. Stahl EA, Wegmann D, Trynka G, Gutierrez‐Achury J, Do R, Voight BF, Kraft P, Chen R, Kallberg HJ, Kurreeman FA, and others. 2012. Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nat Genet 44(5):483–489. Epub 2012/03/27.
    1. Szulkin R, Whitington T, Eklund M, Aly M, Eeles RA, Easton D, Kote‐Jarai ZS, Amin Al Olama A, Benlloch S, Muir K, and others. 2015. Prediction of individual genetic risk to prostate cancer using a polygenic score. Prostate 75(13):1467–1474. Epub 2015/07/17.
    1. Talmud PJ, Cooper JA, Morris RW, Dudbridge F, Shah T, Engmann J, Dale C, White J, McLachlan S, Zabaneh D, and others. 2015. Sixty‐five common genetic variants and prediction of type 2 diabetes. Diabetes 64(5):1830–1840. Epub 2014/12/06.
    1. Thrift AP, Gong J, Peters U, Chang‐Claude J, Rudolph A, Slattery ML, Chan AT, Locke AE, Kahali B, Justice AE, and others. 2015. Mendelian randomization study of body mass index and colorectal cancer risk. Cancer Epidemiol Biomarkers Prev 24(7):1024–1031. Epub 2015/05/16.
    1. Varghese JS, Thompson DJ, Michailidou K, Lindstrom S, Turnbull C, Brown J, Leyland J, Warren RM, Luben RN, Loos RJ, and others. 2012. Mammographic breast density and breast cancer: evidence of a shared genetic basis. Cancer Res 72(6):1478–1484. Epub 2012/01/24.
    1. Visscher PM, Macgregor S, Benyamin B, Zhu G, Gordon S, Medland S, Hill WG, Hottenga JJ, Willemsen G, Boomsma DI, and others. 2007. Genome partitioning of genetic variation for height from 11,214 sibling pairs. Am J Hum Genet 81(5):1104–1110. Epub 2007/10/10.
    1. Visscher PM, Hill WG, Wray NR. 2008. Heritability in the genomics era—concepts and misconceptions. Nat Rev Genet 9(4):255–266. Epub 2008/03/06.
    1. Visscher PM, Brown MA, McCarthy MI, Yang J. 2012. Five years of GWAS discovery. Am J Hum Genet 90(1):7–24. Epub 2012/01/17.
    1. Visscher PM, Hemani G, Vinkhuyzen AA, Chen GB, Lee SH, Wray NR, Goddard ME, Yang J. 2014. Statistical power to detect genetic (co)variance of complex traits using SNP data in unrelated samples. PLoS Genet 10(4):e1004269. Epub 2014/04/12.
    1. Voight BF, Peloso GM, Orho‐Melander M, Frikke‐Schmidt R, Barbalic M, Jensen MK, Hindy G, Holm H, Ding EL, Johnson T, and others. 2012. Plasma HDL cholesterol and risk of myocardial infarction: a Mendelian randomisation study. Lancet 380(9841):572–580. Epub 2012/05/23.
    1. Wray NR, Goddard ME, Visscher PM. 2007. Prediction of individual genetic risk to disease from genome‐wide association studies. Genome Res 17(10):1520–1528. Epub 2007/09/06.
    1. Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, Madden PA, Heath AC, Martin NG, Montgomery GW, and others. 2010. Common SNPs explain a large proportion of the heritability for human height. Nat Genet 42(7):565–569. Epub 2010/06/22.
    1. Yang J, Manolio TA, Pasquale LR, Boerwinkle E, Caporaso N, Cunningham JM, de Andrade M, Feenstra B, Feingold E, Hayes MG, and others. 2011a. Genome partitioning of genetic variation for complex traits using common SNPs. Nat Genet 43(6):519–525. Epub 2011/05/10.
    1. Yang J, Weedon MN, Purcell S, Lettre G, Estrada K, Willer CJ, Smith AV, Ingelsson E, O'Connell JR, Mangino M, and others. 2011b. Genomic inflation factors under polygenic inheritance. Eur J Hum Genet 19(7):807–812. Epub 2011/03/17.
    1. Zhang B, Shu XO, Delahanty RJ, Zeng C, Michailidou K, Bolla MK, Wang Q, Dennis J, Wen W, Long J, and others. 2015. Height and breast cancer risk: evidence from prospective studies and Mendelian randomization. J Natl Cancer Inst 107(11). Epub 2015/08/25.
    1. Zhou X, Stephens M. 2012. Genome‐wide efficient mixed‐model analysis for association studies. Nat Genet 44(7):821–824. Epub 2012/06/19.
    1. Ziegler A, König IR. 2014. Celebrating the 30th anniversary of genetic epidemiology: how to define our scope? Genet Epidemiol 38(5):379–380. Epub 2014/06/26.

Source: PubMed

3
订阅