Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations
Amit V Khera, Mark Chaffin, Krishna G Aragam, Mary E Haas, Carolina Roselli, Seung Hoan Choi, Pradeep Natarajan, Eric S Lander, Steven A Lubitz, Patrick T Ellinor, Sekar Kathiresan, Amit V Khera, Mark Chaffin, Krishna G Aragam, Mary E Haas, Carolina Roselli, Seung Hoan Choi, Pradeep Natarajan, Eric S Lander, Steven A Lubitz, Patrick T Ellinor, Sekar Kathiresan
Abstract
A key public health need is to identify individuals at high risk for a given disease to enable enhanced screening or preventive therapies. Because most common diseases have a genetic component, one important approach is to stratify individuals based on inherited DNA variation1. Proposed clinical applications have largely focused on finding carriers of rare monogenic mutations at several-fold increased risk. Although most disease risk is polygenic in nature2-5, it has not yet been possible to use polygenic predictors to identify individuals at risk comparable to monogenic mutations. Here, we develop and validate genome-wide polygenic scores for five common diseases. The approach identifies 8.0, 6.1, 3.5, 3.2, and 1.5% of the population at greater than threefold increased risk for coronary artery disease, atrial fibrillation, type 2 diabetes, inflammatory bowel disease, and breast cancer, respectively. For coronary artery disease, this prevalence is 20-fold higher than the carrier frequency of rare monogenic mutations conferring comparable risk6. We propose that it is time to contemplate the inclusion of polygenic risk prediction in clinical care, and discuss relevant issues.
Figures
References
- Green ED, Guyer MS; National Human Genome Research Institute. Charting a course for genomic medicine from base pairs to bedside. Nature. 470, 204–213 (2011).
- Fisher RA The correlation between relatives on the supposition of Mendelian inheritance. Proc. Roy. Soc. Edinburgh 52, 99–433 (1918).
- Gibson G Rare and common variants: twenty arguments. Nat Rev Genet. 18, 135–45 (2012).
- Golan D, Lander ES, Rosset S. Measuring missing heritability: inferring the contribution of common variants. Proc Natl Acad Sci U S A. 111, E5272–81 (2014).
- Fuchsberger C, et al. The genetic architecture of type 2 diabetes. Nature. 536, 41–47 (2016).
- Abul-Husn NS, et al. Genetic identification of familial hypercholesterolemia within a single U.S. health care system. Science. 354 (2016).
- Nordestgaard BG, et al. Familial hypercholesterolaemia is underdiagnosed and undertreated in the general population: guidance for clinicians to prevent coronary heart disease: consensus statement of the European Atherosclerosis Society. Eur Heart J. 34, 3478–90a (2013).
- Lek M, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 536, 285–91 (2016).
- Estrada K, et al. Association of a low-frequency variant in HNF1A with type 2 diabetes in a Latino population. JAMA. 311, 2305–14 (2014).
- Chatterjee N et al. Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies. Nat Genet. 45, 400–405 (2013).
- Zhang Y, et al. Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits and implications for the future. Preprint at: (2017).
- Ripatti S, et al. A multilocus genetic risk score for coronary heart disease: case-control and prospective cohort analyses. Lancet. 327, 1393–400 (2010).
- Vilhjálmsson BJ et al. Modeling linkage disequilibrium increases accuracy of polygenic scores. Am J Hum Genet. 97, 576–592 (2015).
- Sudlow C et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
- Bycroft C, et al. Genome-wide genetic data on ~500,000 UK Biobank participants. Preprint at: (2017).
- Nikpay M et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 47,1121–1130 (2015).
- Tada H, et al. Risk prediction by genetic risk scores for coronary heart disease is independent of self-reported family history. Eur Heart J. 37, 561–7 (2016).
- Abraham G, et al. Genomic prediction of coronary heart disease. Eur Heart J. 37, 3267–3278 (2016).
- Khera AV, et al. Genetic risk, adherence to a healthy lifestyle, and coronary disease. N Engl J Med. 375, 2349–2358 (2016).
- Mega JL, et al. Genetic risk, coronary heart disease events, and the clinical benefit of statin therapy: an analysis of primary and secondary prevention trials. Lancet. 385, 2264–2271 (2015).
- Natarajan P, et al. Polygenic risk score identifies subgroup with higher burden of atherosclerosis and greater relative benefit from statin therapy in the primary prevention setting. Circulation. 135, 2091–2101 (2017).
- January CT, et al. 2014 AHA/ACC/HRS guideline for the management of patients with atrial fibrillation: a report of the American College of Cardiology/American Heart Association Task Force on practice guidelines and the Heart Rhythm Society. Circulation. 130, e199–267 (2014).
- GBD 2015 Disease and Injury Incidence and Prevalence Collaborators. Global, regional, and national incidence, prevalence, and years live with disability for 310 diseases and injuries, 1990–2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet. 388, 1545–1602 (2016).
- Knowler WC, et al. Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. N Engl J Med. 346, 393–403 (2002).
- Abraham C & Cho JH Inflammatory bowel disease. N Engl J Med. 361, 2066–78 (2009).
- Pharoah PD, Antoniou AC, Easton DF, Ponder BA. Polygenes, risk prediction, and targeted prevention of breast cancer. N Engl J Med. 358, 2796–803 (2008).
- Fry A, et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am J Epidemiol. 186, 1026–34 (2017).
- Khera AV & Kathiresan S Is coronary atherosclerosis one disease or many? Setting realistic expectations for precision medicine. Circulation. 135, 1005–07 (2017).
- Martin AR et al. Human demographic history impacts genetic risk prediction across diverse populations. Am J Hum Genet. 100, 635–649 (2017).
- Christophersen IE, et al. Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation. Nat Genet. 49, 946–952 (2017).
- Scott RA, et al. An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans. Diabetes. 66, 2888–2902 (2017).
- Liu JZ, et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat Genet. 47, 979–986 (2015).
- Michailidou K, et al. Association analysis identifies 65 new breast cancer risk loci. Nature. 551, 92–94 (2017).
- The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature. 526, 68–74 (2015).
- Chang CC, et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 4, 7 (2015).
- Ganna A, et al. Ultra-rare disruptive and damaging mutations influence educational attainment in the general population. Nat Neurosci. 19, 1563–65 (2016).
Source: PubMed