Genetics of blood lipids among ~300,000 multi-ethnic participants of the Million Veteran Program

Derek Klarin, Scott M Damrauer, Kelly Cho, Yan V Sun, Tanya M Teslovich, Jacqueline Honerlaw, David R Gagnon, Scott L DuVall, Jin Li, Gina M Peloso, Mark Chaffin, Aeron M Small, Jie Huang, Hua Tang, Julie A Lynch, Yuk-Lam Ho, Dajiang J Liu, Connor A Emdin, Alexander H Li, Jennifer E Huffman, Jennifer S Lee, Pradeep Natarajan, Rajiv Chowdhury, Danish Saleheen, Marijana Vujkovic, Aris Baras, Saiju Pyarajan, Emanuele Di Angelantonio, Benjamin M Neale, Aliya Naheed, Amit V Khera, John Danesh, Kyong-Mi Chang, Gonçalo Abecasis, Cristen Willer, Frederick E Dewey, David J Carey, Global Lipids Genetics Consortium, Myocardial Infarction Genetics (MIGen) Consortium, Geisinger-Regeneron DiscovEHR Collaboration, VA Million Veteran Program, John Concato, J Michael Gaziano, Christopher J O'Donnell, Philip S Tsao, Sekar Kathiresan, Daniel J Rader, Peter W F Wilson, Themistocles L Assimes, Derek Klarin, Scott M Damrauer, Kelly Cho, Yan V Sun, Tanya M Teslovich, Jacqueline Honerlaw, David R Gagnon, Scott L DuVall, Jin Li, Gina M Peloso, Mark Chaffin, Aeron M Small, Jie Huang, Hua Tang, Julie A Lynch, Yuk-Lam Ho, Dajiang J Liu, Connor A Emdin, Alexander H Li, Jennifer E Huffman, Jennifer S Lee, Pradeep Natarajan, Rajiv Chowdhury, Danish Saleheen, Marijana Vujkovic, Aris Baras, Saiju Pyarajan, Emanuele Di Angelantonio, Benjamin M Neale, Aliya Naheed, Amit V Khera, John Danesh, Kyong-Mi Chang, Gonçalo Abecasis, Cristen Willer, Frederick E Dewey, David J Carey, Global Lipids Genetics Consortium, Myocardial Infarction Genetics (MIGen) Consortium, Geisinger-Regeneron DiscovEHR Collaboration, VA Million Veteran Program, John Concato, J Michael Gaziano, Christopher J O'Donnell, Philip S Tsao, Sekar Kathiresan, Daniel J Rader, Peter W F Wilson, Themistocles L Assimes

Abstract

The Million Veteran Program (MVP) was established in 2011 as a national research initiative to determine how genetic variation influences the health of US military veterans. Here we genotyped 312,571 MVP participants using a custom biobank array and linked the genetic data to laboratory and clinical phenotypes extracted from electronic health records covering a median of 10.0 years of follow-up. Among 297,626 veterans with at least one blood lipid measurement, including 57,332 black and 24,743 Hispanic participants, we tested up to around 32 million variants for association with lipid levels and identified 118 novel genome-wide significant loci after meta-analysis with data from the Global Lipids Genetics Consortium (total n > 600,000). Through a focus on mutations predicted to result in a loss of gene function and a phenome-wide association study, we propose novel indications for pharmaceutical inhibitors targeting PCSK9 (abdominal aortic aneurysm), ANGPTL4 (type 2 diabetes) and PDE3B (triglycerides and coronary disease).

Figures

Figure 1.. GWAS Study Design
Figure 1.. GWAS Study Design
a) DNA sequence variants across 3 separate ancestry groups in the Million Veteran Program were meta-analyzed using an inverse-variance weighted fixed effects method in the discovery phase (Stage 1). Variants with suggestive association were then brought forward for independent replication. b) DNA sequence variants with suggestive association (two-sided linear regression P < 10−4) in discovery (Stage 1) were brought forward for independent replication and tested using summary statistics from the 2017 exome-array focused GLGC meta-analysis (Stage 2a). Only variants with suggestive association in Stage 1 that were not present in the GLGC 2017 exome-array study (Stage 2a) were alternatively replicated in the 2013 GLGC “joint meta-analysis” (Stage 2b). Abbreviations: MVP, Million Veteran Program; GWAS, genome-wide association study; EHR, electronic health record; GLGC, Global Lipids Genetics Consortium
Figure 2.. Comparison of 354 Independent Lipid…
Figure 2.. Comparison of 354 Independent Lipid Associated Variants Across Ethnicities
Allele frequencies observed in white individuals (n=215,196; x-axes) compared to black (a, n=57,280; R = 0.72,) or Hispanic(b, n=24,742; R = 0.96) individuals for lipid-associated variants are shown. Effect estimates for LDL-C association in white individuals (n = 215,196; x-axes) compared to black (c, n = 57,280; β = 1.07) or Hispanic (d, n = 24,742; β = 1.06) individuals are also depicted. Abbreviations: SD, Standard Deviations; LDL-C, Low-Density Lipoprotein Cholesterol; R = Pearson correlation coefficient
Figure 3.. PDE3B Loss of Gene Function,…
Figure 3.. PDE3B Loss of Gene Function, Lipids, and Coronary Disease
Linear regression results for the association of the predicted loss of function mutation p.Arg783Ter in PDE3B with HDL-C(a) and triglycerides (b) for white veterans in MVP with independent replication in the DiscovEHR study. Two-sided P values are displayed. c) Meta-analysis of the association of damagingPDE3B mutations and coronary artery disease across five studies, including three (MIGen, PMBB, DiscovEHR) with exome sequencing. Logistic regression results were pooled in an inverse-variance weighted fixed effects meta-analysis. Minimal evidence of heterogeneity across cohorts was observed (I = 0%). Two-sided P values are displayed. Abbreviations: MVP, Million Veteran Program; HDL-C, High-Density Lipoprotein Cholesterol; TG, Triglycerides; UKBB, UK Biobank; MIGen, Myocardial Infarction Genetics Consortium; PMBB, Penn Medicine Biobank
Figure 4.. ANGPTL4 40Lys Carrier Disease Associations.
Figure 4.. ANGPTL4 40Lys Carrier Disease Associations.
Forest plot for a representative 33 of the 1004 disorders tested in theANGPTL4 p.Glu40Lys PheWAS. Statistically significant logistic regression associations are shown in blue. Two-sided P values are displayed.
Figure 5.. PCSK9 46Leu Carrier Disease Associations
Figure 5.. PCSK9 46Leu Carrier Disease Associations
Forest plot for a representative 33 of the 1004 disorders tested in thePCSK9 p.Arg46Leu PheWAS. Statistically significant logistic regression associations are shown in blue. Two-sided P values are displayed.
Figure 6.. Lipid Associations with Abdominal Aortic…
Figure 6.. Lipid Associations with Abdominal Aortic Aneurysm
Logistic regression association results of the 223 variant lipid genetic risk score with abdominal aortic aneurysm in a multivariable Mendelian randomization analysis. Odds ratios are displayed per 1-standard deviation genetically increased lipid fraction. Two-sided P values are displayed. Abbreviations: HDL, High-Density Lipoprotein; LDL, Low-Density Lipoprotein

References

    1. Collins R What makes UK Biobank special? The Lancet 379, 1173–1174 (2012).
    1. Gaziano JM et al. Million Veteran Program: A mega-biobank to study genetic influences on health and disease. J Clin Epidemiol 70, 214–23 (2016).
    1. Di Angelantonio E et al. Major lipids, apolipoproteins, and risk of vascular disease. Jama 302, 1993–2000 (2009).
    1. Teslovich TM et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–13 (2010).
    1. Global Lipids Genetics Consortium et al. Discovery and refinement of loci associated with lipid levels. Nat Genet 45, 1274–83 (2013).
    1. Chasman DI et al. Forty-three loci associated with plasma lipoprotein size, concentration, and cholesterol content in genome-wide analysis. PLoS Genet 5, e1000730 (2009).
    1. Albrechtsen A et al. Exome sequencing-driven discovery of coding polymorphisms associated with common metabolic phenotypes. Diabetologia 56, 298–310 (2013).
    1. Peloso GM et al. Association of low-frequency and rare coding-sequence variants with blood lipids and coronary heart disease in 56,000 whites and blacks. Am J Hum Genet 94, 223–32 (2014).
    1. Asselbergs FW et al. Large-scale gene-centric meta-analysis across 32 studies identifies multiple lipid loci. Am J Hum Genet 91, 823–38 (2012).
    1. Below JE et al. Meta-analysis of lipid-traits in Hispanics identifies novel loci, population-specific effects, and tissue-specific enrichment of eQTLs. Sci Rep 6, 19429 (2016).
    1. Liu DJ et al. Exome-wide association study of plasma lipids in >300,000 individuals. Nat Genet (2017).
    1. Lu X et al. Exome chip meta-analysis identifies novel loci and East Asian-specific coding variants that contribute to lipid levels and coronary artery disease. Nat Genet (2017).
    1. Sabatine MS et al. Evolocumab and Clinical Outcomes in Patients with Cardiovascular Disease. N Engl J Med (2017).
    1. Myocardial Infarction Genetics and CARDIoGRAM Exome Consortia Investigators. Coding Variation in ANGPTL4, LPL, and SVEP1 and the Risk of Coronary Disease. N Engl J Med 374, 1134–44 (2016).
    1. Dewey FE et al. Inactivating Variants in ANGPTL4 and Risk of Coronary Artery Disease. N Engl J Med 374, 1123–33 (2016).
    1. Barter PJ et al. Effects of torcetrapib in patients at high risk for coronary events. N Engl J Med 357, 2109–22 (2007).
    1. Denny JC et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat Biotechnol 31, 1102–10 (2013).
    1. Crosby J et al. Loss-of-function mutations in APOC3, triglycerides, and coronary disease. N Engl J Med 371, 22–31 (2014).
    1. Cohen JC, Boerwinkle E, Mosley TH Jr. & Hobbs HH Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N Engl J Med 354, 1264–72 (2006).
    1. Abul-Husn NS et al. Genetic identification of familial hypercholesterolemia within a single U.S. health care system. Science 354(2016).
    1. The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
    1. Tishkoff SA et al. The genetic structure and history of Africans and African Americans. Science 324, 1035–44 (2009).
    1. Gusev A et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet 48, 245–52 (2016).
    1. Wright FA et al. Heritability and genomics of gene expression in peripheral blood. Nat Genet 46, 430–7 (2014).
    1. GTEx Consortium. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
    1. Mancuso N et al. Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits. Am J Hum Genet 100, 473–487 (2017).
    1. Lek M et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–91 (2016).
    1. Marouli E et al. Rare and low-frequency coding variants alter human adult height. Nature (2017).
    1. McLaren W et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26, 2069–70 (2010).
    1. Khera AV et al. Association of Rare and Common Variation in the Lipoprotein Lipase Gene With Coronary Artery Disease. JAMA 317, 937–946 (2017).
    1. Dewey FE et al. Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study. Science 354(2016).
    1. Sidore C et al. Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers. Nat Genet 47, 1272–81 (2015).
    1. Purcell SM et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 506, 185–90 (2014).
    1. Diogo D et al. Phenome-wide association studies (PheWAS) across large “real-world data” population cohorts support drug target validation. bioRxiv (2017).
    1. Mahajan A et al. Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes. Nat Genet 50, 559–571 (2018).
    1. Bowden J, Davey Smith G & Burgess S Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol 44, 512–25 (2015).
    1. Klarin D et al. Genetic analysis in UK Biobank links insulin resistance and transendothelial migration pathways to coronary artery disease. Nat Genet (2017).
    1. Nelson CP et al. Association analyses based on false discovery rate implicate new loci for coronary artery disease. Nat Genet 49, 1385–1391 (2017).
    1. Gandotra S et al. Perilipin deficiency and autosomal dominant partial lipodystrophy. N Engl J Med 364, 740–8 (2011).
    1. Rani J et al. T2DiACoD: A Gene Atlas of Type 2 Diabetes Mellitus Associated Complex Disorders. Sci Rep 7, 6892 (2017).
    1. Musunuru K et al. Exome Sequencing, ANGPTL3 Mutations, and Familial Combined Hypolipidemia New England Journal of Medicine (2010).
    1. Graham MJ et al. Cardiovascular and Metabolic Effects of ANGPTL3 Antisense Oligonucleotides. N Engl J Med (2017).
    1. Zhang W & Colman RW Thrombin regulates intracellular cyclic AMP concentration in human platelets through phosphorylation/activation of phosphodiesterase 3A. Blood 110, 1475–82 (2007).
    1. Maass PG et al. PDE3A mutations cause autosomal dominant hypertension with brachydactyly. Nat Genet 47, 647–53 (2015).
    1. Vandeput F et al. Selective regulation of cyclic nucleotide phosphodiesterase PDE3A isoforms. Proc Natl Acad Sci U S A 110, 19778–83 (2013).
    1. Bedenis R et al. Cilostazol for intermittent claudication. Cochrane Database Syst Rev, Cd003748 (2014).
    1. Tsuchikane E et al. Impact of cilostazol on restenosis after percutaneous coronary balloon angioplasty. Circulation 100, 21–6 (1999).
    1. Shinohara Y et al. Cilostazol for prevention of secondary stroke (CSPS 2): an aspirin-controlled, double-blind, randomised non-inferiority trial. Lancet Neurol 9, 959–68 (2010).
    1. Ahmad F et al. Phosphodiesterase 3B (PDE3B) regulates NLRP3 inflammasome in adipose tissue. Sci Rep 6, 28056 (2016).
    1. Chung YW et al. Targeted disruption of PDE3B, but not PDE3A, protects murine heart from ischemia/reperfusion injury. Proc Natl Acad Sci U S A 112, E2253–62 (2015).
    1. Harrison SC et al. Genetic Association of Lipids and Lipid Drug Targets With Abdominal Aortic Aneurysm: A Meta-analysis. JAMA Cardiol 3, 26–33 (2018).
    1. Lu H et al. Hypercholesterolemia Induced by a PCSK9 Gain-of-Function Mutation Augments Angiotensin II-Induced Abdominal Aortic Aneurysms in C57BL/6 Mice-Brief Report. Arterioscler Thromb Vasc Biol 36, 1753–7 (2016).
    1. Voight BF et al. Plasma HDL cholesterol and risk of myocardial infarction: a mendelian randomisation study. The Lancet 380, 572–580 (2012).
    1. Do R et al. Common variants associated with plasma triglycerides and risk for coronary artery disease. Nat Genet 45, 1345–52 (2013).
Online Methods References
    1. Loh PR, Palamara PF & Price AL Fast and accurate long-range phasing in a UK Biobank cohort. Nat Genet 48, 811–6 (2016).
    1. Howie B, Fuchsberger C, Stephens M, Marchini J & Abecasis GR Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet 44, 955–9 (2012).
    1. Price AL et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38, 904–9 (2006).
    1. Winkler TW et al. Quality control and conduct of genome-wide association meta-analyses. Nat Protoc 9, 1192–212 (2014).
    1. Hyde CL et al. Identification of 15 genetic loci associated with risk of major depression in individuals of European descent. Nat Genet 48, 1031–6 (2016).
    1. Zhou X, Carbonetto P & Stephens M Polygenic modeling with bayesian sparse linear mixed models. PLoS Genet 9, e1003264 (2013).

Source: PubMed

3
订阅