Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity
Danish Saleheen, Pradeep Natarajan, Irina M Armean, Wei Zhao, Asif Rasheed, Sumeet A Khetarpal, Hong-Hee Won, Konrad J Karczewski, Anne H O'Donnell-Luria, Kaitlin E Samocha, Benjamin Weisburd, Namrata Gupta, Mozzam Zaidi, Maria Samuel, Atif Imran, Shahid Abbas, Faisal Majeed, Madiha Ishaq, Saba Akhtar, Kevin Trindade, Megan Mucksavage, Nadeem Qamar, Khan Shah Zaman, Zia Yaqoob, Tahir Saghir, Syed Nadeem Hasan Rizvi, Anis Memon, Nadeem Hayyat Mallick, Mohammad Ishaq, Syed Zahed Rasheed, Fazal-Ur-Rehman Memon, Khalid Mahmood, Naveeduddin Ahmed, Ron Do, Ronald M Krauss, Daniel G MacArthur, Stacey Gabriel, Eric S Lander, Mark J Daly, Philippe Frossard, John Danesh, Daniel J Rader, Sekar Kathiresan, Danish Saleheen, Pradeep Natarajan, Irina M Armean, Wei Zhao, Asif Rasheed, Sumeet A Khetarpal, Hong-Hee Won, Konrad J Karczewski, Anne H O'Donnell-Luria, Kaitlin E Samocha, Benjamin Weisburd, Namrata Gupta, Mozzam Zaidi, Maria Samuel, Atif Imran, Shahid Abbas, Faisal Majeed, Madiha Ishaq, Saba Akhtar, Kevin Trindade, Megan Mucksavage, Nadeem Qamar, Khan Shah Zaman, Zia Yaqoob, Tahir Saghir, Syed Nadeem Hasan Rizvi, Anis Memon, Nadeem Hayyat Mallick, Mohammad Ishaq, Syed Zahed Rasheed, Fazal-Ur-Rehman Memon, Khalid Mahmood, Naveeduddin Ahmed, Ron Do, Ronald M Krauss, Daniel G MacArthur, Stacey Gabriel, Eric S Lander, Mark J Daly, Philippe Frossard, John Danesh, Daniel J Rader, Sekar Kathiresan
Abstract
A major goal of biomedicine is to understand the function of every gene in the human genome. Loss-of-function mutations can disrupt both copies of a given gene in humans and phenotypic analysis of such 'human knockouts' can provide insight into gene function. Consanguineous unions are more likely to result in offspring carrying homozygous loss-of-function mutations. In Pakistan, consanguinity rates are notably high. Here we sequence the protein-coding regions of 10,503 adult participants in the Pakistan Risk of Myocardial Infarction Study (PROMIS), designed to understand the determinants of cardiometabolic diseases in individuals from South Asia. We identified individuals carrying homozygous predicted loss-of-function (pLoF) mutations, and performed phenotypic analysis involving more than 200 biochemical and disease traits. We enumerated 49,138 rare (<1% minor allele frequency) pLoF mutations. These pLoF mutations are estimated to knock out 1,317 genes, each in at least one participant. Homozygosity for pLoF mutations at PLA2G7 was associated with absent enzymatic activity of soluble lipoprotein-associated phospholipase A2; at CYP2F1, with higher plasma interleukin-8 concentrations; at TREH, with lower concentrations of apoB-containing lipoprotein subfractions; at either A3GALT2 or NRG4, with markedly reduced plasma insulin C-peptide concentrations; and at SLC9A3R1, with mediators of calcium and phosphate signalling. Heterozygous deficiency of APOC3 has been shown to protect against coronary heart disease; we identified APOC3 homozygous pLoF carriers in our cohort. We recruited these human knockouts and challenged them with an oral fat load. Compared with family members lacking the mutation, individuals with APOC3 knocked out displayed marked blunting of the usual post-prandial rise in plasma triglycerides. Overall, these observations provide a roadmap for a 'human knockout project', a systematic effort to understand the phenotypic consequences of complete disruption of genes in humans.
Conflict of interest statement
The authors do not declare competing financial interests.
Figures
References
- Eisenberg D, Marcotte EM, Xenarios I, Yeates TO. Protein Function in the Post-Genomic Era. Nature. 2000;405:823–826. doi: 10.1038/35015694.
- Bittles AH, Mason WM, Greene J, Rao NA. Reproductive Behavior and Health in Consanguineous Marriages. Science. 1991;252:789–794.
- Saleheen D, et al. The Pakistan Risk of Myocardial Infarction Study: A Resource for the Study of Genetic, Lifestyle and Other Determinants of Myocardial Infarction in South Asia. Eur J Epidemiol. 2009;24:329–338. doi: 10.1007/s10654-009-9334-y.
- Crosby J, et al. Loss-of-Function Mutations in APOC3, Triglycerides, and Coronary Disease. N Engl J Med. 2014;371:22–31. doi: 10.1056/NEJMoa1307095.
- Jorgensen AB, Frikke-Schmidt R, Nordestgaard BG, Tybjaerg-Hansen A. Loss-of-Function Mutations in Apoc3 and Risk of Ischemic Vascular Disease. N Engl J Med. 2014;371:32–41. doi: 10.1056/NEJMoa1308027.
- Narasimhan VM, et al. Health and Population Effects of Rare Gene Knockouts in Adult Humans with Related Parents. Science. 2016 doi: 10.1126/science.aac8624.
- Sulem P, et al. Identification of a Large Set of Rare Complete Human Knockouts. Nat Genet. 2015 doi: 10.1038/ng.3243.
- Lek M, et al. Analysis of Protein-Coding Genetic Variation in 60,706 Humans. Nature. 2016;536:285–291. doi: 10.1038/nature19057.
- Di Angelantonio E, et al. Lipid-Related Markers and Cardiovascular Disease Prediction. JAMA. 2012;307:2499–2506. doi: 10.1001/jama.2012.6571.
- Gregson JM, et al. Genetic Invalidation of Lp-Pla2 as a Therapeutic Target: Large-Scale Study of Five Functional Lp-Pla2-Lowering Alleles. Eur J Prev Cardiol. 2016 doi: 10.1177/2047487316682186.
- Polfus LM, Gibbs RA, Boerwinkle E. Coronary Heart Disease and Genetic Variants with Low Phospholipase A2 Activity. N Engl J Med. 2015;372:295–296. doi: 10.1056/NEJMc1409673.
- White HD, et al. Darapladib for Preventing Ischemic Events in Stable Coronary Heart Disease. N Engl J Med. 2014;370:1702–1711. doi: 10.1056/NEJMoa1315878.
- O’Donoghue ML, et al. Effect of Darapladib on Major Coronary Events after an Acute Coronary Syndrome: The Solid-Timi 52 Randomized Clinical Trial. JAMA. 2014;312:1006–1015. doi: 10.1001/jama.2014.11061.
- Carr BA, Wan J, Hines RN, Yost GS. Characterization of the Human Lung Cyp2f1 Gene and Identification of a Novel Lung-Specific Binding Motif. J Biol Chem. 2003;278:15473–15483. doi: 10.1074/jbc.M300319200.
- Standiford TJ, et al. Interleukin-8 Gene Expression by a Pulmonary Epithelial Cell Line. A Model for Cytokine Networks in the Lung. J Clin Invest. 1990;86:1945–1953. doi: 10.1172/JCI114928.
- Murray IA, Coupland K, Smith JA, Ansell ID, Long RG. Intestinal Trehalase Activity in a Uk Population: Establishing a Normal Range and the Effect of Disease. Br J Nutr. 2000;83:241–245.
- Christiansen D, et al. Humans Lack Igb3 Due to the Absence of Functional Igb3-Synthase: Implications for Nkt Cell Development and Transplantation. PLoS Biol. 2008;6:e172. doi: 10.1371/journal.pbio.0060172.
- Dahl K, Buschard K, Gram DX, d’Apice AJ, Hansen AK. Glucose Intolerance in a Xenotransplantation Model: Studies in Alpha-Gal Knockout Mice. APMIS. 2006;114:805–811. doi: 10.1111/j.1600-0463.2006.apm_393.x.
- Casu A, et al. Insulin Secretion and Glucose Metabolism in Alpha 1,3-Galactosyltransferase Knock-out Pigs Compared to Wild-Type Pigs. Xenotransplantation. 2010;17:131–139. doi: 10.1111/j.1399-3089.2010.00572.x.
- Schneider MR, Wolf E. The Epidermal Growth Factor Receptor Ligands at a Glance. J Cell Physiol. 2009;218:460–466. doi: 10.1002/jcp.21635.
- Wang GX, et al. The Brown Fat-Enriched Secreted Factor Nrg4 Preserves Metabolic Homeostasis through Attenuation of Hepatic Lipogenesis. Nat Med. 2014;20:1436–1443. doi: 10.1038/nm.3713.
- Murtazina R, et al. Tissue-Specific Regulation of Sodium/Proton Exchanger Isoform 3 Activity in Na(+)/H(+) Exchanger Regulatory Factor 1 (Nherf1) Null Mice. Camp Inhibition Is Differentially Dependent on Nherf1 and Exchange Protein Directly Activated by Camp in Ileum Versus Proximal Tubule. J Biol Chem. 2007;282:25141–25151. doi: 10.1074/jbc.M701910200.
- Karim Z, et al. Nherf1 Mutations and Responsiveness of Renal Parathyroid Hormone. N Engl J Med. 2008;359:1128–1135. doi: 10.1056/NEJMoa0802836.
- Huff MW, Hegele RA, Apolipoprotein C., III Going Back to the Future for a Lipid Drug Target. Circ Res. 2013;112:1405–1408. doi: 10.1161/CIRCRESAHA.113.301464.
- Pollin TI, et al. A Null Mutation in Human APOC3 Confers a Favorable Plasma Lipid Profile and Apparent Cardioprotection. Science. 2008;322:1702–1705. doi: 10.1126/science.1161524.
- Gaudet D, et al. Antisense Inhibition of Apolipoprotein C-III in Patients with Hypertriglyceridemia. N Engl J Med. 2015;373:438–447. doi: 10.1056/NEJMoa1400283.
- Gaudet D, et al. Targeting APOC3 in the Familial Chylomicronemia Syndrome. N Engl J Med. 2014;371:2200–2206. doi: 10.1056/NEJMoa1400284.
- Graham MJ, et al. Antisense Oligonucleotide Inhibition of Apolipoprotein C-III Reduces Plasma Triglycerides in Rodents, Nonhuman Primates, and Humans. Circ Res. 2013;112:1479–1490. doi: 10.1161/circresaha.111.300367.
- Brown SD, Moore MW. Towards an Encyclopaedia of Mammalian Gene Function: The International Mouse Phenotyping Consortium. Dis Model Mech. 2012;5:289–292. doi: 10.1242/dmm.009878.
- Scott EM, et al. Characterization of Greater Middle Eastern Genetic Variation for Enhanced Disease Gene Discovery. Nat Genet. 2016 doi: 10.1038/ng.3592.
- Kooner JS, et al. Genome-Wide Association Study in Individuals of South Asian Ancestry Identifies Six New Type 2 Diabetes Susceptibility Loci. Nat Genet. 2011;43:984–989. doi: 10.1038/ng.921.
- Purcell S, et al. Plink: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am J Hum Genet. 2007;81:559–575. doi: 10.1086/519795.
- Tennessen JA, et al. Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes. Science. 2012;337:64–69. doi: 10.1126/science.1219240.
- Do R, et al. Exome Sequencing Identifies Rare Ldlr and Apoa5 Alleles Conferring Risk for Myocardial Infarction. Nature. 2015;518:102–106. doi: 10.1038/nature13917.
- Fisher S, et al. A Scalable, Fully Automated Process for Construction of Sequence-Ready Human Exome Targeted Capture Libraries. Genome Biol. 2011;12:R1. doi: 10.1186/gb-2011-12-1-r1.
- Li H, Durbin R. Fast and Accurate Short Read Alignment with Burrows-Wheeler Transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324.
- McKenna A, et al. The Genome Analysis Toolkit: A Mapreduce Framework for Analyzing Next-Generation DNA Sequencing Data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110.
- DePristo MA, et al. A Framework for Variation Discovery and Genotyping Using Next-Generation DNA Sequencing Data. Nat Genet. 2011;43:491–498. doi: 10.1038/ng.806.
- Van der Auwera GA, et al. From Fastq Data to High Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline. Curr Protoc Bioinformatics. 2013;11:11 10 11–11 10 33. doi: 10.1002/0471250953.bi1110s43.
- McLaren W, et al. Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor. Bioinformatics. 2010;26:2069–2070. doi: 10.1093/bioinformatics/btq330.
- Karczewski KJ. Loftee (Loss-of-Function Transcript Effect Estimator) 2015 .
- Jun G, et al. Detecting and Estimating Contamination of Human DNA Samples in Sequencing and Array-Based Genotype Data. Am J Hum Genet. 2012;91:839–848. doi: 10.1016/j.ajhg.2012.09.004.
- Manichaikul A, et al. Robust Relationship Inference in Genome-Wide Association Studies. Bioinformatics. 2010;26:2867–2873. doi: 10.1093/bioinformatics/btq559.
- Gold L, et al. Aptamer-Based Multiplexed Proteomic Technology for Biomarker Discovery. PLoS One. 2010;5:e15004. doi: 10.1371/journal.pone.0015004.
- Hunter-Zinck H, et al. Population Genetic Structure of the People of Qatar. Am J Hum Genet. 2010;87:17–25. doi: 10.1016/j.ajhg.2010.05.018.
- Lander ES, Botstein D. Homozygosity Mapping: A Way to Map Human Recessive Traits with the DNA of Inbred Children. Science. 1987;236:1567–1570.
- Purcell SM, et al. A Polygenic Burden of Rare Disruptive Mutations in Schizophrenia. Nature. 2014;506:185–190. doi: 10.1038/nature12975.
- Wright S. Coefficients of Inbreeding and Relationship. Am Nat. 1922;56:330–338.
- De Rubeis S, et al. Synaptic, Transcriptional and Chromatin Genes Disrupted in Autism. Nature. 2014;515:209–215. doi: 10.1038/nature13772.
- Samocha KE, et al. A Framework for the Interpretation of De Novo Mutation in Human Disease. Nat Genet. 2014;46:944–950. doi: 10.1038/ng.3050.
- Wang T, et al. Identification and Characterization of Essential Genes in the Human Genome. Science. 2015;350:1096–1101. doi: 10.1126/science.aac7041.
- Eppig JT, et al. The Mouse Genome Database (Mgd): Facilitating Mouse as a Model for Human Biology and Disease. Nucleic Acids Res. 2015;43:D726–736. doi: 10.1093/nar/gku967.
- Georgi B, Voight BF, Bucan M. From Mouse to Human: Evolutionary Genomics Analysis of Human Orthologs of Essential Genes. PLoS Genet. 2013;9:e1003484. doi: 10.1371/journal.pgen.1003484.
- Fuchs M, et al. The P400 Complex Is an Essential E1a Transformation Target. Cell. 2001;106:297–307.
- Fazzio TG, Huff JT, Panning B. An Rnai Screen of Chromatin Proteins Identifies Tip60-P400 as a Regulator of Embryonic Stem Cell Identity. Cell. 2008;134:162–174. doi: 10.1016/j.cell.2008.05.031.
- Price AL, et al. Principal Components Analysis Corrects for Stratification in Genome-Wide Association Studies. Nat Genet. 2006;38:904–909. doi: 10.1038/ng1847.
- Sambrook J, Russell DW. Purification of Nucleic Acids by Extraction with Phenol:Chloroform. CSH Protoc. 2006;2006 doi: 10.1101/pdb.prot4455.
- Mosteller RD. Simplified Calculation of Body-Surface Area. N Engl J Med. 1987;317:1098. doi: 10.1056/NEJM198710223171717.
- Maraki M, et al. Validity of Abbreviated Oral Fat Tolerance Tests for Assessing Postprandial Lipemia. Clin Nutr. 2011;30:852–857. doi: 10.1016/j.clnu.2011.05.003.
Source: PubMed