Classification of common human diseases derived from shared genetic and environmental determinants
Kanix Wang, Hallie Gaitsch, Hoifung Poon, Nancy J Cox, Andrey Rzhetsky, Kanix Wang, Hallie Gaitsch, Hoifung Poon, Nancy J Cox, Andrey Rzhetsky
Abstract
In this study, we used insurance claims for over one-third of the entire US population to create a subset of 128,989 families (481,657 unique individuals). We then used these data to (i) estimate the heritability and familial environmental patterns of 149 diseases and (ii) infer the genetic and environmental correlations for disease pairs from a set of 29 complex diseases. The majority (52 of 65) of our study's heritability estimates matched earlier reports, and 84 of our estimates appear to have been obtained for the first time. We used correlation matrices to compute environmental and genetic disease classifications and corresponding reliability measures. Among unexpected observations, we found that migraine, typically classified as a disease of the central nervous system, appeared to be most genetically similar to irritable bowel syndrome and most environmentally similar to cystitis and urethritis, all of which are inflammatory diseases.
Conflict of interest statement
Competing Financial Interests Statement
The authors declare no competing financial interests.
Figures
References
- van de Water T, Suliman S, Seedat S. Gender and cultural issues in psychiatric nosological classification systems. CNS Spectr. 2016;21:334–340.
- Kendler KS. The nature of psychiatric disorders. World Psychiatry. 2016;15:5–12.
- Endlicher S. In: Genera plantarum secundum ordines naturales disposita. Beck F, editor. 1836.
- Jussieu ALd, Stafleu FA. In: Genera plantarum. Cramer J, editor. Stechert-Hafner Service Agency; 1964.
- Linne Cv, et al. The families of plants : with their natural characters, according to the number, figure, situation, and proportion of all of the parts of fructification. 1787. Printed by John Jackson, sold by J. Johnson … T. Byrne … and J. Balfour.
- Thunberg KP, et al. Nova genera plantarum. apud J. Edman etc; 1781.
- Anderson MJ. Carl Linnaeus : genius of classification. Enslow Publishers, Inc; 2015.
- Felsenstein J. Inferring phylogenies. Sinauer Associates; 2004.
- Suthram S, et al. Network-based elucidation of human disease similarities reveals common functional modules enriched for pluripotent drug targets. PLoS Comput Biol. 2010;6:e1000662.
- Fisher RA. XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. Transactions of the Royal Society of Edinburgh. 1918;52:399–433.
- Wright S. Systems of Mating. I. the Biometric Relations between Parent and Offspring. Genetics. 1921;6:111–123.
- Lynch M, Walsh B. Genetics and analysis of quantitative traits. Sinauer; 1998.
- Gelman A. Bayesian data analysis. 3. CRC Press; 2014.
- Hadfield JD. MCMC Methods for Multi-Response Generalized Linear Mixed Models: The MCMCglmm R Package. Journal of Statistical Software. 2010;33:1–22.
- Benjamini Y, Hochberg Y. Controlling the False Discovery Rate - a Practical and Powerful Approach to Multiple Testing. J Roy Stat Soc B Met. 1995;57:289–300.
- Lichtenstein P, et al. Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet. 2009;373:234–239.
- Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017;169:1177–1186.
- Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406–425.
- Efron B. The Jackknife, the Bootstrap and Other Resampling Plans. Society for Industrial and Applied Mathematics; 1982.
- Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–791.
- Efron B. The bootstrap and Markov-chain Monte Carlo. Journal of biopharmaceutical statistics. 2011;21:1052–1062.
- Farh KK, et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature. 2015;518:337–343.
- Gormley P, et al. Meta-analysis of 375,000 individuals identifies 38 susceptibility loci for migraine. Nature genetics. 2016
- Bulik-Sullivan B, Finucane HK, Anttila V, et al. An Atlas of Genetic Correlations across Human Diseases and Traits. Nature genetics. 2015;47:1236–1241.
- Xia C, et al. Pedigree- and SNP-Associated Genetics and Recent Environment are the Major Contributors to Anthropometric and Cardiometabolic Trait Variation. PLoS genetics. 2016;12:e1005804.
- Schildkraut JM, Risch N, Thompson WD. Evaluating genetic association among ovarian, breast, and endometrial cancer: evidence for a breast/ovarian cancer relationship. American journal of human genetics. 1989;45:521–529.
- Davis LK, et al. Partitioning the heritability of Tourette syndrome and obsessive compulsive disorder reveals differences in genetic architecture. PLoS genetics. 2013;9:e1003864.
- Lee SH, et al. Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nature genetics. 2013;45:984–994.
- Loh PR, et al. Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nature genetics. 2015;47:1385–1392.
- Munoz M, et al. Evaluating the contribution of genetics and familial shared environment to common disease using the UK Biobank. Nature genetics. 2016;48:980–983.
- Vattikuti S, Guo J, Chow CC. Heritability and genetic correlations explained by common SNPs for metabolic syndrome traits. PLoS genetics. 2012;8:e1002637.
- Liu C, et al. Revisiting heritability accounting for shared environmental effects and maternal inheritance. Human genetics. 2015;134:169–179.
- Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A. 2012;109:1193–1198.
- Zaitlen N, et al. Using extended genealogy to estimate components of heritability for 23 quantitative and dichotomous traits. PLoS genetics. 2013;9:e1003520.
- Wray NR, Maier R. Genetic Basis of Complex Genetic Disease: The Contribution of Disease Heterogeneity to Missing Heritability. Current Epidemiology Reports. 2014;1:220–227.
- Ojodu J, et al. Incidence of sickle cell trait--United States, 2010. MMWR. Morbidity and mortality weekly report. 2014;63:1155–1158.
Us Census Bureau, D. I. D. (Washington, DC, 2017).
- Denny JC, et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics. 2010;26:1205–1210.
Us Census Bureau, D. I. D.
- Korsgaard IR, et al. Multivariate Bayesian analysis of Gaussian, right censored Gaussian, ordered categorical and binary traits using Gibbs sampling. Genetics Selection Evolution : GSE. 2003;35:159–183.
- Falconer D, Mackay T. Introduction to Quantitative Genetics. 4. Harlow, UK: Longman Scientific and Technical; 1996.
- Falconer DS. The inheritance of liability to certain diseases, estimated from the incidence among relatives. Ann Hum Genet. 1965;29:51–76.
- Sorensen D, Gianola D. Likelihood, Bayesian and MCMC methods in quantitative genetics. Springer-Verlag; 2002.
- German Rodriguez NG. An Assessment of Estimation Procedures for Multilevel Models with Binary Responses. Journal of the Royal Statistical Society. Series A (Statistics in Society) 1995;158:73–89.
- de Villemereuil P, Gimenez O, Doligez B. Comparing parent–offspring regression with frequentist and Bayesian animal models to estimate heritability in wild populations: a simulation study for Gaussian and binary traits. Methods in Ecology and Evolution. 2013;4:260–275.
- Gelman A. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper) 2006. pp. 515–534.
- Gelman A, Rubin DB. Inference from Iterative Simulation using Multiple Sequences. Stat Sci. 1992;7:457–511.
- Heidelberger P, Welch PD. Simulation run length control in the presence of an initial transient. Opns Res. 1983;31:1109–1144.
- Plummer M, Best N, Cowles K, Vines K. CODA: Convergence Diagnosis and Output Analysis for MCMC. R News. 2006;6:7–11.
- Benjamini Y, Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann Stat. 2001;29:1165–1188.
- Spiegelhalter DJ, Best NG, Carlin BP, Van Der Linde A. Bayesian measures of model complexity and fit. 2002. pp. 583–616.
- Bérénos C, Ellis PA, Pilkington JG, Pemberton JM. Estimating quantitative genetic parameters in wild populations: a comparison of pedigree and genomic approaches. Molecular ecology. 2014;23:3434–3451.
- Charmantier A, Réale D. How do misassigned paternities affect the estimation of heritability in the wild? Molecular ecology. 2005;14:2839–2850.
- Morrissey MB, Wilson AJ, Pemberton JM, Ferguson MM. A framework for power and sensitivity analyses for quantitative genetic studies of natural populations, and case studies in Soay sheep (Ovis aries) Journal of evolutionary biology. 2007;20:2309–2321.
- Kreider RM, Lofquist DA. P20-572: Adopted Children and Stepchildren: 2010. Washington, DC: U.S. Census Bureau; 2014.
- United States Census, B. Children by Presence and Type of Parent(s), Race, and Hispanic Origin:2007–2011. 2007.
- Anttila V, Bulik-Sullivan B, Finucane HK, et al. Analysis of shared heritability in common disorders of the brain. 2016. bioRxiv.
- Pippitt K, Li M, Gurgle HE. Diabetes Mellitus: Screening and Diagnosis. American family physician. 2016;93:103–109.
Source: PubMed