Variants near FOXE1 are associated with hypothyroidism and other thyroid conditions: using electronic medical records for genome- and phenome-wide studies

Joshua C Denny, Dana C Crawford, Marylyn D Ritchie, Suzette J Bielinski, Melissa A Basford, Yuki Bradford, High Seng Chai, Lisa Bastarache, Rebecca Zuvich, Peggy Peissig, David Carrell, Andrea H Ramirez, Jyotishman Pathak, Russell A Wilke, Luke Rasmussen, Xiaoming Wang, Jennifer A Pacheco, Abel N Kho, M Geoffrey Hayes, Noah Weston, Martha Matsumoto, Peter A Kopp, Katherine M Newton, Gail P Jarvik, Rongling Li, Teri A Manolio, Iftikhar J Kullo, Christopher G Chute, Rex L Chisholm, Eric B Larson, Catherine A McCarty, Daniel R Masys, Dan M Roden, Mariza de Andrade, Joshua C Denny, Dana C Crawford, Marylyn D Ritchie, Suzette J Bielinski, Melissa A Basford, Yuki Bradford, High Seng Chai, Lisa Bastarache, Rebecca Zuvich, Peggy Peissig, David Carrell, Andrea H Ramirez, Jyotishman Pathak, Russell A Wilke, Luke Rasmussen, Xiaoming Wang, Jennifer A Pacheco, Abel N Kho, M Geoffrey Hayes, Noah Weston, Martha Matsumoto, Peter A Kopp, Katherine M Newton, Gail P Jarvik, Rongling Li, Teri A Manolio, Iftikhar J Kullo, Christopher G Chute, Rex L Chisholm, Eric B Larson, Catherine A McCarty, Daniel R Masys, Dan M Roden, Mariza de Andrade

Abstract

We repurposed existing genotypes in DNA biobanks across the Electronic Medical Records and Genomics network to perform a genome-wide association study for primary hypothyroidism, the most common thyroid disease. Electronic selection algorithms incorporating billing codes, laboratory values, text queries, and medication records identified 1317 cases and 5053 controls of European ancestry within five electronic medical records (EMRs); the algorithms' positive predictive values were 92.4% and 98.5% for cases and controls, respectively. Four single-nucleotide polymorphisms (SNPs) in linkage disequilibrium at 9q22 near FOXE1 were associated with hypothyroidism at genome-wide significance, the strongest being rs7850258 (odds ratio [OR] 0.74, p = 3.96 × 10(-9)). This association was replicated in a set of 263 cases and 1616 controls (OR = 0.60, p = 5.7 × 10(-6)). A phenome-wide association study (PheWAS) that was performed on this locus with 13,617 individuals and more than 200,000 patient-years of billing data identified associations with additional phenotypes: thyroiditis (OR = 0.58, p = 1.4 × 10(-5)), nodular (OR = 0.76, p = 3.1 × 10(-5)) and multinodular (OR = 0.69, p = 3.9 × 10(-5)) goiters, and thyrotoxicosis (OR = 0.76, p = 1.5 × 10(-3)), but not Graves disease (OR = 1.03, p = 0.82). Thyroid cancer, previously associated with this locus, was not significantly associated in the PheWAS (OR = 1.29, p = 0.09). The strongest association in the PheWAS was hypothyroidism (OR = 0.76, p = 2.7 × 10(-13)), which had an odds ratio that was nearly identical to that of the curated case-control population in the primary analysis, providing further validation of the PheWAS method. Our findings indicate that EMR-linked genomic data could allow discovery of genes associated with many diseases without additional genotyping cost.

Copyright © 2011 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Figures

Figure 1
Figure 1
Genome-wide Association Analysis of Individuals with Primary Hypothyroidism versus Controls SNP tests of association (logistic regression) under the assumption of an additive genetic model adjusted for sex, birth decade, and study site; the tests incorporated 522,164 SNPs. The red horizontal line indicates p = 5 × 10−8, the threshold for genome-wide significance.
Figure 2
Figure 2
Regional Associations Near FOXE1
Figure 3
Figure 3
Manhattan Plot of Phenome-wide Association Study for rs965513 The PheWAS considered 866 phenotypes via single SNP tests of association (logistic regression) adjusted for age and sex. The red line indicates Bonferroni significance for these associations, p = 5.8 × 10−5. The blue line indicates p = 0.05. For labeled associations, the dot size varies by the magnitude of the odds ratio. For purposes of display, we normalized all odds ratios by taking the multiplicative absolute value such that all values were greater than 1 (e.g., an odds ratio of 0.5 becomes 2).

Source: PubMed

3
구독하다