Comparisons of Polyexposure, Polygenic, and Clinical Risk Scores in Risk Prediction of Type 2 Diabetes

Yixuan He, Chirag M Lakhani, Danielle Rasooly, Arjun K Manrai, Ioanna Tzoulaki, Chirag J Patel, Yixuan He, Chirag M Lakhani, Danielle Rasooly, Arjun K Manrai, Ioanna Tzoulaki, Chirag J Patel

Abstract

Objective: To establish a polyexposure score (PXS) for type 2 diabetes (T2D) incorporating 12 nongenetic exposures and examine whether a PXS and/or a polygenic risk score (PGS) improves diabetes prediction beyond traditional clinical risk factors.

Research design and methods: We identified 356,621 unrelated individuals from the UK Biobank of White British ancestry with no prior diagnosis of T2D and normal HbA1c levels. Using self-reported and hospital admission information, we deployed a machine learning procedure to select the most predictive and robust factors out of 111 nongenetically ascertained exposure and lifestyle variables for the PXS in prospective T2D. We computed the clinical risk score (CRS) and PGS by taking a weighted sum of eight established clinical risk factors and >6 million single nucleotide polymorphisms, respectively.

Results: In the study population, 7,513 had incident T2D. The C-statistics for the PGS, PXS, and CRS models were 0.709, 0.762, and 0.839, respectively. Individuals in the top 10% of PGS, PXS, and CRS had 2.00-, 5.90-, and 9.97-fold greater risk, respectively, compared to the remaining population. Addition of PGS and PXS to CRS improved T2D classification accuracy, with a continuous net reclassification index of 15.2% and 30.1% for cases, respectively, and 7.3% and 16.9% for controls, respectively.

Conclusions: For T2D, the PXS provides modest incremental predictive value over established clinical risk factors. However, the concept of PXS merits further consideration in T2D risk stratification and is likely to be useful in other chronic disease risk prediction models.

© 2021 by the American Diabetes Association.

Figures

Figure 1
Figure 1
Study design. PXS, CRS, and PGS were calculated and compared for predictive accuracy. PGS was calculated using previously published weights. CRS factors included sex, age, family history, BMI, systolic blood pressure, serum glucose levels, serum HDL-C, and serum triglycerides. PXS factors were selected using a lasso-based method that relied on summary statistics from XWAS. CRF, clinical risk factor.
Figure 2
Figure 2
Reclassification of predicted T2D risk. The reclassified predicted risk with addition of PGS (A), PXS (B), or PGS + PXS (C) to the CRS model in the continuous case or the categorical case with a threshold of 12.5% risk. The overall NRI is the sum of the net reclassifications for cases (P[up|case] − P[down|case]) and noncases (P[down|noncase] − P[up|noncase]). A positive NRI indicates improved reclassification.

References

    1. Morgan CL, Currie CJ, Peters JR. Relationship between diabetes and mortality: a population study using record linkage. Diabetes Care 2000;23:1103–1107
    1. Meigs JB, Shrader P, Sullivan LM, et al. . Genotype score in addition to common risk factors for prediction of type 2 diabetes. N Engl J Med 2008;359:2208–2219
    1. Khera AV, Chaffin M, Aragam KG, et al. . Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet 2018;50:1219–1224
    1. Mars N, Koskela JT, Ripatti P, et al. .; FinnGen . Polygenic and clinical risk scores and their impact on age at onset and prediction of cardiometabolic diseases and common cancers. Nat Med 2020;26:549–557
    1. Pan A, Wang Y, Talaei M, Hu FB, Wu T. Relation of active, passive, and quitting smoking with incident type 2 diabetes: a systematic review and meta-analysis. Lancet Diabetes Endocrinol 2015;3:958–967
    1. Neuenschwander M, Ballon A, Weber KS, et al. . Role of diet in type 2 diabetes incidence: umbrella review of meta-analyses of prospective observational studies. BMJ 2019;366:l2368.
    1. Martinell M, Pingel R, Hallqvist J, et al. . Education, immigration and income as risk factors for hemoglobin A1c >70 mmol/mol when diagnosed with type 2 diabetes or latent autoimmune diabetes in adult: a population-based cohort study. BMJ Open Diabetes Res Care 2017;5:e000346
    1. Smith GD, Lawlor DA, Harbord R, Timpson N, Day I, Ebrahim S. Clustered environments and randomized genes: a fundamental distinction between conventional and genetic epidemiology. PLoS Med 2007;4:e352.
    1. Ioannidis JPA, Loy EY, Poulton R, Chia KS. Researching genetic versus nongenetic determinants of disease: a comparison and proposed unification. Sci Transl Med 2009;1:7ps8
    1. van der Meer T, Wolffenbuttel B, Patel CJ. Data-driven assessment, contextualization and implementation of 134 variables in their risk for type 2 diabetes: An analysis of Lifelines, a prospective cohort study in the Netherlands. Diabetologia. In press
    1. Sudlow C, Gallacher J, Allen N, et al. . UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 2015;12:e1001779.
    1. Bycroft C, Freeman C, Petkova D, et al. . The UK Biobank resource with deep phenotyping and genomic data. Nature 2018;562:203–209
    1. Wilson PWF, Meigs JB, Sullivan L, Fox CS, Nathan DM, D’Agostino RB Sr. Prediction of incident diabetes mellitus in middle-aged adults: the Framingham Offspring Study. Arch Intern Med 2007;167:1068–1074
    1. American Diabetes Association . Standards of Medical Care in Diabetes-2018 abridged for primary care providers. Clin Diabetes 2018;36:14–37
    1. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 2015;4:7.
    1. Millard LAC, Davies NM, Gaunt TR, Davey Smith G, Tilling K. Software application profile: PHESANT: a tool for performing automated phenome scans in UK Biobank. Int J Epidemiol 2018;47:29–35
    1. Patel CJ, Bhattacharya J, Butte AJ. An environment-wide association study (EWAS) on type 2 diabetes mellitus. PLoS One 2010;5:e10746.
    1. Patel CJ, Cullen MR, Ioannidis JP, Butte AJ. Systematic evaluation of environmental factors: persistent pollutants and nutrients correlated with serum lipid levels. Int J Epidemiol 2012;41:828–843
    1. Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc B 1996;58:267–288
    1. Balding DJ, Moltke I, Marioni J, Eds. Handbook of Statistical Genomics. 4th ed. Hoboken, NJ, Wiley, 2019
    1. Therneau TM, Grambsch PM. Modeling Survival Data: Extending the Cox Model. New York, Springer, 2000
    1. Benjamini Y, Hochberg Y. Controlling the false Discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B 1995;57:289–300
    1. Conway JR, Lex A, Gehlenborg N. UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics 2017;33:2938–2940
    1. Patel CJ, Ioannidis JPA. Studying the elusive environment in large scale. JAMA 2014;311:2173–2174
    1. Patel CJ, Ioannidis JPA. Placing epidemiological results in the context of multiplicity and typical correlations of exposures. J Epidemiol Community Health 2014;68:1096–1100
    1. Manrai AK, Patel CJ, Gehlenborg N, Tatonetti NP, Ioannidis JPA, Kohane IS. Methods to enhance the reproducibility of precision medicine. Pac Symp Biocomput 2016;21:180–182
    1. Vermeulen R, Schymanski EL, Barabási A-L, Miller GW. The exposome and health: where chemistry meets biology. Science 2020;367:392–396
    1. Figueroa JF, Frakt AB, Jha AK. Addressing social determinants of health: time for a polysocial risk score. JAMA 2020;323:1553–1554
    1. Elliott J, Bodinier B, Bond TA, et al. . Predictive accuracy of a polygenic risk score-enhanced prediction model vs a clinical risk score for coronary artery disease. JAMA 2020;323:636–645
    1. Abbasi A, Peelen LM, Corpeleijn E, et al. . Prediction models for risk of developing type 2 diabetes: systematic literature search and independent external validation study. BMJ 2012;345:e5900.
    1. Ioannidis JPA. Why most published research findings are false. PLoS Med 2005;2:e124.
    1. Tzoulaki I, Elliott P, Kontis V, Ezzati M. Worldwide exposures to cardiovascular risk factors and associated health effects: current knowledge and data gaps. Circulation 2016;133:2314–2333
    1. Johansen MY, MacDonald CS, Hansen KB, et al. . Effect of an intensive lifestyle intervention on glycemic control in patients with type 2 diabetes: a randomized clinical trial. JAMA 2017;318:637–646
    1. Borgnakke WS. “Non-modifiable” risk factors for periodontitis and diabetes. Curr Oral Health Rep 2016;3:270–281
    1. Ioannidis JPA. The challenge of reforming nutritional epidemiologic research. JAMA 2018;320:969–970
    1. Aschard H, Lutz S, Maus B, et al. . Challenges and opportunities in genome-wide environmental interaction (GWEI) studies. Hum Genet 2012;131:1591–1613
    1. Patel CJ, Chen R, Kodama K, Ioannidis JPA, Butte AJ. Systematic identification of interaction effects between genome- and environment-wide associations in type 2 diabetes mellitus. Hum Genet 2013;132:495–508
    1. Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet 2019;51:584–591
    1. Braveman PA, Cubbin C, Egerter S, Williams DR, Pamuk E. Socioeconomic disparities in health in the United States: what the patterns tell us. Am J Public Health 2010;100(Suppl. 1):S186–S196
    1. Drope J, Liber AC, Cahn Z, et al. . Who’s still smoking? Disparities in adult cigarette smoking prevalence in the United States. CA Cancer J Clin 2018;68:106–115
    1. Denny JC, Rutter JL, Goldstein DB, et al. .; All of Us Research Program Investigators . The “All of Us” research program. N Engl J Med 2019;381:668–676
    1. Chen Z, Chen J, Collins R, et al. .; China Kadoorie Biobank (CKB) collaborative group . China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. Int J Epidemiol 2011;40:1652–1666

Source: PubMed

3
Sottoscrivi