Machine Learning-Derived Prenatal Predictive Risk Model to Guide Intervention and Prevent the Progression of Gestational Diabetes Mellitus to Type 2 Diabetes: Prediction Model Development Study

Mukkesh Kumar, Li Ting Ang, Cindy Ho, Shu E Soh, Kok Hian Tan, Jerry Kok Yen Chan, Keith M Godfrey, Shiao-Yng Chan, Yap Seng Chong, Johan G Eriksson, Mengling Feng, Neerja Karnani, Mukkesh Kumar, Li Ting Ang, Cindy Ho, Shu E Soh, Kok Hian Tan, Jerry Kok Yen Chan, Keith M Godfrey, Shiao-Yng Chan, Yap Seng Chong, Johan G Eriksson, Mengling Feng, Neerja Karnani

Abstract

Background: The increasing prevalence of gestational diabetes mellitus (GDM) is concerning as women with GDM are at high risk of type 2 diabetes (T2D) later in life. The magnitude of this risk highlights the importance of early intervention to prevent the progression of GDM to T2D. Rates of postpartum screening are suboptimal, often as low as 13% in Asian countries. The lack of preventive care through structured postpartum screening in several health care systems and low public awareness are key barriers to postpartum diabetes screening.

Objective: In this study, we developed a machine learning model for early prediction of postpartum T2D following routine antenatal GDM screening. The early prediction of postpartum T2D during prenatal care would enable the implementation of effective strategies for diabetes prevention interventions. To our best knowledge, this is the first study that uses machine learning for postpartum T2D risk assessment in antenatal populations of Asian origin.

Methods: Prospective multiethnic data (Chinese, Malay, and Indian ethnicities) from 561 pregnancies in Singapore's most deeply phenotyped mother-offspring cohort study-Growing Up in Singapore Towards healthy Outcomes-were used for predictive modeling. The feature variables included were demographics, medical or obstetric history, physical measures, lifestyle information, and GDM diagnosis. Shapley values were combined with CatBoost tree ensembles to perform feature selection. Our game theoretical approach for predictive analytics enables population subtyping and pattern discovery for data-driven precision care. The predictive models were trained using 4 machine learning algorithms: logistic regression, support vector machine, CatBoost gradient boosting, and artificial neural network. We used 5-fold stratified cross-validation to preserve the same proportion of T2D cases in each fold. Grid search pipelines were built to evaluate the best performing hyperparameters.

Results: A high performance prediction model for postpartum T2D comprising of 2 midgestation features-midpregnancy BMI after gestational weight gain and diagnosis of GDM-was developed (BMI_GDM CatBoost model: AUC=0.86, 95% CI 0.72-0.99). Prepregnancy BMI alone was inadequate in predicting postpartum T2D risk (ppBMI CatBoost model: AUC=0.62, 95% CI 0.39-0.86). A 2-hour postprandial glucose test (BMI_2hour CatBoost model: AUC=0.86, 95% CI 0.76-0.96) showed a stronger postpartum T2D risk prediction effect compared to fasting glucose test (BMI_Fasting CatBoost model: AUC=0.76, 95% CI 0.61-0.91). The BMI_GDM model was also robust when using a modified 2-point International Association of the Diabetes and Pregnancy Study Groups (IADPSG) 2018 criteria for GDM diagnosis (BMI_GDM2 CatBoost model: AUC=0.84, 95% CI 0.72-0.97). Total gestational weight gain was inversely associated with postpartum T2D outcome, independent of prepregnancy BMI and diagnosis of GDM (P=.02; OR 0.88, 95% CI 0.79-0.98).

Conclusions: Midgestation weight gain effects, combined with the metabolic derangements underlying GDM during pregnancy, signal future T2D risk in Singaporean women. Further studies will be required to examine the influence of metabolic adaptations in pregnancy on postpartum maternal metabolic health outcomes. The state-of-the-art machine learning model can be leveraged as a rapid risk stratification tool during prenatal care.

Trial registration: ClinicalTrials.gov NCT01174875; https://ichgcp.net/clinical-trials-registry/NCT01174875.

Keywords: Asian populations; diabetes management; digital health; gestational diabetes mellitus; machine learning; prediction models; prenatal care; public health; risk factors; type 2 diabetes.

Conflict of interest statement

Conflicts of Interest: NK, KMG, SYC, and YSC are part of an academic consortium that has received research funding from Abbott Nutrition, Nestec Inc, BenevolentAI Bio Ltd, and Danone. MF was partially supported by the National Research Foundation Singapore under its AI Singapore Program (award AISG-GC-2019-001-2A). Other authors declare no conflicts of interest.

©Mukkesh Kumar, Li Ting Ang, Cindy Ho, Shu E Soh, Kok Hian Tan, Jerry Kok Yen Chan, Keith M Godfrey, Shiao-Yng Chan, Yap Seng Chong, Johan G Eriksson, Mengling Feng, Neerja Karnani. Originally published in JMIR Diabetes (https://diabetes.jmir.org), 05.07.2022.

Figures

Figure 1
Figure 1
Pearson Correlation heatmap for abnormal glucose metabolism (AGM). GDM: gestational diabetes mellitus.
Figure 2
Figure 2
Pearson Correlation heatmap for type 2 diabetes (T2D). GDM: gestational diabetes mellitus.
Figure 3
Figure 3
SHapley Additive exPlanations (SHAP) summery plot of feature selection model. WHO: World Health Organization.
Figure 4
Figure 4
SHapley Additive exPlanations (SHAP) summary plot of BMI_GDM model. WHO: World Health Organization.
Figure 5
Figure 5
Validation curve with CatBoost algorithm–Varying learning rate. AUC: area under the receiver operating characteristic curve.
Figure 6
Figure 6
Validation curve with CatBoost algorithm–Varying L2 leaf regularization. AUC: area under the receiver operating characteristic curve.
Figure 7
Figure 7
Validation curve with CatBoost algorithm–Varying random strength. AUC: area under the receiver operating characteristic curve.

References

    1. IDF Diabetes Atlas 9th edition. International Diabetes Federation. 2019. [2022-05-24].
    1. Vounzoulaki E, Khunti K, Abner SC, Tan BK, Davies MJ, Gillies CL. Progression to type 2 diabetes in women with a known history of gestational diabetes: systematic review and meta-analysis. BMJ. 2020 May 13;369:m1361. doi: 10.1136/bmj.m1361.
    1. Chen L, Soh SE, Tint M, Loy SL, Yap F, Tan KH, Lee YS, Shek LP, Godfrey KM, Gluckman PD, Eriksson JG, Chong Y, Chan S. Combined analysis of gestational diabetes and maternal weight status from pre-pregnancy through post-delivery in future development of type 2 diabetes. Sci Rep. 2021 Mar 03;11(1):5021. doi: 10.1038/s41598-021-82789-x. doi: 10.1038/s41598-021-82789-x.10.1038/s41598-021-82789-x
    1. American Diabetes Association 2. Classification and Diagnosis of Diabetes: Standards of Medical Care in Diabetes-2020. Diabetes Care. 2020 Jan 20;43(Suppl 1):S14–S31. doi: 10.2337/dc20-S002.43/Supplement_1/S14
    1. Nouhjah S, Shahbazian H, Amoori N, Jahanfar S, Shahbazian N, Jahanshahi A, Cheraghian B. Postpartum screening practices, progression to abnormal glucose tolerance and its related risk factors in Asian women with a known history of gestational diabetes: A systematic review and meta-analysis. Diabetes Metab Syndr. 2017 Dec;11 Suppl 2:S703–S712. doi: 10.1016/j.dsx.2017.05.002.S1871-4021(17)30133-9
    1. Sanderson H, Loveman E, Colquitt J, Royle P, Waugh N, Tan BK. Improving uptake of postnatal checking of blood glucose in women who had gestational diabetes mellitus in universal healthcare settings: a systematic review. J Clin Med. 2018 Dec 20;8(1):4. doi: 10.3390/jcm8010004. jcm8010004
    1. Allalou A, Nalla A, Prentice KJ, Liu Y, Zhang M, Dai FF, Ning X, Osborne LR, Cox BJ, Gunderson EP, Wheeler MB. A Predictive metabolic signature for the transition from gestational diabetes Mellitus to type 2 diabetes. Diabetes. 2016 Sep 23;65(9):2529–39. doi: 10.2337/db15-1720. db15-1720
    1. Joglekar MV, Wong WKM, Ema FK, Georgiou HM, Shub A, Hardikar AA, Lappas M. Postpartum circulating microRNA enhances prediction of future type 2 diabetes in women with previous gestational diabetes. Diabetologia. 2021 Jul 23;64(7):1516–1526. doi: 10.1007/s00125-021-05429-z.10.1007/s00125-021-05429-z
    1. Diagnostic criteria and classification of hyperglycaemia first detected in pregnancy. WHO. 2013. [2022-05-20]. .
    1. Definition and diagnosis of diabetes mellitus and intermediate hyperglycaemia: report of a WHO/IDF consultation. WHO. 2006. [2022-05-20]. .
    1. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee S. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020 Jan;2(1):56–67. doi: 10.1038/s42256-019-0138-9.
    1. Prokhorenkova L, Gusev G, Vorobev A, Veronika DA, Gulin A. CatBoost: unbiased boosting with categorical features. ArXiv. Preprint posted online on June 28, 2017. 2022 doi: 10.48550/arXiv.1706.09516.
    1. Zeng Z, Liu F, Li S. Metabolic adaptations in pregnancy: a review. Ann Nutr Metab. 2017 Mar 16;70(1):59–65. doi: 10.1159/000459633. 000459633
    1. GUSTO_Type-2-Diabetes-Mellitus. GitHub. [2022-05-31]. .
    1. Machine learning derived prenatal predictive risk model to guide intervention and prevent the progression of Gestational Diabetes Mellitus to Type 2 Diabetes. Group Mornin. [2022-05-31]. .

Source: PubMed

3
Abonnieren