| Literature DB >> 35682375 |
Mukkesh Kumar1,2,3, Li Ting Ang1,2, Hang Png1,2, Maisie Ng1,2, Karen Tan1, See Ling Loy4,5, Kok Hian Tan4,6, Jerry Kok Yen Chan4,5,7,8, Keith M Godfrey9, Shiao-Yng Chan1,8, Yap Seng Chong1,8, Johan G Eriksson1,8,10,11, Mengling Feng3,12, Neerja Karnani1,2,13.
Abstract
The increasing prevalence of gestational diabetes mellitus (GDM) is contributing to the rising global burden of type 2 diabetes (T2D) and intergenerational cycle of chronic metabolic disorders. Primary lifestyle interventions to manage GDM, including second trimester dietary and exercise guidance, have met with limited success due to late implementation, poor adherence and generic guidelines. In this study, we aimed to build a preconception-based GDM predictor to enable early intervention. We also assessed the associations of top predictors with GDM and adverse birth outcomes. Our evolutionary algorithm-based automated machine learning (AutoML) model was implemented with data from 222 Asian multi-ethnic women in a preconception cohort study, Singapore Preconception Study of Long-Term Maternal and Child Outcomes (S-PRESTO). A stacked ensemble model with a gradient boosting classifier and linear support vector machine classifier (stochastic gradient descent training) was derived using genetic programming, achieving an excellent AUC of 0.93 based on four features (glycated hemoglobin A1c (HbA1c), mean arterial blood pressure, fasting insulin, triglycerides/HDL ratio). The results of multivariate logistic regression model showed that each 1 mmol/mol increase in preconception HbA1c was positively associated with increased risks of GDM (p = 0.001, odds ratio (95% CI) 1.34 (1.13-1.60)) and preterm birth (p = 0.011, odds ratio 1.63 (1.12-2.38)). Optimal control of preconception HbA1c may aid in preventing GDM and reducing the incidence of preterm birth. Our trained predictor has been deployed as a web application that can be easily employed in GDM intervention programs, prior to conception.Entities:
Keywords: Asian populations; HbA1c; digital health; gestational diabetes mellitus; machine learning; preconception care; prediction; preterm birth; public health; risk factors
Mesh:
Year: 2022 PMID: 35682375 PMCID: PMC9180245 DOI: 10.3390/ijerph19116792
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Sample Participant characteristics. Sample selection flowchart of 222 preconception women who had complete data on demographics, medical/obstetric history, physical measures, blood-derived markers, lifestyle factors and antenatal OGTT for machine learning models.
Participant characteristics at preconception baseline. Participant characteristics table on demographics, medical/obstetric history, physical measures, blood-derived markers, lifestyle factors, metabolic indices, prediabetes status, antenatal OGTT and adverse birth outcomes. Continuous variables are presented as group mean value and standard deviation. Categorical variables are presented as count and percentage.
| S-PRESTO | |
|---|---|
|
| |
| Age (years), mean ± SD | 30.51 ± 3.11 |
| Ethnicity, | |
| Chinese | 176 (79.28) |
| Malay | 30 (13.51) |
| Indian | 16 (7.21) |
|
| |
| Family history of diabetes mellitus, | |
| Yes | 63 (28.38) |
| No | 159 (71.62) |
| History of GDM, | |
| Yes | 6 (2.70) |
| No | 216 (97.30) |
| Parity, | |
| Nulliparous | 140 (63.06) |
| Multiparous | 82 (36.94) |
| Medical history of high blood pressure, | |
| Yes | 0 (0.00) |
| No | 222 (100.00) |
|
| |
| Pre-pregnancy weight (kg), mean ± SD | 59.31 ± 11.80 |
| Maternal height (cm), mean ± SD | 159.96 ± 5.55 |
| Pre-pregnancy BMI (kg/m2), mean ± SD | 23.18 ± 4.52 |
| Waist circumference (cm), mean ± SD | 81.35 ± 10.10 |
| Mid-upper arm circumference (cm), mean ± SD | 27.27 ± 4.08 |
| Systolic blood pressure (mm Hg), mean ± SD | 104.15 ± 8.92 |
| Diastolic blood pressure (mm Hg), mean ± SD | 67.38 ± 7.51 |
| Mean arterial blood pressure (mm Hg), mean ± SD | 79.63 ± 7.48 |
|
| |
| HbA1c (mmol/mol), mean ± SD | 31.80 ± 2.73 |
| Fasting glucose (mmol/L), mean ± SD | 4.72 ± 0.33 |
| Fasting insulin (mU/L), mean ± SD | 5.97 ± 4.83 |
| Triglycerides (mmol/L), mean ± SD | 0.81 ± 0.38 |
| High density lipoprotein cholesterol (mmol/L), mean ± SD | 1.48 ± 0.28 |
| Gamma-glutamyl transferase (U/L), mean ± SD | 18.99 ± 14.28 |
|
| |
| Self-reported smoking, | |
| Yes | 6 (2.70) |
| No | 216 (97.30) |
| Self-reported alcohol consumption, | |
| Yes | 159 (71.62) |
| No | 63 (28.38) |
|
| |
| Homeostasis model assessment-insulin resistance (HOMA-IR) index, mean ± SD | 1.27 ± 1.08 |
| Triglycerides/high density lipoprotein cholesterol ratio | 0.59 ± 0.41 |
| Fatty liver index, mean ± SD | 5.61 ± 10.38 |
| Metabolic syndrome, | |
| Yes | 7 (3.15) |
| No | 215 (96.85) |
|
| |
| Impaired fasting glucose (IFG), | 0 (0.00) |
| Impaired glucose tolerance (IGT), | 11 (5.00) |
| Type 2 diabetes (T2D), | 0 (0.00) |
| Normal glucose metabolism, | 209 (95.00) |
|
| |
| Glucose measures (mmol/L), mean ± SD | |
| Fasting glucose | 4.28 ± 0.35 |
| 1-hour glucose | 7.99 ± 1.52 |
| 2-hour glucose | 6.68 ± 1.27 |
| GDM, | |
| IADPSG/WHO 2013 criteria | 29 (13.06) |
|
| |
| Preterm birth, | |
| Yes | 10 (4.50) |
| No | 212 (95.50) |
| Low birthweight at term, | |
| Yes | 7 (3.24) |
| No | 209 (96.76) |
| Large for gestational age infant, | |
| Yes | 34 (15.74) |
| No | 182 (84.26) |
Figure 2SHAP Global Importance Plot. Global importance of individual features and their correlation with GDM/non-GDM outcomes estimated using the Shapley values computed from coalitional game theory. SHAP values represent a change in log odds ratio. SHAP values of zero means that the feature does not contribute to the prediction.
Construction of preconception predictive risk model. The preconception predictive risk model for GDM was sequentially constructed using top predictors with SHAP value magnitudes greater than zero in the AutoML feature selection model. The optimal machine learning pipeline for each model and area under the receiver operating characteristic curve (AUC) performance metric are reported. The proposed AutoML model was also robust when replacing HbA1c with fasting glucose (AUC: 0.87), replacing mean arterial blood pressure with systolic blood pressure (AUC: 0.91) and replacing fasting insulin with HOMA-IR index (AUC: 0.91) (Supplementary Table S1). HbA1c had the greatest impact on model performance changes (ΔAUC = −0.06), followed by mean arterial blood pressure (ΔAUC = −0.02) and fasting insulin (ΔAUC = −0.02). Given these observations, maternal insulin resistance around conception can be postulated as an important determinant in the pathophysiology of metabolic diseases and fetal programming.
| Features | Optimal Machine Learning Pipeline | AUC |
|---|---|---|
| 1: HbA1c | Gradient boosting classifier. | 0.81 |
| 2: HbA1c + fatty liver index | Stacked ensemble model with logistic regression classifier, multinomial naïve Bayes classifier and multi-layer perceptron classifier. | 0.78 |
| 3: HbA1c + fatty liver index + mean arterial blood pressure | Stacked ensemble model with k-nearest neighbors classifier and decision tree classifier. | 0.82 |
| 4: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin | Stacked ensemble model with k-nearest neighbors classifier and decision tree classifier. | 0.88 |
|
| Extra trees classifier. |
|
| 6: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height | Stacked ensemble model with logistic regression classifier (stochastic gradient descent training) and k-nearest neighbors classifier. | 0.89 |
| 7: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height + age | Multi-layer perceptron classifier. | 0.88 |
| 8: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height + age + mid-upper arm circumference | Stacked ensemble model with Bernoulli naïve Bayes classifier, gaussian naïve Bayes classifier, multinomial naïve Bayes classifier and linear support vector machine classifier. | 0.93 |
| 9: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height + age + mid-upper arm circumference + BMI | Stacked ensemble model with extra trees classifier, Bernoulli naïve Bayes classifier and gaussian naïve Bayes classifier. | 0.85 |
| 10: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height + age + mid-upper arm circumference + BMI + parity | Stacked ensemble model with k-nearest neighbors classifier and multi-layer perceptron classifier. | 0.85 |
| 11: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height + age + mid-upper arm circumference + BMI + parity + alcohol consumption | Stacked ensemble model with gradient boosting classifier, multi-layer perceptron classifier and linear support vector machine classifier. | 0.90 |
| 12: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height + age + mid-upper arm circumference + BMI + parity + alcohol consumption + family history of diabetes mellitus | Stacked ensemble model with multinomial naïve Bayes classifier and multi-layer perceptron classifier. | 0.87 |
| 13: HbA1c + fatty liver index + mean arterial blood pressure + fasting insulin + TG/HDL ratio + height + age + mid-upper arm circumference + BMI + parity + alcohol consumption + family history of diabetes mellitus + Chinese ethnicity | Stacked ensemble model with multinomial naïve Bayes classifier and multi-layer perceptron classifier. | 0.87 |
| 14: Mean arterial blood pressure + height + age + mid-upper arm circumference + BMI + parity + alcohol consumption + family history of diabetes mellitus + Chinese ethnicity | Stacked ensemble model with linear support vector machine classifier (stochastic gradient descent training), Bernoulli naïve Bayes classifier, multinomial naïve Bayes classifier, multi-layer perceptron classifier and linear support vector machine classifier. | 0.81 |
Associations of top predictors and GDM outcome. Associations of top predictors identified from AutoML feature selection model and GDM outcome. Statistical tests were conducted two-sided with a significance level of 5%. All confidence intervals (CIs) are presented two-sided with a confidence level of 95%. The odds ratios (ORs) with 95% CI are presented. A resultant p-value of less than 0.05 is considered statistically significant.
| Feature | GDM ( |
|---|---|
| OR (95% CI) | |
| HbA1c (mmol/mol) | OR: 1.31 (1.12–1.53) |
| Fatty liver index | OR: 1.01 (0.98–1.05) |
| Mean arterial blood pressure (mm Hg) | OR: 0.99 (0.94–1.04) |
| Fasting insulin (mU/L) | OR: 1.05 (0.99–1.12) |
| Triglycerides/high density lipoprotein cholesterol ratio | OR: 1.45 (0.65–3.28) |
| Maternal height (cm) | OR: 0.96 (0.90–1.04) |
| Age (years) | OR: 0.97 (0.86–1.10) |
| Mid-upper arm circumference (cm) | OR: 1.05 (0.96–1.15) |
| BMI (kg/m2) | OR: 1.05 (0.97–1.13) |
| Parity | OR: 0.74 (0.32–1.71) |
| Self-reported alcohol consumption | OR: 2.06 (0.75–5.67) |
| Family history of diabetes mellitus | OR: 1.39 (0.61–3.18) |
| Chinese vs. Malay/Indian ethnicity | OR: 1.29 (0.47–3.60) |
| Feature | GDM ( |
| OR (95% CI) | |
| HbA1c (mmol/mol) ^ | OR: 1.34 (1.13–1.60) |
| Feature | GDM ( |
| OR (95% CI) | |
| HbA1c (mmol/mol) #,^ | OR: 1.32 (1.10–1.59) |
* Statistically significant feature. ^ Adjusted for maternal ethnicity, age, parity, family history of diabetes mellitus and pre-pregnancy BMI. # After excluding 11 women with prediabetes (impaired glucose tolerance) based on preconception OGTT.
Associations of top predictors and adverse birth outcomes (preterm birth, low birthweight at term and large for gestational age infant). Associations of top predictors identified from AutoML feature selection model and adverse birth outcomes (preterm birth, low birthweight at term and large for gestational age infant). Statistical tests were conducted two-sided with a significance level of 5%. All confidence intervals (CIs) are presented two-sided with a confidence level of 95%. The odds ratios (ORs) with 95% CI are presented. A resultant p-value of less than 0.05 is considered statistically significant.
| Feature | Preterm Birth | Low Birthweight at Term | Large for Gestational Age Infant |
|---|---|---|---|
| OR (95% CI) | OR (95% CI) | OR (95% CI) | |
| HbA1c (mmol/mol) | OR: 1.28 (1.01–1.62) | OR: 1.13 (0.86–1.49) | OR: 1.06 (0.92–1.21) |
| Fatty liver index | OR: 1.00 (0.94–1.06) | OR: 0.89 (0.68–1.16) | OR: 1.06 (1.03–1.10) |
| Mean arterial blood pressure (mm Hg) | OR: 1.02 (0.94–1.11) | OR: 0.96 (0.86–1.06) | OR: 1.03 (0.98–1.08) |
| Fasting insulin (mU/L) | OR: 1.04 (0.96–1.14) | OR: 1.05 (0.95–1.15) | OR: 1.08 (1.01–1.16) |
| Triglycerides/high density lipoprotein cholesterol ratio | OR: 0.79 (0.13–4.76) | OR: 1.42 (0.34–6.00) | OR: 2.85 (1.30–6.21) |
| Maternal height (cm) | OR: 0.95 (0.84–1.07) | OR: 0.91 (0.78–1.05) | OR: 0.99 (0.93–1.06) |
| Age (years) | OR: 1.05 (0.86–1.29) | OR: 0.97 (0.76–1.24) | OR: 1.07 (0.95–1.20) |
| Mid-upper arm circumference (cm) | OR: 0.97 (0.82–1.15) | OR: 0.90 (0.71–1.14) | OR: 1.22 (1.12–1.33) |
| BMI (kg/m2) | OR: 1.00 (0.88–1.16) | OR: 0.84 (0.64–1.10) | OR: 1.18 (1.09–1.27) |
| Parity | OR: 1.75 (0.49–6.25) | OR: 1.29 (0.28–5.90) | OR: 1.89 (0.90–3.95) |
| Self-reported alcohol consumption | OR: 0.38 (0.11–1.35) | OR: 2.47 (0.29–20.97) | OR: 0.44 (0.21–0.94) |
| Family history of diabetes mellitus | OR: 1.73 (0.47–6.35) | OR: 1.91 (0.41–8.78) | OR: 1.95 (0.92–4.17) |
| Chinese vs. Malay/Indian ethnicity | OR: 0.59 (0.15–2.39) | OR: 0.33 (0.07–1.51) | OR: 0.55 (0.24–1.26) |
| Feature | Preterm Birth | ||
| OR (95% CI) | |||
| HbA1c (mmol/mol) ^ | OR: 1.63 (1.12–2.38) | ||
| Feature | Preterm Birth | ||
| OR (95% CI) | |||
| HbA1c (mmol/mol) #,^ | OR: 1.75 (1.14–2.67) | ||
| Feature | Large for Gestational Age Infant | ||
| OR (95% CI) | |||
| Fatty liver index ^ | OR: 1.02 (0.96–1.08) | ||
| Feature | Large for Gestational Age Infant | ||
| OR (95% CI) | |||
| Fasting insulin (mU/L) ^ | OR: 1.01 (0.92–1.10) | ||
| Feature | Large for Gestational Age Infant | ||
| OR (95% CI) | |||
| Triglycerides/high density lipoprotein cholesterol ratio ^ | OR: 1.98 (0.76–5.10) | ||
| Feature | Large for Gestational Age Infant | ||
| OR (95% CI) | |||
| Mid-upper arm circumference (cm) ^ | OR: 1.21 (0.93–1.58) | ||
| Feature | Large for Gestational Age Infant | ||
| OR (95% CI) | |||
| BMI (kg/m2) ~ | OR: 1.20 (1.10–1.31) | ||
| Feature | Large for Gestational Age Infant | ||
| OR (95% CI) | |||
| Self-reported alcohol consumption ^ | OR: 0.47 (0.17–1.28) | ||
* Statistically significant feature. ^ Adjusted for maternal ethnicity, age, parity, family history of diabetes mellitus, pre-pregnancy BMI, GDM diagnosis, total gestational weight gain (derived by subtracting first antenatal visit weight from last antenatal visit weight) and child sex. ~ Adjusted for maternal ethnicity, age, parity, family history of diabetes mellitus, GDM diagnosis, total gestational weight gain (derived by subtracting first antenatal visit weight from last antenatal visit weight) and child sex. # After excluding 11 women with prediabetes (impaired glucose tolerance) based on preconception OGTT.