| Literature DB >> 27338739 |
Amina Allalou1, Amarnadh Nalla2, Kacey J Prentice3, Ying Liu3, Ming Zhang3, Feihan F Dai3, Xian Ning4, Lucy R Osborne5, Brian J Cox6, Erica P Gunderson7, Michael B Wheeler8.
Abstract
Gestational diabetes mellitus (GDM) affects 3-14% of pregnancies, with 20-50% of these women progressing to type 2 diabetes (T2D) within 5 years. This study sought to develop a metabolomics signature to predict the transition from GDM to T2D. A prospective cohort of 1,035 women with GDM pregnancy were enrolled at 6-9 weeks postpartum (baseline) and were screened for T2D annually for 2 years. Of 1,010 women without T2D at baseline, 113 progressed to T2D within 2 years. T2D developed in another 17 women between 2 and 4 years. A nested case-control design used 122 incident case patients matched to non-case patients by age, prepregnancy BMI, and race/ethnicity. We conducted metabolomics with baseline fasting plasma and identified 21 metabolites that significantly differed by incident T2D status. Machine learning optimization resulted in a decision tree modeling that predicted T2D incidence with a discriminative power of 83.0% in the training set and 76.9% in an independent testing set, which is far superior to measuring fasting plasma glucose levels alone. The American Diabetes Association recommends T2D screening in the early postpartum period via oral glucose tolerance testing after GDM, which is a time-consuming and inconvenient procedure. Our metabolomics signature predicted T2D incidence from a single fasting blood sample. This study represents the first metabolomics study of the transition from GDM to T2D validated in an independent testing set, facilitating early interventions.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27338739 PMCID: PMC5001181 DOI: 10.2337/db15-1720
Source DB: PubMed Journal: Diabetes ISSN: 0012-1797 Impact factor: 9.461
Figure 1Study design and metabolic assay work flow. A: Study design of the SWIFT prospective cohort, a total of 1,035 women in whom GDM was diagnosed were enrolled at 6–9 weeks postpartum (baseline) and screened via 2-h 75-g OGTTs. At baseline (V1), 21 women with T2D and 4 ineligible women were excluded from the follow-up. The study observed 1,010 participants without diabetes who were rescreened annually via OGTTs with retention rates of 85% and 83% for 1 and 2 years, respectively. Prospective cohort sample sizes for non-T2D and incident T2D are as follows: T2D developed in 59 women at 1 year and in 54 women at 2 years; T2D developed in another 17 women beyond 2–4 years postbaseline. B: Work flow of metabolomics assay. A total of 182 metabolites were assayed in plasma from V1 (baseline) using LC-MS/MS, gas chromatography–mass spectrometry (GC-MS), and ELISA. For further methodology, please refer to Supplementary Table 1.
Baseline (6–9 weeks postpartum) and follow-up characteristics of SWIFT women with GDM in the training and testing sets (n = 122 pairs)
| Characteristics | Training set | Testing set | ||
|---|---|---|---|---|
| Non-T2D
( | Incident T2D
( | Non-T2D
( | Incident T2D
( | |
| Sociodemographic/clinical | ||||
| Age, years | 33.1 (4.5) | 33.3 (5.2) | ||
| Race/ethnicity, | ||||
| Non-Hispanic white | 13 (16) | 12 (15) | 8 (19) | 9 (21) |
| Asian (East, South, Southeast) | 26 (33) | 26 (33) | 13 (31) | 10 (24) |
| Non-Hispanic black | 10 (12) | 10 (12) | 2 (5) | 5 (12) |
| Hispanic | 31 (39) | 31 (39) | 17 (41) | 17 (41) |
| Other | 0 (0) | 1 (1) | 2 (5) | 1 (2) |
| Parity, | ||||
| Primiparous (1 birth) | 31 (39) | 26 (33) | 13 (31) | 16 (38) |
| Biparous (2 births) | 27 (34) | 29 (36) | 14 (33) | 16 (38) |
| Multiparous (>2 births) | 22 (27) | 25 (31) | 15 (36) | 10 (24) |
| GDM prenatal treatment, | ||||
| Diet only | 50 (63) | 29 (69) | ||
| Oral medications | 28 (35) | 13 (31) | ||
| Insulin | 2 (2) | 0 (0) | ||
| Gestational age at GDM diagnosis (weeks) | 24.4 (7.5) | 22.0 (8.6) | 25.0 (7.1) | 23.3 (8.1) |
| Prepregnancy BMI, kg/m2 | 33.3 (8.3) | 33.5 (8.4) | 32.6 (7.5) | 33.1 (7.6) |
| Postpartum 6–9 weeks BMI, kg/m2 | 33.2 (7.8) | 33.5 (7.7) | 32.4 (6.6) | 33.3 (7.6) |
| Hypertension history, | 16 (20) | 19 (24) | 8 (19) | 8 (19) |
| Family history of diabetes, | 42 (53) | 45 (56) | 19 (33) | |
| 6–9 weeks postpartum, lifestyle | ||||
| Smoker, | 2 (3) | 4 (5) | 1 (2) | 1 (2) |
| Physical activity, met-h/week | 47.4 (21.0) | 54.2 (25.1) | 49.4 (21.6) | 48.8 (24.9) |
| Total energy intake, kcal/day | 811 (319) | 805 (338) | 774 (340) | 900.4 (297) |
| Lactation intensity groups, | ||||
| Exclusive lactation | 20 (25) | 10 (12) | 8 (19) | 8 (19) |
| Mostly lactation | 30 (38) | 28 (35) | 15 (36) | 17 (41) |
| Mostly formula/mixed | 18 (22) | 19 (24) | 10 (24) | 12 (29) |
| Exclusive formula | 12 (15) | 23 (29) | 9 (21) | 5 (12) |
| 6–9 weeks postpartum, plasma | ||||
| FPG, mg/dL | 95 (8.4) | 93.5 (7.8) | ||
| 2hPG, mg/dL | 109 (25.9) | 116 (28.5) | ||
| Fasting insulin, µU/mL | 26 (14.8) | 25.6 (12.1) | 29.1 (20) | |
| Fasting triglycerides, mg/dL | 128 (90.7) | 150 (105.2) | 134 (79.6) | 151.3 (106) |
| Fasting HDL-C, mg/dL | 49 (13.2) | 49 (13.0) | 51.5 (13.0) | 49.4 (10.9) |
| HOMA-IR | 6.1 (3.7) | 5.97 (3.0) | 7.47 (5.9) | |
| HOMA-B | 299 (183) | 305 (156) | 313 (153) | 284 (193) |
| Postbaseline, 2-year follow-up | ||||
| Subsequent birth, | 5 (6) | 5 (6) | 9 (21) | |
| Follow-up in months, median (IQR) | 22.4 (1.9) | 21.8 (2.8) | 18.3 (12.5) | |
Data are presented as the mean (SD) unless otherwise noted. Plasma values are from the SWIFT database (26).
*P < 0.05 between incident T2D and non-T2D groups;
‡Determined by χ2 test;
†P < 0.05 between training and testing sets. Specific differences between specific characteristics are shown in boldface type.
§Determined by Wilcoxon sum rank tests for medians.
Metabolites that significantly differ in incident T2D in the training set (n = 80 pairs)
| No. | Metabolites | Non-T2D | Incident T2D | Uncorrected | Corrected |
|---|---|---|---|---|---|
| 1 | 2-AAA | 1.06 ± 0.44 | 1.27 ± 0.54 | 8.02E-03 | 1.01E-01 |
| 2 | Gly | 311.1 ± 112.63 | 279.14 ± 71.7 | 3.38E-02 | 2.31E-01 |
| 3 | Ile | 46.94 ± 9.09 | 51.39 ± 11.8 | 8.30E-03 | 1.01E-01 |
| 4 | Leu | 115.05 ± 21.79 | 126.34 ± 29.01 | 6.05E-03 | 9.50E-02 |
| 5 | Thr | 141.13 ± 27.78 | 154.77 ± 43.81 | 1.99E-02 | 1.83E-01 |
| 6 | Trp | 66.76 ± 8.31 | 70.52 ± 10.99 | 1.57E-02 | 1.57E-01 |
| 7 | Tyr | 94.82 ± 17.48 | 106.33 ± 24.51 | 7.95E-04 | |
| 8 | Val | 230.79 ± 35.52 | 252.44 ± 45.63 | 1.01E-03 | |
| 9 | xLeu | 200.69 ± 29.18 | 220.64 ± 43.67 | 8.63E-04 | |
| 10 | Hexoses | 4.7 ± 0.51 | 5.16 ± 0.63 | 1.13E-06 | |
| 11 | SM (OH) C16:1 | 2.87 ± 0.69 | 2.62 ± 0.8 | 3.87E-02 | 2.31E-01 |
| 12 | SM (OH) C22:2 | 7.13 ± 1.45 | 6.59 ± 1.83 | 3.90E-02 | 2.31E-01 |
| 13 | SM C18:0 | 17.21 ± 3.83 | 15.82 ± 4.19 | 2.98E-02 | 2.31E-01 |
| 14 | SM C18:1 | 8.91 ± 2.01 | 7.94 ± 2.21 | 4.11E-03 | 7.54E-02 |
| 15 | SM C20:2 | 0.42 ± 0.12 | 0.34 ± 0.12 | 1.33E-04 | |
| 16 | SM C24:1 | 26.86 ± 5.52 | 24.52 ± 6.44 | 1.47E-02 | 1.57E-01 |
| 17 | PC ae C40:5 | 4.81 ± 1.21 | 4.36 ± 1.59 | 4.32E-02 | 2.31E-01 |
| 18 | PC ae C42:5 | 2.27 ± 0.46 | 2.08 ± 0.59 | 2.42E-02 | 2.05E-01 |
| 19 | PC ae C44:5 | 1.18 ± 0.25 | 1.09 ± 0.32 | 4.47E-02 | 2.31E-01 |
| 20 | AC10 | 0.25 ± 0.08 | 0.22 ± 0.06 | 4.63E-02 | 2.31E-01 |
| 21 | AC3 | 0.28 ± 0.08 | 0.31 ± 0.1 | 4.55E-02 | 2.31E-01 |
| 22 | Palmitoleic acid (C16:1 n9) | 2.76 ± 0.96 | 2.45 ± 0.86 | 3.86E-02 | 2.31E-01 |
Data are presented as the mean ± SD, unless otherwise noted. Concentrations of metabolites are in μmol/L except for hexoses (mmol/L).
*P values are corrected for multiple comparisons with the Benjamini-Hochberg method and significant metabolites are shown in boldface type.
+Metabolites assayed using both Biocrates plate technology and in-house method, but xLeu was excluded for prediction analysis. Significant differences after Benjamini-Hochberg correction for multiple comparisons are shown in boldface type.
Figure 2Decision tree and ROC for the prediction of incident T2D. A: Decision tree by the J48 machine learner based on the combined AUCs and F scores of all algorithms. The gray boxes indicate the metabolite chosen for the node, whereas the clear numbered boxes indicate the concentration threshold in μmol/L for PC ae C40:5, BCAA, and SM (OH) C14:1 and in mmol/L for hexoses. The percentage below each group indicates percent of instances correctly classified. B: ROC curve of the J48 machine learner algorithm for the training and testing sets, performing with discriminative power of 0.830 (P < 0.000001) and 0.769 (P < 0.0001), respectively, which is greater than for FPG alone 0.724 (P < 0.0001) and 0.706 (P < 0.01), as well as for 2hPG alone 0.726 (P < 0.000001) and 0.661 (P < 0.05), respectively. Data are presented as the AUC.
Comparison of FPG, 2hPG and metabolites optimized machine learning performance, indicating greatest performance in the metabolite model
| Sets | Parameters | Optimized machine learner algorithm | AUC | Se | Sp | Accuracy | P | Best model score ( | |
|---|---|---|---|---|---|---|---|---|---|
| Training | FPG | LR | 0.724 (0.645–0.803) | 60.00% | 75.00% | 67.50% | 70.60% | 64.90% | 1.373 |
| 2hPG | LR | 0.726 (0.648–0.804) | 58.75% | 72.50% | 65.63% | 68.12% | 63.09% | 1.3569 | |
| Metabolite model | DT | 0.830 (0.765–0.894) | 86.30% | 68.80% | 77.50% | 73.40% | 79.30% | 1.623 | |
| Testing | FPG | LR | 0.706 (0.596–0.816) | 57.10% | 66.70% | 61.90% | 63.20% | 60.00% | 1.306 |
| 2hPG Model | LR | 0.661 (0.543–0.779) | 57.10% | 71.40% | 64.30% | 66.70% | 61.50% | 1.276 | |
| Metabolite model | DT | 0.769 (0.667–0.871) | 73.80% | 69.10% | 71.40% | 70.50% | 72.10% | 1.490 | |
| Glucose model (FPG and 2hPG) | DT | 0.732 | 88.10% | 47.60% | 67.90% | 62.70% | 73.30% | 1.465 | |
| Combined model | NB | 0.754 | 54.80% | 76.20% | 65.50% | 69.70% | 61.30% | 1.367 |
DT, J48 decision tree; LR, logistic regression, NB, naive Bayes.
*Data are presented as the mean and 95% CI.
Figure 3Comparison of Venn diagrams and contingency tables of model predictions of future diabetes. A: Venn diagrams of correct and incorrect predictions of the testing data set for all patients; only patients with incident T2D and patients with non-T2D (Non) are shown. Intersection of correct predictions (green) and incorrect (red) indicates that one or more models had identical prediction of a patient, and the other models did not. Although the correct and incorrect patient predictions appear similar across all three models (left), the glucose and combined models have worse performance for the prediction of future diabetes (middle). The combined model has worse prediction for control subjects (right). B: Contingency tables of the three different models against the testing data set. Columns are known group labels, and rows are predicted group labels. The metabolite model (left) shows the higher P and Sp compared with the glucose model. The combined model (right) has overall poorer Se and Sp compared with both the metabolite and glucose models alone.
Pearson correlation coefficients (r) between 22 metabolites that significantly differ in incident T2D compared with non-T2D, as well as metabolite selected by machine learning (SM (OH) C14:1), in the training set (80 pairs) at baseline and clinical parameters BMI, FPG, 2hPG, fasting insulin, and HOMA-IR at baseline
| Parameter and metabolite | BMI (kg/m2) | Fasting glucose (mg/dL) | 2hPG (glucose mg/dL) | Fasting insulin (μU/mL) | HOMA-IR |
|---|---|---|---|---|---|
| 2-AAA | 0.210 | 0.283 | 0.115 | 0.335 | 0.353 |
| Gly | −0.151 | −0.192 | −0.173 | −0.279 | −0.281 |
| Ile | 0.230 | 0.278 | 0.144 | 0.415 | 0.437 |
| Leu | 0.055 | 0.242 | 0.15 | 0.343 | 0.367 |
| Thr | 0.218 | 0.156 | 0.025 | 0.150 | 0.153 |
| Trp | −0.161 | 0.22 | 0.061 | 0.171 | 0.187 |
| Tyr | 0.205 | 0.252 | 0.028 | 0.335 | 0.353 |
| Val | 0.073 | 0.235 | 0.161 | 0.409 | 0.418 |
| AC10 | −0.022 | −0.165 | 0.139 | −0.201 | −0.202 |
| AC3 | 0.104 | 0.306 | 0.184 | 0.362 | 0.387 |
| xLeu+ | 0.118 | 0.311 | 0.197 | 0.481 | 0.508 |
| Hexoses | 0.16 | 0.826 | 0.211 | 0.311 | 0.429 |
| Palmitoleic acid (C16:1n9) | 0.246 | −0.1 | −0.009 | 0.098 | 0.068 |
| PC ae C40:5 | −0.252 | −0.054 | 0.081 | −0.329 | −0.311 |
| PC ae C42:5 | −0.115 | −0.033 | 0.018 | −0.266 | −0.252 |
| PC ae C44:5 | −0.006 | −0.177 | −0.182 | −0.204 | −0.217 |
| SM C18:0 | −0.181 | −0.150 | 0.028 | −0.266 | −0.272 |
| SM C18:1 | −0.049 | −0.157 | −0.039 | −0.254 | −0.263 |
| SM C20:2 | −0.092 | −0.284 | −0.122 | −0.358 | −0.376 |
| SM C24:1 | −0.277 | −0.246 | −0.025 | −0.475 | −0.475 |
| SM (OH) C14:1 | −0.136 | −0.207 | −0.175 | −0.257 | −0.279 |
| SM (OH) C16:1 | −0.161 | −0.199 | −0.087 | −0.315 | −0.329 |
| SM (OH) C22:2 | −0.201 | −0.226 | −0.034 | −0.378 | −0.385 |
+P = 0.05.
*P < 0.05.
**P < 0.005.
***P < 0.0005.