| Literature DB >> 36104698 |
Lauren D Liao1, Assiamira Ferrara2, Mara B Greenberg3,4, Amanda L Ngo2, Juanran Feng2, Zhenhua Zhang5, Patrick T Bradshaw6, Alan E Hubbard1, Yeyi Zhu7,8.
Abstract
BACKGROUND: Gestational diabetes (GDM) is prevalent and benefits from timely and effective treatment, given the short window to impact glycemic control. Clinicians face major barriers to choosing effectively among treatment modalities [medical nutrition therapy (MNT) with or without pharmacologic treatment (antidiabetic oral agents and/or insulin)]. We investigated whether clinical data at varied stages of pregnancy can predict GDM treatment modality.Entities:
Keywords: Gestational diabetes; Machine learning; Pharmacologic treatment; Prediction; Pregnancy; Risk stratification; Treatment modality
Mesh:
Substances:
Year: 2022 PMID: 36104698 PMCID: PMC9476287 DOI: 10.1186/s12916-022-02499-7
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 11.150
Fig. 1Flowchart for developing pregnancies cohort with gestational diabetes 2007–2017. GDM,: gestational diabetes mellitus
Characteristics of individuals with gestational diabetes at Kaiser Permanente Northern California, 2007–2017
| Age at childbirth, mean (SD), y | < 0.001 | |||
| 15–24 | 1624 (5.3) | 1497 (5.5) | 127 (3.9) | |
| 25–29 | 6057 (19.9) | 5522 (20.3) | 535 (16.5) | |
| 30–34 | 11,295 (37.1) | 10,018 (36.8) | 1277 (39.5) | |
| ≥ 35 | 11,498 (37.7) | 10,203 (37.5) | 1295 (40.0) | |
| Race/Ethnicity, | 0.028 | |||
| White | 6866 (22.5) | 6174 (22.7) | 692 (21.4) | |
| Hispanic | 8506 (27.9) | 7655 (28.1) | 851 (26.3) | |
| African American | 1319 (4.3) | 1174 (4.3) | 145 (4.5) | |
| Asian/Pacific Islander | 12,377 (40.6) | 10,990 (40.3) | 1387 (42.9) | |
| Other | 1406 (4.6) | 1247 (4.6) | 159 (4.9) | |
| Pre | < 0.001 | |||
| Underweight | 399 (1.3) | 344 (1.3) | 55 (1.7) | |
| Normal | 6850 (22.5) | 6147 (22.6) | 703 (21.7) | |
| Overweight | 10,095 (33.1) | 9106 (33.4) | 989 (30.6) | |
| Obese | 13,130 (43.1) | 11,643 (42.7) | 1487 (46.0) | |
| Median household income, annual, | < 0.001 | |||
| < $25,000 | 813 (2.7) | 562 (2.1) | 251 (7.8) | |
| $25,000–39,999 | 2816 (9.2) | 2495 (9.2) | 321 (9.9) | |
| $40,000–59,999 | 7169 (23.5) | 6463 (23.7) | 706 (21.8) | |
| $60,000–79,999 | 7796 (25.6) | 7010 (25.7) | 786 (24.3) | |
| ≥ $80,000 | 11,880 (39.0) | 10,710 (39.3) | 1170 (36.2) | |
| Nulliparity, n (%) | 12,419 (40.8) | 11,117 (40.8) | 1302 (40.3) | 0.559 |
| Gestational age at delivery, mean (SD), weeks | 38.3 (1.9) | 38.3 (1.9) | 38.2 (1.9) | 0.05 |
1Obtained by Student’s t test for continuous variables or Pearson’s chi-squared test for categorical variables
Area under the receiver operating characteristic curve prediction results predictors at varied stages of pregnancy
| Predictor levelsa | Dataset | AUC (95% CI) | |||
|---|---|---|---|---|---|
| 1 | Discovery set | 0.613 (0.603–0.622) | 0.670 (0.663–0.676) | 0.673 (0.667–0.679) | 0.683 (0.676–0.689) |
| Validation set | 0.592 (0.567–0.616) | 0.634 (0.615–0.653) | 0.635 (0.615–0.654) | 0.634 (0.615–0.653) | |
| 1, 2 | Discovery set | 0.618 (0.609–0.628) | 0.685 (0.678–0.691) | 0.688 (0.682–0.695) | 0.761 (0.756–0.767) |
| Validation set | 0.588 (0.563–0.613) | 0.647 (0.628–0.666) | 0.645 (0.626–0.664) | 0.648 (0.630–0.667) | |
| 1, 2, 3 | Discovery set | 0.740 (0.732–0.748) | 0.785 (0.780–0.791) | 0.790 (0.785–0.796) | 0.869 (0.865–0.873) |
| Validation set | 0.703 (0.682–0.724) | 0.750 (0.733–0.767) | 0.749 (0.733–0.766) | 0.754 (0.739–0.772) | |
| 1, 2, 3, 4 | Discovery set | 0.785 (0.777–0.792) | 0.849 (0.845–0.854) | 0.852 (0.848–0.857) | 0.934 (0.931–0.936) |
| Validation set | 0.745 (0.722–0.767) | 0.809 (0.794–0.823) | 0.808 (0.794–0.823) | 0.815 (0.800–0.829) | |
AUC, area under the receiver operating characteristic curve; CART, classification and regression tree; LASSO, least absolute shrinkage and selection operator
aLevel 1: 1-year preconception to last menstrual period; level 2: last menstrual period to before diagnosis of gestational diabetes; level 3: at the time of diagnosis of gestational diabetes; level 4: 1 week after diagnosis of gestational diabetes
bCandidate algorithms in simple super learner included response-mean, LASSO regression, and CART
cCandidate algorithms in complex super learner included response-mean, LASSO regression, CART, random forest, and extreme gradient boosting
Fig. 2Variable importance for predictors at level(s) A 1, B 1–2, C 1–3, and D 1–4. BP, blood pressure; C–C, Carpenter-Coustan’s criteria; GCT, glucose challenge test; GDM, gestational diabetes mellitus; HDL, high-density lipoprotein; LASSO, least absolute shrinkage and selection operator; OGTT, oral glucose tolerance test; PCOS, polycystic ovary syndrome; SMBG, self-monitored blood glucose. Level 1: 1-year preconception to last menstrual period; level 2: last menstrual period to before diagnosis of gestational diabetes; level 3: at the time of diagnosis of gestational diabetes; level 4: 1 week after diagnosis of gestational diabetes
Prediction results using final simplified logistic regression models with predictors at varied stages of pregnancy
| Level 1a | 0.632 (0.623–0.640) | 0.609 (0.587–0.632) | 0.073 | 0.609 (0.587–0.632) |
| Levels 1–2b | 0.648 (0.640–0.656) | 0.621 (0.599–0.643) | 0.075 | 0.621 (0.599–0.643) |
| Levels 1–3c | 0.770 (0.764–0.775) | 0.746 (0.730–0.763) | 0.072 | 0.752 (0.734–0.77) |
| Levels 1–4d | 0.825 (0.820–0.830) | 0.798 (0.783–0.813) | 0.038 | 0.802 (0.786–0.818) |
AUC, area under the receiver operating characteristic curve; CI, confidence interval; GDM, gestational diabetes
aPredictors included history of GDM, pre-pregnancy obesity, and prediabetes before pregnancy
bPredictors included history of GDM, pre-pregnancy obesity, glucose levels at 50-g, 1-h glucose challenge test for GDM screening (≥ 200 mg/dL), and prediabetes before pregnancy, in addition to three pairwise interactions between the first three predictors
cPredictors included fasting glucose value at 100-g, 3-h oral glucose tolerance test, gestational week at GDM diagnosis (continuous), and GDM diagnosis by Carpenter-Coustan criteria (versus by fasting hyperglycemia)
dPredictors included gestational week at GDM diagnosis (continuous), fasting glucose value at 100-g, 3-h oral glucose tolerance test, self-monitored glycemic control status at fasting, number of fasting self-monitored blood glucose measurements, and an interaction term between last two variables
Final models developed by simplified logistic regression
| Level 1 | − 0.856 to 0.005 * history of GDM + 0.741 * BMI obese + 0.800 * prediabetes before pregnancy |
| Levels 1–2 | − 1.001 + 0.572 * history of GDM + 0.579 * pre-pregnancy obesity + 0.774 * prediabetes before pregnancy + 0.733 * screening valuea − 0.323 * history of GDM * pre-pregnancy obesity − 0.577 * history of GDM * screening valuea + 0.480 * pre-pregnancy obesity * screening valuea |
| Levels 1–3 | − 4.468 + 0.074 * oral glucose tolerance testb − 0.063 * week of gestational agec − 1.435 diagnosis by C–C criteriad |
| Levels 1–4 | − 2.645 to 0.810 * meeting glycemic control goale + 0.167 * number of SMBG tests taken − 0.076 * week of gestational agec + 0.044 * oral glucose tolerance testb − 0.234 * meeting glycemic control goale * number of SMBG tests taken |
BMI, body mass index; GDM, gestational diabetes mellitus; SMBG, self-monitored blood glucose
The outcome is in log odds form, and coefficients have been rounded to the third decimal point
aGlucose levels at 50-g, 1-h glucose challenge test for GDM screening (≥ 200 mg/dL)
bFasting glucose value at 100-g, 3-h oral glucose tolerance test
cGestational week at GDM diagnosis (continuous)
dGDM diagnosis by Carpenter-Coustan criteria (versus by fasting hyperglycemia)
eSMBG control status for the fasting test measured during first week after GDM diagnosis
Fig. 3Pre- and post-calibration plots using logistic regression with level 1–4 predictors on the validation set. The simpler logistic regression model included gestational week at diagnosis of gestational diabetes, the diagnostic fasting glucose value, the status and frequency of self-monitored glycemic control at fasting during 1-week post diagnosis, and an interaction term of the last two variables. The dashed line indicates a perfectly calibrated model
Fig. 4Prediction results from A complex super learner and B logistic regression at varied pregnancy stages. (1) Complex super learner algorithm included response-mean, LASSO regression, CART, random forest, and extreme gradient boosting. The simpler logistic regression models were developed based on predictors selected in the complex super learner algorithms at each level, aiming to include a minimum set of predictors for easier interpretability and higher clinical uptake. (2) Level 1: 1-year preconception to last menstrual period; level 2: last menstrual period to before diagnosis of gestational diabetes; level 3: at the time of diagnosis of gestational diabetes; level 4: 1 week after diagnosis of gestational diabetes. (3) The corresponding difference in AUC by Delong’s test between the complex super learner and simpler logistic regression models using level 1, levels 1–2, levels 1–3, and levels 1–4 are 0.073, 0.049, 0.831, and 0.264 respectively. AUC, area under the receiver operating characteristic curve; LASSO: least absolute shrinkage and selection operator