| Literature DB >> 35294369 |
Zheqing Zhang1, Luqian Yang1, Wentao Han1, Yaoyu Wu1, Linhui Zhang1, Chun Gao1, Kui Jiang1, Yun Liu2,3, Huiqun Wu1.
Abstract
BACKGROUND: Gestational diabetes mellitus (GDM) is a common endocrine metabolic disease, involving a carbohydrate intolerance of variable severity during pregnancy. The incidence of GDM-related complications and adverse pregnancy outcomes has declined, in part, due to early screening. Machine learning (ML) models are increasingly used to identify risk factors and enable the early prediction of GDM.Entities:
Keywords: digital health; gestational diabetes mellitus; machine learning; prediction model; prognostic model
Mesh:
Year: 2022 PMID: 35294369 PMCID: PMC8968560 DOI: 10.2196/26634
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Figure 1PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flow diagram for study selection. CNKI: China National Knowledge Infrastructure.
The most frequent factors included in risk prediction models for gestational diabetes mellitus.
| Study first author, year | Factors included in models | ||||||||||||
|
| MAa (n=19) | FHDb (n=14) | BMI (n=12) | FPGc (n=11) | PBMId (n=8) | HDe (n=8) | Ethnicity (n=6) | TGf (n=5) | HbA1cg (n=4) | SBPh (n=3) | Height (n=3) | hsCRPi (n=3) | |
| Gao, 2020 [ | ✓j | ✓ | ✓ |
|
|
|
|
|
| ✓ | ✓ |
| |
| Liu, 2020 [ | ✓ |
|
| ✓ | ✓ |
|
|
|
|
|
|
| |
| Miao, 2020 [ |
| ✓ |
| ✓ |
|
|
| ✓ |
|
|
|
| |
| Tan, 2020 [ | ✓ | ✓ |
| ✓ | ✓ |
|
|
|
|
|
|
| |
| Wu, 2020 [ | ✓ | ✓ |
| ✓ |
| ✓ |
|
|
|
|
|
| |
| Xiong, 2020 [ |
|
|
|
|
|
|
|
|
|
|
|
| |
| Ye, 2020 [ | ✓ |
| ✓ | ✓ | ✓ | ✓ |
| ✓ | ✓ |
|
|
| |
| Zhang, 2020 [ | ✓ | ✓ |
| ✓ | ✓ | ✓ |
|
| ✓ | ✓ |
| ✓ | |
| Snyder, 2020 [ | ✓ |
|
|
| ✓ |
| ✓ |
|
|
|
|
| |
| Cui, 2019 [ | ✓ | ✓ |
|
| ✓ |
|
|
|
|
|
|
| |
| Zheng, 2019 [ | ✓ |
| ✓ | ✓ |
|
|
| ✓ |
|
|
|
| |
| Nombo, 2018 [ |
| ✓ | ✓ |
|
|
|
|
|
|
|
|
| |
| Sweeting, 2018 [ |
| ✓ | ✓ |
|
| ✓ | ✓ | ✓ |
|
|
|
| |
| Xiao, 2018 [ | ✓ |
| ✓ | ✓ |
|
|
| ✓ | ✓ |
| ✓ |
| |
| Huang, 2017 [ | ✓ | ✓ |
| ✓ | ✓ |
|
|
|
|
|
|
| |
| Wu, 2017 [ | ✓ |
| ✓ | ✓ |
|
|
|
|
|
| ✓ | ✓ | |
| Gabbay-Benziv, 2015 [ | ✓ |
| ✓ |
|
| ✓ | ✓ |
|
| ✓ |
|
| |
| Thériault, 2015 [ | ✓ | ✓ | ✓ |
|
| ✓ | ✓ |
| ✓ |
|
| ✓ | |
| Eleftheriades, 2014 [ | ✓ |
|
|
|
|
|
|
|
|
|
|
| |
| Pintaudi, 2013 [ |
| ✓ |
| ✓ | ✓ |
|
|
|
|
|
|
| |
| Savona-Ventura, 2013 [ | ✓ |
|
| ✓ |
|
|
|
|
|
|
|
| |
| Tran, 2013 [ | ✓ |
| ✓ |
|
|
|
|
|
|
|
|
| |
| Teede, 2011 [ | ✓ | ✓ |
|
|
| ✓ | ✓ |
|
|
|
|
| |
| Vanleeuwen, 2009 [ |
| ✓ | ✓ |
|
| ✓ | ✓ |
|
|
|
|
| |
| Caliskan, 2004 [ | ✓ | ✓ | ✓ |
|
|
|
|
|
|
|
|
| |
aMA: maternal age.
bFHD: family history of diabetes.
cFPG: fasting plasma glucose.
dPBMI: prepregnancy BMI.
eHD: history of diabetes.
fTG: triglyceride.
gHbA1c: hemoglobin A1c.
hSBP: systolic blood pressure.
ihsCRP: high-sensitivity C-reaction protein.
jA checkmark (✓) indicates that the factor was included.
Figure 2The overall pooled area under the receiver operating characteristic curve (AUROC) of machine learning models for gestational diabetes mellitus prediction. Q*: the sensitivity at the intersection of the SROC curve and the straight line (sensitivity=specificity); SROC: summary receiver operating characteristic.
Figure 3The overall pooled sensitivity of machine learning models for gestational diabetes mellitus prediction. First authors for each study are listed along the y-axis. The vertical red dotted lines are the 95% CIs of the pooled sensitivity. BYS: Bayesian; DNN: deep neural network; GA-CB: GA-CatBoost (genetic algorithm category boosting); GBDT: gradient-boosting decision tree; KNN: k-nearest neighbors; LGB: LightGBM (light gradient boosting machine); LR: logistic regression; SVM: support vector machine; Tnet: TreeNet; XGB: XGBoost (extreme gradient boosting).
Figure 4The overall pooled specificity of machine learning for gestational diabetes mellitus prediction. First authors for each study are listed along the y-axis. The vertical red dotted line is the 95% CI of the pooled specificity. BYS: Bayesian; DNN: deep neural network; GA-CB: GA-CatBoost (genetic algorithm category boosting); GBDT: gradient-boosting decision tree; KNN: k-nearest neighbors; LGB: LightGBM (light gradient boosting machine); LR: logistic regression; SVM: support vector machine; Tnet: TreeNet; XGB: XGBoost (extreme gradient boosting).
The comparison of performance of machine learning models in gestational diabetes mellitus (GDM) prediction applied to different subgroups.
| Subgroup | Models (N=30), n (%) | AUROCa | Sensitivity (95% CI) | Specificity (95% CI) | PLRb (95% CI) | NLRc (95% CI) | DORd (95% CI) |
| Overall | 30 (100) | 0.8492 | 0.69 (0.68-0.69) | 0.75 (0.75-0.75) | 4.02 (3.13-5.17) | 0.31 (0.26-0.38) | 13.78 (9.53-19.94) |
| 0-13 weeks before diagnosis | 16 (53) | 0.8667 | 0.74 (0.73-0.75) | 0.64 (0.64-0.64) | 3.89 (2.92-5.19) | 0.28 (0.22-0.36) | 16.55 (9.52-28.77) |
| 14-28 weeks before diagnosis | 14 (47) | 0.8365 | 0.64 (0.63-0.65) | 0.85 (0.84-0.85) | 3.90 (2.76-5.53) | 0.35 (0.25-0.48) | 11.67 (7.59-18.02) |
| With GDM history | 11 (37) | 0.8759 | 0.67 (0.66-0.68) | 0.85 (0.85-0.86) | 5.29 (3.39-8.25) | 0.28 (0.18-0.44) | 19.82 (11.49-34.13) |
| Without GDM history | 19 (63) | 0.8330 | 0.70 (0.66-0.68) | 0.65 (0.64-0.65) | 3.12 (2.52-3.86) | 0.35 (0.30-0.41) | 8.27 (5.14-13.29) |
| Logistic regression | 19 (63) | 0.8151 | 0.71 (0.70-0.72) | 0.67 (0.67-0.67) | 3.04 (2.37-3.89) | 0.37 (0.32-0.43) | 8.73 (5.99-12.73) |
| Non–logistic regression | 11 (37) | 0.8891 | 0.66 (0.65-0.67) | 0.85 (0.85-0.86) | 6.80 (4.45-10.37) | 0.24 (0.15-0.38) | 31.85 (15.93-63.69) |
aAUROC: area under receiver operating characteristic curve.
bPLR: positive likelihood ratio.
cNLR: negative likelihood ratio.
dDOR: diagnostic odds ratio.
Figure 5The overall pooled area under the receiver operating characteristic curve (AUROC) of logistic regression models for gestational diabetes mellitus prediction. Q*: the sensitivity at the intersection of the SROC curve and the straight line (sensitivity=specificity); SROC: summary receiver operating characteristic.
Figure 6The overall pooled area under the receiver operating characteristic curve (AUROC) of non–logistic regression models for gestational diabetes mellitus prediction. Q*: the sensitivity at the intersection of the SROC curve and the straight line (sensitivity=specificity); SROC: summary receiver operating characteristic.