| Literature DB >> 36124028 |
Jinbo Zhang1, Xiaozhi Wu1, Qingqing Song2.
Abstract
Method: A retrospective selection of 93 women who were hospitalized in our hospital from March 2019 to May 2022 with a singleton pregnancy and delivered at term with macrosomia were the study group. And 356 women who delivered a normal size baby during the same period were the control group. The variables that were associated with the onset of macrosomia were screened from maternal medical records. Logistic regression models, random forest, and CART decision tree models were developed using the screened variables as input variables and whether they were macrosomia as outcome variables, respectively. The performance of the three models was evaluated by accuracy, precision, recall, F1 score, and receiver operating characteristic curve (ROC). Result: The risk prediction models for the onset of macrosomia, logistic regression model, random forest model, and decision tree, were successfully developed, with accuracies of 0.904, 1.000, and 0.901 in the training set and 0.926, 0.582, and 0.852 in the validation set, respectively. The AUC in the training set were 0.898, 1.000, and 0.789, and in the validation set were 0.906, 0.913, and 0.731, respectively. In general, the logistic regression model has the highest diagnostic efficiency, followed by the random forest model.Entities:
Mesh:
Year: 2022 PMID: 36124028 PMCID: PMC9482546 DOI: 10.1155/2022/9073043
Source DB: PubMed Journal: Dis Markers ISSN: 0278-0240 Impact factor: 3.464
Assignment table of each input variable.
| Variable | Assignment | Number |
|---|---|---|
| Age | Actual value entry | X1 |
| Pregnancy BMI (kg/m2) | 0 = Normal, 1 = overweight/obese | X2 |
| Gestational weight gain (kg) | Actual value entry | X3 |
| Number of maternity | 0 = Primipara, 1 = parity | X4 |
| TG (mmol/L) | Actual value entry | X5 |
| HDL-C (mmol/L) | Actual value entry | X6 |
| LDL-C (mmol/L) | Actual value entry | X7 |
| FT4 (pmol/L) | Actual value entry | X8 |
| TPOAb positive | 0 = no, 1 = yes | X9 |
| Gestational diabetes | 0 = no, 1 = yes | X10 |
| HDP | 0 = no, 1 = yes | X11 |
| Sex of newborn | 0 = female, 1 = male | X12 |
. LR model results.
| Variable | Coeff | Standard error |
|
|
|---|---|---|---|---|
| Constant | -10.455 | 1.890 | -5.50 | <0.05 |
| Age | 0.166 | 0.037 | 4.46 | <0.05 |
| Pregnancy overweight/obese | 1.410 | 0.267 | 5.29 | <0.05 |
| Gestational weight gain | 0.074 | 0.042 | 1.77 | >0.05 |
| Number of maternity | 1.340 | 0.388 | 3.45 | <0.05 |
| TG | 0.153 | 0.142 | 1.08 | >0.05 |
| HDL-C | 1.175 | 0.500 | 2.35 | <0.05 |
| LDL-C | -0.046 | 0.261 | -0.17 | >0.05 |
| FT4 | -0.172 | 0.043 | -4.00 | <0.05 |
| TPOAb positive | 2.311 | 0392 | 5.90 | <0.05 |
| Gestational diabetes | 0.879 | 0.393 | 2.23 | <0.05 |
| HDP | 0.781 | 0.424 | 1.84 | >0.05 |
| Sex of newborn | 0.478 | 0.359 | 1.33 | >0.05 |
Figure 1(a) OOB trend of the random forest model; (b) the classification contribution of each variable.
The contribution of each variable.
| Number | Variable | Mean decrease accuracy | Mean decrease Gini |
|---|---|---|---|
| 1 | TPOAb positive | 19.608 | 6.953 |
| 2 | Gestational weight gain | 18.486 | 1.619 |
| 3 | Age | 17.662 | 12.293 |
| 4 | Pregnancy BMI | 17.115 | 9.660 |
| 5 | FT4 | 14.323 | 12.865 |
| 6 | Parity | 13.260 | 4.214 |
| 7 | HDL-C | 8.498 | 13.020 |
| 8 | Gestational diabetes | 7.498 | 2.142 |
| 9 | TG | 2.357 | 8.094 |
| 10 | Sex of newborn | 1.509 | 1.282 |
| 11 | LDL-C | 0.658 | 7.573 |
| 12 | HDP | -2.248 | 1.161 |
Figure 2Decision tree model of macrosomia.
Performance comparison of the three prediction models.
| Models | LR | RF | DT | |||
|---|---|---|---|---|---|---|
| Training set | Validation set | Training set | Validation set | Training set | Validation set | |
| Accuracy | 0.904 | 0.926 | 1.000 | 0.582 | 0.901 | 0.852 |
| Sensitivity | 0.968 | 0.990 | 1.000 | 0.406 | 0.541 | 0.438 |
| Specificity | 0.639 | 0.719 | 1.000 | 0.990 | 0.988 | 0.981 |
| Recall rate | 0.968 | 0.990 | 1.000 | 0.406 | 0.541 | 0.438 |
| Accurate rate | 0.918 | 0.919 | 1.000 | 0.929 | 0.917 | 0.875 |
| F1 score | 0.942 | 0.953 | 1.000 | 0.565 | 0.680 | 0.583 |
| AUC | 0.898 | 0.906 | 1.000 | 0.913 | 0.789 | 0.731 |
Figure 3ROC curve of the logistic regression model. (a) ROC curve of training set; (b) ROC curve of validation set.
Figure 4ROC curve of the random forest model. (a) ROC curve of training set; (b) ROC curve of validation set.
Figure 5ROC curve of the decision tree model. (a) ROC curve of training set; (b) ROC curve of validation set.