| Literature DB >> 35308550 |
Wenwen Wang1, Yang Xu2, Suzhen Yuan1, Zhiying Li3, Xin Zhu4, Qin Zhou1, Wenfeng Shen2, Shixuan Wang1.
Abstract
Endometrial carcinoma (EC) is a common cause of cancer death in women, and having an early accurate prediction model to identify this disease is crucial. The aim of this study was to develop a new machine learning (ML) model-based diagnostic prediction model for EC. We collected data from consecutive patients between November 2012 and January 2021 at tertiary hospitals in central China. Inclusion criteria included women undergoing endometrial biopsy, dilation and curettage, or hysterectomy. A total of 9 features, including patient demographics, vital signs, and laboratory and ultrasound results, were selected in the final analysis. This new model was combined with three top optimal ML methods, namely, logistic regression, gradient-boosted decision tree, and random forest. A total of 1,922 patients were eligible for final analysis and modeling. The ensemble model, called TJHPEC, was validated in an internal validation cohort and two external validation cohorts. The results showed that the AUC values were 0.9346, 0.8341, and 0.8649 for the prediction of total EC and 0.9347, 0.8073, and 0.871 for prediction of stage I EC. Nine clinical features were confirmed to be highly related to the prediction of EC in TJHPEC. In conclusion, our new model may be accurate for identifying EC, especially in the early stage, in the general population of central China.Entities:
Keywords: endometrial carcinoma (EC); ensemble method; machine learning; model; prediction
Year: 2022 PMID: 35308550 PMCID: PMC8931475 DOI: 10.3389/fmed.2022.851890
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Flow chart of group allocation. EC, endometrial carcinoma.
Figure 2Feature selection using LASSO. Feature coefficients revealed high-risk (red bar) and low-risk (blue bar) features, which were selected using LASSO with an optimal lambda value of 2.5 × 10−3. BMI, body mass index; TBIL, total bilirubin; ALP, alkaline phosphatase; BUN, blood urea nitrogen; γ-GGT, γ-glutamyl transpeptidase; ET, endometrial thickness; TC, total cholesterol; AST, aspartate aminotransferase; CA-125, cancer antigen 125; UA, uric acid; Diabetes, type 2 diabetes; Family history, family history of malignant diseases; WBC, white blood cell; RBC, red blood cell; ALT, alanine aminotransferase; HGB, hemoglobin; PCT, thrombocytocrit; MCH, mean corpuscular hemoglobin; MCV, mean corpuscular volume; HCT, hematocrit; eGFR, estimated glomerular filtration rate; MCHC, mean corpuscular hemoglobin concentration; TP, total protein; DBIL, direct bilirubin; ALB, albumin; , concentrations of bicarbonate.
Demographic characteristics in the validation cohorts.
|
|
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Age, years | 40 ± 11 | 54 ± 8 | 43 ± 10 | 54 ± 10 | 42 ± 11 | 54 ± 8 |
| BMI, kg/m2 | 22.1 ± 3.3 | 24.5 ± 4.0 | 22.5 ± 3.0 | 24.3 ± 3.7 | 22.8 ± 3.4 | 24.9 ± 3.6 |
|
| ||||||
| Menopause status | 32 (10.3) | 42 (56.8) | 23 (11.9) | 25 (65.8) | 63 ( | 78 (54.5) |
|
| ||||||
| Vaginal bleeding | 108 (34.7) | 68 (91.9) | 106 (54.6) | 32 (84.2) | 227 (39.7) | 128 (89.5) |
|
| ||||||
| ALB, g/L | 41.7 ± 4.4 | 38.8 ± 5.7 | 41.9 ± 6.0 | 40.7 ± 4.8 | 42.9 ± 4.2 | 41.8 ± 3.9 |
| ALP, U/L | 54 ± 19 | 65 ± 22 | 60 ± 20 | 78 ± 23 | 62 ± 26 | 71 ± 20 |
| γ-GGT, U/L | 18 ± 17 | 27 ± 27 | 17 ± 15 | 21 ± 13 | 20 ± 25 | 26 ± 23 |
| 24.0 ± 2.3 | 23.7 ± 3.1 | 25.9 ± 2.3 | 27.2 ± 2.5 | 23.7 ± 2.2 | 23.7 ± 2.3 | |
|
| ||||||
| ET, mm | 8.2 ± 4.7 | 8.1 ± 7.7 | 9.7 ± 4.3 | 14.9 ± 6.9 | 7.3 ± 4.4 | 6.1 ± 5.8 |
BMI, body mass index; ET, endometrial thickness; CA-125, cancer antigen 125.
Performance indices of the four predictive models for total EC in the validation cohorts.
|
|
|
|
| |
|---|---|---|---|---|
|
| ||||
| AUC (95% CI) | 0.9346 (0.9108–0.9584) | 0.9175 (0.8901–0.9449) | 0.932 (0.9077–0.9563) | 0.9335 (0.9095–0.9575) |
| Accuracy (95% CI) | 91.17% (88.33–94.00%) | 88.83% (85.68–91.98%) | 89.61% (86.56–92.66%) | 90.65% (87.74–93.56%) |
| Sensitivity (95% CI) | 86.49% (78.70–94.28%) | 79.73% (70.57–88.89%) | 89.19% (82.11–96.26%) | 86.49% (78.70–94.28%) |
| Specificity (95% CI) | 92.28% (89.32–95.25%) | 91.00% (87.82–94.18%) | 89.71% (86.33–93.09%) | 91.64% (88.56–94.72%) |
| PPV (95% CI) | 72.73% (63.42–82.03%) | 67.82% (58.00–77.63%) | 67.35% (58.06–76.63%) | 71.11% (61.75–80.48%) |
| NPV (95% CI) | 96.63% (94.58–98.68%) | 94.97% (92.48–97.45%) | 97.21% (95.31–99.12%) | 96.61% (94.55–98.68%) |
| F1 | 0.7901 | 0.7329 | 0.7674 | 0.7805 |
| Kappa | 0.7347 | 0.6629 | 0.7022 | 0.7218 |
| Brier score | 0.088 | 0.112 | 0.104 | 0.094 |
|
| ||||
| AUC (95% CI) | 0.8341 (0.777–0.8912) | 0.831 (0.7732–0.8888) | 0.8196 (0.7593–0.8799) | 0.8265 (0.7677–0.8853) |
| Accuracy (95% CI) | 81.03% (75.99–86.08%) | 78.02% (72.69–83.35%) | 77.16% (71.75–82.56%) | 80.60% (75.52–85.69%) |
| Sensitivity (95% CI) | 57.89% (42.20–73.59%) | 60.53% (44.99–76.07%) | 71.05% (56.63–85.47%) | 60.53% (44.99–76.07%) |
| Specificity (95% CI) | 85.57% (80.62–90.51%) | 81.44% (75.97–86.91%) | 78.35% (72.56–84.15%) | 84.54% (79.45–89.62%) |
| PPV (95% CI) | 44.00% (30.24–57.76%) | 38.98% (26.54–51.43%) | 39.13% (27.61–50.65%) | 43.40% (30.05–56.74%) |
| NPV (95% CI) | 91.21% (87.09–95.32%) | 91.33% (87.14–95.52%) | 93.25% (89.40–97.10%) | 91.62% (87.56–95.68%) |
| F1 | 0.5 | 0.4742 | 0.5047 | 0.5055 |
| Kappa | 0.3857 | 0.3434 | 0.372 | 0.3889 |
| Brier score | 0.19 | 0.22 | 0.228 | 0.194 |
|
| ||||
| AUC (95% CI) | 0.8649 (0.8377–0.8921) | 0.8574 (0.8293–0.8855) | 0.8544 (0.8259–0.8829) | 0.8607 (0.833–0.8884) |
| Accuracy (95% CI) | 80.98% (78.10–83.86%) | 78.88% (75.89–81.87%) | 79.30% (76.33–82.27%) | 81.12% (78.25–83.99%) |
| Sensitivity (95% CI) | 80.42% (73.92–86.92%) | 81.82% (75.50–88.14%) | 86.01% (80.33–91.70%) | 81.12% (74.70–87.53%) |
| Specificity (95% CI) | 81.12% (77.91–84.33%) | 78.15% (74.76–81.53%) | 77.62% (74.21–81.04%) | 81.12% (77.91–84.33%) |
| PPV (95% CI) | 51.57% (45.01–58.13%) | 48.35% (42.05–54.64%) | 49.00% (42.82–55.19%) | 51.79% (45.24–58.33%) |
| NPV (95% CI) | 94.31% (92.26–96.36%) | 94.50% (92.45–96.56%) | 95.69% (93.84–97.54%) | 94.50% (92.48–96.52%) |
| F1 | 0.6284 | 0.6078 | 0.6244 | 0.6322 |
| Kappa | 0.5087 | 0.4761 | 0.4959 | 0.5133 |
| Brier score | 0.19 | 0.211 | 0.207 | 0.189 |
GBDT, gradient-boosted decision tree; LR, logistic regression; RF, random forest; AUC, area under the receiver-operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value; 95% CI, 95% confidence interval.
Figure 3Evaluation of the efficiency of the diagnostic prediction of models (LR, SVM, RF, NN, KNN, GBDT, and TJHPEC) using ROC curves for the total EC cohorts (A, TJH1 internal validation set; B, RHH external validation set; C, TJH2 external validation set) and stage I EC cohorts (D, TJH1 internal validation set; E, RHH external validation set; F, TJH2 external validation set).
Performance indices of the four predictive models for stage I EC in validation cohorts.
|
|
|
|
| |
|---|---|---|---|---|
|
| ||||
| AUC (95% CI) | 0.9347 (0.9102–0.9592) | 0.9149 (0.8859–0.9439) | 0.9301 (0.9046–0.9556) | 0.9347 (0.9102–0.9592) |
| Accuracy (95% CI) | 91.44% (88.61–94.28%) | 89.04% (85.87–92.20%) | 89.84% (86.78–92.90%) | 90.91% (88.00–93.82%) |
| Sensitivity (95% CI) | 87.30% (79.08–95.52%) | 79.37% (69.37–89.36%) | 90.48% (83.23–97.72%) | 87.30% (79.08–95.52%) |
| Specificity (95% CI) | 92.28% (89.32–95.25%) | 91.00% (87.82–94.18%) | 89.71% (86.33–93.09%) | 91.64% (88.56–94.72%) |
| PPV (95% CI) | 69.62% (59.48–79.76%) | 64.10% (53.46–74.75%) | 64.04% (54.08–74.01%) | 67.90% (57.73–78.07%) |
| NPV (95% CI) | 97.29% (95.43–99.14%) | 95.61% (93.27–97.94%) | 97.89% (96.23–99.56%) | 97.27% (95.40–99.14%) |
| F1 | 0.7746 | 0.7092 | 0.75 | 0.7639 |
| Kappa | 0.7227 | 0.6426 | 0.6886 | 0.7087 |
| Brier score | 0.086 | 0.11 | 0.102 | 0.091 |
|
| ||||
| AUC (95% CI) | 0.8073 (0.7335–0.8811) | 0.8129 (0.7405–0.8853) | 0.782 (0.7019–0.8621) | 0.7966 (0.7201–0.8731) |
| Accuracy (95% CI) | 82.11% (77.02–87.20%) | 78.44% (72.98–83.90%) | 77.06% (71.48–82.65%) | 81.65% (76.51–86.79%) |
| Sensitivity (95% CI) | 54.17% (34.23–74.10%) | 54.17% (34.23–74.10%) | 66.67% (47.81–85.53%) | 58.33% (38.61–78.06%) |
| Specificity (95% CI) | 85.57% (80.62–90.51%) | 81.44% (75.97–86.91%) | 78.35% (72.56–84.15%) | 84.54% (79.45–89.62%) |
| PPV (95% CI) | 31.71% (17.46–45.95%) | 26.53% (14.17–38.89%) | 27.59% (16.08–39.09%) | 31.82% (18.06–45.58%) |
| NPV (95% CI) | 93.79% (90.23–97.34%) | 93.49% (89.77–97.21%) | 95.00% (91.62–98.38%) | 94.25% (90.79–97.71%) |
| F1 | 0.4 | 0.3562 | 0.3902 | 0.4118 |
| Kappa | 0.3032 | 0.2445 | 0.2778 | 0.314 |
| Brier score | 0.179 | 0.216 | 0.229 | 0.183 |
|
| ||||
| AUC (95% CI) | 0.871 (0.8435–0.8985) | 0.8577 (0.8284–0.887) | 0.8597 (0.8307–0.8887) | 0.8685 (0.8407–0.8963) |
| Accuracy (95% CI) | 81.29% (78.40–84.19%) | 78.71% (75.66–81.75%) | 79.14% (76.12–82.16%) | 81.44% (78.55–84.33%) |
| Sensitivity (95% CI) | 82.11% (75.34–88.89%) | 81.30% (74.41–88.19%) | 86.18% (80.08–92.28%) | 82.93% (76.28–89.58%) |
| Specificity (95% CI) | 81.12% (77.91–84.33%) | 78.15% (74.76–81.53%) | 77.62% (74.21–81.04%) | 81.12% (77.91–84.33%) |
| PPV (95% CI) | 48.33% (41.55–55.10%) | 44.44% (37.95–50.94%) | 45.30% (38.92–51.68%) | 48.57% (41.81–55.33%) |
| NPV (95% CI) | 95.47% (93.62–97.32%) | 95.11% (93.16–97.06%) | 96.31% (94.59–98.03%) | 95.67% (93.86–97.48%) |
| F1 | 0.6084 | 0.5747 | 0.5938 | 0.6126 |
| Kappa | 0.4962 | 0.4485 | 0.4711 | 0.5013 |
| Brier score | 0.187 | 0.213 | 0.209 | 0.186 |
GBDT, gradient-boosted decision tree; LR, logistic regression; RF, random forest; AUC, area under the receiver-operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value; 95% CI, 95% confidence interval.
Figure 4Preponderance ranking of the top 20 features was performed using four models (LR, RF, GBDT, and TJHPEC) for the prediction of EC. The size of each circle represents its relative importance. The depth of the color indicates the importance of each feature in each model.