| Literature DB >> 35795827 |
Xiaoyuan Qian1, Du He2, Li Qin3, Lin Lai2, Hongli Wang4, Yukun Zhang2.
Abstract
Purpose: Deep myometrial invasion (DMI) is an independent high-risk factor for lymph node metastasis and a prognostic risk factor in early-stage endometrial cancer (EC-I) patients. Thus, we developed a machine learning (ML) assistant model, which can accurately help define the surgical area.Entities:
Keywords: endometrial cancer; gray level co-occurrence matrix; machine learning; myometrial invasion; prediction
Year: 2022 PMID: 35795827 PMCID: PMC9252192 DOI: 10.2147/CMAR.S370477
Source DB: PubMed Journal: Cancer Manag Res ISSN: 1179-1322 Impact factor: 3.602
Figure 1The flow chart of patient selection and data process.
Baseline Demographic and Clinicopathological Characteristics of Patients
| Variables | Training Set | Testing Set | ||||||
|---|---|---|---|---|---|---|---|---|
| Overall(N=243) | Yes(N=28) | No(N=215) | P-value | Overall(N=105) | Yes(N=20) | No(N=85) | P-value | |
| Age (median [IQR]),year | 55.00 [44.00, 65.50] | 60.50 [45.50, 65.00] | 55.00 [43.00, 66.00] | 0.306 | 53.00 [43.00, 64.00] | 53.50 [43.00, 63.25] | 53.00 [41.00, 65.00] | 0.838 |
| BMI (median [IQR]),kg/m2 | 22.90 [20.60, 25.10] | 24.60 [22.82, 26.40] | 22.60 [20.25, 25.00] | 0.005 | 23.50 [21.40, 25.70] | 22.85 [22.25, 27.05] | 23.80 [21.30, 25.60] | 0.315 |
| Smoking (%) | ||||||||
| Yes | 130 (53.5) | 13 (46.4) | 117 (54.4) | 0.551 | 49 (46.7) | 7 (35.0) | 42 (49.4) | 0.361 |
| No | 113 (46.5) | 15 (53.6) | 98 (45.6) | 56 (53.3) | 13 (65.0) | 43 (50.6) | ||
| Drinking (%) | ||||||||
| Yes | 129 (53.1) | 13 (46.4) | 116 (54.0) | 0.583 | 50 (47.6) | 6 (30.0) | 44 (51.8) | 0.132 |
| No | 114 (46.9) | 15 (53.6) | 99 (46.0) | 55 (52.4) | 14 (70.0) | 41 (48.2) | ||
| Hypertension (%) | ||||||||
| Yes | 120 (49.4) | 14 (50.0) | 106 (49.3) | 1 | 54 (51.4) | 12 (60.0) | 42 (49.4) | 0.546 |
| No | 123 (50.6) | 14 (50.0) | 109 (50.7) | 51 (48.6) | 8 (40.0) | 43 (50.6) | ||
| Diabetes (%) | ||||||||
| Yes | 119 (49.0) | 19 (67.9) | 100 (46.5) | 0.054 | 52 (49.5) | 8 (40.0) | 44 (51.8) | 0.485 |
| No | 124 (51.0) | 9 (32.1) | 115 (53.5) | 53 (50.5) | 12 (60.0) | 41 (48.2) | ||
| Menopause (%) | ||||||||
| Yes | 190 (78.2) | 19 (67.9) | 171 (79.5) | 0.244 | 74 (70.5) | 16 (80.0) | 58 (68.2) | 0.444 |
| No | 53 (21.8) | 9 (32.1) | 44 (20.5) | 31 (29.5) | 4 (20.0) | 27 (31.8) | ||
| CA199 (median [IQR]),U/mL | 32.70 [27.80, 38.25] | 43.45 [37.15, 51.00] | 32.00 [27.10, 36.80] | <0.001 | 34.20 [28.10, 38.30] | 44.60 [32.80, 54.05] | 33.10 [27.20, 37.10] | <0.001 |
| CA125 (median [IQR]),U/mL | 42.30 [37.45, 51.80] | 58.50 [48.90, 70.90] | 41.60 [36.90, 50.05] | <0.001 | 48.20 [41.00, 53.90] | 62.75 [50.55, 68.62] | 47.50 [39.80, 52.20] | <0.001 |
| Pathological type (%) | ||||||||
| Adenocarcinoma | 175 (72.0) | 26 (92.9) | 149 (69.3) | 0.017 | 71 (67.6) | 18 (90.0) | 53 (62.4) | 0.035 |
| Non-adenocarcinoma | 68 (28.0) | 2 (7.1) | 66 (30.7) | 34 (32.4) | 2 (10.0) | 32 (37.6) | ||
| FIGO stage (%) | ||||||||
| I | 215 (88.5) | 0 (0.0) | 215 (100.0) | <0.001 | 85 (81.0) | 0 (0.0) | 85 (100.0) | <0.001 |
| IA | 16 (6.6) | 16 (57.1) | 0 (0.0) | 10 (9.5) | 10 (50.0) | 0 (0.0) | ||
| IB | 12 (4.9) | 12 (42.9) | 0 (0.0) | 10 (9.5) | 10 (50.0) | 0 (0.0) | ||
| Correlation_AllDirection_offset1 (median [IQR]) | 0.98 [0.93, 1.04] | 2.46 [2.17, 2.88] | 0.96 [0.92, 1.02] | <0.001 | 0.99 [0.95, 1.05] | 2.35 [2.21, 2.93] | 0.98 [0.94, 1.02] | <0.001 |
| Correlation_angle0_offset1 (median [IQR]) | 1.03 [0.98, 1.06] | 2.47 [1.94, 3.14] | 1.01 [0.98, 1.05] | <0.001 | 1.04 [1.01, 1.08] | 2.75 [1.91, 3.80] | 1.03 [1.00, 1.06] | <0.001 |
| Correlation_angle45_offset1 (median [IQR]) | 1.09 [1.04, 1.13] | 2.19 [1.70, 2.65] | 1.08 [1.04, 1.11] | <0.001 | 1.09 [1.04, 1.13] | 2.38 [1.90, 2.78] | 1.07 [1.04, 1.10] | <0.001 |
| Correlation_angle90_offset1 (median [IQR]) | 1.14 [1.08, 1.21] | 2.60 [2.11, 2.97] | 1.12 [1.07, 1.18] | <0.001 | 1.16 [1.08, 1.22] | 2.92 [2.13, 3.40] | 1.13 [1.06, 1.18] | <0.001 |
| Inertia_AllDirection_offset1 (median [IQR]) | 230.00 [207.50, 253.50] | 183.00 [159.75, 201.25] | 233.00 [213.00, 256.00] | <0.001 | 228.00 [206.00, 253.00] | 182.00 [152.00, 197.25] | 235.00 [218.00, 259.00] | <0.001 |
| Inertia_AllDirection_offset1_SD (median [IQR]) | 4860.00 [3365.50, 6333.50] | 3365.50 [2079.00, 4945.25] | 5052.00 [3638.00, 6367.00] | 0.003 | 4437.00 [2839.00, 6103.00] | 3673.50 [2344.00, 5724.50] | 4548.00 [3024.00, 6133.00] | 0.133 |
| Inertia_angle0_offset1 (median [IQR]) | 199.00 [175.00, 231.50] | 126.00 [95.75, 158.50] | 205.00 [180.50, 235.50] | <0.001 | 190.00 [168.00, 217.00] | 138.00 [114.50, 181.00] | 196.00 [175.00, 226.00] | <0.001 |
| Inertia_angle45_offset1 (median [IQR]) | 282.00 [247.00, 314.50] | 209.00 [166.75, 282.00] | 287.00 [252.00, 319.00] | <0.001 | 283.00 [238.00, 320.00] | 193.00 [164.50, 250.50] | 301.00 [246.00, 326.00] | <0.001 |
| Inertia_angle90_offset1 (median [IQR]) | 133.00 [111.00, 154.00] | 130.50 [111.25, 139.25] | 135.00 [111.00, 155.00] | 0.229 | 132.00 [117.00, 156.00] | 123.00 [111.50, 136.00] | 135.00 [120.00, 156.00] | 0.057 |
| IDMoment_AllDirection_offset1 (median [IQR]) | 0.08 [0.07, 0.08] | 0.10 [0.10, 0.12] | 0.08 [0.07, 0.08] | <0.001 | 0.08 [0.07, 0.08] | 0.10 [0.09, 0.12] | 0.08 [0.07, 0.08] | <0.001 |
| IDMoment_angle0_offset1 (median [IQR]) | 0.08 [0.07, 0.08] | 0.11 [0.09, 0.12] | 0.08 [0.07, 0.08] | <0.001 | 0.08 [0.08, 0.08] | 0.12 [0.10, 0.14] | 0.08 [0.08, 0.08] | <0.001 |
| IDMoment_angle45_offset1 (median [IQR]) | 0.06 [0.05, 0.06] | 0.10 [0.07, 0.12] | 0.06 [0.05, 0.06] | <0.001 | 0.06 [0.05, 0.07] | 0.10 [0.09, 0.10] | 0.06 [0.05, 0.06] | <0.001 |
| IDMoment_angle90_offset1 (median [IQR]) | 0.07 [0.05, 0.08] | 0.11 [0.10, 0.13] | 0.06 [0.05, 0.07] | <0.001 | 0.07 [0.06, 0.08] | 0.13 [0.11, 0.14] | 0.07 [0.06, 0.08] | <0.001 |
Abbreviations: IQR, inter-quartile range; BMI, body mass index; CA199, carbohydrate antigen199; CA125, carbohydrate antigen125; FIGO, Federation International of Gynecology and Obstetrics.
Figure 2Variable screening and weight allocation. (A) Correlation matrix analysis of candidate features. (B) The weight distribution of the candidate variables of each ML-based model.
Figure 3Predictive model visualization based on ML-based algorithm. (A) RFC model. (B) DT model. The candidate factors associated with myometrial invasion were ordered via RFC algorithm (A, B) prediction node, and weight was allocated via DT algorithm.
Figure 4Predictive model visualization based on ANN algorithm. The candidate factors associated with myometrial invasion were ordered via the ANN algorithm. Red represents the positive weight, and blue represents the negative weight.
Figure 5Prediction performance of candidate models based on ML-based algorithm. (A) DCA for five ML-based models in the training set. (B) DCA for five ML-based models in the testing set.
The ROC Curve Analyses for Predicting Myometrial Invasion in Each ML-Based Model
| Model | Training Set | Testing Set | ||||
|---|---|---|---|---|---|---|
| AUC Mean | AUC 95% CI | Variables& | AUC Mean | AUC 95% CI | Variables& | |
| 0.877 | 0.316–1.438 | 7 | 0.862 | 0.301–1.423 | 7 | |
| 0.765 | 0.191–1.339 | 8 | 0.716 | 0.142–1.290 | 8 | |
| 0.787 | 0.264–1.310 | 7 | 0.739 | 0.212–1.266 | 7 | |
| 0.842 | 0.295–1.389 | 7 | 0.804 | 0.257–1.351 | 7 | |
| 0.768 | 0.204–1.332 | 8 | 0.715 | 0.151–1.279 | 8 | |
| 0.835 | 0.267–1.403 | 0 | 0.816 | 0.248–1.384 | 0 | |
Notes: &Variables included in the model.
Abbreviations: RFC, random forest classifier; SVM, support vector machine; DT, decision tree; ANN, artificial Neural Network; XGboost, eXtreme gradient boosting; AUC, area under curve; 95% CI, 95% confidence interval;.