| Literature DB >> 35267599 |
Reid Shaw1, Anna E Lokshin2, Michael C Miller1, Geralyn Messerlian-Lambert3, Richard G Moore1.
Abstract
OBJECTIVE: To identify the most predictive parameters of ovarian malignancy and develop a machine learning (ML) based algorithm to preoperatively distinguish between a benign and malignant pelvic mass.Entities:
Keywords: CA125; HE4; ovarian cancer; pelvic mass
Year: 2022 PMID: 35267599 PMCID: PMC8909341 DOI: 10.3390/cancers14051291
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Patient demographics and tumor characteristics.
|
| |||||
| Training ( | Testing ( | All Patients ( | |||
| Age | |||||
| Years (range) | 54.7 (20–91) | 55.3 (25–82) | 54.9 (20–91) | ||
| Race | |||||
| Black | 2 (1.9%) | 1 (2.9%) | 3 (2.1%) | ||
| Hispanic | 1 (0.9%) | 1 (2.9%) | 2 (1.4%) | ||
| Other | 6 (5.7%) | 1 (2.9%) | 7 (5.0%) | ||
| White | 97 (91.5%) | 31 (91.2%) | 128 (91.4%) | ||
| Menopausal Status | |||||
| Post-menopausal | 68 (64.2%) | 22 (64.7%) | 90 (64.3%) | ||
| Pre-menopausal | 38 (35.8%) | 12 (35.3%) | 50 (35.7%) | ||
| Histological Diagnosis | |||||
| Benign | 60 (56.6%) | 19 (55.9%) | 79 (56.4%) | ||
| Borderline/LMP | 3 (2.8%) | 1 (2.9%) | 4 (2.9%) | ||
| Cancer—EOC I–II | 17 (16.0%) | 4 (11.8%) | 21 (15.0%) | ||
| Cancer—EOC III–IV | 15 (14.2%) | 9 (26.5%) | 24 (17.1%) | ||
| Cancer—Metastatic | 5 (4.7%) | 1 (2.9%) | 6 (4.3%) | ||
| Cancer—EOC Unstaged | 1 (0.9%) | 0 (0.0%) | 1 (0.7%) | ||
| Cancer—Non-EOC | 2 (1.9%) | 0 (0.0%) | 2 (1.4%) | ||
| Cancer—Other Gyn | 3 (2.8%) | 0 (0.0%) | 3 (2.1%) | ||
| Stage | |||||
| IA | 8 (7.5%) | 2 (5.9%) | 10 (7.1%) | ||
| IB | 1 (0.9%) | 0 (0.0%) | 1 (0.7%) | ||
| IC | 6 (5.7%) | 1 (2.9%) | 7 (5.0%) | ||
| II | 1 (0.9%) | 0 (0.0%) | 1 (0.7%) | ||
| IIA | 3 (2.8%) | 0 (0.0%) | 3 (2.1%) | ||
| IIB | 1 (0.9%) | 0 (0.0%) | 1 (0.7%) | ||
| IIC | 1 (0.9%) | 1 (2.9%) | 2 (1.4%) | ||
| IIIA | 5 (4.7%) | 2 (5.9%) | 7 (5.0%) | ||
| IIIB | 1 (0.9%) | 1 (2.9%) | 2 (1.4%) | ||
| IIIC | 12 (11.3%) | 7 (20.6%) | 19 (13.6%) | ||
| IV | 5 (4.7%) | 1 (2.9%) | 6 (4.3%) | ||
| Unstaged | 1 (0.9%) | 0 (0.0%) | 1 (0.7%) | ||
|
| |||||
| EOC Type | Stage I | Stage II | Stage III | Unstaged | All |
| Clear Cell | 2 (33.3%) | 1 (16.7%) | 3 (50.0%) | 0 (0.0%) | 6 (13.0%) |
| Endometrioid | 1 (50.0%) | 0 (0.0%) | 1 (50.0%) | 0 (0.0%) | 2 (4.3%) |
| Mucinous | 7 (87.5%) | 1 (12.5%) | 0 (0.0%) | 0 (0.0%) | 8 (17.4%) |
| Serous | 4 (13.8%) | 4 (13.8%) | 20 (69.0%) | 1 (3.4%) | 29 (63.0%) |
| Mixed | 0 (0.0%) | 1 (100%) | 0 (0.0%) | 0 (0.0%) | 1 (2.2%) |
| All EOC | 14 (30.4%) | 7 (15.2%) | 24 (52.2%) | 1 (2.2%) | 46 (100%) |
|
| |||||
| All patients | Pre-Menopausal | Post-Menopausal | |||
| EOC Grade | |||||
| Grade 1 | 9 (19.6%) | 3 (30.0%) | 6 (16.7%) | ||
| Grade 2 | 3 (6.5%) | 1 (10.0%) | 2 (5.6%) | ||
| Grade 3 | 34 (73.9%) | 6 (60.0%) | 28 (77.8%) | ||
| Total | 46 (100%) | 10 (21.7%) | 36 (78.3%) | ||
Figure 1(A) Dot plot of ROC AUC for each of the three tuned random forest (RF) hyperparameters using Bayes optimization. (B) ROC curves of the RF hyperparameters averaged across each of the 25 bootstrap resamples. (C) Boxplot of variable importance as calculated by Gini impurity across 1000 permutations of the RF model fitting. “P” indicates plasma, whereas “U” indicates urine; unlabeled biomarkers are from serum. (D) Boxplot of the three selected predictive parameters: CA125, HE4, and transferrin.
Figure 2(A) 25 ROC curves for each of the nine unique ML classifiers. One curve represents the average of all tested hyperparameters for one of the bootstrap resamples. (B) ROC AUC values for each of the individual ML hyperparameters averaged across each of the 25 bootstrap resamples. (C) Ensemble stack ML classifier weights as defined by LASSO regression.
Ensemble model statistics on testing dataset.
| Ensemble Model | AUC | Accuracy | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|---|
| CA125 + HE4 + TRF | 0.951 | 97.1% | 93.3% | 100.0% | 100.0% | 95.0% |
| CA125 + HE4 | 0.937 | 85.3% | 86.7% | 84.2% | 81.2% | 88.9% |
| Top 10% | 0.926 | 82.4% | 66.7% | 94.7% | 90.9% | 78.3% |
| All predictors | 0.909 | 88.2% | 80.0% | 94.7% | 92.3% | 85.7% |
Figure 3(A) ROC curves of the four ensemble stack models. No statistical difference was observed using DeLong’s method. (B) Ensemble stack confusion matrix for the final testing data (n = 34). (C) Dot plot of the testing samples colored by prediction result. CA125 is on the X-axis, HE4 is on the Y-axis, and transferrin is illustrated by the size of each dot.