| Literature DB >> 34040391 |
Jun Ma1, Jiani Yang1, Yue Jin1, Shanshan Cheng1, Shan Huang1, Nan Zhang1, Yu Wang1.
Abstract
OBJECTIVE: We aimed to develop an ovarian cancer-specific predictive framework for clinical use platinum-sensitivity and prognosis using machine learning methods based on multiple biomarkers, including circulating tumor cells (CTCs). PATIENTS AND METHODS: We enrolled 156 epithelial ovarian cancer (EOC) patients, randomly assigned into the training and validation cohorts. Eight machine learning classifiers, including Random Forest (RF), Support Vector Machine, Gradient Boosting Machine, Conditional RF, Neural Network, Naive Bayes, Elastic Net, and Logistic Regression, were used to derive predictive information from 11 peripheral blood parameters, including CTCs. Through the advanced CanPatrol CTC-enrichment technique, we detect CTCs and classify them into subpopulations: epithelial, mesenchymal, and hybrids. Survival curves were generated by Kaplan-Meier method and calculated through the Log rank test.Entities:
Keywords: artificial intelligence; blood biomarkers; circulating tumor cell; epithelial ovarian cancer
Year: 2021 PMID: 34040391 PMCID: PMC8140950 DOI: 10.2147/OTT.S307546
Source DB: PubMed Journal: Onco Targets Ther ISSN: 1178-6930 Impact factor: 4.147
Figure 1The flowchart of the study. (A) We detected the circulating tumor cells (CTCs) through the advanced CanPatrolTM technique. After collecting 5 mL of peripheral blood samples, we used a nanofiltration system for CTCs isolation. Then, CTCs were detected by RNA-In Situ Hybridization (RNA-ISH). (B) We enrolled in 156 epithelial ovarian cancer (EOC) patients according to the inclusion and exclusion criteria. Patients were then randomly assigned to a training group (n=106) and a validated group (n=50) for machine learning model development.
Characteristics Between Patients in the Training Cohort and the Validated Cohort
| Variable | Total Patients (n = 156) | Training Cohort (n = 106) | Validated Cohort (n = 50) | p-value | |
|---|---|---|---|---|---|
| Age (years) | 57.89 ± 9.01 | 57.78 ± 8.46 | 58.10 ± 9.32 | 0.831 | |
| BMI (kg/m2) | 22.97 ± 0.89 | 23.19 ± 1.35 | 22.87 ± 0.95 | 0.134 | |
| Tumor size (cm) | 6.58 ± 3.98 | 6.73 ± 3.48 | 6.45 ± 4.21 | 0.662 | |
| Pathological grade, n (%) | 0.494 | ||||
| G1-2 | 43 (27.56%) | 31 (19.87%) | 12 (7.69%) | - | |
| G3 | 113 (72.44%) | 75 (48.08%) | 38 (24.36%) | - | |
| Clinical stage, n (%) | 0.396 | ||||
| I–II | 52 (33.33%) | 33 (21.15%) | 19 (12.18%) | - | |
| III–IV | 104 (66.67%) | 73 (46.79%) | 31 (19.87%) | - | |
| Histological type, n (%) | 0.906 | ||||
| Serous | 98 (62.8%) | 67 (42.9%) | 31 (19.9%) | - | |
| Mucinous | 25 (16.0%) | 16 (10.3%) | 9 (5.8%) | - | |
| Endometrioid | 14 (9.0%) | 9 (5.8%) | 5 (3.2%) | - | |
| Others | 19 (12.2%) | 14 (9.0%) | 5 (3.2%) | - | |
| CTCs, n (%) | 8.70 ± 3.85 | 8.73 ± 4.58 | 8.65 ± 3.49 | 0.913 | |
| M-CTC | 0.26 ± 0.18 | 0.26 ± 0.14 | 0.25 ± 0.20 | 0.719 | |
| Neutrophil (10^9/L) | 5.13 ± 1.82 | 5.28 ± 1.69 | 5.10 ± 1.93 | 0.554 | |
| Lymphocyte (10^9/L) | 1.32 ± 0.79 | 1.31 ± 0.84 | 1.40 ± 0.66 | 0.506 | |
| Platelet (10^9/L) | 342.16 ± 90.41 | 351.73 ± 77.38 | 339.94 ± 92.65 | 0.406 | |
| Albumin (g/L) | 41.97 ± 9.35 | 40.94 ± 8.37 | 42.28 ± 9.83 | 0.379 | |
| CA-125 (U/mL) | 996.57 ± 392.04 | 1003.24 ± 412.43 | 994.39 ± 379.56 | 0.898 | |
| CA-199 (U/mL) | 130.29 ± 52.30 | 123.73 ± 59.04 | 135.28 ± 47.57 | 0.228 | |
| AFP (ng/mL) | 5.82 ± 3.94 | 6.04 ± 3.32 | 5.36 ± 4.25 | 0.278 | |
| CEA (ng/mL) | 3.29 ± 2.63 | 3.09 ± 2.74 | 3.32 ± 2.48 | 0.615 | |
| HE4 (pmol/L) | 527.39 ± 73.01 | 535.39 ± 70.38 | 524.28 ± 80.39 | 0.381 | |
| CRP (mg/L) | 8.39 ± 2.10 | 8.20 ± 1.93 | 8.75 ± 2.13 | 0.110 | |
| HB (g/L) | 118.27 ± 18.39 | 120.38 ± 20.31 | 116.38 ± 16.47 | 0.226 | |
| Fibrinogen (g/L) | 4.27 ± 1.29 | 4.11 ± 0.83 | 4.32 ± 1.07 | 0.182 | |
| LDH (U/L) | 187.17 ± 19.83 | 190.38 ± 20.19 | 185.87 ± 15.48 | 0.165 | |
| ALT (U/L) | 28.38 ± 3.93 | 29.18 ± 5.20 | 27.91 ± 4.79 | 0.147 | |
| AST (U/L) | 30.37 ± 2.04 | 29.49 ± 4.47 | 30.74 ± 3.93 | 0.093 | |
| TBA (μmol/L) | 9.82 ± 2.09 | 10.18 ± 2.74 | 9.76 ± 3.38 | 0.409 | |
Abbreviations: BMI, body mass index; CTCs, circulating tumor cells; M-CTC, mesenchymal–CTC percentage; HB, hemoglobin; HE4, Human epididymis protein 4, CEA, Carcinoembryonic antigen, AFP, Alpha-fetoprotein; CRP, C-reaction protein; LDH, Lactate dehydrogenase; ALT, Alanine aminotransferase; AST, Aspartate aminotransferase; TBA, total bile acid.
Figure 2Differentiation of epithelial ovarian cancer (EOC) prognosis based on multiple preoperative blood biomarkers. (A) receiver operating characteristic (ROC) curves derived from logistic regression for single blood biomarkers. (B) the ROC curves derived from 8 supervised machine learning methods. The progression-free survival (PFS) analysis among (C) all patients; patients stratified by (D) circulating tumor cell (CTC) counts and (E) mesenchymal–CTC (M-CTC) percentage. The overall survival (OS) analysis among (F) all patients; patients stratified by (G) CTCs counts and (H) M-CTC percentage.
Correlation Between Preoperative Circulating Tumor Cell (CTC) Count and Clinicopathological Features of Epithelial Ovarian Cancer Patients
| Variable | CTC Count < 5 | CTC Count ≥ 5 | p-value |
|---|---|---|---|
| Age (years) | 57.92 ± 7.31 | 58.24 ± 8.02 | 0.795 |
| BMI (kg/m2) | 22.97 ± 0.98 | 23.06 ± 1.25 | 0.616 |
| Tumor size (cm) | 5.72 ± 1.43 | 6.22 ± 1.09 | 0.016 |
| Pathological grade, n (%) | 0.188 | ||
| G1-2 | 26 (16.67%) | 17 (10.90%) | - |
| G3 | 55 (35.26%) | 58 (37.18%) | - |
| Clinical stage, n (%) | 0.007 | ||
| I–II | 35 (22.44%) | 17 (20.90%) | |
| III–IV | 46 (29.49%) | 58 (37.18%) | |
| Histological type, n (%) | 0.849 | ||
| Serous | 53 (33.97%) | 45 (28.85%) | - |
| Mucinous | 12 (7.69%) | 13 (8.33%) | - |
| Endometrioid | 6 (3.85%) | 8 (5.13%) | - |
| Others | 10 (6.41%) | 9 (5.77%) | - |
| Neutrophil (10^9/L) | 5.18 ± 2.92 | 5.27 ± 2.41 | 0.298 |
| Lymphocyte (10^9/L) | 1.37 ± 0.79 | 1.53 ± 1.02 | 0.273 |
| Platelet (10^9/L) | 371.28 ± 86.02 | 359.43 ± 79.20 | 0.373 |
| Albumin (g/L) | 40.82 ± 8.02 | 41.39 ± 9.38 | 0.683 |
| CA-125 (U/mL) | 897.92 ± 293.59 | 1013.01 ± 385.24 | 0.037 |
| CA-199 (U/mL) | 129.40 ± 49.31 | 136.38 ± 40.48 | 0.338 |
| AFP (ng/mL) | 5.83 ± 3.35 | 6.02 ± 4.72 | 0.771 |
| CEA (ng/mL) | 3.28 ± 2.18 | 3.62 ± 1.97 | 0.310 |
| HE4 (pmol/l) | 539.48 ± 74.20 | 529.40 ± 80.38 | 0.417 |
| CRP (mg/L) | 7.89 ± 1.83 | 8.43 ± 2.18 | 0.095 |
| HB (g/L) | 120.26 ± 20.18 | 117.39 ± 17.72 | 0.348 |
| Fibrinogen (g/L) | 4.10 ± 1.53 | 4.31 ± 1.07 | 0.326 |
| LDH (U/L) | 184.27 ± 16.38 | 189.54 ± 20.30 | 0.075 |
| ALT (U/L) | 28.04 ± 3.18 | 29.18 ± 4.26 | 0.059 |
| AST (U/L) | 30.72 ± 1.98 | 30.29 ± 2.13 | 0.193 |
| TBA (μmol/L) | 9.27 ± 2.19 | 9.94 ± 3.72 | 0.169 |
Abbreviations: BMI, body mass index; CTC, circulating tumor cell; HB, hemoglobin; HE4, Human epididymis protein 4, CEA, Carcinoembryonic antigen, AFP, Alpha-fetoprotein; CRP, C-reaction protein; LDH, Lactate dehydrogenase; ALT, Alanine aminotransferase; AST, Aspartate aminotransferase; TBA, total bile acid.
Univariate and Multivariate Regression Analyses with Clinicopathologic Parameters for Epithelial Ovarian Cancer (EOC) Patient’s Prognosis
| Variables | Univariate Analysis | Multivariate Analysis | ||
|---|---|---|---|---|
| HR (95% CI) | P-value | HR (95% CI) | P-value | |
| Age | 1.28(1.09–1.47) | 0.033 | 1.19(1.04–1.49) | 0.112 |
| BMI | 1.10(0.85–2.19) | 0.341 | - | - |
| Tumor size | 1.32(1.10–1.79) | 0.042 | 1.27(0.95–1.86) | 0.113 |
| Pathological grade | ||||
| G1-2 vs G3 | 1.47(1.23–1.64) | 0.038 | 1.38(1.23–1.94) | 0.042 |
| Clinical stage | ||||
| I–II vs III–IV | 2.11(1.28–3.73) | 0.009 | 1.94(1.26–3.73) | 0.015 |
| Histological type | ||||
| Serous vs others | 1.19(0.90–2.20) | 0.235 | - | - |
| CTC count | ||||
| <5 vs ≥5 | 2.03(1.64–4.04) | 0.002 | 1.95(1.55–3.96) | 0.007 |
| M-CTC percentage | ||||
| <0.3 vs ≥0.3 | 1.74(1.54–2.57) | 0.005 | 1.84(1.48–2.64) | 0.009 |
| Neutrophil | 1.22(0.84–1.92) | 0.328 | - | - |
| Lymphocyte | 0.94(0.54–2.48) | 0.281 | - | - |
| Platelet | 1.43(0.89–1.74) | 0.136 | - | - |
| Albumin | 0.84(0.54–0.93) | 0.016 | 0.89(0.64–1.02) | 0.083 |
| CA-125 | 1.43(1.04–1.74) | 0.029 | 1.34(1.03–1.84) | 0.038 |
| CA-199 | 0.98(0.85–1.35) | 0.348 | - | - |
| AFP | 1.19(0.85–1.43) | 0.193 | - | - |
| CEA | 1.25(0.94–1.86) | 0.379 | - | - |
| HE4 | 1.34(0.84–1.63) | 0.283 | - | - |
| CRP | 1.47(1.04–2.92) | 0.037 | 1.36(1.29–2.80) | 0.041 |
| HB | 1.28(0.89–1.73) | 0.326 | - | - |
| Fibrinogen | 1.58(1.18–2.10) | 0.041 | 1.39(0.99–2.39) | 0.126 |
| LDH | 1.03(0.93–2.38) | 0.275 | - | - |
| ALT | 0.94(0.75–2.02) | 0.362 | - | - |
| AST | 0.99(0.85–1.95) | 0.286 | - | - |
| TBA | 1.25(0.85–2.05) | 0.321 | - | - |
Abbreviations: HR, hazard ratio; 95% CI, 95% confidence interval; BMI, body mass index; CTC, circulating tumor cell; M-CTC, mesenchymal–CTC; HB, hemoglobin; HE4, Human epididymis protein 4, CEA, Carcinoembryonic antigen, AFP, Alpha-fetoprotein; CRP, C-reaction protein; LDH, Lactate dehydrogenase; ALT, Alanine aminotransferase; AST, Aspartate aminotransferase; TBA, total bile acid.
Figure 3Prediction of clinical stages of epithelial ovarian cancer (EOC) with Random Forest (RF) classifier. (A) Receiver operating characteristic (ROC) curve for RF prediction of clinical stage based on circulating biomarkers with/without CTCs. (B) Variable importance for RF prediction of clinical stages measured by mean decrease in Gini index. (C) The box plot show distribution of the top eight important blood markers for RF prediction of clinical stages.
Figure 4Prediction of platinum-resistance of epithelial ovarian cancer (EOC) with the Random Forest (RF) classifier. (A) The receiver operating characteristic (ROC) curve for RF prediction of platinum-resistance based on circulating biomarkers with/without CTCs. (B) Variable importance for RF prediction of platinum-resistance measured by mean decrease in the Gini index. (C) The box plot shows the distribution of the top eight important blood markers for RF prediction of platinum-resistance.
Figure 5Unsupervised machine learning clustering associated with EOC prognosis. EOC patients were clustered into two groups by the unsupervised clustering analysis with RF classifier. Kaplan–Meier curves indicating progression-free survival (PFS) of each cluster in (A) all EOC patients, (B) early clinical stage group, and (C) advanced clinical stage group. Kaplan–Meier curves indicating overall survival (OS) of each cluster in (D) all EOC patients, (E) early clinical stage group, and (F) advanced clinical stage group. (G) Box plots showed the distribution of the top eight peripheral blood biomarkers between two clusters.