| Literature DB >> 34923833 |
Seri Jeong1, Dae-Soon Son2, Minseob Cho2, Nuri Lee1, Wonkeun Song1, Saeam Shin3, Sung-Ho Park4, Dong Jin Lee5, Min-Jeong Park1.
Abstract
BACKGROUND: The differential diagnosis of ovarian cancer is important, and there has been ongoing research to identify biomarkers with higher performance. This study aimed to evaluate the diagnostic utility of combinations of cancer markers classified by machine learning algorithms in patients with early stage ovarian cancer, which has rarely been reported.Entities:
Keywords: cancer antigen 125; human epididymis protein 4; lactate dehydrogenase; machine learning; neutrophil-to-lymphocyte ratio; ovarian cancer
Mesh:
Substances:
Year: 2021 PMID: 34923833 PMCID: PMC8704186 DOI: 10.1177/10732748211033401
Source DB: PubMed Journal: Cancer Control ISSN: 1073-2748 Impact factor: 3.302
Basic Characteristics and Laboratory Results Related to Ovarian Cancer of the Study Population.
| Variable
| Ovarian cancer | Non-cancer control | |
|---|---|---|---|
| Age, years | 54.0 (48.7-62.0) | 49.0 (37.0-55.3) | <.001 |
| Menopause | 40 (75.5) | 342 (50.5) | .003 |
| BMI, kg/m2 | 23.4 (20.8-26.1) | 22.9 (20.9-25.2) | .490 |
| Cancer marker | |||
| ROMA, % | 15.3 (5.4-87.8) | 6.0 (3.5-10.5) | <.001 |
| HE4, pmol/L | 48.3 (34.5-210.2) | 36.9 (30.9-45.8) | <.001 |
| CA 125, U/mL | 27.4 (14.7-420.3) | 18.4 (11.5-37.9) | .003 |
| CEA, ng/mL | .8 (.1-1.4) | .7 (.3-1.3) | .607 |
| Hematology | |||
| Hemoglobin, g/dL | 12.6 (10.8-13.5) | 12.8 (11.8-13.5) | .364 |
| WBC, ×109/L | 6.3 (4.9-7.4) | 6.3 (5.1-8.1) | .789 |
| Neutrophil, % | 66.9 (58.0-77.5) | 63.6 (56.1-69.7) | .040 |
| Lymphocyte % | 23.3 (14.7-32.5) | 27.7 (21.9-33.5) | .027 |
| Neutrophil-to-lymphocyte ratio | 2.9 (1.8-5.5) | 2.2 (1.7-3.2) | .008 |
| Monocyte, % | 4.2 (3.7-5.7) | 4.7 (3.8-5.6) | .607 |
| Monocyte-to-lymphocyte ratio | .2 (.1-.3) | .2 (.1-.2) | .027 |
| Platelet, ×109/L | 275.0 (223.7-320.7) | 255.0 (215.0-296.0) | .051 |
| Chemistry | |||
| Creatinine, mg/dL | .6 (.6-.7) | .6 (.6-.7) | .963 |
| Albumin, g/dL | 4.3 (4.0-4.6) | 4.5 (4.3-4.7) | .027 |
| LD, IU/L | 202.0 (176.7-261.7) | 183.0 (165.0-208.0) | <.001 |
| Smoking | 2 (3.8) | 48 (7.1) | .485 |
Abbreviations: BMI, body mass index; CA125, cancer antigen 125; CEA, carcinoembryonic antigen; HE4, human epididymis protein 4; LD, lactate dehydrogenase; ROMA, risk of ovarian malignancy algorithm; WBC, white blood cell.
aData are expressed as median (first to third quartiles) or number (percentage).
bAdjusted using the Benjamini–Hochberg method after Pearson’s chi-square test for nominal variables and the Mann–Whitney U test for continuous variables.
Performance of Single Ovarian Cancer Markers.
| Menopause state | Markers | ROC-AUC | Sensitivity
|
|---|---|---|---|
| Total | ROMA | .707 (.623-.792) | 60.4 (46.0-73.6) |
| HE4 | .680 (.596-.765) | 54.7 (40.5-68.4) | |
| CA125 | .643 (.553-.733) | 49.1 (35.1-63.2) | |
| LD | .657 (.577-.737) | 49.1 (35.1-63.2) | |
| NLR | .624 (.533-.716) | 45.3 (31.6-60.0) | |
| Pre-menopause | ROMA | .580 (.411-.749) | 30.8 (9.1-61.4) |
| HE4 | .589 (.423-.755) | 38.5 (13.9-68.4) | |
| CA125 | .540 (.372-.708) | 30.8 (9.1-61.4) | |
| LD | .586 (.413-.759) | 30.8 (9.1-61.4) | |
| NLR | .609 (.433-.786) | 46.2 (19.2-74.9) | |
| Post-menopause | ROMA | .685 (.577-.793) | 57.5 (40.9-73.0) |
| HE4 | .684 (.586-.782) | 55.0 (38.5-70.7) | |
| CA125 | .693 (.590-.796) | 52.5 (36.1-68.5) | |
| LD | .635 (.539-.730) | 45.0 (29.3-61.5) | |
| NLR | .623 (.515-.732) | 47.5 (31.5-63.9) |
Abbreviations: CA125, cancer antigen 125; HE4, human epididymis protein 4; LD, lactate dehydrogenase; NLR, neutrophil-to-lymphocyte ratio; ROC-AUC, areas under the receiver operating characteristic curve; ROMA, risk of ovarian malignancy algorithm.
aData are shown as value (95% confidence interval).
bSensitivities at 75.0% specificities are presented.
Performance of Ovarian Cancer Markers in Combination.
| Menopause state | Markers | ROC-AUC | Sensitivity
|
|---|---|---|---|
| Total | ROMA+LD | .709 (.624-.794) | 58.5 (44.1-71.9) |
| HE4+LD | .692 (.611-.773) | 58.5 (44.1-71.9) | |
| CA125+LD | .698 (.616-.780) | 56.6 (42.3-70.2) | |
| NLR+LD | .690 (.608-.771) | 50.9 (36.8-64.9) | |
| ROMA+NLR+LD | .708 (.621-.795) | 58.5 (44.1-71.9) | |
| HE4+CA125+LD | .705 (.621-.789) | 58.5 (44.1-71.9) | |
| HE4+NLR+LD | .698 (.615-.782) | 52.8 (38.6-66.7) | |
| CA125+NLR+LD | .690 (.605-.775) | 49.1 (35.1-63.2) | |
| HE4+CA125+NLR+LD | .696 (.610-.783) | 52.8 (38.6-66.7) | |
| Pre-menopause | ROMA+LD | .556 (.379-.734) | 30.8 (9.1-61.4) |
| HE4+LD | .584 (.406-.761) | 30.8 (9.1-61.4) | |
| CA125+LD | .572 (.396-.749) | 30.8 (9.1-61.4) | |
| NLR+LD | .600 (.414-.786) | 38.5 (13.9-68.4) | |
| ROMA+NLR+LD | .583 (.399-.767) | 30.8 (9.1-61.4) | |
| HE4+CA125+LD | .568 (.389-.747) | 30.8 (9.1-61.4) | |
| HE4+NLR+LD | .600 (.411-.789) | 46.2 (19.2-74.9) | |
| CA125+NLR+LD | .591 (.401-.781) | 46.2 (19.2-74.9) | |
| HE4+CA125+NLR+LD | .593 (.402-.784) | 46.2 (19.2-74.9) | |
| Post-menopause | ROMA+LD | .694 (.589-.798) | 60.0 (43.3-75.1) |
| HE4+LD | .687 (.593-.781) | 55.0 (38.5-70.7) | |
| CA125+LD | .708 (.614-.802) | 55.0 (38.5-70.7) | |
| NLR+LD | .689 (.596-.781) | 50.0 (33.8-66.2) | |
| ROMA+NLR+LD | .702 (.599-.805) | 57.5 (40.9-73.0) | |
| HE4+CA-125+LD | .718 (.623-.814) | 62.5 (45.8-77.3) | |
| HE4+NLR+LD | .701 (.607-.796) | 55.0 (38.5-70.7) | |
| CA125+NLR+LD | .703 (.606-.799) | 52.5 (36.1-68.5) | |
| HE4+CA125+NLR+LD | .712 (.614-.809) | 57.5 (40.9-73.0) |
Abbreviations: CA125, cancer antigen 125; HE4, human epididymis protein 4; LD, lactate dehydrogenase; NLR, neutrophil-to-lymphocyte ratio; ROC-AUC, areas under the receiver operating characteristic curve; ROMA, risk of ovarian malignancy algorithm.
aData are shown as value (95% confidence interval).
bSensitivities at 75.0% specificities are presented.
Figure 1.Performance of combined markers classified using machine learning for predicting ovarian cancer. (A) ROC curves of ROMA + LD determined by classification tree, bagging, random forest, adaptive boosting, support vector machine, and K-nearest neighbor analyses for distinguishing ovarian cancer from the non-cancer controls, (B) ROC curve of HE4 + CA125 + NLR + LD in the postmenopausal group. The areas under the ROC curves (AUCs) of combined markers are presented in brackets. Abbreviations: AdaBoost, adaptive boosting; CA125, cancer antigen 125; HE4, human epididymis protein 4; KNN, K-nearest neighbor; LD, lactate dehydrogenase; ROC, Receiver operating characteristic; ROMA, risk of ovarian malignancy algorithm; SVM, support vector machine; Tree, classification tree.