| Literature DB >> 35515104 |
Wenle Li1,2, Yafeng Liu3,4, Wencai Liu5, Zhi-Ri Tang6, Shengtao Dong7, Wanying Li2, Kai Zhang1,2, Chan Xu2, Zhaohui Hu8, Haosheng Wang9, Zhi Lei10, Qiang Liu1, Chunxue Guo11, Chengliang Yin12.
Abstract
Background: Regional lymph node metastasis is a contributor for poor prognosis in osteosarcoma. However, studies on risk factors for predicting regional lymph node metastasis in osteosarcoma are scarce. This study aimed to develop and validate a model based on machine learning (ML) algorithms.Entities:
Keywords: SEER; lymph node metastasis; machine learning algorithm; multicenter; osteosarcoma; web calculator
Year: 2022 PMID: 35515104 PMCID: PMC9067126 DOI: 10.3389/fonc.2022.797103
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Description of the study population according to the presence of lymph node metastases.
| Variables | levels | Overall (N=1201) | lymph node metastases |
| |
|---|---|---|---|---|---|
| No (N=1088) | Yes* (N=113) | ||||
| Category [n(%)] | Multicenter | 107 (8.9) | 91 (8.4) | 16 (14.2) | 0.059 |
| SEER | 1094 (91.1) | 997 (91.6) | 97 (85.8) | ||
| Race [n(%)] | black | 163 (13.6) | 143 (13.1) | 20 (17.7) | 0.089 |
| other | 216 (18.0) | 190 (17.5) | 26 (23.0) | ||
| white | 822 (68.4) | 755 (69.4) | 67 (59.3) | ||
| Age(years) [M(Q1, Q3)] | 32.98 (24.08) | 32.41 (23.70) | 38.53 (26.89) | 0.01 | |
| Sex [n(%)] | female | 549 (45.7) | 492 (45.2) | 57 (50.4) | 0.336 |
| male | 652 (54.3) | 596 (54.8) | 56 (49.6) | ||
| Primary site [n(%)] | Axis bone | 313 (26.1) | 278 (25.6) | 35 (31.0) | 0.241 |
| Limb bone | 787 (65.5) | 721 (66.3) | 66 (58.4) | ||
| other | 101 (8.4) | 89 (8.2) | 12 (10.6) | ||
| Laterality [n(%)] | left | 514 (42.8) | 477 (43.8) | 37 (32.7) | 0.074 |
| Not a paired site | 161 (13.4) | 144 (13.2) | 17 (15.0) | ||
| right | 526 (43.8) | 467 (42.9) | 59 (52.2) | ||
| Stage group [n(%)] | I | 198 (16.5) | 193 (17.7) | 5 (4.4) | <0.001 |
| II | 562 (46.8) | 542 (49.8) | 20 (17.7) | ||
| III | 51 (4.2) | 48 (4.4) | 3 (2.7) | ||
| IV | 278 (23.1) | 211 (19.4) | 67 (59.3) | ||
| UNK stage | 112 (9.3) | 94 (8.6) | 18 (15.9) | ||
| T [n(%)] | T1 | 420 (35.0) | 395 (36.3) | 25 (22.1) | <0.001 |
| T2 | 562 (46.8) | 520 (47.8) | 42 (37.2) | ||
| T3 | 40 (3.3) | 34 (3.1) | 6 (5.3) | ||
| TX | 179 (14.9) | 139 (12.8) | 40 (35.4) | ||
| M [n(%)] | M0 | 931 (77.5) | 873 (80.2) | 58 (51.3) | <0.001 |
| M1 | 270 (22.5) | 215 (19.8) | 55 (48.7) | ||
| Surgery [n(%)] | No | 225 (18.7) | 175 (16.1) | 50 (44.2) | <0.001 |
| Yes | 976 (81.3) | 913 (83.9) | 63 (55.8) | ||
| Radiation [n(%)] | No | 1054 (87.8) | 959 (88.1) | 95 (84.1) | 0.269 |
| Yes | 147 (12.2) | 129 (11.9) | 18 (15.9) | ||
| Chemotherapy [n(%)] | No | 236 (19.7) | 202 (18.6) | 34 (30.1) | 0.005 |
| Yes | 965 (80.3) | 886 (81.4) | 79 (69.9) | ||
| Bone metastases [n(%)] | No | 1144 (95.3) | 1045 (96.0) | 99 (87.6) | <0.001 |
| Yes | 57 (4.7) | 43 (4.0) | 14 (12.4) | ||
| Survival times(month) [M(Q1, Q3)] | 24 (12, 48) | 25(13, 49) | 16(7, 31) | <0.001 | |
SEER, surveillance epidemiology and end results; T, stage; M, metastasis.
*indicates patients with unable to evaluate lymph node metastases were also included.
Single and multi-factor logistic regression analysis for the modeling group.
| Variables | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|
| OR (95% CI) | p value | OR (95% CI) | p value | |
| Age(years) | 1.010(1.002-1.018) | <0.05 | 0.997(0.986-1.008) | 0.574 |
| Survival time(month) | 0.984(0.973-0.994) | <0.01 | 0.999(0.999-1.011) | 0.835 |
| Race | ||||
| White | Ref | Ref | Ref | Ref |
| Black | 1.576(0.927-2.679) | 0.093 | / | / |
| Other | 1.138(0.567-2.285) | 0.716 | / | / |
| Sex | ||||
| Male | Ref | Ref | Ref | Ref |
| Female | 1.241(0.818-1.883) | 0.311 | / | / |
| Primary site | ||||
| Limb bones | Ref | Ref | Ref | Ref |
| Axis of a bone | 1.143(0.905-2.286) | 0.124 | / | / |
| other | 1.562(0.787-3.102) | 0.203 | / | / |
| Laterality | ||||
| Left | Ref | Ref | Ref | Ref |
| Right | 1.615(1.015-2.572) | <0.05 | 1.559(0.955-2.547) | 0.076 |
| Other | 1.625(0.865-3.052) | 0.131 | 0.997(0.487-2.039) | 0.992 |
| T | ||||
| T1 | Ref | Ref | Ref | Ref |
| T2 | 1.131(0.657-1.148) | 0.657 | 0.948(0.537-1.675) | 0.855 |
| T3 | 2.787(0.984-7.892) | 0.054 | 1.498(0.492-4.567) | 0.477 |
| TX | 4.179(2.371-7.363) | <0.001 | 2.330(1.265-4.292) | <0.01 |
| M | ||||
| M0 | Ref | Ref | Ref | Ref |
| M1 | 3.842(2.505-5.892) | <0.001 | 3.182(1.606-6.302) | <0.01 |
| Surgery | ||||
| No | Ref | Ref | Ref | Ref |
| Yes | 0.250(0.162-.0387) | <0.001 | 0.455(0.267-0.774) | <0.01 |
| Radiation | ||||
| No | Ref | Ref | Ref | Ref |
| Yes | 1.634(0.947-2.821) | 0.078 | / | / |
| Chemotherapy | ||||
| No | Ref | Ref | Ref | Ref |
| Yes | 0.491(0.312-0.771) | <0.01 | 0.510(0.287-0.906) | <0.05 |
| Bone metastases | ||||
| No | Ref | Ref | Ref | Ref |
| Yes | 3.378(1.707-6.682) | <0.001 | 1.371(0.631-1.655) | 0.425 |
| Lung metastases | ||||
| No | Ref | Ref | Ref | Ref |
| Yes | 2.823(1.800-4.428) | <0.001 | 0.826(0.412-1.655) | 0.589 |
OR, odds ratio; CI, confident interval; T, stage; M, metastasis.
Figure 110-fold cross-validation of machine learning algorithms.
Figure 2ROC curves for six ML algorithm models in predicting the risk of lung metastasis in osteosarcoma patients.
Performance of the validation group models.
| Models | AUC | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| LR | 0.753 | 0.720 | 0.625 | 0.736 |
| MLP | 0.783 | 0.766 | 0.750 | 0.769 |
| RF | 0.777 | 0.729 | 0.750 | 0.725 |
| DT | 0.784 | 0.720 | 0.688 | 0.725 |
| GBM | 0.810 | 0.729 | 0.813 | 0.714 |
| XGB | 0.874 | 0.851 | 0.750 | 0.868 |
AUC, area under the curve; LR, logistic regression; MLP, multilayer perceptron; RF, random forest; DT, the decision tree; GBM, gradient boosting machine; XGBoost, extreme gradient boosting.
Figure 3Relative importance ranking of features in six ML algorithms for predicting lymph node metastasis.
Figure 4The Web calculator built on the XGB model predicting lymph node metastasis in patients with osteosarcoma.