| Literature DB >> 36203663 |
Tianhao Li1, Honghong Huang2, Shuocun Zhang3, Yongdan Zhang4,5, Haoren Jing4,5, Tianwei Sun6, Xipeng Zhang4,5,7,8, Liangfu Lu2, Mingqing Zhang4,5,7,8.
Abstract
Background: This study aimed to develop an artificial intelligence predictive model for predicting the probability of developing BM in CRC patients.Entities:
Keywords: artificial intelligence; bone metastasis; colorectal cancer; machine learning; predictive model
Mesh:
Year: 2022 PMID: 36203663 PMCID: PMC9531117 DOI: 10.3389/fpubh.2022.984750
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1The analytical cohort and exclusion criteria.
Figure 2Feature correlation heatmap after initial preprocessing.
Figure 3The influence weight of each factor calculated by the random forest algorithm.
Figure 4Schematic diagram of SVM.
Figure 5Performance of SVM models with different kernel functions.
Clinical and pathological characteristics of training and test sets.
|
|
|
| |||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| Age | 0.714 | ||||
| <60 | 13847(34.4) | 78(38.2) | 3447(34.2) | 15(40.5) | |
| >60 | 26401(65.6) | 126(61.8) | 6630(65.8) | 222(59.5) | |
| Sex | 0.182 | ||||
| Male | 21857(54.3) | 116(56.9) | 5399(53.6) | 20(54.1) | |
| Female | 18391(45.7) | 88(43.1) | 4678(46.4) | 17(45.9) | |
| Primary tumor site | 0.922 | ||||
| Colon | 30131(74.9) | 135(66.2) | 7552(74.9) | 20(54.7) | |
| Rectal | 10117(25.1) | 69(33.8) | 2525(25.1) | 17(45.9) | |
| Size | 0.91 | ||||
| <2 cm | 4936(11.5) | 1(0.5) | 1150(11.4) | 2(5.4) | |
| 2–5 cm | 21397(53.2) | 108(52.9) | 5385(53.4) | 16(43.2) | |
| >5 cm | 14215(35.3) | 95(46.6) | 3542(35.1) | 19(51.4) | |
| Histology | 0.947 | ||||
| Adenocarcinoma | 37405(92.9) | 189(92.6) | 9366(92.9) | 34(91.9) | |
| Mucosal adenocarcinoma | 2542(6.3) | 8(3.9) | 633(6.3) | 1(2.7) | |
| Signet-ring cell carcinoma | 301(0.7) | 7(3.4) | 78(0.8) | 2(5.4) | |
| T stage | 0.839 | ||||
| T1/2 | 10411(25.9) | 28(13.7) | 2616(26) | 4(10.8) | |
| T3/4 | 29837(74.1) | 176(86.3) | 7461(74) | 33(89.2) | |
| N stage | 0.108 | ||||
| N0 | 22046(54.8) | 60(29.4) | 5607(55.6) | 10(27) | |
| N1/2 | 18201(45.2) | 144(70.6) | 4470(44.4) | 27(73) | |
| Grade | 0.566 | ||||
| Grade I–II | 34486(85.7) | 140(68.6) | 8655(85.9) | 25(67.6) | |
| Grade III–IV | 5762(14.3) | 64(31.4) | 1422(14.1) | 12(32.4) | |
| CEA level | 0.242 | ||||
| Negative | 23446(58.3) | 32(15.7) | 5930(58.8) | 5(13.5) | |
| Positive | 16802(41.7) | 172(84.3) | 4147(41.2) | 32(86.5) | |
| Extraosseous metastases | 0.012 | ||||
| No | 36227(90) | 64(31.4) | 9153(90.8) | 6(90.6) | |
| Yes | 4021(10) | 140(68.6) | 924(9.2) | 31(9.4) | |
| Brain metastasis | 0.497 | ||||
| No | 40219(99.9) | 196(96.1) | 10071(99.9) | 36(97.3) | |
| Yes | 29(0.1) | 8(3.9) | 6(0.1) | 1(2.7) | |
| Liver metastasis | 0.019 | ||||
| No | 36655(91.1) | 79(38.7) | 9249(91.8) | 11(29.7) | |
| Yes | 3593(8.9) | 125(61.3) | 828(8.2) | 26(70.3) | |
| Lung metastasis | 0.704 | ||||
| No | 39263(97.6) | 138(67.6) | 9838(97.6) | 20(54.1) | |
| Yes | 985(2.4) | 66(32.4) | 239(2.4) | 17(45.9) | |
Multivariable logistic regression model with enter variable selection.
|
|
|
|
|---|---|---|
|
| ||
| <60 years | Reference | |
| >60 years | 0.155(0.862–1.548) | 0.333 |
|
| ||
| Male | Reference | 0.72 |
| Female | 0.949(0.715–1.261) | |
|
| <0.001 | |
| Colon | Reference | |
| Rectal | 1.88(1.39–2.543) | |
|
| ||
| <2 cm | Reference | |
| 2–5 cm | 11.96(1.661–86.47) | 0.014 |
| >5 cm | 10.868(1.504–78.531) | 0.018 |
|
| ||
| Adenocarcinoma | Reference | |
| Mucosal adenocarcinoma | 0.699(0.341–1.433) | 0.328 |
| Signet-ring cell carcinoma | 3.035(1.316–6.998) | 0.009 |
|
| ||
| N0 | Reference | |
| N1 | 1.123(0.815–1.548) | 0.479 |
|
| <0.001 | |
| Grade I–II | Reference | |
| Grade III–IV | 2.118(1.537–2.92) | |
|
| <0.001 | |
| Negative | Reference | |
| Positive | 2.879(1.908–4.344) | |
|
| <0.001 | |
| No | Reference | |
| Yes | 12.207(8.805–16.923) |
Figure 6Results of Pearson correlation analysis between all variables. The heatmap shows the correlation between the variables.
Comparing the prediction performances of different models for BM.
|
|
|
|
|
|---|---|---|---|
| SVM |
|
| 0.838 |
| LR | 0.918 | 0.865 |
|
| DT | 0.770 | 0.850 | 0.703 |
| RF | 0.770 | 0.850 | 0.676 |
| XGB | 0.873 | 0.882 | 0.838 |
Figure 7ROC curve, in which the new model refers to SVM and the old one refers to LR.
Figure 8IDI curve, in which the new model refers to SVM and the old one refers to LR.