| Literature DB >> 35844460 |
Yanfeng Wang1, Wenhao Zhang1, Junwei Sun1, Lidong Wang2, Xin Song2, Xueke Zhao2.
Abstract
Esophageal squamous cell carcinoma (ESCC) is one of the highest incidence and mortality cancers in the world. An effective survival prediction model can improve the quality of patients' survival. Therefore, a parameter-optimized deep belief network based on the improved Archimedes optimization algorithm is proposed in this paper for the survival prediction of patients with ESCC. Firstly, a combination of features significantly associated with the survival of patients is found by the minimum redundancy and maximum relevancy (MRMR) algorithm. Secondly, a DBN network is introduced to make predictions for survival of patients. Aiming at the problem that the deep belief network model is affected by parameters in the construction process, this paper uses the Archimedes optimization algorithm to optimize the learning rate α and batch size β of DBN. In order to overcome the problem that AOA is prone to fall into local optimum and low search accuracy, an improved Archimedes optimization algorithm (IAOA) is proposed. On this basis, a survival prediction model for patients with ESCC is constructed. Finally, accuracy comparison tests are carried out on IAOA-DBN, AOA-DBN, SSA-DBN, PSO-DBN, BES-DBN, IAOA-SVM, and IAOA-BPNN models. The results show that the IAOA-DBN model can effectively predict the five-year survival rate of patients and provide a reference for the clinical judgment of patients with ESCC.Entities:
Mesh:
Year: 2022 PMID: 35844460 PMCID: PMC9286952 DOI: 10.1155/2022/1924906
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.809
Population proportion information of the dataset.
| Project | Category | Number of population | Percentage of population |
|---|---|---|---|
| Genders | Male | 186 | 62.4% |
| Female | 112 | 37.6% | |
| Ages | ≤61.5 | 192 | 64.4% |
| 61.5 | 112 | 37.6% | |
| T stages | T1 | 42 | 14.1% |
| T2 | 89 | 29.9% | |
| T3 | 165 | 55.4% | |
| T4 | 2 | 0.6% | |
| N stages | N0 | 170 | 57.1% |
| N1 | 80 | 26.8% | |
| N2 | 34 | 11.4% | |
| N3 | 14 | 4.7% | |
| TNM stages | 1 | 37 | 12.4% |
| 2 | 139 | 46.6% | |
| 3 | 106 | 35.6% | |
| 4 | 16 | 5.4% |
Basic information about seventeen blood indicators.
| Variable | Mean | Median (range) | Variance | Standard deviation |
|---|---|---|---|---|
| BASO | 0.050 | 0 (0-1) | 0.014 | 0.118 |
| EO | 0.144 | 0.1 (0-3) | 0.074 | 0.272 |
| FIB | 379.262 | 362.811 (167.613-909.725) | 924.038 | 30.398 |
| PTL | 226.289 | 227.5 (45-448) | 62.902 | 7.931 |
| ALB | 42.077 | 42 (27-59) | 25.055 | 5.005 |
| HGB | 137.742 | 139 (95-189) | 227.216 | 15.074 |
| WBC | 6.564 | 6.1 (2.18-15.3) | 4.077 | 2.019 |
| MONO | 0.406 | 0 (0-1) | 0.091 | 0.301 |
| APTT | 35.929 | 35.1 (15.4-78.5) | 62.479 | 7.904 |
| GLOB | 29.077 | 29 (17-45) | 26.240 | 5.122 |
| RBC | 4.452 | 4.5 (2.93-6.04) | 0.224 | 0.473 |
| PT | 10.322 | 10.2 (7-16.5) | 2.834 | 1.684 |
| LYMPH | 1.930 | 1.905 (0-8) | 0.479 | 0.692 |
| NEUT | 3.864 | 3.5 (0-10.6) | 2.829 | 1.682 |
| TP | 71.070 | 71 (50-92) | 51.971 | 7.209 |
| INR | 0.796 | 0.78 (0.45-1.64) | 0.034 | 0.185 |
| TT | 15.569 | 15.7 (1.3-46.5) | 6.629 | 2.575 |
The unit of WBC, LYMPH, GLOB, ALB, RBC, BASO, EO, NEUT, TP, HGB, and PLT is g/L. The unit of PT, TT, APTT is second(s). The unit of FIB is mg/L.
Algorithm 1Framework of MRMR.
Figure 1The change of classification accuracy with the number of feature variables.
Figure 2Survival function at the mean of the covariate. The survival years are taken as the time, the eleven indicators obtained from minimum redundancy and maximum relevancy algorithm.
Algorithm 2Framework of IAOA.
Baseline test functions.
| Funs | Name | Range | Optimum |
|---|---|---|---|
| F1 | Sphere | [−100,100] | Min = 0 |
| F2 | Schwefel2.22 | [−10, 10] | Min = 0 |
| F3 | Schwefel1.20 | [−100,100] | Min = 0 |
| F4 | Schwefel2.21 | [−100,100] | Min = 0 |
| F5 | Quartic | [−1.28,1.28] | Min = 0 |
| F6 | Rastraign | [−32, 32] | Min = 0 |
| F7 | Ackley | [−600,600] | Min = 0 |
| F8 | Griewank | [−600,600] | Min = 0 |
| F9 | Penalized 1 | [−50, 50] | Min = 0 |
| F10 | Penalized 2 | [−50, 50] | Min = 0 |
| F11 | Kowalik's | [−5, 5]2 | 3.08 |
| F12 | Six-hump | [−5, 5]2 | −1.03 |
| F13 | Branin | [-5,10]∪[0, 15]2 | 3.98 |
Algorithm parameter settings.
| Algorithm | Main parameters |
|---|---|
| AOA | |
| IAOA |
|
| SSA | ST = 0.6, PD = 0.7, SD = 0.2 |
| BES |
|
Comparison with the results of 3 metaheuristic algorithms.
| Statistics | Algorithm | F1 | F2 | F3 | F4 | F5 | F6 | F7 |
|---|---|---|---|---|---|---|---|---|
| Best | IAOA | 0.00 | 0.00 | 0.00 | 0.00 | 5.26 | 0.00 | 8.88 |
| AOA | 4.93 | 5.61 | 1.27 | 1.55 | 9.09 | 0.00 | 8.88 | |
| SSA | 0.00 | 0.00 | 0.00 | 0.00 | 3.54 | 0.00 | 8.88 | |
| BES | 5.31 | 8.70 | 8.10 | 8.13 | 5.36 | 0.00 | 8.88 | |
| Mean | IAOA | 0.00 | 0.00 | 0.00 | 0.00 | 9.89 | 0.00 | 8.88 |
| AOA | 2.96 | 3.69 | 9.36 | 1.23 | 6.76 | 5.46 | 2.31 | |
| SSA | 1.58 | 3.17 | 7.63 | 1.62 | 7.50 | 0.00 | 8.88 | |
| BES | 4.38 | 1.17 | 4.47 | 1.51 | 2.39 | 3.64 | 8.03 | |
| Worst | IAOA | 0.00 | 0.00 | 0.00 | 0.00 | 4.39 | 0.00 | 8.88 |
| AOA | 5.17 | 1.06 | 2.74 | 2.28 | 2.35 | 1.64 | 4.44 | |
| SSA | 4.74 | 9.51 | 2.29 | 4.87 | 2.04 | 0.00 | 8.88 | |
| BES | 4.59 | 1.93 | 1.02 | 6.02 | 8.91 | 1.49 | 2.41 | |
| Std | IAOA | 0.00 | 0.00 | 0.00 | 0.00 | 1.03 | 0.00 | 0.00 |
| AOA | 8.66 | 1.94 | 5.00 | 4.44 | 5.35 | 2.99 | 1.60 | |
| SSA | 1.55 | 1.74 | 4.18 | 8.89 | 5.47 | 0.00 | 0.00 | |
| BES | 1.08 | 3.55 | 1.93 | 2.23 | 2.11 | 5.61 | 4.40 |
Continuation of Table 5.
| Statistics | Algorithm | F8 | F9 | F10 | F11 | F12 | F13 |
|---|---|---|---|---|---|---|---|
| Best | IAOA | 0.00 | 1.04 | 4.32 | 3.19 | −1.03 | 3.98 |
| AOA | 0.00 | 4.85 | 2.60 | 3.35 | −1.03 | 3.98 | |
| SSA | 0.00 | 5.02 | 2.21 | 3.08 | −1.03 | 3.98 | |
| BES | 0.00 | 7.53 | 8.05 | 3.07 | −1.03 | 3.98 | |
| Mean | IAOA | 0.00 | 7.03 | 6.96 | 4.68 | −1.03 | 3.98 |
| AOA | 0.00 | 8.16 | 2.89 | 9.10 | −1.03 | 3.98 | |
| SSA | 0.00 | 2.42 | 1.49 | 3.74 | −1.03 | 3.98 | |
| BES | 0.00 | 1.04 | 5.03 | 3.18 | −1.03 | 3.98 | |
| Worst | IAOA | 0.00 | 3.86 | 3.93 | 1.32 | −1.03 | 3.98 |
| AOA | 0.00 | 1.15 | 2.99 | 5.91 | −1.03 | 3.98 | |
| SSA | 0.00 | 8.30 | 1.32 | 1.22 | −1.03 | 3.98 | |
| BES | 0.00 | 1.04 | 2.33 | 2.04 | −1.03 | 3.98 | |
| Std | IAOA | 0.00 | 1.09 | 1.16 | 1.68 | 1.79 | 5.48 |
| AOA | 0.00 | 1.78 | 8.39 | 9.73 | 3.93 | 3.89 | |
| SSA | 0.00 | 1.87 | 3.88 | 2.53 | 4.72 | 8.16 | |
| BES | 0.00 | 3.16 | 7.47 | 6.86 | 1.65 | 7.18 |
Figure 3Schematic diagram of the DBN structure.
Figure 4Schematic diagram of RBM structure.
Figure 5The framework of IAOA-DBN.
Figure 6ESCC patient survival prediction model.
Comparison of different algorithms for predicting five-year survival of patients with esophageal squamous cell carcinoma.
| Algorithm | 10-fold cross-validation accuracy | |
|---|---|---|
| Eleven indicators | All indicators | |
| IAOA-DBN | 89.66% | 88.13% |
| AOA-DBN | 87.46% | 86.24% |
| SSA-DBN | 88.14% | 86.93% |
| PSO-DBN | 86.78% | 85.46% |
| BES-DBN | 87.29% | 86.12% |
| IAOA-SVM | 86.27% | 85.19% |
| IAOA-BPNN | 86.61% | 85.32% |
Comparison of the results of different algorithms.
| Algorithm | 10-fold cross-validation accuracy |
|---|---|
| IAOA-DBN | 97.538% |
| AOA-DBN | 96.974% |
| SSA-DBN | 97.177% |
| PSO-DBN | 96.629% |
| BES-DBN | 96.829% |
| IAOA-SVM | 95.602% |
| IAOA-BPNN | 95.038% |