| Literature DB >> 30472689 |
Noushin Jafarpisheh1, Mohammad Teshnehlab2.
Abstract
In the present era, enormous factors contribute to causing cancer. So cancer classification cannot rely only on doctor's thoughts. As a result, intelligent algorithms concerning doctor's help are inevitable. Therefore, the authors are motivated to suggest a novel algorithm to classify three cancer datasets; colon, ALL-AML, and leukaemia cancers. Their proposed algorithm is based on the deep neural network and emotional learning process. First of all, by applying the principal component analysis, they had a feature reduction. Then, they used deep neural as a feature extraction. Then, they implemented different classifiers; multi-layer perceptron, support vector machine (SVM), decision tree, and Gaussian mixture model. In the end, because in the real world, especially when working on systems biology, unpredictable events, and uncertainties are undeniable, the robustness of their model against uncertainties is important. So they added Gaussian noise to the input features of the first encoder in each dataset, then, they applied the stacked denoising method. Experimental results disclosed that, generally, using emotional learning increased the accuracy. In addition, the highest accuracy was gained by SVM, 91.66, 92.27, and 96.56% for colon, ALL-AML, and leukaemia, respectively. However, GMM led to the lowest accuracy. The best accuracy gained by GMM was 60%.Entities:
Mesh:
Year: 2018 PMID: 30472689 PMCID: PMC8687421 DOI: 10.1049/iet-syb.2018.5002
Source DB: PubMed Journal: IET Syst Biol ISSN: 1751-8849 Impact factor: 1.615
Number of whole samples and number of samples in each class for different cancer datasets
| Dataset | Number of samples | Number of features | Number of samples in each dataset |
|---|---|---|---|
| colon cancer [ | 62 | 2000 | 40 (tumour) and 22 (normal) |
| ALL‐AML [ | 72 | 7129 | 47 (ALL) and 25 (AML) |
| leukaemia [ | 72 | 7070 |
47 (−1) 25 (1) |
Fig. 1General overview of autoencoder and decoder structures in DNN
Fig. 2Number of features selected in each autoencoder. This structure is fixed for all three cancer datasets that are used here. Only the number of whole features (x) will be different for each dataset
Number of training and testing samples in each dataset
| Dataset | Number of samples for training | Number of samples for testing |
|---|---|---|
| colon cancer | 44 | 18 |
| ALL‐AML | 55 | 22 |
| leukaemia | 55 | 22 |
Results of classifying colon cancer dataset with 18 test samples
| Classifier | Mean of results ( |
| Variance of results ( |
| Best result ( |
|
|---|---|---|---|---|---|---|
| MLP |
64.45% (12) |
67.78% (12) | 3.3778 | 2.1778 | 15 | 15 |
| SVM |
89.44% (16) |
91.66% (16) | 0.77 | 0.72 | 17 | 18 |
| DT |
70.56% (13) |
73.34% (13) | 5.79 | 3.73 | 16 | 16 |
| GMM |
47.78% (9) |
54.44% (10) | 17.6 | 16.18 | 13 | 17 |
Results of classifying ALL‐AML cancer dataset with 22 test samples
| Classifier | Mean of results ( |
| Variance of results ( |
| Best result ( |
|
|---|---|---|---|---|---|---|
| MLP |
64.54% (14) |
65.45% (14) | 7.0667 | 3.8222 | 18 | 17 |
| SVM |
88.63% (19) |
92.27% (20) | 2.9444 | 0.9 | 21 | 22 |
| DT |
72.27% (16) |
74.09% (16) | 6.5444 | 6.0111 | 19 | 21 |
| GMM |
26.82% (6) |
29.09% (6) | 15.8778 | 12.0444 | 13 | 11 |
Results of classifying leukaemia cancer dataset with 22 test samples
| Classifier | Mean of results ( |
| Variance of results ( |
| Best result ( |
|
|---|---|---|---|---|---|---|
| MLP |
86.36% (19) |
87.22% (19) | 2.011 | 1.12 | 21 | 21 |
| SVM |
95.45% (21) |
96.56% (21) | 1.8222 | 0.84 | 22 | 22 |
| DT |
81.82% (18) |
83.18% (18) | 2.011 | 5.57 | 21 | 22 |
| GMM |
50.09% (13) |
51.82% (11) | 12.2667 | 19.38 | 16 | 18 |
Results of classifying colon cancer dataset by stacked denoising method with 18 test samples
| Classifier | Mean of results ( |
| Variance of results ( |
| Best result ( |
|
|---|---|---|---|---|---|---|
| MLP |
60.56% (11) |
70% (13) | 2.9889 | 4.2667 | 13 | 16 |
| SVM |
86.67% (16) |
90% (16) | 3.16 | 2.4 | 18 | 18 |
| DT |
72.22% (13) |
76.67% (14) | 6.0444 | 2.84 | 16 | 16 |
| GMM |
44.44% (8) |
53.89% (10) | 17.3889 | 25.34 | 15 | 17 |
Results of classifying ALL‐AML cancer dataset by stacked denoising method with 22 test samples
| Classifier | Mean of results ( |
| Variance of results ( |
| Best result ( |
|
|---|---|---|---|---|---|---|
| MLP |
60.00% (13) |
64.55% (14) | 0.6222 | 3.0667 | 14 | 17 |
| SVM |
88.18% (19) |
92.27% (20) | 0.9333 | 1.1222 | 21 | 22 |
| DT |
65.91% (14) |
74.09% (16) | 1.8333 | 7.7889 | 20 | 20 |
| GMM |
35% (8) |
35.05% (8) | 16.2333 | 8.2667 | 15 | 12 |
Results of classifying leukaemia cancer dataset by stacked denoising method with 22 test samples
| Classifier | Mean of results ( |
| Variance of results ( |
| Best result ( |
|
|---|---|---|---|---|---|---|
| MLP |
81.82% (18) |
86% (19) | 3.4333 | 3.21 | 21 | 22 |
| SVM |
94.1% (21) |
94.54% (21) | 1.12 | 0.62 | 22 | 22 |
| DT |
77.27% (17) |
83.63% (18) | 3.5111 | 2.27 | 21 | 21 |
| GMM |
54.45% (10) |
60% (13) | 10.9444 | 8.84 | 15 | 20 |