| Literature DB >> 27642363 |
Maolong Xi1, Jun Sun2, Li Liu3, Fangyun Fan2, Xiaojun Wu2.
Abstract
This paper focuses on the feature gene selection for cancer classification, which employs an optimization algorithm to select a subset of the genes. We propose a binary quantum-behaved particle swarm optimization (BQPSO) for cancer feature gene selection, coupling support vector machine (SVM) for cancer classification. First, the proposed BQPSO algorithm is described, which is a discretized version of original QPSO for binary 0-1 optimization problems. Then, we present the principle and procedure for cancer feature gene selection and cancer classification based on BQPSO and SVM with leave-one-out cross validation (LOOCV). Finally, the BQPSO coupling SVM (BQPSO/SVM), binary PSO coupling SVM (BPSO/SVM), and genetic algorithm coupling SVM (GA/SVM) are tested for feature gene selection and cancer classification on five microarray data sets, namely, Leukemia, Prostate, Colon, Lung, and Lymphoma. The experimental results show that BQPSO/SVM has significant advantages in accuracy, robustness, and the number of feature genes selected compared with the other two algorithms.Entities:
Mesh:
Year: 2016 PMID: 27642363 PMCID: PMC5013239 DOI: 10.1155/2016/3572705
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Binary coding of particle's position.
Pseudocode 1Pseudocode for obtaining mbest.
Pseudocode 2Pseudocode of the transformation.
Pseudocode 3The psudocode the BPSO/SVM.
Description for the test databases.
| Number | Name of data set | Number of examples | Number of genes | Classes |
|---|---|---|---|---|
| 1 | Leukemia | 72 | 7129 | 2 |
| 2 | Prostate | 102 | 12600 | 2 |
| 3 | Colon | 62 | 2000 | 2 |
| 4 | Lung | 181 | 12533 | 2 |
| 5 | Lymphoma | 77 | 7129 | 2 |
Description for the test databases.
| Number | Name of data set | Class 1 (quantity) | Class 2 (quantity) |
|---|---|---|---|
| 1 | Leukemia | AML (25) | ALL (47) |
| 2 | Prostate | N1 (50) | PC2 (52) |
| 3 | Colon | N3 (22) | CC4 (40) |
| 4 | Lung | MPM5 (31) | ADCA6 (150) |
| 5 | Lymphoma | DLBCL7 (58) | FL8 (19) |
1: normal, 2: prostate cancer, 3: normal, 4: colon cancer, 5: malignant pleural mesothelioma, 6: adenocarcinoma, 7: diffuse large B-cell lymphoma, and 8: follicular lymphoma.
BQPSO, BPSO, and GA parameters for gene subset selection and classification.
| BQPSO | |
|
| |
| Swarm size | 20 |
| Iteration | 100 |
| Dimension of particle | 1 |
|
| 1 |
|
| |
| BPSO | |
|
| |
| Swarm size | 20 |
| Iteration | 100 |
| Maximum of velocity | 6 |
| ( | (0.5, 2, 2) |
|
| |
| GA | |
|
| |
| Swarm size | 20 |
| Iteration | 100 |
| Probability of crossover | 0.9 |
| Probability of mutation | 0.04 |
Comparison of accuracy with the proposed algorithm, BPSO/SVM, and GA/SVM.
| Data set | BQPSO/SVM | BPSO/SVM | GA/SVM | |||
|---|---|---|---|---|---|---|
| Best | Mean | Best | Mean | Best | Mean | |
| Leukemia | 100 | 100 | 100 | 100 | 100 | 99.61 |
| Prostate | 100 | 99.25 | 99.02 | 99.02 | 98.04 | 96.00 |
| Colon | 93.55 | 92.52 | 91.94 | 91.94 | 91.94 | 88.65 |
| Lung | 100 | 99.96 | 100 | 99.96 | 100 | 99.87 |
| Lymphoma | 100 | 99.79 | 100 | 99.74 | 98.70 | 98.18 |
Figure 2The average number of genes selected by BQPSO/SVM, BPSO/SVM, and GA/SVM, respectively.
Comparison in terms of statistical results of BQPSO/SVM, BPSO/SVM, and GA/SVM.
| Data set | BQPSO/SVM | BPSO/SVM | GA/SVM | |||
|---|---|---|---|---|---|---|
| Best | Std. dev. | Best | Std. dev. | Best | Std. dev. | |
| Leukemia | 100 | 0 | 100 | 0 | 100 | 0.64 |
| Prostate | 100 | 0.43 | 99.02 | 0 | 98.04 | 1.20 |
| Colon | 93.55 | 0.79 | 91.94 | 0 | 91.94 | 1.89 |
| Lung | 100 | 0.15 | 100 | 0.15 | 100 | 0.24 |
| Lymphoma | 100 | 0.49 | 100 | 0.53 | 98.70 | 0.75 |
Top 5 genes with the highest selection frequency of colon data set.
| Data set | Accession number | Gene description |
|---|---|---|
| Colon | Z50753 |
|
| R87126 | Myosin heavy chain, nonmuscle ( | |
| X63629 |
| |
| M76378 | Human cysteine-rich protein (CRP) gene, exons 5 and 6 | |
| X53586 | Human mRNA for integrin alpha 6 |