| Literature DB >> 22924059 |
Mohammad Javad Abdi1, Seyed Mohammad Hosseini, Mansoor Rezghi.
Abstract
We develop a detection model based on support vector machines (SVMs) and particle swarm optimization (PSO) for gene selection and tumor classification problems. The proposed model consists of two stages: first, the well-known minimum redundancy-maximum relevance (mRMR) method is applied to preselect genes that have the highest relevance with the target class and are maximally dissimilar to each other. Then, PSO is proposed to form a novel weighted SVM (WSVM) to classify samples. In this WSVM, PSO not only discards redundant genes, but also especially takes into account the degree of importance of each gene and assigns diverse weights to the different genes. We also use PSO to find appropriate kernel parameters since the choice of gene weights influences the optimal kernel parameters and vice versa. Experimental results show that the proposed mRMR-PSO-WSVM model achieves highest classification accuracy on two popular leukemia and colon gene expression datasets obtained from DNA microarrays. Therefore, we can conclude that our proposed method is very promising compared to the previously reported results.Entities:
Mesh:
Year: 2012 PMID: 22924059 PMCID: PMC3424529 DOI: 10.1155/2012/320698
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Solution representation.
Figure 2The process of classification by mRMR-PSO-WSVM.
The values of the statistical parameters of the classifiers.
| Methods/datasets | Leukemia | Colon | ||
|---|---|---|---|---|
| Acc (%) | Selected genes | Acc (%) | Selected genes | |
| SVM | 90.28 | 7129 | 83.87 | 2000 |
| mRMR-SVM | 97.22 | 50 | 83.87 | 50 |
| PSO-SVM1 | 94.44 | 22.5 | 85.48 | 20.1 |
| PSO-SVM2 | 93.06 | 7129 | 87.01 | 2000 |
| mRMR-PSO-SVM | 100 | 17.7 | 90.32 | 10.3 |
|
|
|
|
|
|
Detailed information of gene expression datasets.
| Dataset name | Number of | |||
|---|---|---|---|---|
| Samples | Categories | Genes | ||
| Leukemia | Acute myeloid leukemia | 25 | 2 | 7129 |
| Acute lymphoblastic leukemia | 47 | |||
|
| ||||
| Colon | Cancerous colon tissues | 40 | 2 | 2000 |
| Normal colon tissues | 22 | |||
PSO parameters.
| Parameters | Values |
|---|---|
| Swarm size | 50 |
| The inertia weight | 0.9 |
| Accelration constants | 2 |
| Maximum number of iterations | 70 |
Classification accuracy of our method with other methods from literature (under 10-fold cross validation).
| (Authors, year) | Method | Leukemia | Colon | ||
|---|---|---|---|---|---|
| Acc (%) | S. G. | Acc (%) | S. G. | ||
| (Ruiz et al., 2006) [ | NB-FCBF | 95.9 | 48.5 | 77.6 | 14.6 |
| (Shen et al., 2007) [ | PSOSVM | N. C. | N. C. | 91.67 | 4.00 |
| (Li et al., 2008) [ | Single PSO | 94.6 | 22.3 | 87.1 | 19.8 |
| (Li et al., 2008) [ | Single GA | 94.6 | 23.1 | 87.1 | 17.5 |
| (Li et al., 2008) [ | Hybrid PSO/GA | 97.2 | 18.7 | 91.90 | 18.00 |
| (Shen et al., 2008) [ | HPSOTS | 98.61 | 7.00 | 93.32 | 8.00 |
|
|
|
|
|
|
|
∗S. G. and N. C. denote selected genes and not considered, respectively.
Classification accuracy of our method with other methods from literature (under LOOCV).
| (Authors, year) | Method | Leukemia | Colon | ||
|---|---|---|---|---|---|
| Acc (%) | S. G. | Acc (%) | S. G. | ||
| (Mohamad et al., 2007) [ | IG + NewGASVM | 94.71 | 20.00 | N. C. | N. C. |
| (El Akadi et al., 2011) [ | mRMR-GA | 100 | 15.00 | 85.48 | 15.00 |
|
|
|
|
|
|
|
∗S. G. and N. C. denote selected genes and not considered, respectively.