| Literature DB >> 26484222 |
D Ramyachitra1, M Sofia1, P Manikandan1.
Abstract
Microarray technology allows simultaneous measurement of the expression levels of thousands of genes within a biological tissue sample. The fundamental power of microarrays lies within the ability to conduct parallel surveys of gene expression using microarray data. The classification of tissue samples based on gene expression data is an important problem in medical diagnosis of diseases such as cancer. In gene expression data, the number of genes is usually very high compared to the number of data samples. Thus the difficulty that lies with data are of high dimensionality and the sample size is small. This research work addresses the problem by classifying resultant dataset using the existing algorithms such as Support Vector Machine (SVM), K-nearest neighbor (KNN), Interval Valued Classification (IVC) and the improvised Interval Value based Particle Swarm Optimization (IVPSO) algorithm. Thus the results show that the IVPSO algorithm outperformed compared with other algorithms under several performance evaluation functions.Entities:
Keywords: Gene selection; Interval-value based Particle Swarm Optimization classification; Interval-value classification; Microarray; Particle swarm optimization; Tissue sample classification
Year: 2015 PMID: 26484222 PMCID: PMC4583628 DOI: 10.1016/j.gdata.2015.04.027
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Performance comparison of existing and proposed methods for the leukemia dataset.
| Algorithms/performance metrics | TP rate | FP rate | Precision | Accuracy |
|---|---|---|---|---|
| Support Vector Machine | 70.97 | 28.61 | 43.75 | 69.01 |
| K-Nearest Neighbor | 80.27 | 22.2 | 90.0 | 71.28 |
| Interval Valued Classification | 85.0 | 60.0 | 94.4 | 78.26 |
| Particle Swarm Optimization | 90.0 | 22.6 | 83.35 | 81.8 |
| Interval Value based Particle Swarm Optimization | 100 | 0.0 | 90.0 | 96.88 |
Fig. 2Performance comparison of existing and proposed methods for the leukemia dataset.
Performance comparison of existing and proposed methods for breast cancer dataset.
| Algorithms/performance metrics | TP rate | FP rate | Precision | Accuracy |
|---|---|---|---|---|
| Support Vector Machine | 71.26 | 29.45 | 70.75 | 71.87 |
| K Nearest Neighbor | 76.8 | 27.24 | 75.95 | 67.29 |
| Interval Valued Classification | 80.1 | 25.24 | 75.66 | 74.86 |
| Particle Swarm Optimization | 82.8 | 20.86 | 79.87 | 84.63 |
| Interval Value based Particle Swarm Optimization | 90.16 | 17.17 | 83.9 | 92.24 |
Fig. 3Performance comparison of existing and proposed methods for the breast cancer dataset.
Performance comparison of existing and proposed methods for lung cancer dataset.
| Algorithms/performance metrics | TP rate | FP rate | Precision | Accuracy |
|---|---|---|---|---|
| Support Vector Machine | 71.30 | 28.4 | 71.4 | 70.55 |
| K Nearest Neighbor | 77.8 | 26.24 | 73.95 | 65.29 |
| Interval Valued Classification | 79.19 | 24.08 | 76.67 | 79.03 |
| Particle Swarm Optimization | 83.27 | 21.03 | 79.83 | 80.02 |
| Interval Value based Particle Swarm Optimization | 89.24 | 19.42 | 82.12 | 94.68 |
Fig. 4Performance comparison of existing and proposed methods for the lung cancer dataset.
Performance comparison of existing and proposed methods for the blood cancer dataset.
| Algorithms/performance metrics | TP rate | FP rate | Precision | Accuracy |
|---|---|---|---|---|
| Support Vector Machine | 66 | 22.2 | 70 | 66.66 |
| K Nearest Neighbor | 72 | 25 | 72.3 | 72.82 |
| Interval Valued Classification | 56.25 | 43.75 | 81.2 | 78.26 |
| Particle Swarm Optimization | 78.6 | 22.1 | 79.2 | 80.26 |
| Interval Value based Particle Swarm Optimization | 81.26 | 18.19 | 83.6 | 90.86 |
Fig. 5Performance comparison of existing and proposed methods for the blood cancer dataset.
Fig. 1Comparison of accuracy on the leukemia, breast cancer, lung cancer and blood cancer datasets.