| Literature DB >> 28184251 |
Wei Du1, Zhongbo Cao1,2, Tianci Song1, Ying Li1, Yanchun Liang1,3.
Abstract
BACKGROUND: With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative features from expression data effectively and robustly using feature selection technique is challenging and crucial. So far, a mass of many feature selection approaches have been proposed and applied to analyse expression data of different types. However, most of these methods only are limited to measure the performances on one single type of expression data by accuracy or error rate of classification.Entities:
Year: 2017 PMID: 28184251 PMCID: PMC5288949 DOI: 10.1186/s13040-017-0124-x
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
The detailed information of mRNA microarray datasets
| Cancer Type | Datasets ID | Number of Samples |
|---|---|---|
| Liver | GSE5364, GSE22058, GSE14520, GSE12941 | 132 |
| Pancreatic | GSE15471, GSE16515, GSE22780 | 63 |
| Lung | GSE5364, GSE19804, GSE22058, GSE10072, GSE7670, GSE2514 | 249 |
| Colon | GSE5364, GSE8671, GSE25070, GSE21510, GSE23878, GSE18105 | 70 |
| Gastric | GSE13911, GSE13195, GSE5081, GSE19826 | 93 |
| Breast | GSE5364, GSE15852, GSE10810, GSE16873, GSE5764, GSE14548 | 113 |
| Thyroid | GSE5364, GSE3678 | 23 |
| Prostate | GSE6919, GSE6956, GSE17951 | 88 |
The detailed information of mRNA Sequencing and miRNA Sequencing datasets
| Cancer Type | Number of Samples |
|---|---|
| KIDNEY1 | 88 |
| BRCA | 71 |
| LUNG2 | 47 |
| HNSC | 37 |
| LIHC | 46 |
| PRAD | 43 |
| STAD | 29 |
| THCA | 56 |
1: KIDNEY contains KIRC and KIRP
2: LUNG contains LUSC and LUAD
The results of mean effectiveness on mRNA microarray (top 10)
| Methods | SVM-RFE | SVM-RCE | mRMR | IMRelief | SlimPLS | OSFS | FGM | SMKL-FS |
|---|---|---|---|---|---|---|---|---|
| Liver | 0.913 | 0.860 |
| 0.825 | 0.831 | 0.750 | 0.867 | 0.963 |
| Pancreatic | 0.689 | 0.777 |
| 0.784 | 0.673 | 0.707 | 0.729 | 0.804 |
| Lung | 0.731 | 0.786 | 0.942 | 0.814 | 0.708 | 0.704 | 0.860 |
|
| Gastric | 0.614 | 0.724 | 0.688 | 0.566 | 0.636 | 0.533 | 0.640 |
|
| Colon | 0.736 | 0.888 | 0.941 | 0.803 | 0.794 | 0.682 | 0.812 |
|
| Breast | 0.745 | 0.776 | 0.832 | 0.545 | 0.693 | 0.728 | 0.769 |
|
| Thyroid | 0.835 | 0.897 | 0.838 | 0.633 | 0.743 | 0.517 | 0.802 |
|
| Prostate | 0.577 |
| 0.750 | 0.560 | 0.682 | 0.629 | 0.679 | 0.717 |
| Mean | 0.730 | 0.809 | 0.847 | 0.691 | 0.720 | 0.656 | 0.770 |
|
The results of mean effectiveness on mRNA Sequencing (top 10)
| Methods | SVM-RFE | SVM-RCE | mRMR | IMRelief | SlimPLS | OSFS | FGM | SMKL-FS |
|---|---|---|---|---|---|---|---|---|
| KIDNEY | 0.912 | 0.952 |
| 0.949 | 0.898 | 0.914 | 0.951 | 0.957 |
| BRCA | 0.938 | 0.982 | 0.973 | 0.953 | 0.871 | 0.934 | 0.928 |
|
| LUNG | 0.957 | 0.977 | 0.993 | 0.932 | 0.942 | 0.867 | 0.931 |
|
| HNSC | 0.930 | 0.949 |
| 0.908 | 0.844 | 0.900 | 0.977 | 0.948 |
| LIHC | 0.893 | 0.937 |
| 0.919 | 0.900 | 0.798 | 0.952 | 0.958 |
| PRAD | 0.932 | 0.928 |
| 0.893 | 0.779 | 0.764 | 0.966 | 0.953 |
| STAD | 0.907 | 0.895 |
| 0.945 | 0.758 | 0.848 | 0.898 | 0.963 |
| THCA | 0.945 | 0.954 |
| 0.933 | 0.883 | 0.844 | 0.903 | 0.970 |
| Mean | 0.927 | 0.947 |
| 0.929 | 0.859 | 0.859 | 0.938 | 0.966 |
The results of mean effectiveness on miRNA Sequencing (top 10)
| Methods | SVM-RFE | SVM-RCE | mRMR | IMRelief | SlimPLS | OSFS | FGM | SMKL-FS |
|---|---|---|---|---|---|---|---|---|
| KIDNEY | 0.922 | 0.832 | 0.987 | 0.901 | 0.896 | 0.893 | 0.916 |
|
| BRCA | 0.839 | 0.963 | 0.979 | 0.817 | 0.973 | 0.893 | 0.953 |
|
| LUNG | 0.891 | 0.946 | 0.979 | 0.953 | 0.831 | 0.945 | 0.946 |
|
| HNSC | 0.979 | 0.955 | 0.991 | 0.879 | 0.874 | 0.920 | 0.874 |
|
| LIHC | 0.906 | 0.836 | 0.911 | 0.813 | 0.871 | 0.789 |
| 0.917 |
| PRAD | 0.897 | 0.933 | 0.930 | 0.892 | 0.905 | 0.794 | 0.836 |
|
| STAD | 0.855 | 0.870 | 0.853 | 0.790 | 0.823 | 0.760 | 0.827 |
|
| THCA | 0.925 | 0.901 |
| 0.842 | 0.876 | 0.878 | 0.928 | 0.967 |
| Mean | 0.902 | 0.904 | 0.950 | 0.861 | 0.881 | 0.859 | 0.901 |
|
Fig. 1The results of independent stability on different mRNA microarray datasets
Fig. 2The results of independent stability on different mRNA Sequencing datasets
Fig. 3The results of independent stability on different miRNA Sequencing datasets
The results of similarity on mRNA microarray
| Methods | SVM-RFE | SVM-RCE | mRMR | IMRelief | SlimPLS | SMKL-FS |
|---|---|---|---|---|---|---|
| Liver | 6.33 | 1.17 |
| 1.33 | 1 | 15.17 |
| Pancreatic | 4.67 | 0.83 | 11.17 | 1.83 | 3 |
|
| Lung | 3.83 | 21.83 | 20.67 | 0.17 | 2.17 |
|
| Colon | 7.17 | 0.67 | 19.17 | 0.67 | 2.67 |
|
| Gastric | 3.83 | 0.83 | 16.00 | 0.50 | 3.50 |
|
| Breast | 9.83 | 32.83 | 31.83 | 0 | 1.67 |
|
| Thyroid | 10.83 | 29.00 | 20.17 | 0 | 1.67 |
|
| Prostate | 5.50 | 27.50 | 20.00 | 0.50 | 1.17 |
|
| Mean | 6.50 | 14.33 | 19.35 | 0.63 | 2.10 |
|
The results of similarity on mRNA Sequencing
| Methods | SVM-RFE | SVM-RCE | mRMR | IMRelief | SlimPLS | SMKL-FS |
|---|---|---|---|---|---|---|
| KIDNEY | 1.33 | 8.00 | 11.00 | 2.83 | 1.67 |
|
| BRCA | 5.67 | 16.83 | 14.83 | 3.67 | 0.83 |
|
| LUNG | 6.50 | 23.17 | 11.50 | 2.83 | 0.67 |
|
| HNSC | 1.17 |
| 11.67 | 2.50 | 1.17 | 23.00 |
| LIHC | 9.50 | 26.67 | 17.50 | 1.33 | 2.33 |
|
| PRAD | 9.83 | 26.67 | 19.17 | 3.33 | 0.83 |
|
| STAD | 7.83 |
| 15.17 | 16.67 | 0.33 | 29.50 |
| THCA | 5.17 | 14.33 | 12.50 | 4.83 | 0.50 |
|
| Mean | 5.88 | 21.19 | 14.17 | 4.75 | 1.04 |
|
The results of similarity on miRNA Sequencing
| Methods | SVM-RFE | SVM-RCE | mRMR | IMRelief | SlimPLS | SMKL-FS |
|---|---|---|---|---|---|---|
| KIDNEY | 43.00 | 33.00 | 48.50 | 29.17 | 28.00 |
|
| BRCA | 39.67 | 39.33 | 50.83 | 25.83 | 33.00 |
|
| LUNG | 41.50 | 38.83 | 50.17 | 29.50 | 21.67 |
|
| HNSC | 42.17 | 38.83 | 50.50 | 32.50 | 22.50 |
|
| LIHC | 42.33 | 35.50 | 46.50 | 24.67 | 25.17 |
|
| PRAD | 42.33 | 40.33 | 53.17 | 27.00 | 30.83 |
|
| STAD | 43.50 | 35.33 | 48.83 | 28.67 | 20.67 |
|
| THCA | 37.33 | 37.50 | 47.50 | 26.50 | 25.50 |
|
| Mean | 41.48 | 37.33 | 49.50 | 27.98 | 25.92 |
|