| Literature DB >> 22759422 |
Pingzhao Hu1, Shelley B Bull, Hui Jiang.
Abstract
BACKGROUND: Molecular predictor is a new tool for disease diagnosis, which uses gene expression to classify diagnostic category of a patient. The statistical challenge for constructing such a predictor is that there are thousands of genes to predict for the disease categories, but only a small number of samples are available.Entities:
Mesh:
Year: 2012 PMID: 22759422 PMCID: PMC3314572 DOI: 10.1186/1471-2105-13-S10-S17
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Descriptive characteristics of data sets used for classification
| Disease | Response type | No. Samples | No. genes/features | Reference |
|---|---|---|---|---|
| Colon cancer | Tumor/Normal | 40 / 22 | 2000 | [ |
| Prostate cancer | Tumor/Normal | 50 / 38 | 12635 | [ |
| Lung cancer | Tumor/Normal | 60 / 69 | 22215 | [ |
Mean error rates of classification methods applied to colon cancer data set
| No. genes | DQDA | DLDA | 1NN | Tree | SVM | MPCLR | MLDA |
|---|---|---|---|---|---|---|---|
| 5 | 0.113 | 0.113 | 0.210 | 0.226 | 0.113 | ||
| 10 | 0.177 | 0.177 | 0.161 | 0.290 | 0.129 | 0.129 | |
| 15 | 0.129 | 0.129 | 0.242 | 0.145 | |||
| 20 | 0.145 | 0.129 | 0.161 | 0.258 | 0.129 | 0.129 | |
| 30 | 0.145 | 0.129 | 0.161 | 0.194 | 0.145 | 0.129 | |
| 40 | 0.145 | 0.145 | 0.210 | 0.145 | |||
| 50 | 0.145 | 0.145 | 0.194 | 0.226 | 0.145 | 0.145 |
Mean error rates of classification methods applied to prostate cancer data set
| No. genes | DQDA | DLDA | 1NN | Tree | SVM | MPCLR | MLDA |
|---|---|---|---|---|---|---|---|
| 5 | 0.227 | 0.239 | 0.261 | 0.227 | 0.216 | 0.216 | |
| 10 | 0.205 | 0.193 | 0.284 | 0.318 | 0.193 | 0.182 | |
| 15 | 0.250 | 0.261 | 0.295 | 0.261 | 0.239 | ||
| 20 | 0.216 | 0.227 | 0.250 | 0.273 | 0.205 | 0.205 | |
| 30 | 0.216 | 0.239 | 0.295 | 0.216 | 0.216 | ||
| 40 | 0.261 | 0.250 | 0.295 | 0.318 | 0.250 | 0.261 | |
| 50 | 0.227 | 0.227 | 0.341 | 0.330 | 0.216 | 0.250 |
Mean error rates of classification methods applied to lung cancer data set
| No. genes | DQDA | DLDA | 1NN | Tree | SVM | MPCLR | MLDA |
|---|---|---|---|---|---|---|---|
| 5 | 0.170 | 0.170 | 0.186 | 0.201 | 0.170 | ||
| 10 | 0.170 | 0.186 | 0.193 | 0.170 | |||
| 15 | 0.162 | 0.162 | 0.201 | 0.178 | 0.155 | 0.147 | |
| 20 | 0.147 | 0.162 | 0.170 | 0.193 | 0.178 | 0.155 | |
| 30 | 0.132 | 0.125 | 0.132 | 0.193 | 0.147 | 0.125 | |
| 40 | 0.178 | 0.147 | 0.162 | 0.186 | |||
| 50 | 0.147 | 0.178 | 0.147 |
Mean error rates of classification methods applied to lung cancer data set
| No. genes | DQDA | DLDA | 1NN | Tree | SVM | MPCLR | MLDA |
|---|---|---|---|---|---|---|---|
| 5 | 0.178 | 0.170 | 0.193 | 0.225 | 0.178 | 0.170 | |
| 10 | 0.170 | 0.170 | 0.209 | 0.193 | 0.178 | 0.155 | |
| 15 | 0.186 | 0.147 | 0.201 | 0.225 | 0.146 | 0.132 | |
| 20 | 0.147 | 0.162 | 0.186 | 0.178 | 0.186 | 0.155 | |
| 30 | 0.147 | 0.178 | 0.132 | 0.193 | |||
| 40 | 0.178 | 0.178 | 0.186 | ||||
| 50 | 0.162 | 0.162 | 0.186 | 0.155 | 0.147 |