| Literature DB >> 23966761 |
Lingkang Huang1, Hao Helen Zhang, Zhao-Bang Zeng, Pierre R Bushel.
Abstract
BACKGROUND: Microarray techniques provide promising tools for cancer diagnosis using gene expression profiles. However, molecular diagnosis based on high-throughput platforms presents great challenges due to the overwhelming number of variables versus the small sample size and the complex nature of multi-type tumors. Support vector machines (SVMs) have shown superior performance in cancer classification due to their ability to handle high dimensional low sample size data. The multi-class SVM algorithm of Crammer and Singer provides a natural framework for multi-class learning. Despite its effective performance, the procedure utilizes all variables without selection. In this paper, we propose to improve the procedure by imposing shrinkage penalties in learning to enforce solution sparsity.Entities:
Keywords: cancer classification; classification; microarray; multi-class SVM; shrinkage methods; support vector machine (SVM); variable selection
Year: 2013 PMID: 23966761 PMCID: PMC3740816 DOI: 10.4137/CIN.S10212
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Selection frequency of individual variables over 100 runs for the linear example.
| Method | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| L2 MSVM (C&S) | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| L1 MSVM | 100 | 100 | 13 | 4 | 26 | 21 | 31 | 28 | 29 | 20 | 22 | 22 | 28 | 24 | 22 | 20 | 25 | 32 | 26 | 23 |
| Sup MSVM | 100 | 100 | 10 | 5 | 15 | 16 | 14 | 14 | 21 | 13 | 13 | 16 | 13 | 16 | 9 | 9 | 14 | 18 | 16 | 17 |
| Adapt-L1 MSVM | 100 | 100 | 11 | 13 | 14 | 8 | 12 | 13 | 14 | 13 | 11 | 10 | 16 | 8 | 9 | 12 | 11 | 11 | 11 | 11 |
| Adapt-Sup MSVM | 100 | 100 | 6 | 4 | 7 | 8 | 8 | 6 | 9 | 10 | 9 | 4 | 9 | 5 | 6 | 5 | 6 | 8 | 7 | 8 |
Average test error and model size for the linear example.
| Method | Test error (SE) | Selected variables | Model size | |
|---|---|---|---|---|
|
| ||||
| Important 2 var. | Noise or informative but redundant 18 var. | |||
| L2 MSVM (C&S) | 0.296 (1.4 × 10−3) | 2 | 18 | 20 |
| L1 MSVM | 0.263 (1.4 × 10−3) | 2 | 4.16 | 6.16 |
| Adapt-L1 MSVM | 0.262 (1.0 × 10−3) | 2 | 2.08 | 4.08 |
| Sup MSVM | 0.258 (1.2 × 10−3) | 2 | 2.49 | 4.49 |
| Adapt-Sup MSVM | 0.255 (6.1 × 10−4) | 2 | 1.25 | 3.25 |
| Bayes | 0.246 (−) | 2 | 0 | 2 |
Average test error and model size for the nonlinear example.
| Method | Test error (SE) | Selected variables | Model size | |
|---|---|---|---|---|
|
| ||||
| Important 3 var. | Noise 227 var. | |||
| L2 MSVM (C&S) | 0.441 (2.1 × 10−3) | 3 | 218.87 | 221.87 |
| L1/Sup MSVM | 0.160 (2.4 × 10−3) | 3 | 18.34 | 21.34 |
| Adapt-L1 MSVM | 0.152 (2.1 × 10−3) | 2.98 | 6.08 | 9.06 |
| Adapt-Sup MSVM | 0.147 (1.9 × 10−3) | 3 | 5.58 | 8.58 |
| Bayes | 0.120 (−) | 3 | 0 | 3 |
The variable selection frequencies of adaptive sup-norm MSVM over 100 runs for the nonlinear example.
| List | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 100 | 1 | 0 | 2 | 0 | 3 | 0 | 1 | 0 | 0 | 1 | 0 | 2 | 2 | 2 | 0 | 1 | 0 | 0 | 1 | |
| . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | ||
| 9 | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | ||
| 0 | 5 | 6 | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| 1 | 4 | 3 | 8 | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| 4 | 3 | 3 | 1 | 4 | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| 1 | 1 | 1 | 2 | 1 | 7 | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| 3 | 1 | 1 | 4 | 2 | 3 | 5 | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| 3 | 4 | 5 | 2 | 2 | 0 | 0 | 5 | . | . | . | . | . | . | . | . | . | . | . | . | |
| 2 | 6 | 2 | 2 | 1 | 2 | 2 | 2 | 6 | . | . | . | . | . | . | . | . | . | . | . | |
| 1 | 4 | 4 | 2 | 4 | 2 | 1 | 3 | 2 | 7 | . | . | . | . | . | . | . | . | . | . | |
| 1 | 8 | 1 | 4 | 2 | 2 | 1 | 4 | 1 | 1 | 7 | . | . | . | . | . | . | . | . | . | |
| 2 | 3 | 3 | 1 | 3 | 0 | 4 | 2 | 2 | 2 | 1 | 9 | . | . | . | . | . | . | . | . | |
| 2 | 5 | 2 | 0 | 3 | 2 | 2 | 4 | 2 | 3 | 0 | 4 | 9 | . | . | . | . | . | . | . | |
| 2 | 3 | 3 | 2 | 1 | 5 | 1 | 4 | 1 | 4 | 3 | 3 | 3 | 5 | . | . | . | . | . | . | |
| 2 | 2 | 3 | 0 | 6 | 3 | 1 | 4 | 3 | 3 | 3 | 1 | 2 | 1 | 5 | . | . | . | . | . | |
| 0 | 6 | 0 | 5 | 4 | 2 | 0 | 4 | 2 | 4 | 0 | 2 | 4 | 0 | 3 | 4 | . | . | . | . | |
| 1 | 5 | 0 | 2 | 1 | 4 | 2 | 2 | 2 | 1 | 3 | 1 | 1 | 1 | 3 | 2 | 4 | . | . | . | |
| 0 | 4 | 3 | 2 | 1 | 0 | 2 | 3 | 2 | 2 | 4 | 3 | 3 | 1 | 2 | 0 | 2 | 4 | . | . | |
| 0 | 4 | 3 | 4 | 6 | 3 | 3 | 0 | 1 | 3 | 1 | 2 | 2 | 3 | 0 | 1 | 3 | 2 | 9 | . | |
| 1 | 3 | 2 | 4 | 3 | 1 | 3 | 3 | 3 | 0 | 0 | 0 | 3 | 1 | 2 | 0 | 2 | 1 | 1 | 6 |
Classification and selection results for the leukemia study.
| Method | Test error | No. of genes |
|---|---|---|
| L2 MSVM (C&S) | 1/34 | 429 |
| L1/Sup MSVM | 2/34 | 14 |
| Adapt-L1 MSVM | 3/34 | 9 |
| Adapt-Sup MSVM | 3/34 | 4 |
Selected genes by various methods for the leukemia study.
| Probe set ID | Adapt-sup | Adapt-L1 | L1/Sup | Rank of F-test | Gene name/description |
|---|---|---|---|---|---|
| X00437_s_at | 1 | 1 | 1 | 1 | TCRB (T-cell receptor, beta cluster) |
| X76223_s_at | 1 | 1 | 1 | 3 | MAL gene |
| M27891_at | 1 | 1 | 1 | 12 | CST3 (Cystatin C) |
| X82240_rna1_at | 1 | 1 | 1 | 19 | TCL1 (T-cell leukemia/lymphoma) |
| X59871_at | – | 1 | 1 | 8 | TCF7 (Transcription factor 7; T-cell specific) |
| M11722_at | – | 1 | 1 | 157 | Terminal transferase mRNA |
| U89922_s_at | – | 1 | 1 | 324 | LTB (Lymphotoxin-beta) |
| Z14982_rna1_at | – | 1 | 1 | 527 | MHC-encoded proteasome subunit gene LAMP 7-E1 gene |
| M21624_at | – | 1 | – | 462 | TCRD (T-cell receptor, delta) |
| U05259_rna1_at | – | – | 1 | 10 | MB-1 gene |
| X58529_at | – | – | 1 | 27 | IGHM Immunoglobulin mu |
| M74719_at | – | – | 1 | 46 | SEF2-1A mRNA, 5′ end |
| Y00787_s_at | – | – | 1 | 58 | Interleukin-8 precursor |
| M19507_at | – | – | 1 | 112 | MPO (Myeloperoxidase) |
| U01317_cds4_at | – | – | 1 | 390 | Delta-globin gene |
Figure 1Hierarchical clustering of all training and test samples based on 4 selected genes in leukemia study.
Notes: All samples, including 38 training and 34 test samples, are plotted. Each column represents a sample from one of classes: AML, ALL_T or ALL_B. Each row represents the expression profile of a gene (labeled as a probe set ID) across all samples. The color scale ranges from green for an expression value less than the mean to red for an expression value greater than the mean. The hierarchical clustering result is generated using the public software Cluster ( http://rana.lbl.gov/EisenSoftware.htm ) and viewed by the Java TreeView ( http://jtreeview.sourceforge.net/ ). The hierarchical clustering used Pearson correlation for gene similarity metric and average-linkage algorithm for clustering correlation matrixes.
Classification and selection results for the SRBCT study.
| Method | Test error | Selected genes | |
|---|---|---|---|
|
| |||
| Top 333 genes | Bottom 300 genes | ||
| L2 MSVM (C&S) | 0/20 | 194 | 124 |
| L1 MSVM | 1/20 | 31 | 0 |
| Sup MSVM | 0/20 | 36 | 0 |
| Adapt-L1 MSVM | 0/20 | 31 | 0 |
| Adapt-Sup MSVM | 0/20 | 28 | 0 |