| Literature DB >> 23433084 |
Abstract
BACKGROUND: PAM, a nearest shrunken centroid method (NSC), is a popular classification method for high-dimensional data. ALP and AHP are NSC algorithms that were proposed to improve upon PAM. The NSC methods base their classification rules on shrunken centroids; in practice the amount of shrinkage is estimated minimizing the overall cross-validated (CV) error rate.Entities:
Mesh:
Year: 2013 PMID: 23433084 PMCID: PMC3687811 DOI: 10.1186/1471-2105-14-64
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Probability of classification of a new sample in the minority class and the classification error as a function of the number of variables. The figure shows the probability of classification of a new sample in the minority class (left panel) and the classification error (right panel) as a function of the number of variables for the example presented in the main text.
Figure 2Classification results under the alternative hypothesis for the NSC and GM-NSC classifiers. The figure shows class specific predictive accuracies (PA1 and PA2) for different levels of class-imbalance (k1) in the training set. The differences between the classes were small (upper panel: μ2=0.5) or moderate (lower panel: μ2=1). See text for details.
Performance of the classifiers under the alternative hypothesis with large class-imbalance () and moderate differences between classes ()
| PAM | 0.05 | 99.94 | 9184.1 [92.77] | 0.99 | 0.95 | 0.11 | 0.31 | 0.6 |
| | (0.08) | (0.87) | (1136.28) | (0.00) | (0.02) | (0.05) | (0.07) | (0.04) |
| GM-PAM | 1.29 | 58.81 | 602.1 [6.08] | 0.62 | 0.63 | 0.64 | 0.62 | 0.69 |
| | (0.51) | (43.6) | (1004.73) | (0.38) | (0.1) | (0.16) | (0.08) | (0.1) |
| ALP | 0.07 | 99.96 | 9004.2 [90.95] | 0.99 | 0.95 | 0.11 | 0.3 | 0.61 |
| | (0.19) | (0.54) | (2232.18) | (0.01) | (0.03) | (0.07) | (0.08) | (0.04) |
| GM-ALP | 3.76 | 63.08 | 408.8 [4.13] | 0.59 | 0.67 | 0.61 | 0.63 | 0.68 |
| | (2.21) | (41.09) | (816.3) | (0.36) | (0.1) | (0.17) | (0.09) | (0.09) |
| AHP | 0.36 | 96.89 | 6438.9 [65.04] | 0.95 | 0.94 | 0.14 | 0.34 | 0.62 |
| | (1.57) | (12.84) | (4235.52) | (0.11) | (0.04) | (0.1) | (0.1) | (0.05) |
| GM-AHP | 5.29 | 37.45 | 266.5 [2.69] | 0.42 | 0.78 | 0.5 | 0.6 | 0.69 |
| (3.94) | (37.96) | (1275.07) | (0.4) | (0.08) | (0.18) | (0.12) | (0.1) |
The table reports the estimated optimal threshold (λ∗), the number [%] of active non-informative variables (# non-info [%], selected out of 9,900 non-informative variables) and the number (also equal to %) of active informative variables (#, % info, selected out of 100 informative variables, equal to 100(1-false negative rate)), false discovery rate (FDR, # non-info/(# active)), class specific predictive accuracies, g-means and AUC, averaged over 500 repetitions; standard deviations are reported in brackets. The simulation settings are the same as in Figure 2. a For AHP and GM-AHP only λ was optimized while λ was set to zero.
Multi-class classification results for the class-imbalanced scenario in the alternative case
| PAM | 6.6 | 59.23 | 0 [0.00] | 0 | 0.86 | 0.04 | 0.86 | 0.29 |
| | (0.58) | (33.68) | (0) | (0.00) | (0.02) | (0.02) | (0.02) | (0.05) |
| GM-PAM | 1.4 | 99.64 | 834.56 [17.03] | 0.5 | 0.75 | 0.31 | 0.75 | 0.55 |
| | (0.96) | (5.75) | (1262.68) | (0.41) | (0.04) | (0.06) | (0.04) | (0.04) |
| ALP | 58.76 | 99.98 | 49 [1.00] | 0.01 | 0.82 | 0.15 | 0.82 | 0.46 |
| | (15.12) | (0.14) | (490) | (0.1) | (0.04) | (0.05) | (0.03) | (0.04) |
| GM-ALP | 12.77 | 99.64 | 415.68 [8.48] | 0.19 | 0.74 | 0.34 | 0.74 | 0.57 |
| | (12.88) | (3.6) | (1330.05) | (0.31) | (0.05) | (0.08) | (0.05) | (0.05) |
| AHP | 59.05 | 99.99 | 98 [2.00] | 0.02 | 0.82 | 0.15 | 0.82 | 0.45 |
| | (15.74) | (0.1) | (689.46) | (0.14) | (0.04) | (0.05) | (0.04) | (0.05) |
| GM-AHP | 13.55 | 100 | 316.73 [6.46] | 0.17 | 0.74 | 0.34 | 0.74 | 0.57 |
| (13.05) | (0) | (1164.99) | (0.28) | (0.05) | (0.08) | (0.05) | (0.04) |
The table reports the same information as Table 1; # non-info [%] was selected out of 4,900 non-informative variables and #, % info was selected out of 100 informative variables; see text for details.
aFor AHP and GM-AHP only λ was optimized while λ was set to zero.
Gene expression breast cancer data sets
| Ivshina | 22,283 | ER- or ER+ | 34 | 211 | | 0.14 |
| | | Grade 1, 2 or 3 | 68 | 166 | 55 | 0.19 |
| | | Grade 1, 2 or 3 | 40 | 10 to 80 | 40 | 0.25 to 0.50 |
| Wang | 22,283 | Relapse or not | 179 | 107 | | 0.37 |
| Korkola | 9,524 | Good or bad prognosis | 34 | 21 | | 0.38 |
| Sotiriou | 7,650 | ER+ or ER- | 10 to 50 | 10 | | 0.50 to 0.17 |
| Grade 1-2 or 3 | 10 to 40 | 10 | 0.50 to 0.20 |
a is a proportion of the minority class samples in the training set.
Performance of the classifiers on real gene expression data sets for the two class classification tasks
| Ivshina | PAM | 0 | 22283 | 0.84 | 0.79 | 0.85 | 0.82 | 0.85 |
| (ER) | GM-PAM | 4.83 | 51 | 0.82 | 0.91 | 0.81 | 0.86 | 0.90 |
| | ALP | 0 | 22283 | 0.84 | 0.82 | 0.84 | 0.83 | 0.86 |
| | GM-ALP | 58.24 | 26 | 0.85 | 0.91 | 0.84 | 0.87 | 0.88 |
| | AHP | 185.56 | 20 | 0.89 | 0.88 | 0.89 | 0.88 | 0.90 |
| | GM-AHP | 69.58 | 115 | 0.89 | 0.88 | 0.89 | 0.89 | 0.91 |
| Wang | PAM | 3.71 | 14 | 0.61 | 0.60 | 0.62 | 0.61 | 0.62 |
| | GM-PAM | 3.71 | 14 | 0.60 | 0.60 | 0.62 | 0.61 | 0.63 |
| | ALP | 8.26 | 654 | 0.56 | 0.57 | 0.55 | 0.56 | 0.63 |
| | GM-ALP | 8.26 | 654 | 0.56 | 0.56 | 0.56 | 0.56 | 0.63 |
| | AHP | 21.95 | 135 | 0.56 | 0.56 | 0.56 | 0.56 | 0.65 |
| | GM-AHP | 21.95 | 135 | 0.56 | 0.56 | 0.55 | 0.56 | 0.63 |
| Korkola | PAM | 0.19 | 7073 | 0.65 | 0.71 | 0.57 | 0.64 | 0.64 |
| | GM-PAM | 0.19 | 7073 | 0.65 | 0.71 | 0.57 | 0.64 | 0.64 |
| | ALP | 4.87 | 155 | 0.64 | 0.65 | 0.62 | 0.63 | 0.64 |
| | GM-ALP | 4.87 | 155 | 0.69 | 0.74 | 0.62 | 0.67 | 0.69 |
| | AHP | 0.76 | 1308 | 0.58 | 0.68 | 0.43 | 0.54 | 0.58 |
| GM-AHP | 0.76 | 1308 | 0.62 | 0.71 | 0.48 | 0.58 | 0.60 |
The table reports the same information as Table 1; # genes is the number of active genes. Optimal thresholds were estimated with 5-fold CV and the accuracy measures with LOOCV; see text for details.
aFor AHP and GM-AHP only λ was optimized while λ was set to zero.
Figure 3Classification results on the Sotiriou data set. The figure shows PA for ER+ class (PAER+) and PA for ER- class (PAER-) for different number of ER+ samples in the training set (nER+). There were 10 ER- samples in each training set. See text for more details.
Results on the Ivshina data set for classification of Grade of the tumor
| 10 | PAM | 4.25 | 963 | 0.29 | 0.90 | 0.12 | 0.89 | 0.45 |
| | GM-PAM | 2.76 | 5768 | 0.34 | 0.86 | 0.20 | 0.88 | 0.51 |
| | ALP | 10.26 | 7425 | 0.33 | 0.88 | 0.18 | 0.88 | 0.51 |
| | GM-ALP | 28.39 | 3295 | 0.37 | 0.70 | 0.26 | 0.88 | 0.50 |
| | AHP | 25.61 | 9508 | 0.39 | 0.83 | 0.27 | 0.85 | 0.57 |
| | GM-AHP | 76.40 | 640 | 0.45 | 0.75 | 0.36 | 0.86 | 0.61 |
| 20 | PAM | 3.14 | 3023 | 0.34 | 0.89 | 0.18 | 0.88 | 0.51 |
| | GM-PAM | 1.27 | 12299 | 0.42 | 0.83 | 0.30 | 0.85 | 0.59 |
| | ALP | 11.20 | 5541 | 0.40 | 0.85 | 0.26 | 0.88 | 0.57 |
| | GM-ALP | 10.67 | 9991 | 0.42 | 0.81 | 0.30 | 0.86 | 0.58 |
| | AHP | 30.66 | 7733 | 0.47 | 0.78 | 0.37 | 0.83 | 0.62 |
| | GM-AHP | 48.52 | 3811 | 0.48 | 0.76 | 0.39 | 0.83 | 0.62 |
| 40 | PAM | 1.55 | 9832 | 0.46 | 0.86 | 0.32 | 0.87 | 0.61 |
| | GM-PAM | 0.49 | 17439 | 0.52 | 0.83 | 0.41 | 0.84 | 0.65 |
| | ALP | 12.43 | 6885 | 0.45 | 0.82 | 0.31 | 0.87 | 0.59 |
| | GM-ALP | 2.00 | 16919 | 0.51 | 0.80 | 0.41 | 0.83 | 0.64 |
| | AHP | 37.43 | 6366 | 0.52 | 0.73 | 0.44 | 0.83 | 0.64 |
| | GM-AHP | 32.21 | 7148 | 0.53 | 0.74 | 0.44 | 0.82 | 0.64 |
| 80 | PAM | 0.28 | 19617 | 0.58 | 0.77 | 0.48 | 0.83 | 0.67 |
| | GM-PAM | 0.32 | 19311 | 0.58 | 0.78 | 0.47 | 0.83 | 0.67 |
| | ALP | 0.68 | 19149 | 0.59 | 0.75 | 0.50 | 0.82 | 0.67 |
| | GM-ALP | 0.76 | 18727 | 0.59 | 0.75 | 0.50 | 0.82 | 0.67 |
| | AHP | 15.20 | 11231 | 0.58 | 0.73 | 0.48 | 0.82 | 0.66 |
| GM-AHP | 14.69 | 10928 | 0.57 | 0.73 | 0.48 | 0.82 | 0.66 |
The table reports the same information as Table 4. There were 40 Grade 1 and Grade 3 samples and the number of Grade 2 samples varied; see text for more details.
aFor AHP and GM-AHP only λ was optimized while λ was set to zero.