| Literature DB >> 18578891 |
Koji Kadota1, Yuji Nakai, Kentaro Shimizu.
Abstract
BACKGROUND: Identification of differentially expressed genes (DEGs) under different experimental conditions is an important task in many microarray studies. However, choosing which method to use for a particular application is problematic because its performance depends on the evaluation metric, the dataset, and so on. In addition, when using the Affymetrix GeneChip(R) system, researchers must select a preprocessing algorithm from a number of competing algorithms such as MAS, RMA, and DFW, for obtaining expression-level measurements. To achieve optimal performance for detecting DEGs, a suitable combination of gene selection method and preprocessing algorithm needs to be selected for a given probe-level dataset.Entities:
Year: 2008 PMID: 18578891 PMCID: PMC2464587 DOI: 10.1186/1748-7188-3-8
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
AUC (percent) values for Datasets 1 and 2 for eight methods
| MAS | RMA | DFW | ||||
| Method | Dataset 1 | Dataset 2 | Dataset 1 | Dataset 2 | Dataset 1 | Dataset 2 |
| WAD | 98.240(4) | |||||
| AD | 83.381(6) | 96.430(8) | 99.897(6) | 98.631(2) | 99.948(2) | |
| FC | 83.092(7) | 96.445(7) | 99.655(8) | 98.617(3) | 99.948(2) | |
| RP | 81.981(8) | 96.626(6) | 99.757(7) | 99.993(4) | 99.938(3) | |
| modT | 93.257(4) | 97.561(4) | 99.928(5) | 98.109(7) | 99.983(7) | 98.459(6) |
| samT | 94.002(3) | 97.547(5) | 99.944(3) | 98.139(6) | 99.988(5) | 98.656(4) |
| shrinkT | 92.379(5) | 97.617(3) | 99.955(2) | 97.846(8) | 99.984(6) | 98.558(5) |
| ibmT | 94.693(2) | 97.618(2) | 99.941(4) | 98.183(5) | 99.983(7) | 98.455(7) |
Numbers in parentheses show the rankings. Signal intensities smaller than 1 in the MAS-preprocessed data were set to 1 so that the logarithm of the data could be taken.
AUC (percent) values for Dataset 1 (MAS) for different signal intensity thresholds
| Signal intensity threshold | ||||
| Method | 1 | 5 | 10 | 15 |
| WAD | ||||
| AD | 83.381(6) | 89.215(6) | 92.996(7) | 94.915(6) |
| FC | 83.092(7) | 88.353(8) | 92.381(8) | 94.455(7) |
| RP | 81.981(8) | 88.516(7) | 93.131(6) | 95.456(5) |
| modT | 93.257(4) | 94.776(2) | 95.284(3) | 95.977(4) |
| samT | 94.002(3) | 94.731(4) | 95.074(5) | 96.028(3) |
| shrinkT | 92.379(5) | 94.114(5) | 95.537(2) | 96.437(2) |
| ibmT | 94.693(2) | 94.770(3) | 95.260(4) | 94.318(8) |
AUC values when floor signal values in MAS-preprocessed data were 1, 5, 10, and 15, corresponding to substitutions of 4.1, 24.7, 40.2, and 50.8% of signals.
Figure 1Effect of the weight (. AUC values for the weight term (w, light blue circle) in WAD, AD (black circle), and WAD (red circle) are shown. Analyses of Datasets 3–26 and Datasets 27–38 were performed using MAS- and RMA-preprocessed data, respectively, following the choice of preprocessing algorithm in the original papers. The average AUC values for their respective methods as well as the other methods are shown in Table 3. Note that WAD statistics (AD with the w term) can overall give higher AUC values than AD statistics.
Results for Dataset 3–38 using eight methods
| Method | Average AUC (%) | No. of datasets best performed |
| WAD | ||
| AD | 94.758 | 1 |
| FC | 94.659 | 4 |
| RP | 93.182 | 2 |
| modT | 95.541 | 1 |
| samT | 95.866 | 7 |
| shrinkT | 95.439 | 4 |
| ibmT | 96.060 | 5 |
Analyses of Datasets 3–26 and Datasets 27–38 were performed using MAS- and RMA-preprocessed data, respectively, following the choice of preprocessing algorithm in the original papers. Accordingly, the average AUC value was calculated from those for Datasets 3–26 (MAS) and Datasets 27–38 (RMA). The best performing methods for each dataset is given in the additional file [see Additional file 1].
Average AUC values for Datasets 3–26 and 27–38
| Datasets 3–26 | Datasets 27–38 | ||||||
| Method | MAS | RMA | DFW | MAS | RMA | DFW | Average |
| WAD | 91.373(6) | 91.407(5) | 96.732(2) | 94.090(4) | |||
| AD | 93.755(6) | 93.098(2) | 92.239(2) | 87.411(7) | 94.222(2) | 92.915 | |
| FC | 93.625(7) | 92.239(2) | 88.230(6) | 96.726(3) | 94.221(3) | 93.026 | |
| RP | 91.511(8) | 92.540(3) | 84.552(8) | 96.526(4) | 92.055 | ||
| modT | 95.673(5) | 91.381(5) | 90.109(7) | 90.895(4) | 95.277(7) | 92.355(7) | 92.615 |
| samT | 95.947(3) | 91.231(8) | 89.959(8) | 90.305(5) | 95.702(5) | 92.052(8) | 92.533 |
| shrinkT | 95.733(4) | 91.316(7) | 91.451(4) | 90.968(3) | 94.851(8) | 93.684(5) | 93.001 |
| ibmT | 96.344(2) | 91.771(4) | 90.252(6) | 91.921(2) | 95.491(6) | 92.427(6) | 93.034 |
| Average | 94.916 | 91.978 | 91.274 | 89.587 | 96.009 | 93.465 | |
Analyses of Datasets 3–26 and Datasets 27–38 were performed using MAS- and RMA-preprocessed data, respectively, following the choice of preprocessing algorithm in the original papers.
Average number of genes common to each pair of methods for Datasets 3–38
| (a) MAS | AD | FC | RP | modT | samT | shrinkT | ibmT |
| WAD | 52.0 | 39.1 | 49.7 | 37.7 | 45.2 | 39.8 | 42.8 |
| AD | 61.9 | 84.1 | 34.4 | 47.1 | 37.2 | 33.2 | |
| FC | 58.2 | 29.5 | 39.1 | 31.2 | 28.1 | ||
| RP | 30.5 | 41.8 | 32.4 | 29.8 | |||
| modT | 79.9 | 92.7 | 78.1 | ||||
| samT | 83.5 | 65.0 | |||||
| shrinkT | 74.8 | ||||||
| (b) RMA | AD | FC | RP | modT | samT | shrinkT | ibmT |
| WAD | 62.2 | 50.4 | 60.2 | 31.7 | 32.6 | 30.8 | 33.4 |
| AD | 78.8 | 84.7 | 35.2 | 36.2 | 33.9 | 38.0 | |
| FC | 72.3 | 32.2 | 32.8 | 30.8 | 34.5 | ||
| RP | 36.8 | 37.6 | 35.4 | 39.4 | |||
| modT | 88.3 | 93.0 | 88.4 | ||||
| samT | 87.6 | 83.6 | |||||
| shrinkT | 85.1 | ||||||
| (c) DFW | AD | FC | RP | modT | samT | shrinkT | ibmT |
| WAD | 84.3 | 83.9 | 72.1 | 13.6 | 13.4 | 14.6 | 13.9 |
| AD | 98.6 | 77.3 | 13.7 | 13.5 | 14.7 | 14.0 | |
| FC | 77.0 | 13.6 | 13.4 | 14.6 | 13.9 | ||
| RP | 18.1 | 17.9 | 20.1 | 18.8 | |||
| modT | 94.1 | 83.2 | 93.4 | ||||
| samT | 81.1 | 91.0 | |||||
| shrinkT | 83.0 | ||||||
The averages were calculated from top 100 genes. Due to the symmetric nature of the matrix only the upper triangular part is presented.
Average number of common genes in results of three preprocessing algorithms for Datasets 3–38
| Method | Top 20 | Top 50 | Top 100 | Top 200 |
| WAD | ||||
| AD | 4.5 | 10.7 | 20.0 | 37.9 |
| FC | 5.0 | 12.2 | 22.0 | 40.6 |
| RP | 4.6 | 11.1 | 20.6 | 40.5 |
| modT | 4.4 | 13.1 | 27.6 | 60.3 |
| samT | 4.0 | 11.9 | 24.4 | 52.4 |
| shrinkT | 4.5 | 13.6 | 29.0 | 62.4 |
| ibmT | 5.3 | 15.2 | 32.0 | 66.7 |
Statistical significance between two methods for Datasets 3–38
| a) MAS | Inferior | ||||||||
| WAD | AD | FC | RP | modT | samT | shrinkT | ibmT | ||
| Superior | WAD | - | 1.8E-01 | ||||||
| AD | 1.0E+00 | - | 8.1E-01 | 1.0E+00 | 1.0E+00 | 1.0E+00 | 1.0E+00 | ||
| FC | 1.0E+00 | 1.9E-01 | - | 1.0E+00 | 1.0E+00 | 1.0E+00 | 1.0E+00 | ||
| RP | 1.0E+00 | 1.0E+00 | 1.0E+00 | - | 1.0E+00 | 1.0E+00 | 1.0E+00 | 1.0E+00 | |
| modT | 9.8E-01 | - | 4.7E-01 | 9.0E-01 | 1.0E+00 | ||||
| samT | 9.8E-01 | 5.3E-01 | - | 6.9E-01 | 1.0E+00 | ||||
| shrinkT | 9.8E-01 | 1.0E-01 | 3.1E-01 | - | 1.0E+00 | ||||
| ibmT | 8.2E-01 | - | |||||||
| (b) RMA | Inferior | ||||||||
| WAD | AD | FC | RP | modT | samT | shrinkT | ibmT | ||
| Superior | WAD | - | 9.8E-01 | 9.8E-01 | 8.9E-01 | 3.0E-01 | 3.1E-01 | 2.5E-01 | 4.4E-01 |
| AD | - | 4.7E-01 | 8.3E-02 | ||||||
| FC | 5.3E-01 | - | 8.8E-02 | ||||||
| RP | 1.1E-01 | 9.2E-01 | 9.1E-01 | - | 8.4E-02 | 9.7E-02 | 6.6E-02 | 1.7E-01 | |
| modT | 7.0E-01 | 9.9E-01 | 9.9E-01 | 9.2E-01 | - | 5.6E-01 | 6.5E-02 | 1.0E+00 | |
| samT | 6.9E-01 | 9.9E-01 | 9.9E-01 | 9.0E-01 | 4.4E-01 | - | 2.1E-01 | 8.3E-01 | |
| shrinkT | 7.5E-01 | 9.9E-01 | 9.9E-01 | 9.3E-01 | 9.4E-01 | 7.9E-01 | - | 1.0E+00 | |
| ibmT | 5.6E-01 | 9.7E-01 | 9.7E-01 | 8.3E-01 | 1.7E-01 | - | |||
| (c) DFW | Inferior | ||||||||
| WAD | AD | FC | RP | modT | samT | shrinkT | ibmT | ||
| Superior | WAD | - | 1.0E+00 | 1.0E+00 | 1.0E+00 | 1.3E-01 | 1.2E-01 | 4.5E-01 | 1.6E-01 |
| AD | - | 1.6E-01 | 9.6E-01 | 5.1E-02 | 2.1E-01 | 6.9E-02 | |||
| FC | 8.4E-01 | - | 9.6E-01 | 5.1E-02 | 2.1E-01 | 6.9E-02 | |||
| RP | - | 1.1E-01 | |||||||
| modT | 8.7E-01 | 9.5E-01 | 9.5E-01 | 9.7E-01 | - | 8.6E-02 | 9.9E-01 | 8.5E-01 | |
| samT | 8.8E-01 | 9.5E-01 | 9.5E-01 | 9.7E-01 | 9.1E-01 | - | 9.9E-01 | 1.0E+00 | |
| shrinkT | 5.5E-01 | 7.9E-01 | 7.9E-01 | 8.9E-01 | - | ||||
| ibmT | 8.4E-01 | 9.3E-01 | 9.3E-01 | 9.6E-01 | 1.5E-01 | 9.7E-01 | - | ||
The p-values between the 36 AUC values from a possibly superior method and those from a possibly inferior method were calculated by a one-tail paired t-test. The null hypothesis is that the mean of the 36 AUC values for one method is the same as that for the other method. There are two p-values for two methods compared. For example, in (a) MAS-preprocessed data, the p-value is 1.8E-01 when the alternative hypothesis is that the mean of the 36 AUC values for WAD is greater than that for ibmT while the p-value is 8.2E-01 when the alternative hypothesis is that the mean of the 36 AUC values for ibmT is greater than that for WAD. Combinations having p < 0.05 are highlighted in bold.