| Literature DB >> 30200879 |
Itziar Irigoien1, Concepción Arenas2.
Abstract
BACKGROUND: An important issue in microarray data is to select, from thousands of genes, a small number of informative differentially expressed (DE) genes which may be key elements for a disease. If each gene is analyzed individually, there is a big number of hypotheses to test and a multiple comparison correction method must be used. Consequently, the resulting cut-off value may be too small. Moreover, an important issue is the selection's replicability of the DE genes. We present a new method, called ORdensity, to obtain a reproducible selection of DE genes. It takes into account the relation between all genes and it is not a gene-by-gene approach, unlike the usually applied techniques to DE gene selection.Entities:
Keywords: Differentially expressed gene; Multivariate statistics; Outlier; Quantile
Mesh:
Substances:
Year: 2018 PMID: 30200879 PMCID: PMC6131896 DOI: 10.1186/s12859-018-2318-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1General outline of the proposed ORdensity approach. In green the first step of the method and in red the second step of the method
Fig. 2Visualization of differences for p∈C={0.25,0.5,0.75} for two genes. In the left side, a gene whose expressions in conditions X and Y are not differentially expressed (No DE gene); in the right side, a gene that is differentially expressed in conditions X and Y (DE gene)
Fig. 3Illustrative example. a First two principal components of data corresponding to fold 1 (99.3% of explained variability), and there are represented: the potential genes (genes in S0.05) by circles; the false positives genes (genes in R) by “p"s, and the differentially expressed genes (genes generated as truly DE genes) by crosses. b Representation of the potential genes based on OR (vertical axis), FP (horizontal axis) and dFP (size of the circle is inversely proportional to its value). Truly DE genes are marked with a cross; in red and blue, genes belonging to cluster 1 and cluster 2, respectively
Illustrative example
| Cluster 1 ( | Cluster 2 ( | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Min | 23.5 | 0.0 | 0.00 | 6.3 | 7.1 | 8.13 |
|
| 53.8 | 0.0 | 0.00 | 6.7 | 9.0 | 24.78 |
|
| 79.3 | 0.0 | 0.00 | 7.3 | 9.1 | 29.13 |
| Mean | 104.5 | 0.4 | 0.25 | 8.5 | 9.2 | 28.51 |
|
| 122.1 | 0.0 | 0.00 | 9.6 | 9.7 | 32.78 |
| Max | 412.0 | 6.7 | 5.22 | 19.8 | 10.0 | 43.21 |
Basic description for the OR,FP and dFP values in the two clusters obtained using PAM and silhouette procedures
Simulation study 1, scenario 1 with n=n=30 and 100 runs
| Nb. of |
| |||
| DE genes | 0.1 | 0.05 | 0.01 | |
| 50 |
| 0.08 | 0.20 | 0.80 |
| % | 99.84 (0.54) | 99.56 (0.92) | 97.20 (2.2) | |
| 100 |
| 0.13 | 0.45 | 0.99 |
| % | 99.87 (0.34) | 99.34 (0.66) | 95.20 (2.3) | |
| 200 |
| 0.80 | 0.99 | 1.00 |
| % | 99.27 (0.56) | 96.70 (1.5) | 78.80 (4.0) | |
| Nb. of |
| |||
| DE genes | 0.1 | 0.05 | 0.01 | |
| 50 |
| 0.00 | 0.00 | 0.00 |
| % | 100 | 100 | 100 | |
| 100 |
| 0.00 | 0.00 | 0.05 |
| % | 100 | 100 | 99.95 (0.22) | |
| 200 |
| 0.01 | 0.03 | 0.95 |
| % | 100 | 99.98 (0.09) | 98.50 (0.83) | |
| Nb. of |
| |||
| DE genes | 0.1 | 0.05 | 0.01 | |
| 50 |
| 0.00 | 0.00 | 0.00 |
| % | 100 | 100 | 100 | |
| 100 |
| 0.00 | 0.00 | 0.00 |
| % | 100 | 100 | 100 | |
| 200 |
| 0.00 | 0.00 | 0.00 |
| % | 100 | 100 | 100 |
Evaluation of the first step of the ORdensity method using different values of α. The Table shows the estimated probability, , of no considering as potential DE gene at least one gene that it really is, and the mean proportion of DE genes (row named “%”) that the procedure considered as potential DE genes. Corresponding standard deviations are in brackets
Fig. 4Simulation study 1, scenario 1 with n=n=30 and 100 runs. Evaluation of the first step of the ORdensity method using different values of α. Top: in x axis number of DE genes; in y axis estimated probability, , of no considering as potential DE gene at least one gene that it really is. Bottom: in x axis number of DE genes; in y axis mean proportion of DE genes that the procedure considered as potential DE genes
Simulation study 1, scenario 1 with n=30, n=10 and 100 runs
| Nb. of |
| |||
| DE genes | 0.1 | 0.05 | 0.01 | |
| 50 |
| 0.85 | 1 | 1 |
| % | 97.2 (2.8) | 91.1 (2.1) | 69.6 (6.7) | |
| 100 |
| 1 | 1 | 1 |
| % | 93.7 (2.5) | 86.0 (3.5) | 60.1 (5.0) | |
| 200 |
| 1 | 1 | 1 |
| % | 84.9 (2.6) | 70.5 (3.5) | 34.9 (3.4) | |
| Nb. of |
| |||
| DE genes | 0.1 | 0.05 | 0.01 | |
| 50 |
| 0.13 | 0.33 | 0.98 |
| % | 99.7 (0.7) | 99.2 (1.2) | 93.2 (3.6) | |
| 100 |
| 0.38 | 0.88 | 1 |
| % | 99.6 (0.6) | 98.4 (1.0) | 88.4 (3.2) | |
| 200 |
| 0.99 | 1 | 1 |
| % | 97.8 (1.1) | 92.1 (1.9) | 65.3 (4.0) | |
| Nb. of |
| |||
| DE genes | 0.1 | 0.05 | 0.01 | |
| 50 |
| 0.00 | 0.00 | 0.02 |
| % | 100 | 100 | 96.0 (0.3) | |
| 100 |
| 0.00 | 0.00 | 0.14 |
| % | 100 | 100 | 99.8 (0.5) | |
| 200 |
| 0.01 | 0.11 | 0.97 |
| % | 100.0 (0.1) | 99.9 (0.2) | 98.1 (1.1) |
Evaluation of the first step of the ORdensity method using different values of α. The Table shows the estimated probability, , of no considering as a potential DE gene at least one gene that it really is, and the mean proportion of DE genes (row named “%”) that the procedure considered as potential DE genes. Corresponding standard deviations are in brackets
Fig. 5Simulation study 1, scenario 1 with n=30, n=10 and 100 runs. Evaluation of the first step of the ORdensity method using different values of α. Top: in x axis number of DE genes; in y axis estimated probability, , of no considering as potential DE gene at least one gene that it really is. Bottom: in x axis number of DE genes; in y axis mean proportion of DE genes that the procedure considered as potential DE genes
Simulation study 1, scenario 1 with n=n=30 and 100 runs
|
| DE | 10 |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| ( | ( | ( | ( | ( | ||||
|
| 30.7 | 45.7 | 0.5 | 0.4 | 0.2 | |||
| (7.9) | (6.7) | (0.3) | (0.3) | (0.9) | ||||
| 50 | 8.5 |
| 21.8 | 19.4 | 5.4 | 7.5 | 20.7 | |
| (5.9) | (5.0) | (2.1) | (4.4) | (20.9) | ||||
|
| 32.9 | 8.4 | 9.2 | 29.0 | 93.7 | |||
| (6.3) | (0.6) | (0.2) | (3.0) | (5.8) | ||||
|
| 34.1 | 46.5 | 0.0 | 0.0 | 0.0 | |||
| (11.3) | (6.3) | (0.1) | (0.1) | (0.2) | ||||
| 1.5 | 100 | 8.5 |
| 56.2 | 22.5 | 1.5 | 1.8 | 1.6 |
| (10.3) | (2.7) | (0.67) | (9.0) | (2.0) | ||||
|
| 32.1 | 9.1 | 8.7 | 21.5 | 68.9 | |||
| (5.6) | (0.5) | (0.35) | (2.0) | (8.9) | ||||
|
| 59.3 | 29.5 | 0.0 | 0.0 | 0.0 | |||
| (12.5) | (2.8) | (0.0) | (0.0) | (0) | ||||
| 200 | 8.5 |
| 113.0 | 14.8 | 0.5 | 0.8 | 0.4 | |
| (13.0) | (1.0) | (0.2) | (0.2) | (0.6) | ||||
|
| 25.3 | 8.0 | 6.0 | 9.9 | 14.9 | |||
| (6.5) | (0.5) | (0.9) | (1.8) | (8.2) | ||||
|
| 22.3 | 86.3 | 0.0 | 0.0 | 0.0 | |||
| (9.2) | (17.1) | (0.1) | (0.0) | (0) | ||||
| 50 | 8.0 |
| 27.8 | 42.0 | 1.4 | 1.5 | 5.8 | |
| (6.9) | (9.2) | (2.0) | (3.3) | (18.0) | ||||
|
| 36.0 | 9.1 | 9.2 | 27.8 | 97.0 | |||
| (5.3) | (0.61) | (0.19) | (2.0) | (2.7) | ||||
|
| 35.5 | 74.7 | 0.0 | 0.0 | 0.0 | |||
| (7.9) | (8.1) | (0.0) | (0.0) | (0) | ||||
| 2 | 100 | 8.0 |
| 63.1 | 37.3 | 0.4 | 0.4 | 0.3 |
| (7.9) | (3.1) | (0.2) | (0.2) | (0.7) | ||||
|
| 26.0 | 8.8 | 9.3 | 24.9 | 93.7 | |||
| (4.7) | (0.6) | (0.3) | (1.9) | (4.5) | ||||
|
| 69.2 | 46.6 | 0.0 | 0.0 | 0.0 | |||
| (14.1) | (3.9) | (0.0) | (0.0) | (0) | ||||
| 200 | 8.0 |
| 122.9 | 23.8 | 0.2 | 0.2 | 0.0 | |
| (17.0) | (2.4) | (0.1) | (0.1) | (0.3) | ||||
|
| 13.1 | 8.9 | 8.1 | 13.0 | 58.0 | |||
| (17.3) | (2.4) | (2.0) | (4.1) | (23.1) | ||||
|
| 18.9 | 191.2 | 0.0 | 0.0 | 0.0 | |||
| (6.6) | (22.7) | (0.0) | (0.0) | (0) | ||||
| 50 | 7.2 |
| 31.5 | 99.9 | 0.29 | 0.4 | 2.1 | |
| (4.9) | (16.1) | (1.2) | (2.6) | (14.1) | ||||
|
| 37.0 | 9.0 | 9.2 | 27.7 | 99.8 | |||
| (5.1) | (0.6) | (0.19) | (1.5) | (0.8) | ||||
|
| 38.6 | 155.7 | 0.0 | 0.0 | 0.0 | |||
| (10.0) | (15.1) | (0.0) | (0.0) | (0) | ||||
| 3 | 100 | 7.1 |
| 61.3 | 83.4 | 0.0 | 0.0 | 0.0 |
| (10.0) | (6.6) | (0.1) | (0.0) | (0) | ||||
|
| 25.1 | 8.7 | 9.3 | 25.8 | 99.6 | |||
| (4.5) | (0.49) | (0.26) | (1.7) | (1.1) | ||||
|
| 74.1 | 95.8 | 0 | 0 | 0.0 | |||
| (16.4) | (8.2) | (0) | (0) | (0) | ||||
| 200 | 7.1 |
| 115.5 | 53.1 | 0.0 | 0.0 | 0.0 | |
| (21.3) | (6.6) | (0.0) | (0.0) | (0) | ||||
|
| 16.1 | 12.5 | 8.3 | 16.4 | 86.0 | |||
| (24.8) | (11.9) | (3.2) | (6.8) | (33.6) |
Evaluation of the second step of the ORdensity method with α=0.05. In the first two columns, delta (Δ) values and number of total simulated DE genes. In column 3, the 10×f0 values where f0 is the average proportion of permuted cases in sets U. In column 4, the “*” indicates the clusters considered by the procedure. Columns 5–8 contain for each cluster: the mean number of genes (), the mean of OR values (), the mean of FP values (), the mean of dFP values (). In the last column the mean of False Positives genes per cluster in % (). Corresponding standard deviations are in brackets
Fig. 6Simulation study 1, scenario 1 with n=n=30 and 100 runs. Evaluation of the second step of the ORdensity method with α=0.05. In x axis number of DE genes; in y axis the mean of False Positives genes per cluster in % (). In red cluster 1 (C1), in green cluster 2 (C2) and in blue cluster 3 (C3)
AUC mean values for Simulation study 1, scenario 1 and 100 runs. In first column: n indicates the number of DE gens; Δ the Δ values; A the ORdensity method; B the limma method and C the SAM method
|
| 50 | 100 | ||||
|
| 1.5 | 2 | 3 | 1.5 | 2 | 3 |
| A | 0.998 | 0.998 | 0.997 | 0.998 | 0.997 | 0.995 |
| (0.002) | (0.001) | (0.002) | (0.001) | (0.002) | (0.003) | |
| B | 0.995 | 0.993 | 0.993 | 0.997 | 0.993 | 0.993 |
| (0.002) | (0.000) | (0.000) | (0.003) | (0.000) | (0.000) | |
| C | 0.994 | 0.993 | 0.993 | 0.996 | 0.993 | 0.993 |
| (0.003) | (0.000) | (0.000) | (0.002) | (0.000) | (0.000) | |
|
| 200 | 50 | ||||
|
| 1.5 | 2 | 3 | 1.5 | 2 | 3 |
| A | 0.996 | 0.997 | 0.993 | 0.974 | 0.996 | 0.998 |
| (0.000) | (0.000) | (0.000) | (0.001) | (0.001) | (0.001) | |
| B | 0.997 | 0.992 | 0.992 | 0.967 | 0.996 | 0.993 |
| (0.003) | (0.001) | (0.000) | (0.011) | (0.003) | (0.001) | |
| C | 0.998 | 0.992 | 0.992 | 0.940 | 0.994 | 0.993 |
| (0.002) | (0.001) | (0.000) | (0.016) | (0.003) | (0.001) | |
|
| 100 | 200 | ||||
|
| 1.5 | 2 | 3 | 1.5 | 2 | 3 |
| A | 0.973 | 0.994 | 0.998 | 0.959 | 0.992 | 0.997 |
| (0.001) | (0.002) | (0.003) | (0.000) | (0.000) | (0.000) | |
| B | 0.981 | 0.997 | 0.993 | 0.993 | 0.998 | 0.992 |
| (0.006) | (0.003) | (0.000) | (0.002) | (0.002) | (0.000) | |
| C | 0.946 | 0.995 | 0.993 | 0.960 | 0.996 | 0.992 |
| (0.010) | (0.003) | (0.000) | (0.007) | (0.002) | (0.000) | |
|
| 50 | 100 | ||||
|
| 1.5 | 2 | 3 | 1.5 | 2 | 3 |
| A | 0.925 | 0.974 | 0.998 | 0.921 | 0.971 | 0.997 |
| (0.014) | (0.011) | (0.011) | (0.010) | (0.001) | (0.008) | |
| B | 0.894 | 0.980 | 0.994 | 0.931 | 0.989 | 0.994 |
| (0.023) | (0.009) | (0.001) | (0.013) | (0.004) | (0.002) | |
| C | 0.859 | 0.959 | 0.994 | 0.865 | 0.960 | 0.994 |
| (0.023) | (0.015) | (0.001) | (0.016) | (0.009) | (0.002) | |
|
| 200 | |||||
|
| 1.5 | 2 | 3 | |||
| A | 0.900 | 0.956 | 0.996 | |||
| (0.002) | (0.001) | (0.001) | ||||
| B | 0.968 | 0.996 | 0.993 | |||
| (0.007) | (0.001) | (0.003) | ||||
| C | 0.876 | 0.974 | 0.995 | |||
| (0.017) | (0.007) | (0.003) | ||||
Simulation study 2 with n=n=30 and 100 runs
| Number of | 1 block | 2 blocks | |||||
| DE genes |
|
| |||||
| per block | 0.1 | 0.05 | 0.01 | 0.1 | 0.05 | 0.01 | |
| 5 |
| 0 | 0 | 0 | 0 | 0 | 0 |
| % | 100 (0) | 100 (0) | 100 (0) | 100 (0) | 100 (0) | 100 (0) | |
| 20 |
| 0 | 0 | 0 | 0 | 0 | 0 |
| % | 100 (0) | 100 (0) | 100 (0) | 100 (0) | 100 (0) | 100 (0) | |
| 100 |
| 0 | 0 | 0 | 0 | 0 | 0 |
| % | 100 (0) | 100 (0) | 100 (0) | 100 (0) | 100 (0) | 100 (0) | |
| Number of | 3 blocks | ||||||
| DE genes |
| ||||||
| per block | 0.1 | 0.05 | 0.01 | ||||
| 5 |
| 0 | 0 | 0 | |||
| % | 100 (0) | 100 (0) | 100 (0) | ||||
| 20 |
| 0 | 0 | 0.01 | |||
| % | 100 | 100 | 99.9 (0.7) | ||||
| 100 |
| 0 | 0.01 | 0.04 | |||
| % | 100 (0) | 100.0 (0.0) | 99.8 (1.9) | ||||
Evaluation of the first step of the ORdensity method using different values of α. The Table shows the estimated probability, , of no considering as a potential DE gene at least one gene that it really is, and the mean proportion of DE genes (row named “%”) that the procedure considered as potential DE genes. Corresponding standard deviations are in brackets
Simulation study 2, with n=n=30 and 100 runs
| B | DE | 10 |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| ( | ( | ( | ( | ( | ||||
|
| 27.9 | 160.4 | 6.6 | 8.1 | 32.0 | |||
| (31.2) | (102.2) | (1.1) | (10.8) | (43.9) | ||||
| 5 | 9.1 |
| 92.1 | 10.1 | 9.0 | 24.1 | 100 | |
| (25.5) | (0.8) | (0.4) | (4.0) | (0) | ||||
|
| 82.5 | 7.8 | 9.4 | 41.6 | 100 | |||
| (17.9) | (0.3) | (0.3) | (2.2) | (0) | ||||
|
| 20.0 | 233.0 | 0.0 | 0.0 | 0.0 | |||
| (0.0) | (35.1) | (0.0) | (0.0) | (0) | ||||
| 1 | 20 | 9.1 |
| 96.5 | 10.4 | 9.0 | 22.0 | 100 |
| (12.7) | (0.5) | (0.1) | (1.5) | (0) | ||||
|
| 90.1 | 7.7 | 9.2 | 41.1 | 100 | |||
| (14.2) | (0.2) | (0.1) | (1.4) | (0) | ||||
|
| 100.0 | 209.5 | 0.0 | 0.0 | 0.0 | |||
| (0.0) | (32.9) | (0.0) | (0.0) | (0) | ||||
| 100 | 8.9 |
| 73.7 | 10.8 | 9.1 | 20.7 | 100 | |
| (10.9) | (0.59) | (0.2) | (1.5) | (0) | ||||
|
| 69.1 | 8.2 | 9.2 | 37.5 | 100 | |||
| (11.2) | (0.25) | (0.2) | (1.6) | (0) | ||||
|
| 9.9 | 119.7 | 1.0 | 0.2 | 0.0 | |||
| (0.6) | (16.9) | (0.2) | (0.0) | (0.0) | ||||
| 10 | 9.1 |
| 102.2 | 10.5 | 9.0 | 22.1 | 99.9 | |
| (14.2) | (0.7) | (0.1) | (1.5) | (0.7) | ||||
|
| 94.2 | 7.7 | 9.2 | 41.7 | 100 | |||
| (13.2) | (0.2) | (0.1) | (1.4) | (0.0) | ||||
|
| 40.0 | 118.6 | 0.0 | 0.0 | 0.0 | |||
| (0.1) | (20.0) | (0.0) | (0.0) | (0.0) | ||||
| 2 | 40 | 9.1 |
| 97.0 | 10.3 | 9.1 | 22.2 | 100 |
| (12.0) | (0.5) | (0.1) | (0.4) | (0.0) | ||||
|
| 87.4 | 7.7 | 9.2 | 41.2 | 100 | |||
| (12.9) | (0.2) | (0.1) | (1.2) | (0.0) | ||||
|
| 115.9 | 120.1 | 0.0 | 0.0 | 0.0 | |||
| (38.9) | (18.9) | (0.0) | (0.0) | (0.0) | ||||
| 200 | 8.6 |
| 95.3 | 72.7 | 1.6 | 3.7 | 16.7 | |
| (15.5) | (31.3) | (3.5) | (8.2) | (37.8) | ||||
|
| 118.5 | 8.9 | 9.3 | 31.2 | 100 | |||
| (28.3) | (0.5) | (0.1) | (3.2) | (0.0) | ||||
|
| 14.6 | 86.2 | 0.26 | 0.09 | 0.0 | |||
| (1.14) | (14.1) | (0.31) | (0.11) | (0) | ||||
| 15 | 9.0 |
| 103.9 | 9.0 | 8.4 | 22.1 | 99.6 | |
| (11.8) | (0.13) | (0.48) | (1.5) | (1.1) | ||||
|
| 92.8 | 7.2 | 9.2 | 41.6 | 100 | |||
| (10.7) | (0.77) | (0.13) | (1.4) | (0.0) | ||||
|
| 59.9 | 81.2 | 0.03 | 0.03 | 0.0 | |||
| (0.29) | (12.1) | (0.07) | (0.06) | (0) | ||||
| 3 | 60 | 8.9 |
| 93.8 | 10.3 | 9.1 | 22.3 | 99.9 |
| (12.3) | (0.43) | (0.13) | (1.4) | (0.3) | ||||
|
| 87.2 | 7.7 | 9.2 | 41.3 | 100 | |||
| (11.4) | (0.16) | (0.13) | (1.3) | (0.0) | ||||
|
| 142.4 | 81.0 | 0.00 | 0.00 | 0.0 | |||
| (31.0) | (13.7) | (0.00) | (0.00) | (0) | ||||
| 300 | 8.4 |
| 157.5 | 54.6 | 0.03 | 0.04 | 0.0 | |
| (31.0) | (11.1) | (0.06) | (0.09) | (0) | ||||
|
| 119.2 | 8.9 | 9.4 | 30.3 | 99.9 | |||
| (11.4) | (0.28) | (0.10) | (0.96) | (0.02) |
Evaluation of the second step of the ORdensity method with α=0.05. In the first column number of blocks. In column 2, the number of total simulated DE genes. In column 3, the 10×f0 values where f0 is the average proportion of permuted cases in sets U. In column 4, the “*” indicates the clusters considered by the procedure. Columns 5–8 contain for each cluster: the mean number of genes (), the mean of OR values (), the mean of FP values (), the mean of dFP values (). In the last column the mean of False Positives genes per cluster in % (). Corresponding standard deviations are in brackets
Fig. 7Simulation study 2 with n=n=30 and 100 runs. Evaluation of the second step of the ORdensity method with α=0.05. In x axis number of DE genes; in y axis the mean of False Positives genes per cluster in % (). In red cluster 1 (C1), in green cluster 2 (C2) and in blue cluster 3 (C3)
AUC mean values for Simulation study 2, with n=n=30 and 100 runs
| 1 block | 2 blocks | |||||
| Nb. DE genes | 5 | 20 | 100 | 5 | 20 | 50 |
| ORdensity | 0.995 | 0.995 | 0.980 | 0.993 | 0.998 | 0.985 |
| (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | |
| limma | 0.701 | 0.854 | 0.903 | 0.796 | 0.789 | 0.917 |
| (0.0004) | (0.001) | (0.001) | (0.000) | (0.001) | (0.045) | |
| SAM | 0.701 | 0.854 | 0.902 | 0.796 | 0.789 | 0.918 |
| (0.001) | (0.001) | (0.00002) | (0.000) | (0.0003) | (0.045) | |
| 3 block | ||||||
| Nb. DE genes | 5 | 20 | 50 | |||
| ORdensity | 0.983 | 0.997 | 0.990 | |||
| (0.002) | (0.000) | (0.000) | ||||
| limma | 0.982 | 0.997 | 0.990 | |||
| (0.001) | (0.003) | (0.069) | ||||
| SAM | 0.825 | 0.994 | 0.876 | |||
| (0.000) | (0.003) | (0.070) | ||||
Lymphoma cancer data set
| Ns | ||||||
|---|---|---|---|---|---|---|
| 19 | 24 | 64 | 88 | 96 | ||
| I | ORdensity |
| 100 | 100 | 100 | 100 |
| limma | 95.24 |
| 100 |
| - | |
| SAM | 100 | 100 |
| - | - | |
| II | A vs B | 13 | 17 | 14 | 17 | - |
| A vs C | 14 | 16 | 42 | 63 | - | |
| B vs C | 17 | 23 | 58 | - | - | |
| III | ORdensity | 10.2 | 13.2 | 40.4 | 61.4 | 65.3 |
| (1.48) | (2.84) | (2.01) | (2.27) | (3.13) | ||
| limma | 7.9 | 9.7 | 27.4 | 36.7 | - | |
| (0.87) | (1.57) | (2.17) | (1.34) | - | ||
| SAM | 14.2 | 17.3 | 51.4 | - | - | |
| (2.82) | (3.97) | (8.85) | - | - | ||
Results for different number (Ns) of selected genes: 19 with ORdensity relaxed selection; 24 with limma and Bonferroni; 64 with SAM; 88 with limma and BH, and 96 total potential DE genes. Rows I present the leave-one-out cross-validation correct classification rate. In bold, the results for the genes selected under standard criteria for ORdensity, limma and SAM procedures; in rows II, number of common selected genes between the ORdensity, limma and SAM approaches; in rows III, mean and standard deviation (in brackets) of the number of genes that for 10 subsamples were always kept selected
Golub cancer data set
| Ns | ||||
|---|---|---|---|---|
| 4 | 193 | 291 | ||
| I | ORdensity (A) |
| 97.22 |
|
| limma (B) | 97.22 |
| 97.22 | |
| SAM (C) | 97.22 | 97.22 | 97.22 | |
| II | A vs B | 1 | 127 | 206 |
| A vs C | 1 | 130 | 213 | |
| B vs C | 4 | 175 | 265 | |
| III | ORdensity | 0.5 (0.71) | 150.4 (3.89) | 241.8 (6.94) |
| limma | 0 | 12.3 (2.26) | 4.3 (1.83) | |
| SAM | 3.3 (0.48) | 151.8 (24.18) | 220.7 (35.88) | |
| Ns | ||||
| 403 | 556 | 938 | ||
| I | ORdensity (A) | 97.22 | 97.22 | - |
| limma (B) | 97.22 | 97.22 |
| |
| SAM (C) |
| - | - | |
| II | A vs B | 274 | 358 | - |
| A vs C | 280 | - | - | |
| B vs C | 368 | - | - | |
| III | ORdensity | 334.1 (6.03) | 473.3 (9.56) | - |
| limma | 029.2 (2.44) | 55.5 (3.10) | 149.3 (6.36) | |
| SAM | 316.3 (50.06) | - | - | |
Results for different number (Ns) of selected genes: 4 with ORdensity strong selection; 193 with limma and Bonferroni; 291 with ORdensity relaxed selection; 403 with SAM; 556 total potential DE genes and 938 with limma and BH. Rows I present the leave-one-out cross-validation correct classification rate. In bold, the results for the genes selected under the standard criteria for ORdensity, limma and SAM procedures; in rows II, number of common selected genes between the ORdensity, limma and SAM approaches; in rows III, mean and standard deviation (in brackets) of the number of genes that for 10 subsamples were always kept selected
Colon cancer data set
| Ns | ||||
|---|---|---|---|---|
| 12 | 49 | 59 | ||
| I | ORdensity (A) |
| 90.32 |
|
| limma (B) | 91.94 |
| 88.71 | |
| SAM (C) | 85.48 | 88.71 | 88.71 | |
| II | A vs B | 7 | 32 | 38 |
| A vs C | 0 | 14 | 22 | |
| B vs C | 2 | 26 | 29 | |
| III | ORdensity | 7.5 (1.27) | 35.4 (4.09) | 43.0 (3.56) |
| limma | 0.1 (0.32) | 3.5 (0.85) | 4.1 (0.57) | |
| SAM | 1.8 (1.14) | 23.7 (2.98) | 29.4 (3.57) | |
| Ns | ||||
| 166 | 186 | 366 | ||
| I | ORdensity (A) | 87.10 | 87.10 | - |
| limma (B) | 87.10 | 87.10 |
| |
| SAM (C) |
| - | - | |
| II | A vs B | 119 | 134 | - |
| A vs C | 118 | - | - | |
| B vs C | 155 | - | - | |
| III | ORdensity | 127.2 (4.49) | 142.5 (4.84) | - |
| limma | 13.1 (1.37) | 16.7 (2.21) | 72.1 (4.72) | |
| SAM | 136.7 (5.23) | - | - | |
Results for different number (Ns) of selected genes: 12 with ORdensity strong selection; 49 with limma and Bonferroni; 59 with ORdensity relaxed selection; 166 with SAM; 186 total potential DE genes and 366 with limma and BH. Rows I present the leave-one-out cross-validation correct classification rate. In bold, the results for the genes selected under standard criteria for ORdensity, limma and SAM procedures; in rows II, number of common selected genes between the ORdensity, limma and SAM approaches. In rows III, mean and standard deviation (in brackets) of the number of genes that for 10 subsamples were always kept selected
Prostate cancer data set
| Ns | ||||
|---|---|---|---|---|
| 131 | 263 | 990 | ||
| I | ORdensity (A) |
| 74.51 |
|
| limma (B) | 86.27 |
| 71.57 | |
| SAM (C) | 82.35 | 76.47 | 70.59 | |
| II | A vs B | 64 | 170 | 691 |
| A vs C | 64 | 159 | 746 | |
| B vs C | 54 | 211 | 791 | |
| III | ORdensity | 72.4 (7.12) | 184.1 (23.56) | 787.6 (29.40) |
| limma | 108.6 (4.99) | 220.2 (10.07) | 827.2 (39.17) | |
| SAM | 0.7 (0.67) | 4.4 (0.84) | 66.1 (4.38) | |
| Ns | ||||
| 1531 | 2684 | 3322 | ||
| I | ORdensity (A) | 66.67 | - | - |
| limma (B) | 70.59 |
| - | |
| SAM (C) | 70.59 | 66.67 |
| |
| II | A vs B | 961 | - | - |
| A vs C | 1035 | - | - | |
| B vs C | 1321 | 2422 | - | |
| III | ORdensity | 1215.4 (99.66) | - | - |
| limma | 1297.9 (50.64) | 2299.5 (86.19) | - | |
| SAM | 165.1 (6.15) | 563 (19.87) | 862.2 (19.41) | |
Results for different number (Ns) of selected genes: 131 with ORdensity strong selection; 263 with limma and Bonferroni; 990 with ORdensity relaxed selection; 1531 total potential DE genes; 2684 with limma and BH, and 3322 with SAM. Rows I present the leave-one-out cross-validation correct classification rate. In bold, the results for the genes selected under standard criteria for ORdensity, limma and SAM procedures; in rows II, number of common selected genes between the ORdensity, limma and SAM approaches; in rows III, mean and standard deviation (in brackets) of the number of genes that for 10 subsamples were always kept selected