| Literature DB >> 26244016 |
Abstract
The rank product statistic has been widely used to detect differentially expressed genes in replicated microarrays and a one-class setting. The objective of this article is to apply a rank product statistic to approximate the P-value of differential expression in a two-class setting, such as in normal and cancer cells. For this purpose, we introduce a simple statistic that compares the P-values of each class's rank product statistic. Its null distribution is straightforwardly derived using the change-of-variable technique.Entities:
Keywords: change of variable; chi-squared approximation; log-transformation; rank product statistic; two-class setting
Year: 2015 PMID: 26244016 PMCID: PMC4507469 DOI: 10.4137/BBI.S26414
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
False-positive rates of the proposed statistic for various nominal α-levels and numbers of samples, where m1 and m2 are the sample numbers of the control group and the treatment group, respectively.
| m1, m2 | ||||
|---|---|---|---|---|
| 0.01 | 0.05 | 0.10 | 0.25 | |
| 10,10 | 0.0097 | 0.0494 | 0.0997 | 0.2501 |
| 20,20 | 0.0099 | 0.0496 | 0.0993 | 0.2495 |
| 30,30 | 0.0095 | 0.0492 | 0.0994 | 0.2491 |
| 10,20 | 0.0097 | 0.0496 | 0.0994 | 0.2493 |
| 20,10 | 0.0095 | 0.0496 | 0.0997 | 0.2499 |
Note: The numbers denote the rates of genes that were identified by the proposed statistic as differentially expressed at α.
Power of the proposed statistic for various nominal α-levels.
| ADDED CONSTANT | ||||
|---|---|---|---|---|
| 0.25 | 0.5 | 1.0 | 1.5 | |
| 0.01 | 0.08 (0.06) | 0.32 (0.20) | 0.91 (0.74) | 1.0 (0.98) |
| 0.05 | 0.24 (0.19) | 0.56 (0.44) | 0.97 (0.91) | 1.0 (1.0) |
| 0.1 | 0.35 (0.30) | 0.69 (0.58) | 0.98 (0.96) | 1.0 (1.0) |
| 0.2 | 0.57 (0.54) | 0.83 (0.80) | 0.99 (0.99) | 1.0 (1.0) |
Notes: We simulated 10,000 genes such that the gene expression in 40 microarrays for each gene was simulated independently from a standard normal distribution, and where the first 20 samples were the control group and the second 20 were the treatment group. We randomly selected 5% of genes and added a constant of 0.25 to their treatment group. These selected genes had a higher average expression in the treatment group; however, there was no difference between the two groups for the remaining 95% of genes. We repeated the same procedure by adding larger constants: 0.5, 1.0, and 1.5. The numbers denote the percentages of differentially expressed genes that were identified by the proposed statistic as differentially expressed. For comparison, the numbers inside parentheses denote the percentages of differentially expressed genes identified by the Wilcoxon rank-sum statistic.
Our P-values obtained under the overall null hypothesis that the expression levels are exchangeable within each of the independent microarrays.
| AFFYMETRIX ID | DESCRIPTION | OUR TOP 25 P-VALUES | WILCOXON’S |
|---|---|---|---|
| Y00787* | interleukin-8 precursor | 6.07 × 10−11 | 3.39 × 10−06 |
| M27891* | CST3 Cystatin C | 9.69 × 10−09 | 3.32 × 10−09 |
| M96326* | Azurocidin gene | 6.69 × 10−08 | 8.28 × 10−06 |
| M28130* | Interleukin 8 gene | 2.85 × 10−07 | 2.67 × 10−06 |
| M63438 | glutamine synthase | 7.17 × 10−07 | 1.10 × 10−04 |
| X17042* | PRG1 Proteoglycan 1, secretory granule | 2.51 × 10−06 | 2.74 × 10−05 |
| U01317 | Delta-globin gene | 4.47 × 10−06 | 6.42 × 10−04 |
| M19507 | mpo myeloperoxidase | 5.95 × 10−06 | 1.53 × 10−05 |
| M91036 | G-gamma globin | 8.83 × 10−06 | 1.37 × 10−03 |
| M87789 | hybridoma H210 | 1.00 × 10−05 | 2.06 × 10−04 |
| X95735* | Zyxin | 1.14 × 10−05 | 8.31 × 10−10 |
| M19045* | LYZ | 1.27 × 10−05 | 2.67 × 10−06 |
| X14008 | Lysozyme gene | 1.81 × 10−05 | 6.67 × 10−06 |
| X64072 | SELL Leukocyte adhesion protein beta subunit | 2.09 × 10−05 | 2.74 × 10−05 |
| J04990 | cathepsin g precursor | 2.38 × 10−05 | 1.53 × 10−05 |
| J03801 | LYZ | 2.59 × 10−05 | 1.63 × 10−06 |
| X62320 | GRN Granulin | 4.60 × 10−05 | 4.16 × 10−07 |
| X04085* | Catalase 5′flank and exon 1 mapping to chr 11 | 5.59 × 10−05 | 2.67 × 10−06 |
| M21119 | LYZ | 7.99 × 10−05 | 9.49 × 10−04 |
| M84526* | DF D component of complement | 1.09 × 10−04 | 3.30 × 10−05 |
| M57710* | galectin 3 | 1.11 × 10−04 | 9.37 × 10−05 |
| L09209 | APLP2 Amyloid beta (A4) precursor-like protein 2 | 1.33 × 10−04 | 5.56 × 10−08 |
| L08246* | induced myeloid leukemia cell differentiation protein mcl1 | 1.53 × 10−04 | 2.67 × 10−06 |
| X62654 | ME491 | 2.21 × 10−04 | 1.15 × 10−07 |
| X65965 | manganese superoxide dismutase | 3.26 × 10−04 | 7.78 × 10−03 |
Notes: The top 25 P-values for AML-specific genes from the leukemia data of Golub et al from Equation (3). Among them, 11 genes marked with * were reported among the top 25 AML-specific genes in Golub et al. Our P-values are compared with P-values of Wilcox rank-sum test. Ten genes of the Wilcoxon rank-sum statistic were reported among the top 25 AML-specific genes in Golub et al.