| Literature DB >> 26380547 |
Li-Xuan Qin1, Huei-Chung Huang1, Qin Zhou1.
Abstract
MOTIVATION/Entities:
Keywords: log transformation; microRNA; microarray; normalization; preprocessing; probe set summarization
Year: 2015 PMID: 26380547 PMCID: PMC4560483 DOI: 10.4137/CIN.S21630
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Orderings of preprocessing steps applied to the test data.
| ORDERING | FIRST | SECOND | THIRD |
|---|---|---|---|
| A | Quantile normalization | Log2 | Median |
| B | Log2 | Quantile normalization | Median |
| C | Median | Quantile normalization | Log2 |
| D | Median | Log2 | Quantile normalization |
| Reference | Log2 | Median | – |
Statistical measures for method comparison.
| DECLARED SIGNIFICANT IN BENCHMARK | DECLARED INSIGNIFICANCE IN BENCHMARK | |
|---|---|---|
| Declared significant in test data | True positive (TP) | False positive (FP) |
| Declared insignificance in test data | False negative (FN) | True negative (TN) |
Results of differential expression analysis for the test dataset. A P-value cutoff of 0.01 was used to claim significant markers in both the benchmark dataset and the test dataset. At this cutoff, the number of significant markers was 351 in the benchmark dataset
| ORDERING | NUMBER OF SIGNIFICANT MARKERS | TPR% | FPR% | FDR% |
|---|---|---|---|---|
| A | 710 | 12.0 (382/3172) | 53.8 (382/710) | |
| B | 708 | 12.0 ( | ||
| C | 710 | 92.9 (326/351) | 12.1 (384/3172) | 54.1 (384/710) |
| D | 712 | 92.9 (326/351) | 12.2 (386/3172) | 54.2 (386/712) |
| Reference | 1934 | 52.7 (185/351) | 55.1 (1749/3172) | 90.4 (1749/1934) |
Results of differential expression analysis for the test dataset. A P-value cutoff of 0.001 was used to claim significant markers in both the benchmark dataset and the test dataset. At this cutoff, the number of significant markers was 186 in the benchmark dataset.
| ORDERING | NUMBER OF SIGNIFICANT MARKERS | TPR% | FPR% | FDR% |
|---|---|---|---|---|
| A | 443 | 94.6 (176/186) | 8.00 (267/3337) | 60.3 (267/443) |
| B | 441 | 94.6 (176/186) | 7.94 (265/3337) | 60.1 (265/441) |
| C | 441 | 94.6 (176/186) | 7.94 (265/3337) | 60.1 (265/441) |
| D | 442 | 94.6 (176/186) | 7.97 (266/3337) | 60.2 (266/442) |
| Reference | 281 | 54.8 (102/186) | 5.36 (179/3337) | 63.7 (179/281) |
Figure 1Receiver Operating Characteristics (ROC) curves comparing the differential expression P-values for the test dataset versus that for the benchmark dataset, treating the latter as a gold standard.
Figure 2Results of the simulation study. Dots represent the means and error bars the standard deviations for each summary statistics (TPR, FPR, and FDR) across the 100 simulation datasets for each simulation setting. X axis indicates the value for σ (the standard deviation of the zero mean Gaussian distribution from which the extra level of noise was generated and added to the probe-level data of the test dataset).