| Literature DB >> 19417058 |
Min Zhang1, Lin Zhang, Jinfeng Zou, Chen Yao, Hui Xiao, Qing Liu, Jing Wang, Dong Wang, Chenguang Wang, Zheng Guo.
Abstract
MOTIVATION: According to current consistency metrics such as percentage of overlapping genes (POG), lists of differentially expressed genes (DEGs) detected from different microarray studies for a complex disease are often highly inconsistent. This irreproducibility problem also exists in other high-throughput post-genomic areas such as proteomics and metabolism. A complex disease is often characterized with many coordinated molecular changes, which should be considered when evaluating the reproducibility of discovery lists from different studies.Entities:
Mesh:
Year: 2009 PMID: 19417058 PMCID: PMC2940240 DOI: 10.1093/bioinformatics/btp295
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Consistence scores between DEG lists for the same disease when using 0.1% FDR control for significant correlations
| Datasets | DEGs | POG | nPOG | POGR | Max | DEGs | POG | POGR | Max | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| From 102 samples to 103 samples | From 103 samples to 102 samples | |||||||||||
| Top 10 | 0.30 | 0.30 | 0.90 | 0.89 | 0.94 | Top 10 | 0.30 | 0.30 | 1.00 | 1.00 | 0.94 | |
| Prostate cancer | Top 50 | 0.14 | 0.14 | 0.78 | 0.69 | 0.70 | Top 50 | 0.14 | 0.14 | 0.92 | 0.89 | 0.73 |
| Top100 | 0.15 | 0.14 | 0.80 | 0.66 | 0.60 | Top100 | 0.15 | 0.14 | 0.94 | 0.90 | 0.63 | |
| 1054 vs. 1343 | 0.38 | 0.30 | 0.90 | 0.74 | 0.38 | 1343 vs. 1054 | 0.30 | 0.23 | 0.90 | 0.76 | 0.40 | |
| From 38 samples to 18 samples | From 18 samples to 38 samples | |||||||||||
| Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | |
| Lung cancer | Top 50 | 0.20 | 0.19 | 0.20 | 0.19 | 0.99 | Top 50 | 0.20 | 0.19 | 0.20 | 0.19 | 0.99 |
| Top100 | 0.31 | 0.30 | 0.32 | 0.31 | 0.99 | Top100 | 0.31 | 0.30 | 0.31 | 0.30 | 0.99 | |
| 2157 vs. 336 | 0.13 | 0.09 | 0.13 | 0.09 | 0.96 | 336 vs. 2157 | 0.82 | 0.75 | 0.82 | 0.75 | 0.72 | |
| From 24 samples to 36 samples | From 36 samples to 24 samples | |||||||||||
| Top 10 | 0.20 | 0.20 | 0.70 | 0.70 | 1.00 | Top 10 | 0.20 | 0.20 | 0.80 | 0.80 | 1.00 | |
| DMD | Top 50 | 0.42 | 0.42 | 0.92 | 0.92 | 0.99 | Top 50 | 0.42 | 0.42 | 0.80 | 0.80 | 0.99 |
| Top100 | 0.54 | 0.54 | 0.94 | 0.94 | 0.98 | Top100 | 0.54 | 0.54 | 0.85 | 0.85 | 0.98 | |
| 805 vs. 800 | 0.53 | 0.50 | 0.72 | 0.68 | 0.89 | 800 vs. 805 | 0.53 | 0.49 | 0.65 | 0.61 | 0.89 | |
aMax means the maximum potential agreement beyond chance (see ‘Methods’ section).
bTwo datasets for each disease are marked by their sample sizes. ‘From dataset1 to dataset2’ means that the reproducibility of a DEG list detected in dataset1 is evaluated in dataset2.
Fig. 1.The effect of list lengths on POG, nPOG, POGR and nPOGR. The x-axis represents list lengths ranging from 100 to 2000, and the y-axis represents the average scores of 10 000 pairs of gene lists randomly selected from the original datasets. Here, 0.1% FDR level is used for detecting significant correlations. The legend for all the six subgraphs is the same as shown within the last one.
Fig. 2.The effect of FDR control levels on POG, nPOG, POGR and nPOGR. The x-axis represents FDR cotnrol levels ranging from 0.01% to 5%. The y-axis represents the average scores of 10 000 pairs of gene lists with a given length (1000) randomly selected from the original datasets. The legend for all the six subgraphs is the same as shown within the last one.
Fig. 3.The distributions of the PCC of expressions of gene pairs in three pairs of datasets. The legend for all the six subgraphs is the same as shown within the last one.
Consistence scores between DEG lists for different diseases when using 0.1% FDR control for significant correlations
| Datasets | DEGs | POG | nPOG | POGR | Maxa | DEGs | POG | POGR | Maxa | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| From Prostate to DMDb | From DMD to Prostateb | |||||||||||
| Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.98 | Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.98 | |
| Prostate vs. DMD | Top 50 | 0.02 | 0.02 | 0.04 | 0.00 | 0.93 | Top 50 | 0.02 | 0.02 | 0.04 | 0.00 | 0.93 |
| Top100 | 0.01 | 0.00 | 0.05 | 0.00 | 0.89 | Top100 | 0.01 | 0.00 | 0.06 | 0.00 | 0.89 | |
| 1842 vs. 801 | 0.05 | 0.00 | 0.16 | 0.00 | 0.68 | 801 vs. 1842 | 0.11 | 0.01 | 0.37 | 0.00 | 0.52 | |
| From Lung to DMDb | From DMD to Lungb | |||||||||||
| Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | |
| Lung vs. DMD | Top 50 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | Top 50 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 |
| Top100 | 0.00 | 0.00 | 0.00 | 0.00 | 0.99 | Top100 | 0.00 | 0.00 | 0.00 | 0.00 | 0.99 | |
| 3303 vs. 797 | 0.05 | 0.00 | 0.05 | 0.00 | 0.95 | 797 vs. 3303 | 0.21 | 0.02 | 0.21 | 0.02 | 0.81 | |
| From Prostate to Lungb | From Lung to Prostateb | |||||||||||
| Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | Top 10 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | |
| Prostate vs. lung | Top 50 | 0.02 | 0.02 | 0.02 | 0.02 | 1.00 | Top 50 | 0.02 | 0.02 | 0.02 | 0.02 | 1.00 |
| Top100 | 0.04 | 0.03 | 0.04 | 0.03 | 0.99 | Top100 | 0.04 | 0.03 | 0.04 | 0.03 | 0.99 | |
| 1353 vs. 2763 | 0.35 | 0.15 | 0.35 | 0.15 | 0.77 | 2763 vs. 1353 | 0.17 | 0.06 | 0.17 | 0.07 | 0.89 | |
Indicated as in Table 1.