| Literature DB >> 15904488 |
Xing Qiu1, Andrew I Brooks, Lev Klebanov, Ndrei Yakovlev.
Abstract
BACKGROUND: Stochastic dependence between gene expression levels in microarray data is of critical importance for the methods of statistical inference that resort to pooling test-statistics across genes. It is frequently assumed that dependence between genes (or tests) is sufficiently weak to justify the proposed methods of testing for differentially expressed genes. A potential impact of between-gene correlations on the performance of such methods has yet to be explored.Entities:
Mesh:
Year: 2005 PMID: 15904488 PMCID: PMC1156869 DOI: 10.1186/1471-2105-6-120
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The histogram of correlation coeffcients for overlapping pairs of t-statistics associated with individual genes in the SJCRH data. A: data before normalization, B: GEO, C: RANK, D: QUANT, E: simulated set of data SIMU1.
Figure 2The effect of the normalization procedure QUANT as applied to the SIMU2N data. A: data without noise (SIMU2), B: data with noise (SIMU2N), C: SIMU2 after normalization, D: SIMU2N after normalization.
Figure 3The behavior of the standard deviation of the sample mean as a function of the number of involved genes. 1. Raw biological data; 2. Quantile normalization; 3. Independent simulations (SIMU1).
Long-range correlation analysis for the SIMU2N data.
| Gene Label | GEO | QUANT | RANK | SIMU2N |
| 1 | 743 | 746 | 741 | 12558 |
| 2 | 754 | 750 | 756 | 12558 |
| 3 | 723 | 723 | 721 | 12558 |
| 4 | 705 | 698 | 718 | 12558 |
| 5 | 736 | 734 | 754 | 12558 |
| 6 | 751 | 763 | 765 | 12558 |
| 7 | 702 | 695 | 709 | 12558 |
| 8 | 667 | 665 | 679 | 12558 |
| 9 | 747 | 747 | 759 | 12558 |
| 10 | 728 | 730 | 736 | 12558 |
| 11 | 713 | 717 | 713 | 12558 |
| 12 | 696 | 699 | 685 | 12558 |
| 13 | 743 | 750 | 762 | 12558 |
| 14 | 725 | 721 | 733 | 12558 |
| 15 | 691 | 691 | 740 | 12558 |
| 16 | 789 | 789 | 799 | 12558 |
| 17 | 724 | 725 | 669 | 12558 |
| 18 | 716 | 712 | 722 | 12558 |
| 19 | 762 | 762 | 720 | 12558 |
| 20 | 676 | 673 | 708 | 12558 |
| Mean | 724.6 | 724.5 | 729.5 | 12558 |
| STD | 30.1 | 31.8 | 31.9 | 0 |
Long-range correlation analysis for the SIMU3N data.
| Gene Label | GEO | QUANT | RANK | SIMU3N |
| 1 | 483 | 520 | 512 | 12297 |
| 2 | 471 | 582 | 591 | 10656 |
| 3 | 436 | 523 | 614 | 12506 |
| 4 | 644 | 643 | 744 | 11031 |
| 5 | 677 | 739 | 765 | 11320 |
| 6 | 610 | 543 | 570 | 12413 |
| 7 | 612 | 863 | 788 | 12429 |
| 8 | 802 | 727 | 711 | 12077 |
| 9 | 1743 | 1406 | 1077 | 11898 |
| 10 | 975 | 895 | 920 | 12001 |
| 11 | 1352 | 1330 | 1543 | 12453 |
| 12 | 670 | 707 | 686 | 12480 |
| 13 | 1874 | 1849 | 1890 | 6913 |
| 14 | 1858 | 1765 | 1808 | 9371 |
| 15 | 1925 | 1790 | 1974 | 12469 |
| 16 | 1792 | 1718 | 1796 | 12520 |
| 17 | 1764 | 1526 | 1679 | 12499 |
| 18 | 1769 | 1684 | 1821 | 12509 |
| 19 | 1476 | 1300 | 1569 | 12514 |
| 20 | 2223 | 2307 | 2148 | 12507 |
| Mean | 1207.8 | 1170.9 | 1210.3 | 11743.2 |
| STD | 617.3 | 557.5 | 576.5 | 1402 |
Long-range correlation analysis for the SJCRH data.
| Gene Label | GEO | QUANT | RANK | raw data |
| 1 | 5644 | 462 | 494 | 12481 |
| 2 | 7330 | 3175 | 1431 | 12486 |
| 3 | 4189 | 1480 | 2062 | 12496 |
| 4 | 5218 | 2728 | 1548 | 12493 |
| 5 | 8169 | 1888 | 1064 | 12451 |
| 6 | 8140 | 956 | 1162 | 12482 |
| 7 | 323 | 1169 | 839 | 12480 |
| 8 | 6774 | 1479 | 839 | 12497 |
| 9 | 7676 | 1832 | 2140 | 12390 |
| 10 | 8234 | 794 | 1440 | 12384 |
| 11 | 7652 | 930 | 466 | 12498 |
| 12 | 8266 | 1329 | 708 | 12476 |
| 13 | 8197 | 1343 | 2045 | 12391 |
| 14 | 7422 | 2118 | 2513 | 12501 |
| 15 | 1588 | 1467 | 1011 | 12494 |
| 16 | 7861 | 1931 | 1133 | 12429 |
| 17 | 1292 | 1477 | 1445 | 12489 |
| 18 | 6389 | 2949 | 1456 | 12481 |
| 19 | 7359 | 490 | 514 | 12469 |
| 20 | 4384 | 970 | 787 | 12488 |
| Mean | 6105.4 | 1548.4 | 1254.9 | 12467.8 |
| STD | 2545 2512 | 756 | 589.5 | 38.2 |