| Literature DB >> 18271965 |
Tristan Mary-Huard1, Julie Aubert, Nadera Mansouri-Attia, Olivier Sandra, Jean-Jacques Daudin.
Abstract
BACKGROUND: In individually dye-balanced microarray designs, each biological sample is hybridized on two different slides, once with Cy3 and once with Cy5. While this strategy ensures an automatic correction of the gene-specific labelling bias, it also induces dependencies between log-ratio measurements that must be taken into account in the statistical analysis.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18271965 PMCID: PMC2277403 DOI: 10.1186/1471-2105-9-98
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Three different balanced reverse dye designs for the comparison of 2 treatments
| 1 | array | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| Cy5 | A1 | B5 | A3 | B9 | A5 | B6 | A7 | B10 | A9 | B9 | |
| Cy3 | B3 | A2 | B8 | A4 | B2 | A6 | B1 | A8 | B4 | A10 | |
| 2 | array | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| Cy5 | A1 | B1 | A2 | B2 | A3 | B3 | A4 | B4 | A5 | B5 | |
| Cy3 | B1 | A2 | B2 | A3 | B3 | A4 | B4 | A5 | B5 | A1 | |
| 3 | array | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| Cy5 | A1 | B1 | A2 | B2 | A3 | B3 | A4 | B4 | A5 | B5 | |
| Cy3 | B1 | A1 | B2 | A2 | B3 | A3 | B4 | A4 | B5 | A5 | |
Three different balanced reverse dye designs for the comparison of 2 treatments (A and B), with an equal number of slides. Astands for the ibiological sample in condition A. (1) Globally balanced design, with 10 biological samples per condition. (2) Individually-balanced design with 5 biological samples per condition. (3) Dye-swap design with 5 biological samples per condition.
Actual level of the 5 test procedures in one simulation of 10 000 genes
| Method | 5 | 10 | 20 | 30 | 5 | 10 | 20 | 30 |
| Naive | 6.9 (0.2) | 7.3 (0.2) | 7.3 (0.2) | 7.5 (0.2) | 13.2 (0.3) | 13.9 (0.3) | 14.0 (0.3) | 14.2 (0.3) |
| Unbiased Paired | 5.2 (0.2) | 5.2 (0.2) | 5.2 (0.2) | 5.3 (0.2) | 8.2 (0.3) | 6.9 (0.2) | 6 (0.2) | 5.8 (0.2) |
| Unbiased Unpaired | 2.1 (0.1) | 1.3 (0.1) | 1.0 (0.1) | 1 (0.1) | 4.6 (0.2) | 3.4 (0.2) | 2.7 (0.1) | 2.9 (0.2) |
| ML | 8.5 (0.3) | 8.6 (0.3) | 8.3 (0.3) | 8.3 (0.3) | 12.5 (0.4) | 11.1 (0.3) | 9.9 (0.3) | 9.8 (0.3) |
| REML | 4.7 (0.2) | 4.2 (0.2) | 4.5 (0.2) | 4.9 (0.2) | 14.7 (0.4) | 8.5 (0.3) | 5.9 (0.2) | 5.5 (0.2) |
Actual mean level (standard error) of the 5 test procedures, for low ( = 0.5, left) and high ( = 2, right) values of biological variance, and different number of samples n in each condition (in column). The requested nominal threshold is 5%.
Power of the UU, UP and REML test procedures
| Nb Samples | UU | UP | REML | UU | UP | REML | |
| 5 | 0.5 | 5.6 | 13.6 | 10.6 | 55.5 | 92.1 | 86.75 |
| 5 | 2 | 2.8 | 5.0 | 12.95 | 17.9 | 29.4 | 34.75 |
| 10 | 0.5 | 13.2 | 39.3 | 33.97 | 77.8 | 100.0 | 99.64 |
| 10 | 2 | 3.5 | 7.8 | 9.06 | 45.0 | 63.5 | 63.06 |
| 20 | 0.5 | 35.0 | 80.1 | 78.13 | 98.8 | 100.0 | 100.0 |
| 20 | 2 | 7.3 | 14.5 | 13.93 | 82.6 | 94.8 | 94.53 |
| 30 | 0.5 | 51.9 | 95.5 | 95.05 | 100.0 | 100.0 | 100.0 |
| 30 | 2 | 12.1 | 22.5 | 21.74 | 96.2 | 99.6 | 99.53 |
Power (probability of rejecting H0 × 100) of the different test procedures to detect a low (μ = 1, left) or high (μ = 3, right) differential expression.
CPU times of procedures UP and REML
| UP CPU | REML CPU | No REML CV | |
| 5 | 2.3 | 787 | 56.9 |
| 10 | 2.6 | 212 | 5 |
| 20 | 2.8 | 467 | 0 |
| 30 | 3.2 | 1046 | 0.16 |
User CPU time of procedures (UP) and (REML), for σ2 = 0.5 and different numbers of samples. The last column provides the average number of genes for which REML did not converge.
Figure 1Venn diagram for the embriogenomics experiment. Comparison of the DE genes obtained by four methods. Vertical right rectangle: REML, horizontal low rectangle: UP, bone: N and circle: ML.
Figure 2Comparison of the standard errors obtained with ML, REML and UP for the REML-DE genes of the embriogenomics experiment. Left: REML estimates (y-axis) versus UP estimates (x-axis) of the standard error. Center: REML estimates versus ML estimates. Right: UP estimates versus ML estimates.
Figure 3Mean difference versus standard error for the REML-differentially expressed genes of the embriogenomics experiment. Standard error of the difference obtained by REML (y-axis) versus mean difference between the two conditions (x-axis). Black points are not found DE by other methods than REML.
Figure 4The Teleofish experiment design.
Lists of genes for the Teleofish experiment
| Oleksiak list [8] | UP list |
| RAN GTP binding protein hypo P FLJ20727 ribosomal protein L27 dihydrolipoamide dehydrogenase GTP binding protein | |
| Steroidogenic acute regulatory protein hypo P FLJ11275 capping protein muscle Z line orla C4 surface glycoprotein HT7 precursor methionine adeno. regulatory Von Willebrand factor succinate dehydrogenase complex KIAA1481 protein protein disulfide isomerase annexin V | Thioredoxin nascent polypeptide associated dnaK type molec. chap. prec. ribosomal protein S16 |
Lists of genes whose expression was significantly different between populations. The first 5 genes are found differentially expressed by both methods.
Figure 5Clustering tree for the Teleofish dataset. Clustering tree for the Teleofish dataset, obtained from the second list of differentially expressed genes of Table 5.