| Literature DB >> 18173834 |
Ana C Fierro1, Raphael Thuret, Kristof Engelen, Gilles Bernot, Kathleen Marchal, Nicolas Pollet.
Abstract
BACKGROUND: As an alternative to the frequently used "reference design" for two-channel microarrays, other designs have been proposed. These designs have been shown to be more profitable from a theoretical point of view (more replicates of the conditions of interest for the same number of arrays). However, the interpretation of the measurements is less straightforward and a reconstruction method is needed to convert the observed ratios into the genuine profile of interest (e.g. a time profile). The potential advantages of using these alternative designs thus largely depend on the success of the profile reconstruction. Therefore, we compared to what extent different linear models agree with each other in reconstructing expression ratios and corresponding time profiles from a complex design.Entities:
Mesh:
Year: 2008 PMID: 18173834 PMCID: PMC2265676 DOI: 10.1186/1471-2105-9-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Experimental microarray designs used in this study. Circles represent samples or time points, and arrows represent a direct hybridization between two samples. The arrows point from the time point labeled with Cy3 to the time point labeled with Cy5. (a) Interwoven design (first dataset). Grey arrows were removed to generate a single loop design (see (b)). (b) Loop design (second dataset).
Pairwise correlation between ratios estimates for the loop design
| Method 1 | Method 2 | T3/T1 | T4/T1 | T5/T1 | T6/T1 | Mean |
| lmbr/lmbr_dye | limmaQual | 0.9966 | 0.9998 | 0.9648 | 0.9909 | 0.9880 |
| lmbr/lmbr_dye | anovaFix | 0.9899 | 0.9913 | 0.9721 | 0.9856 | 0.9848 |
| lmbr/lmbr_dye | anovaMix | 0.9829 | 0.9751 | 0.9420 | 0.9549 | 0.9637 |
| limmaQual | anovaFix | 0.9889 | 0.9918 | 0.9298 | 0.9775 | 0.9720 |
| limmaQual | anovaMix | 0.9810 | 0.9758 | 0.8950 | 0.9467 | 0.9496 |
| anovaFix | anovaMix | 0.9936 | 0.9847 | 0.9726 | 0.9694 | 0.9801 |
Correlation between ratios estimated by each pair of applied methods (column 1 and column 2) for a loop design. Each ratio corresponds to a time dependent change in expression as compared to the first time point (T1). The last column corresponds to the mean correlation of the 4 estimates. Since the loop design is balanced with respect to the dyes, the results for lmbr and lmbr_dye were the same (see 'Methods' section), which is why they are not treated differently.
Mean similarity between profiles for both the interwoven and the loop design
| Interwoven design | Loop design | ||||||
| lmbr | lmbr_dye | limmaQual | anovaFix | lmbr/lmbr_dye | limmaQual | anovaFix | |
| lmbr_dye | 0.9477 | 1.0000 | |||||
| limmaQual | 0.9940 | 0.9359 | 0.9844 | ||||
| anovaFix | 0.9321 | 0.9572 | 0.9157 | 0.9514 | 0.9252 | ||
| anovaMix | 0.9138 | 0.9373 | 0.8989 | 0.9767 | 0.9186 | 0.8934 | 0.9611 |
Values in the table correspond to the similarity between any two methods, expressed as the mean profile similarity of the genes. Since the loop design is balanced with respect to the dyes, the results for lmbr and lmbr_dye were the same (see 'Methods' section), which is why they are not treated differently. No filtering applied and similarity is assessed for all 2999 profile estimates.
Figure 2Examples of reconstructed profiles for two representative genes from the interwoven design. For the gene-specific methods (based on log-ratios estimate) the ratios are expressed relative to T1 and the ratio T1/T1 is set to zero. For the two-stage (ANOVA) methods, the estimated VG effect (gene-variety) is plotted. (a) Estimated profiles for the Ribosomal like gene under the interwoven design. (b) Reconstructed time profile for the gene MGC85244 under the interwoven design.
Mean similarity between profiles after filtering for both the interwoven and the loop design
| Interwoven design | Loop design | ||||||
| lmbr | lmbr_dye | limmaQual | anovaFix | lmbr/lmbr_dye | limmaQual | anovaFix | |
| lmbr_dye | 0.9834 | 1.0000 | |||||
| limmaQual | 0.9980 | 0.9799 | 0.9920 | ||||
| anovaFix | 0.9800 | 0.9948 | 0.9755 | 0.9961 | 0.9851 | ||
| anovaMix | 0.9748 | 0.9905 | 0.9701 | 0.9957 | 0.9866 | 0.9728 | 0.9905 |
Mean profile similarity using a filtering threshold of 0.4 on all profiles estimated by each of the methods. A pairwise similarity comparison is made for all profile pairs (corresponding to the same gene) estimated by each of the two methods compared, for which at least one profile is above the filtering threshold.
Assessing the effect of array failure on estimated ratios for the interwoven design
| Array removed | lmbr | lmbr_dye | limmaQual | anovaFix | anovaMix | Conditions affected |
| 1 | 0.9615 | 0.9812 | 0.9623 | 0.9824 | 0.9784 | T2/T3 |
| 2 | 0.8905 | 0.9277 | 0.8768 | 0.9317 | 0.9431 | T1/T3 |
| 3 | 0.9606 | 0.8790 | 0.9521 | 0.8951 | 0.8943 | T3/T5 |
| 4 | 0.9080 | 0.9353 | 0.8821 | 0.9478 | 0.9539 | T4/T1 |
| 5 | 0.9601 | 0.9632 | 0.9505 | 0.9569 | 0.9592 | T6/T1 |
| 6 | 0.9773 | 0.9863 | 0.9742 | 0.9852 | 0.9847 | T5/T2 |
| 7 | 0.9238 | 0.9585 | 0.9317 | 0.9549 | 0.9662 | T5/T6 |
| 8 | 0.9615 | 0.9599 | 0.9615 | 0.9617 | 0.9690 | T2/T4 |
| 9 | 0.9816 | 0.9859 | 0.9613 | 0.9836 | 0.9845 | T4/T6 |
| Mean | 0.9472 | 0.9530 | 0.9392 | 0.9555 | 0.9592 | |
The different methods for which the influence of array failure was assessed are represented in the columns. Each row shows the mean correlation between the corresponding estimated ratios from the complete design and those obtained from a defect design (where one array was removed compared to the complete design). Mean: shows the overall mean correlation for a given method.
Assessing the effect of array failure on estimated ratios for the loop design
| Array removed | lmbr | anovaFix | anovaMix | Conditions affected |
| 1 | 0.5964 | 0.6461 | 0.5312 | T1/T3 |
| 2 | 0.7161 | 0.7545 | 0.6559 | T3/T5 |
| 3 | 0.6713 | 0.8883 | 0.7606 | T5/T6 |
| 4 | 0.5697 | 0.7883 | 0.6637 | T6/T4 |
| 5 | 0.4359 | 0.6042 | 0.5534 | T4/T1 |
| Mean | 0.5979 | 0.7363 | 0.6330 | |
The different methods for which the influence of array failure was assessed are represented in the columns. Each row shows the mean correlation between the corresponding estimated ratios from the complete design and those obtained from a defect design (where one array was removed compared to the complete design). Mean: shows the overall mean correlation for a given method. Lmbr_dye and limmaQual were not evaluated as in this particular case they lose their main differing characteristic compared to lmbr.
Effect of additional normalization procedures on estimating gene profiles from both the interwoven design and the loop design
| Interwoven design | |||||
| Normalization methods | lmbr | lmbr_dye | limmaQual | anovaFix | anovaMix |
| none/quantile | 0.9602 | 0.9576 | 0.9599 | 0.9605 | 0.9584 |
| none/scale | 0.9900 | 0.9881 | 0.9914 | 0.9867 | 0.9848 |
| scale/quantile | 0.9540 | 0.9531 | 0.9557 | 0.9547 | 0.9566 |
| Loop design | |||||
| lmbr | lmbr_dye | limmaQual | anovaFix | anovaMix | |
| none/quantile | 0.9520 | 0.9520 | 0.9534 | 0.9601 | 0.9481 |
| none/scale | 0.9822 | 0.9822 | 0.9793 | 0.9835 | 0.9778 |
| scale/quantile | 0.9355 | 0.9355 | 0.9361 | 0.9433 | 0.9370 |
Similarities between profiles were assessed using the mean cosine similarity measure. Rows indicate the different normalization procedures. Columns indicate the different models used to reconstruct the profiles.
Concentration (copies per cell) of the control clones spiked
| Spike No. | Spike Mix 1 | Spike Mix 2 | Spike Mix 3 | Spike Mix 4 | Spike Mix 5 | Spike Mix 6 | Spike Mix 7 | Reference Mix |
| 1, 2 | 10,000 | 0 | 0.1 | 1 | 10 | 100 | 1,000 | 100 |
| 3, 4 | 1,000 | 10,000 | 0 | 0.1 | 1 | 10 | 100 | 100 |
| 5, 6 | 100 | 1,000 | 10,000 | 0 | 0.1 | 1 | 10 | 100 |
| 7, 8 | 10 | 100 | 1,000 | 10,000 | 0 | 0.1 | 1 | 100 |
| 9, 10 | 1 | 10 | 100 | 1,000 | 10,000 | 0 | 0.1 | 100 |
| 11, 11a | 0.1 | 1 | 10 | 100 | 1,000 | 10,000 | 0 | 100 |
| 12, 13 | 0 | 0.1 | 1 | 10 | 100 | 1,000 | 10,000 | 100 |
From the total of 14 arrays, 7 were hybridized with the respective spike mixes labeled in Cy5 against the reference mix labeled in Cy3. The remaining 7 arrays were hybridized with the spike mixes labeled in Cy3 against the reference mix labeled in Cy5. Spike 11a was removed from analysis due to quality issues (Allemeersch et al., 2005 [17]).
Figure 3Spike-in expression ratio estimates. Reconstructed expression ratio estimates of spikes 7 (+ markers) and 8 (x markers) are plotted for lmbr/lmbr_dye/limmaQual (solid line), anovaFix (dotted line), and anovaMix (dashed line). Concentrations (cpc; copies per cell) of 104 cpc (panel A), 10 cpc (panel B), and 10-1 cpc (panel C) were used as reference points. Estimated ratios were sorted from low to high concentrations. The solid grey line (o-markers) corresponds to the expected ratios for the known concentrations.