| Literature DB >> 12659661 |
Vigdis Nygaard1, Anders Løland, Marit Holden, Mette Langaas, Håvard Rue, Fang Liu, Ola Myklebost, Øystein Fodstad, Eivind Hovig, Birgitte Smith-Sørensen.
Abstract
BACKGROUND: A limiting factor of cDNA microarray technology is the need for a substantial amount of RNA per labeling reaction. Thus, 20-200 micro-grams total RNA or 0.5-2 micro-grams poly (A) RNA is typically required for monitoring gene expression. In addition, gene expression profiles from large, heterogeneous cell populations provide complex patterns from which biological data for the target cells may be difficult to extract. In this study, we chose to investigate a widely used mRNA amplification protocol that allows gene expression studies to be performed on samples with limited starting material. We present a quantitative study of the variation and noise present in our data set obtained from experiments with either amplified or non-amplified material.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12659661 PMCID: PMC153514 DOI: 10.1186/1471-2164-4-11
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Experimental design. For a particular cell line, the material was divided into six batches; two batches of non-amplified RNA were directly labeled (white shapes) and four batches of diluted total RNA were amplified before labeling (grey shapes). These batches were split in two and labeled Cy5 (red) or Cy3 (green). In total, four array experiments were based on non-amplified material, and eight experiments were based on amplified material.
Overview of filtered genes.
| Description | Number of genes |
| 1Genes on an array | 12497 (100%) |
| 2Genes filtered out from all arrays | 1738 (14%) |
| 3Total number of genes after filtering | 10759 (86%) |
| 4Genes only detected in non-amplified material (at least one array) | 64 (0.5%) |
| 5Genes only detected in amplified material (at least one array) | 4377 (35%) |
| 6Genes detected for both groups | 6318 (50.5%) |
1Number of genes printed on an array. 2Genes not detected on any of the twelve arrays. 3Genes with signals on at least on array. 4Genes that were filtered out from all arrays with amplified material, but present on at least one array with non-amplified material. 5Genes that were filtered out from all arrays with non-amplified material, but present on at least one array with amplified material. 6Number of genes that were detected both in arrays with non-amplified material and in arrays with amplified material. These genes are again split into two groups; one with genes fulfilling the criterion of 3 observations in each group (4331 genes); and one with those not fulfilling this criterion (1987 genes).
Filtering per array.
| Arrays hybridized with amplified material | Arrays hybridized with non-amplified material | |||||||||||
| Array number | 1 | 2 | 5 | 6 | 7 | 9 | 10 | 11 | 3 | 4 | 8 | 12 |
| Genes flagged by Genepix | 5192 | 2155 | 1007 | 2152 | 3151 | 2209 | 3212 | 1073 | 4750 | 4909 | 2903 | 4374 |
| 41.55% | 17025% | 8.06% | 17.22% | 25.21% | 17.68% | 25.70% | 8.59% | 38.01% | 39.28% | 23.23% | 35.00% | |
| Genes flagged manually | 15 | 1 | 3 | 5 | 2 | 6 | 10 | 1 | 116 | 15 | 4223 | 16 |
| 0.12% | 0.008% | 0.024% | 0.04% | 0.016% | 0.048% | 0.08% | 0.008% | 0.93% | 0.12% | 33.79% | 0.13% | |
| Additionally Filtered by (spot-background) / background > 1 | 2078 | 1621 | 1055 | 2081 | 2081 | 2033 | 1680 | 2596 | 3226 | 2223 | 2117 | 2521 |
| 16.63% | 12.97% | 8.44% | 16.65% | 16.15% | 16.27% | 13.44% | 20.77% | 25.81% | 17.79% | 16.94% | 20.17% | |
| Sum | 7285 | 3777 | 2065 | 4238 | 5171 | 4238 | 4902 | 3670 | 8092 | 7147 | 9243 | 6911 |
| 58.30% | 30.22% | 16.52% | 33.91% | 41.38% | 33.40% | 39.22% | 29.37% | 64.75% | 57.19% | 73.96% | 55.30% | |
Parameter estimates for the total data set consisting of 10759 genes and for the "conservative set" of 2721 genes.
| Parameter | Type of effect1 | Estimate Total set | Estimate Conserved set |
| μ | Fixed | 10.56 | 12.18 |
| Fixed | 0.086 | -0.028 | |
| Fixed | -0.026 | -0.022 | |
| Fixed | -0.74 | -1.14 | |
| Fixed | -0.0043 | 0.0089 | |
| σ | Random | 0.28 | 0.30 |
| σ | Random | 0.041 | 0.041 |
| σ | Random | 1.36 | 0.98 |
| σ | Random | 0.039 | 0.040 |
| σ | Random | 0.85 | 0.79 |
| σ | Random | 0.74 | 0.54 |
| σ | Random | 0.44 | 0.44 |
| σ | Random | 0.048 | 0.050 |
| σ | Random | 0.22 | 0.028 |
| σε | Random | 0.22 | 0.22 |
Annotations: μ = overall mean signal intensity, C = cell line type, D = labeled dye, P = protocol. 1 Of the fixed effects, C, D, and P have two levels while CP is an interaction between two fixed effects. The random effects were modeled as independently normally distributed random effects with zero mean and constant standard deviation. σis the cell line specific GP effect and is the effect of primary interest.
Signal to Noise Ratio (SNR) for the two data sets.
| Data set | SNR without σ | SNR with σ |
| Total set1 | 9.91 | 5.22 |
| Conservative set2 | 5.38 | 5.30 |
1The total data set included 10579 genes detected on at least one of the twelve arrays. 2The conservative data set included genes found to have equal ratio expression between amplified and non-amplified material. σis the cell line specific GP effect and is the effect of primary interest.
Figure 2(A) Plot of the difference in log2-ratio between the amplified and the non-amplified material for the log2-ratio of the two cell-lines (MT-1, U2OS) vs. the p-value for each of the 4331 genes, calculated for hypothesis testing. (B) Plot of the average log2-ratio of the two cell-lines (MT-1 and U2OS), for the amplified and the non-amplified material for each of the 4331 genes investigated during hypothesis testing. The genes in the "conservative set" are in green, the genes in the "rejected set" are in red and the genes in the "undetermined set" are in black.