| Literature DB >> 19936079 |
Werner Van Belle1, Nancy Gerits, Kirsti Jakobsen, Vigdis Brox, Marijke Van Ghelue, Ugo Moens.
Abstract
To perform a quantitative analysis with gene-arrays, one must take into account inaccuracies (experimental variations, biological variations and other measurement errors) which are seldom known. In this paper we investigated amplification and noise propagation related errors by measuring intensity dependent variations. Based on a set of control samples, we create confidence intervals for up and down regulations. We validated our method through a qPCR experiment and compared it to standard analysis methods (including loess normalization and filtering methods based on genetic variability). The results reveal that amplification related errors are a major concern.Entities:
Keywords: Microarray analysis; confidence intervals; differential expression; downregulation; gene-array; measurement errors; upregulation
Year: 2007 PMID: 19936079 PMCID: PMC2759122
Source DB: PubMed Journal: Gene Regul Syst Bio ISSN: 1177-6250
Overview of the different experiments.
| Experiment | Constitutive Active MK5 | FKRP Knockdown | TAF4 Knockdown | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| CY5/CY3 | DIG | DIG | ||||||||
| KTH Rat27K Oligo m.a, OperonV3.0 Tecan HS 4800 Genepix 4000B Genepix Pro 6.1.0.2 | Applied Biosystems | Applied Biosystems | ||||||||
| Normal (Cy5) | MK5 Induced (Cy3) | Control (Both) | siRNA #1 | siRNA #2 | Scrambled | SiRNA | Scrambled | siRNA | Scrambled | |
| 3 | 3 | 1 | 3 | 2 | 3 | 4 | 4 | 4 | 3 | |
| PC12 TetOff for MK5L337A | C2C12 | SK-N-DZ | HeLa | |||||||
| 1. Normal vs MK5 Induced | 2. siRNA#1 vs Scrambled | 4. siRNA vs Scrambled | 5. siRNA vs Scrambled | |||||||
| Microarray facilityTromsø Loess normalization | UNIGEN (Trondheim) Quantile Normalization | UNIGEN (Trondheim) Quantile Normalization | ||||||||
| 27468 reported of which 4007 in agreement | 0 | 0 | not submitted | 70 | ||||||
| Both Quantile and no normalization | Applied Biosystems Inter-array normalization | Applied Biosystems Inter-array normalization | ||||||||
| 1422 | 2977 | 576 | 661 | 2497 (22 validated through qPCR) | ||||||
| 311, with 10 wrong in the standard analysis | 0 | 0 | not applicable | 65 | ||||||
Figure 1Scatterplot of the control slides and the two measurements of the MK5 experiments. The red points are from slide 1. The green points are from slide 3. The blue points are from the control slide. Horizontally the red channel is set out, vertically the green. The bend is due to quenching (Kubista, 1994). The variance of the control slide can be observed in the width of the blue area. It increases up to 32768 (indicated with gray dotted lines), after which it decreases again. In a perfect world, the control sample should have the same red as green value, and be a straight line.
Figure 2Error Distribution of various up/down regulation experiments. Horizontally the spot intensity is set out. Vertically the measurement error is set out as a cumulative distribution function. The cumulative distribution expresses the probability that a specific difference will occur due to experimental, biological or measurement variations. The colors are more intense within the 95% confidence interval. With such a diagram one can to determine the limits in which a regulation is very likely to fall. The multiple diagrams are measurement errors obtained from different experiments and different machines. The MK5 sample was Cy5/Cy3 stained and scanned on a Tecan scanner. All other samples were DIG labeled and scanned on an Applied Biosystems 1700 microarray scanner. As an example how to read the diagrams: in the MK5 diagram (top right) we find that the biological variation is larger for spots with intensity 32768. If a measured spot has intensity 32768, then its 95% confidence interval on the difference between the two channels is around[−9000, 9000] (marked with a white arrow).
Gene regulation induced by MK5 activation. Each regulation is listed as a term with a confidence interval covering 95% of the real values. Gene regulation is calculated as the mean of all the measured oligosequences/probes. The reported confidence interval is the result of a convolution of the respective error distributions. The yellow row is explained in detail in the text.
| Difference | Summed Values | Regulation Ratio | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene # | Confidence Interval | At least | Measured | At most | Green | Red | Count | Direction | At least | Measured | At most |
| 1 | [−6430.72:6840.32] | −39267.7 | −32837 | −25996.7 | 39613 | 6776 | 2 | down | 4.84 | 5.85 | 114.73 |
| 2 | [−2447.36:2242.56] | −7807.36 | −5360 | −3117.44 | 6191 | 831 | 2 | down | 4.75 | 7.45 | inf |
| 3 | [−2355.2:2129.92] | −5122.2 | −2767 | −637.08 | 3807 | 1040 | 2 | down | 1.61 | 3.66 | inf |
| 4 | [−2775.04:2754.56] | 1531.96 | 4307 | 7061.56 | 2531 | 6838 | 6 | up | 1.61 | 2.7 | inf |
| 5 | [−2437.12:2447.36] | −5919.12 | −3482 | −1034.64 | 5215 | 1733 | 4 | down | 1.6 | 3.01 | inf |
| 6 | [−2037.76:2457.6] | 472.24 | 2510 | 4967.6 | 809 | 3319 | 2 | up | 1.58 | 4.1 | inf |
| 7 | [−3532.8:3430.4] | 2701.2 | 6234 | 9664.4 | 4818 | 11052 | 4 | up | 1.56 | 2.29 | inf |
| 8 | [−1812.48:1536] | −3697.48 | −1885 | −349 | 2514 | 629 | 2 | down | 1.55 | 4 | inf |
| 9 | [−2590.72:2621.44] | −6302.72 | −3712 | −1090.56 | 5684 | 1972 | 6 | down | 1.55 | 2.88 | inf |
| 10 | [−2170.88:2314.24] | 969.12 | 3140 | 5454.24 | 1854 | 4994 | 6 | up | 1.52 | 2.69 | inf |
| 11 | [−3461.12:3686.4] | 2038.88 | 5500 | 9186.4 | 3982 | 9482 | 2 | up | 1.51 | 2.38 | 32.08 |
| 12 | [−2283.52:2048] | −5168.52 | −2885 | −837 | 4528 | 1643 | 2 | down | 1.51 | 2.76 | inf |
| 13 | [−8448:8704] | 10789 | 19237 | 27941 | 21540 | 40777 | 2 | up | 1.5 | 1.89 | 3.18 |
| 14 | [−2754.56:3368.96] | 765.44 | 3520 | 6888.96 | 1555 | 5075 | 2 | up | 1.49 | 3.26 | inf |
| 15 | [−1771.52:1986.56] | 438.48 | 2210 | 4196.56 | 914 | 3124 | 2 | up | 1.48 | 3.42 | inf |
| 16 | [−6082.56:5898.24] | 5740.44 | 11823 | 17721.2 | 12046 | 23869 | 2 | up | 1.48 | 1.98 | 3.88 |
| 17 | [−2078.72:2211.84] | −4762.72 | −2684 | −472.16 | 3708 | 1024 | 2 | down | 1.46 | 3.62 | inf |
| 18 | [−2119.68:2037.76] | −4787.68 | −2668 | −630.24 | 4044 | 1376 | 2 | down | 1.46 | 2.94 | inf |
| 19 | [−1781.76:1792] | 314.24 | 2096 | 3888 | 688 | 2784 | 2 | up | 1.46 | 4.05 | inf |
| 20 | [−3932.16:4259.84] | 2675.84 | 6608 | 10867.8 | 5984 | 12592 | 2 | up | 1.45 | 2.1 | 7.3 |
| 21 | [−10455:10915.8] | −36683 | −26228 | −15312.2 | 85832 | 59604 | 2 | down | 1.26 | 1.44 | 1.75 |
| 22 | [−7700.48:7782.4] | 5041.52 | 12742 | 20524.4 | 20556 | 33298 | 2 | up | 1.25 | 1.62 | 2.61 |
| 23 | [−2140.16:2273.28] | 320.84 | 2461 | 4734.28 | 1321 | 3782 | 2 | up | 1.24 | 2.86 | inf |
| 24 | [−2621.44:2979.84] | −6161.44 | −3540 | −560.16 | 5883 | 2343 | 2 | down | 1.24 | 2.51 | inf |
| 25 | [−3450.88:3952.64] | −8665.88 | −5215 | −1262.36 | 10529 | 5314 | 2 | down | 1.24 | 1.98 | 5.65 |
| 26 | [−2232.32:2600.96] | −5150.32 | −2918 | −317.04 | 4264 | 1346 | 4 | down | 1.24 | 3.17 | inf |
| 27 | [−2181.12:2099.2] | 202.88 | 2384 | 4483.2 | 867 | 3251 | 2 | up | 1.23 | 3.75 | inf |
| 28 | [−3758.08:3768.32] | 1212.92 | 4971 | 8739.32 | 5296 | 10267 | 4 | up | 1.23 | 1.94 | 6.72 |
| 29 | [−4925.44:5857.28] | 2682.56 | 7608 | 13465.3 | 11941 | 19549 | 2 | up | 1.22 | 1.64 | 3.21 |
| 30 | [−2426.88:2887.68] | 418.12 | 2845 | 5732.68 | 1909 | 4754 | 2 | up | 1.22 | 2.49 | inf |
| 31 | [−5980.16:5867.52] | −14564.2 | −8584 | −2716.48 | 20997 | 12413 | 2 | down | 1.22 | 1.69 | 3.26 |
| 32 | [−4423.68:4966.4] | −11228.7 | −6805 | −1838.6 | 15221 | 8416 | 4 | down | 1.22 | 1.81 | 3.81 |
| 33 | [−1771.52:1484.8] | −3399.52 | −1628 | −143.2 | 2307 | 679 | 2 | down | 1.21 | 3.4 | inf |
| 34 | [−3491.84:3481.6] | 331.16 | 3823 | 7304.6 | 5513 | 9336 | 4 | up | 1.06 | 1.69 | 4.6 |
Figure 3Plots illustrating the difference between standard filtered results (based on loess normalization and a consensus for both slides) and the filtering based on the confidence intervals for the MK5 experiment. A) the red spots are reported by the standard method but no longer by the confidence interval method. The green spots are the control slide, illustrating the large variance of the measurement. All spots omitted in the confidence interval method were too close to the measurement error to be useful. B) The red spots are those reported in the confidence interval method but not in the standard analysis. The green spots again represent the control slide.
Quantitative PCR analysis to verify differentially expressed genes. A number of the genes that were reported to be expressed differentially by the microarray analysis were measured using quantitative PCR.
| qPCR results | Microarray results | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| TAF4 | # | Mean | CT | Ratio | Fixed | Ratio | least | most | Comments |
| Hela Cells | 1 | 29.88 | down | 1.33 | 1.6 | down | 1.2 | 2.45 | OK |
| 2 | 29.72 | down | 1.32 | 1.59 | up | 1.07 | 1.66 | NO, | |
| 3 | 29.41 | up | 1.03 | 1.24 | up | 1.22 | 1.78 | OK | |
| 4 | 30.84 | up | 1.09 | 1.32 | up | 7.84 | inf | NO, | |
| 5 | 25.46 | up | 2.76 | 3.34 | up | 2.64 | 5.01 | OK, | |
| 6 | down | large | large | down | 122.53 | inf | OK, | ||
| 7 | 38.93 | down | 2.67 | 3.23 | down | 3.57 | inf | OK, | |
| 8 | 38.26 | down | 1.25 | 1.52 | down | 3.18 | 8.5 | OK, | |
| 9 | 34.02 | up | 1.04 | 1.26 | up | 1.13 | 1.88 | OK | |
| 10 | 31.1 | down | 1.2 | 1.45 | down | 1.22 | 2 | OK | |
| 11 | 26.09 | down | 1.02 | 1.23 | down | 1.23 | 1.91 | OK | |
| 12 | 35.48 | up | 1.38 | 1.67 | up | 1.03 | 1.59 | NO, | |
| 13 | 34.03 | down | 1.05 | 1.27 | up | 1.1 | 1.65 | NO, | |
| 14 | 35.99 | up | 1.03 | 1.25 | down | 1.11 | 1.54 | NO, | |
| 15 | 31.38 | down | 2.04 | 2.47 | down | 1.5 | 2.23 | NO, | |
| 16 | 31.01 | up | 1.06 | 1.28 | up | 1.08 | 1.65 | OK | |
| 17 | 34.67 | up | 1.49 | 1.8 | up | 1.36 | 3.32 | OK | |
| SK-N-DZ Cells | 18 | 28.73 | down | 1.47 | 1.47 | down | 1.16 | 1.7 | OK |
| 19 | 28.15 | down | 1.52 | 1.52 | down | 1.03 | 1.98 | OK | |
| 20 | 35.02 | up | 1.38 | 1.38 | up | 1.06 | 2.96 | OK | |
| 21 | 33.11 | down | 1.49 | 1.49 | down | 1.09 | 1.87 | OK | |
| 22 | 38.04 | up | 1.24 | 1.24 | down | 14.99 | inf | NO, | |
All results are reported as a ratio from the scrambled siRNA to the specific siRNA
1) HeLa cells results have been multiplied to account for transfection efficiency; 2) Regulation direction reported wrong; 3) qPCR result difficult to obtain due to large CP values; 4) Microarray upperbound too low; 5) Difficult consensus on PCR results; 6) Also listed in the genvar analysis
Wrongly reported datapoints in the loess normalized data. We compared the regulations of our method to a standard loess normalization and found 10 spots for which the two methods disagreed qualitatively. Each case contains the data as found on the non-normalized microarray (reported in the two first green/red columns). The reported log ratio after loess normalization is given in the second row of each case. The reported confidence interval is presented in the first row of each case.
| confidence intervals | C.I. | Difference | Values | Regulation | Factor | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Low | Norm | Hi | Green | Red | Count | Lo | Mes | ||||
| loess | D4D1 | D6D3 | |||||||||
| non-normalized | Slide 1 | Slide 3 | |||||||||
| Green | Red | Green | Red | ||||||||
| Rn 30026543 | [− 3983.36:3993.6] | −277.64 | −4261 | −8254.6 | 10661 | 6400 | 2 | down | 1.03 | 0.6 | confidence intervals |
| 0.49 | 0.55 | up | loess | ||||||||
| 4336 | 2218 | 7262 | 3661 | non-normalized | |||||||
| Rn 30009746 | [−1904.64:1812.48] | −41.36 | −1946 | −3758.48 | 2743 | 797 | 2 | down | 1.02 | 0.29 | confidence intervals |
| 0.12 | 0.02 | up | loess | ||||||||
| 911 | 683 | 2001 | 113 | non-normalized | |||||||
| Rn 30025831 | [−2918.4:3246.08] | −545.6 | −3464 | −6710.08 | 8138 | 4674 | 2 | down | 1.09 | 0.57 | confidence intervals |
| 0.21 | 0.41 | up | loess | ||||||||
| 2274 | 1383 | 6508 | 2910 | non-normalized | |||||||
| Rn 30026511 | [−8212.48:8407.04] | −1460.52 | −9673 | −18080 | 41854 | 32181 | 2 | down | 1.04 | 0.77 | confidence intervals |
| 0.43 | 0.06 | up | loess | ||||||||
| 10631 | 8385 | 34489 | 20727 | non-normalized | |||||||
| Rn 30023124 | [−5539.84:5683.2] | 11256.8 | 5717 | 33.8 | 14262 | 19979 | 2 | up | 1 | 1.4 | confidence intervals |
| −0.13 | −0.11 | down | loess | ||||||||
| 7556 | 8168 | 8065 | 10364 | non-normalized | |||||||
| Rn 30026938 | [−2959.36:2826.24] | −580.64 | −3540 | −6366.24 | 5297 | 1757 | 2 | down | 1.18 | 0.33 | confidence intervals |
| 0.02 | 0.02 | up | loess | ||||||||
| 2104 | 827 | 3590 | 880 | non-normalized | |||||||
| Rn 30026618 | [−7618.56:8785.92] | 17364.6 | 9746 | 960.08 | 109415 | 119161 | 2 | up | 1.01 | 1.09 | confidence intervals |
| −0.01 | −0.13 | down | loess | ||||||||
| 53944 | 57496 | 57493 | 60704 | non-normalized | |||||||
| Rn 30026891 | [−2939.88:3481.6] | −444.12 | −3383 | −6864.6 | 7455 | 4072 | 2 | down | 1.08 | 0.55 | confidence intervals |
| 0.01 | 0.1 | up | loess | ||||||||
| 2347 | 1004 | 5737 | 2757 | non-normalized | |||||||
| Rn 30000378 | [−6338.56:7075.84] | 13860.6 | 7522 | 446.16 | 114279 | 121801 | 2 | up | 1 | 1.07 | confidence intervals |
| −0.26 | −0.12 | down | loess | ||||||||
| 63294 | 64294 | 53346 | 56126 | non-normalized | |||||||
| Rn 30018614 | [−1904.64:1853.44] | 3883.64 | 1979 | 125.56 | 851 | 2830 | 2 | up | 1.07 | 3.33 | confidence intervals |
| −0.1 | 0 | down | loess | ||||||||
| 528 | 958 | 311 | 1711 | non-normalized | |||||||