| Literature DB >> 17059609 |
André Fujita1, João Ricardo Sato, Leonardo de Oliveira Rodrigues, Carlos Eduardo Ferreira, Mari Cleide Sogayar.
Abstract
BACKGROUND: With the development of DNA hybridization microarray technologies, nowadays it is possible to simultaneously assess the expression levels of thousands to tens of thousands of genes. Quantitative comparison of microarrays uncovers distinct patterns of gene expression, which define different cellular phenotypes or cellular responses to drugs. Due to technical biases, normalization of the intensity levels is a pre-requisite to performing further statistical analyses. Therefore, choosing a suitable approach for normalization can be critical, deserving judicious consideration.Entities:
Mesh:
Year: 2006 PMID: 17059609 PMCID: PMC1636075 DOI: 10.1186/1471-2105-7-469
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Smoothing parameters used for each microarray dataset. For Loess, it is the span value, for Splines and Wavelets it is the number of functions, for Kernel and SVR it is the maximum value minus the minimum value multiplied by the number described in the table.
| 0.30 | 0.10 | 0.10 | |
| 10.00 | 20.00 | 20.00 | |
| 16.00 | 64.00 | 16.00 | |
| 0.50 | 0.50 | 0.70 | |
| 0.20 | 0.20 | 0.60 |
Figure 1The minimum mean square error for three different simulated microarray datasets. From left to right: 1) sinusoid shape; 2) banana shape; 3) mix shape. The Kernel regression was not included in this figure because its MSE is 103 orders of magnitude greater than the other normalization methods.
The mean square errors of estimated gene expression levels for simulated cDNA microarray data with differentially expressed genes inserted under the low expression levels condition.
| Percentage of DEG | Sinusoid | Banana | Mixed | |||||||
| Method | 25% Quantile | Median | 75% Quantile | 25% Quantile | Median | 75% Quantile | 25% Quantile | Median | 75% Quantile | |
| 5% | Loess | 0.00038 | 0.00039 | 0.00040 | 0.00029 | 0.00029 | 0.00029 | 0.04806 | 0.04816 | 0.04825 |
| Splines | 0.00035 | 0.00036 | 0.00037 | 0.00027 | 0.00028 | 0.00028 | 0.04828 | 0.04839 | 0.04848 | |
| Wavelets | 0.00033 | 0.00034 | 0.00035 | 0.00127 | 0.00128 | 0.00128 | 0.04816 | 0.04827 | 0.04835 | |
| Kernel | 0.03781 | 0.03782 | 0.03783 | 0.14368 | 0.14404 | 0.14416 | 0.19869 | 0.19888 | 0.20098 | |
| SVR | 0.00031 | 0.00032 | 0.00033 | 0.00016 | 0.00016 | 0.00017 | 0.04729 | 0.04733 | 0.04738 | |
| 10% | Loess | 0.00047 | 0.00048 | 0.00050 | 0.00038 | 0.00038 | 0.00039 | 0.04631 | 0.04646 | 0.04661 |
| Splines | 0.00044 | 0.00045 | 0.00047 | 0.00037 | 0.00038 | 0.00039 | 0.04649 | 0.04663 | 0.04678 | |
| Wavelets | 0.00042 | 0.00043 | 0.00045 | 0.00117 | 0.00118 | 0.00118 | 0.04637 | 0.04651 | 0.04666 | |
| Kernel | 0.03780 | 0.03781 | 0.03783 | 0.15535 | 0.15574 | 0.15602 | 0.19177 | 0.19337 | 0.19596 | |
| SVR | 0.00040 | 0.00041 | 0.00043 | 0.00031 | 0.00032 | 0.00033 | 0.04543 | 0.04548 | 0.04556 | |
| 15% | Loess | 0.00055 | 0.00057 | 0.00059 | 0.00037 | 0.00038 | 0.00040 | 0.05157 | 0.05177 | 0.05194 |
| Splines | 0.00053 | 0.00055 | 0.00057 | 0.00036 | 0.00036 | 0.00038 | 0.05180 | 0.05199 | 0.05215 | |
| Wavelets | 0.00050 | 0.00053 | 0.00054 | 0.00085 | 0.00087 | 0.00088 | 0.05165 | 0.05184 | 0.05199 | |
| Kernel | 0.03779 | 0.03781 | 0.03784 | 0.16922 | 0.16965 | 0.17000 | 0.18852 | 0.18989 | 0.19142 | |
| SVR | 0.00048 | 0.00050 | 0.00052 | 0.00033 | 0.00034 | 0.00035 | 0.05057 | 0.05069 | 0.05078 | |
| 20% | Loess | 0.00064 | 0.00066 | 0.00068 | 0.00042 | 0.00043 | 0.00044 | 0.04780 | 0.04797 | 0.04819 |
| Splines | 0.00061 | 0.00063 | 0.00066 | 0.00040 | 0.00042 | 0.00043 | 0.04799 | 0.04818 | 0.04837 | |
| Wavelets | 0.00059 | 0.00061 | 0.00064 | 0.00138 | 0.00140 | 0.00142 | 0.04786 | 0.04807 | 0.04825 | |
| Kernel | 0.03778 | 0.03781 | 0.03785 | 0.14796 | 0.14841 | 0.14864 | 0.18435 | 0.18606 | 0.18721 | |
| SVR | 0.00056 | 0.00058 | 0.00060 | 0.00035 | 0.00036 | 0.00037 | 0.04630 | 0.04638 | 0.04647 | |
| 40% | Loess | 0.00098 | 0.00102 | 0.00104 | 0.00057 | 0.00061 | 0.00065 | 0.07937 | 0.07985 | 0.08031 |
| Splines | 0.00096 | 0.00099 | 0.00103 | 0.00060 | 0.00064 | 0.00069 | 0.07965 | 0.08014 | 0.08059 | |
| Wavelets | 0.00095 | 0.00098 | 0.00101 | 0.00178 | 0.00182 | 0.00187 | 0.07954 | 0.08003 | 0.08047 | |
| Kernel | 0.03771 | 0.03780 | 0.03786 | 0.14208 | 0.14235 | 0.14271 | 0.20321 | 0.20363 | 0.20426 | |
| SVR | 0.00088 | 0.00091 | 0.00094 | 0.00047 | 0.00048 | 0.00049 | 0.06863 | 0.06901 | 0.06943 | |
DEG: Differentially expressed genes
The mean square errors of estimated gene expression levels for simulated cDNA microarray data with differentially expressed genes inserted under the medium expression levels condition.
| Percentage of DEG | Sinusoid | Banana | Mixed | |||||||
| Method | 25% Quantile | Median | 75% Quantile | 25% Quantile | Median | 75% Quantile | 25% Quantile | Median | 75% Quantile | |
| 5% | Loess | 0.00356 | 0.00373 | 0.00389 | 0.00379 | 0.00392 | 0.00407 | 0.05214 | 0.05234 | 0.05259 |
| Splines | 0.00354 | 0.00370 | 0.00387 | 0.00379 | 0.00393 | 0.00407 | 0.05237 | 0.05258 | 0.05283 | |
| Wavelets | 0.00351 | 0.00368 | 0.00384 | 0.00438 | 0.00450 | 0.00466 | 0.05227 | 0.05247 | 0.05272 | |
| Kernel | 0.03799 | 0.03816 | 0.03838 | 0.17380 | 0.17441 | 0.17487 | 0.18858 | 0.18882 | 0.18904 | |
| SVR | 0.00337 | 0.00353 | 0.00369 | 0.00357 | 0.00368 | 0.00382 | 0.05034 | 0.05049 | 0.05067 | |
| 10% | Loess | 0.00709 | 0.00723 | 0.00743 | 0.00758 | 0.00780 | 0.00799 | 0.05483 | 0.05506 | 0.05532 |
| Splines | 0.00707 | 0.00721 | 0.00741 | 0.00763 | 0.00787 | 0.00805 | 0.05507 | 0.05532 | 0.05556 | |
| Wavelets | 0.00705 | 0.00718 | 0.00739 | 0.00858 | 0.00881 | 0.00898 | 0.05497 | 0.05522 | 0.05547 | |
| Kernel | 0.03837 | 0.03857 | 0.03886 | 0.15366 | 0.15461 | 0.15523 | 0.18801 | 0.18830 | 0.18867 | |
| SVR | 0.00672 | 0.00688 | 0.00707 | 0.00709 | 0.00731 | 0.00750 | 0.05150 | 0.05164 | 0.05183 | |
| 15% | Loess | 0.01041 | 0.01061 | 0.01094 | 0.01108 | 0.01136 | 0.01165 | 0.05964 | 0.05985 | 0.06012 |
| Splines | 0.01039 | 0.01060 | 0.01091 | 0.01109 | 0.01136 | 0.01165 | 0.05990 | 0.06013 | 0.06039 | |
| Wavelets | 0.01038 | 0.01058 | 0.01089 | 0.01251 | 0.01276 | 0.01310 | 0.05978 | 0.06003 | 0.06029 | |
| Kernel | 0.03867 | 0.03897 | 0.03927 | 0.12923 | 0.13026 | 0.13111 | 0.19337 | 0.19367 | 0.19414 | |
| SVR | 0.00986 | 0.01006 | 0.01032 | 0.01027 | 0.01056 | 0.01081 | 0.05499 | 0.05526 | 0.05550 | |
| 20% | Loess | 0.01393 | 0.01418 | 0.01444 | 0.01487 | 0.01519 | 0.01542 | 0.06362 | 0.06398 | 0.06432 |
| Splines | 0.01393 | 0.01415 | 0.01442 | 0.01486 | 0.01518 | 0.01542 | 0.06390 | 0.06425 | 0.06460 | |
| Wavelets | 0.01390 | 0.01414 | 0.01440 | 0.01631 | 0.01666 | 0.01689 | 0.06375 | 0.06410 | 0.06445 | |
| Kernel | 0.03915 | 0.03957 | 0.04004 | 0.12265 | 0.12366 | 0.12464 | 0.19808 | 0.19858 | 0.19909 | |
| SVR | 0.01310 | 0.01334 | 0.01365 | 0.01365 | 0.01399 | 0.01416 | 0.05809 | 0.05835 | 0.05858 | |
| 40% | Loess | 0.02772 | 0.02813 | 0.02862 | 0.02969 | 0.03004 | 0.03043 | 0.07856 | 0.07910 | 0.07975 |
| Splines | 0.02774 | 0.02814 | 0.02861 | 0.02966 | 0.03002 | 0.03038 | 0.07884 | 0.07937 | 0.08002 | |
| Wavelets | 0.02771 | 0.02811 | 0.02859 | 0.03012 | 0.03049 | 0.03092 | 0.07873 | 0.07926 | 0.07995 | |
| Kernel | 0.04195 | 0.04261 | 0.04316 | 0.15640 | 0.15816 | 0.16012 | 0.20368 | 0.20443 | 0.20518 | |
| SVR | 0.02545 | 0.02581 | 0.02614 | 0.02656 | 0.02685 | 0.02724 | 0.06786 | 0.06830 | 0.06862 | |
DEG: Differentially expressed genes
The mean square errors of estimated gene expression levels for simulated cDNA microarray data with differentially expressed genes inserted under the high expression levels conditions.
| Percentage of DEG | Sinusoid | Banana | Mixed | |||||||
| Method | 25% Quantile | Median | 75% Quantile | 25% Quantile | Median | 75% Quantile | 25% Quantile | Median | 75% Quantile | |
| 5% | Loess | 0.00038 | 0.00039 | 0.00040 | 0.00081 | 0.00087 | 0.00094 | 0.04633 | 0.04639 | 0.04648 |
| Splines | 0.00035 | 0.00036 | 0.00037 | 0.00079 | 0.00086 | 0.00092 | 0.04658 | 0.04665 | 0.04674 | |
| Wavelets | 0.00033 | 0.00034 | 0.00035 | 0.00115 | 0.00121 | 0.00129 | 0.04643 | 0.04650 | 0.04660 | |
| Kernel | 0.03781 | 0.03782 | 0.03783 | 0.16417 | 0.16453 | 0.16513 | 0.17428 | 0.17436 | 0.17447 | |
| SVR | 0.00031 | 0.00032 | 0.00033 | 0.00080 | 0.00087 | 0.00092 | 0.04527 | 0.04534 | 0.04544 | |
| 10% | Loess | 0.00146 | 0.00153 | 0.00168 | 0.00145 | 0.00156 | 0.00167 | 0.04733 | 0.04748 | 0.04758 |
| Splines | 0.00142 | 0.00149 | 0.00160 | 0.00147 | 0.00157 | 0.00168 | 0.04756 | 0.04767 | 0.04779 | |
| Wavelets | 0.00140 | 0.00147 | 0.00159 | 0.00160 | 0.00170 | 0.00182 | 0.04725 | 0.04736 | 0.04748 | |
| Kernel | 0.02662 | 0.03454 | 0.03789 | 0.23086 | 0.23135 | 0.23245 | 0.18441 | 0.19003 | 0.19176 | |
| SVR | 0.00126 | 0.00133 | 0.00144 | 0.00142 | 0.00154 | 0.00166 | 0.04678 | 0.04687 | 0.04696 | |
| 15% | Loess | 0.00203 | 0.00217 | 0.00234 | 0.00199 | 0.00212 | 0.00224 | 0.04963 | 0.04976 | 0.04991 |
| Splines | 0.00198 | 0.00211 | 0.00223 | 0.00200 | 0.00211 | 0.00222 | 0.04987 | 0.05001 | 0.05014 | |
| Wavelets | 0.00196 | 0.00209 | 0.00224 | 0.00219 | 0.00240 | 0.00257 | 0.04975 | 0.04989 | 0.05001 | |
| Kernel | 0.02318 | 0.02992 | 0.03729 | 0.17578 | 0.20066 | 0.22763 | 0.18472 | 0.18898 | 0.18923 | |
| SVR | 0.00178 | 0.00190 | 0.00200 | 0.00170 | 0.00180 | 0.00189 | 0.04885 | 0.04898 | 0.04912 | |
| 20% | Loess | 0.00260 | 0.00275 | 0.00293 | 0.00259 | 0.00272 | 0.00289 | 0.04917 | 0.04930 | 0.04944 |
| Splines | 0.00254 | 0.00268 | 0.00286 | 0.00260 | 0.00270 | 0.00289 | 0.04933 | 0.04946 | 0.04961 | |
| Wavelets | 0.00253 | 0.00267 | 0.00284 | 0.00289 | 0.00304 | 0.00320 | 0.04919 | 0.04933 | 0.04947 | |
| Kernel | 0.02224 | 0.02819 | 0.03468 | 0.16500 | 0.17817 | 0.20385 | 0.18207 | 0.18716 | 0.19141 | |
| SVR | 0.00226 | 0.00239 | 0.00255 | 0.00247 | 0.00258 | 0.00272 | 0.04839 | 0.04850 | 0.04863 | |
| 40% | Loess | 0.00501 | 0.00520 | 0.00545 | 0.00518 | 0.00538 | 0.00555 | 0.04980 | 0.04999 | 0.05020 |
| Splines | 0.00498 | 0.00519 | 0.00539 | 0.00520 | 0.00538 | 0.00558 | 0.05002 | 0.05022 | 0.05038 | |
| Wavelets | 0.00496 | 0.00517 | 0.00537 | 0.00535 | 0.00551 | 0.00572 | 0.04984 | 0.05004 | 0.05020 | |
| Kernel | 0.02155 | 0.02487 | 0.02810 | 0.18250 | 0.20140 | 0.22296 | 0.17524 | 0.18072 | 0.18433 | |
| SVR | 0.00446 | 0.00467 | 0.00489 | 0.00464 | 0.00483 | 0.00505 | 0.04809 | 0.04829 | 0.04860 | |
DEG: Differentially expressed genes
Figure 2Fitted normalization curves for actual cDNA microarray data using the five different normalization methods (Loess, Splines, Wavelets, Kernel, SVR).