| Literature DB >> 16432259 |
Bettina Harr1, Christian Schlötterer.
Abstract
Oligonucleotide microarrays are an informative tool to elucidate gene regulatory networks. In order for gene expression levels to be comparable across microarrays, normalization procedures have to be invoked. A large number of methods have been described to correct for systematic biases in microarray experiments. The performance of these methods has been tested only to a limited extend. Here, we evaluate two different types of microarray analyses: (i) the same gene in replicate samples and (ii) different, but co-expressed genes in the same sample. The reliability of the latter analysis needs to be determined for the analysis of regulatory networks and our report is the first attempt to evaluate for the accuracy of different microarray normalization methods in this respect. Consistent with previous results we observed a large effect of the normalization method on the outcome of the expression analyses. Our analyses indicate that different normalization methods should be performed depending on whether a study is aiming to detect differential gene expression between independent samples or whether co-expressed genes should be identified. We make recommendations about the most appropriate method to use.Entities:
Mesh:
Year: 2006 PMID: 16432259 PMCID: PMC1345700 DOI: 10.1093/nar/gnj010
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Graphical representation of the analyses performed in this study. (A) Each sample is analyzed with four different normalization methods and correlation coefficients are calculated for pairwise combinations of normalization methods within one sample. (B) Replicate samples within each strain are analyzed with four different normalization methods and correlation coefficients are calculated between the replicate samples analyzed with a single method. (C) Each of the eight samples (i.e. four E.coli strains × two hybridizations per strain) was analyzed independently under 54 different normalization methods. For each method the correlation in expression level between two operon member genes is calculated.
Spearman correlation coefficients calculated on two levels
| Method | Sample | DHα | DHαF | MG1655 | MG1655F | |
|---|---|---|---|---|---|---|
| gcrma | Sample 1 versus Sample 2 | within method | 0.98 | 0.99 | 0.99 | 0.97 |
| rma | Sample 1 versus Sample 2 | within method | 0.96 | 0.98 | 0.98 | 0.96 |
| Li–Wong | Sample 1 versus Sample 2 | within method | 0.95 | 0.97 | 0.98 | 0.95 |
| mas5.0 | Sample 1 versus Sample 2 | within method | 0.94 | 0.96 | 0.97 | 0.93 |
| Sample 1 | between method | 0.94 | 0.95 | 0.96 | 0.91 | |
| Sample 1 | between method | 0.89 | 0.91 | 0.92 | 0.87 | |
| Sample 1 | between method | 0.87 | 0.90 | 0.92 | 0.83 | |
| Sample 1 | between method | 0.81 | 0.86 | 0.85 | 0.79 | |
| Sample 1 | between method | 0.74 | 0.78 | 0.79 | 0.68 | |
| Sample 1 | between method | 0.72 | 0.78 | 0.78 | 0.69 | |
| Sample 2 | between method | 0.95 | 0.94 | 0.96 | 0.95 | |
| Sample 2 | between method | 0.92 | 0.89 | 0.93 | 0.90 | |
| Sample 2 | between method | 0.91 | 0.88 | 0.93 | 0.89 | |
| Sample 2 | between method | 0.87 | 0.82 | 0.86 | 0.86 | |
| Sample 2 | between method | 0.79 | 0.73 | 0.80 | 0.78 | |
| Sample 2 | between method | 0.78 | 0.73 | 0.80 | 0.77 |
I. Within method, between two replicates analyzed with the same normalization method; II. Between method, calculated within a single replicate analyzed for different pairwise combinations of microarray normalization methods. Note that only four different normalization schemes were used (mas5.0, rma, gcrma and Li–Wong, see text for references).
Spearman correlation coefficients between operon member genes
| MG1 | MG2 | MGF1 | MGF2 | DH1 | DH2 | DHF1 | DHF2 | Average | Method | Common name | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| r | r | r | r | r | r | r | r | Background | Normalize | pm correct | Summary | ||
| 0.69 | 0.73 | 0.71 | 0.67 | 0.71 | 0.72 | 0.63 | 0.68 | 0.6921 | mas | invariantset | mas | liwong | |
| 0.68 | 0.73 | 0.71 | 0.67 | 0.71 | 0.73 | 0.62 | 0.67 | 0.6877 | none | quantiles | mas | liwong | |
| 0.68 | 0.73 | 0.71 | 0.67 | 0.71 | 0.72 | 0.62 | 0.67 | 0.6877 | mas | quantiles | mas | liwong | |
| 0.68 | 0.73 | 0.71 | 0.66 | 0.70 | 0.72 | 0.62 | 0.66 | 0.6867 | none | invariantset | mas | liwong | |
| 0.66 | 0.70 | 0.69 | 0.64 | 0.70 | 0.72 | 0.60 | 0.65 | 0.6700 | mas | quantiles | pmonly | liwong | |
| 0.67 | 0.71 | 0.69 | 0.65 | 0.69 | 0.71 | 0.60 | 0.64 | 0.6697 | none | constant | mas | liwong | |
| 0.66 | 0.70 | 0.69 | 0.64 | 0.70 | 0.72 | 0.60 | 0.65 | 0.6694 | rma | quantiles | pmonly | liwong | |
| 0.65 | 0.70 | 0.68 | 0.65 | 0.69 | 0.70 | 0.60 | 0.64 | 0.6651 | mas | constant | mas | liwong | |
| 0.65 | 0.69 | 0.68 | 0.63 | 0.70 | 0.72 | 0.60 | 0.64 | 0.6632 | rma | invariantset | pmonly | liwong | |
| 0.65 | 0.69 | 0.68 | 0.63 | 0.69 | 0.71 | 0.59 | 0.64 | 0.6610 | none | quantiles | pmonly | liwong | |
| 0.65 | 0.69 | 0.67 | 0.62 | 0.69 | 0.71 | 0.59 | 0.63 | 0.6574 | mas | invariantset | pmonly | liwong | |
| 0.63 | 0.68 | 0.65 | 0.61 | 0.68 | 0.70 | 0.58 | 0.63 | 0.6456 | none | constant | pmonly | liwong | |
| 0.64 | 0.68 | 0.66 | 0.60 | 0.69 | 0.70 | 0.56 | 0.62 | 0.6443 | none | invariantset | pmonly | liwong | Li–Wong |
| 0.63 | 0.68 | 0.65 | 0.60 | 0.68 | 0.70 | 0.57 | 0.63 | 0.6415 | rma | constant | pmonly | liwong | |
| 0.63 | 0.67 | 0.65 | 0.59 | 0.67 | 0.70 | 0.57 | 0.62 | 0.6397 | mas | constant | pmonly | liwong | |
| 0.65 | 0.67 | 0.64 | 0.61 | 0.66 | 0.68 | 0.60 | 0.60 | 0.6379 | gcrma | invariantset | pmonly | liwong | |
| 0.65 | 0.68 | 0.65 | 0.59 | 0.65 | 0.67 | 0.60 | 0.60 | 0.6362 | gcrma | constant | pmonly | mas | |
| 0.64 | 0.68 | 0.65 | 0.59 | 0.65 | 0.67 | 0.60 | 0.60 | 0.6352 | gcrma | quantiles | pmonly | mas | |
| 0.64 | 0.68 | 0.65 | 0.59 | 0.65 | 0.67 | 0.60 | 0.60 | 0.6352 | gcrma | invariantset | pmonly | mas | |
| 0.63 | 0.68 | 0.63 | 0.59 | 0.65 | 0.68 | 0.59 | 0.61 | 0.6336 | none | invariantset | mas | mas | |
| 0.62 | 0.68 | 0.63 | 0.59 | 0.66 | 0.69 | 0.59 | 0.61 | 0.6335 | none | quantiles | mas | mas | |
| 0.65 | 0.67 | 0.63 | 0.61 | 0.66 | 0.67 | 0.58 | 0.59 | 0.6316 | gcrma | quantiles | pmonly | liwong | |
| 0.62 | 0.68 | 0.63 | 0.59 | 0.65 | 0.68 | 0.57 | 0.61 | 0.6302 | none | constant | mas | mas | |
| 0.62 | 0.68 | 0.63 | 0.58 | 0.66 | 0.68 | 0.58 | 0.61 | 0.6301 | mas | invariantset | mas | mas | |
| 0.62 | 0.68 | 0.63 | 0.58 | 0.66 | 0.68 | 0.58 | 0.61 | 0.6295 | mas | quantiles | mas | mas | |
| 0.62 | 0.68 | 0.63 | 0.58 | 0.66 | 0.68 | 0.56 | 0.61 | 0.6265 | mas | constant | mas | mas | mas5.0 |
| 0.63 | 0.66 | 0.63 | 0.59 | 0.64 | 0.67 | 0.58 | 0.60 | 0.6256 | none | constant | mas | medianpolish | |
| 0.65 | 0.66 | 0.64 | 0.60 | 0.63 | 0.64 | 0.59 | 0.58 | 0.6249 | gcrma | quantiles | pmonly | medianpolish | gcrma |
| 0.65 | 0.66 | 0.64 | 0.60 | 0.62 | 0.65 | 0.59 | 0.58 | 0.6243 | gcrma | invariantset | pmonly | medianpolish | |
| 0.63 | 0.66 | 0.63 | 0.59 | 0.64 | 0.67 | 0.58 | 0.59 | 0.6239 | mas | constant | mas | medianpolish | |
| 0.65 | 0.66 | 0.64 | 0.59 | 0.62 | 0.65 | 0.58 | 0.59 | 0.6228 | gcrma | constant | pmonly | medianpolish | |
| 0.62 | 0.66 | 0.63 | 0.59 | 0.64 | 0.67 | 0.58 | 0.60 | 0.6226 | none | quantiles | mas | medianpolish | |
| 0.64 | 0.65 | 0.63 | 0.60 | 0.64 | 0.66 | 0.57 | 0.58 | 0.6224 | gcrma | constant | pmonly | liwong | |
| 0.62 | 0.66 | 0.62 | 0.59 | 0.64 | 0.66 | 0.58 | 0.59 | 0.6208 | mas | quantiles | mas | medianpolish | |
| 0.62 | 0.66 | 0.62 | 0.58 | 0.64 | 0.66 | 0.57 | 0.59 | 0.6183 | none | invariantset | mas | medianpolish | |
| 0.62 | 0.66 | 0.62 | 0.58 | 0.63 | 0.66 | 0.57 | 0.59 | 0.6181 | mas | invariantset | mas | medianpolish | |
| 0.61 | 0.66 | 0.62 | 0.57 | 0.64 | 0.68 | 0.55 | 0.60 | 0.6167 | none | invariantset | pmonly | mas | |
| 0.61 | 0.66 | 0.62 | 0.57 | 0.65 | 0.68 | 0.55 | 0.60 | 0.6166 | none | quantiles | pmonly | mas | |
| 0.61 | 0.66 | 0.62 | 0.57 | 0.64 | 0.68 | 0.55 | 0.60 | 0.6158 | none | constant | pmonly | mas | |
| 0.61 | 0.66 | 0.62 | 0.57 | 0.64 | 0.68 | 0.55 | 0.60 | 0.6157 | mas | constant | pmonly | mas | |
| 0.61 | 0.67 | 0.62 | 0.57 | 0.64 | 0.68 | 0.55 | 0.60 | 0.6156 | mas | quantiles | pmonly | mas | |
| 0.61 | 0.67 | 0.61 | 0.57 | 0.64 | 0.68 | 0.55 | 0.60 | 0.6150 | mas | invariantset | pmonly | mas | |
| 0.61 | 0.67 | 0.61 | 0.57 | 0.64 | 0.68 | 0.55 | 0.59 | 0.6150 | rma | quantiles | pmonly | mas | |
| 0.61 | 0.66 | 0.61 | 0.57 | 0.64 | 0.68 | 0.54 | 0.59 | 0.6145 | rma | constant | pmonly | mas | |
| 0.62 | 0.65 | 0.62 | 0.57 | 0.63 | 0.67 | 0.57 | 0.59 | 0.6145 | mas | constant | pmonly | medianpolish | |
| 0.61 | 0.66 | 0.61 | 0.57 | 0.64 | 0.68 | 0.54 | 0.60 | 0.6141 | rma | invariantset | pmonly | mas | |
| 0.62 | 0.65 | 0.62 | 0.58 | 0.63 | 0.67 | 0.57 | 0.58 | 0.6134 | mas | quantiles | pmonly | medianpolish | |
| 0.61 | 0.65 | 0.61 | 0.57 | 0.63 | 0.67 | 0.57 | 0.59 | 0.6129 | rma | constant | pmonly | medianpolish | |
| 0.62 | 0.65 | 0.62 | 0.58 | 0.63 | 0.67 | 0.56 | 0.58 | 0.6128 | rma | quantiles | pmonly | medianpolish | rma |
| 0.62 | 0.64 | 0.61 | 0.58 | 0.63 | 0.67 | 0.56 | 0.58 | 0.6118 | rma | invariantset | pmonly | medianpolish | |
| 0.61 | 0.64 | 0.62 | 0.57 | 0.63 | 0.67 | 0.57 | 0.58 | 0.6114 | mas | invariantset | pmonly | medianpolish | |
| 0.62 | 0.64 | 0.61 | 0.57 | 0.63 | 0.66 | 0.57 | 0.58 | 0.6112 | none | quantiles | pmonly | medianpolish | |
| 0.62 | 0.64 | 0.61 | 0.57 | 0.63 | 0.66 | 0.57 | 0.58 | 0.6109 | none | invariantset | pmonly | medianpolish | |
| 0.62 | 0.64 | 0.61 | 0.56 | 0.63 | 0.66 | 0.55 | 0.59 | 0.6088 | none | constant | pmonly | medianpolish | |
Values are given for each strain individually and as an average across strains. The right part of the table describes the combination of normalization steps to yield the different methods. Boldface highlight the correlation coefficients for the commonly employed normalization methods.
Spearman correlation coefficients between replicate samples
| MG | MGF | DH | DHF | Average | Method | Common name | |||
|---|---|---|---|---|---|---|---|---|---|
| r | r | r | r | Background | Normalize | pmcorrect | Summary | ||
| 0.9531 | 0.9853 | 0.986 | 0.9657 | 0.973 | gcrma | invariantset | pmonly | medianpolish | |
| 0.9571 | 0.9821 | 0.9842 | 0.9634 | 0.972 | none | quantiles | pmonly | medianpolish | |
| 0.9646 | 0.98 | 0.9827 | 0.9577 | 0.971 | mas | invariantset | pmonly | medianpolish | |
| 0.9549 | 0.9822 | 0.9843 | 0.9636 | 0.971 | none | invariantset | pmonly | medianpolish | |
| 0.9628 | 0.9795 | 0.9823 | 0.9568 | 0.970 | mas | quantiles | pmonly | medianpolish | |
| 0.9622 | 0.9792 | 0.9819 | 0.9567 | 0.970 | mas | constant | pmonly | medianpolish | |
| 0.9348 | 0.9867 | 0.9869 | 0.9682 | 0.969 | gcrma | quantiles | pmonly | medianpolish | gcrma |
| 0.9547 | 0.981 | 0.982 | 0.9587 | 0.969 | none | constant | pmonly | medianpolish | |
| 0.9489 | 0.9832 | 0.9831 | 0.9607 | 0.969 | gcrma | constant | pmonly | medianpolish | |
| 0.9578 | 0.9792 | 0.9821 | 0.9568 | 0.969 | rma | invariantset | pmonly | medianpolish | |
| 0.9567 | 0.9788 | 0.9819 | 0.9561 | 0.968 | rma | constant | pmonly | medianpolish | |
| 0.9542 | 0.979 | 0.982 | 0.9571 | 0.968 | rma | quantiles | pmonly | medianpolish | rma |
| 0.9734 | 0.9696 | 0.9784 | 0.9493 | 0.968 | mas | invariantset | mas | medianpolish | |
| 0.9567 | 0.9729 | 0.9813 | 0.9565 | 0.967 | mas | quantiles | pmonly | liwong | |
| 0.9552 | 0.9728 | 0.9807 | 0.9577 | 0.967 | mas | constant | pmonly | liwong | |
| 0.9554 | 0.9722 | 0.9811 | 0.9575 | 0.967 | rma | constant | pmonly | liwong | |
| 0.9719 | 0.9786 | 0.949 | 0.967 | mas | quantiles | mas | medianpolish | ||
| 0.9557 | 0.9721 | 0.9802 | 0.9565 | 0.966 | mas | invariantset | pmonly | liwong | |
| 0.9616 | 0.9733 | 0.979 | 0.9507 | 0.966 | mas | constant | mas | medianpolish | |
| 0.9687 | 0.9705 | 0.9771 | 0.9443 | 0.965 | none | invariantset | pmonly | mas | |
| 0.9684 | 0.9705 | 0.977 | 0.9446 | 0.965 | none | quantiles | pmonly | mas | |
| 0.9411 | 0.978 | 0.9805 | 0.9588 | 0.965 | gcrma | quantiles | pmonly | liwong | |
| 0.9557 | 0.9727 | 0.9797 | 0.9502 | 0.965 | none | invariantset | mas | medianpolish | |
| 0.9519 | 0.9704 | 0.9791 | 0.9569 | 0.965 | mas | constant | mas | liwong | |
| 0.952 | 0.9749 | 0.9801 | 0.9511 | 0.965 | none | constant | mas | medianpolish | |
| 0.9545 | 0.969 | 0.9794 | 0.9549 | 0.964 | mas | quantiles | mas | liwong | |
| 0.9542 | 0.9732 | 0.9797 | 0.9505 | 0.964 | none | quantiles | mas | medianpolish | |
| 0.9453 | 0.9732 | 0.9816 | 0.957 | 0.964 | rma | quantiles | pmonly | liwong | |
| 0.9641 | 0.9703 | 0.9769 | 0.944 | 0.964 | none | constant | pmonly | mas | |
| 0.9626 | 0.9707 | 0.9769 | 0.945 | 0.964 | rma | quantiles | pmonly | mas | |
| 0.9455 | 0.9722 | 0.9808 | 0.9565 | 0.964 | rma | invariantset | pmonly | liwong | |
| 0.9459 | 0.9729 | 0.9795 | 0.9558 | 0.964 | none | quantiles | pmonly | liwong | |
| 0.9615 | 0.9706 | 0.9768 | 0.9449 | 0.963 | rma | invariantset | pmonly | mas | |
| 0.9412 | 0.977 | 0.979 | 0.956 | 0.963 | gcrma | constant | pmonly | liwong | |
| 0.955 | 0.9662 | 0.978 | 0.9538 | 0.963 | mas | invariantset | mas | liwong | |
| 0.9443 | 0.9719 | 0.9792 | 0.9556 | 0.963 | none | constant | pmonly | liwong | |
| 0.9573 | 0.9705 | 0.9769 | 0.9446 | 0.962 | rma | constant | pmonly | mas | |
| 0.937 | 0.9723 | 0.9797 | 0.9588 | 0.962 | none | constant | mas | liwong | |
| 0.9414 | 0.9751 | 0.9779 | 0.9521 | 0.962 | gcrma | invariantset | pmonly | liwong | |
| 0.9359 | 0.9713 | 0.9804 | 0.9579 | 0.961 | none | quantiles | mas | liwong | |
| 0.9351 | 0.9693 | 0.9801 | 0.9569 | 0.960 | none | invariantset | mas | liwong | |
| 0.9336 | 0.9724 | 0.9789 | 0.9547 | 0.960 | none | invariantset | pmonly | liwong | Li–Wong |
| 0.946 | 0.9708 | 0.977 | 0.9447 | 0.960 | mas | constant | pmonly | mas | |
| 0.9464 | 0.9706 | 0.9771 | 0.9445 | 0.960 | mas | invariantset | pmonly | mas | |
| 0.9458 | 0.9708 | 0.977 | 0.9447 | 0.960 | mas | quantiles | pmonly | mas | |
| 0.9773 | 0.9585 | 0.9686 | 0.931 | 0.959 | gcrma | quantiles | pmonly | mas | |
| 0.9758 | 0.9589 | 0.9685 | 0.9315 | 0.959 | gcrma | invariantset | pmonly | mas | |
| 0.971 | 0.9586 | 0.9684 | 0.9315 | 0.957 | gcrma | constant | pmonly | mas | |
| 0.9588 | 0.9591 | 0.9705 | 0.9325 | 0.955 | none | constant | mas | mas | |
| 0.9585 | 0.9573 | 0.9697 | 0.9324 | 0.954 | none | quantiles | mas | mas | |
| 0.9587 | 0.9563 | 0.9693 | 0.9317 | 0.954 | none | invariantset | mas | mas | |
| 0.9562 | 0.9564 | 0.969 | 0.9306 | 0.953 | mas | quantiles | mas | mas | |
| 0.9455 | 0.9544 | 0.9685 | 0.9297 | 0.950 | mas | invariantset | mas | mas | |
| 0.9362 | 0.9584 | 0.9697 | 0.9318 | 0.949 | mas | constant | mas | mas | mas5.0 |
Values are given for each strain individually and as an average across strains. The right part of the table describes the combination of normalization steps to yield the different methods. Boldface highlight the correlation coefficients for the commonly employed normalization methods.
Figure 2The influence of different methods is shown for each step in the normalization procedure independently. (A) Mean and 95% confidence intervals of Spearman rank correlation coefficients calculated between the two member genes of an operon. (B) Mean and 95% confidence intervals of Spearman rank correlation coefficients calculated between all genes on replicate arrays.