| Literature DB >> 12950995 |
Taesung Park1, Sung-Gon Yi, Sung-Hyun Kang, SeungYeoun Lee, Yong-Sung Lee, Richard Simon.
Abstract
BACKGROUND: Microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. This novel technique helps us to understand gene regulation as well as gene by gene interactions more systematically. In the microarray experiment, however, many undesirable systematic variations are observed. Even in replicated experiment, some variations are commonly observed. Normalization is the process of removing some sources of variation which affect the measured gene expression levels. Although a number of normalization methods have been proposed, it has been difficult to decide which methods perform best. Normalization plays an important role in the earlier stage of microarray data analysis. The subsequent analysis results are highly dependent on normalization.Entities:
Mesh:
Year: 2003 PMID: 12950995 PMCID: PMC200968 DOI: 10.1186/1471-2105-4-33
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Flowchart of normalization
Abbreviation for Normalization Methods
| Abbreviation | Description |
| O | Original data |
| G | Global median normalization |
| L | Intensity dependent linear normalization |
| N | Intensity dependent nonlinear normalization (LOWESS) |
| P | Print-tip normalization |
| S | Print-tip scale normalization |
| .s | Between-slide scale normalization |
List of Normalization Methods
| List of normalization methods including print-tip scale normalization | ||
| Method | Notation | Description |
| O | Original data | |
| Global | G | Global median normalization |
| GP | Global median normalization on each print-tip | |
| GPS | Global median normalization on each print-tip with scale normalization | |
| G.s | Global median normalization and between-slide scale normalization | |
| GP.s | Global median normalization on each print-tip and between-slide scale normalization | |
| GPS.s | Global median normalization on print-tip with scale normalization and between-slide scale normalization | |
| Linear | L | Intensity dependent linear regression normalization |
| LP | Intensity dependent linear regression normalization on each print-tip | |
| LPS | Intensity dependent linear regression normalization on each print-tip with scale normalization | |
| L.s | Intensity dependent linear regression normalization and between-slide scale normalization | |
| LP.s | Intensity dependent linear regression normalization on each print-tip and between-slide scale normalization | |
| LPS.s | Intensity dependent linear regression normalization on each print-tip with scale normalization and between-slide scale normalization | |
| Nonlinear | N | Intensity dependent nonlinear regression normalization (LOWESS) |
| NP | Intensity dependent nonlinear regression normalization (LOWESS) on each print-tip | |
| NPS | Intensity dependent nonlinear regression normalization (LOWESS) on each print-tip with scale normalization | |
| N.s | Intensity dependent nonlinear regression normalization (LOWESS) and between-slide scale normalization | |
| NP.s | Intensity dependent nonlinear regression normalization (LOWESS) on each print-tip and between-slide scale normalization | |
| NPS.s | Intensity dependent nonlinear regression normalization (LOWESS) on each print-tip with scale normalization and between-slide scale normalization | |
Figure 2(A) The original slide with a non-linear pattern. (B-D) Three normalized slides (global median, intensity dependent linear regression, intensity dependent non-linear regression.
Figure 3Dot Plots of Log-transformed Variance Estimates for Cortical Stem Cells Data. The Y-axis represents normalization methods and the X-axis represents the mean values of log-transformed variance estimates. (A) Dot plots for O, G, G.s, L, L.s, N, and N.s, (B) Dot plots for Global Normalization Methods, (C) Dot plots for Intensity-dependent Linear Normalization Methods, (D) Dot plots for Intensity-dependent Nonlinear Normalization Methods.
Figure 4Four types of (logG, logR) plots for the simulated microarray data: Type I (A), Type II (B), Type III (C), Type IV (D).
Figure 5Dot Plots of Log-transformed Variance Estimates for Simulated Data. The Y-axis represents normalization methods and the X-axis represents the mean values of log-transformed variance estimates.