| Literature DB >> 16297240 |
Mehrnoush Khojasteh1, Wan L Lam, Rabab K Ward, Calum MacAulay.
Abstract
BACKGROUND: In two-channel competitive genomic hybridization microarray experiments, the ratio of the two fluorescent signal intensities at each spot on the microarray is commonly used to infer the relative amounts of the test and reference sample DNA levels. This ratio may be influenced by systematic measurement effects from non-biological sources that can introduce biases in the estimated ratios. These biases should be removed before drawing conclusions about the relative levels of DNA. The performance of existing gene expression microarray normalization strategies has not been evaluated for removing systematic biases encountered in array-based comparative genomic hybridization (CGH), which aims to detect single copy gains and losses typically in samples with heterogeneous cell populations resulting in only slight shifts in signal ratios. The purpose of this work is to establish a framework for correcting the systematic sources of variation in high density CGH array images, while maintaining the true biological variations.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16297240 PMCID: PMC1310623 DOI: 10.1186/1471-2105-6-274
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Data description. In this table, the array data of this study are summarized.
| MM-1 to MM-4 | Male genomic | Male genomic | Self-self hybridizations | S.d. for each array |
| H526-1 to H526-8 | H526 cell line | Male genomic | Replicate H526 cell line experiments | 1. Correlation coefficient for each pair of arrays |
| MF-1 and MF-2 | Female genomic | Male genomic | Single copy change | T-test |
| T1 to T5 | Female genomic | Male/Female mixture (see | Single copy loss with normal cell contamination | T-test |
| T6 to T10 | Male genomic | Male/Female mixture (see | Single copy gain with normal cell contamination | T-test |
| X1 to X5 | Female DNA | cell lines containing varying numbers of X chromosomes (see | varying levels of gene amplification and deletion for each of the X-chromosomal genes | T-test |
Figure 1A smoothed M-XY plot illustrating spatial bias. The plot displays representation of log2 ratios based on the corresponding spot location on the microarray, the plot is smoothed with a moving median filter.
Summary of normalization methods. Each of the normalization methods in this table will be denoted by its number through out the text. For full description of methods refer to "Methods of bias removal" section in Results and Discussion and the "Normalization methods" section in Methods.
| No normalization | Raw ratios | |
| Global median Ratio | Ratios scaled by their median | |
| Intensity LOWESS, 10% span | Global Intensity LOWESS, span = 10% | |
| Intensity LOWESS, 25% span | Global Intensity LOWESS, span = 25% | |
| Intensity LOWESS, 40% span | Global Intensity LOWESS, span = 40% | |
| Print tip mean Ratio | Ratios of each print-tip group scaled by the mean ratio of that group | |
| Spatial | median of log2 ratios for the spots within a spatial neighbourhood window of size 11 rows by 11 columns centred on that spot | |
| Spatial + Median Plate Ratio | Method 8 followed by plate normalization | |
| Print Tip Intensity LOWESS, span = 40% | LOWESS performed on the ratios from each print-tip group | |
| Intensity LOWESS + Spatial | Stepwise Method 4 and 8 | |
| Intensity LOWESS + Spatial + Median Plate Ratio | Stepwise Methods 4 and 9 | |
| Spatial + Median Plate Ratio + intensity LOWESS | Stepwise Methods 9 and 4 | |
| No Normalization | See Method 1, but without background subtraction | |
| Global median Ratio | See Method 2, but without background subtraction | |
| Global Intensity LOWESS, span = 10% | See Method 4, but without background subtraction | |
| Print tip Mean Ratio | See Method 3, but without background subtraction | |
| Spatial | See Method 8, but without background subtraction | |
| Intensity LOWESS + Spatial | See Method 10, but without background subtraction | |
| Intensity LOWESS + Spatial + Median Plate Ratio | See Method 11, but without background subtraction | |
Figure 2Normalization of self-self hybridization data. Relative standard deviation (s.d.) of log2 ratios averaged across arrays MM-1 through MM-4 using all data points are shown in blue. The repeated analysis of relative s.d. after removal of the weakest 10% of spots is shown in red. The numbers on the horizontal axis refer to the methods used for normalization listed on Table 2.
Figure 3Normalization of hybridization data from replicate experiments. 8 replicate array CGH experiments were done comparing sample DNA from H526 cell line and the reference normal male genomic DNA. A. Graph shows the average of the standard deviations of log2 ratios for the same spot across 8 replicate arrays. B. shows the ICC and Average correlation coefficient of replicate arrays. Horizontal axis represents the method number listed in Table 2.
Figure 4Normalization of hybridization data from male and female DNA. For each of arrays MF-1 and MF-2, a T-test was performed on the two groups of log ratios, i.e. log ratios for the autosomal clones and those for the X chromosome clones. Values of T-statistic after each normalization method are shown. Horizontal axis represents the method number listed in Table 2.
Figure 5Normalization of hybridization data from samples mimicking heterogeneous cell populations and single copy alterations. Array CGH data were generated for samples mimicking single copy loss (deletion) or single copy gain (amplification) with contamination of increasing proportion of reference DNA, indicated as percentage on the horizontal axis. The experimental procedure for the array CGH experiments was previously described [5]. Global median normalization (method 1), stepwise normalization (method 12), global median normalization with background subtraction (method 13), and 3 step normalization with background subtraction (method 19) were applied. T-statistic values computed before and after normalization for arrays T1-T10 are summarized.
Figure 6Normalization of hybridization data from cDNA arrays. Array CGH data were generated for samples simulating varying levels of gene amplification and deletion for X-chromosomal genes on the array. Global median normalization (method 1), stepwise normalization (method 12), global median normalization with background subtraction (method 13), and 3 step normalization with background subtraction (method 19) were applied. T-statistic values computed before and after normalization for arrays X1-X5 are summarized.
Figure 7Chromosome plots before and after normalization. Plot of log2 signal ratios for clones (from chromosome 1 in A and chromosome 2 in B) versus their location across the chromosome. The profiles from left to right are: H526-1 data with global median normalization (method 1), H526-1 data with stepwise normalization (method 12), H526-5 data with global median normalization (method 13), H526-5 data with stepwise normalization (method 19). Each dot on the SeeGH plot represents a BAC clone. A shift in signal ratio to the left of center line indicates a copy number reduction, while a shift to the right indicates a gain. Blue arrow points to a high level segmental amplification. The arrow in part B points to the micro-amplification.