| Literature DB >> 17363414 |
Jayne Y Hehir-Kwa1, Michael Egmont-Petersen, Irene M Janssen, Dominique Smeets, Ad Geurts van Kessel, Joris A Veltman.
Abstract
Recently, comparative genomic hybridization onto bacterial artificial chromosome (BAC) arrays (array-based comparative genomic hybridization) has proved to be successful for the detection of submicroscopic DNA copy-number variations in health and disease. Technological improvements to achieve a higher resolution have resulted in the generation of additional microarray platforms encompassing larger numbers of shorter DNA targets (oligonucleotides). Here, we present a novel method to estimate the ability of a microarray to detect genomic copy-number variations of different sizes and types (i.e. deletions or duplications). We applied our method, which is based on statistical power analysis, to four widely used high-density genomic microarray platforms. By doing so, we found that the high-density oligonucleotide platforms are superior to the BAC platform for the genome-wide detection of copy-number variations smaller than 1 Mb. The capacity to reliably detect single copy-number variations below 100 kb, however, appeared to be limited for all platforms tested. In addition, our analysis revealed an unexpected platform-dependent difference in sensitivity to detect a single copy-number loss and a single copy-number gain. These analyses provide a first objective insight into the true capacities and limitations of different genomic microarrays to detect and define DNA copy-number variations.Entities:
Mesh:
Year: 2007 PMID: 17363414 PMCID: PMC2779891 DOI: 10.1093/dnares/dsm002
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Detection of known and validated submicroscopic copy-number variations by high-density BAC, SNP, and oligonucleotide arrays. Individual chromosome plots are shown for Patient 8 (chromosome 9), with the log2 T/R (test-over-reference) values plotted on the y-axis versus the genomic position on chromosome 9 on the x-axis. Results are shown for the tiling-resolution 32k BAC array (A), the 100k SNP array (B), the 250k SNP array (C), and the 385k oligonucleotide array (D). A known and validated microdeletion of 0.54 Mb on 9q33.1 is detected by all four genomic microarray platforms (see black arrow). In addition, a previously undetected microduplication is clearly visible on the chromosome profile obtained by the 250k SNP array (see grey arrow). This figure also shows the different levels of microarray noise present for the different microarray platforms.
Detection of known and validated submicroscopic copy-number variations onto high-density BAC, SNP and oligonucleotide microarrays
| Patient | Copy number | Chromosome | Size (Mb) | No. of targets in region | Average ratio targets in region | Detected by HMMa | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 32k | 100k | 250k | 385k | 32k | 100k | 250k | 385k | 100k | 250k | 385k | ||||
| BACs | SNPs | SNPs | Oligos | BACs | SNPs | SNPs | Oligos | SNPs | SNPs | Oligos | ||||
| 1 | Loss | 1 | 3.93 | 42 | 125 | 230 | n.d. | −0.41 | −0.51 | −0.45 | n.d. | Yes | Yes | n.d. |
| 2 | Gain | 1 | 2.12 | 21 | 50 | 120 | 176 | 0.44 | 0.49 | 0.77 | 0.59 | Yes | Yes | Yes |
| 3 | Loss | 2 | 0.92 | 11 | 47 | 64 | 127 | −0.59 | −0.43 | −0.52 | −0.45 | Yes | Yes | Yes |
| 4 | Gain | 5 | 1.24 | 16 | 40 | n.d. | n.d. | 0.29 | 0.30 | n.d. | n.d. | Yes | n.d. | n.d. |
| 5 | Loss | 7 | 0.35 | 18 | 1 | 13 | 30 | −0.23 | −0.52 | −0.40 | −0.24 | No | No | Yes |
| 6 | Gain | 9 | 0.23 | 5 | 23 | 38 | 40 | 0.35 | 0.30 | 0.47 | 0.27 | Yes | Yes | Yes |
| 7 | Loss | 9 | 2.85 | 30 | 145 | 320 | n.d. | −0.39 | −0.45 | −0.44 | n.d. | Yes | Yes | n.d. |
| 8 | Loss | 9 | 0.54 | 6 | 22 | 70 | 88 | −0.50 | −0.44 | −0.44 | −0.37 | Yes | Yes | Yes |
| 9 | Loss | 11 | 9.15 | 80 | 551 | 923 | 1299 | −0.35 | −0.47 | −0.47 | −0.50 | Yes | Yes | Yes |
| 10 | Gain | 12 | 2.30 | 39 | 69 | n.d. | 353 | 0.32 | 0.26 | n.d. | 0.23 | Yes | n.d. | Yes |
| 11 | Loss | 15 | 1.65 | 16 | 4 | 40 | 204 | −0.33 | −0.36 | −0.50 | −0.35 | No | Yes | Yes |
| 12 | Gain | 17 | 2.89 | 28 | 64 | 151 | 420 | 0.37 | 0.26 | 0.46 | 0.29 | Yes | Yes | Yes |
| 12 | Gain | 17 | 1.43 | 14 | 18 | 91 | 198 | 0.36 | 0.19 | 0.44 | 0.31 | Yes | Yes | Yes |
| 12 | Gain | 17 | 2.88 | 30 | 205 | 279 | 442 | 0.41 | 0.29 | 0.45 | 0.28 | Yes | Yes | Yes |
| 12 | Gain | 17 | 1.48 | 24 | 21 | 64 | 189 | 0.33 | 0.26 | 0.52 | 0.31 | Yes | Yes | Yes |
| 13 | Loss | 22 | 2.66 | 35 | 36 | 130 | 306 | −0.41 | −0.47 | −0.41 | −0.23 | Yes | Yes | Yes |
aAll copy-number variations were initially detected by an automated HMM on the 32k BAC array.
Signal-to-noise parameters of the four genomic copy-number profiling platforms
| Patient | 32k BAC array | Duplicate 32k BAC array | Affymetrix 100k SNP array | Affymetrix 250k SNP array | NimbleGen 385k Oligonucleotide array | ||||
|---|---|---|---|---|---|---|---|---|---|
| Mean Xa | Auto STDb | Auto STD | Mean X | Auto STD | Mean X | Auto STD | Mean X | Auto STD | |
| 1 | 0.47 | 0.10 | 0.06 | 0.48 | 0.15 | 0.49 | 0.18 | n.d. | n.d. |
| 2 | 0.49 | 0.14 | 0.08 | 0.48 | 0.13 | 0.48 | 0.14 | 0.42 | 0.20 |
| 3 | 0.49 | 0.12 | 0.07 | 0.48 | 0.16 | 0.45 | 0.16 | 0.35 | 0.25 |
| 4 | 0.42 | 0.13 | 0.07 | 0.47 | 0.14 | n.d. | n.d. | n.d. | n.d. |
| 5 | 0.40 | 0.09 | 0.05 | 0.45 | 0.16 | 0.46 | 0.18 | 0.29 | 0.24 |
| 6 | 0.45 | 0.10 | 0.05 | 0.47 | 0.17 | 0.43 | 0.13 | 0.38 | 0.28 |
| 7 | 0.46 | 0.12 | 0.06 | 0.48 | 0.14 | 0.43 | 0.13 | n.d. | n.d. |
| 8 | 0.47 | 0.11 | 0.06 | 0.46 | 0.13 | 0.42 | 0.15 | 0.35 | 0.24 |
| 9 | 0.40 | 0.12 | 0.06 | 0.48 | 0.24 | 0.47 | 0.16 | 0.51 | 0.21 |
| 10 | 0.49 | 0.11 | 0.07 | 0.48 | 0.13 | n.d. | n.d. | 0.39 | 0.22 |
| 11 | 0.43 | 0.09 | 0.06 | 0.45 | 0.16 | 0.49 | 0.14 | 0.41 | 0.25 |
| 12 | 0.56 | 0.13 | 0.07 | 0.48 | 0.16 | 0.44 | 0.14 | 0.43 | 0.21 |
| 13 | 0.50 | 0.10 | 0.06 | 0.48 | 0.17 | 0.48 | 0.17 | 0.32 | 0.20 |
| Average | 0.46 | 0.11 | 0.06 | 0.47 | 0.16 | 0.46 | 0.15 | 0.38 | 0.23 |
aMean log2-transformed test-over-reference ratio of the X chromosome, excluding the pseudo-autosomal regions, obtained from calculations in sex-mismatched hybridization experiments. For the BAC and the NimbleGen platforms, data were obtained within each two-color experiment, for the Affymetrix SNP platform, data were combined in silico from different one-color experiments. For this analysis, four reference pool samples (two of each sex) were processed in parallel with the patient samples.
bStandard deviation calculated over the log2-transformed test-over-reference ratios for all autosomal targets, excluding the genomic regions known to harbor submicroscopic copy-number variations.
Result of the statistical power analysis: How many consecutive targets are required to detect a single copy-number loss or gain?
| Patient | 32k BAC array | Duplicate 32k BAC array | Affymetrix 100k SNP array | Affymetrix 250k SNP array | NimbleGen 385k Oligonucleotide array | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Loss | Gain | Loss | Gain | Loss | Gain | Loss | Gain | Loss | Gain | |
| 1 | 4 | 4 | 3 | 3 | 4 | 6 | 5 | 7 | n.d. | n.d. |
| 2 | 4 | 11 | 3 | 4 | 4 | 6 | 4 | 6 | 5 | 13 |
| 3 | 4 | 5 | 3 | 3 | 4 | 7 | 5 | 7 | 10 | 19 |
| 4 | 4 | 6 | 3 | 4 | 4 | 6 | n.d. | n.d. | n.d. | n.d. |
| 5 | 4 | 5 | 3 | 3 | 5 | 8 | 5 | 8 | 11 | 31 |
| 6 | 3 | 5 | 3 | 3 | 5 | 8 | 4 | 6 | 9 | 27 |
| 7 | 4 | 5 | 3 | 3 | 4 | 6 | 4 | 6 | n.d. | n.d. |
| 8 | 4 | 5 | 3 | 3 | 4 | 6 | 5 | 8 | 8 | 24 |
| 9 | 4 | 5 | 3 | 4 | 7 | 9 | 4 | 7 | 5 | 8 |
| 10 | 3 | 5 | 3 | 4 | 4 | 6 | n.d. | n.d. | 7 | 14 |
| 11 | 3 | 5 | 3 | 4 | 5 | 7 | 4 | 6 | 7 | 18 |
| 12 | 3 | 5 | 3 | 4 | 5 | 8 | 4 | 5 | 5 | 16 |
| 13 | 3 | 4 | 3 | 4 | 5 | 7 | 4 | 7 | 9 | 14 |
| Average | 4 | 5 | 3 | 4 | 5 | 7 | 4 | 7 | 8 | 18 |
For this analysis, we used a power of 95%.
Figure 2Result of the power analysis of the four genomic microarray platforms for detection of a single copy-number gain or loss contained by different numbers of consecutive targets. The resulting power for a single copy-number gain (dotted) and a single copy-number loss (line) are displayed for the 32k BAC array platform (A), the 100k SNP array (B), the 250k SNP array (C), and 385k oligonucleotide array platform (D). The increase in number of targets has a varying impact on the resulting power across the four different microarray platforms. In addition, the number of consecutive targets required to detect single copy-number gains differs considerably from the number of targets needed to detect a single copy-number loss, and this difference appears to be platform-dependent.
Probability to detect a single copy-number gain or loss with different genomic sizes onto the four platforms
| 32k BAC array | Affymetrix 100k SNP array | Affymetrix 250k SNP array | NimbleGen 385k Oligonucleotide array | |||||
|---|---|---|---|---|---|---|---|---|
| Loss | Gain | Loss | Gain | Loss | Gain | Loss | Gain | |
| 10 kb | 0.00 | 0.00 | 0.03 | 0.01 | 0.12 | 0.02 | 0.00 | 0.00 |
| 50 kb | 0.01 | 0.00 | 0.28 | 0.14 | 0.68 | 0.38 | 0.28 | 0.00 |
| 100 kb | 0.02 | 0.01 | 0.56 | 0.39 | 0.88 | 0.75 | 0.94 | 0.01 |
| 200 kb | 0.11 | 0.05 | 0.81 | 0.71 | 0.94 | 0.91 | 0.95 | 0.93 |
| 300 kb | 0.32 | 0.16 | 0.88 | 0.83 | 0.95 | 0.94 | 0.95 | 0.94 |
| 400 kb | 0.60 | 0.36 | 0.91 | 0.88 | 0.95 | 0.94 | 0.95 | 0.94 |
| 500 kb | 0.83 | 0.60 | 0.93 | 0.91 | 0.95 | 0.95 | 0.95 | 0.95 |
| 1 Mb | 0.95 | 0.95 | 0.94 | 0.94 | 0.95 | 0.95 | 0.95 | 0.95 |
This table combines the results from Table 3 with those of Supplementary Table 2. For this analysis, we used a power of 95%.