| Literature DB >> 15961730 |
Thomas S Price1, Regina Regan, Richard Mott, Asa Hedman, Ben Honey, Rachael J Daniels, Lee Smith, Andy Greenfield, Ana Tiganescu, Veronica Buckle, Nicki Ventress, Helena Ayyub, Anita Salhan, Susana Pedraza-Diaz, John Broxholme, Jiannis Ragoussis, Douglas R Higgs, Jonathan Flint, Samantha J L Knight.
Abstract
Comparative genome hybridization (CGH) to DNA microarrays (array CGH) is a technique capable of detecting deletions and duplications in genomes at high resolution. However, array CGH studies of the human genome noting false negative and false positive results using large insert clones as probes have raised important concerns regarding the suitability of this approach for clinical diagnostic applications. Here, we adapt the Smith-Waterman dynamic-programming algorithm to provide a sensitive and robust analytic approach (SW-ARRAY) for detecting copy-number changes in array CGH data. In a blind series of hybridizations to arrays consisting of the entire tiling path for the terminal 2 Mb of human chromosome 16p, the method identified all monosomies between 267 and 1567 kb with a high degree of statistical significance and accurately located the boundaries of deletions in the range 267-1052 kb. The approach is unique in offering both a nonparametric segmentation procedure and a nonparametric test of significance. It is scalable and well-suited to high resolution whole genome array CGH studies that use array probes derived from large insert clones as well as PCR products and oligonucleotides.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15961730 PMCID: PMC1151590 DOI: 10.1093/nar/gki643
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Known map positions of monosomies in test samples
| Case ID | Position of monosomy (kb from 16p) | References |
|---|---|---|
| 1 | 0–223 | ( |
| 2 | 33–300 | ( |
| 3 | 0–300 | D.R. Higgs and Dr V. Buckle, personal communication |
| 4 | 0–700 | ( |
| 5 | 0–775 | ( |
| 6 | 0–900 | ( |
| 7 | 0–999 | ( |
| 8 | 0–1052 | ( |
| 9 | 0–1100 | ( |
| 10 | 0–1159 | ( |
| 11 | 0–1159 | ( |
| 12 | 0–1183 | ( |
| 13 | 0–1400 | D.R. Higgs and Dr V. Buckle, personal communication |
| 14 | 0–1567 | ( |
| 15 | 0–1000 | ( |
| 16 | 0–1160 | D.R. Higgs and Dr V. Buckle, personal communication |
Figure 1Normalized ratio data (blue data points) from the 16p probes for three control versus control hybridizations (a–c) and mean ratio and SD values (shown as black error bars) for six control versus control hybridizations (d). The data points are joined by lines to help visualize the similarity between hybridization profiles.
Figure 2(a) and (b) Charts showing the mean, normalized hybridization ratios across the terminal 2 Mb of 16p for two patients, monosomic for different portions of 16p (pink data points) and in each case compared with the mean, normalized hybridization ratios and corresponding SD values (black error bars) from the control versus control hybridizations (dark blue data points). Each data point represents data for a single arrayed probe and has been corrected for probe to probe variation. The known region of monosomy for each patient is indicated by the red box in the ideogram of the terminal 2 Mb of chromosome 16p above the relevant chart. The yellow boxes on the ideograms represent known gene locations. The probes giving false negative and a false positive result are annotated and indicated by the arrows. (c) Example of the graphical output obtained by applying SW-ARRAY on one data set analysed initially using the probe by probe global thresholding approach [this initial analysis is shown in (a)]. Each black data point indicates the mean, normalized LOG2 fluorescence ratio for a single 16p probe. The significance value is given in the top right of the chart. The robustness values are plotted as blue lines, the threshold value as a green line. The genomic regions identified as monosomic with robustness values >0.5 are indicated by the black double-ended arrow. The known region of monosomy for each patient is indicated by the red box in the ideogram of the terminal 2 Mb of chromosome 16p above the chart and the yellow box on the ideogram is known to be disomic.
Summary of 16p array CGH results analysed probe by probe
| Location of monosomy (kb from 16p telomere) | Correct monosomy result probe (%) | Correct disomy result probe (%) | False positive in monosomic region | False negative in monosomic region probe (%) | Overall false positive results |
|---|---|---|---|---|---|
| Test samples | |||||
| 34–300 | 86 | 59 | 0 | 14 | 37 |
| 0–775 | 63 | 68 | 0 | 37 | 20 |
| 0–1000 | 61 | 82 | 0 | 39 | 9 |
| 0–1052 | 97 | 91 | 0 | 3 | 5 |
| 0–1160 | 86 | 96 | 0 | 14 | 2 |
| 0–1183 | 70 | 82 | 0 | 30 | 8 |
| 0–1567 | 83 | 100 | 0 | 17 | 0 |
| Control samples | |||||
| 0 | NA | 95 | NA | NA | 5 |
| 0 | NA | 100 | NA | NA | 0 |
| 0 | NA | 100 | NA | NA | 0 |
| 0 | NA | 98 | NA | NA | 2 |
| 0 | NA | 88 | NA | NA | 12 |
| 0 | NA | 98 | NA | NA | 2 |
aCopy-number increases rather than decreases.
bIncludes false positives in disomic regions.
NA = Not applicable.
Non-blind array CGH results obtained using SW-ARRAY
| Known location of monosomy (kb from 16p telomere) | Overall significance ( | Algorithm location with robustness >0.5 (kb from 16p telomere) |
|---|---|---|
| Test samples | ||
| 33–300 | 0.0578 | NA |
| 0–775 | 0.0042 | 27–775 |
| 0–1000 | 0.0000 | 27–1055 |
| 0–1052 | 0.0000 | 27–1055 |
| 0–1160 | 0.0000 | <27–1026 |
| 0–1183 | 0.0000 | 27–1121 |
| 0–1567 | 0.0008 | 356–1358 |
| Control samples | ||
| 0 | 0.9688 | NA |
| 0 | 0.3915 | NA |
| 0 | 0.0840 | NA |
| 0 | 0.1459 | NA |
| 0 | 0.8497 | NA |
| 0 | 0.1529 | NA |
NA = Not applicable.
Figure 3Results of applying SW-ARRAY on data sets with 14 blind control versus test hybridizations. Each black data point indicates the mean, normalized LOG2 fluorescence ratio for a single 16p probe. The significance values are given in the top right of each chart. The robustness values are plotted as blue lines, the threshold values as green lines. Genomic regions identified as monosomic with robustness values >0.5 are indicated by the black double-ended arrows. The known region of monosomy for each patient is indicated by the red box in the ideogram of the terminal 2 Mb of chromosome 16p above the relevant chart. The black box on the ideograms represent disomic regions.