| Literature DB >> 23216884 |
Yang Du1, Eduard Murani, Siriluck Ponsuksili, Klaus Wimmers.
Abstract
BACKGROUND: As a powerful tool in whole genome analysis, tiling array has been widely used in the answering of many genomic questions. Now it could also serve as a capture device for the library preparation in the popular high throughput sequencing experiments. Thus, a flexible and efficient tiling array design approach is still needed and could assist in various types and scales of transcriptomic experiment.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23216884 PMCID: PMC3583072 DOI: 10.1186/1471-2105-13-323
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic diagram of low MUS coverage and long MUS. Hypothetically extreme cases showing low MUS coverage (above) and long MUS (below).
Example of 60-mer probes from Agilent ChIP-on-Chip Set
| A_17_P01761220 | 2 | 45 | 17.55 | 22 | 6.85 |
| A_17_P17428106 | 2 | 44 | 16.42 | 24 | 7.97 |
| A_17_P04386305 | 2 | 59 | 20.86 | 22 | 6.59 |
| A_17_P10898621 | 12 | 55 | 20.86 | 22 | 6.14 |
BLAT hits, number of hybridization-quality alignment to the reference genome. MUS coverage is the length of the region on the probe covered by MUS. MUS Avg. Len, is the average length of all MUS within the probe. U, the original uniqueness score. U, the penalized uniqueness score.
Summary of MUS coverage ratio and average length (in bp) of Agilent ChIP-on-Chip Set probes
| 0.23 | 0.96 | 0.97 | 0.97 | 0.98 | 1 | |
| 13.26 | 16.91 | 17.32 | 17.33 | 17.76 | 30 |
Figure 2Back-to-back box-plot. Back-to-back box-plot of original uniqueness score (U, left) and the penalized uniqueness score (U, right) distribution within different BLAT hits groups (1, aligned to 1 position; 2, aligned to 2 positions; ≥3, aligned to at least 3 positions).
Figure 3Receiver operating characteristic curve. ROC curves of using original uniqueness score (U, red) and penalized uniqueness score (U, blue) for BLAT hits group classification.
Figure 4Palindromic content and uniqueness scores of Agilent ChIP-on-Chip Set. Distributions of the original uniqueness score (U, left) and penalized uniqueness score (U, right) of the Agilent Human Whole Genome ChIP-on-Chip Set 244K (1 to25) probes, using different level of palindromic content as cutoff.
Design summary and coverage comparison
| Agilent* | 73373 | 53/52.85 | 202/426 | 11.11% | 51.25% |
| -o −250† | 75036 | 60/57.16 | 246/411.9 | 12.29% | 53.60% |
| -o −275 | 72087 | 60/57.11 | 269/431.1 | 11.80% | 53.66% |
| -o −300 | 69491 | 60/57.02 | 292/449.4 | 11.35% | 53.74% |
| -o −325 | 67074 | 60/56.92 | 313/467.7 | 10.94% | 53.81% |
* Agilent Human Whole Genome ChIP-on-Chip Set 244K (Chromosome 22 only).
† Overlapping size set to −250, probe length range [45, 60], Tm range [69, 74], U range [9, Inf), palindromic content range [0, 30%], GC content range [30%, 50%].
‡ Table cell is formatted as median/mean.
¶ Using the length of all unit-sized (1000 bp) windows in which at least one probe was placed.
Figure 5Palindromic content and uniqueness scores of experimental data. Distributions of the original uniqueness score (upper-left) and penalized uniqueness score (upper-right) using different level of palindromic content as cutoff for 60-mer candidate probes on pig chromosome 14 (Sus Scrofa Build 10, NC_010456.3) region from 74377028 bp to 78176022 bp, and on the lower panel shows their distributions after filtering with standard probe selection criteria.
ArrayExpress experiments summary and penalized uniqueness score comparison
| 22072 | 5 | 2 | 18 | 9.306/8.798 | 3.235 | 9.131/10.04 | NA |
| 23131 | 31 | 3 | 61 | 8.735/9.796 | 3.492 | 9.132/10.04 | NA |
| 23558 | 32 | 2 | 136 | 4.784/3.503 | 4.246 | 9.146/10.05 | 2.2E-16 |
| 23697 | 70 | 2 | 1 | 0/0 | NA | 9.131/10.04 | NA |
| 24536 | 52 | 5 | 11 | 7.508/9.957 | 4.973 | 9.132/10.04 | NA |
| 25623 | 32 | 3 | 1 | 4.655/4.655 | NA | 9.131/10.04 | NA |
| 27915 | 20 | 5 | 6 | 9.633/9.899 | 2.824 | 9.131/10.04 | NA |
| 29288 | 132 | 9 | 9 | 8.641/9.957 | 3.965 | 9.131/10.04 | NA |
| 32155 | 21 | 7 | 23 | 8.48/9.613 | 3.642 | 9.132/10.04 | NA |
| 32988 | 48 | 10 | 145 | 6.618/8.636 | 4.383 | 9.14/10.04 | 1.34E-10 |
| 33264 | 49 | 16 | 31 | 9.428/9.957 | 2.76 | 9.131/10.04 | NA |
| 35635 | 57 | 3 | 1 | 12.79/12.79 | NA | 9.131/10.04 | NA |
| 35756 | 32 | 8 | 458 | 4.229/3.396 | 3.876 | 9.187/10.04 | 2.2E-16 |
| 37827 | 87 | 29 | 32 | 9.078/9.701 | 2.837 | 9.131/10.04 | NA |