| Literature DB >> 22950736 |
Colin D Veal1, Peter J Freeman, Kevin Jacobs, Owen Lancaster, Stéphane Jamain, Marion Leboyer, Demetrius Albanes, Reshma R Vaghela, Ivo Gut, Stephen J Chanock, Anthony J Brookes.
Abstract
BACKGROUND: For many analytical methods the efficiency of DNA amplification varies across the genome and between samples. The most affected genome regions tend to correlate with high C + G content, however this relationship is complex and does not explain why the direction and magnitude of effects varies considerably between samples.Entities:
Mesh:
Year: 2012 PMID: 22950736 PMCID: PMC3469336 DOI: 10.1186/1471-2164-13-455
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Correlated weak signal regions in Illumina Infinium array data. Signal intensity data from Illumina genotyping arrays (expressed as copy number counts per SNP), are shown in red above four example chromosome ideograms for two independently processed DNAs (S1 and S2), the data for which failed standard quality control checks. Whereas most markers can be seen to have produced normal strength signals (inferred diploid copy number of 2, indicated by the most prominent horizontal row of data points), many other markers produced far weaker signals (inferred copy number of one, indicated by the row of data points plotted one step lower), and these weak signal regions are highly correlated between the two samples (and between many others not shown here).
Primers and co-ordinates for all PCR amplicons and Probes (hg18, GRCh36)
| HDLBP | GAGCTCATCCTCCACTTGGG | GAACTTGGTGAGAAGTGCGG | chr2:241,855,412-241,860,011 | 4600 | TUF |
| HDAC4 | AGGTGCTAGATTTGGACGGG | GTGTGTGTTAGGGGGTCAGG | chr2:239,860,758-239,863,570 | 2813 | TUF |
| CAPN10 | ATCTGGCTACAGGCATGGGC | GAGAGCCCAGAAGTTCCAGC | chr2:241,173,030-241,175,779 | 2750 | TUF |
| PSCDBP | GAGGCAATCACATGAGCAGG | CTGCTAAGTGGATGAATGGTGG | chr2:158,003,567-158,006,035 | 2469 | Non-TUF |
| MARCH7 | GGGAAATATGGGTTGGGAAACTG | ATGGTCTCCGTCTTCTTCGG | chr2:160,329,568-160,332,075 | 2508 | Non-TUF |
| RBMS1 | AGTAAGGAGATGAGGGGTGG | ACAGGTTTTGGTGGGAGAGG | chr2:160,889,938-160,892,713 | 2776 | Non-TUF |
| 2n13 | GCAGACTAATGGGGATGAGG | GCCTATCTGGAAAAATAGAC | chr2:241,151,005-241,151,733 | 729 | TUF |
| | | | chr13:31,949,825-31,950,179 | 355 | Non-TUF |
| 8n6 | TTGAGTCAGCCACAGAGG | CCTGGTGACAGAATGACC | chr8:142,286,472-142,286,937 | 466 | TUF |
| | | | chr6:89,689,978-89,690,367 | 390 | Non-TUF |
| 8n3 | GCTTCATCCAGCTTCAACC | AGCAAAGTGACACTCAGTGC | chr8:145,234,022-145,234,464 | 443 | TUF |
| | | | chr3:134,693,119-134,693,550 | 432 | Non-TUF |
| 2n1 | CACCCCAGTGAGTAAGCTGC | AGGGTGATCGCTTCTGACC | chr2:241,707,017-241,707,258 | 242 | TUF |
| | | | chr1:37,721,785-37,722,026 | 242 | Non-TUF |
| 5nX | ATCTAGGCTCAGGAGAGAG | TAAACATCTTAAAATGGCCT | chr5:179,593,910-179,594,270 | 361 | TUF |
| | | | chrX:63,694,585-63,694,959 | 375 | Non-TUF |
| 9n14 | CAGAGAGCAACCTGGCTC | CTGCCTCCTTGTTTGGC | chr9:139,562,111-139,562,372 | 262 | TUF |
| chr14:94,217,995-94,218,256 | 262 | Non-TUF |
Figure 2Southern blot data showing DNA fragments that resist denaturation. Data is shown for Southern blots in which freshly prepared genomic DNAs were cut with the indicated restriction enzymes and processed as normal (‘a’ tracks), or heated for one minute in water at 100°C and snap cooled on ice prior to gel electrophoresis (‘b’ tracks), or similarly heated and cooled before restriction enzyme digestion and electrophoresis (‘c’ tracks). Arrow heads indicate the expected position of Southern blot bands. The ‘Control’ probe (PSCDBP, Table 1), which is from a genome region that gives consistently strong Illumina Infinium signals, produces no bands in any heated sample. In contrast, the ‘Test’ probe (CAPN10, Table 1), which originates from a genome region that tends to give weak Illumina Infinium signals, produces strong bands in all the tracks, indicating the detected genomic fragments are not effectively denatured by the conditions applied prior to running on the agarose gel. Equivalent results were produced for denaturation attempts involving heating at 37°C for 10 minutes in 0.32 M NaOH, followed by pH neutralisation (data not shown).
Figure 3Pre-heating samples to improve PCR amplification efficiency. This graph plots the test:reference product ratio generated by PRT assay ‘2n13’ (Y-axis), amplicon sizes 729 bp and 355 bp, against temperatures used to pre-heat the input genomic DNAs (X-axis). After heating, samples were snap cooled on ice before adding them to the PRT reaction mix. Each condition was run on 2–4 samples in duplicate, and maximum and minimum ratios are plotted as error bars. The ‘Control’ ratio at the start of the chart indicates the ratio produced by amplifying non-heated input DNA, and the dotted horizontal line at 1.43 indicates the ratio that would be produced if the slightly different sized test and reference amplicons amplified with exactly equal efficiency.
Figure 4Illumina Infinium weak signals regions aligned with CpG and C + G maps. This image uses chromosome 2 to provide typical evidence of the degree of correlation between copy number inferences per SNP for samples that genotyped poorly on Illumina Infinium arrays (first data row below the ideogram), long-range averaged C + G content on a scale of 30-70% (middle data rows, data from UCSC genome browser), and the location of CpG islands (bottom data rows, data from UCSC genome browser). Weak Illumina signal regions are not simply correlated with CpG islands, nor with generally high C + G content, but only with regions containing the highest peaks of C + G content.
Figure 5TUF (C + G rich) sequences impair the analysis of neighbouring DNA regions. The lower image shows a restriction map surrounding the ‘test’ fragment of PRT assay A2n13, including a 1100 bp region with 73% C + G content (green box). The graph above shows average test:reference product ratios that were run in triplicate, with maximum and minimum plotted as error bars. ‘Control’ indicates the use of undigested DNA. Remaining columns show the ratios produced upon pre-digesting with the indicated restriction enzymes. The absolute degree of reference fragment amplification did not vary significantly across these treatments. Treatments that break the DNA to physically separate the test fragment from the C + G rich sequence clearly provide the best improvement in test fragment amplification efficiency. This reaches 1.43, which is the theoretical maximum assuming exactly equal molar amplification of test and reference amplicons (as indicated by the dotted line).
Figure 6The residual LRR variance prior to and after adjustment for C + G and CpG content. The log probe intensity ratio (LRR) values for each SNP or CNV assay provides data on probe intensity relative to that of the estimated genotype-specific cluster location. We implemented a method similar to that described in Staaf et al . [29] to re-estimate LRR after a quantile-normalization, with an enhanced multiple linear regression model, incorporating within-chip signal re-scaling terms and a polynomial correction for GC and CpG waves. This scatterplot shows the pre-normalization LRR variance against the LRR variance post-normalization.
Association between PRT performance and Illumina Infinium intensity correlations with C + G and CpG for 54 samples
| | ||||
|---|---|---|---|---|
| 1 mb | 0.7 | 0.8 | 0.37 | 0.59 |
| 100 kb | ||||
| 50 kb | 0.13 | 0.41 | 0.4 | |
| 10 kb | 0.36 | 0.29 | 0.49 | 0.28 |
| 5 kb | 0.39 | 0.36 | ||
| 1 kb | 0.66 | 0.52 | ||
| 500 bp | 0.55 | 0.51 | ||
| 100 bp | ||||
Figure 7Ligase treatment drastically reduces PCR efficiency in TUF regions. The charts indicate the amplicon product ratios for two PRT assays for four ‘old’ DNA samples (C7, C8, D4, D5) that were untreated (blue) and treated with PreCR (red), which includes a ligase to repair ssDNA nicks. Treated samples have greatly reduced amplification efficiency at the test amplicon in the ‘TUF’ region compared to the untreated DNA samples.
Figure 8Sonicated DNA improves the quality of WGA and Illumina Infinium genotyping. (A) This graph shows test:reference product ratios (Y-axis) for PRT assay 2n13 performed using equal amounts of various input DNAs as labelled (X-axis). All PRT reactions were run in quadruplicate, with maximum and minimum plotted as error bars. The dotted horizontal line at 1.43 indicates the ratio that would be produced if the test and reference amplicons amplified with exactly equal efficiency. ‘Control’ indicates that the PRT employed freshly prepared genomic DNA. ‘Intact’ indicates that the same genomic sample was first subjected to WGA using the MDA method (QIAGEN Repli-g Mini kit applied to 50 ng of DNA). ‘Sonicated’ indicates that the same genomic sample was first sonicated to less than 1 kb average size and WGA processed. The blue data points show data produced by the above regimes, whereas the red data points are from an equivalent experiment where the DNA was additionally digested with NcoI immediately prior to inclusion in the PRT reactions. As indicated in Figure 4, NcoI cuts the genomic DNA just upstream of the test sequence target region, and separates it from a nearby region of high C + G content. The data points for the ‘intact’ column demonstrate that after WGA the test locus is still subject to reduced amplification efficiency. Importantly, correction by digestion is substantially reduced compared to the control. Sonication prior to WGA dramatically enhances amplification to almost the efficiency of the references locus even without correction by digestion. (B) Using chromosome 7 as a typical example, log R ratio plots (a measure of relative signal strength) are shown for Illumina Infinium genotyping data generated by assaying a freshly prepared intact genomic DNA sample (log R ratio plot in the upper box) and from a portion of that sample sonicated to 0.3 – 3 kbp in size (log R ratio plot in the second box). The data tracks below these boxes show the apparently reduced signal strength regions (as copy number inferences) generated on the same platform for two poorly performing DNA samples (those mentioned in Figure 1), the C + G content and CpG island maps, and the chromosome 7 ideogram.