| Literature DB >> 28245591 |
Jing Tu1, Na Lu2, Mengqin Duan3, Mengting Huang4, Liang Chen5, Junji Li6, Jing Guo7, Zuhong Lu8.
Abstract
Multiple displacement amplification (MDA) is considered to be a conventional approach to comprehensive amplification from low input DNA. The chimeric reads generated in MDA lead to severe disruption in some studies, including those focusing on heterogeneity, structural variation, and genetic recombination. Meanwhile, the generation of by-products gives a new approach to gain insights into the reaction process of φ29 polymerase. Here, we analyzed 36.7 million chimeras and screened 196 billion chimeric hotspots in the human genome, as well as evaluating the hotspot selective preference of chimeras. No significant preference was captured in the distributions of chimeras and hotspots among chromosomes. Hotspots with overlaps for 12-13 nucleotides (nt) were most likely to be selected as templates in chimera generation. Meanwhile, a regularly selective preference was noticed in overlap GC content. The preferences in overlap length and GC content was shown to be pertinent to the sequence denaturation temperature, which pointed out the optimization direction for reducing chimeras. Distance preference between two segments of chimeras was 80-280 nt. The analysis is beneficial for reducing the chimeras in MDA, and the characterization of MDA chimeras is helpful in distinguishing MDA chimeras from chimeric sequences caused by disease.Entities:
Keywords: chimeras; chimeric hotspots; multiple displacement amplification (MDA); φ29 polymerase
Mesh:
Year: 2017 PMID: 28245591 PMCID: PMC5372508 DOI: 10.3390/ijms18030492
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1A chimeric read with the two segments located on the complementary strands and an 8-nucleotide (nt) overlap. The chimeric read is 101 base pairs (bp) in length. The two segments are mapped to the different strands of chromosome 10 and have an 8-nt overlap. In the sense strand, the sequences of the 8-nt overlap in the two locations are reverse complementary sequences; CTCTATTC and GAATAGAG. The sequence of segments are drawn in red and blue, and the red parts are the sequences of overlap.
Figure 2Scattergraph of number of chimeras or hotspots and chromosome lengths. (a) Scattergraph of number of chimeras and chromosome lengths. The number of chimeras in chromosome X and Y are doubled in the illustration because the other chromosomes are diploid; (b) Scattergraph of number of chimeric hotspots and chromosome lengths.
Average GC content and estimation denaturation temperature of overlaps in chimeras.
| Overlap Length (nt) | Number of Chimeras | Average GC Content | |
|---|---|---|---|
| 1 | 11,992 | 36.77% | 2.74 |
| 2 | 22,292 | 44.40% | 5.78 |
| 3 | 71,262 | 49.88% | 8.99 |
| 4 | 208,729 | 45.31% | 11.62 |
| 5 | 336,932 | 44.26% | 14.43 |
| 6 | 476,409 | 42.84% | 17.14 |
| 7 | 547,136 | 41.82% | 19.86 |
| 8 | 528,555 | 40.53% | 22.49 |
| 9 | 459,454 | 39.19% | 25.05 |
| 10 | 356,470 | 37.82% | 27.56 |
| 11 | 253,356 | 36.17% | 29.96 |
| 12 | 163,502 | 34.35% | 32.24 |
| 13 | 97,992 | 32.71% | 34.51 |
| 14 | 54,711 | 31.08% | 36.70 |
| 15 | 28,218 | 29.93% | 38.98 |
| 16 | 14,630 | 28.59% | 41.15 |
| 17 | 7429 | 28.05% | 43.54 |
| 18 | 4189 | 29.79% | 46.73 |
| 19 | 2310 | 32.13% | 50.21 |
| 20 | 1661 | 33.99% | 53.60 |
| 21 | 1200 | 35.80% | 57.04 |
| 22 | 932 | 37.78% | 60.62 |
| 23 | 740 | 40.11% | 64.45 |
| 24 | 587 | 38.96% | 66.70 |
| 25 | 480 | 40.10% | 70.05 |
Figure 3Chimera and chimeric hotspot distribution of overlap length. (a) Absolute number of chimeras with different overlap lengths; (b) Absolute number of chimeric hotspots with different overlap lengths. Hotspots for 1–2 nt were only screened in chromosome 10, and the numbers representing the whole genome were calculated by multiplying the size difference between chromosome 10 and the whole genome; (c) The ratio between the number of chimeras and hotspots with different overlap lengths.
Figure 4Average overlap GC content in overlap length. (a) Average overlap GC content in chimeras and chimeric hotspots in overlap length; (b) ΔGC between chimeras and chimeric hotspots in different overlap lengths.
Figure 5Chimera and chimeric hotspot distribution in distance of two segments. This scale takes 20 nt as one distance step. (a) Chimera distribution in distance of two segments; (b) Chimeric hotspot distribution in distance of two segments.