| Literature DB >> 25887558 |
Hiroshi Shinozuka1,2, Noel O I Cogan3,4, Maiko Shinozuka5,6, Alexis Marshall7,8, Pippa Kay9, Yi-Han Lin10,11, German C Spangenberg12,13,14, John W Forster15,16,17.
Abstract
BACKGROUND: Fragmentation at random nucleotide locations is an essential process for preparation of DNA libraries to be used on massively parallel short-read DNA sequencing platforms. Although instruments for physical shearing, such as the Covaris S2 focused-ultrasonicator system, and products for enzymatic shearing, such as the Nextera technology and NEBNext dsDNA Fragmentase kit, are commercially available, a simple and inexpensive method is desirable for high-throughput sequencing library preparation. MspJI is a recently characterised restriction enzyme which recognises the sequence motif CNNR (where R = G or A) when the first base is modified to 5-methylcytosine or 5-hydroxymethylcytosine.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25887558 PMCID: PMC4396059 DOI: 10.1186/s12896-015-0139-7
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Figure 1MspJI-enzymatic digestion of 5mC-containing PCR and Φ29 products. (a) DNA fragments amplified with the locus-specific PCR primers for the Agro_gc50 sequence under the presence of 5-methyl-dCTP (0, 2, 4 or 8 μM). (b) MspJI-digested DNA fragments derived from PCR products with each locus-specific primers and Agro gDNA as DNA template. Molar concentration denotes the 5-methyl-dCTP-concentration in PCR solution. (c) MspJI-enzymatic digestion of Φ29 enzyme-amplified DNA with randomly incorporated 5mC from a range of DNA templates. 0, 10, 15 and 20 μM denote final concentrations of 5-methyl-dCTP in the REPLI-g WGA mixture.
Figure 2Illumina MiSeq short read-sequencing results of the libraries constructed from MspJI-digested and physically sheared DNA. The sorted alignment was visualised using the Tablet viewers. (a) Alignment results of Φ29 enzyme-amplified Agro gDNA-derived reads and physically sheared Agro gDNA-derived reads to the reference Agrobacterium genome sequences for the 1,000-1,250 and 2,000-2250 kb regions of the circular and linear chromosomes. (b) Alignment results of the Arabidopsis genome with the MspJI-enzymatic fragmentation and physical shearing methods. Read coverage depth for the 10,000-10,250, 20,000-20,250 and 30,000-30,250 kb regions of chromosome 1 was visualised.
Result of and genome resequencing using the two different DNA fragmentation methods
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| REPLI-g/MspJI | Circular | 1,380,029 | 99.9 | 64.3 |
| Linear | 1,219,389 | 99.9 | 77.8 | ||
| Physical shearing | Circular | 546,292 | 99.8 | 27.4 | |
| Linear | 496,662 | 99.9 | 34.1 | ||
|
| REPLI-g/MspJI | 1 | 4,611,463 | 97.3 | 36.4 |
| 2 | 4,759,134 | 98 | 57.7 | ||
| 3 | 4,702,657 | 98.3 | 47.9 | ||
| 4 | 3,088,231 | 98 | 39.9 | ||
| 5 | 4,222,266 | 98 | 37.8 | ||
| Physical shearing | 1 | 5,116,095 | 98.9 | 41.5 | |
| 2 | 6,143,569 | 99.5 | 75 | ||
| 3 | 6,083,236 | 99.5 | 62.4 | ||
| 4 | 3,776,130 | 99.5 | 49.3 | ||
| 5 | 4,980,370 | 99.4 | 45.1 |
Ratio of coverage denotes the value for specific nucleotides located on each chromosome.
Figure 3Short read-sequencing results of the libraries constructed from MspJI, FspEI and LpnPI-digested DNA. Read coverage depth was visualised using the Tablet viewer.
Figure 4BAC clone sequencing based on the MspJI-based DNA fragmentation approach. (a) Amplified BAC DNA directly from glycerol stocks, using the Φ29 enzyme. In the amplification solution, 30 μM 5-methyl-dCTP were included. (b) Resequencing results of the BAC clones. Read coverage depth for each clone was visualised on the Tablet software.
Resequencing results of the BAC clones
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| MXK3 | 81 kb | 395,498 | 27,585 (7.0%) | 99.6% | 82.3 | 7086 | 98.6% | 21.3 |
| F20D21 | 143 kb | 558,276 | 86,861 (15.6%) | 99.8% | 149.4 | 15382 | 99.2% | 26.5 |
| F20B17 | 90 kb | 478,300 | 46,409 (9.7%) | >99.9% | 126.2 | 9173 | 99.9% | 24.9 |
Ratio of coverage denotes the value for specific nucleotides located on each clone. A subset of 100,000 reads was obtained using the Seqtk software.
Figure 5Sequencing of the fungal endophyte genome with the MspJI-based DNA fragmentation method. Two independent experiments (rep1 and rep2) were performed using the single endophyte strain. (a) WGA of the endophyte genome with the Φ29 enzyme in the presence of 5-methyl-dCTP (X μM). (b) MspJI digestion of the endophyte genome-derived amplicons. (c) Flowchart of the MspJI- and physical shearing-based library prep procedures from WGA products.
Resequencing results of the ryegrass endophyte genome
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| gi|347366940 | 1,598,175 | 81,237 | 88.7 | 10.1 | 749 | 1.5 |
| gi|347366939 | 1,323,136 | 59,445 | 86.7 | 8.8 | 304 | 1.4 |
UI denotes the unique identifier of the NCBI GenBank. Short read sequencing data from two experiments (rep1 and rep2) were combined for read alignment.
Figure 6Resequencing of PCR amplicons with the MspJI-based DNA fragmentation method. (a) Pooled PCR amplicons containing 5mC. Locus-specific amplification was performed for each sequence independently. (b) MspJI digestion of the PCR amplicons. (c) Flowchart of the MspJI- and physical shearing-based library prep procedures from PCR amplicons.