| Literature DB >> 25478564 |
Momchilo Vuyisich1, Ayesha Arefin1, Karen Davenport1, Shihai Feng1, Cheryl Gleasner1, Kim McMurry1, Beverly Parson-Quintana1, Jennifer Price2, Matthew Scholz1, Patrick Chain1.
Abstract
Sequencing bacterial genomes has traditionally required large amounts of genomic DNA (~1 μg). There have been few studies to determine the effects of the input DNA amount or library preparation method on the quality of sequencing data. Several new commercially available library preparation methods enable shotgun sequencing from as little as 1 ng of input DNA. In this study, we evaluated the NEBNext Ultra library preparation reagents for sequencing bacterial genomes. We have evaluated the utility of NEBNext Ultra for resequencing and de novo assembly of four bacterial genomes and compared its performance with the TruSeq library preparation kit. The NEBNext Ultra reagents enable high quality resequencing and de novo assembly of a variety of bacterial genomes when using 100 ng of input genomic DNA. For the two most challenging genomes (Burkholderia spp.), which have the highest GC content and are the longest, we also show that the quality of both resequencing and de novo assembly is not decreased when only 10 ng of input genomic DNA is used.Entities:
Year: 2014 PMID: 25478564 PMCID: PMC4247979 DOI: 10.1155/2014/434575
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1Evenness of coverage for B. anthracis (a) and E. coli (b) genome sequencing using NEBNext library preparation with 100 ng DNA input (top graph) and TruSeq library preparation with 1,000 ng DNA input (bottom graph). Plots are normalized by the average coverage in each figure.
Figure 2Evenness of coverage plots for B. thailandensis A ((a) is chromosome 1 and (b) is chromosome 2) and B. thailandensis B ((c) is chromosome 1 and (d) is chromosome 2) genome sequencing. Top graph within each panel shows NEBNext library preparation data with 10 ng or 100 ng DNA input (black color is for 10 ng samples and red color is for 100 ng samples). Bottom graph within each panel shows TruSeq library preparation data with 1,000 ng DNA input. Plots are normalized by the average coverage in each figure.
Comparison of gap counts by organism, replicon, and library preparation method. Input DNA amount for each sample follows the name of the library preparation method.
| Gap counts for | |||||
|---|---|---|---|---|---|
| Library preparation method-DNA input | Reference | Replicon length, bp | Cutoff based on average coverage | ||
| 0 | 1% | 10% | |||
| NEBNext-100 ng | Plasmid 1 | 109,274 | 0 | 0 | 24 |
| Plasmid 2 | 74,213 | 45 | 145 | 318 | |
| Plasmid 3 | 1,549 | 0 | 0 | 8 | |
| Chromosome | 5,253,138 | 26 | 206 | 1262 | |
| TruSeq-1000 ng | Plasmid | 109,274 | 0 | 1 | 49 |
| Plasmid | 74,213 | 44 | 200 | 491 | |
| Plasmid | 1,549 | 0 | 0 | 25 | |
| Chromosome | 5,253,138 | 23 | 449 | 2833 | |
|
| |||||
| Gap counts for | |||||
| Library preparation method-DNA input | Reference | Replicon length, bp | Cutoff based on average coverage | ||
| 0 | 1% | 10% | |||
|
| |||||
| NEBNext-10 ng | Chromosome 1 | 3,805,980 | 47 | 72 | 329 |
| Chromosome 2 | 2,870,750 | 86 | 125 | 498 | |
| NEBNext-100 ng | Chromosome 1 | 3,805,980 | 3 | 35 | 669 |
| Chromosome 2 | 2,870,750 | 78 | 151 | 1625 | |
| TruSeq-1000 ng | Chromosome 1 | 3,805,980 | 0 | 153 | 2747 |
| Chromosome 2 | 2,870,750 | 30 | 135 | 1937 | |
|
| |||||
| Gap counts for | |||||
| Library preparation method-DNA input | Reference | Replicon length, bp | Cutoff based on average coverage | ||
| 0 | 1% | 10% | |||
|
| |||||
| NEBNext-10 ng | Chromosome 1 | 3,805,980 | 37 | 158 | 1297 |
| Chromosome 2 | 2,870,750 | 19 | 38 | 717 | |
| NEBNext-100 ng | Chromosome 1 | 3,805,980 | 0 | 0 | 0 |
| Chromosome 2 | 2,870,750 | 6 | 158 | 2328 | |
| TruSeq-1000 ng | Chromosome 1 | 3,805,980 | 0 | 114 | 1874 |
| Chromosome 2 | 2,870,750 | 0 | 28 | 1177 | |
Figure 3Box and whisker plots of the variation of coverage when mapping reads to the reference genomes. Variation was calculated for nonoverlapping 10 kbp windows. Evenness of coverage is calculated as 1 − (standard deviation of coverage/median coverage).
Figure 4(a) IDBA assemblies of bacterial genomes using only Illumina short insert paired data. (b) Allpaths assemblies of bacterial genomes using Illumina short insert paired data and 454 long insert paired data.
Genome assembly statistics and comparison to reference genomes for insertions/deletions and rearrangements.
| Organism | Assembly method | Library preparation method | Starting DNA quantity, ng | # of contigs | Total bps | Relative to the reference genomes | |||
|---|---|---|---|---|---|---|---|---|---|
| Deletions | Insertions | Total indels | Rearrangements | ||||||
|
| Allpaths | NEBNext | 100 | 21 | 5,358,817 | 170 | 6,518 | 6,688 | 80 |
| TruSeq | 1000 | 23 | 5,348,296 | 170 | 50,762 | 50,932 | 92 | ||
| IDBA | NEBNext | 100 | 48 | 5,349,320 | 1,525 | 16,827 | 18,352 | 102 | |
| TruSeq | 1000 | 41 | 5,483,709 | 87,019 | 1,838 | 88,857 | 104 | ||
|
| |||||||||
|
| Allpaths | NEBNext | 100 | 82 | 5,312,046 | 758 | 48,753 | 49,511 | 222 |
| TruSeq | 1000 | 96 | 5,269,849 | 760 | 80,227 | 80,987 | 238 | ||
| IDBA | NEBNext | 100 | 199 | 5,216,101 | 111 | 152,190 | 152,301 | 414 | |
| TruSeq | 1000 | 207 | 5,242,710 | 224 | 75,926 | 76,150 | 446 | ||
|
| |||||||||
|
| Allpaths | NEBNext | 10 | 26 | 6,652,405 | 100 | 12,213 | 12,313 | 80 |
| NEBNext | 100 | 27 | 6,650,888 | 143 | 13,042 | 13,185 | 154 | ||
| TruSeq | 1000 | 77 | 6,603,636 | 6,294 | 27,237 | 33,531 | 252 | ||
| IDBA | NEBNext | 10 | 119 | 6,575,406 | 206 | 38,419 | 38,625 | 276 | |
| NEBNext | 100 | 117 | 6,575,500 | 351 | 38,382 | 38,733 | 280 | ||
| TruSeq | 1000 | 271 | 6,582,658 | 8,320 | 35,337 | 43,657 | 586 | ||
|
| |||||||||
|
| Allpaths | NEBNext | 10 | 23 | 6,660,010 | 144 | 12,576 | 12,720 | 82 |
| NEBNext | 100 | 27 | 6,651,385 | 163 | 12,278 | 12,441 | 86 | ||
| TruSeq | 1000 | 53 | 6,655,446 | 157 | 16,517 | 16,674 | 204 | ||
| IDBA | NEBNext | 10 | 129 | 6,579,849 | 749 | 45,201 | 45,950 | 264 | |
| NEBNext | 100 | 125 | 6,579,749 | 236 | 44,279 | 44,515 | 262 | ||
| TruSeq | 1000 | 281 | 6,587,931 | 8,694 | 45,997 | 54,691 | 612 | ||