| Literature DB >> 26496746 |
Robert M Bowers1, Alicia Clum2, Hope Tice3, Joanne Lim4, Kanwar Singh5, Doina Ciobanu6, Chew Yee Ngan7, Jan-Fang Cheng8, Susannah G Tringe9, Tanja Woyke10.
Abstract
BACKGROUND: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composition and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for large initial DNA template quantities during library preparation.Entities:
Mesh:
Year: 2015 PMID: 26496746 PMCID: PMC4619416 DOI: 10.1186/s12864-015-2063-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Sample overview. Each tube on this plot represents a mock metagenomic library preparation. The control library is an unamplified TruSeq library of the same mock community sample generated from 200 ng input DNA
Fig. 2a Insert size and (b) GC profiles of Illumina sequence data from each of the three different library preparation methods. Unamplified control library is represented by the red dashed line
Fig. 3Relative abundances of each mock community member and the unmapped reads with the exception of Nocardiopsis dassonvillei (too few sequences mapped to reference). Individual GC plots corresponding to the high and low GC references are displayed in Additional file 1: Figure S1, which further illustrates the shift in relative abundance from low to high GC organisms across the dilution series. Unamplified control library is represented by the red diamond
Fig. 4a Mock community relative abundances across each library prep kit and across each dilution including the TruSeq 200 ng Control library (). b Principal coordinates analyses of Euclidean distances derived from mapping reads to the mock community reference genomes () and of k-mer frequencies (, averaged sample k-mer frequencies from k2-k10 sample by sample k-mer distance matrices). Individual samples are colored by either library preparation () or starting input quantity (). The unamplified 200 ng control library is represented by the red point in each ordination
Fig. 5Heatmap noting the completeness of genomic bins extracted from each of the low input metagenomes. The color bar on top of the figure refers to each of the three tested library types and control library (Control = red, Nextera XT = green, Mondrian = blue and MALBAC = yellow). Samples are also arranged where the highest input quantity is located on the left side of each library type