| Literature DB >> 29373999 |
Garold Fuks1, Michael Elgart2, Amnon Amir3, Amit Zeisel4, Peter J Turnbaugh5, Yoav Soen2, Noam Shental6.
Abstract
BACKGROUND: Most of our knowledge about the remarkable microbial diversity on Earth comes from sequencing the 16S rRNA gene. The use of next-generation sequencing methods has increased sample number and sequencing depth, but the read length of the most widely used sequencing platforms today is quite short, requiring the researcher to choose a subset of the gene to sequence (typically 16-33% of the total length). Thus, many bacteria may share the same amplified region, and the resolution of profiling is inherently limited. Platforms that offer ultra-long read lengths, whole genome shotgun sequencing approaches, and computational frameworks formerly suggested by us and by others all allow different ways to circumvent this problem yet suffer various shortcomings. There is a need for a simple and low-cost 16S rRNA gene-based profiling approach that harnesses the short read length to provide a much larger coverage of the gene to allow for high resolution, even in harsh conditions of low bacterial biomass and fragmented DNA.Entities:
Keywords: 16S rRNA gene; High resolution; Microbial profiling; Microbiome
Mesh:
Substances:
Year: 2018 PMID: 29373999 PMCID: PMC5787238 DOI: 10.1186/s40168-017-0396-x
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Fig. 1A schematic description of SMURF. a A comparison between single and multiple region profiling. b The SMURF flow diagram describing the internal steps of a typical analysis
The set of six primer pairs used in this study. The number of Greengenes sequences (out of the total 1,402,801 sequences) that have a perfect match with each primer pair. The primers’ sequences and their median start locations along the forward strand are shown. The set of primers was used for profiling our mock mixture, the D. melanogaster samples, and for in silico simulations
| Region | # amplified | Forward primer | Reverse primer | ||
|---|---|---|---|---|---|
| Sequence | Position | Sequence | Position | ||
| 1 | 196415 | 5′-TGGCGGACGGGTGAGTAA-3′ | 74 | 5′- CTGCTGCCTCCCGTAGGA-3′ | 315 |
| 2 | 1122118 | 5′-TCCTACGGGAGGCAGCAG-3′ | 316 | 5′- TATTACCGCGGCTGCTGG-3′ | 484 |
| 3 | 660912 | 5′-CAGCAGCCGCGGTAATAC-3′ | 486 | 5′- CGCATTTCACCGCTACAC-3′ | 650 |
| 4 | 660342 | 5′-AGGATTAGATACCCTGGT-3′ | 752 | 5′- GAATTAAACCACATGCTC-3′ | 911 |
| 5 | 591604 | 5′-GCACAAGCGGTGGAGCAT-3′ | 901 | 5′- CGCTCGTTGCGGGACTTA-3′ | 1057 |
| 6 | 783882 | 5′-AGGAAGGTGGGGATGACG-3′ | 1143 | 5′- CCCGGGAACGTATTCACC-3′ | 1336 |
Fig. 2Theoretical resolution and in silico simulation results. a The resolution for one region and six regions. b Weighted precision/recall of simulated communities in six regions, one region, and V4
Fig. 3Experimental mock mixture. a Frequency (percent) of correctly detected bacteria and of false positive detections (one and six regions). b Ambiguity as a measure of resolution (one and six regions). c The average ambiguity (~ 100fold smaller for six regions vs. one region)
Fig. 4Re-analysis of an HMP “even” mock mixture. a Reconstruction of either V1-V3 or V6-V9 or the two regions jointly. b Ambiguity when using V1-V3 or V6-V9 regions and by their combination. c Average ambiguity is reduced ~ 2-fold when combining the two regions
Fig. 5Reconstruction of bacterial populations in D. melanogaster following toxic treatment. a Naïve flies using six regions (left) and one region (right). b Same for flies reared on medium containing the G418 toxin. c Ambiguity of the three most abundant species. Profiling based on a single region (hollow bars) and six regions (full bars). d Similarity between L. plantarum predicted strains and the Sanger sequence of the strain isolated from the detected colony (top 100 sequences)