| Literature DB >> 25653722 |
Giovanni Bacci1, Alessia Bani2, Marco Bazzicalupo2, Maria Teresa Ceccherini3, Marco Galardini2, Paolo Nannipieri3, Giacomo Pietramellara3, Alessio Mengoni2.
Abstract
Here we report a benchmark of the effect of bootstrap cut-off values of the RDP Classifier tool in terms of data retention along the different taxonomic ranks by using Illumina reads. Results provide guidelines for planning sequencing depths and selection of bootstrap cut-off in taxonomic assignments.Entities:
Keywords: 16S rRNA; OTU clustering; bacterial communities.; metabarcoding; ribosomal database project
Year: 2015 PMID: 25653722 PMCID: PMC4316179 DOI: 10.7150/jgen.9204
Source DB: PubMed Journal: J Genomics
Description of the datasets used*.
| BioProject* | rRNA region | Average reads length | Number of samples | Average number of | Environment |
|---|---|---|---|---|---|
| PRJEB6047 | V3 | 302bp | 72 | 61023 | Subgingival, supragingival, and tongue plaque from healthy and periodontal subjects |
| PRJNA245381 | V3 | 300bp | 100 | 28634 | Soil contaminated with increasing level of ionic Ag |
| PRJNA217938 | V4 | 288bp | 25 | 476230 | Samples from the surface to depth in Upper Mystic Lake, Winchester, MA |
| PRJNA238275 | V4 | 251bp | 6 | 759518 | Soil associated with the rhizosphere of the coffee plant ( |
| PRJNA188383 | V6 | 200bp | 48 | 66887 | Seawater and surface sediments retrieved from the Arctic Ocean |
* The ID of the accession (http://www.ncbi.nlm.nih.gov/bioproject/), the variable region sequenced, the type of reads, the number of different samples analyzed and the number of reads is shown.
Figure 1Effect of bootstrap cut-off thresholds on the number of reads. The percentage of trimmed reads assigned to each taxonomic level is reported versus RDP bootstrap cut-off values. Shaded lines correspond to the 95% confidence interval assuming normality.
Figure 2Percentage of assigned reads with respect to bootstrap cut-off thresholds at the genus level. Plots report the assigned reads for all dataset analyzed. Shaded lines correspond to the 95% confidence interval assuming normality.