| Literature DB >> 30018864 |
Keri Ann Lydon1, Erin K Lipp1.
Abstract
Next-generation sequencing has provided powerful tools to conduct microbial ecology studies. Analysis of community composition relies on annotated databases of curated sequences to provide taxonomic assignments; however, these databases occasionally have errors with implications for downstream analyses. Systemic taxonomic errors were discovered in Greengenes database (v13_5 and 13_8) related to orders Vibrionales and Alteromonadales. These orders have family level annotations that were erroneous at least one taxonomic level, e.g., 100% of sequences assigned to the Pseudoalteromonadaceae family were placed improperly in Vibrionales (rather than Alteromonadales) and >20% of these sequences were indeed Vibrio spp. but were improperly assigned to the Pseudoalteromonadaceae family (rather than to Vibrionaceae). Use of this database is common; we identified 68 peer-reviewed papers since 2013 that likely included erroneous annotations specifically associated with Vibrionales and Pseudoalteromonadaceae, with 20 explicitly stating the incorrect taxonomy. Erroneous assignments using these specific versions of Greengenes can lead to incorrect conclusions, especially in marine systems where these taxa are commonly encountered as conditionally rare organisms and potential pathogens.Entities:
Keywords: 16S rRNA gene; Alteromonadales; Greengenes; Marine microbiology; Microbial ecology; Next-generation sequencing; Pseudoalteromonadaceae; Taxonomy; Vibrionaceae; Vibrionales
Year: 2018 PMID: 30018864 PMCID: PMC6044269 DOI: 10.7717/peerj.5248
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Phylogenetic tree of Pseudoalteromonadaceae representative sequences in the Greengenes database (n = 164).
Each branch is labeled by the assigned ID in the representative sequences within Greengenes. Color strips indicated the assigned taxonomy (Order and Family) from different curated databases: Greengenes (GG), SILVA, RDP, and NCBI. The accepted Order for the family Pseudoalteromonadaceae is Alteromonadales (Ivanova, Ng & Webb, 2014) not Vibrionales, as featured in all Greengenes taxonomic assignments.
Incorrect taxonomic annotation of Pseudoalteromonadaceae in Vibrionales as published in peer reviewed literature.
| Search term queries | Greengenes + Vibrionales + Pseudoalteromonadaceae | Greengenes + Vibrionales | Total |
|---|---|---|---|
| All papers fitting search criteria | 22 | 63 | 85 |
| Papers confirmed using Greengenes versions 13_5 or 13_8 with known taxonomy errors | 14 | 41 | 55 |
| Papers explicitly stating taxonomic mismatch (in text or supplemental material) | 10 | 10 | 20 |
| Papers stating correct assignment or using earlier version of Greengenes | 7 | 10 | 17 |
| Papers using Greengenes but with no information on database version. | 1 | 12 | 13 |
Notes:
Search was conducted between March 23 and 28, 2018 using Google Scholar. Search results included in the analysis were peer-reviewed papers published between 2013 and 2018 that were accessible in English and used a 16S rRNA gene next-generation sequencing approach.
Counts do not include papers appearing in the “Greengenes + Vibrionales + Pseudoalteromonadaceae” search.
These papers are also included in the tally for papers using Greengenes 13_5 or 13_8.