| Literature DB >> 29320804 |
Aimilia A Stavrou1,2, Verónica Mixão3,4, Teun Boekhout1,2, Toni Gabaldón3,4,5.
Abstract
Online sequence databases such as NCBI GenBank serve as a tremendously useful platform for researchers to share and reuse published data. However, submission systems lack control for errors such as organism misidentification, which once entered in the database can be propagated and mislead downstream analyses. Here we present an illustrating case of misidentification of Candida albicans from a clinical sample as Naumovozyma dairenensis based on whole-genome shotgun data. Analyses of phylogenetic markers, read mapping and single nucleotide polymorphisms served to correct the identification. We propose that the routine use of such analyses could help to detect misidentifications arising from unsupervised analyses and correct them before they enter the databases. Finally, we discuss broader implications of such misidentifications and the difficulty of correcting them once they are in the records.Entities:
Keywords: Candida albicans; Naumovozyma dairenensis; misidentification; public databases
Mesh:
Year: 2018 PMID: 29320804 PMCID: PMC6001429 DOI: 10.1002/yea.3303
Source DB: PubMed Journal: Yeast ISSN: 0749-503X Impact factor: 3.239
BLASTn results for Naumovozyma dairenensis CBS 421 against Naumovozyma dairenensis strain 763_NDAI
| Strain | Locus | Accession number | NCBI database | Query coverage | Identity |
|---|---|---|---|---|---|
| CBS 421 | Partial rDNA | AJ229072 | WGS | 37% | 88% |
| CBS 421 | Actin1 | AF527937 | WGS | 100% | 86% |
| CBS 421 | RPB2 | AF527908 | WGS | 97% | 67% |
| CBS 421 | TEF1 | AF402046 | WGS | 99% | 89% |
WGS = Whole‐genome shotgun.
Blastn results for aligned regions of Naumovozyma dairenensis strain 763_NDAI against the Nucleotide database
| Locus | NCBI Database | Species | Query Coverage | Identity |
|---|---|---|---|---|
| Partial rDNA | Nucleotide |
| 100% | 100% |
| Actin1 | Nucleotide |
| 100% | 99% |
| RPB2 | Nucleotide |
| 100% | 100% |
| TEF1a | Nucleotide |
| 100% | 99% |