| Literature DB >> 25607802 |
Roxana Beaz-Hidalgo1, Mohammad J Hossain2, Mark R Liles2, Maria-Jose Figueras1.
Abstract
Around 27,000 prokaryote genomes are presently deposited in the Genome database of GenBank at the National Center for Biotechnology Information (NCBI) and this number is exponentially growing. However, it is not known how many of these genomes correspond correctly to their designated taxon. The taxonomic affiliation of 44 Aeromonas genomes (only five of these are type strains) deposited at the NCBI was determined by a multilocus phylogenetic analysis (MLPA) and by pairwise average nucleotide identity (ANI). Discordant results in relation to taxa assignation were found for 14 (35.9%) of the 39 non-type strain genomes on the basis of both the MLPA and ANI results. Data presented in this study also demonstrated that if the genome of the type strain is not available, a genome of the same species correctly identified can be used as a reference for ANI calculations. Of the three ANI calculating tools compared (ANI calculator, EzGenome and JSpecies), EzGenome and JSpecies provided very similar results. However, the ANI calculator provided higher intra- and inter-species values than the other two tools (differences within the ranges 0.06-0.82% and 0.92-3.38%, respectively). Nevertheless each of these tools produced the same species classification for the studied Aeromonas genomes. To avoid possible misinterpretations with the ANI calculator, particularly when values are at the borderline of the 95% cutoff, one of the other calculation tools (EzGenome or JSpecies) should be used in combination. It is recommended that once a genome sequence is obtained the correct taxonomic affiliation is verified using ANI or a MLPA before it is submitted to the NCBI and that researchers should amend the existing taxonomic errors present in databases.Entities:
Mesh:
Year: 2015 PMID: 25607802 PMCID: PMC4301921 DOI: 10.1371/journal.pone.0115813
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Intra and inter-species ANI values for the 44 Aeromonas genomes and identification on the basis of the MLPA and ANI (mislabeled genomes in bold).
|
| ||||
|---|---|---|---|---|
|
|
|
|
|
|
|
| ||||
|
| Canned milk (US) | 97.14 | 100 |
|
|
| Diseased outbreak in catfish | 100 | 97.14 |
|
|
| Diseased catfish (USA) | 99.99 | 97.24 |
|
|
| Soil of catfish pond (USA) | 99.99 | 97.30 |
|
|
| Clinical strain (Malaysia) | 97.13 | 97.30 |
|
|
| Kidney of a moribund carp (Japan) | 97.20 | 97.22 |
|
|
| Kidney of moribund salmon (Korea) | 97.11 | 97.10 |
|
|
| Wetland sediments (USA) | 97.13 | 97.16 |
|
|
| Necroziting fascitiis (USA) | 96.94 | 97.29 |
|
|
| Necroziting fascitiis (USA) | 96.79 | 97.15 |
|
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Blood of a patient with septicemia and necrotizing fasciitis (Taiwan) | 93.73 | 97.52 |
|
|
| Sludge of a dairy treatment plant (India) | 93.71 | 97.57 |
|
|
| ||||
|
| Aquaculture pond sediment (China) | 88.56 | 100 |
|
|
| Human blood (USA) | 88.38 | 96.76 |
|
|
| Human blood (USA) | 88.35 | 99.72 |
|
|
| Under eye wound (USA) | 88.33 | 97.21 |
|
|
| Leech digestive tract (France) | 88.47 | 96.83 |
|
|
| Fish intestine (India) | 88.51 | 96.75 |
|
|
| Stool (Malaysia) | 88.40 | 96.75 |
|
|
|
|
|
| “ |
|
| ||||
|
| Brown trout with furunculosis (France) | 88.23 | 100 |
|
|
| Infected brook trout (Canada) | 88.28 | 99.98 |
|
|
| Heavily polluted river (Argentina) | 88.33 | 97.40 |
|
|
| Not specified (Japan) | 88.35 | 99.82 |
|
|
| Ulcer from crucian carp (Korea) | 88.57 | 99.79 |
|
|
| ||||
|
| Feces from a 17month old boy with gastroenteritis (Brazil) | 89.00 | 100 |
|
|
| Compost (Malaysia) | 89.02 | 98.27 |
|
|
|
|
|
|
|
|
| ||||
|
| Lake (China) | 88.55 | 100 |
|
|
|
|
|
| “ |
|
| River water (USA) | 88.61 | ND |
|
|
| Leg wound (USA) | 84.39 | ND |
|
|
| Bivalve molluscs (Spain) | 85.40 | ND |
|
|
| Wound (Taiwan) | 88.85 | ND |
|
|
| Cerebrospinal fluid (Brazil) | 88.17 | ND |
|
aGenomes also included in the EZ-Genome database up to 23 July 2013 (http://www.ezbiocloud.net/ezgenome/browse_db). Given genomes names at the Ez-Genome for b-e:
b A. hydrophila
c A. veronii
d A. cavernicola.
e A. enteropelogenes (a later synonym of A. trota). ND, No data can be calculated because no other genome of this species was available.
Figure 1Phylogenetic tree constructed with partial sequences of six concatenated housekeeping genes (gyrB, rpoD, recA, dnaJ, gyrA, dnaX, 3879 bp) from 41 Aeromonas genomes (in bold as well as their species affiliation) and from 48 strains representing the 27 species included in the genus.
Sequences of three genomes (Aeromonas sp. 159, A. enteropelogenes (A. trota) 1991CR and A. salmonicida subsp. salmonicida 34melT) are not included in the tree as some genes were missing. Abbreviations for the species names: A. hyd., A. hydrophila; A. dhak., A. dhakensis; A. cav., A.caviae; A. jan., A. jandaei; A. ver., A. veronii; A. med., A. media; A. salm., A. salmonicida.
ANI results in function of the employed calculating tool of 15 selected Aeromonas genomes against A. hydrophila strain ML09-119.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| |||||
|
|
| 96.94 | 96.77 | 96.77 | 96.53 |
|
|
| 96.79 | 96.61 | 96.63 | 96.05 |
|
|
| 93.73 | 92.75 | 92.76 | 93.12 |
|
|
| 93.76 | 92.80 | 92.82 | 93.13 |
|
|
| 93.77 | 92.80 | 92.85 | 93.22 |
|
|
| 88.38 | 85.29 | 85.31 | 87.62 |
|
|
| 88.47 | 85.29 | 85.31 | 87.59 |
|
|
| 88.51 | 85.58 | 85.60 | 87.67 |
|
|
| 88.40 | 85.02 | 85.04 | 87.58 |
|
| “ | 88.54 | 85.22 | 85.23 | 87.84 |
|
|
| 88.33 | 86.45 | 86.43 | 86.43 |
|
|
| 88.35 | 86.28 | 86.40 | 87.78 |
|
|
| 89.02 | 86.18 | 86.19 | 88.21 |
|
|
| 89.22 | 86.24 | 86.26 | 88.25 |
|
| “ | 88.48 | 86.95 | 87.76 | 85.85 |
ANI results in function of the employed calculating tool for 15 selected Aeromonas genomes against a representative one from the same species.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| |||||
|
|
| 97.29 | 96.84 | 96.94 | 96.76 |
|
|
| 97.15 | 96.77 | 96.77 | 96.66 |
|
| |||||
|
|
| 97.52 | 97.12 | 97.24 | 97.09 |
|
|
| 97.53 | 97.34 | 97.30 | 97.36 |
|
|
| 97.54 | 97.22 | 97.29 | 97.09 |
|
| |||||
|
|
| 96.76 | 96.36 | 96.41 | 96.52 |
|
|
| 96.83 | 96.60 | 96.38 | 96.38 |
|
|
| 96.75 | 96.23 | 96.30 | 96.48 |
|
|
| 96.75 | 96.26 | 96.15 | 96.27 |
|
| “ | 94.68 | 93.51 | 93.68 | 94.17 |
|
| |||||
|
|
| 97.40 | 96.88 | 96.98 | 96.98 |
|
|
| 99.82 | 99.76 | 99.73 | 99.51 |
|
| |||||
|
|
| 98.27 | 98.12 | 97.86 | 97.70 |
|
|
| 98.44 | 98.13 | 98.13 | 98.07 |
|
| |||||
|
| “ | 93.92 | 93.10 | 93.10 | 93.25 |