Literature DB >> 25477398

Taxonomic affiliation of new genomes should be verified using average nucleotide identity and multilocus phylogenetic analysis.

María José Figueras1, Roxana Beaz-Hidalgo2, Mohammad J Hossain3, Mark R Liles3.   

Abstract

The average nucleotide identity (ANI) determines if two genomes belong to the same species. Using ANI, we detected mislabeled genomes and recommend verifying with ANI and multilocus phylogenetic analysis the species affiliations of the announced genomes. The slightly different results obtained with different ANI calculation software can potentially mislead taxonomic inferences.
Copyright © 2014 Figueras et al.

Entities:  

Year:  2014        PMID: 25477398      PMCID: PMC4256179          DOI: 10.1128/genomeA.00927-14

Source DB:  PubMed          Journal:  Genome Announc


COMMENTARY

The average nucleotide identity (ANI) is a similarity index between a given pair of genomes that can be applicable to prokaryotic organisms independently of their G+C content, and a cutoff score of >95% indicates that they belong to the same species (1, 2). Despite the fact that some of the genomes published in Genome Announcements include ANI values with closely related genomes (3–6), these data are missing in many studies. The genome sequencing studies may use different software packages for ANI determination (3–6). The currently available software tools for ANI calculation include programs that have to be downloaded, like JSpecies (http://www.imedea.uib.es/jspecies) and Gegenees (http://www.gegenees.org/documentation.html), as well as online calculation tools, like the one at the EzGenome (http://www.ezbiocloud.net/ezgenome/ani) and the ANI calculator (http://enve-omics.ce.gatech.edu/ani/index). However, there is a lack of information indicating whether these tools provide consistent results. In a recent study using ANI calculations and a multilocus phylogenetic analysis (MLPA), we discovered that 14 (36%) of the 39 Aeromonas genomes deposited in the GenBank genome database were taxonomically mislabeled and that ANI values can vary using different calculation tools (7). Of the different tools employed, the ANI calculator always provided higher values than the JSpecies and the EzGenome tools that provided very similar results, but the magnitude of the differences depended on the specific genomes that were compared. For instance, the ANI calculator provided results between the genomes Aeromonas hydrophila HZM (GenBank accession no. JEMQ01) and A. hydrophila ATCC 7966T (GenBank accession no. CP000462) of 89.0%, but when using JSpecies, the ANI was 86.6%. These results were <95% (the species ANI cutoff value) and therefore showed that the genome A. hydrophila HZM does not affiliate with A. hydrophila. This was also confirmed by the MLPA, because in the obtained tree, A. hydrophila HZM did not cluster with the type strain of A. hydrophila but with that of Aeromonas caviae instead. In fact, the ANI comparison between the genome of A. hydrophila HZM with that of A. caviae Ae398 (GenBank accession no. CACP01) showed results of >95% (98.4% with ANI calculator and 98.1% with JSpecies), clearly indicating that HZM belongs to the species A. caviae. Typically, the differences between the ANI values obtained with the ANI calculator and JSpecies or EzGenome did not affect the species classification. The only exception might be a borderline ANI value (94.7%) obtained with the ANI calculator for the genomes of Aeromonas veronii AMC34 (GenBank accession no. AGWU01) and A. veronii B565 (GenBank accession no. CP002607) that might lead investigators to conclude that the two genomes belonged to the same species. However, the ANI values determined with JSpecies (93.7%) and EzGenome (93.5%), together with the MLPA results, confirmed that these two strains (AMC34 and B565) do not belong to the same species. Therefore, the use of other ANI determination methods in parallel with the ANI calculator is required to reinforce the correct interpretation of the results in borderline cases. Nevertheless, the ANI enabled us to easily detect wrongly labeled genomes. The mistakes in the species names of deposited genomes are relevant because they may lead to incorrect conclusions in comparative genomic studies. Therefore, these data should motivate researchers to implement measures to prevent introducing taxonomical errors in public genome databases. This commentary alerts authors about the ANI variations in relation to the calculation tool used and of the importance of using multiple approaches to confirm the taxonomic affiliations of announced genomes.
  7 in total

1.  DNA-DNA hybridization values and their relationship to whole-genome sequence similarities.

Authors:  Johan Goris; Konstantinos T Konstantinidis; Joel A Klappenbach; Tom Coenye; Peter Vandamme; James M Tiedje
Journal:  Int J Syst Evol Microbiol       Date:  2007-01       Impact factor: 2.747

2.  Shifting the genomic gold standard for the prokaryotic species definition.

Authors:  Michael Richter; Ramon Rosselló-Móra
Journal:  Proc Natl Acad Sci U S A       Date:  2009-10-23       Impact factor: 11.205

3.  Draft Genome Sequence of Commensalibacter papalotli MX01, a Symbiont Identified from the Guts of Overwintering Monarch Butterflies.

Authors:  Luis E Servín-Garcidueñas; Andrés Sánchez-Quinto; Esperanza Martínez-Romero
Journal:  Genome Announc       Date:  2014-03-06

4.  Strategies to avoid wrongly labelled genomes using as example the detected wrong taxonomic affiliation for aeromonas genomes in the GenBank database.

Authors:  Roxana Beaz-Hidalgo; Mohammad J Hossain; Mark R Liles; Maria-Jose Figueras
Journal:  PLoS One       Date:  2015-01-21       Impact factor: 3.240

5.  Draft Genome Sequence of Bacillus pumilus 7P, Isolated from the Soil of the Tatarstan Republic, Russia.

Authors:  Elena I Shagimardanova; Anna A Toymentseva; Nelly P Balaban; Ayslu M Mardanova; Yulia V Danilova; Oleg A Gusev; Elena Kostryukova; Irina Karpova; Aleksandr Manolov; Dmitriy Alexeev; Margarita R Sharipova
Journal:  Genome Announc       Date:  2014-06-12

6.  Draft Genome Sequence of Marine-Derived Actinomycete Nocardiopsis sp. Strain TP-A0876, a Producer of Polyketide Pyrones.

Authors:  Hisayuki Komaki; Natsuko Ichikawa; Akira Hosoyama; Nobuyuki Fujita; Yasuhiro Igarashi
Journal:  Genome Announc       Date:  2014-07-10

7.  Whole-Genome Sequence of Brucella canis Strain SVA13, Isolated from an Infected Dog.

Authors:  Rene Kaden; Joakim Agren; Sevinc Ferrari; Martina Lindberg; Stina Bäckman; Tara Wahab
Journal:  Genome Announc       Date:  2014-07-17
  7 in total
  48 in total

1.  Thalassococcus arenae sp. nov. isolated from sea sand.

Authors:  Veeraya Weerawongwiwat; Jong-Hwa Kim; Jung-Sook Lee; Jung-Hoon Yoon; Ampaitip Sukhoom; Wonyong Kim
Journal:  Arch Microbiol       Date:  2022-06-21       Impact factor: 2.552

2.  Ornithinimicrobium sediminis sp. nov., a novel actinobacterium isolated from a saline lake sediment.

Authors:  Lei Gao; Bao-Zhu Fang; Yong-Hong Liu; Li Li; Yin Huang; Jian-Yu Jiao; André Antunes; Wen-Jun Li
Journal:  Arch Microbiol       Date:  2022-04-23       Impact factor: 2.552

3.  Phylogenomic analysis of the Erwiniaceae supports reclassification of Kalamiella piersonii to Pantoea piersonii comb. nov. and Erwinia gerundensis to the new genus Duffyella gen. nov. as Duffyella gerundensis comb. nov.

Authors:  Craig D Soutar; John Stavrinides
Journal:  Mol Genet Genomics       Date:  2022-01-05       Impact factor: 3.291

4.  Whole genome sequencing of the halophilic Halomonas qaidamensis XH36, a novel species strain with high ectoine production.

Authors:  Tiantian Zhang; Tianqi Cui; Yaning Cao; Yongzhen Li; Fenghui Li; Derui Zhu; Jiangwa Xing
Journal:  Antonie Van Leeuwenhoek       Date:  2022-03-03       Impact factor: 2.271

5.  Draft Genome Sequence of Cupriavidus basilensis SRS, a Bacterium Isolated from Stream Sediments.

Authors:  He Fu; Alex Kugler; Patrick Huyck; Robin L Brigmon; Elizabeth A Ottesen
Journal:  Microbiol Resour Announc       Date:  2022-09-08

6.  Identification and genome analysis of Deinococcus actinosclerus SJTR1, a novel 17β-estradiol degradation bacterium.

Authors:  Weiliang Xiong; Wanli Peng; Rubing Liang
Journal:  3 Biotech       Date:  2018-10-01       Impact factor: 2.406

7.  A High-Throughput Short Sequence Typing Scheme for Serratia marcescens Pure Culture and Environmental DNA.

Authors:  Thibault Bourdin; Alizée Monnier; Marie-Ève Benoit; Emilie Bédard; Michèle Prévost; Caroline Quach; Eric Déziel; Philippe Constant
Journal:  Appl Environ Microbiol       Date:  2021-09-29       Impact factor: 5.005

8.  Comparative genomics of a cannabis pathogen reveals insight into the evolution of pathogenicity in Xanthomonas.

Authors:  Jonathan M Jacobs; Céline Pesce; Pierre Lefeuvre; Ralf Koebnik
Journal:  Front Plant Sci       Date:  2015-06-16       Impact factor: 5.753

9.  Draft genome sequence of Sphingomonas paucimobilis strain LCT-SP1 isolated from the Shenzhou X spacecraft of China.

Authors:  Lei Pan; Hong Zhou; Jia Li; Bing Huang; Jun Guo; Xue-Lin Zhang; Long-Cheng Gao; Chou Xu; Chang-Ting Liu
Journal:  Stand Genomic Sci       Date:  2016-02-24

10.  Comprehensive genome analyses of Sellimonas intestinalis, a potential biomarker of homeostasis gut recovery.

Authors:  Marina Muñoz; Enzo Guerrero-Araya; Catalina Cortés-Tapia; Angela Plaza-Garrido; Trevor D Lawley; Daniel Paredes-Sabja
Journal:  Microb Genom       Date:  2020-11-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.