Michael Richter1, Ramon Rosselló-Móra2, Frank Oliver Glöckner3, Jörg Peplies1. 1. Ribocon GmbH, Fahrenheitstraße. 1, 28359 Bremen, Germany. 2. Marine Microbiology Group, Mediterranean Institute for Advanced Studies, C/ Miquel Marqués, 21, 07190, Esporles, Spain. 3. Max Planck Institute for Marine Microbiology, Microbial Genomics and Bioinformatics Research Group, Celsiusstraße 1, 28359 Bremen, Germany and Jacobs University gGmbH Bremen, Department of Life Sciences and Chemistry, Campus Ring 1, 28759 Bremen, Germany.
Abstract
UNLABELLED: JSpecies Web Server (JSpeciesWS) is a user-friendly online service for in silico calculating the extent of identity between two genomes, a parameter routinely used in the process of polyphasic microbial species circumscription. The service measures the average nucleotide identity (ANI) based on BLAST+ (ANIb) and MUMmer (ANIm), as well as correlation indexes of tetra-nucleotide signatures (Tetra). In addition, it provides a Tetra Correlation Search function, which allows to rapidly compare selected genomes against a continuously updated reference database with currently about 32 000 published whole and draft genome sequences. For comparison, own genomes can be uploaded and references can be selected from the JSpeciesWS reference database. The service indicates whether two genomes share genomic identities above or below the species embracing thresholds, and serves as a fast way to allocate unknown genomes in the frame of the hitherto sequenced species. AVAILABILITY AND IMPLEMENTATION: JSpeciesWS is available at http://jspecies.ribohost.com/jspeciesws SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. CONTACT: mrichter@ribocon.com.
UNLABELLED: JSpecies Web Server (JSpeciesWS) is a user-friendly online service for in silico calculating the extent of identity between two genomes, a parameter routinely used in the process of polyphasic microbial species circumscription. The service measures the average nucleotide identity (ANI) based on BLAST+ (ANIb) and MUMmer (ANIm), as well as correlation indexes of tetra-nucleotide signatures (Tetra). In addition, it provides a Tetra Correlation Search function, which allows to rapidly compare selected genomes against a continuously updated reference database with currently about 32 000 published whole and draft genome sequences. For comparison, own genomes can be uploaded and references can be selected from the JSpeciesWS reference database. The service indicates whether two genomes share genomic identities above or below the species embracing thresholds, and serves as a fast way to allocate unknown genomes in the frame of the hitherto sequenced species. AVAILABILITY AND IMPLEMENTATION: JSpeciesWS is available at http://jspecies.ribohost.com/jspeciesws SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. CONTACT: mrichter@ribocon.com.
DNA-DNA-hybridization (DDH) measures the degree of genomic similarity between two pools of DNA molecules. Despite that distinct DDH determination methods frequently lead to different results, since nearly 50 years it is considered as the gold standard for species delineation (Rosselló-Móra and Amann, 2015). In times of cost-efficient sequencing with scientists having easy access to thousands of publicly available genome sequences, the species delineation based on user friendly computational pairwise genome comparisons to determine overall genome relatedness indices [OGRI, (Chun and Rainey, 2014)] has emerged. This approach has already been widely accepted by the scientific community (Colston ; Li ; Lugli ; Oren and Garrity, 2014) and specialized tools like the stand-alone software JSpecies (Richter and Rosselló-Móra, 2009) or similar solutions (Meier-Kolthoff ) have proven to provide valuable OGRI results.Here we present the JSpecies Web Server (JSpeciesWS) a web-based implementation of the core features of the original JSpecies software for easy user-access and extended functionality. It represents the latest development of JSpecies in terms of usability, flexibility, and efficiency. An online reference database of all published whole and draft genomes (about 32 000) with pre-calculated OGRI results has been added and can be searched in seconds based on genome comparison using the new Tetra Correlation Search (TCS) function.
2 Methods
JSpeciesWS has been implemented with the Google Web Toolkit (http://www.gwtproject.org/) version 2.7.0 and uses a Twitter-Bootstrap clone (https://github.com/gwtbootstrap3/gwtbootstrap3) for the user interface elements. Imported sequences get validated by using the FastaValidator version 1 (Waldmann ).
2.1 Average nucleotide identity based on BLAST+
The average nucleotide identity based on BLAST + (ANIb) calculation is implemented as described by Goris . In contrast to the original JSpecies stand-alone implementation, the web server uses BLAST version 2.2.29 + (Camacho ) instead of BLAST version 2.2.26 (Altschul ).
2.2 Average nucleotide identity based on MUMmer
For average nucleotide identity based on MUMmer (ANIm) calculation, the dnadiff script of the MUMmer tool version 3.0 (Kurtz ) is used instead of a self-written parser used by the original JSpecies stand-alone implementation.
2.3 Tetra-nucleotide signature correlation index
Tetra is an alignment-free parameter correlating with ANI, and implemented in the same way as the original JSpecies stand-alone (Richter and Rosselló-Móra, 2009).
2.4 Tetra Correlation Search
TCS is a new feature of JSpeciesWS that allows rapid comparisons against the JSpeciesWS reference database, providing a list of the most similar genomes based on their resulting Tetra-nucleotide signature correlation index.
2.5 Genomes reference database
The internal JSpeciesWS genome database resembles NCBI’s genomic sequence data taken from ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank and includes all primary submissions of assembled genome sequences and their associated annotation data. Currently, more than 32 000 prokaryotic genomes at different stages of finishing are included. For each entry, taxonomic information is provided for the field’s domain, phylum, class, order, family, genus and species. Further contextual data extracted from the original entry (if available) include genome size, guanine-cytosine content, Gram staining, shape, arrangement, endospores formation, motility, salinity, oxygen, habitat and temperature range.
3 Usage
JSpeciesWS does not require user registration. By visiting the website, a session is generated linked to a dedicated genome cart. The genome cart can be filled with own genome uploads and/or genomes can be added from the JSpeciesWS reference database. The cart (session) expires automatically after a period of 14 days of inactivity (no revisit based on session code provided) and all data will be deleted. The limit of genomes per cart is currently set to 15.Pairwise ANIb, ANIm or Tetra calculations can be performed between all genomes in the cart (all vs all) or, alternatively, only one genome of the cart can be selected for comparison against all other genomes in the cart to reduce calculation time. In addition, a TCS can be performed for each genome in the cart against the complete JSpeciesWS reference database of all published whole and draft genomes.After starting an analysis, the status field is regularly updated and the user is finally informed about a finished calculation by a pop-up message. In addition, the user can provide an e-mail address to become informed about the completion of the calculations. A all versus all comparison of 15 genomes (210 calculations) takes on the current system about 2 h based on ANIb, 30 min based on ANIm and 10 min based on Tetra. All results can be revisited, exported for further processing and shared among users.
4 Conclusion
Recent studies have shown that electronic (in silico) pairwise OGRI can replace standard laboratory-based DDH in order to simplify, and at the same time substantiate, this central aspect of prokaryotic species delineation. In addition, OGRI allows taxonomists to quickly verify the species affiliation of any public genome sequence and to identify wrongly annotated submissions. JSpeciesWS will help to further propagate this approach by providing fast and easy access to the corresponding analyses and a comprehensive genomes reference database.
Funding
This research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013) under grant agreement no 311975. This publication reflects the views only of the author, and the European Union cannot be held responsible for any use which may be made of the information contained therein.Conflict of Interest: none declared.Click here for additional data file.
Authors: Johan Goris; Konstantinos T Konstantinidis; Joel A Klappenbach; Tom Coenye; Peter Vandamme; James M Tiedje Journal: Int J Syst Evol Microbiol Date: 2007-01 Impact factor: 2.747
Authors: Gabriele Andrea Lugli; Christian Milani; Francesca Turroni; Sabrina Duranti; Chiara Ferrario; Alice Viappiani; Leonardo Mancabelli; Marta Mangifesta; Bernard Taminiau; Véronique Delcenserie; Douwe van Sinderen; Marco Ventura Journal: Appl Environ Microbiol Date: 2014-08-08 Impact factor: 4.792
Authors: Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg Journal: Genome Biol Date: 2004-01-30 Impact factor: 13.583
Authors: Sophie M Colston; Matthew S Fullmer; Lidia Beka; Brigitte Lamy; J Peter Gogarten; Joerg Graf Journal: mBio Date: 2014-11-18 Impact factor: 7.867
Authors: Dimitri V Meier; Petra Pjevac; Wolfgang Bach; Stephane Hourdez; Peter R Girguis; Charles Vidoudez; Rudolf Amann; Anke Meyerdierks Journal: ISME J Date: 2017-04-04 Impact factor: 10.302
Authors: Julia A Maresca; Jessica L Keffer; Priscilla P Hempel; Shawn W Polson; Olga Shevchenko; Jaysheel Bhavsar; Deborah Powell; Kelsey J Miller; Archana Singh; Martin W Hahn Journal: J Bacteriol Date: 2019-04-24 Impact factor: 3.490