Literature DB >> 22426345

GOHTAM: a website for 'Genomic Origin of Horizontal Transfers, Alignment and Metagenomics'.

Sabine Ménigaud1, Ludovic Mallet, Géraldine Picord, Cécile Churlaud, Alexandre Borrel, Patrick Deschavanne.   

Abstract

MOTIVATION: This website allows the detection of horizontal transfers based on a combination of parametric methods and proposes an origin by researching neighbors in a bank of genomic signatures. This bank is also used to research an origin to DNA fragments from metagenomics studies.
RESULTS: Different services are provided like the possibility of inferring a phylogenetic tree with sequence signatures or comparing two genomes and displaying the rearrangements that happened since their separation.
AVAILABILITY AND IMPLEMENTATION: http://gohtam.rpbs.univ-paris-diderot.fr/

Entities:  

Mesh:

Year:  2012        PMID: 22426345      PMCID: PMC3338014          DOI: 10.1093/bioinformatics/bts118

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Horizontal transfers (HTs) are a major force of evolution (Keeling and Palmer, 2008; Ochman ) and this website proposes methods for their detection. The genomic signature was demonstrated to be species-specific (Deschavanne ; Sandberg ) and allows HT detection in terms of tetranucleotide frequencies (Dufraigne ). Parametric methods were designed to work only with the information contained in genomic sequences. They rely either on the whole set of genes or on local variations of genomic signature (Dufraigne ; Mallet ). Recently, a benchmark has determined the most efficient parametric methods in different conditions and has proposed to use a combination of methods to analyze HTs in genomes (Becq ). This site provides user-friendly access to such methods as well as some unique features including signature-based phylogeny and potential origin of a set of metagenomics sequences.

2 GOHTAM SERVICES

2.1 HT detection

The two methods proposed can be used alone or in combination. The first is a window-based signature method as described in Dufraigne ), except that the distance used is the Jensen–Shannon divergence, a symmetric version of the Kullback–Leibler divergence (Azad and Lawrence, 2007; Becq ). Either sensitivity or specificity can be increased by adjustable classification process (Azad and Lawrence, 2007). A gene-based method is also proposed with the same distance (Becq ). Up to now, these methods were never proposed for online genome analysis (Fig. 1A).
Fig. 1.

Some partial screens of the website. (A) Window-based HT detection; (B) table of neighbors; (C) signature-based phylogenetic tree; (D) species signature; and (E) genome alignment.

Some partial screens of the website. (A) Window-based HT detection; (B) table of neighbors; (C) signature-based phylogenetic tree; (D) species signature; and (E) genome alignment.

2.2 Bank of genomic signatures

A key feature of GOHTAM is the biggest bank of genomic signatures to date. Instead of using only complete genomes (van Passel ; Teeling ), this bank is based on the whole set of sequences of Genbank (release 188, only sequences <1 kb were discarded) and contains ~248 000 tetranucleotide species signatures. The bank is updated at each major release.

2.3 Origin of transferred regions

Each detected region signature is compared with the signatures of the bank and the 10 closest neighbors are displayed with a confidence rating depending on the length of both query and reference sequences and the distance between the two signatures (Fig. 1B).

2.4 Metagenomics

In the case of a metagenomics study, a sequence or a set of sequences (multi-Fasta) is loaded; the signatures of these sequences are compared as above to propose a species of origin.

2.5 Oligonucleotide content

The whole set of tetranucleotides of a sequence represents the signature of a sequence (Deschavanne ). This signature of the 256 possible tetranucleotides is under the form of a 16×16 frequency matrix and can be displayed as a signature image (Fig. 1D).

2.6 Phylogenetic tree of sequence signatures

It was shown that the species specificity of genomic signatures could be used to infer phylogenetic trees (Chapus ). Loading a multi-Fasta file of sequences leads to build a neighbor-joining phylogenetic tree (Fig. 1C; Felsenstein, 2005).

2.7 Genome alignment

The website uses maximum unique matches (MUMs) to align genomes. All rearrangements superiors to 1 kb between two genomes are graphically displayed with the possibility to choose a region or modify the length of MUMs (Fig. 1E; Delcher ).

3 IMPLEMENTATION

Except for use of programs like the Phylip package (http://evolution.gs.washington.edu/phylip.html) or Mummer (http://mummer.sourceforge.net/), the original programs are written in Python, Perl or R and available at: http://gohtam.rpbs.univ-paris-diderot.fr:8080/Data/bin/GOHTAM_bin.tgz An online help is available. Some analyses require time; HT detection lasts ~6 min and the research for neighbors ~2 min depending on the server load and the sequence length. This site provides some unique features in terms of HT detection, origin of HT regions, metagenomics studies as well as for phylogenetic analyses of homologous or non-homologous sequences due to its extended reference database and improves the analyses proposed by other sites of genome analysis (van Passel ; Teeling ). Funding: This work was supported by a grant from ANR MIE/TB-Hits 2010. Conflict of Interest: none declared.
  12 in total

1.  Genomic signature: characterization and classification of species assessed by chaos game representation of sequences.

Authors:  P J Deschavanne; A Giron; J Vilain; G Fagot; B Fertil
Journal:  Mol Biol Evol       Date:  1999-10       Impact factor: 16.240

Review 2.  Lateral gene transfer and the nature of bacterial innovation.

Authors:  H Ochman; J G Lawrence; E A Groisman
Journal:  Nature       Date:  2000-05-18       Impact factor: 49.962

3.  Deltarho-web, an online tool to assess composition similarity of individual nucleic acid sequences.

Authors:  M W J van Passel; A C M Luyf; A H C van Kampen; A Bart; A van der Ende
Journal:  Bioinformatics       Date:  2005-04-26       Impact factor: 6.937

Review 4.  Horizontal gene transfer in eukaryotic evolution.

Authors:  Patrick J Keeling; Jeffrey D Palmer
Journal:  Nat Rev Genet       Date:  2008-08       Impact factor: 53.242

5.  Capturing whole-genome characteristics in short sequences using a naïve Bayesian classifier.

Authors:  R Sandberg; G Winberg; C I Bränden; A Kaske; I Ernberg; J Cöster
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

6.  Alignment of whole genomes.

Authors:  A L Delcher; S Kasif; R D Fleischmann; J Peterson; O White; S L Salzberg
Journal:  Nucleic Acids Res       Date:  1999-06-01       Impact factor: 16.971

7.  A benchmark of parametric methods for horizontal transfers detection.

Authors:  Jennifer Becq; Cécile Churlaud; Patrick Deschavanne
Journal:  PLoS One       Date:  2010-04-01       Impact factor: 3.240

8.  Exploration of phylogenetic data using a global sequence analysis method.

Authors:  Charles Chapus; Christine Dufraigne; Scott Edwards; Alain Giron; Bernard Fertil; Patrick Deschavanne
Journal:  BMC Evol Biol       Date:  2005-11-09       Impact factor: 3.260

9.  TETRA: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences.

Authors:  Hanno Teeling; Jost Waldmann; Thierry Lombardot; Margarete Bauer; Frank Oliver Glöckner
Journal:  BMC Bioinformatics       Date:  2004-10-26       Impact factor: 3.169

10.  Detecting laterally transferred genes: use of entropic clustering methods and genome position.

Authors:  Rajeev K Azad; Jeffrey G Lawrence
Journal:  Nucleic Acids Res       Date:  2007-06-25       Impact factor: 16.971

View more
  19 in total

1.  Biochemical diversity of carboxyl esterases and lipases from Lake Arreo (Spain): a metagenomic approach.

Authors:  Mónica Martínez-Martínez; María Alcaide; Anatoli Tchigvintsev; Oleg Reva; Julio Polaina; Rafael Bargiela; María-Eugenia Guazzaroni; Alvaro Chicote; Albert Canet; Francisco Valero; Eugenio Rico Eguizabal; María del Carmen Guerrero; Alexander F Yakunin; Manuel Ferrer
Journal:  Appl Environ Microbiol       Date:  2013-03-29       Impact factor: 4.792

2.  Identification and characterization of carboxyl esterases of gill chamber-associated microbiota in the deep-sea shrimp Rimicaris exoculata by using functional metagenomics.

Authors:  María Alcaide; Anatoli Tchigvintsev; Mónica Martínez-Martínez; Ana Popovic; Oleg N Reva; Álvaro Lafraya; Rafael Bargiela; Taras Y Nechitaylo; Ruth Matesanz; Marie-Anne Cambon-Bonavita; Mohamed Jebbar; Michail M Yakimov; Alexei Savchenko; Olga V Golyshina; Alexander F Yakunin; Peter N Golyshin; Manuel Ferrer
Journal:  Appl Environ Microbiol       Date:  2015-01-16       Impact factor: 4.792

3.  Biochemical studies on a versatile esterase that is most catalytically active with polyaromatic esters.

Authors:  Mónica Martínez-Martínez; Iván Lores; Carlina Peña-García; Rafael Bargiela; Dolores Reyes-Duarte; María-Eugenia Guazzaroni; Ana Isabel Peláez; Jesús Sánchez; Manuel Ferrer
Journal:  Microb Biotechnol       Date:  2014-01-13       Impact factor: 5.813

4.  Comparative genomic and phylogenetic analyses of Gammaproteobacterial glg genes traced the origin of the Escherichia coli glycogen glgBXCAP operon to the last common ancestor of the sister orders Enterobacteriales and Pasteurellales.

Authors:  Goizeder Almagro; Alejandro M Viale; Manuel Montero; Mehdi Rahimpour; Francisco José Muñoz; Edurne Baroja-Fernández; Abdellatif Bahaji; Manuel Zúñiga; Fernando González-Candelas; Javier Pozueta-Romero
Journal:  PLoS One       Date:  2015-01-21       Impact factor: 3.240

5.  Complete genome sequence of 'Halanaeroarchaeum sulfurireducens' M27-SA2, a sulfur-reducing and acetate-oxidizing haloarchaeon from the deep-sea hypersaline anoxic lake Medee.

Authors:  Enzo Messina; Dimitry Y Sorokin; Ilya V Kublanov; Stepan Toshchakov; Anna Lopatina; Erika Arcadi; Francesco Smedile; Gina La Spada; Violetta La Cono; Michail M Yakimov
Journal:  Stand Genomic Sci       Date:  2016-05-13

6.  Metaproteomics and metabolomics analyses of chronically petroleum-polluted sites reveal the importance of general anaerobic processes uncoupled with degradation.

Authors:  Rafael Bargiela; Florian-Alexander Herbst; Mónica Martínez-Martínez; Jana Seifert; David Rojo; Simone Cappello; María Genovese; Francesca Crisafi; Renata Denaro; Tatyana N Chernikova; Coral Barbas; Martin von Bergen; Michail M Yakimov; Manuel Ferrer; Peter N Golyshin
Journal:  Proteomics       Date:  2015-08-27       Impact factor: 3.984

7.  Analysis of three genomes within the thermophilic bacterial species Caldanaerobacter subterraneus with a focus on carbon monoxide dehydrogenase evolution and hydrolase diversity.

Authors:  F H Sant'Anna; A V Lebedinsky; T G Sokolova; F T Robb; J M Gonzalez
Journal:  BMC Genomics       Date:  2015-10-07       Impact factor: 3.969

8.  Deciphering Genome Content and Evolutionary Relationships of Isolates from the Fungus Magnaporthe oryzae Attacking Different Host Plants.

Authors:  Hélène Chiapello; Ludovic Mallet; Cyprien Guérin; Gabriela Aguileta; Joëlle Amselem; Thomas Kroj; Enrique Ortega-Abboud; Marc-Henri Lebrun; Bernard Henrissat; Annie Gendrault; François Rodolphe; Didier Tharreau; Elisabeth Fournier
Journal:  Genome Biol Evol       Date:  2015-10-09       Impact factor: 3.416

9.  Lateral Gene Transfer in a Heavy Metal-Contaminated-Groundwater Microbial Community.

Authors:  Christopher L Hemme; Stefan J Green; Lavanya Rishishwar; Om Prakash; Angelica Pettenato; Romy Chakraborty; Adam M Deutschbauer; Joy D Van Nostrand; Liyou Wu; Zhili He; I King Jordan; Terry C Hazen; Adam P Arkin; Joel E Kostka; Jizhong Zhou
Journal:  MBio       Date:  2016-04-05       Impact factor: 7.867

10.  Structural and Functional Characterization of a Ruminal β-Glycosidase Defines a Novel Subfamily of Glycoside Hydrolase Family 3 with Permuted Domain Topology.

Authors:  Mercedes Ramírez-Escudero; Mercedes V Del Pozo; Julia Marín-Navarro; Beatriz González; Peter N Golyshin; Julio Polaina; Manuel Ferrer; Julia Sanz-Aparicio
Journal:  J Biol Chem       Date:  2016-09-27       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.