Literature DB >> 15713731

Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases.

Jean-François Dufayard1, Laurent Duret, Simon Penel, Manolo Gouy, François Rechenmann, Guy Perrière.   

Abstract

MOTIVATION: Comparative sequence analysis is widely used to study genome function and evolution. This approach first requires the identification of homologous genes and then the interpretation of their homology relationships (orthology or paralogy). To provide help in this complex task, we developed three databases of homologous genes containing sequences, multiple alignments and phylogenetic trees: HOBACGEN, HOVERGEN and HOGENOM. In this paper, we present two new tools for automating the search for orthologs or paralogs in these databases.
RESULTS: First, we have developed and implemented an algorithm to infer speciation and duplication events by comparison of gene and species trees (tree reconciliation). Second, we have developed a general method to search in our databases the gene families for which the tree topology matches a peculiar tree pattern. This algorithm of unordered tree pattern matching has been implemented in the FamFetch graphical interface. With the help of a graphical editor, the user can specify the topology of the tree pattern, and set constraints on its nodes and leaves. Then, this pattern is compared with all the phylogenetic trees of the database, to retrieve the families in which one or several occurrences of this pattern are found. By specifying ad hoc patterns, it is therefore possible to identify orthologs in our databases.

Mesh:

Year:  2005        PMID: 15713731     DOI: 10.1093/bioinformatics/bti325

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  73 in total

1.  Premature terminator analysis sheds light on a hidden world of bacterial transcriptional attenuation.

Authors:  Magali Naville; Daniel Gautheret
Journal:  Genome Biol       Date:  2010-09-29       Impact factor: 13.583

2.  Development of High Affinity and High Specificity Inhibitors of Matrix Metalloproteinase 14 through Computational Design and Directed Evolution.

Authors:  Valeria Arkadash; Gal Yosef; Jason Shirian; Itay Cohen; Yuval Horev; Moran Grossman; Irit Sagi; Evette S Radisky; Julia M Shifman; Niv Papo
Journal:  J Biol Chem       Date:  2017-01-13       Impact factor: 5.157

3.  COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations.

Authors:  Raja Jothi; Elena Zotenko; Asba Tasneem; Teresa M Przytycka
Journal:  Bioinformatics       Date:  2006-01-24       Impact factor: 6.937

4.  Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.

Authors:  Matthew D Rasmussen; Manolis Kellis
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

5.  EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.

Authors:  Albert J Vilella; Jessica Severin; Abel Ureta-Vidal; Li Heng; Richard Durbin; Ewan Birney
Journal:  Genome Res       Date:  2008-11-24       Impact factor: 9.043

6.  Dealing with incongruence in phylogenomic analyses.

Authors:  Nicolas Galtier; Vincent Daubin
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

7.  Reconciliation with non-binary species trees.

Authors:  Benjamin Vernot; Maureen Stolzer; Aiton Goldman; Dannie Durand
Journal:  J Comput Biol       Date:  2008-10       Impact factor: 1.479

8.  Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes.

Authors:  Romain A Studer; Simon Penel; Laurent Duret; Marc Robinson-Rechavi
Journal:  Genome Res       Date:  2008-06-18       Impact factor: 9.043

9.  Computational methods for Gene Orthology inference.

Authors:  David M Kristensen; Yuri I Wolf; Arcady R Mushegian; Eugene V Koonin
Journal:  Brief Bioinform       Date:  2011-06-19       Impact factor: 11.622

10.  Lipid transfer proteins in coffee: isolation of Coffea orthologs, Coffea arabica homeologs, expression during coffee fruit development and promoter analysis in transgenic tobacco plants.

Authors:  Michelle G Cotta; Leila M G Barros; Juliana D de Almeida; Fréderic de Lamotte; Eder A Barbosa; Natalia G Vieira; Gabriel S C Alves; Felipe Vinecky; Alan C Andrade; Pierre Marraccini
Journal:  Plant Mol Biol       Date:  2014-01-28       Impact factor: 4.076

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.