Literature DB >> 11836216

Automated ortholog inference from phylogenetic trees and calculation of orthology reliability.

Christian E V Storm1, Erik L L Sonnhammer.   

Abstract

MOTIVATION: Orthologous proteins in different species are likely to have similar biochemical function and biological role. When annotating a newly sequenced genome by sequence homology, the most precise and reliable functional information can thus be derived from orthologs in other species. A standard method of finding orthologs is to compare the sequence tree with the species tree. However, since the topology of phylogenetic tree is not always reliable one might get incorrect assignments.
RESULTS: Here we present a novel method that resolves this problem by analyzing a set of bootstrap trees instead of the optimal tree. The frequency of orthology assignments in the bootstrap trees can be interpreted as a support value for the possible orthology of the sequences. Our method is efficient enough to analyze data in the scale of whole genomes. It is implemented in Java and calculates orthology support levels for all pairwise combinations of homologous sequences of two species. The method was tested on simulated datasets and on real data of homologous proteins.

Mesh:

Substances:

Year:  2002        PMID: 11836216     DOI: 10.1093/bioinformatics/18.1.92

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  69 in total

1.  Comprehensive analysis of orthologous protein domains using the HOPS database.

Authors:  Christian E V Storm; Erik L L Sonnhammer
Journal:  Genome Res       Date:  2003-10       Impact factor: 9.043

2.  Phylogenetic molecular function annotation.

Authors:  Barbara E Engelhardt; Michael I Jordan; Susanna T Repo; Steven E Brenner
Journal:  J Phys Conf Ser       Date:  2009

3.  COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations.

Authors:  Raja Jothi; Elena Zotenko; Asba Tasneem; Teresa M Przytycka
Journal:  Bioinformatics       Date:  2006-01-24       Impact factor: 6.937

Review 4.  Homology and phylogeny and their automated inference.

Authors:  Georg Fuellen
Journal:  Naturwissenschaften       Date:  2008-02-21

5.  A hierarchical model for incomplete alignments in phylogenetic inference.

Authors:  Fuxia Cheng; Stefanie Hartmann; Mayetri Gupta; Joseph G Ibrahim; Todd J Vision
Journal:  Bioinformatics       Date:  2009-01-15       Impact factor: 6.937

6.  Reconciliation revisited: handling multiple optima when reconciling with duplication, transfer, and loss.

Authors:  Mukul S Bansal; Eric J Alm; Manolis Kellis
Journal:  J Comput Biol       Date:  2013-09-14       Impact factor: 1.479

7.  Computational methods for Gene Orthology inference.

Authors:  David M Kristensen; Yuri I Wolf; Arcady R Mushegian; Eugene V Koonin
Journal:  Brief Bioinform       Date:  2011-06-19       Impact factor: 11.622

8.  Genome-scale phylogenetic function annotation of large and diverse protein families.

Authors:  Barbara E Engelhardt; Michael I Jordan; John R Srouji; Steven E Brenner
Journal:  Genome Res       Date:  2011-07-22       Impact factor: 9.043

9.  The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches.

Authors:  Ishita K Khan; Qing Wei; Samuel Chapman; Dukka B Kc; Daisuke Kihara
Journal:  Gigascience       Date:  2015-09-14       Impact factor: 6.524

10.  The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.

Authors:  Raffaele Mazza; Francesco Strozzi; Andrea Caprera; Paolo Ajmone-Marsan; John L Williams
Journal:  BMC Genomics       Date:  2009-12-14       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.