Literature DB >> 14525933

Comprehensive analysis of orthologous protein domains using the HOPS database.

Christian E V Storm1, Erik L L Sonnhammer.   

Abstract

One of the most reliable methods for protein function annotation is to transfer experimentally known functions from orthologous proteins in other organisms. Most methods for identifying orthologs operate on a subset of organisms with a completely sequenced genome, and treat proteins as single-domain units. However, it is well known that proteins are often made up of several independent domains, and there is a wealth of protein sequences from genomes that are not completely sequenced. A comprehensive set of protein domain families is found in the Pfam database. We wanted to apply orthology detection to Pfam families, but first some issues needed to be addressed. First, orthology detection becomes impractical and unreliable when too many species are included. Second, shorter domains contain less information. It is therefore important to assess the quality of the orthology assignment and avoid very short domains altogether. We present a database of orthologous protein domains in Pfam called HOPS: Hierarchical grouping of Orthologous and Paralogous Sequences. Orthology is inferred in a hierarchic system of phylogenetic subgroups using ortholog bootstrapping. To avoid the frequent errors stemming from horizontally transferred genes in bacteria, the analysis is presently limited to eukaryotic genes. The results are accessible in the graphical browser NIFAS, a Java tool originally developed for analyzing phylogenetic relations within Pfam families. The method was tested on a set of curated orthologs with experimentally verified function. In comparison to tree reconciliation with a complete species tree, our approach finds significantly more orthologs in the test set. Examples for investigating gene fusions and domain recombination using HOPS are given.

Mesh:

Substances:

Year:  2003        PMID: 14525933      PMCID: PMC403726          DOI: 10.1101/gr1305203

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  23 in total

1.  NIFAS: visual analysis of domain evolution in proteins.

Authors:  C E Storm; E L Sonnhammer
Journal:  Bioinformatics       Date:  2001-04       Impact factor: 6.937

Review 2.  Orthologs, paralogs and genome comparisons.

Authors:  J P Gogarten; L Olendzenski
Journal:  Curr Opin Genet Dev       Date:  1999-12       Impact factor: 5.578

Review 3.  Homology a personal view on some of the problems.

Authors:  W M Fitch
Journal:  Trends Genet       Date:  2000-05       Impact factor: 11.639

Review 4.  Genome annotation: from sequence to biology.

Authors:  L Stein
Journal:  Nat Rev Genet       Date:  2001-07       Impact factor: 53.242

5.  MRBAYES: Bayesian inference of phylogenetic trees.

Authors:  J P Huelsenbeck; F Ronquist
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

Review 6.  Horizontal gene transfer in prokaryotes: quantification and classification.

Authors:  E V Koonin; K S Makarova; L Aravind
Journal:  Annu Rev Microbiol       Date:  2001       Impact factor: 15.500

7.  Investigating 42 candidate orthologous protein groups by molecular evolutionary analysis on genome scale.

Authors:  T Xie; D Ding
Journal:  Gene       Date:  2000-12-31       Impact factor: 3.688

8.  Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.

Authors:  M Remm; C E Storm; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-12-14       Impact factor: 5.469

9.  Large-scale taxonomic profiling of eukaryotic model organisms: a comparison of orthologous proteins encoded by the human, fly, nematode, and yeast genomes.

Authors:  A R Mushegian; J R Garey; J Martin; L X Liu
Journal:  Genome Res       Date:  1998-06       Impact factor: 9.043

10.  The Pfam protein families database.

Authors:  Alex Bateman; Ewan Birney; Lorenzo Cerruti; Richard Durbin; Laurence Etwiller; Sean R Eddy; Sam Griffiths-Jones; Kevin L Howe; Mhairi Marshall; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

View more
  17 in total

1.  Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.

Authors:  Matthew D Rasmussen; Manolis Kellis
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

2.  Sequence alignment by cross-correlation.

Authors:  Alan L Rockwood; David K Crockett; James R Oliphant; Kojo S J Elenitoba-Johnson
Journal:  J Biomol Tech       Date:  2005-12

3.  Simultaneous Bayesian gene tree reconstruction and reconciliation analysis.

Authors:  Orjan Akerborg; Bengt Sennblad; Lars Arvestad; Jens Lagergren
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-19       Impact factor: 11.205

4.  Computational methods for Gene Orthology inference.

Authors:  David M Kristensen; Yuri I Wolf; Arcady R Mushegian; Eugene V Koonin
Journal:  Brief Bioinform       Date:  2011-06-19       Impact factor: 11.622

5.  The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.

Authors:  Raffaele Mazza; Francesco Strozzi; Andrea Caprera; Paolo Ajmone-Marsan; John L Williams
Journal:  BMC Genomics       Date:  2009-12-14       Impact factor: 3.969

6.  DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection.

Authors:  Ting-wen Chen; Timothy H Wu; Wailap V Ng; Wen-chang Lin
Journal:  BMC Bioinformatics       Date:  2010-10-15       Impact factor: 3.169

7.  Metabolic analysis of the soil microbe Dechloromonas aromatica str. RCB: indications of a surprisingly complex life-style and cryptic anaerobic pathways for aromatic degradation.

Authors:  Kennan Kellaris Salinero; Keith Keller; William S Feil; Helene Feil; Stephan Trong; Genevieve Di Bartolo; Alla Lapidus
Journal:  BMC Genomics       Date:  2009-08-03       Impact factor: 3.969

8.  Evaluating ortholog prediction algorithms in a yeast model clade.

Authors:  Leonidas Salichos; Antonis Rokas
Journal:  PLoS One       Date:  2011-04-13       Impact factor: 3.240

9.  Signalogs: orthology-based identification of novel signaling pathway components in three metazoans.

Authors:  Tamás Korcsmáros; Máté S Szalay; Petra Rovó; Robin Palotai; Dávid Fazekas; Katalin Lenti; Illés J Farkas; Péter Csermely; Tibor Vellai
Journal:  PLoS One       Date:  2011-05-03       Impact factor: 3.240

Review 10.  Ortholog identification in the presence of domain architecture rearrangement.

Authors:  Kimmen Sjölander; Ruchira S Datta; Yaoqing Shen; Grant M Shoffner
Journal:  Brief Bioinform       Date:  2011-06-28       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.