Literature DB >> 16434444

COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations.

Raja Jothi1, Elena Zotenko, Asba Tasneem, Teresa M Przytycka.   

Abstract

MOTIVATION: Determining orthology relations among genes across multiple genomes is an important problem in the post-genomic era. Identifying orthologous genes can not only help predict functional annotations for newly sequenced or poorly characterized genomes, but can also help predict new protein-protein interactions. Unfortunately, determining orthology relation through computational methods is not straightforward due to the presence of paralogs. Traditional approaches have relied on pairwise sequence comparisons to construct graphs, which were then partitioned into putative clusters of orthologous groups. These methods do not attempt to preserve the non-transitivity and hierarchic nature of the orthology relation.
RESULTS: We propose a new method, COCO-CL, for hierarchical clustering of homology relations and identification of orthologous groups of genes. Unlike previous approaches, which are based on pairwise sequence comparisons, our method explores the correlation of evolutionary histories of individual genes in a more global context. COCO-CL can be used as a semi-independent method to delineate the orthology/paralogy relation for a refined set of homologous proteins obtained using a less-conservative clustering approach, or as a refiner that removes putative out-paralogs from clusters computed using a more inclusive approach. We analyze our clustering results manually, with support from literature and functional annotations. Since our orthology determination procedure does not employ a species tree to infer duplication events, it can be used in situations when the species tree is unknown or uncertain. CONTACT: jothi@mail.nih.gov, przytyck@mail.nih.gov SUPPLEMENTARY INFORMATION: Supplementary materials are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16434444      PMCID: PMC1620014          DOI: 10.1093/bioinformatics/btl009

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

1.  Co-evolution of proteins with their interaction partners.

Authors:  C S Goh; A A Bogan; M Joachimiak; D Walther; F E Cohen
Journal:  J Mol Biol       Date:  2000-06-02       Impact factor: 5.469

2.  NOTUNG: a program for dating gene duplications and optimizing gene family trees.

Authors:  K Chen; D Durand; M Farach-Colton
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

3.  Similarity of phylogenetic trees as indicator of protein-protein interaction.

Authors:  F Pazos; A Valencia
Journal:  Protein Eng       Date:  2001-09

4.  Automated ortholog inference from phylogenetic trees and calculation of orthology reliability.

Authors:  Christian E V Storm; Erik L L Sonnhammer
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

5.  A simple algorithm to infer gene duplication and speciation events on a gene tree.

Authors:  C M Zmasek; S R Eddy
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

Review 6.  Phylogenetic analysis and gene functional predictions: phylogenomics in action.

Authors:  Jonathan A Eisen; Martin Wu
Journal:  Theor Popul Biol       Date:  2002-06       Impact factor: 1.570

7.  The COG database: a tool for genome-scale analysis of protein functions and evolution.

Authors:  R L Tatusov; M Y Galperin; D A Natale; E V Koonin
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

8.  Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.

Authors:  M Remm; C E Storm; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-12-14       Impact factor: 5.469

9.  Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA).

Authors:  Yuandan Lee; Razvan Sultana; Geo Pertea; Jennifer Cho; Svetlana Karamycheva; Jennifer Tsai; Babak Parvizi; Foo Cheung; Valentin Antonescu; Joseph White; Ingeborg Holt; Feng Liang; John Quackenbush
Journal:  Genome Res       Date:  2002-03       Impact factor: 9.043

10.  Protein molecular function prediction by Bayesian phylogenomics.

Authors:  Barbara E Engelhardt; Michael I Jordan; Kathryn E Muratore; Steven E Brenner
Journal:  PLoS Comput Biol       Date:  2005-10-07       Impact factor: 4.475

View more
  31 in total

1.  Class-specific correlations of gene expressions: identification and their effects on clustering analyses.

Authors:  Jigang Zhang; Jian Li; Hongwen Deng
Journal:  Am J Hum Genet       Date:  2008-08       Impact factor: 11.025

2.  Computational methods for Gene Orthology inference.

Authors:  David M Kristensen; Yuri I Wolf; Arcady R Mushegian; Eugene V Koonin
Journal:  Brief Bioinform       Date:  2011-06-19       Impact factor: 11.622

3.  kdetrees: Non-parametric estimation of phylogenetic tree distributions.

Authors:  Grady Weyenberg; Peter M Huggins; Christopher L Schardl; Daniel K Howe; Ruriko Yoshida
Journal:  Bioinformatics       Date:  2014-04-24       Impact factor: 6.937

4.  Identification of conserved gene clusters in multiple genomes based on synteny and homology.

Authors:  Anasua Sarkar; Hayssam Soueidan; Macha Nikolski
Journal:  BMC Bioinformatics       Date:  2011-10-05       Impact factor: 3.169

5.  Application of clustering analyses to the diagnosis of Huntington disease in mice and other diseases with well-defined group boundaries.

Authors:  Jason B Nikas; Walter C Low
Journal:  Comput Methods Programs Biomed       Date:  2011-05-06       Impact factor: 5.428

6.  DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection.

Authors:  Ting-wen Chen; Timothy H Wu; Wailap V Ng; Wen-chang Lin
Journal:  BMC Bioinformatics       Date:  2010-10-15       Impact factor: 3.169

7.  Functional equivalency inferred from "authoritative sources" in networks of homologous proteins.

Authors:  Shreedhar Natarajan; Eric Jakobsson
Journal:  PLoS One       Date:  2009-06-12       Impact factor: 3.240

8.  Inferring hierarchical orthologous groups from orthologous gene pairs.

Authors:  Adrian M Altenhoff; Manuel Gil; Gaston H Gonnet; Christophe Dessimoz
Journal:  PLoS One       Date:  2013-01-14       Impact factor: 3.240

9.  Plant-symbiotic fungi as chemical engineers: multi-genome analysis of the clavicipitaceae reveals dynamics of alkaloid loci.

Authors:  Christopher L Schardl; Carolyn A Young; Uljana Hesse; Stefan G Amyotte; Kalina Andreeva; Patrick J Calie; Damien J Fleetwood; David C Haws; Neil Moore; Birgitt Oeser; Daniel G Panaccione; Kathryn K Schweri; Christine R Voisey; Mark L Farman; Jerzy W Jaromczyk; Bruce A Roe; Donal M O'Sullivan; Barry Scott; Paul Tudzynski; Zhiqiang An; Elissaveta G Arnaoudova; Charles T Bullock; Nikki D Charlton; Li Chen; Murray Cox; Randy D Dinkins; Simona Florea; Anthony E Glenn; Anna Gordon; Ulrich Güldener; Daniel R Harris; Walter Hollin; Jolanta Jaromczyk; Richard D Johnson; Anar K Khan; Eckhard Leistner; Adrian Leuchtmann; Chunjie Li; JinGe Liu; Jinze Liu; Miao Liu; Wade Mace; Caroline Machado; Padmaja Nagabhyru; Juan Pan; Jan Schmid; Koya Sugawara; Ulrike Steiner; Johanna E Takach; Eiji Tanaka; Jennifer S Webb; Ella V Wilson; Jennifer L Wiseman; Ruriko Yoshida; Zheng Zeng
Journal:  PLoS Genet       Date:  2013-02-28       Impact factor: 5.917

10.  DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

Authors:  Steven Kelly; Philip K Maini
Journal:  PLoS One       Date:  2013-03-15       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.