Literature DB >> 17237108

Family relationships: should consensus reign?--consensus clustering for protein families.

Macha Nikolski1, David J Sherman.   

Abstract

MOTIVATION: Reliable identification of protein families is key to phylogenetic analysis, functional annotation and the exploration of protein function diversity in a given phylogenetic branch. As more and more complete genomes are sequenced, there is a need for powerful and reliable algorithms facilitating protein families construction.
RESULTS: We have formulated the problem of protein families construction as an instance of consensus clustering, for which we designed a novel algorithm that is computationally efficient in practice and produces high quality results. Our algorithm uses an election method to construct consensus families from competing clustering computations. Our consensus clustering algorithm is tailored to serve the specific needs of comparative genomics projects. First, it provides a robust means to incorporate results from different and complementary clustering methods, thus avoiding the need for an a priori choice that may introduce computational bias in the results. Second, it is suited to large-scale projects due to the practical efficiency. And third, it produces high quality results where families tend to represent groupings by biological function. AVAILABILITY: This method has been used for Génolevures project to compute protein families of Hemiascomycetous yeasts. The data are available online at http://cbi.labri.fr/Genolevures/fam/

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17237108     DOI: 10.1093/bioinformatics/btl314

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  Identification of conserved gene clusters in multiple genomes based on synteny and homology.

Authors:  Anasua Sarkar; Hayssam Soueidan; Macha Nikolski
Journal:  BMC Bioinformatics       Date:  2011-10-05       Impact factor: 3.169

2.  Combined phylogeny and neighborhood analysis of the evolution of the ABC transporters conferring multiple drug resistance in hemiascomycete yeasts.

Authors:  Marie-Line Seret; Julie F Diffels; André Goffeau; Philippe V Baret
Journal:  BMC Genomics       Date:  2009-10-01       Impact factor: 3.969

3.  Comparative genomics of protoploid Saccharomycetaceae.

Authors:  Jean-Luc Souciet; Bernard Dujon; Claude Gaillardin; Mark Johnston; Philippe V Baret; Paul Cliften; David J Sherman; Jean Weissenbach; Eric Westhof; Patrick Wincker; Claire Jubin; Julie Poulain; Valérie Barbe; Béatrice Ségurens; François Artiguenave; Véronique Anthouard; Benoit Vacherie; Marie-Eve Val; Robert S Fulton; Patrick Minx; Richard Wilson; Pascal Durrens; Géraldine Jean; Christian Marck; Tiphaine Martin; Macha Nikolski; Thomas Rolland; Marie-Line Seret; Serge Casarégola; Laurence Despons; Cécile Fairhead; Gilles Fischer; Ingrid Lafontaine; Véronique Leh; Marc Lemaire; Jacky de Montigny; Cécile Neuvéglise; Agnès Thierry; Isabelle Blanc-Lenfle; Claudine Bleykasten; Julie Diffels; Emilie Fritsch; Lionel Frangeul; Adrien Goëffon; Nicolas Jauniaux; Rym Kachouri-Lafond; Célia Payen; Serge Potier; Lenka Pribylova; Christophe Ozanne; Guy-Franck Richard; Christine Sacerdot; Marie-Laure Straub; Emmanuel Talla
Journal:  Genome Res       Date:  2009-06-12       Impact factor: 9.043

4.  A genome-scale metabolic model of the lipid-accumulating yeast Yarrowia lipolytica.

Authors:  Nicolas Loira; Thierry Dulermo; Jean-Marc Nicaud; David James Sherman
Journal:  BMC Syst Biol       Date:  2012-05-04

5.  IONS: Identification of Orthologs by Neighborhood and Similarity-an Automated Method to Identify Orthologs in Chromosomal Regions of Common Evolutionary Ancestry and its Application to Hemiascomycetous Yeasts.

Authors:  Marie-Line Seret; Philippe V Baret
Journal:  Evol Bioinform Online       Date:  2011-08-30       Impact factor: 1.625

6.  Inferring gene family histories in yeast identifies lineage specific expansions.

Authors:  Ryan M Ames; Daniel Money; Simon C Lovell
Journal:  PLoS One       Date:  2014-06-12       Impact factor: 3.240

7.  Genome-wide computational prediction of tandem gene arrays: application in yeasts.

Authors:  Laurence Despons; Philippe V Baret; Lionel Frangeul; Véronique Leh Louis; Pascal Durrens; Jean-Luc Souciet
Journal:  BMC Genomics       Date:  2010-01-21       Impact factor: 3.969

8.  Searching remote homology with spectral clustering with symmetry in neighborhood cluster kernels.

Authors:  Ujjwal Maulik; Anasua Sarkar
Journal:  PLoS One       Date:  2013-02-15       Impact factor: 3.240

9.  Génolevures: protein families and synteny among complete hemiascomycetous yeast proteomes and genomes.

Authors:  David J Sherman; Tiphaine Martin; Macha Nikolski; Cyril Cayla; Jean-Luc Souciet; Pascal Durrens
Journal:  Nucleic Acids Res       Date:  2008-11-16       Impact factor: 16.971

10.  Fusion and fission of genes define a metric between fungal genomes.

Authors:  Pascal Durrens; Macha Nikolski; David Sherman
Journal:  PLoS Comput Biol       Date:  2008-10-24       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.