Literature DB >> 12855445

Annotation of bacterial genomes using improved phylogenomic profiles.

F Enault1, K Suhre, C Abergel, O Poirot, J-M Claverie.   

Abstract

MOTIVATION: Phylogenomic profiling is a large-scale comparative genomic method used to infer protein function from evolutionary information first described in a binary form by Pellegrini et al. (1999). Here, we propose improvements of this approach including the use of normalized Blastp bit scores, a normalization of the matrix of profiles to take into account the evolutionary distances between bacteria, the definition of a phylogenomic neighborhood based on continuous pairwise distances between genes and an original annotation procedure including the computation of a p-value for each functional assignment.
RESULTS: The method presented here increases the number of Ecocyc enzymes identified as being evolutionarily related by about 25% with respect to the original binary form (absent/present) method. The fraction of 'false' positives is shown to be smaller than 20%. Based on their phylogenomic relationships, genes of unknown function can then be automatically related to annotated genes. Each gene annotation predicted is associated with a p-value, i.e. its probability to be obtained by chance. The validity of this method was extensively tested on a large set of genes of known function using the MultiFun database. We find that 50% of 3122 function attributions that can be made at a p-value level of 10(-11) correspond to the actual gene annotation. The method can be readily applied to any newly sequenced microbial genome. In contrast to earlier work on the same topic, our approach avoids the use of arbitrary cut-off values, and provides a reliability estimate of the functional predictions in form of p-values.

Mesh:

Substances:

Year:  2003        PMID: 12855445     DOI: 10.1093/bioinformatics/btg1013

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Phydbac2: improved inference of gene function using interactive phylogenomic profiling and chromosomal location analysis.

Authors:  François Enault; Karsten Suhre; Olivier Poirot; Chantal Abergel; Jean-Michel Claverie
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 2.  Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution.

Authors:  Philip R Kensche; Vera van Noort; Bas E Dutilh; Martijn A Huynen
Journal:  J R Soc Interface       Date:  2008-02-06       Impact factor: 4.118

3.  Single-pass classification of all noncoding sequences in a bacterial genome using phylogenetic profiles.

Authors:  Antonin Marchais; Magali Naville; Chantal Bohn; Philippe Bouloc; Daniel Gautheret
Journal:  Genome Res       Date:  2009-02-23       Impact factor: 9.043

4.  Inferring genome-wide functional linkages in E. coli by combining improved genome context methods: comparison with high-throughput experimental data.

Authors:  Sailu Yellaboina; Kshama Goyal; Shekhar C Mande
Journal:  Genome Res       Date:  2007-03-05       Impact factor: 9.043

5.  Phydbac (phylogenomic display of bacterial genes): An interactive resource for the annotation of bacterial genomes.

Authors:  François Enault; Karsten Suhre; Olivier Poirot; Chantal Abergel; Jean-Michel Claverie
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process.

Authors:  Malay K Basu; Jeremy D Selengut; Daniel H Haft
Journal:  BMC Bioinformatics       Date:  2011-11-09       Impact factor: 3.307

7.  MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction.

Authors:  Torsten Blum; Sebastian Briesemeister; Oliver Kohlbacher
Journal:  BMC Bioinformatics       Date:  2009-09-01       Impact factor: 3.169

8.  Effect of reference genome selection on the performance of computational methods for genome-wide protein-protein interaction prediction.

Authors:  Vijaykumar Yogesh Muley; Akash Ranjan
Journal:  PLoS One       Date:  2012-07-26       Impact factor: 3.240

9.  An improved hypergeometric probability method for identification of functionally linked proteins using phylogenetic profiles.

Authors:  Appala Raju Kotaru; Khader Shameer; Pandurangan Sundaramurthy; Ramesh Chandra Joshi
Journal:  Bioinformation       Date:  2013-04-13

10.  Evaluation of physical and functional protein-protein interaction prediction methods for detecting biological pathways.

Authors:  Vijaykumar Yogesh Muley; Akash Ranjan
Journal:  PLoS One       Date:  2013-01-17       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.