Literature DB >> 18263643

ConFunc--functional annotation in the twilight zone.

Mark N Wass1, Michael J E Sternberg.   

Abstract

MOTIVATION: The success of genome sequencing has resulted in many protein sequences without functional annotation. We present ConFunc, an automated Gene Ontology (GO)-based protein function prediction approach, which uses conserved residues to generate sequence profiles to infer function. ConFunc split sets of sequences identified by PSI-BLAST into sub-alignments according to their GO annotations. Conserved residues are identified for each GO term sub-alignment for which a position specific scoring matrix is generated. This combination of steps produces a set of feature (GO annotation) derived profiles from which protein function is predicted.
RESULTS: We assess the ability of ConFunc, BLAST and PSI-BLAST to predict protein function in the twilight zone of sequence similarity. ConFunc significantly outperforms BLAST & PSI-BLAST obtaining levels of recall and precision that are not obtained by either method and maximum precision 24% greater than BLAST. Further for a large test set of sequences with homologues of low sequence identity, at high levels of presicision, ConFunc obtains recall six times greater than BLAST. These results demonstrate the potential for ConFunc to form part of an automated genomics annotation pipeline. AVAILABILITY: http://www.sbg.bio.ic.ac.uk/confunc

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18263643     DOI: 10.1093/bioinformatics/btn037

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  51 in total

1.  Real-time ligand binding pocket database search using local surface descriptors.

Authors:  Rayan Chikhi; Lee Sael; Daisuke Kihara
Journal:  Proteins       Date:  2010-07

2.  Structure- and sequence-based function prediction for non-homologous proteins.

Authors:  Lee Sael; Meghana Chitale; Daisuke Kihara
Journal:  J Struct Funct Genomics       Date:  2012-01-22

3.  Analysis of the peroxiredoxin family: using active-site structure and sequence information for global classification and residue analysis.

Authors:  Kimberly J Nelson; Stacy T Knutson; Laura Soito; Chananat Klomsiri; Leslie B Poole; Jacquelyn S Fetrow
Journal:  Proteins       Date:  2010-12-22

4.  Protein structure prediction on the Web: a case study using the Phyre server.

Authors:  Lawrence A Kelley; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

5.  Prediction of ligand binding sites using homologous structures and conservation at CASP8.

Authors:  Mark N Wass; Michael J E Sternberg
Journal:  Proteins       Date:  2009

6.  Comparison of structure-based and threading-based approaches to protein functional annotation.

Authors:  Michal Brylinski; Jeffrey Skolnick
Journal:  Proteins       Date:  2010-01

7.  Genome-scale phylogenetic function annotation of large and diverse protein families.

Authors:  Barbara E Engelhardt; Michael I Jordan; John R Srouji; Steven E Brenner
Journal:  Genome Res       Date:  2011-07-22       Impact factor: 9.043

8.  The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches.

Authors:  Ishita K Khan; Qing Wei; Samuel Chapman; Dukka B Kc; Daisuke Kihara
Journal:  Gigascience       Date:  2015-09-14       Impact factor: 6.524

9.  Sequence-based feature prediction and annotation of proteins.

Authors:  Agnieszka S Juncker; Lars J Jensen; Andrea Pierleoni; Andreas Bernsel; Michael L Tress; Peer Bork; Gunnar von Heijne; Alfonso Valencia; Christos A Ouzounis; Rita Casadio; Søren Brunak
Journal:  Genome Biol       Date:  2009-02-02       Impact factor: 13.583

10.  A threading-based method for the prediction of DNA-binding proteins with application to the human genome.

Authors:  Mu Gao; Jeffrey Skolnick
Journal:  PLoS Comput Biol       Date:  2009-11-13       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.