Literature DB >> 14764575

Statistically rigorous automated protein annotation.

Werner G Krebs1, Philip E Bourne.   

Abstract

MOTIVATION: Assignment of putative protein functional annotation by comparative analysis using pre-defined experimental annotations is performed routinely by molecular biologists. The number and statistical significance of these assignments remains a challenge in this era of high-throughput proteomics. A combined statistical method that enables robust, automated protein annotation by reliably expanding existing annotation sets is described. An existing clustering scheme, based on relevant experimental information (e.g. sequence identity, keywords or gene expression data) is required. The method assigns new proteins to these clusters with a measure of reliability. It can also provide human reviewers with a reliability score for both new and previously classified proteins.
RESULTS: A dataset of 27 000 annotated Protein Data Bank (PDB) polypeptide chains (of 36 000 chains currently in the PDB) was generated from 23 000 chains classified a priori. AVAILABILITY: PDB annotations and sample software implementation are freely accessible on the Web at http://pmr.sdsc.edu/go

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 14764575     DOI: 10.1093/bioinformatics/bth039

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Probabilistic annotation of protein sequences based on functional classifications.

Authors:  Emmanuel D Levy; Christos A Ouzounis; Walter R Gilks; Benjamin Audit
Journal:  BMC Bioinformatics       Date:  2005-12-14       Impact factor: 3.169

2.  MACSIMS: multiple alignment of complete sequences information management system.

Authors:  Julie D Thompson; Arnaud Muller; Andrew Waterhouse; Jim Procter; Geoffrey J Barton; Frédéric Plewniak; Olivier Poch
Journal:  BMC Bioinformatics       Date:  2006-06-23       Impact factor: 3.169

3.  Publishing proteomic data.

Authors:  Martin Latterich
Journal:  Proteome Sci       Date:  2006-04-28       Impact factor: 2.480

4.  EST2Prot: mapping EST sequences to proteins.

Authors:  Paul Shafer; David M Lin; Golan Yona
Journal:  BMC Genomics       Date:  2006-03-04       Impact factor: 3.969

5.  CORRIE: enzyme sequence annotation with confidence estimates.

Authors:  Benjamin Audit; Emmanuel D Levy; Wally R Gilks; Leon Goldovsky; Christos A Ouzounis
Journal:  BMC Bioinformatics       Date:  2007-05-22       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.