Literature DB >> 12798038

Protein family classification and functional annotation.

Cathy H Wu1, Hongzhan Huang, Lai-Su L Yeh, Winona C Barker.   

Abstract

With the accelerated accumulation of genomic sequence data, there is a pressing need to develop computational methods and advanced bioinformatics infrastructure for reliable and large-scale protein annotation and biological knowledge discovery. The Protein Information Resource (PIR) provides an integrated public resource of protein informatics to support genomic and proteomic research. PIR produces the Protein Sequence Database of functionally annotated protein sequences. The annotation problems are addressed by a classification-driven and rule-based method with evidence attribution, coupled with an integrated knowledge base system being developed. The approach allows sensitive identification, consistent and rich annotation, and systematic detection of annotation errors, as well as distinction of experimentally verified and computationally predicted features. The knowledge base consists of two new databases, sequence analysis tools, and graphical interfaces. PIR-NREF, a non-redundant reference database, provides a timely and comprehensive collection of all protein sequences, totaling more than 1,000,000 entries. iProClass, an integrated database of protein family, function, and structure information, provides extensive value-added features for about 830,000 proteins with rich links to over 50 molecular databases. This paper describes our approach to protein functional annotation with case studies and examines common identification errors. It also illustrates that data integration in PIR supports exploration of protein relationships and may reveal protein functional associations beyond sequence homology.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12798038     DOI: 10.1016/s1476-9271(02)00098-1

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  31 in total

1.  High-throughput protein analysis integrating bioinformatics and experimental assays.

Authors:  Coral del Val; Alexander Mehrle; Mechthild Falkenhahn; Markus Seiler; Karl-Heinz Glatting; Annemarie Poustka; Sandor Suhai; Stefan Wiemann
Journal:  Nucleic Acids Res       Date:  2004-02-03       Impact factor: 16.971

2.  PIRSF: family classification system at the Protein Information Resource.

Authors:  Cathy H Wu; Anastasia Nikolskaya; Hongzhan Huang; Lai-Su L Yeh; Darren A Natale; C R Vinayaka; Zhang-Zhi Hu; Raja Mazumder; Sandeep Kumar; Panagiotis Kourtesis; Robert S Ledley; Baris E Suzek; Leslie Arminski; Yongxing Chen; Jian Zhang; Jorge Louie Cardenas; Sehee Chung; Jorge Castro-Alvear; Georgi Dinkov; Winona C Barker
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  UniProt: the Universal Protein knowledgebase.

Authors:  Rolf Apweiler; Amos Bairoch; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Sequence analysis and organization of the Neodiprion abietis nucleopolyhedrovirus genome.

Authors:  Simon P Duffy; Aaron M Young; Benoit Morin; Christopher J Lucarotti; Ben F Koop; David B Levin
Journal:  J Virol       Date:  2006-07       Impact factor: 5.103

5.  Evolution of exceptionally large genes in prokaryotes.

Authors:  Min-Chieh Kuo; Li-Fang Chou; Hwan-You Chang
Journal:  J Mol Evol       Date:  2008-03-06       Impact factor: 2.395

6.  The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology.

Authors:  Evelyn Camon; Michele Magrane; Daniel Barrell; Vivian Lee; Emily Dimmer; John Maslen; David Binns; Nicola Harte; Rodrigo Lopez; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

7.  Genome-wide comparative gene family classification.

Authors:  Christian Frech; Nansheng Chen
Journal:  PLoS One       Date:  2010-10-15       Impact factor: 3.240

8.  GAIA: a gram-based interaction analysis tool--an approach for identifying interacting domains in yeast.

Authors:  Kelvin X Zhang; B F Francis Ouellette
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

9.  SoyDB: a knowledge database of soybean transcription factors.

Authors:  Zheng Wang; Marc Libault; Trupti Joshi; Babu Valliyodan; Henry T Nguyen; Dong Xu; Gary Stacey; Jianlin Cheng
Journal:  BMC Plant Biol       Date:  2010-01-18       Impact factor: 4.215

10.  Family classification without domain chaining.

Authors:  Jacob M Joseph; Dannie Durand
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.