Literature DB >> 11932249

Predicting gene ontology functions from ProDom and CDD protein domains.

Jonathan Schug1, Sharon Diskin, Joan Mazzarelli, Brian P Brunk, Christian J Stoeckert.   

Abstract

A heuristic algorithm for associating Gene Ontology (GO) defined molecular functions to protein domains as listed in the ProDom and CDD databases is described. The algorithm generates rules for function-domain associations based on the intersection of functions assigned to gene products by the GO consortium that contain ProDom and/or CDD domains at varying levels of sequence similarity. The hierarchical nature of GO molecular functions is incorporated into rule generation. Manual review of a subset of the rules generated indicates an accuracy rate of 87% for ProDom rules and 84% for CDD rules. The utility of these associations is that novel sequences can be assigned a putative function if sufficient similarity exists to a ProDom or CDD domain for which one or more GO functions has been associated. Although functional assignments are increasingly being made for gene products from model organisms, it is likely that the needs of investigators will continue to outpace the efforts of curators, particularly for nonmodel organisms. A comparison with other methods in terms of coverage and agreement was performed, indicating the utility of the approach. The domain-function associations and function assignments are available from our website http://www.cbil.upenn.edu/GO.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11932249      PMCID: PMC187511          DOI: 10.1101/gr.222902

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  16 in total

1.  Increased coverage of protein families with the blocks database servers.

Authors:  J G Henikoff; E A Greene; S Pietrokovski; S Henikoff
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  SMART: a web-based tool for the study of genetically mobile domains.

Authors:  J Schultz; R R Copley; T Doerks; C P Ponting; P Bork
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites.

Authors:  R Apweiler; T K Attwood; A Bairoch; A Bateman; E Birney; M Biswas; P Bucher; L Cerutti; F Corpet; M D Croning; R Durbin; L Falquet; W Fleischmann; J Gouzy; H Hermjakob; N Hulo; I Jonassen; D Kahn; A Kanapin; Y Karavidopoulou; R Lopez; B Marx; N J Mulder; T M Oinn; M Pagni; F Servant; C J Sigrist; E M Zdobnov
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 4.  The LIM domain: regulation by association.

Authors:  I Bach
Journal:  Mech Dev       Date:  2000-03-01       Impact factor: 1.882

5.  Creating the gene ontology resource: design and implementation.

Authors: 
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

6.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

7.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

8.  Protein family classification based on searching a database of blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Genomics       Date:  1994-01-01       Impact factor: 5.736

9.  Functional annotation of a full-length mouse cDNA collection.

Authors:  J Kawai; A Shinagawa; K Shibata; M Yoshino; M Itoh; Y Ishii; T Arakawa; A Hara; Y Fukunishi; H Konno; J Adachi; S Fukuda; K Aizawa; M Izawa; K Nishi; H Kiyosawa; S Kondo; I Yamanaka; T Saito; Y Okazaki; T Gojobori; H Bono; T Kasukawa; R Saito; K Kadota; H Matsuda; M Ashburner; S Batalov; T Casavant; W Fleischmann; T Gaasterland; C Gissi; B King; H Kochiwa; P Kuehl; S Lewis; Y Matsuo; I Nikaido; G Pesole; J Quackenbush; L M Schriml; F Staubli; R Suzuki; M Tomita; L Wagner; T Washio; K Sakai; T Okido; M Furuno; H Aono; R Baldarelli; G Barsh; J Blake; D Boffelli; N Bojunga; P Carninci; M F de Bonaldo; M J Brownstein; C Bult; C Fletcher; M Fujita; M Gariboldi; S Gustincich; D Hill; M Hofmann; D A Hume; M Kamiya; N H Lee; P Lyons; L Marchionni; J Mashima; J Mazzarelli; P Mombaerts; P Nordone; B Ring; M Ringwald; I Rodriguez; N Sakamoto; H Sasaki; K Sato; C Schönbach; T Seya; Y Shibata; K F Storch; H Suzuki; K Toyo-oka; K H Wang; C Weitz; C Whittaker; L Wilming; A Wynshaw-Boris; K Yoshida; Y Hasegawa; H Kawaji; S Kohtsuki; Y Hayashizaki
Journal:  Nature       Date:  2001-02-08       Impact factor: 49.962

10.  Accurate prediction of protein functional class from sequence in the Mycobacterium tuberculosis and Escherichia coli genomes using data mining.

Authors:  R D King; A Karwath; A Clare; L Dehaspe
Journal:  Yeast       Date:  2000-12       Impact factor: 3.239

View more
  27 in total

1.  Gene discovery in the apicomplexa as revealed by EST sequencing and assembly of a comparative gene database.

Authors:  Li Li; Brian P Brunk; Jessica C Kissinger; Deana Pape; Keliang Tang; Robert H Cole; John Martin; Todd Wylie; Mike Dante; Steven J Fogarty; Daniel K Howe; Paul Liberator; Carmen Diaz; Jennifer Anderson; Michael White; Maria E Jerome; Emily A Johnson; Jay A Radke; Christian J Stoeckert; Robert H Waterston; Sandra W Clifton; David S Roos; L David Sibley
Journal:  Genome Res       Date:  2003-03       Impact factor: 9.043

2.  A literature-based method for assessing the functional coherence of a gene group.

Authors:  Soumya Raychaudhuri; Russ B Altman
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

3.  The Gene Ontology Annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro.

Authors:  Evelyn Camon; Michele Magrane; Daniel Barrell; David Binns; Wolfgang Fleischmann; Paul Kersey; Nicola Mulder; Tom Oinn; John Maslen; Anthony Cox; Rolf Apweiler
Journal:  Genome Res       Date:  2003-03-12       Impact factor: 9.043

4.  Automated Gene Ontology annotation for anonymous sequence data.

Authors:  Steffen Hennig; Detlef Groth; Hans Lehrach
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  OntoBlast function: From sequence similarities directly to potential functional annotations by ontology terms.

Authors:  Günther Zehetner
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  Predicting gene function from patterns of annotation.

Authors:  Oliver D King; Rebecca E Foulger; Selina S Dwight; James V White; Frederick P Roth
Journal:  Genome Res       Date:  2003-04-14       Impact factor: 9.043

7.  A semantic analysis of the annotations of the human genome.

Authors:  Purvesh Khatri; Bogdan Done; Archana Rao; Arina Done; Sorin Draghici
Journal:  Bioinformatics       Date:  2005-06-14       Impact factor: 6.937

8.  The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology.

Authors:  Evelyn Camon; Michele Magrane; Daniel Barrell; Vivian Lee; Emily Dimmer; John Maslen; David Binns; Nicola Harte; Rodrigo Lopez; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

9.  A plant orthologue of RNase L inhibitor (RLI) is induced in plants showing RNA interference.

Authors:  Antonio Sergio Kimus Braz; Jean Finnegan; Peter Waterhouse; Rogério Margis
Journal:  J Mol Evol       Date:  2004-07       Impact factor: 2.395

10.  Identification of salt treated proteins in sorghum using gene ontology linkage.

Authors:  Manoj Kumar Sekhwal; Ajit Kumar Swami; Renu Sarin; Vinay Sharma
Journal:  Physiol Mol Biol Plants       Date:  2012-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.