Literature DB >> 18591194

Predicting protein function from domain content.

Kristoffer Forslund1, Erik L L Sonnhammer.   

Abstract

MOTIVATION: Computational assignment of protein function may be the single most vital application of bioinformatics in the post-genome era. These assignments are made based on various protein features, where one is the presence of identifiable domains. The relationship between protein domain content and function is important to investigate, to understand how domain combinations encode complex functions.
RESULTS: Two different models are presented on how protein domain combinations yield specific functions: one rule-based and one probabilistic. We demonstrate how these are useful for Gene Ontology annotation transfer. The first is an intuitive generalization of the Pfam2GO mapping, and detects cases of strict functional implications of sets of domains. The second uses a probabilistic model to represent the relationship between domain content and annotation terms, and was found to be better suited for incomplete training sets. We implemented these models as predictors of Gene Ontology functional annotation terms. Both predictors were more accurate than conventional best BLAST-hit annotation transfer and more sensitive than a single-domain model on a large-scale dataset. We present a number of cases where combinations of Pfam-A protein domains predict functional terms that do not follow from the individual domains. AVAILABILITY: Scripts and documentation are available for download at http://sonnhammer.sbc.su.se/multipfam2go_source_docs.tar

Mesh:

Substances:

Year:  2008        PMID: 18591194     DOI: 10.1093/bioinformatics/btn312

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  44 in total

1.  A proteogenomic approach to understand splice isoform functions through sequence and expression-based computational modeling.

Authors:  Hong-Dong Li; Gilbert S Omenn; Yuanfang Guan
Journal:  Brief Bioinform       Date:  2016-01-06       Impact factor: 11.622

2.  Genome-wide computational function prediction of Arabidopsis proteins by integration of multiple data sources.

Authors:  Yiannis A I Kourmpetis; Aalt D J van Dijk; Roeland C H J van Ham; Cajo J F ter Braak
Journal:  Plant Physiol       Date:  2010-11-22       Impact factor: 8.340

Review 3.  The emerging era of genomic data integration for analyzing splice isoform function.

Authors:  Hong-Dong Li; Rajasree Menon; Gilbert S Omenn; Yuanfang Guan
Journal:  Trends Genet       Date:  2014-06-17       Impact factor: 11.639

4.  Proteome-wide remodeling of protein location and function by stress.

Authors:  KiYoung Lee; Min-Kyung Sung; Jihyun Kim; Kyung Kim; Junghyun Byun; Hyojung Paik; Bongkeun Kim; Won-Ki Huh; Trey Ideker
Journal:  Proc Natl Acad Sci U S A       Date:  2014-07-15       Impact factor: 11.205

5.  PoGO: Prediction of Gene Ontology terms for fungal proteins.

Authors:  Jaehee Jung; Gangman Yi; Serenella A Sukno; Michael R Thon
Journal:  BMC Bioinformatics       Date:  2010-04-29       Impact factor: 3.169

6.  Inferring bona fide transfrags in RNA-Seq derived-transcriptome assemblies of non-model organisms.

Authors:  Stanley Kimbung Mbandi; Uljana Hesse; Peter van Heusden; Alan Christoffels
Journal:  BMC Bioinformatics       Date:  2015-02-21       Impact factor: 3.169

7.  Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.

Authors:  Hsin-Nan Lin; Ching-Tai Chen; Ting-Yi Sung; Shinn-Ying Ho; Wen-Lian Hsu
Journal:  BMC Bioinformatics       Date:  2009-12-03       Impact factor: 3.169

8.  FACT: functional annotation transfer between proteins with similar feature architectures.

Authors:  Tina Koestler; Arndt von Haeseler; Ingo Ebersberger
Journal:  BMC Bioinformatics       Date:  2010-08-09       Impact factor: 3.169

9.  Graph2GO: a multi-modal attributed network embedding method for inferring protein functions.

Authors:  Kunjie Fan; Yuanfang Guan; Yan Zhang
Journal:  Gigascience       Date:  2020-08-01       Impact factor: 6.524

10.  d-Omix: a mixer of generic protein domain analysis tools.

Authors:  Duangdao Wichadakul; Somrak Numnark; Supawadee Ingsriswang
Journal:  Nucleic Acids Res       Date:  2009-05-21       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.