Literature DB >> 23514456

Protein function prediction using domain families.

Robert Rentzsch1, Christine A Orengo.   

Abstract

Here we assessed the use of domain families for predicting the functions of whole proteins. These 'functional families' (FunFams) were derived using a protocol that combines sequence clustering with supervised cluster evaluation, relying on available high-quality Gene Ontology (GO) annotation data in the latter step. In essence, the protocol groups domain sequences belonging to the same superfamily into families based on the GO annotations of their parent proteins. An initial test based on enzyme sequences confirmed that the FunFams resemble enzyme (domain) families much better than do families produced by sequence clustering alone. For the CAFA 2011 experiment, we further associated the FunFams with GO terms probabilistically. All target proteins were first submitted to domain superfamily assignment, followed by FunFam assignment and, eventually, function assignment. The latter included an integration step for multi-domain target proteins. The CAFA results put our domain-based approach among the top ten of 31 competing groups and 56 prediction methods, confirming that it outperforms simple pairwise whole-protein sequence comparisons.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23514456      PMCID: PMC3584934          DOI: 10.1186/1471-2105-14-S3-S5

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  32 in total

1.  An insight into domain combinations.

Authors:  G Apic; J Gough; S A Teichmann
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  Gene3D: structural assignment for whole genes and genomes using the CATH domain structure database.

Authors:  Daniel W A Buchan; Adrian J Shepherd; David Lee; Frances M G Pearl; Stuart C G Rison; Janet M Thornton; Christine A Orengo
Journal:  Genome Res       Date:  2002-03       Impact factor: 9.043

3.  Program description: Strategies for biological annotation of mammalian systems: implementing gene ontologies in mouse genome informatics.

Authors:  D P Hill; A P Davis; J E Richardson; J P Corradi; M Ringwald; J T Eppig; J A Blake
Journal:  Genomics       Date:  2001-05-15       Impact factor: 5.736

4.  Evolution of the protein repertoire.

Authors:  Cyrus Chothia; Julian Gough; Christine Vogel; Sarah A Teichmann
Journal:  Science       Date:  2003-06-13       Impact factor: 47.728

5.  Gotrees: predicting go associations from protein domain composition using decision trees.

Authors:  Boris Hayete; Jadwiga R Bienkowska
Journal:  Pac Symp Biocomput       Date:  2005

6.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

7.  Gene ontology functional annotations at the structural domain level.

Authors:  Daniel Lopez; Florencio Pazos
Journal:  Proteins       Date:  2009-08-15

8.  Domain-based and family-specific sequence identity thresholds increase the levels of reliable protein function transfer.

Authors:  Sarah Addou; Robert Rentzsch; David Lee; Christine A Orengo
Journal:  J Mol Biol       Date:  2008-12-25       Impact factor: 5.469

Review 9.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

10.  Reorganizing the protein space at the Universal Protein Resource (UniProt).

Authors: 
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

View more
  35 in total

1.  A proteogenomic approach to understand splice isoform functions through sequence and expression-based computational modeling.

Authors:  Hong-Dong Li; Gilbert S Omenn; Yuanfang Guan
Journal:  Brief Bioinform       Date:  2016-01-06       Impact factor: 11.622

2.  Evolution and structural diversification of Nictaba-like lectin genes in food crops with a focus on soybean (Glycine max).

Authors:  Sofie Van Holle; Pierre Rougé; Els J M Van Damme
Journal:  Ann Bot       Date:  2017-03-01       Impact factor: 4.357

Review 3.  The emerging era of genomic data integration for analyzing splice isoform function.

Authors:  Hong-Dong Li; Rajasree Menon; Gilbert S Omenn; Yuanfang Guan
Journal:  Trends Genet       Date:  2014-06-17       Impact factor: 11.639

4.  INGA: protein function prediction combining interaction networks, domain assignments and sequence similarity.

Authors:  Damiano Piovesan; Manuel Giollo; Emanuela Leonardi; Carlo Ferrari; Silvio C E Tosatto
Journal:  Nucleic Acids Res       Date:  2015-05-27       Impact factor: 16.971

5.  Graph2GO: a multi-modal attributed network embedding method for inferring protein functions.

Authors:  Kunjie Fan; Yuanfang Guan; Yan Zhang
Journal:  Gigascience       Date:  2020-08-01       Impact factor: 6.524

6.  dSPRINT: predicting DNA, RNA, ion, peptide and small molecule interaction sites within protein domains.

Authors:  Anat Etzion-Fuchs; David A Todd; Mona Singh
Journal:  Nucleic Acids Res       Date:  2021-07-21       Impact factor: 16.971

7.  Computational approach to unravel the impact of missense mutations of proteins (D2HGDH and IDH2) causing D-2-hydroxyglutaric aciduria 2.

Authors:  D Thirumal Kumar; L Jerushah Emerald; C George Priya Doss; P Sneha; R Siva; W Charles Emmanuel Jebaraj; Hatem Zayed
Journal:  Metab Brain Dis       Date:  2018-07-09       Impact factor: 3.655

8.  CATH FunFHMMer web server: protein functional annotations using functional family assignments.

Authors:  Sayoni Das; Ian Sillitoe; David Lee; Jonathan G Lees; Natalie L Dawson; John Ward; Christine A Orengo
Journal:  Nucleic Acids Res       Date:  2015-05-11       Impact factor: 16.971

9.  Completeness and Consistency in Structural Domain Classifications.

Authors:  R Dustin Schaeffer; Lisa N Kinch; Jimin Pei; Kirill E Medvedev; Nick V Grishin
Journal:  ACS Omega       Date:  2021-06-08

10.  Functional site plasticity in domain superfamilies.

Authors:  Benoit H Dessailly; Natalie L Dawson; Kenji Mizuguchi; Christine A Orengo
Journal:  Biochim Biophys Acta       Date:  2013-03-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.