Literature DB >> 21445639

A new approach to assess and predict the functional roles of proteins across all known structures.

Elchin S Julfayev1, Ryan J McLaughlin, Yi-Ping Tao, William A McLaughlin.   

Abstract

The three dimensional atomic structures of proteins provide information regarding their function; and codified relationships between structure and function enable the assessment of function from structure. In the current study, a new data mining tool was implemented that checks current gene ontology (GO) annotations and predicts new ones across all the protein structures available in the Protein Data Bank (PDB). The tool overcomes some of the challenges of utilizing large amounts of protein annotation and measurement information to form correspondences span>n class="Gene">between protein structure and function. Protein attributes were extracted from the Structural Biology Knowledgebase and open source biological databases. Based on the presence or absence of a given set of attributes, a given protein's functional annotations were inferred. The results show that attributes derived from the three dimensional structures of proteins enhanced predictions over that using attributes only derived from primary amino acid sequence. Some predictions reflected known but not completely documented GO annotations. For example, predictions for the GO term for copper ion binding reflected used information a copper ion was known to interact with the protein based on information in a ligand interaction database. Other predictions were novel and require further experimental validation. These include predictions for proteins labeled as unknown function in the PDB. Two examples are a role in the regulation of transcription for the protein AF1396 from Archaeoglobus fulgidus and a role in RNA metabolism for the protein psuG from Thermotoga maritima.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21445639      PMCID: PMC3089730          DOI: 10.1007/s10969-011-9105-3

Source DB:  PubMed          Journal:  J Struct Funct Genomics        ISSN: 1345-711X


  58 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The ENZYME database in 2000.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Crystal structure of an indigoidine synthase A (IndA)-like protein (TM1464) from Thermotoga maritima at 1.90 A resolution reveals a new fold.

Authors:  Inna Levin; Mitchell D Miller; Robert Schwarzenbacher; Daniel McMullan; Polat Abdubek; Eileen Ambing; Tanya Biorac; Jamison Cambell; Jaume M Canaves; Hsiu-Ju Chiu; Ashley M Deacon; Michael DiDonato; Marc-André Elsliger; Adam Godzik; Carina Grittini; Slawomir K Grzechnik; Joanna Hale; Eric Hampton; Gye Won Han; Justin Haugen; Michael Hornsby; Lukasz Jaroszewski; Cathy Karlak; Heath E Klock; Eric Koesema; Andreas Kreusch; Peter Kuhn; Scott A Lesley; Andrew Morse; Kin Moy; Edward Nigoghossian; Jie Ouyang; Rebecca Page; Kevin Quijano; Ron Reyes; Alyssa Robb; Eric Sims; Glen Spraggon; Raymond C Stevens; Henry van den Bedem; Jeff Velasquez; Juli Vincent; Xianhong Wang; Bill West; Guenter Wolf; Qingping Xu; Olga Zagnitko; Keith O Hodgson; John Wooley; Ian A Wilson
Journal:  Proteins       Date:  2005-06-01

4.  Inference of protein function from protein structure.

Authors:  Debnath Pal; David Eisenberg
Journal:  Structure       Date:  2005-01       Impact factor: 5.006

5.  A method to identify protein sequences that fold into a known three-dimensional structure.

Authors:  J U Bowie; R Lüthy; D Eisenberg
Journal:  Science       Date:  1991-07-12       Impact factor: 47.728

6.  The CATH Database provides insights into protein structure/function relationships.

Authors:  C A Orengo; F M Pearl; J E Bray; A E Todd; A C Martin; L Lo Conte; J M Thornton
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

7.  A combined algorithm for genome-wide prediction of protein function.

Authors:  E M Marcotte; M Pellegrini; M J Thompson; T O Yeates; D Eisenberg
Journal:  Nature       Date:  1999-11-04       Impact factor: 49.962

8.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

9.  Crystal structure of a novel shikimate dehydrogenase from Haemophilus influenzae.

Authors:  Sasha Singh; Sergey Korolev; Olga Koroleva; Thomas Zarembinski; Frank Collart; Andrzej Joachimiak; Dinesh Christendat
Journal:  J Biol Chem       Date:  2005-02-25       Impact factor: 5.157

10.  E-MSD: an integrated data resource for bioinformatics.

Authors:  S Velankar; P McNeil; V Mittard-Runte; A Suarez; D Barrell; R Apweiler; K Henrick
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  3 in total

1.  KB-Rank: efficient protein structure and functional annotation identification via text query.

Authors:  Elchin S Julfayev; Ryan J McLaughlin; Yi-Ping Tao; William A McLaughlin
Journal:  J Struct Funct Genomics       Date:  2012-01-21

2.  Quality of computationally inferred gene ontology annotations.

Authors:  Nives Skunca; Adrian Altenhoff; Christophe Dessimoz
Journal:  PLoS Comput Biol       Date:  2012-05-31       Impact factor: 4.475

Review 3.  Data to knowledge: how to get meaning from your result.

Authors:  Helen M Berman; Margaret J Gabanyi; Colin R Groom; John E Johnson; Garib N Murshudov; Robert A Nicholls; Vijay Reddy; Torsten Schwede; Matthew D Zimmerman; John Westbrook; Wladek Minor
Journal:  IUCrJ       Date:  2015-01-01       Impact factor: 4.769

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.