Literature DB >> 30239678

THE-DB: a threading model database for comparative protein structure analysis of the E. coli K12 and human proteomes.

Justin S Diamond1,2, Yang Zhang1.   

Abstract

New methodology must be developed to improve the ability to characterize the growing number of amino acid sequences, which vastly exceeds the number of experimentally determined protein structures. Homologous proteins can be used as structural templates for modeling proteins that do not have experimentally determined structures. However, in many cases, there are no homologous proteins (typically <30% sequence identity) with determined structures from which a query sequence can be reliably modeled. The aim of protein threading is to use features, such as secondary structure, solvent accessibility and torsional angles, in addition to sequence patterns to identify structural templates from the protein databank to assist for full-length atomic-level structural modeling. However, there are still numerous protein sequences for which correct templates cannot be recognized. This raises the question as to what attributes allow query sequences to be matched to the correct but distantly homologous templates. To aid the investigation into this question and to provide genome-score protein structure for the biological community, a database called THE-DB (threading hard and easy protein database) has been developed in which it becomes possible to analyze over 15 000 query sequences from the Escherichia coli (E. coli) K12 and human proteomes, as well as to find their three-dimensional templates derived from the state-of-the-art threading algorithms which is not feasible with existing protein template databases. The E. coli K12 and human data can be downloaded in bulk from the THE-DB page.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30239678      PMCID: PMC6146127          DOI: 10.1093/database/bay090

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


  17 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

3.  Further evidence for the likely completeness of the library of solved single domain protein structures.

Authors:  Jeffrey Skolnick; Hongyi Zhou; Michal Brylinski
Journal:  J Phys Chem B       Date:  2012-02-13       Impact factor: 2.991

4.  The protein structure prediction problem could be solved using the current PDB library.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2005-01-14       Impact factor: 11.205

5.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions.

Authors:  K T Simons; C Kooperberg; E Huang; D Baker
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

6.  Principles that govern the folding of protein chains.

Authors:  C B Anfinsen
Journal:  Science       Date:  1973-07-20       Impact factor: 47.728

7.  The Protein Model Portal--a comprehensive resource for protein structure and model information.

Authors:  Juergen Haas; Steven Roth; Konstantin Arnold; Florian Kiefer; Tobias Schmidt; Lorenza Bordoli; Torsten Schwede
Journal:  Database (Oxford)       Date:  2013-04-26       Impact factor: 3.451

8.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

9.  LOMETS: a local meta-threading-server for protein structure prediction.

Authors:  Sitao Wu; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2007-05-03       Impact factor: 16.971

10.  The SWISS-MODEL Repository and associated resources.

Authors:  Florian Kiefer; Konstantin Arnold; Michael Künzli; Lorenza Bordoli; Torsten Schwede
Journal:  Nucleic Acids Res       Date:  2008-10-18       Impact factor: 16.971

View more
  1 in total

1.  Protein structure prediction using sparse NOE and RDC restraints with Rosetta in CASP13.

Authors:  Georg Kuenze; Jens Meiler
Journal:  Proteins       Date:  2019-07-18
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.