Literature DB >> 17456745

DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile.

Hongyi Zhou1, Bin Xue, Yaoqi Zhou.   

Abstract

Dividing protein structures into domains is proven useful for more accurate structural and functional characterization of proteins. Here, we develop a method, called DDOMAIN, that divides structure into DOMAINs using a normalized contact-based domain-domain interaction profile. Results of DDOMAIN are compared to AUTHORS annotations (domain definitions are given by the authors who solved protein structures), as well as to popular SCOP and CATH annotations by human experts and automatic programs. DDOMAIN's automatic annotations are most consistent with the AUTHORS annotations (90% agreement in number of domains and 88% agreement in both number of domains and at least 85% overlap in domain assignment of residues) if its three adjustable parameters are trained by the AUTHORS annotations. By comparison, the agreement is 83% (81% with at least 85% overlap criterion) between SCOP-trained DDOMAIN and SCOP annotations and 77% (73%) between CATH-trained DDOMAIN and CATH annotations. The agreement between DDOMAIN and AUTHORS annotations goes beyond single-domain proteins (97%, 82%, and 56% for single-, two-, and three-domain proteins, respectively). For an "easy" data set of proteins whose CATH and SCOP annotations agree with each other in number of domains, the agreement is 90% (89%) between "easy-set"-trained DDOMAIN and CATH/SCOP annotations. The consistency between SCOP-trained DDOMAIN and SCOP annotations is superior to two other recently developed, SCOP-trained, automatic methods PDP (protein domain parser), and DomainParser 2. We also tested a simple consensus method made of PDP, DomainParser 2, and DDOMAIN and a different version of DDOMAIN based on a more sophisticated statistical energy function. The DDOMAIN server and its executable are available in the services section on http://sparks.informatics.iupui.edu.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17456745      PMCID: PMC2206635          DOI: 10.1110/ps.062597307

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  33 in total

1.  Protein domain decomposition using a graph-theoretic approach.

Authors:  Y Xu; D Xu; H N Gabow; H Gabow
Journal:  Bioinformatics       Date:  2000-12       Impact factor: 6.937

2.  PDP: protein domain parser.

Authors:  Nickolai Alexandrov; Ilya Shindyalov
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

3.  Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction.

Authors:  Hongyi Zhou; Yaoqi Zhou
Journal:  Protein Sci       Date:  2002-11       Impact factor: 6.725

4.  Automated prediction of CASP-5 structures using the Robetta server.

Authors:  Dylan Chivian; David E Kim; Lars Malmström; Philip Bradley; Timothy Robertson; Paul Murphy; Charles E M Strauss; Richard Bonneau; Carol A Rohl; David Baker
Journal:  Proteins       Date:  2003

5.  Improving the performance of DomainParser for structural domain partition using neural network.

Authors:  Jun-tao Guo; Dong Xu; Dongsup Kim; Ying Xu
Journal:  Nucleic Acids Res       Date:  2003-02-01       Impact factor: 16.971

6.  GlobPlot: Exploring protein sequences for globularity and disorder.

Authors:  Rune Linding; Robert B Russell; Victor Neduva; Toby J Gibson
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

7.  PISCES: a protein sequence culling server.

Authors:  Guoli Wang; Roland L Dunbrack
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

8.  Partitioning protein structures into domains: why is it so difficult?

Authors:  Timothy A Holland; Stella Veretnik; Ilya N Shindyalov; Philip E Bourne
Journal:  J Mol Biol       Date:  2006-06-22       Impact factor: 5.469

9.  Exhaustive enumeration of protein domain families.

Authors:  Andreas Heger; Liisa Holm
Journal:  J Mol Biol       Date:  2003-05-02       Impact factor: 5.469

10.  The Pfam protein families database.

Authors:  Alex Bateman; Lachlan Coin; Richard Durbin; Robert D Finn; Volker Hollich; Sam Griffiths-Jones; Ajay Khanna; Mhairi Marshall; Simon Moxon; Erik L L Sonnhammer; David J Studholme; Corin Yeats; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more
  24 in total

1.  Highly accurate and high-resolution function prediction of RNA binding proteins by fold recognition and binding affinity prediction.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  RNA Biol       Date:  2011-11-01       Impact factor: 4.652

2.  IS-Dom: a dataset of independent structural domains automatically delineated from protein structures.

Authors:  Teppei Ebina; Yuki Umezawa; Yutaka Kuroda
Journal:  J Comput Aided Mol Des       Date:  2013-05-29       Impact factor: 3.686

3.  Definition and classification of evaluation units for CASP10.

Authors:  Todd J Taylor; Chin-Hsien Tai; Yuanpeng J Huang; Jeremy Block; Hongjun Bai; Andriy Kryshtafovych; Gaetano T Montelione; Byungkook Lee
Journal:  Proteins       Date:  2013-11-22

4.  Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates.

Authors:  Yuedong Yang; Eshel Faraggi; Huiying Zhao; Yaoqi Zhou
Journal:  Bioinformatics       Date:  2011-06-11       Impact factor: 6.937

5.  FUpred: detecting protein domains through deep-learning-based contact map prediction.

Authors:  Wei Zheng; Xiaogen Zhou; Qiqige Wuyun; Robin Pearce; Yang Li; Yang Zhang
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

6.  An automated procedure for detecting protein folds from sub-nanometer resolution electron density.

Authors:  Reza Khayat; Gabriel C Lander; John E Johnson
Journal:  J Struct Biol       Date:  2009-12-22       Impact factor: 2.867

7.  CASP11 statistics and the prediction center evaluation system.

Authors:  Andriy Kryshtafovych; Bohdan Monastyrskyy; Krzysztof Fidelis
Journal:  Proteins       Date:  2016-03-09

8.  dConsensus: a tool for displaying domain assignments by multiple structure-based algorithms and for construction of a consensus assignment.

Authors:  Kieran Alden; Stella Veretnik; Philip E Bourne
Journal:  BMC Bioinformatics       Date:  2010-06-09       Impact factor: 3.169

9.  Evaluation system and web infrastructure for the second cryo-EM model challenge.

Authors:  Andriy Kryshtafovych; Paul D Adams; Catherine L Lawson; Wah Chiu
Journal:  J Struct Biol       Date:  2018-07-12       Impact factor: 2.867

10.  A threading-based method for the prediction of DNA-binding proteins with application to the human genome.

Authors:  Mu Gao; Jeffrey Skolnick
Journal:  PLoS Comput Biol       Date:  2009-11-13       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.