Literature DB >> 16649034

Fast prediction of protein domain boundaries using conserved local patterns.

Rajani R Joshi1, Vivekanand V Samant.   

Abstract

We have found certain conserved motifs and secondary structural patterns present in the vicinity of interior domain boundary points (dbps) by a data-driven approach without any a priori constraint on the type and number of such features, and without any requirement of sequence homology. We have used these motifs and patterns to rerank the solutions obtained by the well-known domain guess by size (DGS) algorithm. We predict, overall, five solutions. The average accuracy of overall (i.e., top five) predictions by our method [domain boundary prediction using conserved patterns (DPCP)] has improved the average accuracy of the top five solutions of DGS from 71.74 to 82.88 %, in the case of two-continuous-domain proteins, and from 21.38 to 80.56 %, for two-discontinuous-domain proteins. Considering only the top solution, the gains in accuracy are from 0 to 72.74 % for two-continuous-domain proteins with chain lengths up to 300 residues, and from 0 to 62.85 % for those with up to 400 residues. In the case of discontinuous domains, top_min solutions (the minimum number of solutions required for predicting all dbps of a protein) of DPCP improve the average accuracy of DGS prediction from 12.5 to 76.3 % in proteins with chain lengths up to 300 residues, and from 13.33 to 70.84 % for proteins with up to 400 residues. In our validation experiments, the performance of DPCP was also found to be superior to that of domain identification from secondary structure element alignment (DomSSEA), the best method reported so far for efficient prediction of domain boundaries using predicted secondary structure. The average accuracies of the topmost solution of DomSSEA are 61 and 52 % for proteins with up to 300 residues and 400, respectively, in the case of continuous domains; the corresponding accuracies for the discontinuous case are 28 and 21 %.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16649034     DOI: 10.1007/s00894-006-0116-0

Source DB:  PubMed          Journal:  J Mol Model        ISSN: 0948-5023            Impact factor:   1.810


  16 in total

1.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

2.  Universal similarity measure for comparing protein structures.

Authors:  M R Betancourt; J Skolnick
Journal:  Biopolymers       Date:  2001-10-15       Impact factor: 2.505

3.  DomCut: prediction of inter-domain linker regions in amino acid sequences.

Authors:  Mikita Suyama; Osamu Ohara
Journal:  Bioinformatics       Date:  2003-03-22       Impact factor: 6.937

4.  Rapid protein domain assignment from amino acid sequence using predicted secondary structure.

Authors:  Russell L Marsden; Liam J McGuffin; David T Jones
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

Review 5.  Computational tools for protein modeling.

Authors:  D Xu; Y Xu; E C Uberbacher
Journal:  Curr Protein Pept Sci       Date:  2000-07       Impact factor: 3.272

6.  Parameterization and classification of the protein universe via geometric techniques.

Authors:  Ashish V Tendulkar; Pramod P Wangikar; Milind A Sohoni; Vivekanand V Samant; Chetan Y Mone
Journal:  J Mol Biol       Date:  2003-11-14       Impact factor: 5.469

7.  Structure prediction of a multi-domain EF-hand Ca2+ binding protein by PROPAINOR.

Authors:  Subramanian Jyothi; Sourajit M Mustafi; Kandala V R Chary; Rajani R Joshi
Journal:  J Mol Model       Date:  2005-08-11       Impact factor: 1.810

8.  Domain assignment for protein structures using a consensus approach: characterization and analysis.

Authors:  S Jones; M Stewart; A Michie; M B Swindells; C Orengo; J M Thornton
Journal:  Protein Sci       Date:  1998-02       Impact factor: 6.725

9.  PROMOTIF--a program to identify and analyze structural motifs in proteins.

Authors:  E G Hutchinson; J M Thornton
Journal:  Protein Sci       Date:  1996-02       Impact factor: 6.725

10.  A new approach to clustering the amino acids.

Authors:  L E Stanfel
Journal:  J Theor Biol       Date:  1996-11-21       Impact factor: 2.691

View more
  2 in total

1.  Quantitative characterization of protein tertiary motifs.

Authors:  Rajani R Joshi; S Sreenath
Journal:  J Mol Model       Date:  2014-01-26       Impact factor: 1.810

2.  Identifying foldable regions in protein sequence from the hydrophobic signal.

Authors:  Chi N I Pang; Kuang Lin; Merridee A Wouters; Jaap Heringa; Richard A George
Journal:  Nucleic Acids Res       Date:  2007-12-01       Impact factor: 16.971

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.