Literature DB >> 15632283

Descriptor-based protein remote homology identification.

Ziding Zhang1, Sunil Kochhar, Martin G Grigorov.   

Abstract

Here, we report a novel protein sequence descriptor-based remote homology identification method, able to infer fold relationships without the explicit knowledge of structure. In a first phase, we have individually benchmarked 13 different descriptor types in fold identification experiments in a highly diverse set of protein sequences. The relevant descriptors were related to the fold class membership by using simple similarity measures in the descriptor spaces, such as the cosine angle. Our results revealed that the three best-performing sets of descriptors were the sequence-alignment-based descriptor using PSI-BLAST e-values, the descriptors based on the alignment of secondary structural elements (SSEA), and the descriptors based on the occurrence of PROSITE functional motifs. In a second phase, the three top-performing descriptors were combined to obtain a final method with improved performance, which we named DescFold. Class membership was predicted by Support Vector Machine (SVM) learning. In comparison with the individual PSI-BLAST-based descriptor, the rate of remote homology identification increased from 33.7% to 46.3%. We found out that the composite set of descriptors was able to identify the true remote homolog for nearly every sixth sequence at the 95% confidence level, or some 10% more than a single PSI-BLAST search. We have benchmarked the DescFold method against several other state-of-the-art fold recognition algorithms for the 172 LiveBench-8 targets, and we concluded that it was able to add value to the existing techniques by providing a confident hit for at least 10% of the sequences not identifiable by the previously known methods.

Mesh:

Year:  2005        PMID: 15632283      PMCID: PMC2253398          DOI: 10.1110/ps.041035505

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  64 in total

1.  Crystal structure of alginate lyase A1-III from Sphingomonas species A1 at 1.78 A resolution.

Authors:  H J Yoon; B Mikami; W Hashimoto; K Murata
Journal:  J Mol Biol       Date:  1999-07-09       Impact factor: 5.469

2.  Characterization of novel proteins based on known protein structures.

Authors:  W A Koppensteiner; P Lackner; M Wiederstein; M J Sippl
Journal:  J Mol Biol       Date:  2000-03-03       Impact factor: 5.469

3.  MaxSub: an automated measure for the assessment of protein structure prediction quality.

Authors:  N Siew; A Elofsson; L Rychlewski; D Fischer
Journal:  Bioinformatics       Date:  2000-09       Impact factor: 6.937

4.  Classification of G-protein coupled receptors by alignment-independent extraction of principal chemical properties of primary amino acid sequences.

Authors:  Maris Lapinsh; Alexandrs Gutcaits; Peteris Prusis; Claes Post; Torbjörn Lundstedt; Jarl E S Wikberg
Journal:  Protein Sci       Date:  2002-04       Impact factor: 6.725

5.  Assessment of the CASP4 fold recognition category.

Authors:  M J Sippl; P Lackner; F S Domingues; A Prlić; R Malik; A Andreeva; M Wiederstein
Journal:  Proteins       Date:  2001

6.  Targeting novel folds for structural genomics.

Authors:  Liam J McGuffin; David T Jones
Journal:  Proteins       Date:  2002-07-01

7.  In silico protein recombination: enhancing template and sequence alignment selection for comparative protein modelling.

Authors:  Bruno Contreras-Moreira; Paul W Fitzjohn; Paul A Bates
Journal:  J Mol Biol       Date:  2003-05-02       Impact factor: 5.469

8.  Competitive assessment of protein fold recognition and alignment accuracy.

Authors:  M Levitt
Journal:  Proteins       Date:  1997

9.  Evaluation of threading specificity and accuracy.

Authors:  S H Bryant
Journal:  Proteins       Date:  1996-10

10.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

View more
  15 in total

1.  Comparison of a homology model and the crystallographic structure of human 11beta-hydroxysteroid dehydrogenase type 1 (11betaHSD1) in a structure-based identification of inhibitors.

Authors:  Laurence Miguet; Ziding Zhang; Maryse Barbier; Martin G Grigorov
Journal:  J Comput Aided Mol Des       Date:  2006-04-20       Impact factor: 3.686

2.  eThread: a highly optimized machine learning-based approach to meta-threading and the modeling of protein tertiary structures.

Authors:  Michal Brylinski; Daswanth Lingam
Journal:  PLoS One       Date:  2012-11-21       Impact factor: 3.240

3.  Outer membrane proteins can be simply identified using secondary structure element alignment.

Authors:  Ren-Xiang Yan; Zhen Chen; Ziding Zhang
Journal:  BMC Bioinformatics       Date:  2011-03-17       Impact factor: 3.169

4.  Medicago truncatula transporter database: a comprehensive database resource for M. truncatula transporters.

Authors:  Zhenyan Miao; Daofeng Li; Zhenhai Zhang; Jiangli Dong; Zhen Su; Tao Wang
Journal:  BMC Genomics       Date:  2012-02-06       Impact factor: 3.969

5.  TIM-Finder: a new method for identifying TIM-barrel proteins.

Authors:  Jing-Na Si; Ren-Xiang Yan; Chuan Wang; Ziding Zhang; Xiao-Dong Su
Journal:  BMC Struct Biol       Date:  2009-12-14

6.  DescFold: a web server for protein fold recognition.

Authors:  Ren-Xiang Yan; Jing-Na Si; Chuan Wang; Ziding Zhang
Journal:  BMC Bioinformatics       Date:  2009-12-14       Impact factor: 3.169

7.  Relationships between kinetic constants and the amino acid composition of enzymes from the yeast Saccharomyces cerevisiae glycolysis pathway.

Authors:  Peteris Zikmanis; Inara Kampenusa
Journal:  EURASIP J Bioinform Syst Biol       Date:  2012-08-06

8.  An improved sequence based prediction protocol for DNA-binding proteins using SVM and comprehensive feature analysis.

Authors:  Chuanxin Zou; Jiayu Gong; Honglin Li
Journal:  BMC Bioinformatics       Date:  2013-03-09       Impact factor: 3.169

9.  Human Pol II promoter recognition based on primary sequences and free energy of dinucleotides.

Authors:  Jian-Yi Yang; Yu Zhou; Zu-Guo Yu; Vo Anh; Li-Qian Zhou
Journal:  BMC Bioinformatics       Date:  2008-02-24       Impact factor: 3.169

10.  Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs.

Authors:  Yong-Zi Chen; Yu-Rong Tang; Zhi-Ya Sheng; Ziding Zhang
Journal:  BMC Bioinformatics       Date:  2008-02-18       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.