Literature DB >> 17115254

Sequence representation and prediction of protein secondary structure for structural motifs in twilight zone proteins.

Lukasz Kurgan1, Kanaka Durga Kedarisetti.   

Abstract

Characterizing and classifying regularities in protein structure is an important element in uncovering the mechanisms that regulate protein structure, function and evolution. Recent research concentrates on analysis of structural motifs that can be used to describe larger, fold-sized structures based on homologous primary sequences. At the same time, accuracy of secondary protein structure prediction based on multiple sequence alignment drops significantly when low homology (twilight zone) sequences are considered. To this end, this paper addresses a problem of providing an alternative sequences representation that would improve ability to distinguish secondary structures for the twilight zone sequences without using alignment. We consider a novel classification problem, in which, structural motifs, referred to as structural fragments (SFs) are defined as uniform strand, helix and coil fragments. Classification of SFs allows to design novel sequence representations, and to investigate which other factors and prediction algorithms may result in the improved discrimination. Comprehensive experimental results show that statistically significant improvement in classification accuracy can be achieved by: (1) improving sequence representations, and (2) removing possible noise on the terminal residues in the SFs. Combining these two approaches reduces the error rate on average by 15% when compared to classification using standard representation and noisy information on the terminal residues, bringing the classification accuracy to over 70%. Finally, we show that certain prediction algorithms, such as neural networks and boosted decision trees, are superior to other algorithms.

Mesh:

Substances:

Year:  2006        PMID: 17115254     DOI: 10.1007/s10930-006-9029-0

Source DB:  PubMed          Journal:  Protein J        ISSN: 1572-3887            Impact factor:   2.371


  50 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

2.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Twilight zone of protein sequence alignments.

Authors:  B Rost
Journal:  Protein Eng       Date:  1999-02

4.  Fold recognition and accurate query-template alignment by a combination of PSI-BLAST and threading.

Authors:  Y Shan; G Wang; H X Zhou
Journal:  Proteins       Date:  2001-01-01

5.  Predicting protein structural class by functional domain composition.

Authors:  Kuo-Chen Chou; Yu-Dong Cai
Journal:  Biochem Biophys Res Commun       Date:  2004-09-03       Impact factor: 3.575

6.  Development of hydrophobicity parameters to analyze proteins which bear post- or cotranslational modifications.

Authors:  S D Black; D R Mould
Journal:  Anal Biochem       Date:  1991-02-15       Impact factor: 3.365

7.  Porter: a new, accurate server for protein secondary structure prediction.

Authors:  Gianluca Pollastri; Aoife McLysaght
Journal:  Bioinformatics       Date:  2004-12-07       Impact factor: 6.937

8.  Predicting protein secondary structure content. A tandem neural network approach.

Authors:  S M Muskal; S H Kim
Journal:  J Mol Biol       Date:  1992-06-05       Impact factor: 5.469

9.  Prediction of the helix/strand content of globular proteins based on their primary sequences.

Authors:  C T Zhang; Z S Lin; Z Zhang; M Yan
Journal:  Protein Eng       Date:  1998-11

10.  Structural classification of alphabetabeta and betabetaalpha supersecondary structure units in proteins.

Authors:  N S Boutonnet; A V Kajava; M J Rooman
Journal:  Proteins       Date:  1998-02-01
View more
  2 in total

1.  Structural alphabets for protein structure classification: a comparison study.

Authors:  Quan Le; Gianluca Pollastri; Patrice Koehl
Journal:  J Mol Biol       Date:  2008-12-25       Impact factor: 5.469

2.  Protein-segment universe exhibiting transitions at intermediate segment length in conformational subspaces.

Authors:  Kazuyoshi Ikeda; Takatsugu Hirokawa; Junichi Higo; Kentaro Tomii
Journal:  BMC Struct Biol       Date:  2008-08-13
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.