Literature DB >> 34320027

A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction.

Teng-Ruei Chen1,2, Sheng-Hung Juan1,2, Yu-Wei Huang1,2, Yen-Cheng Lin3,4, Wei-Cheng Lo1,2,3,4,5.   

Abstract

Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM.

Entities:  

Year:  2021        PMID: 34320027     DOI: 10.1371/journal.pone.0255076

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  89 in total

1.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

2.  Automated structure prediction of weakly homologous proteins on a genomic scale.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2004-05-04       Impact factor: 11.205

3.  Achieving 80% ten-fold cross-validated accuracy for secondary structure prediction by large-scale training.

Authors:  Ofer Dor; Yaoqi Zhou
Journal:  Proteins       Date:  2007-03-01

4.  MUPRED: a tool for bridging the gap between template based methods and sequence profile based methods for protein secondary structure prediction.

Authors:  Rajkumar Bondugula; Dong Xu
Journal:  Proteins       Date:  2007-02-15

Review 5.  Circular permutation of polypeptide chains: implications for protein folding and stability.

Authors:  U Heinemann; M Hahn
Journal:  Prog Biophys Mol Biol       Date:  1995       Impact factor: 3.667

6.  MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction.

Authors:  Chao Fang; Yi Shang; Dong Xu
Journal:  Proteins       Date:  2018-03-12

7.  SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles.

Authors:  Eshel Faraggi; Tuo Zhang; Yuedong Yang; Lukasz Kurgan; Yaoqi Zhou
Journal:  J Comput Chem       Date:  2011-11-02       Impact factor: 3.376

8.  A simple and fast secondary structure prediction method using hidden neural networks.

Authors:  Kuang Lin; Victor A Simossis; Willam R Taylor; Jaap Heringa
Journal:  Bioinformatics       Date:  2004-09-17       Impact factor: 6.937

9.  A simple strategy to enhance the speed of protein secondary structure prediction without sacrificing accuracy.

Authors:  Sheng-Hung Juan; Teng-Ruei Chen; Wei-Cheng Lo
Journal:  PLoS One       Date:  2020-06-30       Impact factor: 3.240

10.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

View more
  5 in total

1.  Role of lupeol in chemosensitizing therapy-resistant prostate cancer cells by targeting MYC, β-catenin and c-FLIP: in silico and in vitro studies.

Authors:  Homa Fatma; Santosh Kumar Maurya; Akhilesh Kumar Maurya; Nidhi Mishra; Hifzur R Siddique
Journal:  In Silico Pharmacol       Date:  2022-09-04

2.  Discovering the Ultimate Limits of Protein Secondary Structure Prediction.

Authors:  Chia-Tzu Ho; Yu-Wei Huang; Teng-Ruei Chen; Chia-Hua Lo; Wei-Cheng Lo
Journal:  Biomolecules       Date:  2021-11-03

3.  Identifying Transcription Factors That Prefer Binding to Methylated DNA Using Reduced G-Gap Dipeptide Composition.

Authors:  Quang H Nguyen; Hoang V Tran; Binh P Nguyen; Trang T T Do
Journal:  ACS Omega       Date:  2022-08-30

4.  Energy Profile Bayes and Thompson Optimized Convolutional Neural Network protein structure prediction.

Authors:  Varanavasi Nallasamy; Malarvizhi Seshiah
Journal:  Neural Comput Appl       Date:  2022-10-07       Impact factor: 5.102

5.  Identifying SNARE Proteins Using an Alignment-Free Method Based on Multiscan Convolutional Neural Network and PSSM Profiles.

Authors:  Quang-Hien Kha; Quang-Thai Ho; Nguyen Quoc Khanh Le
Journal:  J Chem Inf Model       Date:  2022-09-27       Impact factor: 6.162

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.