Literature DB >> 22316305

Predicting protein structural class by incorporating patterns of over-represented k-mers into the general form of Chou's PseAAC.

Yu-Fang Qin1, Chun-Hua Wang, Xiao-Qing Yu, Jie Zhu, Tai-Gang Liu, Xiao-Qi Zheng.   

Abstract

Computational prediction of protein structural class based on sequence data remains a challenging problem in current protein science. In this paper, a new feature extraction approach based on relative polypeptide composition is introduced. This approach could take into account the background distribution of a given k-mer under a Markov model of order k-2, and avoid the curse of dimensionality with the increase of k by using a T-statistic feature selection strategy. The selected features are then fed to a support vector machine to perform the prediction. To verify the performance of our method, jackknife cross-validation tests are performed on four widely used benchmark datasets. Comparison of our results with existing methods shows that our method provides satisfactory performance for structural class prediction.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22316305     DOI: 10.2174/092986612799789350

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  7 in total

Review 1.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

2.  Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.

Authors:  Samad Jahandideh; Vinodh Srinivasasainagendra; Degui Zhi
Journal:  J Theor Biol       Date:  2012-08-03       Impact factor: 2.691

3.  iNuc-PhysChem: a sequence-based predictor for identifying nucleosomes via physicochemical properties.

Authors:  Wei Chen; Hao Lin; Peng-Mian Feng; Chen Ding; Yong-Chun Zuo; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-10-29       Impact factor: 3.240

4.  PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

Authors:  Liqi Li; Xiang Cui; Sanjiu Yu; Yuan Zhang; Zhong Luo; Hua Yang; Yue Zhou; Xiaoqi Zheng
Journal:  PLoS One       Date:  2014-03-27       Impact factor: 3.240

5.  Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach.

Authors:  Taigang Liu; Yufang Qin; Yongjie Wang; Chunhua Wang
Journal:  Int J Mol Sci       Date:  2015-12-24       Impact factor: 5.923

6.  Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.

Authors:  Guillermin Agüero-Chapin; Deborah Galpert; Reinaldo Molina-Ruiz; Evys Ancede-Gallardo; Gisselle Pérez-Machado; Gustavo A de la Riva; Agostinho Antunes
Journal:  Biomolecules       Date:  2019-12-23

7.  PseAAC-General: fast building various modes of general form of Chou's pseudo-amino acid composition for large-scale protein datasets.

Authors:  Pufeng Du; Shuwang Gu; Yasen Jiao
Journal:  Int J Mol Sci       Date:  2014-02-26       Impact factor: 5.923

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.