Literature DB >> 25248192

An Improved Protein Structural Classes Prediction Method by Incorporating Both Sequence and Structure Information.

.   

Abstract

Protein structural classes information is beneficial for secondary and tertiary structure prediction, protein folds prediction, and protein function analysis. Thus, predicting protein structural classes is of vital importance. In recent years, several computational methods have been developed for low-sequence-similarity (25%-40%) protein structural classes prediction. However, the reported prediction accuracies are actually not satisfactory. Aiming to further improve the prediction accuracies, we propose three different feature extraction methods and construct a comprehensive feature set that captures both sequence and structure information. By applying a random forest (RF) classifier to the feature set, we further develop a novel method for structural classes prediction. We test the proposed method on three benchmark datasets (25PDB, 640, and 1189) with low sequence similarity, and obtain the overall prediction accuracies of 93.5%, 92.6%, and 93.4%, respectively. Compared with six competing methods, the accuracies we achieved are 3.4%, 6.2%, and 8.7% higher than those achieved by the best-performing methods, showing the superiority of our method. Moreover, due to the limitation of the size of the three benchmark datasets, we further test the proposed method on three updated large-scale datasets with different sequence similarities (40%, 30%, and 25%). The proposed method achieves above 90% accuracies for all the three datasets, consistent with the accuracies on the above three benchmark datasets. Experimental results suggest our method as an effective and promising tool for structural classes prediction. Currently, a webserver that implements the proposed method is available on http://121.192.180.204:8080/RF_PSCP/Index.html.

Year:  2014        PMID: 25248192     DOI: 10.1109/TNB.2014.2352454

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  22 in total

1.  Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier.

Authors:  Xiaotong Guo; Fulin Liu; Ying Ju; Zhen Wang; Chunyu Wang
Journal:  Sci Rep       Date:  2016-06-21       Impact factor: 4.379

2.  Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion.

Authors:  Weizhong Lu; Xiaoyi Chen; Yu Zhang; Hongjie Wu; Yijie Ding; Jiawei Shen; Shixuan Guan; Haiou Li
Journal:  Comput Math Methods Med       Date:  2022-06-02       Impact factor: 2.809

3.  A deformation energy-based model for predicting nucleosome dyads and occupancy.

Authors:  Guoqing Liu; Yongqiang Xing; Hongyu Zhao; Jianying Wang; Yu Shang; Lu Cai
Journal:  Sci Rep       Date:  2016-04-07       Impact factor: 4.379

4.  Identification of Bacterial Cell Wall Lyases via Pseudo Amino Acid Composition.

Authors:  Xin-Xin Chen; Hua Tang; Wen-Chao Li; Hao Wu; Wei Chen; Hui Ding; Hao Lin
Journal:  Biomed Res Int       Date:  2016-06-29       Impact factor: 3.411

5.  DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation.

Authors:  Bin Liu; Shanyi Wang; Xiaolong Wang
Journal:  Sci Rep       Date:  2015-10-20       Impact factor: 4.379

6.  DephosSite: a machine learning approach for discovering phosphotase-specific dephosphorylation sites.

Authors:  Xiaofeng Wang; Renxiang Yan; Jiangning Song
Journal:  Sci Rep       Date:  2016-03-22       Impact factor: 4.379

7.  Identification of Multi-Functional Enzyme with Multi-Label Classifier.

Authors:  Yuxin Che; Ying Ju; Ping Xuan; Ren Long; Fei Xing
Journal:  PLoS One       Date:  2016-04-14       Impact factor: 3.240

8.  Recombination spot identification Based on gapped k-mers.

Authors:  Rong Wang; Yong Xu; Bin Liu
Journal:  Sci Rep       Date:  2016-03-31       Impact factor: 4.379

9.  Identification of apolipoprotein using feature selection technique.

Authors:  Hua Tang; Ping Zou; Chunmei Zhang; Rong Chen; Wei Chen; Hao Lin
Journal:  Sci Rep       Date:  2016-07-22       Impact factor: 4.379

10.  SVM-Prot 2016: A Web-Server for Machine Learning Prediction of Protein Functional Families from Sequence Irrespective of Similarity.

Authors:  Ying Hong Li; Jing Yu Xu; Lin Tao; Xiao Feng Li; Shuang Li; Xian Zeng; Shang Ying Chen; Peng Zhang; Chu Qin; Cheng Zhang; Zhe Chen; Feng Zhu; Yu Zong Chen
Journal:  PLoS One       Date:  2016-08-15       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.