Literature DB >> 19278932

Prediction of protein folds: extraction of new features, dimensionality reduction, and fusion of heterogeneous classifiers.

Pradip Ghanty1, Nikhil R Pal.   

Abstract

Here, we consider a two-level (four classes in level 1 and 27 folds in level 2) protein fold determination problem. We propose several new features and use some existing features including frequencies of adjacent residues, frequencies of residues separated by one residue, and triplets (trio) of amino acid compositions (AACs). The dimensionality of the trio AAC features is drastically reduced using a neural network based novel online feature selection scheme. We also propose new sets of features called trio potential computed using the hydrophobicity values considering only the selected trio AACs. We demonstrate that the proposed features including the selected trio AACs and trio potential have good discriminating power for protein fold determination. As machine learning tools, we use multilayer perceptron network, radial basis function network, and support vector machine. To improve the recognition accuracies further, we use fusion of different classifiers using the same set of features as well as different sets of features. The effectiveness of our schemes is demonstrated with a benchmark structural classification of proteins (SCOP) dataset. Our system achieves 84.9% test accuracy for the SCOP structural class (four classes) determination and 68.6% test accuracy for the fold recognition with 27 folds. In order to demonstrate the consistency of feature sets and fusion schemes, we also perform the fivefold cross-validation experiments.

Mesh:

Substances:

Year:  2009        PMID: 19278932     DOI: 10.1109/TNB.2009.2016488

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  10 in total

1.  A novel fusion based on the evolutionary features for protein fold recognition using support vector machines.

Authors:  Mohammad Saleh Refahi; A Mir; Jalal A Nasiri
Journal:  Sci Rep       Date:  2020-09-01       Impact factor: 4.379

2.  Recognition of 27-class protein folds by adding the interaction of segments and motif information.

Authors:  Zhenxing Feng; Xiuzhen Hu
Journal:  Biomed Res Int       Date:  2014-07-21       Impact factor: 3.411

3.  Improving protein fold recognition using the amalgamation of evolutionary-based and structural based information.

Authors:  Kuldip K Paliwal; Alok Sharma; James Lyons; Abdollah Dehzangi
Journal:  BMC Bioinformatics       Date:  2014-12-08       Impact factor: 3.169

4.  The recognition of multi-class protein folds by adding average chemical shifts of secondary structure elements.

Authors:  Zhenxing Feng; Xiuzhen Hu; Zhuo Jiang; Hangyu Song; Muhammad Aqeel Ashraf
Journal:  Saudi J Biol Sci       Date:  2015-12-11       Impact factor: 4.219

Review 5.  Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.

Authors:  Leyi Wei; Quan Zou
Journal:  Int J Mol Sci       Date:  2016-12-16       Impact factor: 5.923

6.  Identification of self-interacting proteins by exploring evolutionary information embedded in PSI-BLAST-constructed position specific scoring matrix.

Authors:  Ji-Yong An; Zhu-Hong You; Xing Chen; De-Shuang Huang; Zheng-Wei Li; Gang Liu; Yin Wang
Journal:  Oncotarget       Date:  2016-12-13

7.  Recognition of Protein Network for Bioinformatics Knowledge Analysis Using Support Vector Machine.

Authors:  Arshpreet Kaur; Abhijit Chitre; Kirti Wanjale; Pankaj Kumar; Shahajan Miah; Arnold C Alguno
Journal:  Biomed Res Int       Date:  2022-04-23       Impact factor: 3.246

8.  A strategy to select suitable physicochemical attributes of amino acids for protein fold recognition.

Authors:  Alok Sharma; Kuldip K Paliwal; Abdollah Dehzangi; James Lyons; Seiya Imoto; Satoru Miyano
Journal:  BMC Bioinformatics       Date:  2013-07-24       Impact factor: 3.169

9.  Evaluation of sequence features from intrinsically disordered regions for the estimation of protein function.

Authors:  Alok Sharma; Abdollah Dehzangi; James Lyons; Seiya Imoto; Satoru Miyano; Kenta Nakai; Ashwini Patil
Journal:  PLoS One       Date:  2014-02-24       Impact factor: 3.240

10.  Prediction of Drug-Target Interaction Networks from the Integration of Protein Sequences and Drug Chemical Structures.

Authors:  Fan-Rong Meng; Zhu-Hong You; Xing Chen; Yong Zhou; Ji-Yong An
Journal:  Molecules       Date:  2017-07-05       Impact factor: 4.411

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.