Literature DB >> 22185508

Discriminating outer membrane proteins with Fuzzy K-nearest Neighbor algorithms based on the general form of Chou's PseAAC.

Maqsood Hayat1, Asifullah Khan.   

Abstract

Outer membrane proteins (OMPs) play important roles in cell biology. In addition, OMPs are targeted by multiple drugs. The identification of OMPs from genomic sequences and successful prediction of their secondary and tertiary structures is a challenging task due to short membrane-spanning regions with high variation in properties. Therefore, an effective and accurate silico method for discrimination of OMPs from their primary sequences is needed. In this paper, we have analyzed the performance of various machine learning mechanisms for discriminating OMPs such as: Genetic Programming, K-nearest Neighbor, and Fuzzy K-nearest Neighbor (Fuzzy K-NN) in conjunction with discrete methods such as: Amino acid composition, Amphiphilic Pseudo amino acid composition, Split amino acid composition (SAAC), and hybrid versions of these methods. The performance of the classifiers is evaluated by two datasets using 5-fold crossvalidation. After the simulation, we have observed that Fuzzy K-NN using SAAC based-features makes it quite effective in discriminating OMPs. Fuzzy K-NN achieves the highest success rates of 99.00% accuracy for discriminating OMPs from non-OMPs and 98.77% and 98.28% accuracies from α-helix membrane and globular proteins, respectively on dataset1. While on dataset2, Fuzzy K-NN achieves 99.55%, 99.90%, and 99.81% accuracies for discriminating OMPs from non- OMPs, α-helix membrane, and globular proteins, respectively. It is observed that the classification performance of our proposed method is satisfactory and is better than the existing methods. Thus, it might be an effective tool for high throughput innovation of OMPs.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22185508     DOI: 10.2174/092986612799789387

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  30 in total

1.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Authors:  Muhammad Kabir; Maqsood Hayat
Journal:  Mol Genet Genomics       Date:  2015-08-30       Impact factor: 3.291

2.  Predicting membrane proteins and their types by extracting various sequence features into Chou's general PseAAC.

Authors:  Ahmad Hassan Butt; Nouman Rasool; Yaser Daanial Khan
Journal:  Mol Biol Rep       Date:  2018-09-20       Impact factor: 2.316

3.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

Review 4.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

5.  A multilabel model based on Chou's pseudo-amino acid composition for identifying membrane proteins with both single and multiple functional types.

Authors:  Chao Huang; Jing-Qi Yuan
Journal:  J Membr Biol       Date:  2013-04-02       Impact factor: 1.843

6.  Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.

Authors:  Samad Jahandideh; Vinodh Srinivasasainagendra; Degui Zhi
Journal:  J Theor Biol       Date:  2012-08-03       Impact factor: 2.691

7.  Detecting epileptic seizure with different feature extracting strategies using robust machine learning classification techniques by applying advance parameter optimization approach.

Authors:  Lal Hussain
Journal:  Cogn Neurodyn       Date:  2018-01-25       Impact factor: 5.082

8.  Analysis of protein determinants of host-specific infection properties of polyomaviruses using machine learning.

Authors:  Myeongji Cho; Hayeon Kim; Hyeon S Son
Journal:  Genes Genomics       Date:  2021-03-01       Impact factor: 1.839

9.  Machine learning classification of texture features of MRI breast tumor and peri-tumor of combined pre- and early treatment predicts pathologic complete response.

Authors:  Lal Hussain; Pauline Huang; Tony Nguyen; Kashif J Lone; Amjad Ali; Muhammad Salman Khan; Haifang Li; Doug Young Suh; Tim Q Duong
Journal:  Biomed Eng Online       Date:  2021-06-28       Impact factor: 2.819

10.  Naïve Bayes classifier with feature selection to identify phage virion proteins.

Authors:  Peng-Mian Feng; Hui Ding; Wei Chen; Hao Lin
Journal:  Comput Math Methods Med       Date:  2013-05-15       Impact factor: 2.238

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.