Literature DB >> 21445589

Mito-GSAAC: mitochondria prediction using genetic ensemble classifier and split amino acid composition.

Tariq Habib Afridi1, Asifullah Khan, Yeon Soo Lee.   

Abstract

Mitochondria are all-important organelles of eukaryotic cells since they are involved in processes associated with cellular mortality and human diseases. Therefore, trustworthy techniques are highly required for the identification of new mitochondrial proteins. We propose Mito-GSAAC system for prediction of mitochondrial proteins. The aim of this work is to investigate an effective feature extraction strategy and to develop an ensemble approach that can better exploit the advantages of this feature extraction strategy for mitochondria classification. We investigate four kinds of protein representations for prediction of mitochondrial proteins: amino acid composition, dipeptide composition, pseudo amino acid composition, and split amino acid composition (SAAC). Individual classifiers such as support vector machine (SVM), k-nearest neighbor, multilayer perceptron, random forest, AdaBoost, and bagging are first trained. An ensemble classifier is then built using genetic programming (GP) for evolving a complex but effective decision space from the individual decision spaces of the trained classifiers. The highest prediction performance for Jackknife test is 92.62% using GP-based ensemble classifier on SAAC features, which is the highest accuracy, reported so far on the Mitochondria dataset being used. While on the Malaria Parasite Mitochondria dataset, the highest accuracy is obtained by SVM using SAAC and it is further enhanced to 93.21% using GP-based ensemble. It is observed that SAAC has better discrimination power for mitochondria prediction over the rest of the feature extraction strategies. Thus, the improved prediction performance is largely due to the better capability of SAAC for discriminating between mitochondria and non-mitochondria proteins at the N and C terminus and the effective combination capability of GP. Mito-GSAAC can be accessed at http://111.68.99.218/Mito-GSAAC . It is expected that the novel approach and the accompanied predictor will have a major impact to Molecular Cell Biology, Proteomics, Bioinformatics, System Biology, and Drug Development.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21445589     DOI: 10.1007/s00726-011-0888-0

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  10 in total

Review 1.  A Treatise to Computational Approaches Towards Prediction of Membrane Protein and Its Subtypes.

Authors:  Ahmad Hassan Butt; Nouman Rasool; Yaser Daanial Khan
Journal:  J Membr Biol       Date:  2016-11-19       Impact factor: 1.843

2.  Robust segmentation and intelligent decision system for cerebrovascular disease.

Authors:  Asmatullah Chaudhry; Mehdi Hassan; Asifullah Khan
Journal:  Med Biol Eng Comput       Date:  2016-04-07       Impact factor: 2.602

3.  Predicting the binding patterns of hub proteins: a study using yeast protein interaction networks.

Authors:  Carson M Andorf; Vasant Honavar; Taner Z Sen
Journal:  PLoS One       Date:  2013-02-19       Impact factor: 3.240

4.  An ensemble method with hybrid features to identify extracellular matrix proteins.

Authors:  Runtao Yang; Chengjin Zhang; Rui Gao; Lina Zhang
Journal:  PLoS One       Date:  2015-02-13       Impact factor: 3.240

5.  Analysis and prediction of single-stranded and double-stranded DNA binding proteins based on protein sequences.

Authors:  Wei Wang; Lin Sun; Shiguang Zhang; Hongjun Zhang; Jinling Shi; Tianhe Xu; Keliang Li
Journal:  BMC Bioinformatics       Date:  2017-06-12       Impact factor: 3.169

6.  A novel deep learning-assisted hybrid network for plasmodium falciparum parasite mitochondrial proteins classification.

Authors:  Wafa Alameen Alsanousi; Nosiba Yousif Ahmed; Eman Mohammed Hamid; Murtada K Elbashir; Mohamed Elhafiz M Musa; Jianxin Wang; Noman Khan
Journal:  PLoS One       Date:  2022-10-06       Impact factor: 3.752

7.  An improved sequence based prediction protocol for DNA-binding proteins using SVM and comprehensive feature analysis.

Authors:  Chuanxin Zou; Jiayu Gong; Honglin Li
Journal:  BMC Bioinformatics       Date:  2013-03-09       Impact factor: 3.169

8.  A computational pipeline for the development of multi-marker bio-signature panels and ensemble classifiers.

Authors:  Oliver P Günther; Virginia Chen; Gabriela Cohen Freue; Robert F Balshaw; Scott J Tebbutt; Zsuzsanna Hollander; Mandeep Takhar; W Robert McMaster; Bruce M McManus; Paul A Keown; Raymond T Ng
Journal:  BMC Bioinformatics       Date:  2012-12-08       Impact factor: 3.169

9.  JPPRED: Prediction of Types of J-Proteins from Imbalanced Data Using an Ensemble Learning Method.

Authors:  Lina Zhang; Chengjin Zhang; Rui Gao; Runtao Yang
Journal:  Biomed Res Int       Date:  2015-10-26       Impact factor: 3.411

10.  Prediction of endoplasmic reticulum resident proteins using fragmented amino acid composition and support vector machine.

Authors:  Ravindra Kumar; Bandana Kumari; Manish Kumar
Journal:  PeerJ       Date:  2017-09-04       Impact factor: 2.984

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.