Literature DB >> 28941868

Predicting membrane protein types using various decision tree classifiers based on various modes of general PseAAC for imbalanced datasets.

E Siva Sankari1, D Manimegalai2.   

Abstract

Predicting membrane protein types is an important and challenging research area in bioinformatics and proteomics. Traditional biophysical methods are used to classify membrane protein types. Due to large exploration of uncharacterized protein sequences in databases, traditional methods are very time consuming, expensive and susceptible to errors. Hence, it is highly desirable to develop a robust, reliable, and efficient method to predict membrane protein types. Imbalanced datasets and large datasets are often handled well by decision tree classifiers. Since imbalanced datasets are taken, the performance of various decision tree classifiers such as Decision Tree (DT), Classification And Regression Tree (CART), C4.5, Random tree, REP (Reduced Error Pruning) tree, ensemble methods such as Adaboost, RUS (Random Under Sampling) boost, Rotation forest and Random forest are analysed. Among the various decision tree classifiers Random forest performs well in less time with good accuracy of 96.35%. Another inference is RUS boost decision tree classifier is able to classify one or two samples in the class with very less samples while the other classifiers such as DT, Adaboost, Rotation forest and Random forest are not sensitive for the classes with fewer samples. Also the performance of decision tree classifiers is compared with SVM (Support Vector Machine) and Naive Bayes classifier.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Decision tree classifiers; Membrane protein types; Prediction

Mesh:

Substances:

Year:  2017        PMID: 28941868     DOI: 10.1016/j.jtbi.2017.09.018

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  5 in total

1.  Clinical analysis and artificial intelligence survival prediction of serous ovarian cancer based on preoperative circulating leukocytes.

Authors:  Ying Feng; Zhixiang Wang; Ran Cui; Meizhu Xiao; Huiqiao Gao; Huimin Bai; Bert Delvoux; Zhen Zhang; Andre Dekker; Andrea Romano; Shuzhen Wang; Alberto Traverso; Chongdong Liu; Zhenyu Zhang
Journal:  J Ovarian Res       Date:  2022-05-24       Impact factor: 5.506

2.  CNNLSTMac4CPred: A Hybrid Model for N4-Acetylcytidine Prediction.

Authors:  Guiyang Zhang; Wei Luo; Jianyi Lyu; Zu-Guo Yu; Guohua Huang
Journal:  Interdiscip Sci       Date:  2022-02-01       Impact factor: 2.233

Review 3.  Large-Scale Assessment of Bioinformatics Tools for Lysine Succinylation Sites.

Authors:  Md Mehedi Hasan; Mst Shamima Khatun; Hiroyuki Kurata
Journal:  Cells       Date:  2019-01-28       Impact factor: 6.600

4.  Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning.

Authors:  Lei Guo; Shunfang Wang; Mingyuan Li; Zicheng Cao
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

5.  iBLP: An XGBoost-Based Predictor for Identifying Bioluminescent Proteins.

Authors:  Dan Zhang; Hua-Dong Chen; Hasan Zulfiqar; Shi-Shi Yuan; Qin-Lai Huang; Zhao-Yue Zhang; Ke-Jun Deng
Journal:  Comput Math Methods Med       Date:  2021-01-07       Impact factor: 2.238

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.