Literature DB >> 17045354

LogitBoost classifier for discriminating thermophilic and mesophilic proteins.

Guangya Zhang1, Baishan Fang.   

Abstract

A novel classifier, the so-called LogitBoost classifier, was introduced to discriminate the thermophilic and mesophilic proteins according to their primary structures. When the 20-amino acid composition was chosen as the feature vector, the overall accuracy of the self-consistency check and a five-fold cross-validation procedure was 97.0% and 86.6%, respectively. To test if the method was also applicable to a wide range of biological targets, an independent testing dataset was also used. The method based on LogitBoost algorithm has achieved an overall classification accuracy of 88.9%. According to the three different validation check approaches, it was demonstrated that LogitBoost outperformed AdaBoost and performed comparably with RBF neural network and support vector machine. The influence of protein size on discrimination was addressed.

Mesh:

Substances:

Year:  2006        PMID: 17045354     DOI: 10.1016/j.jbiotec.2006.07.020

Source DB:  PubMed          Journal:  J Biotechnol        ISSN: 0168-1656            Impact factor:   3.307


  9 in total

Review 1.  Empirical comparison and analysis of machine learning-based predictors for predicting and analyzing of thermophilic proteins.

Authors:  Phasit Charoenkwan; Nalini Schaduangrat; Md Mehedi Hasan; Mohammad Ali Moni; Pietro Lió; Watshara Shoombuatong
Journal:  EXCLI J       Date:  2022-03-02       Impact factor: 4.022

2.  A Method for Prediction of Thermophilic Protein Based on Reduced Amino Acids and Mixed Features.

Authors:  Changli Feng; Zhaogui Ma; Deyun Yang; Xin Li; Jun Zhang; Yanjuan Li
Journal:  Front Bioeng Biotechnol       Date:  2020-05-05

3.  Establishment and evaluation of prediction model for multiple disease classification based on gut microbial data.

Authors:  Sohyun Bang; DongAhn Yoo; Soo-Jin Kim; Soyun Jhang; Seoae Cho; Heebal Kim
Journal:  Sci Rep       Date:  2019-07-15       Impact factor: 4.379

4.  A novel sequence-based predictor for identifying and characterizing thermophilic proteins using estimated propensity scores of dipeptides.

Authors:  Phasit Charoenkwan; Warot Chotpatiwetchkul; Vannajan Sanghiran Lee; Chanin Nantasenamat; Watshara Shoombuatong
Journal:  Sci Rep       Date:  2021-12-10       Impact factor: 4.379

5.  Engineering and screening of novel β-1,3-xylanases with desired hydrolysate type by optimized ancestor sequence reconstruction and data mining.

Authors:  Bo Zeng; ShuYan Zhao; Rui Zhou; YanHong Zhou; WenHui Jin; ZhiWei Yi; GuangYa Zhang
Journal:  Comput Struct Biotechnol J       Date:  2022-06-27       Impact factor: 6.155

6.  Classification of premalignant pancreatic cancer mass-spectrometry data using decision tree ensembles.

Authors:  Guangtao Ge; G William Wong
Journal:  BMC Bioinformatics       Date:  2008-06-11       Impact factor: 3.169

7.  Changing relative risk of clinical factors for hospital-acquired acute kidney injury across age groups: a retrospective cohort study.

Authors:  Lijuan Wu; Yong Hu; Xiangzhou Zhang; Weiqi Chen; Alan S L Yu; John A Kellum; Lemuel R Waitman; Mei Liu
Journal:  BMC Nephrol       Date:  2020-08-02       Impact factor: 2.388

8.  Discrimination of Thermophilic Proteins and Non-thermophilic Proteins Using Feature Dimension Reduction.

Authors:  Zifan Guo; Pingping Wang; Zhendong Liu; Yuming Zhao
Journal:  Front Bioeng Biotechnol       Date:  2020-10-22

9.  iThermo: A Sequence-Based Model for Identifying Thermophilic Proteins Using a Multi-Feature Fusion Strategy.

Authors:  Zahoor Ahmed; Hasan Zulfiqar; Abdullah Aman Khan; Ijaz Gul; Fu-Ying Dao; Zhao-Yue Zhang; Xiao-Long Yu; Lixia Tang
Journal:  Front Microbiol       Date:  2022-02-22       Impact factor: 5.640

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.