Literature DB >> 33505515

iBLP: An XGBoost-Based Predictor for Identifying Bioluminescent Proteins.

Dan Zhang1, Hua-Dong Chen2, Hasan Zulfiqar1, Shi-Shi Yuan1, Qin-Lai Huang1, Zhao-Yue Zhang1, Ke-Jun Deng1.   

Abstract

Bioluminescent proteins (BLPs) are a class of proteins that widely distributed in many living organisms with various mechanisms of light emission including bioluminescence and chemiluminescence from luminous organisms. Bioluminescence has been commonly used in various analytical research methods of cellular processes, such as gene expression analysis, drug discovery, cellular imaging, and toxicity determination. However, the identification of bioluminescent proteins is challenging as they share poor sequence similarities among them. In this paper, we briefly reviewed the development of the computational identification of BLPs and subsequently proposed a novel predicting framework for identifying BLPs based on eXtreme gradient boosting algorithm (XGBoost) and using sequence-derived features. To train the models, we collected BLP data from bacteria, eukaryote, and archaea. Then, for getting more effective prediction models, we examined the performances of different feature extraction methods and their combinations as well as classification algorithms. Finally, based on the optimal model, a novel predictor named iBLP was constructed to identify BLPs. The robustness of iBLP has been proved by experiments on training and independent datasets. Comparison with other published method further demonstrated that the proposed method is powerful and could provide good performance for BLP identification. The webserver and software package for BLP identification are freely available at http://lin-group.cn/server/iBLP.
Copyright © 2021 Dan Zhang et al.

Entities:  

Year:  2021        PMID: 33505515      PMCID: PMC7808816          DOI: 10.1155/2021/6664362

Source DB:  PubMed          Journal:  Comput Math Methods Med        ISSN: 1748-670X            Impact factor:   2.238


  64 in total

1.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

2.  iATP: A Sequence Based Method for Identifying Anti-tubercular Peptides.

Authors:  Wei Chen; Pengmian Feng; Fulei Nie
Journal:  Med Chem       Date:  2020       Impact factor: 2.745

Review 3.  Perspectives on Bioluminescence Mechanisms.

Authors:  John Lee
Journal:  Photochem Photobiol       Date:  2016-12-03       Impact factor: 3.421

4.  Pro54DB: a database for experimentally verified sigma-54 promoters.

Authors:  Zhi-Yong Liang; Hong-Yan Lai; Huan Yang; Chang-Jian Zhang; Hui Yang; Huan-Huan Wei; Xin-Xin Chen; Ya-Wei Zhao; Zhen-Dong Su; Wen-Chao Li; En-Ze Deng; Hua Tang; Wei Chen; Hao Lin
Journal:  Bioinformatics       Date:  2017-02-01       Impact factor: 6.937

5.  Zika and Flaviviruses Phylogeny Based on the Alignment-Free Natural Vector Method.

Authors:  Yongkun Li; Lily He; Rong Lucy He; Stephen S-T Yau
Journal:  DNA Cell Biol       Date:  2016-12-15       Impact factor: 3.311

6.  iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features.

Authors:  Dan Zhang; Zhao-Chun Xu; Wei Su; Yu-He Yang; Hao Lv; Hui Yang; Hao Lin
Journal:  Bioinformatics       Date:  2021-04-19       Impact factor: 6.937

Review 7.  Predicting membrane protein types using various decision tree classifiers based on various modes of general PseAAC for imbalanced datasets.

Authors:  E Siva Sankari; D Manimegalai
Journal:  J Theor Biol       Date:  2017-09-20       Impact factor: 2.691

Review 8.  Recent Development of Computational Predicting Bioluminescent Proteins.

Authors:  Dan Zhang; Zheng-Xing Guan; Zi-Mei Zhang; Shi-Hao Li; Fu-Ying Dao; Hua Tang; Hao Lin
Journal:  Curr Pharm Des       Date:  2019       Impact factor: 3.116

9.  BLProt: prediction of bioluminescent proteins based on support vector machine and relieff feature selection.

Authors:  Krishna Kumar Kandaswamy; Ganesan Pugalenthi; Mehrnaz Khodam Hazrati; Kai-Uwe Kalies; Thomas Martinetz
Journal:  BMC Bioinformatics       Date:  2011-08-17       Impact factor: 3.169

10.  Classifying Included and Excluded Exons in Exon Skipping Event Using Histone Modifications.

Authors:  Wei Chen; Pengmian Feng; Hui Ding; Hao Lin
Journal:  Front Genet       Date:  2018-10-01       Impact factor: 4.599

View more
  23 in total

1.  ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.

Authors:  Shihu Jiao; Zheng Chen; Lichao Zhang; Xun Zhou; Lei Shi
Journal:  Amino Acids       Date:  2022-03-14       Impact factor: 3.520

2.  Prediction and Screening Model for Products Based on Fusion Regression and XGBoost Classification.

Authors:  Jiaju Wu; Linggang Kong; Ming Yi; Qiuxian Chen; Zheng Cheng; Hongfu Zuo; Yonghui Yang
Journal:  Comput Intell Neurosci       Date:  2022-07-31

Review 3.  Application of Multilayer Network Models in Bioinformatics.

Authors:  Yuanyuan Lv; Shan Huang; Tianjiao Zhang; Bo Gao
Journal:  Front Genet       Date:  2021-03-31       Impact factor: 4.599

4.  A Novel Framework Based on Deep Learning and ANOVA Feature Selection Method for Diagnosis of COVID-19 Cases from Chest X-Ray Images.

Authors:  Hamid Nasiri; Seyed Ali Alavi
Journal:  Comput Intell Neurosci       Date:  2022-01-07

5.  SNAREs-SAP: SNARE Proteins Identification With PSSM Profiles.

Authors:  Zixiao Zhang; Yue Gong; Bo Gao; Hongfei Li; Wentao Gao; Yuming Zhao; Benzhi Dong
Journal:  Front Genet       Date:  2021-12-20       Impact factor: 4.599

6.  4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism.

Authors:  Rao Zeng; Song Cheng; Minghong Liao
Journal:  Front Cell Dev Biol       Date:  2021-05-10

7.  i4mC-EL: Identifying DNA N4-Methylcytosine Sites in the Mouse Genome Using Ensemble Learning.

Authors:  Yanjuan Li; Zhengnan Zhao; Zhixia Teng
Journal:  Biomed Res Int       Date:  2021-05-29       Impact factor: 3.411

8.  Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy.

Authors:  Jian Zhou; Suling Bo; Hao Wang; Lei Zheng; Pengfei Liang; Yongchun Zuo
Journal:  Front Cell Dev Biol       Date:  2021-07-16

9.  Identification of Helicobacter pylori Membrane Proteins Using Sequence-Based Features.

Authors:  Mujiexin Liu; Hui Chen; Dong Gao; Cai-Yi Ma; Zhao-Yue Zhang
Journal:  Comput Math Methods Med       Date:  2022-01-12       Impact factor: 2.238

10.  VTP-Identifier: Vesicular Transport Proteins Identification Based on PSSM Profiles and XGBoost.

Authors:  Yue Gong; Benzhi Dong; Zixiao Zhang; Yixiao Zhai; Bo Gao; Tianjiao Zhang; Jingyu Zhang
Journal:  Front Genet       Date:  2022-01-03       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.