Literature DB >> 34396388

NeuroPpred-Fuse: an interpretable stacking model for prediction of neuropeptides by fusing sequence information and feature selection methods.

Mingming Jiang1, Bowen Zhao1, Shenggan Luo1, Qiankun Wang1, Yanyi Chu1, Tianhang Chen1, Xueying Mao1, Yatong Liu1, Yanjing Wang1, Xue Jiang1, Dong-Qing Wei1, Yi Xiong1.   

Abstract

Neuropeptides acting as signaling molecules in the nervous system of various animals play crucial roles in a wide range of physiological functions and hormone regulation behaviors. Neuropeptides offer many opportunities for the discovery of new drugs and targets for the treatment of neurological diseases. In recent years, there have been several data-driven computational predictors of various types of bioactive peptides, but the relevant work about neuropeptides is little at present. In this work, we developed an interpretable stacking model, named NeuroPpred-Fuse, for the prediction of neuropeptides through fusing a variety of sequence-derived features and feature selection methods. Specifically, we used six types of sequence-derived features to encode the peptide sequences and then combined them. In the first layer, we ensembled three base classifiers and four feature selection algorithms, which select non-redundant important features complementarily. In the second layer, the output of the first layer was merged and fed into logistic regression (LR) classifier to train the model. Moreover, we analyzed the selected features and explained the feasibility of the selected features. Experimental results show that our model achieved 90.6% accuracy and 95.8% AUC on the independent test set, outperforming the state-of-the-art models. In addition, we exhibited the distribution of selected features by these tree models and compared the results on the training set to that on the test set. These results fully showed that our model has a certain generalization ability. Therefore, we expect that our model would provide important advances in the discovery of neuropeptides as new drugs for the treatment of neurological diseases.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  feature analysis; feature selection; machine learning; neuropeptide prediction; stacking

Mesh:

Substances:

Year:  2021        PMID: 34396388     DOI: 10.1093/bib/bbab310

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  3 in total

1.  Ensemble Learning-Based Feature Selection for Phage Protein Prediction.

Authors:  Songbo Liu; Chengmin Cui; Huipeng Chen; Tong Liu
Journal:  Front Microbiol       Date:  2022-07-15       Impact factor: 6.064

2.  Prediction of anti-inflammatory peptides by a sequence-based stacking ensemble model named AIPStack.

Authors:  Hua Deng; Chaofeng Lou; Zengrui Wu; Weihua Li; Guixia Liu; Yun Tang
Journal:  iScience       Date:  2022-08-17

3.  Exome sequencing identifies novel somatic variants in African American esophageal squamous cell carcinoma.

Authors:  Hayriye Verda Erkizan; Shrey Sukhadia; Thanemozhi G Natarajan; Gustavo Marino; Vicente Notario; Jack H Lichy; Robert G Wadleigh
Journal:  Sci Rep       Date:  2021-07-20       Impact factor: 4.996

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.