Literature DB >> 29893128

Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy.

Balachandran Manavalan1, Sathiyamoorthy Subramaniyam2, Tae Hwan Shin1,3, Myeong Ok Kim4, Gwang Lee1,3.   

Abstract

Cell-penetrating peptides (CPPs) can enter cells as a variety of biologically active conjugates and have various biomedical applications. To offset the cost and effort of designing novel CPPs in laboratories, computational methods are necessitated to identify candidate CPPs before in vitro experimental studies. We developed a two-layer prediction framework called machine-learning-based prediction of cell-penetrating peptides (MLCPPs). The first-layer predicts whether a given peptide is a CPP or non-CPP, whereas the second-layer predicts the uptake efficiency of the predicted CPPs. To construct a two-layer prediction framework, we employed four different machine-learning methods and five different compositions including amino acid composition (AAC), dipeptide composition, amino acid index, composition-transition-distribution, and physicochemical properties (PCPs). In the first layer, hybrid features (combination of AAC and PCP) and extremely randomized tree outperformed state-of-the-art predictors in CPP prediction with an accuracy of 0.896 when tested on independent data sets, whereas in the second layer, hybrid features obtained through feature selection protocol and random forest produced an accuracy of 0.725 that is better than state-of-the-art predictors. We anticipate that our method MLCPP will become a valuable tool for predicting CPPs and their uptake efficiency and might facilitate hypothesis-driven experimental design. The MLCPP server interface along with the benchmarking and independent data sets are freely accessible at www.thegleelab.org/MLCPP .

Entities:  

Keywords:  cell-penetrating peptides; extremely randomized tree; feature selection; machine learning; random forest; uptake efficiency

Mesh:

Substances:

Year:  2018        PMID: 29893128     DOI: 10.1021/acs.jproteome.8b00148

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  40 in total

1.  Development of an RHEB-Targeting Peptide To Inhibit mTORC1 Kinase Activity.

Authors:  Raef Shams; Yoshihiro Ito; Hideyuki Miyatake
Journal:  ACS Omega       Date:  2022-06-24

2.  Extremely-randomized-tree-based Prediction of N6-Methyladenosine Sites in Saccharomyces cerevisiae.

Authors:  Rajiv G Govindaraj; Sathiyamoorthy Subramaniyam; Balachandran Manavalan
Journal:  Curr Genomics       Date:  2020-01       Impact factor: 2.236

3.  Deep learning to design nuclear-targeting abiotic miniproteins.

Authors:  Carly K Schissel; Somesh Mohapatra; Justin M Wolfe; Colin M Fadzen; Kamela Bellovoda; Chia-Ling Wu; Jenna A Wood; Annika B Malmberg; Andrei Loas; Rafael Gómez-Bombarelli; Bradley L Pentelute
Journal:  Nat Chem       Date:  2021-08-09       Impact factor: 24.427

4.  Beyond Tripeptides Two-Step Active Machine Learning for Very Large Data sets.

Authors:  Alexander van Teijlingen; Tell Tuttle
Journal:  J Chem Theory Comput       Date:  2021-04-27       Impact factor: 6.006

Review 5.  MASS: predict the global qualities of individual protein models using random forests and novel statistical potentials.

Authors:  Tong Liu; Zheng Wang
Journal:  BMC Bioinformatics       Date:  2020-07-06       Impact factor: 3.169

6.  A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites.

Authors:  Haixia Long; Bo Liao; Xingyu Xu; Jialiang Yang
Journal:  Int J Mol Sci       Date:  2018-09-18       Impact factor: 5.923

7.  iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Comput Struct Biotechnol J       Date:  2018-10-24       Impact factor: 7.271

8.  FEGS: a novel feature extraction model for protein sequences and its applications.

Authors:  Zengchao Mu; Ting Yu; Xiaoping Liu; Hongyu Zheng; Leyi Wei; Juntao Liu
Journal:  BMC Bioinformatics       Date:  2021-06-03       Impact factor: 3.169

9.  iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction.

Authors:  Balachandran Manavalan; Rajiv Gandhi Govindaraj; Tae Hwan Shin; Myeong Ok Kim; Gwang Lee
Journal:  Front Immunol       Date:  2018-07-27       Impact factor: 7.561

10.  RFAmyloid: A Web Server for Predicting Amyloid Proteins.

Authors:  Mengting Niu; Yanjuan Li; Chunyu Wang; Ke Han
Journal:  Int J Mol Sci       Date:  2018-07-16       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.