Literature DB >> 30239616

CPPred-FL: a sequence-based predictor for large-scale identification of cell-penetrating peptides by feature representation learning.

Xiaoli Qiang1, Chen Zhou2, Xiucai Ye3, Pu-Feng Du4, Ran Su4, Leyi Wei2.   

Abstract

Cell-penetrating peptides (CPPs) have been shown to be a transport vehicle for delivering cargoes into live cells, offering great potential as future therapeutics. It is essential to identify CPPs for better understanding of their functional mechanisms. Machine learning-based methods have recently emerged as a main approach for computational identification of CPPs. However, one of the main challenges and difficulties is to propose an effective feature representation model that sufficiently exploits the inner difference and relevance between CPPs and non-CPPs, in order to improve the predictive performance. In this paper, we have developed CPPred-FL, a powerful bioinformatics tool for fast, accurate and large-scale identification of CPPs. In our predictor, we introduce a new feature representation learning scheme that enables one to learn feature representations from totally 45 well-trained random forest models with multiple feature descriptors from different perspectives, such as compositional information, position-specific information and physicochemical properties, etc. We integrate class and probabilistic information into our feature representations. To improve the feature representation ability, we further remove redundant and irrelevant features by feature space optimization. Benchmarking experiments showed that CPPred-FL, using 19 informative features only, is able to achieve better performance than the state-of-the-art predictors. We anticipate that CPPred-FL will be a powerful tool for large-scale identification of CPPs, facilitating the characterization of their functional mechanisms and accelerating their applications in clinical therapy.

Year:  2018        PMID: 30239616     DOI: 10.1093/bib/bby091

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  25 in total

1.  DWPPI: A Deep Learning Approach for Predicting Protein-Protein Interactions in Plants Based on Multi-Source Information With a Large-Scale Biological Network.

Authors:  Jie Pan; Zhu-Hong You; Li-Ping Li; Wen-Zhun Huang; Jian-Xin Guo; Chang-Qing Yu; Li-Ping Wang; Zheng-Yang Zhao
Journal:  Front Bioeng Biotechnol       Date:  2022-03-21

2.  AMYPred-FRL is a novel approach for accurate prediction of amyloid proteins by using feature representation learning.

Authors:  Phasit Charoenkwan; Saeed Ahmed; Chanin Nantasenamat; Julian M W Quinn; Mohammad Ali Moni; Pietro Lio'; Watshara Shoombuatong
Journal:  Sci Rep       Date:  2022-05-11       Impact factor: 4.996

3.  Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.

Authors:  Md Mehedi Hasan; Sho Tsukiyama; Jae Youl Cho; Hiroyuki Kurata; Md Ashad Alam; Xiaowen Liu; Balachandran Manavalan; Hong-Wen Deng
Journal:  Mol Ther       Date:  2022-05-06       Impact factor: 12.910

4.  iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool.

Authors:  Xiao Yang; Xiucai Ye; Xuehong Li; Lesong Wei
Journal:  Front Genet       Date:  2021-03-31       Impact factor: 4.599

5.  iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Comput Struct Biotechnol J       Date:  2018-10-24       Impact factor: 7.271

6.  ImmunoSPdb: an archive of immunosuppressive peptides.

Authors:  Salman Sadullah Usmani; Piyush Agrawal; Manika Sehgal; Pradeep Kumar Patel; Gajendra P S Raghava
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

7.  4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism.

Authors:  Rao Zeng; Song Cheng; Minghong Liao
Journal:  Front Cell Dev Biol       Date:  2021-05-10

8.  AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Leyi Wei; Gwang Lee
Journal:  Comput Struct Biotechnol J       Date:  2019-07-03       Impact factor: 7.271

9.  4mCpred-EL: An Ensemble Learning Framework for Identification of DNA N4-methylcytosine Sites in the Mouse Genome.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Da Yeon Lee; Leyi Wei; Gwang Lee
Journal:  Cells       Date:  2019-10-28       Impact factor: 6.600

10.  The Spectrum of Design Solutions for Improving the Activity-Selectivity Product of Peptide Antibiotics against Multidrug-Resistant Bacteria and Prostate Cancer PC-3 Cells.

Authors:  Davor Juretić; Anja Golemac; Denise E Strand; Keshi Chung; Nada Ilić; Ivana Goić-Barišić; François-Xavier Pellay
Journal:  Molecules       Date:  2020-08-01       Impact factor: 4.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.