Literature DB >> 32692832

ToxDL: deep learning using primary structure and domain embeddings for assessing protein toxicity.

Xiaoyong Pan1,2,3, Jasper Zuallaert2,4, Xi Wang3, Hong-Bin Shen1, Elda Posada Campos3, Denys O Marushchak3, Wesley De Neve2,4.   

Abstract

MOTIVATION: Genetically engineering food crops involves introducing proteins from other species into crop plant species or modifying already existing proteins with gene editing techniques. In addition, newly synthesized proteins can be used as therapeutic protein drugs against diseases. For both research and safety regulation purposes, being able to assess the potential toxicity of newly introduced/synthesized proteins is of high importance.
RESULTS: In this study, we present ToxDL, a deep learning-based approach for in silico prediction of protein toxicity from sequence alone. ToxDL consists of (i) a module encompassing a convolutional neural network that has been designed to handle variable-length input sequences, (ii) a domain2vec module for generating protein domain embeddings and (iii) an output module that classifies proteins as toxic or non-toxic, using the outputs of the two aforementioned modules. Independent test results obtained for animal proteins and cross-species transferability results obtained for bacteria proteins indicate that ToxDL outperforms traditional homology-based approaches and state-of-the-art machine-learning techniques. Furthermore, through visualizations based on saliency maps, we are able to verify that the proposed network learns known toxic motifs. Moreover, the saliency maps allow for directed in silico modification of a sequence, thus making it possible to alter its predicted protein toxicity.
AVAILABILITY AND IMPLEMENTATION: ToxDL is freely available at http://www.csbio.sjtu.edu.cn/bioinf/ToxDL/. The source code can be found at https://github.com/xypan1232/ToxDL. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 32692832     DOI: 10.1093/bioinformatics/btaa656

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  Deep Learning-Powered Prediction of Human-Virus Protein-Protein Interactions.

Authors:  Xiaodi Yang; Shiping Yang; Panyu Ren; Stefan Wuchty; Ziding Zhang
Journal:  Front Microbiol       Date:  2022-04-15       Impact factor: 6.064

2.  Reliability of the In Silico Prediction Approach to In Vitro Evaluation of Bacterial Toxicity.

Authors:  Sung-Yoon Ahn; Mira Kim; Ji-Eun Bae; Iel-Soo Bang; Sang-Woong Lee
Journal:  Sensors (Basel)       Date:  2022-08-31       Impact factor: 3.847

Review 3.  Machine Learning in Healthcare.

Authors:  Hafsa Habehh; Suril Gohel
Journal:  Curr Genomics       Date:  2021-12-16       Impact factor: 2.689

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.