Literature DB >> 30462171

DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction.

Abdurrahman Elbasir1, Balasubramanian Moovarkumudalvan2, Khalid Kunji3, Prasanna R Kolatkar2, Raghvendra Mall3, Halima Bensmail1,3.   

Abstract

MOTIVATION: Protein structure determination has primarily been performed using X-ray crystallography. To overcome the expensive cost, high attrition rate and series of trial-and-error settings, many in-silico methods have been developed to predict crystallization propensities of proteins based on their sequences. However, the majority of these methods build their predictors by extracting features from protein sequences, which is computationally expensive and can explode the feature space. We propose DeepCrystal, a deep learning framework for sequence-based protein crystallization prediction. It uses deep learning to identify proteins which can produce diffraction-quality crystals without the need to manually engineer additional biochemical and structural features from sequence. Our model is based on convolutional neural networks, which can exploit frequently occurring k-mers and sets of k-mers from the protein sequences to distinguish proteins that will result in diffraction-quality crystals from those that will not.
RESULTS: Our model surpasses previous sequence-based protein crystallization predictors in terms of recall, F-score, accuracy and Matthew's correlation coefficient (MCC) on three independent test sets. DeepCrystal achieves an average improvement of 1.4, 12.1% in recall, when compared to its closest competitors, Crysalis II and Crysf, respectively. In addition, DeepCrystal attains an average improvement of 2.1, 6.0% for F-score, 1.9, 3.9% for accuracy and 3.8, 7.0% for MCC w.r.t. Crysalis II and Crysf on independent test sets.
AVAILABILITY AND IMPLEMENTATION: The standalone source code and models are available at https://github.com/elbasir/DeepCrystal and a web-server is also available at https://deeplearning-protein.qcri.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30462171     DOI: 10.1093/bioinformatics/bty953

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Sequence-Based Prediction of Transmembrane Protein Crystallization Propensity.

Authors:  Qizhi Zhu; Lihua Wang; Ruyu Dai; Wei Zhang; Wending Tang; Yannan Bin; Zeliang Wang; Junfeng Xia
Journal:  Interdiscip Sci       Date:  2021-06-18       Impact factor: 2.233

2.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors:  Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

3.  A deep learning approach for orphan gene identification in moso bamboo (Phyllostachys edulis) based on the CNN + Transformer model.

Authors:  Xiaodan Zhang; Jinxiang Xuan; Chensong Yao; Qijuan Gao; Lianglong Wang; Xiu Jin; Shaowen Li
Journal:  BMC Bioinformatics       Date:  2022-05-05       Impact factor: 3.307

4.  DeepT3_4: A Hybrid Deep Neural Network Model for the Distinction Between Bacterial Type III and IV Secreted Effectors.

Authors:  Lezheng Yu; Fengjuan Liu; Yizhou Li; Jiesi Luo; Runyu Jing
Journal:  Front Microbiol       Date:  2021-01-21       Impact factor: 5.640

5.  An Interpretable Double-Scale Attention Model for Enzyme Protein Class Prediction Based on Transformer Encoders and Multi-Scale Convolutions.

Authors:  Ken Lin; Xiongwen Quan; Chen Jin; Zhuangwei Shi; Jinglong Yang
Journal:  Front Genet       Date:  2022-04-01       Impact factor: 4.772

6.  TLCrys: Transfer Learning Based Method for Protein Crystallization Prediction.

Authors:  Chen Jin; Zhuangwei Shi; Chuanze Kang; Ken Lin; Han Zhang
Journal:  Int J Mol Sci       Date:  2022-01-16       Impact factor: 5.923

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.