Literature DB >> 28430949

Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility.

Rhys Heffernan1, Yuedong Yang2, Kuldip Paliwal1, Yaoqi Zhou2.   

Abstract

MOTIVATION: The accuracy of predicting protein local and global structural properties such as secondary structure and solvent accessible surface area has been stagnant for many years because of the challenge of accounting for non-local interactions between amino acid residues that are close in three-dimensional structural space but far from each other in their sequence positions. All existing machine-learning techniques relied on a sliding window of 10-20 amino acid residues to capture some 'short to intermediate' non-local interactions. Here, we employed Long Short-Term Memory (LSTM) Bidirectional Recurrent Neural Networks (BRNNs) which are capable of capturing long range interactions without using a window.
RESULTS: We showed that the application of LSTM-BRNN to the prediction of protein structural properties makes the most significant improvement for residues with the most long-range contacts (|i-j| >19) over a previous window-based, deep-learning method SPIDER2. Capturing long-range interactions allows the accuracy of three-state secondary structure prediction to reach 84% and the correlation coefficient between predicted and actual solvent accessible surface areas to reach 0.80, plus a reduction of 5%, 10%, 5% and 10% in the mean absolute error for backbone ϕ , ψ , θ and τ angles, respectively, from SPIDER2. More significantly, 27% of 182724 40-residue models directly constructed from predicted C α atom-based θ and τ have similar structures to their corresponding native structures (6Å RMSD or less), which is 3% better than models built by ϕ and ψ angles. We expect the method to be useful for assisting protein structure and function prediction.
AVAILABILITY AND IMPLEMENTATION: The method is available as a SPIDER3 server and standalone package at http://sparks-lab.org . CONTACT: yaoqi.zhou@griffith.edu.au or yuedong.yang@griffith.edu.au. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28430949     DOI: 10.1093/bioinformatics/btx218

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  74 in total

1.  Probabilistic divergence of a template-based modelling methodology from the ideal protocol.

Authors:  Ashish Runthala
Journal:  J Mol Model       Date:  2021-01-07       Impact factor: 1.810

2.  Estimating the Influence of Physicochemical and Biochemical Property Indexes on Selection for Amino Acids Usage in Eukaryotic Cells.

Authors:  Giovani B Fogalli; Sergio R P Line
Journal:  J Mol Evol       Date:  2021-03-24       Impact factor: 2.395

3.  A deep dense inception network for protein beta-turn prediction.

Authors:  Chao Fang; Yi Shang; Dong Xu
Journal:  Proteins       Date:  2019-07-23

4.  Predicting Hot Spot Residues at Protein-DNA Binding Interfaces Based on Sequence Information.

Authors:  Lingsong Yao; Huadong Wang; Yannan Bin
Journal:  Interdiscip Sci       Date:  2020-10-17       Impact factor: 2.233

5.  MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction.

Authors:  Chao Fang; Yi Shang; Dong Xu
Journal:  Proteins       Date:  2018-03-12

6.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

Review 7.  A guide to machine learning for biologists.

Authors:  Joe G Greener; Shaun M Kandathil; Lewis Moffat; David T Jones
Journal:  Nat Rev Mol Cell Biol       Date:  2021-09-13       Impact factor: 94.444

8.  Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.

Authors:  Zhen Chen; Xuhan Liu; Fuyi Li; Chen Li; Tatiana Marquez-Lago; André Leier; Tatsuya Akutsu; Geoffrey I Webb; Dakang Xu; Alexander Ian Smith; Lei Li; Kuo-Chen Chou; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

9.  Prediction of Protein Backbone Torsion Angles Using Deep Residual Inception Neural Networks.

Authors:  Chao Fang; Yi Shang; Dong Xu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2018-03-12       Impact factor: 3.710

10.  DNSS2: Improved ab initio protein secondary structure prediction using advanced deep learning architectures.

Authors:  Zhiye Guo; Jie Hou; Jianlin Cheng
Journal:  Proteins       Date:  2020-09-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.