| Literature DB >> 34159204 |
Guohua Huang1, Qingfeng Shen1, Guiyang Zhang1, Pan Wang1, Zu-Guo Yu2.
Abstract
Lysine succinylation is a typical protein post-translational modification and plays a crucial role of regulation in the cellular process. Identifying succinylation sites is fundamental to explore its functions. Although many computational methods were developed to deal with this challenge, few considered semantic relationship between residues. We combined long short-term memory (LSTM) and convolutional neural network (CNN) into a deep learning method for predicting succinylation site. The proposed method obtained a Matthews correlation coefficient of 0.2508 on the independent test, outperforming state of the art methods. We also performed the enrichment analysis of succinylation proteins. The results showed that functions of succinylation were conserved across species but differed to a certain extent with species. On basis of the proposed method, we developed a user-friendly web server for predicting succinylation sites.Entities:
Year: 2021 PMID: 34159204 PMCID: PMC8188601 DOI: 10.1155/2021/9923112
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Flowchart of the proposed method.
Number of parameters and shape of output in the LSTMCNNsucc.
| Layers | Parameters | Output |
|---|---|---|
| Embedding | 1472 | (None, 31, 64) |
| Bidirectional LSTM | 197632 | (None, 31, 256) |
| Dropout | 0 | (None, 31, 256) |
| Flatten | 0 | (None, 7936) |
| 1D convolution | 10272 | (None, 27, 32) |
| Pooling | 0 | (None, 32) |
| Dense (16) | 127504 | (None, 16) |
| Dense (1) | 17 | (None, 1) |
Figure 2The structure of neural networks: (a) for RNN, (b) for LSTM, and (c) for directional LSTM.
Comparison with state of the art methods.
| Method | SN | SP | ACC | MCC |
|---|---|---|---|---|
| LSTMCNNsucc | 0.5916 | 0.7957 | 0.7789 | 0.2508 |
| SuccinSite [ | 0.3977 | 0.8635 | 0.8272 | 0.1925 |
| iSuc-PseAAC [ | 0.1258 | 0.8929 | 0.8296 | 0.0165 |
| DeepSuccinylSite [ | 0.7438 | 0.6879 | 0.6923 | 0.2438 |
Figure 3The numbers of shared terms (a) for biological process, (b) cellular component, and (c) molecular function.
Significant KEGG pathway terms.
| Species | KEGG terms | Benjamini |
|---|---|---|
| E. coli | Metabolic pathways | 3.30 |
| Biosynthesis of amino acids | 1.00 | |
| Biosynthesis of secondary metabolites | 2.40 | |
| Biosynthesis of antibiotics | 7.40 | |
| Lysine biosynthesis | 3.30 | |
|
| ||
| H. sapiens | Biosynthesis of antibiotics | 3.70 |
| Metabolic pathways | 2.80 | |
| Ribosome | 3.40 | |
| Valine, leucine, and isoleucine degradation | 1.30 | |
| Carbon metabolism | 6.20 | |
| Oxidative phosphorylation | 1.10 | |
| Parkinson's disease | 2.60 | |
| Citrate cycle (TCA cycle) | 1.00 | |
| Huntington's disease | 4.10 | |
| Alzheimer's disease | 7.80 | |
| Aminoacyl-tRNA biosynthesis | 1.00 | |
| Butanoate metabolism | 3.40 | |
| Proteasome | 8.20 | |
|
| ||
| M. musculus | Metabolic pathways | 6.20 |
| Parkinson's disease | 8.50 | |
| Oxidative phosphorylation | 3.40 | |
| Nonalcoholic fatty liver disease (NAFLD) | 1.00 | |
| Huntington's disease | 2.80 | |
| Alzheimer's disease | 1.40 | |
| Ribosome | 3.30 | |
| Peroxisome | 1.80 | |
| Glycine, serine, and threonine metabolism | 1.50 | |
| Pyruvate metabolism | 9.00 | |
| Propanoate metabolism | 2.40 | |
| Valine, leucine, and isoleucine degradation | 1.90 | |
| Glyoxylate and dicarboxylate metabolism | 3.10 | |
| Biosynthesis of antibiotics | 5.60 | |
|
| ||
| M. tuberculosis | Metabolic pathways | 1.00 |
| Microbial metabolism in diverse environments | 2.50 | |
| Biosynthesis of antibiotics | 4.40 | |
| Biosynthesis of secondary metabolites | 1.00 | |
| Propanoate metabolism | 1.00 | |
|
| ||
| S. cerevisiae | Metabolic pathways | 5.20 |
| Biosynthesis of amino acids | 3.30 | |
| 2-Oxocarboxylic acid metabolism | 7.90 | |
| Biosynthesis of antibiotics | 3.50 | |
| Oxidative phosphorylation | 3.50 | |