| Literature DB >> 36166351 |
Quang-Hien Kha1, Quang-Thai Ho2,3, Nguyen Quoc Khanh Le4,5,6.
Abstract
Background: SNARE proteins play a vital role in membrane fusion and cellular physiology and pathological processes. Many potential therapeutics for mental diseases or even cancer based on SNAREs are also developed. Therefore, there is a dire need to predict the SNAREs for further manipulation of these essential proteins, which demands new and efficient approaches.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36166351 PMCID: PMC9554904 DOI: 10.1021/acs.jcim.2c01034
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 6.162
Figure 1Flowchart of our proposed method.
Detailed Statistics of Dataset Used in this Studya
| training data | independent data #1 | independent data #2 | |
|---|---|---|---|
| SNAREs | 644 | 38 | 15 |
| non-SNAREs | 2234 | 349 | 126 |
Training data and independent data #1 were retrieved from the previous study.[22] Independent data #2 is newly discovered data (from November 1, 2018) that were manually collected in this study.
Comparison to Other Sequence-Based Features in Proteina
| feature | sensitivity | specificity | accuracy | MCC |
|---|---|---|---|---|
| AAC | 0.844 | 0.811 | 0.814 | 0.430 |
| DPC | 0.838 | 0.860 | 0.858 | 0.492 |
| PAAC | 0.782 | 0.877 | 0.869 | 0.485 |
| APAAC | 0.781 | 0.883 | 0.874 | 0.493 |
| GAAC | 0.755 | 0.718 | 0.721 | 0.285 |
| CKSAAP | 0.824 | 0.872 | 0.868 | 0.502 |
| CKSAAGP | 0.802 | 0.805 | 0.805 | 0.397 |
| PSSM | 0.845 | 0.955 | 0.930 | 0.800 |
All of the results were obtained using CNN architecture on the training set via a cross-validation scheme. SMOTE algorithm was applied to resolve imbalance problems.
Figure 2Comparison among different models. (A) ROC curve and (B) precision–recall Curve.
Figure 3Feature representation of multiscan PSSM profiles. (A) t-SNE analysis and (B) UMAP analysis.
Comparison to Previous Predictors Using the Same Independent Dataseta
| predictor | sensitivity | specificity | accuracy | MCC |
|---|---|---|---|---|
| SNARE-CNN[ | 0.658 | 0.903 | 0.879 | 0.460 |
| SNAREs-SAP[ | 0.680 | 0.940 | 0.920 | 0.480 |
| proposed method | 0.842 | 0.968 | 0.955 | 0.767 |
All of the results were obtained on independent dataset #1.