| Literature DB >> 35897954 |
Jie Liu1, Dongdong Peng1, Jinlong Li1, Zong Dai2, Xiaoyong Zou3, Zhanchao Li1,4,5.
Abstract
Parkinson's disease (PD) is a serious neurodegenerative disease. Most of the current treatment can only alleviate symptoms, but not stop the progress of the disease. Therefore, it is crucial to find medicines to completely cure PD. Finding new indications of existing drugs through drug repositioning can not only reduce risk and cost, but also improve research and development efficiently. A drug repurposing method was proposed to identify potential Parkinson's disease-related drugs based on multi-source data integration and convolutional neural network. Multi-source data were used to construct similarity networks, and topology information were utilized to characterize drugs and PD-associated proteins. Then, diffusion component analysis method was employed to reduce the feature dimension. Finally, a convolutional neural network model was constructed to identify potential associations between existing drugs and LProts (PD-associated proteins). Based on 10-fold cross-validation, the developed method achieved an accuracy of 91.57%, specificity of 87.24%, sensitivity of 95.27%, Matthews correlation coefficient of 0.8304, area under the receiver operating characteristic curve of 0.9731 and area under the precision-recall curve of 0.9727, respectively. Compared with the state-of-the-art approaches, the current method demonstrates superiority in some aspects, such as sensitivity, accuracy, robustness, etc. In addition, some of the predicted potential PD therapeutics through molecular docking further proved that they can exert their efficacy by acting on the known targets of PD, and may be potential PD therapeutic drugs for further experimental research. It is anticipated that the current method may be considered as a powerful tool for drug repurposing and pathological mechanism studies.Entities:
Keywords: Parkinson’s disease; convolutional neural network; drug repositioning; multi-source data fusion
Mesh:
Substances:
Year: 2022 PMID: 35897954 PMCID: PMC9369596 DOI: 10.3390/molecules27154780
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.927
Figure 1Similarity values and statistical results. (A–C) The similarity values of any two drugs, two LProts and two drug–LProt association pairs, respectively. (D) The statistical distribution of drugs, LProts and drug–LProt associations similarity values.
Average values of Acc, Sen, Spe, Mcc, Auroc, Auprc.
| Dimension | Acc (%) | Spe (%) | Sen (%) | Mcc | Auroc | Auprc | |
|---|---|---|---|---|---|---|---|
| Drug | LProt | ||||||
| 100 | 200 | 90.76 | 94.54 | 86.98 | 0.8187 | 0.9709 | 0.9709 |
| 100 | 300 | 91.51 | 94.88 | 88.13 | 0.8328 | 0.9730 | 0.9729 |
| 100 | 400 | 91.57 | 95.26 | 87.24 | 0.8303 | 0.9731 | 0.9726 |
| 100 | 600 | 91.25 | 95.43 | 87.08 | 0.8287 | 0.9721 | 0.9708 |
| 200 | 300 | 91.08 | 94.51 | 89.65 | 0.8252 | 0.9711 | 0.9710 |
| 300 | 500 | 91.06 | 95.49 | 86.63 | 0.8251 | 0.9715 | 0.9702 |
| 500 | 500 | 91.50 | 94.77 | 88.15 | 0.8320 | 0.9706 | 0.9702 |
Figure 2RSDs of Acc, Auroc, Auprc, Sen, Spe, Mcc.
Figure 3Statistical average results from datasets with various ratios between positive and negative sample.
10-fold cross-validation test results on different non-redundant datasets.
| Threshold | Acc (%) | Sen (%) | Spe (%) | Mcc | Auroc | Auprc |
|---|---|---|---|---|---|---|
| 0.9 | 91.57 | 95.27 | 87.24 | 0.8304 | 0.9732 | 0.9727 |
| 0.8 | 91.81 | 95.21 | 88.41 | 0.8387 | 0.9735 | 0.9726 |
| 0.7 | 91.34 | 94.92 | 87.76 | 0.8294 | 0.9708 | 0.9691 |
| 0.6 | 90.74 | 95.41 | 86.07 | 0.8193 | 0.9699 | 0.9691 |
| 0.5 | 90.11 | 94.61 | 85.61 | 0.8065 | 0.9658 | 0.9654 |
| 0.4 | 88.49 | 94.73 | 82.24 | 0.7769 | 0.9598 | 0.9587 |
10-fold cross-validation test results on different non-redundant drug datasets.
| Threshold | Acc (%) | Sen (%) | Spe (%) | Mcc | Auroc | Auprc |
|---|---|---|---|---|---|---|
| 0.9 | 91.92 | 94.85 | 89.05 | 0.8408 | 0.9726 | 0.9728 |
| 0.8 | 90.23 | 94.32 | 86.13 | 0.8087 | 0.9666 | 0.9661 |
| 0.7 | 90.24 | 94.37 | 86.11 | 0.8090 | 0.9667 | 0.9654 |
| 0.6 | 90.73 | 93.62 | 87.85 | 0.8171 | 0.9652 | 0.9658 |
| 0.5 | 90.59 | 94.20 | 86.98 | 0.8151 | 0.9660 | 0.9654 |
| 0.4 | 91.90 | 90.84 | 92.95 | 0.8395 | 0.9645 | 0.9692 |
10-fold cross-validation results of the non-redundant drug–LProt association datasets.
| Threshold | Acc (%) | Sen (%) | Spe (%) | Mcc | Auroc | Auprc |
|---|---|---|---|---|---|---|
| 0.9 | 91.87 | 95.58 | 88.56 | 0.8405 | 0.9746 | 0.9764 |
| 0.8 | 91.84 | 95.58 | 88.25 | 0.8397 | 0.9757 | 0.9760 |
| 0.7 | 91.75 | 94.67 | 88.11 | 0.8366 | 0.9720 | 0.9741 |
Figure 4(A) receiver operating characteristic curve with different folds. (B) precision–recall curve with different folds.
Figure 5Performance comparison with existing methods. (A) Auroc value. (B) Auprc value.
Comparison results of 10-fold cross-validation with different methods.
| Acc (%) | Sen (%) | Spe (%) | Mcc | Auroc | Auprc | |
|---|---|---|---|---|---|---|
| Logistic regression | 86.51 | 86.84 | 86.19 | 0.7303 | 0.9341 | 0.9248 |
| KNN | 87.29 | 94.40 | 80.16 | 0.7534 | 0.9446 | 0.9495 |
| NB | 77.62 | 71.46 | 83.82 | 0.5569 | 0.8605 | 0.8707 |
| RF | 90.57 | 91.12 | 90.02 | 0.8114 | 0.9653 | 0.9659 |
| SVM | 86.92 | 88.12 | 85.72 | 0.7386 | 0.9270 | 0.9094 |
| Current | 91.57 | 95.27 | 87.24 | 0.8304 | 0.9731 | 0.9727 |
Figure 6ROC and PRC curves for various methods. (A) ROC Curve. (B) PRC Curve.
Top ten drug information.
| Number | Drug | Indication |
|---|---|---|
| 1 | Topotecan | Treat ovarian cancer, small cell lung cancer or cervical cancer. |
| 2 | Loperamide | Control nonspecific and chronic diarrhea caused by inflammatory bowel disease or gastroenteritis. |
| 3 | Artenimol | Treatment of artemisinin derivatives and the antimalarial agent Plasmodium falciparum infection. |
| 4 | Mitotane | Treatment of inoperable adrenal cortical tumors; Cushing’s syndrome. |
| 5 | Estramustine | The palliative treatment of patients with metastatic and/or progressive carcinoma of the prostate. |
| 6 | Quercetin | A flavonol widely distributed in plants. It is an antioxidant, like many other phenolic heterocyclic compounds. |
| 7 | Nortriptyline | A tricyclic antidepressant used to treat major depressive disorder and also to treat chronic pain and other conditions. |
| 8 | Bacitracin | Topical preparations for acute and chronic topical skin infections. |
| 9 | Minocycline | Treatment of inflammatory lesions of acne vulgaris. |
| 10 | Doxepin | A psychotropic agent with antidepressant and anxiolytic properties. |
Molecular docking results of pimavanserin, loperamide, topotecan, artemisinol and PD target (HTR2A).
| Ligand | Target Protein | Binding Energy | Inhibition Constant |
|---|---|---|---|
| Pimavanserin | HTR2A | −6.4 | 20.49 |
| Loperamide | −7.76 | 2.05 | |
| Topotecan | −7.96 | 1.47 | |
| Artenimol | −7.65 | 2.46 |
Figure 7Visualization of the docking results of small molecule drugs (Pimawanserin, Topotecan) and PD target (5-HT2A). (A,B) Pimavanserin docked to receptor proteins as well as their interactions (Positive Control) (C,D) Topotecan docked to receptor proteins and their interactions.
The detail information of the drugs and proteins.
| Information | Number | Sources |
|---|---|---|
| drug–chemical structure | 6587 | DrugBank Database |
| drug–ATC | 4636 | |
| drug–enzyme | 4828 | |
| drug–target | 15,504 | |
| drug–side effect | 755,165 | SIDES Database |
| PPI | 353,550 | HIPPIES Database |
| PD targets | 157 | TTD Database |
| PD drugs | 30 | |
| PD associated targets (LProt) | 5295 | PPI |
| LProt–pathway | 13,947 | CTD Database |
| LProt–sequence | 5295 | Uniprot Database |
| drugs | 6587 | DrugBank Database |
The framework and parameters of convolutional neural network.
| Layer | Size |
|---|---|
| Input | 500*1 |
| Convolutional | 4 filters with 5*1, stride 1*1 |
| ReLU | - |
| Convolutional | 8 filters with 10*1, stride 1*1 |
| ReLU | - |
| Max-Pooling | 2*1, stride 2*1 |
| ReLU | - |
| Fully connected | 256, dropout = 0.5 |
| Sigmoid | - |
| Classification | 2 |
Figure 8MSDF-CNN workflow.