| Literature DB >> 28542505 |
Jiansong Fang1,2, Ling Wang3, Yecheng Li3, Wenwen Lian4, Xiaocong Pang4, Hong Wang1, Dongsheng Yuan1, Qi Wang1,2, Ai-Lin Liu4, Guan-Hua Du4.
Abstract
Alzheimer's disease (AD) is a complicated progressive neurodegeneration disorder. To confront AD, scientists are searching for multi-target-directed ligands (MTDLs) to delay disease progression. The in silico prediction of chemical-protein interactions (CPI) can accelerate target identification and drug discovery. Previously, we developed 100 binary classifiers to predict the CPI for 25 key targets against AD using the multi-target quantitative structure-activity relationship (mt-QSAR) method. In this investigation, we aimed to apply the mt-QSAR method to enlarge the model library to predict CPI towards AD. Another 104 binary classifiers were further constructed to predict the CPI for 26 preclinical AD targets based on the naive Bayesian (NB) and recursive partitioning (RP) algorithms. The internal 5-fold cross-validation and external test set validation were applied to evaluate the performance of the training sets and test set, respectively. The area under the receiver operating characteristic curve (ROC) for the test sets ranged from 0.629 to 1.0, with an average of 0.903. In addition, we developed a web server named AlzhCPI to integrate the comprehensive information of approximately 204 binary classifiers, which has potential applications in network pharmacology and drug repositioning. AlzhCPI is available online at http://rcidm.org/AlzhCPI/index.html. To illustrate the applicability of AlzhCPI, the developed system was employed for the systems pharmacology-based investigation of shichangpu against AD to enhance the understanding of the mechanisms of action of shichangpu from a holistic perspective.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28542505 PMCID: PMC5460905 DOI: 10.1371/journal.pone.0178347
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The schematic workflow of AlzhCPI to predict cheimical-protein interactions toward Alzheimer's disease based on the multitarget quantitative structure-activity relationships (mt-QSAR).
Fig 2Summary of 51 key targets in AlzhCPI.
Detailed statistical description of the entire data set based on the multi-label classification strategy.
| Encoding Gene | Training set (ECFP2) | Test set (ECFP2) | ||||||
|---|---|---|---|---|---|---|---|---|
| Inhibitors | decoys | Total | Tanimoto index | Inhibitors | decoys | Total | Tanimoto index | |
| HTR2A | 2200 | 6600 | 8800 | 0.288 | 742 | 2226 | 2968 | 0.198 |
| ADORA2A | 2360 | 7080 | 9440 | 0.279 | 783 | 2349 | 3132 | 0.179 |
| CHRM2 | 380 | 1140 | 1520 | 0.249 | 128 | 384 | 512 | 0.15 |
| PDE9A | 110 | 330 | 440 | 0.114 | 33 | 99 | 132 | 0.046 |
| GRM2 | 310 | 930 | 1240 | 0.28 | 106 | 318 | 424 | 0.234 |
| GRM3 | 50 | 150 | 200 | 0.305 | 16 | 48 | 64 | 0.203 |
| MAPK8 | 780 | 2340 | 3120 | 0.192 | 266 | 798 | 1064 | 0.091 |
| MAPK9 | 330 | 990 | 1320 | 0.13 | 108 | 324 | 432 | 0.06 |
| MAPK10 | 510 | 1530 | 2040 | 0.183 | 174 | 522 | 696 | 0.056 |
| MAPK14 | 40 | 120 | 160 | 0.181 | 19 | 57 | 76 | 0.171 |
| HS90AA1 | 750 | 2250 | 3000 | 0.215 | 248 | 744 | 992 | 0.1361 |
| PIN1 | 60 | 180 | 240 | 0.125 | 23 | 69 | 92 | 0.0544 |
| MAPT | 40 | 120 | 160 | 0.1125 | 12 | 36 | 48 | 0.0209 |
| PTGS2 | 1760 | 5280 | 7040 | 0.542 | 583 | 1749 | 2332 | 0.164 |
| NOS2 | 570 | 1710 | 2280 | 0.33 | 184 | 552 | 736 | 0.288 |
| MPO | 60 | 180 | 240 | 0.338 | 19 | 57 | 76 | 0.211 |
| CHUK | 120 | 360 | 480 | 0.173 | 41 | 123 | 164 | 0.098 |
| IKBKB | 600 | 1800 | 2400 | 0.22 | 198 | 594 | 792 | 0.123 |
| TNF | 560 | 1680 | 2240 | 0.184 | 192 | 576 | 768 | 0.083 |
| ALOX12 | 120 | 360 | 480 | 0.2 | 40 | 120 | 160 | 0.119 |
| CTSD | 1250 | 3750 | 5000 | 0.246 | 423 | 1269 | 1692 | 0.093 |
| PDK1 | 440 | 1320 | 1760 | 0.261 | 149 | 447 | 596 | 0.2 |
| HMGCR | 600 | 1800 | 2400 | 0.233 | 199 | 597 | 796 | 0.136 |
| IDE | 60 | 180 | 240 | 0.054 | 20 | 60 | 80 | 0.013 |
| PPARG | 1730 | 5190 | 6920 | 0.264 | 582 | 1746 | 2328 | 0.171 |
| CES1 | 290 | 870 | 1160 | 0.305 | 100 | 300 | 400 | 0.27 |
Fig 3Targets (A) and active compounds (B) classification within the entire data set in AlzhCPI.
Performance of the 5-fold cross-validation for 26 targets towards Alzheimer disease using NB and RP classifiers.
| Encoding Gene | ECFP6 | MACCS | ||||||
|---|---|---|---|---|---|---|---|---|
| NB | RP | NB | RP | |||||
| MCC | AUC | MCC | AUC | MCC | AUC | MCC | AUC | |
| HTR2A | 0.992 | 1 | 0.944 | 0.988 | 0.732 | 0.948 | 0.938 | 0.989 |
| ADORA2A | 0.989 | 1 | 0.947 | 0.989 | 0.89 | 0.981 | 0.984 | 0.995 |
| CHRM2 | 0.984 | 0.999 | 0.877 | 0.976 | 0.779 | 0.963 | 0.928 | 0.978 |
| PDE9A | 0.994 | 0.999 | 0.913 | 0.97 | 0.939 | 0.993 | 0.947 | 0.971 |
| GRM2 | 0.989 | 1 | 0.955 | 0.987 | 0.754 | 0.962 | 0.892 | 0.979 |
| GRM3 | 1 | 1 | 0.882 | 0.968 | 0.906 | 0.984 | 0.889 | 0.961 |
| MAPK8 | 0.991 | 1 | 0.916 | 0.973 | 0.707 | 0.941 | 0.893 | 0.966 |
| MAPK9 | 0.98 | 0.996 | 0.852 | 0.961 | 0.763 | 0.945 | 0.822 | 0.939 |
| MAPK10 | 0.952 | 0.993 | 0.866 | 0.956 | 0.65 | 0.915 | 0.849 | 0.943 |
| MAPK14 | 1 | 1 | 0.905 | 0.935 | 0.916 | 0.98 | 0.795 | 0.897 |
| HS90AA1 | 0.975 | 0.997 | 0.928 | 0.984 | 0.689 | 0.941 | 0.911 | 0.97 |
| PIN1 | 0.978 | 0.999 | 0.914 | 0.964 | 0.978 | 0.998 | 0.812 | 0.922 |
| MAPT | 0.937 | 0.998 | 0.725 | 0.886 | 0.794 | 0.904 | 0.724 | 0.815 |
| PTGS2 | 0.956 | 0.997 | 0.93 | 0.982 | 0.698 | 0.935 | 0.965 | 0.991 |
| NOS2 | 0.976 | 0.999 | 0.886 | 0.968 | 0.702 | 0.929 | 0.887 | 0.97 |
| MPO | 0.956 | 0.996 | 0.914 | 0.963 | 0.781 | 0.956 | 0.918 | 0.953 |
| CHUK | 0.983 | 0.992 | 0.955 | 0.961 | 0.729 | 0.971 | 0.882 | 0.947 |
| IKBKB | 0.993 | 1 | 0.932 | 0.983 | 0.775 | 0.954 | 0.905 | 0.967 |
| TNF | 0.867 | 0.985 | 0.814 | 0.933 | 0.564 | 0.854 | 0.798 | 0.938 |
| ALOX12 | 0.989 | 1 | 0.924 | 0.98 | 0.88 | 0.986 | 0.936 | 0.989 |
| CTSD | 0.961 | 0.994 | 0.976 | 0.994 | 0.729 | 0.949 | 0.942 | 0.992 |
| PDK1 | 0.995 | 0.997 | 0.981 | 0.996 | 0.985 | 0.994 | 0.983 | 0.991 |
| HMGCR | 0.991 | 1 | 0.974 | 0.996 | 0.935 | 0.998 | 0.97 | 0.995 |
| IDE | 0.851 | 0.988 | 0.679 | 0.881 | 0.68 | 0.923 | 0.753 | 0.829 |
| PPARG | 0.981 | 0.998 | 0.955 | 0.991 | 0.745 | 0.947 | 0.934 | 0.988 |
| CES1 | 0.956 | 0.999 | 0.934 | 0.972 | 0.676 | 0.913 | 0.89 | 0.969 |
Performance of the test set validation for 25 targets towards Alzheimer disease using NB and RP classifiers.
| Encoding Gene | ECFP6 | MACCS | ||||||
|---|---|---|---|---|---|---|---|---|
| NB | RP | NB | RP | |||||
| MCC | AUC | MCC | AUC | MCC | AUC | MCC | AUC | |
| HTR2A | 0.953 | 0.997 | 0.884 | 0.967 | 0.678 | 0.931 | 0.838 | 0.959 |
| ADORA2A | 0.653 | 0.949 | 0.681 | 0.911 | 0.553 | 0.868 | 0.26 | 0.714 |
| CHRM2 | 0.797 | 0.961 | 0.738 | 0.889 | 0.664 | 0.915 | 0.651 | 0.939 |
| PDE9A | 0.96 | 0.994 | 0.836 | 0.954 | 0.643 | 0.982 | 0.771 | 0.855 |
| GRM2 | 0.956 | 0.989 | 0.893 | 0.955 | 0.544 | 0.876 | 0.687 | 0.917 |
| GRM3 | 0.832 | 0.897 | 0.797 | 0.911 | 0.785 | 0.874 | 0.788 | 0.847 |
| MAPK8 | 0.927 | 0.991 | 0.801 | 0.928 | 0.651 | 0.903 | 0.746 | 0.898 |
| MAPK9 | 0.829 | 0.956 | 0.681 | 0.869 | 0.633 | 0.901 | 0.615 | 0.874 |
| MAPK10 | 0.787 | 0.937 | 0.695 | 0.879 | 0.541 | 0.852 | 0.594 | 0.84 |
| MAPK14 | 0.965 | 0.984 | 0.894 | 0.921 | 0.75 | 0.935 | 0.393 | 0.7 |
| HS90AA1 | 0.821 | 0.935 | 0.807 | 0.897 | 0.585 | 0.88 | 0.745 | 0.857 |
| PIN1 | 0.854 | 0.964 | 0.791 | 0.906 | 0.728 | 0.899 | 0.698 | 0.887 |
| MAPT | 0.832 | 0.97 | 0.408 | 0.748 | 0.591 | 0.854 | 0.415 | 0.779 |
| PTGS2 | 0.854 | 0.983 | 0.756 | 0.919 | 0.587 | 0.874 | 0.898 | 0.976 |
| NOS2 | 0.893 | 0.983 | 0.752 | 0.901 | 0.543 | 0.841 | 0.668 | 0.894 |
| MPO | 0.787 | 0.994 | 0.666 | 0.865 | 0.383 | 0.629 | 0.492 | 0.752 |
| CHUK | 0.735 | 0.939 | 0.731 | 0.856 | 0.726 | 0.928 | 0.677 | 0.921 |
| IKBKB | 0.895 | 0.973 | 0.832 | 0.911 | 0.696 | 0.907 | 0.718 | 0.915 |
| TNF | 0.697 | 0.915 | 0.501 | 0.791 | 0.171 | 0.722 | 0.502 | 0.814 |
| ALOX12 | 0.849 | 0.97 | 0.752 | 0.906 | 0.718 | 0.901 | 0.804 | 0.932 |
| CTSD | 0.885 | 0.974 | 0.92 | 0.95 | 0.647 | 0.913 | 0.867 | 0.941 |
| PDK1 | 0.946 | 0.959 | 0.955 | 0.976 | 0.923 | 0.961 | 0.937 | 0.955 |
| HMGCR | 0.964 | 1 | 0.963 | 0.987 | 0.913 | 0.995 | 0.929 | 0.984 |
| IDE | 0.864 | 0.983 | 0.321 | 0.729 | 0.114 | 0.69 | 0.401 | 0.704 |
| PPARG | 0.897 | 0.965 | 0.884 | 0.948 | 0.661 | 0.916 | 0.803 | 0.928 |
| CES1 | 0.683 | 0.929 | 0.809 | 0.919 | 0.472 | 0.792 | 0.662 | 0.861 |
Fig 4Boxplot shows the minimum, lower quartile (Q1), median (Q2), upper quartile (Q3), and maximum of Matthews correlation coefficient (MCC) on test sets based on four types of classifiers (A) and different fingerprints and algorithms (B).
Fig 5The compound–target–mechanism network of shichangpu based on AlzhCPI.
Ellipse, hexagon and triangle represent drug nodes, protein nodes and mechanism nodes, respectively.