| Literature DB >> 29943074 |
Jie Dong1,2,3, Ning-Ning Wang1, Zhi-Jiang Yao1, Lin Zhang3, Yan Cheng1, Defang Ouyang4, Ai-Ping Lu5, Dong-Sheng Cao6,7.
Abstract
Current pharmaceutical research and development (R&D) is a high-risk investment which is usually faced with some unexpected even disastrous failures in different stages of drug discovery. One main reason for R&D failures is the efficacy and safety deficiencies which are related largely to absorption, distribution, metabolism and excretion (ADME) properties and various toxicities (T). Therefore, rapid ADMET evaluation is urgently needed to minimize failures in the drug discovery process. Here, we developed a web-based platform called ADMETlab for systematic ADMET evaluation of chemicals based on a comprehensively collected ADMET database consisting of 288,967 entries. Four function modules in the platform enable users to conveniently perform six types of drug-likeness analysis (five rules and one prediction model), 31 ADMET endpoints prediction (basic property: 3, absorption: 6, distribution: 3, metabolism: 10, elimination: 2, toxicity: 7), systematic evaluation and database/similarity searching. We believe that this web platform will hopefully facilitate the drug discovery process by enabling early drug-likeness evaluation, rapid ADMET virtual screening or filtering and prioritization of chemical structures. The ADMETlab web platform is designed based on the Django framework in Python, and is freely accessible at http://admet.scbdd.com/ .Entities:
Keywords: ADMET; ADMET database; ADMETlab; Cheminformatics; Drug discovery; Drug-likeness
Year: 2018 PMID: 29943074 PMCID: PMC6020094 DOI: 10.1186/s13321-018-0283-x
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1An overview of ADMET properties that can be evaluated by ADMETlab
The statistical results of the datasets for modeling
| Category | Property | Total | Positive | Negative | Train | Test |
|---|---|---|---|---|---|---|
| Basic physicochemical property | LogS | 5220 | – | – | 4116 | 1104 |
| LogD7.4 | 1031 | – | – | 773 | 258 | |
| LogP | ||||||
| Absorption | Caco-2 | 1182 | – | – | 886 | 296 |
| Pgp-inhibitor | 2297 | 1372 | 925 | 1723 | 574 | |
| Pgp-substrate | 1252 | 643 | 609 | 939 | 313 | |
| HIA | 970 | 818 | 152 | 728 | 242 | |
| F (20%) | 1013 | 759 | 254 | 760 | 253 | |
| F (30%) | 1013 | 672 | 341 | 760 | 253 | |
| Distribution | PPB | 1822 | – | – | 1368 | 454 |
| VD | 544 | – | – | 408 | 136 | |
| BBB | 2237 | 540 | 1697 | 1678 | 559 | |
| Metabolism | CYP 1A2-inhibitor | 12,145 | 5713 | 6432 | 9145 | 3000 |
| CYP 1A2-substrate | 396 | 198 | 198 | 297 | 99 | |
| CYP 3A4-inhibitor | 11,893 | 5047 | 6846 | 8893 | 3000 | |
| CYP 3A4-substrate | 1020 | 510 | 510 | 765 | 255 | |
| CYP 2C9-inhibitor | 11,720 | 3960 | 7760 | 8720 | 3000 | |
| CYP 2C9-substrate | 784 | 278 | 506 | 626 | 156 | |
| CYP 2C19-inhibitor | 12,272 | 5670 | 6602 | 9272 | 3000 | |
| CYP 2C19-substrate | 312 | 156 | 156 | 234 | 78 | |
| CYP 2D6-inhibitor | 12,726 | 2342 | 10,384 | 9726 | 3000 | |
| CYP 2D6-substrate | 816 | 352 | 464 | 611 | 205 | |
| Excretion | Clearance | 544 | – | – | 408 | 136 |
| T1/2 | 544 | – | – | 408 | 136 | |
| Toxicity | hERG | 655 | 451 | 204 | 392 | 263 |
| H-HT | 2171 | 1435 | 736 | 1628 | 543 | |
| Ames | 7619 | 4252 | 3367 | 5714 | 1905 | |
| Skin sensitivity | 404 | 274 | 130 | 323 | 81 | |
| Rat oral acute toxicity | 7397 | – | – | 5917 | 1480 | |
| DILI | 475 | 236 | 239 | 380 | 95 | |
| FDAMDD | 803 | 442 | 361 | 643 | 160 |
The molecular descriptors that were used in modeling process
| Descriptor type | Description | Number |
|---|---|---|
| Constitution | Constitutional descriptors | 30 |
| Topology | Topological descriptors | 35 |
| Connectivity | Connectivity indices | 44 |
| E-state | E-state descriptors | 79 |
| Kappa | Kappa shape descriptors | 7 |
| Basak | Basak information indices | 21 |
| Burden | Burden descriptors | 64 |
| Autocorrelation | Morgan autocorrelation | 32 |
| Charge | Charge descriptors | 25 |
| Property | Molecular property | 6 |
| FP2 | A path-based fingerprint which indexes small molecule fragments based on linear segments of up to 7 atoms | 2048 |
| MACCS | MACCS keys | 167 |
| ECFP2 | An ECFP feature represents a circular substructure around a center atom with diameter is 1 | 2048 |
| ECFP4 | An ECFP feature represents a circular substructure around a center atom with diameter is 2 | 2048 |
| ECFP6 | An ECFP feature represents a circular substructure around a center atom with diameter is 3 | 2048 |
The best regression models for some ADMET related properties (Part 1)
| Property | Method | mtry | R2 | Q2 | RT2 | RMSEF | RMSECV | RMSET |
|---|---|---|---|---|---|---|---|---|
| LogS | RF | 10 | 0.980 | 0.860 | 0.979 | 0.095 | 0.698 | 0.712 |
| LogD7.4 | RF | 14 | 0.983 | 0.877 | 0.874 | 0.228 | 0.614 | 0.605 |
| Caco-2 | RF | 14 | 0.973 | 0.845 | 0.824 | 0.121 | 0.289 | 0.290 |
| PPB | RF | 8 | 0.954 | 0.691 | 0.682 | 7.124 | 18.443 | 18.044 |
| VD | RF | 10 | 0.950 | 0.634 | 0.556 | 0.281 | 0.762 | 0.948 |
The best regression models for some ADMET related properties (Part 2)
| Property | Method | Features | mtry | Twofold rate (CV/test) | Threefold rate (CV/test) |
|---|---|---|---|---|---|
| CL | RF | 2D | 10 | 0.760/0.816 | 0.877/0.897 |
| T1/2 | RF | 2D | 12 | 0.762/0.699 | 0.897/0.824 |
| LD50 | RF | 2D | 5 | 0.986/0.987 | 0.998/0.997 |
The best classification models for some ADME/T related properties
| Property | Method | Features | Fivefold cross validation | External validation dataset | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sensitivity | Specificity | Accuracy | AUC | Sensitivity | Specificity | Accuracy | AUC | |||
| HIA | RF | MACCS | 0.820 | 0.743 | 0.782 | 0.846 | 0.801 | 0.743 | 0.773 | 0.831 |
| F (20%) | RF | MACCS | 0.731 | 0.647 | 0.689 | 0.759 | 0.680 | 0.663 | 0.671 | 0.746 |
| F (30%) | RF | ECFP6 | 0.743 | 0.605 | 0.669 | 0.715 | 0.751 | 0.601 | 0.667 | 0.718 |
| BBB | SVM | ECFP2 | 0.962 | 0.813 | 0.926 | 0.948 | 0.993 | 0.854 | 0.962 | 0.975 |
| Pgp-inhibitor | SVM | ECFP4 | 0.887 | 0.789 | 0.848 | 0.908 | 0.863 | 0.802 | 0.838 | 0.913 |
| Pgp-substrate | SVM | ECFP4 | 0.839 | 0.807 | 0.824 | 0.899 | 0.826 | 0.854 | 0.840 | 0.905 |
| CYP1A2-inhibitor | SVM | ECFP4 | 0.833 | 0.864 | 0.849 | 0.928 | 0.853 | 0.880 | 0.867 | 0.939 |
| CYP1A2-substrate | RF | ECFP4 | 0.768 | 0.636 | 0.702 | 0.801 | 0.768 | 0.637 | 0.702 | 0.802 |
| CYP3A4-inhibitor | SVM | ECFP4 | 0.759 | 0.858 | 0.817 | 0.901 | 0.788 | 0.860 | 0.829 | 0.909 |
| CYP3A4-substrate | RF | ECFP4 | 0.798 | 0.716 | 0.757 | 0.835 | 0.819 | 0.679 | 0.749 | 0.835 |
| CYP2C19-inhibitor | SVM | ECFP2 | 0.826 | 0.819 | 0.822 | 0.893 | 0.812 | 0.825 | 0.819 | 0.899 |
| CYP2C19-substrate | RF | ECFP2 | 0.735 | 0.744 | 0.740 | 0.816 | 0.871 | 0.667 | 0.769 | 0.853 |
| CYP2C9-inhibitor | SVM | ECFP4 | 0.719 | 0.898 | 0.837 | 0.900 | 0.730 | 0.882 | 0.830 | 0.894 |
| CYP2C9-substrate | RF | ECFP4 | 0.746 | 0.709 | 0.728 | 0.819 | 0.746 | 0.709 | 0.734 | 0.824 |
| CYP2D6-inhibitor | RF | ECFP4 | 0.770 | 0.811 | 0.793 | 0.868 | 0.771 | 0.812 | 0.795 | 0.882 |
| CYP2D6-substrate | RF | ECFP4 | 0.765 | 0.73 | 0.748 | 0.823 | 0.792 | 0.73 | 0.76 | 0.833 |
| hERG | RF | 2D | 0.908 | 0.700 | 0.844 | 0.879 | 0.888 | 0.762 | 0.848 | 0.873 |
| H-HT | RF | 2D | 0.780 | 0.520 | 0.689 | 0.710 | 0.785 | 0.487 | 0.681 | 0.683 |
| Ames | RF | MACCS | 0.800 | 0.841 | 0.820 | 0.890 | 0.848 | 0.816 | 0.834 | 0.897 |
| SkinSen | RF | MACCS | 0.685 | 0.727 | 0.706 | 0.760 | 0.715 | 0.727 | 0.731 | 0.774 |
| DILI | RF | MACCS | 0.866 | 0.813 | 0.840 | 0.904 | 0.830 | 0.857 | 0.843 | 0.910 |
| FDAMDD | RF | ECFP4 | 0.848 | 0.812 | 0.832 | 0.904 | 0.853 | 0.782 | 0.821 | 0.892 |
Web tools related with ADMET prediction
| Tools | Availability | Batch computation | Endpoints | Database | Druglikeness rules | Druglikeness model | Systematic evaluation | Suggestions |
|---|---|---|---|---|---|---|---|---|
| ADMETlab | Free | Yes | Number: 31 | Yes | Yes | Yes | Yes | Yes |
| lazar [ | Free | No | Number: 3 | No | No | No | No | No |
| admetSAR [ | Free | No | Number: 27 | Yes | No | No | Yes | No |
| PreADMET [ | Free or commercial | No | Number: 19 | No | Yes | No | No | No |
| FAF-Drugs4 [ | Free | Yes | Mainly filtering compounds by their descriptors and basic properties | No | Yes | No | No | No |
| pkCSM [ | Free | Yes | Number: 30 | No | No | No | Yes | No |
| SwissADME [ | Free | Yes | Number: 19 | No | Yes | No | Yes | No |
| VCCLAB [ | Free | Yes | Number: 14 | No | No | No | No | No |
| Molinspiration [ | Free | No | 5 bioactivities, miLogP and 8 molecular descriptors | No | No | No | No | No |
| vNN-ADMET [ | Registration required | No | Number: 14 | No | No | No | No | No |
*The “B, A, D, M, E, T” refers the contents in the “Documentation” section of our website. A tool that marked “A” means it covers some endpoints of class “A”, not all endpoints of class “A”