| Literature DB >> 32030204 |
Benquan Liu1, Huiqin He1, Hongyi Luo1, Tingting Zhang1, Jingwei Jiang1.
Abstract
Different kinds of biological databases publicly available nowadays provide us a goldmine of multidiscipline big data. The Cancer Genome Atlas is a cancer database including detailed information of many patients with cancer. DrugBank is a database including detailed information of approved, investigational and withdrawn drugs, as well as other nutraceutical and metabolite structures. PubChem is a chemical compound database including all commercially available compounds as well as other synthesisable compounds. Protein Data Bank is a crystal structure database including X-ray, cryo-EM and nuclear magnetic resonance protein three-dimensional structures as well as their ligands. On the other hand, artificial intelligence (AI) is playing an important role in the drug discovery progress. The integration of such big data and AI is making a great difference in the discovery of novel targeted drug. In this review, we focus on the currently available advanced methods for the discovery of highly effective lead compounds with great absorption, distribution, metabolism, excretion and toxicity properties. © Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.Entities:
Keywords: artificial intelligence; big data; targeted drug
Year: 2019 PMID: 32030204 PMCID: PMC6979871 DOI: 10.1136/svn-2019-000290
Source DB: PubMed Journal: Stroke Vasc Neurol ISSN: 2059-8696
Figure 1Schematic procedure of artificial intelligence (AI)-assisted virtual screening. Millions of structurally diverse chemical compounds are docked to a specific therapeutic target. AI scoring function is used to select the best hits from millions of docked results.
Figure 2Schematic procedure of artificial intelligence (AI)-assisted reverse docking. More than 100 000 structurally diverse protein structures are reversely docked to a specific chemical compound/natural product. AI scoring function is used to select the best hits from millions of docked results.
Figure 3AI-assisted ADMET properties prediction. (A) Deep learning algorithm to calculate logBB for a specific chemical compound. (B) Deep learning algorithm to calculate logPapp for a specific chemical compound. (C) PCA(Principal Component Analysis) analysis on 48 186 reverse-docked proteins for 55 FDA-approved drugs (yellow dots) and 224 FDA-withdrawn drugs (blue dots). (D) PLS-DA(Partial Least Squares Discriminant Analysis) analysis on 48 186 reverse-docked proteins for 55 FDA-approved drugs (blue dots) and 224 FDA-withdrawn drugs (yellow dots). ADMET, absorption, distribution, metabolism, excretion and toxicity; AI, artificial intelligence; FDA, Food and Drug Administration;TPSA,total polar surface area.