| Literature DB >> 36176461 |
Asma Sellami1, Manon Réau1, Matthieu Montes1, Nathalie Lagarde1.
Abstract
Being in the center of both therapeutic and toxicological concerns, NRs are widely studied for drug discovery application but also to unravel the potential toxicity of environmental compounds such as pesticides, cosmetics or additives. High throughput screening campaigns (HTS) are largely used to detect compounds able to interact with this protein family for both therapeutic and toxicological purposes. These methods lead to a large amount of data requiring the use of computational approaches for a robust and correct analysis and interpretation. The output data can be used to build predictive models to forecast the behavior of new chemicals based on their in vitro activities. This atrticle is a review of the studies published in the last decade and dedicated to NR ligands in silico prediction for both therapeutic and toxicological purposes. Over 100 articles concerning 14 NR subfamilies were carefully read and analyzed in order to retrieve the most commonly used computational methods to develop predictive models, to retrieve the databases deployed in the model building process and to pinpoint some of the limitations they faced.Entities:
Keywords: QSAR; docking; endocrine disrupting chemicals; in silico; nuclear receptors; pharmacophore model
Mesh:
Substances:
Year: 2022 PMID: 36176461 PMCID: PMC9513233 DOI: 10.3389/fendo.2022.986016
Source DB: PubMed Journal: Front Endocrinol (Lausanne) ISSN: 1664-2392 Impact factor: 6.055
Review of the different initiatives dedicated to Steroid Hormones nuclear receptors.
|
|
|
|
|
|
|
|
|
|
|
| COMPARA | both | 1,746 compounds from ToxCast/Tox21 | Medium | retrospective | Toxicological | 2020 | ( |
|
| QSAR : Machine learning methods (kNN, lazy IB1, and ADTree methods) | LB | In house data (292 compounds) and collection from literature (231 compounds) | Medium | prospective | Toxicological | 2010 | ( |
|
| Docking and 3D QSAR (CoMSIA) | both | Collected from the literature (76 compounds) | Medium | retrospective | Toxicological | 2013 | ( |
|
| Docking, MD, and 3D QSAR (Comsia) | both | In house database of flavonoids (21 compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| Docking | SB | Collected from the literature (20 bisphenols compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| Docking | SB | EPA (1689 compounds) | Medium | prospective | Toxicological | 2017 | ( |
|
| Docking + molecular dynamics | SB | NR-List BDB (3233 compounds) | Medium | retrospective | Toxicological | 2018 | ( |
|
| QSAR : Machine learning methods (Bernoulli Naive Bayes, RF, NNN) | LB | COMPARA calibration set (1689 compounds from EPA) and external validation set (3882 compounds from EPA) | High | retrospective | Toxicological | 2019 | ( |
|
| QSAR : machine learning methods (ANNs,SVM,DT) | LB | CoMPARA dataset (1689 compounds from EPA), EDKB (202 compounds) | Medium | retrospective | Toxicological | 2019 | ( |
|
| Docking and LB Pharmacophore | both | NR-DBIND (812 compounds), Tox21 (5690 compounds) | High | retrospective | Both | 2019 | ( |
|
| QSAR : Machine learning (Bayesian models, RF, kNN,SVM, naïve Bayesian, AdaBoosted DT) and DL | LB | Toxcast (8645 compounds) | High | retrospective | Toxicological | 2020 | ( |
|
| QSAR: KNN + Local method ( lazy learning) + RF | LB | METI (900 compounds), EDKB (87 compounds) | Medium | prospective | Toxicological | 2010 | ( |
|
| QSAR: ML (kNN, DT, NB SVM) | LB | Collected from the literature (1157 compounds in the training set and 121 compounds in the external validation set) | High | retrospective | Toxicological | 2014 | ( |
|
| QSAR: ANN | LB | Collected from the literature (879 compounds for ER, 930 compounds for AR) | High | Retrospective | Toxicological | 2015 | ( |
|
| 3D QSAR and bayesian statistics | LB | Toxcast (1853 compounds) + 42 compounds | Medium | retrospective | Toxicological | 2016 | ( |
|
| Hierarchical charactarestic fragments, docking and MD simulations | both | ToxCast/Tox21 and ChEMBL (2458 compounds for ER, 2843 compounds for AR) | Medium | prospective | Toxicological | 2020 | ( |
|
| Similarity | LB | Toxcast (7027 compounds for AR, 7329 compounds for GR) | High | retrospective | Toxicological | 2020 | ( |
|
| CERAPP | both | Collected from the literature (1677 compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| QSAR: ANN | LB | Collected from the literature (174 compounds) | Medium | Both | Toxicological | 2010 | ( |
|
| QSAR (single task an multi task learning KNN) and docking | both | Collected from the literature including EDKB and ChEMBL (QSAR data sets: 546 compounds for ERa, 137 compounds for ERb; docking data sets: 106 binders/ 4018 decoys for ERa, 80 binders/ 2000 for ER b) | Medium | Both | Toxicological | 2013 | ( |
|
| QSAR:machine learning methods (LDA / CART/SVM) | LB | Toxcast (1814 compounds) and Tox21 (8303 compounds) | Low | retrospective | Toxicological | 2013 | ( |
|
| Docking | SB | EPA (1677 compounds) | Medium | retrospective | Toxicological | 2015 | ( |
|
| Docking and QSAR :machine learning methods (LDA, decision tree, SVM) | both | Collected from the literature (440 compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| QSAR: Machine learning (Bernoulli Naive Bayes, AdaBoost Decision Tree, RF, SVM) and deep learning (DNN) methods | LB | Collected from the literature (1677 compounds from the CERAPP data set, 7351 compounds from Tox21, 3474 compounds for ERa, 2775 compounds for ERb) | High | retrospective | Toxicological | 2018 | ( |
|
| QSAR: Machine learning method (GkNN) | LB | ToxCast and CERAPP databases (1677 compounds) | Low | retrospective | Toxicological | 2018 | ( |
|
| QSAR: Machine learning method (Bayesian models) | LB | "Toxcast2019" and two publications | Medium | Both | Toxicological | 2020 | ( |
|
| QSAR: Machine-learning (BNB, kNN, RF, and SVM) and deep learning (DNN) methods | LB | ToxCast and Tox21 (7576 compounds) | Medium | retrospective | Toxicological | 2020 | ( |
|
| 3D QSAR + 2D QSAR : machine learning methods (PLS, SVR, LR) | LB | In house (68 raloxifene's derivatives) | Low | prospective | Therapeutic | 2013 | ( |
|
| Docking | SB | Ligands extracted from cristallographic complexes (66 compounds) and DUD-E's set (106 binders, 4018 decoys) | Medium | retrospective | Therapeutic | 2014 | ( |
|
| Docking and aggregated potential field similarity | both | NCTRER binding database, ChEMBL, DUD (1691 active and 4785 inactive/decoy compounds) and Tox21 for prospective screening | Medium | prospective | Toxicological | 2014 | ( |
|
| Docking | SB | Drug-Bank Database and collection from literature (105 compounds) | Medium | prospective | Therapeutic | 2019 | ( |
|
| QSAR : Machine learning (RF) | LB | EABD (3308 compounds) and Toxcast (1641 compounds) | Medium | retrospective | Toxicological | 2015 | ( |
|
| QSAR: ANN | LB | Collected from the literature (170 compounds) | Low | retrospective | Toxicological | 2011 | ( |
|
| LB Pharmacophore modeling and QSAR (MLR) | LB | Collected from the literature (119 compounds) and NCI list of compounds for prospective screening | Medium | prospective | Therapeutic | 2010 | ( |
|
| LB pharmacophore modeling and docking | both | Maybridge and Enamine | Low | prospective | Therapeutic | 2014 | ( |
|
| docking and MD simulations | SB | 18 ligands from crystal structures, 40 compounds collected from the literature and 2570 DUD decoys, 400000 compounds from commercial databases for prospective screening | Medium | prospective | Therapeutic | 2014 | ( |
|
| QSAR: Machine learning methods (Naïve bayes, KNN, RF, SVM) | LB | CHEMBL20 (356 active compounds and 107 inactive compounds) + 249 DUD-E decoys | Medium | retrospective | Therapeutic | 2016 | ( |
|
| QSAR: Machine learning (RF) | LB | EADB (2492 compounds) and ToxCast (1805 compounds) | Medium | retrospective | Both | 2017 | ( |
|
| Docking, MD, Binding energy calculation | SB | Collected from the literature (12 compounds); ZINC db (20000 compounds) for prospective screening | High | prospective | Therapeutic | 2018 | ( |
Review of the different initiatives dedicated to projects targeting several NR.
| Receptor | methods | Approach | Database | Reproducibility | Prospective or Retrospective | Application | Year | Ref |
|---|---|---|---|---|---|---|---|---|
|
| QSAR : Deep learning method (molecular image-based method) | LB | Tox21 Data Challenge 2014 (~7000 compounds / NRs) | Low | prospective | Toxicological | 2020 | ( |
|
| QSAR: Machine learning (RF) and Deep Learning (Deep Neural Network) methods | LB | Tox21 (10255 compounds curated from the original 12707 compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| QSAR :Deep learning method (DNN) | LB | Tox21 (8694 compounds curated from the original 12707 compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| QSAR: Machine learning (RF and SVM)/Molecular similarity/ SB Pharmacophore modeling | LB | Tox21 (~7000 compounds / NRs) | High | retrospective | Toxicological | 2018 | ( |
|
| Docking | SB | DUDE-E, ChEMBL | High | retrospective | Toxicological | 2014 | ( |
|
| QSAR : Machine learning methods (SVM, RF) | LB | Tox21 (7248 compounds) | High | retrospective | Toxicological | 2019 | ( |
|
| Virtual Tox Lab software (docking and mQSAR) | both | Collected from the literature (1016 compounds) | Medium | prospective | Toxicological | 2012 and 2014 | ( |
|
| Docking | SB | Collected from the literature (157 compounds) | High | retrospective | Therapeutic | 2010 | ( |
Figure 1Number of publications related to each studied hNR subfamily described in the review.
Figure 2Distribution of the computational approach (SB: structure-based, LB: ligand-based and both: combination of SB and LB methods) in the reviewed publications for the different hNR.
Review of the different initiatives dedicated to RXR and its partners NR.
| Receptor | methods | Approach | Database | Reproducibility | Prospective or Retrospective | Application | Year | Ref |
|---|---|---|---|---|---|---|---|---|
|
| SB pharmacophores | SB | ChEMBL (221 compounds); NCI database (247041 compounds) for prospective screening | Medium | prospective | Therapeutic | 2011 | ( |
|
| SB pharmacophores | SB | in-house Chinese Herbal Medicine database (10216 compounds) for prospective screening | Low | prospective | Therapeutic | 2011 | ( |
|
| LB Pharmacophore and free energy calculations | LB | ChemBridge (~520000 compounds) for prospective screening | Low | prospective | Therapeutic | 2015 | ( |
|
| QSAR: Machine learning methods (SVM, C4.5 DT, k-NN, RF, NV), MoSS and SARpy | LB | Tox21 (688 compounds), ChEMBL (460 compounds), D3R CG2 (76 compounds) | Low | retrospective | Toxicological | 2018 | ( |
|
| QSAR: Machine Learning (counter-propagation artificial neural network, kNN) | LB | ChEMBL (896 compounds), Asinex (3383942 compounds) for prospective screening | Medium | prospective | Therapeutic | 2018 | ( |
|
| SB pharmacophore and shape similarity | both | Collected from the literature 41 compounds + 67059 decoys from Derwent World Drug Index); NCI database (250761 compounds) for prospective screening | Medium | prospective | Therapeutic | 2012 | ( |
|
| self-organizing maps (SOM) | LB | ChEMBL (458 compounds); DrugBank (1280 compounds) for prospective screening | Low | prospective | Therapeutic | 2017 | ( |
|
| 2D fragment-based HQSAR and HQSSR (structure selectivity) and Docking | both | Collected from the literature (62 quinolines and cinnolines) | Medium | prospective | Therapeutic | 2012 | ( |
|
| Docking and MD | SB | ChEMBL database + DecoyFinder (769 compounds for LXRa, 570 compounds for LXRb); MolMall subset of the ZINC (~20000 compounds) for prospective screening | High | prospective | Therapeutic | 2018 | ( |
|
| QSAR (MLR) and Docking | both | Collected from the literature (53 compounds with dual activity LRα/β) | Medium | prospective | Therapeutic | 2018 | ( |
|
| 2D-, 3D-QSAR and docking | both | In-house library (22 compounds) | Medium | prospective | Therapeutic | 2013 | ( |
|
| QSAR, SB pharmacophore modelling and docking | both | In-house library (46 phenylpropanoic acid derivatives) | Medium | prospective | Therapeutic | 2016 | ( |
|
| docking and MD | SB | Asinex (292,724 compounds) for prospective screening | Medium | prospective | Therapeutic | 2018 | ( |
|
| docking, binding energy calculations, MD | SB | ChemDiv database (7476 compounds) for prospective screening | Low | prospective | Therapeutic | 2019 | ( |
|
| Docking and MD | SB | Ligand Expo components database | Medium | prospective | Therapeutic | 2020 | ( |
|
| LB Pharmacophores and 3D QSAR | LB | Collected from the library (88 compounds) | Medium | retrospective | Therapeutic | 2010 | ( |
|
| QSAR :Machine learning methods (MLR, SVM and Bayes Network Toolbox (BNT)), docking and MD | both | Traditional Chinease Medicine (TCM) database (9,029 compounds) | High | prospective | Therapeutic | 2014 | ( |
|
| Docking and MD | SB | Compounds collected from the literature (51 compounds + 3600 DUD decoys); "clean-leads" ZINC's subset for prospective screening (740000 compounds) | Low | prospective | Therapeutic | 2015 | ( |
|
| SB and LB pharmacophore-, shape similarity and docking | both | Collected from the literature (51 partial agonists, 14 agonists + 812 inactives from ToxCast and literature); Maybridge database (52000 compounds) for prospective screening | Low | prospective | Therapeutic | 2016 | ( |
|
| docking and MD | SB | Zbc subset of ZINC database (180313 compounds) | Low | prospective | Therapeutic | 2018 | ( |
|
| Docking, binding energy calculations and MD simulations | SB | Seaweed Metabolite Database (1110 compounds) | Medium | prospective | Therapeutic | 2021 | ( |
|
| docking,SB and LB pharmacophore, QSAR (SVM) | both | Collected from the literature (392 compounds) | Low | retrospective | Therapeutic | 2017 | ( |
|
| Docking | SB | Collected from the literature (106 compounds) | Medium | retrospective | Toxicological | 2017 | ( |
|
| Docking and QSAR (Bayesian classification) | both | Toxcast (308 compounds) | medium | prospective | Toxicological | 2010 | ( |
|
| QSAR : C5.0 | LB | In-house collection (202 compounds) and collection from the literature (434 compounds) | High | retrospective |
| 2012 | ( |
|
| QSAR: partial logistic regression(PLR) | LB | Collected from the literature (631 compounds) | medium | retrospective | Both* (PXR activation is an unwanted side effects of drugs) | 2012 | ( |
|
| QSAR, similarity | LB | Prestwick Chemical Library (1120 compounds) | Low | prospective | Both* (PXR activation is an unwanted side effects of drugs) | 2015 | ( |
|
| SB Pharmacophore and docking | SB | Binding DB (266 compounds); PubChem (820 herbs compounds) for prospective screening | Medium | prospective | Both* (PXR activation is an unwanted side effects of drugs) | 2015 | ( |
|
| SB Pharmacophore | SB | Collected from the literature (18 compounds), Mitsubishi Tanabe Pharma Corporation (68 compounds), NPC (2816 compounds) | Low | retrospective | Both* (PXR activation is an unwanted side effects of drugs) | 2017 | ( |
|
| Docking and MD simulations | both | Collected from the literature (16 HO-PBDEs compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| QSAR (C4.5 ,SVM and Random Forest) | LB | Collected from the literature (258 compounds) | Medium | retrospective | Toxicological | 2019 | ( |
|
| Docking and MD | SB | DUD-E (7556 compounds), in-house indoor dust contaminant inventory (485 compounds) | Medium | retrospective | Toxicological | 2016 | ( |
|
| 3D QSAR, Docking and MD | both | Collected from the literature (33 compounds) | Medium | retrospective | Therapeutic | 2015 | ( |
|
| Docking and QSAR (PLS) | both | Collected from the literature (18 HO-PBDEs compounds) | Medium | prospective | Toxicological | 2010 | ( |
|
| de novo design, docking, MD, free energy calculation | both | Fragments extracted from 6 VDR agonists collected from the literature | Low | prospective | Therapeutic | 2012 | ( |
|
| Docking, LB Pharmacophores, 3D QSAR, MD | both | ChEMBL (478 compounds) | Medium | retrospective | Therapeutic | 2020 | ( |
|
| LB pharmacophore, molecular docking, binding free energy calculation, Density Functional Theory (DFT) study and MD | both | Binding database (31 compounds); for prospective screening: Life chemicals, Enamine, MayBridge, and TCM | Low | prospective | Therapeutic | 2020 | ( |
Review of the different initiatives dedicated to monomeric orphan receptors.
| Receptor | methods | Approach | Database | Reproducibility | Prospective or Retrospective | Application | Year | Ref |
|---|---|---|---|---|---|---|---|---|
|
| Combination of QSAR models | LB | Tox21 (5077 compounds for ERR agonism, 6526 compounds for ERR inhibition); HMDB (3092 compounds) and EU pesticides dataset (888 compounds) for prospective screening | high | prospective | Toxicological | 2019 | ( |
|
| molecular similarity and docking | both | KEGG COMPOUND database (10739 compounds) | high | prospective | Therapeutic | 2013 | ( |
|
| Docking | SB | ZINC database (5.2 million compounds) for prospective screening | Low | prospective | Therapeutic | 2013 | ( |
|
| docking and similarity | both | ChEMBL (502 compounds); Specs commercial database (116495 compounds) for prospective screening | Medium | prospective | Therapeutic | 2018 | ( |
|
| SB pharmacophore and Docking | SB | Asinex Gold–Platinum (289174 compounds) | Medium | prospective | Therapeutic | 2020 | ( |
Figure 3Structure-based (SB) and Ligand-based (LB) screening methods (155).
Example of databases including or dedicated to nuclear receptors.
| Database | Link | Composition | Specific to NR only |
|---|---|---|---|
| Binding DB |
| As of November 8, 2021, BindingDB contains | No |
| ChEMBL |
| manually curated database: 2.1 M compounds | No |
| Drugbank |
| 14,585 drugs and several targets like enzyme, transporters and carriers | No |
| ZINC database |
| contains over 230 million purchasable compounds in ready-to-dock, 3D formats. | No |
| Tox21 |
| The list of ToxCast and Tox21 chemicals suspected to be a hazard for human and environmental health and associated | No |
| ToxCast |
| No | |
| DUD-E |
| 22,886 active compounds and their affinities against 102 targets, an average of 224 ligands per target and | No |
| DSSTox |
| launched in 2004, currently exceeds 875K substances spanning hundreds of lists of interest. | No |
| EDKB |
| Data for more than 3200 chemicals | Yes (ER and AR) |
| EABD |
| 18,114 estrogenic-activity data points collected for 8,212 chemicals tested in 1,284 binding assays, reporter-gene assays, cell-proliferation assays, and in-vivo assays in 11 different species. | Yes (ER) |
| NR-DBIND |
| 15,116 positive and negative interactions data are provided for 28 NRs together with 593 PDB structures | Yes |
| NR-List BDB |
| 9,905 compounds and 339 structures of the NRLiSt BDB | Yes |
| ONRLDB |
| ∼11 000 ligands, of which ∼6500 are unique. | Yes |
| NURA |
| bioactivity data for 15,247 molecules and 11 NRs | Yes (ERα andβ, PPARGαγ, and δ, AR, GR,PR, FXR,RXR and PXR) |