| Literature DB >> 30044400 |
Larisa Ivanova1, Mati Karelson2, Dimitar A Dobchev3.
Abstract
The aim of this study was to identify new potentially active compounds for three protein targets, tropomyosin receptor kinase A (TrkA), N-methyl-d-aspartate (NMDA) receptor, and leucine-rich repeat kinase 2 (LRRK2), that are related to various neurodegenerative diseases such as Alzheimer's, Parkinson's, and neuropathic pain. We used a combination of machine learning methods including artificial neural networks and advanced multilinear techniques to develop quantitative structure⁻activity relationship (QSAR) models for all target proteins. The models were applied to screen more than 13,000 natural compounds from a public database to identify active molecules. The best candidate compounds were further confirmed by docking analysis and molecular dynamics simulations using the crystal structures of the proteins. Several compounds with novel scaffolds were predicted that could be used as the basis for development of novel drug inhibitors related to each target.Entities:
Keywords: CADD; LRRK2; NMDA; TrkA; artificial neural networks; molecular docking; molecular dynamics; natural compounds
Mesh:
Substances:
Year: 2018 PMID: 30044400 PMCID: PMC6222649 DOI: 10.3390/molecules23081847
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Statistical and topological parameters of the artificial neural network (ANN) models. LRRK2, leucine-rich repeat kinase 2; NMDA, N-methyl-d-aspartate; TrkA, tropomyosin receptor kinase A; RMS, root mean squared error.
| Target | Alpha | Eta | Ntr | Nval | Epoch | R2tr | R2val | RMStr | RMSval | Architecture | Input Descriptors |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 0.01 | 0.02 | 81 | 20 | 3525 | 0.788 | 0.565 | 0.195 | 0.381 | 5-5-3-1 | Highest coulombic interaction (AM1); HPSA polar (AM1) part of SASA; max nucleophilic reactivity index (AM1) for C atoms; avg. electrophilic reactivity index (AM1) for C atoms; max electrophilic reactivity index (AM1) for N atoms |
|
| 0.01 | 0.02 | 107 | 26 | 1651 | 0.752 | 0.519 | 0.273 | 0.444 | 5-5-5-1 | Kier and Hall index (order 2); charged (Zefirov) surface area of O atoms; min nucleophilic reactivity index (AM1) for O atoms; highest e-e repulsion (AM1) for C–O bonds; square root of charged (Zefirov) surface area of N atoms |
|
| 0.01 | 0.03 | 121 | 30 | 1641 | 0.781 | 0.798 | 0.216 | 0.230 | 5-5-4-1 | Lowest total interaction (AM1) for N–H bonds; relative number of aromatic bonds; HASA-2/TMSA (AM1); square root of partial charged (AM1) surface area of C atoms; lowest n-n repulsion (AM1) for C–N bonds |
Figure 1Predicted vs. experimental logIC50 for training and validation sets: (A) ANN model for LRRK2; (B) ANN model for NMDA; (C) ANN model for TrkA. Trend lines used for training data.
Best multilinear regression (BMLR) models for logIC50 and their statistical parameters.
|
|
| ||||
|
|
|
|
|
| |
| 0 | −11.779 | 2.126 | −5.541 | Intercept | |
| 1 | −0.276 | 0.028 | −9.947 | Highest coulombic interaction (AM1) | |
| 2 | 8.364 | 1.087 | 7.697 | Max bonding contribution of one MO (AM1) | |
| 3 | 1.955 | 0.954 | 2.048 | RNCG relative negative charge (QMNEG/QTMINUS) (Zefirov) | |
| 4 | 2883.443 | 389.061 | 7.411 | Max electrophilic reactivity index (AM1) for H atoms | |
| 5 | −35.897 | 8.661 | −4.145 | Max nucleophilic reactivity index (AM1) for C atoms | |
|
| |||||
| (AB,C): R2ab = 0.673 | R2ab_cv = 0.607 | R2c_pred = 0.729 | |||
| (BC,A): R2bc = 0.722 | R2bc_cv = 0.662 | R2a_pred = 0.756 | |||
| (CA,B): R2ca = 0.677 | R2ca_cv = 0.567 | R2b_pred = 0.689 | |||
|
|
| ||||
| 0 | 52.126 | 11.055 | 4.715 | Intercept | |
| 1 | −333.432 | 34.324 | −9.714 | HACA-2/TMSA (Zefirov) | |
| 2 | −0.760 | 0.029 | −26.281 | Kier and Hall index (order 2) | |
| 3 | −10.315 | 0.785 | −13.141 | Average bonding information content (order 0) | |
| 4 | −0.084 | 0.008 | −10.313 | Lowest e-n attraction (AM1) for C–N bonds | |
| 5 | −17.241 | 3.022 | −5.706 | Average valency (AM1) for C atoms | |
|
| |||||
| (AB,C): R2ab = 0.905 | R2ab_cv = 0.883 | R2c_pred = 0.894 | |||
| (BC,A): R2bc = 0.920 | R2bc_cv = 0.900 | R2a_pred = 0.884 | |||
| (CA,B): R2ca = 0.888 | R2ca_cv = 0.860 | R2b_pred = 0.926 | |||
|
|
| ||||
| 0 | 31.850 | 2.351 | 13.545 | Intercept | |
| 1 | −0.267 | 0.017 | −15.942 | Highest coulombic interaction (AM1) for N–H bonds | |
| 2 | −0.586 | 0.041 | −14.412 | Number of F atoms | |
| 3 | 0.123 | 0.010 | 12.637 | Highest e-n attraction (AM1) for C–C bonds | |
| 4 | 0.214 | 0.021 | 10.270 | Charged (Zefirov) surface area of N atoms | |
| 5 | −0.066 | 0.017 | −3.915 | Total point-charge component of molecular dipole (AM1) | |
|
| |||||
| (AB,C): R2ab = 0.859 | R2ab_cv = 0.842 | R2c_pred = 0.880 | |||
| (BC,A): R2bc = 0.854 | R2bc_cv = 0.837 | R2a_pred = 0.861 | |||
| (CA,B): R2ac = 0.861 | R2ac_cv = 0.841 | R2b_pred = 0.843 | |||
Figure 2Linear fit between experimental and observed logIC50 of the BMLR models: (A) LRRK2 set; (B) NMDA set; (C) TrkA set. Trend lines indicated by straight lines. BMLR equations in Table 2 are also incorporated in the boxes.
Binding energies (kcal/mol) and binding modes of small-molecule ligands to receptors (LRRK2, NMDA, and TrkA).
| No. | Structure | ZINC Code | Binding Energy ΔG, kcal/mol | Ligand Efficiency | Binding Mode, Including H-Bonds (Residue of Amino Acid Group or Atom in a Compound) |
|---|---|---|---|---|---|
|
| |||||
| GNE-5729 |
| ==== | −11.3 | −0.42 | Ile116, Val128, Pro129, Phe130, Pro141, Lys143, Tyr144, Thr242, Leu262, Leu263, Val266, Leu270 |
|
|
| ZINC98363705 | −9.8 | −0.39 | Pro129, Phe130, Val131, Glu132, Pro141, Tyr144, Leu263, Val266, Met271, Glu275 |
| 2N |
| ZINC67658347 | −9.7 | −0.40 | Ile128, Pro129, Val131, Glu132, Pro141, Tyr144, Thr242, Gly243, Gly250, Val266, His273 (N...HN) |
|
|
| ZINC98364250 | −9.0 | −0.39 | Ile128, Pro129, Glu132, Pro141, Tyr144, Thr242, Gly250 (NH...N), Leu263, Val266 |
|
| |||||
| PF-06447475 |
| ==== | −9.0 | −0.39 | Ile30, Glu31, Ala51, Lys53, Glu100 (O...HN), Leu102 (NH...N), Leu151, Asp162 |
|
|
| ZINC2115150 | −9.0 | −0.36 | Ile30, Val38, Lys53, Glu70, Tyr101, Leu102, Asp162 |
|
|
| ZINC12901845 | −8.5 | −0.37 | Leu102 (NH...O), Ser106 (HO....H2N), Gly105, Asp109, Leu151 |
|
|
| ZINC518729 | −8.7 | −0.38 | Ile30, Val38, Tyr101, Leu102 (NH...OH), Gly105, Leu151 |
| TrkA | |||||
|
|
| ZINC35077985 | −8.7 | −0.32 | Leu516, Val524, Ala542, Phe589, Met592, Gly595, Asp596, Leu657 (O....HN) |
|
|
| ZINC85880015 | −9.80 | −0.38 | Leu516, Gly519, Lys544, Phe589, Met592, Gly595, Leu657, Asp668 |
|
|
| ZINC1323099 | −8.6 | −0.54 | Val524, Ala542, Phe589, Met592, Leu657 |
|
|
| ZINC489632 | −9.5 | −0.38 | |
Figure 3Calculated binding modes of ligands in the active site of NMDA (PDB ID: 5TP9): (A) compound GNE-5729, (B) compound 1N, (C) compound 2N, (D) compound 3N. The amino acid residues of NMDA are colored gray (carbon), blue (nitrogen), red (oxygen), and white (hydrogen). Hydrogen bonds formed between compounds and residues of NMDA are represented by green dashed lines.
Figure 42D summary of molecular dynamics calculated contacts between NMDA and compounds (A) GNE-5729, (B) 1N, (C) 2N, and (D) 3N.
Figure 5Calculated binding modes of (A) compound PF-06447475, (B) compound 1L, (C) compound 2L, and (D) compound 3L in the active site of LRRK2 (PDB ID: 4U8Z). The amino acid residues of LRRK2 are colored gray (carbon), blue (nitrogen), red (oxygen), and white (hydrogen). Hydrogen bonds formed between compound and residues of LRRK2 are represented by green dashed lines.
Figure 62D summary of the molecular dynamics calculated contacts between compounds (A) PF-06447475, (B) 1L, (C) 2L, (D) 3L and LRRK2.
Figure 7Calculated binding modes of (A) compound AZ-23, (B) compound 1T, (C) compound 2T, and (D) compound 3T in the active site of TrkA (PDB ID: 4AOJ). The amino acid residues of TrkA are colored gray (carbon), blue (nitrogen), red (oxygen), and white (hydrogen). Hydrogen bonds formed between compound and residues of TrkA are represented by green dashed lines.
Figure 82D summary of molecular dynamics calculated contacts between compounds (A) AZ-23, (B) 1T, (C) 2T, and (D) 3T and TrkA.