| Literature DB >> 33104327 |
Sergey Gusarov1, Stanislav R Stoyanov2.
Abstract
Chemical similarity-based approaches employed to repurpose or develop new treatments for emerging diseases, such as COVID-19, correlates molecular structure-based descriptors of drugs with those of a physiological counterpart or clinical phenotype. We propose novel descriptors based on a COSMO-RS (short for conductor-like screening model for real solvents) σ-profiles for enhanced drug screening enabled by machine learning (ML). The descriptors' performance is hereby illustrated for nucleotide analogue drugs that inhibit the ribonucleic acid-dependent ribonucleic acid polymerase, key to viral transcription and genome replication. The COSMO-RS-based descriptors account for both chemical reactivity and structure, and are more effective for ML-based screening than fingerprints based on molecular structure and simple physical/chemical properties. The descriptors are evaluated using principal component analysis, an unsupervised ML technique. Our results correlate with the active monophosphate forms of the leading drug remdesivir and the prospective drug EIDD-2801 with nucleotides, followed by other promising drugs, and are superior to those from molecular structure-based descriptors and molecular docking. The COSMO-RS-based descriptors could help accelerate drug discovery for the treatment of emerging diseases.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33104327 PMCID: PMC7605243 DOI: 10.1021/acs.jpclett.0c02836
Source DB: PubMed Journal: J Phys Chem Lett ISSN: 1948-7185 Impact factor: 6.475
Figure 1(a) Noncovalent bonding of remdesivir (ball and stick) to mRNA (helix) in RdRp (green ribbon) optimized using QM/MM. Green lines are the hydrogen bonds with U10 (target, two upper bonds) and U20 (pattern, two lower bonds). (b) 2D noncovalent bonding network of remdesivir bonded to mRNA. (c) COSMO-RS σ-profile of remdesivir and its projections W1–W4, with the hydrogen bonds highlighted.
Charge Density (e/Å2) for the Hydrogen Bonds Highlighted in Figure c
| Nrmsd-ALA688 | Nrmsd-T:U10 | Ormsd-P:U20 | Hrmsd-T:U10 | Hrmsd-T:U20 |
|---|---|---|---|---|
| 0.0092 | 0.0130 | 0.0062 | –0.0140 | –0.0159 |
Principal Component (PC) Analysis Results for Descriptor Sets I and II
| parameter | PC1 | PC2 | PC3 | PC4 |
|---|---|---|---|---|
| Descriptor Set I | ||||
| COSMO-RS Solvent accessible surface (Å2) | –0.286 | 0.242 | 0.567 | –0.400 |
| Charge on | –0.265 | 0.453 | –0.138 | –0.154 |
| COSMO-RS solvation energy (kcal/mol) | 0.522 | 0.232 | –0.048 | 0.031 |
| Dipole moment (Debye) | –0.393 | –0.281 | –0.111 | –0.544 |
| COSMO-RS σ-profile projection W1 (Å2) | –0.488 | 0.212 | –0.200 | 0.253 |
| COSMO-RS σ-profile projection W2 (Å2) | –0.035 | –0.396 | 0.694 | 0.256 |
| COSMO-RS σ-profile projection W3 (Å2) | –0.075 | 0.595 | 0.323 | 0.294 |
| COSMO-RS σ-profile projection W4 (Å2) | –0.420 | –0.220 | –0.137 | 0.550 |
| Latency | 2.887 | 1.751 | 1.190 | 0.902 |
| Descriptor Set II | ||||
| AlogP | –0.361 | 0.232 | 0.333 | –0.010 |
| Molecular weight (g/mol) | 0.272 | 0.233 | 0.526 | 0.242 |
| Number of hydrogen bond donors | 0.486 | 0.021 | –0.008 | 0.359 |
| Number of hydrogen bond acceptors | 0.446 | 0.113 | 0.070 | –0.164 |
| Number of rotatable bonds | –0.199 | 0.238 | –0.382 | 0.803 |
| Number of rings | 0.095 | 0.459 | 0.381 | 0.024 |
| Number of aromatic rings | –0.048 | 0.540 | –0.295 | –0.246 |
| Molecular fractional polar surface area (Å2) | 0.508 | –0.102 | –0.369 | –0.127 |
| Molecular surface area (Å2) | –0.048 | 0.540 | –0.295 | –0.246 |
| Molecular polar surface area (Å2) | 0.470 | 0.135 | –0.080 | 0.055 |
| Latency | 0.430 | 0.280 | 0.133 | 0.062 |
Figure 2(a) Principal components (PC) 1–3 for descriptor set I (Table ) with labels of the COSMO-RS-derived projections W1–W4 of the σ-profile, COSMO-RS solvent accessible surface (SAS), ECOSMO-RS, P atom induced charge (QP), and dipole moment (DM). (b) PC 1–3 for descriptor set II (Table ). (green squares, remdesivir; green diamonds, EIDD-2801; green circles, AMP; yellow squares, GMP; yello triangles, IDX-184; yello circles, mizobirine; blue squares, ribavirin, penciclovir, acyclovir; red circles, remaining drugs and nucleotides.)
Figure 3(a) Docking of the drugs and nucleotides on the mRNA of RdRp, showing the localization of the drugs of Group 1. (b) LibDock score of the 22 pyrimidine (Pyr) and purine nucleotides and drug analogues.