| Literature DB >> 33101604 |
Nur Syatila Ab Ghani1, Reeki Emrizal2, Haslina Makmur2, Mohd Firdaus-Raih1,2.
Abstract
Structures of protein-drug-complexes provide an atomic level profile of drug-target interactions. In this work, the three-dimensional arrangements of amino acid side chains in known drug binding sites (substructures) were used to search for similarly arranged sites in SARS-CoV-2 protein structures in the Protein Data Bank for the potential repositioning of approved compounds. We were able to identify 22 target sites for the repositioning of 16 approved drug compounds as potential therapeutics for COVID-19. Using the same approach, we were also able to investigate the potentially promiscuous binding of the 16 compounds to off-target sites that could be implicated in toxicity and side effects that had not been provided by any previous studies. The investigations of binding properties in disease-related proteins derived from the comparison of amino acid substructure arrangements allows for effective mechanism driven decision making to rank and select only the compounds with the highest potential for success and safety to be prioritized for clinical trials or treatments. The intention of this work is not to explicitly identify candidate compounds but to present how an integrated drug repositioning and potential toxicity pipeline using side chain similarity searching algorithms are of great utility in epidemic scenarios involving novel pathogens. In the case of the COVID-19 pandemic caused by the SARS-CoV-2 virus, we demonstrate that the pipeline can identify candidate compounds quickly and sustainably in combination with associated risk factors derived from the analysis of potential off-target site binding by the compounds to be repurposed.Entities:
Keywords: COVID-19; Drug repositioning; Off-target sites; SARS-CoV-2; Side chain similarity
Year: 2020 PMID: 33101604 PMCID: PMC7575501 DOI: 10.1016/j.csbj.2020.10.013
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Sub-structural similarities of known drug binding sites in SARS-CoV-2 protein structures.
| Alternate target in SARS-CoV-2 | PDBID | Drug ID | Known target | % seq. identity | Docking Score |
|---|---|---|---|---|---|
| ADP ribose phosphatase | 6w02B | CLQ * | 4fglB (Quinone reductase 2) | 21.54 | −7.5 |
| LOC | 3ut5B (Tubulin beta chain) | 18.02 | −7.6 | ||
| 017 * | 6dh3A (HIV protease) | 17.92 | −9.5 | ||
| AB1 | 2qhcA (HIV protease retropepsin) | 18.50 | −17.6 | ||
| RIT | 1rl8A (HIV protease retropepsin) | 17.92 | −12.0 | ||
| 6w6yB | NPS | 3nt1 (Prostaglandin-endoperoxide synthase 2) | 14.09 | −6.7 | |
| Spike protein | 6vybC | 017 * | 3lzvA (HIV protease) | 6.21 | −6.6 |
| 6w41C/H | RIT * | 1rl8A (HIV protease retropepsin) | 21.88 | −12.2 | |
| Main protease | 6lu7A | VIA * | 2h42A (PDE5A) | 22.34 | −7.8 |
| 6y2gA/B | FOL | 4i13A (Dihydrofolate reductase) | 17.92 | −8.2 | |
| NSP10/16 | 6w75B/D | RIT * | 1sh9 (Pol polyprotein) | 20.00 | −7.6 |
| NPS10 | 6w4hB | IMN | 1z9hD (Membrane-associated prostaglandin E synthase-2) | 20.00 | −6.7 |
| NSP15 | 6vwwA | LSN | 5x24A (Cytochrome P450) | 21.59 | −7.5 |
| 6w01B | NPS | 4po0A (Serum albumin) | 19.09 | −7.3 | |
| Nucleocapsid | 6m3mA/D | LSN | 5x24A (cytochrome P450) | 16.34 | −8.5 |
| 6m3mB | RIT | 3tneB (Aspartic protease) | 19.01 | −7.2 | |
| 6m3mB/C | 017 | 3so9 (HIV protease) | 23.85 | −8.6 | |
| NSP7/NSP8 | 6yhuB | AB1 * | 2rkgB (Protease retropepsin) | N.A | −17.9 |
| 6yhuC/D | IMN | 4coxA (Cyclooxygenase-2) | N.A | −7.2 | |
| 6wiqA/B | FOL | 1rb2B (Dihydrofolate reductase) | N.A | −6.9 | |
| NSP8 | 7bv1D | FOL | 3tqbA (Dihydrofolate reductase) | N.A | −7.5 |
Drug ID is represented as follows: CLQ: chloroquine; LOC: colchicine; RIT: ritonavir; 017: darunavir; AB1: lopinavir; VIA: sildenafil; NPS: naproxen; LSN: losartan; AIN: aspirin; IMN: indomethacin; FOL: folic acid. *Denotes that the interaction had been previously reported by the accompanying citation.
Fig. 1Sub-structural similarity and poses of docked ligands from Autodock Vina. Predicted binding residues to docked ligands are indicated in orange, while ball and stick representations of atoms colored in orange indicate the residues identified by Drug ReposER that are similarly arranged to binding sites in known targets (green). The docked ligand is presented on the potential target protein from SARS-CoV-2 (light blue). (A) ADP ribose phosphatase (PDBID: 6w02) bound to docked darunavir (017) with green colored stick representation of similarly arranged residues from HIV protease retropepsin (PDBID: 2qhc). (B) NSP10 (PDBID: 6w4h) bound to docked indomethacin (IMN) with green colored stick representation of similarly arranged residues from Membrane-associated prostaglandin E synthase-2 (PDB: 1z9h). (C) NSP15 (PDBID: 6w01) bound to docked naproxen (NPS) with similarly arranged residues from serum albumin highlighted in green (D) Main protease (PDBID: 6lu7) bound to docked sildenafil (VIA) with superposed residues from PDE5A (PDBID: 2h42). (E) Docked losartan (LSN) in nucleocapsid (PDBID: 6m3m) with superposed losartan binding residues cytochrome P450 (PDBID: 5x24) indicated in green. (F) Docked folic acid (FOL) bound to NSP8 (PDBID: 7bv1) that has similar arrangement to folic acid sites in dihydrofolate reductase (PDBID: 3tqb). The locations of proposed binding sites are highlighted in orange color. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Docked drug molecules on SARS-CoV-2 ADP ribose phosphatase (PDBID: 6w02). (A) Superposed drugs in ADP ribose phosphatase obtained from docking simulations in Autodock Vina. White shaded areas indicate that the residues are within 4.0 Å to the docked drug molecules. (B-F, left) Superpositions of known drug targets to ADP ribose phosphatase based on sub-structural similarity of drug binding sites. (B-F, right) Residues that are similarly arranged in ADP ribose phosphatase and binding sites for known drugs derived from protein-drug complexes. (B) Chloroquine (CLQ) binding site in quinone reductase (PDBID: 4fgl). (C) Colchicine (LOC) binding site in tubulin chain B (PDBID: 3ut5). (D) Darunavir (017) binding site in HIV protease (PDBID: 6dh3). (E) Lopinavir (AB1) binding site in HIV protease retropepsin (PDBID: 2qhc). (F) Ritonavir (RIT) binding site in HIV protease retropepsin (PDBID: 1rl8). The location of proposed binding sites are highlighted in surface representation colored in orange. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Human protein-drug complexes available in the PDB for the approved drugs proposed for COVID-19.
| Bound drug (Drug ID1) | PDB ID of protein-drug complex | Protein structure annotation | Known / potential effects from drug binding |
|---|---|---|---|
| CLQ | 4v2o | Saposin B | Impaired lipid degradation |
| 4fgl | Quinone reductase 2 | Known treatment for malaria | |
| LOC | 4lzr | Bromodomain-containing protein | Potential repurposing for cancer |
| 5nkn | Neutrophil gelatinase-associated lipocalin | Potentially reduce poisoning effects of colchicine | |
| 017 | No drug-bound proteins from human is available. | ||
| AB1 | |||
| RIT | |||
| VIA | Conserved in phosphodiesterase family | Potential repurposing of sildenafil in multiple diseases related to PDE activities | |
| FOL | 1drf | Dihydrofolate reductase | Known target for folic acid |
| 4lrh | Folate receptor alpha | ||
| 4kmz | Folate receptor beta | ||
| LSN | 5x24 | Cytochrome P50 2C9 | Known enzyme that binds to losartan |
| NPS | 3r58 | Aldo-keto reductase family 1 member C3 | Potential repurposing of naproxen for prostate cancer |
| 4jq1 | Aldo-keto reductase family 1 member C2 | ||
| AIN | No drug-bound proteins from human is available. | ||
| IMN | 2zb8 | Prostaglandin reductase 2 | Potential use of indomethacin to improve insulin sensitivity |
| 3ads | PPAR-gamma | Potential repurposing of indomethacin for obesity and lipodystrophy | |
Drug IDs are indicated as in Table 1.
Potential off-target effects for selected drugs according to sub-structural similarity of binding patterns from Drug ReposER application.
| Potential off-targets that may cause adverse effects | ||||
|---|---|---|---|---|
| Drug ID | Query PDBID of structure with known binding site | Hit PDB of structure with a potential alternate / off-target site (Docking score) | Macromolecule and its associated pathways or mechanisms for off-target sites / involvement in antiviral activity | Potential/reported outcomes associated with the off-target site |
| LOC | 5nm5B | 2gk1I (−7.5) | Beta-hexosaminidase subunit alpha (HEXA) - Tay Sachs disease (TSD) | Neurodegeneration |
| 6dh0A | 3kciA (−9.3) | E3 ubiquitin-protein ligase (HERC2) - neurodevelopmental disorder | Neurological complications | |
| 3oxwB | 2r7eB (−7.6) | Coagulation factor VIII (C8) - hemophilia | Increased risk of hemorrhage in hemophilia patients | |
| 5veuA | 3d7uA (−8.5) | Tyrosine protein kinase (CSK) - suppress SRC tyrosine kinase (SFK) activity that cause cancer such as colorectal cancer | Increased risk of colorectal cancer | |
| 1rl8A | 3hhdA (−13.1) | Fatty acid synthase (FAS/FASN) - lipid mechanism | Lipodystrophy | |
| 2vdbA | 3fgqA (−6.1) | Neuroserpin (SERPIN1) - stroke | Increased risk of stroke | |
| 5x24A | 1kcwA (−7.4) | Ceruloplasmin (CP) - Parkinsonism | Worsens the effects of parkinsonism | |
| 5x24A | 3gzdA (−6.7) | Selenocysteine lyase (SCLY) - glucose and lipid metabolism | Prevents metabolic syndromes | |
| 4po0A | 1qm9A (−6.9) | Polypyrimidine tract-binding protein 1 (PTBP1) - allow viral replication through its RNA binding domain | Potentially inhibit viral replication | |
| 2qqtA | 1d7wC (−6.1) | Myloperoxidase (MPO) - antiviral properties toward influenza virus | Potentially decrease lung viral loads | |
| 3ogwA | 6azpA (−8.2) | |||
| 1rl8A | 3hhdA (−13.1) | Fatty acid synthetase (FAS/FASN) - facilitate viral replication through generation of membrane compartments | Potentially inhibit viral replication | |
Drug IDs are indicated as in Table 1.
Fig. 3Examples of potential off-target sites for selected drug molecules. Residues in cornflower blue are within 4.0 Å to a drug molecule derived from protein-drug complexes in the PDB; tan residues are potential protein targets that are similarly arranged to binding residues from protein-drug complexes. All docked ligands are colored in red. (A) The indomethacin binding site in lactoperoxidase that are similarly arranged to residues in myeloperoxidase. (B) Superposition of the losartan binding site in cytochrome 450 docked onto the binding pose in ceruloplasmin (C) Similarly arranged residue patterns between the binding site of naproxen in serum albumin and residues within 4.0 Å to the docked naproxen in polypyrimidine tract-binding protein 1. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Human proteins with more than 30% sequence identity to SARS-CoV-2 proteins retrieved by a blastp search of the PDB database.
| SARS-CoV-2 protein | PDBID of structure homolog | Protein function / disease mechanism Potential side effects / benefits |
|---|---|---|
| NSP3 / ADP ribose phosphatase | 3q6zA (31.25%) | Poly [ADP-ribose] polymerase 14, catalyze the mono-ADP-ribosylation of STAT1, functions in innate immune response |
| Spike protein | 4z7iA (48.51%) | Insulin-regulated amino peptidase – binds angiotensin IV in the brain |
| 4bkfC (72.00%) | EPHRIN-B3 – serves as receptor for Nipah virus | |
| 5ojmA (94.44%) | Gamma-aminobutyric acid receptor subunit alpha-5 – implicated in neurological disorders | |
| NSP15 | 4ewqA (41.67%) | Mitogen-activated protein kinase 14 – plays a role in neuroinflammatory responses |
| 4tntA (94.44%) | Mineralocorticoid receptor - plays a role in inflammatory responses through regulation of macrophage and T-cells, and is implicated in cardiac hypertrophy | |
| Nucleocapsid | 2vxsA (37.84%) | Interleukin-17a – involves in inflammatory responses and plays a role in cardiovascular complications |
| Main protease | No human homolog found, conserved in viruses. | |
| NSP7 | ||
| NSP8 | ||
| NSP10 | ||
| NSP16 | ||
1Drug IDs are indicated as in Table 1.
Sequence similarity percentages are provided in brackets.
Fig. 4Examples of potential off-target sites for selected drug molecules based on a blastp search for human proteins with more than 30% sequence identity to SARS-CoV-2 proteins. (A) A superposition of structures between the Poly [ADP-ribose] polymerase 14 from human (tan color) and ADP ribose in NSP3 of SARS-CoV-2 (blue color) resulting in a shared binding site for substrate (APR) and proposed site for binding to ritonavir (red). (B) Superposition of human EPHRIN-B3 (tan color) and spike protein (blue color) with the proposed site for darunavir (cyan). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
The drug molecules in Drug ReposER that exhibit similar structure with the proposed drug molecules for COVID-19 clinical trials that have the similar potential ability of binding to SARS-CoV-2 proteins.
| Drug molecules proposed for COVID-19 clinical trials (proposed drugs) | Drug molecules exhibiting similar structure with drug molecules undergoing trials (matched drugs) | Shared SARS-CoV-2 protein targets predicted by Drug ReposER | Similar binding sites on the target structures from SARS-CoV-2 identified by Drug ReposER | Molecular docking analysis | ||
|---|---|---|---|---|---|---|
| Presence of binding conformation that is close to the predicted binding site (<4 Å) | Binding affinity (kcal/mol) | |||||
| Chloroquine (CLQ) | Quinacrine (QUN) | Angiotensin-converting enzyme 2 (ACE2) (6m17) | – | CLQ | None | −6.2 |
| QUN | None | −6.5 | ||||
| Sildenafil (VIA) | Vardenafil (VDN) | Non-structural protein 16 (NSP16) (6w75) | MET A 6929 | VIA | Yes | −7.9 |
| ILE A 6951 | ||||||
| TYR A 6979 | ||||||
| ALA A 6990 | ||||||
| HIS A 7023 | VDN | Yes | −7.8 | |||
| Thalidomide (EF2) | Lenalidomide (LVY) | ACE2 and Receptor Binding Domain (RBD) (6m17) | TYR B 41 | EF2 | None | −7.6 |
| ASN E 439 | ||||||
| PHE E 497 | ||||||
| PRO E 507 | ||||||
| LVY | None | −7.4 | ||||
| Pomalidomide (Y70) | NSP16 (6w4h) | HIS A 6867 | EF2 | None | −7.7 | |
| THR A 6891 | ||||||
| TRP A 6922 | ||||||
| PHE A 6954 | Y70 | None | −7.6 | |||
| Darunavir (017) | Amprenavir (478) | NSP3 (6w02) | PRO B 125 | 17 | Yes | −9.4 |
| GLY B 130 | ||||||
| ILE B 131 | ||||||
| VAL B 155 | ||||||
| ASP B 157 | 478 | Yes | −9.4 | |||
| Folic Acid (FOL) | Methotrexate (MTX) | Main protease (Mpro) (7buy) | THR A 199 | FOL | Yes | −7.5 |
| LEU A 205 | ||||||
| VAL A 233 | ||||||
| SER A 267 | ||||||
| LEU A 271 | MTX | Yes | −7.3 | |||
Fig. 5Quinacrine (QUN), vardenafil (VDN), lenalidomide (LVY), pomalidomide (Y70), amprenavir (478), and methotrexate (MTX) share structural similarities with the corresponding drug molecules that have been proposed for COVID-19 clinical trials: chloroquine (CLQ), sildenafil (VIA), thalidomide (EF2), darunavir (017) and folic acid (FOL) respectively. Structural alignment of QUN (red) (A), LVY (yellow) (B), Y70 (gold) (C), VDN (cyan) (D), 478 (green) (E), and MTX (black) (F) with CLQ (pink) (A), EF2 (brown) (B, C), VIA (purple) (D), 017 (blue) (E), and FOL (gray) (F) respectively. Molecular docking for the structurally similar drug molecules and the corresponding drug molecules that have been proposed for COVID-19 clinical trials on their shared protein target from SARS-CoV-2 predicted by Drug ReposER (ACE2 and RBD complex (PDBID: 6 m17, green) (A,B), NSP16 (PDBID: 6w4h, gray) (C), NSP16 (PDBID: 6w75, blue) (D), NSP3 (PDBID: 6w02, magenta) (E), and Mpro (PDBID: 7buy, brown) (F)). The white shaded areas indicate regions containing residues within less than 4.0 Å to docked drug molecules. The orange shaded areas indicate regions containing residues that form the binding sites identified by Drug ReposER. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)