| Literature DB >> 34834924 |
Xuan Xu1, Qing-Ye Zhang1, Xin-Yi Chu1, Yuan Quan1, Bo-Min Lv1, Hong-Yu Zhang1.
Abstract
Over the course of human history, billions of people worldwide have been infected by various viruses. Despite rapid progress in the development of biomedical techniques, it is still a significant challenge to find promising new antiviral targets and drugs. In the past, antiviral drugs mainly targeted viral proteins when they were used as part of treatment strategies. Since the virus mutation rate is much faster than that of the host, such drugs feature drug resistance and narrow-spectrum antiviral problems. Therefore, the targeting of host molecules has gradually become an important area of research for the development of antiviral drugs. In recent years, rapid advances in high-throughput sequencing techniques have enabled numerous genetic studies (such as genome-wide association studies (GWAS), clustered regularly interspersed short palindromic repeats (CRISPR) screening, etc.) for human diseases, providing valuable genetic and evolutionary resources. Furthermore, it has been revealed that successful drug targets exhibit similar genetic and evolutionary features, which are of great value in identifying promising drug targets and discovering new drugs. Considering these developments, in this article the authors propose a host-targeted antiviral drug discovery strategy based on knowledge of genetics and evolution. We first comprehensively summarized the genetic, subcellular location, and evolutionary features of the human genes that have been successfully used as antiviral targets. Next, the summarized features were used to screen novel druggable antiviral targets and to find potential antiviral drugs, in an attempt to promote the discovery of new antiviral drugs.Entities:
Keywords: evolution; genetics; host receptors
Mesh:
Substances:
Year: 2021 PMID: 34834924 PMCID: PMC8626054 DOI: 10.3390/v13112117
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Basic information of approved antiviral host targets.
| Gene | Uniprot ID | Protein Name | SCG Virus | Subcellular Location | Origin | Associated Virus Diseases Name |
|---|---|---|---|---|---|---|
|
| P02745 | Complement C1q subcomponent subunit A | 1 | Secreted | Vertebrata | Herpes Simplex |
|
| P02746 | Complement C1q subcomponent subunit B | 1 | Secreted | Vertebrata | Herpes Simplex; Carcinoma, Merkel Cell |
|
| P02747 | Complement C1q subcomponent subunit C | 1 | Secreted | Vertebrata | Herpes Simplex; Adenoviridae Infections |
|
| P00736 | Complement C1r subcomponent, EC 3.4.21.41 | 3 | Secreted | Vertebrata | AIDS Dementia Complex |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| P20839 |
|
| Cytoplasm | Cellular_ |
|
|
| P12268 | Inosine-5’-monophosphate dehydrogenase 2 | 1 | Nucleus | Cellular_ | Sarcoma, Kaposi |
|
|
|
|
|
|
|
|
|
| Q9Y3R4 |
|
| Cytoplasm | Euk + Bac |
|
|
| O75469 | Nuclear receptor subfamily 1 group I member 2 | 1 | Nucleus | Eumetazoa | Hepatitis; Measles; Paramyxoviridae Infections… |
|
| P49902 | Cytosolic purine 5’-nucleotidase, EC 3.1.3.5 | 1 | Cytoplasm | Eukaryota | AIDS Dementia Complex; Cytomegalovirus Infections; Sarcoma, Kaposi |
|
| P00491 | Purine nucleoside phosphorylase, PNP, EC 2.4.2.1 | 1 | Cytoplasm | Euk + Bac | Cytomegalovirus Infections; AIDS Dementia Complex; Epstein-Barr Virus Infections |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| O14746 | Telomerase reverse transcriptase, EC 2.7.7.49 | 1 | Nucleolus | Eukaryota | Carcinoma, Merkel Cell; Leukoencephalopathy, Progressive Multifocal; Influenza, Human |
|
| P11388 | DNA topoisomerase 2-alpha, EC 5.6.2.2 | 1 | Nucleoplasm | Eukaryota | Leukoencephalopathy, Progressive Multifocal; Carcinoma, Merkel Cell; Hepatitis, Viral, Human |
|
| P68366 | Tubulin alpha-4A chain | 4 | Cytoskeleton | Eukaryota | AIDS Dementia Complex; Leukoencephalopathy, Progressive Multifocal |
|
| P07437 | Tubulin beta chain | —— | Cytoplasm, cytoskeleton | Eukaryota | —— |
|
| P04818 | Thymidylate synthase, TS, TSase, EC 2.1.1.45 | 1 | Nucleus | Euk + Bac | Sarcoma, Kaposi; Influenza, Human; Hepatitis |
Genes located on the cell membrane are shown in bold. * Complete candidate targets and diseases association information are listed in Table S1. Disease names are listed in descending order of SCG scores.
Figure 1The statistical histogram of evolutionary features of approved antiviral receptors and cell membrane receptors. The results of the evolutionary information statistics showed that the 36 approved targets were distributed mainly in genes belonging to class 6 (Eumetazoa) (14/36, p = 0.04, hypergeometric assay) (a). As for the 21 cell membrane receptors, 13 of them were from the Eumetazoan period, and the tendency for enrichment was more pronounced (p = 0.00034, hypergeometric test) (b).
Figure 2Antiviral host targets screening by genetic, subcellular location, and evolutionary biology features (different shapes and colors of symbols represent various genes related to viral diseases). Based on the analysis of the genetic features, subcellular localization, and evolutionary features of the approved antiviral host targets, it is summarized that an effective antiviral target generally exhibits the following three characteristics: (1) a median of genetic scores associated with viral disease (s) equal to or greater than three; (2) subcellular localization on the cell membrane; (3) origins in the Eumetazoan period.
Basic information of candidate targets.
| Gene | Uniprot ID | Protein | Drug | Recognized | SCG Virus Disease(S) Median Score | Associated Virus Disease(s) Name | Evolutionary Age | Pubmed |
|---|---|---|---|---|---|---|---|---|
|
| P08913 | Alpha-2A adrenergic receptor | known | Yes | 3 | AIDS Dementia Complex; Hepatitis | Eumetazoa | 13 |
|
| P07550 | Beta-2 adrenergic receptor | known | Yes | 3 | Leukoencephalopathy, Progressive Multifocal; Paramyxoviridae Infections; Picornaviridae Infections | Eumetazoa | 52 |
|
| P13945 | Beta-3 adrenergic receptor | known | Yes | 3 | Hepatitis; AIDS Dementia Complex; Picornaviridae Infections | Eumetazoa | 1 |
|
| Q15109 | Advanced glycosylation end product-specific receptor | unknown | No | 3 | Leukoencephalopathy, Progressive Multifocal; Hepatitis; AIDS Dementia Complex | Eumetazoa | 27 |
|
| P30556 | Type-1 angiotensin II receptor | known | Yes | 3 | Leukoencephalopathy, Progressive Multifocal; Paramyxoviridae Infections; Hepatitis | Eumetazoa | 3 |
|
| P78348 | Acid-sensing ion channel 1, ASIC1 | known | Yes | 3 | Leukoencephalopathy, Progressive Multifocal | Eumetazoa | 3 |
|
| O00499 | Myc box-dependent-interacting protein 1 | unknown | No | 6 | Fatigue Syndrome, Chronic; AIDS Dementia Complex | Eumetazoa | 14 |
|
| P35613 | Basigin | unknown | No | 3 | Hepatitis; West Nile Fever; Epstein–Barr Virus Infections | Eumetazoa | 19 |
|
| Q8IU99 | Calcium homeostasis modulator protein 1 | unknown | No | 9 | AIDS Dementia Complex | Eumetazoa | 1 |
|
| Q9NNX6 | CD209 | unknown | Yes | 3 | West Nile Fever; Hepatitis; Hemorrhagic Fever, Ebola | Eumetazoa | 65 |
|
| P42081 | T-lymphocyte activation antigen CD86 | known | Yes | 3 | Respiratory Syncytial Virus Infections; Picornaviridae Infections; Hepatitis D | Eumetazoa | 758 |
|
| Q13324 | Corticotropin-releasing factor | unknown | Yes | 3 | Hepatitis; Fatigue Syndrome, Chronic; West Nile Fever | Eumetazoa | 4 |
|
| P21917 | D(4) dopamine | known | Yes | 3 | AIDS Dementia Complex | Eumetazoa | 5 |
|
| Q13474 | Dystrophin-related protein 2 | unknown | No | 3 | AIDS Dementia Complex | Eumetazoa | 1 |
|
| P25445 | Tumor necrosis factor receptor superfamily member 6 | unknown | Yes | 3 | Picornaviridae Infections; AIDS Dementia Complex; Carcinoma, Merkel Cell | Eumetazoa | 1588 |
|
| Q92847 | Growth hormone secretagogue | known | Yes | 3 | Carcinoma, Merkel Cell | Eumetazoa | 7 |
|
| P78333 | Glypican-5 | unknown | No | 3 | Leukoencephalopathy, Progressive Multifocal; | Eumetazoa | 1 |
|
| Q8IYL9 | Psychosine receptor | unknown | No | 6 | Leukoencephalopathy, Progressive Multifocal | Eumetazoa | 1 |
|
| Q14833 | Metabotropic | known | Yes | 3 | Leukoencephalopathy, Progressive Multifocal; Hepatitis; Picornaviridae Infections | Eumetazoa | 1 |
|
| P30550 | Gastrin-releasing peptide receptor | unknown | Yes | 3 | AIDS Dementia Complex | Eumetazoa | 1 |
|
| P14778 | Interleukin-1 | known | Yes | 3 | Leukoencephalopathy, Progressive Multifocal; Picornaviridae Infections; HIV Infections | Eumetazoa | 17 |
|
| Q01638 | Interleukin-1 | unknown | No | 3 | Influenza, Human; Hepatitis D; West Nile Fever | Eumetazoa | 31 |
|
| P51787 | Potassium voltage-gated channel | known | Yes | 3 | Picornaviridae Infections; AIDS Dementia Complex; Measles | Eumetazoa | 5 |
|
| O75473 | Leucine-rich repeat-containing G-protein coupled receptor 5 | unknown | Yes | 3 | HIV Infections; Influenza, Human; Paramyxoviridae Infections | Eumetazoa | 25 |
|
| P20916 | Myelin-associated glycoprotein | unknown | No | 3 | Leukoencephalopathy, Progressive Multifocal | Eumetazoa | 100 |
|
| Q6W5P4 | Neuropeptide S | known | Yes | 3 | Hepatitis; Respiratory Syncytial Virus Infections; Influenza, Human | Eumetazoa | 1 |
|
| Q9NPB6 | Partitioning defective 6 homolog alpha | unknown | No | 3 | AIDS Dementia Complex | Eumetazoa | 2 |
|
| Q96FX8 | p53 apoptosis effector related to PMP-22 | unknown | No | 3 | Carcinoma, Merkel Cell; Leukoencephalopathy, Progressive Multifocal | Eumetazoa | 5 |
|
| P49802 | Regulator of G-protein signaling 7 | unknown | No | 3 | Leukoencephalopathy, Progressive Multifocal | Eumetazoa | 1 |
|
| Q7L0Q8 | Rho-related GTP-binding protein RhoU | unknown | No | 3 | Carcinoma, Merkel Cell | Eumetazoa | 4 |
|
| Q16586 | Alpha-sarcoglycan, Alpha-SG | unknown | No | 14 | Fatigue Syndrome, Chronic | Eumetazoa | 7 |
|
| Q15464 | SH2 domain-containing adapter protein B | unknown | No | 3 | AIDS Dementia Complex | Eumetazoa | 19 |
|
| Q8WXH0 | Nesprin-2 | unknown | No | 7 | Fatigue Syndrome, Chronic; | Eumetazoa | 4 |
|
| P36897 | TGF-beta receptor type-1 | known | Yes | 3 | Hepatitis; Hepatitis B; Carcinoma, Merkel Cell | Eumetazoa | 29 |
|
| P50591 | Tumor necrosis factor ligand superfamily member 10 | unknown | No | 3 | Hepatitis D; HIV Infections; Leukoencephalopathy, Progressive Multifocal | Eumetazoa | 224 |
a Refers to the number of NCBI Pubmed (https://www.ncbi.nlm.nih.gov/, accessed on 20 October 2021) studies related to this gene and virus. * Complete candidate targets and diseases association information are listed in Table S2. Disease names are listed in descending order of SCG scores.
Druggability information of candidate targets.
| Gene Symbol | PDB ID | Protein Name | Resolution (Å) | Ligand(s) a | Number of Pockets | Number of Druggable Pockets (Score ≥ 0.5) | Best Druggable Pocket Score |
|---|---|---|---|---|---|---|---|
|
| 3O3U | Advanced glycosylation end product-specific receptor | 1.50 | MLR, SO4 | 24 | 13 | 1.00 |
|
| 2FIC | Myc box-dependent-interacting protein 1 | 1.99 | XE | 11 | 7 | 0.91 |
|
| 3I84 | Basigin | 2.00 | CL | 2 | 0 | —— |
|
| 2XR6 | CD209 antigen | 1.35 | 07B, MAN, AE9, CA, CL | 2 | 2 | 0.78 |
|
| 3N93-AB | Corticotropin-releasing factor receptor 2 | 2.50 | MAL, GOL | 41 | 18 | 1.00 |
|
| 3TJE-F | Tumor necrosis factor receptor superfamily member 6 | 1.93 | CD, EDO, CL | 4 | 1 | 0.84 |
|
| 4KC3-B | Interleukin-1 receptor-like 1 | 3.27 | NAG, MSE | 11 | 7 | 0.98 |
|
| 4UFR-AC | Leucine-rich repeat-containing G-protein coupled receptor 5 | 2.20 | NAG, CL | 34 | 21 | 0.99 |
|
| 1WMH-B | Partitioning defective 6 homolog alpha | 1.50 | —— | 4 | 3 | 0.99 |
|
| 2A72 | Regulator of G-protein signaling 7 | 2.00 | CL | 7 | 2 | 0.60 |
|
| 2Q3H | Rho-related GTP-binding protein RhoU | 1.73 | GDP, MG | 6 | 1 | 0.93 |
|
| 1DG6-A | Tumor necrosis factor ligand superfamily member 10 | 1.30 | ZN, CL | 2 | 1 | 0.65 |
a The ligand information is derived from the RCSB PDB (https://www.rcsb.org/, accessed on 20 October 2021) [105].
Figure 3The best druggable pocket (shown in cyan surface mode) of each target predicted by PockDrug. A total of 11 candidate genes with known protein crystal structures and with one or more druggable pockets are presented, including CD209 (a), LGR5 (b), CRHR2 (c), FAS (d), AGER (e), BIN1 (f), IL1RL1 (g), PARD6A (h), RGS7 (i), RHOU (j) and TNFSF10 (k). All the predicted information by PockDrug is summarized in Table 3 and Table S3. Pockets with druggability scores of 0.5 or more were considered druggable.