| Literature DB >> 35885993 |
Yulin Dai1, Hui Yu2, Qiheng Yan1,3, Bingrui Li1,4, Andi Liu1,5, Wendao Liu1,6, Xiaoqian Jiang7, Yejin Kim7, Yan Guo2, Zhongming Zhao1,5,6,8.
Abstract
The coronavirus disease 2019 (COVID-19) pandemic has caused a dramatic loss of human life and devastated the worldwide economy. Numerous efforts have been made to mitigate COVID-19 symptoms and reduce the death rate. We conducted literature mining of more than 250 thousand published works and curated the 174 most widely used COVID-19 medications. Overlaid with the human protein-protein interaction (PPI) network, we used Steiner tree analysis to extract a core subnetwork that grew from the pharmacological targets of ten credible drugs ascertained by the CTD database. The resultant core subnetwork consisted of 34 interconnected genes, which were associated with 36 drugs. Immune cell membrane receptors, the downstream cellular signaling cascade, and severe COVID-19 symptom risk were significantly enriched for the core subnetwork genes. The lung mast cell was most enriched for the target genes among 1355 human tissue-cell types. Human bronchoalveolar lavage fluid COVID-19 single-cell RNA-Seq data highlighted the fact that T cells and macrophages have the most overlapping genes from the core subnetwork. Overall, we constructed an actionable human target-protein module that mainly involved anti-inflammatory/antiviral entry functions and highly overlapped with COVID-19-severity-related genes. Our findings could serve as a knowledge base for guiding drug discovery or drug repurposing to confront the fast-evolving SARS-CoV-2 virus and other severe infectious diseases.Entities:
Keywords: COVID-19; drug treatment; drug-target network; text mining
Mesh:
Year: 2022 PMID: 35885993 PMCID: PMC9316565 DOI: 10.3390/genes13071210
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Literature mining and subnetwork extraction workflow: text mining of COVID-19 drugs, drug target curation, and Steiner tree network analysis. Abstracts with matching keywords were downloaded from PubMed. Drug names were extracted from the downloaded abstracts using Med7 and a cutoff was applied based on the empirical distribution, to further narrow down the drug list. Target gene information for each drug was collected from DrugBank. Starting from the target genes of ten credible drugs ascertained by CTDbase, Steiner tree algorithm was applied to a human protein–protein interaction network to extract a core target interaction subnetwork.
The top ten genes with the highest degree in the COVID-19-related parental PPI network.
| Gene Symbol | Degree | Betweenness |
|---|---|---|
| HSP90AA1 | 196 | 37,757.90 |
| TP53 | 148 | 17,451.41 |
| APP | 145 | 32,447.83 |
| NTRK1 | 141 | 15,363.83 |
| MYC | 134 | 12,058.01 |
| EGFR | 125 | 19,208.06 |
| ESR1 | 114 | 7602.044 |
| ESR2 | 89 | 8696.631 |
| EGLN3 | 83 | 5614.034 |
| XPO1 | 82 | 4984.01 |
The number of target genes of 10 common COVID-19-related drugs between our compilation and CTDbase.
| Drug | Compilation | CTDbase | Overlapping | Union | PharmGKB Annotation |
|---|---|---|---|---|---|
| acalabrutinib † | 1 | 3 | 0 | 4 | other |
| aliskiren | 1 | 3 | 0 | 4 | NA |
| Argatroban ‡ | 1 | 1 | 0 | 2 | NA |
| baricitinib †‡ | 4 | 11 | 3 | 12 | anti-cytokine/anti-inflammatory |
| bicalutamide | 1 | 4 | 0 | 5 | NA |
| dapagliflozin †‡ | 1 | 3 | 0 | 4 | other |
| Ibrutinib † | 1 | 4 | 1 | 4 | NA |
| montelukast †‡ | 2 | 6 | 0 | 8 | NA |
| ruxolitinib † | 4 | 4 | 0 | 8 | other |
| tofacitinib †‡ | 4 | 8 | 0 | 12 | anti-cytokine/anti-inflammatory |
We queried the clinical trials information of the 10 drugs on clinicaltrials.gov as of 28 February 2022. † Indicates that the drug was in phase 2/3 of clinical trial(s) for testing to treat COVID-19. ‡ Indicates that the drug was in phase 4 of clinical trial(s) for testing to treat COVID-19.
Figure 2Steiner-tree-inferred protein–protein interaction network that interconnects 25 convincing COVID-19 drug target genes and functional enrichment. (A) Red node: credible COVID-19 drug target genes as terminals of the inferred Steiner tree. Blue node: a minimum set of genes (mediators) through which the interconnected subnetwork was formed. Node size was proportional to the degree. (B) Top 20 significant enrichment results for Gene Ontology (GO) analysis of biological process, molecular function, and cellular component. Each row is the GO term. The color of the circle is proportional to the value of −log10 (PBH) for each term, from blue to red. The circle size is proportional to the number of intersected genes between the 34 COVID19-DrugNET genes and the term genes.
Plausible COVID-19 drugs with target genes significantly enriched in COVID19-DrugNET.
| Drug | p.hyper | Target Reservation Rate | Reserved Target Genes |
|---|---|---|---|
| adalimumab | 0 | 1/1 | TNF |
| bromhexine | 0.0018 | 1/2 | TMPRSS2 |
| canakinumab | 0 | 1/1 | IL1B |
| deferoxamine | 0 | 1/1 | APP |
| epinephrine | 0.0036 | 2/8 | ADRB2, TNF |
| formoterol | 0.0053 | 1/3 | ADRB2 |
| infliximab | 0 | 1/1 | TNF |
| leflunomide | 0.0053 | 1/3 | PTK2B |
| progesterone | 0.0073 | 2/10 | ESR1, AR |
| tocilizumab | 0 | 1/1 | IL6R |
Figure 3Cell-type-specificity of COVID19-DrugNET genes. The red dashed line indicates the Bonferroni-corrected significant threshold −log10 (p = 3.69 × 10−5). The grey solid line indicates the nominal significance −log10 (p = 1 × 10−3). (A) In each category of organ systems, each dot represents one tissue cell type from that organ system, in a different color by column. We highlighted the top cell type, i.e., lung mast cell in respiratory system. (B) In each category of tissue, each dot represents one cell type from that tissue, in a different color by column. We highlighted the top cell type, i.e., lung mast cell in one lung single-cell RNA sequencing (scRNA-seq) study. (C) Heatmap for the COVID19-DrugNET gene cell-type-specific enrichment analysis results in one lung scRNA-seq panel. The color is proportional to the p-values. The first column is the tissue cell type in this scRNA-seq panel. The second column is the raw p-values. The third column is the combined p-value calculated by WebCSEA.
Figure 4Comparison of COVID19-DrugNET genes with the differentially expressed genes (DEGs) from single-cell RNA-seq COVID-19 bronchoalveolar lavage fluid dataset. (A) In the severe and healthy groups, the UpSet plot shows the shared and uniqued components among the top 5 cell types with overlapping genes between the corresponding DEGs and COVID19-DrugNET genes. The set size indicates the overlapping genes. NK: natural killer cell; mDC: myeloid dendritic cell. (B) In the severe and moderate groups, the UpSet plot shows the shared and unique components among the top 5 cell types with overlapping genes between the corresponding DEGs and COVID19-DrugNET genes. The set size indicates the overlapping genes. mDC: myeloid dendritic cells; pDC: plasmacytoid dendritic cells.