| Literature DB >> 33611340 |
Tasnimul Alam Taz1, Kawsar Ahmed2, Bikash Kumar Paul3, Fahad Ahmed Al-Zahrani4, S M Hasan Mahmud1, Mohammad Ali Moni5.
Abstract
This study aimed to identify significant gene expression profiles of the human lung epithelial cells caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections. We performed a comparative genomic analysis to show genomic observations between SARS-CoV and SARS-CoV-2. A phylogenetic tree has been carried for genomic analysis that confirmed the genomic variance between SARS-CoV and SARS-CoV-2. Transcriptomic analyses have been performed for SARS-CoV-2 infection responses and pulmonary arterial hypertension (PAH) patients' lungs as a number of patients have been identified who faced PAH after being diagnosed with coronavirus disease 2019 (COVID-19). Gene expression profiling showed significant expression levels for SARS-CoV-2 infection responses to human lung epithelial cells and PAH lungs as well. Differentially expressed genes identification and integration showed concordant genes (SAA2, S100A9, S100A8, SAA1, S100A12 and EDN1) for both SARS-CoV-2 and PAH samples, including S100A9 and S100A8 genes that showed significant interaction in the protein-protein interactions network. Extensive analyses of gene ontology and signaling pathways identification provided evidence of inflammatory responses regarding SARS-CoV-2 infections. The altered signaling and ontology pathways that have emerged from this research may influence the development of effective drugs, especially for the people with preexisting conditions. Identification of regulatory biomolecules revealed the presence of active promoter gene of SARS-CoV-2 in Transferrin-micro Ribonucleic acid (TF-miRNA) co-regulatory network. Predictive drug analyses provided concordant drug compounds that are associated with SARS-CoV-2 infection responses and PAH lung samples, and these compounds showed significant immune response against the RNA viruses like SARS-CoV-2, which is beneficial in therapeutic development in the COVID-19 pandemic.Entities:
Keywords: COVID-19; SARS-CoV; SARS-CoV-2; pulmonary arterial hypertension; transcriptomic profiling
Mesh:
Substances:
Year: 2021 PMID: 33611340 PMCID: PMC7929374 DOI: 10.1093/bib/bbab026
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Figure 1The workflow of current analysis. Genomic differences between SARS-CoV and SARS-CoV-2 are visualized through a phylogenetic analysis. Two datasets GSE147507 and GSE117261 are collected according to SARS-CoV-2 infection in human lung epithelial cells and PAH lung, respectively. Differentially expressed genes (DEGs) were identified using R programming language and similar DEGs were identified from total DEGs of both the datasets. Corresponding similar DEGs were used to perform transcriptomic analyses. The gene expression profiling was performed for both the datasets, and gene ontology (GO) terms, cell informative pathways, PPIs network, hub gene identification and TF–miRNA-based analyses were performed. According to the corresponding similar DEGs, drug compounds were predicted.
Figure 2Phylogram of SARS-CoV and SARS-CoV-2, which provides genomic differences between human coronaviruses of 2003–2018 (SARS-CoV) and 2019–2020 (SARS-CoV-2). Two colors are implemented to differentiate SARS-CoV (purple) and SARS-CoV-2 (green).
Figure 3Gene expression profiling of SARS-CoV-2 infection in human lung epithelial cells for the top 20 genes and selected 24 samples from the GSE147507 dataset.
Figure 4(A) Gene expression visualization of healthy controls (GSM3290083, GSM3290086 and GSM3290085) and PAH samples. (B) Volcano plot shows the regulation of genes (upregulated and downregulated) for GSE117261.
Figure 5(A) Concordant gene identification between GSE147507 and GSE117261 dataset that provide evidence of six common differentially expressed genes in between 108 genes of GSE147507 (COVID-19) and 59 genes of GSE117261 (PAH) dataset. (B) Heat map according to the log fold changes for the shared common DEGs between COVID-19 dataset and PAH dataset.
Figure 6(A) Heat map for the identification of highly risk prone nature of S100A9 and S100A8 genes. (B) Risk group comparisons between the shared common genes of SARS-CoV-2 and PAH.
The association of concordant genes in GO terms and GO pathways and the proportional P-values
| Category | GO ID | Term |
| Genes |
|---|---|---|---|---|
| GO biological process | GO:0030593 | Neutrophil Chemotaxis | 6.563(e-10) | SAA1, S100A12, S100A9, S100A8 |
| GO:0071621 | Granulocyte Chemotaxis | 8.230(e-10) | SAA1, S100A12, S100A9, S100A8 | |
| GO:1990266 | Neutrophil Migration | 9.506(e-10) | SAA1, S100A12, S100A9, S100A8 | |
| GO:0050832 | Defense response to fungus | 1.018(e-8) | S100A12, S100A9, S100A8 | |
| GO:0050727 | Regulation of inflammatory response | 6.777(e-8) | SAA1, S100A12, S100A9, S100A8 | |
| GO:0051091 | Positive regulation of sequence-specific DNA-binding transcription factor activity | 1.915(e-7) | EDN1, S100A12, S100A9, S100A8 | |
| GO:0050729 | Positive regulation of inflammatory response | 9.257(e-7) | S100A12, S100A9, S100A8 | |
| GO:0031349 | Positive regulation of defense response | 9.647(e-7) | S100A12, S100A9, S100A8 | |
| GO:0070486 | Leukocyte aggregation | 0.000001574 | S100A9, S100A8 | |
| GO:0032103 | Positive regulation of response to external stimulus | 0.000001745 | S100A12, S100A9, S100A8 | |
| GO molecular function | GO:0050786 | RAGE receptor binding | 1.259(e-9) | S100A12, S100A9, S100A8 |
| GO:0035325 | Toll-like receptor binding | 0.000002697 | S100A9, S100A8 | |
| GO:0005509 | Calcium ion binding | 0.00005490 | S100A12, S100A9, S100A8 | |
| GO:0008270 | Zinc ion binding | 0.00006592 | S100A12, S100A9, S100A8 | |
| GO:0046914 | Transition metal ion binding | 0.0001507 | S100A12, S100A9, S100A8 | |
| GO:0046872 | Metal ion binding | 0.0002040 | S100A12, S100A9, S100A8 | |
| GO:0008017 | Microtubule binding | 0.001383 | S100A9, S100A8 | |
| GO:0015631 | Tubulin binding | 0.002348 | S100A9, S100A8 | |
| GO:0005507 | Copper ion binding | 0.01224 | S100A12 | |
| GO cellular component | GO:0060205 | Cytoplasmic vesicle lumen | 2.453(e-8) | SAA1, S100A12, S100A9, S100A8 |
| GO:0071682 | Endocytic vesicle lumen | 0.005388 | SAA1 | |
| GO:0005881 | Cytoplasmic microtubule | 0.01135 | SAA1 | |
| GO:0034774 | Secretory granule lumen | 0.00007614 | S100A12, S100A9, S100A8 | |
| GO:0045111 | Intermediate filament cytoskeleton | 0.02111 | S100A8 | |
| GO:0005856 | Cytoskeleton | 0.0003296 | S100A12, S100A9, S100A8 | |
| GO:0030139 | Endocytic vesicle | 0.03197 | SAA1 | |
| GO:0005874 | Microtubule | 0.06138 | SAA1 |
The association of concordant genes in KEGG, WikiPathways, Reactome and BioCarta databases and the proportional P-values
| Databases | Pathways |
| Genes |
|---|---|---|---|
| KEGG | Interleukin 17 (IL-17) signaling pathway | 0.0003170 | S100A9, S100A8 |
| Renin secretion | 0.02052 | EDN1 | |
| Hypertrophic cardiomyopathy (HCM) | 0.02523 | EDN1 | |
| AGE–RAGE signaling pathway in diabetic complications | 0.02963 | EDN1 | |
| HIF-1 signaling pathway | 0.02963 | EDN1 | |
| Melanogenesis | 0.02992 | EDN1 | |
| Tumor necrosis factor (TNF) signaling pathway | 0.03255 | EDN1 | |
| Relaxin signaling pathway | 0.03838 | EDN1 | |
| Vascular smooth muscle contraction | 0.03896 | EDN1 | |
| Fluid shear stress and atherosclerosis | 0.04099 | EDN1 | |
| WikiPathways | Vitamin B12 metabolism WP1533 | 0.00009129 | SAA1, SAA2 |
| Folate metabolism WP176 | 0.0001595 | SAA1, SAA2 | |
| IL1 and megakaryocytes in obesity WP2865 | 0.007179 | S100A9 | |
| Physiological and pathological hypertrophy of the heart WP1528 | 0.007477 | EDN1 | |
| Selenium micronutrient network WP15 | 0.0002711 | SAA1, SAA2 | |
| Endothelin pathways WP2197 | 0.009860 | EDN1 | |
| Photodynamic therapy-induced HIF-1 survival signaling WP3614 | 0.01105 | EDN1 | |
| Melatonin metabolism and effects WP3298 | 0.01105 | EDN1 | |
| Prostaglandin synthesis and regulation WP98 | 0.01343 | EDN1 | |
| Vitamin D receptor pathway WP2877 | 0.001206 | S100A9, S100A8 | |
| Reactome | Advanced glycosylation endproduct receptor signaling | 0.000005841 | SAA1, S100A12 |
| DEx/H-box helicases activate type I IFN and inflammatory cytokines production | 0.000005841 | SAA1, S100A12 | |
| Scavenging by Class B receptors | 0.001499 | SAA1 | |
| RIP-mediated NFkB activation via ZBP1 | 0.00001571 | SAA1, S100A12 | |
| TRAF6-mediated NF-kB activation | 0.00002064 | SAA1, S100A12 | |
| ZBP1(DAI)-mediated induction of type I IFNs | 0.00002430 | SAA1, S100A12 | |
| TAK1 activates NFkB by phosphorylation and activation of IKKs complex | 0.00002430 | SAA1, S100A12 | |
| Formyl peptide receptors bind formyl peptides and many other ligands | 0.002398 | SAA1 | |
| Cytosolic sensors of pathogen-associated DNA | 0.0001595 | SAA1, S100A12 | |
| TRAF6-mediated induction of proinflammatory cytokines | 0.0001899 | SAA1, S100A12 | |
| BioCarta | G-protein signaling through tubby proteins | 0.002997 | EDN1 |
| Activation of PKC through G-protein-coupled receptors | 0.003296 | EDN1 | |
| Hypoxia-inducible factor in the cardiovascular system | 0.004791 | EDN1 | |
| Cystic fibrosis transmembrane conductance regulator (CFTR) and beta 2 adrenergic receptor (b2AR) pathway | 0.005986 | EDN1 | |
| Corticosteroids and cardioprotection | 0.007477 | EDN1 | |
| Beta-arrestins in GPCR desensitization | 0.008372 | EDN1 | |
| Activation of cAMP-dependent protein kinase, PKA | 0.008670 | EDN1 | |
| Role of beta-arrestins in the activation and targeting of MAP kinases | 0.008967 | EDN1 | |
| Role of EGF receptor transactivation by GPCRs in cardiac hypertrophy | 0.009860 | EDN1 | |
| Roles of beta-arrestin-dependent recruitment of Src kinases in GPCR signaling | 0.01016 | EDN1 |
Figure 7(A) GO terms regarding biological process, molecular function and cellular component according to the associative P-values. (B) Cell informative pathways (KEGG, BioCarta, Reactome and WikiPathways) analysis result regarding associative P-values.
Figure 8PPIs network for identified common DEGs that refers to SARS-CoV-2 infections in human lung and PAH lung. The common genes are highlighted with purple node (SAA2, S100A9, S100A8, SAA1 and S100A12). The network consists of 125 nodes and 136 edges.
Figure 9Hub gene detection from the similar DEGs based on the PPIs network. The highlighted nodes S100A9 (red), S100A8 (orange) and SAA1 (yellow) are regarded as highly interconnected nodes, considered as hub nodes. The network is made up of 124 nodes and 135 edges.
Figure 10Highly interconnected regions (module) identification network that consists of 13 nodes and 13 edges. The hub genes S100A9 (orange) and S100A8 (orange) are visualized in the corresponding module network.
Exploration of topological results for top three hub genes
| Hub gene | Degree | Stress | Closeness centrality | Betweenness centrality |
|---|---|---|---|---|
| S100A9 | 83 | 14 008 | 102.66667 | 13 258 |
| S100A8 | 45 | 7370 | 82.75 | 7117 |
| SAA1 | 4 | 738 | 41.5 | 732 |
Figure 11TF–miRNA co-regulatory network visualization. The network includes 69 nodes and 77 edges. According to the network, there exist 35 TF genes (blue) and 28 are miRNAs (red) and they are interacted with six common DEGs (green).
Predictive drug compounds according to the concordant genes of SARS-CoV-2 and PAH samples
| Name of drugs |
| Adjusted | Genes |
|---|---|---|---|
| MIGLITOL CTD 00002031 | 0.000004943 | 0.01990 | S100A12, S100A9 |
| Bosentan CTD 00003071 | 0.003296 | 0.5529 | EDN1 |
| Coenzyme Q10 CTD 00001167 | 0.003595 | 0.5789 | EDN1 |
| Metoprolol HL60 UP | 0.00007383 | 0.04954 | S100A12, S100A9 |
| 9-(2-Phosphonomethoxypropyl)adenine CTD 00003259 | 0.004193 | 0.5821 | EDN1 |
| (+)-Chelidonine HL60 DOWN | 0.00009129 | 0.05250 | S100A9, S100A8 |
| Sildenafil CTD 00003367 | 0.004492 | 0.6028 | EDN1 |
| Norepinephrine CTD 00006417 | 0.00009879 | 0.04972 | S100A9, S100A8 |
| Dydrogesterone CTD 00005882 | 0.004791 | 0.6028 | EDN1 |
| 1,3-Dimethylthiourea CTD 00001818 | 0.004791 | 0.5845 | EDN1 |