| Literature DB >> 33024988 |
Tasnimul Alam Taz1, Kawsar Ahmed2, Bikash Kumar Paul3, Md Kawsar1, Nargis Aktar4, S M Hasan Mahmud5, Mohammad Ali Moni6.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is accountable for the cause of coronavirus disease (COVID-19) that causes a major threat to humanity. As the spread of the virus is probably getting out of control on every day, the epidemic is now crossing the most dreadful phase. Idiopathic pulmonary fibrosis (IPF) is a risk factor for COVID-19 as patients with long-term lung injuries are more likely to suffer in the severity of the infection. Transcriptomic analyses of SARS-CoV-2 infection and IPF patients in lung epithelium cell datasets were selected to identify the synergistic effect of SARS-CoV-2 to IPF patients. Common genes were identified to find shared pathways and drug targets for IPF patients with COVID-19 infections. Using several enterprising Bioinformatics tools, protein-protein interactions (PPIs) network was designed. Hub genes and essential modules were detected based on the PPIs network. TF-genes and miRNA interaction with common differentially expressed genes and the activity of TFs are also identified. Functional analysis was performed using gene ontology terms and Kyoto Encyclopedia of Genes and Genomes pathway and found some shared associations that may cause the increased mortality of IPF patients for the SARS-CoV-2 infections. Drug molecules for the IPF were also suggested for the SARS-CoV-2 infections.Entities:
Keywords: SARS-CoV-2; differentially expressed genes; drug molecule; gene ontology; hub gene; idiopathic pulmonary fibrosis; protein–protein interactions
Mesh:
Year: 2021 PMID: 33024988 PMCID: PMC7665362 DOI: 10.1093/bib/bbaa235
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Figure 1Methodical workflow for the current investigation. Two type of samples (control cells, SARS-CoV-2 infected cells) were collected from SARS-CoV-2 infected lung epithelial cells and both are included in the GSE147507 dataset. GSE147507 dataset contains a sample of SARS-CoV-2 infected lung epithelial cells and the GSE35145 dataset contains IPF affected lung samples. Common DEGs were identified from both the datasets using the R programming language. From the common DEGs, GO identification, KEGG pathway, PPIs network, TF and miRNA analysis, hub gene identification and module analysis was designed and based on those analysis drug molecule identification was performed.
Figure 2Common differentially expressed genes representation through a Venn diagram. Eleven genes were found common from the 108 differentially expressed genes of SARS-CoV-2 infection and 359 differentially expressed genes of IPF patients. The common differentially expressed genes were 2.4% among total 467 differentially expressed genes.
GO terms, GO pathways and their corresponding P-values and genes for common differentially expressed genes
| Category | GO ID | GO pathways |
| Genes |
|---|---|---|---|---|
| GO biological process | GO:0070486 | Leukocyte aggregation | 0.000005766 | S100A9, S100A8 |
| GO:0050832 | Defense response to fungus | 8.380e-8 | S100A12, S100A9, S100A8 | |
| GO:0030593 | Neutrophil chemotaxis | 1.430e-8 | SAA1, S100A12, S100A9, S100A8 | |
| GO:0071621 | Granulocyte chemotaxis | 1.792e-8 | SAA1, S100A12, S100A9, S100A8 | |
| GO:1990266 | Neutrophil migration | 2.069e-8 | SAA1, S100A12, S100A9, S100A8 | |
| GO:1900122 | Positive regulation of receptor binding | 0.003296 | MMP9 | |
| GO:0002693 | Positive regulation of cellular extravasation | 0.003296 | ICAM1 | |
| GO:0032364 | Oxygen homeostasis | 0.003296 | SOD2 | |
| GO:0003085 | Negative regulation of systemic arterial blood pressure | 0.003296 | SOD2 | |
| GO:0051549 | Positive regulation of keratinocyte migration | 0.003296 | MMP9 | |
| GO Molecular Function | GO:0050786 | RAGE receptor binding | 1.038e-8 | S100A12, S100A9, S100A8 |
| GO:0035325 | Toll-like receptor binding | 0.000009879 | S100A9, S100A8 | |
| GO:0046914 | Transition metal ion binding | 0.000001290 | S100A12, SOD2, MMP9, S100A9, S100A8 | |
| GO:0008270 | Zinc ion binding | 0.00001547 | S100A12, MMP9, S100A9, S100A8 | |
| GO:0004866 | Endopeptidase inhibitor activity | 0.001625 | SERPINA3, PI3 | |
| GO:0030145 | Manganese ion binding | 0.01909 | SOD2 | |
| GO:0030414 | Peptidase inhibitor activity | 0.01963 | PI3 | |
| GO:0061135 | Endopeptidase regulator activity | 0.01963 | PI3 | |
| GO:0005507 | Copper ion binding | 0.02233 | S100A12 | |
| GO:0046872 | Metal ion binding | 0.00006865 | S100A12, SOD2, S100A9, S100A8 | |
| GO Cellular Component | GO:0060205 | Cytoplasmic vesicle lumen | 4.624e-9 | SERPINA3, SAA1, S100A12, S100A9, S100A8 |
| GO:0071682 | Endocytic vesicle lumen | 0.009858 | SAA1 | |
| GO:0034774 | Secretory granule lumen | 0.00001872 | SERPINA3, S100A12, S100A9, S100A8 | |
| GO:0005881 | Cytoplasmic microtubule | 0.02071 | SAA1 | |
| GO:1904724 | Tertiary granule lumen | 0.02984 | MMP9 | |
| GO:0031093 | Platelet alpha granule lumen | 0.03625 | SERPINA3 | |
| GO:0045111 | Intermediate filament cytoskeleton | 0.03837 | S100A8 | |
| GO:0005856 | Cytoskeleton | 0.002467 | S100A12, S100A9, S100A8 | |
| GO:0031091 | Platelet alpha granule | 0.04841 | SERPINA3 | |
| GO:0035578 | Azurophil granule lumen | 0.04841 | SERPINA3 |
Top pathways from KEGG, WikiPathways, Reactome and BioCarta databases and their corresponding P-values and genes for common differentially expressed genes
| Databases | Pathways |
| Genes |
|---|---|---|---|
| KEGG | IL-17 signaling pathway | 0.00001563 | MMP9, S100A9, S100A8 |
| TNF signaling pathway | 0.001596 | MMP9, ICAM1 | |
| Leukocyte transendothelial migration | 0.001654 | MMP9, ICAM1 | |
| African trypanosomiasis | 0.02017 | ICAM1 | |
| Bladder cancer | 0.02233 | MMP9 | |
| Fluid shear stress and atherosclerosis | 0.002531 | MMP9, ICAM1 | |
| Malaria | 0.02663 | ICAM1 | |
| Viral myocarditis | 0.03198 | ICAM1 | |
|
| 0.03678 | ICAM1 | |
| Peroxisome | 0.04473 | SOD2 | |
| WikiPathways | Vitamin B12 Metabolism WP1533 | 3.630e-11 | SERPINA3, SAA1, SAA2, SOD2, ICAM1 |
| IL1 and megakaryocytes in obesity WP2865 | 2.489e-7 | MMP9, S100A9, ICAM1 | |
| Folate Metabolism WP176 | 1.525e-10 | SERPINA3, SAA1, SAA2, SOD2, ICAM1 | |
| Selenium Micronutrient Network WP15 | 5.914e-10 | SERPINA3, SAA1, SAA2, SOD2, ICAM1 | |
| Mammary gland development pathway - Involution (Stage 4 of 4) WP2815 | 0.005488 | MMP9 | |
| Photodynamic therapy-induced NF-kB survival signaling WP3617 | 0.0001620 | MMP9, ICAM1 | |
| Osteopontin Signaling WP1434 | 0.007129 | MMP9 | |
| Platelet-mediated interactions with vascular and circulating cells WP4462 | 0.009313 | ICAM1 | |
| Cells and Molecules involved in local acute inflammatory response WP4493 | 0.009313 | ICAM1 | |
| Extracellular vesicles in the crosstalk of cardiac cells WP4300 | 0.01040 | MMP9 | |
| Reactome | DEx/H-box helicases activate type I IFN and inflammatory cytokines production | 0.00002138 | SAA1, S100A12 |
| Advanced glycosylation endproduct receptor signaling Homo sapiens R-HSA-879415 | 0.00002138 | SAA1, S100A12 | |
| Scavenging by Class B Receptors Homo sapiens R-HSA-3000471 | 0.002747 | SAA1 | |
| RIP-mediated NFkB activation via ZBP1 Homo sapiens R-HSA-1810476 | 0.00005742 | SAA1, S100A12 | |
| TRAF6 mediated NF-kB activation Homo sapiens R-HSA-933542 | 0.00007540 | SAA1, S100A12 | |
| ZBP1(DAI) mediated induction of type I IFNs Homo sapiens R-HSA-1606322 | 0.00008873 | SAA1, S100A12 | |
| TAK1 activates NFkB by phosphorylation and activation of IKKs complex Homo sapiens R-HSA-445989 | 0.00008873 | SAA1, S100A12 | |
| Formyl peptide receptors bind formyl peptides and many other ligands Homo sapiens R-HSA-444473 | 0.004392 | SAA1 | |
| Cytosolic sensors of pathogen-associated DNA Homo sapiens R-HSA-1834949 | 0.0005787 | SAA1, S100A12 | |
| TRAF6 Mediated Induction of proinflammatory cytokines Homo sapiens R-HSA-168180 | 0.0006883 | SAA1, S100A12 | |
| BioCarta | Inhibition of Matrix Metalloproteinases Homo sapiens h reckPathway | 0.004392 | MMP9 |
| Cardiac Protection Against ROS Homo sapiens h flumazenilPathway | 0.006035 | SOD2 | |
| Erythropoietin mediated neuroprotection through NF-kB Homo sapiens h eponfkbPathway | 0.007129 | SOD2 | |
| The IGF-1 Receptor and Longevity Homo sapiens h longevity pathway | 0.008767 | SOD2 |
Figure 3(A) Biological process, molecular function and cellular component related GO terms identification result according to combined score. The higher the enrichment score, the higher number of genes are involved in a certain ontology. (B) Pathway analysis result identification through KEGG, WikiPathways, Reactome and BioCarta. The results of the pathway terms were identified through the combined score.
Figure 4Protein–protein interactions (PPIs) network for identified common differentially expressed genes that are shared by two diseases (COVID-19 and IPF). Nodes in orange color indicate common differentially expressed genes and edges specify the interconnection in the middle of two genes. The analyzed network holds 60 nodes and 403 edges.
Figure 5Detection of hub genes from the PPIs network of common differentially expressed genes. The highlighted five genes are VEGFA, AKT1, MMP9, ICAM1 and CD44. These five genes are considered as hub genes according to their degree value. The network has 53 nodes and 378 edges. According to topological analysis, the degree value of VEGFA and AKT1 was 38. The degree value of MMP9, ICAM1 and CD44 were 34, 29 and 27, respectively.
Figure 6Module analysis network obtained from Figure 4 PPIs network. ICAM1 and MMP9 are highlighted in red color as these two hub nodes are common between GSE147507 and GSE35145. The network represents highly interconnected regions of the PPIs network. The network holds 16 nodes and 106 edges.
Topological result exploration for top five hub genes where the network density is 0.274, network diameter 3 and network radius 2
| Hub gene | Degree | Stress | Closeness centrality | Betweenness centrality | Distance | Eccentricity | Edge betweenness | Transitivity |
|---|---|---|---|---|---|---|---|---|
| VEGFA | 38 | 2248 | 48.1667 | 362.6209 | 1.269230 | 2 | 0.121695 | 0.34424 |
| AKT1 | 38 | 2502 | 48.1667 | 396.249 | 1.269230 | 2 | 0.133888 | 0.35277 |
| MMP9 | 34 | 2126 | 46.5 | 343.5242 | 1.346153 | 2 | 0.095339 | 0.38324 |
| ICAM1 | 29 | 1594 | 44 | 242.5716 | 1.442307 | 2 | 0.064545 | 0.42365 |
| CD44 | 27 | 1060 | 42.3333 | 189.3317 | 1.480769 | 2 | 0.064571 | 0.4359 |
Figure 7Network for TF-gene interaction with common differentially expressed genes. The highlighted blue color node represents the common genes and other nodes represent TF-genes. The network consists of 142 nodes and 180 edges.
Figure 8The network presents the TF-miRNA coregulatory network. The network consists of 101 nodes and 131 edges including 53 TF-genes, 39 miRNA and nine differentially expressed genes. The nodes in pink color are the differentially expressed genes, a yellow node represents miRNA and other nodes indicate TF-genes.
Suggested top drug compounds for the IPF-2 infections
| Name of drugs |
| Adjusted | Genes |
|---|---|---|---|
| MIGLITOL CTD 00002031 | 0.00001810 | 0.004285 | S100A12, S100A9 |
| CHEMBL55802 CTD 00003118 | 0.00002876 | 0.005514 | MMP9, ICAM1 |
| Hesperidin CTD 00006087 | 0.00004187 | 0.007024 | MMP9, ICAM1 |
| Cytochalasin D CTD 00007076 | 0.00005197 | 0.007472 | MMP9, ICAM1 |
| Prolinedithiocarbamate CTD 00002658 | 0.00007540 | 0.008928 | MMP9, ICAM1 |
| Parthenolide CTD 00000087 | 0.000002540 | 0.001705 | SAA1, MMP9, ICAM1 |
| FEXOFENADINE HYDROCHLORIDE CTD 00003191 | 0.00008193 | 0.009163 | MMP9, ICAM1 |
| Hydroxytyrosol CTD 00000267 | 0.00008193 | 0.008915 | MMP9, ICAM1 |
| Antimycin A CTD 00005427 | 0.00008873 | 0.009401 | SOD2, ICAM1 |
| Anacardic acid C15:3 CTD 00003117 | 0.00008873 | 0.009160 | MMP9, ICAM1 |