Literature DB >> 33611340

Identification of biomarkers and pathways for the SARS-CoV-2 infections that make complexities in pulmonary arterial hypertension patients.

Tasnimul Alam Taz1, Kawsar Ahmed2, Bikash Kumar Paul3, Fahad Ahmed Al-Zahrani4, S M Hasan Mahmud1, Mohammad Ali Moni5.   

Abstract

This study aimed to identify significant gene expression profiles of the human lung epithelial cells caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections. We performed a comparative genomic analysis to show genomic observations between SARS-CoV and SARS-CoV-2. A phylogenetic tree has been carried for genomic analysis that confirmed the genomic variance between SARS-CoV and SARS-CoV-2. Transcriptomic analyses have been performed for SARS-CoV-2 infection responses and pulmonary arterial hypertension (PAH) patients' lungs as a number of patients have been identified who faced PAH after being diagnosed with coronavirus disease 2019 (COVID-19). Gene expression profiling showed significant expression levels for SARS-CoV-2 infection responses to human lung epithelial cells and PAH lungs as well. Differentially expressed genes identification and integration showed concordant genes (SAA2, S100A9, S100A8, SAA1, S100A12 and EDN1) for both SARS-CoV-2 and PAH samples, including S100A9 and S100A8 genes that showed significant interaction in the protein-protein interactions network. Extensive analyses of gene ontology and signaling pathways identification provided evidence of inflammatory responses regarding SARS-CoV-2 infections. The altered signaling and ontology pathways that have emerged from this research may influence the development of effective drugs, especially for the people with preexisting conditions. Identification of regulatory biomolecules revealed the presence of active promoter gene of SARS-CoV-2 in Transferrin-micro Ribonucleic acid (TF-miRNA) co-regulatory network. Predictive drug analyses provided concordant drug compounds that are associated with SARS-CoV-2 infection responses and PAH lung samples, and these compounds showed significant immune response against the RNA viruses like SARS-CoV-2, which is beneficial in therapeutic development in the COVID-19 pandemic.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  COVID-19; SARS-CoV; SARS-CoV-2; pulmonary arterial hypertension; transcriptomic profiling

Mesh:

Substances:

Year:  2021        PMID: 33611340      PMCID: PMC7929374          DOI: 10.1093/bib/bbab026

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


Introduction

Coronavirus disease 2019 (COVID-19) is caused by a virus called severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which belongs to the Coronaviridae family [1]. The widespread behavior of this virus has immensely influenced the death rate and proved it as the most internecine global epidemic of the 21st century. Angiotensin-converting enzyme 2 (ACE2), which is used by SARS-CoV-2, forms an entrance in host human cells and binds with human ACE2 that eventually leads to the intense spread of this lethal virus among human [2]. Spike protein is considered to be a potential therapeutic target against SARS-CoV-2 [3, 4]. The first severe case of COVID-19 that led to death eventually was indicated on 11 January 2020 [5]. As of 10 September 2020, the number of confirmed COVID-19 cases all over the world is 27 688 740, including 899 315 deaths (https://covid19.who.int/). A large proportion of the total patients of COVID-19 are male (54.3%), where the mortality rate of the elderly patients is higher (15%), compare with younger patients [6]. Due to the rapid spread of COVID-19, the pace of vaccine production has not been able to keep pace with demand. The transference of lethal SARS-CoV-2 from one person to another mostly occurs through respiratory droplet transmission [7]. The prevalence of SARS-CoV-2 is increasing because presymptomatic infectious diseases are difficult to detect [8]. Pulmonary arterial hypertension (PAH) is considered to be a progressive disorder and causes right heart affliction and the arteries of human lungs get affected by PAH as well [9]. Dyspnea, fatigue and chest pain are among the major symptoms of PAH, which is significantly associated with lung vascular scheme and causes premature death [10]. Although early diagnostic therapy can certainly reduce the death rate of PAH [11], COVID-19 has caused many people to suffer from cardiac, age-related and pulmonary diseases, including PAH [12]. Meanwhile, researchers have produced results that demonstrate the activity of SARS-CoV-2 in promoting pulmonary microthrombi, vascular leak through different ways including inflammation, damage of DNA and mitochondrial dysfunction [13, 14]. Based on these studies, PAH can be considered as a major risk factor of COVID-19. Due to the mentioned reasons, it is revealed that there may be a number of pathological compatibility between COVID-19 and PAH. To get an idea of this compatibility, we have tried to identify altered pathways that are common for SARS-CoV-2 infections and PAH-affected samples. To accomplish these tasks, large-scale transcriptomic datasets have been used in this research. Large-scale microarray datasets are important for uncovering gene expression-based biological information [15]. High-throughput sequencing has immensely influenced the advancement of biomedical research by contributing to the rapidly growing genome sequencing field [16]. High-throughput sequencing-based analysis has already been implemented on SARS-CoV, which has also produce remarkable gene expression results [17]. The significance of the research is that we performed the largest comparative and transcriptomic study against SARS-CoV-2 infection responses to human lung epithelial cells. The potential biomarkers we have been able to figure out have proved the significance in terms of appropriate immune responses. The following analyses attempt to find cell informative pathways and drug compounds based on the transcriptomic analysis on SARS-CoV-2 and PAH. However, initially, the genomic analysis was introduced to identify genomic differences of SARS-CoV and SARS-CoV-2 effect on Homo sapiens. This genomic-level study eventually allows the research to put emphasis on SARS-CoV-2 and the major risk factors. As a result, two datasets (GSE147507 and GSE117261) were selected for the transcriptomic-level study. Hence, the research went through the identification process of finding out differentially expressed genes (DEGs) from GSE147507 and GSE117261. However, similar DEGs were conducted as input data for a further molecular-level study that includes gene ontology (GO) terms identification and predictive analysis on cell informative pathways. The visualization of the protein–protein interactions (PPIs) network is regarded as the focal point of the analysis as hub nodes and significant modules were identified from the PPIs. Herein, transcriptional regulators are also traced based on the similar DEGs of GSE147507 and GSE117261. Finally, potential drug compounds are suggested. The experimental workflow of the ongoing research is presented in Figure 1.
Figure 1

The workflow of current analysis. Genomic differences between SARS-CoV and SARS-CoV-2 are visualized through a phylogenetic analysis. Two datasets GSE147507 and GSE117261 are collected according to SARS-CoV-2 infection in human lung epithelial cells and PAH lung, respectively. Differentially expressed genes (DEGs) were identified using R programming language and similar DEGs were identified from total DEGs of both the datasets. Corresponding similar DEGs were used to perform transcriptomic analyses. The gene expression profiling was performed for both the datasets, and gene ontology (GO) terms, cell informative pathways, PPIs network, hub gene identification and TF–miRNA-based analyses were performed. According to the corresponding similar DEGs, drug compounds were predicted.

The workflow of current analysis. Genomic differences between SARS-CoV and SARS-CoV-2 are visualized through a phylogenetic analysis. Two datasets GSE147507 and GSE117261 are collected according to SARS-CoV-2 infection in human lung epithelial cells and PAH lung, respectively. Differentially expressed genes (DEGs) were identified using R programming language and similar DEGs were identified from total DEGs of both the datasets. Corresponding similar DEGs were used to perform transcriptomic analyses. The gene expression profiling was performed for both the datasets, and gene ontology (GO) terms, cell informative pathways, PPIs network, hub gene identification and TF–miRNA-based analyses were performed. According to the corresponding similar DEGs, drug compounds were predicted.

Methodology

Comprehensive genomic-level phylogenetic study

Comparison between SARS-CoV and SARS-CoV-2 at the viral genomic level is generated with the collection of a number of genome sequences. The sequences were gathered from the Virus Pathogen Database and Analysis Resource (https://www.viprbrc.org/). A total of 32 sequences were analyzed where SARS-CoV and SARS-CoV-2 both contain 16 sequences, respectively. The sequences for SARS-CoV are as follows: JN247391, JN247392, JN247393, JN247394, JN247395, JN247396, JN247397, GU553363, GU553364, AY274119, MK062179, MK062180, MK062181, MK062182, MK062183 and MK062184. Besides, sequences for SARS-CoV-2 are as follows: MT008022, MT008023, MN988668, MN988669, LC521925, LC522972, LC522973, LC522974, LC522975, MN938385, MN938387, MN938384, MN938388, MN938386, MN938389 and MN938390. According to the sequences, a PHYLIP formatted comprehensive phylogenetic guided tree was designed using Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/). Clustal Omega contains significant features and exploits comprehensive information based on sequence alignments [18]. The phylogenetic tree was redesigned using the interactive tree of life (iTOL) (https://itol.embl.de/). iTOL provides graphical representations of numerous phylogenetic trees and the representations can be customized [19].

Details information of the datasets

GSE147507 and GSE117261 datasets were assembled from the Gene Expression Omnibus (GEO) database [20]. GEO database provides gene expression-based analysis, which is under the platform of National Center for Biotechnology Information [21]. GSE147507 dataset interprets host responses to SARS-CoV-2 and transcriptional responses in lung epithelium cells. GPL18573 Illumina NextSeq 500 (H. sapiens) platform is utilized for GSE147507 to retrieve the analysis of RNA sequence. The contributor of the GSE147507 dataset was Blanco-Melo et al. [22]. However, the GSE117261 dataset represents transcriptomic analysis and systems biology representation on PAH lung. GPL6244 platform was used for GSE117261 dataset, which is [HuGene-1_0-st] Affymetrix Human Gene 1.0 ST Array [transcript (gene) version]. GSE117261 consists of a total of 83 samples that include PAH lung: 58 samples and control lung: 25 samples.

Data filtering and retrieval of DEGs, and identification of common DEGs between SARS-CoV-2 and PAH

Transcriptomic datasets GSE147507 for SARS-CoV-2 infection in human lung epithelial cells and GSE117261 for PAH lung is used for this research. The initial preprocessing phase of the research goes through the retrieval of DEGs for both datasets. Identification of DEGs for the dataset GSE147507 is achieved with the assistance of the R programming language. Herein, limma [23] and DESeq2 [24] packages of R programming language are used for obtaining DEGs for the GSE147507 dataset. Absolute log2 fold change >1.0 and an adjusted P-value <0.05 were considered as cutoff criteria to determine significant DEGs from the GSE147507 dataset. GEO2R (https://www.ncbi.nlm.nih.gov/geo/geo2r/), which is a web-based platform for the analysis of microarray datasets is used for the identification of DEGs for the GSE117261 dataset. GEO2R performs the analysis in a comparative manner by comparing infected samples versus controlled samples, and the comparison is generated through limma and GEOquery [25] packages from Bioconductor [26] project in the platform of R programming language. Benjamini–Hochberg methodology was implemented for GSE147507 and GSE117261 datasets with the purpose of the false discovery rate controlling [27]. Similar DEGs were also acquired using the R programming language.

GO and cell informative pathways analysis

Gene set enrichment analysis is generally a computational and statistical methodology that defines whether a set of determined genes show statistical significance in different biological conditions [28]. The resources of GO provide structural and computational information considering the gene product-based functions [29, 30]. GO can be categorized into three subsections including molecular function, biological process and cellular component for annotation of gene products [31]. GO terms for the current study are obtained using Enrichr (https://amp.pharm.mssm.edu/Enrichr/) platform. Enrichr is a web-based program that contains large gene sets consisting of 102 libraries and performs experiments that are genome based [32]. For cell informative pathway analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) [33], Reactome [34], WikiPathways [35] and BioCarta databases are employed. The results from the databases are also implemented using the Enrichr platform. Phylogram of SARS-CoV and SARS-CoV-2, which provides genomic differences between human coronaviruses of 2003–2018 (SARS-CoV) and 2019–2020 (SARS-CoV-2). Two colors are implemented to differentiate SARS-CoV (purple) and SARS-CoV-2 (green).

Designing of PPIs network

Prominent information about the functions of protein is achieved with the analysis of protein interactions, which is regarded as the primary step in drug discovery and systems biology [36]. The number of complex biological processes is determined with the advanced study of PPIs networks [37, 38]. Identified similar DEGs for SARS-CoV-2 and PAH lung were provided as an input in InnateDB [39] using the NetworkAnalyst (https://www.networkanalyst.ca/) web-based platform. Numerous omics data analysis is achieved through a visual representation of NetworkAnalyst platform including complex PPIs network [40]. The network was further designed using Cytoscape (https://cytoscape.org/). Cytoscape software can be regarded as a prominent source in integrating protein interactions and genetic interactions [41].

Establishment of the topological algorithm on the PPIs network and detection of hub nodes

Hub nodes generally defined by the highly interconnected nodes in a large-scale complex PPIs network [42]. The hub nodes for the current research are determined by the degree topological algorithm. The degree algorithm is applied to the PPIs network using a plugin of Cytoscape software, which is cytoHubba (http://apps.cytoscape.org/apps/cytohubba). cytoHubba is a comprehensive plugin of Cytoscape software that consists of 11 topological algorithms to rank the nodes in a specific network [43]. In the areas where the hub genes are highly interconnected, these areas are regarded as prominent modules from the PPIs network. Distinguishing the modules from the PPIs network will provide better visualization of the hub nodes in separated modules. For specific module analyses for the corresponding PPIs network is generated by ClusterViz (http://apps.cytoscape.org/apps/clusterviz), which is also a Cytoscape plugin. Cluster identification and detection of functional modules from a number of networks, including PPIs network, metabolic network and gene network, are determined by ClusterViz plugin [44].

Analysis of TF–miRNA co-regulatory network

RegNetwork repository was used to generate the analysis of the TF–miRNA co-regulatory network [45]. The miRNAs and TFs are identified from the co-regulatory network, which is responsible for the regulation of DEGs at transcriptional and posttranscriptional levels. The visualization of the network was provided using NetworkAnalyst web-based platform. For system-level data understanding, NetworkAnalyst has been used as a leading bioinformatics tool as a demand of immensely growing gene expression-based datasets [46, 47]. Gene expression profiling of SARS-CoV-2 infection in human lung epithelial cells for the top 20 genes and selected 24 samples from the GSE147507 dataset.

Therapeutic drug compounds prediction

According to similar DEGs, a number of drug compounds are predicted from the Drug Signatures Database (DSigDB) using the Enrichr platform. DSigDB consists of gene sets: 22 527, gene: 19 531 and unique compound: 17 389 [48]. DSigDB predominantly predicts drugs on gene expression-based datasets and each set of the gene are regarded as targeted genes considering a compound [48]. Performing genome-based characterization including RNA, DNA and protein-based biomedical, pharmacological and biological information can be gathered with more accuracy and at an inexpensive post using the Enrichr web-platform [49].

Results

Genomic and phylogram differences between SARS-CoV and SARS-CoV-2

Genomic differences are observed through phylogenetic analysis of SARS-CoV and SARS-CoV-2. The 16 genome sequences for SARS-CoV are the sequences from the year 2003 to 2018 and the host responses were for humans. However, another 16 genome sequence sample for SARS-CoV-2 are the sequences from the year 2019 to 2020 and host responses were for humans as well. The result of the phylogenetic analysis shows that SARS-CoV and SARS-CoV-2 do not produce any clade between them, but the samples share ancestral origin among themselves. This distinguishes SARS-CoV and SARS-CoV-2 at the genomic level. Phylogenetic visualization of SARS-CoV and SARS-CoV-2 genome sequences are displayed in Figure 2.
Figure 2

Phylogram of SARS-CoV and SARS-CoV-2, which provides genomic differences between human coronaviruses of 2003–2018 (SARS-CoV) and 2019–2020 (SARS-CoV-2). Two colors are implemented to differentiate SARS-CoV (purple) and SARS-CoV-2 (green).

Gene expression analysis of PAH patients and SARS-CoV-2 infected human lung epithelial and associative cells

Form the GSE147507 dataset, 24 samples were filtered, and those samples were involved with SARS-CoV-2 infection to primary human bronchial epithelial cells, lung adenocarcinoma and lung biopsy cells. The gene expression of the top 20 genes from the selected samples has been visualized in Figure 3, which provides the report of the high expression profile of S100A9 and KRT5 gene. Besides, among all 83 samples of PAH lung and healthy controls, characterization of gene expression is presented for 20 samples including three healthy controls (GSM3290083, GSM3290086 and GSM3290085), and the remaining of them are PAH samples. Differentiating PAH samples and healthy controls provide evidence of distinct groups of PAH samples according to hierarchical clustering and comparing both samples at RNA level provides different infection response of PAH sample compared with healthy controls (Figure 4A). A volcano plot is visualized and the adjusted P-value <0.05 is considered, which showed the upregulated and downregulated genes that have been identified through a comparative analysis between PAH samples and normal samples for the GSE117261 dataset (Figure 4B).
Figure 3

Gene expression profiling of SARS-CoV-2 infection in human lung epithelial cells for the top 20 genes and selected 24 samples from the GSE147507 dataset.

Figure 4

(A) Gene expression visualization of healthy controls (GSM3290083, GSM3290086 and GSM3290085) and PAH samples. (B) Volcano plot shows the regulation of genes (upregulated and downregulated) for GSE117261.

(A) Gene expression visualization of healthy controls (GSM3290083, GSM3290086 and GSM3290085) and PAH samples. (B) Volcano plot shows the regulation of genes (upregulated and downregulated) for GSE117261. (A) Concordant gene identification between GSE147507 and GSE117261 dataset that provide evidence of six common differentially expressed genes in between 108 genes of GSE147507 (COVID-19) and 59 genes of GSE117261 (PAH) dataset. (B) Heat map according to the log fold changes for the shared common DEGs between COVID-19 dataset and PAH dataset. (A) Heat map for the identification of highly risk prone nature of S100A9 and S100A8 genes. (B) Risk group comparisons between the shared common genes of SARS-CoV-2 and PAH. The association of concordant genes in GO terms and GO pathways and the proportional P-values The association of concordant genes in KEGG, WikiPathways, Reactome and BioCarta databases and the proportional P-values

Common DEGs identifications for further molecular analysis and ensuring the efficiency of predictive drugs

For SARS-CoV-2 infection responses to human lung epithelial cells observation, the DEGs of dataset GSE147507 is identified. Regarding the analysis, a total of 108 DEGs were found. Notably, 93 DEGs show upregulation and the remaining 15 DEGs show downregulation. However, comparison analysis between PAH lung and healthy controls for GSE117261 shows a total of 59 DEGs, of which 27 DEGs show upregulation and another 32 DEGs show downregulation. Comparing SARS-CoV-2 infection responses and PAH samples, six DEGs (SAA2, S100A9, S100A8, SAA1, S100A12 and EDN1) manifest concordance, which is used for identifying GO terms and pathway results, PPIs network, hub nodes and module identification and TF–miRNA regulation and prediction of drug compounds. The concordance produced from the comparison between these two datasets is visualized using a Venn diagram (Figure 5A). The heat map regarding the log fold change for the shared common genes between SARS-CoV-2 and PAH showed unparalleled transcriptional signature impelled upon SARS-CoV-2 infection (Figure 5B). The gene validation is provided according to the risk groups of the genes in a heat map that provides information regarding S100A9 and S100A8 that are highly prone to inflammation (Figure 6A). The boxplot of the risk group comparison also indicates that S100A9 and S100A8 are highly risked prone (Figure 6B).
Figure 5

(A) Concordant gene identification between GSE147507 and GSE117261 dataset that provide evidence of six common differentially expressed genes in between 108 genes of GSE147507 (COVID-19) and 59 genes of GSE117261 (PAH) dataset. (B) Heat map according to the log fold changes for the shared common DEGs between COVID-19 dataset and PAH dataset.

Figure 6

(A) Heat map for the identification of highly risk prone nature of S100A9 and S100A8 genes. (B) Risk group comparisons between the shared common genes of SARS-CoV-2 and PAH.

(A) GO terms regarding biological process, molecular function and cellular component according to the associative P-values. (B) Cell informative pathways (KEGG, BioCarta, Reactome and WikiPathways) analysis result regarding associative P-values. PPIs network for identified common DEGs that refers to SARS-CoV-2 infections in human lung and PAH lung. The common genes are highlighted with purple node (SAA2, S100A9, S100A8, SAA1 and S100A12). The network consists of 125 nodes and 136 edges.

GO and pathway analysis based on the similar DEGs

After the identification of unique DEGs aligned with SARS-CoV-2 infection profile to lung epithelial cells, a number of databases (KEGG, Reactome, WikiPathways, BioCarta and The GO) were utilized to identify GO terms and cell informative pathways. Among all the GO terms, the top 10 biological processes, cellular components and molecular functions were predicted (Table 1). Analysis of biological processes provides neutrophil chemotaxis, granulocyte chemotaxis and regulation of inflammatory responses to SARS-CoV-2 infections according to the number of genes interaction. Molecular function regarding studies show enrichment of calcium ion binding, zinc ion binding, transition metal ion binding and metal ion binding factors. Cytoplasmic vesicle lumen cellular component factor is significantly involved with the corresponding identified DEGs, which eventually refer to SARS-CoV-2 infection responses to the human lung. Notably, top pathways based on the DEGs were allied in the current study (Table 2). IL-17 signaling pathway, TNF signaling pathway and Vitamin B12 metabolism are among the top pathways that were identified through the analysis of the curated databases. The comparison of GO terms is represented in Figure 7A, and the comparison of pathways from numerous databases is provided in Figure 7B.
Table 1

The association of concordant genes in GO terms and GO pathways and the proportional P-values

CategoryGO IDTerm P-valueGenes
GO biological processGO:0030593Neutrophil Chemotaxis6.563(e-10)SAA1, S100A12, S100A9, S100A8
GO:0071621Granulocyte Chemotaxis8.230(e-10)SAA1, S100A12, S100A9, S100A8
GO:1990266Neutrophil Migration9.506(e-10)SAA1, S100A12, S100A9, S100A8
GO:0050832Defense response to fungus1.018(e-8)S100A12, S100A9, S100A8
GO:0050727Regulation of inflammatory response6.777(e-8)SAA1, S100A12, S100A9, S100A8
GO:0051091Positive regulation of sequence-specific DNA-binding transcription factor activity1.915(e-7)EDN1, S100A12, S100A9, S100A8
GO:0050729Positive regulation of inflammatory response9.257(e-7)S100A12, S100A9, S100A8
GO:0031349Positive regulation of defense response9.647(e-7)S100A12, S100A9, S100A8
GO:0070486Leukocyte aggregation0.000001574S100A9, S100A8
GO:0032103Positive regulation of response to external stimulus0.000001745S100A12, S100A9, S100A8
GO molecular functionGO:0050786RAGE receptor binding1.259(e-9)S100A12, S100A9, S100A8
GO:0035325Toll-like receptor binding0.000002697S100A9, S100A8
GO:0005509Calcium ion binding0.00005490S100A12, S100A9, S100A8
GO:0008270Zinc ion binding0.00006592S100A12, S100A9, S100A8
GO:0046914Transition metal ion binding0.0001507S100A12, S100A9, S100A8
GO:0046872Metal ion binding0.0002040S100A12, S100A9, S100A8
GO:0008017Microtubule binding0.001383S100A9, S100A8
GO:0015631Tubulin binding0.002348S100A9, S100A8
GO:0005507Copper ion binding0.01224S100A12
GO cellular componentGO:0060205Cytoplasmic vesicle lumen2.453(e-8)SAA1, S100A12, S100A9, S100A8
GO:0071682Endocytic vesicle lumen0.005388SAA1
GO:0005881Cytoplasmic microtubule0.01135SAA1
GO:0034774Secretory granule lumen0.00007614S100A12, S100A9, S100A8
GO:0045111Intermediate filament cytoskeleton0.02111S100A8
GO:0005856Cytoskeleton0.0003296S100A12, S100A9, S100A8
GO:0030139Endocytic vesicle0.03197SAA1
GO:0005874Microtubule0.06138SAA1
Table 2

The association of concordant genes in KEGG, WikiPathways, Reactome and BioCarta databases and the proportional P-values

DatabasesPathways P-valueGenes
KEGGInterleukin 17 (IL-17) signaling pathway0.0003170S100A9, S100A8
Renin secretion0.02052EDN1
Hypertrophic cardiomyopathy (HCM)0.02523EDN1
AGE–RAGE signaling pathway in diabetic complications0.02963EDN1
HIF-1 signaling pathway0.02963EDN1
Melanogenesis0.02992EDN1
Tumor necrosis factor (TNF) signaling pathway0.03255EDN1
Relaxin signaling pathway0.03838EDN1
Vascular smooth muscle contraction0.03896EDN1
Fluid shear stress and atherosclerosis0.04099EDN1
WikiPathwaysVitamin B12 metabolism WP15330.00009129SAA1, SAA2
Folate metabolism WP1760.0001595SAA1, SAA2
IL1 and megakaryocytes in obesity WP28650.007179S100A9
Physiological and pathological hypertrophy of the heart WP15280.007477EDN1
Selenium micronutrient network WP150.0002711SAA1, SAA2
Endothelin pathways WP21970.009860EDN1
Photodynamic therapy-induced HIF-1 survival signaling WP36140.01105EDN1
Melatonin metabolism and effects WP32980.01105EDN1
Prostaglandin synthesis and regulation WP980.01343EDN1
Vitamin D receptor pathway WP28770.001206S100A9, S100A8
ReactomeAdvanced glycosylation endproduct receptor signaling H. sapiens R-HSA-8794150.000005841SAA1, S100A12
DEx/H-box helicases activate type I IFN and inflammatory cytokines production H. sapiens R-HSA-31349630.000005841SAA1, S100A12
Scavenging by Class B receptors H. sapiens R-HSA-30004710.001499SAA1
RIP-mediated NFkB activation via ZBP1 H. sapiens R-HSA-18104760.00001571SAA1, S100A12
TRAF6-mediated NF-kB activation H. sapiens R-HSA-9335420.00002064SAA1, S100A12
ZBP1(DAI)-mediated induction of type I IFNs H. sapiens R-HSA-16063220.00002430SAA1, S100A12
TAK1 activates NFkB by phosphorylation and activation of IKKs complex H. sapiens R-HSA-4459890.00002430SAA1, S100A12
Formyl peptide receptors bind formyl peptides and many other ligands H. sapiens R-HSA-4444730.002398SAA1
Cytosolic sensors of pathogen-associated DNA H. sapiens R-HSA-18349490.0001595SAA1, S100A12
TRAF6-mediated induction of proinflammatory cytokines H. sapiens R-HSA-1681800.0001899SAA1, S100A12
BioCartaG-protein signaling through tubby proteins H. sapiens h tubbyPathway0.002997EDN1
Activation of PKC through G-protein-coupled receptors H. sapiens h pkcPathway0.003296EDN1
Hypoxia-inducible factor in the cardiovascular system H. sapiens h hifPathway0.004791EDN1
Cystic fibrosis transmembrane conductance regulator (CFTR) and beta 2 adrenergic receptor (b2AR) pathway H. sapiens h cftrPathway0.005986EDN1
Corticosteroids and cardioprotection H. sapiens h gcrPathway0.007477EDN1
Beta-arrestins in GPCR desensitization H. sapiens h bArrestinPathway0.008372EDN1
Activation of cAMP-dependent protein kinase, PKA H. sapiens h gsPathway0.008670EDN1
Role of beta-arrestins in the activation and targeting of MAP kinases H. sapiens h barr-mapkPathway0.008967EDN1
Role of EGF receptor transactivation by GPCRs in cardiac hypertrophy H. sapiens h cardiacegfPathway0.009860EDN1
Roles of beta-arrestin-dependent recruitment of Src kinases in GPCR signaling H. sapiens h bArrestin-srcPathway0.01016EDN1
Figure 7

(A) GO terms regarding biological process, molecular function and cellular component according to the associative P-values. (B) Cell informative pathways (KEGG, BioCarta, Reactome and WikiPathways) analysis result regarding associative P-values.

P‌PIs network construction to perceive hub nodes

Using the NetworkAnalyst platform, six DEGs (SAA2, S100A9, S100A8, SAA1, S100A12 and EDN1) were provided as input and the generated network file was further customized in Cytoscape. The representation of the PPIs network shows immense interaction of S100A9 and S100A8 genes, and the interaction reveals the evidence of enrichment of S100A9 and S100A8 genes to SARS-CoV-2 responses in the human lung. Hub gene identification, module analysis and prediction of effective drug compounds are mainly concerned with the corresponding PPIs network. The PPIs network is represented in Figure 8, with customized visualization that contains 125 nodes and 136 edges.
Figure 8

PPIs network for identified common DEGs that refers to SARS-CoV-2 infections in human lung and PAH lung. The common genes are highlighted with purple node (SAA2, S100A9, S100A8, SAA1 and S100A12). The network consists of 125 nodes and 136 edges.

Hub nodes identification based on the topological analyses and module detection from the PPIs network

Among the similar DEGs, hub nodes from the PPIs network are identified using cytoHubba. The identified top three hub nodes are S100A9, S100A8 and SAA1. The degree algorithm was used for the identification purpose and the degree algorithm shows the highest number of interaction in a specific network. The highlighted hub genes in a hub node identification network are presented in Figure 9, and the network consists of 124 nodes and 135 edges. The regions where the hub nodes are established in the PPIs network are considered as the prominent modules. Module analysis network is represented in Figure 10, which consists of 13 nodes and 13 edges. Topological analysis results for the top three hub genes are presented in Table 3.
Figure 9

Hub gene detection from the similar DEGs based on the PPIs network. The highlighted nodes S100A9 (red), S100A8 (orange) and SAA1 (yellow) are regarded as highly interconnected nodes, considered as hub nodes. The network is made up of 124 nodes and 135 edges.

Figure 10

Highly interconnected regions (module) identification network that consists of 13 nodes and 13 edges. The hub genes S100A9 (orange) and S100A8 (orange) are visualized in the corresponding module network.

Table 3

Exploration of topological results for top three hub genes

Hub geneDegreeStressCloseness centralityBetweenness centrality
S100A98314 008102.6666713 258
S100A845737082.757117
SAA1473841.5732
Hub gene detection from the similar DEGs based on the PPIs network. The highlighted nodes S100A9 (red), S100A8 (orange) and SAA1 (yellow) are regarded as highly interconnected nodes, considered as hub nodes. The network is made up of 124 nodes and 135 edges. Highly interconnected regions (module) identification network that consists of 13 nodes and 13 edges. The hub genes S100A9 (orange) and S100A8 (orange) are visualized in the corresponding module network. Exploration of topological results for top three hub genes TFs and miRNAs interaction with the DEGs can be regarded as a reason for the regulation of expression of the DEGs. The co-regulatory network of TF–miRNA interaction is generated using the NetworkAnalyst platform, and the network is reintroduced in Cytoscape software for better visualization. TF–miRNA co-regulatory network includes 69 nodes and 77 edges. Of the 69 genes, six are similar DEGs, 35 are TF genes and 28 are miRNAs. The customized representation of the TF–miRNA co-regulatory network is presented in Figure 11.
Figure 11

TF–miRNA co-regulatory network visualization. The network includes 69 nodes and 77 edges. According to the network, there exist 35 TF genes (blue) and 28 are miRNAs (red) and they are interacted with six common DEGs (green).

TF–miRNA co-regulatory network visualization. The network includes 69 nodes and 77 edges. According to the network, there exist 35 TF genes (blue) and 28 are miRNAs (red) and they are interacted with six common DEGs (green).

Predictive drug compounds

The drug compounds were proposed from the DSigDB database using the Enrichr web platform. The drug compounds were predicted according to identified six DEGs (SAA2, S100A9, S100A8, SAA1, S100A12 and EDN1). The results were accomplished based on adjusted P-value and P-value scores. MIGLITOL CTD 00002031 and metoprolol HL60 UP are the two prominent drug compounds with which a significant amount of genes are connected. Besides, among the top hub genes, S100A9 is interconnected with both the drug compounds, which makes the drug compounds even more eminent in terms of the efficiency of the drugs. The predictive drug compounds are presented in Table 4.
Table 4

Predictive drug compounds according to the concordant genes of SARS-CoV-2 and PAH samples

Name of drugs P-valueAdjusted P-valueGenes
MIGLITOL CTD 000020310.0000049430.01990S100A12, S100A9
Bosentan CTD 000030710.0032960.5529EDN1
Coenzyme Q10 CTD 000011670.0035950.5789EDN1
Metoprolol HL60 UP0.000073830.04954S100A12, S100A9
9-(2-Phosphonomethoxypropyl)adenine CTD 000032590.0041930.5821EDN1
(+)-Chelidonine HL60 DOWN0.000091290.05250S100A9, S100A8
Sildenafil CTD 000033670.0044920.6028EDN1
Norepinephrine CTD 000064170.000098790.04972S100A9, S100A8
Dydrogesterone CTD 000058820.0047910.6028EDN1
1,3-Dimethylthiourea CTD 000018180.0047910.5845EDN1
Predictive drug compounds according to the concordant genes of SARS-CoV-2 and PAH samples

Discussion

Recent studies have demonstrated the effect of SARS-CoV-2 in human lungs and create complexity in the functioning of the human lungs that eventually leads to diseases like PAH. The following study attempts to identify genomic differences between SARS-CoV and SARS-CoV-2 and also signify transcriptomic effects of SARS-CoV-2 to the PAH through a number of bioinformatics approaches. As SARS-CoV-2 is having a lethal effect on humankind, the current research can be regarded as the most comprehensive transcriptomic and genomic research on novel coronavirus to date. According to the GO terms, inflammatory responses are detected that dominate infection responses to SARS-CoV-2. In the biological process, neutrophil chemotaxis, granulocyte chemotaxis, neutrophil migration and regulation of inflammatory responses are among the top GO terms. During the infection of SARS-CoV-2 in the human lung, neutrophil chemotaxis term induces uncontrolled inflammation due to proinflammatory cytokine [50]. The term granulocyte chemotaxis show immensely upregulated inflammatory response in human lung epithelial cell [51]. After molecular function identification, receptor for advanced glycation end products (RAGE) receptor binding, calcium ion binding and zinc ion binding can be considered as the most significant terms. RAGE performs as a mediator and biomarker in terms of inflammatory illness during SARS-CoV-2 [52]. The top cellular components are cytoplasmic vesicle lumen, secretory granule lumen and cytoskeleton. Cell informative pathway identification with the screening of unbiased database methodology shows inflammatory responses to SARS-CoV-2. IL-17 signaling pathway is identified from the KEGG database. IL-17 is a member of a cytokine family that shows correlation and cytokine storm with SARS-CoV-2 [53, 54]. In molecules of PAH, highly expressed and meaningful hypomethylation of IL-17 responses were identified [55]. A recent study found that the TNF signaling pathway was found in the infection of SARS-CoV-2 in the lung epithelial cells of the human [56]. PPIs network designing reveals the proteomic information regarding SARS-CoV-2 and PAH. The PPIs network shows 136 interactions among 125 genes. The analysis was generated for six common DEGs (SAA2, S100A9, S100A8, SAA1, S100A12 and EDN1), and the highly interconnected nodes and regions show effective prediction on S100A9 and S100A8. S100 calcium-binding protein A9 (S100A9) and S100 calcium-binding protein A8 (S100A8), both genes are associated with the respiratory disorder or lung diseases [57]. Studies have found a number of immunocytochemical responses of S100A9 and S100A8 in PAH lung samples [58]. According to the hub nodes, highly interconnected modules were also identified from the PPIs network. In a number of solutions to complex diseases, regulatory biomolecules perform as potential biological markers. The regulation regarding six common DEGs is justified with the analysis of the TF–miRNA co-regulatory network by measuring the performance of TF-genes and miRNAs in that specific network. A total of 28 miRNAs and 35 TF-genes interactions are visualized with the six common DEGs. The analysis of TF-genes shows androgen receptor (AR) has the most interaction comparing with other TF-genes. TMPRSS2 gene is considered to be an active promoter for spike protein of SARS-CoV-2, and AR is used as a required factor for transcription of the TMPRSS2 gene [59]. Drug compounds are suggested for six common DEGs from the prediction of the DSigDB database. Significantly, prominent top 10 drugs were identified for the following study. MIGLITOL CTD 00002031, Bosentan CTD 00003071, Coenzyme Q10 CTD 00001167, metoprolol HL60 UP, chelidonine HL60 DOWN, sildenafil CTD 00003367, norepinephrine CTD 00006417, dydrogesterone CTD 00005882 and 1,3-Dimethylthiourea CTD 00001818 are among the significant candidate drugs form the current prediction. Recent studies have presented the efficient activity of MIGLITOL against RNA viruses. MIGLITOL showed significant performance as an inhibitor against the spike protein (S1) of the SARS-CoV-2 virus. This result was identified using the study of molecular dynamics and virtual screening of MIGLITOL and also a number of approved drugs [60]. The effect of the coenzyme Q10 drug compound can be supportive for COVID-19 patients as it increases energy level, immunity and reduce oxidative stress among patients. One of the major symptoms of COVID-19 is fatigue, and coenzyme Q10 has shown significant potential to reduce the fatigue and pain in fibromyalgia patients [61]. Recent studies have predicted that sildenafil is suitable for COVID-19 infected patients as the principal role of sildenafil is to inhibit the neointimal formation and aggregation of platelet [62]. Adult persons are more at risk due to COVID-19 disease, and norepinephrine is suggested for infected adult persons with shock [63]. The identified DEGs show inflammatory and cytokine responses and association with a number of pathways and which generally refers to SARS-CoV-2 infection in human lung epithelial cells and PAH affected lungs. The transcriptomic result produced in this research is for limited samples regarding both SARS-CoV-2 and PAH. The larger number of samples would produce a significant amount of concordant genes, which will definitely produce a large transcriptomic response in near future.

Conclusions

In this study, biological domains, regulatory elements and identified biomarkers had been discussed in brief that is expected to accelerate the pace of therapeutics development against the ongoing COVID-19 pandemic. The superiority of our study can be considered as it is by far the largest genomic and transcriptomic study on SARS-CoV-2. We provided multiple ways of analyses including comparative genomic differences of SARS-CoV and SARS-CoV-2, and the difference has been made to look for transcriptomic analyses on SARS-CoV-2 and its PAH comorbidity condition. Phylogenetic analyses of this research have produced genomic differences between SARS-CoV and SARS-CoV-2. We have identified the concordant genes between SARS-CoV-2 and PAH that produce further molecular results and show the association of the DEGs in SARS-CoV-2 affected human lung epithelial cells and PAH patients’ lung. A different type of transcriptional response was found due to the SARS-CoV-2 infection in human lung epithelial cells, which is enriched in inflammatory responses and neutrophil chemotaxis. The predicted drug compounds show activity against inflammatory responses against RNA viruses. Phylogenetic analysis showed genomic differences between SARS-CoV and SARS-CoV-2. Transcriptomic gene expression provided inflammatory responses in SARS-CoV-2-infected human lung epithelial cells and PAH patients. The development of the PPIs network detected the interactions for the identified shared genes between the COVID-19 and PAH. Topological analysis of the PPIs network showed the highly interconnected nodes and extracted specific genes from the concordant genes. The predictive drug compounds highlighted activity against inflammatory responses that are identified with SARS-CoV-2 infection responses and the pathways indicate molecular information for both SARS-CoV-2 and PAH.
  13 in total

1.  Enrichment analysis on regulatory subspaces: A novel direction for the superior description of cellular responses to SARS-CoV-2.

Authors:  Pedro Rodrigues; Rafael S Costa; Rui Henriques
Journal:  Comput Biol Med       Date:  2022-04-25       Impact factor: 6.698

2.  COVID-19 patient transcriptomic and genomic profiling reveals comorbidity interactions with psychiatric disorders.

Authors:  Mohammad Ali Moni; Ping-I Lin; Julian M W Quinn; Valsamma Eapen
Journal:  Transl Psychiatry       Date:  2021-03-15       Impact factor: 6.222

3.  Bioinformatics and system biology approach to identify the influences of SARS-CoV-2 infections to idiopathic pulmonary fibrosis and chronic obstructive pulmonary disease patients.

Authors:  S M Hasan Mahmud; Md Al-Mustanjid; Farzana Akter; Md Shazzadur Rahman; Kawsar Ahmed; Md Habibur Rahman; Wenyu Chen; Mohammad Ali Moni
Journal:  Brief Bioinform       Date:  2021-09-02       Impact factor: 11.622

4.  Effects of Bacille Calmette Guerin (BCG) vaccination during COVID-19 infection.

Authors:  Utpala Nanda Chowdhury; Md Omar Faruqe; Md Mehedy; Shamim Ahmad; M Babul Islam; Watshara Shoombuatong; A K M Azad; Mohammad Ali Moni
Journal:  Comput Biol Med       Date:  2021-09-29       Impact factor: 4.589

5.  Pannexin-1 channel opening is critical for COVID-19 pathogenesis.

Authors:  Ross Luu; Silvana Valdebenito; Eliana Scemes; Antonio Cibelli; David C Spray; Maximiliano Rovegno; Juan Tichauer; Andrea Cottignies-Calamarte; Arielle Rosenberg; Calude Capron; Sandrine Belouzard; Jean Dubuisson; Djillali Annane; Geoffroy Lorin de la Grandmaison; Elisabeth Cramer-Bordé; Morgane Bomsel; Eliseo Eugenin
Journal:  iScience       Date:  2021-11-19

Review 6.  Covid-19 and development of heart failure: mystery and truth.

Authors:  Hope Onohuean; Hayder M Al-Kuraishy; Ali I Al-Gareeb; Safaa Qusti; Eida M Alshammari; Gaber El-Saber Batiha
Journal:  Naunyn Schmiedebergs Arch Pharmacol       Date:  2021-09-04       Impact factor: 3.000

7.  Identification of Crucial Genes and Key Functions in Type 2 Diabetic Hearts by Bioinformatic Analysis.

Authors:  Xin Huang; Kai-Jie Zhang; Jun-Jie Jiang; Shou-Yin Jiang; Jia-Bin Lin; Yi-Jia Lou
Journal:  Front Endocrinol (Lausanne)       Date:  2022-02-15       Impact factor: 5.555

8.  Causal Association and Shared Genetics Between Asthma and COVID-19.

Authors:  Ancha Baranova; Hongbao Cao; Jiu Chen; Fuquan Zhang
Journal:  Front Immunol       Date:  2022-03-21       Impact factor: 7.561

9.  Discovering Common Pathophysiological Processes between COVID-19 and Cystic Fibrosis by Differential Gene Expression Pattern Analysis.

Authors:  Md Tanvir Hasan; Lway Faisal Abdulrazak; Mohammad Khursheed Alam; Md Rezwan Islam; Yeasmin Hena Sathi; Fahad Ahmed Al-Zahrani; Kawsar Ahmed; Francis M Bui; Mohammad Ali Moni
Journal:  Biomed Res Int       Date:  2022-04-29       Impact factor: 3.246

10.  Identifying molecular insight of synergistic complexities for SARS-CoV-2 infection with pre-existing type 2 diabetes.

Authors:  M Babul Islam; Utpala Nanda Chowdhury; Zulkar Nain; Shahadat Uddin; Mohammad Boshir Ahmed; Mohammad Ali Moni
Journal:  Comput Biol Med       Date:  2021-07-23       Impact factor: 4.589

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.