| Literature DB >> 35205254 |
Mohd Murshad Ahmed1, Almaz Zaki2, Alaa Alhazmi3, Khalaf F Alsharif4, Hala Abubaker Bagabir5, Shafiul Haque6, Kailash Manda7, Shaniya Ahmad2, Syed Mansoor Ali2, Romana Ishrat1.
Abstract
Sepsis is a clinical syndrome with high mortality and morbidity rates. In sepsis, the abrupt release of cytokines by the innate immune system may cause multiorgan failure, leading to septic shock and associated complications. In the presence of a number of systemic disorders, such as sepsis, infections, diabetes, and systemic lupus erythematosus (SLE), cardiorenal syndrome (CRS) type 5 is defined by concomitant cardiac and renal dysfunctions Thus, our study suggests that certain mRNAs and unexplored pathways may pave a way to unravel critical therapeutic targets in three debilitating and interrelated illnesses, namely, sepsis, SLE, and CRS. Sepsis, SLE, and CRS are closely interrelated complex diseases likely sharing an overlapping pathogenesis caused by erroneous gene network activities. We sought to identify the shared gene networks and the key genes for sepsis, SLE, and CRS by completing an integrative analysis. Initially, 868 DEGs were identified in 16 GSE datasets. Based on degree centrality, 27 hub genes were revealed. The gProfiler webtool was used to perform functional annotations and enriched molecular pathway analyses. Finally, core hub genes (EGR1, MMP9, and CD44) were validated using RT-PCR analysis. Our comprehensive multiplex network approach to hub gene discovery is effective, as evidenced by the findings. This work provides a novel research path for a new research direction in multi-omics biological data analysis.Entities:
Keywords: cardiorenal syndrome; differentially expressed genes; miRNA; sepsis; systemic lupus erythematosus
Mesh:
Year: 2022 PMID: 35205254 PMCID: PMC8872348 DOI: 10.3390/genes13020209
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Details of GSE series sample and their DEGs and DEMs. Column 2 shows the total number of samples available in the particular GSE series, and column 7 indicates the fold change (FC) values. These GSE series were preprocessed using GEO2R, with many in-built functions for normalization and batch effect removal using empirical methods in limma. After applying the limma methods, the resultant data were obtained in the form of various calculated values, such as B value, p-Value, adj p-Value, fold change, and t values. The total up- and downregulated genes in each GSE series were written in columns 5 and 6, respectively. In this table, column 11 indicates the platform of the GSE series, which means that different platforms have different numbers of transcripts (Affymetrix Probe Set ID). GPL-570 always has 54675 transcripts, whereas GPL-4274 contains 18666 only.
| SERIES | #Sample | Normal | Disease | Up | Down | FC | Illness | Country | Year | Platform | Author Reference |
|---|---|---|---|---|---|---|---|---|---|---|---|
| GSE6535 | 72 | 17 | 55 | 13 | 29 | 2 | Sepsis | Australia | 2007 | GPL-4274 | Tang BM [ |
| GSE5772 | 94 | 23 | 71 | 28 | 19 | 0.5 | Sepsis | Australia | 2007 | GPL-4274 | Tang BM [ |
| GSE28750 | 41 | 20 | 21 | 86 | 42 | 2 | Sepsis | Australia | 2011 | GPL-570 | Gareth price [ |
| GSE64457 | 23 | 8 | 15 | 53 | 63 | 1.5 | Sepsis | France | 2015 | GPL-570 | Julien TT [ |
| GSE12624 | 70 | 36 | 34 | 41 | 30 | 1 | Sepsis | Germany | 2010 | GPL-4204 | Hamid Hossain |
| GSE13205 | 21 | 8 | 13 | 107 | 81 | 0.5 | Sepsis | UK | 2008 | GPL-570 | Iain Gallagher [ |
| GSE51997 | 36 | 22 | 14 | 662 | 205 | 0.5 | SLE | Germany | 2014 | GPL-570 | Powel Durek [ |
| GSE13887 | 27 | 17 | 10 | 129 | 45 | 1.5 | SLE | USA | 2009 | GPL-570 | Frank A M [ |
| GSE50772 | 81 | 20 | 61 | 169 | 53 | 1 | SLE | USA | 2015 | GPL-570 | Michael TS [ |
| GSE30153 | 26 | 9 | 17 | 61 | 62 | 0.5 | SLE | France | 2003 | GPL-570 | Jean Nicolas [ |
| GSE103760 | 16 | 8 | 8 | 489 | 103 | 1.5 | SLE | USA | 2018 | GPL-17585 | Morel L |
| GSE99967 | 59 | 17 | 42 | 65 | 11 | 0.5 | SLE | Canada | 2018 | GPL-21970 | Prokopec S [ |
| GSE17582 | 48 | 12 | 36 | 11 | 10 | 1.5 | CRS | Canada | 2009 | GPL-6883 | Braam B [ |
| GSE125898 | 6 | 3 | 3 | 362 | 422 | 2 | CRS | Mexico | 2019 | GPL-10739 | Rangel LA |
| GSE89699 | 8 | 1 | 7 | 0 | 81 | 0.5 | CRS | India | 2017 | GPL-18402 | Ramanathan k |
| GSE87885 | 5 | 3 | 2 | 93 | 78 | 0.5 | CRS | China | 2019 | Gpl-22555 | Rangjian Xu |
Figure 1Methodology adopted for the series used in the study. SLE, sepsis, and CRS data were extracted from NCBI geo datasets, and these terms were also searched for in the Array Express database. The GSE series were available in the public databases, and we retrieved only human data by applying exclusion criteria. Data were filtered via GEO2R and selected after final mining, applying normalization and log 2 transformations. The resulting data are in the Excel sheet used for DEGs (based on fold change and p-value). The data were merged to find overlapping DEGs and DEMs, and finally, the PPIN network was constructed via overlapping DEGs-DEMs (shared network).
Real time pcr primers (mouse).
| Genes | Forward | Reverse |
|---|---|---|
|
| GAATTAGCTGGACACTCAAG | CACCTTCTCCTACTATTGACC |
|
| CAGAGTCCTTTTCTGACATC | GAGAAGCGGCCAGTATAG |
|
| CTTCCAGTACCAAGACAAAG | ACCTTGTTCACCTCATTTTG |
Figure 2Boxplot of GSE94717. For all GSE series, the boxplot indicates data noise/data duplicity and normalized/non-normalized data. If the data are not normalized, the whisker boxes are up and down (not in proper lines). If data are random due to noise mixing or redundancy, data should be normalized in order to solve this problem. If data are non-normalized, there may be false results; the upregulated genes may be portrayed as downregulated, and vice-versa. After data are normalized, we can rely on the data, and consider the DEGs for further studies. The blue color box plot shows the normal samples and the pink color box plot shows the disease samples. In the boxplot, we can see the number of samples and the GSM number of the GSE series (if the GSE series selection was wrong, then the 12 disease samples and the 3 control samples will also give false results). The pre-processing of the data is an essential step for data accuracy when working with manipulated data. The Y-axis of the boxplot indicates a range of data values, and the X-axis indicates data samples in the GSE series. (a) Non-normalized data; (b) normalized data.
Figure 3Network representing downregulated genes of sepsis, SLE, and CRS. This downregulated network contains 1351 nodes and 52,455 edges. The CRS GSE series GSE125898 contains 422 downregulated genes and GSE17582 contains 10 downregulated genes. Sepsis and SLE together contribute 13 downregulated genes. Green colors indicate our seed genes and blue are their interacting partner. A zoom view shows clear gene–gene interactions (green colors for seed genes). The nodes are the gene entity, and the edges are the interactions/connections. A gene has multiple connections in the network: gene to gene, gene to itself, and gene to many genes interactions. This complex network is undirected (no direction of edges) and unweighted. To retrieve biological information from the complex network, module analysis methods are used, which reduces complexity.
Figure 4Network representing upregulated genes of sepsis, SLE and CRS. This upregulated network contains 1348 nodes and 45,699 edges. The CRS GSE series GSE125898 contains 362 upregulated genes and GSE17582 contains 11 upregulated genes. Sepsis and SLE together contribute 47 upregulated genes. The red color indicates our seed genes, and blue are their interacting partner. A highlighted (zoom view) in the right corner highlights the clear interaction between seed genes and non-seed genes. The nodes are the genes, and edges are the interactions/connections (G = V, E).
Figure 5Merged network of upregulated genes and downregulated genes. This network comprises 2091 nodes and 75,957 edges. Green colors nodes and red colors nodes indicates our key genes (downregulated and upregulated, respectively), and blue nodes are their interacting partners (other genes). Nodes are the gene entity and edges are the interactions/connections. This merged network is prepared using the Network Analyzer function in Cytoscape. The genes have a common connection in both networks and merge into one network: both networks, upregulated and downregulated, have 1306 and 1348 nodes, respectively; after merging, the total genes should be 2698, but in the merged network, this becomes 2091, because some of the genes are the same in both networks (i.e., we removed duplicate genes).
Figure 6A miRNA–mRNA network of three key miRNAs from two CRS miRNA series. This represents a network of three miRNAs (selected on the basis of common miRNAs obtained from the two series). This network represents miRNAs with their respective targets. The total number of target genes of all three miRNAs are 1500 and the edges are 1609.
Figure 7Shared network: this shared network is formed from all the above networks (upregulated network, downregulated network, and miRNA–mRNA network). It contains 2091 nodes and 75,957 edges. The red color indicates upregulated genes, the green color indicates downregulated genes, the yellow color indicates the target genes of miRNAs, and the blue color indicates the interacting genes.
Part A: List of modules and their corresponding top 50 genes. Part B: Seed genes in top 10.
|
| ||
|
|
|
|
| module 1 | 22 |
|
| module 2 | 22 |
|
| module 3 | 35 |
|
| module 4 | 24 |
|
| module 5 | 20 |
|
| module 6 | 36 |
|
| module 7 | 10 |
|
| module 8 | 49 |
|
| module 9 | 1 |
|
| module 10 | 33 |
|
|
| ||
|
|
|
|
| module 2 | 1 |
|
| module 3 | 7 |
|
| module 4 | 1 |
|
| module 6 | 2 |
|
| module 7 | 5 |
|
| module 8 | 5 |
|
| module 10 | 6 |
|
Figure 8Top ten modules selected from MCODE analysis. Network/modules/sub-modules at different network levels that accommodate leading hubs (red) (fundamental key regulators) and blue nodes indicate interactive partners (other genes).
Figure 9Topological properties of shared network: (a) clustering co-efficient (C(k)), (b) the behaviors of degree distributions (P(k)), (c) neighborhood connectivity (CN(k)), (d) closeness (CC(k)), (e) betweenness (CB(k)), and (f) eigenvector (CE(k)) measurements as a function of degree k for original shared network.
Figure 10The figure depicts the top 10 gene ontology of key genes with GO term analysis ID and the padj molecular function, biological process, cellular components, and KEGG pathways.
Figure 11Validation through real-time qRT-PCR analysis of gene expression (MMP9, CD44, and EGR1). Sepsis groups are CLP operated and SHAM groups are control groups. Real-time analysis data showed their upregulated expression. Data are represented as mean ± S.E.M. * p < 0.05 and *** p < 0.001.
Twenty-seven hub genes were identified in the shared network based on degree centrality.
| Sr. No. | Genes Symbol | Degree |
|---|---|---|
| 1 |
| 149 |
| 2 |
| 275 |
| 3 |
| 194 |
| 4 |
| 242 |
| 5 |
| 132 |
| 6 |
| 151 |
| 7 |
| 170 |
| 8 |
| 135 |
| 9 |
| 254 |
| 10 |
| 91 |
| 11 |
| 37 |
| 12 |
| 188 |
| 13 |
| 126 |
| 14 |
| 138 |
| 15 |
| 120 |
| 16 |
| 116 |
| 17 |
| 57 |
| 18 |
| 246 |
| 19 |
| 134 |
| 20 |
| 39 |
| 21 |
| 87 |
| 22 |
| 38 |
| 23 |
| 29 |
| 24 |
| 21 |
| 25 |
| 22 |
| 26 |
| 22 |
| 27 |
| 24 |