Literature DB >> 34103966

Identification of an Alveolar Macrophage-Related Core Gene Set in Acute Respiratory Distress Syndrome.

Chunling Zhao1, Jingjia Mo2, Xiaowen Zheng1, Zimeng Wu1, Qian Li1, Jihua Feng1, Jiefeng Luo1, Junyu Lu3, Jianfeng Zhang1.   

Abstract

PURPOSE: Acute respiratory distress syndrome (ARDS) is a rapidly progressive diffuse lung injury that is characterized by high mortality and acute onset. The pathological mechanisms of ARDS are still unclear. But alveolar macrophages have been shown to play an important role in inflammatory responses during ARDS. We aimed to find the biomarkers for ARDS for early diagnosis, to give ARDS patients timely treatment.
METHODS: Gene expression profiles were downloaded from Gene Expression Omnibus (GEO) and screened for differentially expressed genes (DEGs). The common upregulated genes in all the datasets were defined as circulating ARDS alveolar macrophage-related genes (cARDSAMGs). We performed a functional enrichment analysis to explore potential biological functions of cARDSAMGs, and we built protein-protein interaction networks. Gene set variation analysis (GSVA) was used to calculate the core gene set variation analysis (CGSVA) score for individual samples. Receiver operating characteristic (ROC) curve analysis was applied on the CGSVA score to evaluate its ability for diagnosis of ARDS.
RESULTS: A total of 60 genes were upregulated in all ARDS datasets and were therefore denominated as cARDSAMGs. The cARDSAMGs were significantly involved in multiple inflammation-, immunity- and phagocytosis-related biological processes and pathways. In the protein-protein interaction network associated with host responses to ADRS, eight genes were identified as a core gene set: PTCRA, JAG1, C1QB, ADAM17, C1QA, MMP9, VSIG4 and TNFAIP3. ROC curve analysis showed that the CGSVA score may be considered as a biomarker for ARDS: it was significantly higher in patients with ARDS than those in healthy in both alveolar lavage fluid and whole blood.
CONCLUSION: The ARDS alveolar macrophage-related CGSVA score may be useful as a biomarker for ARDS.
© 2021 Zhao et al.

Entities:  

Keywords:  acute respiratory distress syndrome; differentially expressed genes; gene expression omnibus; lung injury

Year:  2021        PMID: 34103966      PMCID: PMC8179830          DOI: 10.2147/JIR.S306136

Source DB:  PubMed          Journal:  J Inflamm Res        ISSN: 1178-7031


Introduction

Sepsis is a syndrome in which the body’s immune system overreacts to infection. There may be several life-threatening complications. One of them is acute respiratory distress syndrome (ARDS).1,2 ARDS is an acute diffuse lung injury with high incidence. It has been shown that the pathophysiological basis of ARDS includes an excessive and protracted systemic inflammation,3 but the exact mechanisms remain unknown. The main risk factors that may cause ARDS include drinking, smoking, air pollution, hypoproteinemia, and diabetes mellitus.4–6 Currently, urgent treatment strategies mainly include low tidal mechanical ventilation, but the outcome is very poor.3 Although some progress has been made in treatment options in recent years, the mortality rate remains high.7 Previous studies have identified some genes used as biomarkers for ARDS. The high expression of interleukin (IL)-33 can elevate the expression of matrix metallopeptidases (MMP) 2 and 9 in patients with acute lung injury.8 SLC2A6 may be a critical biomarker for predicting survival of sepsis patients.9 V-set immunoglobulin-domain-containing 4 (VSIG4) may be involved in lung injury by inducing phagocytosis.10 Activation of ERK1/2 by NGF1B depends on MAPKKK c-Raf, eventually inducing inflammation. The NGF1B in the cytoplasm may be involved in the occurrence and development of ARDS.11 Down-regulation of the long non-coding RNA GAS5 decreases the expression of angiotensin-converting enzyme 2 (ACE 2), leading to increased levels of microRNA (miR) miR-200c-3p, which promotes the progression of ARDS.12 MYC and STAT3 may be the key regulatory genes in the underlying dysfunction of sepsis-induced ARDS.13 In addition, a previous study showed that gene set variation index can be used the potential diagnostic tool for sepsis.14 And gene set variation index may be considered as the biomarker of bacterial and fungal sepsis.15 However, we found that a majority of previous studies mostly focused on single genes or molecules for ARDS. There are only a few studies involving in a set of genes related to ARDS. Several studies confirmed that alveolar macrophage transcriptional programs are associated with ARDS.16–18 We hypothesized that these alveolar macrophage-related ARDS-specific transcriptional programs may be also reflected in blood cells. To investigate our hypothesis in the present study, we identified genes upregulated both in alveolar macrophages and blood cells and constructed a corresponding protein–protein interaction network associated with host response to ADRS. We found that eight genes formed a core gene set, and that the core gene set variation score may be used as a circulating biomarker for ARDS.

Materials and Methods

Data Collection and Processing

The data was downloaded from the Gene Expression Omnibus (GEO) database (): GSE116560,16 GSE89953,19 GSE7629320 and GSE32707.21 GSE116560 and GSE89953 were based on GPL6883. GSE116560 was obtained from alveolar macrophages of 68 ARDS samples, while GSE89953 was obtained from peripheral blood monocytes of 26 ARDS samples. The two datasets were combined and then batch effects were removed using the sva routine in R.22 From GSE76293, based on GPL570, the gene profiles of neutrophils from 12 ARDS samples and 12 healthy controls were extracted for subsequent analysis. From GSE32707, based on GPL10558, the gene profiles of whole blood cells were used to perform the subsequent analysis, from 18 ARDS samples and 34 healthy controls. The data were normalized using the “normalizeBetweenArrays” function of the limma routine23 in R. The probe set was converted into gene symbols, according to the annotation information of the three platforms. The workflow of the study is shown in Figure 1.
Figure 1

Workflow chart of the study.

Workflow chart of the study.

Principal Component Analysis (PCA) and Screening Differentially Expressed Genes (DEGs)

The ggbiplot routine24 in R was applied to perform the PCA.25 In GPL6883 datasets (GSE116560 and GSE89953), the DEGs in alveolar macrophages were screened in comparison with peripheral blood monocytes. These DEGs may reflect the response of host monocyte-macrophages to ARDS.18 In GSE76293, DEGs in ARDS neutrophils were screened compared to healthy controls, since neutrophils are the most frequent nucleated cell in whole blood. In GSE32707, we screened the DEGs in whole blood between ARDS patients and healthy controls. A total of the DEGs were screened using the limma routine in R. The cut-off criteria are a false discovery rate (FDR)-adjusted P < 0.05 and |log2 (fold-change)| > 1. In the study, the common upregulated genes were defined as circulating ARDS alveolar macrophage-related genes (cARDSAMGs).

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway Enrichment Analysis

To explore the potential biological functions of the cARDSAMGs, GO and KEGG pathway enrichment analysis were performed using the clusterProfiler routine in R.26 GO and KEGG networks were clustered using the plug-in ClueGO26 in Cytoscape software.27 Differences associated with an FDR-adjusted P < 0.05 was considered statistically significant.

Analysis of Core Genes

Using STRING database28 () and KEGG pathway enrichment analysis, a protein–protein interactions (PPIs) network was constructed for the cARDSAMGs. If genes were involved in ARDS-related pathways of interest, we defined them as core genes in the network. The GOsemsim routine inR29 was applied to analyze the semantic similarities among GO terms for the cARDSAMGs, applying the following formula: Pi = -lg (p) * |log2 (fold-change)|, weight (W) value = semantic similarities * mean value of Pi.

Gene Set Variation Analysis (GSVA) and Receiver Operating Characteristic (ROC) Curve Analysis

The GSVA routine inR30 was applied to calculate the core gene set variation analysis (CGSVA) score for individual samples to assess the concerted functional behavior of the core gene set. To further evaluate the potential diagnostic value of the CGSVA score for ARDS, an ROC curve analysis was performed using the pROC routine in R.31

Results

Multiple Genes Identified as cARDSAMGs

PCA showed that global gene expression patterns could distinguish ARDS from controls in GSE116560, GSE89953 and GSE76293, but not in GSE32707 (). There were 8522 DEGs in GSE116560 and GSE89953, 6064 in GSE76293, and 7713 in GSE32707 (Figure 2A). A set of 60 DEGs was overexpressed in all the datasets, defined as cARDSAMGs (Figure 2B). Moreover, we obtained the p.value and rank of 60 cARDSAMGs in above three datasets and the ranking based on the p.value (). PCA showed that the expression patterns of the cARDSAMGs could not distinguish ARDS from healthy controls (). However, we found that cARDSAMGs can better distinguish ARDS from control compared with the global gene expression patterns ().
Figure 2

Multiple genes identified as cARDSAMGs. (A) Manhattan plot for DEGs. The three most significantly overexpressed genes are marked in yellow and labeled with their names. (B) Venn diagram to identify upregulated DEGs common to datasets from the three platforms.

Multiple genes identified as cARDSAMGs. (A) Manhattan plot for DEGs. The three most significantly overexpressed genes are marked in yellow and labeled with their names. (B) Venn diagram to identify upregulated DEGs common to datasets from the three platforms.

Biological Significance of cARDSAMGs

GO enrichment analysis showed that changes in expression involved mainly genes related to metabolism- and inflammation-related biological processes (Figure 3A). KEGG result showed that DEGs were mainly involved in pathways related to ARDS, such as Notch the tumor necrosis factor (TNF) and the tumor necrosis factor (TNF) signaling pathways, as well as complement and coagulation cascades (Figure 3B). In addition, we performed the enrichment analysis using the 60 top ranked up-regulated genes in each of the datasets. As shown in , we found that the biological processes (BPs) and KEGG pathways were different in each of datasets. But most of them were related to immunity, inflammation and phagocytosis. Furthermore, we randomly selected 60 genes in the three datasets and thereby calculated the enriched GO term. We found that the enriched GO terms mostly were not significantly related with inflammation, immunity and phagocytosis, which further evidenced that the gene set-cARDAMGs was not simply a randomly overlapping set (). The ClueGo analysis indicated that DEGs were clustered mainly in biological processes, such as bone resorption, collagen catabolic process, regulation of cartilage development, positive regulation of ATP biosynthetic process, negative regulation of IL-2 production and positive regulation of animal organ morphogenesis (Figure 3C). No significant clustering of KEGG pathways was observed, based on a threshold of FDR-adjusted P < 0.05.
Figure 3

Biological significance of cARDSAMGs. (A) The significantly enriched biological processes, (B) KEGG pathways and (C) GO network in circulating ARDS involving alveolar macrophage-related genes in the three datasets. Different colors of nodes represent different functional groups.

Biological significance of cARDSAMGs. (A) The significantly enriched biological processes, (B) KEGG pathways and (C) GO network in circulating ARDS involving alveolar macrophage-related genes in the three datasets. Different colors of nodes represent different functional groups.

Core Genes in the PPIs Network of ARDS

We constructed the PPIs network of cARDSAMGs based on the STRING database. There were 25 nodes and 189 interaction pairs (Figure 4A). Previous studies showed that some of the biological processes and pathways were related to ARDS, including immunity, inflammation and phagocytosis.32,33 Therefore, we selected the following ARDS-related pathways as pathways of interest: Notch, TNF and IL-17 signaling pathways, as well as complement and coagulation cascades. Subsequently, we identified eight cARDSAMGs as core genes involved in these pathways. The core genes included PTCRA, JAG1, C1QB, ADAM17, C1QA, MMP9, VSIG4 and TNFAIP3. In the complement and coagulation cascades, antigen-antibody complexes act on C1QA and C1QB. They further act on VSIG4, thereby inducing phagocytosis.34,35 In the TNF signaling pathway, TNF acts on TNF receptor 1 (TNFR1), ultimately inducing tissue remodeling, autoimmune pathology, neutrophil recruitment and immunity to extracellular pathogens.36,37 In the Notch signaling pathway, ADAM17 and JAG1 act on the cell surface receptor Notch, which in turn affects PTCRA. In the TNF signaling pathway, TNF binds to TNFR1, which signals to surface receptors, intracellular signaling and remodeling of extracellular matrix.38,39 In the IL-17 signaling pathway, the IL-17 family signals activate downstream pathways that include NF-kappaB, MAPKs and C/EBPs, which further induces the expression of chemokines, cytokines and antimicrobial peptides40,41 (Figure 4B). The semantic similarities among the GO terms of the eight core genes are ranked in Figure 4C, and the W value of core genes, which may suggest the importance of molecules in this dysfunction, are displayed in Figure 4D.
Figure 4

Core genes in the PPIs network of ARDS. (A) PPIs network for alveolar macrophage-related genes in circulating acute respiratory distress syndrome. Ellipses represent genes; diamonds, pathways; and font size, relative degree. (B) Core genes and pathways of interest. (C) Semantic similarities among GO terms for the eight core genes. (D) Weight value of the eight core genes in the three datasets.

Core genes in the PPIs network of ARDS. (A) PPIs network for alveolar macrophage-related genes in circulating acute respiratory distress syndrome. Ellipses represent genes; diamonds, pathways; and font size, relative degree. (B) Core genes and pathways of interest. (C) Semantic similarities among GO terms for the eight core genes. (D) Weight value of the eight core genes in the three datasets.

CGSVA Score May Be Used as a Circulating Biomarker for ARDS

As shown in , the C1QA, C1QB, MMP9, PTCRA and VSIG4 may be serve as circulating diagnostic markers for ARDS. Their areas under the ROC curves (AUCs) were higher than 0.7 in the three datasets. In addition, the CGSVA score was higher in the ARDS group than in the control group in all datasets (Figure 5A). The expression of the core genes is shown in Figure 5A. AUC for CGSVA score was higher than 0.7 in all three datasets, which suggests that the score may be useful as a diagnostic marker for ARDS (Figure 5B). The score in macrophages from the alveolar lavage fluid (GSE116560 and GSE89953) and whole blood (GSE76293 and GSE32707) was higher in ARDS individuals than in healthy controls (Figure 5C).
Figure 5

CGSVA score may serve as a circulating biomarker for ARDS. (A) a, GSVA scores of the core gene set in the three datasets. Expression of core genes in (b) GSE76293, (c) GSE116560 & GSE89953 and (d) GSE32707. (B) ROC curves of the gene set GSVA score in the three datasets. (C) The GSVA scores in alveolar whole blood (GSE76293 and GSE32707) and lavage fluid (GSE116560 and GSE89953) were higher for patients with ARDS than for controls.

CGSVA score may serve as a circulating biomarker for ARDS. (A) a, GSVA scores of the core gene set in the three datasets. Expression of core genes in (b) GSE76293, (c) GSE116560 & GSE89953 and (d) GSE32707. (B) ROC curves of the gene set GSVA score in the three datasets. (C) The GSVA scores in alveolar whole blood (GSE76293 and GSE32707) and lavage fluid (GSE116560 and GSE89953) were higher for patients with ARDS than for controls.

Discussion

In this study, the differences in gene expression were explored between whole blood and alveolar lavage in ARDS samples and healthy controls. We selected four gene expression datasets from three platforms, where we found a common set of 60 DEGs upregulated in ARDS patients compared to healthy controls. Some among them have been previously informed to be related to ARDS. For example, immunohistochemical results showed that CD163 is overexpressed in ARDS patients.42 The highly expressed IL-33 can drive expression of MMP9 and MMP2 by activating STAT3. The process mainly happens in alveolar macrophages for acute lung injury patients induced by lipopolysaccharide.8 Mutations in the ALOX5 gene appear to be most likely cause of inter-individual differences in ARDS progression.43 Subsequently, we performed a function enrichment analysis for cARDSAMGs. The results showed that these cARDSAMGs were involved mainly in biological processes and pathways related to immunity, inflammation, phagocytosis and nucleic acid turnover. Some of these pathways have been related with ARDS in previous studies. Production of IL-1 and low concentrations of anti-inflammatory cytokines, such as IL-10 and IL-1 receptor antagonist, in bronchoalveolar lavage fluid of early ARDS patients have been closely related to poor prognosis.44 Numerous studies have linked inflammation and phagocytosis to ARDS.32,33,45,46 Moreover, we constructed a protein–protein interaction network to analyze the relationship among genes and pathways. We focused on TNF, IL-17 and Notch signaling pathways, as well as complement and coagulation cascades. In complement and coagulation cascades, complement activation may induce the occurrence of phagocytosis,47 which plays an essential role in ARDS.48 TNF acts as an inflammatory factor, up-regulating the expression of surface receptors, promoting intracellular signaling and remodeling of the extracellular matrix.49 Some biomarkers are closely related to ARDS, including surfactant-related proteins and cytokines (IL-1, −2, −6, −8, −10 and −15, and TNF-α), as well as neutrophil activation markers (MMP9, leukotriene B4 and ferritin).32 In the IL-17 signaling pathway, IL-17A and IL-17B act through surface receptors IL-17-17RA and IL-17RC to induce immunity and recruit neutrophils.49,50 Neutrophil-derived mediators induce epithelial cell death by oxidation of soluble Fas ligand.33,45 Eight genes corresponding to the selected pathways were identified as core genes: PTCRA, JAG1, C1QB, ADAM17, C1QA, MMP9, VSIG4 and TNFAIP3. VSIG4 is the main part of phagocytic system: it rapidly removes C3-modified particles.10 MMP9 is a secreted zinc metallopeptidase.51 MMP9 mediates protein degradation in the extracellular matrix of alveolar epithelial cells, mainly in intercellular junction proteins and proteins that anchor cells to the basement membrane.52 In addition, MMP9 is enriched mainly in macrophages and neutrophils,53,54 and involved in the occurrence and development of emphysema and asthma.55,56 TNFAIP3 was identified as a gene whose expression is rapidly induced by TNF. The protein encoded by this gene has been shown to inhibit nuclear factor (NF)-kappa B activation as well as TNF-mediated apoptosis.57 PTCRA encodes a single channel type 1 membrane protein in mature T cells. This protein forms a T-cell pre-receptor complex together with CD3 and T cell receptor beta chain (TCRB) complexes, thereby regulating the early development of T cells.58–60 C1QB may be associated with the immune response after injury.61 In addition, CGSVA score may be used as a biomarker for ARDS patients. Furthermore, the CGSVA scores associated with ARDS both in whole blood and alveolar macrophages were significantly higher in ARDS patients than in healthy controls. This may suggest that these alveolar macrophage-related ARDS-specific transcriptional programs are also reflected in blood cells. Our study presents several limitations. Firstly, our predictions are based on bioinformatic analyses, and therefore further experimental verification is needed. Secondly, most of alveolar macrophage genes have a dynamic pattern, but we only select genes that are significantly up-regulated in all time point without the consider of time changes. Thirdly, the whole transcriptome sequencing data of macrophages in whole blood and alveolar lavage fluid are difficult to find, currently. Whether these 8 cARDSAMGs may be used as diagnostic markers needs to verify in a larger independent clinical sample. Fourthly, it is shown that the gene set has a diagnostic performance in each of datasets in this study. Therefore, we believe that this gene set is specific for ARDS to some extent. But whether it is specific enough to ARDS, any inflammation or infection, still needs to be further clarified. Finally, we need to further explore the upstream regulators of core genes to reveal in more detail the imbalance network in ARDS.

Conclusions

We identified a core gene set (PTCRA, JAG1, C1QB, ADAM17, C1QA, MMP9, VSIG4 and TNFAIP3) that may help predict ARDS. The ARDS alveolar macrophage-related CGSVA score may be helpful for the diagnosis of ARDS.
  61 in total

1.  A critical role for the cytoplasmic tail of pTalpha in T lymphocyte development.

Authors:  Iannis Aifantis; Christine Borowski; Fotini Gounari; H Daniel Lacorazza; Janko Nikolich-Zugich; Harald von Boehmer
Journal:  Nat Immunol       Date:  2002-04-01       Impact factor: 25.606

Review 2.  Notch signaling in mammary development and oncogenesis.

Authors:  Robert Callahan; Sean E Egan
Journal:  J Mammary Gland Biol Neoplasia       Date:  2004-04       Impact factor: 2.673

3.  The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

Authors:  Jeffrey T Leek; W Evan Johnson; Hilary S Parker; Andrew E Jaffe; John D Storey
Journal:  Bioinformatics       Date:  2012-01-17       Impact factor: 6.937

Review 4.  The acute respiratory distress syndrome.

Authors:  Michael A Matthay; Lorraine B Ware; Guy A Zimmerman
Journal:  J Clin Invest       Date:  2012-08-01       Impact factor: 14.808

5.  Association of complement activation and elevated plasma-C5a with adult respiratory distress syndrome. Pathophysiological relevance and possible prognostic value.

Authors:  D E Hammerschmidt; L J Weaver; L D Hudson; P R Craddock; H S Jacob
Journal:  Lancet       Date:  1980-05-03       Impact factor: 79.321

6.  The association between mortality rates and decreased concentrations of interleukin-10 and interleukin-1 receptor antagonist in the lung fluids of patients with the adult respiratory distress syndrome.

Authors:  S C Donnelly; R M Strieter; P T Reid; S L Kunkel; M D Burdick; I Armstrong; A Mackenzie; C Haslett
Journal:  Ann Intern Med       Date:  1996-08-01       Impact factor: 25.391

7.  Cloning and comparative analysis of the human pre-T-cell receptor alpha-chain gene.

Authors:  P Del Porto; L Bruno; M G Mattei; H von Boehmer; C Saint-Ruf
Journal:  Proc Natl Acad Sci U S A       Date:  1995-12-19       Impact factor: 11.205

8.  Elevated IL-33 promotes expression of MMP2 and MMP9 via activating STAT3 in alveolar macrophages during LPS-induced acute lung injury.

Authors:  Yafeng Liang; Nengli Yang; Guoquan Pan; Bingxin Jin; Shufen Wang; Wei Ji
Journal:  Cell Mol Biol Lett       Date:  2018-10-31       Impact factor: 5.787

Review 9.  Acute respiratory distress syndrome.

Authors:  Michael A Matthay; Rachel L Zemans; Guy A Zimmerman; Yaseen M Arabi; Jeremy R Beitler; Alain Mercat; Margaret Herridge; Adrienne G Randolph; Carolyn S Calfee
Journal:  Nat Rev Dis Primers       Date:  2019-03-14       Impact factor: 52.329

10.  Screening of Key Genes of Sepsis and Septic Shock Using Bioinformatics Analysis.

Authors:  Xiaoliang Zeng; Jihua Feng; Yanli Yang; Ruzhi Zhao; Qiao Yu; Han Qin; Lile Wei; Pan Ji; Hongyuan Li; Zimeng Wu; Jianfeng Zhang
Journal:  J Inflamm Res       Date:  2021-03-11
View more
  3 in total

Review 1.  The role of lung macrophages in acute respiratory distress syndrome.

Authors:  Wenpei Dang; Yiming Tao; Xinxin Xu; Hui Zhao; Lijuan Zou; Yongsheng Li
Journal:  Inflamm Res       Date:  2022-10-20       Impact factor: 6.986

2.  Component 1 Inhibitor Missense (Val480Met) Variant Is Associated With Gene Expression and Sepsis Development in Neonatal Lung Disease.

Authors:  Enas F Elngar; Mona A Azzam; Ayman A Gobarah; Eman A Toraih; Manal S Fawzy; Nouran B AbdAllah
Journal:  Front Pediatr       Date:  2022-05-20       Impact factor: 3.569

3.  Bioinformatics approach to identify the influences of SARS-COV2 infections on atherosclerosis.

Authors:  Jiuchang Zhang; Liming Zhang
Journal:  Front Cardiovasc Med       Date:  2022-08-18
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.