Lingxiao Qiu1,2,3, Gencheng Gong4, Wenjuan Wu5, Nana Li1, Zhaonan Li6, Shanshan Chen1,3, Ping Li1,3,7, Tengfei Chen1, Huasi Zhao1, Chunling Hu8, Zeming Fang9, Yan Wang1, Hongping Liu1, Panpan Cui10, Guojun Zhang1,3,7. 1. Department of Respiratory Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China. 2. Academy of Medical Sciences, Zhengzhou University, Zhengzhou, China. 3. Henan Provincial Medical Key Laboratory for Interstitial Lung Disease and Lung Transplantation, Zhengzhou, China. 4. State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, Guangzhou Institute of Respiratory Health, the First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China. 5. Department of Geriatric Medicine, Henan Provincial People's Hospital, Zhengzhou, China. 6. Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China. 7. Zhengzhou Key Laboratory for Chronic Respiratory Disease, Zhengzhou, China. 8. Department of Respiratory Intensive Care Unit, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China. 9. Department of Thoracic Surgery, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China. 10. School of Nursing and Heath, Zhengzhou University, Zhengzhou, China.
Abstract
BACKGROUND: Idiopathic pulmonary fibrosis (IPF) is a highly fatal lung disease of unknown etiology with a median survival after diagnosis of only 2-3 years. Its poor prognosis is due to the limited therapy options available as well as the lack of effective prognostic indicators. This study aimed to construct a novel prognostic signature for IPF to assist in the personalized management of IPF patients during treatment. METHODS: Differentially-expressed genes (DEGs) in IPF patients versus healthy individuals were analyzed using the "limma" package of R software. Immune-related genes (IRGs) were obtained from the ImmPort database. Univariate Cox regression analysis was adopted to screen significantly prognostic IRGs for IPF patients. Multiple Cox regression analysis was used to identify optimal prognostic IRGs and construct a prognostic signature. RESULTS: Compared with healthy individuals, there were a total of 52 prognosis-related DEGs in the bronchoalveolar lavage (BAL) samples of IPF patients, of which 37 genes were identified as IRGs. Of these, five genes (CXCL14, SLC40A1, RNASE3, CCR3, and RORA) were significantly associated with overall survival (OS) in IPF patients, and were utilized for establishment of the prognostic signature. IPF patients were divided into high- and low-risk groups based on the prognostic signature. Marked differences in the OS probability were observed between high- and low-risk IPF patients. The area under curves (AUCs) of the receiver operating characteristic (ROC) curve for the prognostic signature in the training and validation cohorts were 0.858 and 0.837, respectively. The expression levels between RNASE3 and SLC40A1 (P<0.01, r=0.394), between RORA and CXCL14 (P<0.01, r=-0.355), between CCR3 and CXCL14 (P<0.01, r=0.258), as well as between RNASE3 and CCR3 (P<0.01, r=0.293) were significantly correlated. CONCLUSIONS: We developed a validated and reproducible IRG-based prognostic signature that should be helpful in the personalized management of patients with IPF, providing new insights into the relationship between the immune system and IPF. 2021 Annals of Translational Medicine. All rights reserved.
BACKGROUND: Idiopathic pulmonary fibrosis (IPF) is a highly fatal lung disease of unknown etiology with a median survival after diagnosis of only 2-3 years. Its poor prognosis is due to the limited therapy options available as well as the lack of effective prognostic indicators. This study aimed to construct a novel prognostic signature for IPF to assist in the personalized management of IPF patients during treatment. METHODS: Differentially-expressed genes (DEGs) in IPF patients versus healthy individuals were analyzed using the "limma" package of R software. Immune-related genes (IRGs) were obtained from the ImmPort database. Univariate Cox regression analysis was adopted to screen significantly prognostic IRGs for IPF patients. Multiple Cox regression analysis was used to identify optimal prognostic IRGs and construct a prognostic signature. RESULTS: Compared with healthy individuals, there were a total of 52 prognosis-related DEGs in the bronchoalveolar lavage (BAL) samples of IPF patients, of which 37 genes were identified as IRGs. Of these, five genes (CXCL14, SLC40A1, RNASE3, CCR3, and RORA) were significantly associated with overall survival (OS) in IPF patients, and were utilized for establishment of the prognostic signature. IPF patients were divided into high- and low-risk groups based on the prognostic signature. Marked differences in the OS probability were observed between high- and low-risk IPF patients. The area under curves (AUCs) of the receiver operating characteristic (ROC) curve for the prognostic signature in the training and validation cohorts were 0.858 and 0.837, respectively. The expression levels between RNASE3 and SLC40A1 (P<0.01, r=0.394), between RORA and CXCL14 (P<0.01, r=-0.355), between CCR3 and CXCL14 (P<0.01, r=0.258), as well as between RNASE3 and CCR3 (P<0.01, r=0.293) were significantly correlated. CONCLUSIONS: We developed a validated and reproducible IRG-based prognostic signature that should be helpful in the personalized management of patients with IPF, providing new insights into the relationship between the immune system and IPF. 2021 Annals of Translational Medicine. All rights reserved.
Idiopathic pulmonary fibrosis (IPF) is a deadly interstitial lung disorder of unknown etiology (1). It is characterized by irreversible fibrogenesis in the lung parenchyma, leading to progressive respiratory function failure and eventually death (2,3). IPF is the most common interstitial lung disease and has the worst prognosis in pulmonary fibrosis (4). Nearly half of IPF patients die within 2–3 years after diagnosis (3,4), and the 5-year survival rate is less than 30% (5). IPF is a highly heterogeneous disease with a greatly variable natural history (6,7). The course of this disease in an individual patient is difficult to predict (4,8); some patients with IPF experience rapid decline, while others experience much slower development (3,8). For a long time, the lack of effective prognostic indicators has made it difficult to accurately track and evaluate the prognosis of IPF, which has led to the poor prognosis of IPF to a certain extent. Hence, the development of applicable prognostic signatures is urgently needed for the clinical treatment of IPF.The pathophysiological pathogenesis of IPF involves aberrant transcription and gene expression (9-14). Molecular genomic features based on lung tissue have been used to predict the development of IPF (15,16). Though previous studies have identified some genes and pathways may play an important role in the occurrence and development of IPF, and may be expected to be biomarkers or therapeutic targets for the diagnosis of IPF (17,18). However, the lack of verification of survival information is the biggest short board in these papers. Meanwhile, the resources required to perform a lung biopsy and the risks associated with the procedure limit the applicability of such genomic features. Molecular models have also been established based on peripheral blood mononuclear cell (PBMC) transcription profile data to predict the disease state of IPF (19,20). However, in the absence of lung biopsies, it is difficult to explain the correlation between abnormal PBMC transcription and pulmonary fibrosis course. Bronchoalveolar lavage (BAL) is a method of obtaining alveolar surface lining fluid with fiberoptic bronchoscopy for evaluating inflammation, immune cells, and soluble substances. BAL plays a vital role in assisting IPF diagnosis and has been recommended as the auxiliary diagnostic reference by the American Thoracic Society (ATS) (21). The advantages of utilizing the gene expression profiles of BAL cells to depict the molecular features of IPF include lung localization, ease of accessibility, and dynamic assessment of disease status through longitudinal sample collection. Previous studies have revealed that Innate and adaptive immune responses disorders possess an important role in the pathogenesis of lung fibrosis (22). The differentially-expressed immune-related genes (IRGs) also have been reported associated with the development of IPF (23,24). The immPort database is funded by the National Institutes of Health (NIH), National Institute of Allergy and Infectious Diseases (NIAID), Health and Human Services (HHS) in support of the NIH mission to share data with the public. It provides information about the immune-related genes of humans. Therefore, using the GSE70866 gene expression data set of the Gene Expression Synthesis (GEO) database and the IRGs list of the ImmPort database, we aim to combine the survival information of IPF patients to establish a new molecular genome feature screening from IRGs, to predict the prognosis of IPF patient. We present the following article in accordance with the STARD reporting checklist (available at https://dx.doi.org/10.21037/atm-21-4545).
Methods
Acquisition and analysis of datasets
Microarray profile data from the GSE70866 gene expression dataset were downloaded from the GEO (http://www.ncbi.nlm.nih.gov/geo/) database. The platform was a GPL14550 Agilent-028004 SurePrint G3 Human GE 8x60K Microarray (Agilent Technologies Inc., California, U.S.). A total of 132 BALF samples, including 20 samples from healthy individuals and 112 samples from IPF patients, were used to analyze the microarray data. All 112 IPF patients had detailed sociodemographic characteristics and complete survival information. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
The criteria of differentially-expressed genes (DEGs) and differentially-expressed immune-related genes (IRGs)
The filtration of DEGs was performed in 112 IPF patients versus healthy individuals. In this study, DEGs between IPF and healthy individuals were defined using a log2 fold change (FC) >1 and an adjusted P value (adj. P) <0.05 as thresholds. A total of 1,793 IRGs were obtained from the ImmPort (https://www.immport.org/shared/genelists) database. Taking the intersection through the Venn algorithm (http://bioinformatics.psb.ugent.be/webtools/Venn/), 52 differentially-expressed IRGs were filtered, which remained and were used as candidates for subsequent analysis.
Construction and validation of the prognostic IRG-based signature
The 112 included patients were randomly divided into a training cohort (50%) and validation cohort (50%) using the random numbers method. The construction of prognostic gene-based signatures was carried out in the training cohort, and verification was performed in the verification cohort. Univariate Cox regression analysis was used to screen for immune genes that were significantly associated with prognosis, with a cut-off of P<0.05. Next, multivariate Cox-regression analysis was performed on the training cohort to further determine the best prognostic IRG signature using the “survival” package (URL: https://github.com/therneau/survival) in R software (version 4.0.3) (URL: https://cran.r-project.org/mirrors.html), with a cut-off of P<0.05. The formula of IPF patient’s risk score was established as follows: score = sum (each gene’s expression × corresponding coefficient). The patients were stratified into high-risk and low-risk groups based on the median value of the risk score. Based on the risk score groups, survival differences between high-risk and low-risk groups were carried out with the “survival” R package (URL: https://github.com/therneau/survival).
Statistical analysis
Baseline characteristics such as age, sex, race, days to death, and vital status were collected. Continuous variables were reported as the mean (± standard deviation) and compared using the Student’s t-test. Categorical variables were reported as counts n (%) and compared using the chi-square test. The comparison of sociodemographic features between the training and validation cohorts was carried out using GraphPad Prism (version 7.0; GraphPad Software, La Jolla, CA, USA).The other statistical analyses were carried out using R software (version 4.0.3) (URL: https://cran.r-project.org/mirrors.html) and considered significant when the corresponding P<0.05. The adjusted P<0.05 was used for screening DEGs, and P<0.05 was used as a significance threshold in the remaining statistical analyses. The analysis of DEGs was conducted by utilizing the “limma” package (URL: http://www.strimmerlab.org/software/st/). Univariate Cox regression analysis was used to screen for DEGs that were significantly associated with overall survival (OS). Multivariate Cox regression analysis was performed on the training cohort to further determine the best prognostic IRG model. A multivariate Cox regression model was conducted for the variables with P<0.05 in the univariate analyses. A gene-based signature was built with the coefficients of each factor in the multivariate Cox analysis. The “survival” package (URL: https://github.com/therneau/survival) calculated the survival curve function, and the “survminer” package (URL: https://mirror.lzu.edu.cn/CRAN/bin/windows/contrib/4.0/survminer_0.4.9.zip) executed the visualization. The heat map was drawn using the “pheatmap” (pretty heatmap) package (URL: https://mirror.lzu.edu.cn/CRAN/bin/windows/contrib/4.0/pheatmap_1.0.12.zip). The volcano map was drawn using the “ggplot2” package (URL: https://cran.r-project.org/web/packages/ggplot2movies/index.html).
Results
Baseline characteristics of patient with IPF
summarizes the sociodemographic information of the included IPF patients. A total of 112 IPF patients were identified, with a median age of 69.5 (±10.1) years. IPF was more common in older populations (67.0% of patients were older than 65 years versus 33.0% of patients less than 6 years). The incidence of IPF was higher in men than in women (83.0% male patients versus 17.0% female patients).
Table 1
The sociodemographic information of patients
Characteristics
Total (n=112)
Training cohort (n=56)
Validation cohort (n=56)
P value
Age, mean (± SD)
67.97 (±10.1)
67.0 (±10.4)
69.0 (±9.7)
0.300
Age, n (%)
<65
37 (33.0)
18 (32.1)
19 (34.0)
≥65
75 (67.0)
38 (67.9)
37 (66.0)
0.841
Gender, n (%)
Female
19 (17.0)
7 (12.5)
12 (21.4)
Male
93 (83.0)
49 (87.5)
44 (78.6)
0.208
Days to death, mean (± SD)
698.1 (±555.9)
656.7 (±551.9)
739.5 (±561.7)
0.433
Vital status, n (%)
Alive
36 (32.1)
20 (35.7)
16 (28.6)
Dead
76 (67.9)
36 (64.3)
40 (71.4)
0.418
Sample contact country, n (%)
Germany
112 (100.0)
56 (100.0)
56 (100.0)
NA
SD, standard deviation.
SD, standard deviation.These 112 IPF patients were randomly divided into training (50%) and validation (50%) cohorts, with 56 patients in each group. No significant differences between the two cohorts were observed in terms of age, sex, days to death, and vital status (P>0.05). Qualified survival information for all of the included IPF patients was available for further analysis.
Identification of DEGs
DEGs of the IPF and healthy individuals from the GPL14550 platform of the GSE70866 gene expression dataset were analysed using the “limma” package. In this dataset, a total of 379 DEGs met the criteria, of which 207 genes were upregulated and 172 genes were downregulated (Table S1). is a volcano map of 379 DEGs in the IPF group compared to the healthy individuals group. The profiling of all the DEGs is shown in and presented in the form of a non-cluster analysis expression heatmap. SPP1, PPBP, and MMP7 were the top three most significantly upregulated genes in the IPF group, while NALCN, C8B, and ITIH5 were the three most downregulated genes in the IPF group.
Figure 1
Comparison of the gene expression profile between the IPF group and the healthy individuals group. (A) Heatmap of significantly DEGs. (B) Volcano map of DEGs; red dots represent upregulated DEGs, grey dots represent non-differentially expressed genes, and green dots represent downregulated DEGs. IPF, idiopathic pulmonary fibrosis; DEGs, differentially-expressed genes.
Comparison of the gene expression profile between the IPF group and the healthy individuals group. (A) Heatmap of significantly DEGs. (B) Volcano map of DEGs; red dots represent upregulated DEGs, grey dots represent non-differentially expressed genes, and green dots represent downregulated DEGs. IPF, idiopathic pulmonary fibrosis; DEGs, differentially-expressed genes.
Identification of differential expression IRGs
Combining the results of DEGs (Table S1) and the IRGs from the ImmPort database, 52 differentially expressed IRGs were identified. A volcano map was constructed to present the differential expression of all IRGs (). shows the expression of the 52 differential IRGs in the form of a heatmap. SPP1, PPBP, TUBB3, CCL2, and S100A12 were the five most significantly upregulated IRGs, while the top five downregulated IRGs were PTGER3, CD40LG, CAMP, IGF1, and CXCL9.
Figure 2
Comparison of the IRG expression profile between the IPF group and the healthy individuals group. (A) Heatmap of significantly differentially-expressed IRGs. (B) Volcano map of IRGs; red dots represent upregulated differentially expressed IRGs, grey dots represent non-differentially expressed IRGs, and green dots represent downregulated differentially expressed IRGs. IRG, immune-related gene; IPF, idiopathic pulmonary fibrosis.
Comparison of the IRG expression profile between the IPF group and the healthy individuals group. (A) Heatmap of significantly differentially-expressed IRGs. (B) Volcano map of IRGs; red dots represent upregulated differentially expressed IRGs, grey dots represent non-differentially expressed IRGs, and green dots represent downregulated differentially expressed IRGs. IRG, immune-related gene; IPF, idiopathic pulmonary fibrosis.
Prognostically relevant IRGs filtration
Prognostically relevant IRGs for IPF were selected based on the results of univariate Cox regression analysis. A forest plot was drawn to show the 37 obtained prognostically relevant IRGs, including prognostically protective IRGs such as RORA [hazard ratio (HR): 0.613, 95% confidence interval (CI): (0.474–0.794)] and ICOS [HR: 0.672, 95% CI: (0.560–0.809)] (). Conversely, MPO [HR: 1.287, 95% CI: (1.139–1.454)], RNASE3 [HR: 1.711, 95% CI: (1.338-2.188)], PDGFA [HR: 1.228, 95% CI: (1.030–1.465)], PPBP [HR: 1.154, 95% CI: (1.002–1.330)], and FABP3 [HR: 1.522, 95% CI: (1.216–1.905)] were prognostic factors of worse survival ().
Figure 3
Forest plot of the differentially-expressed IRGs related to prognosis. IRGs, immune-related genes.
Forest plot of the differentially-expressed IRGs related to prognosis. IRGs, immune-related genes.
An IRGs prognostic model of IPF
Multivariate Cox regression analysis was performed based on 37 prognostic factors of OS to establish a model to predict the outcomes of IPF patients. CXCL14, SLC40A1, RNASE3, CCR3, and RORA were ultimately identified to build a five-IRG-based prognostic signature to predict the survival time of patients with IPF in the training cohort.shows the survival outcomes of IPF patients stratified by CXCL14, SLC40A1, RNASE3, CCR3, and RORA. The survival curve revealed that IPF patients with higher expression levels of CXCL14, SLC40A1, RNASE3, and CCR3 had much worse survival outcomes. Patients with a relatively lower expression of RORA had markedly longer OS.
Figure 4
OS of patients with IPF stratified by the genes included in our novel signature, including (A) CXCL14, (B) SLC40A1, (C) RNASE3, (D) CCR3, and (E) RORA. OS, overall survival; IPF, idiopathic pulmonary fibrosis.
OS of patients with IPF stratified by the genes included in our novel signature, including (A) CXCL14, (B) SLC40A1, (C) RNASE3, (D) CCR3, and (E) RORA. OS, overall survival; IPF, idiopathic pulmonary fibrosis.Detailed results of the multivariate Cox regression analysis, including coefficients, P values, hazard ratios, etc., are provided in Table S2. Accordingly, the patient’s risk score representing the risk for OS was calculated as follows: risk score = 0.1970 × expression value of CXCL14 + 0.3280 × expression value of SLC40A1 + 0.5852× expression value of RNASE3 + 0.2802 × expression value of CCR3 − 0.6504 × expression value of RORA. According to the median risk score, IPF patients were divided into high- and low-risk groups. Individuals with risk scores beyond 0.711 were recognized as high-risk; otherwise, they were considered low-risk (, Table S3). There was a significant decrease in the OS of IPF patients as the risk score increased (). displays the expression level of the five IRGs between the high- and low-risk groups. As shown in , CXCL14, SLC40A1, RNASE3, and CCR3 were more highly expressed, while RORA expression exhibited relatively lower expression in the high-risk IPF patients than in the low-risk individuals. The survival curve constructed by the five-IRG-based prognostic signature in the training cohort showed that there was an extremely significant difference between the high- and low-risk groups (). A validation cohort was utilized to verify the five-IRG-based signature, and notable differential survival outcomes were observed between the high- and low-risk groups (). The area under curves (AUC) of the five-IRG-based prognostic signature for IPF in the training model was 0.858 (). The AUC of this predictive five-gene-based signature in the validation was 0.837 (), indicating that this predictive signature could be trusted.
Figure 5
The risk score could effectively predict IPF patient prognosis. (A) Scatter plot of the risk score distribution of the samples. One point refers to a sample, red points are samples with higher risk scores, green points are samples with lower risk scores, and the intersecting point represents the median risk score. (B) Scatter plot of the survival outcome distribution of the samples. One point refers to a sample, red points represent live samples, green points represent dead samples with lower risk scores, and the intersecting point represents the median risk score. (C) Heatmap of signature-based genes (CXCL14, SLC40A1, RNASE3, CCR3, and RORA) between the high- and low-risk groups. IPF, idiopathic pulmonary fibrosis.
Figure 6
Signature of predicting survival probability for IPF patients. (A) Survival curve of the risk score distribution of the training cohort, which also shows the 1-, 2-, 3-, 4-, 5-, and 6-year survival rates of IPF patients. (B) Survival curve of the risk score distribution of the validation cohort, which also shows the 1-, 2-, 3-, 4-, 5-, and 6-year survival rates of IPF patients. (C) ROC curve of the signature in the training cohort. (D) ROC curve of the signature in the validation cohort. IPF, idiopathic pulmonary fibrosis; ROC, receiver operating characteristic.
The risk score could effectively predict IPF patient prognosis. (A) Scatter plot of the risk score distribution of the samples. One point refers to a sample, red points are samples with higher risk scores, green points are samples with lower risk scores, and the intersecting point represents the median risk score. (B) Scatter plot of the survival outcome distribution of the samples. One point refers to a sample, red points represent live samples, green points represent dead samples with lower risk scores, and the intersecting point represents the median risk score. (C) Heatmap of signature-based genes (CXCL14, SLC40A1, RNASE3, CCR3, and RORA) between the high- and low-risk groups. IPF, idiopathic pulmonary fibrosis.Signature of predicting survival probability for IPF patients. (A) Survival curve of the risk score distribution of the training cohort, which also shows the 1-, 2-, 3-, 4-, 5-, and 6-year survival rates of IPF patients. (B) Survival curve of the risk score distribution of the validation cohort, which also shows the 1-, 2-, 3-, 4-, 5-, and 6-year survival rates of IPF patients. (C) ROC curve of the signature in the training cohort. (D) ROC curve of the signature in the validation cohort. IPF, idiopathic pulmonary fibrosis; ROC, receiver operating characteristic.
Correlation expression map of the five genes included in the predictive signature
A correlation map of the five included prognostic IRGs expression levels is described in . The strongest expression correlations were observed between RNASE3 and SLC40A1 (P<0.01, r=0.394), as well as between RORA and CXCL14 (P<0.01, r=−0.355). Meanwhile, the expression level of CCR3 was significantly positively correlated with the expression of CXCL14 (P<0.01, r=0.258). There was an intimate positive association between RNASE3 and CCR3 (P<0.01, r=0.293).
Figure 7
Gene co-expression network of 5 genes: CXCL14, SLC40A1, RNASE3, CCR3, and RORA.
Gene co-expression network of 5 genes: CXCL14, SLC40A1, RNASE3, CCR3, and RORA.
Discussion
IPF is the most prevalent subtype of interstitial lung disease (ILD) worldwide (25). However, it has the poorest prognosis among the various ILD subtypes, with a median survival of 2–3 years after diagnosis (3,4). Lung transplantation is the only intervention that has been shown to prolong survival for patients with IPF (26). Pirfenidone and nintedanib have emerged as effective therapies that can significantly slow the decline in forced vital capacity (FVC) and disease progression in IPF patients (27,28). However, the prognosis of IPF remains unfavourable. The poor prognosis of IPF is partly due to a lack of effective prognostic biomarkers to guide treatment. Without the ability to forecast disease progression, it is difficult to determine which IPF patients are likely to benefit from new therapies or lung transplantation. Therefore, we constructed a molecular genomic signature to predict the prognosis of IPF patients using the GSE70866 gene expression dataset from the GEO database.Previous studies have revealed that the immune system possesses an actual effect on the IPF process (22,29,30). All stages of fibrogenesis are accompanied by innate and adaptive immune responses (22). More importantly, increasing evidence has appeared over the last few years establishing the meaningful role of IRGs in the pathogenesis and treatment of lung fibrosis (23,24,31,32). It has been shown that regulating the expression of IRGs can ameliorate pulmonary fibrogenesis in bleomycin-induced (BLM-induced) mouse models (31,32). Furthermore, data from clinical trials of newly developed drugs for the treatment of IPF have demonstrated the active role of IRG-targeting drugs in slowing disease progression. For instance, IRG-targeting drugs have been shown to play a positive role in reducing fibrogenesis (33). These previous studies highlight the importance of IRGs in the pathophysiological mechanism of IPF. In the present study, we were interested in the role of IRGs in the prognosis of IPF.In total, 112 IPF patients and 20 healthy individuals were included in our study. The included IPF patients were predominantly older males (aged >65 years old). This demographic feature, as well as the fact that the prevalence of IPF is higher in men than in women, are consistent with previous studies (1,3). In this comparative microarray profile of an IPF cohort versus a healthy individual cohort, a total of 379 DEGs were identified. The genes involved in encoding extracellular matrix (ECM) components, tissue architecture remodeling, and ECM accumulation (SPP1, MMP7, MMP10, CCL2, and ITGB3) were observed to be significantly upregulated (34-37). Of the 379 DEGs, 52 were filtered as IRGs based on the ImmPort database. Next, 37 of these 52 differentially-expressed IRGs were recognized as significant prognostic biomarkers for patients with IPF. More than 70% of the differentially-expressed IRGs had notable associations with survival. Our results further suggested that there was a close association between IRGs and the progression of IPF, which was consistent with previous studies. Based on these findings, a five IRG-based prognostic signature (CXCL14, SLC40A1, RNASE3, CCR3, and RORA), was built in the training cohort in this study. This signature presented an excellent predictive prognostic effect, with an AUC value of 0.858. In addition, the risk score was significantly different between the high- and low-risk groups. Meanwhile, the risk score was significantly correlated with the OS of IPF patients. CXCL14, SLC40A1, CXCL14, and CCR3 were differentially-upregulated genes between IPF patients and healthy individuals. The expression levels of these four genes in the high-risk IPF group were significantly higher than those in the low-risk group. RORA was detected at a lower expression level in the healthy individuals group compared to the IPF group. Consistently, the expression level of RORA was lower in the high-risk IPF group than in the low-risk group.Fibroblast foci represent the main pathogenic lesions of IPF, including abnormally activated fibroblasts and myofibroblasts. Myofibroblasts are the main effector cells of IPF. They can secrete a large amount of ECM protein and promote the abnormal hardening of ECM, which leads to the remodeling of lung structure and the gradual loss of lung function (38-40). Previous studies have confirmed that knockdown of CXCL14 could inhibit lung fibrogenesis by suppressing lung fibroblasts proliferation and downregulating MMP2/9 (31). Zagai et al. found eosinophil cationic protein (ECP, also known as RNASE3) could stimulate human lung fibroblasts to secrete extracellular matrix, thereby leads to airway fibrosis (41). The concentration of RNASE3 in bronchoalveolar lavage fluid (BALF) is markedly increased in IPF patients compared with healthy individuals and is highly correlated with acute exacerbation during the preceding 3- to 6-month period (42,43). CCR3 can increase the activation, migration and proliferation ability of lung fibroblasts, and the ability of myofibroblasts to secrete ECM protein (44,45). In addition, CCR3 is notably expressed in the lungs of BLM-induced mice and is expressed not only by eosinophils but also by neutrophils (44). CCR3 plays a key role in the recruitment of granulocytes and is an important suppressor of fibrogenesis in BLM-treated lungs (44). These studies on the pathophysiological mechanisms between IPF and CXCL14, RNASE3, and CCR3 increase the credibility of the signature constructed in our study. Our research also showed that there is a meaningful correlation between the expression of RNASE3 and CCR3. Meanwhile, a significant expression correlation between CXCL14 and CCR3 was also observed in this study. For the SLC40A1 and RORA, no relevant studies have been conducted to determine the association with lung fibrosis. We first reported that there may be some potential associations between the pathological mechanism of IPF and SLC40A1 along with RORA. The specific pathophysiological mechanism is worthy of further study.Finally, we evaluated the performance of the genomic signature in the validation cohort. The signature showed an equally excellent ability to distinguish between high- and low-risk patient groups. The AUC value of the Receiver Operating Characteristic Curve (ROC) curve was 0.837, demonstrating the potential applicability of our findings for real-world use.While the genomic model developed in this study was successfully validated, there were still some potential limitations that should be noted. Firstly, this research was based on the gene expression profiles from the GEO database. Due to the difficult of recruitment of a large number of IPF patients, no validation of the 5 genes in real world data in this paper. Also, the IPF patients included in this study were all from Germany. Thus, our results might only represent patients in Germany and might not applicable to all IPF patients worldwide. Finally, due to limited data on treatment, our study did not subgroup IPF patients according to the different treatment choices. Consequently, the reliability and accuracy of our results might be affected and needs to be re-evaluated by future studies.
Conclusions
In conclusion, our study identified a novel five-IRG-based signature that is a reproducible predictor of outcome in IPF patients. This novel signature benefits the personalized management of patients with IPF. Furthermore, this finding provides new insights into the relationship between the immune system and IPF, offering incremental clinical value for IPF prognosis and therapy.The article’s supplementary files as
Authors: Chih-Hsi S Kuo; Stelios Pavlidis; Jie Zhu; Matthew Loza; Fred Baribaud; Anthony Rowe; Ioannis Pandis; David Gibeon; Uruj Hoda; Ana Sousa; Susan J Wilson; Peter Howarth; Dominick Shaw; Stephen Fowler; Barbro Dahlen; Pascal Chanez; Norbert Krug; Thomas Sandstrom; Louise Fleming; Julie Corfield; Charles Auffray; Ratko Djukanovic; Peter J Sterk; Yike Guo; Ian M Adcock; Kian Fan Chung Journal: Allergy Date: 2019-02-11 Impact factor: 13.146
Authors: Martina Korfei; Daniel von der Beck; Ingrid Henneke; Philipp Markart; Clemens Ruppert; Poornima Mahavadi; Bahil Ghanim; Walter Klepetko; Ludger Fink; Silke Meiners; Oliver Holger Krämer; Werner Seeger; Carlo Vancheri; Andreas Guenther Journal: J Proteomics Date: 2013-05-06 Impact factor: 4.044
Authors: Ganesh Raghu; Martine Remy-Jardin; Jeffrey L Myers; Luca Richeldi; Christopher J Ryerson; David J Lederer; Juergen Behr; Vincent Cottin; Sonye K Danoff; Ferran Morell; Kevin R Flaherty; Athol Wells; Fernando J Martinez; Arata Azuma; Thomas J Bice; Demosthenes Bouros; Kevin K Brown; Harold R Collard; Abhijit Duggal; Liam Galvin; Yoshikazu Inoue; R Gisli Jenkins; Takeshi Johkoh; Ella A Kazerooni; Masanori Kitaichi; Shandra L Knight; George Mansour; Andrew G Nicholson; Sudhakar N J Pipavath; Ivette Buendía-Roldán; Moisés Selman; William D Travis; Simon Walsh; Kevin C Wilson Journal: Am J Respir Crit Care Med Date: 2018-09-01 Impact factor: 21.405
Authors: Johnny Luo; Trisiah Tugade; Emmy Sun; Ana Maria Pena Diaz; David B O'Gorman Journal: J Cell Commun Signal Date: 2022-04-12 Impact factor: 5.782
Authors: Sabina A Guler; Sara Cuevas-Ocaña; Mouhamad Nasser; Wim A Wuyts; Marlies S Wijsenbeek; Antoine Froidure; Elena Bargagli; Elisabetta A Renzoni; Marcel Veltkamp; Paolo Spagnolo; Hilario Nunes; Cormac McCarthy; Maria Molina-Molina; Francesco Bonella; Venerino Poletti; Michael Kreuter; Katerina M Antoniou; Catharina C Moor Journal: ERJ Open Res Date: 2022-05-23