Literature DB >> 33028329

Development and validation of a robust immune-related prognostic signature in early-stage lung adenocarcinoma.

Pancheng Wu1, Yi Zheng2, Yanyu Wang1, Yadong Wang1, Naixin Liang3.   

Abstract

BACKGROUND: The incidence of stage I and stage II lung adenocarcinoma (LUAD) is likely to increase with the introduction of annual screening programs for high-risk individuals. We aimed to identify a reliable prognostic signature with immune-related genes that can predict prognosis and help making individualized management for patients with early-stage LUAD.
METHODS: The public LUAD cohorts were obtained from the large-scale databases including 4 microarray data sets from the Gene Expression Omnibus (GEO) and 1 RNA-seq data set from The Cancer Genome Atlas (TCGA) LUAD cohort. Only early-stage patients with clinical information were included. Cox proportional hazards regression model was performed to identify the candidate prognostic genes in GSE30219, GSE31210 and GSE50081 (training set). The prognostic signature was developed using the overlapped prognostic genes based on a risk score method. Kaplan-Meier curve with log-rank test and time-dependent receiver operating characteristic (ROC) curve were used to evaluate the prognostic value and performance of this signature, respectively. Furthermore, the robustness of this prognostic signature was further validated in TCGA-LUAD and GSE72094 cohorts.
RESULTS: A prognostic immune signature consisting of 21 immune-related genes was constructed using the training set. The prognostic signature significantly stratified patients into high- and low-risk groups in terms of overall survival (OS) in training data set, including GSE30219 (HR = 4.31, 95% CI 2.29-8.11; P = 6.16E-06), GSE31210 (HR = 11.91, 95% CI 4.15-34.19; P = 4.10E-06), GSE50081 (HR = 3.63, 95% CI 1.90-6.95; P = 9.95E-05), the combined data set (HR = 3.15, 95% CI 1.98-5.02; P = 1.26E-06) and the validation data set, including TCGA-LUAD (HR = 2.16, 95% CI 1.49-3.13; P = 4.54E-05) and GSE72094 (HR = 2.95, 95% CI 1.86-4.70; P = 4.79E-06). Multivariate cox regression analysis demonstrated that the 21-gene signature could serve as an independent prognostic factor for OS after adjusting for other clinical factors. ROC curves revealed that the immune signature achieved good performance in predicting OS for early-stage LUAD. Several biological processes, including regulation of immune effector process, were enriched in the immune signature. Moreover, the combination of the signature with tumor stage showed more precise classification for prognosis prediction and treatment design.
CONCLUSIONS: Our study proposed a robust immune-related prognostic signature for estimating overall survival in early-stage LUAD, which may be contributed to make more accurate survival risk stratification and individualized clinical management for patients with early-stage LUAD.

Entities:  

Keywords:  Immune-related genes; Lung adenocarcinoma; Overall survival; Prognosis; Risk score

Mesh:

Substances:

Year:  2020        PMID: 33028329      PMCID: PMC7542703          DOI: 10.1186/s12967-020-02545-z

Source DB:  PubMed          Journal:  J Transl Med        ISSN: 1479-5876            Impact factor:   5.531


Background

Lung cancer is the leading cause of death from cancer. In the United States, there will be approximately 228,820 newly diagnosed cases and 135,720 deaths in 2020 [1]. Lung adenocarcinoma (LUAD) is the most common histological type and accounts for nearly 60% of non-small cell lung cancer (NSCLC), which comprises approximately 85% of lung cancer [2-4]. Surgical lobectomy remains the preferred treatment strategy for patients with operable early-stage LUAD [5]. Although patients with early-stage LUAD have a relatively superior prognosis, nearly 10–44% of these patients still die within 5 years after surgical intervention [6, 7]. Recently, several studies revealed that adjuvant chemotherapy brought a clear 5-year survival benefit ranged from 4 to 10% for patients with resected stage II LUAD and can be considered for stage IB LUAD with primary tumor more than 4 cm [8-11], but not for patients with stage IA because of the potential detrimental effect [12]. Thus, besides the traditional clinical factors, it is imperative to develop a novel prognostic signature to perform personalized survival risk stratification and identify the high-risk early-stage patients who might benefit from additional systemic therapy. In recent years, numerous studies have reported prognostic signatures to make survival stratification and predict prognosis for patients with LUAD using genomics and transcriptomics data [13-15]. Unfortunately, the signatures proposed by these studies have not been incorporated into clinical practice owing to the problems such as small sample size and insufficient independent validation [16, 17]. Nowadays, the available public, large-scale databases containing enough gene expression data, such as TCGA (The Cancer Genome Atlas) and GEO (Gene Expression Omnibus) database, bring the opportunity to make more reliable prognostic signatures for lung cancer [18]. Immune system has been shown to play a crucial role in cancer initiation and progression [19, 20]. In addition, avoiding immune destruction has been accepted as a novel hallmark of cancer [21]. Recently, immunotherapies have achieved a notably and durable response in LUAD by targeting specific immune checkpoints like PD-1 or PD-L1 [22, 23]. Several studies have reported immune-related gene signatures which could predict prognosis and provide potential targets for immunotherapy in patients with LUAD [24-26]. However, few prognostic models have focused on immune-related genes in early-stage LUAD. In this study, we used the gene expression data sets from GEO and TCGA to develop and validate a prognostic prediction model for early-stage LUAD based on immune-related genes. A novel 21-gene based prognostic immune signature with robust prediction power for early-stage LUAD was developed, which allows clinicians to evaluate the prognosis of patients with early-stage LUAD and might provide promise for individualized therapeutic interventions.

Methods

Data preprocessing

We downloaded four independent NSCLC microarray data sets from GEO database (https://www.ncbi.nlm.nih.gov/geo/) using the GEOquery package [27]. Only early-stage LUAD patients were included. Patients without survival status or whose overall survival time shorter than 30 days were removed from the study. Among these data sets, the gene expression data of GSE30219 [28], GSE31210 [29, 30] and GES50081 [31] were generated by the same platform GPL570 (Affymetrix Human Genome U133 Plus 2.0 Array). These data sets were defined as training set and selected to screen for the candidate prognostic genes, while GSE72094 [32], another microarray data set, was chose for independent validation. Besides, The gene expression data and corresponding clinical information of TCGA-LUAD cohort, a RNA-seq data set, were downloaded by the UCSC Xena platform [33, 34], which was used for another independent validation. The general information of these datasets was summarized in Additional file 1: Table S1. The gene expression data of GEO and TCGA-LUAD data sets were normalized by the limma and DESeq2 package, respectively [35, 36]. Overall, a total of 1091 patients were enrolled in our study, including 82 patients from GSE30219, 204 patients from GSE31210, 127 patients from GSE50081, 311 patients from GSE72094 and 367 patients from TCGA-LUAD. The baseline characteristics of the patients enrolled in the study were described in Additional file 2: Table S2.

Development of the prognostic gene signature

We constructed the prognostic gene signature by focusing on the immune-related genes, which were downloaded from the InnateDB database (https://innatedb.com/) [37]. The list of the immune-related genes was summarized in Additional file 3: Table S3. The flow chart of this study was presented in Fig. 1. Firstly, univariate cox proportional hazards regression model was performed to screen for the candidate prognostic genes (p < 0.05) associated with OS in GSE30219, GSE31210 and GSE50081 cohort, respectively. Candidate genes with Hazard ratio (HR) > 1 were considered as risky prognostic genes, while HR < 1 as protective prognostic genes. The overlapped candidate prognostic genes were selected to develop the prognostic signature based on risk score model. In addition, the three microarray data sets were merged into 1 combined data set for further analysis.
Fig. 1

Flow chart of this study

Flow chart of this study Then a risk score for each patient was established based on a linear combination of the overlapped candidate prognostic genes expression levels weighted by the regression coefficient (β) derived from the univariate cox regression analysis [38, 39]. The risk score formula was defined as the following: The n, exp and β in the above formula represent the number of prognostic genes, the expression value and the coefficient of gene i, respectively [40]. Optimal cutoff value of the risk score in each data set was determined by the survminer package in R [41]. According to the cutoff value, patients were classified into high- and low-risk groups.

Evaluation of the immune-related prognostic signature

To assess the prognostic value of this prognostic signature, we firstly estimated the survival curves between the high- and low-risk groups by the Kaplan–Meier method using the survival and survminer package in GSE30219, GSE31210, GSE50081 and the combined data set, respectively [41, 42], with log-rank test to determine the statistical significances in OS between two groups. Meanwhile, time-dependent receiver operating characteristic (ROC) curve was conducted to evaluate the performance of this signature by calculating the area under the ROC curves (AUC) using timeROC package [43]. Furthermore, the same risk score formula was employed on GSE72094 and TCGA-LUAD cohort, which were served as independent validation data sets, to further evaluate and validate the efficiency of this signature.

Functional annotation and enrichment analysis

To acquire the potential biological processes of the overlapped prognostic genes, Gene Ontology (GO) enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis were performed using clusterProfiler package [44].

Statistical analysis

All statistical analyses were performed using R (version 3.6.2; R Foundation for Statistical Computing) [45] and RStudio (version 1.2.1335) (https://rstudio.com/). To investigate whether the gene signature was an independent prognostic factor for early-stage LUAD, univariate analysis was performed to evaluate the association of the gene signature and other clinical parameters with overall survival. Risk factors (p < 0.2) derived from univariate analysis were selected for further analysis in multivariate cox regression model [46, 47]. Heatmap was generated using the pheatmap package [48]. The detailed information of the system, software and packages using in the study were summarized in Additional file 4: Table S4. p < 0.05 deemed statistically significant.

Results

Identification of 21 immune-related prognostic genes in the training set

A total of 1091 patients with early-stage LUAD (533 men [49%], 558 women [51%]; median age [range], 66 [30-89] years), including 413 patients in the training set and 678 patients in the validation set, were enrolled in the analysis. Among 1051 immune-related genes from the innateDB database, 920 genes were measured in the training set. Under the cutoff value of p < 0.05, 173 genes in GSE30219, 300 genes in GSE31210 and 146 genes in GSE50081 were identified as candidate prognostic genes which were significantly associated with OS. After overlapping these prognostic genes among these data sets, 21 overlapped genes were finally screened, including 14 risky genes and 7 protective genes. The general information of the overlapped genes and corresponding coefficients were summarized in Additional file 5: Table S5.

Development of the 21-gene based immune-related prognostic signature

We calculated the 21-gene based risk score for each patient in the training set using the risk score formula (Additional file 6: Table S6). Patients were classified into high- and low-risk groups using the optimal cutoff analyzed by the survminer package. The cutoff value in each cohort was summarized in Additional file 7: Table S7. The distribution of the risk scores, survival status and the expression levels of the 21 genes in the training set were shown in Additional file 8: Figure. S1. Kaplan–Meier survival curves revealed that patients in the high-risk group shown significantly poorer OS than patients in the low-risk group (Fig. 2a). Moreover, the AUCs for 1-year, 3-year and 5-year were 0.75, 0.80 and 0.82 in GSE30219, 0.78, 0.75 and 0.81 in GSE31210 and 0.73, 0.75 and 0.74 in GSE50081, respectively (Fig. 2b), suggesting that this 21-gene signature achieved a relatively high performance for early-stage LUAD survival prediction. Furthermore, we conducted survival analysis in the combined data set to assess the reliability of this signature. Consistent with the results of single data set in the training set, Kaplan–Meier curve showed that patients in the high-risk group exhibited shorter OS than those in the low-risk group (p < 0.0001) (Fig. 2a). The AUCs for 1-year, 3-year and 5-year were 0.66, 0.66 and 0.70, respectively (Fig. 2b), implying that this gene signature also had a good performance for prognosis prediction in the combined data set.
Fig. 2

Correlation between the 21-gene signature and Overall survival in the training set (early-stage LUAD). a Kaplan–Meier survival curves between high- and low-risk groups, b ROC curves for 1-year, 3-year and 5-year survival prediction by the 21-gene signature

Correlation between the 21-gene signature and Overall survival in the training set (early-stage LUAD). a Kaplan–Meier survival curves between high- and low-risk groups, b ROC curves for 1-year, 3-year and 5-year survival prediction by the 21-gene signature

External validation of the 21-gene prognostic signature

To further validate the robustness of the 21-gene signature, the risk score for each patient was calculated using the same risk score formula in two independent data sets, including TCGA-LUAD and GSE72094 cohort. We divided the patients into high- and low-risk group according to the optimal cutoff. Consistent with the results in the training set, patients of high-risk group shown conspicuously poorer OS than those of low-risk group in both TCGA-LUAD and GSE72094 cohort (p < 0.0001, Fig. 3a). The AUCs for 1-year, 3-year and 5-year were 0.61, 0.66 and 0.62 in TCGA-LUAD, and 0.70, 0.64 and 0.94 in GSE72094 (Fig. 3b), which implies that the prognostic signature has a valid performance for OS prediction in validation data sets. The distribution of the risk scores, survival status and the expression levels of the 21 genes in the validation set were shown in Additional file 8: Figure. S1. The data for Kaplan–Meier survival analysis and ROC analysis were summarized in Additional file 9: Table S8. Taken together, these results suggest that this 21-gene based prognostic signature is robust in prognosis prediction for early-stage LUAD and can be used in both microarray and RNA-sequencing data sets.
Fig. 3

Correlation between the 21-gene signature and Overall survival in the validation set (early-stage LUAD). a Kaplan–Meier survival curves between high- and low-risk groups in TCGA-LUAD and GSE72094 cohort, respectively. b ROC curves for 1-year, 3-year and 5-year survival prediction by the 21-gene signature in TCGA-LUAD and GSE72094 cohort, respectively

Correlation between the 21-gene signature and Overall survival in the validation set (early-stage LUAD). a Kaplan–Meier survival curves between high- and low-risk groups in TCGA-LUAD and GSE72094 cohort, respectively. b ROC curves for 1-year, 3-year and 5-year survival prediction by the 21-gene signature in TCGA-LUAD and GSE72094 cohort, respectively

The 21-gene prognostic signature is an independent prognostic factor

Univariate and multivariate cox analysis were performed in both training and validation sets to investigate whether this 21-gene prognostic signature could be served as an independent prognostic factor for patients with early-stage LUAD. The prognostic signature and other available clinicopathological factors were included for analysis. Univariate regression analysis indicated that the prognostic signature was significantly associated with OS for early-stage LUAD in GSE30219 (HR = 4.31, 95% CI 2.29–8.11, P = 6.16E−06), GSE31210 (HR = 11.91, 95% CI 4.15–34.19; P = 4.10E−06), GSE50081 (HR = 3.63, 95% CI 1.90–6.95; P = 9.95E−05), combined data set (HR = 3.15, 95% CI 1.98–5.02; P = 1.26E−06) (Table 1), TCGA-LUAD (HR = 2.16, 95% CI 1.49–3.13; P = 4.54E−05) and GSE72094 (HR = 2.95, 95% CI 1.86–4.70; P = 4.79E−06) (Table 2). Then, risk factors (P < 0.2) derived from the univariate analysis were selected for further multivariate analysis. The results shown that there was a significantly association between the prognostic signature and OS in GSE30219 (HR = 5.01, 95% CI 2.50–10.06; P = 5.75E−06), GSE31210 (HR = 8.82, 95% CI 2.86–27.14; P = 1.48E−04), GSE50081 (HR = 2.74, 95% CI 1.37–5.46; P = 4.24E−03), combined data set (HR = 3.01, 95% CI 1.86–4.85; P = 6.44E−06) (Table 1), TCGA-LUAD (HR = 1.91, 95% CI 1.28–2.85; P = 1.55E−03) and GSE72094 (HR = 2.94, 95% CI 1.81–4.79; 1.47E−05) (Table 2). These results demonstrated that the 21-gene based prognostic signature was an independent prognostic factor for patients with early-stage LUAD in both training set and validation set after adjusting for other clinical and pathologic factors.
Table 1

Univariate and multivariate Cox regression analyses of the 21-gene signature and OS in the training set (early-stage LUAD)

VariablesPatients (N)Univariate analysisMultivariate analysis
HR95% CIp valueHR95% CIp value
GSE30219
 Age > 65/ ≤ 6523/591.7650.946–3.2957.42E−022.361.179–4.7241.53E−02
 GenderMale/female64/181.1480.532–2.4787.24E−01
 T stageT2/T112/692.0461.019–4.1074.40E−020.9820.352–1.949.63E−01
 Risk scoreHigh/low34/484.3082.287–8.1146.16E−065.0122.498–10.0585.75E−06
GSE31210
 Age > 65/ ≤ 6547/1572.7791.349–5.7245.59E−033.1721.445–6.9644.01E−03
 GenderMale/female95/1091.6860.818–3.4761.57E−011.0330.347–3.0789.54E−01
 StageII/I42/4624.2972.092–8.8287.21E−052.4441.152–5.1861.99E−02
 SmokingYes/no99/1051.9080.918–3.9668.35E−021.7260.565–5.2713.38E−01
 ALK fusion ± 7/1971.0570.144–7.7779.56E−01
 EGFR mutation ± 116/880.460.222–0.9563.76E−021.6580.512–5.3753.99E−01
 KRAS mutation ± 19/1850.9920.3–3.2829.89E−01
 Triple negativeYes/no62/1420.4360.212–0.8942.34E−020.5430.174–1.6982.94E−01
 MYC copyHigh/low16/1870.7670.183–3.220.716942752
 Risk scoreHigh/low81/12311.9144.151–34.1934.10E−068.8172.864–27.141.48E−04
GSE50081
 Age > 65/ ≤ 6587/401.4550.774–2.7352.44E−01
 GenderMale/female65/621.410.807–2.4632.28E−01
 StageII/I35/922.4431.383–4.3162.10E−031.9051.068–3.3992.90E−02
 T stageT2/T182/302.4351.214–4.8831.22E−021.6250.785–3.3651.91E−01
 SmokingYes/no92/231.6820.75–3.7762.07E−01
 Risk scoreHigh/low72/553.6321.897–6.9539.95E−052.7361.373–5.4554.24E−03
Combined data set
 Age > 65/ ≤ 65157/2562.3141.621–3.3043.82E−062.4141.686–3.4561.49E−06
 GenderMale/female224/1891.5741.081–2.2911.80E−021.5821.084–2.3071.73E−02
 StageII/I90/3232.791.934–4.0264.08E−082.0651.42–3.0041.49E−04
 Risk scoreHigh/low270/1433.1531.981–5.0161.26E−063.0051.863–4.8466.44E−06
Table 2

Univariate and multivariate Cox regression analyses of the 21-gene signature and OS in the validation set (early-stage LUAD)

VariablesPatients (N)Univariate analysisMultivariate analysis
HR95% CIp valueHR95% CIp value
GSE72094
Age > 65/ ≤ 65219/921.6380.952–2.8177.47E−021.8071.042–3.1353.52E−02
 GenderMale/female141/1701.4490.917–2.2891.12E−011.3860.871–2.2071.69E−01
 StageII/I65/2462.1371.313–3.4792.24E−032.2581.38–3.6951.20E−03
 SmokingYes/no237/261.3340.482–3.6965.79E−01
 EGFR mutationYes/no35/2760.0920.013–0.6591.76E−020.1370.019–1.0045.05E−02
 KRAS mutationYes/no105/2061.7011.075–2.6912.33E−021.5540.976–2.4756.33E−02
 TP53 mutationYes/no77/2341.4610.893–2.3921.31E−011.1540.69–1.9325.86E−01
 STK11 mutationYes/no48/2630.7990.41–1.5595.11E−01
 Risk scoreHigh/low89/2222.9521.857–4.6954.79E−062.9421.806–4.7931.47E−05
TCGA-LUAD
 Age > 65/ ≤ 65190/1681.3080.897–1.9071.62E−011.5131.018–2.2494.05E−02
 GenderMale/female168/1991.0690.739–1.5467.25E−01
 StageII/I113/2542.6071.795–3.7854.84E−072.1861.444–3.3092.19E−04
 T stageT2/T1201/1411.4230.942–2.1479.35E−021.1310.775–1.8134.34E−01
 SmokingYes/no273/490.9230.54–1.5797.70E−01
 Risk scoreHigh/low141/2262.161.492–3.1284.54E−051.9091.279–2.8491.55E−03
Univariate and multivariate Cox regression analyses of the 21-gene signature and OS in the training set (early-stage LUAD) Univariate and multivariate Cox regression analyses of the 21-gene signature and OS in the validation set (early-stage LUAD)

Prognosis prediction by combining the 21-gene prognostic signature with stage

Multivariate analysis revealed that the prognostic signature and stage were both independent prognostic factors in the combined data set, suggesting a complementary value. Therefore, we attempted to develop an integrated prognostic model for survival prediction by combining the prognostic signature with tumor stage in the combined data set. Based on the risk and stage, patients were classified into six groups: group 1 (stage IA with low-risk), group 2 (stage IA with high-risk), group 3 (stage IB with low-risk), group 4 (stage IB with high-risk), group 5 (stage II with low-risk) and group 6 (stage II with high-risk) (Fig. 4). Kaplan–Meier survival analysis were performed between different groups. The results revealed that patients in group 2, group3, group 4, group5 and group 6 had worse prognosis compared with patients in group 1, with group 1 exhibited the best prognosis and group 6 showed the worst (Fig. 4). Nevertheless, there was no significant difference between patients in group 2 and group 3/4/5 (Fig. 4). These results display that patients of stage IA with high-risk have similar prognosis to those of stage IB and stage II with low-risk, suggesting adjuvant chemotherapy might be beneficial for stage IA LUAD with high-risk. Additionally, patients of early-stage LUAD could be divided into six different groups based on the stage and prognostic signature, which might be a more precise scheme to predict prognosis for patients with early-stage LUAD in the future practice.
Fig. 4

Kaplan–Meier curves of overall survival for patients grouped by stage and 21-gene signature combination (early-stage LUAD)

Kaplan–Meier curves of overall survival for patients grouped by stage and 21-gene signature combination (early-stage LUAD)

Functional annotation and enrichment analysis of the 21-gene prognostic signature

To identify the underlying biological processes and pathways within this 21-gene signature, we performed GO enrichment and KEGG pathway analysis. The results indicated these genes were mainly enriched in biological processes such as positive regulation of cytokine production (GO:0001819), regulation of immune effector process (GO:0002697) and intrinsic apoptotic signaling pathway (GO:0097193) (Fig. 5). In addition, KEGG analysis revealed that several pathways like viral carcinogenesis, proteoglycans in cancer and Fc-gamma R-mediated phagocytosis (Fig. 5) were enriched among these genes.
Fig. 5

Functional enrichment analysis of the 21 prognostic genes. a Gene Ontology analysis, b Kyoto Encyclopedia of Genes and Genomes pathway analysis

Functional enrichment analysis of the 21 prognostic genes. a Gene Ontology analysis, b Kyoto Encyclopedia of Genes and Genomes pathway analysis

Discussion

Previous studies have reported different prognostic biomarkers for patients with early-stage LUAD [28, 49–51]. However, none of these studies focused on the immune-related genes in prognosis prediction. Recently, several studies have proposed prognostic signatures using immune-related genes for LUAD [24-26]. Nevertheless, some concerns hamper the prediction power of these prognostic signatures, such as insufficient sample size, lack of external independent validation or effective validation. In the present study, we developed a novel prognostic signature based on 21 immune-related genes for early-stage LUAD and validated it in two independent cohorts. Our prognostic signature was significantly associated with OS for early-stage LUAD and could further identify the high- and low-risk early-stage LUAD patients with significant differences in OS. Besides, the 21-gene prognostic signature showed a good prediction performance in all enrolled studies including GEO and TCGA-LUAD data sets, suggesting that our signature had a cross-platform compatibility. Multivariate regression analysis revealed that the 21-gene prognostic signature was an independent prognostic factor for all enrolled studies. These results suggest that the 21-gene prognostic signature could effectively predict the overall survival for early-stage LUAD. In the aera of immunotherapy, it may hold great promise to discover prognostic and predictive biomarkers that are related to tumor immune microenvironment, which can be used for identifying novel molecular targets for patients [52]. Our functional enrichment analysis suggests that the genes in our prognostic signature are widely involved in the immune process. Among all the 21 prognostic genes, 14 (e.g., AQP3, BIRC5, C5AR1, HMOX1, IL32, IL6ST, MIF, MMP12, PLAUR, PMAIP1, RAC1, SMAD6, SPHK1, USP7) have been reported to be served as a prognostic biomarker or suggested to be a novel therapeutic targets for lung adenocarcinoma [53-67]. The remaining 7 genes, including ARF6, C7, ELF4, ITPR1, MOV10, PTCH1 and RIPK2, have not been previously reported to be associated with LUAD prognosis and might act as potential biomarker. We were particularly interested in studying ARF6, which was a member of small GTPases ADP-ribosylation factor family, and its downstream effector AMAP1 have been reported overexpressed in several types of cancer and could promote cancer cell proliferation, invasion and migration [68-71]. For example, KRAS and TP53 oncogenes could promote PD-L1 recycling and cell surface expression through ARF6-AMAP1 pathway, which is significantly involved in the immune evasion of pancreatic ductal adenocarcinoma cells [72]. Currently, tumor staging system has been widely used for prognosis prediction and treatment design for LUAD. However, prognosis might vary in patients with same stage owing to the variabilities in clinical behavior caused by genomic changes [73, 74]. Thus, it is critically needed to develop reliable prognostic biomarkers to predict prognosis and help clinical oncologists optimally select early-stage patients who might obtain survival benefit from additional system therapy. In the integrated prediction model analysis, early-stage LUAD patients could be stratified into six different groups by combining our 21-gene prognostic signature with tumor stage. Besides, no statistical significance exists in prognosis between stage IA patients with high-risk and stage II patients with low-risk. These findings may help clinicians identify high-risk patients and make individualized treatment design for these patients. The limitations in our study need to be noted. First, although different cohorts from GEO and TCGA databases have been included in our study to develop and validate the immune-related prognostic signature, the study presents a retrospective design. Future large-scale prospective clinical studies needed to confirm our findings. Second, the data of specific mutations such as EGFR, KRAS and TP53 were only available in GSE31210 and GSE72094 cohort, thus it might be insufficient to assess the 21-gene prognostic signature with the specific mutations. Finally, the biological mechanisms of these prognostic genes in early-stage LUAD and the association of the prognostic signature with several prognostic biomarker such as PD-L1, IL-7R [75], CD8+ [76], are still unknown, Future studies are required to explore and clarify molecular functions of these immune-related prognostic genes during early-stage LUAD progression and the association between these genes with above prognostic biomarkers.

Conclusions

In summary, we developed and validated a promising immune-related prognostic signature comprising of 21 immune-related genes, which could serve as an independent prognostic biomarker for OS prediction in early-stage LUAD. Furthermore, a prediction model by combining our prognostic signature with tumor stage could more accurately evaluate patient’s prognosis. These findings might provide novel therapeutic targets and be used for making individualized management and hold promise for improving survival for patients with early-stage LUAD. Additional file 1: Table S1. The general information of the enrolled datasets. Additional file 2: Table S2. The baseline characteristics of patients included in this study. Additional file 3: Table S3. The list of the immune-related genes downloaded from the InnateDB database. Additional file 4: Table S4. The detailed information of the system, software and packages using in this study. Additional file 5: Table S5. The general information of the overlapped prognostic genes and corresponding coefficients. Additional file 6: Table S6. The specific risk score formula using in this study. Additional file 7: Table S7. The cutoff values for each data set. Additional file 8: Figure S1. The distribution of the risk scores, survival status and gene expression levels in the enrolled data set. Additional file 9: Table S8. The data for Kaplan-Meier survival analysis and ROC analysis.
  72 in total

1.  A practical molecular assay to predict survival in resected non-squamous, non-small-cell lung cancer: development and international validation studies.

Authors:  Johannes R Kratz; Jianxing He; Stephen K Van Den Eeden; Zhi-Hua Zhu; Wen Gao; Patrick T Pham; Michael S Mulvihill; Fatemeh Ziaei; Huanrong Zhang; Bo Su; Xiuyi Zhi; Charles P Quesenberry; Laurel A Habel; Qiuhua Deng; Zongfei Wang; Jiangfen Zhou; Huiling Li; Mei-Chun Huang; Che-Chung Yeh; Mark R Segal; M Roshni Ray; Kirk D Jones; Dan J Raz; Zhidong Xu; Thierry M Jahan; David Berryman; Biao He; Michael J Mann; David M Jablons
Journal:  Lancet       Date:  2012-01-27       Impact factor: 79.321

2.  An immune-related lncRNA signature for patients with anaplastic gliomas.

Authors:  Wen Wang; Zheng Zhao; Fan Yang; Haoyuan Wang; Fan Wu; Tingyu Liang; Xiaoyan Yan; Jiye Li; Qing Lan; Jiangfei Wang; Jizong Zhao
Journal:  J Neurooncol       Date:  2017-11-23       Impact factor: 4.130

Review 3.  Adjuvant chemotherapy of completely resected early stage non-small cell lung cancer (NSCLC).

Authors:  Ying Liang; Heather A Wakelee
Journal:  Transl Lung Cancer Res       Date:  2013-10

4.  Gefitinib or carboplatin-paclitaxel in pulmonary adenocarcinoma.

Authors:  Tony S Mok; Yi-Long Wu; Sumitra Thongprasert; Chih-Hsin Yang; Da-Tong Chu; Nagahiro Saijo; Patrapim Sunpaweravong; Baohui Han; Benjamin Margono; Yukito Ichinose; Yutaka Nishiwaki; Yuichiro Ohe; Jin-Ji Yang; Busyamas Chewaskulyong; Haiyi Jiang; Emma L Duffield; Claire L Watkins; Alison A Armour; Masahiro Fukuoka
Journal:  N Engl J Med       Date:  2009-08-19       Impact factor: 91.245

5.  Validation of a histology-independent prognostic gene signature for early-stage, non-small-cell lung cancer including stage IA patients.

Authors:  Sandy D Der; Jenna Sykes; Melania Pintilie; Chang-Qi Zhu; Dan Strumpf; Ni Liu; Igor Jurisica; Frances A Shepherd; Ming-Sound Tsao
Journal:  J Thorac Oncol       Date:  2014-01       Impact factor: 15.609

6.  Epidermal growth factor receptor tyrosine kinase defines critical prognostic genes of stage I lung adenocarcinoma.

Authors:  Mai Yamauchi; Rui Yamaguchi; Asuka Nakata; Takashi Kohno; Masao Nagasaki; Teppei Shimamura; Seiya Imoto; Ayumu Saito; Kazuko Ueno; Yousuke Hatanaka; Ryo Yoshida; Tomoyuki Higuchi; Masaharu Nomura; David G Beer; Jun Yokota; Satoru Miyano; Noriko Gotoh
Journal:  PLoS One       Date:  2012-09-19       Impact factor: 3.240

7.  Lysophosphatidic acid activates Arf6 to promote the mesenchymal malignancy of renal cancer.

Authors:  Shigeru Hashimoto; Shuji Mikami; Hirokazu Sugino; Ayumu Yoshikawa; Ari Hashimoto; Yasuhito Onodera; Shotaro Furukawa; Haruka Handa; Tsukasa Oikawa; Yasunori Okada; Mototsugu Oya; Hisataka Sabe
Journal:  Nat Commun       Date:  2016-02-08       Impact factor: 14.919

8.  TFAP2C increases cell proliferation by downregulating GADD45B and PMAIP1 in non-small cell lung cancer cells.

Authors:  Hyunhee Do; Dain Kim; JiHoon Kang; Beomseok Son; Danbi Seo; HyeSook Youn; BuHyun Youn; Wanyeon Kim
Journal:  Biol Res       Date:  2019-07-11       Impact factor: 5.612

9.  Systematic construction and validation of an immune prognostic model for lung adenocarcinoma.

Authors:  Chenghan Luo; Mengyuan Lei; Yixia Zhang; Qian Zhang; Lifeng Li; Jingyao Lian; Shasha Liu; Liping Wang; Guofu Pi; Yi Zhang
Journal:  J Cell Mol Med       Date:  2019-11-28       Impact factor: 5.310

10.  An eight-miRNA signature as a potential biomarker for predicting survival in lung adenocarcinoma.

Authors:  Xuelian Li; Yunrui Shi; Zhihua Yin; Xiaoxia Xue; Baosen Zhou
Journal:  J Transl Med       Date:  2014-06-04       Impact factor: 5.531

View more
  9 in total

1.  The DDR-related gene signature with cell cycle checkpoint function predicts prognosis, immune activity, and chemoradiotherapy response in lung adenocarcinoma.

Authors:  Quan Li; Pan Zhang; Huixiao Hu; Hang Huang; Dong Pan; Guangyun Mao; Burong Hu
Journal:  Respir Res       Date:  2022-07-15

2.  Identification of a methylomics-associated nomogram for predicting overall survival of stage I-II lung adenocarcinoma.

Authors:  Heng Wang; Chuangye Wei; Peng Pan; Fengfeng Yuan; Jiancheng Cheng
Journal:  Sci Rep       Date:  2021-05-11       Impact factor: 4.379

3.  Immune landscape and a promising immune prognostic model associated with TP53 in early-stage lung adenocarcinoma.

Authors:  Chengde Wu; Xiang Rao; Wei Lin
Journal:  Cancer Med       Date:  2020-12-12       Impact factor: 4.452

4.  High DSCC1 Level Predicts Poor Prognosis of Lung Adenocarcinoma.

Authors:  Sisi Chang; Yahui Zhu; Yutan Xi; Fuyan Gao; Juanjuan Lu; Liang Dong; Chunzheng Ma; Honglin Li
Journal:  Int J Gen Med       Date:  2021-10-20

5.  Characterization of the Different Subtypes of Immune Cell Infiltration to Aid Immunotherapy.

Authors:  Zhenqing Li; Kai Mao; Bo Ding; Qun Xue
Journal:  Front Cell Dev Biol       Date:  2022-03-15

6.  Necroptosis Identifies Novel Molecular Phenotypes and Influences Tumor Immune Microenvironment of Lung Adenocarcinoma.

Authors:  Chen Zhao; Kewei Xiong; Abdalla Adam; Zhiqiang Ji; Xiangpan Li
Journal:  Front Immunol       Date:  2022-07-14       Impact factor: 8.786

7.  Identification of metabolic genes for the prediction of prognosis and tumor microenvironment infiltration in early-stage non-small cell lung cancer.

Authors:  Jing Li; Yun Guan; Rongrong Zhu; Yang Wang; Huaguang Zhu; Xin Wang
Journal:  Open Life Sci       Date:  2022-08-11       Impact factor: 1.311

8.  Prognosis and recurrence patterns in patients with early stage lung cancer: a multi-state model approach.

Authors:  Won Gi Jeong; Hyemi Choi; Kum Ju Chae; Jinheum Kim
Journal:  Transl Lung Cancer Res       Date:  2022-07

9.  Assessing the Prognostic Capability of Immune-Related Gene Scoring Systems in Lung Adenocarcinoma.

Authors:  Wenhao Liu; Ruihong Dong; Shuai Gao; Xiaodi Shan; Mian Li; Zhaoyan Yu; Liang Sun
Journal:  J Oncol       Date:  2022-07-31       Impact factor: 4.501

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.