Literature DB >> 32175651

Mast cell-based molecular subtypes and signature associated with clinical outcome in early-stage lung adenocarcinoma.

Xuanwen Bao1,2, Run Shi3, Tianyu Zhao4,5,6,7, Yanfang Wang8.   

Abstract

Mast cells are a major component of the immune microenvironment in tumour tissues and modulate tumour progression by releasing pro-tumorigenic and antitumorigenic molecules. Regarding the impact of mast cells on the outcomes of patients with lung adenocarcinoma (LUAD) patient, several published studies have shown contradictory results. Here, we aimed at elucidating the role of mast cells in early-stage LUAD. We found that high mast cell abundance was correlated with prolonged survival in early-stage LUAD patients. The mast cell-related gene signature and gene mutation data sets were used to stratify early-stage LUAD patients into two molecular subtypes (subtype 1 and subtype 2). The neural network-based framework constructed with the mast cell-related signature showed high accuracy in predicting response to immunotherapy. Importantly, the prognostic mast cell-related signature predicted the survival probability and the potential relationship between TP53 mutation, c-MYC activation and mast cell activities. The meta-analysis confirmed the prognostic value of the mast cell-related gene signature. In summary, this study might improve our understanding of the role of mast cells in early-stage LUAD and aid in the development of immunotherapy and personalized treatments for early-stage LUAD patients.
© 2020 The Authors. Published by FEBS Press and John Wiley & Sons Ltd.

Entities:  

Keywords:  early-stage lung adenocarcinoma; immunotherapy; mast cell; prognosis

Mesh:

Substances:

Year:  2020        PMID: 32175651      PMCID: PMC7191192          DOI: 10.1002/1878-0261.12670

Source DB:  PubMed          Journal:  Mol Oncol        ISSN: 1574-7891            Impact factor:   6.603


differently expressed gene Gene Expression Omnibus gene ontology gene set enrichment analysis hazard ratio lung adenocarcinoma principal component analysis single‐sample gene set enrichment analysis The Cancer Genome Atlas weighted correlation network analysis

Introduction

Lung adenocarcinoma (LUAD) is one of the most complex and heterogeneous malignancies, both in molecular and phenotypic terms (Li et al., 2016; Mao et al., 2016; Wang et al., 2019a). The incidence of LUAD has been increasing in recent years. The treatment for early‐stage LUAD includes operation, chemotherapy and radiotherapy (Besse et al., 2015). Additionally, immunotherapy serves a promising therapeutic strategy in many cancer types (Bao et al., 2019b; Couzin‐Frankel, 2013; Purwar et al., 2012; Schumacher and Schreiber, 2015; Wasiuk et al., 2012) as well. However, there is still a long way to go for immunotherapy in LUAD. A clear understanding of the tumour immune microenvironment may aid in the development of immunotherapy for LUAD patients. Mast cell is widely distributed in different tissues and is a major component of the immune microenvironment in tumour tissues. Mast cells modulate tumour initiation and progression through the secretion of pro‐tumorigenic and antitumorigenic molecules (Varricchi et al., 2017). The controversial roles of mast cells result in conflicting effects among different tumour types (Alì et al., 2009; Carlini et al., 2010; Gounaris et al., 2007; Jeong et al., 2013; Nordlund and Askenase, 1983; Sinnamon et al., 2008; Welsh et al., 2005; Yang et al., 2011). Regarding the impact of mast cells on LUAD patient clinical outcomes, several contradictory results have been published (Carlini et al., 2010; Imada et al., 2000; Kurebayashi et al., 2016; Li et al., 2018; Nagata et al., 2003; Takanami et al., 2000). Although the results of these studies are quite different with respect to the prognostic values of mast cells, previous studies have shown that mast cell infiltration is more intensive in well‐differentiated tumours and low‐grade histologic subtypes than in poorly differentiated and high‐grade subtypes (Carlini et al., 2010; Nagata et al., 2003). Thus, in this study, we focused on the effect of mast cells in early‐stage LUAD patients. We analysed the potential role of mast cell, mast cell‐related genes and immunotherapy outcomes in early‐stage LUAD using bioinformatics models and machine learning methods.

Method

Data processing

The Cancer Genome Atlas (TCGA) transcriptome data, mutation data and clinical information were downloaded via the UCSC Xena Browser (https://xenabrowser.net/). http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE11969, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE13213, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE29013, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE30219, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE31210, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37745, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE42127, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE50081 and http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE72094 were downloaded from the Gene Expression Omnibus database (http://www.ncbi.nlm.nih.gov/geo/). The detailed TCGA clinical information is summarized in Table 1 and Appendix S1.
Table 1

Patient information.

VariableNumber
Gender (female/male)166/142
TNM stage (stage I/stage II)212/96
Lymph node metastasis (positive/negative)61/247
Age (> 60/≤ 60/missing)221/78/9
KRAS status (WT/MUT)210/98
EGFR status (WT/MUT)271/37
Smoking (no/yes/missing)15/93/200
Patient information.

Estimation of the abundance of immune cell populations and implementation of weighted correlation network analysis

Transcriptome file of TCGA early‐stage LUAD was applied on xCell to estimate the abundance of different immune cell populations (Aran et al., 2017; Newman et al., 2015). Weighted correlation network analysis (WGCNA) was accomplished with the r package ‘WGCNA’ (Bao et al., 2019a; Langfelder and Horvath, 2008; Wang et al., 2019b). The expression profile of immune‐related gene (from https://www.innatedb.com/redirect.do?go=resourcesGeneLists) was applied as the input of WGCNA. Gene significance quantified the association of individual genes with mast cell density, and module membership represented the correlation between module eigengenes and gene expression profiles. A power of β = 3 and a scale‐free R 2 = 0.95 were set as soft‐threshold parameters to ensure a signed scale‐free co‐expression gene network. A total of six nongrey modules were generated. Among these modules, the yellow module depicting the highest correlation (r = 0.92, P = 4.2e‐115) was considered the most correlated with mast cell density. Survival analysis was performed using the r package ‘survival’. Cox regression analysis was used to determine the hazard ratio (HR). All genes in the yellow module were subjected to univariate Cox regression. The 110 genes that significantly associated with the survival of early‐stage LUAD patients in the yellow module were identified as the mast cell‐related gene signature. These identified genes were applied on gene ontology (GO) analysis with the ‘clusterProfiler’ package (Yu et al., 2012) to elucidate the potential mechanism behind the gene signature. r software (version: 3.5.3) was used for all the analyses in the manuscript.

Molecular subtype identification

The r package ‘CancerSubtypes’ was applied to perform molecular subtype identification (Xu et al., 2017). Transcriptome profile and gene mutation data sets were used to perform cancer subtype analysis. The default parameters were used to perform the classification. The cluster number was selected as 2. Gene set enrichment analysis (GSEA) was performed with GSEA software from Broad Institute.

Differently expressed gene analysis

The differently expressed gene (DEG) analysis was performed with ‘Limma’ package (Smyth, 2005). An empirical Bayesian method was applied to estimate the fold change between the molecular subtype 1 and 2 using moderated t‐tests. The adjusted P‐value for multiple testing was calculated using the Benjamini–Hochberg correction. The genes with an adjusted P‐value < 0.05 and absolute log2 (log to base two of) fold change > 1.5 were identified as DEGs between two molecular subtypes. GO analysis was performed based on the significant genes.

Prognostic gene signature‐based risk score and ssGSEA implementation

The genes in the WGCNA yellow module were analysed with univariate Cox regression analysis. The comprehensive mast cell‐related signature was calculated by principal component analysis (PCA). The PCA‐based risk score MastCellpca was derived from the first principal component of the 110 genes from mast cell‐related gene signature. Let E represent the log2(RSEM + 1) value of the key gene i in tumour sample j, and C represents the corresponding coefficient of the mast cell‐related genes. The risk score MastCellpca was calculated as follows:

ssGSEA implementation and clinical response prediction

The enrichment scores of the hallmark genes were evaluated using single‐sample GSEA (ssGSEA) with r package ‘GSVA’ (Hänzelmann et al., 2013). The hallmark gene sets were obtained from MSigDB. Spearman's coefficient analysis was performed to analyse the correlation between prognostic gene signature‐based risk score and each hallmark. The Tumor Immune Dysfunction and Exclusion algorithm was used to predict the clinical response to immune checkpoint blockade (Jiang et al., 2018).

Neural network construction

PyTorch was employed to construct the neural network to predict the immunotherapy response by the mast cell‐related gene signature in python (Version: 3.5) (Paszke et al., 2017). Stochastic gradient descent method and learning rate 0.001 were chosen for the optimizer of the model. Five layers were built with different input and output numbers. Batch normalization was performed in each layer. Dropout function (dropout rate: 0.2) was used in the training process but not in the testing process. Relu function was applied as the activate function. A logistic sigmoid function was used in the output layer. The Python script is provided in Appendix S2.

Random forest algorithm for feature importance ranking

A random forest algorithm was applied to find the most critical mutations associated with the mast cell signature‐based risk score. Briefly, the gene mutation data set (Appendix S3) and mast cell signature‐based risk score were applied to find the most important gene mutations associated with the mast cell signature‐based risk score. First, the ‘ranger’ package was used to find the best hyperparameter in the regression process (Wright and Ziegler, 2015). Then, the ‘randomforest’ package was applied for the construction of the regression model (Liaw and Wiener, 2002). The r code for the analysis in the manuscript is provided in Appendix S4.

Results

High mast cell abundance in early‐stage LUAD benefits the survival of patients

The workflow of the manuscript is shown in Fig. 1A. To illustrate the correlation between mast cells and survival in early‐stage LUAD patients, we first analysed the abundance of immune cell populations in early‐stage LUAD tumour samples. We identified twenty‐two immune cell populations, and the correlations between these populations are shown in Fig. 1B. We found that high mast cell abundance benefited the survival of early‐stage LUAD patients in the TCGA cohorts (Fig. 1C). To further confirm the association between mast cells and the survival of early‐stage LUAD patients, we estimated the abundance of mast cells in two external cohorts (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE31210 and http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE50081). The results showed that high mast cell abundance is associated with prolonged survival of early‐stage LUAD patients, as we observed in the TCGA early‐stage LUAD cohort (Fig. 1D,E).
Fig. 1

The association between mast cell abundance and clinical outcomes in early‐stage LUAD patients. (A) Schematic diagram of the study design. (B) The correlation among immune cell populations. (C–E) Kaplan–Meier curves for the OS of early‐stage LUAD patients showed that the patients with high mast cell abundance had a favourable outcome compared with the patients with low mast cell abundance in the TCGA, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE31210 and http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE50081 cohorts.

The association between mast cell abundance and clinical outcomes in early‐stage LUAD patients. (A) Schematic diagram of the study design. (B) The correlation among immune cell populations. (C–E) Kaplan–Meier curves for the OS of early‐stage LUAD patients showed that the patients with high mast cell abundance had a favourable outcome compared with the patients with low mast cell abundance in the TCGA, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE31210 and http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE50081 cohorts.

Identification of a gene signature associated with mast cells

The immune‐related genes were determined with WGCNA. Genes were clustered into seven modules (Fig. 2A). The correlation between the modules and mast cell abundance was calculated by Pearson's correlation coefficient (Fig. 2B). The yellow module showed the highest correlation coefficient with mast cells (cor: 0.73). The plots of module membership and gene significance illustrated a significant correlation for each gene in the yellow module (cor: 0.92; Fig. 2C). Then, each gene in the yellow module was analysed with a univariate Cox regression analysis. We identified 110 genes that were significantly associated with the survival of early‐stage LUAD patients (Fig. 2D). The heat map shows the expression level of the 110 genes (Fig. 2E). The 110 genes were defined as a mast cell‐related gene signature (Appendix S5) in early‐stage LUAD patients. GO analysis revealed that cellular metabolic pathways, WNT signalling, antigen processing and presentation, and other enriched pathways were associated with the mast cell‐related key genes (Fig. 2F).
Fig. 2

Mast cell‐related gene signature identification. (A) WGCNA was performed to identify seven modules by unsupervised clustering. (B) A total of six modules (nongrey) were identified. The yellow module had the highest correlation (r = 0.73, P = 8e−53) and was considered the most correlated with mast cells. (C) The gene significance and module membership of the genes in the yellow module exhibited a high correlation. (D) A total of 110 mast cell‐related genes were identified among the hub genes extracted from the yellow module. (E) The expression profile of the 110 mast cell‐related genes. (F) GO analysis was performed based on the 110 mast cell‐related genes.

Mast cell‐related gene signature identification. (A) WGCNA was performed to identify seven modules by unsupervised clustering. (B) A total of six modules (nongrey) were identified. The yellow module had the highest correlation (r = 0.73, P = 8e−53) and was considered the most correlated with mast cells. (C) The gene significance and module membership of the genes in the yellow module exhibited a high correlation. (D) A total of 110 mast cell‐related genes were identified among the hub genes extracted from the yellow module. (E) The expression profile of the 110 mast cell‐related genes. (F) GO analysis was performed based on the 110 mast cell‐related genes.

Molecular subtype identification based on the mast cell‐related gene signature in early‐stage LUAD

As we observed, two expression patterns were identified in the expression profiles of mast cell‐related genes from expression heat map of mast cell‐related gene signature. We asked whether the mast cell‐related gene signature could distinguish the molecular subtypes of early‐stage LUAD. Using a combination of gene mutation data sets (genome characteristics) and the expression profiles of mast cell‐related key gene signature (genetic characteristics), we performed molecular subtype identification on early‐stage LUAD patients. Three methods were applied to show the classification effect of the molecular subtypes: (a) a clustering heat map was generated to intuitively visualize the effect of sample clustering (Fig. 3A); (b) univariate Cox and Kaplan–Meier analyses were used to evaluate the significance of the difference in survival profiles between subtypes (HR = 0.59; Fig. 3B); and (b) the average silhouette width, a measure of cluster coherence, was calculated to appraise whether samples were more similar within or across subtypes (Fig. 3C). The results above indicated that mast cell‐related key genes could stratify early‐stage LUAD into two molecular subtypes (subtype 1 and subtype 2) with distinct clinical and molecular characteristics. Tumours of molecular subtype 2 had greater average mast cell densities compared with tumours of molecular subtype 1.
Fig. 3

Molecular subtype identification according to the mast cell‐related gene signature. (A) Clustering heat map for intuitively visualizing the effect of sample clustering. (B) Univariate Cox analysis and Kaplan–Meier curves were used to evaluate the survival difference between the two molecular subtypes. (C) Average silhouette width between the two molecular subtypes. (D) The DEGs between two molecular subtypes. (E) GO analysis. (F) Upregulated hallmarks in the GSEA. (G) Downregulated hallmarks in the GSEA. (H) The immune cell population distribution in the subtype 1 and subtype 2. (I) The difference in immune cell population scores and the significances between the subtype 1 and subtype 2.

Molecular subtype identification according to the mast cell‐related gene signature. (A) Clustering heat map for intuitively visualizing the effect of sample clustering. (B) Univariate Cox analysis and Kaplan–Meier curves were used to evaluate the survival difference between the two molecular subtypes. (C) Average silhouette width between the two molecular subtypes. (D) The DEGs between two molecular subtypes. (E) GO analysis. (F) Upregulated hallmarks in the GSEA. (G) Downregulated hallmarks in the GSEA. (H) The immune cell population distribution in the subtype 1 and subtype 2. (I) The difference in immune cell population scores and the significances between the subtype 1 and subtype 2. Differently expressed gene analysis was performed to identify the DEGs between the subtype 1 and subtype 2 molecular subtypes. The heat map shows the expression profile of the DEGs (adjusted P‐value < 0.05 and log2 (FC) > 1.5; Fig. 3D). Then, the DEGs were subjected to GO analysis (Fig. 3E). The results revealed enrichments in cell cycle‐related terms. GSEA was performed on the subtype 1 and subtype 2 of early‐stage LUAD. Upregulated pathways included pathways related to coagulation, inflammatory response and myogenesis in the subtype 1 (Fig. 3F). Downregulated pathways included pathways related to E2F targets, G2M checkpoints and MYC targets in the subtype 1 (Fig. 3G). The immune cell population distribution in the subtype 1 and subtype 2 further illustrated the different tumour immune microenvironments in the two molecular subtypes of early‐stage LUAD (Fig. 3H). Among all immune cell populations, mast cells showed the most significant difference between the subtype 1 and subtype 2 (Fig. 3I).

Neural network‐based model to identifying immunotherapy treatment outcomes

To further utilize the mast cell‐related gene signature we identified, we built a neural network‐based framework to predict which patient would respond to immunotherapy according to mast cell‐related key genes. The detailed code is provided in Appendix S2. Figure 4A illustrates a diagram of the neural network. Briefly, the early‐stage LUAD data set was divided into training and testing data sets. We constructed the neural network with the mast cell‐related gene signature by the training data set. The test data set was applied to evaluate the accuracy of the neural network. With the increased epoch number for training, the loss value of the model in the testing set decreased (Fig. 4B). The confusion matrix showed only one sample was recognized wrongly in the testing set (Fig. 4C). The receiver operating characteristic (ROC) curve illustrated a high accuracy rate with the area under the curve reaching 98.7% (Fig. 4D).
Fig. 4

Neural network‐based framework construction with the mast cell‐related gene signature. (A) Schematic diagram of the neural network. (B) The loss value in each epoch during training process in the validation cohort. (C) The confusion matrix in the testing cohort validated the accuracy of the network's prediction capacity. (D) The ROC plot in the testing data set validated the accuracy of the network's prediction capacity.

Neural network‐based framework construction with the mast cell‐related gene signature. (A) Schematic diagram of the neural network. (B) The loss value in each epoch during training process in the validation cohort. (C) The confusion matrix in the testing cohort validated the accuracy of the network's prediction capacity. (D) The ROC plot in the testing data set validated the accuracy of the network's prediction capacity.

Mast cell‐related signature predicts the prognosis and clinical outcome of early‐stage LUAD patients

The mast cell‐related gene signature was employed to calculate a prognostic risk score. The risk score MastCellpca was calculated for each patient using the PCA method. Figure 5A shows the first principal component (PCA1) score for each key mast cell‐related gene. MastCellpca was calculated with the expression level of each gene and the PCA1 score. The results showed a highly negative correlation between MastCellpca and mast cell abundance, which further confirmed the correlation between the mast cell‐related key genes and mast cells (Fig. 5B). The Kaplan–Meier plot revealed that patients with a low‐risk score had a better prognosis than patients with a high‐risk score (Fig. 5C). ssGSEA results revealed a high association of DNA repair and the c‐MYC pathway with the mast cell‐related risk score (Fig. 5D). TP53 mutations can regulate the activation of c‐MYC pathway (Frazier et al., 1998). Due to the high correlation between the c‐MYC pathway and the mast cell‐related risk score, we selected TP53 mutation as an example given its role in regulating the activation of c‐MYC pathway (Frazier et al., 1998) and analysed the mast cell‐related risk score in TP53‐mutated and wild‐type patients (Fig. 5E,F). The results showed a high‐risk score in the TP53‐mutated patients.
Fig. 5

Mast cell‐related signature‐based risk score calculation and the potential mechanism underlying the mast cells in early‐stage LUAD. (A) PCA of the key mast cell‐related genes. (B) The correlation between the prognostic signature‐based risk score and the mast cell ssGSEA score in early‐stage LUAD patients. (C) Univariate Cox analysis and Kaplan–Meier curves showed prolonged survival in patients with low‐risk scores compared with patients with high‐risk scores. (D) The correlation between the ssGSEA score of each hallmark gene and the risk score. (E) The correlation between the risk score and ssGSEA score in early‐stage LUAD patients. (F) The risk score distribution in patients with wild‐type or mutated TP53. P‐value was calculated with Mann–Whitney U‐test. (G) A Sankey plot was used to reveal the correlation between mast cell scores, prognostic signature‐based risk scores, immunotherapy response and clinical outcome. (H, I) Patients who received adjuvant therapies, including chemo(radio)therapy and targeted therapy, with low‐risk scores, exhibited prolonged overall survival.

Mast cell‐related signature‐based risk score calculation and the potential mechanism underlying the mast cells in early‐stage LUAD. (A) PCA of the key mast cell‐related genes. (B) The correlation between the prognostic signature‐based risk score and the mast cell ssGSEA score in early‐stage LUAD patients. (C) Univariate Cox analysis and Kaplan–Meier curves showed prolonged survival in patients with low‐risk scores compared with patients with high‐risk scores. (D) The correlation between the ssGSEA score of each hallmark gene and the risk score. (E) The correlation between the risk score and ssGSEA score in early‐stage LUAD patients. (F) The risk score distribution in patients with wild‐type or mutated TP53. P‐value was calculated with Mann–Whitney U‐test. (G) A Sankey plot was used to reveal the correlation between mast cell scores, prognostic signature‐based risk scores, immunotherapy response and clinical outcome. (H, I) Patients who received adjuvant therapies, including chemo(radio)therapy and targeted therapy, with low‐risk scores, exhibited prolonged overall survival. Furthermore, patients with high mast cell abundance had a low mast cell‐related risk score and responded to immunotherapy (Fig. 5G). In patients who received chemo(radio)therapy and molecular therapy, the patients with low mast cell‐related risk scores had better survival outcomes than those with high mast cell‐related risk scores (Fig. 5H,I).

The association between mast cell‐related signature and gene mutation in early‐stage LUAD

The random forest algorithm was employed to determine the importance of gene mutations associated mast cell‐related risk score (Fig. 6). The results revealed that TP53 and CSMD3 were the most important gene mutations associated with the mast cell‐related risk score. The patients with TP53 mutations had significantly higher mast cell‐related risk scores than the patients without TP53 mutations.
Fig. 6

Association of the immune signature with early‐stage LUAD gene mutations. The distribution of gene mutations correlated with the prognostic signature‐based risk score. TP53 was the most important mutation according to the importance of ranking.

Association of the immune signature with early‐stage LUAD gene mutations. The distribution of gene mutations correlated with the prognostic signature‐based risk score. TP53 was the most important mutation according to the importance of ranking.

External validation and meta‐analysis

Nine external cohorts were used to confirm the association between the mast cell‐related gene signature and survival outcomes in early‐stage LUAD patients. The detailed information for each cohort is shown in the Kaplan–Meier plot (Fig. 7A). A meta‐analysis was performed with a random‐effects model, and the results showed that patients with a high mast cell‐related risk score had poor survival outcomes in the overall data set (HR = 3.79; Fig. 7B).
Fig. 7

Meta‐analysis and external validation of the prognostic value of the mast cell‐related signature. (A) Detailed information for the nine external validation cohorts. (B) A meta‐analysis revealed the overall prognostic value of the mast cell‐related signature.

Meta‐analysis and external validation of the prognostic value of the mast cell‐related signature. (A) Detailed information for the nine external validation cohorts. (B) A meta‐analysis revealed the overall prognostic value of the mast cell‐related signature.

Discussion

Previous studies have investigated the relationship between immune cell populations and the clinical outcomes of cancer patients (Bao et al., 2019b; Bindea et al., 2013; Chung et al., 2017; Homma et al., 2014). The heterogeneity of immune cell populations in different cancer types leads to a complicated immune network in the tumour microenvironment and differentially influences tumour initiation and progression. As a major component of the immune microenvironment in tumour tissues, mast cells may play a pro‐tumorigenic or antitumorigenic role by releasing different mediators (Varricchi et al., 2017). For instance, angiogenic and lymphangiogenic factors secreted by mast cells promote tumour angiogenesis and lymphangiogenesis (Detoraki et al., 2010; Detoraki et al., 2009; Theoharides et al., 2010). Several matrix metalloproteinases released by mast cells regulate the digestion of tumour extracellular matrix and favour the distant metastasis of cancer cells (Baram et al., 2001). Specifically, the activation of MYC triggers rapid recruitment of mast cells to the tumour site to promote tumour expansion in pancreatic cancer. MYC directly commandeers and instructs tissue remodelling, angiogenesis and inflammation by activation of mast cells (Soucek et al., 2007). Mast cells release tryptase AB1 and interleukin‐1β, which in turn induced pleural vasculature leakiness and triggered NF‐κB activation in pleural tumour cells, thereby fostering pleural fluid accumulation and tumour growth (Giannou et al., 2015). In contrast, mast cells can exhibit antitumour activity directly through tumour cell cytotoxicity mediated by TNF‐α and ROS or indirectly through the release of interleukin‐9 and heparin and the stimulation of dendritic cell maturation (Varricchi et al., 2017). The complicated roles of mast cells allow them to play different functions in different cancer types and stages. Regarding the impact of mast cells on LUAD patient outcomes, several contradictory studies have been published. One study revealed that mast cells correlated with angiogenesis and poor outcome in stage I LUAD (Baram et al., 2001). Another study has revealed KIT‐competent mast cells fuel KRAS‐mutant LUAD formation, growth and metastasis by providing interleukin‐1β and are associated with LUAD progression (Lilis et al., 2019). However, one research indicated that only mast cells were found by univariate analysis to be associated with better prognosis in LUAD (Kurebayashi et al., 2016). Although the results of these studies are quite different with respect to the prognostic value of mast cells, previous studies have shown that mast cell infiltration is more intensive in low‐grade histologic subtypes than in high‐grade subtypes (Carlini et al., 2010). Understanding the potential mechanism and roles of mast cells in early‐stage LUAD may be helpful for the development of immunotherapy. Thus, in this study, we analysed the potential role and mast cell‐related genes in early‐stage LUAD. The abundance of mast cells was estimated in several cohorts. Cox regression was performed to identify the prognostic value of mast cells in early‐stage LUAD. WGCNA was employed to identify the mast cell‐related gene signature. Molecular subtypes (subtype 1 and subtype 2) were identified according to the mast cell‐related gene signature in a mutation data set of early‐stage LUAD. A neural network‐based framework was constructed to predict the immunotherapy outcome of early‐stage LUAD patients according to the mast cell‐related gene signature. A mast cell‐related risk score MastCellpca was calculated by the expression levels of mast cell‐related gene signature using the PCA method. ssGSEA was performed to identify the potential molecular mechanism associated with the mast cell‐related prognostic signature. The association between gene mutations and the risk scores was identified by a random forest algorithm. A meta‐analysis was performed to validate the mast cell‐related signature in external cohorts. In our analysis, we revealed that a high abundance of mast cells was associated with prolonged survival in early‐stage LUAD patients. Two external cohorts confirmed this conclusion. The differences and controversial conclusions in different studies (Baram et al., 2001; Kurebayashi et al., 2016; Lilis et al., 2019) may be due to the mixture of activated and resting mast cells. The function of activated mast cells may be masked by the resting mast cells. Therefore, it is essential to analyse the activated and resting mast cells separately. In an alternative way, we employed the following workflow to identify the potential mechanisms and genes associated with mast cells. First, the immune‐related genes were clustered into several modules by unsupervised clustering. The yellow module was identified as the most important module correlated with mast cells according to Pearson's correlation coefficient. The mast cell‐related gene signature was obtained from the yellow module. The genes in the mast cell‐related gene signature were highly associated with the mast cell density in early‐stage LUAD. According to the mast cell‐related gene signature and genome characteristics, the early‐stage LUAD tissues were stratified into two molecular subtypes (subtype 1 and subtype 2). Interestingly, the GO analysis and GSEA both indicated enrichments in cell cycle and c‐MYC‐related pathways in the subtype 2. Thus, we concluded the potential involvement of mast cells in the c‐MYC pathway in early‐stage LUAD. One previous study has demonstrated the important roles of mast cells in MYC activation and the potential tumour expansion promoted by mast cells in pancreatic cancer. MYC is a highly pleiotropic transcription factor whose aberrant activation links tightly with tumour progression, including both cell‐intrinsic proliferation and extracellular microenvironment alterations such as tissue remodelling, angiogenesis and invasion (Gabay et al., 2014). Aberrant MYC activities induce the dysregulated expression of a chemokine‐encoding gene cluster, therefore chemoattracting mast cells into the islets of pancreatic cancer (Soucek et al., 2007). In consistent with the pancreatic cancer study, the transcriptomic and downstream analyses underscore the importance of mast cell in MYC activation in early‐stage LUAD. Moreover, the differences in mast cells between the two subtypes were the most significant of all the immune cell populations studied, which confirmed the relationship between the mast cell‐related gene signature and mast cell abundance. The mast cell‐related gene signature may represent targets for further study to aid in the understanding of the mechanism of mast cells in early‐stage LUAD. To further utilize the mast cell‐related gene signature, we built a neural network‐based framework to predict response to immunotherapy. The confusion matrix and ROC plot confirmed the accuracy of the network's prediction capability. Hence, we were able to apply the expression profile of the mast cell‐related genes to predict the response to immunotherapy using the neural network framework. In the next step, we calculated the risk score MastCellpca according to the expression level of the gene signature for each patient. ssGSEA revealed a significant correlation between DNA repair, the c‐MYC pathway and the signature‐based risk score. The ssGSEA results further confirmed the results from the canonical GSEA of the molecular subtypes. c‐MYC stimulates the expression of target genes that play important roles in cell proliferation, growth arrest and apoptosis in lung cancer cells (Dang et al., 2006; Tong et al., 2004). Additionally, we further identified TP53 as the most critical mutation associated with the mast cell‐related signature. Dysregulation of the c‐MYC pathway induces the expression of endogenous TP53. As a cellular gatekeeper, TP53 plays crucial role in cell cycle arrest and apoptosis (Mogi and Kuwano, 2011). The close link between the mast cell‐related signature, the c‐MYC pathway and TP53 mutation in our analysis may highlight the roles of mast cells in early‐stage LUAD. However, as we suggested, analysing the activated and resting mast cells separately would be a promising way for understanding the molecular mechanism of mast cells in early‐stage LUAD. The mast cell‐related gene signature we obtained may therefore prove useful information for further study in this field. The mast cell‐related signature also served as a promising marker to predict the survival of early‐stage LUAD patients. We performed a meta‐analysis by combining nine cohorts. The results revealed in both each cohort and the meta‐analysis that the mast cell‐related signature stratified the survival of patients with high and low signature‐based risk scores. The results above also confirmed the pivotal roles of mast cells in early‐stage LUAD. A problem with the mast cell‐related signature of early‐stage LUAD as shown is that only in silico analysis is performed. Experimental studies are required to further elucidate the biological functions underlying the mast cell‐related signature in early‐stage LUAD. Large, well‐designed prospective population‐based studies should be conducted to investigate the complex role of mass cell and testify our results on mast cell‐related signature.

Conclusion

In this study, we depicted the correlation between mast cell populations and prognosis in early‐stage LUAD patients. A mast cell‐related gene signature was identified. A novel molecular subtype classification and a mast cell‐related gene signature‐based neural network were built to help understanding of mast cell activities in early‐stage LUAD and aid in the development of immunotherapy for early‐stage LUAD patients. Potential pathways associated with the mast cell‐related gene signature provide new directions for determining novel mechanisms in mast cells in early‐stage LUAD. The results above may facilitate personalized medicine for early‐stage LUAD patients.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could role as a potential conflict of interest.

Author contributions

XB, YW and RS conceived and designed the experiments. XB and RS performed the analysis. XB and YW wrote the paper. XB, TZ, RS and YW reviewed the draft. All authors read and approved the final manuscript. Appendix S1. Patient clinical information. Click here for additional data file. Appendix S2. The Python script for building the neural network‐based framework. Click here for additional data file. Appendix S3. The gene mutation data set. Click here for additional data file. Appendix S4. The r script. Click here for additional data file. Appendix S5. The gene list of the mast cell‐related gene signature. Click here for additional data file.
  46 in total

1.  clusterProfiler: an R package for comparing biological themes among gene clusters.

Authors:  Guangchuang Yu; Li-Gen Wang; Yanyan Han; Qing-Yu He
Journal:  OMICS       Date:  2012-03-28

2.  Breakthrough of the year 2013. Cancer immunotherapy.

Authors:  Jennifer Couzin-Frankel
Journal:  Science       Date:  2013-12-20       Impact factor: 47.728

3.  Human mast cells release metalloproteinase-9 on contact with activated T cells: juxtacrine regulation by TNF-alpha.

Authors:  D Baram; G G Vaday; P Salamon; I Drucker; R Hershkoviz; Y A Mekori
Journal:  J Immunol       Date:  2001-10-01       Impact factor: 5.422

4.  Mast cells correlate with angiogenesis and poor outcome in stage I lung adenocarcinoma.

Authors:  A Imada; N Shijubo; H Kojima; S Abe
Journal:  Eur Respir J       Date:  2000-06       Impact factor: 16.671

5.  Vascular endothelial growth factors synthesized by human lung mast cells exert angiogenic effects.

Authors:  Aikaterini Detoraki; Rosaria I Staiano; Francescopaolo Granata; Giorgio Giannattasio; Nella Prevete; Amato de Paulis; Domenico Ribatti; Arturo Genovese; Massimo Triggiani; Gianni Marone
Journal:  J Allergy Clin Immunol       Date:  2009-03-10       Impact factor: 10.793

Review 6.  MYC activation is a hallmark of cancer initiation and maintenance.

Authors:  Meital Gabay; Yulin Li; Dean W Felsher
Journal:  Cold Spring Harb Perspect Med       Date:  2014-06-02       Impact factor: 6.915

7.  Mast cells impair the development of protective anti-tumor immunity.

Authors:  Anna Wasiuk; Dyana K Dalton; William L Schpero; Radu V Stan; Jose R Conejo-Garcia; Randolph J Noelle
Journal:  Cancer Immunol Immunother       Date:  2012-06-10       Impact factor: 6.968

8.  Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response.

Authors:  Peng Jiang; Shengqing Gu; Deng Pan; Jingxin Fu; Avinash Sahu; Xihao Hu; Ziyi Li; Nicole Traugh; Xia Bu; Bo Li; Jun Liu; Gordon J Freeman; Myles A Brown; Kai W Wucherpfennig; X Shirley Liu
Journal:  Nat Med       Date:  2018-08-20       Impact factor: 53.440

9.  CancerSubtypes: an R/Bioconductor package for molecular cancer subtype identification, validation and visualization.

Authors:  Taosheng Xu; Thuc Duy Le; Lin Liu; Ning Su; Rujing Wang; Bingyu Sun; Antonio Colaprico; Gianluca Bontempi; Jiuyong Li
Journal:  Bioinformatics       Date:  2017-10-01       Impact factor: 6.937

10.  Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer.

Authors:  Woosung Chung; Hye Hyeon Eum; Hae-Ock Lee; Kyung-Min Lee; Han-Byoel Lee; Kyu-Tae Kim; Han Suk Ryu; Sangmin Kim; Jeong Eon Lee; Yeon Hee Park; Zhengyan Kan; Wonshik Han; Woong-Yang Park
Journal:  Nat Commun       Date:  2017-05-05       Impact factor: 14.919

View more
  11 in total

1.  Identification of a Novel Immune Landscape Signature for Predicting Prognosis and Response of Colon Cancer to Immunotherapy.

Authors:  Zheng Wang; Jingru Song; Nisma Lena Bahaji Azami; Mingyu Sun
Journal:  Front Immunol       Date:  2022-04-28       Impact factor: 8.786

2.  Gene Coexpression Network Characterizing Microenvironmental Heterogeneity and Intercellular Communication in Pancreatic Ductal Adenocarcinoma: Implications of Prognostic Significance and Therapeutic Target.

Authors:  Chengsi Wu; Yizhen Liu; Dianhui Wei; Li Tao; Lili Yuan; Tiantian Jing; Boshi Wang
Journal:  Front Oncol       Date:  2022-06-01       Impact factor: 5.738

3.  Immune landscape and a promising immune prognostic model associated with TP53 in early-stage lung adenocarcinoma.

Authors:  Chengde Wu; Xiang Rao; Wei Lin
Journal:  Cancer Med       Date:  2020-12-12       Impact factor: 4.452

4.  Neutrophils correlate with hypoxia microenvironment and promote progression of non-small-cell lung cancer.

Authors:  Chunyan Zhang; Bingxiang Tang; Jianping Hu; Xiang Fang; Hongzhi Bian; Junlei Han; Congxia Hou; Fang Sun
Journal:  Bioengineered       Date:  2021-12       Impact factor: 3.269

5.  5mC regulator-mediated molecular subtypes depict the hallmarks of the tumor microenvironment and guide precision medicine in bladder cancer.

Authors:  Jiao Hu; Belaydi Othmane; Anze Yu; Huihuang Li; Zhiyong Cai; Xu Chen; Wenbiao Ren; Jinbo Chen; Xiongbing Zu
Journal:  BMC Med       Date:  2021-11-26       Impact factor: 8.775

6.  Immune landscape and a novel immunotherapy-related gene signature associated with clinical outcome in early-stage lung adenocarcinoma.

Authors:  Xuanwen Bao; Run Shi; Tianyu Zhao; Yanfang Wang
Journal:  J Mol Med (Berl)       Date:  2020-04-25       Impact factor: 4.599

7.  Development of a novel prognostic score combining clinicopathologic variables, gene expression, and mutation profiles for lung adenocarcinoma.

Authors:  Guofeng Li; Guangsuo Wang; Yanhua Guo; Shixuan Li; Youlong Zhang; Jialu Li; Bin Peng
Journal:  World J Surg Oncol       Date:  2020-09-19       Impact factor: 2.754

8.  Analysis of the molecular nature associated with microsatellite status in colon cancer identifies clinical implications for immunotherapy.

Authors:  Xuanwen Bao; Hangyu Zhang; Wei Wu; Shaobing Cheng; Xiaomeng Dai; Xudong Zhu; Qihan Fu; Zhou Tong; Lulu Liu; Yi Zheng; Peng Zhao; Weijia Fang; Fanglong Liu
Journal:  J Immunother Cancer       Date:  2020-10       Impact factor: 13.751

Review 9.  Relationship between microsatellite status and immune microenvironment of colorectal cancer and its application to diagnosis and treatment.

Authors:  Junge Bai; Hongsheng Chen; Xuefeng Bai
Journal:  J Clin Lab Anal       Date:  2021-05-03       Impact factor: 2.352

10.  Construction and validation of a 15-gene ferroptosis signature in lung adenocarcinoma.

Authors:  Guangxu Tu; Weilin Peng; Qidong Cai; Zhenyu Zhao; Xiong Peng; Boxue He; Pengfei Zhang; Shuai Shi; Yongguang Tao; Xiang Wang
Journal:  PeerJ       Date:  2021-07-07       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.