Lijing Wang1, Hongjun Zhao2, Lemeng Zhang3,4, Hui Luo2, Qiong Chen1, Xiaoxia Zuo2. 1. Department of Gerontology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, P.R. China. 2. Department of Rheumatology and Immunology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, P.R. China. 3. Department of Thoracic Oncology, Hunan Cancer Hospital, Central South University, Changsha, Hunan 410008, P.R. China. 4. Department of Thoracic Oncology, The Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha, Hunan 410008, P.R. China.
As the third leading cause of mortality in the United States in 2018, chronic obstructive pulmonary disease (COPD) is a chronic inflammatory lung disease characterized by obstruction of lung airflow (1). In 2017, squamous cell lung cancer, or squamous cell carcinoma of the lung (SCC), which is a type of non-small cell lung cancer (NSCLC), accounted for ~30% of all lung cancer cases worldwide (2). However, the connection between COPD and SCC remains poorly defined (3).Current literature has extended the idea that inflammation is a basic part of tumorigenesis (3). Inflammatory cells, which interact with the tumor environment, foster proliferation, survival and migration of tumor cells (4). Tumor cells use certain signaling molecules of the innate immune system, such as selectins and chemokines, to foster proliferation, migration and metastasis (4). However, little is known about the role of inflammation in lung cancer progression.Systems biology has been applied to biological systems for the investigation of cancer (5). A biological system is any system that applies to organic frameworks (6). Systems biology enables the identification of the molecular and cellular networks involved in disease progression (7–9). The present study generated a genetic interaction network using microarray gene expression data from patients with SCC and COPD, and explored the potential molecular mechanisms involved using a gene regulatory networks approach.
Materials and methods
Microarray dataset
The microarray dataset (GSE12472) (6) used in the current study was downloaded from the Gene Expression Omnibus repository (www.ncbi.nlm.nih.gov/geo). All differently expressed genes associated with SCC with or without COPD were screened from this dataset. In the original study, the authors compared gene expression profiles in bronchial epithelia from patients with SCC with and without COPD (6). Bronchial tissues from the larger airways (bronchus diameter >2 mm, surrounded by cartilage) were resected from 10 individuals without COPD and 18 with COPD. Additionally, centrally located primary tumor tissues were collected from 35 patients with SCC, including 17 patients without COPD and 18 with COPD. The most recent post-bronchodilator spirometric results prior to surgery were used to assess the GOLD stage for each patient. Bronchial tissues from 17 patients without COPD were obtained during thoracotomy for NSCLC. Bronchial tissues from 18 patients with COPD were obtained during thoracotomy for NSCLC (n=11) or lung transplantation (n=7). Total RNA was isolated and purified from the samples, and the Agilent Whole Human Genome Oligo microarray (GPL1708; Agilent Technologies, Inc.) was used to determine gene expression levels.
Identification of differentially expressed genes
The raw microarray data was downloaded directly from GSE12472 as the matrix .txt files. Significance analysis of microarrays (SAM; version 2.11; statweb.stanford.edu/~tibs/SAM) was used to identify upregulated and downregulated genes from the comparison between bronchial epithelia of patients with SCC with and without COPD (7). To identify COPD-associated genes that drive SCC progression, genes identified in both the normal bronchial epithelium and COPD tissue were excluded from the list of genes in SCC with COPD using the Advanced Network Merge function in Cytoscape.
Differently expressed gene visualization
Differently expressed genes were visualized using Cluster and TreeView (version 3.0; bonsai.hgc.jp/~mdehoon/software/cluster/software.htm) as previously described (8).
Gene-gene interaction network construction
The COPD-associated genetic interaction network in SCC was expanded using the BisoGenet plug-in in Cytoscape 3.0 (www.cytoscape.org) as previously reported (8). BisoGenet allows users to customize searches by specifying a target set of genes/proteins to retrieve interaction data, and allows searching for molecular interactions in SysBiomics embedded within Cytoscape (9). SysBiomics integrates data from well-known interaction databases, including the Database of Interacting Proteins (dip.mbi.ucla.edu/dip), BIOGRID (thebiogrid.org), the Human Protein Reference Database (www.hprd.org), the Biomolecular Interaction Network Database (bind.ca), the Molecular INTeraction database (mint.bio.uniroma2.it/mint) and EMBL-EBI INTACT (https://www.ebi.ac.uk/intact/).
Network topological parameters
Topological parameters were calculated using the Network Analysis function of Cytoscape 3.0. Node size and color was mapped based on the degree of connectivity (8).
Survival analysis
The prognostic value of gene signatures in SCC was assessed using PROGgene version 2.0 (watson.compbio.iupui.edu/chirayu/proggene/database/index.php) (10,11). The overall survival rate was compared between patients with high and low mRNA expression of the selected genes using the median gene expression as the cut-off value. For survival analysis, Kaplan-Meier survival curves and the log-rank test were used in this study.
Statistical analysis
For SAM, gene expression was considered significantly different when the false discovery rate (FDR) was ≤5% and the fold change was ≥2.
Results
Differentially expressed genes associated with SCC progression
Gene expression in patients with SCC and normal tissues with COPD was compared. As presented in Fig. 1A, a total of 385 upregulated and 855 downregulated genes were identified by SAM when a FDR ≤5% and fold change ≥2 were employed.
Figure 1.
SAM of patients with SCC patients with or without COPD. First, patients with SCC and the normal tissues with COPD were compared. Subsequently, patients with SCC and the normal tissues without COPD were compared. (A) Representative SAM plot for patients with SCC without COPD. (B) Representative SAM plot for patients with SCC and COPD. Red represents upregulated genes and green represents downregulated genes. Black represents gene with no changes in expression. SAM, significance analysis of microarrays; SCC, squamous cell lung carcinoma; COPD, chronic obstructive pulmonary disease.
Differentially expressed genes associated with COPD-related SCC progression
Patients with SCC and normal tissues without COPD were compared. According to the SAM plot (Fig. 1B), a total of 1,243 differently expressed genes (466 upregulated and 777 downregulated) were identified.
To identify COPD-associated genes that drive SCC progression, genes identified in both the normal bronchial epithelium and COPD tissue were excluded from the list of genes in SCC with COPD. A total of 281 differentially expressed genes for SCC with COPD remained (Fig. 2).
Figure 2.
Hierarchical heatmap clustering visualization of genes specifically expressed in patients with SCC and COPD. (A) Venn diagram of the differentially expressed COPD-associated SCC genes. (B) Heatmap of differentially expressed genes in COPD-associated SCC tissues. Red represents upregulated genes and green represents downregulated genes. SCC, squamous cell lung carcinoma; COPD, chronic obstructive pulmonary disease.
Genetic interaction network of genes in SCC and COPD
A genetic interaction network investigating the interactions between genes in SCC with COPD was constructed and expanded using BisoGenet. The interaction network consisted of 220 nodes and 163 edges (Fig. 3). The clustering coefficient and network diameter were 0.04 and 10, respectively. The node degree distribution across the entire network was calculated (Fig. 4A). The four nodes with the greatest degree of connectivity [Heat shock protein 90 α family class A member 1 (HSP90AA1); adrenoceptor β2 (ADRB2); transducin β like 1 X-linked receptor 1 (TBL1XR1) and heat shock protein family B (small) member 1 (HSPB1)] were identified (Fig. 4B), and were selected for evaluation of their effect on survival.
Figure 3.
Genetic interaction network in chronic obstructive pulmonary disease-related squamous cell lung carcinoma using the BisoGenet plug-in in Cytoscape. Circles represent genes and the connections represent interactions. Colors represent the node degree of connectivity; red represents genes high connectivity, green represents low connectivity. Only the largest clusters were considered as the genetic interaction network in chronic obstructive pulmonary disease-related squamous cell lung carcinoma. The nodes without connections and/or single nodes (bottom) were not included in the network analysis.
Figure 4.
Network parameter calculation and hub identification. (A) Node degree distribution result for the genetic network. (B) Top nodes that may be involved in chronic obstructive pulmonary disease-associated squamous cell lung carcinoma pathogenesis. ADRB2, adrenoceptor β2; TBL1XR1, transducin β like 1 X-linked receptor 1; HSPB1, heat shock protein family B (small) member 1.
Association of HSP90AA1, ADRB2, TBL1XR1 and HSPB1 with the overall survival rate in COPD-related SCC
The association between the expression of the aforementioned genes and overall survival rate was evaluated using PROGgene. Compared with patients with SCC with high gene expression, patients with lower gene expression of ADRB2 (Fig. 5A) and HSP90AA1 (Fig. 5B) had a decreased overall survival rate. However, high expression of HSPB1 (Fig. 5C) and TBL1XR1 (Fig. 5D), which were highly expressed in SCC compared with COPD tissues, was associated with decreased overall survival rate compared with low expression. The survival analysis also revealed that patients with low expression of HSPB1 in SCC were still alive at the end of the follow-up period (13.6 years), suggesting that HSPB1 may be a potential therapeutic target in SCC. The results obtained in the present study demonstrate the clinical significance of HSP90AA1, ADRB2, TBL1XR1 and HSPB1 in COPD-related SCC.
Figure 5.
Overall survival analysis according to (A) Gene ID 206170_at (ADRB2), (B) Gene ID 214328_at_(HSP90AA1), (C) Gene ID 201841_s_at (HSPB1) and (D) Gene ID 235890_s_(TBL1XR1) expression, based on a clinical microarray dataset using PROGgene. Left panel represents gene expression distribution in SCC samples. Right panel represents overall survival time. Red and blue represent high and low expression levels, respectively. ADRB2, adrenoceptor β2; HSP90AA1, heat shock protein 90 α family class A member 1; TBL1XR1, transducin β like 1 X-linked receptor 1; HSPB1, heat shock protein family B (small) member 1.
Discussion
COPD is a chronic lung disease that results in decreased airflow in the lungs. Symptoms include trouble with breathing, coughing, mucus (sputum) generation and wheezing (12). This is caused by exposure to irritant compounds or particulate matter, predominantly from tobacco smoke (13). Individuals with COPD are at an increased risk of developing lung tumors and other conditions, such as emphysema and chronic bronchitis (13). COPD is a risk factor for SCC and the association between the two remains unclear (13). Therefore, the association between COPD and SCC needed to be elucidated the present study identified marker genes uniquely associated with SCC with COPD. The results of the present study indicated that high levels of ADRB2 and HSP90AA1 were associated with a longer survival time compared with the respective low expression groups. However, high expression of TBL1XR1 and HSPB1 predicted poor overall survival in patient with SCC.The microarray analysis employed in the current study revealed that ADRB2 and HSP90AA1 are downregulated in SCC tissues compared with healthy adjacent non-cancerous tissue, suggesting that they may serve important roles in the progression of SCC. Previous studies reported that ADRB2 is an important regulator of airway smooth muscle tone (14–16). High levels of ADRB2 are associated with reduced lung function, asthma and COPD (14). Additionally, ADRB2 is an anti-inflammatory gene and was previously reported to be involved in hyperinflammation (17) and autoimmune disorders such as rheumatoid arthritis (18). Furthermore, ADRB2 has been demonstrated to be downregulated in prostate and breast cancer, and functional gain of ADRB2 decreases cancer progression (19,20). However, the association between ADRB2 and COPD-associated SCC remains unknown. In the present study, TBL1XR1 mRNA was highly expressed in samples from patients with SCC with COPD, but not SCC without COPD. TBL1XR1 is a member of the WD40 repeat-containing gene family. The protein encoded by this gene composes an integral subunit of both nuclear receptor corepressor and histone deacetylase 3 complexes, and serves an essential role in regulating the activation of transcriptional and intracellular signaling pathways, such as JNK and ribonuclease pathways (21,22). Upregulation of TBL1XR1 gene expression has been observed in various cancer cell lines and solid tumors from patients (23,24), lymphoma (25) and acute leukemia (26), and are associated with advanced tumor stage, metastasis and poor prognosis. In SCC, elevated mRNA and protein expression of TBL1XR1 was confirmed in cells lines and patient samples (27). While a direct association between COPD and TBL1XR1 has not been revealed, this gene has been reported to regulate several important signaling pathways that modulate the growth of lung α-smooth muscle actin-positive cells and airway reconstruction (28). Previously, Zou et al (29) reported that HSP90AA1 affects the response of patients with systemic lupus erythematosus to glucocorticoid treatment and the lipopolysaccharide-induced inflammatory response, including tumornecrosis factor secretion by monocytes. Furthermore, the role of HSP90AA1 in cancer pathogenesis is currently an area of active investigation. A study suggested that high expression of HSP90AA1 was an independent factor associated with mortality in patients with breast cancer (triple-negative and humanepidermal growth factor receptor 2-negative/estrogen receptor-positive subtypes) (30).HSPB1 encodes HSP20, a member of the heat shock protein family (31,32). The association between increased expression levels of HSPA1A and HSPB1 in plasma and lymphocytes and the risk of COPD has been reported and verified by lung tissue proteomics (31). Additionally, upregulated expression of HSPB1 was observed in SCC and the endothelial-to-mesenchymal transition, and was associated with poor survival (33). Upregulation of HSPB1 has also been observed in leukemia, cervical cancer, breast cancer and NSCLC. In the present study, the possible clinical importance of HSPB1 in COPD-related SCC are in agreement with previous studies, which suggests that it requires further research (34).In the present study, patients with SCC who exhibited low expression of HSPB1 were still alive after 13.6 years, suggesting that HSPB1 may be a potential therapeutic target in SCC. The efficacy of HSPB1 as a therapeutic target requires further investigation and validation by in vivo experiments.In conclusion, ADRB2, HSP90AA1, TBL1XR1 and HSPB1 may serve important roles in COPD-associated SCC and may provide novel therapeutic targets for the treatment of lung cancer. The limited number of studies existing on these four genes in COPD-related SCC progression is a limitation that impacts the interpretation of the findings from the current in silico study and limits the applications of these signatures within clinical practice.
Authors: David W Scott; Karen L Mungall; Susana Ben-Neriah; Sanja Rogic; Ryan D Morin; Graham W Slack; King L Tan; Fong Chun Chan; Raymond S Lim; Joseph M Connors; Marco A Marra; Andrew J Mungall; Christian Steidl; Randy D Gascoyne Journal: Blood Date: 2012-04-11 Impact factor: 22.113
Authors: Christine Desmedt; Fanny Piette; Sherene Loi; Yixin Wang; Françoise Lallemand; Benjamin Haibe-Kains; Giuseppe Viale; Mauro Delorenzi; Yi Zhang; Mahasti Saghatchian d'Assignies; Jonas Bergh; Rosette Lidereau; Paul Ellis; Adrian L Harris; Jan G M Klijn; John A Foekens; Fatima Cardoso; Martine J Piccart; Marc Buyse; Christos Sotiriou Journal: Clin Cancer Res Date: 2007-06-01 Impact factor: 12.531
Authors: Mario Pérez-Sayáns; José Manuel Somoza-Martín; Francisco Barros-Angueira; Pilar Gayoso Diz; José Manuel Gándara Rey; Abel García-García Journal: Oncol Res Date: 2010 Impact factor: 5.574