Gang Tian1, Xiang-Xiao Lin1, Xue-Mei Zhang1, Zhun He2, Zhi-Liang Hu2. 1. Department of Respiratory Medicine, Affiliated Hospital of Jining Medical University, Jining 272000, China. 2. Department of Thoracic Surgery, Affiliated Hospital of Jining Medical University, Jining 272000, China.
Abstract
BACKGROUND: Lung adenocarcinoma (AD) remains one of the most common cancers. Early diagnosis of AD improves therapeutic strategy and lengthens survival time. The objective of this study is to identify hub genes influencing the process of lung AD. METHODS: The microarray profiles of GSE43458 were extracted from the Gene Expression Omnibus (GEO) database to screen potential targets during lung AD. Differentially expressed genes (DEGs) between AD patients and normal controls were detected. Then gene ontology (GO) and Kyoto Encyclopedia of Gene and Genome (KEGG) pathway analysis were performed. Moreover, the major modules of protein-protein interaction (PPI) network of those DEGs were performed using the MCODE plug of the Cytoscape. The hub genes were validated in the Oncomine and GEPIA datasets. Additionally, the prognostic values of hub genes were evaluated in Kaplan Meier plotter and GEPIA databases. RESULTS: Totally, 859 DEGs were identified, including 278 up-regulated and 581 down-regulated genes. Functional annotation suggested those DEGs were related to cell adhesion, migration and motility. Besides, helicase lymphoid-specifics (HELLs) and selenoprotein P1 (SEPP1) were regarded as hub genes in AD. Then, the upregulation of HELLs and downregulation of SEPP1 were validated in the Oncomine and GEPIA databases, respectively. Moreover, Kaplan-Meier and GEPIA databases also suggested both HELLs and SEPP1 could affect the prognosis of lung AD patients. CONCLUSIONS: Our study demonstrated HELLs and SEPP1 were hub genes contributing to the progress of lung AD. They could be potential target genes for the diagnosis and therapy of lung AD. 2019 Translational Cancer Research. All rights reserved.
BACKGROUND: Lung adenocarcinoma (AD) remains one of the most common cancers. Early diagnosis of AD improves therapeutic strategy and lengthens survival time. The objective of this study is to identify hub genes influencing the process of lung AD. METHODS: The microarray profiles of GSE43458 were extracted from the Gene Expression Omnibus (GEO) database to screen potential targets during lung AD. Differentially expressed genes (DEGs) between AD patients and normal controls were detected. Then gene ontology (GO) and Kyoto Encyclopedia of Gene and Genome (KEGG) pathway analysis were performed. Moreover, the major modules of protein-protein interaction (PPI) network of those DEGs were performed using the MCODE plug of the Cytoscape. The hub genes were validated in the Oncomine and GEPIA datasets. Additionally, the prognostic values of hub genes were evaluated in Kaplan Meier plotter and GEPIA databases. RESULTS: Totally, 859 DEGs were identified, including 278 up-regulated and 581 down-regulated genes. Functional annotation suggested those DEGs were related to cell adhesion, migration and motility. Besides, helicase lymphoid-specifics (HELLs) and selenoprotein P1 (SEPP1) were regarded as hub genes in AD. Then, the upregulation of HELLs and downregulation of SEPP1 were validated in the Oncomine and GEPIA databases, respectively. Moreover, Kaplan-Meier and GEPIA databases also suggested both HELLs and SEPP1 could affect the prognosis of lung AD patients. CONCLUSIONS: Our study demonstrated HELLs and SEPP1 were hub genes contributing to the progress of lung AD. They could be potential target genes for the diagnosis and therapy of lung AD. 2019 Translational Cancer Research. All rights reserved.
At present, lung cancer remains to be one of the most common malignant tumors and leads to increasing mortality around the world (1,2). During the past decades, the occurrence and mortality of lung cancer are rising rapidly worldwide, especially in those countries with advanced industry (3). Lung adenocarcinoma (AD) is the main histologic subtype of lung carcinoma (4). Although numerous studies verify multiple oncogenes participating in the pathogenesis of lung cancer, the overall 5-year survival rate of lung patients is still very lower (4,5). Thus, identification of molecular mechanisms and pathways for the development of lung tumors will benefit to the therapeutic effect and improve the outcomes of lung AD patients.High throughput technology provides promising methods for cancer research, such as the molecular diagnosis and prognosis prediction of cancer (6,7). Previous studies have reported numerous potential biomarkers and therapeutic targets of lung cancer through microarrays or sequencing data (8). Girard et al. developed an mRNA expression signature including 62 genes to discriminate non-small cell lung cancer (NSCLC) patients. The researchers demonstrated high predictive accuracies (93–95%) had been obtained in the TCGA and other public database using their signature (9). Meanwhile, Chen et al. developed a malignancy-risk gene signature that is significant related to overall survival (OS) of NSCLC patients. This signature could be used for the early identification of NSCLC patients (10). Therefore, microarray data provides us new opportunities to identify novel target genes and construct prediction models for lung cancer, which may promote the treatments of lung cancer patients.In our current study, we downloaded original microarray data from Gene Expression Omnibus (GEO) to screen differentially expressed genes (DEGs) between AD and normal controls. Functional levels of DEGs were subsequently performed through gene ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Moreover, protein-protein interaction (PPI) network was performed to pick out hub genes in the developments of AD. In addition, prognostic effects of hub genes were evaluated in the Kaplan Meier plotter and GEPIA databases. We hoped to have an in-deep understanding of the pathogenesis of AD, then to identify potential candidate targets or biomarkers for the early prediction and diagnosis of AD patients.
Methods
Data processing of DEGs
In our study, GSE43458 gene expression profiles were extracted from the GEO dataset. GSE43458, submitted by Kabbout et al. (11), was performed by GPL6244 platform (Affymetrix Human Gene 1.0 ST Array). Only never smoking (n=40) AD samples and 30 normal tissues were included in our current study. Limma package was adopted to screen the DEGs between normal tissues and AD samples. |log2foldchange (FC)| >1 was considered as the threshold to determine the significant difference of gene expression. GO and KEGG pathway analysis were carried out to investigate the DEGs at the functional level using the Database for Annotation, Visualization and Integrated Discovery (DAVID). False discovery rate (FDR) <0.05 was set as significantly difference.
PPI network analysis
The Search Tool for the Retrieval of Interacting Gene (STRING) database was performed to construct interaction network among DEGs (12). Combined score more than 0.4 was defined as the threshold. What’s more, the network was visualized by Cytoscape software. The MCODE plug of Cytoscape was applied to display the major modules of PPI network. The criteria were considered as following: MCODE score ≥10 and number of nodes ≥4. MCODE was based on vertex weighting by local neighborhood density and outward traversal from a locally dense seed protein to isolate the dense regions according to given parameters (13). The algorithm identified seed nodes for expansion by computing a score of local density for each node in the graph. The algorithm expands highly scoring seed nodes in a local search procedure by adding highly scoring nodes connected to the module (14).
Validation of the hub genes
The Oncomine database (www.oncomine.org) includes 500 cancer types with gene expression and sample data. Currently, more than 490 datasets and nearly 40,000 measured samples were contained in the Oncomine database. According to a large amount of data, the Oncomine provides several analytical tools, including differential expression analysis, co-expression analysis, and comparing analysis. To validate the dysregulation of the hub genes, we applied the Oncomine database to confirm the differentially expression levels of hub genes. In our current study, the expression levels of cancer and normal control were compared with the Students’ t-test. The P value was established at 1×10−4 and FC was defined as 2. At the same time, the data type was limited to mRNA. Moreover, GEPIA dataset (http://gepia.cancer-pku.cn/) (15), another public database, was also used to explore the differential expression of hub genes.
Survival analysis of hub genes
The Kaplan Meier plotter database (www.kmplot.com), was applied to evaluate the prognostic values of hub genes (16). Up to now, 2,437 lung cancer patients were included in this database. Depending on the expression levels (high vs. low) of hub genes, patients were divided into two groups. We analyzed the survival time of AD patients using a Kaplan-Meier survival plot. The hazard ratio (HR) with 95% confidence intervals and log rank P value were calculated and displayed on the plot. Meanwhile, the effect of hub genes on the prognosis of lung AD patients were validated in the GEPIA database.
Statement of ethics approval
Our study was investigated using public database. Participants had been giving informed consent before taking part in the Oncomine, GEPIA and Kaplan Meier plotter databases. Moreover, this study had got approval from the Institutional Ethics Review Board of Affiliated Hospital of Jining Medical University (ID: 20181026B).
Results
Gene expression profiling in AD
In our current study, a comparative analysis between AD samples and normal lung tissues identified 589 DEGs (278 up-regulated genes and 581 down-regulated genes) according to the cut-off criteria (FDR <0.05 and log |FC|>1), as shown in . The heat map of top 100 genes was displayed in .
Figure 1
DEGs in AD patients. (A) The volcano plot of DEGs (up-regulated genes: red; down-regulated genes: green; not DEGs: blue); (B) the heat map of the top 100 DEGs. Red: up-regulation; green: down-regulation; (C,D) HELLs and SEPP1 were identified as hub genes for the development of lung cancer. DEGs, differentially expressed gene; AD, lung adenocarcinoma.
DEGs in AD patients. (A) The volcano plot of DEGs (up-regulated genes: red; down-regulated genes: green; not DEGs: blue); (B) the heat map of the top 100 DEGs. Red: up-regulation; green: down-regulation; (C,D) HELLs and SEPP1 were identified as hub genes for the development of lung cancer. DEGs, differentially expressed gene; AD, lung adenocarcinoma.
GO and KEGG pathway enrichment analysis
All DEGs were uploaded to the online DAVID database to identify the most significantly GO terms and KEGG pathways. GO results suggested DEGs were markedly related to biological processes, including cell adhesion, regulation of cell migration and regulation of cell motility (). For molecular function, the DEGs were involved in calcium ion binding, glycosaminoglycan binding and carbohydrate binding (). For cellular component, the DEGs were belonged to extracellular space, extracellular region, and extracellular region part (). KEGG analysis suggested the most significant pathways of the DEGs were extracellular matrix (ECM)-receptor interaction, complement and coagulation cascades, and vascular smooth muscle contraction ().
Based on the information in the STRING database, PPI network was presented by Cytoscape software. According to the MCODE score, only two modules were with more than 10 scores. Especially, helicase lymphoid-specific (HELLs) and selenoprotein P1 (SEPP1) were considered as the seed genes of two modules, which might function as key genes for lung AD occurrence and development (). In the GSE43458 dataset, HELLs upregulated 2.05 folds and SEPP1 downloaded 2.01 folds in the AD patients.
Different transcription levels of HELLs and SEPP1
Up to now, four studies showed AD patients had higher HELLs expression in Oncomine database, compared with normal controls. In Su’s dataset, HELLs mRNA level increased 2.53 folds (P=2.60×10−5) in the AD samples (). Okayama’s dataset revealed HELLs mRNA level increased 2.44 folds (P=2.96×10−12) in the AD samples (). In Garber’s dataset, the transcription levels of HELLs up-regulated 2.59 folds in the lung AD (P=1.45×10−5) (). The HELLs mRNA level also increased 2.56 folds in the Hou’s AD group (P=4.76×10−12), compared with normal samples (). In addition, 483 AD patients and 347 normal controls were included to compare the expression levels of HELLs mRNA in the GEPIA database. The results also revealed HELLs mRNA level increased in the AD patients (P<0.01) (Figure S1A).
Figure 2
Elevated HELLs in the Oncomine dataset. (A) In Su’s dataset, HELLs mRNA level increases 2.53 folds (P=2.60×10−5); (B) in Okayama’s dataset, HELLs mRNA level increases 2.44 folds (P=2.96×10−12); (C) in Garber’s dataset, the transcription levels of HELLs up-regulates 2.59 folds in the lung AD (P=1.45×10−5); (D) in Hou’s AD group, HELLs mRNA up-regulates 2.56 folds (P=4.76×10−12). HELL, helicase lymphoid-specific.
Elevated HELLs in the Oncomine dataset. (A) In Su’s dataset, HELLs mRNA level increases 2.53 folds (P=2.60×10−5); (B) in Okayama’s dataset, HELLs mRNA level increases 2.44 folds (P=2.96×10−12); (C) in Garber’s dataset, the transcription levels of HELLs up-regulates 2.59 folds in the lung AD (P=1.45×10−5); (D) in Hou’s AD group, HELLs mRNA up-regulates 2.56 folds (P=4.76×10−12). HELL, helicase lymphoid-specific.There were six studies revealing the downregulated SEPP1 in the Oncomine database. In Stearman’s study, SEPP1 mRNA level decreased 2.80 folds in AD patients, compared with normal tissues (P=2.95×10−5) (). In Selamat’s study, SEPP1 mRNA level decreased 6.63 folds in AD patients (P=6.84×10−31) (). Compared with normal lung tissues, Bhattacharjee’s study indicated SEPP1 mRNA level decreased 5.36 folds (P=4.12×10−6) (). Beer’s study suggested SEPP1 mRNA level decreased 3.30 folds (P=1.10×10−23) in the AD patients (). In Hou’s study, SEPP1 mRNA level decreased 3.17 folds (P=2.89×10−18), compared with normal lung samples (). Moreover, Su’s study suggested SEPP1 mRNA level decreased 2.47 folds (P=6.92×10−6) in AD patients (), when compared with normal lung tissues. Meanwhile, our results also suggested SEPP1 mRNA level decreased in the 483 AD patients, when compared with 347 normal controls in the GEPIA database (P<0.01) (Figure S1B).
Figure 3
Downregulated SEPP1 in the Oncomine dataset. (A) In Stearman’s study, SEPP1 mRNA level decreases 2.80 folds in AD patients (P=2.95×10−5); (B) in Selamat’s study, SEPP1 mRNA level decreases 6.63 folds (P=6.84×10−31); (C) in Bhattacharjee’s study, SEPP1 mRNA level decreases 5.36 folds (P=4.12×10−6); (D) in Beer’s study, SEPP1 mRNA level decreases 3.30 folds (P=1.10×10−23); (E) in Hou’s study, SEPP1 mRNA level decreases 3.17 folds (P=2.89×10−18); (F) in Su’s study, SEPP1 mRNA level decreases 2.47 folds (P=6.92×10−6). SEPP1, selenoprotein P1.
Downregulated SEPP1 in the Oncomine dataset. (A) In Stearman’s study, SEPP1 mRNA level decreases 2.80 folds in AD patients (P=2.95×10−5); (B) in Selamat’s study, SEPP1 mRNA level decreases 6.63 folds (P=6.84×10−31); (C) in Bhattacharjee’s study, SEPP1 mRNA level decreases 5.36 folds (P=4.12×10−6); (D) in Beer’s study, SEPP1 mRNA level decreases 3.30 folds (P=1.10×10−23); (E) in Hou’s study, SEPP1 mRNA level decreases 3.17 folds (P=2.89×10−18); (F) in Su’s study, SEPP1 mRNA level decreases 2.47 folds (P=6.92×10−6). SEPP1, selenoprotein P1.
The prognostic effect of hub genes for the lung AD patients
We explored the prognostic value of HELLs in the Kaplan Meier plotter and GEPIA databases. In lung AD patients, Kaplan-Meier analysis results indicated high-HELLs patients exhibited shorter OS periods than low-HELLs patients [HR 1.32 (1.03–1.68), P=0.025] (). Kaplan-Meier analysis also indicated that high-HELLs exhibited shorter progression-free survival (PFS) periods than low-HELLs patients [HR 1.98 (1.42–2.77), P=4.0×10−5] (). GEPIA database results also showed AD patients with higher HELLs exhibited shorter OS periods than those with lower HELLs (HR 1.6, P=0.0024) (Figure S2A). Meanwhile, prognostic value of the SEPP1 was also evaluated by Kaplan-Meier analysis. Kaplan-Meier analysis suggested low-SEPP1 AD patients had shorter OS periods than high-SEPP1 patients [HR 0.5 (0.40–0.64), P=6.5×10−9] (). Results also demonstrated patients with low-SEPP1 exhibited shorter PFS periods than patients with high-SEPP1 [HR 0.49 (0.36–0.67), P=8.3×10−6] in lung AD patients (). In addition, GEPIA database results also showed low-SEPP1 patients exhibited shorter OS periods than high-SEPP1 patients (HR 0.73, P=0.034) (Figure S2B)
Figure 4
The prognostic values of HELLs and SEPP1 for lung cancer patients in the Kaplan-Meier plot database. (A) Patients with high-HELLs levels exhibit shorter OS periods than patients with low-HELLs levels; (B) high-HELLs AD patients exhibit shorter PFS periods than low-HELLs patients; (C) patients with high-SEPP1 exhibit longer OS periods than those with low-SEPP1; (D) high-SEPP1 AD patients exhibit longer PFS periods than those patients with low-SEPP1. HELL, helicase lymphoid-specific; SEPP1, selenoprotein P1; PFS, progression-free survival; OS, overall survival; AD, lung adenocarcinoma.
The prognostic values of HELLs and SEPP1 for lung cancer patients in the Kaplan-Meier plot database. (A) Patients with high-HELLs levels exhibit shorter OS periods than patients with low-HELLs levels; (B) high-HELLs AD patients exhibit shorter PFS periods than low-HELLs patients; (C) patients with high-SEPP1 exhibit longer OS periods than those with low-SEPP1; (D) high-SEPP1 AD patients exhibit longer PFS periods than those patients with low-SEPP1. HELL, helicase lymphoid-specific; SEPP1, selenoprotein P1; PFS, progression-free survival; OS, overall survival; AD, lung adenocarcinoma.
Discussion
Although the occurrence of lung cancer has declined, it still results in the majority of cancer deaths all over the world, which is mainly caused by the failure of early prediction and diagnosis of lung cancer patients. Therefore, it was urgently important to explore the deep pathogenesis of lung cancer. In this study, we compared the gene expression between 40 AD samples and 30 normal tissues in GSE43458. Totally, 859 DEGs were identified, including 278 up-regulated genes and 581 down-regulated genes. To deepen our understanding of those DEGs, GO function and KEGG pathway analysis were carried out. Our results demonstrated those DEGs were involved in cell adhesion, migration and motility. Moreover, upregulated HELLs and downregulated SEPP1 were considered as hub genes in the progress of AD. Furthermore, prognostic analysis results also indicated upregulated HELLs and downregulated SEPP1 were associated with the survival time of AD patients.HELLs is one of members of the SNF2 (Sucrose Non-Fermenter) family of helicase proteins, which contribute to chromatin recombination, remodeling, transcription and methylation (17). High levels of HELLs have been identified in several human cancers, including leukemia, NSCLC, breast cancer, and melanoma (18). Waseem et al. demonstrated HELLs mRNA and protein expression were significantly correlated with the progression of head and neck squamous cell carcinoma. This study provided evidence that HELLs might be hub gene for early cancer discrimination and indicator of malignant conversion and progression. Moreover, the high expression of HELLs was validated in renal cell carcinoma. High mRNA level of HELLs was an independent predictor of poor outcome in renal cell carcinoma patients (19). Yano et al. suggested downregulation of HELLs by allelic loss and abnormal proteins produced by tumors specific exon might result in malignancy or progression of lung cells (20). Furthermore, HELLs activity was considered as a critical component of the malignant progression of cancer, and HELLs could be a promising therapeutic target for cancer patients (21).SEPP1 gene, with 10 selenocysteine residues, contributes to the selenium transport and production of other selenoproteins (22). Previous studies demonstrated SEPP1 played an important role in cancer prevention through its function in mediating oxidative damage (23,24). When the SEPP1 gene reduced, the oxidative stress increased and resulted in carcinogenesis (25). At present, the downregulation of plasma SEPP1 in various cancers has been established, including prostate and colorectal cancer (26,27). Gresner’s study (28) suggested SEPP1 gene expression downregulated in malignant NSCLC lung tissue, compared with paired non-malignant control tissue. In our study, we found SEPP1 was downregulated in the AD patients, which was consistent with previous reports (26,27). However, those studies about the deeply mechanism of SEPP1 to affect the development of lung cancer were limited.Our study had several attributes that strengthen its validity. Using bioinformatics methods, we analyzed the DEGs from the global gene levels, and identified the important roles of HELLs and SEPP1 in the development of lung AD. Dysregulation of HELLs and SEPP1 and their clinical effects were validated in large online databases. However, some limitations also were obvious. First, AD just represents one subgroup of lung cancer, so we do not analyze the DEGs in patients with different subtype, such as lung squamous cell carcinoma and small cell lung cancer. Therefore, the HELLs and SEPP1 might have a variety of functions on the progression of different lung cancer subgroups. Second, although we validated the dysregulation of HELLs and SEPP1 mRNA in the Oncomine and GEPIA database, the protein levels of HELLs and SEPP1 were not studied in our study. Moreover, further studies were needed to deeply investigate the mechanisms of HELLs and SEPP1 affecting the development of lung cancer.
Conclusions
Our current study performed a comprehensive analysis of DEGs contributing to the progress of AD. HELLs and SEPP1 were identified as core genes of AD. However, further experiments were needed to investigate the molecular biological function.
Authors: Dong Chen; Matthias Maruschke; Oliver Hakenberg; Wolfgang Zimmermann; Christian G Stief; Alexander Buchner Journal: Urology Date: 2017-01-06 Impact factor: 2.649
Authors: Alfonso Calvo; Nianqing Xiao; Jason Kang; Carolyn J M Best; Isabel Leiva; Michael R Emmert-Buck; Cheryl Jorcyk; Jeffrey E Green Journal: Cancer Res Date: 2002-09-15 Impact factor: 12.701
Authors: Andrea Franceschini; Damian Szklarczyk; Sune Frankild; Michael Kuhn; Milan Simonovic; Alexander Roth; Jianyi Lin; Pablo Minguez; Peer Bork; Christian von Mering; Lars J Jensen Journal: Nucleic Acids Res Date: 2012-11-29 Impact factor: 16.971