Literature DB >> 29266838

Identification of differential protein-coding gene expressions in early phase lung adenocarcinoma.

Li-Na Zhou1, Shi-Cheng Li2, Xue-Ying Li1, Hong Ge1, Hong-Mei Li1.   

Abstract

BACKGROUND: The diagnosis of early phase lung adenocarcinoma (LADC) is associated with therapeutic strategy, effect, and survival time. However, the sensitive biomarkers of early phase LADC are still unclear. This study aimed to identify protein-coding genes that can be used as biomarkers of early stage LADC.
METHODS: Gene microarray analysis was performed to identify key hub genes that show different expression in lung adenocarcinoma compared to normal tissues. The microarray data of lung adenocarcinoma in stages IA, IB, IIA, IIB, and normal tissues (GSE10072) were downloaded from a free online database, Gene Expression Omnibus (GEO).
RESULTS: A total of 572 differentially expressed genes (DEGs) were identified between early phase lung adenocarcinoma and normal tissues using R software. Database for Annotation, Visualization and Integrated Discovery online tools were used to obtain Gene Ontology analysis and the Kyoto Encyclopedia of Genes and Genomes was used to analyze DEGs. Cytoscape software was used to express the protein-protein interaction network. We found that some cancer-related Gene Ontology terms and pathways (e.g. cell adhesion, cell surface receptor signaling pathway, PI3K-Akt signaling pathway) were significantly enriched in DEGs.
CONCLUSION: Protein-coding genes JUN, FYN, CAV1, and SFN may play vital roles in the progress of early-stage lung adenocarcinoma. Consequently, through bioinformatics analysis, the key genes could be established to provide more potential references for the therapeutic targets of lung adenocarcinoma.
© 2017 The Authors. Thoracic Cancer published by China Lung Oncology Group and John Wiley & Sons Australia, Ltd.

Entities:  

Keywords:  Biomarkers; genes expression profiling; lung adenocarcinoma; survival

Mesh:

Substances:

Year:  2017        PMID: 29266838      PMCID: PMC5792719          DOI: 10.1111/1759-7714.12569

Source DB:  PubMed          Journal:  Thorac Cancer        ISSN: 1759-7706            Impact factor:   3.500


Introduction

Lung cancer is the most common cancer in the world, associated with significant morbidity and mortality.1 The incidence of lung cancers continues to rise in China. Like other cancers, lung cancer is a heterogeneous disease that carries high rates of genetic somatic mutations, copy number alterations, and chromosomal rearrangement.2 Lung adenocarcinoma (LADC) is a familiar histological type that accounts for 40% of all lung cancers and is one of the best genetically characterized human epithelial malignancies.3 Although treatment methods, including surgery, chemotherapy, radiotherapy, and targeting therapy are improving, the five‐year overall survival rate remains lower than 15%.1 Consequently, elucidating the molecular mechanism involved in the proliferation of LADC is important for the development of efficacious diagnosis and treatment strategies. Gene expression analysis based on high‐throughput techniques, such as microarrays, is widely used. This useful research method is increasingly being utilized as a promising tool to improve the molecular diagnosis and classification of cancers, as well as to determine new drug targets. Meanwhile, microarray technology can simultaneously detect thousands of genes to locate the differentially expressed genes (DEGs) involved in different pathways, molecular functions, or biological characteristics. Many DEGs play an important role in lung adenocarcinoma proliferation and progression, and could be valued as potential molecular targets or efficacious diagnosis markers. A recent study analyzed data with expression array analysis, Gene Ontology (GO), and pathway analyses to selected genes, and after a series of trials, finally found that TRIM58 overexpression suppresses LADC cell proliferation and oncogenesis in vitro and in vivo, suggesting that this gene acts as a tumor suppressor gene (TSG) during the initiation and/or early progression stages of LADC.4 Using microarray technology combining bioinformatics analysis, Griesing et al. demonstrated that miR‐532‐5p is directly regulated by TTF‐1 and plays a vital role in tumor suppression by targeting KRAS and MKL2 in LADC.5 To date, microarray screening has been utilized to identify DEGs in various types of cancer to better understand the mechanisms underlying cancer progression. Original data (GSE10072) for our study was downloaded from the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/). We compared DEGs between LADC and normal samples, and performed function and pathway enrichment analysis. Additionally, a protein–protein interaction (PPI) network of all of the DEGs was constructed. Genes that showed higher degrees in the PPI networks were selected as key genes in LADC. Further, several transcription factors and their binding sites were studied.

Methods

Microarray data

The microarray data obtained for this study were available in the GEO database under the expression dataset profile of GSE10072, based on the GPL96 platform. The GSE10072 dataset consists of 107 ADC and non‐tumor tissue samples, and includes tumor node metastasis (TNM) stages and survival data.

Identification of differentially expressed genes (DEGs)

Analysis of the GSE10072 dataset was carried out using Affymetrix Human Genome U133A Array (ThermoFisher Scientific, Waltham, MA, USA). A processed series matrix file was then downloaded for microarray data analysis. After the series matrix data was background normalized using the Robust Multichip Averaging (RMA) method, the data was subjected to DEG selection. The DEGs were defined by the Limma algorithm (R software, R Foundation for Statistical Computing, Vienna, Austria). A P value of < 0.05 and |logFC| > 2 were considered as significant.

Bioinformatic analysis

The GO (http://www.geneontology.org) is a widely used bioinformatics resource that supplies information about the annotation of genes, molecular functions, and cellular components.6, 7 The Kyoto Encyclopedia of Genes and Genomes (KEGG, http://www.genome.jp/) is an integrated database that provides the molecular functions of genes and proteins, genome sequences, and high‐throughput data.8 The Database for Annotation, Visualization and Integrated Discovery (DAVID, https://david.ncifcrf.gov/) is an essential foundation for visualization, annotation, and integrated findings.9 We performed GO enrichment analysis and KEGG pathway enrichment for DEGs using the DAVID online tool. The Retrieval of Interacting Genes (STRING) database, was then used to evaluate protein–protein interaction (PPI) information to identify the interaction relationships between DEGs. Cytoscape software (National Institute of General Medical Sciences, Bethesda, MD, USA) was used to construct PPI networks for the identified hub genes between DEGs.

Survival analysis

Kaplan–Meier plots and Cox regression were employed to visualize the association between the genes under investigation and survival to explore the prognostic value of differentially expressed coding genes. Before analysis, the samples were classified into two groups: high expression or low expression. Survival analysis was then compared between the groups.10 We conducted statistical analysis using ONCOMINE and the Kaplan–Meier Plotter database. Statistical significance was examined using the Student's t‐test. P < 0.05 or P<0.01 were considered statistically significant.

Results

DEGs in lung adenocarcinoma

A total of 13 987 gene expression values were obtained from 107 samples after raw data pretreatment using the RMA method. R statistical analysis software was used to define the DEGs. A total of 572 DEGs were revealed by comparing 31 gene expression profiles from normal tissues and 33 profiles from LADC, of which 420 were upregulated and 152 were downregulated in LADC. A volcano map of DEG expression in the samples is shown in Figure 1.
Figure 1

A volcano map using log2 as a standard to demonstrate the expression of differentially expressed genes (DEGs).

A volcano map using log2 as a standard to demonstrate the expression of differentially expressed genes (DEGs).

Functional enrichment analysis

GO or KEGG pathway enrichment analysis could reveal molecular functions or biological pathways and were performed for the DEGs (Fig 2). GO analysis showed that DEGs were significantly enriched in biological processes (BP), molecular functions (MF), and cell components (CC), including cell communication, cell adhesion molecule activity, and plasma membrane. Some cancer‐related pathways were enriched in the DEGs, such as Beta1 integrin cell surface interactions, glypican pathways, and the ErbB receptor signaling network (Fig 2).
Figure 2

Gene Ontology analysis results showed that differentially expressed genes (DEGs) were significantly enriched in (a) biological processes (BP), (b) molecular functions (MF), and (c) cell components (CC). (d) Some cancer‐related pathways were enriched in the DEGs, such as Beta1 integrin cell surface interactions, glypican pathways, and the ErbB receptor signaling network.

Gene Ontology analysis results showed that differentially expressed genes (DEGs) were significantly enriched in (a) biological processes (BP), (b) molecular functions (MF), and (c) cell components (CC). (d) Some cancer‐related pathways were enriched in the DEGs, such as Beta1 integrin cell surface interactions, glypican pathways, and the ErbB receptor signaling network.

Protein‐protein interaction network construction and identification of hub genes

The STRING database and Cytoscape software were used to obtain further insight into the interaction between DEGs. The PPI network consisted of 423 nodes interacting at 3217 edges, while the remaining 149 DEGs did not match the PPI pairs. Meanwhile, the top four hub nodes with higher degrees were screened. These hub genes, JUN, FYN, CAV1 and SFN, could code relative proteins, considered the hub proteins, which are more likely to play a vital role in the genesis and growth of cancer. The hub proteins and their interactions are depicted in Figure 3.
Figure 3

(a) The protein–protein network, which consisted of 423 nodes interacting at 3217 edges; and (b) the hub proteins and the number of interactions.

(a) The protein–protein network, which consisted of 423 nodes interacting at 3217 edges; and (b) the hub proteins and the number of interactions.

Survival analysis of hub genes

To test the potential prognostic roles of these hub genes, we analyzed whether there was any association to survival in lung cancer patients. Kaplan–Meier survival and univariate Cox proportional hazards regression analyses were performed for the hub genes, and results showed that expression levels of three genes, JUN, CAV1 and FYN, were significantly associated with survival in lung cancer patients. High expression of these three genes led to prolonged survival (Fig 4). To further confirm our findings, we consulted ONCOMINE. The results showed that SFN was significantly highly expressed in lung cancer tissues, thus high expression of SFN was associated with shorter survival (Fig 4).
Figure 4

A patient with high (a) JUN, (b) CAV1, and (c) FYN expression can achieve longer survival. (d) A patient with high SFN expression would experience shorter survival. HR, hazard ratio.

A patient with high (a) JUN, (b) CAV1, and (c) FYN expression can achieve longer survival. (d) A patient with high SFN expression would experience shorter survival. HR, hazard ratio.

Discussion

Lung cancer is a heterogeneous disease, a product of genetic somatic mutation, copy number alteration, and chromosomal rearrangement. LADC incidence is the most common among different pathological types. Therefore, understanding the molecular mechanism of LADC is vital for accurate diagnosis, treatment, and prognosis. Microarray and high‐throughput analysis could provide expression levels of thousands of human genes and thus has been widely used to determine potential diagnosis biomarkers and therapeutic targets for LADC. In our study, we analyzed the microarray data from GSE10072 and identified 420 upregulated and 152 downregulated DEGs between LADC and normal tissue profiles. Using DAVID, bioinformation analysis showed that these DEGs were mainly involved in enzyme inhibitor activity, chemokine signaling pathways, and cytokine‐cytokine receptor interaction. A PPI network was used to obtain further insight into the association between DEGs. The top four hub nodes with higher degrees were screened. These hub genes, JUN, FYN, CAV1, and SFN, may represent potential biomarkers or provide new ideas for therapeutic studies. In the present study, gene expression of 107 samples was retrieved from the GEO dataset under the accession GSE10072, including TNM stage and survival data. Using an RMA method, a total of 13 987 genes were obtained from these samples. After comparing the gene expression profiles of 31 normal tissues and 33 LADC, 572 DEGs were revealed, among which 420 were upregulated and 152 downregulated. Cumulative evidence has demonstrated that a group of genes with similar expression were made up of co‐expression genes, which also take part in the biological process.11 In order to gain deeper insight into the interaction of DEGs, we performed GO and KEGG pathway analyses. The results of GO analysis showed that upregulated DEGs were significantly enriched in BP, including cellular component organization or biogenesis, biological regulation, and the developmental process; downregulated DEGs were significantly enriched in BP, including the metabolic process, cell communication, and cell cycle. This finding proved that cell component organization or biogenesis, the cell metabolic process and cell cycle is associated with tumorigenesis and prognosis. The cell cycle is a tightly integrated process that is frequently aberrant in non‐small cell lung cancer (NSCLC).12 It has been reported that for rapid cell growth and proliferation, tumors have high energetic and anabolic needs. Therefore, cell metabolic component organization and biogenesis is an important source of metabolic intermediates. Furthermore, KEGG analysis showed that upregulated DEGs were significantly enriched in ECM‐receptor interaction, PI3K‐AKT signaling, and other cancer‐related pathways. Previous studies have shown that EMC plays a vital role in maintaining stem cell properties and regulating stem cell differentiation. Dysregulation of EMC regulates the development and progression of tumors by promoting the tumor microenvironment.13, 14, 15 Moreover, a disturbance in the PI3K‐AKT signaling pathway is associated with resistance to EGFR‐TKI therapy and lower survival in NSCLC patients.16 Downregulated DEGs were associated with focal adhesion, the p53 signaling pathway, and other functions. Dy et al. reported that focal adhesion kinase is expressed in more than 50% of stage I NSCLC patients.17 In addition, other articles have demonstrated that focal adhesion kinase is a promising factor to predict aggressive behavior and prognosis in NSCLC patients, particularly with the ADC subtype.18 It has been reported that cisplatin causes p53 overexpression, which may represent a predictive marker for chemotherapy efficacy in LADC.19 p53 is also involved in the immune response by regulating PD‐L1.20 Therefore, monitoring these singling pathways may improve the measure of diagnosis and the prediction of therapy. After functional enrichment analysis, we constructed a PPI network with DEGs, and then determined the top degree hub genes: JUN, FYN, CAV1, and SFY. The first hub gene, JUN, plays an important role in cell proliferation, the suppression of apoptosis, and increased angiogenesis. c‐JUN integrates signals of several developmental pathways, including EGFR‐TKI and activin B‐MAP3K1‐JNK for cell proliferation and differentiation.21 The second hub gene, FYN, belongs to the Src family kinase (SFKs), which is involved in signal transduction or the susceptibility of cancer cells to some anti‐cancer treatments. In tamoxifen‐resist breast cancer cell lines, FYN regulates and plays a key role in resistant mechanisms.22 Recent studies have demonstrated that FYN together with other SFK members is a mediator of growth‐factor induced anti‐apoptotic activity of Akt/PKB. Several cancers, including breast, ovarian, and oral carcinomas are associated with FYN, as such FYN inhibitors are being investigated.23 The last hub gene, SFN, a potent anti‐cancer compound, inhibits cancer stem‐like cell properties and enhances the therapeutic efficacy of cisplatin in human NSCLC.24 Some articles have reported that SFN can induce apoptosis in a variety of cancers via multiple mechanisms. SFN‐Cys has been used to inhibit invasion in human prostate cancer cells. Other trials have found that SFN‐Cys triggers ERK1/2 activation to mediate the mitochondria signaling pathway with upregulation and α‐tubulin downregulation leading to NSCLC cell apoptosis.25 In conclusion, our data identified four key genes and some pathways related to LADC proliferation, progression, and apoptosis using a series of bioinformation analyses on DEGs between LADC and normal tissues. Previous studies have indicated that early‐stage LADC corrects the CAM pathway.26 The results of this study provide a series of useful targets for future investigation into the molecular mechanisms and biomarkers of LADC. Previous studies that used the complementary DNA microarray system found that numerous alterations of gene expression were present in early‐stage cancer.27 However, some of functions of those genes are still unclear, therefore, further molecular biological experiments are required to determine the biomarkers and potential therapeutic targets of early stage LADC.

Disclosure

No authors report any conflict of interest.
  27 in total

Review 1.  Adhesion within the stem cell niches.

Authors:  Karine Raymond; Marie-Ange Deugnier; Marisa M Faraldo; Marina A Glukhova
Journal:  Curr Opin Cell Biol       Date:  2009-06-15       Impact factor: 8.382

Review 2.  Fyn kinase in brain diseases and cancer: the search for inhibitors.

Authors:  S Schenone; C Brullo; F Musumeci; M Biava; F Falchi; M Botta
Journal:  Curr Med Chem       Date:  2011       Impact factor: 4.530

3.  Epidemiology: The dominant malignancy.

Authors:  Eric Bender
Journal:  Nature       Date:  2014-09-11       Impact factor: 49.962

4.  Overexpression of focal adhesion kinase correlates with increased lymph node metastasis and poor prognosis in non-small-cell lung cancer.

Authors:  Hong-Fei Ji; Da Pang; Song-Bin Fu; Yan Jin; Lei Yao; Ji-Ping Qi; Jing Bai
Journal:  J Cancer Res Clin Oncol       Date:  2012-11-11       Impact factor: 4.553

5.  Adult SVZ stem cells lie in a vascular niche: a quantitative analysis of niche cell-cell interactions.

Authors:  Qin Shen; Yue Wang; Erzsebet Kokovay; Gang Lin; Shu-Mien Chuang; Susan K Goderie; Badrinath Roysam; Sally Temple
Journal:  Cell Stem Cell       Date:  2008-09-11       Impact factor: 24.633

6.  cDNA microarray analysis of gene expression in pathologic Stage IA nonsmall cell lung carcinomas.

Authors:  Haruhiko Nakamura; Hisashi Saji; Akihiko Ogata; Makoto Hosaka; Masaru Hagiwara; Takamoto Saijo; Norihito Kawasaki; Harubumi Kato
Journal:  Cancer       Date:  2003-06-01       Impact factor: 6.860

7.  Sulforaphane-cysteine-induced apoptosis via phosphorylated ERK1/2-mediated maspin pathway in human non-small cell lung cancer cells.

Authors:  Kai Lin; Ronghui Yang; Zhongnan Zheng; Yan Zhou; Yang Geng; Yabin Hu; Sai Wu; Wei Wu
Journal:  Cell Death Discov       Date:  2017-07-03

8.  Thyroid transcription factor-1-regulated microRNA-532-5p targets KRAS and MKL2 oncogenes and induces apoptosis in lung adenocarcinoma cells.

Authors:  Sebastian Griesing; Taisuke Kajino; Mei Chee Tai; Zhuoran Liu; Masahiro Nakatochi; Yukako Shimada; Motoshi Suzuki; Takashi Takahashi
Journal:  Cancer Sci       Date:  2017-06-10       Impact factor: 6.716

9.  Sulforaphane inhibits cancer stem-like cell properties and cisplatin resistance through miR-214-mediated downregulation of c-MYC in non-small cell lung cancer.

Authors:  Qian-Qian Li; You-Ke Xie; Yue Wu; Lin-Lin Li; Ying Liu; Xiao-Bo Miao; Qiu-Zhen Liu; Kai-Tai Yao; Guang-Hui Xiao
Journal:  Oncotarget       Date:  2017-02-14

10.  PDL1 Regulation by p53 via miR-34.

Authors:  Maria Angelica Cortez; Cristina Ivan; David Valdecanas; Xiaohong Wang; Heidi J Peltier; Yuping Ye; Luiz Araujo; David P Carbone; Konstantin Shilo; Dipak K Giri; Kevin Kelnar; Desiree Martin; Ritsuko Komaki; Daniel R Gomez; Sunil Krishnan; George A Calin; Andreas G Bader; James W Welsh
Journal:  J Natl Cancer Inst       Date:  2015-11-17       Impact factor: 13.506

View more
  3 in total

1.  Intravenous Transplantation of Mesenchymal Stem Cells Reduces the Number of Infiltrated Ly6C+ Cells but Enhances the Proportions Positive for BDNF, TNF-1α, and IL-1β in the Infarct Cortices of dMCAO Rats.

Authors:  Yunqian Guan; Xiaobo Li; Wenxiu Yu; Zhaohui Liang; Min Huang; Renchao Zhao; Chunsong Zhao; Yao Liu; Haiqiang Zou; Yanli Hao; Zhiguo Chen
Journal:  Stem Cells Int       Date:  2018-10-02       Impact factor: 5.443

2.  Lentivirus-mediated shRNA targeting MUTYH inhibits malignant phenotypes of bladder cancer SW780 cells.

Authors:  Qunjun Gao; Yuhan Liu; Haibiao Xie; Yucheng Zhong; Xinhui Liao; Hengji Zhan; Qun Zhou; Mengting Ding; Kang Yang; Aolin Li; Yuchen Liu; Hongbing Mei; Zhiming Cai
Journal:  Onco Targets Ther       Date:  2018-09-21       Impact factor: 4.147

3.  Identification of key genes and evaluation of clinical outcomes in lung squamous cell carcinoma using integrated bioinformatics analysis.

Authors:  Yangfeng Shi; Yeping Li; Chao Yan; Hua Su; Kejing Ying
Journal:  Oncol Lett       Date:  2019-09-30       Impact factor: 2.967

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.