Literature DB >> 27994470

Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data.

Ying Liu1, Ran Ni1, Hui Zhang1, Lijun Miao1, Jing Wang1, Wenqing Jia1, Yuanyuan Wang1.   

Abstract

This study aimed to identify the genes and pathways associated with smoking-related lung adenocarcinoma. Three lung adenocarcinoma associated datasets (GSE43458, GSE10072, and GSE50081), the subjects of which included smokers and nonsmokers, were downloaded to screen the differentially expressed feature genes between smokers and nonsmokers. Based on the identified feature genes, we constructed the protein-protein interaction (PPI) network and optimized feature genes using closeness centrality (CC) algorithm. Then, the support vector machine (SVM) classification model was constructed based on the feature genes with higher CC values. Finally, pathway enrichment analysis of the feature genes was performed. A total of 213 down-regulated and 83 up-regulated differentially expressed genes were identified. In the constructed PPI network, the top ten nodes with higher degrees and CC values included ANK3, EPHA4, FGFR2, etc. The SVM classifier was constructed with 27 feature genes, which could accurately identify smokers and nonsmokers. Pathways enrichment analysis for the 27 feature genes revealed that they were significantly enriched in five pathways, including proteoglycans in cancer (EGFR, SDC4, SDC2, etc.), and Ras signaling pathway (FGFR2, PLA2G1B, EGFR, etc.). The 27 feature genes, such as EPHA4, FGFR2, and EGFR for SVM classifier construction and cancer-related pathways of Ras signaling pathway and proteoglycans in cancer may play key roles in the progression and development of smoking-related lung adenocarcinoma.

Entities:  

Keywords:  feature genes; lung adenocarcinoma; pathway; support vector machine (SVM) classification

Year:  2016        PMID: 27994470      PMCID: PMC5153289          DOI: 10.2147/OTT.S114230

Source DB:  PubMed          Journal:  Onco Targets Ther        ISSN: 1178-6930            Impact factor:   4.147


Introduction

Lung cancer is the leading cause of cancer deaths worldwide.1 Non-small cell lung cancer (NSCLC) represents about 85% of all diagnosed lung cancer cases2 and is usually diagnosed in advanced or metastatic stages with a poor overall 5-year survival rate.3 Lung adenocarcinoma is a common histological form of NSCLC and nearly half of the lung cancers diagnosed in the USA are adenocarcinoma. Most cases of lung adenocarcinoma are associated with smoking.4 Bryant and Cerfolio5 have reported that cigarette smoking is responsible for ~90% of lung cancers. Up to now, many studies have been done to explore the gene expression altered by cigarette smoking. For instance, Spira et al6 reported a smoking-related alteration of CYP1B1. NEK2 and CENPF have also been found to be differentially expressed in smoking-related lung cancer.7 Additionally, polymorphisms of CYPIA1 and OST1 have been suggested to be associated with susceptibility to lung cancer in relation to cigarette smoking.8 A recent study by Vucic et al9 found that microRNAs disrupted in a smoking status-dependent manner affected distinct cellular pathways and differentially influenced lung cancer patient prognosis in current, former, and never smokers. Moreover, Karlsson et al10 identified some genomic and transcriptional alterations in lung adenocarcinoma in relation to smoking history. In spite of these findings, we think it is not enough in the clinical practice. Therefore, in this study, we used three lung adenocarcinoma associated datasets, the subjects of which included smokers and nonsmokers, to screen the differentially expressed feature genes between smokers and nonsmokers. Based on the identified feature genes, we constructed the protein–protein interaction (PPI) network and optimized feature genes using closeness centrality (CC) algorithm. Then, the support vector machine (SVM) classification model was constructed based on the feature genes with higher CC values. Finally, we performed pathway enrichment analysis for the feature genes. To the best of our knowledge, the current approaches, such as PPI network analysis, feature genes optimization, and SVM classification model construction, have not been comprehensively applied in the relevant studies. We aimed to identify the genes associated with smoking in lung adenocarcinoma.

Data and methods

Microarray data

We searched the expression profile datasets from the Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) database based on the keywords of lung cancer, homo sapiens, and smoke. The datasets that met the following criteria were included in this study: 1) the data were gene expression profile data; 2) the data were identified from the lung cancer tissues samples in patients with lung adenocarcinoma; 3) the lung adenocarcinoma patients included smokers and nonsmokers; and 4) the number of samples in each dataset was ≥50. After screening, three gene expression profile datasets, GSE43458, GSE10072, and GSE50081, were selected in this study. GSE43458 contained 110 samples, including 40 smokers, 40 nonsmokers and others (only 80 samples were used in this study); GSE10072 contained 107 samples, including 16 smokers, 42 nonsmokers and others (only 58 samples were used in this study); and GSE50081 contained 116 samples, including 23 smokers and 93 nonsmokers.

Data preprocessing and feature gene identification

In the original microarray data in CEL format, background correction,11 and quartile data normalization12 using the Affy package (http://www.bioconductor.org/packages/release/bioc/html/affy.html)13 in R were carried out. For the original data in TXT format, the probes were converted into gene symbols through the expression annotation platform and the empty probes were removed. If multiple probes corresponded to the same gene symbol, the mean value was calculated as the gene expression value of this gene. Then the data in GSE10072 and GSE43458 were integrated and the differentially expressed genes (DEGs) were selected using the limma package (http://www.bioconductor.org/packages/release/bioc/html/limma.html).14 The P-value was adjusted according to the Benjamini-Hochberg15 method. The adjusted P-value <0.05 and |log2 (fold change)| >0.585 were regarded as the cut-off values. The identified DEGs were considered as feature genes and clustering analysis was then performed. The GSE50081 dataset was used for verification.

PPI network construction

Human protein reference database (HPRD, http://www.hprd.org/)16 is a database of experimentally derived human proteomic information, which includes PPIs, post-translational modifications, and tissue expression. In the present study, we downloaded this database and mapped the identified feature genes to the PPI network. The PPI network was visualized using the Cytoscape (http://www.cytoscape.org/)17 software.

Feature gene optimization

The centrality study is a popular subject in the analysis of networks. CC highlights the players who will be able to contact all other members of the network easily.18 In this study, the close connectivity degree of nodes in the PPI network was calculated based on the CC algorithm as follows: where V represents the node set; t represents a certain node in the node set; and dG (v,t) represents the sum of the distance from node t to the other nodes. The CC value is between 0 and 1. The greater the value, the stronger the CC of the node.

SVM classification model construction and classification efficiency evaluation

SVM has become a popular classification tool. In this study, based on the CC values of the feature genes, we sorted these genes in the descending order. The integrated data of GSE10072 and GSE43458, including 56 nonsmokers and 82 smokers, were used as the training dataset, on which was then performed optimal SVM classifier training using the R package e1071 (version: 1.6–7).19 The significant feature genes in classifier were used for further analysis. The remaining dataset of GSE50081 was used as the verification dataset to evaluate the classification efficiency of the constructed optimal SVM classifier. The evaluation indexes included sensitivity, specificity, positive predictive value, negative predictive value, as well as areas under the receiver operating characteristic (ROC) curve.20 In addition, based on the survival time and terminal state of the clinical samples in GSE50081, we conducted the Kaplan–Meier (KM) survival analysis and drew the KM curve.21

Pathways enrichment analysis

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis of the feature genes were carried out using the fisher algorithm.22 The formula was as follows: where N represents the number of genes in whole genome; M represents the number of genes in pathway gene set; K represents the number of DEGs. The Fisher’s score represents the probability of at least x genes being pathway genes in K DEGs.

Results

DEGs analysis

A total of 296 DEGs were selected, including 213 downregulated and 83 upregulated DEGs. The distribution of DEGs is shown in the volcano plot (Figure 1A). In addition, hierarchical clustering analysis of the DEGs and samples showed that DEGs could cluster most of the similar sample together (Figure 1B).
Figure 1

(A) The distribution of DEGs. The orange color represents upregulated genes and blue color represents downregulated genes. (B) The tree diagram of hierarchical clustering analysis of DEGs for smoker and nonsmoker samples. The purple and orange vertical bars above the heat map, respectively, represent lung adenocarcinoma samples of smokers and nonsmokers.

Abbreviations: DEGs, differentially expressed gene; FC, fold change.

Based on the HPRD database and Cytoscape software, the PPI network was constructed, which included 249 nodes and 263 edges (Figure 2A). There were two kinds of gene nodes: feature gene nodes (58) and extension gene nodes (191). Extension gene was the gene that had direct interactions with at least five feature genes. Additionally, the result of node degree analysis showed that the number of nodes decreased with the increase of node degree, indicating that the PPI network had scale-free feature (Figure 2B).
Figure 2

(A) The constructed PPI network with DEGs. The red node represents upregulated feature genes (DEGs) and the blue node represents downregulated feature genes. The pink node represents extension gene that had direct interactions with at least five feature genes. (B) Node degree distribution of genes in the PPI network. Horizontal axis represents the log-transformed degree and vertical axis represents the number of nodes.

Abbreviations: DEG, differentially expressed gene; PPI, protein–protein interaction.

The CC value of each node was calculated based on the CC algorithm. The top 10 nodes with higher CC values, such as ankyrin 3, node of Ranvier (ankyrin G) (ANK3), EPH receptor A4 (EPHA4), fibroblast growth factor receptor 2 (FGFR2), and midline 1 (MID1), are shown in Table 1.
Table 1

The top 10 nodes with high CC

GeneCCDegreelogFCP-valueAdjusted P-value
ANK311−0.741392.01E-061.65E-05
EPHA411−0.67077.49E-042.70E-03
FGFR211−0.758817.30E-065.02E-05
MID111−0.597861.66E-047.39E-04
MID1IP111−0.456035.77E-042.16E-03
MLPH11−0.60763.68E-041.47E-03
NGFRAP111−0.473941.02E-056.70E-05
PLA2G1B11−0.647661.47E-023.43E-02
PTPN1311−0.650546.96E-031.82E-02
RAB27B11−0.793464.23E-031.19E-02

Abbreviations: CC, closeness centrality; FC, fold change.

After sorting the CC values of gene nodes in the descending order, we selected the top 200 gene nodes (including 27 feature genes [Table 2]) and performed optimal SVM classifier training based on GSE10072 and GSE43458. As shown in Figure 3A, the constructed SVM classifier could accurately identify 55 nonsmokers (55/56, 98.21%) and 80 smokers (80/82, 97.56%). The overall classification accuracy was 97.83% (135/138). Then the expression values of 27 feature were extracted and performed hierarchical clustering analysis for all the training set samples to differentiate the lung adenocarcinoma patients into smokers and nonsmokers. As shown in Figure 4, the expression values of 27 feature genes could classify the samples.
Table 2

The expression difference parameters of 27 feature genes

GeneP-valueAdjusted P-valuelogFC
ANK3−0.741392.01E-061.65E-05
EPHA4−0.67077.49E-042.70E-03
FGFR2−0.758817.30E-065.02E-05
MID1−0.597861.66E-047.39E-04
MID1IP1−0.456035.77E-042.16E-03
MLPH−0.60763.68E-041.47E-03
NGFRAP1−0.473941.02E-056.70E-05
PLA2G1B−0.647661.47E-023.43E-02
PTPN13−0.650546.96E-031.82E-02
RAB27B−0.793464.23E-031.19E-02
SCNN1B−0.5549.94E-032.45E-02
SELENBP1−0.947688.51E-054.15E-04
AURKB0.4602287.62E-065.20E-05
BGN0.6104296.64E-042.43E-03
CENPA0.500711.22E-045.69E-04
TRIP130.5552814.02E-031.14E-02
PRKCA−0.746741.18E-057.57E-05
EGFR−0.426271.37E-023.21E-02
CFTR−1.522434.31E-063.19E-05
SDC2−0.414289.01E-032.26E-02
PAM−0.543217.43E-065.09E-05
SDC4−0.396449.56E-043.32E-03
MUC1−0.597941.22E-022.92E-02
SHC3−0.555974.11E-031.16E-02
ATP1A1−0.48175.83E-064.16E-05
CD82−0.504152.26E-051.32E-04
PTPRJ−0.428951.11E-033.78E-03

Abbreviation: FC, fold change.

Figure 3

Sample classification in (A) training set and (B) validation set by SVM classifier.

Abbreviation: SVM, support vector machine.

Figure 4

The tree diagram of hierarchical clustering analysis of 27 feature genes for smoker and nonsmoker samples.

Note: The purple and orange vertical bars above the heat map respectively represent lung adenocarcinoma samples of smokers and nonsmokers.

We used the GSE50081 dataset to verify the repeatability and portability of the classifier constructed by the selected 27 feature genes. As displayed in Figure 3B, the SVM classifier could identify 89 smokers (89/93, 95.7%) and 19 nonsmokers (19/23, 82.61%), and the classification accuracy was 93.10% (108/116). The classification scatter plots of the training and validation sets are shown in Figure 5A and B.
Figure 5

Classification spot diagrams of (A) training set and (B) validation dataset, and receiver operating characteristic curves of (C) training set and (D) validation dataset. Green and orange nodes respectively represent lung adenocarcinoma samples of smokers and nonsmokers.

Abbreviation: AUC, area under the curve.

In addition, we evaluated the classification efficiency of the constructed optimal SVM classifier using the indexes of correct rate, sensitivity, specificity, positive predictive and negative predictive values, and areas under the ROC curve. The evaluation result is shown in Table 3 and Figure 5C and D. Furthermore, based on the clinical information of samples in the GSE50081 dataset, which was divided into two groups (nonsmokers and smokers) by SVM classifier, we performed KM survival analysis. There were significant differences in survival time between the samples of smokers and nonsmokers identified from SVM classifier (P=0.0198). The survival rates of smokers decreased very fast in a comparatively short time, while the survival rates of nonsmokers decreased more slowly than that of smokers (Figure 6).
Table 3

The evaluation indexes of SVM classifier in (A) training set and (B) validation set

DatasetsNumber of samplesCorrect rateSensitivitySpecificityPPVNPVAUROC
GSE43458 + GSE10072 (A)1380.9780.9750.9820.9880.9650.975
GSE50081 (B)1160.9310.9570.8260.9570.8260.945

Abbreviations: AUROC, areas under receiver operating characteristic curves; NPV, negative predictive value; PPV, positive predictive value; SVM, support vector machine.

Figure 6

Kaplan–Meier survival curve of samples in GSE50081 dataset.

Notes: Black imaginary line represents lung adenocarcinoma samples of nonsmokers and red dotted line represents lung adenocarcinoma samples of smokers. “+” represents sample in this time.

The 27 feature genes were significantly enriched in five pathways, including pancreatic secretion (PLA2G1B, CFTR, ATP1A1, etc.), aldosterone-regulated sodium reabsorption (ATP1A1, SCNN1B, and PRKCA), proteoglycans in cancer (SDC4, ANK3, EGFR, SDC2, and PRKCA) (http://www.kegg.jp/kegg-bin/show_pathway?hsa05205+8066513+7933772+8132860+8147461+8009301), Ras signaling pathway (FGFR2, PLA2G1B, EGFR, etc.) (http://www.kegg.jp/kegg-bin/show_pathway?hsa04014+7936734+7967034+8132860+8162216+8009301), and ErbB signaling pathway (EGFR, SHC3, and PRKCA) (Table 4).
Table 4

Pathways enriched by 27 feature genes

PathwayIDInput numberP-valueInput
Pancreatic secretionhsa0497250.000529PLA2G1B, CFTR, ATP1A1, RAB27B, PRKCA
Aldosterone-regulated sodium reabsorptionhsa0496030.002589ATP1A1, SCNN1B, PRKCA
Proteoglycans in cancerhsa0520550.012092SDC4, ANK3, EGFR, SDC2, PRKCA
Ras signaling pathwayhsa0401450.018508FGFR2, PLA2G1B, EGFR, SHC3, PRKCA
Ras signaling pathwayhsa0401230.021123EGFR, SHC3, PRKCA

Discussion

Cigarette smoke consists of a complex mixture of chemicals that can cause direct or indirect damage to the respiratory epithelium and its genome.23 Accumulation of genomic alterations has been observed in lung cancers arising in smokers compared to never smokers.24 In this study, we identified 296 DEGs between smokers and nonsmokers. Based on these DEGs and genes in the HPRD database, the PPI network was constructed. The top ten nodes with higher degrees and CC values included ANK3, EPHA4, FGFR2, etc. The SVM classifier constructed with 27 feature genes could accurately identify smokers and nonsmokers. Additionally, KM survival analysis of nonsmokers and smokers indicated that smoking had significant reduction effect on the survival rates of lung adenocarcinoma patients. Therefore, the 27 feature genes may provide reliable data basis and research direction for the diagnosis and treatment of smoking-related lung adenocarcinoma. Pathway enrichment analysis for the 27 feature genes revealed that they were significantly enriched in proteoglycans found in cancer (EGFR, SDC4, ANK3, SDC2, and PRKCA) and Ras signaling pathway (FGFR2, PLA2G1B, EGFR, etc.). EPHA4 was one of the significant feature genes for SVM classifier construction. EPHA4 is a member of the Eph receptor kinase family, which constitutes one of the largest groups of transmembrane receptor tyrosine kinases.25 The receptor tyrosine kinases play important roles in the regulation of cellular proliferation and differentiation.26 Importantly, studies have found that several members of Eph receptor kinase family have been found to be overexpressed in various cancers.27,28 For instance, EPHA4 expression is frequently functionally altered in breast cancer, gastric cancer, pancreatic adenocarcinoma, and lung adenocarcinoma.29–32 Given the role of EPHA4 in SVM classifier, we speculated that EPHA4 may serve as a smoking-related biomarker in lung adenocarcinoma. In this study, EPHA4 was downregulated in smokers and upregulated in nonsmokers. Therefore, the changes of EPHA4 expression (from upregulation to downregulation) may reflect the development of lung adenocarcinoma from nonsmokers to smokers. The fibroblast growth factor 2 and its transmembrane tyrosine kinase receptors (FGFRs) make up a complex family of signaling molecules, the dysregulation of which has been implicated in the progression and development of cancer.33 Overexpression of FGFR2 has been detected in NSCLC cell lines.34 In this study, FGFR2 was a key feature gene in the PPI network with higher degree and CC value. Behrens et al35 found that in lung adenocarcinoma specimens, the expression of FGFR2 is significantly higher in never smokers than in smokers. In accordance with the previously mentioned findings, our study also found that FGFR2 was upregulated in nonsmokers compared with smokers. Therefore, the downregulation of this gene may be a biomarker of smoking-related lung cancer. Interestingly, FGFR2 was enriched in Ras signaling pathway. Ras signaling pathway is one of the best characterized pathways in cancer biology that can be activated by somatic mutation and gene amplification.36 Ras signaling pathway is involved in growth factor receptor activation in tumors.37 Alteration of this pathway has been reported in cancers frequently because of gain-of-function mutations mainly in Ras gene.38 Activating mutations in Ras protein result in constitutive signaling, thereby inhibiting apoptosis and stimulating cell proliferation. Oncogenic mutations in Ras gene have been shown in about 30% of human cancers.39 Presently, Ras signaling pathway has been used as a target in cancer therapy, including lung cancer therapy.40 In addition to FGFR2, EGFR was also enriched in Ras signaling pathway. EGFR is an attractive candidate for a receptor tyrosine kinase mediating autocrine growth in NSCLC.41 Molecular analysis of the responsive lung tumors reveals a significant enrichment for gain-of-function mutations in EGFR.42 About 10%–40% of lung adenocarcinoma displays activating mutations in EGFR.43 Especially, a study by Yanagawa et al44 found that smoking was correlated with the frequencies of EGFR mutations in lung adenocarcinoma. In the PPI network of the present study, EGFR interacted with a large number of genes and occupied the hub position, suggesting its important role in smoking-related lung adenocarcinoma. Taken together, FGFR2 and EGFR may play important roles in the development of smoking-related lung adenocarcinoma through Ras signaling pathway. Furthermore, in this study, EGFR was also found to be involved in the pathway of Proteoglycans in cancer (hsa05205). Proteoglycans control many normal and pathological processes, such as cell proliferation, adhesion, tissue repair, vascularization, inflammation, and cancer metastasis. Due to the diverse functions, proteoglycans are implicated in tumorigenesis in human cancers.45,46 Altered proteoglycans expression in tumors can affect cancer cell signaling, growth, migration, and angiogenesis.47 The role of proteoglycans in cancer pathways in lung cancer has not been widely investigated, but we speculated that this pathway and its enriched feature genes (EGFR, SDC4, ANK3, SDC2, and PRKCA) may play an important role in smoking-related lung adenocarcinoma. In conclusion, changes in the expression levels for the 27 feature genes, such as EPHA4, FGFR2, and EGFR for SVM classifier construction may play key roles in the progression and development of smoking-related lung adenocarcinoma, and may be useful biomarkers and therapeutic targets for the treatment of this cancer. Additionally, cancer-related pathways of Ras signaling and proteoglycans found in cancer may also play important roles in smoking-related lung adenocarcinoma. However, validation experiments are needed in the future to confirm our results.
  42 in total

1.  FDR control by the BH procedure for two-sided correlated tests with implications to gene expression data analysis.

Authors:  Anat Reiner-Benaim
Journal:  Biom J       Date:  2007-02       Impact factor: 2.207

2.  Global evaluation of Eph receptors and ephrins in lung adenocarcinomas identifies EphA4 as an inhibitor of cell migration and invasion.

Authors:  Pierre Saintigny; Shaohua Peng; Li Zhang; Banibrata Sen; Ignacio I Wistuba; Scott M Lippman; Luc Girard; John D Minna; John V Heymach; Faye M Johnson
Journal:  Mol Cancer Ther       Date:  2012-07-17       Impact factor: 6.261

3.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

4.  Inverse correlation between EGFR mutation and FHIT, RASSF1A and RUNX3 methylation in lung adenocarcinoma: relation with smoking status.

Authors:  Naoki Yanagawa; Gen Tamura; Hiroyuki Oizumi; Makoto Endoh; Mitsuaki Sadahiro; Teiichi Motoyama
Journal:  Anticancer Res       Date:  2011-04       Impact factor: 2.480

Review 5.  Ras signaling and therapies.

Authors:  Amy Young; Jesse Lyons; Abigail L Miller; Vernon T Phan; Irma Rangel Alarcón; Frank McCormick
Journal:  Adv Cancer Res       Date:  2009       Impact factor: 6.242

6.  The IASLC Lung Cancer Staging Project: proposals for the revision of the TNM stage groupings in the forthcoming (seventh) edition of the TNM Classification of malignant tumours.

Authors:  Peter Goldstraw; John Crowley; Kari Chansky; Dorothy J Giroux; Patti A Groome; Ramon Rami-Porta; Pieter E Postmus; Valerie Rusch; Leslie Sobin
Journal:  J Thorac Oncol       Date:  2007-08       Impact factor: 15.609

7.  Differences in epidemiology, histology, and survival between cigarette smokers and never-smokers who develop non-small cell lung cancer.

Authors:  Ayesha Bryant; Robert James Cerfolio
Journal:  Chest       Date:  2007-06-15       Impact factor: 9.410

8.  Influence of basic fibroblast growth factor on the proliferation of non-small cell lung cancer cell lines.

Authors:  Hartmut Kuhn; Carmen Köpff; Jana Konrad; Alexander Riedel; Christian Gessner; Hubert Wirtz
Journal:  Lung Cancer       Date:  2004-05       Impact factor: 5.705

9.  Polymorphisms of the CYP1A1 and glutathione S-transferase genes associated with susceptibility to lung cancer in relation to cigarette dose in a Japanese population.

Authors:  K Nakachi; K Imai; S Hayashi; K Kawajiri
Journal:  Cancer Res       Date:  1993-07-01       Impact factor: 12.701

10.  EphA4-deleted microenvironment regulates cancer development and leukemoid reaction of the isografted 4T1 murine breast cancer via reduction of an IGF1 signal.

Authors:  Xuefeng Jing; Takashi Sonoki; Masayasu Miyajima; Takahiro Sawada; Nanako Terada; Shigeki Takemura; Kazushige Sakaguchi
Journal:  Cancer Med       Date:  2016-02-29       Impact factor: 4.452

View more
  6 in total

1.  Identification of genes highly downregulated in pancreatic cancer through a meta-analysis of microarray datasets: implications for discovery of novel tumor-suppressor genes and therapeutic targets.

Authors:  Nalin C W Goonesekere; Wyatt Andersen; Alex Smith; Xiaosheng Wang
Journal:  J Cancer Res Clin Oncol       Date:  2017-12-29       Impact factor: 4.553

Review 2.  Group 1B phospholipase A2 in metabolic and inflammatory disease modulation.

Authors:  David Y Hui
Journal:  Biochim Biophys Acta Mol Cell Biol Lipids       Date:  2018-07-09       Impact factor: 4.698

3.  Comparative Network Analysis of Patients with Non-Small Cell Lung Cancer and Smokers for Representing Potential Therapeutic Targets.

Authors:  Mehrdad Pazhouhandeh; Fatemeh Samiee; Tahereh Boniadi; Abbas Fadaei Khedmat; Ensieh Vahedi; Mahsa Mirdamadi; Naseh Sigari; Seyed Davar Siadat; Farzam Vaziri; Abolfazl Fateh; Faezeh Ajorloo; Elham Tafsiri; Mostafa Ghanei; Fereidoun Mahboudi; Fatemeh Rahimi Jamnani
Journal:  Sci Rep       Date:  2017-10-23       Impact factor: 4.379

4.  Differential protein-coding gene and long noncoding RNA expression in smoking-related lung squamous cell carcinoma.

Authors:  Shicheng Li; Xiao Sun; Shuncheng Miao; Jia Liu; Wenjie Jiao
Journal:  Thorac Cancer       Date:  2017-09-26       Impact factor: 3.500

5.  Comprehensive analysis of gene expression and DNA methylation data identifies potential biomarkers and functional epigenetic modules for lung adenocarcinoma.

Authors:  XiaoCong Wang; YanMei Li; HuiHua Hu; FangZheng Zhou; Jie Chen; DongSheng Zhang
Journal:  Genet Mol Biol       Date:  2020-06-01       Impact factor: 1.771

6.  Identification and validation of smoking-related genes in lung adenocarcinoma using an in vitro carcinogenesis model and bioinformatics analysis.

Authors:  Jin Wang; Tao Chen; Xiaofan Yu; Nan OUYang; Lirong Tan; Beibei Jia; Jian Tong; Jianxiang Li
Journal:  J Transl Med       Date:  2020-08-14       Impact factor: 5.531

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.