| Literature DB >> 32195055 |
Wei Zhu1, Lin Lin Li1, Yiyan Songyang1, Zhan Shi2, Dejia Li1.
Abstract
Although lung cancer is one of the greatest threats to human health, its signaling pathway and related genes are still unknown. This study integrates data from three groups of people to study potential key candidate genes and pathways related to lung cancer. Expression profiles (GSE18842, GSE19188 and GSE27262), including 162 tumor tissue and 135 adjacent normal lung tissue samples, were integrated and analyzed. Differentially expressed genes (DEGs) and candidate genes were identified, their expression pathways were analyzed, and the diethylene glycol-related protein-protein interaction (PPI) network was analyzed. We identified 232 shared DEGs (40 upregulated and 192 down-regulated) from the three GSE datasets. The DEGs were clustered according to function and signaling pathway for significant enrichment analysis. In total, 129 nodes/DEGs were identified from the DEG PPI network complex. An improved prognosis was associated with increased Helicase, Lymphoid-Specific (HELLS) and decreased Intercellular adhesion molecule 1 (ICAM1) mRNA expression in lung cancer patients. In conclusion, we used integrated bioinformatics analysis to identify candidate genes and pathways in lung cancer to show that HELLS and ICAM1 might be the key genes related to tumorigenesis or tumor progression in lung cancer. Additional studies are needed to further explore the involved functional mechanisms.Entities:
Keywords: Bioinformatics analysis; HELLS; ICAM1; Lung cancer; Prognosis
Year: 2020 PMID: 32195055 PMCID: PMC7067188 DOI: 10.7717/peerj.8731
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Functional and pathway enrichment analysis of DEGs.
(A) GO terms and KEGG pathway were presented, and each band represents one enriched term or pathway colored according to the −log 10 p value. (B) Network of the enriched terms and pathways. Nodes represent enriched terms or pathways with node size indicating the number of DEGs involved in. Nodes sharing the same cluster are typically close to each other, and the thicker the edge displayed, the higher the similarity is represented.
Figure 2The distribution of differential genes between GSE18842, GSE19188 and GSE27262.
(A) DEGs were selected with p < 0.05 and [log FC] > 1 among the mRNA expression profiling sets. (B and C) The PPI network of DEGs was constructed using Cytoscape (upregulated genes are marked in light red; downregulated genes are marked in light blue).
Two hundred thirty-two differentially expressed genes (DEGs) were screened from three profile datasets.
| DEGs | Genes symbol |
|---|---|
| Upregulated (40) | G2E3, GDA, EPT1, KPNA2, IGH, ANLN, BIRC5, HSPD1, PTK2, BC017398, MIR3934, SAPCD2, IGHD, AGO2, GTSE1, CBX2, PTBP3, ADAM28, CCNB2, LRP8, NFE2L3, KRT16, IGHG1, NME1, COL12A1, EYA2, LEPREL4, LRRC15, BCL2L11, ATAD2, MTA3, HIST1H2BG, PCDH19, SLC1A4, HELLS, GBP6, FERMT1, KRT5, HOXC6, CDKN2A |
| Downregulated (192) | PFKP, ERG, HIGD1B, VGLL3, IPW, MAOA, CDK1, RSRC1, SPTBN1, SNX25, UNC13B, PPBP, QKI, SPG20, MAD2L1, SORBS2, CCM2L, WIF1, GIMAP6, SOCS3, MAGI1, LMO7, CCL23, PCDP1, OGN, KRT4, SFTA3, CEACAM5, PDE5A, SLC16A4, PDZD2, WISP1, HBB, ITGAX, TM9SF3, LPL, COL11A1, ODF3B, CASP4, ROR1, SYNM, UGT8, FKBP11, SESTD1, SLC4A4, RP699M1.2, CTTN, NEK2, SMAD6, MEF2C, ERBB4, RP115C23.1, MACF1, MAGEA10, MAGEA5, ITIH5, SIGLEC17P, TBX5, PARVA, PPAP2C, AQP4, SLC47A1, SERPINA1, COL4A3, IL1RL1, MCEMP1, CYP4V2, TRPV2, STOM, KIAA1244, EDNRB, ST3GAL6, SOX17, TNS1, CAMK2N1, POLQ, CACNA1D, RGS5, PTGDS, GAGE12B, EIF5, SERPINB1, PTPN21, CST6, STRBP, NAV1, SHROOM3, CNTNAP2, ZNF280B, LMO3, MBIP, IL33, ARHGAP6, RNF125, CYP2B6, ANK3, DAPL1, KCTD1, TACC2, MITF, LILRB2, HOXA1, CSF3R, LOC643733, HNMT, GNAS, SLC27A3, SERPINB9, C3, SERPING1, AFAP1L1, SULT1A2, ZFPM2, SEC63, ADRB1, SVEP1, FYN, COL5A2, LOC101928198, PLAC9, MSR1, LST1, DOCK4, FRMD4A, KLF9, PDK4, EMCN, TSPAN12, CA4, SRPX2, SIRPA, APOH, CLEC7A, NCKAP1L, LHFP, GLS, CFLAR, ACACB, RUFY3, SOBP, PMP22, P2RX7, LEPREL1, LPCAT1, SOX2, IQGAP2, OTUD1, FRMD3, DOCK2, BTNL9, UCHL1, CLEC2B, TBX2, TMEM237, PPARG, HLADQB1, LMOD1, SUGT1, LRRN4, RGCC, ADRB2, CMAHP, SEMA6A, HMGA2, CCDC68, SREK1IP1, MYLIP, DOCK9, MARCKS, RORA, SORBS1, GUCY1A2, STXBP6, STX12, CASC5, CALML3, CKAP2, ICAM1, FGF18, ZEB2, DSG3, LGI3, TTN, AKAP13, SLC34A2, STAC, TAPT1, SEMA5A, SYNPO, CAST, SLCO4C1, HBA1 |
Pathway and process enrichment analysis.
| GO | Category | Description | Count | % | log 10 ( |
|---|---|---|---|---|---|
| Upregulated | |||||
| M236 | Canonical Pathways | PID DELTA NP63 PATHWAY | 4 | 10.53 | −6.05 |
| M176 | Canonical Pathways | PID FOXM1 PATHWAY | 3 | 7.89 | −4.47 |
| R-HSA-5693606 | Reactome Gene sets | DNA Double Strand Break Response | 3 | 7.89 | −3.62 |
| M66 | Canonical Pathways | PID MYC ACTIV PATHWAY | 4 | 7.89 | −3.58 |
| GO Biological Processes | Ell-matrix adhesion | 5 | 10.53 | −3.37 | |
| GO Biological Processes | Negative regulation of cellular protein localization | 3 | 7.89 | −3.16 | |
| GO Biological Processes | Apoptotic mitochondrial changes | 3 | 7.89 | −3.01 | |
| R-HSA-9006925 | Reactome Gene sets | Intracellular signaling by second messenger | 3 | 10.53 | −2.89 |
| GO Biological Processes | Defense response to bacterium | 4 | 10.53 | −2.78 | |
| GO Biological Processes | Cell division | 5 | 13.16 | −2.65 | |
| GO Biological Processes | Protein import | 3 | 7.89 | −2.48 | |
| GO Biological Processes | Homeostasis of number of cells | 3 | 7.89 | −2.15 | |
| Downregulated | |||||
| GO Biological Processes | Developmental growth | 23 | 12.17 | −8.59 | |
| GO Biological Processes | Actin cytoskeleton organization | 22 | 11.64 | −8.04 | |
| GO Biological Processes | Blood vessel development | 22 | 11.64 | −6.79 | |
| GO Biological Processes | Myeloid leukocyte activation | 19 | 10.05 | −6.01 | |
| GO Biological Processes | Regulation of endopeptidase activity | 15 | 7.94 | −5.93 | |
| GO Biological Processes | Mesenchyme development | 12 | 6.35 | −5.70 | |
| R-HSA-109582 | Reactome Gene sets | Hemostasis | 18 | 9.52 | −5.67 |
| GO Biological Processes | Cellular component assembly involved in morphogenesis | 8 | 4.23 | −5.60 | |
| GO Biological Processes | Muscle system process | 15 | 7.94 | −5.36 | |
| GO Biological Processes | Cell surface receptor signaling pathway involved in cell–cell signaling | 17 | 8.99 | −5.08 | |
| GO Biological Processes | Negative regulation of cell proliferation | 19 | 10.05 | −5.00 | |
| hsa04924 | KEGG Pathway | Renin secretion | 6 | 3.17 | −4.92 |
| GO Biological Processes | Developmental cell growth | 10 | 5.29 | −4.86 | |
| GO Biological Processes | Positive regulation of lipid storage | 4 | 2.12 | −4.62 | |
| GO Biological Processes | Circulatory system process | 15 | 7.94 | −4.55 | |
| Reactome Gene Sets | Posithive regulation of hydrolase activity | 18 | 9.52 | −4.49 | |
| GO Biological Processes | Regulation of MAPK cascade | 18 | 9.52 | −4.44 | |
| R-HSA-1247673 | GO Biological Processes | Erythrocytes take up oxygen release | 3 | 1.59 | −4.42 |
| GO Biological Processes | Protein localization to axon | 3 | 1.59 | −4.42 | |
| GO Biological Processes | Fat cell differentiation | 9 | 4.76 | −4.21 |
Figure 3Functional and pathway enrichment analysis of the PPI module.
(A) GO terms and KEGG pathway were presented, and each band represents one enriched term or pathway colored according to the −log 10 p value. (B) Network of the enriched terms and pathways. Nodes represent enriched terms or pathways with node size indicating the number of DEGs involved in.
Functional roles of 10 hub genes.
| Genes symbol | Full name | Function |
|---|---|---|
| MAD2L1 | Mitotic arrest deficient 2 like 1 | Preventing the onset of anaphase |
| POLQ | DNA polymerase θ | Alternative nonhomologous end joining |
| HELLS | Helicase, lymphoid specific | DNA strand separation |
| ANLN | Aniline actin binding protein | Cell growth and migration |
| BIRC5 | Bucovina IAP repeat containing 5 | Preventing apoptotic cell death |
| ATAD2 | ATPase family AAA domain containing 2 | Chaperone-like functions |
| CCNB2 | Cyclin B2 | The cell cycle regulatory machinery |
| PTK2 | Protein tyrosine kinase 2 | Cytoplasmic protein tyrosine kinase |
| ICAM1 | Intercellular adhesion molecule 1 | Endothelial cells and cells of the immune system |
| ITGAX | Integrin subunit alpha X | Encoding the integrin alpha X chain protein |
Figure 4Hub gene selection and analysis.
(A) Hub genes and their co-expression genes were analyzed using cBioPortal. Nodes with bold black outline represent hub genes. Nodes with thin black outline represent the co-expression genes. (B) Hierarchical clustering of hub genes was constructed using UCSC. (C) The biological process analysis of hub genes was constructed using BiNGO.
Figure 5Overall survival and disease-free survival analyses of different HELLS expression lung cancer patients.
(A–F) Overall survival and disease-free survival analyses of the HELLS were performed in TCGA online website. (G and H) The mRNA level of HELLS was evaluated in lung cancer using GEPIA analysis, p < 0.05 was considered to indicate a statistically significant difference.
Figure 6Overall survival and disease-free survival analyses of different ICAM1 expression lung cancer patients.
(A–F) Overall survival and disease-free survival analyses of the ICAM1 were performed in TCGA online website. (G and H) The mRNA level of ICAM1 was evaluated in lung cancer using GEPIA analysis p < 0.05 was considered to indicate a statistically significant difference.
Figure 7The mRNA level of HELLS and ICAM1, the PPI network of HELLS and ICAM1 were constructed.
(A and B) The mRNA level of HELLS and ICAM1 were evaluated in lung cancer among four studies using ONCOMINE analysis. (C and D) PPI network of HELLS and ICAM1 were constructed by STRING database.
Figure 8The expression levels of ICAM1 and HELLS in the lung cancer samples and the lung cancer cells.
(A and B) The expression levels of ICAM1 and HELLS in the lung cancer samples. (C and D) The expression levels of ICAM1 and HELLS in the lung cancer cells.