| Literature DB >> 30664219 |
Tingting Long1, Zijing Liu2, Xing Zhou1, Shuang Yu1, Hui Tian1, Yixi Bao1.
Abstract
Lung cancer is the leading cause of cancer‑associated mortality worldwide. The aim of the present study was to identify the differentially expressed genes (DEGs) and enriched pathways in lung cancer by bioinformatics analysis, and to provide potential targets for diagnosis and treatment. Valid microarray data of 31 pairs of lung cancer tissues and matched normal samples (GSE19804) were obtained from the Gene Expression Omnibus database. Significance analysis of the gene expression profile was used to identify DEGs between cancer tissues and normal tissues, and a total of 1,970 DEGs, which were significantly enriched in biological processes, were screened. Through the Gene Ontology function and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis, 77 KEGG pathways associated with lung cancer were identified, among which the Toll‑like receptor pathway was observed to be important. Protein‑protein interaction network analysis extracted 1,770 nodes and 10,667 edges, and identified 10 genes with key roles in lung cancer with highest degrees, hub centrality and betweenness. Additionally, the module analysis of protein‑protein interactions revealed that 'chemokine signaling pathway', 'cell cycle' and 'pathways in cancer' had a close association with lung cancer. In conclusion, the identified DEGs, particularly the hub genes, strengthen the understanding of the development and progression of lung cancer, and certain genes (including advanced glycosylation end‑product specific receptor and epidermal growth factor receptor) may be used as candidate target molecules to diagnose, monitor and treat lung cancer.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30664219 PMCID: PMC6390056 DOI: 10.3892/mmr.2019.9878
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Figure 1.Identification of DEGs. (A) Pearson's correlation (signal) map. The correlation coefficient was close to 1.0, indicating higher repeatability or similar to distribution. (B) Relative Signal Box Plot map. The red line is the base line; more similar distribution implies higher repeatability of the data. (C) Volcano plot comparing all of the DEGs. The red dots indicate DEGs that were significant at |Log2 (FC)|>1. DEGs, differentially expressed genes; T, lung cancer tissues; N, normal lung tissues.
Statistical metrics for key differentially expressed genes.
| Probe Set ID | Gene | P-value | Fold change (tumor vs. normal) |
|---|---|---|---|
| 210081_at | AGER | 1.08×10−19 | −42.35560 |
| 232578_at | CLDN18 | 9.04×10−14 | −41.43548 |
| 204712_at | WIF1 | 1.82×10−12 | −34.45463 |
| 209875_s_at | SPP1 | 2.01×10−14 | 34.41545 |
| 203980_at | FABP4 | 4.45×10−17 | −27.01589 |
| 219230_at | TMEM100 | 4.84×10−14 | −24.15423 |
| 37892_at | COL11A1 | 1.10×10−10 | 22.98382 |
| 209469_at | GPM6A | 5.90×10−25 | −22.56606 |
| 205725_at | SCGB1A1 | 1.21×10−10 | −22.50023 |
| 213317_at | CLIC5 | 1.04×10−16 | −22.22463 |
AGER, advanced glycosylation end-product specific receptor; CLDN18, claudin 18; WIF1, WNT inhibitory factor 1; SPP1, secreted phosphoprotein 1; FABP4, fatty acid binding protein 4; TMEM100, transmembrane protein 100; COL11A1, collagen type XI α1 chain; GPM6A, glycoprotein M6A; SCGB1A1, secretoglobin family 1A member 1; CLIC5, chloride intracellular channel 5.
Figure 2.Hierarchical clustering analysis of the 1,970 differentially expressed genes. Red and green indicate higher and lower gene expression, respectively. T, lung cancer tissues; N, normal lung tissues.
Gene Ontology analysis of differentially expressed genes.
| Gene set name | Gene counts | % | P-value | |
|---|---|---|---|---|
| GOTERM | GO:0016477-cell migration | 257 | 14.1 | 3.64×10−40 |
| -BP-FAT | GO:0072358-cardiovascular system development | 222 | 12.2 | 5.09×10−40 |
| GO:0072359-circulatory system development | 222 | 12.2 | 5.09×10−40 | |
| GO:0048870-cell motility | 277 | 15.2 | 5.51×10−40 | |
| GO:0051674-localization of cell | 277 | 15.2 | 5.51×10−40 | |
| GOTERM | GO:0005102-receptor binding | 222 | 12.2 | 3.09×10−14 |
| -MF-FAT | GO:0005539-glycosaminoglycan binding | 56 | 3.1 | 4.52×10−13 |
| GO:0019838-growth factor binding | 39 | 2.1 | 4.62×10−11 | |
| GO:0098772-molecular function regulator | 199 | 10.9 | 3.11×10−10 | |
| GO:0008201-heparin binding | 42 | 2.3 | 1.20×10−9 | |
| GOTERM | GO:0005576-extracellular region | 635 | 34.9 | 4.41×10−23 |
| -CC-FAT | GO:0044421-extracellular region part | 551 | 30.3 | 1.42×10−22 |
| GO:0005615-extracellular space | 253 | 13.9 | 2.95×10−20 | |
| GO:0005578-proteinaceous extracellular matrix | 97 | 5.3 | 8.71×10−20 | |
| GO:0031012-extracellular matrix | 123 | 6.8 | 3.43×10−19 |
BP, biological processes; FAT, functional annotation tool; MF, molecular function; CC, cellular component.
Gene Ontology functional enrichment analyses of differentially expressed genes associated with lung cancer.
| A, Up-regulated | ||||
|---|---|---|---|---|
| Category | Term/gene function | Gene count | % | P-value |
| GOTERM_BP_FAT | GO:1903047-mitotic cell cycle process | 62 | 12.5 | 2.47×10−12 |
| GOTERM_BP_FAT | GO:0000278-mitotic cell cycle | 64 | 12.9 | 9.70×10−12 |
| GOTERM_BP_FAT | GO:0051301-cell division | 45 | 9.1 | 1.16×10−10 |
| GOTERM_BP_FAT | GO:0007059-chromosome segregation | 33 | 6.7 | 1.83×10−10 |
| GOTERM_BP_FAT | GO:0044243-multicellular organism catabolic process | 16 | 3.2 | 3.58×10−10 |
| GOTERM_MF_FAT | GO:0005201-extracellular matrix structural constituent | 12 | 2.4 | 7.34×10−6 |
| GOTERM_MF_FAT | GO:0042802-identical protein binding | 60 | 12.1 | 3.73×10−5 |
| GOTERM_MF_FAT | GO:0008574-ATP-dependent microtubule motor activity, plus-end-directed | 6 | 1.2 | 5.13×10−5 |
| GOTERM_MF_FAT | GO:1990939-ATP-dependent microtubule motor activity | 6 | 1.2 | 6.96×10−5 |
| GOTERM_MF_FAT | GO:0016758-transferase activity, transferring hexosyl groups | 16 | 3.2 | 2.89×10−4 |
| GOTERM_CC_FAT | GO:0005578-proteinaceous extracellular matrix | 31 | 6.3 | 1.75×10−7 |
| GOTERM_CC_FAT | GO:0005576-extracellular region | 176 | 35.5 | 5.35×10−6 |
| GOTERM_CC_FAT | GO:0098643-banded collagen fibril | 6 | 1.2 | 1.26×10−5 |
| GOTERM_CC_FAT | GO:0005583-fibrillar collagen trimer | 6 | 1.2 | 1.26×10−5 |
| GOTERM_CC_FAT | GO:0000779-condensed chromosome, centromeric region | 14 | 1.9 | 1.87×10−5 |
| GOTERM_BP_FAT | GO:0072359-circulatory system development | 190 | 14.3 | 3.52×10−44 |
| GOTERM_BP_FAT | GO:0072358-cardiovascular system development | 190 | 14.3 | 3.52×10−44 |
| GOTERM_BP_FAT | GO:0001944-vasculature development | 146 | 11.0 | 9.36×10−43 |
| GOTERM_BP_FAT | GO:0001568-blood vessel development | 137 | 10.3 | 9.96×10−40 |
| GOTERM_BP_FAT | GO:0040011-locomotion | 242 | 18.3 | 3.86×10−38 |
| GOTERM_MF_FAT | GO:0005102-receptor binding | 179 | 13.5 | 7.17×10−16 |
| GOTERM_MF_FAT | GO:0005539-glycosaminoglycan binding | 45 | 3.4 | 6.57×10−12 |
| GOTERM_MF_FAT | GO:0098772-molecular function regulator | 155 | 11.7 | 2.41×10−10 |
| GOTERM_MF_FAT | GO:0008201-heparin binding | 36 | 2.7 | 3.61×10−10 |
| GOTERM_MF_FAT | GO:0003779-actin binding | 62 | 4.7 | 9.32×10−10 |
| GOTERM_CC_FAT | GO:0044421-extracellular region part | 401 | 30.3 | 6.04×10−18 |
| GOTERM_CC_FAT | GO:0005576-extracellular region | 459 | 34.6 | 8.97×10−18 |
| GOTERM_CC_FAT | GO:0005615-extracellular space | 187 | 14.1 | 4.24×10−16 |
| GOTERM_CC_FAT | GO:0009986-cell surface | 118 | 8.9 | 1.97×10−15 |
| GOTERM_CC_FAT | GO:0031012-extracellular matrix | 89 | 6.7 | 7.14×10−14 |
BP, biological processes; FAT, functional annotation tool; MF, molecular function; CC, cellular component.
Figure 3.Top 10 most significantly enriched pathways of differentially expressed genes associated with lung cancer as analyzed by Kyoto Encyclopedia of Genes and Genomes pathway analysis. ECM, extracellular matrix; PI3K, phosphoinositide-3-kinase; Akt, protein kinase B; Rap1, Ras-proximate-1.
Top 10 most overrepresented Kyoto Encyclopedia of Genes and Genomes pathways of differentially expressed genes.
| Gene set name | Count | % | P-value |
|---|---|---|---|
| hsa04512: ECM-receptor interaction | 28 | 1.5 | 1.52×10−7 |
| hsa05144: Malaria | 20 | 1.1 | 2.20×10−7 |
| hsa04610: Complement and coagulation cascades | 24 | 1.3 | 3.03×10−7 |
| hsa04510: Focal adhesion | 47 | 2.6 | 5.97×10−7 |
| hsa04974: Protein digestion and absorption | 25 | 1.4 | 9.42×10−6 |
| hsa04514: Cell adhesion molecules (CAMs) | 33 | 1.8 | 2.59×10−5 |
| hsa04151: PI3K-Akt signaling pathway | 61 | 3.3 | 6.57×10−5 |
| hsa04015: Rap1 signaling pathway | 40 | 2.2 | 3.29×10−4 |
| hsa04530: Tight junction | 29 | 1.6 | 4.68×10−4 |
| hsa04115: p53 signaling pathway | 18 | 1.0 | 4.77×10−4 |
ECM, extracellular matrix; PI3K, phosphoinositide-3-kinase; Akt, protein kinase B.
Figure 4.Protein-protein interaction network of differentially expressed genes identified by Search Tool for the Retrieval of Interacting Genes.
Top 10 hub nodes with highest degrees of interaction in lung cancer.
| Name | Node degree | Betweenness centrality | Closeness centrality | Stress centrality | Clustering coefficient |
|---|---|---|---|---|---|
| EGFR | 198 | 0.088 | 0.464 | 2,403,514 | 0.085 |
| JUN | 185 | 0.055 | 0.457 | 1,936,238 | 0.116 |
| FOS | 164 | 0.038 | 0.447 | 1,365,032 | 0.120 |
| IL6 | 139 | 0.031 | 0.427 | 1,039,056 | 0.138 |
| MYC | 136 | 0.030 | 0.441 | 1,089,702 | 0.147 |
| MMP9 | 135 | 0.028 | 0.433 | 1,011,434 | 0.148 |
| CDK1 | 121 | 0.032 | 0.427 | 936,828 | 0.105 |
| CDH1 | 119 | 0.035 | 0.426 | 1,062,348 | 0.111 |
| FYN | 118 | 0.035 | 0.410 | 1,089,798 | 0.127 |
| FGF2 | 114 | 0.013 | 0.420 | 577,568 | 0.184 |
Figure 5.Top 3 modules from the protein-protein interaction network. Nodes and links represent human proteins and protein interactions. (A) The enriched pathways of module 1; (B) The enriched pathways of module 2; (C) The enriched pathways of module 3.
Top 3 modules from the protein-protein interaction network.
| Modules | Term | P-value | False discovery rate | Genes |
|---|---|---|---|---|
| 1 | Chemokine signaling pathway | 6.11×10−12 | 6.23×10−9 | ADCY4, CXCL5, PPBP, CCL21, CXCL3, CXCL2, CXCR2, GNG2, CCL5, CXCL12, CCL28 |
| Neuroactive ligand-receptor interaction | 5.62×10−5 | 0.057248 | P2RY13, S1PR1, C5AR1, SSTR1, P2RY14, FPR1, FPR2 | |
| Cytokine-cytokine receptor interaction | 2.51×10−4 | 0.255655 | PPBP, CCL21, CXCR2, CCL5, CXCL12, CCL28 | |
| 2 | Cell cycle | 2.53×10−9 | 1.80×10−6 | CCNB1, CDK1, CCNB2, MAD2L1, BUB1B, CDC20, CCNA2 |
| Progesterone-mediated oocyte maturation | 2.90×10−6 | 0.002056 | CCNB1, CDK1, CCNB2, MAD2L1, CCNA2 | |
| Oocyte meiosis | 2.99×10−4 | 0.212472 | CDK1, MAD2L1, CDC20, AURKA | |
| 3 | Pathways in cancer | 7.30×10−7 | 8.72×10−4 | AGTR1, EDNRB, VEGFC, IL6, CDKN2A, PLCB4, GNAQ, PPARG, CDH1, FGF2, PIK3R1, TGFB2 |
| Chagas disease (American trypanosomiasis) | 9.65×10−6 | 0.011527 | ACE, IL6, PLCB4, GNAQ, PIK3R1, TGFB2 | |
| African trypanosomiasis | 2.00×10−5 | 0.023928 | ICAM1, IL6, PLCB4, GNAQ, F2RL1 |
Figure 6.Gene network associated with Toll-like receptor pathways. Red and green dots represent up- and downregulated differentially expressed genes, respectively. Grey diamonds marked with yellow stars represent the associated genes, which were added by the system. The dot size represents the difference of one gene between lung cancer tissues and normal tissues.
Main genes associated with Toll-like receptors pathways.
| Gene ID | Gene | P-value | Fold change |
|---|---|---|---|
| 6696 | SPP1 | 2.01×10−14 | 34.41545 |
| 3569 | IL6 | 1.05×10−12 | −8.05396 |
| 1326 | MAP3K8 | 5.29×10−15 | −4.17527 |
| 51311 | TLR8 | 5.01×10−10 | −3.84351 |
| 2353 | FOS | 5.20×10−12 | −3.81020 |
| 6351 | CCL4 | 5.31×10−9 | −2.77206 |
| 7099 | TLR4 | 3.40×10−7 | −2.50968 |
| 6352 | CCL5 | 1.51×10−6 | −2.42054 |
| 3725 | JUN | 1.05×10−8 | −2.36435 |
| 5295 | PIK3R1 | 8.86×10−10 | −2.26284 |
| 10000 | AKT3 | 9.98×10−11 | −2.22438 |
| 5603 | MAPK13 | 7.48×10−11 | 2.14546 |