| Literature DB >> 33285674 |
Qian Zhao1, Yan Zhang1, Xue Zhang1, Yeqing Sun2, Zhengkui Lin1.
Abstract
To explore the gene modules and key genes of head and neck squamous cell carcinoma (HNSCC), a bioinformatics algorithm based on the gene co-expression network analysis was proposed in this study.Firstly, differentially expressed genes (DEGs) were identified and a gene co-expression network (i-GCN) was constructed with Pearson correlation analysis. Then, the gene modules were identified with 5 different community detection algorithms, and the correlation analysis between gene modules and clinical indicators was performed. Gene Ontology (GO) analysis was used to annotate the biological pathways of the gene modules. Then, the key genes were identified with 2 methods, gene significance (GS) and PageRank algorithm. Moreover, we used the Disgenet database to search the related diseases of the key genes. Lastly, the online software onclnc was used to perform the survival analysis on the key genes and draw survival curves.There were 2600 up-regulated and 1547 down-regulated genes identified in HNSCC. An i-GCN was constructed with Pearson correlation analysis. Then, the i-GCN was divided into 9 gene modules. The result of association analysis showed that, sex was mainly related to mitosis and meiosis processes, event was mainly related to responding to interferons, viruses and T cell differentiation processes, T stage was mainly related to muscle development and contraction, regulation of protein transport activity processes, N stage was mainly related to mitosis and meiosis processes, while M stage was mainly related to responding to interferons and immune response processes. Lastly, 34 key genes were identified, such as CDKN2A, HOXA1, CDC7, PPL, EVPL, PXN, PDGFRB, CALD1, and NUSAP1. Among them, HOXA1, PXN, and NUSAP1 were negatively correlated with the survival prognosis.HOXA1, PXN, and NUSAP1 might play important roles in the progression of HNSCC and severed as potential biomarkers for future diagnosis.Entities:
Mesh:
Year: 2020 PMID: 33285674 PMCID: PMC7717835 DOI: 10.1097/MD.0000000000022655
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Figure 1Flow-chart of data analysis in this paper. The rectangular boxes represent the processing steps, and the parallelogram boxes represent the method or database.
Figure 2Data preprocessing and identification of DEG. (A) Sample clustering was conducted to detect outliers, whlie TCGA-D6-A6ES and TCGA-IQ-7631 were removed. (B) X-axis represents log2 fold-changes and Y-axis represents negative logarithm to the base 10 of the P-values. Black vertical and horizontal dashed lines reflect filtering criteria (FC= ± 1 and P value = .05). (C) Red and blue bars are number of significantly down-regulated (n = 1547) and up-regulated genes (n = 2600) in HNSCC compared with its adjacent tissues.
Figure 3Construction of GCN and mining of gene modules. (A) The i-GCN was constructed by Pearson correlation analysis. (B) Division result obtained by the multilevel algorithm. The multilevel algorithm divided i-GCN into 13 communities. (C) The heat map of the correlation between modules and clinical indicators. The row corresponds to module, the column corresponds to clinical indicator. The modules m2, m3, m7, m8, and m9 were highly correlated with clinical indicators.
The modularities of five algorithms.
| Algorithm | Modularity |
| multilevel | 0.5256592 |
| eigenvector | 0.5137625 |
| label-propagation | 0.5155226 |
| map-equation | 0.01748193 |
| edge-betweenness | 0.5223523 |
Network density of nine communities containing more than 20 genes.
| Module | Densities |
| m1 | 0.220859773 |
| m2 | 0.221835782 |
| m3 | 1.219824805 |
| m4 | 0.141935484 |
| m5 | 0.367359229 |
| m6 | 0.158419958 |
| m7 | 0.106810852 |
| m8 | 0.10614192 |
| m9 | 0.063424239 |
GO analysis of DEGs in highly correlative module.
| ID | Description | Count | |
| Sex-related biological processes | |||
| GO:0007059 | chromosome segregation | 1.23E-71 | 86 |
| GO:0000819 | sister chromatid segregation | 9.62E-70 | 73 |
| GO:0006260 | DNA replication | 4.52E-58 | 70 |
| GO:0000280 | nuclear division | 1.05E-57 | 80 |
| GO:0140014 | mitotic nuclear division | 3.93E-56 | 67 |
| GO:0034340 | response to type I interferon | 1.07E-42 | 33 |
| GO:0051607 | defense response to virus | 3.92E-38 | 36 |
| GO:0000082 | G1/S transition of mitotic cell cycle | 7.00E-35 | 50 |
| GO:0009615 | response to virus | 2.41E-34 | 37 |
| GO:0071103 | DNA conformation change | 4.06E-33 | 50 |
| Event-related biological processes | |||
| GO:0034340 | response to type I interferon | 1.07E-42 | 33 |
| GO:0051607 | defense response to virus | 3.92E-38 | 36 |
| GO:0009615 | response to virus | 2.41E-34 | 37 |
| GO:0045071 | negative regulation of viral genome replication | 2.49E-23 | 17 |
| GO:0070268 | cornification | 5.43E-22 | 21 |
| GO:0019079 | viral genome replication | 5.97E-18 | 17 |
| GO:0043900 | regulation of multi-organism process | 6.39E-17 | 26 |
| GO:0035455 | response to interferon-alpha | 3.10E-13 | 8 |
| GO:0018149 | peptide cross-linking | 1.45E-12 | 11 |
| GO:0032480 | negative regulation of type I interferon production | 2.87E-09 | 8 |
| T-related biological processes | |||
| GO:0006936 | muscle contraction | 3.56E-56 | 68 |
| GO:0055001 | muscle cell development | 2.25E-40 | 44 |
| GO:0030239 | myofibril assembly | 2.50E-40 | 32 |
| GO:0030049 | muscle filament sliding | 6.02E-35 | 24 |
| GO:0033275 | actin-myosin filament sliding | 6.02E-35 | 24 |
| GO:0010927 | cellular component assembly involved in morphogenesis | 2.24E-34 | 33 |
| GO:0007517 | muscle organ development | 4.19E-34 | 53 |
| GO:0031032 | actomyosin structure organization | 5.69E-28 | 35 |
| GO:0014706 | striated muscle tissue development | 7.28E-25 | 43 |
| GO:0050879 | multicellular organismal movement | 8.75E-21 | 18 |
| N-related biological processes | |||
| GO:0007059 | chromosome segregation | 1.23E-71 | 86 |
| GO:0000819 | sister chromatid segregation | 9.62E-70 | 73 |
| GO:0006260 | DNA replication | 4.52E-58 | 70 |
| GO:0000280 | nuclear division | 1.05E-57 | 80 |
| GO:0140014 | mitotic nuclear division | 3.93E-56 | 67 |
| GO:0000082 | G1/S transition of mitotic cell cycle | 7.00E-35 | 50 |
| GO:0071103 | DNA conformation change | 4.06E-33 | 50 |
| GO:1901990 | regulation of mitotic cell cycle phase transition | 1.99E-31 | 57 |
| GO:0051983 | regulation of chromosome segregation | 2.04E-30 | 31 |
| GO:0007088 | regulation of mitotic nuclear division | 1.56E-27 | 35 |
| M-related biological processes | |||
| GO:0034340 | response to type I interferon | 1.07E-42 | 33 |
| GO:0051607 | defense response to virus | 3.92E-38 | 36 |
| GO:0009615 | response to virus | 2.41E-34 | 37 |
| GO:0045071 | negative regulation of viral genome replication | 2.49E-23 | 17 |
| GO:0019079 | viral genome replication | 5.97E-18 | 17 |
| GO:0043900 | regulation of multi-organism process | 6.39E-17 | 26 |
| GO:0035455 | response to interferon-alpha | 3.10E-13 | 8 |
| GO:0032480 | negative regulation of type I interferon production | 2.87E-09 | 8 |
| GO:0032606 | type I interferon production | 3.53E-09 | 11 |
| GO:0050688 | regulation of defense response to virus | 9.98E-08 | 8 |
Figure 4Significant correlation between key genes expression and survival. Survival curves of genes HOXA1, EVPL, PXN, and NUSAP1, X-axis represented survival time and Y-axis represented survival rate.