| Literature DB >> 35028418 |
Chengrui Li1, Yufeng Wan2, Weijun Deng3, Fan Fei4, Linlin Wang5, Fuwei Qi4, Zhong Zheng4.
Abstract
Lung adenocarcinoma (LUAD) is the most common subtype of non-small cell lung cancer associated with an unstable prognosis. Thus, there is an urgent demand for the identification of novel diagnostic and prognostic biomarkers as well as targeted drugs for LUAD. The present study aimed to identify potential new biomarkers associated with the pathogenesis and prognosis of LUAD. Three microarray datasets (GSE10072, GSE31210, and GSE40791) from the Gene Expression Omnibus database were integrated to identify the differentially expressed genes (DEGs) in normal and LUAD samples using the limma package. Bioinformatics tools were used to perform functional and signaling pathway enrichment analyses for the DEGs. The expression and prognostic values of the hub genes were further evaluated by Gene Expression Profiling Interactive Analysis and real-time quantitative polymerase chain reaction. Furthermore, we mined the "Connectivity Map" (CMap) to explore candidate small molecules that can reverse the tumoral of LUAD based on the DEGs. A total of 505 DEGs were identified, which included 337 downregulated and 168 upregulated genes. The PPI network was established with 1,860 interactions and 373 nodes. The most significant pathway and functional enrichment associated with the genes were cell adhesion and extracellular matrix-receptor interaction, respectively. Seven DEGs with high connectivity degrees (ZWINT, RRM2, NDC80, KIF4A, CEP55, CENPU, and CENPF) that were significantly associated with worse survival were chosen as hub genes. Lastly, top 20 most important small molecules which reverses the LUAD gene expressions were identified. The findings contribute to revealing the molecular mechanisms of the initiation and progression of LUAD and provide new insights for integrating multiple biomarkers in clinical practice.Entities:
Keywords: bioinformatics analysis; candidate small molecules; lung adenocarcinoma; novel biomarkers; prognosis
Year: 2021 PMID: 35028418 PMCID: PMC8692660 DOI: 10.1515/med-2021-0375
Source DB: PubMed Journal: Open Med (Wars)
Figure 1Study workflow for the identification of primary genes and pathways in LUAD.
Figure 2(a) Volcano plot of the gene expression profile data in LUAD and normal tissues in each dataset. Black dot: non-DEGs; green dot: substantially downregulated genes; red dot: substantially upregulated genes. |log 2 FC| >1 and P <0.05 were considered significant and (b) ((a) Venn diagram of the 505 overlapping DEGs from the GSE10072, GSE31210, and GSE40791 datasets. (b) Upregulated overlapping DEGs; and (c) Downregulated overlapping DEGs).
Functional and pathway enrichment analysis of the overlapping DEGs
| Category | Term | Count |
|
|---|---|---|---|
| GOTERM_BP_FAT | GO:0007155∼cell adhesion | 64 | 2.02 × 10−14 |
| GOTERM_BP_FAT | GO:0022610∼biological adhesion | 64 | 2.12 × 10−14 |
| GOTERM_BP_FAT | GO:0001944∼vasculature development | 32 | 5.91 × 10−11 |
| GOTERM_BP_FAT | GO:0001568∼blood vessel development | 31 | 1.50 × 10−10 |
| GOTERM_BP_FAT | GO:0042127∼regulation of cell proliferation | 57 | 5.75 × 10−9 |
| GOTERM_CC_FAT | GO:0044421∼extracellular region part | 95 | 1.11 × 10−23 |
| GOTERM_CC_FAT | GO:0005576∼extracellular region | 139 | 2.88 × 10−20 |
| GOTERM_CC_FAT | GO:0005578∼proteinaceous extracellular matrix | 43 | 4.65 × 10−15 |
| GOTERM_CC_FAT | GO:0031012∼extracellular matrix | 44 | 1.39 × 10−14 |
| GOTERM_CC_FAT | GO:0005615∼extracellular space | 63 | 6.59 × 10−14 |
| GOTERM_MF_FAT | GO:0030246∼carbohydrate binding | 38 | 1.07 × 10−10 |
| GOTERM_MF_FAT | GO:0005539∼glycosaminoglycan binding | 23 | 3.83 × 10−10 |
| GOTERM_MF_FAT | GO:0001871∼pattern binding | 23 | 2.46 × 10−9 |
| GOTERM_MF_FAT | GO:0030247∼polysaccharide binding | 23 | 2.46 × 10−9 |
| GOTERM_MF_FAT | GO:0008201∼heparin binding | 18 | 1.83 × 10−8 |
| KEGG_PATHWAY | hsa04512:ECM-receptor interaction | 12 | 1.82 × 10−4 |
| KEGG_PATHWAY | hsa04510:Focal adhesion | 18 | 8.11 × 10−4 |
| KEGG_PATHWAY | hsa04514:Cell adhesion molecules (CAMs) | 14 | 8.37 × 10−4 |
| KEGG_PATHWAY | hsa04115:p53 signaling pathway | 9 | 0.00281 |
| KEGG_PATHWAY | hsa04670:Leukocyte transendothelial migration | 12 | 0.003238 |
Figure 3Functional and signaling pathway analyses of the overlapping DEGs in LUAD. (a) Biological process, (b) cellular component, (c) molecular function, and (d) KEGG pathways.
Figure 4(a) PPI network construction and module analysis. (b) The most significant module. (c) The biological process analysis of the module genes by BiNGO. The color depth of the nodes represents the P value correction. The node size represents the total genes involved.
Functional and pathway enrichment analysis of genes in the most significant modules
| Term | ID | Input number |
|
|---|---|---|---|
| Cell cycle | hsa04110 | 8 | 3.31 × 10−13 |
| Oocyte meiosis | hsa04114 | 6 | 1.86 × 10−9 |
| Progesterone-mediated oocyte maturation | hsa04914 | 5 | 3.72 × 10−8 |
| p53 signaling pathway | hsa04115 | 4 | 5.87 × 10−7 |
| DNA replication | hsa03030 | 2 | 0.000548 |
Figure 5(a) Expression levels of hub genes in the normal and LUAD tissues in the three datasets. (b) The heatmap of the module genes in the normal and LUAD samples based on the UCSC database.
Figure 6The (a) expression levels and (b) prognostic values of hub genes based on the GEPIA database.
Figure 7(a) Results of the representative immunohistochemistry staining showing the protein level expressions of the hub genes in normal and LUAD tissues. (b) List of 20 significant small-molecule drugs with the ability to reverse the tumoral status of LUAD.
Figure 8(a) Hub genes and their co-expression gene networks constructed using the cBioPortal tool. Hub genes and their co-expression genes are represented by nodes with thick and thin outlines, respectively. (b) Pop plot of the top 20 small molecules with the ability to reverse the gene expression in LUAD. (c) qPCR validation of hub genes in the seven paired LUAD samples. *P <0.05.
The full name and functional roles of eight hub genes
| Gene symbol | Full name | Function |
|---|---|---|
| CENPF | Centromere protein F | This gene encodes a protein that associates with the centromere-kinetochore complex. The protein is a component of the nuclear matrix during the G2 phase of interphase. In late G2 the protein associates with the kinetochore and maintains this association through early anaphase. It localizes to the spindle midzone and the intracellular bridge in late anaphase and telophase, respectively, and is thought to be subsequently degraded. The localization of this protein suggests that it may play a role in chromosome segregation during mitotis. It is thought to form either a homodimer or heterodimer. Autoantibodies against this protein have been found in patients with cancer or graft versus host disease |
| CENPU | Centromere protein U | The centromere is a specialized chromatin domain, present throughout the cell cycle, that acts as a platform on which the transient assembly of the kinetochore occurs during mitosis. All active centromeres are characterized by the presence of long arrays of nucleosomes in which CENPA (MIM 117139) replaces histone H3 (see MIM 601128). MLF1IP, or CENPU, is an additional factor required for centromere assembly |
| CEP55 | Centrosomal protein 55 | CEP55 (Centrosomal Protein 55) is a Protein Coding gene. Diseases associated with CEP55 include Multinucleated Neurons, Anhydramnios, Renal Dysplasia, Cerebellar Hypoplasia, And Hydranencephaly and Meckel Syndrome, Type 1. Among its related pathways are Cytoskeletal Signaling and DNA Damage. |
| KIF4A | Kinesin family member 4A | This gene encodes a member of the kinesin 4 subfamily of kinesin related proteins. The encoded protein is an ATP dependent microtubule-based motor protein that is involved in the intracellular transport of membranous organelles. This protein also associates with condensed chromosome arms and may be involved in maintaining chromosome integrity during mitosis. This protein may also be involved in the organization of the central spindle prior to cytokinesis. A pseudogene of this gene is found on chromosome X. |
| NDC80 | NDC80, kinetochore complex component | This gene encodes a component of the NDC80 kinetochore complex. The encoded protein consists of an N-terminal microtubule binding domain and a C-terminal coiled-coiled domain that interacts with other components of the complex. This protein functions to organize and stabilize microtubule-kinetochore interactions and is required for proper chromosome segregation |
| RRM2 | Ribonucleotide Reductase Regulatory Subunit M2 | This gene encodes one of two non-identical subunits for ribonucleotide reductase. This reductase catalyzes the formation of deoxyribonucleotides from ribonucleotides. Synthesis of the encoded protein (M2) is regulated in a cell-cycle dependent fashion. Transcription from this gene can initiate from alternative promoters, which results in two isoforms that differ in the lengths of their N-termini. Related pseudogenes have been identified on chromosomes 1 and X. |
| ZWINT | ZW10 interacting kinetochore protein | This gene encodes a protein that is clearly involved in kinetochore function although an exact role is not known. It interacts with ZW10, another kinetochore protein, possibly regulating the association between ZW10 and kinetochores. The encoded protein localizes to prophase kinetochores before ZW10 does and it remains detectable on the kinetochore until late anaphase. It has a uniform distribution in the cytoplasm of interphase cells. Alternatively spliced transcript variants encoding different isoforms have been found for this gene |