Literature DB >> 32578582

Comprehensive Analysis of Candidate Diagnostic and Prognostic Biomarkers Associated with Lung Adenocarcinoma.

Jingyuan Li1, Xingyuan Liu2,3, Zan Cui1, Guanying Han1.   

Abstract

BACKGROUND We aimed to screen and identify central genetic and molecular targets involved in advancement of lung adenocarcinoma (LUAD) and to perform an integrated analysis and clinical validation. MATERIAL AND METHODS The GEO2R technique was utilized to assess differentially expressed genes (DEGs) among the gene sets GSE75037, GSE85716, and GSE118370. Subsequently, gene Ontology (GO) analyses and Kyoto Encyclopedia of Genes and Genomes (KEGG) analytical methods were executed to determine related biofunctions and signaling pathways, which were annotated with tools from the Database for Annotation, Visualization and Integrated Discovery (DAVID) resource. Then, a protein-protein interaction (PPI) network complex consisting of all detected DEGs was built with the STRING web interface. Cytohubba and MCODE plug-ins for Cytoscape software and Gene Expression Profiling Interactive Analysis (GEPIA) were employed to identify the hub genes. Finally, the mRNA expression of the identified hub genes was quantitatively validated by The Cancer Genome Atlas (TCGA) database analysis and real-time quantitative polymerase chain reaction (RT-qPCR). RESULTS We screened 146 upregulated DEGs and 431 downregulated DEGs with the criteria of |logFC| >1 and P<0.05, and the GO analysis indicated that DEGs were implicated in mitotic nuclear division (biological process, BP), the nucleus (cellular component, CC), and protein binding (molecular function, MF) and were associated with multiple KEGG pathways, such as the p53 signaling pathway in cancer. Then, the top 8 genes that predicted significantly different outcomes in LUAD patients were filtered from the DEGs and selected as hub genes. The TCGA database analysis and RT-qPCR results demonstrated that these genes were differentially expressed with the same trends in LUAD tissues compared with normal tissues. CONCLUSIONS Overall, we propose that 8 genes (PECAM1, CDK1, MKI67, SPP1, TOP2A, CHEK1, CCNB1, and RRM2) might be novel hub genes strongly associated with the progression and prognosis of LUAD.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32578582      PMCID: PMC7331474          DOI: 10.12659/MSM.922070

Source DB:  PubMed          Journal:  Med Sci Monit        ISSN: 1234-1010


Background

Lung cancer remains the principal culprit of cancer-related morbidity and mortality, accounting for 18.4% of cancer deaths based in a worldwide survey, corresponding to approximately 1.8 million deaths [1]. Lung adenocarcinoma (LUAD), which habitually occurs in peripheral portions of the lungs, is overtaking lung squamous cell carcinoma (LUSC) as the top prevailing pathological type of lung cancer and is characterized by glandular differentiation and/or mucin production [2]. IASLC/ATS/ERS corporately released a current multidisciplinary categorization of LUAD in 2011 that includes concordant expert terminology and comprehensive diagnostic criteria [3]. It reveals the correlation between the histological subtypes of LUAD and the risk of poor prognosis through comparison and analysis of big data from patients [4]. At present, the 5-year survival rate of MIA (minimally invasive adenocarcinoma) and AIS (adenocarcinoma in situ) after complete removal is approximately 100% and thus is associated with favorable prognosis [5]; however, the outcomes of other LUAD subtypes are unfavorable, and the 5-year survival rate is barely 21% [6]. Dissimilar clinical factors, including clinicopathological T classification and OSR (overall survival rate), exist between LUAD and LUSC owing to differences in genomic alternations, including conversion rate, mutation characteristics and frequently mutated genes [7]. The occurrence and progression of LUAD are closely related to driver mutations [8]. High-throughput sequencing, with extended read length and appropriate stability, has become more frequently applied for the exploration of candidate genes in cancer, diabetes, autism, and other genetic diseases and further increased the probability of identifying genes in non-model species. In the field of non-small-cell lung cancer (NSCLC)-targeted therapeutic strategies, studies concentrating on MET, RAT and BRAF as targets are being carried out in addition to studies on targeting ALK, EGFR, and ROS1 [9]. LUAD, which only shows symptoms similar to general respiratory diseases in the early stage, is probably overlooked, so uncovering biomarkers will substantially augment the potential to identify molecular targets in clinical practice. In addition, screening predictive biomarkers has been confirmed to be a necessary condition for discovering targeted anticancer drugs. In addition, patients with specific subtypes could potentially be recognized by applying biomarkers that can predict targeted drug response, invasive or malignant behavior, and drug resistance mechanisms. Accordingly, the effort of unrevealing beneficial and effective biomarkers for the preclinical diagnosis and prognostic prediction of LUAD is extremely urgent. Comprehensive analysis of the gene expression profile array using bioinformatics techniques was carried out to pinpoint various differentially expressed genes (DEGs). In the current study, upregulated DEGs (uDEGs) and downregulated DEGs (dDEGs) were filtered among thousands of DEGs in 3 GSE datasets of patients with LUAD. Subsequently, we performed Gene Ontology (GO) analyses, including results in the BP, CC and MF categories, to explore biofunctions and singling pathway enrichment; Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and protein-protein interaction (PPI) network construction was also performed with the indicated uDEGs and dDEGs. After the candidate genes were selected under certain conditions, the hub genes were obtained by survival analysis. Then, the differential messenger RNA (mRNA) expression levels of selected hub genes in LUAD tissues were confirmed by The Cancer Genome Atlas (TCGA) database analysis and real-time quantitative polymerase chain reaction (RT-qPCR). We intend to further clarify the intricate molecular biology of the pathogenesis of LUAD and substantiate potential pivotal genes that may be encouraging candidate biomarkers of diagnosis and prognosis, targets for the development of novel anticancer agents, or markers for drug resistance and precise treatment.

Material and Methods

Data source

GSE75037 data were detected with an Illumina HumanWG-6 v3.0 expression beadchip [10], GSE85716 analysis performed on an Agilent-062918 OE Human lncRNA Microarray V4.0 02800 [11], and GSE118370 analysis was performed on an Affymetrix Human Genome U133 Plus 2.0 Array [12]; these data were extracted from the GEO server, a free open genome database based on microarrays and sequences [13]. The characteristics of the 3 gene expression datasets are described in Table 1.
Table 1

The main features of 3 studies of GEO gene expression microarray data.

GEO accessionPlatformCase/ControlSampleDate
GSE75037GPL688483/83LUAD2016
GSE85716GPL196126/6LUAD2017
GSE118370GPL5706/6LUAD2019

LUAD – lung adenocarcinoma.

Data preprocessing

GEO2R () is based on R packages GEOquery and Limma, which is a GEO online analysis tool used to screen DEGs between normal lung tissue and LUAD samples in the current study. The genes that satisfied the conditions of |logFC| >1 and P<0.05 were determined to be statistically significant DEGs. The DEGs we obtained were grouped into 2 categories depending on logFC: uDEGs, logFC >1 and dDEGs, logFC <–1. The common intersection of identified uDEGs and dDEGs from GSE75037, GSE85716, and GSE118370 was separately acquired using Venny diagram plotter ().

Functional enrichment of overlapping uDEGs and dDEGs

The Database for Annotation, Visualization and Integrated Discovery (DAVID), a bioinformatics database that provides integrated and detailed biofunctional annotation information for sizable cohorts of genes or proteins [14], was applied to reveal the functional enrichment of uDEGs and dDEGs, including terms related to BP, CC, and MF. KEGG is a habitually used database for systematically uncovering the principal metabolic signaling pathways of gene expression products in cells. The DAVID web server was utilized to investigate the KEGG pathways of uDEGs and dDEGs in the current research.

PPI network complex construction and module analysis

STRING, a database to inspect interactions between recognized proteins [15], was implemented to assemble the PPI network complex of uDEGs and dDEGs. Cytoscape is a public serviceable program for improved visualization of interaction networks between molecules with integrated data [16]. The cytoHubba plug-in in Cytoscape software explores considerable nodes based on the score of all 11 methods in each node in the network. Accordingly, the PPI network module with a significant gene pair (combination score >0.2) was obtained with the MCODE plug-in. MCODE clustering score >5 and number of nodes >5 were used as the cutoff values for investigating meaningful modules. DAVID was applied to explore the functional enrichment of, and pathways involved with the genes in each module.

Determination of hub genes

GEPIA, which is implemented to assess the impact of hub genes on the prognostic outcome of the occurrence and progression of LUAD, is a well-performing web server that allows for interactive analysis of the specific functions of key genes, including profile analysis of differential expression in various tumors, pathological staging analysis, and survival analysis of patients according to DEGs [17]. In the course of survival analysis, patients were separated into high and low groups according to the median expression level of each selected gene. The significance between 2 groups was determined by the log-rank T test, and hazard ratio (HR) was estimated to evaluate the association between gene expression and survival rate. Kaplan-Meier charts with valid log-rank P values (<0.05 as the cutoff criterion) were applied to assess overall survival rate (OSR) and filter the hub genes between the 2 groups. GEPIA was used to observe the recognized differentially expressed hub genes in specific clinicopathological stages of LUAD. The expression of DEGs in different pathological stages was evaluated with one-way analysis of variance.

TCGA database validation

TCGA was utilized, which has 2.5 petabytes (PB) of detailed data on 33 cancers from no less than 10 000 patients. The hub genes were verified using the GEPIA tool based on TCGA data of LUAD and corresponding paracancerous tissues.

Patient sample collection

To further validate the robustness of the hub genes identified, 28 LUAD specimens and 8 benign marginal normal lung tissues were collected in the First Affiliated Hospital of Jinzhou Medical University from January 2016 to December 2018. The characteristics of the patients participating in the study are shown in Table 2. Prior to the use of tissue samples, informed consent was obtained from all patients. The study complied with the Helsinki Declaration and was approved by the Medical Ethics Committee of Jinzhou Medical University. All specimens were classified histologically by clinical pathologists according to the 2015 World Health Organization (WHO) classification of lung tumors. Each biopsy specimen was divided into 2 parts, one for routine histological diagnosis and the other for rapid freezing in RNAlater (Invitrogen) and storage at –20°C until total RNA separation and analysis.
Table 2

Clinicopathologic features of patients with LUAD in JMU.

VariablesAll patient
Patient number36
Age, years59.64±6.78
Body mass index, kg/m223.3±2.99
Gender
 Male17
 Female19
Clinicopathologic diagnosis (WHO)
 LUAD I7
 LUAD II7
 LUAD III7
 LUAD IV7
 LT8
Lymph node involvement
 N13
 Y15

JMU – Jinzhou Medical University; LUAD – lung adenocarcinoma; LT – lung tissue; WHO – World Health Organization.

RT-qPCR

Total RNA was obtained from 28 LUAD cases and 8 benign pulmonary tissue cases with TRIzol RNA separation reagent (Invitrogen) following the manufacturer’s steps, and then the SuperScript IV System kit (Invitrogen) was used to synthesize cDNA. As described in a previous study [18], TB Green Premix Ex Taq II m (Takara) was used for RT-qPCR, and a 7500 Real-Time PCR Applied Bio-system (Thermo Fisher) was utilized to examine the expression difference with GAPDH as an internal reference. The primers used to amplify PECAM1, CDK1, MKI67, SPP1, TOP2A, CHEK1, CCNB1, RRM2, and GAPDH were synthesized and constructed by GenePharma Corporation (Shanghai, China). The primer sequences are shown in Table 3. The 2−ΔΔCt method was utilized to compare the relative mRNA expression of the tumor samples and the control.
Table 3

Primer pair sequences for RT-qPCR of PECAM1, CDK1, MKI67, SPP1, TOP2A, CHEK1, CCNB1, RRM2, and GAPDH.

GeneForward (5′-3′)Reverse (5′-3′)
PECAM1atgccagtggaaatgtcctcagaagtggtactggtg
CDK1ccgtcgtaacctgttgagtaactatgtctacccttatacaccacaccgtaa
MKI67cgcgaattcagagagcttttccagacaccatgcgagcctcgaggaagattgttggggtacccac
SPP1ttctgattgggacagccgtgtctcatcattggctttccgct
TOP2Aggtgagaaggactggcagaaatcttgtcgatgaagtacagggcta
CHEK1ccagatgctcagagattcttccatgttcaacaaacgctcacgatta
CCNB1ttcgcctgagcctattttggtaagctccatcttctgcatccacat
RRM2tttagtgagcttagcacagcgggaaaatctgcgttgaagcagtgaggc
GAPDHtccaccaccctgttgctgtagacttcaacagcaactcccac

Results

DEGs screening

Criteria (|logFC| >1 and P<0.05) were adopted as thresholds for DEG filtration. Relevant data from GSE75037, GSE85716 and GSE118370 were used to estimate differentially expressed genes (DEGs) using GEO2R. A total of 577 DEGs were identified, among which 146 genes were upregulated (uDEGs) and 431 genes were downregulated (dDEGs), as demonstrated in the Venn diagram (Figure 1). The number of dDEGs exceeded the number of uDEGs. The top 5 most significant DEGs among the GSE75037, GSE85716 and GSE118370 datasets are presented in Table 4.
Figure 1

Venn diagram presenting the overlapping (A) uDEGs and (B) dDEGs in the 3 GEO datasets. In total, 146 uDEGs and 431 dDEGs were commonly identified between LUAD and corresponding normal adjacent lung samples. uDEGs – upregulated differentially expressed genes; dDEGs – downregulated differentially expressed genes; LUAD – lung adenocarcinoma.

Table 4

The most significant 5 uDEGs and 5 dDEGs in GSE75037, GSE85716, and GSE118370 datasets.

Gene symbolGSE75037Gene symbolGSE85716Gene symbolGSE118370
P valuelogFCP valuelogFCP valuelogFC
The most significant 5 uDEGs
MMP111.90E-535.12GLB1L39.61E-055.08254515SPINK12.45E-054.7511875
EEF1A25.17E-295.05CD1A1.78E-033.63170602HS6ST27.92E-034.3473223
GCNT32.88E-424.92TMPRSS42.04E-033.87381578TOX31.69E-044.1766185
CST14.27E-314.75TUBB33.19E-034.16514507CP3.11E-044.1578082
FAM83A4.68E-454.68XAGE1A3.46E-035.40792158MSMB2.86E-044.1265297
The most significant 5 dDEGs
FABP41.45E-85−6.03AGER5.93E-09−5.94009052CD300LG4.19E-04−4.9800372
CLDN181.16E-38−6.03SLC6A41.20E-06−5.934542FLJ345039.35E-05−4.9718172
ITLN22.71E-71−6.02FAM107A2.11E-06−5.53509465SLC19A35.58E-04−4.9164017
AGER6.39E-68−5.72SOSTDC16.81E-05−6.41373342FABP45.03E-05−4.8213644
SFTPA11.85E-36−5.54CLDN183.02E-04−5.67266395SLC6A47.30E-06−4.7189991

uDEGs – upregulated differentially expressed genes; dDEGs – downregulated differentially expressed genes.

Pathway enrichment analysis

The outcomes of uDEGs GO analysis demonstrated that mitotic nuclear division, nucleus, and protein binding were the most crucial terms in the BP, CC, and MF categories, respectively. The dDEGs GO analysis results indicated that positive regulation of transcription from the RNA polymerase II promoter, integral component of the membrane, and protein binding were the most significant terms in the BP, CC, and MF categories (Table 5). KEGG analysis was conducted on the uDEGs and dDEGs to further explore the key pathways involved. The uDEGs were generally implicated in the p53 signaling pathway, cell cycle, and oocyte meiosis signaling pathway, and the dDEGs were substantially implicated in the pathways in cancer, cAMP signaling pathway, and neuroactive ligand-receptor interaction signaling pathway (Table 6). These significantly enriched GO terms and pathways could be revealed to further apprehend the function of these DEGs in the occasion and progress of LUAD.
Table 5

TOP 5 GO function annotation of common uDEGs and dDEGs associated with LUAD.

CategoryGO-IDGO-termCount%P-value
GO analysis of uDEGs (TOP5)
BPGO: 0007067Mitotic nuclear division128.225.6E-06
BPGO: 0006260DNA replication96.163.8E-05
BPGO: 0000086G2/M transition of mitotic cell cycle85.481.3E-04
BPGO: 0051301Cell division128.221.3E-04
BPGO: 0006977DNA damage response, signal transduction by p53 class mediator resulting in cell cycle arrest64.111.4E-04
CCGO: 0005634Nucleus5336.304.5E-02
CCGO: 0005737Cytoplasm5235.623.6E-02
CCGO: 0016324Apical plasma membrane85.487.9E-03
CCGO: 0005813Centrosome85.484.9E-02
CCGO: 0030496Midbody74.795.2E-04
MFGO: 0005515Protein binding7954.113.3E-02
MFGO: 0005524ATP binding2215.074.6E-03
MFGO: 0042802Identical protein binding149.594.8E-03
MFGO: 0004674Protein serine/threonine kinase activity96.168.3E-03
MFGO: 0019901Protein kinase binding96.168.3E-03
GO Analysis of dDEGs (TOP5)
BPGO: 0045944Positive regulation of transcription from RNA polymerase II promoter4510.472.6E-05
BPGO: 0007165Signal transduction4410.231.9E-03
BPGO: 0007155Cell adhesion337.674.0E-08
BPGO: 0001525Angiogenesis327.441.2E-15
BPGO: 0000122Negative regulation of transcription from RNA polymerase II promoter306.983.2E-03
CCGO: 0016021Integral component of membrane15736.514.7E-05
CCGO: 0005886Plasma membrane15636.281.5E-11
CCGO: 0070062Extracellular exosome8018.603.3E-02
CCGO: 0005887Integral component of plasma membrane7116.518.2E-10
CCGO: 0005576Extracellular region7016.283.3E-07
MFGO: 0005515Protein binding22251.637.6E-03
MFGO: 0005509Calcium ion binding286.516.2E-03
MFGO: 0042803Protein homodimerization activity276.281.4E-02
MFGO: 0046982Protein heterodimerization activity194.421.8E-02
MFGO: 0008134Transcription factor binding184.192.5E-04

GO – Gene Ontology; uDEGs – upregulated differentially expressed genes; dDEGs – downregulated differentially expressed genes; LUAD – lung adenocarcinoma, BP – biological process, CC – cellular component MF – molecular function.

Table 6

KEGG pathway analysis of uDEGs and dDEGs.

IDKEGG_PATHWAY descriptionCount%P-value
uDEGs
 hsa04115p53 signaling pathway85.481.6E-06
 hsa04110Cell cycle85.489.1E-05
 hsa04114Oocyte meiosis53.421.6E-02
 hsa04914Progesterone-mediated oocyte maturation42.744.0E-02
 hsa05219Bladder cancer32.054.9E-02
dDEGs
 hsa05200Pathways in cancer235.356.4E-04
 hsa04024cAMP signaling pathway153.497.2E-04
 hsa04080Neuroactive ligand-receptor interaction153.491.5E-02
 hsa04550Signaling pathways regulating pluripotency of stem cells122.791.1E-03
 hsa04514Cell adhesion molecules (CAMs)122.791.3E-03

KEGG – Kyoto Encyclopedia of Genes and Genomes; uDEGs – upregulated differentially expressed genes; dDEGs – downregulated differentially expressed genes.

PPI network and module analysis

The PPI network was constructed by using STRING to retrieve interactions between genes. The established PPI network consisted of 577 edges and 2166 nodes (Figure 2A), with a local clustering coefficient=0.4. The top 20 highest-level central nodes, including IL6, CDH1, VWF, PECAM1, CDK1, CD34, CXCL12, JUN, BMP2, MKI67, PPARG, CTGF, COL1A1, SPP1, CAV1, TOP2A, CHEK1, CCNB1, and BDNF, were screened out with the cytoHubba plug-in. To facilitate our understanding of the DEGs, we visualized the network complex in Cytoscape software and modularized it using the plug-in MCODE to obtain 3 consequential modules including module 1 (28 nodes and 343 edges, cluster score=25.407), module 2 (28 nodes and 118 edges, cluster score=8.741) and module 3 (5 nodes and 10 edges, cluster score=5) (Figure 2B–2D). The KEGG analysis laid out 4 related signaling pathways that were principally involved in module 1, including the cell cycle, p53 signaling pathway, and oocyte meiosis; 15 related signaling pathways were considerably involved in module 2, such as pathways in cancer; 9 related signaling pathways were significantly implicated in module 3, including basal cell carcinoma and others (Table 7).
Figure 2

Construction of the protein-protein interaction (PPI) network complex with the STRING database and acquisition of 3 significant modules on the basis of the topological structure of the PPI network of the common uDEGs and dDEGs. (A) PPI network of uDEGs and dDEGs in LUAD tissues compared with corresponding normal adjacent tissues; (B) module 1, cluster score=25.407, 28 nodes and 343 edges; (C) module 2, cluster score=8.741, 28 nodes and 118 edges; (D) module 3, cluster score=5, 5 nodes and 10 edges. The blue squares indicate a node degree less than 30. uDEGs – upregulated differentially expressed genes; dDEGs – downregulated differentially expressed genes; LUAD – lung adenocarcinoma.

Table 7

KEGG pathway analysis of 3 key module in PPI network complex.

IDKEGG_PATHWAYCount%P-valueGenes
Module 1hsa04110Cell cycle621.431.3E-06CCNB1, CDK1, MAD2L1, CHEK1, CDC25C, MCM4
hsa04115p53 signaling pathway517.863.8E-06CCNB1, CDK1, RRM2, CHEK1, GTSE1
hsa04114Oocyte meiosis517.862.9E-05CCNB1, CDK1, MAD2L1, AURKA, CDC25C
Module 2hsa05200Pathways in cancer80.191.1E-04ADCY4, BMP2, CDKN2A, PPARG, CDH1, GNG2, GNG11, MMP1
hsa04151PI3K-Akt signaling pathway60.143.2E-03CSF3, TEK, ANGPT1, GNG2, GNG11, SPP1
hsa05219Bladder cancer30.076.8E-03CDKN2A, CDH1, MMP1
Module 3hsa05217Basal cell carcinoma5100.003.4E-09WNT7B, FZD1, AXIN2, FZD4, WNT2B
hsa04310Wnt signaling pathway5100.001.6E-07WNT7B, FZD1, AXIN2, FZD4, WNT2B
hsa04550Signaling pathways regulating pluripotency of stem cells5100.001.6E-07WNT7B, FZD1, AXIN2, FZD4, WNT2B

KEGG – Kyoto Encyclopedia of Genes and Genomes; PPI – protein–protein interaction

Recognition of hub genes

To better define hub genes in LUAD, 3 conditions were set for hub genes: 1) the degree score of DEGs tested by cytohHubba was in the top 15; 2) DEGs were in the 3 modules obtained by MCODE; and 3) DEGs were associated with the survival rate of the patients in the survival curve (Figure 3A–3H). We used the top 15 genes identified by cytoHubba and the common genes in the 3 modules as candidate genes in the PPI network and then verified whether candidate genes influenced survival, as shown by the Kaplan-Meier chart. The obtained hub genes, including PECAM1 (F value=0.308), CDK1 (F value=5.91), MKI67 (F value=6.38), SPP1 (F value=1.31), TOP2A (F value=2.88), CHEK1 (F value=7.27), CCNB1 (F value=8.88), and RRM2 (F value=9.01), had altered expression, as shown by violin charts according to the pathological stage of the patient using GEPIA (Figure 3I–3P). Among these genes, PECAM1 was downregulated, while others were upregulated in LUAD compared to normal samples.
Figure 3

Kaplan-Meier survival curves indicated the prognostic value of hub genes in the form of overall survival (months) in human LUAD patients; (A) PECAM1, (B) CDK1, (C) MKI67, (D) SPP1, (E) TOP2A, (F) CHEK1, (G) CCNB1, (H) RRM2. Violin plots demonstrated an association between the pathological stage of human LUAD and the expression of 8 hub genes based on TCGA data analysis using GEPIA; (I) PECAM1, (J) CDK1, (K) MKI67, (L) SPP1, (M) TOP2A, (N) CHEK1, (O) CCNB1, (P) RRM2. LUAD – lung adenocarcinoma; TCGA – The Cancer Genome Atlas.

Verification of hub genes by TCGA database analysis and RT-qPCR

The mRNA expression level of hub genes was verified based on TCGA database analysis, and it was revealed that the mRNA expression levels of CDK1, MKI67, SPP1, TOP2A, CHEK1, CCNB1, and RRM2 in LUAD samples were statistically higher than those in non-lung cancer samples (Figure 4A–4H). This is consistent with previous bioinformatics research.
Figure 4

DEGs were validated in LUAD and normal lung tissue. The gene expression of (A) PECAM1, (B) CDK1, (C) MKI67, (D) SPP1, (E) TOP2A, (F) CHEK1, (G) CCNB1, (H) RRM2 in 483 LUAD patients and 347 normal tissues was evaluated by GEPIA. The relative mRNA expression levels of (I) PECAM1, (J) CDK1, (K) MKI67, (L) SPP1, (M) TOP2A, (N) CHEK1, (O) CCNB1, and (P) RRM2 between 28 LUAD patients and 8 normal lung tissues were determined by RT-qPCR. All data are shown as the mean±SD, ### P<0.001 versus the normal group. DEGs – differentially expressed genes; LUAD – lung adenocarcinoma; GEPIA – Gene Expression Profiling Interactive Analysis; mRNA – messenger RNA; RT-qPCR – real-time quantitative polymerase chain reaction; SD – standard deviation.

To determine whether hub genes identified in gene chip analysis can be non-selectively used to identify LUAD patients in clinical practice, the mRNA expression discrepancy of selected hub genes in LUAD samples and normal lung cancer tissues was verified by RT-qPCR examination (Figure 4I–4P). These results showed that CDK1, MKI67, SPP1, TOP2A, CHEK1, CCNB1, and RRM2 mRNA expression levels were significantly upregulated and PECAM1 was significantly downregulated, which was consistent with the gene chip analysis results. The results of RT-qPCR confirmed that the mRNA expression of selected hub genes in human LUAD tissues was higher than that in the control tissues, suggesting that these 8 hub genes might be new genetic markers for LUAD patients.

Discussion

Previous preclinical and clinical studies revealed a partial understanding of the underlying genetic mechanism of LUAD; nevertheless, the incidence and mortality rates of LUAD are continually increasing [5]. The rapid advancement of high-throughput technology has led to the identification of massive cohorts of applicable biomarkers, such as microRNAs (miRNAs) and long non-coding RNAs (lncRNAs), that can be used for early diagnosis, prognosis, and treatment decision making for many diseases, including LUAD [19]. In the current research, 3 gene expression profiling sets were subjected to integrated analysis, and 577 DEGs were identified, including 146 uDEGs and 431 dDEGs. Functional annotation of uDEGs and dDEGs was divided into 3 groups according to GO terminology (BP, CC, and MF), and KEGG pathway enrichment of uDEGs and dDEGs was analyzed by the DAVID database. In line with KEGG pathway enrichment analysis, the uDEGs were mainly implicated in the pathways in cancer, cAMP signaling pathway, and neuroactive ligand-receptor interaction signaling pathway; the dDEGs were significantly involved in the p53 signaling pathway, cell cycle, and oocyte meiosis signaling pathway. According to previous studies [20-22], DEGs identified from LUAD are enriched in signaling pathways, such as the p53 signaling pathway, cell cycle, oocyte meiosis, pathways in cancer, cAMP signaling pathway, and neuroactive ligand-receptor interaction, which are related pathways involved in the advancement of malignant tumors. A PPI network of uDEGs and dDEGs was constructed, and the top 15 candidate genes and the most important modules were screened based on the topological structures of the network. Through comprehensive bioinformatics analysis, 8 hub genes, including PECAM1, CDK1, MKI67, SPP1, TOP2A, CHEK1, CCNB1, and RRM2, were identified to have high connectivity and could distinguish the stage of LUAD from both benign lung disease and normal tissue. At present, the treatment options for LUAD include surgery, chemotherapy, radiotherapy and targeted drug therapy, which improve the survival of patients with mild effects [23]. Platelet and endothelial cell adhesion molecule 1 (PECAM-1) is commonly detected in vascular endothelial cells and is involved in various physiological processes, including platelet aggregation, angiogenesis, and protection of the endothelium from endotoxin stress [24]. PECAM-1 is mediated by regulation of the tumor microenvironment (TME) and tumor cell proliferation, which is related to advanced tumor metastasis progression [25], and has a distinct differential expression in hepatocellular carcinoma (HCC) [26]. Cyclin-dependent kinase-1 (CDK1), a critical regulator of the G2/M checkpoint, can directly phosphorylate HIF-1α, upregulate HIF-1α transcription, and increase the invasion and migration of tumor cells [27]. Research has revealed that the nuclear/cytoplasmic expression ratio of CDK1 was determined as a self-sufficient prognostic reference point of colorectal cancer with histochemical staining [28]. CDK1, which is an effective prognostic indicator in patients with pancreatic ductal adenocarcinoma, could promote tumor progression [29]. Marker of proliferation Ki-67 (MKI67), a peripheral part of the mitotic chromosome, keeps the chromosome from collapsing into a single chromatin mass after the disintegration of the nuclear membrane; consequently, chromosomes can independently mobilize and effectively interact with mitotic spindles. The high expression of MKI67 mRNA is actually correlated with the advanced stage of non-muscle invasive bladder cancer [30]. Ki-67 functions in the proliferation of malignant cells at the front of infiltrating tumors, which is highly correlated with the invasiveness and poor clinical outcomes of triple-negative breast cancer [31], and has great potential as a biomarker of oral squamous cell carcinoma (OSCC) [32]. Decreased expression of secreted phosphoprotein 1 (SPP1), associated with EGFR mutation, is interconnected with the improvement of the overall survival rate and recurrence-free survival rate of LUAD [33]. SPP1 in hydrothorax can be applied in the detection of malignant pleural effusion and judgment of patients with NSCLC [34]. Additionally, the overexpression of SPP1 in OSCC and HCC is related to the occurrence and progression of tumors [35,36]. The overexpression of DNA topoisomerase II alpha (TOP2A) has a negative effect on the prognosis of breast cancer patients; hence, TOP2A-targeted therapy might be beneficial to the treatment and prognosis prediction of breast cancer [37]. Patients with TOP2A-positive breast cancer are more sensitive to anthracyclines than TOP2A-negative patients [38]. TOP2A is highly expressed in more advanced uterine leiomyosarcoma with a high mitotic index but not in nonmalignant uterine diseases [39]. It has been previously reported that the protein encoded by the checkpoint kinase-1 (CHEK1) gene, which is essential for cell cycle arrest in the presence of DNA damage or unreplicated chromatid, belongs to a conserved serine/threonine kinase family. It is a meaningful oncogene associated with poor prognosis and is overexpressed in both esophageal squamous cell carcinoma and HCC [40,41]. The expression of CHEK1 can be applied not only as a prognostic indicator but also as a marker for the selection of CHEK1 inhibitors in patients with acute myeloid leukemia [42]. The high expression rate of CHEK1 in breast cancer was found to be 61%, and high expression was related to tumor size, triple-negative subtype, basal phenotype, epithelial-stromal transformation, dysfunction of the DNA homologous repair pathway and poor prognosis [43]. Abnormal expression of cyclin B1 (CCNB1) has been observed in a variety of tumors, including pituitary adenomas, and increases moderately with the escalation of invasiveness [18]. CCNB1 is a pivotal factor in the proliferation of hepatocellular carcinoma cells, and FOXM1 binds the CCNB1 promoter region to regulate the transcription of CCNB1 [44]. CCNB1 is an influential biomarker for the prognosis of estrogen receptor (ER)+ breast cancer; targeting CCNB1 can prevent or even reverse resistance to hormone therapy and facilitate personalized treatment [45]. The expression of ribonucleotide reductase regulatory subunit M2 (RRM2) correlates with clinical stage and is increased significantly in neuroblastoma compared to adjacent benign tissues [46]. HPVE7 facilitates to RRM2 upregulation and promotes the occurrence of cervical carcinoma through angiogenesis induced by Ros-ERK1/2-HIF-1, and α-VEGF. The overexpression of RRM2 is intimately related to the occurrence and progression of human ovarian carcinoma and breast cancer [47,48].

Conclusions

8 hub genes in LUAD were identified from hundreds of candidate DEGs. The hub genes were markedly correlated with the overall survival of LUAD patients, and their expression was histologically validated. Studies have shown that dysregulated gene expression can lead to the occurrence of tumors. This study improves our understanding of the molecular determinants of LUAD progression and the expression reliability of biomarkers and provides new biomarkers that may potentially support the diagnosis and precise identification of treatment targets for LUAD. However, in vivo and in vitro trials and multicenter randomized controlled clinical trials are still needed before these biomarkers can be accurately applied in clinical laboratory diagnostics.
  48 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  Associations between mutations and histologic patterns of mucin in lung adenocarcinoma: invasive mucinous pattern and extracellular mucin are associated with KRAS mutation.

Authors:  Kyuichi Kadota; Yi-Chen Yeh; Sandra P D'Angelo; Andre L Moreira; Deborah Kuk; Camelia S Sima; Gregory J Riely; Maria E Arcila; Mark G Kris; Valerie W Rusch; Prasad S Adusumilli; William D Travis
Journal:  Am J Surg Pathol       Date:  2014-08       Impact factor: 6.394

3.  High expression of CDK1 and BUB1 predicts poor prognosis of pancreatic ductal adenocarcinoma.

Authors:  Junjie Piao; Lianhua Zhu; Jie Sun; Nan Li; Bing Dong; Yang Yang; Liyan Chen
Journal:  Gene       Date:  2019-03-18       Impact factor: 3.688

Review 4.  Emerging cell cycle inhibitors for treating metastatic castration-resistant prostate cancer.

Authors:  Anupam Batra; Eric Winquist
Journal:  Expert Opin Emerg Drugs       Date:  2018-11-28       Impact factor: 4.191

5.  Upregulation of cyclin B1 plays potential roles in the invasiveness of pituitary adenomas.

Authors:  Peng Zhao; Pengfei Zhang; Wei Hu; Hongyun Wang; Guoqiang Yu; Zhuang Wang; Chuzhong Li; Jiwei Bai; Yazhuo Zhang
Journal:  J Clin Neurosci       Date:  2017-06-07       Impact factor: 1.961

6.  Checkpoint kinase 1 is negatively regulated by miR-497 in hepatocellular carcinoma.

Authors:  Yin Xie; Rong-Rong Wei; Guo-Liang Huang; Mei-Yin Zhang; Yun-Fei Yuan; Hui-Yun Wang
Journal:  Med Oncol       Date:  2014-01-25       Impact factor: 3.064

7.  CDK1 stabilizes HIF-1α via direct phosphorylation of Ser668 to promote tumor growth.

Authors:  Noel A Warfel; Nathan G Dolloff; David T Dicker; Jozef Malysz; Wafik S El-Deiry
Journal:  Cell Cycle       Date:  2013-10-25       Impact factor: 4.534

8.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

9.  High nuclear/cytoplasmic ratio of Cdk1 expression predicts poor prognosis in colorectal cancer patients.

Authors:  Wen-Wei Sung; Yueh-Min Lin; Pei-Ru Wu; Hsu-Heng Yen; Hung-Wen Lai; Tzu-Cheng Su; Ren-Hung Huang; Chun-Kai Wen; Chia-Yu Chen; Chih-Jung Chen; Kun-Tu Yeh
Journal:  BMC Cancer       Date:  2014-12-15       Impact factor: 4.430

10.  Involvement of TIMP-1 in PECAM-1-mediated tumor dissemination.

Authors:  Valsamma Abraham; Gaoyuan Cao; Andrew Parambath; Fareedah Lawal; Chakkrapong Handumrongkul; Robert Debs; Horace M DeLisser
Journal:  Int J Oncol       Date:  2018-05-25       Impact factor: 5.650

View more
  3 in total

1.  CTHRC1 expression is a novel shared diagnostic and prognostic biomarker of survival in six different human cancer subtypes.

Authors:  Nuzhat Sial; Mukhtiar Ahmad; Muhammad Safdar Hussain; Muhammad Junaid Iqbal; Yasir Hameed; Mehran Khan; Mustansar Abbas; Rizwan Asif; Jalil Ur Rehman; Muhammad Atif; Muhammad Rashid Khan; Zahid Hameed; Hina Saeed; Rida Tanveer; Saba Saeed; Aneeqa Sharif; Hafiz Muhammad Asif
Journal:  Sci Rep       Date:  2021-10-06       Impact factor: 4.379

2.  Multi-Omics Analysis Identified TMED2 as a Shared Potential Biomarker in Six Subtypes of Human Cancer.

Authors:  Nuzhat Sial; Saba Saeed; Mukhtiar Ahmad; Yasir Hameed; Abdul Rehman; Mustansar Abbas; Rizwan Asif; Hamad Ahmed; Muhammad Safdar Hussain; Jalil Ur Rehman; Muhammad Atif; Muhammad Rashid Khan
Journal:  Int J Gen Med       Date:  2021-10-21

3.  Construction and Analysis of a circRNA-Mediated ceRNA Network in Lung Adenocarcinoma.

Authors:  Zhaojun Wang; Hanzhong Pei; Hongsen Liang; Qiwei Zhang; Li Wei; Donglei Shi; Yun Chen; Junhang Zhang
Journal:  Onco Targets Ther       Date:  2021-06-08       Impact factor: 4.147

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.