Literature DB >> 34925645

Integrative Molecular Analyses of an Individual Transcription Factor-Based Genomic Model for Lung Cancer Prognosis.

Rong Yao1, Leilei Zhou1, Zhongying Guo2, Dahong Zhang1, Tiecheng Zhang1.   

Abstract

OBJECTIVE: Precision medicine with molecular profiles has revolutionized the management of lung cancer contributing to improved prognosis. Herein, we aimed to uncover the gene expression profiling of transcription factors (TFs) in lung cancer as well as to develop a TF-based genomic model.
METHODS: We retrospectively curated lung cancer patients from public databases. Through comparing mRNA expression profiling between lung cancer and normal specimens, specific TFs were determined. Thereafter, a TF genomic model was developed with univariate Cox regression and stepwise multivariable Cox analyses, which was verified through the GSE72094 dataset. Gene set enrichment analyses (GSEA) were presented. Downstream targets of TFs were predicted with ChEA, JASPAR, and MotifMap projects, and their biological significance was investigated through the clusterProfiler algorithm.
RESULTS: In the TCGA cohort, we proposed a TF-based genomic model, comprised of SATB2, HLF, and NPAS2. Lung cancer individuals were remarkably stratified into high- and low-risk groups. Survival analyses uncovered that high-risk populations presented unfavorable survival outcomes. ROCs confirmed the excellent predictive potency in patients' prognosis. Additionally, this model was an independent prognostic indicator in accordance with multivariate analyses. The clinical implication of the model was well verified in an independent dataset. High risk score was in relation to carcinogenic pathways. Downstream targets were characterized by immune and carcinogenic activation.
CONCLUSION: The proposed TF genomic model acts as a promising marker for estimation of lung cancer patients' outcomes. Prospective research is required for testing the clinical utility of the model in individualized management of lung cancer.
Copyright © 2021 Rong Yao et al.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34925645      PMCID: PMC8672105          DOI: 10.1155/2021/5125643

Source DB:  PubMed          Journal:  Dis Markers        ISSN: 0278-0240            Impact factor:   3.434


1. Introduction

Lung cancer occupies the major cause of cancer incidence and mortality across the globe, with estimated 2.1 million newly diagnosed cases as well as 18.4% of all cancer-related deaths in 2018 [1]. It is primarily classified into small cell lung cancer (SCLC) as well as non-small cell lung cancer (NSCLC). Among them, NSCLC occupies around 85% of lung cancer cases, with an undesirable five-year survival rate below 16%. It is comprised of lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC), as well as large cell carcinoma histological subtypes [2]. Chemotherapy represents the major therapeutic option against lung cancer, and platinum-based dual treatment is the standard treatment against advanced patients [3]. Nevertheless, chemotherapeutic resistance remains challenging, impeding much therapeutic success. Though tyrosine kinase-based targeted therapy interferes with the oncogenic pathway in NSCLC, acquired resistance evolves contributing to rapid disease recurrence and progression [4]. Immunotherapy like PD-1 and PD-L1 inhibitors has displayed superiority to traditional chemotherapy, which has become the standard for treating NSCLC [5-7]. Nevertheless, in past clinical trials, the objective response rate of immunotherapy remains low [8]. Transcription factors (TFs), DNA-binding proteins, recognize the promoter sequences of genes as well as subsequently guide gene expression [9]. Recent studies demonstrate that patterns of TF programs as well as immune pathway activation characterize four main SCLC subtypes with diverse treatment vulnerabilities [10]. Additionally, system-epigenomics inference of TF activities indicates inactivated aryl-hydrocarbon-receptor as an important event during lung carcinogenesis [11]. Experimental evidences suggest the lung carcinogenic roles of TFs. For instance, Oct4, a critical stemness TF, controls expression of lncRNAs NEAT1 and MALAT1 for promoting lung carcinogenesis [12]. TF NFIB is in relation to tumor aggressiveness in LUAD [13]. TF OCT1 triggers the Warburg effect through upregulating hexokinase 2 in NSCLC [14]. TF YY1 mediates lung cancer progression through activation of lncRNA-PVT1 [15]. Hence, characterization of the genomic profiling of TFs may assist in offering an in-depth understanding of the precise treatment of lung cancer. Herein, we developed a TF genomic model for lung cancer outcomes and uncovered the relevant signaling pathways.

2. Materials and Methods

2.1. Public mRNA Expression Cohorts

The Cancer Genome Atlas- (TCGA-) LUAD and TCGA-LUSC datasets containing mRNA expression profiling, survival and clinicopathologic data (n = 522) were curated from the Genomic Data Commons tool (https://portal.gdc.cancer.gov/) [16], as a training set. Fragments per kilobase million (FPKM) were converted to transcripts per million (TPM). Gene expression arrays of 442 LUAD patients were harvested from GSE72094 [17] via the Gene Expression Omnibus (GEO) repository (https://www.ncbi.nlm.nih.gov/gds/), which acted as a testing set. This dataset was based on the GPL15048 platform. Probe IDs were transformed into gene symbols in accordance with platform annotation file. The standardized expression value was logarithmically converted as well as scaled. Thereafter, the mean expression of genes with various probes was utilized as their expression value. The corresponding patient's clinical data in TCGA and GSE72094 cohorts are displayed in Supplementary table 1. The gene sets of TFs were harvested from published literature (Supplementary table 2) [9].

2.2. Analyses of Lung Cancer-Specific TFs

DESeq2 package [18] was adopted for differential analyses of sequencing data. TFs with ∣log2 fold change (FC) | >1 and adjusted p < 0.05 were set as thresholds of lung cancer-specific TFs. Volcano plots and heat map were conducted for visualizing the distribution of lung cancer-specific TFs between lung cancer and normal specimens.

2.3. Construction of a TF Genomic Model

Univariate Cox regression models were conducted for evaluation of the interactions of lung cancer-specific TFs with patients' prognosis in the TCGA cohort. Thereafter, stepwise multivariate Cox regression analyses were utilized for shrinking the variables as well as screening the most predictive biomarkers. The candidate TF-specific TFs were utilized for creating a risk scoring formula that was determined through a linear integration of mRNA expression as well as matched regression coefficients derived from the stepwise multivariate Cox analyses. Through ranking of the risk scoring system, patients were stratified into high- and low-risk subpopulations. Kaplan–Meier methods were presented for estimating the survival outcomes between high- and low-risk subpopulations, and log-rank tests were adopted for the calculation of the discrepancy in prognosis between subpopulations. The receiver operator characteristic (ROC) curve was depicted through the pROC package [19]. The prognostic potency of the TF genomic model was externally verified in the GSE72094 cohort.

2.4. Uni- and Multivariate Cox Regression Analyses

Through univariate Cox regression analyses, the interactions of clinicopathological indicators (age, gender, and pathological staging) and TF genomic model with lung cancer prognosis were evaluated across lung cancer individuals. Thereafter, multivariate Cox regression models were established for uncovering the independent prognostic indicators.

2.5. Gene Set Enrichment Analyses (GESA)

Through the Java program (http://software.broadinstitute.org/gsea/index.jsp), GSEA [20] was carried out on the basis of the “c2.cp.kegg.v7.2.symbols” gene set curated from the Molecular Signatures Database (MSigDB) project [21]. Cytoscape software [22] was adopted for visualizing our GSEA results. The interactions of specific gene sets with risk score for all genes were investigated, and positively and negatively correlated ones to the enrichment score were calculated. In total, 1000 permutations were conducted, and pathways with false discovery rate (FDR) < 0.05 were regarded as having prominent enrichment.

2.6. Prediction of Downstream Targets of TFs

Three web-based interactive applications, containing ChIP Enrichment Analysis (ChEA; http://amp.pharm.mssm.edu/lib/chea.jsp) [23], JASPAR (http://jaspar.genereg.net), and MotifMap (http://motifmap.igb.uci.edu/) databases, were adopted for estimating the downstream targets of TFs. The ChEA project offers data on eukaryotic TFs, consensus binding sequences, and experimentally validated binding sites as well as modulated genes [24]. The JASPAR project represents an open-access project of curated, nonredundant TF-binding profiling stored as a position frequency matrix for TFs among diverse species in six taxonomic populations [25]. MotifMap provides integrative genome-wide maps of regulated motif sites for model species [26].

2.7. Functional Enrichment Analyses

Gene Ontology (GO) as well as Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were carried out for determining the biological functions of downstream targets of TFs through the clusterProfiler package [27]. GO depicted three complementary biological concepts containing biological process (BP) and molecular function (MF) as well as cellular component (CC). Meanwhile, KEGG may assist uncover high-level functions and utilities of biological systems.

2.8. Statistical Analyses

All statistical analyses were managed through R software (version 3.6.3). Student's t-test or Wilcoxon test was adopted for statistical comparisons, with p < 0.05 as statistical significance.

3. Results

3.1. Identification of Lung Cancer-Specific TFs

This study retrospectively curated mRNA expression profiling, survival, and clinicopathologic data of lung cancer from the TCGA project. With the ∣log2 FC | >1 and adjusted p < 0.05 thresholds, 320 upregulated TFs as well as 103 downregulated TFs were determined in lung cancer in comparison with normal tissues (Figures 1(a) and 1(b); Supplementary table 3). The above deregulated TFs were regarded as lung cancer-specific TFs.
Figure 1

Identification of lung cancer-specific TFs utilizing mRNA expression profiling from TCGA cohort. (a) Volcano plots display lung cancer-specific TFs through comparison of differential TFs between lung cancer and normal specimens in accordance with ∣log2 FC | >1 and adjusted p < 0.05 thresholds. Red dots are indicative of upregulated TFs, while green dots are indicative of downregulated TFs. x-axis represents log2 FC while y-axis represents -log10 (adjusted p). (b) Hierarchical clustering analyses show the deregulated expression patterns of lung cancer-specific TFs in lung cancer (T) and normal (N) specimens. x-axis represents each sample while y-axis represents the RNA expression of TFs.

3.2. Determination of Prognostic Lung Cancer-Specific TFs

On the basis of mRNA expression profiling and survival information of lung cancer patients from the TCGA cohort, we conducted the interactions of lung cancer-specific TFs with clinical prognosis through univariate Cox regression models. In accordance with p < 0.05, 13 lung cancer-specific TFs displayed remarkable associations with lung cancer outcomes (Figure 2(a); Table 1). Among them, GFI1B, HLF, and ZNF750 acted as protective factors of lung cancer prognosis. Meanwhile, CTCFL, TFAP2A, CENPA, VAX1, E2F7, FOXM1, SATB2, ARNTL2, NPAS2, and HMGA1 acted as risk factors of lung cancer prognosis.
Figure 2

Development of a TF genomic model for lung cancer prognosis in TCGA cohort. (a) Forest diagram displays the prognostic lung cancer-specific TFs through univariate Cox regression analyses. Vertical dashed line indicates HR = 1. Green represents HR < 1 while red represents HR > 1. (b) Risk score of lung cancer patients is quantified in line with the regression coefficient derived from multivariate Cox regression models and expression value of SATB2, HLF, and NPAS2. Vertical dashed line expresses the grouping cutoff. Red dots represent high-risk specimens, while blue dots represent low-risk specimens. (c) Scatter diagram displays the survival duration and status of lung cancer patients that are ranked by risk score. Vertical dashed line expresses the grouping cutoff. Red triangle is indicative of high-risk specimen, while blue triangle is indicative of low-risk specimen. (d) Hierarchical clustering analyses present the expression patterns of SATB2, HLF, and NPAS2 in two groups. Red indicates high expression, and blue indicates low expression. (e) Survival analyses are conducted between high- and low-risk groups, and survival difference is estimated with log-rank test. (f) ROC curves are conducted on the basis of risk score for lung cancer outcomes.

Table 1

Univariate Cox regression models identify prognostic lung cancer-specific TFs.

TFsHRLower 95% CIUpper 95% CI P value
ARNTL21.3022811.1694751.4501681.49E-06
VAX11.2102741.1318591.2941222.35E-08
CENPA1.199741.0847741.3268890.000395
SATB21.2784891.1121241.4697410.000552
GFI1B0.7764270.6836370.8818129.75E-05
FOXM11.2721981.1392851.4206171.90E-05
E2F71.2638521.1358981.4062211.71E-05
HLF0.862930.8010830.9295520.000102
TFAP2A1.1887191.0922551.2937026.24E-05
ZNF7500.8810950.8217780.9446940.000371
HMGA11.3564361.1841811.5537471.08E-05
NPAS21.3028381.1436881.4841356.90E-05
CTCFL1.1296071.0568551.2073670.000333

3.3. Development of a TF Genomic Model for Lung Cancer Prognosis

Further multivariate Cox regression analyses displayed that SATB2, HLF, and NPAS2 were independently predictive of lung cancer prognosis. In accordance with the regression coefficient derived from multivariate Cox regression models and expression value of SATB2, HLF, and NPAS2, a TF genomic model was conducted for lung cancer prognosis. The risk score of each lung cancer patient in the TCGA cohort was calculated in line with the following formula: risk score = 0.215678015946362∗SATB2 expression + (−0.133926273065041)∗HLF expression + 0.22354783465585∗NPAS2 expression. Thereafter, lung cancer patients in the TCGA cohort were separated into two groups following the median risk score. Patients with >median risk score were classified into the high-risk group, while those with

3.4. External Verification of the TF Genomic Model

The clinical application potential of the TF genomic model was externally verified in the GSE72094 cohort. In accordance with the same formula, the risk score of each lung cancer patient was quantified. Thereafter, we clustered patients into high- and low-risk groups (Figure 3(a)). As expected, more dead patients were noted in the high-risk group (Figure 3(b)). There was enhanced expression of HLF as well as weakened expression of SATB2 and NPAS2 in the low-risk group (Figure 3(c)). Additionally, the high risk score presented more undesirable survival outcomes (Figure 3(d)). ROC curves confirmed the favorable predictive potency of the TF genomic model in lung cancer outcomes (AUC = 0.619; Figure 3(e)).
Figure 3

External verification of the TF genomic model for lung cancer prognosis in GSE72094 cohort. (a) Risk score distribution of lung cancer patients is displayed in line with the regression coefficient derived from multivariate Cox regression models and expression value of SATB2, HLF, and NPAS2. Vertical dashed line shows the grouping cutoff. Red dots are indicative of high-risk specimens, while blue dots are indicative of low-risk specimens. (b) Scatter plots present the survival duration and status of lung cancer patients that are ranked by risk score. Vertical dashed line represents the grouping cutoff. Red triangle expresses high-risk individual, while blue triangle expresses low-risk individual. (c) Hierarchical clustering analyses show the expression patterns of SATB2, HLF, and NPAS2 in two groups. Red represents upregulated expression, while blue represents downregulated expression. (d) Survival analyses are carried out between groups, and survival difference is estimated through log-rank test. (e) ROC curves display the predictive potency of risk score for lung cancer outcomes.

3.5. Analyses and Verification of the TF Genomic Model as an Independent Prognostic Indicator of Lung Cancer

In the TCGA cohort, uni- and multivariate Cox regression models were presented for investigation of the interaction of conventional clinicopathological indicators and risk score with lung cancer outcomes. As a result, staging as well as risk score was independently predictive of patients' prognosis (Figures 4(a) and 4(b)). The predictive independency was externally verified in the GSE72094 cohort. Our data confirmed that gender and staging as well as risk score were independent risk factors of lung cancer outcomes (Figures 4(c) and 4(d)).
Figure 4

Analyses and verification of the TF genomic model as an independent prognostic indicator of lung cancer. (a) Forest diagram depicts the interactions of age, gender, and staging as well as risk score with lung cancer outcomes through univariate Cox regression models in TCGA cohort. (b) Forest diagram presents the independency of stage and risk score in prediction of lung cancer prognosis in TCGA cohort. (c) Forest diagram shows the associations of age, gender, staging, and risk score with lung cancer prognosis in GSE72094 cohort. (d) Forest diagram displays the independency of gender and staging as well as risk score in predicting patients' outcomes in GSE72094 cohort.

3.6. Analyses of the TF Genomic Model Associated with Activation of Signaling Pathways

GSEA was conducted for investigating the activated signaling pathways correlated to the TF genomic model. In accordance with FDR < 0.05, ECM receptor interaction, small cell lung cancer, axon guidance, chronic myeloid leukemia, and adherens junction as well as regulation of actin cytoskeleton were remarkably activated in high-risk specimens (Figures 5(a)–5(f)). Additionally, clusters of relevant genes linked to the high risk score were determined, containing genes relating to thyroid cancer, type diabetes mellitus, valine leucine isoleucine, and ECM receptor interaction as well as alpha linolenic acid (Figure 6).
Figure 5

Analyses of the TF genomic model associated with activation of signaling pathways. (a–f) GSEA results present activated KEGG signaling pathways in high-risk specimens, containing (a) ECM receptor interaction, (b) small cell lung cancer, (c) axon guidance, (d) chronic myeloid leukemia, (e) adherens junction, and (f) regulation of actin cytoskeleton.

Figure 6

GSEA of KEGG pathways correlated to the TF genomic model. Enriched gene set is expressed through each node that is grouped in accordance with the similarity for creating a network. The size of each node displays the positive association with the number of genes as well as the thickness of line-interacting nodes indicating the proportions of common genes between nodes.

3.7. Determination of Downstream Targets of TFs: SATB2, HLF, and NPAS2

Through integration analyses of the ChEA, JASPAR, and MotifMap databases, we determined 307 downstream targets of HLF, 4 downstream targets (CRY2, PER1, PER2, and CRY1) of NPAS2, and 2 downstream targets (UPF3B and TP63) of SATB2 (Supplementary table 4). GO enrichment analyses uncovered that the rhythmic process was remarkably enriched by downstream targets (Figure 7(a); Table 2). Additionally, we noted that the above downstream targets presented remarkable interactions with circadian rhythm and immune activation pathways (like allograft rejection, T cell receptor signaling pathway, PD-L1 expression and PD-1 checkpoint pathway in cancer, inflammatory mediator regulation of TRP channels, and inflammatory bowel disease) as well as carcinogenic pathways (like transcriptional misregulation in cancer, FoxO signaling pathway, non-small cell lung cancer, MAPK signaling pathway, AMPK signaling pathway, Rap1 signaling pathway, and apoptosis; Figure 7(b) and Table 3).
Figure 7

Functional enrichment analyses of downstream targets of TFs: SATB2, HLF, and NPAS2. (a) GO enrichment results enriched by downstream targets of three TFs. (b) KEGG pathway enrichment results enriched by downstream targets of three TFs.

Table 2

GO enrichment results enriched by downstream targets of three TFs.

DescriptionGene ratioBgRatio p valueFDR q valueCount
Rhythmic process20/277294/188621.48E-083.08E-052.45E-0520
Circadian rhythm17/277212/188621.67E-083.08E-052.45E-0517
Circadian regulation of gene expression10/27767/188624.71E-085.21E-054.14E-0510
Anterior/posterior pattern specification16/277203/188625.64E-085.21E-054.14E-0516
Regionalization20/277326/188628.22E-086.07E-054.83E-0520
Positive regulation of protein localization to cell periphery9/27767/188625.67E-070.0003490.0002779
Regulation of circadian rhythm11/277118/188621.34E-060.0006650.00052911
Pattern specification process21/277426/188621.44E-060.0006650.00052921
Entrainment of circadian clock by photoperiod6/27729/188623.40E-060.0013940.0011086
Muscle organ development17/277317/188624.90E-060.0017160.00136517
Early endosome membrane11/283162/195202.53E-050.0088470.00816811
Early endosome15/283378/195200.000440.0769410.0710415
Nuclear speck14/283411/195200.0028340.2551650.23559614
Presynaptic active zone5/28370/195200.0034720.2551650.2355965
Recycling endosome8/283190/195200.0066570.2551650.2355968
Mitotic spindle7/283157/195200.0080670.2551650.2355967
Recycling endosome membrane5/28387/195200.0086950.2551650.2355965
Catalytic step 2 spliceosome5/28387/195200.0086950.2551650.2355965
Cell-substrate junction13/283423/195200.0091080.2551650.23559613
Ribonucleoprotein granule9/283244/195200.0094820.2551650.2355969
DNA-binding transcription activator activity, RNA polymerase II-specific19/285443/183376.93E-050.0081150.00744819
DNA-binding transcription activator activity19/285447/183377.80E-050.0081150.00744819
E-box binding6/28547/183378.41E-050.0081150.0074486
Cytokine receptor binding14/285270/183378.94E-050.0081150.00744814
Transcription corepressor binding5/28530/183379.07E-050.0081150.0074485
DNA-binding transcription repressor activity, RNA polymerase II-specific15/285307/183379.81E-050.0081150.00744815
DNA-binding transcription repressor activity15/285309/183370.0001050.0081150.00744815
Protein-cysteine S-palmitoyltransferase activity4/28528/183370.0008720.0522060.0479194
Protein-cysteine S-acyltransferase activity4/28528/183370.0008720.0522060.0479194
Tumor necrosis factor receptor binding4/28531/183370.0012920.0696130.0638964
Table 3

KEGG pathway enrichment results enriched by downstream targets of three TFs.

DescriptionGene ratioBgRatio p valueFDR q valueCount
Circadian rhythm10/12631/81011.94E-114.71E-093.93E-0910
Allograft rejection5/12638/81010.000280.0338380.0282595
Oxytocin signaling pathway9/126154/81010.0006360.0380620.0317879
Phosphatidylinositol signaling system7/12697/81010.0007510.0380620.0317877
Transcriptional misregulation in cancer10/126192/81010.0007860.0380620.03178710
FoxO signaling pathway8/126131/81010.0009590.0386850.0323088
Neurotrophin signaling pathway7/126119/81010.0024720.0652250.0544727
Longevity regulating pathway6/12689/81010.0025470.0652250.0544726
Longevity regulating pathway—multiple species5/12662/81010.0026790.0652250.0544725
Hypertrophic cardiomyopathy6/12690/81010.0026950.0652250.0544726
Non-small cell lung cancer5/12672/81010.0051210.0959970.0801715
Inositol phosphate metabolism5/12673/81010.0054310.0959970.0801715
T cell receptor signaling pathway6/126104/81010.005510.0959970.0801716
Fluid shear stress and atherosclerosis7/126139/81010.0058360.0959970.0801717
MAPK signaling pathway11/126294/81010.005950.0959970.08017111
Insulin resistance6/126108/81010.0066080.0999530.0834766
AMPK signaling pathway6/126120/81010.0108570.1545560.1290776
Asthma3/12631/81010.0120220.1571270.1312243
PD-L1 expression and PD-1 checkpoint pathway in cancer5/12689/81010.0123360.1571270.1312245
GnRH signaling pathway5/12693/81010.014710.1646440.1375025
Rap1 signaling pathway8/126210/81010.0164730.1646440.1375028
Circadian entrainment5/12697/81010.0173730.1646440.1375025
Inflammatory mediator regulation of TRP channels5/12698/81010.0180860.1646440.1375025
Inflammatory bowel disease4/12665/81010.0182120.1646440.1375024
Lipid and atherosclerosis8/126215/81010.0187150.1646440.1375028
Apoptosis6/126136/81010.0191380.1646440.1375026
African trypanosomiasis3/12637/81010.0194290.1646440.1375023
Insulin signaling pathway6/126137/81010.0197670.1646440.1375026
Yersinia infection6/126137/81010.0197670.1646440.1375026
Estrogen signaling pathway6/126138/81010.020410.1646440.1375026

4. Discussion

High-throughput sequencing technologies may assist in determining more biomarkers that present close interactions with patients' outcomes at the genetic levels. Herein, we proposed a 3-TF genomic model linked to lung cancer progression through conducting reliable bioinformatic analyses. Additionally, the 3-TF genomic model acted as an independent molecular marker for prediction of lung cancer patients' survival outcomes. Our findings might be of great significance to elucidate the underlying biological mechanisms of lung carcinogenesis as well as to develop innovative prognostic indicators and molecular therapeutic targets. Previously, a 7-TF gene model has been established for prediction of colon adenocarcinoma outcomes [28]. A 9-TF signature can be predictive of breast cancer recurrence for optimizing clinical management [29]. Additionally, Yang et al. proposed a TF-based prognostic signature that reliably predicts endometrial cancer individuals' survival outcomes [30]. Here, through univariate analyses followed by stepwise multivariate Cox regression analyses, we developed a TF genomic model for lung cancer outcomes. In accordance with the formula, a TF genomic model was conducted, containing SATB2, HLF, and NPAS2. ROC curves confirmed the reliability of this model in prediction of patients' prognosis. Following integration of clinicopathological indicators, the model was independently predictive of clinical prognosis. To our knowledge, the 3-TF genomic model's potential as a predictor has not been proposed in previous evidences, though research might offer a novel guide of lung cancer outcomes. In routine clinical practice, pathological staging acts as an important survival determinant concerning oncologists as well as lung cancer individuals. Nevertheless, diverse patients' survival outcomes with the same staging are indicative of the deficient pathological staging system for prognosis on the basis of the anatomic scope and staging system of the disease, in which pathological changes reflect the biological heterogeneity within lung cancer. The issues influence the predictive potency of the conventional system for lung cancer individuals. Our GSEA uncovered that the 7-TF genomic model presented remarkable interactions with ECM receptor interaction, SCLC, axon guidance, chronic myeloid leukemia, and adherens junction as well as regulation of actin cytoskeleton, indicative of the interactions of the 7-TF genomic model with lung carcinogenesis. Previous evidences suggest the biological significance of SATB2, HLF, and NPAS2 within the TF-based genomic model in lung carcinogenesis. For instance, SATB2 reduces NSCLC invasiveness through modulation of EMT-relevant proteins as well as histone methylation of G9a [31]. Further, hypoxic tumor-derived exosomal miR-31-5p triggers LUAD metastases through negative modulation of SATB2-reversed EMT as well as activation of the MEK/ERK pathway [32]. Downregulated HLF facilitates multiple-organ distant metastases of NSCLC via the PPAR/NF-κb pathway NSCLC [33]. Reduced HLF expression is predictive of undesirable clinical outcomes of LUAD [34]. NPAS2 polymorphism independently predicts NSCLC patients' prognosis [35]. Further analyses determined the downstream targets of TFs: SATB2, HLF, and NPAS2. Our further biological function analyses demonstrated the interactions of these downstream targets with circadian rhythm and immune activation pathways (like allograft rejection, T cell receptor signaling pathway, PD-L1 expression and PD-1 checkpoint pathway in cancer, inflammatory mediator regulation of TRP channels, and inflammatory bowel disease) and carcinogenic pathways (like transcriptional misregulation in cancer, FoxO signaling pathway, NSCLC, MAPK signaling pathway, AMPK signaling pathway, Rap1 signaling pathway, and apoptosis), indicating that SATB2, HLF, and NPAS2 modulated the above pathways to participate in lung carcinogenesis. Nevertheless, a few limitations of this study need to be pointed out. Our findings were primarily on the basis of integrated bioinformatic analyses. However, sufficient experimental verification of our results remains lacking. In future studies, we will conduct in-depth in vitro and in vivo experiments to verify our conclusion. Because all patients were retrospectively harvested, the underlying bias linked to unbalanced clinicopathological characteristics cannot be ignored. Additionally, the reliability of the 3-TF genomic model for predicting survival outcomes of lung cancer individuals remains a key issue in the clinic. In particular, the guideline for the clinical application of our 3-TF genomic model requires an in-depth definition in our future studies.

5. Conclusion

Collectively, our findings proposed and verified a 3-TF genomic model (SATB2, HLF, and NPAS2) for prediction of lung cancer outcomes. This genomic model acted as an independent indicator as well as a complement prognostic factor for clinicopathological features of lung cancer.
  35 in total

1.  SATB2 suppresses non-small cell lung cancer invasiveness by G9a.

Authors:  Yi-Nan Ma; Hai-Yan Zhang; Liang-Ru Fei; Mei-Yu Zhang; Cong-Cong Wang; Yuan Luo; Yu-Chen Han
Journal:  Clin Exp Med       Date:  2017-06-30       Impact factor: 3.984

2.  Atezolizumab versus docetaxel in patients with previously treated non-small-cell lung cancer (OAK): a phase 3, open-label, multicentre randomised controlled trial.

Authors:  Achim Rittmeyer; Fabrice Barlesi; Daniel Waterkamp; Keunchil Park; Fortunato Ciardiello; Joachim von Pawel; Shirish M Gadgeel; Toyoaki Hida; Dariusz M Kowalski; Manuel Cobo Dols; Diego L Cortinovis; Joseph Leach; Jonathan Polikoff; Carlos Barrios; Fairooz Kabbinavar; Osvaldo Arén Frontera; Filippo De Marinis; Hande Turna; Jong-Seok Lee; Marcus Ballinger; Marcin Kowanetz; Pei He; Daniel S Chen; Alan Sandler; David R Gandara
Journal:  Lancet       Date:  2016-12-13       Impact factor: 79.321

3.  Pembrolizumab versus docetaxel for previously treated, PD-L1-positive, advanced non-small-cell lung cancer (KEYNOTE-010): a randomised controlled trial.

Authors:  Roy S Herbst; Paul Baas; Dong-Wan Kim; Enriqueta Felip; José L Pérez-Gracia; Ji-Youn Han; Julian Molina; Joo-Hang Kim; Catherine Dubos Arvis; Myung-Ju Ahn; Margarita Majem; Mary J Fidler; Gilberto de Castro; Marcelo Garrido; Gregory M Lubiniecki; Yue Shentu; Ellie Im; Marisa Dolled-Filhart; Edward B Garon
Journal:  Lancet       Date:  2015-12-19       Impact factor: 79.321

4.  The Molecular Signatures Database (MSigDB) hallmark gene set collection.

Authors:  Arthur Liberzon; Chet Birger; Helga Thorvaldsdóttir; Mahmoud Ghandi; Jill P Mesirov; Pablo Tamayo
Journal:  Cell Syst       Date:  2015-12-23       Impact factor: 10.304

5.  Patterns of transcription factor programs and immune pathway activation define four major subtypes of SCLC with distinct therapeutic vulnerabilities.

Authors:  Carl M Gay; C Allison Stewart; Elizabeth M Park; Lixia Diao; Sarah M Groves; Simon Heeke; Barzin Y Nabet; Junya Fujimoto; Luisa M Solis; Wei Lu; Yuanxin Xi; Robert J Cardnell; Qi Wang; Giulia Fabbri; Kasey R Cargill; Natalie I Vokes; Kavya Ramkumar; Bingnan Zhang; Carminia M Della Corte; Paul Robson; Stephen G Swisher; Jack A Roth; Bonnie S Glisson; David S Shames; Ignacio I Wistuba; Jing Wang; Vito Quaranta; John Minna; John V Heymach; Lauren Averett Byers
Journal:  Cancer Cell       Date:  2021-01-21       Impact factor: 31.743

6.  Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma.

Authors:  M B Schabath; E A Welsh; W J Fulp; L Chen; J K Teer; Z J Thompson; B E Engel; M Xie; A E Berglund; B C Creelan; S J Antonia; J E Gray; S A Eschrich; D-T Chen; W D Cress; E B Haura; A A Beg
Journal:  Oncogene       Date:  2015-10-19       Impact factor: 9.867

7.  Systems-epigenomics inference of transcription factor activity implicates aryl-hydrocarbon-receptor inactivation as a key event in lung cancer development.

Authors:  Yuting Chen; Martin Widschwendter; Andrew E Teschendorff
Journal:  Genome Biol       Date:  2017-12-20       Impact factor: 13.583

8.  Oct4 transcriptionally regulates the expression of long non-coding RNAs NEAT1 and MALAT1 to promote lung cancer progression.

Authors:  Jayu Jen; Yen-An Tang; Ying-Hung Lu; Che-Chung Lin; Wu-Wei Lai; Yi-Ching Wang
Journal:  Mol Cancer       Date:  2017-06-14       Impact factor: 27.401

9.  JASPAR 2020: update of the open-access database of transcription factor binding profiles.

Authors:  Oriol Fornes; Jaime A Castro-Mondragon; Aziz Khan; Robin van der Lee; Xi Zhang; Phillip A Richmond; Bhavi P Modi; Solenne Correard; Marius Gheorghe; Damir Baranašić; Walter Santana-Garcia; Ge Tan; Jeanne Chèneby; Benoit Ballester; François Parcy; Albin Sandelin; Boris Lenhard; Wyeth W Wasserman; Anthony Mathelier
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.