Literature DB >> 36176415

A pan-cancer analysis of the expression of STAT family genes in tumors and their relationship to the tumor microenvironment.

Min Zhou1, Ping Zhang1, Mengting Da2, Rui Yang3, Yulian Ma4, Jiuda Zhao2, Tao Ma1, Jiazeng Xia5, Guoshuang Shen2, Yu Chen3, Daozhen Chen1,3,4.   

Abstract

Background: The signal transducer and activator of transcription (STAT) protein family, a group of seven members (STAT1, STAT2, STAT3, STAT4, STAT5A, STAT5B, and STAT6), has been widely used to investigate numerous biological functions including cell proliferation, differentiation, apoptosis, and immune regulation. However, not much is known about the role of the STAT family genes in pan-cancer.
Methods: Tumor Immune Estimation Resource (TIMER), Sangerbox, cBioPortal, GSCALite, Xena Shiny, GeneMANIA, Gene Expression Profiling Interactive Analysis (GEPIA), and Metascape were used to analyze the relationship between STAT gene expression, clinical outcome, gene variation, methylation status, pathway activity, tumor immune infiltration, and microenvironment in different cancer types and screened drugs that could potentially influence STATs.
Results: The Cancer Genome Atlas (TCGA) pan-cancer data showed that most STAT family genes were extensively changed in most tumors compared to the adjacent normal tissues. We also found that STAT gene expression could be used to predict patient survival in various cancers. The STAT gene family formed a network of interaction networks that was associated with several pathways. By mining the of Genomics Drug Sensitivity in Cancer (GDSC) database, we discovered a number of potential drugs that might target STAT regulators. Importantly, the close correlation between STATs and immunocell infiltration suggested the important role of dysregulation of STATs in tumor immune escape. Finally, the relation between STAT gene expression and the tumor microenvironment (TME) indicated that the higher expression of STAT regulators, the higher the degree of tumor stem cells.
Conclusion: Considering these genomic alterations and clinical features of STAT family members across cancer types, it will be possible to change the relationship between STATs and tumorigenesis. It was beneficial to treat cancer by targeting these STAT regulators.
Copyright © 2022 Zhou, Zhang, Da, Yang, Ma, Zhao, Ma, Xia, Shen, Chen and Chen.

Entities:  

Keywords:  STAT; pan-cancer; pathway analysis; prognosis; tumor microenvironment

Year:  2022        PMID: 36176415      PMCID: PMC9513395          DOI: 10.3389/fonc.2022.925537

Source DB:  PubMed          Journal:  Front Oncol        ISSN: 2234-943X            Impact factor:   5.738


Introduction

The signal transducer and activator of transcription (STAT) protein, which includes seven members (STAT1, STAT2, STAT3, STAT4, STAT5A, STAT5B, and STAT6), has been widely used to analyze different biological functions (1–3). Proteins contribute significantly to the pathogenesis of diseases, including cancer, autoimmune diseases, and infections, and their involvement in many signaling pathways downstream of cytokines, interleukin, and growth factors (4, 5). They function as cytokines, transcription factors, and regulating target genes, regulating the tumor suppressors or oncogenes. In cancers, the activation of STAT genes is often observed and postulated that dysregulation of these factors may contribute to tumor progression at several levels (6, 7). STATs have different expression patterns and physiological functions. Numerous cancer cells, including breast, head and neck, and gastric, are abnormally high in STAT1 (8–10). A high expression of STAT1 is associated with improved clinical outcomes in most studies (11, 12). Despite that, results of other clinical trials with high STAT1 expression in cancer tissues are worse than those with low STAT1 expression in the cancerous tissues (13, 14). Over 70% of human cancers are estimated to have aberrant STAT3 activity (15). In many publications, the association between STAT3 and tumor growth and immune evasion has been clearly documented (16). Among the massive tumors reported, STAT3 dysregulation occurred in bladder, breast, cervix, head and neck, kidney, and stomach (17–22). The mechanism for STAT5 proteins is activated by numerous processes in human cancers, including alterations to epigenetic mechanisms, hormone-regulated transcription factors, proteolytic pathways, gene amplification, and aberrant expression of growth factors (23–25). Activated STAT5 causes oncogenic changes through transcriptional modifications and protein–protein interactions (PPIs). The STAT5 protein provides aberrant responses to DNA damage, invasion, metastasis, and epithelial-to-mesenchymal transition (EMT) (26). Recent studies have shown that STAT6 signaling reduces cancerous growth and/or metastasis in solid tumors of the gastrointestinal tract, the breast, the lung, and the prostate, suggesting that STAT6 signaling could prevent these cancers (27–30). Historically, there are limited numbers of reports relating STAT2 and STAT4 dysregulation with the clinical characteristics and prognosis of human cancer. The role of STAT family genes and their mechanisms of action are still not fully explained. The family genes for STAT were previously studied but only for the individual cancer type and did not include multitypes of cancer compared. In this pan-cancer study, we attempted to establish a role of STATs in different cancer types and determine the cellular mechanisms and functions of each STAT family gene and its interacting molecules in carcinogenesis. Our study examined the interrelationships between STAT expression, clinical outcome, gene variation, methylation status, pathway activity, immune infiltration, microenvironment of different cancers, and the potential impact of drugs on STATs. The workflow chart of this study was summarized in .
Figure 1

Workflow chart of the study.

Workflow chart of the study.

Methods

Gene expression and survival analysis

The TIMER database (https://cistrome.shinyapps.io/timer/) includes 10,897 samples across 32 cancer types from The Cancer Genome Atlas (TCGA) to estimate the STAT family gene expression patterns (31, 32). We analyzed the differential expression of any gene of interest in tumors and adjacent normal tissues using the Diff Exp module provided by TCGA. In box plots, gene expression levels were captured, in Wilcoxon tests being used to determine if there were significant differences between levels. Based on the expression status of the STATs family, we performed a survival analysis using the Sangerbox database (http://vip.sangerbox.com/home.html). A median gene RNA-seq by Expectation Maximization (RSEM) value was used to classify tumor samples into high and low groups. The overall survival (OS) outcomes were analyzed by calculating the Hazard Ratio (HR) and 95% CI, along with a log-rank p-value.

Genomic alteration analysis

cBioPortal (http://www.cbioportal.org/) was used to analyze genomic changes to STAT genes in pan-cancer (33). All types of Pan-Cancer Atlas available (32 studies and 10,967 samples) were selected for calculation. According to standard software parameters, altered frequencies of each gene were determined and the fraction of genome altered and survival curves were calculated.

Gene set cancer analysis

By using GSCALite (http://bioinfo.life.hust.edu.cn/web/GSCALite/), the Genotype-Tissue Expression (GTEx) database was analyzed for expression levels of STAT family genes in each tissue (34). The STAT family genes were investigated for single-nucleotide variations (SNVs), copy number variations (CNVs), methylation, pathway function, and drug sensitivity. In selected cancer types, the SNV module displays the frequency and variant types of each gene set. The statistics of heterozygous and homozygous CNV of each cancer type were displayed as pie charts for gene sets on the CNV module. For selected cancer types, the Methylation module examined differences in methylation between tumors and matching normal tissue, the relationship between methylation and expression, and the impact of methylation on OS. The Pathway Activity module displayed differences in gene expression across pathway activity categories (activation and inhibition), which were determined by pathway scores. Drug sensitivity and gene expression profiling data from cancer cell lines in GDSC were combined in the Drug Sensitivity module. Spearman’s correlation analysis was used to compare the expression of each gene in the gene set to the sensitivity to small molecules and drugs [50% inhibiting concentration (IC50)].

Immune cell infiltration

With Xena Shiny (https://shiny.hiplot.com.cn/ucsc-xena-shiny/), we could analyze immune cell infiltration across 32 cancer types from TCGA. We used the module TCGA: Associations between Molecular Profile and Immune Signature module of Xena Shiny to evaluate the interrelationships between STAT expression in 20 immune cells and their infiltration by using “CIBERSORT.” Pearson’s correlation coefficient of gene and immune infiltration score in each tumor was also generated.

Tumor microenvironment analysis

The stromal and immune scores are positively related to the stromal and immune components in the tumor microenvironment (TME). In light of estimate scores, a composite score combining stromal scores and immune scores can determine the relative proportion of stromal and immune components in the TME. Sangerbox tool was used to calculate stromal, immune, and estimate scores. Spearman’s correlation coefficient of gene and immune infiltration score in each tumor was also generated by the Sangerbox tool. The Spearman correlation test was used to determine the association between tumor stemness and each STAT expression. An RNA stemness score (RNAss) based on mRNA expression and a DNA stemness score (DNAss) based on DNA methylation patterns were used in this pan-cancer study to measure tumor stemness.

GeneMANIA analysis

GeneMANIA was used in bioinformatics methods to predict functions for genes or gene lists, as well as to construct PPI networks (35).

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment

To perform enrichment analyses, related genes of STAT families were discovered using GEPIA (http://gepia.cancer-pku.cn/index.html) (36). Metascape (http://metascape.org) was used to analyze the function of STAT members and related genes (37). Gene Ontology (GO) analysis in Metascape could identify STATs and the similar genes by three categories, including molecular function (MF), cellular component (CC), and biological process (BP). By analyzing the Kyoto Encyclopedia of Genes and Genomes (KEGG), we identified the signaling pathways associated with STAT factors, as well as related genes.

Statistical analysis

The Spearman’s correlation test or the Pearson’s correlation test was used to analyze correlations. Cox proportional hazards models were calculated to determine survivorship risk and HR. Each variable was analyzed using survival plots and compared with log-rank tests. Once two sets of data were detected, a p-value of 0.05 was declared significant.

Results

Expression of STAT genes that were extensively changed in pan-cancer

We examined the expression levels of STAT family genes in all 33 cancer types available in TCGA pan-cancer data. Compared to other adjacent normal tissues, STATs tended to be extensively changed, suggesting a statistically significant difference in nearly half of all pan-cancers. STAT1 was highly expressed in bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), cholangiocarcinoma (CHOL), colon adenocarcinoma (COAD), esophageal adenocarcinoma (ESCA), head and neck squamous cell carcinoma (HNSC), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), stomach adenocarcinoma (STAD), thyroid carcinoma (THCA), and uterine corpus endometrial carcinoma (UCEC) and poorly expressed in kidney chromophobe (KICH), and the difference was statistically significant ( ). STAT2 was significantly upregulated in BLCA, CHOL, ESCA, HNSC, kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP), LIHC, LUAD, LUSC, prostate adenocarcinoma (PRAD), and THCA but significantly downregulated in CHOL ( ). STAT3 expression remained high in CHOL, ESCA, HNSC, and STAD and low in BLCA, BRCA, COAD, KICH, KIRC, KIRP, LUAD, LUSC, and PRAD, and the difference was statistically significant ( ). STAT4 was significantly upregulated in ESCA, HNSC, KIRC, KIRP, STAD, and THCA but significantly downregulated in BRCA, COAD, KICH, LUSC, and rectum adenocarcinoma (READ) ( ). STAT5A was highly expressed in CHOL, ESCA, HNSC, LIHC, STAD, and THCA and poorly expressed in BLCA, BRCA, KICH, LUAD, LUSC, PRAD, and UCEC, and the difference was statistically significant ( ). STAT5B expression remained high in CHOL and LIHC and low in BLCA, BRCA, KICH, KIRP, LUAD, LUSC, PRAD, and THCA, and the difference was statistically significant ( ). STAT6 was significantly upregulated in CHOL, ESCA, HNSC, KIRC, KIRP, LIHC, STAD, and THCA but significantly downregulated in BLCA, BRCA, COAD, LUAD, LUSC, PRAD, and UCEC ( ).
Figure 2

Differential expression of STAT genes in human normal organ tissues. (A) STAT1; (B) STAT2; (C) STAT3; (D) STAT4; (E) STAT5A; (F) STAT5B; (G) STAT6. *p < 0.05, **p < 0.01, ***p < 0.001.

Differential expression of STAT genes in human normal organ tissues. (A) STAT1; (B) STAT2; (C) STAT3; (D) STAT4; (E) STAT5A; (F) STAT5B; (G) STAT6. *p < 0.05, **p < 0.01, ***p < 0.001.

Prognostic potential of STAT family genes in pan-cancer

To investigate the prognostic value of the seven STAT factors in pan-cancer, survival analysis was performed. High expression of STAT1 was significantly linked with the shortened OS in glioma (GBMLGG) [HR = 1.72 (1.54, 1.92), p = 9.9e-22], brain lower-grade glioma (LGG) [HR = 1.84 (1.56, 2.17), p = 2.1e-13], LUAD [HR = 1.17 (1.01, 1.35), p = 0.04], KIRP [HR = 1.54 (1.18, 2.01), p = 1.6e-3], pan-kidney cohort (KICH+KIRC+KIRP) (KIPAN) [HR = 1.25 (1.12, 1.38), p = 5.0e-5], uveal melanoma (UVM) [HR = 1.37 (1.09, 1.70), p = 4.4e-3], acute myeloid leukemia (LAML) [HR = 1.22 (1.06, 1.41), p = 5.3e-3], and adrenocortical carcinoma (ACC) [HR = 1.79 (1.24, 2.59), p = 2.0e-3], while lower STAT1 expression was significantly associated with lower OS rates in sarcoma (SARC) [HR = 0.82 (0.68, 0.98), p = 0.03], skin cutaneous melanoma (SKCM) [HR = 0.78 (0.71, 0.85), p = 1.1e-7], and ovarian serous cystadenocarcinoma (OV) [HR = 0.86 (0.79, 0.94), p = 1.3e-3] ( ). Higher STAT2 expression was significantly associated with poorer OS in GBMLGG [HR = 1.71 (1.45, 2.01), p = 2.5e-10], LGG [HR = 1.95 (1.55, 2.46), p = 3.3e-8], KIPAN [HR = 1.69 (1.44, 1.98), p = 3.1e-10], KIRC [HR = 1.62 (1.32, 1.99), p = 4.9e-6], LAML [HR = 1.22 (1.05, 1.42), p = 7.5e-3], and ACC [HR = 1.81 (1.08, 3.04), p = 0.02] ( ). GBMLGG, LGG, KIPAN, glioblastoma multiforme (GBM), and UVM patients with high STAT3 expression had worse OS [HR = 2.50 (1.98, 3.15), p = 1.4e-14; HR = 2.56 (1.84, 3.56), p = 4.0e-8; HR = 1.20 (1.03, 1.41), p = 0.02; HR = 1.49 (1.08, 2.06), p = 0.01; HR = 1.88 (1.13, 3.14), p = 0.01, respectively] than those with low STAT3 expression, while colon adenocarcinoma/rectum adenocarcinoma esophageal carcinoma (COADREAD) and SKCM patients with low STAT3 expression had worse OS [HR = 0.71 (0.51, 0.99), p = 0.04; HR = 0.79 (0.67, 0.93), p = 6.1e-3), respectively] than those with high STAT3 expression ( ). High expression of STAT4 was significantly linked with the shortened OS in KIRP [HR = 1.50 (1.15, 1.96), p = 3.2e-3], KIPAN [HR = 1.30 (1.17, 1.45), p = 1.4e-6], and GBM [HR = 1.28 (1.07, 1.53), p = 7.4e-3], while lower STAT4 expression was significantly associated with lower OS rates in BRCA [HR = 0.88 (0.79, 0.98), p = 0.02], SARC [HR = 0.85 (0.74, 0.98), p = 0.02], SKCM [HR = 0.81 (0.75, 0.88), p = 1.3e-7], OV [HR = 0.88 (0.80, 0.96), p = 6.0e-3], and pancreatic adenocarcinoma (PAAD) [HR = 0.82 (0.69, 0.98), p = 0.03] ( ). Higher STAT5A expression was significantly associated with poorer OS in GBMLGG [HR = 1.86 (1.61, 2.15), p = 4.7e-17], LGG [HR = 1.79 (1.46, 2.18), p = 6.3e-9], and testicular germ cell tumors (TGCTs) [HR = 7.02 (1.07, 46.31), p = 0.04], while low expression of STAT5A was significantly associated with lower OS rates in SARC [HR = 0.64 (0.49, 0.82), p = 4.9e-4], KIRP [HR = 0.63 (0.46, 0.87), p = 5.8e-3], HNSC [HR = 0.86 (0.75, 0.98), p = 0.03], SKCM [HR = 0.86 (0.75, 0.99), p = 0.04], mesothelioma (MESO) [HR = 0.54 (0.33, 0.89), p = 0.01], and UVM [HR = 0.52 (0.28, 0.99), p = 0.05] ( ). GBMLGG, KIRC, SKCM, and PAAD patients with low STAT5B expression had worse OS [HR = 0.51 (0.41,0.64), p = 5.4e-9; HR = 0.67 (0.56, 0.80), p = 1.2e-5; HR = 0.75 (0.61, 0.91), p = 3.6e-3; HR = 0.77 (0.60, 0.97), p = 0.03, respectively] than those with high STAT5B expression ( ). High expression of STAT6 was significantly linked with the shortened OS in GBMLGG [HR = 1.99 (1.68, 2.37), p = 4.8e-15], LGG [HR = 1.97 (1.53, 2.52), p = 1.2e-7], UVM [HR = 2.83 (1.34, 5.97), p = 5.5e-3], and LAML [HR = 1.39 (1.19, 1.63), p = 3.2e-5], while lower STAT6 expression was significantly associated with lower OS rates in SARC [HR = 0.67 (0.54, 0.84), p = 5.3e-4] and BLCA [HR = 0.83 (0.70, 0.98), p = 0.03) ( ).
Figure 3

Prognostic role of STAT expression in pan-cancer. (A) STAT1; (B) STAT2; (C) STAT3; (D) STAT4; (E) STAT5A; (F) STAT5B; (G) STAT6.

Prognostic role of STAT expression in pan-cancer. (A) STAT1; (B) STAT2; (C) STAT3; (D) STAT4; (E) STAT5A; (F) STAT5B; (G) STAT6.

Genetic alteration analysis of the STATs in pan-cancer

Using the cBioPortal tool on 10,967 samples in 32 studies, we calculated the mutation frequency of the seven STAT genes. The gene of STAT genes was altered in 1,003 (9%) samples; STAT1, STAT2, STAT3, STAT4, STAT5A, STAT5B, andSTAT6 were altered in 2.3%, 1.7%, 2.0%, 2.4%, 1.6%, 1.9%, and 1.9% of the demanded pan-cancer samples ( ). We found that mutations of STAT family genes more frequently occur in melanoma, mature B-cell neoplasms, UCEC, esophagogastric adenocarcinoma, BLCA, NSCLC, COADREAD, CHOL, and cervical squamous cell carcinoma (>10%) ( ). An analysis of SNVs found the highest mutation levels in STATs in tumor tissues in UCEC and SKCM. These mutations increased expression levels in genes. In the greatest number of tumors, STAT4 was correlated with tumor growth, then STAT1, STAT3, STAT6, STAT5B, STAT2, and STAT5A ( ). The alterations were not related to the survival of STAT family gene mutations, and the OS and disease-free survival (DFS) of patients with alterations were not shortened as opposed to those without alterations (all p > 0.05; ).
Figure 4

Alterations of STAT family genes. (A) STAT genetic alterations across TCGA cancer studies. (B) The alteration frequency of STAT genes plotted in pan-cancer with different cancer types. (C) SNV oncoplot using GSCALite website. (D, E) Genetic alterations in STATs were not associated with overall survival (OS) and disease-free survival (DFS).

Alterations of STAT family genes. (A) STAT genetic alterations across TCGA cancer studies. (B) The alteration frequency of STAT genes plotted in pan-cancer with different cancer types. (C) SNV oncoplot using GSCALite website. (D, E) Genetic alterations in STATs were not associated with overall survival (OS) and disease-free survival (DFS).

CNV of STAT genes

According to the CNV pie chart, most types of CNV were heterozygous amplifications and deletions ( ). ACC and TGCT showed heterozygous amplification of STAT6; KIRP displayed STAT5A, STAT5B, and STAT3; and ACC and TGCT showed greater than 50% amplification of STAT2 (p < 0.05, ). OV and KICH exhibited heterozygous deletions of STAT5B, STAT5A, and STAT3; STAT4 and STAT1 in KICH were both greater than 50% deleted (p < 0.05, ). As a result of CNV, expression levels of every STAT factor increased. This interrelationship has been found in the largest number of tumors for STAT5B, followed by STAT3, STAT6, STAT2, STAT5A, STAT1, and STAT4 ( ).
Figure 5

CNV underlies the dysregulation of STAT members. (A) CNV distribution in pan-cancer. (B) The percentage of amplification and deletion of heterozygous CNVs for each STAT gene in each cancer. (C) CNV correlation with mRNA expression.

CNV underlies the dysregulation of STAT members. (A) CNV distribution in pan-cancer. (B) The percentage of amplification and deletion of heterozygous CNVs for each STAT gene in each cancer. (C) CNV correlation with mRNA expression.

Methylation analysis of STAT regulators

To identify the epigenetic regulation in TCGA database, the methylation characteristics of STAT genes were examined. In different tumors, methylation of STAT genes was heterogeneous: in PRAD, LUSC, BRCA, UCEC, and COAD, more highly hypermethylated genes could be found, while in THCA, KIRC, and LIHC, more highly hypomethylated genes can be found ( ). A correlation analysis of the methylation levels and mRNA expression levels revealed that a large proportion of genes, especially STAT1, STAT5A, and STAT6, correlated negatively with the methylation levels ( ). A survival study of various types of tumors showed that hypermethylation of STAT5A and STAT5B was mainly linked to poorer survival, while hypomethylation of STAT1, STAT2, and STAT4 was generally linked to shorter survival (p < 0.05, ).
Figure 6

Methylation of melatonergic regulators. (A) Methylation difference between tumor and normal samples. (B) Spearman’s correlation coefficient of methylation and STAT gene expression. (C) Overall survival difference between hypermethylation and hypomethylation.

Methylation of melatonergic regulators. (A) Methylation difference between tumor and normal samples. (B) Spearman’s correlation coefficient of methylation and STAT gene expression. (C) Overall survival difference between hypermethylation and hypomethylation.

Pathway activity analysis

The linked pathway network indicated that STAT family genes acted significantly in nine famous cancer-related signaling pathways. For STAT1, the principal inactivated pathways were apoptosis, cell cycle, and EMT. STAT2 was mostly involved in the activation of apoptosis and EMT, while the key pathways inactivated were cell cycle, DNA damage response, and hormone Androgen Receptor (AR). STAT3 was mostly involved in the activation of EMT, hormone Estrogen Receptor (ER), rat sarcoma/mitogen activated protien kinase RAS/MARK, and Receptor tyrosine kinase (RTK), while the major inactivated pathways were cell cycle and DNA damage response. STAT4 was generally involved in the inhibition of DNA damage response and hormone AR, while the main activated pathways were apoptosis, EMT, and hormone ER. STAT5A was mainly involved in the activation of EMT and hormone ER, while the main inactivated pathways were cell cycle and PI3K/AKT. In STAT5B, the primary inactivated pathways were apoptosis and cell cycle. STAT6 was mostly involved in RAS/MARK activation, while the main inactivated pathways were apoptosis, cell cycle, DNA damage response, and EMT (all p < 0.05, ).
Figure 7

Pathway network between STAT family regulators.

Pathway network between STAT family regulators.

Drug sensitivity analysis

According to the drug sensitivity analysis, low levels of STAT5B and STAT5A showed resistance to 56 and 42 drugs, respectively. Drug resistance toward vorinostat, tubastatin A, NPK76-II-72-1, I-BET-762, TPCA-1, TL-1-85, NG-25, navitoclax, and methotrexate negatively correlated with STAT5B expression (correlation coefficient >-0.20). Similarly, drug resistance toward BHG712, TPCA-1, TL-1-85, and NG-25 negatively related to the STAT5A expression (correlation coefficient >-0.20) ( ).
Figure 8

Drug sensitivity analysis of STAT family regulators from GDSC IC50 drug data.

Drug sensitivity analysis of STAT family regulators from GDSC IC50 drug data.

Correlation between the expression levels of STATs and immune cell infiltration in pan-cancer

Scatterplot analysis revealed Spearman’s correlation coefficient between each STAT family gene and the tumor infiltration of each cell type analyzed in different cancer types ( ). STAT1 expression positively correlated with the infiltration of T regulatory cells (Tregs), T follicular helper cells (TFHs), CD8+ and activated memory CD4+ T cells, M1 macrophages, resting dendritic cells (DCs), and naive B cells in most tumors and negatively related to the infiltration of naive CD4+ T, plasma, resting NK, monocyte, resting and activated mast cells, M2 and M0 macrophages, activated DCs, and memory B cells in the majority of tumors ( ). Infiltration of resting memory CD4+ T cells and M1 macrophages positively associated with STAT2 expression; however, STAT2 negatively correlated with the infiltration of activated mast and memory B cells in most cancer types ( ). STAT3 expression positively correlated with the infiltration of resting memory CD4+ T and naive B cells in most tumors and negatively associated with the infiltration of TFHs, CD8+ T, activated NK, and memory B cells in different cancer types ( ). Infiltration of Tregs, TFHs, CD8+, resting and activated memory CD4+ T cells, M1 macrophages, resting DCs, and naive B cells positively correlated with STAT4 expression; however, STAT4 negatively correlated with the infiltration of naive CD4+ T, resting NK and resting mast cells, M0 macrophages, activated DCs, and memory B cells in most tumor types ( ). STAT5A expression positively correlated with the infiltration of Tregs, CD8+, resting and activated memory CD4+ T cells, M1 macrophages, resting DCs, and naive B cells in most tumors and negatively correlated with the infiltration of naive CD4+ T cells, M0 macrophages, activated DCs, and memory B cells in the majority of tumors ( ). The levels of STAT5B positively associated with the infiltration of resting memory CD4+ T, resting mast, and naive B cells in most tumors; however, STAT5B negatively correlated with the infiltration of Tregs, gammadelta T cells, TFHs, activated NK cells, and memory B cells in different cancer types ( ). STAT6 expression positively associated with the infiltration of resting memory CD4+ T, monocyte, and resting mast cells in most tumors and negatively related to the infiltration of gammadelta T cells, activated memory CD4+ T cells, and M0 macrophages in the majority of tumors ( ).
Figure 9

Correlation between the expression levels of signal transducer and activator of transcription (STAT) genes and immune infifiltrationin pan-cancer. (A) STAT1; (B) STAT2; (C) STAT3; (D) STAT4; (E) STAT5A; (F) STAT5B; (G) STAT6. *p < 0.05, **p < 0.01, ***p < 0.001.

Correlation between the expression levels of signal transducer and activator of transcription (STAT) genes and immune infifiltrationin pan-cancer. (A) STAT1; (B) STAT2; (C) STAT3; (D) STAT4; (E) STAT5A; (F) STAT5B; (G) STAT6. *p < 0.05, **p < 0.01, ***p < 0.001.

TME analysis

Higher stromal or immune scores indicated that stromal or immune elements have infiltrated deeper into the TME. Among all STAT genes, but not STAT6, expression was positively related to stromal scores. Specifically, STAT1 was strongly correlated with READ, LAML, and COAD. STAT2 was strongly correlated with COAD, LAML, and READ. STAT3 was strongly correlated with TGCT, PAAD, and LAML. STAT4 was strongly correlated with ACC, BLCA, BRCA, COAD, ESCA, HNSC, KICH, LUSC, OV, PAAD, PRAD, READ, SARC, SKCM, THCA, and UCEC. STAT5A was strongly correlated with CHOL, LGG, LUAD, PAAD, PRAD, and TGCT. STAT5B was strongly correlated with PAAD, READ, and STAD, and STAT6 was strongly correlated with GBM and TGCT. The expressions of STAT1, STAT2, STAT3, STAT4, and STAT5A were mostly positively related to immune scores, and STAT5B and STAT6 expressions were partly positively related to immune scores. To be specific, STAT1 was closely related to BLCA, CESC, COAD, DLBC, HNSC, KIRC, READ, SKCM, TGCT, THCA, and UVM. STAT2 was closely related to COAD, DLBC, and READ. STAT3 was closely related to COAD, DLBC, and READ. STAT4 was closely related to ACC, BLCA, BRCA, CESC, COAD, DLBC, ESCA, HNSC, KICH, KIRC, LUAD, LUSC, MESO, OV, PAAD, PRAD, READ, SARC, SKCM, STAD, TGCT, THCA, UCEC, and UCS. STAT5A was closely related to CHOL, DLBC, HNSC, KIRC, LUAD, LUSC, PAAD, PRAD, and TGCT. STAT5B was positively correlated with DLBC but negatively related to GBM and SARC. All STAT genes except STAT6 exhibited a positive correlation with estimate scores. In particular, STAT1 was highly associated with BLCA, COAD, DLBC, KIRC, READ, SKCM, TGCT, THCA, and UVM. STAT2 was highly associated with COAD and READ. STAT3 was highly associated with COAD, DLBC, LGG, PAAD, and READ. STAT4 was highly associated with ACC, BLCA, BRCA, CESC, COAD, DLBC, ESCA, HNSC, KICH, KIRC, LUAD, LUSC, MESO, OV, PAAD, PRAD, READ, SARC, SKCM, STAD, TGCT, THCA, UCEC, and UCS. STAT5A was highly associated with CHOL, LUAD, LUSC, PAAD, PRAD, and TGCT. STAT5B was highly associated with READ and DLBC ( ; ).
Table 1

Association of the STAT family gene expression with stromal scores, immune scores and estimate scores across 33 different cancer types.

ACCBLCABRCACESCCHOLCOADDLBCESCAGBMHNSCKICHKIRCKIRPLAMLLGGLIHCLUADLUSCMESOOVPAADPCPGPRADREADSARCSKCMSTADTGCTTHCATHYMUCECUCSUVM
STAT1stromal scoresr0.010.350.200.230.090.500.400.210.000.260.230.300.100.510.380.260.260.310.000.090.41-0.020.200.570.150.350.110.270.410.160.060.140.41
P9.03E-017.19E-131.08E-109.47E-056.16E-011.38E-196.45E-033.68E-039.79E-012.91E-096.94E-021.94E-129.89E-021.05E-153.29E-195.74E-072.35E-091.54E-129.69E-016.49E-021.82E-087.82E-014.80E-064.05E-091.75E-021.91E-142.52E-021.58E-031.59E-218.78E-023.90E-013.06E-011.60E-04
immune scoresr-0.110.610.470.560.090.700.790.370.110.530.160.560.040.260.320.380.440.470.370.280.390.150.330.640.380.600.370.690.59-0.060.170.390.53
P3.41E-015.94E-431.90E-593.18E-255.86E-018.61E-437.56E-112.59E-071.80E-014.23E-392.01E-015.55E-455.37E-011.07E-047.42E-141.23E-136.18E-256.14E-284.64E-046.67E-098.77E-084.58E-022.53E-145.97E-122.63E-102.45E-467.01E-143.74E-208.69E-485.11E-012.76E-023.04E-034.43E-07
estimate scoresr-0.050.500.390.460.100.630.740.310.070.440.200.520.060.400.350.360.390.420.230.210.410.080.300.630.320.540.250.660.550.000.150.320.53
P6.56E-011.98E-278.90E-411.48E-165.69E-011.00E-324.01E-091.79E-053.88E-012.17E-261.08E-012.92E-383.26E-011.72E-094.19E-162.86E-121.83E-194.04E-223.05E-021.32E-059.46E-092.73E-011.57E-112.88E-111.25E-073.85E-365.30E-073.92E-181.46E-409.74E-014.28E-021.61E-026.30E-07
STAT2stromal scoresr0.000.180.330.190.200.590.260.370.040.270.030.260.030.550.040.030.100.180.140.130.45-0.100.110.680.170.070.350.190.220.240.020.140.20
P9.91E-012.93E-041.21E-288.45E-042.47E-013.73E-288.29E-022.15E-076.66E-012.66E-108.10E-019.71E-106.46E-019.86E-194.15E-015.33E-011.73E-024.34E-051.86E-016.61E-032.52E-101.78E-011.68E-026.96E-145.02E-031.60E-017.66E-133.14E-023.04E-079.23E-037.37E-012.98E-017.72E-02
immune scoresr-0.010.330.310.300.190.660.510.310.070.390.010.37-0.010.330.100.010.170.200.250.150.33-0.060.160.620.130.240.370.260.21-0.34-0.040.140.28
P9.19E-019.88E-124.13E-252.00E-072.63E-012.10E-362.90E-041.64E-054.04E-014.28E-209.00E-017.45E-198.45E-018.11E-073.33E-027.72E-019.95E-058.72E-062.24E-022.09E-035.33E-063.87E-014.56E-044.28E-113.47E-021.65E-077.10E-142.48E-031.51E-061.96E-046.50E-012.94E-011.10E-02
estimate scoresr-0.010.270.360.290.190.660.490.370.060.370.030.370.000.450.070.030.150.200.220.160.42-0.080.150.690.170.190.380.300.23-0.120.000.130.27
P9.45E-014.75E-081.49E-345.52E-072.56E-011.75E-365.01E-043.06E-074.91E-012.07E-188.23E-011.66E-189.50E-013.73E-121.04E-016.17E-014.81E-047.12E-064.37E-029.48E-048.19E-093.02E-011.11E-033.85E-145.65E-034.06E-053.52E-154.67E-041.01E-071.77E-019.55E-013.22E-011.47E-02
STAT3stromal scoresr-0.010.210.13-0.020.170.380.310.0200.120.340.310.10.460.420.220.040.15-0.010.220.460.250.160.36-0.040.190.310.480.120.26-0.10.110.33
P9.54E-011.24E-051.40E-057.83E-013.15E-014.03E-113.69E-028.03E-019.82E-015.32E-035.50E-035.40E-139.16E-029.16E-131.27E-222.15E-053.32E-016.15E-049.11E-013.62E-067.66E-118.29E-043.94E-043.82E-044.70E-013.80E-052.47E-104.59E-097.79E-033.93E-031.77E-014.36E-012.90E-03
immune scoresr-0.010.33-0.050.020.090.440.440.05-0.010.170.280.03-0.060.110.40.12-0.10.05-0.060.220.310.160.130.45-0.140.150.19-0.060.04-0.3-0.120.150.37
P9.44E-018.87E-129.13E-027.56E-015.88E-014.19E-152.08E-035.16E-019.03E-019.00E-052.51E-025.34E-013.49E-011.14E-013.39E-212.56E-022.45E-022.38E-015.65E-015.40E-062.05E-052.86E-023.63E-038.30E-062.25E-021.34E-031.16E-044.64E-014.05E-018.24E-041.17E-012.73E-018.60E-04
estimate scoresr00.280.030.010.140.440.490.030.010.160.320.160.010.270.420.17-0.040.1-0.050.240.410.220.150.43-0.110.180.270.160.07-0.08-0.110.130.38
P9.70E-015.81E-093.56E-018.83E-014.04E-011.41E-145.50E-046.41E-019.22E-011.93E-041.01E-021.89E-049.26E-017.25E-051.00E-229.88E-043.92E-012.37E-026.30E-015.26E-071.40E-083.81E-035.79E-042.22E-058.08E-021.22E-046.62E-086.69E-021.04E-013.80E-011.27E-013.35E-014.92E-04
STAT4stromal scoresr0.550.570.500.350.320.600.270.580.450.520.670.360.370.10-0.030.390.400.580.390.510.57-0.160.500.590.610.590.490.370.620.250.560.420.41
P1.66E-071.12E-353.05E-707.93E-106.13E-021.46E-286.49E-021.65E-178.21E-093.48E-379.53E-104.79E-181.71E-101.27E-014.61E-015.50E-151.06E-202.49E-462.02E-042.26E-292.40E-162.87E-022.22E-337.41E-103.49E-271.13E-434.97E-251.13E-059.77E-557.26E-037.39E-161.44E-032.10E-04
immune scoresr0.610.800.760.670.310.810.790.740.420.680.770.570.470.18-0.230.490.600.710.630.690.670.100.730.790.780.770.740.780.69-0.080.600.620.38
P3.98E-095.93E-921.60E-2011.92E-396.97E-021.58E-665.53E-117.95E-337.08E-081.74E-701.08E-134.90E-473.69E-177.31E-032.14E-072.16E-231.79E-491.83E-761.24E-103.09E-611.38E-242.01E-019.45E-849.29E-212.24E-533.66E-911.01E-694.52E-282.38E-713.98E-015.37E-193.66E-075.98E-04
estimate scoresr0.620.720.730.600.320.740.650.710.440.690.770.550.450.17-0.160.480.540.690.630.670.65-0.030.680.720.770.760.670.760.700.110.640.580.41
P1.48E-096.02E-668.24E-1771.17E-295.80E-026.71E-509.42E-071.43E-291.51E-086.98E-731.08E-132.62E-436.30E-161.45E-022.29E-042.81E-221.27E-392.04E-701.50E-106.77E-558.33E-236.47E-013.49E-671.43E-151.93E-511.04E-841.26E-521.78E-267.31E-752.26E-013.82E-222.52E-061.88E-04
STAT5Astromal scoresr0.310.330.300.160.590.380.090.200.360.280.380.300.410.340.690.370.500.49-0.070.190.53-0.050.530.380.270.030.450.580.250.360.150.410.02
P5.33E-031.41E-111.16E-237.68E-031.44E-045.18E-115.60E-017.45E-035.42E-066.34E-112.03E-032.85E-128.42E-134.38E-071.33E-714.33E-131.93E-322.90E-315.08E-011.38E-043.14E-144.80E-012.23E-372.46E-048.18E-064.68E-012.82E-201.88E-131.12E-086.09E-053.92E-021.83E-038.30E-01
immune scoresr0.280.440.290.440.570.430.540.410.350.540.320.510.46-0.010.730.400.620.580.100.290.570.100.580.450.350.160.440.620.26-0.140.370.43-0.03
P1.37E-022.76E-203.31E-222.55E-152.62E-047.38E-141.25E-041.29E-081.01E-051.06E-398.30E-031.48E-363.96E-168.36E-011.27E-841.20E-154.26E-541.60E-453.84E-019.14E-108.85E-172.04E-011.33E-457.14E-065.09E-096.01E-049.18E-202.66E-155.04E-091.27E-014.62E-078.89E-047.69E-01
estimate scoresr0.310.410.340.350.610.420.390.330.360.480.360.490.450.140.730.420.610.570.010.270.580.040.600.420.360.120.480.740.260.150.310.490.00
P6.90E-031.59E-175.04E-305.00E-106.83E-051.70E-137.84E-034.89E-064.81E-065.31E-313.46E-034.26E-336.21E-163.57E-021.25E-852.67E-174.18E-521.46E-439.53E-013.29E-081.52E-176.01E-012.84E-504.23E-052.44E-098.00E-036.87E-242.01E-241.73E-091.07E-012.14E-051.16E-049.89E-01
STAT5Bstromal scoresr0.040.030.180.030.240.330.180.20-0.360.320.160.18-0.060.39-0.21-0.120.170.020.200.140.43-0.110.300.46-0.280.130.470.19-0.090.11-0.030.280.23
P7.03E-015.36E-011.38E-095.99E-011.58E-011.68E-082.44E-015.64E-036.75E-063.94E-142.11E-012.59E-053.01E-015.03E-092.88E-062.74E-021.33E-046.04E-016.51E-025.61E-033.71E-091.31E-018.20E-124.76E-064.39E-067.25E-035.35E-232.60E-023.37E-022.35E-017.02E-013.61E-024.51E-02
immune scoresr0.06-0.03-0.05-0.080.190.160.470.06-0.370.200.09-0.09-0.210.01-0.11-0.260.17-0.03-0.070.020.30-0.100.180.34-0.320.130.250.19-0.24-0.05-0.180.130.20
P5.85E-015.88E-011.20E-011.49E-012.74E-017.68E-031.03E-034.10E-011.95E-065.10E-064.55E-012.96E-022.81E-048.78E-011.13E-023.77E-072.05E-044.62E-015.08E-017.36E-013.70E-051.95E-014.84E-059.39E-042.06E-075.44E-035.20E-073.34E-028.11E-085.60E-011.94E-023.51E-017.31E-02
estimate scoresr0.060.000.07-0.030.210.260.410.15-0.370.290.130.02-0.160.17-0.15-0.220.19-0.010.040.080.39-0.100.250.43-0.320.140.390.24-0.190.05-0.120.220.24
P5.76E-019.77E-012.44E-026.02E-012.24E-016.58E-064.93E-034.73E-023.47E-061.10E-113.13E-016.86E-015.40E-031.21E-026.90E-043.45E-052.76E-058.14E-017.24E-018.40E-026.70E-081.74E-011.05E-081.65E-051.04E-072.66E-031.04E-155.53E-031.51E-056.24E-011.13E-019.70E-023.28E-02
STAT6stromal scoresr-0.12-0.300.13-0.10-0.060.000.08-0.130.540.01-0.170.03-0.090.520.600.04-0.08-0.08-0.150.00-0.060.140.130.05-0.21-0.05-0.030.640.010.24-0.070.080.42
P2.95E-015.36E-102.43E-051.03E-017.37E-019.49E-016.14E-017.26E-021.09E-128.42E-011.84E-014.39E-011.34E-015.45E-169.59E-514.50E-017.25E-028.06E-021.61E-019.51E-014.17E-016.24E-023.15E-036.58E-018.91E-043.37E-015.26E-011.28E-168.97E-019.26E-033.88E-015.55E-011.39E-04
immune scoresr-0.19-0.18-0.03-0.02-0.110.070.230.010.470.07-0.10-0.02-0.130.290.48-0.090.06-0.02-0.140.080.030.180.130.10-0.18-0.07-0.03-0.11-0.04-0.33-0.050.310.37
P9.85E-023.88E-043.20E-017.12E-015.12E-012.44E-011.26E-018.42E-011.01E-099.41E-024.16E-016.42E-012.33E-021.20E-058.87E-309.91E-021.96E-017.25E-011.95E-011.05E-016.71E-011.45E-025.35E-033.52E-012.95E-031.21E-015.56E-011.90E-013.45E-012.30E-044.72E-012.08E-026.96E-04
estimate scoresr-0.17-0.260.05-0.05-0.110.030.21-0.070.500.04-0.13-0.01-0.120.410.53-0.03-0.01-0.05-0.200.05-0.010.170.130.07-0.20-0.07-0.040.18-0.03-0.09-0.060.220.41
P1.42E-011.48E-071.20E-014.19E-015.38E-016.65E-011.61E-013.58E-013.30E-113.47E-012.85E-018.74E-013.55E-026.90E-103.22E-385.07E-017.92E-012.92E-017.05E-023.24E-018.67E-012.46E-023.41E-035.30E-011.35E-031.66E-014.54E-013.40E-025.39E-013.42E-014.39E-011.01E-011.80E-04

STAT, signal transducer and activator of transcription; ACC, adrenocortical carcinoma; BLCA, bladder urothelial carcinoma; BRCA, breast invasive carcinoma; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; CHOL, cholangiocarcinoma; COAD, colon adenocarcinoma; DLBC, lymphoid neoplasm diffuse large B-cell lymphoma; ESCA, esophageal carcinoma; GBM, glioblastoma multiforme; HNSC, head and neck squamous cell carcinoma; KICH, kidney chromophobe; KIRC, kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; LAML, acute myeloid leukemia; LGG, brain lower grade glioma; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; MESO, mesothelioma; OV, ovarian serous cystadenocarcinoma; PAAD, pancreatic adenocarcinoma; PCPG, pheochromocytoma and paraganglioma; PRAD, prostate adenocarcinoma; READ, rectum adenocarcinoma; SARC, sarcoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; TGCT, testicular germ cell tumors; THCA, thyroid carcinoma; THYM, thymoma; UCEC, uterine corpus endometrial carcinoma; UCS, uterine carcinosarcoma; UVM, uveal melanoma. The red values mean that the differences are statistically significant.

Figure 10

Association of the STAT family gene expression with stromal scores, immune scores, and estimate scores across 33 different cancer types.

Association of the STAT family gene expression with stromal scores, immune scores and estimate scores across 33 different cancer types. STAT, signal transducer and activator of transcription; ACC, adrenocortical carcinoma; BLCA, bladder urothelial carcinoma; BRCA, breast invasive carcinoma; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; CHOL, cholangiocarcinoma; COAD, colon adenocarcinoma; DLBC, lymphoid neoplasm diffuse large B-cell lymphoma; ESCA, esophageal carcinoma; GBM, glioblastoma multiforme; HNSC, head and neck squamous cell carcinoma; KICH, kidney chromophobe; KIRC, kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; LAML, acute myeloid leukemia; LGG, brain lower grade glioma; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; MESO, mesothelioma; OV, ovarian serous cystadenocarcinoma; PAAD, pancreatic adenocarcinoma; PCPG, pheochromocytoma and paraganglioma; PRAD, prostate adenocarcinoma; READ, rectum adenocarcinoma; SARC, sarcoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; TGCT, testicular germ cell tumors; THCA, thyroid carcinoma; THYM, thymoma; UCEC, uterine corpus endometrial carcinoma; UCS, uterine carcinosarcoma; UVM, uveal melanoma. The red values mean that the differences are statistically significant. Association of the STAT family gene expression with stromal scores, immune scores, and estimate scores across 33 different cancer types. In DNAss, the expression of STAT1 was positively related to GBMLGG, LGG, PRAD, THYM, THCA, and PCPG, while STAT1 expression was negatively related to CESC, ESCA, Stomach and Esophageal carcinoma (STES), SARC, KIPAN, STAD, UCEC, HNSC, KIRC, LUSC, LIHC, UCS, and BLCA ( ). The expression of STAT2 was positively related to GBMLGG, LGG, KIRP, PRAD, THYM, THCA, and PCPG, while STAT2 expression was negatively related to COAD, COADREAD, BRCA, ESCA, STES, STAD, HNSC, MESO, and BLCA ( ). The expression of STAT3 was positively related to GBMLGG, LGG, THYM, THCA, ACC, and CHOL, while STAT3 expression was negatively related to COAD, COADREAD, LAML, BRCA, SARC, STAD, PAAD, TGCT, and BLCA ( ). The expression of STAT4 was positively related to LGG, PARD, THCA, and PCPG, while STAT4 expression was negatively related to CESC, LUAD, COAD, ESCA, COADREAD, BRCA, STES, SARC, HNSC, KIPAN, STAD, UCEC, KIRC, LUSC, LIHC, PAAD, UCS, BLCA, KICH, and DLBC ( ). The expression of STAT5A was positively related to GBMLGG, LGG, PARD, THYM, THCA, PCPG, and ACC, while STAT5A expression was negatively related to GBM, LUAD, COAD, UCEC, COADREAD, BRCA, STES, SARC, KIPAN, STAD, HNSC, LUSC, LIHC, OV, TGCT, UCS, BLCA, and DLBC ( ). The expression of STAT5B was positively related to LIHC and PCPG, while STAT5B expression was negatively related to GBMLGG, LGG, COAD, LAML, COADREAD, BRCA, STES, SARC, STAD, UCEC, HNSC, LUSC, PAAD, and BLCA ( ). The expression of STAT6 was positively related to GBMLGG, LGG, LUSC, THYM, THCA, PCPG, and KICH, while STAT6 expression was negatively correlated with LUAD, BRCA, SARC, PAAD, and TGCT ( ).
Figure 11

Correlation between signal transducer and activator of transcription (STAT) gene expression and cancer stemness scores based on Spearman’s correlation tests. (A–G) DNAss. (H–N) RNAss.

Correlation between signal transducer and activator of transcription (STAT) gene expression and cancer stemness scores based on Spearman’s correlation tests. (A–G) DNAss. (H–N) RNAss. In RNAss, the expression of STAT1 was positively associated with LUAD, BRCA, STES, STAD, and PCPG, while STAT1 expression was negatively associated with GBMLGG, LGG, COAD, COADREAD, KIPAN, HNSC, THYM, LIHC, THCA, READ, and PAAD ( ). The expression of STAT2 was almost negatively associated with all tumor types except ACC, SKCM, and PCPG ( ). The expression of STAT3 was negatively associated with all tumor types ( ). The expression of STAT4 was positively associated with PCPG, while STAT4 expression was negatively related to all other tumor types ( ). The expression of STAT5A was positively associated with PCPG, while STAT5A expression was negatively related to all other tumor types ( ). The expression of STAT5B was positively associated with GBMLGG, LGG, and PCPG, while STAT5B expression was negatively related to all other tumor types ( ). The expression of STAT6 was negatively associated with GBM, GBMLGG, LGG, CESC, LUAD, COAD, COADREAD, LAML, BRCA, SARC, KIRP, KIPAN, PRAD, UCEC, KIRC, LUSC, THYM, LIHC, THCA, PAAD, TGCT, PCPG, SKCM, and BLCA ( ).

PPI network construction

We constructed a PPI network for each of the STAT genes using the GeneMANIA database. A hub node representing each member of the STAT family was surrounded by 20 nodes corresponding to genes that were significantly correlated with their members ( ).
Figure 12

PPI network, GO and KEGG enrichment analysis in pan-cancer. (A) The PPI network for STATs and the 20 most frequently altered neighbor genes. (B) The network of enriched GO terms. (C) The network of enriched KEGG terms. (D) The network of GO pathways colored by p-value. (E) The network of KEGG pathways colored by p-value.

PPI network, GO and KEGG enrichment analysis in pan-cancer. (A) The PPI network for STATs and the 20 most frequently altered neighbor genes. (B) The network of enriched GO terms. (C) The network of enriched KEGG terms. (D) The network of GO pathways colored by p-value. (E) The network of KEGG pathways colored by p-value.

GO and KEGG enrichment analysis

GEPIA was used to identify the top 20 genes that are similar to those of each STAT member in order to explore possible mechanisms through which STATs can contribute to pan-cancer pathology ( ). After removing duplicates, there were 135 genes, including seven STAT family factors and 128 similar genes. To catch the relationship between terms, the Metascape database was used in GO and KEGG enrichment analyses of STATs and their relative genes. Three aspects, including MF, CC, and BP, were considered in GO enrichment analysis for predicting target host gene functions. As displayed in , we found that GO:0045087 (innate immune response), GO:0001819 (positive regulation of cytokine production), GO:0046649 (lymphocyte activation), GO:0034097 (response to cytokine), GO:0032479 (regulation of type I interferon production), GO:0034340 (response to type I interferon), GO:0043122 (regulation of I-kappaB kinase/NF-kappaB signaling), GO:0140289 (protein mono-ADP-ribosylation), GO:0045648 (positive regulation of erythrocyte differentiation), GO:0042803 (protein homodimerization activity), GO:0042393 histone binding, GO:0061629 (RNA polymerase II-specific DNA-binding transcription factor binding), GO:0008047 (enzyme activator activity), GO:0001228 (DNA-binding transcription activator activity, RNA polymerase II-specific), GO:0050778 (positive regulation of immune response), GO:0000209 (protein polyubiquitination), GO:0001779 (NK cell differentiation), GO:0060216 (definitive hemopoiesis), GO:0043124 (negative regulation of I-kappaB kinase/NF-kappaB signaling), and GO:0042113 (B cell activation) were prominently related to STAT factors and their similar genes. Furthermore, six pathways related to the functions of the STAT family were found through KEGG analysis; pathways such as hsa05162: measles, hsa05161: hepatitis B, hsa00310: lysine degradation, hsa05340: primary immunodeficiency, hsa05145: toxoplasmosis, and hsa04664: Fc epsilon RI signaling pathway were involved in the tumorigenesis and pathogenesis of pan-cancer ( ). Additionally, the networks of enriched GO terms and KEGG pathways were displayed in .
Table 2

The top 20 similar genes of each STAT family gene.

STAT1STAT2STAT3STAT4STAT5ASTAT5BSTAT6
TAP1XAF1TRIP12AC067945.4NRROSEZH1TRIM38
UBE2L6TRIM22GOSR1SLFN12LFAM78ALRRC37A17PSP1
IFIH1PARP14SPTY2D1RP11-1094M14.5STK10KMT2ACASP8
LAP3SP110MAP3K2RP11-686D22.10NCKAP1LCARFNUMB
PARP14STAT1KPNA6RP11-1094M14.8DOCK2PIKFYVEBAZ2A
APOL6SAMD9LMFAP3LINC00861RIN3SENP7IRAK4
GBP1SP100FBXW2NLRC3TMEM106AZNRD1-AS1PLEKHM1P
PARP9OAS2SEC24BSLA2SPNNKTRRREB1
OAS2ZFYVE26EFCAB14TOMM20P2TMC8TSSK4ARHGAP27
DTX3LBTN3A1ACBD3SAMD3INPP5DCTD-2647L4.4ALPK1
OAS3RNF213STX17AKNAGIT2CREBRFNONOP2
IRF1ADARSTAM2CD40LGPIK3CDZZEF1KMT2D
TAP2OAS3SUSD6LY9CARD8NR2C2GMIP
SAMD9LDTX3LASXL2RASAL3LYL1KANSL1ELF1
GBP1P1BAZ2ASLAIN2PARP15SETDB2GIT2TRIM56
EPSTI1PARP12THRAP3E2F3P1FMNL1FAM13BRP11-295D4.3
DDX60FKBP15ADARGVINP1GAB3ZNF445PHYKPL
TRIM21MX2SP3CD6RRN3P3TTBK2GIT2
IFIT3PARP9RBM12SLAMF6BTKPARGP1RP11-274B21.2
GBP5IFIH1SEC23IPCLEC2DRP4-682C21.2PAPD4ITSN2
The top 20 similar genes of each STAT family gene.

Discussion

In many malignancies, a dysregulation of STAT family genes has been reported, but the value of STATs in the tumorigenesis and prognosis of a number of cancers has been partly established. In the present study, the gene expression and survival data, gene variation, methylation status, pathway activity, and drug sensitivity, as well as analyses of immune cell infiltration, identified STAT members as potential biomarkers, with great significance for pan-cancer research. TCGA pan-cancer data showed that STAT family genes were extensively changed in BLCA, BRCA, CHOL, COAD, ESCA, HNSC, KICH, KIRC, KIRP, LIHC, LUAD, LUSC, STAD, THCA, and UCEC compared to the matched adjacent normal tissues. The survival plots revealed that high expressions of STAT1, STAT2, STAT3, STAT5A, and STAT6 were risk factors in GBMLGG and LGG; the higher expression of STAT1, STAT2, STAT3, STAT4, STAT5B, and STAT6 correlated with the lower OS rates in LAML. In KIPAN, upregulated STAT1, STAT2, STAT3, and STAT4 were linked to the shortened OS. High expression of STAT1, STAT3, and STAT6 correlated with better OS in UVM. However, the expression of STAT1, STAT3, STAT4, STAT5A, and STAT5B was a protective factor in SKCM. STAT1 and STAT4 were correlated with a high clinical prognosis. We also found that lower STAT5B expression was significantly correlated with poor prognosis in GBMLGG, which was different from other STATs. Thus, the STAT family was a prognostic biomarker in many cancers, with special emphasis on GBMLGG, LGG, LAML, KIPAN, UVM, and SKCM, which has not been previously reported. A high frequency of CNV was found for STAT family genes. Expression analysis for STAT regulators showed a positive correlation between CNV and expression, particularly for STAT5B, STAT6, and STAT3. In addition, CNVs could affect STAT gene expression, a process that can cause tumorigenesis. In one example, the STAT2 gene was often amplified in KICH, KIRC, and KIRP (all heterozygous amplification >25%, p < 0.05) and was related to a lower patient survival in KIPAN. It was found that STAT methylation varied significantly among PRAD, LUSC, BRCA, UCEC, COAD, THCA, KIRC, and LIHC. According to a survival analysis of various types of tumors, hypermethylated STAT5A and STAT5B were mainly associated with poorer survival while hypomethylated STAT1, STAT2, and STAT4 were mainly associated with shorter survival. It is the first time that a change in STAT methylation status has been expected to predict survival outcomes. An interaction network of STATs was linked to activation of cell apoptosis, EMT, hormone ER, and RAS/MAPK pathway and inhibition of cell cycle, DNA damage response, and hormone AR. Thus, STAT factors promoted tumorigenesis via a variety of mechanisms. Chemotherapy and target therapy clinical results were affected by molecular abnormalities. According to the sensitivity analysis, STAT5A and STAT5B at low levels showed high resistance to several drugs, indicating that they might serve as biomarkers for screening drugs. To take a few examples, I-BET762 has been used in PAAD and COADREAD (38, 39); navitoclax has been enrolled in phase II clinical trials (40, 41). Hence, these drugs are susceptible to anticancer effects via STAT regulation. Functional enrichment analyses showed that STAT genes were functionally enriched in lymphocyte activation and immune response. In this study, STAT family members mainly influenced the infiltration of Tregs, CD8+, resting memory CD4+ T cells, M1 macrophages, and naive B cells and positively related to the infiltration of naive CD4+ T and NK cells, M0 macrophages, activated DCs, and memory B cells in most STAT family genes. In tumor immunity, STATs also played an important role. The relationship between immunity infiltration and expression of STAT family genes was also discussed. It was not unexpected that STATs were significantly influenced by the abundance of B, CD8+ and CD4+ T cells, macrophages, and DCs in the infiltration. Many published articles have shown that pSTAT3 acted to negatively affect T cells and DCs while positively regulating Tregs (42–45). STAT4 specifically mediated IL-12 signaling, affecting a wide range of immune cells. The biologic effects of IL-12 included induction of interferon expression in NK and activated T cells, increase in cytotoxic responses in both T and NK cells, and induction of proliferation of activated T cells (46, 47). By signaling with IL-4 and IL-13, STAT6 triggered an immune cell response, causing B- and T-cell proliferation and differentiation of macrophages (48, 49). In addition, the transcription factor STAT6 was the key to Th2 cell development as it decreased the production of IL-10 and increased IL-12 in DCs (50, 51). The study showed that STATs played a major role in tumor immune escape, as shown by the close correlation between STATs and immunocellular infiltration. Based on the present study, STAT gene expression was positively affected by immune, stromal, and estimate scores in 33 tumors. According to the findings, the greater the number of immune and stromal cells, the greater the number of tumor cells. It was demonstrated that stromal and immune cells were involved in cancer growth, metastasis, and drug resistance, suggesting that STATs can regulate tumor behavior by interacting with the TME (52). In addition, most STAT gene expressions correlated positively with DNAss and RNAss in 33 tumors from TCGA. Increased expression of STATs, improved tumor stemness scores, stronger tumor stem cell activity, and decreased tumor differentiation were observed. Despite these relevant strengths, it is nevertheless important to acknowledge the limitation of our study. Preclinical studies are expected to determine the influence of these immune-specific and tumor-specific STAT family genes on driving tumor infiltration and survival differences. Further biological experiments are needed to verify some important results in this study. We have collected tissue specimens of breast and cervical cancer, and in the future, validated experiments will be conducted to examine these findings. In conclusion, considering these genomic alterations and clinical features of STAT family members across cancer types, it will be possible to change the relationship between STATs and tumorigenesis. It was beneficial to treat cancer by targeting these STAT regulators.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material.

Author contributions

DC, YC, GS and MZ contributed to the conception and design of the study. MZ, PZ, MD drafted the manuscript. RY, YM, JZ, JX and TM collected and analyzed the data. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the Natural Science Foundation of Qinghai Province (No. 2022-ZJ-912).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling editor BH and reviewer MX declared a shared parent affiliation with the author(s) MZ, PZ, RY, TM, JX, YC, DC at the time of review.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  52 in total

Review 1.  Regulation of Th2 differentiation and Il4 locus accessibility.

Authors:  K Mark Ansel; Ivana Djuretic; Bogdan Tanasa; Anjana Rao
Journal:  Annu Rev Immunol       Date:  2006       Impact factor: 28.527

Review 2.  Alternative activation of macrophages: an immunologic functional perspective.

Authors:  Fernando O Martinez; Laura Helming; Siamon Gordon
Journal:  Annu Rev Immunol       Date:  2009       Impact factor: 28.527

Review 3.  Infectious Risks Associated with Biologics Targeting Janus Kinase-Signal Transducer and Activator of Transcription Signaling and Complement Pathway for Inflammatory Diseases.

Authors:  Esther Benamu
Journal:  Infect Dis Clin North Am       Date:  2020-06       Impact factor: 5.982

Review 4.  STAT4: a critical regulator of inflammation in vivo.

Authors:  Mark H Kaplan
Journal:  Immunol Res       Date:  2005       Impact factor: 2.829

5.  TIMER: A Web Server for Comprehensive Analysis of Tumor-Infiltrating Immune Cells.

Authors:  Taiwen Li; Jingyu Fan; Binbin Wang; Nicole Traugh; Qianming Chen; Jun S Liu; Bo Li; X Shirley Liu
Journal:  Cancer Res       Date:  2017-11-01       Impact factor: 12.701

6.  Colorectal cancer inhibition by BET inhibitor JQ1 is MYC-independent and not improved by nanoencapsulation.

Authors:  Thibaut Fourniols; Valentina Maggio; Diana Rafael; Ariana Colaco; Elia García Vidal; Alessandra Lopes; Simo Schwartz; Águeda Martínez-Barriocanal; Veronique Preat; Diego Arango
Journal:  Eur J Pharm Biopharm       Date:  2022-01-05       Impact factor: 5.571

Review 7.  Mining for JAK-STAT mutations in cancer.

Authors:  Stefan N Constantinescu; Michael Girardot; Christian Pecquet
Journal:  Trends Biochem Sci       Date:  2008-03       Impact factor: 13.807

Review 8.  Emerging strategies for cancer immunoprevention.

Authors:  J C Roeser; S D Leach; F McAllister
Journal:  Oncogene       Date:  2015-09-14       Impact factor: 9.867

9.  Normal Basal Epithelial Cells Stimulate the Migration and Invasion of Prostate Cancer Cell RM-1 by TGF-β1/STAT3 Axis in vitro.

Authors:  Chun-Yan Li; Chun-Ya Chen; Jian-Hong An; Jian-Bin Wu; Hong Shen
Journal:  Cancer Manag Res       Date:  2021-05-07       Impact factor: 3.989

10.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.