| Literature DB >> 31227749 |
Tham H Hoang1, Yue Zhao2, Yiu Lam2, Stephanie Piekos3, Yueh-Chiang Han4, Cameron Reilly4, Pujan Joshi2, Seung-Hyun Hong2, Chang Ohk Sung5, Charles Giardina4, Dong-Guk Shin6.
Abstract
Transcriptome data can provide information on signaling pathways active in cancers, but new computational tools are needed to more accurately quantify pathway activity and identify tissue-specific pathway features. We developed a computational method called "BioTarget" that incorporates ChIP-seq data into cellular pathway analysis. This tool relates the expression of transcription factor TF target genes (based on ChIP-seq data) with the status of upstream signaling components for an accurate quantification of pathway activity. This analysis also reveals TF targets expressed in specific contexts/tissues. We applied BioTarget to assess the activity of TBX21 and GATA3 pathways in cancers. TBX21 and GATA3 are TF regulators that control the differentiation of T cells into Th1 and Th2 helper cells that mediate cell-based and humoral immune responses, respectively. Since tumor immune responses can impact cancer progression, the significance of our pathway scores should be revealed by effective patient stratification. We found that low Th1/Th2 activity ratios were associated with a significantly poorer survival of stomach and breast cancer patients, whereas an unbalanced Th1/Th2 response was correlated with poorer survival of colon cancer patients. Lung adenocarcinoma and lung squamous cell carcinoma patients had the lowest survival rates when both Th1 and Th2 responses were high. Our method also identified context-specific target genes for TBX21 and GATA3. Applying the BioTarget tool to BCL6, a TF associated with germinal center lymphocytes, we observed that patients with an active BCL6 pathway had significantly improved survival for breast, colon, and stomach cancer. Our findings support the effectiveness of the BioTarget tool for transcriptome analysis and point to interesting associations between some immune-response pathways and cancer progression.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31227749 PMCID: PMC6588588 DOI: 10.1038/s41598-019-45304-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Molecular pathways of Th1 and Th2 Cell Differentiation are modeled into two parts: Upstream and Downstream of transcription factor (TF), in this case, TBX21 and GATA3, respectively. The model also categorizes the TF target genes into two types - Up targets and Down targets.
Figure 2T cells are differentiated into multiple T cell subtypes including Th1, Th2, T-fh, Th17 and T-reg cells. Each cellular differentiation pathway is known to be controlled by key transcription factors. Examples include TBX21, GATA3, BCL6, RORγt, and FOXP3 for Th1, Th2, T-fh, Th17, and T-reg cells, respectively. Th1 cells and Th2 cells interact with each other to balance immune responses to cancer.
Target genes of transcription factor TBX21 identified in Th1 Cell Differentiation pathway.
| Study | Up/Down-regulated genes and their functionality |
|---|---|
| STAD | Up: APOBEC3H[ Down: CDCA7L[ |
| BRCA | Up: AGAP2, CD28[ Down: ARFIP2, BTRC, COPB1[ |
| COAD | Up: IL21R[ Down: AARSD1, COX17, F11R, LYSMD1, MACC1, MRPL9, PON2, RNF43, RPL12, RPL14, RPL23, RPL5, RPS18, SMG7, SNRPE, TARS2, ZNF774 |
| LUAD | Up: ARHGAP30[ Down: BYSL, COPB1[ |
| LUSC | Up: ARHGAP30, CCL4, CD28[ Down: ACTL6A, COPS2, DDX18, ERAL1, GSTA4, METTL2A, MRPL30, PAK1IP1, PDCD10, PHF5A, SLC35F5 |
Figure 3(A) Correlation of transcription factor TBX21 and CD48/TTC26 as Up/Down targets of TBX21 identified by our approach in breast cancer cohort. (B) BRCA breast cancer cohort survival analyses for the extended Th1 Cell Differentiation pathway. (C) KM Analyses conduced on five TCGA cancer cohorts for the extended Th1 Cell Differentiation pathway.
Target genes of transcription factor GATA3 identified in Th2 Cell Differentiation pathway.
| Study | Up/Down genes and their functionality |
|---|---|
| STAD | Up: CAMK2D, CD226[ Down: PRKCH, RASGRP1, RASSF5, SEMA4D[ |
| BRCA | Up: ARHGAP26[ Down: ZFPM1, AEBP2, AGPAT5[ |
| COAD | Up: ANK1, CAMK2D, CBLB, CHD3, FAM124B, HIVEP2, MAP3K5, NHSL2, POLR3GL, PPP3CA, RASGRP4, RGS2, RPS6KA Down: BZRAP1, CDK8, CHEK2, HSF4, MAPKAPK |
| LUAD | Up: ASB2, CAMK2D, CCR6, CD226, CD247, IL18R1, IL2RB[ Down: CCDC12, LSM4, MED29, MYL6B, PAF1, CX3CR1, FAM124B, GNA15[ |
| LUSC | Up: ARHGAP26[ Down: AGPAT5[ |
Figure 4Cohort-based study for the extended Th2 Cell Differentiation pathway.
Figure 5Th1/Th2 balancing can be explained from these results obtained from the KM analyses conducted using M scores. For example, Th1 and Th2 cells are known to eliminate tumor cell translation reducing its evasion effects on cancer cells and patients with highly activated immune pathways may have a better survival chance.
R-based and M-based survival analysis in Th1/Th2 balance.
| Study | KM P-value by R | KM P-value by M |
|---|---|---|
| STAD | 2.3e-01 | 4.6e-03 |
| BRCA | 4.3e-02 | 4.3e-05 |
| COAD | 1.1e-02 | 7.9e-05 |
| LUAD | 5.5e-02 | 2.1e-02 |
| LUSC | 8.8e-03 | 1.5e-02 |
Figure 6The results of studying the Th1/Th2 balancing using cancer stage data show meaningful outcomes in case of STAD and COAD. Same color coding was applied as in Fig. 5.
Figure 7(A) Pathway visualization of BCL6 pathway. (B) Survival analysis based on the pathway scores for Lung Adenocarcinoma cancer with R and M thresholded at zero (e.g., R > 0vs.R < 0 and M > 0 vs. M < 0). (C) Cohort-based study for BCL6 pathway. Most of the studies have KM p-values smaller for M compared with R. Theistribution is shifted to bottom. The extended pathway outperforms the original one on all five cohorts in this KM survival analysis.
BCL6 pathway scores based survival analysis.
| Study | Threshold | KM P-value by R | KM P-value by M |
|---|---|---|---|
| STAD | 0 | 4.7e-1 | 6.1e-02 |
| BRCA | −0.5 | 3.4e-3 | 1.4e-3 |
| COAD | −0.2 | 2.2e-2 | 3.9e-4 |
| LUAD | 0 | 4.7e-1 | 1.6e-3 |
| LUSC | 0.5 | 6.9e-1 | 2.6e-1 |
Significant tests for Th1 Cell Differentiation decoy pathways (Random i) and a tailored pathway for gastric cancer cohort (Th1*).
| Pathway scores | TTest | Wilcoxon | Correlation |
|---|---|---|---|
| Random1 and Th1* | 6.90E-13 | 6.42E-14 | 0.14 |
| Random2 and Th1* | 1.98E-26 | 2.95E-23 | 0.17 |
| Random3 and Th1* | 4.43E-18 | 5.87E-19 | 0.07 |
| Random4 and Th1* | 3.68E-05 | 1.36E-06 | 0.34 |
| Random5 and Th1* | 4.48E-21 | 9.67E-20 | 0.19 |
| Random6 and Th1* | 1.87E-22 | 2.11E-21 | 0.11 |
| Random7 and Th1* | 7.81E-33 | 9.94E-27 | 0.01 |
| Random8 and Th1* | 1.01E-25 | 6.65E-23 | 0.23 |
| Random9 and Th1* | 1.51E-10 | 8.21E-11 | 0.02 |
| Random10 and Th1* | 3.19E-20 | 2.63E-22 | 0.26 |
Figure 8(A) Venn diagrams reveal significant overlaps among Up targets but little overlap among Down targets. (B) Box-plots for pathway scores for three cohorts demonstrate significant variance. R and M scores range from −1 to 1. (C) Comparing the outcomes of KM survival analysis suggests the strength of using M score for the pathway analysis.
Figure 9Data visualization of simplified Th1 Cell Differentiation pathway of two patients in different stages of breast cancer. A typical pathway includes multiple components including ligands, receptor, kinase, transcription factors, target genes, and biological process as mentioned in Fig. 1. RNA-seq fold change is ranging from −5 to 5. Each pathway component is assigned with RNA-seq fold change (tumor vs. control log2 ratio). For example, TBX21 gene of TCGA-E2-A1LH-01 has RNA-seq fold change of 2.18, presented in red color. Clinical and other information of subjects have been added to examine the model. The pathway’s activation/suppression status is more clear from the color coding of the genes appearing as TF targets.
Figure 10(A) ChIP-seq peaks located near 2Kb upstream distance from transcription start site are suggestive of direct target genes of a transcription factor with high probability. Direct and indirect targets can be added from literature survey. (B) Pipeline for identifying TF target genes for a pathway and extending the pathway with the identified up-regulated and down-regulated genes. The significance of the extended pathway is assessed by performing KM survival analyses and literature survey.
Extending Th1 Cell Differentiation pathway with estimated targets of TBX21.
| Study | Cohort Size | Thresholds | # Genes | Correlation | |||||
|---|---|---|---|---|---|---|---|---|---|
| ID | Name | C | C′ |
|
| # | # | CorC | CorC′ |
| STAD | Stomach Adenocarcinoma | 414 | 183 | 0.7 | −0.3 | 30 | 17 | 0.657 | 0.787 |
| BRCA | Breast Carcinoma | 1101 | 642 | 0.7 | −0.5 | 22 | 7 | 0.718 | 0.824 |
| COAD | Colorectal Adenocarcinoma | 614 | 332 | 0.7 | −0.3 | 4 | 17 | 0.522 | 0.743 |
| LUAD | Lung Adenocarcinoma | 507 | 175 | 0.4 | −0.3 | 10 | 5 | 0.584 | 0.78 |
| LUSC | Lung Squamous Cell Carcinoma | 490 | 195 | 0.7 | −0.3 | 35 | 11 | 0.493 | 0.661 |