| Literature DB >> 33859327 |
Shulan Tian1, Henan Zhang2, Pan Zhang3,4, Michael Kalmbach3, Jeong-Heon Lee5, Tamas Ordog6, Paul J Hampel7, Timothy G Call7, Thomas E Witzig7, Neil E Kay7, Eric W Klee8, Susan L Slager8, Huihuang Yan8, Wei Ding9.
Abstract
T cell prolymphocytic leukemia (T-PLL) is a rare disease with aggressive clinical course. Cytogenetic analysis, whole-exome and whole-genome sequencing have identified primary structural alterations in T-PLL, including inversion, translocation and copy number variation. Recurrent somatic mutations were also identified in genes encoding chromatin regulators and those in the JAK-STAT signaling pathway. Epigenetic alterations are the hallmark of many cancers. However, genome-wide epigenomic profiles have not been reported in T-PLL, limiting the mechanistic study of its carcinogenesis. We hypothesize epigenetic mechanisms also play a key role in T-PLL pathogenesis. To systematically test this hypothesis, we generated genome-wide maps of regulatory regions using H3K4me3 and H3K27ac ChIP-seq, as well as RNA-seq data in both T-PLL patients and healthy individuals. We found that genes down-regulated in T-PLL are mainly associated with defense response, immune system or adaptive immune response, while up-regulated genes are enriched in developmental process, as well as WNT signaling pathway with crucial roles in cell fate decision. In particular, our analysis revealed a global alteration of regulatory landscape in T-PLL, with differential peaks highly enriched for binding motifs of immune related transcription factors, supporting the epigenetic regulation of oncogenes and genes involved in DNA damage response and T-cell activation. Together, our work reveals a causal role of epigenetic dysregulation in T-PLL.Entities:
Mesh:
Year: 2021 PMID: 33859327 PMCID: PMC8050249 DOI: 10.1038/s41598-021-87890-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Clinical information about the nine T-PLL cases.
| ID | Date of sample | Treatment* | Time+ | Phenotype | Key FISH results |
|---|---|---|---|---|---|
| P1 | 1/28/16 | CampathX2, romidepsin | < 1 year | CD3+ , CD4 + , CD45RA+ , CD45RO-, CCR7+ , CD69+ | 88.5% of nuclei had a rearrangement involving |
| P2 | 9/29/13 | Untreated, observed from 2009 to 2013 | 4 years | CD3 , CD8+ , CD45RO-, CD45RA+ , CCR7 + | 35.5% nuclei with inv(14) or t(14;14) involving |
| P3 | 4/13/16 | Campath, methotrexate, pentostatin, pralatrexate. CHOP, Benda | < 2 months | CD3+ , CD4+ , CD45RA+ , CCR7+ , CD62L+ | 74% nuclei has |
| P4 | 2012 | Untreated, found in 2008, observed to 2012 | 4 years | CD3 + , CD4 + , CD45RA+ , CCR7 + , CD62L+ | 70% nuclei had rearrangement involving |
| P5 | 7/14/14 | Untreated, but in half a year required therapy for skin rash and cytopenia | 0.5 year | CD3+ , CD8+ , CD45RO+ , CCR7+ , CD62L+ | 14q32.1 ( |
| P6 | 9/11/15 | Untreated, required therapy in 2 months | 2 months | CD3+ , CD4 + , CD45RO+ , CCR7+ , CD62L + | 82% of nuclei had a rearrangement involving |
| P7 | 1/23/12 | Untreated, with 2 months started alemtuzumab and pentostatin, followed by Hyper-CVAD, Fludarabine and cyclophosphamide | 2 months | CD3+ , CD8 + , CD45RA+ , CCR7+ , pCD62L + , CD95+ | 98% nuclei rearrangement involved |
| P8 | 7/10/10 | Untreated, within 5 months, started alemtuzumab and pentostatin | 5 months | CD3+ , CD4 + , CD45RO+ , CCR7+ , pCD62L + , pCD95+ | T |
| P9 | 11/1/04 | Untreated, within 2 months, started alemtuzumab, pentostatin and cyclophosphamide | 2 months | CD3 + , CD4+ , CD45RA+ , CCR7 + , pCD62L+ , pCD95+ | 46,X,-X,t(6;6)(q27;q15),inv(14)(q13q24), + mar[15]/ |
*Treatment history prior to sample collection. Patient P1 received a three-month romidepsin treatment, which was discontinued one month prior to sample collection.
+Time from diagnosis to therapy.
Figure 1Gene expression changes in T-PLL. (a) Unsupervised clustering of expression level from the top 10,000 genes. Gene expression was quantified by total RNA sequencing as RPKM values and transformed to log2 scale with an offset of 0.1, following by quantile normalization. These genes were selected from autosomes that had RPKM ≥ 0.5 in at least one of the samples and showed the largest variation across samples. Unlike the three normal, the T-PLL cases showed remarkable heterogeneity in expression profile. N1-N3, normal individuals; P1, P3, P5 and P6, T-PLL patients. (b) Protein-coding genes differentially expressed between T-PLL and normal. Raw reads were aligned to the hg19 reference genome using STAR and the number of reads in exons per gene was estimated with featureCounts. The 1672 (807 + 865) differentially expressed genes were identified using edgeR at the cutoff of 5% FDR and threefold change. Y-axis, fold-change at the log2 scale; x-axis, sum of the normalized read count per M from the normal and T-PLL group.
Enriched pathways and GO terms for genes down-regulated in T-PLL.
| Name | No. gene | Q value | Source |
|---|---|---|---|
| Immune System | 214 | 7.34E−26 | Reactome |
| Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell | 43 | 1.31E−21 | Reactome |
| Neutrophil degranulation | 70 | 8.87E−14 | Reactome |
| Innate Immune System | 129 | 1.96E−13 | Reactome |
| Adaptive Immune System | 87 | 2.47E−09 | Reactome |
| Cytokine Signaling in Immune system | 80 | 7.29E−09 | Reactome |
| Costimulation by the CD28 family | 17 | 3.58E−06 | Reactome |
| PD-1 signaling | 10 | 4.66E−06 | Reactome |
| Chemokine receptors bind chemokines | 13 | 2.19E−05 | Reactome |
| Interferon gamma signaling | 18 | 2.69E−05 | Reactome |
| Immune response (GO:0006955) | 185 | 4.48E−38 | GO |
| Immune system process (GO:0002376) | 242 | 4.48E−38 | GO |
| Cell activation (GO:0001775) | 115 | 2.29E−24 | GO |
| Defense response (GO:0006952) | 162 | 5.41E−27 | GO |
| Regulation of immune system process (GO:0002682) | 158 | 3.03E−31 | GO |
| Leukocyte activation (GO:0045321) | 97 | 1.89E−21 | GO |
| Regulation of immune response (GO:0050776) | 110 | 4.00E−23 | GO |
| Immune effector process (GO:0002252) | 76 | 8.04E−13 | GO |
| Leukocyte mediated immunity (GO:0002443) | 41 | 4.75E−10 | GO |
| Myeloid leukocyte activation (GO:0002274) | 30 | 2.84E−10 | GO |
Gene expression was quantified by total RNA sequencing for three normal (N1-N3) and four T-PLL (P1, P3, P5, and P6).
Figure 2Global alteration of regulatory regions in T-PLL. (a) Unsupervised clustering of 20,000 H3K27ac peaks. Peaks from all nine samples were first merged if they were overlapped by at least 1 bp. For each merged peak, the input-subtracted read counts were normalized to 10 M mapped reads, log2 transformed and quantile normalized. The top 20,000 merged peaks were selected from autosomes that were each present in at least two samples, not in the TSS ± 2.5 kb regions, and had the largest between-sample variation. (b) Unsupervised clustering of 10,000 H3K4me3 peaks. The H3K4me3 peaks were processed similarly as above, except that only those in the TSS ± 2 kb regions of protein-coding genes were selected. (c) Average read density profile over TSS ± 2 kb for genes down-regulated in T-PLL. H3K4me3 and H3K27ac signals in 40-bp bins were estimated using the ngs.plot software. RPM, reads per M. (d) Average read density profile over TSS ± 2 kb for genes up-regulated in T-PLL.
Association of differential H3K27ac and H3K4me3 occupancy with differential gene expression.
| Mark | Region | Differential peaks | Non-differential peaks | Enrichment | |||
|---|---|---|---|---|---|---|---|
| Type | Total | Linked to up-regulated gene | Total | Linked to up-regulated gene | |||
| H3K27ac | > TSS ± 2.5 kb | Up in normal | 5462 | 1001 (265) | 35,788 | 1372 (323) | 4.78 |
| H3K27ac | > TSS ± 2.5 kb | Up in T-PLL | 2833 | 583 (167) | 35,788 | 1112 (287) | 6.62 |
| H3K27ac | ≤ TSS ± 2.5 kb | Up in normal | 524 | 212 (136) | 16,866 | 355 (213) | 19.22 |
| H3K27ac | ≤ TSS ± 2.5 kb | Up in T-PLL | 543 | 166 (123) | 16,866 | 270 (187) | 19.1 |
| H3K4me3 | > TSS ± 2.0 kb | Up in normal | 1959 | 272 (175) | 17,596 | 635 (321) | 3.83 |
| H3K4me3 | > TSS ± 2.0 kb | Up in T-PLL | 1086 | 165 (102) | 17,596 | 626 (300) | 4.27 |
| H3K4me3 | ≤ TSS ± 2.0 kb | Up in normal | 742 | 212 (186) | 15,775 | 471 (403) | 9.57 |
| H3K4me3 | ≤ TSS ± 2.0 kb | Up in T-PLL | 148 | 85 (84) | 15,775 | 509 (394) | 17.8 |
Total RNA-seq differential expression analysis and ChIP-seq DiffBind were performed on four T-PLL (P1, P3, P5 and P6) and three normal (N1-N3). Peaks were split into those that showed differential occupancy between T-PLL and normal and others that showed no differential occupancy based on DiffBind analysis (FDR ≤ 0.05, fold-change ≥ 2). Peaks were assigned to the nearest genes. For differential peaks showing increased occupancy in normal (“Up in normal”), the number in parentheses indicates the number of up-regulated genes in normal. For differential peaks showing increased occupancy in T-PLL (“Up in T-PLL”), the number in parentheses indicates the number of up-regulated genes in T-PLL. Enrichment level is estimated as: (Number of differential peaks linked to differentially-expressed genes/total differential peaks) / (Number of non-differential peaks linked to differentially-expressed genes/total non-differential peaks).
Figure 3Heatmap of differential peaks and enriched TF binding motifs. (a) H3K27ac peaks with increased (upper panel) or decreased (lower panel) signal in T-PLL. There are 717 peaks (middle panel) that show increased occupancy in three of the T-PLL (P3/P1/P6). (b) H3K4me3 peaks with increased (upper panel) or decreased (lower panel) signal in T-PLL. There are 124 and 129 peaks (two middle panels) that show increased signals in P2/P4/P5 and P3/P1/P6, respectively. Differential peaks were identified using the diffbind package, using merged peaks present in at least two of the nine samples. Discriminatory peaks identified in k-means clustering were shown in the heatmap. K-means clustering and heatmap were performed using the number of reads per kb per 10 M, after input subtraction, log2 transformation and quantile normalization. (c) TF motifs enriched in the differential H3K27ac peaks gaining (upper panel) or losing occupancy (lower panel) in T-PLL. (d) TF motifs enriched in the differential H3K4me3 peaks gaining (upper panel) or losing occupancy (lower panel) in T-PLL. Top 50 motifs with the lowest p values from Homer analysis (http://homer.ucsd.edu/homer/) were displayed. Basic leucine zipper TF family members BATF, JUNB and AP-1 were shown in blue.
Figure 4IGV snapshots showing the changes of histone marks and gene expression in T-PLL. (a) Increased expression of TCL1A and the associated gains of H3K4me3 and H3K27ac. TCL1A was actively expressed in T-PLL but not in normal. All T-PLL cases gained both H3K4me3 and H3K27ac peaks (~ 6-kb) that covered the promoter and gene body. Based on the read density in the inputs, there are copy number gains in two T-PLL (P1 and P5). (b) Increased expression of MYC and the associated increase of H3K4me3 and H3K27ac. MYC expression was up-regulated in T-PLL. There was an overall increase in H3K4me3 and H3K27ac occupancy over an ~ 6 kb region that covers the promoter and gene body. The signal level in the inputs suggested a copy number gain in P1, P5 and P6. Promoter capture Hi-C data from 17 blood cell types suggested that an ~ 50-kb enhancer located > 500 kb upstream interacted with MYC promoter in total B, naïve B, fetal thymus and five types of T cells. This enhancer showed a marked increase of H3K27ac in three of the T-PLL (P1-P3) compared to the normal. (c) Markedly reduced expression of CTLA4 and the associated loss of H3K4me3 and H3K27ac. The signal in the inputs suggested no copy number alteration within CTLA4 in T-PLL. N1-N3, normal; P1-P6, T-PLL. Y-axis indicates the number of reads per 200-bp (for ChIP-seq and input) or per base pair (for RNA-seq).