| Literature DB >> 35042235 |
Georgios Asimomitis1,2,3, André G Deslauriers1,2,4,5,6,7, Andriana G Kotini4,5,6,7, Elsa Bernard1,2, Davide Esposito4,5,7, Malgorzata Olszewska4,5,6,7, Nikolaos Spyrou4,5,6,7, Juan Arango Ossa1,2, Teresa Mortera-Blanco8, Richard Koche9, Yasuhito Nannya10,11, Luca Malcovati12,13, Seishi Ogawa10, Mario Cazzola13, Stuart A Aaronson4,5,7, Eva Hellström-Lindberg8, Elli Papaemmanuil1,2, Eirini P Papapetrou4,5,6,7.
Abstract
SF3B1K700E is the most frequent mutation in myelodysplastic syndrome (MDS), but the mechanisms by which it drives MDS pathogenesis remain unclear. We derived a panel of 18 genetically matched SF3B1K700E- and SF3B1WT-induced pluripotent stem cell (iPSC) lines from patients with MDS with ring sideroblasts (MDS-RS) harboring isolated SF3B1K700E mutations and performed RNA and ATAC sequencing in purified CD34+/CD45+ hematopoietic stem/progenitor cells (HSPCs) derived from them. We developed a novel computational framework integrating splicing with transcript usage and gene expression analyses and derived a SF3B1K700E splicing signature consisting of 59 splicing events linked to 34 genes, which associates with the SF3B1 mutational status of primary MDS patient cells. The chromatin landscape of SF3B1K700E HSPCs showed increased priming toward the megakaryocyte- erythroid lineage. Transcription factor motifs enriched in chromatin regions more accessible in SF3B1K700E cells included, unexpectedly, motifs of the TEA domain (TEAD) transcription factor family. TEAD expression and transcriptional activity were upregulated in SF3B1-mutant iPSC-HSPCs, in support of a Hippo pathway-independent role of TEAD as a potential novel transcriptional regulator of SF3B1K700E cells. This study provides a comprehensive characterization of the transcriptional and chromatin landscape of SF3B1K700E HSPCs and nominates novel mis-spliced genes and transcriptional programs with putative roles in MDS-RS disease biology.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35042235 PMCID: PMC9131920 DOI: 10.1182/bloodadvances.2021006325
Source DB: PubMed Journal: Blood Adv ISSN: 2473-9529
Figure 1.Derivation and phenotypic characterization of MDS-RS patient-derived (A) Schematic overview of the derivation of iPSC lines with isolated SF3B1 mutation and genetically matched normal WT lines from 3 patients with MDS-RS (BMMCs, bone marrow mononuclear cells). (B) Percentage and absolute number of iPSC colonies of each genotype derived from each patient sample. (C) Methylcellulose assays on day 14 of hematopoietic differentiation. The number of colonies from 5000 seeded cells is shown per iPSC line (CFU-GEMM, colony-forming unit-granulocyte, erythrocyte, monocyte, megakaryocyte; CFU-GM, CFU-granulocyte, monocyte; CFU-G, colony-forming unit-granulocyte; CFU-M, CFU-monocyte; BFU-E, burst-forming unit-erythrocyte); n.s., not significant; **P ≤ .01; ***P ≤ .001. (D) Competitive growth assay. The cells were mixed 1:1.5 at the onset of hematopoietic differentiation with a genetically matched WT iPSC line stably expressing green fluorescent protein (GFP). The relative population size was estimated as the percentage of GFP− cells (measured by flow cytometry) at each time point (days 4-18 of differentiation), relative to the percentage of GFP− cells on day 2. (E) Percent viable (4′,6-diamidino-2-phenylindole−) cells on day 14 of hematopoietic differentiation of the indicated SF3B1 and SF3B1 iPSC-HSPCs. Mean and standard error of the mean (SEM) of 1 to 3 independent differentiation experiments with 2 to 3 iPSC lines from each group are shown. *P ≤ .05.
Figure 2.Integrative gene expression, alternative splicing, and transcript usage analyses categorize gene targets of mutant SF3B1. (A) PCA plots based on gene expression of the 3000 most highly variable genes color-coded by SF3B1 mutation status and sign-coded by patient ID. (B) Heatmap showing distance of the indicated iPSC-HSPCs based on pairwise Pearson correlation of their gene expression profiles, color-coded by SF3B1 mutation status and patient ID. (C) PCA plot based on inclusion levels of the differentially spliced events between SF3B1 and SF3B1 iPSC-HSPCs. (D) Scatterplots comparing the mean inclusion levels of the differentially spliced events in SF3B1 vs SF3B1 iPSC-HSPCs with different event types broken down by color, as indicated. (E) Schematic summarizing the integrative analysis used to derive a mutant SF3B1 signature of splicing events and genes and scatterplot showing inclusion level difference of all 59 signature splicing events, corresponding to 34 genes. A positive y axis value indicates that the event is more frequently found in SF3B1 vs SF3B1 cells.
Figure 3.Events of the mutant SF3B1 splicing signature. Heatmap showing the row normalized inclusion levels of the 59 signature events across HSPCs from all iPSC lines. For each row, color-coded side panels present metadata relevant to each event, including the log2fc of expression of the respective genes, the biotypes of the up- and downregulated transcripts that are associated with the splicing events, and the presence of the events in the MDS patient dataset of Pellagatti et al,[9] encoded as not present (signature events not present in any comparison); present but not significant (signature events that were not statistically significant or/and had an absolute inclusion level difference < 0.1 in both comparisons [SF3B1mut vs SF-WT and SF3B1mut vs WT, ie, healthy individuals]); present only in SF3B1mut vs SF-WT (signature events statistically significant [FDR < 0.05] and with an absolute inclusion level difference > 0.1 only in the SF3B1mut vs SF-WT MDS patient comparison); and present in SF3B1mut vs SF-WT and SF3B1mut vs WT (signature events statistically significant [FDR < 0.05] and with an absolute inclusion level difference > 0.1 in both comparisons). The annotations of the transcript biotypes are derived from the Ensembl GRCh37 gtf annotation file. Each row represents one event labeled with the respective gene name followed by a number indicating distinct events.
Figure 4.Splicing event signature separates PCA plot based on the inclusion levels of the signature splicing events in the patient samples of Pellagatti et al,[9] separating MDS SF3B1-mutated patients (K700E SF3B1mut MDS) and patients with SF3B1 mutations other than K700E (non-K700E SF3B1mut MDS) from patients without SF mutations (SF-WT MDS) and healthy individuals (WT). The asterisk marks 1 patient annotated as SF-WT. Clustering of this sample together with the SF3B1-mutated cases prompted us to more closely interrogate the sequence of the SF3B1 locus for any previously unidentified mutations. We thus discovered an in-frame 6-bp deletion (SF3B1p.K700_V701delKV) removing 2 amino acids, including the K700 hotspot.
Figure 5.(A) PCA based on accessibility of all peaks in the ATAC-seq atlas color-coded by SF3B1 mutation status and sign-coded by patient ID. (B) Heatmap showing the distance of the HSPCs from the indicated iPSC lines, based on pairwise Pearson correlation of their chromatin accessibility landscapes, color-coded by SF3B1 mutation status and patient ID. (C) Scatterplot showing the accessibility log2fc and the adjusted P value of the differentially accessible peaks between SF3B1 and SF3B1 iPSC-HSPCs per chromosome, color-coded by the adjusted P value. Each point represents a peak. (D) Cumulative distribution function (CDF) of the expression log2fc of genes more accessible in SF3B1 HSPCs, genes less accessible in SF3B1 HSPCs, and all genes, showing that genes more accessible in SF3B1 HSPCs are upregulated (Kolmogorov–Smirnov [KS] test, P = 1.17e-07) and genes less accessible in SF3B1 HSPCs are downregulated (KS test, P = 3.13e-16) compared with background. (E) Scatterplot showing the log2fc accessibility value of the differentially accessible peaks and the log2fc expression value of the linked gene (genes for which P value could not be calculated were excluded). (F) Heatmap showing Pearson correlation values of normalized read counts for ATAC-seq peaks that overlap between the indicated iPSC-HSPCs and primary normal hematopoietic cell subpopulations (hematopoietic stem cells [HSC], multipotent progenitors [MPP], common myeloid progenitors [CMP], lymphoid-primed multipotent progenitors [LMPP], granulocyte- monocyte progenitors [GMP], megakaryocyte-erythrocyte progenitors [MEP], common lymphoid progenitors [CLP], monocytes [Mono], erythroid cells [Ery], natural killer cells [NK], and B cells) from Corces et al.[36]
Figure 6.Increased transcriptional activity of TEAD TFs in (A) TF motifs enriched in peaks more accessible in SF3B1 compared with SF3B1 HSPCs and linked to upregulated genes. (B) Most statistically significant TF motifs enriched in peaks more accessible in SF3B1 compared with SF3B1 HSPCs and linked to upregulated genes. (C) Tornado plots showing the normalized accessibility signal in peaks more accessible in SF3B1 compared with SF3B1 HSPCs and linked to upregulated genes that contain TEAD motifs. (D) Expression levels of TEAD family genes in iPSC-HSPCs. Mean and SEM of transcripts per million (TPM) values from RNA-seq are shown. **Padj ≤ .01; ***Padj ≤ .001. (E) TEAD reporter activity in HSPCs from the indicated iPSC lines. Mean and SEM of 2 to 5 independent differentiation and transduction experiments per line are shown. n.s., not significant; **P ≤ .01.