| Literature DB >> 31863590 |
Tuan M Nguyen1, Elena B Kabotyanski2, Lucas C Reineke3, Jiaofang Shao4, Feng Xiong4, Joo-Hyung Lee4, Julien Dubrulle5, Hannah Johnson5, Fabio Stossi5, Phoebe S Tsoi6, Kyoung-Jae Choi6, Alexander G Ellis7, Na Zhao2, Jin Cao2, Oluwatoyosi Adewunmi1,2, Josephine C Ferreon6, Allan Chris M Ferreon6, Joel R Neilson8, Michael A Mancini5, Xi Chen2, Jongchan Kim9, Li Ma9, Wenbo Li4, Jeffrey M Rosen2.
Abstract
Transposable elements (TEs) comprise a large proportion of long non-coding RNAs (lncRNAs). Here, we employed CRISPR to delete a short interspersed nuclear element (SINE) in Malat1, a cancer-associated lncRNA, to investigate its significance in cellular physiology. We show that Malat1 with a SINE deletion forms diffuse nuclear speckles and is frequently translocated to the cytoplasm. SINE-deleted cells exhibit an activated unfolded protein response and PKR and markedly increased DNA damage and apoptosis caused by dysregulation of TDP-43 localization and formation of cytotoxic inclusions. TDP-43 binds stronger to Malat1 without the SINE and is likely 'hijacked' by cytoplasmic Malat1 to the cytoplasm, resulting in the depletion of nuclear TDP-43 and redistribution of TDP-43 binding to repetitive element transcripts and mRNAs encoding mitotic and nuclear-cytoplasmic regulators. The SINE promotes Malat1 nuclear retention by facilitating Malat1 binding to HNRNPK, a protein that drives RNA nuclear retention, potentially through direct interactions of the SINE with KHDRBS1 and TRA2A, which bind to HNRNPK. Losing these RNA-protein interactions due to the SINE deletion likely creates more available TDP-43 binding sites on Malat1 and subsequent TDP-43 aggregation. These results highlight the significance of lncRNA TEs in TDP-43 proteostasis with potential implications in both cancer and neurodegenerative diseases.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31863590 PMCID: PMC7049706 DOI: 10.1093/nar/gkz1176
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Malat1 ΔSINE exhibits increased cytoplasmic localization and forms diffuse nuclear speckles. (A) Relative position of the SINE within the genomic locus of Malat1. (B) Generation and validation of ΔSINE cells. Upper: Schema of CRISPR strategy for deletion of the SINE. Middle: Gel electrophoresis image screening for clones with complete SINE deletion. Genomic DNA isolated from each clone was amplified with PCR using primers (red arrows) flanking the deleted region. The upper band is approximately 150-bp larger than the lower band, which corresponds with the length of the SINE. WT, wildtype; Het, heterozygous clone with both WT and SINE-deleted Malat1. Lower: Alignment of Sanger sequencing reads confirming complete SINE deletion. Red brackets denote the exact location of the SINE. (C) Representative Malat1 and Neat1 fluorescent in situ hybridization (FISH) images of WT and ΔSINE cells (scale bar = 10 μm). Note the prevalence of cytoplasmic Malat1 speckles and their diffuse pattern in ΔSINE cells without mitotic features. (D and E) Quantification of cells with cytoplasmic Malat1 (upper) or Neat (lower) speckles, respectively. Pearson's chi-squared test was used for comparisons. ***P < 0.001; n ≈ 365 cells for each group. (F) CER, but not SINE is involved in formation of intramolecular duplexes. Sequences of PARIS RNA duplexes extracted from Lu et al. (14) were aligned to Malat1’s sequence. Thin black lines and blue arcs connect regions that form duplexes as emphasized in black boxes. Purple dashed lines indicate locations of the SINE (red) and the CER (blue) relative to the duplexes. Scissors symbols demarcate regions deleted with CRISPR. Green track illustrates phastCons conservation score across 60 vertebrates. (G) Interphase ΔSINE cells have significantly more cytoplasmic Malat1 than interphase WT cells. Cumulative distribution of the ratio of cytoplasmic over total Malat1 speckle number in interphase (left), mitotic (middle), and all-phases (right) cells. Interphase and mitotic cells were classified based on DAPI signal intensity and texture, using a custom-made MATLAB script. WT, ΔSINE52, ΔSINE14, ΔSINE50 and ΔCER are shown in blue, purple, red, green, and orange, respectively. The cytoplasmic Malat1 ratio was transformed with x1/5, where x is the ratio, to approximate normalization for non-normally distributed data. Two different non-parametric tests, Mann–Whitney U test (MW) and Kolmogorov–Smirnov test (KS), were used showing comparable P values. Note that although more cytoplasmic Malat1 is detected in both interphase and mitotic ΔSINE cells, more significant P values are observed for interphase cells. (H) Malat1 ΔSINE forms diffuse nuclear speckles (scale bar = 5 μm). Shown are representative FISH images of ≈200 cells for each group. (I) Malat1 nuclear speckles are smaller in ΔSINE cells but not Neat1. Box plots comparing Malat1 and Neat1 speckle sizes across cell groups. Speckle sizes were quantified using a custom-made MATLAB script as described in Materials and Methods. *P < 0.05; NS, not significant (P > 0.05); n ≈ 365 cells for each group. (J) Malat1 expression increases in the absence of the SINE. Comparison of Malat1 RNA levels between ΔSINE and ΔCER versus WT cells. RNAs isolated from the cells were analyzed with qPCR using primers probing four different regions of Malat1 from 5′ to 3′. *P < 0.05; ***P < 0.05; n = 3 biological replicates.
Figure 2.SINE-deleted Malat1 cells are highly apoptotic with severe ER stress and exhibit a cell cycle delay in S/G2 and M phases. (A) ΔSINE cells are highly apoptotic with increased DNA damage and phosphorylation of eIF2α (p-eIF2α). Western blots on lysates from WT, ΔSINE and ΔCER cells. Normalized relative changes in band intensity for individual targets were quantified by ImageJ 1.48v and are shown above each blot. Shown is a representative of three biological replicates. (B) ΔSINE cells have increased PERK phosphorylation (p-PERK), a major sensor of ER stress. Western blots on lysates from WT, ΔSINE and ΔCER cells together with Tunicamycin- (Tu) treated cells as positive controls for PERK phosphorylation induction. The upward shift in PERK indicates PERK phosphorylation. Shown is a representative of three biological replicates. L, ladder. (C and D) ΔSINE cells upregulate GADD34 and CHOP, downstream targets of phosphorylated PERK. Bar graphs comparing relative mRNA levels of GADD34 and CHOP in (D) and (E), respectively. *P < 0.05; **P < 0.01; ***P < 0.001; error bars indicate standard deviation; n = 3 biological replicates. (E) SINE deletion promotes formation of spliced XBP1 (XBP1s), a downstream target of activated IRE1α, another major sensor of ER stress. Gel electrophoresis images for visualization of XBP1s from XBP1u (unspliced XBP1) bands. Shown is a representative of 3 biological replicates. (F) Comparisons of XBP1s/XBP1u ratios between ΔSINE and ΔCER versus WT cells. *P < 0.05; NS, not significant (P > 0.05); error bars indicate standard deviation; n = 3 biological replicates. (G) ΔSINE cells have elevated levels of autophagosomes and cytoplasmic vacuoles. Fixed cells were visualized with transmission electron microscopy (TEM) (scale bar = 10 μm). Black arrows indicate a few examples of autophagosomes. (H) ΔSINE cells have swollen ER and condensed mitochondria, a feature of apoptotic cells. TEM images of WT, ΔSINE and ΔCER cells (scale bar = 500 nm). Arrow heads indicate ER regions. M, mitochondria; A, autophagosome. (I) ΔSINE cells exhibit decreased growth compared to WT cells. Growth curves of all cell types, measured by crystal violet assay. ***P < 0.001; error bars indicate standard deviation; n = 3 biological replicates. (J) ΔSINE-cells have shortened G1 and prolonged S/G2/M. Live-cell images of Fucci-expressing cells. Red (RFP) indicates G1 cells; green (GFP) indicates S/G2/M cells. Shown is a representative of 20 cells for each group. (K) Both S/G2 and mitosis are delayed in SINE-deleted cells. Relative fluorescent intensity of RFP (red and orange) and GFP (green and blue) in Fucci-tagged cells during the time course of one cell cycle (scale bar = 10 μm). Shown is a representative of 20 cells for each group. (L) Quantification of the number of hours the cell stays in each phase of cell cycle. Cell cycle was tracked using Fucci reporters. **P < 0.01; ***P < 0.001; error bars indicate standard deviation; n = 20 individual cells for each group.
Figure 3.Malat1 ΔSINE induces apoptosis through induction of TDP-43 aggregation. (A) Exogenously expressed full-length TDP-43 forms nuclear and cytoplasmic speckles with irregular shapes in ΔSINE cells, indicative of aggregate formation instead of liquid–liquid phase separation, which has a round shape. Live images of cells transfected with the full-length TDP-43-GFP construct (scale bar = 10 μm). N, cells with only nuclear TDP-43 but no speckle or just a few tiny round ones; LINS, cells with large irregular shaped nuclear TDP-43 speckles; CS, cells with cytoplasmic TDP-43 speckles. Note that cells with these speckles appear unhealthy, possibly undergoing apoptosis. (B) Cells with large irregular shaped nuclear and cytoplasmic TDP-43 speckles are significantly more prevalent in ΔSINE cells, and cells with only nuclear TDP-43 are not as common in ΔSINE cells compared with WT cells, suggesting an increase in cytoplasmic localization of TDP-43. Quantification of cells with different TDP-43-GFP localization patterns. N, cells with only nuclear TDP-43 but no speckle or just a few tiny round ones; LINS, cells with large irregular shaped nuclear TDP-43 speckles; CS, cells with cytoplasmic TDP-43 speckles; C, cells with only cytoplasmic TDP-43 but no detectable speckle. Pearson's chi-squared test was used for comparison. **P < 0.01; ***P < 0.001; NS, not significant (P > 0.05); n ≈ 155 cells for each group. (C) Malat1 ΔSINE dramatically promotes the formation of insoluble aggregates of TDP-43,. Images of FRAP analysis for TDP-43-GFP in WT and ΔSINE cells (scale bar = 1 μm). Yellow arrows indicate photobleached areas. White arrows indicate areas without photobleaching as references. Note that both nuclear and cytoplasmic TDP-43-GFP in ΔSINE cells could not recover GFP signal even 60 s post bleaching, whereas 30 s was sufficient for GFP to fully recover in WT cells. (D) Time-lapse quantification of TDP-43-GFP fluorescence recovery after photobleaching, demonstrating a drastic reduction in the liquid-like property of TDP-43 in ΔSINE cells. (E) TDP-43 depletion rescues apoptosis in ΔSINE cells, suggesting that TDP-43 aggregation is the cause of cellular toxicity. Western blots on lysates from ΔSINE cells stably expressing scrambled shRNA or TDP-43 shRNA. Shown is a representative of 3 biological replicates. (F) Stable overexpression of WT Malat1 and the SINE alone in ΔSINE cells. Overexpression relative to the empty vector was confirmed with qPCR. *P < 0.05; **P < 0.01; error bars indicate standard deviation; n = 3 biological replicates. (G) WT Malat1 overexpression rescues apoptosis and reduces eIF2α phosphorylation in cells with the mutant Malat1 ΔSINE, suggesting that SINE deletion causes a loss of function of Malat1. WT and ΔSINE cells serve as reference controls. Shown is a representative of three biological replicates. (H) Overexpression of WT Malat1 reduces formation of cytotoxic TDP-43 cytoplasmic speckles in ΔSINE cells. Live images of cells transfected with the full-length TDP-43-GFP construct (scale bar = 10 μm). (I) Quantification of the percentage of TDP-43-GFP cells with cytoplasmic speckles in ΔSINE cells with overexpression of WT Malat1 or the SINE alone. Pearson's chi-squared test was used for comparison. *P < 0.05; NS, not significant (P > 0.05); n ≈ 100 cells for each group.
Figure 4.Malat1 ΔSINE binds stronger to TDP-43 and reprograms TDP-43′s binding on mRNAs of genes involved in mitotic regulation and membrane trafficking. (A) Schematic overview of TDP-43 eCLIP-seq on WT, ΔSINE and ΔCER cells. SMInput, size-matched input; RBP, RNA-binding protein; RT, reverse transcription. (B) Although the binding pattern of TDP-43 on Malat1 does not change in SINE-deleted cells, Malat1 ΔSINE binds significantly stronger to TDP-43 than WT and ΔCER Malat1. Top: TDP-43 eCLIP read density tracks along Malat1 in reads per million (RPM) with paired SMInput for each cell type. Note that SINE and CER regions have almost no reads in ΔSINE and ΔCER cells, respectively and that TDP-43 does not bind to the SINE region. Bottom: Bars comparing enriched RPM folds of TDP-43 eCLIP reads across all Malat1 significant peak regions (Malat1 Peaks) or the full Malat1 gene body (Malat1 Gene). RPM fold was quantified by dividing normalized RPMs of TDP-43 IP over that of SMInput. **P < 0.01; ***P < 0.001; NS, not significant (P > 0.05). P values were calculated using Pearson's chi-squared test. (C) Both ΔSINE and ΔCER cells have TDP-43 binding sites that are distinct from WT cells. Venn diagram demonstrating significant TDP-43 eCLIP peaks. Significant peaks are >2 fold-enriched, P < 0.01 above SMInput. (D) TDP-43 predominantly binds to genes involved in cell morphogenesis and junction organization. Cellular functions and pathways enriched in genes with TDP-43 binding sites that are shared between WT, ΔSINE and ΔCER cells using Metascape. (E) Many TDP-43 binding sites unique for SINE-deleted cells are found in genes involved in mitotic regulation and membrane trafficking. Cellular functions and pathways enriched in genes with ΔSINE-specific TDP-43 binding using Metascape. (F and G) SINE deletion promotes binding of TDP-43 to an exon of Ywhae, a mitotic regulator, and an intron of Tnpo2, a nuclear-cytoplasmic transport regulator, respectively. Shown are the Ywhae and Tnpo2 read densities. Average enriched folds above SMInput are shown adjacent to significant peaks (P < 0.01). Arrow indicates the region with increased TDP-43 binding.
Genes with SINE-specific peaks involved in cell cycle and cellular transport regulation (>2 fold-enriched, P < 0.01 above SMInput)
| Function | Gene and description |
|---|---|
| Mitosis, cell cycle | Incenp: inner centromere protein |
| Spc24: SPC24, NDC80 kinetochore complex component, homolog ( | |
| Ahctf1: AT hook containing transcription factor 1 | |
| Nek2: NIMA (never in mitosis gene a)-related expressed kinase 2 | |
| Ywhag: tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, gamma polypeptide | |
| Smc1a: structural maintenance of chromosomes 1A | |
| Rangap1: RAN GTPase activating protein 1 | |
| Mcm3: minichromosome maintenance deficient 3 ( | |
| Nup93: nucleoporin 93 | |
| Nup188: nucleoporin 188 | |
| Cenpc1: centromere protein C1 | |
| Tuba1a: tubulin, alpha 1A | |
| Ppp2r2d: protein phosphatase 2, regulatory subunit B, delta isoform | |
| Cdkn1a: cyclin-dependent kinase inhibitor 1A (P21) | |
| Dyrk1a: dual-specificity tyrosine-(Y)-phosphorylation regulated kinase 1a | |
| Rcc2: regulator of chromosome condensation 2 | |
| Cdc25b: cell division cycle 25B | |
| Ppp2r2a: protein phosphatase 2 (formerly 2A), regulatory subunit B (PR 52), alpha isoform | |
| Mcm5: minichromosome maintenance deficient 5, cell division cycle 46 ( | |
| Ywhae: tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, epsilon polypeptide | |
| E2f3: E2F transcription factor 3 | |
| Cenpi: centromere protein I | |
| Rfc3: replication factor C (activator 1) 3 | |
| Kif23: kinesin family member 23 | |
| Ccnd1: cyclin D1 | |
| Csnk1d: casein kinase 1, delta | |
| Taok1: TAO kinase 1 | |
| Uba52: ubiquitin A-52 residue ribosomal protein fusion product 1 | |
| Membrane tracking, | Tnpo2: transportin 2 (importin 3, karyopherin beta 2b) |
| nuclear-cytoplasmic | Lrp12: low density lipoprotein-related protein 12 |
| transport | Ddx58: DEAD (Asp-Glu-Ala-Asp) box polypeptide 58 |
| Snx9: sorting nexin 9 | |
| Mon2: MON2 homolog (yeast) | |
| Sft2d3: SFT2 domain containing 3 | |
| Atp2b1: ATPase, Ca2+ transporting, plasma membrane 1 | |
| Atad1: ATPase family, AAA domain containing 1 | |
| Micall2: MICAL-like 2 | |
| Dynll2: dynein light chain LC8-type 2 | |
| Rab14: RAB14, member RAS oncogene family | |
| Scfd1: Sec1 family domain containing 1 | |
| Osbpl6: oxysterol binding protein-like 6 | |
| Slc25a33: solute carrier family 25, member 33 | |
| Ipo5: importin 5 | |
| Ubr5: ubiquitin protein ligase E3 component n-recognin 5 | |
| Pard3: par-3 (partitioning defective 3) homolog ( | |
| Slc26a11: solute carrier family 26, member 11 | |
| Naif1: nuclear apoptosis inducing factor 1 | |
| Osbpl3: oxysterol binding protein-like 3 | |
| Nup93: nucleoporin 93 | |
| Strada: STE20-related kinase adaptor alpha | |
| Uaca: uveal autoantigen with coiled-coil domains and ankyrin repeats | |
| Golim4: golgi integral membrane protein 4 | |
| Calm2: calmodulin 2 | |
| Kcnh5: potassium voltage-gated channel, subfamily H (eag-related), member 5 | |
| Cfl1: cofilin 1, non-muscle | |
| Smurf1: SMAD specific E3 ubiquitin protein ligase 1 | |
| Ipo11: importin 11 | |
| Slc10a7: solute carrier family 10 (sodium/bile acid cotransporter family), member 7 | |
| Akt1: thymoma viral proto-oncogene 1 | |
| Atp2b2: ATPase, Ca2+ transporting, plasma membrane 2 | |
| Slc7a1: solute carrier family 7 (cationic amino acid transporter, y+ system), member 1 | |
| Bcl2l1: BCL2-like 1 | |
| Bmp4: bone morphogenetic protein 4 | |
| Ldlrad3: low density lipoprotein receptor class A domain containing 3 | |
| Hdlbp: high density lipoprotein (HDL) binding protein | |
| Ccnd1: cyclin D1 | |
| Cse1l: chromosome segregation 1-like ( | |
| Slc18a1: solute carrier family 18 (vesicular monoamine), member 1 | |
| Cdkn1a: cyclin-dependent kinase inhibitor 1A (P21) | |
| Chuk: conserved helix-loop-helix ubiquitous kinase | |
| Pex10: peroxisomal biogenesis factor 10 | |
| Cp: ceruloplasmin | |
| Csf3: colony stimulating factor 3 (granulocyte) | |
| Wasf2: WAS protein family, member 2 | |
| Egfr: epidermal growth factor receptor | |
| Derl1: Der1-like domain family, member 1 | |
| Stx2: syntaxin 2 | |
| Epn2: epsin 2 | |
| F2rl1: coagulation factor II (thrombin) receptor-like 1 | |
| Fkbp1a: FK506 binding protein 1a | |
| Aktip: thymoma viral proto-oncogene 1 interacting protein | |
| Fzd1: frizzled homolog 1 (Drosophila) | |
| Grb2: growth factor receptor bound protein 2 | |
| Grin2a: glutamate receptor, ionotropic, NMDA2A (epsilon 1) | |
| Hk2: hexokinase 2 | |
| Agfg1: ArfGAP with FG repeats 1 | |
| Hspa4: heat shock protein 4 | |
| Hspa9: heat shock protein 9 | |
| Rrbp1: ribosome binding protein 1 | |
| Kcnk1: potassium channel, subfamily K, member 1 | |
| Kif5b: kinesin family member 5B | |
| Kif5c: kinesin family member 5C | |
| Uhmk1: U2AF homology motif (UHM) kinase 1 | |
| Psen1: presenilin 1 | |
| Ldlr: low density lipoprotein receptor | |
| Lif: leukemia inhibitory factor | |
| Anxa1: annexin A1 | |
| Lrp5: low density lipoprotein receptor-related protein 5 | |
| Smad2: SMAD family member 2 | |
| Smad3: SMAD family member 3 | |
| Smad4: SMAD family member 4 | |
| Slc3a2: solute carrier family 3 (activators of dibasic and neutral amino acid transport), member 2 | |
| Laptm4a: lysosomal-associated protein transmembrane 4A | |
| Myo10: myosin X | |
| Nnat: neuronatin | |
| Mybbp1a: MYB binding protein (P160) 1a | |
| Pam: peptidylglycine alpha-amidating monooxygenase | |
| Prkcd: protein kinase C, delta | |
| Sec22c: SEC22 vesicle trafficking protein homolog C ( | |
| Ppard: peroxisome proliferator activator receptor delta | |
| Tnk2: tyrosine kinase, non-receptor, 2 | |
| Cnksr3: Cnksr family member 3 | |
| Nus1: nuclear undecaprenyl pyrophosphate synthase 1 homolog ( | |
| Ptpn11: protein tyrosine phosphatase, non-receptor type 11 | |
| Ptpn14: protein tyrosine phosphatase, non-receptor type 14 | |
| Rab7: RAB7, member RAS oncogene family | |
| Rac1: RAS-related C3 botulinum substrate 1 | |
| Rangap1: RAN GTPase activating protein 1 | |
| Trappc10: trafficking protein particle complex 10 | |
| Eea1: early endosome antigen 1 | |
| Agap1: ArfGAP with GTPase domain, ankyrin repeat and PH domain 1 | |
| Ncor1: nuclear receptor co-repressor 1 | |
| Itsn2: intersectin 2 | |
| Vps4b: vacuolar protein sorting 4b (yeast) | |
| Slc7a5: solute carrier family 7 (cationic amino acid transporter, y+ system), member 5 | |
| Tmc6: transmembrane channel-like gene family 6 | |
| Tgfbr2: transforming growth factor, beta receptor II | |
| Thbs1: thrombospondin 1 | |
| Tnks: tankyrase, TRF1-interacting ankyrin-related ADP-ribose polymerase | |
| Tsg101: tumor susceptibility gene 101 | |
| Vegfa: vascular endothelial growth factor A | |
| Ywhae: tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, epsilon polypeptide | |
| Ywhag: tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, gamma polypeptide | |
| Ywhaz: tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, zeta polypeptide | |
| Litaf: LPS-induced TN factor | |
| Sufu: suppressor of fused homolog (Drosophila) | |
| Xpo4: exportin 4 | |
| Chd7: chromodomain helicase DNA binding protein 7 | |
| Slc3a1: solute carrier family 3, member 1 | |
| Tcirg1: T cell, immune regulator 1, ATPase, H+ transporting, lysosomal V0 protein A3 | |
| Btbd9: BTB (POZ) domain containing 9 | |
| Atp5o: ATP synthase, H+ transporting, mitochondrial F1 complex, O subunit | |
| Nipa2: non imprinted in Prader-Willi/Angelman syndrome 2 homolog (human) | |
| Casc3: cancer susceptibility candidate 3 | |
| Flna: filamin, alpha | |
| Slc10a6: solute carrier family 10 (sodium/bile acid cotransporter family), member 6 | |
| Ehd2: EH-domain containing 2 | |
| Atf4: activating transcription factor 4 | |
| Ahctf1: AT hook containing transcription factor 1 | |
| Prickle1: prickle homolog 1 (Drosophila) | |
| Slc2a6: solute carrier family 2 (facilitated glucose transporter), member 6 | |
| Nup188: nucleoporin 188 | |
| Tlk1: tousled-like kinase 1 | |
| Syt6: synaptotagmin VI | |
| Stx16: syntaxin 16 | |
| Xpo7: exportin 7 | |
| Xpot: exportin, tRNA (nuclear export receptor for tRNAs) |
Figure 5.The SINEB1 in Malat1 is enriched in RNAs with high ratios of nuclear to cytoplasmic copy numbers, and Malat1 without the SINE has decreased binding to HNRNPK, which likely binds Malat1 as RBP complexes. (A) Copies of the Malat1 SINEB1, either the sense or antisense strand, are enriched in mRNAs with high nuclear to cytoplasmic copy number ratios. A boxplot comparing the ratio of nuclear to cytoplasmic copy numbers of mRNAs without or with at least one sense or antisense SINEB1 in mouse liver and pancreatic beta cells. The copy number values for the mRNAs in the nucleus and the cytoplasm of the two cell types were determined in Halpern et al. (12). NS, not significant (P > 0.05). (B) Malat1 ΔSINE has decreased binding to HNRNPK. Left: Gel electrophoresis images of Malat1 RNA immunoprecipitation (RIP) assay with HNRNPK antibody on lysates from WT and ΔSINE cells. Right: Comparison of Malat1’s enrichment in HNRPK IP over IgG IP in WT and ΔSINE cells. Normalized Malat1 RIP band intensity from gel electrophoresis images was quantified by ImageJ 1.48v. Shown is a representative of three biological replicates. (C) A number of RBPs bind to the SINEB1-aligned region on human Malat1. Bars indicate eCLIP peak numbers in the top 10 RBPs with the highest number of significant eCLIP peaks on the human SINEB1-aligned region. All significant eCLIP peaks for 120 RBPs in K562 cells were extracted from the ENCODE database and then overlapped with the human SINEB1-aligned region. Note that HNRNPK does not have any eCLIP peaks that overlap the human SINEB1-aligned region. (D) Among the top 10 human-SINEB1-bound RBPs, KHDRBS1, TRA2A and TAF15 have been experimentally determined to interact with HNRNPK. Protein-protein interaction network analysis of the top 10 human-SINEB1-bound RBPs together with HNRNPK using STRING database. (E) KHDRBS1 and TRA2A’s major binding sites overlap with the SINE but not the CER region on human Malat1. KHDRBS1 and TRA2A eCLIP read density tracks along human Malat1. Boxes underneath each track highlight the most significant peaks. Black arrow heads indicate peaks that overlap with the human SINE-aligned region. (F) Other than interacting with KHDRBS1 and TRA2A, HNRNPK is known to bind other RBPs such as HNRNPA1 and PTBP1, both of which also have binding sites in the SINE region as shown in Figure 5G. Protein–protein interaction network analysis of the top 5 direct binding partners of HNRNPK using STRING database. (G) HNRNPK, HNRNPA1 and PTBP1 have binding sites that overlap with TDP-43 binding sites on human Malat1. Top: eCLIP read density tracks along human Malat1. Boxes underneath each track highlight significant peaks. Black arrow heads indicate peaks that overlap with the human SINE-aligned region. Bottom: Quantification of the percentage of TDP-43 eCLIP peaks that overlap with HNRNPK, HNRNPA and PTBP1’s significant eCLIP peaks.
Figure 6.Proposed Model for Malat1 regulation of TDP-43 proteostasis through the SINEB1. In normal cells with the SINE, Malat1 is bound to HNRNPK, which binds to Malat1 in complexes together with other nuclear-speckle-localized RBPs known to interact with HNRNPK such as HNRNPA1/PTBP1 and KHDRBS1/TRA2A. In the absence of the SINE, Malat1 loses binding to HNRNPK likely through dissociation of anchor RBPs such as KHDRBS1/TRA2A, which bind predominantly to the SINE region. Note that HNRNPK has been shown to bind RNA indirectly through forming a super complex with KHDRBS1 (i.e. SAM68) (58). HNRNPK dissociation from Malat1 generates more binding sites for TDP-43 in the absence of the SINE. Moreover, HNRNPK dissociation causes Malat1 to mislocalize to the cytoplasm, hijacking TDP-43 to the cytoplasm (1), as implicated by the diffuse Malat1 speckle phenotype with cytoplasmic localization. Both the increase in binding of Malat1 to TDP-43 and the cytoplasmic mislocalization of TDP-43 have been observed in neurons with cytotoxic TDP-43 inclusions in ALS patients (39, 57). Cytoplasmic TDP-43 inclusions, known to shut down mitochondrial RNAs’ translation and subsequent mitochondrial fragmentation (42), trigger apoptosis and activate caspase-3 (2). Activated caspase-3 cleaves TDP-43 FL into its truncated form TDP-43 CTF35 (3), that predominantly localizes to the cytoplasm (4) and forms more cytotoxic TDP-43 inclusions (5). TDP-43 truncation causes a reduction in nuclear TDP-43 FL that is critical for (6) many RNA metabolism processes as well as DNA repair (54), which disrupts the cell cycle, induce ER stress, PKR activation and subsequent eIF2α phosphorylation.