| Literature DB >> 31336384 |
Alisha N Jones1,2, Michael Sattler1,2.
Abstract
Following the discovery of numerous long non-coding RNA (lncRNA) transcripts in the human genome, their important roles in biology and human disease are emerging. Recent progress in experimental methods has enabled the identification of structural features of lncRNAs. However, determining high-resolution structures is challenging as lncRNAs are expected to be dynamic and adopt multiple conformations, which may be modulated by interaction with protein binding partners. The X-inactive specific transcript (Xist) is necessary for X inactivation during dosage compensation in female placental mammals and one of the best-studied lncRNAs. Recent progress has provided new insights into the domain organization, molecular features, and RNA binding proteins that interact with distinct regions of Xist. The A-repeats located at the 5' end of the transcript are of particular interest as they are essential for mediating silencing of the inactive X chromosome. Here, we discuss recent progress with elucidating structural features of the Xist lncRNA, focusing on the A-repeats. We discuss the experimental and computational approaches employed that have led to distinct structural models, likely reflecting the intrinsic dynamics of this RNA. The presence of multiple dynamic conformations may also play an important role in the formation of the associated RNPs, thus influencing the molecular mechanism underlying the biological function of the Xist A-repeats. We propose that integrative approaches that combine biochemical experiments and high-resolution structural biology in vitro with chemical probing and functional studies in vivo are required to unravel the molecular mechanisms of lncRNAs.Entities:
Keywords: Xist; chemical probing; computational structure prediction; enzymatic footprinting; lncRNA; structural biology
Year: 2019 PMID: 31336384 PMCID: PMC6917512 DOI: 10.1093/jmcb/mjz086
Source DB: PubMed Journal: J Mol Cell Biol ISSN: 1759-4685 Impact factor: 6.216
Secondary structures of lncRNAs (supported by experimental data).
|
|
|
|
| |
|---|---|---|---|---|
|
| 0.86 kb | SHAPE, enzymatic, and inline probing |
|
|
|
| 17 kb | SHAPE, Targeted Structure-Seq, | X-chromosomal inactivation |
|
|
| 2.2 kb | SHAPE, dimethyl sulfate (DMS), and | Protein ubiquitination |
|
|
| 8 kb | SHAPE, chemical, and inline probing | Nuclear speckle formation |
|
|
| 3.2 kb | SHAPE | A scaffolding factor for nuclear |
|
|
| 3.8 kb; 0.6 kb | SHAPE and parallel analysis of RNA | Dosage compensation |
|
|
| 0.6 kb | SHAPE and inline probing | Regulation of cardiovascular |
|
|
| 0.4 kb; 0.7 kb | SHAPE and inline probing | Downregulation of FLC |
|
|
| 0.62 kb | SHAPE-seq | Intranuclear organization of |
|
|
| 0.4 kb; 0.7 kb | SHAPE | Transcriptional repression of |
|
|
| 3 kb | SHAPE | Regulation of p53-mediated |
|
|
| 1.7 kb | mFold and deletion analysis | Transcriptional gene regulation |
|
|
| 1.1 kb | SHAPE-MaP | Expression suppression of |
|
Figure 2(A) A single 26-nucleotide A-repeat region comprising two predicted hairpins. (B and C) Similarity of one-dimensional imino spectra of the dimeric single and tandem A-repeats suggests the formation of inter-repeat dimers involving the theoretical ‘hairpin 2’. (D and E) NMR structure of the stable AUCG hairpin 1 suggests this a basic folding unit of the complete A-repeat (Duszczyk et al., 2008, 2011).
Figure 1Structural arrangements of the Xist A-repeats. (A) The Xist A-repeats are located on the 5′ end of the Xist transcript. Each repeat (7.5 in mouse and 8.5 in human) is separated by a U/A-rich linker. Wutz et al. (2002) first predicted that each A-repeat formed two stable hairpins using free energy minimization; however, Duszczyk et al. (2011) showed that only the AUCG hairpin is stable while the latter drives duplex formation. (B–G) Distinct structural models for the A-repeats. Modular arrangements (where the A-repeats assemble in a modular fashion by inter-repeat duplex formation) (B–D) and non-modular arrangements (where the A-repeats are base paired in a variety of ways) (E–G) of the A-repeats. (B) Model based on NMR analysis of single and tandem repeats in vitro (see Figure 2). (C) Mouse (left) and human (right) models based on in vitro experiments: enzymatic cleavage (V1, T1, T2), chemical probing (DMS, CMCT), FRET, and comparative sequence analysis. (D) Mouse in vivo: icSHAPE (NAI-N3), PARIS (AMT), and comparative sequence analysis. (E) Mouse in vivo: Targeted Structure-Seq (DMS) and comparative sequence analysis. (F) Mouse in vitro: chemical probing using DMS and SHAPE (1 M7) and comparative sequence analysis. (G) Mouse in vivo: SHAPE-MaP (1M6, 1M7, NMIA).
Experimental and computational methods to probe and predict RNA structure.
|
|
| |
|---|---|---|
|
| RNAse A ( | Cleaves 3′ of single stranded C, U |
| RNAse T1 ( | Cleaves 3′ of single stranded G | |
| RNAse T2 ( | Cleaves 3′ of single stranded N (preference for A) | |
| S1 nuclease ( | Cleaves all single stranded nucleotides | |
| RNAse V1 ( | Cleaves double stranded RNA | |
| Hydroxyl radicals (Fe (II)-EDTA, H2O2) ( | Degradation of ribose backbone based on solvent accessibility of phosphodiester bonds | |
| In-line probing ( | RNA allowed to degrade over time, single-stranded regions typically degrade before structured regions | |
|
| Kethoxal ( | Modifies single stranded G (N1) |
| DEPC ( | Modifies single stranded A (N7) | |
| CMCT ( | Acylation of single stranded U (N3) and G (N1) | |
| DMS ( | Methylation of single stranded A, C (N1, N3) | |
| SHAPE ( | Acylation of flexible 2′OH ribose groups, performed with several different molecules such as 1M7 | |
| PARIS ( | Reversible crosslinking of base paired nucleotides using AMT | |
|
| R-scape | Comparative sequence analysis |
| WAR | Comparative sequence analysis |
*Stern et al. (1988); Ziehler and Engelke (2001); Torarinsson and Lindgreen (2008); Weeks (2010); Kwok (2016); Rivas et al. (2017).
DEPC, diethyl pyrocarbonate; CMCT, 1-cyclohexyl-3-(2-morpholinoethyl) carbodiimide metho-p-toluenesulfonate; SHAPE, selective 2′ hydroxyl acylation analyzed by primer extension; PARIS, psoralen analysis of RNA interactions and structures; R-scape, RNA structural covariation above phylogenetic expectation; WAR, webserver for aligning structural RNAs; 1M7, 1-methyl-7-nitroisatoic anhydride; AMT, 4′-aminomethyltrioxsalen.
Experimental conditions used to probe Xist A-repeat secondary structure.
|
|
|
|---|---|
|
| 25°C/4°C; 10 mM NaH2PO4/Na2HPO4 pH 6.0, 100 mM NaCl, 0.02 mM EDTA, 0.02% azide |
|
| 25°C; 20 mM Hepes-KOH pH 7.9, 100 mM KCl, 0.2 mM EDTA pH 8.0, 0.5 mM DTT, 0.5 mM PMSF, 20% glycerol, 3.25 mM MgCl2 |
|
| 37°C; HeLa/HEK293T/mES cells |
|
| 37°C; MEF cells |
|
| 37°C/25°C; 25 mM K-Hepes pH 7.0, 0.1 mM Na-EDTA, 150 mM KCl, 15 mM MgCl2 |
|
| 37°C; mouse TSCs/100 mM Hepes pH 8.0, 100 mM NaCl, 10 mM MgCl2 |
A-repeat interactions with RNA binding proteins and complexes.
|
|
|
|
|
|
|---|---|---|---|---|
|
| Alternative splicing factor/splicing factor 2 | CAUCGGG, CUGCGGA | Required for spliced Xist RNA accumulation |
|
|
| ATP dependent helicase; X-linked helicase II | UUAGGG | Promotes loading of PRC2 on the Xist A-repeats |
|
|
| Fused in sarcoma | CGCGC, GGUG, GUGGU | Not yet identified, but known to regulate transcription |
|
|
| Heterogeneous nuclear ribonucleoprotein U | GUGG | Localization of XIST to the X chromosome |
|
|
| Human antigen R | poly U, poly A, AU rich elements | Not yet identified, but known to increase mRNA stability |
|
|
| Lamin B receptor | non-sequence specific | Repositioning of Xist-targeted loci to the lamina |
|
|
| PRC2 | poly G | XIST upregulation/initiation and spread of XCI |
|
|
| Polypyrimidine tract binding protein | UCUU(C), UUCUCU, CUCUCU | Stabilization of A-repeat repeat structure |
|
|
| SMART/HDAC1 associated repressor protein | GUGUG, ACACA | Recruitment of the SMRT co-repressor to the Xist A-repeat for activation of HDAC3 |
|
|
| RNA binding motif protein 15 | poly U | Required for transcriptional repression |
|
|
| Ring finger protein 20 | unknown | Not yet identified, but known to be an E3 ubiquitin ligase involved in H2BK120 ubiquitylation |
|
|
| Wilms’ tumor 1-associating protein | RRACH | m6A methylation-promoting effects |
|
|
| YT521-B homology domain containing 1 | GG(m6A)C | Promotes XIST-mediated gene silencing |
|
Figure 3RNA binding proteins that have been reported to bind to the Xist A-repeats by CLIP and binding shift assays.
Figure 4Integrated approach for tertiary structure determination of lncRNAs and their RNPs. (A) SHAPE chemical probing and NMR define RNA secondary structure. (B) SAXS/SANS provide global and subdomain shapes. (C) NMR PREs yield long-range distance restraints. Crystallography can be performed in parallel. (D) Structural analysis of holo lncRNA and RNPs can be performed using cryo-EM, and dynamics and spatial arrangements can be obtained from FRET.