| Literature DB >> 31169884 |
Evgenia Ntini1,2, Annalisa Marsico1,2,3.
Abstract
Tight regulation of gene expression is orchestrated by enhancers. Through recent research advancements, it is becoming clear that enhancers are not solely distal regulatory elements harboring transcription factor binding sites and decorated with specific histone marks, but they rather display signatures of active transcription, showing distinct degrees of transcription unit organization. Thereby, a substantial fraction of enhancers give rise to different species of non-coding RNA transcripts with an unprecedented range of potential functions. In this review, we bring together data from recent studies indicating that non-coding RNA transcription from active enhancers, as well as enhancer-produced long non-coding RNA transcripts, may modulate or define the functional regulatory potential of the cognate enhancer. In addition, we summarize supporting evidence that RNA processing of the enhancer-associated long non-coding RNA transcripts may constitute an additional layer of regulation of enhancer activity, which contributes to the control and final outcome of enhancer-targeted gene expression.Entities:
Keywords: RNA processing; chromatin; cotranscriptional RNA splicing; enhancer; long non-coding RNA (lncRNA); transcription
Year: 2019 PMID: 31169884 PMCID: PMC6884709 DOI: 10.1093/jmcb/mjz047
Source DB: PubMed Journal: J Mol Cell Biol ISSN: 1759-4685 Impact factor: 6.216
Glossary.
|
|
|
|
|---|---|---|
| ChromHMM | Software developed to learn and characterize chromatin states, by integrating chromatin datasets, such as ChIP-seq of histone modifications. It employs a multivariate Hidden Markov model, which models the presence or absence of each histone mark. |
|
| CAGE | Capped analysis of gene expression is a technique that maps the 5′ ends of capped RNA transcripts, thereby capturing transcription initiation sites genome-wide. |
|
| GRO-seq | Global run on sequencing is a technique that maps transcriptionally engaged RNA polymerase II (Pol II) genome-wide, capturing nascent RNA transcription. In this method, Pol II is allowed to run on in the presence of a labeled nucleotide analog (5′-bromouridine), which is incorporated into newly transcribed RNA. |
|
| ATAC-seq | Assay for transposable accessible chromatin followed by deep sequencing is a technique that captures open chromatin sites. It is used to identify regions of open chromatin, such as nucleosome-free positions in regulatory regions. It is based on the transposition of sequencing adapters into native chromatin |
|
| STARR-seq | Self-transcribing active regulatory region sequencing measures enhancer activity (enhancer strength) genome-wide, by assaying candidate sequences from any source of DNA using a reporter system |
|
| ChIA-PET | Chromatin interaction analysis by paired-end tag sequencing provides large-scale analysis of long-range chromatin interaction networks. In this technique, cross-linked chromatin interaction sites bound by specific proteins are enriched by chromatin immunoprecipitation, and the remote DNA elements interacting in close spatial proximity are connected through proximity ligation. |
|
| TADs | Topologically associating domains are considered the basic organizational units of chromosome architecture, covering hundreds of kilobases to several million bases in length. DNA sequences within a TAD physically interact with each other more frequently than with sequences outside the TAD. |
|
| Hi-C | Hi-C is based on chromosome conformation capture (3C) technique. It captures the 3D architecture of entire genomes, by coupling proximity ligation of DNA fragments tethered together in spatial proximity (enriched by crosslinking) with high-throughput sequencing, to produce genome-wide interaction maps. |
|
Figure 1Strategy used for enhancer annotation and identification of enhancer-associated lncRNAs. (A) The annotation of enhancers starts from the mapping of DHSs that reflect increased chromatin accessibility. CAGE (Andersson et al., 2014a) and/or nascent RNA sequencing data like GRO-seq (Hah et al., 2013) or PRO-seq (Mahat et al., 2016) are used to detect substantial Pol II transcription emanating bidirectionally from the DHS. The unstable eRNAs, which are short, non-spliced, and non-polyadenylated, are the products of bidirectional Pol II transcription activity at the enhancer and can be detected using de novo transcript assembly in GRO-seq data. Apart from Pol II and general transcription factors (GTFs) like CBP/p300 binding at the DHS, the presence of active histone marks like H3K27Ac and H3K4me1 validates the enhancer. eRNAs are terminated early by polyadenylation (pAS)-like signals and are rapidly turned over by the exosome; hence they are not readily detectable in steady-state RNA-seq data. (B) In some cases (estimated ~5% of active enhancers), Pol II transcription initiating intrinsically bidirectionally at active enhancers is preferentially elongated in one direction, most probably due to the accumulation of activating random mutations leading to enrichment of U1 sites and splicing signals, which suppress early pAS (Wu and Sharp, 2013; Gil and Ulitsky, 2018). The production of long, spliced, and polyadenylated lncRNAs, which are stable and thus readily detectable in steady-stade RNA-seq data, is associated with higher enhancer activity; the latter is reflected by increased chromatin accessibility (DHS) and enrichment of positive histone modifications, i.e. H3K27Ac (enhancer mark), H3K4me3 (promoter mark), and H3K36me3 (a hallmark of transcription elongation). (C) The genomic topology of the predicted active enhancer is also examined, and its position relative to anchor points of chromosomal loops is characterized from Pol II ChIA–PET interactions and/or TAD boundaries. One or more putative target genes are identified based on long-range 3D interactions, and induced effects on target gene expression are examined in conditions of enhancer perturbation, including genetic manipulation of the enhancer-associated lncRNA locus. (D) Flow diagram of the steps employed for enhancer annotation and enhancer-associated lncRNA identification.
Figure 2Possible mechanisms explaining the functional impact of lncRNA splicing on enhancer activity. (A) Splicing factors cotranscriptionally engaged at the conserved lncRNA splice sites may in turn recruit activating factors like histone modification enzymes and chromatin remodelers, resulting in an overall chromatin opening of the locus and a positive effect on cis-gene regulation. This happens while the lncRNA is tethered on chromatin, at its site of transcription, during cotranscriptional processing. (B) Splicing permits dissociation of the nascent lncRNA transcript from chromatin, which in turn interacts with additional proteins contributing to spatial amplification of the cis-regulatory signal. The two distinct possible mechanisms are not mutually exclusive.