| Literature DB >> 28690838 |
David Santiago-Algarra1, Lan T M Dao1, Lydie Pradel1, Alexandre España1, Salvatore Spicuglia1.
Abstract
The regulation of gene transcription in higher eukaryotes is accomplished through the involvement of transcription start site (TSS)-proximal (promoters) and -distal (enhancers) regulatory elements. It is now well acknowledged that enhancer elements play an essential role during development and cell differentiation, while genetic alterations in these elements are a major cause of human disease. Many strategies have been developed to identify and characterize enhancers. Here, we discuss recent advances in high-throughput approaches to assess enhancer activity, from the well-established massively parallel reporter assays to the recent clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-based technologies. We highlight how these approaches contribute toward a better understanding of enhancer function, eventually leading to the discovery of new types of regulatory sequences, and how the alteration of enhancers can affect transcriptional regulation.Entities:
Keywords: CRISPR; MPRA; STARR-seq; enhancer function; gene transcription
Year: 2017 PMID: 28690838 PMCID: PMC5482341 DOI: 10.12688/f1000research.11581.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Principle of high-throughput assays for enhancer activity.
( A) Overview of massively parallel reporter assay (MPRA). The test sequences (wild-type, variants, etc.) are generally synthesized in silico by massive oligonucleotide synthesis with unique barcode tags and cloned into the plasmid backbone. Tags can be synthesized along with the test sequences or added after synthesis by polymerase chain reaction (PCR) amplification. A basal promoter and a reporter open reading frame (ORF) are inserted between the tested element and tag sequences. The reporter library is then transfected into cultured cells. Subsequently, mRNA is isolated and cDNA synthesized. The tags are sequenced before (plasmid library pool, for normalization) and after the transfection. The difference in the enrichment of each barcode is proportional to the enhancer activity of the test sequence. In the case of post-synthesis addition of barcodes, an additional sequencing step is required at the first cloning step. ( B) Overview of self-transcribing active regulatory region sequencing (STARR-Seq). A genomic or bacterial artificial chromosome (BAC) library is cloned in the reporter plasmid, downstream of the ORF and upstream of the polyadenylation site (pAS). Alternatively, the regions of interest might be enriched by a capture approach. The reporter library is transfected into cultured cells. Subsequently, mRNA is isolated and cDNA synthesized. The cloned regions are sequenced from the plasmid library pool (input) and the cDNA. Differences in the enrichment with respect to the input are proportional to the enhancer activity. In both panels, the effect of the enhancer on the basal promoter is indicated by an arrow.
Examples of high-throughput functional assays of mammalian enhancers.
| Original
| Specific Name | DNA
| Application | Specific features | No. of
| Size
| Cell lines | Promoter | Species | Ref. |
|---|---|---|---|---|---|---|---|---|---|---|
| MPRA | Synthetic | Characterize putative enhancers
| Centered on TF sites | 2,104 | 145 | K562, HepG2 | SV40 | Human |
| |
| MPRA | CRE-seq | Synthetic | Characterize genomic regions
| Mutation of TF binding sites | 2,100 | 130 | K562 | Hsp68 | Human |
|
| FIREWACh | Genomic | Identification of enhancers
| Isolation of GFP-positive
| 84,240 | 154 | ESC | Minimal
| Mouse |
| |
| STARR-seq | CapSTARR-seq | Genomic | Identification of enhancers
| Capture of DHS | 7,542 | 330–
| P5424, 3T3 | SCP1 | Mouse |
|
| STARR-seq | Genomic | Characterize genetic variants
| Capture of DHS | 104 | 402 | HepG2 | SCP1 | Human |
| |
| MPRA | CRE-seq | Genomic | Identification of regulatory
| Capture of DHS | 4,000 | 464 | Retina | Minimal
| Mouse |
|
| MPRA | Sharpr-MPRA | Synthetic | Characterize nucleotides as
| Tiled oligonucleotide
| 15,720 | 145 | HepG2, K562 | Minimal
| Human |
|
| MPRA | Synthetic | Identification of regulatory
| Centered on eQTL variants | 3,642 | 150 | Lymphoblastoid,
| Minimal
| Human |
| |
| MPRA | Synthetic | Identification of regulatory
| Three sliding windows with
| 2,756 | 145 | K562,
| Minimal
| Human |
| |
| STARR-seq | CapSTARR-seq | Genomic | Identification of promoter
| Capture of -200 to +50 bp
| 20,719 | 250 | K562, HeLa | SCP1 | Human |
|
| MPRA | LentiMPRA | Synthetic | Identification of enhancers by
| Centered on several ChIP-
| 2,236 | 171 | HepG2 | pGL4.23
| Human |
|
| CRISPR-Cas9 | Synthetic | Identify endogenous enhancers
| Target p53 binding site in
| 685 | N/A | BJ-RASg12v | N/A | Human |
| |
1Number of targeted DNA sequences, not necessarily the number of unique fragments that are tested
Bp, base pair; capSTARR-seq, capture-based self-transcribing active regulatory region sequencing; CHIP-seq, chromatin immunoprecipitation sequencing; CRE, cis-regulatory elements; CRISPR, clustered regularly interspaced short palindromic repeats; DHS, DNase I hypersensitive sites; eQTL, expression quantitative trait loci; ERα, estrogen receptor alpha; ESC, embryonic stem cell; FIREWACh, functional identification of regulatory elements within accessible chromatin; GFP, green fluorescent protein; GWAS, genome-wide association study; Hsp68, heat shock promoter 68; lentiMPRA, lentiviral massively parallel reporter assay; MPRA, massively parallel reporter assay; N/A, not applicable; Sharpr, systematic high-resolution activation and repression profiling with reporter-tiling; STARR-seq, self-transcribing active regulatory region sequencing; SCP1, super core promoter 1; SV40, simian virus 40; TF, transcription factor; TSS, transcription start site.