| Literature DB >> 31503413 |
Christoph Neumayr1, Michaela Pagani1, Alexander Stark1,2, Cosmas D Arnold1.
Abstract
The identification of transcriptional enhancers and the quantitative assessment of enhancer activities is essential to understanding how regulatory information for gene expression is encoded in animal and human genomes. Further, it is key to understanding how sequence variants affect enhancer function. STARR-seq enables the direct and quantitative assessment of enhancer activity for millions of candidate sequences of arbitrary length and origin in parallel, allowing the screening of entire genomes and the establishment of genome-wide enhancer activity maps. In STARR-seq, the candidate sequences are cloned downstream of the core promoter into a reporter gene's transcription unit (i.e., the 3' UTR). Candidates that function as active enhancers lead to the transcription of reporter mRNAs that harbor the candidates' sequences. This direct coupling of enhancer sequence and enhancer activity in cis enables the straightforward and efficient cloning of complex candidate libraries and the assessment of enhancer activities of millions of candidates in parallel by quantifying the reporter mRNAs by deep sequencing. This article describes how to create focused and genome-wide human STARR-seq libraries and how to perform STARR-seq screens in mammalian cells, and also describes a novel STARR-seq variant (UMI-STARR-seq) that allows the accurate counting of reporter mRNAs for STARR-seq libraries of low complexity.Entities:
Keywords: MPRA; STARR-seq; UMI-STARR-seq; cis-regulatory element; enhancer; functional genomics; gene expression; gene regulation; massively parallel reporter assay; transcriptional regulation
Mesh:
Year: 2019 PMID: 31503413 PMCID: PMC9286403 DOI: 10.1002/cpmb.105
Source DB: PubMed Journal: Curr Protoc Mol Biol ISSN: 1934-3647
Figure 1Step‐wise schematic overview of STARR‐seq and UMI‐STARR‐seq. A highly complex library insert (Illumina adapter flanked candidate fragments) is cloned into the 3′UTR of the reporter transcription unit of the STARR‐seq screening plasmid (Basic Protocol 1). After STARR‐seq library transfection into cells, active enhancers transcribe themselves as part of the 3’UTR of the reporter transcripts. The reporter transcripts are isolated as part of the cellular mRNA (isolation of mRNA) and selectively reverse transcribed using a reporter‐specific RT primer (green arrow). Standard STARR‐seq (Basic Protocol 2; left): To selectively amplify reporter cDNAs and not residual input DNA (library plasmid), a nested 2‐step PCR strategy is employed. During the junction PCR step, reporter cDNAs are specifically amplified using a forward primer (junction primer) that exclusively binds to the spliced reporter cDNA. Using the Illumina index primers (i5 and i7), the candidate sequences (Illumina adapter flanked) are amplified during the sequencing‐ready PCR step and subjected to deep sequencing. UMI‐STARR‐seq (Alternate Protocol; right): For low‐complexity candidate libraries, unique molecular identifiers (UMI) can be introduced prior to amplification, allowing the precise counting of reporter mRNAs (Alternate Protocol). To render the reporter cDNAs competent for UMI introduction, second‐strand synthesis is performed with a reporter‐specific forward primer, resulting in double‐stranded reporter DNA. The UMI is introduced by linear PCR with a modified Illumina i7 primer that harbors the UMI at the position of the i7 index. Up to this step, no amplification occurred, such that the UMIs identify individual reporter mRNAs. Only now, the UMI bearing reporter DNAs are amplified, first by the junction PCR and then by the sequencing‐ready PCR. Note that in UMI‐STARR‐seq, indexing is only possible using the i5 index, as the i7 index is replaced by the UMI. Consequently, the UMI is read as Index 1 during deep sequencing on an Illumina platform. STARR‐seq signal over input directly and quantitatively reports on the enhancer activity. ORI, bacterial origin of replication; ORF, open reading frame included in the reporter transcript with no specific function except RNA stabilization. pA site, poly‐adenylation site; P5 and P7, P5 (forward) and P7 (reverse) Illumina adapters.
Figure 2Test PCR on 1% agarose gel (lanes): left lane, Fermentas GeneRuler 1 kb plus; (1) 10 µl of sequencing‐ready PCR with five cycles; (2) 10 µl of sequencing‐ready PCR with nine cycles; and (3) 10 µl of sequencing‐ready PCR with nine cycles of minus RT.
Figure 3Sequencing‐ready PCR on 1% agarose gel (lanes): left lane, Fermentas GeneRuler 1 kb plus; (1) 10 µl of the pooled PCRs; (2) 10 µl of the supernatant, taken after PCR has bound to the beads; and (3) 250 ng cleaned‐up PCR product.
Buffer Preparation
| Stock | Binding buffer (2.5 × per screen) | Washing buffer (2 × per screen) | Storage buffer (5 × per screen) | Reconditioning buffer (3 × per screen) | 10 mM Tris·Cl | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Tris·Cl, pH 7.5 | 1 M | 20 mM | 10 ml | 10 mM | 5 ml | 250 mM | 125 ml | 10 mM | 1 ml | ||
| LiCl | 5 M | 1 M | 100 ml | 0.15 M | 15 ml | ||||||
| EDTA, pH 8 | 0.5 M | 2 mM | 2 ml | 1 mM | 1 ml | 20 mM | 20 ml | ||||
| Tween 20 | 0.10% | 500 μl | |||||||||
| NaN3 | 10% | 0.02% | 1 ml | ||||||||
| DEPC‐treated MonoQ H2O | 388 ml | 479 ml | 353.5 ml | 490 ml | 99 ml | ||||||
| NaOH | 5 M | 0.1 M | 10 ml | ||||||||
| 500 ml | 500 ml | 500 ml | 500 ml | 100 ml | |||||||
DEPC‐treated MonoQ H2O: add 750 µl DEPC to 500 ml MonoQ H2O and incubate overnight. Autoclave to inactivate DEPC.
Troubleshooting
| Problem | Possible cause | Solution |
|---|---|---|
| Low number of cells | Cells have too low density on transfection day | Check growth rate of cell line. Seed more densely prior to transfection. |
| Low complexity of STARR‐seq plasmid library |
Inefficient In‐Fusion HD reaction or reaction cleanup Low efficiency of bacterial transformation |
Follow “In‐Fusion HD reaction” steps of Basic Protocol Use recommended electrocompetent bacteria Determine and ensure maximum efficiency of bacteria transformation Check complexity of cloned library before use by deep sequencing Try Gibson Assembly for library cloning (some inserts work better with Gibson cloning) |
| Low transfection efficiency of human/mammalian cells |
Wrong transfection conditions Low viability of cells Issue with culturing of cells Issue with transfection device/method |
Optimize transfection conditions (manufacturer's guidelines!) Optimize culturing of cells |
| High number of PCR cycles for “sequencing‐ready PCR” (>12 cycles) |
Low transfection efficiency Inefficient cell lysis Degradation of RNA (incl. reporter transcripts) Loss of sample during protocol |
Monitor transfection efficiency by FACS (GFP control transfection) Homogenize cells for 4.5 min Ensure RNase‐free working environment |
| Low complexity of STARR‐seq screen (after deep sequencing) |
Low transfection efficiency Inefficient cell lysis Degradation of RNA (incl. reporter transcripts) Loss of sample during protocol Sequencing depth low |
Ensure sufficient number of reads; Confirm input library complexity by deep‐sequencing Use UMI‐STARR‐seq protocol ( |
| 1 cycle: | 45 s | 98°C | (initial denaturation) |
| 5 or 9 cycles: | 15 s | 98°C | (denaturation) |
| 30 s | 65°C | (annealing) | |
| 45 s | 72°C | (extension) | |
| 1 cycle: | 120 s | 72°C | (final extension). |
| 1 cycle: | 45 s | 98°C | (initial denaturation) |
| 9 cycles: | 15 s | 98°C | (denaturation) |
| 30 s | 65°C | (annealing) | |
| 45 s | 72°C | (extension) | |
| 1 cycle: | 120 s | 72°C | (final extension). |
| 1 cycle: | 45 s | 98°C | (initial denaturation) |
| 16 cycles: | 15 s | 98°C | (denaturation) |
| 30 s | 65°C | (annealing) | |
| 70 s | 72°C | (extension) | |
| 1 cycle: | 120 s | 72°C | (final extension). |
| 1 cycle: | 45 s | 98°C | (initial denaturation) |
| 5 or 9 cycles: | 15 s | 98°C | (denaturation) |
| 30 s | 65°C | (annealing) | |
| 45 s | 72°C | (extension) | |
| 1 cycle: | 120 s | 72°C | (final extension). |
| 1 cycle: | 45 s | 98°C | (initial denaturation) |
|
| 15 s | 98°C | (denaturation) |
| 30 s | 65°C | (annealing) | |
| 45 s | 72°C | (extension) | |
| 1 cycle: | 120 sec | 72°C | (final extension). |
*Number of cycles as determined in test PCR.
| Denaturation: | 98°C for 60 s |
| Annealing: | 65°C for 30 s |
| Elongation: | 72°C for 90 s. |
| Denaturation: | 98°C for 60 s |
| Annealing: | 65°C for 30 s |
| Elongation: | 72°C for 90 s. |
| 1 cycle: | 45 s | 98°C | (denaturation) |
| 16 cycles: | 15 s | 98°C | (denaturation) |
| 30 s | 65°C | (annealing) | |
| 70 s | 72°C | (extension) | |
| 1 cycle | 2 min | 72°C | (final extension). |