| Literature DB >> 27025950 |
Jijun Cheng1,2, Christine A Roden1,2,3, Wen Pan1,2, Shu Zhu4, Anna Baccei2,5, Xinghua Pan1, Tingting Jiang6,7, Yuval Kluger6,7, Sherman M Weissman1, Shangqin Guo2,5, Richard A Flavell4, Ye Ding8, Jun Lu1,2,7,9.
Abstract
Clustered regularly-interspaced palindromic repeats (CRISPR)-based genetic screens using single-guide-RNA (sgRNA) libraries have proven powerful to identify genetic regulators. Applying CRISPR screens to interrogate functional elements in noncoding regions requires generating sgRNA libraries that are densely covering, and ideally inexpensive, easy to implement and flexible for customization. Here we present a Molecular Chipper technology for generating dense sgRNA libraries for genomic regions of interest, and a proof-of-principle screen that identifies novel cis-regulatory domains for miR-142 biogenesis. The Molecular Chipper approach utilizes a combination of random fragmentation and a type III restriction enzyme to derive a densely covering sgRNA library from input DNA. Applying this approach to 17 microRNAs and their flanking regions and with a reporter for miR-142 activity, we identify both the pre-miR-142 region and two previously unrecognized cis-domains important for miR-142 biogenesis, with the latter regulating miR-142 processing. This strategy will be useful for identifying functional noncoding elements in mammalian genomes.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27025950 PMCID: PMC4820989 DOI: 10.1038/ncomms11178
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Cloning of a miRNA sgRNA library using the Molecular Chipper method.
(a) Overview of the Molecular Chipper method to generate a sgRNA library from pieces of input DNA. (b) Detailed schematics of the Molecular Chipper procedure. Briefly, an EcoP15I-site-containing adaptor is ligated to randomly fragmented DNA ends, and enzymatically released 20 bases (a G base plus 19 bases from ends of DNA fragments) are cloned as a pool into a viral vector. (c) Seventeen murine miRNAs (or miRNA cluster) and their flanking genomic sequences were used to generate a sgRNA library. Length distribution of the targeting portions of sgRNAs within the library is shown. Note that the length was calculated by one base G (in adaptor) plus the length of random ends of fragments from input DNA. The counts for each length are normalized to those of the 20-base-targeting motif sgRNAs within each biological replicate. Error bars represent s.d. N=3 biological replicates. (d) The distributions of the distances between neighbouring sgRNAs with NGG-PAM, based on all sgRNAs detected in deep sequencing, are shown (red line). The median neighbour distance is 8 bp. Theoretical distribution assumes all possible NGG-PAM sgRNAs (blue line) are present. (e) Top: diagram showing that the 17 murine miRNAs (or miRNA cluster) and their flanking genomic sequences were used to generate a sgRNA library. Bottom: representative graphs of sgRNA counts mapping to the miR-142 region or to the miR-126 region from one out of three neg-GFP samples is shown, with blue and red indicating mapping to sense and antisense strands, respectively. The positions of sgRNAs plotted were only based on positions of the last targeting domain base.
Figure 2A screen using the Molecular-Chipper-generated sgRNA library to identify both known and unknown functional cis-elements for miR-142 expression.
(a) A diagram showing the miR-142 reporter design and the screen rationale. (b) Representative flow cytometry plots (out of three biological replicates) are shown for BaF3 miR-142-3p reporter cells transduced with a control vector or the sgRNA library. Number indicates the percentage of gated population. (c) Neg-, low-, med- and high-GFP cells were FACS sorted, and then resorted to improve purify. A representative flow cytometry plot is shown for the four indicated populations after sorting and resorting. (d) Competitive proliferation of high-GFP cells and neg-GFP cells was determined. Neg-GFP and high-GFP BaF3 miR-142-3p reporter cells (both mCherry positive) were FACS sorted and mixed with mCherry-negative BaF3 cells. The relative ratio of mCherry-positive to mCherry-negative cells was determined by flow cytometry at the indicated days. Data from high-GFP cells were normalized against those from low-GFP cells. N=3 biological replicates. Error bars represent s.d. Note the absence of strong selection against high-GFP cells. (e) Mouse miR-142-3p, miR-142-5p and miR-222-3p expression levels in neg-, low-, med- and high-GFP populations (from samples in (c)) were determined by qRT–PCR. The relative expression levels are labelled relative to that in neg-GFP samples. Note that data are shown in log scale. Also note that the miR-222-3p expression is shown as a control. N=3 technical replicates. Error bars represent s.d. Data are from a representative experiment out of two performed.
Figure 3Identification and validation of the 5′- and 3′-hit regions of miR-142.
(a) Log2 enrichment of sgRNAs in low-GFP cells versus neg-GFP cells is shown for miR-142 in biological triplicates. X axis indicates position in bp. Horizontal black bars indicate the locations of mature miRNAs. Blue and red indicate enriched sgRNAs that were mapped to sense and antisense strand, respectively. Note that the positions of sgRNAs plotted were based on positions of the last targeting motif base. Blue and red boxes indicate 5′- and 3′-hit regions. (b) Log2 enrichment of sgRNAs in low-GFP cells versus neg-GFP cells is shown for miR-126, as a control, in biological triplicates. (c) Single sgRNAs from the hit regions were transduced into BaF3 miR-142-3p reporter cells. The distribution of GFP levels was determined by flow cytometry. Representative flow cytometry plots are shown, with numbers indicating the percentage of cells within the gate. Note that five single sgRNAs were tested and colour coded in the figure, including one from mature miR-142-5p region, two in the 5′-hit region and two in the 3′-hit region. (d) Mouse miR-142-3p expression levels in neg-, low-, med- and high-GFP populations sorted from reporter cells transduced with the four single sgRNAs (as in c) were determined by qRT–PCR. The relative expression levels were normalized to that in neg-GFP samples. Note that data are shown in log scale. N=3 technical replicates. Error bars represent s.d. Data are from a representative experiment out of two performed. (e) Low-GFP and high-GFP populations transduced with the four sgRNAs were sorted, and genomic DNA was PCR amplified around miR-142 locus and TA cloned. The deletions in low-GFP (top) and high-GFP (bottom) cells are shown within a schematic diagram depicting the miR-142 locus. Horizontal black bars represent mature miR-142 miRNAs. Deletion alleles are colour coded as in c, with short vertical bars in deletion regions indicating the positions of sgRNAs. Positions of sgRNAs correspond to the positions of the last base in the targeting domain.
Figure 4The 5′- and 3′-hit regions of pri-miR-142 regulate miR-142 biogenesis.
(a) Designs of miRNA processing reporters for control (ctrl), wild-type (WT) miR-142 and its deletion mutants. The narrow vertical blue bar upstream of the 3′-hit region depicts a putative CNNC site, which was not disrupted by the deletions. (b) Cleavage efficiencies of the indicated mouse miR-142 processing reporters were determined in the indicated cell lines. *P<0.05; **P<0.01; NS, not significant; Student's t-test. N=3 biological replicates. Data from a representative experiment out of two performed. Error bars represent s.d. (c) NIH 3T3 cells with very low endogenous miR-142 expression were transduced with the indicated mouse miR-142 processing reporters. The expression levels of mature mouse miR-142-3p were determined. **P<0.01; Student's t-test. N=3 biological replicates. Data from a representative experiment out of two performed. Error bars represent s.d.