| Literature DB >> 29734294 |
Kevin R Roy1,2,3,4, Justin D Smith1,4, Sibylle C Vonesch5, Gen Lin5, Chelsea Szu Tu5, Alex R Lederer5, Angela Chu1,6, Sundari Suresh1,6, Michelle Nguyen1,4, Joe Horecka1,6, Ashutosh Tripathi7, Wallace T Burnett1,4, Maddison A Morgan1,4, Julia Schulz1,4, Kevin M Orsley1,4, Wu Wei1,4, Raeka S Aiyar1, Ronald W Davis1,4,6, Vytas A Bankaitis7,8,9, James E Haber10, Marc L Salit2,3, Robert P St Onge1,6, Lars M Steinmetz1,3,4,5.
Abstract
Our understanding of how genotype controls phenotype is limited by the scale at which we can precisely alter the genome and assess the phenotypic consequences of each perturbation. Here we describe a CRISPR-Cas9-based method for multiplexed accurate genome editing with short, trackable, integrated cellular barcodes (MAGESTIC) in Saccharomyces cerevisiae. MAGESTIC uses array-synthesized guide-donor oligos for plasmid-based high-throughput editing and features genomic barcode integration to prevent plasmid barcode loss and to enable robust phenotyping. We demonstrate that editing efficiency can be increased more than fivefold by recruiting donor DNA to the site of breaks using the LexA-Fkh1p fusion protein. We performed saturation editing of the essential gene SEC14 and identified amino acids critical for chemical inhibition of lipid signaling. We also constructed thousands of natural genetic variants, characterized guide mismatch tolerance at the genome scale, and ascertained that cryptic Pol III termination elements substantially reduce guide efficacy. MAGESTIC will be broadly useful to uncover the genetic basis of phenotypes in yeast.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29734294 PMCID: PMC5990450 DOI: 10.1038/nbt.4137
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1The MAGESTIC pipeline for multiplexed precision genome editing
(a) Linking guide-donors to short DNA barcodes. (1) A complex pool of array-synthesized oligonucleotides encoding guide-donors is amplified and cloned to generate the step 1 library (see Methods). The reverse primer introduces a semi-random 31-mer barcode into each ligation product, and NGS enables sequence validation and computational mapping of each guide-donor sequence in the step 1 library to a unique barcode. (b) Insertion of the Cas9 structural guide component plus yeast (HIS3) and bacterial (kanR) selection markers in between the guide and donor. (1) This final step 2 library is transformed into yeast cells such that the vast majority of transformants uptake a single plasmid which accumulates to high-copy number. Each cell harbors a barcode integration locus with a counter-selectable marker (FCY1). Guide-donor plasmids harbor a second guide expression unit (guide X) to promote barcode integration, as guide X cleavage sites flank FCY1. Cas9 and guide expression results in simultaneous cleavage of the guide-donor plasmid at a guide X site adjacent to the downstream homology (DH), target site editing (right), and genomic integration of the guide-marker-donor-barcode cassette (left). (c) Library-scale genome editing and competitive growth phenotyping. (1) The guide-donor plasmids allow editing throughout the genome, while the barcode integration site is constant. (2) Pooled growth in different conditions results in enrichment or depletion of variants that affect fitness. (3) Variant fold-changes are calculated based on barcode sequencing counts in treated vs. untreated conditions.
Figure 2Simultaneous genome editing, guide-donor barcode integration, and plasmid self-destruction
(a) WT and nej1Δ were transformed with GAL-Cas9 and a guide-donor cassette to introduce a premature termination codon (PTC) in the ADE2 gene. Cas9 expression was induced by galactose and aliquots were harvested at the indicated generations. The ADE2 locus was analyzed by NGS and the fractions of WT sequence, NHEJ indels, and donor DNA-directed editing (either perfect or imperfect repair) were calculated (see Methods). The line graph shows the mean percentages at each generation from duplicate experiments. (b) Integration of the guide-donor barcode was assayed by amplification targeting the chromosomal barcode locus for the single ADE2 guide-donor plasmid (top) as well as a complex pool of >100,000 barcoded guide-donor plasmids (bottom). The uncropped gel image indicates an absence of detectable NHEJ indel events at the barcode locus. Self-destruction of the guide-donor plasmids was assessed by a three-primer PCR, with a common forward primer and either a guide-donor plasmid-specific primer (top band) or a Cas9-plasmid specific primer (bottom band). (c) Cultures at the indicated generations of galactose induction were plated in quadruplicate at a density of ~1000 cells per plate on rich medium (YPD) and FCY1 counter-selectable medium (5-FC). The fraction of surviving colonies on plates are shown. All experiments were repeated with three biological replicates starting from independent transformations of the guide-donor plasmids.
Figure 3Active recruitment of donor DNA to Cas9-induced dsDNA breaks increases HR efficiency
(a) A protein fusion of Fkh1p to the LexA DNA-binding domain (LexA-Fkh1p) enables recruitment of donor DNA directly to dsDNA breaks (DSBs). DSBs result in the accumulation of proteins phosphorylated on specific threonine residues (pT) near the site of the break. The interaction between Fkh1p and various pT-containing proteins (including Mph1p, Fdo1p, and additional unidentified proteins) recruits LexA-Fkh1p to DSBs, which in turn recruits donor DNA via LexA binding sites the plasmid. (b) ADE2-guide donor plasmids with (bottom) or without (top) LexA sites were mixed with a non-functional ADE2 guide-donor plasmid at a ratio of 17:3, and transformed into a strain pre-expressing TEF1-Cas9 with (right) or without (left) LexA-Fkh1p. Red colonies indicate cells that received a functional ADE2 guide-donor and survived editing, while white colonies represent cells that received the non-functional ADE2 guide. The bar chart depicts the mean percentage of red colonies (y-axis) determined by counting 3 plates per condition (x-axis). Error bars represent the standard deviation. (c) The ADE2 locus was analyzed as in Fig. 2. Because ade2 is a detrimental mutation, ade2 null colonies are smaller and thus contribute slightly less sequence reads per colony relative to white colonies. The stacked bar chart (left) indicates that >99.5% of the sequence is WT or perfect donor repair. The inset bar chart (right) shows the remaining <0.5% of editing events. (see Methods and Supplementary Table 1).
Figure 4Saturation mutagenesis of an essential eukaryotic gene and structure-function mapping of drug resistance
(a) Synonymous codon spreading strategy for complete saturation mutagenesis of ORFs (see Supplementary Figure 3). (b) Suppressor strategy for assaying function of Sec14p mutants (see Methods). (c) Heat-maps depicting the fitness cost of each mutation in SEC14 on protein function (top) and resistance to the Sec14 small-molecule inhibitor NPPM (bottom). The normalized relative abundance of each variant in the diploid library was assessed by sequencing (top), and read counts following ~12 generations of growth in 8μM NPPM versus DMSO control were used to calculate relative resistance. Rows indicate specific amino acid mutations, and columns indicate amino acid position in Sec14. Color intensity indicates the degree to which each amino acid mutation negatively impacted Sec14 function (orange, top panel), and increased (blue) or decreased (red) Sec14 resistance to NPPM (bottom panel). The light grey blocks indicate variants with insufficient read counts (see Methods). The screen was conducted in biological replicates. (d) The Sec14 α-carbon backbone (grey) and the mutated window (orange), with the NPPM modeled into both the open (left) and closed (right) conformers of Sec14. Side chains critical for protein function (top panel) and NPPM resistance (middle and bottom panels) are highlighted relative to the predicted binding position for NPPM. (e) The indicated mutants were independently reconstructed and grown with 8 μM NPPM. OD600 was followed over 20 hours of growth, with WT growth plotted as reference (black line). Growth assays were conducted in biological triplicate with produced nearly identical results (Supplementary Table 2).
Figure 5Global profiles of guide efficacy and mismatch tolerance for engineering of natural variants
(a) Number of individual genetic variants between RM11 and S288c (b) Log2-fold change (logFC) of barcodes in each guide class post-editing vs. pre-editing (dead guides: indels within 15 bp from PAM or >=2 mismatches within 18 bp; mutated guides: 1 mismatch within 18 bp or indel from 16–18 bp; near-perfect: mismatches at positions 19 or 20). High T-score is defined as >= 5 (see Methods). (c) Dead and perfect guide-donor logFC by distance of the variant allele from the PAM. Variants shown in the N of the NGG consisted solely of indels or linked variants and harbor additional disruption positions upstream or downstream. (d) Boxplots of logFC by T-score bin with Azimuth score bins of the same size (one-sided Wilcoxon p-values: * = 0.03182, *** = 8.12E-14). (e) Normalized RNA abundances as a function of T-score for sequence-perfect guides (left) and guides with synthesis-derived indels (right). RNA levels were determined by targeted RT-PCR of the guide RNAs and PCR of the guide DNA sequences followed by high-throughput sequencing. RNA and DNA levels were analyzed in biological triplicates and similar results were obtained with random hexamers and with a structural guide-specific primer for reverse-transcription. Box and violin plots show median value, and 25th and 75th quantiles. The number of barcodes (N) analyzed in each group is shown at the top of each plot for panel b and d, and in the Methods, section statistical analysis, for panel c.