| Literature DB >> 30654827 |
Christopher L Nobles1, Shantan Reddy1, January Salas-McKee2,3,4,5, Xiaojun Liu2,3,4,5, Carl H June2,3,4,5, J Joseph Melenhorst2,3,4,5, Megan M Davis2,3,4,5, Yangbing Zhao2,3,4,5, Frederic D Bushman6.
Abstract
Genome engineering methods have advanced greatly with the development of programmable nucleases, but methods for quantifying on- and off-target cleavage sites and associated deletions remain nascent. Here, we report an improvement of the GUIDE-seq method, iGUIDE, which allows filtering of mispriming events to clarify the true cleavage signal. Using iGUIDE, we specify the locations of Cas9-guided cleavage for four guide RNAs, characterize associated deletions, and show that naturally occurring background DNA double-strand breaks are associated with open chromatin, gene dense regions, and chromosomal fragile sites. iGUIDE is available from https://github.com/cnobles/iGUIDE .Entities:
Mesh:
Year: 2019 PMID: 30654827 PMCID: PMC6337799 DOI: 10.1186/s13059-019-1625-3
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Diagram of the method, illustrating the strategy for improving specificity and examples of output. Procedure for GUIDE-seq (a) and iGUIDE (b). A dsODN is incorporated into DNA breaks. Amplification of flanking DNA, by nested-PCR, produces sequence copies indistinguishable from genomic mispriming when using the GUIDE-seq design. The modified dsODN of iGUIDE uses a reporter present in sequence output to identify correctly primed molecules. c Alignment of amplification primer and upstream sequence from uniquely identified sites in either GUIDE-seq or iGUIDE samples. We reasoned that amplification products resulting from mispriming should be just adjacent to sequences in the human genome with adventitious matches to the amplification primer sequence. Evidence for greater matching to primer sequences in a sample thus provides evidence for more mispriming. In the figure, the x-axis scores the match of the inferred human flanking DNA to the amplification primer (marked 2 in a and b); higher numbers of matching bases than seen for random sequences (light red) indicates probable mispriming. P values compare the distributions of the matches to the primer sequences in DNA samples detected for GUIDE-seq (top) and iGUIDE (bottom). d Sequence coverage of an on-target CRISPR site from iGUIDE data (gRNA targeting B2M)
Fig. 2Distributions of DNA double-strand breaks in human cells analyzed by GUIDE-seq and iGUIDE. Sequences of suspected Cas9 edited sites associated with either the B2M (a), TRAC5 (b), or VEGFA guideRNAs (c, d). The number of guideRNA mismatches are annotated to the right of the associated sequence, as well as the number of inferred cells sampled, as reported by GUIDE-seq (Gs) or iGUIDE (iG) data. e–g Analysis of the distribution of spontaneous DNA double-strand breaks in cells relative to genomic annotation. Each column shows, from left to right, analysis of sites of dsDNA breaks inferred by iGUIDE and GUIDE-seq. The third column shows sites of lentiviral vector integration in T cells from Fraietta et al. [22] for comparison—HIV favors integration in active transcription units, which is reflected in the integration site preferences [23–25]. Rows summarize the relationship of each form of genomic annotation on the human genome to mapped sites. To generate the heat maps, sites are correlated with the density of genomic annotation in intervals along the genome, and co-occurrence summarized as receiver operating characteristic (ROC) curves. Positive associations (> 0.5) are shown by the higher values (red), negative associations (< 0.5) by the lower values (blue). No association (0.5) is shown white. Because the relevant widow size for comparison is unknown, multiple window sizes were tested. Asterisks on each tile compare the statistical significance for comparison to no association. * indicates 0.05 > p > 0.01; ** indicates 0.01 > p > 0.001; *** indicates p < 0.001. e, f: as in (g), but associations are shown relative to epigenetic marks mapped in T cells. In the analysis, 10 Kb chromosomal intervals were used for the comparison