| Literature DB >> 27363488 |
Jonas Dehairs1, Ali Talebi1, Yacine Cherifi2, Johannes V Swinnen1.
Abstract
The advent of next generation gene editing technologies has revolutionized the fields of genome engineering in allowing the generation of gene knockout models and functional gene analysis. However, the screening of resultant clones remains challenging due to the simultaneous presence of different indels. Here, we present CRISP-ID, a web application which uses a unique algorithm for genotyping up to three alleles from a single Sanger sequencing trace, providing a robust and readily accessible platform to directly identify indels and significantly speed up the characterization of clones.Entities:
Mesh:
Year: 2016 PMID: 27363488 PMCID: PMC4929496 DOI: 10.1038/srep28973
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Input, processing and output of the CRISP-ID application.
After uploading a trace file, CRISP-ID draws a chromatogram that displays the sequence peaks and base calls. The user can tweak the base calling by trimming the start and the end of the chromatogram and by adjusting the background cut-off. To obtain the sequences, a “frame primer” is constructed, containing only the homozygous base calls after the start of the spectral shift (A). This frame primer runs over the entire length of the overlapping spectrum. Provided that the frame primer is sufficiently long, it will align with the reference sequence once for each sequence that is present in the mix (B). If no alignments can be found initially, the primer is iteratively trimmed until either alignments are found, or a minimal size of 10 homozygous base calls is reached. Trimming the 3′ end of the frame primer might be necessary due to poor quality base calls near the end of the sequence run, trimming the 5′ end is necessary in case of insertions, and is set to 10 bases by default (not shown in figure). Initially, a “first guess” of the sequences is constructed based on peak height, with the first sequence containing the highest peaks (C). The first sequence is then compared to the reference sequence according to the first frame found in step B. In case of a base mismatch, the base is swapped with the second sequence. During this process of matching the first sequence to the reference sequence, the swapping inherently results in simultaneously solving the second sequence (D). Finally, the user is presented with a multiple alignment of the de-convoluted sequences and the reference sequence, revealing the size and locus of the indels.
Validation of CRISP-ID compared to single colony cloning.
| Clone | Indel location/size CRISP-ID | Indel location/size single colony | base identity (%) |
|---|---|---|---|
| 83_163 del | identical | 100 | |
| 157 del | identical | 100 | |
| 157_158 Ins 1 | identical | 100 | |
| 157_158 del | identical | 100 | |
| 157 del | identical | 100 | |
| 157_158 Ins 1 | identical | 100 | |
| 157_158 Ins 1 | identical | 100 | |
| 157_162 del | identical | 100 | |
| 157_160 del | identical | 100 | |
| 157_165 del | identical | 100 | |
| 156_158 Ins 1 | identical | 100 | |
| 157_160 del | identical | 99.5 | |
| 152_161 del | identical | 100 | |
| 157 del | identical | 99.5 | |
| 157_158 Ins 1 | identical | 99 | |
| 40_42 del | identical | 100 | |
| 40_41 del | identical | 100 | |
| 41_42 del | identical | 100 | |
| 40_42 del | identical | 100 | |
| 42_45 del | identical | 100 | |
| 42_43 del | identical | 100 | |
| 42 del | identical | 100 | |
| 40_45 del | identical | 100 | |
| 43_44 del | identical | 100 | |
| 42_43 del | identical | 100 | |
| 42 del | identical | 100 | |
| 42 Ins 1 | identical | 100 | |
| 41_42 del | identical | 100 | |
| 41 del | identical | 100 | |
| 17_20 del | identical | 100 | |
| 13 del | identical | 100 | |
| 19_20 Ins 1 | identical | 99.3 | |
| 20_28 del | identical | 100 | |
| 19_20 Ins 1 | identical | 100 | |
| 20_28 del | identical | 100 | |
| 19_20 Ins 1 | identical | 100 | |
| −29_18 del | identical | 100 | |
| 17 del | identical | 100 | |
| 22_29 del | identical | 100 | |
| 28_29 Ins 1 | identical | 98.5 | |
| 28_29 Ins 6 | identical | 99.3 | |
| 104_151 del | identical | 100 | |
| 105_151 del | identical | 100 | |
| 104_151 del | identical | 100 | |
| 104_150 del | identical | 100 | |
| 104_149 del | identical | 100 | |
| 105_154 del | identical | 100 | |
| 105_151 del | identical | 100 | |
| wild type | identical | 99.5 | |
| 66_84 del | identical | 100 | |
| wild type | identical | 100 | |
| 75_76 Ins 1 | identical | 100 |
22 single cell-derived clones with a CRISPR-Cas9 mediated KO were analyzed by Sanger sequencing and CRISP-ID, or by the single colony method followed by Sanger Sequencing.