| Literature DB >> 33096238 |
Diane Gleeson1, Debarati Sethi1, Radka Platte1, Jonathan Burvill1, Daniel Barrett1, Shaheen Akhtar1, Michaela Bruntraeger1, Joanna Bottomley1, Sanger Mouse Genetics Project1, James Bussell1, Edward Ryder2.
Abstract
Genotyping of knockout alleles in mice is commonly performed by end-point PCR or gene-specific/universal cassette qPCR. Both have advantages and limitations in terms of assay design and interpretation of results. As an alternative method for high-throughput genotyping, we investigated next generation sequencing (NGS) of PCR amplicons, with a focus on CRISPR-mediated exon deletions where antibiotic selection markers are not present. By multiplexing the wild type and mutant-specific PCR reactions, the genotype can be called by the relative sequence counts of each product. The system is highly scalable and can be applied to a variety of different allele types, including those produced by the International Mouse Phenotyping Consortium and associated projects. One potential challenge with any assay design is locating unique areas of the genome, especially when working with gene families or regions of high homology. These can result in misleading or ambiguous genotypes for either qPCR or end-point assays. Here, we show that genotyping by NGS can negate these issues by simple, automated filtering of undesired sequences. Analysis and genotype calls can also be fully automated, using FASTQ or FASTA input files and an in-house Perl script and SQL database.Entities:
Keywords: CRISPR; Genotyping; Mouse; Mutant; NGS; QC
Mesh:
Year: 2020 PMID: 33096238 PMCID: PMC8205115 DOI: 10.1016/j.ymeth.2020.10.011
Source DB: PubMed Journal: Methods ISSN: 1046-2023 Impact factor: 3.608
Fig. 1Common knockout designs. End-point PCR assays for EUCOMM/KOMP-CSD mutants are designed to amplify either the wild type or mutant alleles and fail if the allele is not present. Exon deletions follow this same premise, although the mutant assay may amplify a larger size if the exon is sufficiently small. For SNP detection, a single assay spans the region of interest.
Genes and allele types used in the study which were successfully amplified in the NGS pipeline. Genes highlighted in bold are characterized as high-homology (greater than 90% homology over the assay design area).
| Mutation type | Genes |
|---|---|
| EUCOMM/KOMP tm1b | Afap1l1, Id2 |
| SNP | Pcdh15 |
| Exon deletion | Ceacam15, Clic3, Crip1, Etv3, Exo5, Gm17750, Itgam, Msl1, |
Fig. 2NGS genotyping strategy for CRISPR-derived exon deletions. A) In PCR 1, tailed primers containing a linker sequence are amplified in a WT and mutant-specific multiplex reaction. A second round of PCR is then performed using additional tailed primers containing flow cell adaptors and an 8 bp barcode index sequence at each end. B) By using plate-specific and well-specific barcodes, many plates can be multiplexed into one sequencing run. After the sequencing by synthesis is complete, the MiSeq then deconvolutes the clusters and assigns the results to FASTQ files per sample, based on the barcode sequence.
Fig. 3Data analysis pipeline. Mouse ID and strain information is retrieved from the MySQL database by processing the FASTQ filename. Sequences are compared to a database of genotype-specific motifs, any matches reported, and the genotype called on the ratio. FASTQ files can be pre-filtered on the quality score if required.
Ceacam15 genotyping, comparing the Qiaxcel end-point PCR genotypes, LoA qPCR and NGS results. NGS ratios are consistent with previous LoA qPCR genotype calls. Mice are identified by colony prefix, mating, litter and individual. The LoA score is reported here as the relative quantification (RQ) value, where 1 = amplification identical to WT controls and 0 = no amplification (homozygote). Heterozygotes are called where the RQ value is in a defined range of 0.5. PCR gel sizes: WT = 141 bp, mutant = 162 bp.
Fig. 4Genotyping results per mouse strain using next generation sequencing. Ratios are calculated by comparing the number of wild type motif sequence hits against mutant motif hits. The shaded area represents the ideal range for heterozygous genotypes.
Genotype comparisons between LoA qPCR and NGS sequence ratios. High homology genes have a lower genotype concordance than those with no homology, suggesting an issue with the specificity of the design in one or either assay.
| High homology gene | ||
|---|---|---|
| NGS and LoA qPCR agree? | No | Yes |
| Yes | 90% | 82% |
| No | 10% | 18% |
Psg21 genotyping results comparison. Two mice were genotyped as heterozygotes by LoA qPCR and from the PCR 1 reaction used in the NGS experiment. Further analysis, however, revealed that the WT band was spurious, and the true genotype was homozygous (shaded regions). PCR gel sizes: WT = 191 bp, mutant = 157 bp.