| Literature DB >> 26449678 |
Michelle M Simon1, Eva Marie Y Moresco2, Katherine R Bull3,4, Saumya Kumar5, Ann-Marie Mallon5, Bruce Beutler2, Paul K Potter5.
Abstract
Mutagenesis-based screens in mice are a powerful discovery platform to identify novel genes or gene functions associated with disease phenotypes. An N-ethyl-N-nitrosourea (ENU) mutagenesis screen induces single nucleotide variants randomly in the mouse genome. Subsequent phenotyping of mutant and wildtype mice enables the identification of mutated pathways resulting in phenotypes associated with a particular ENU lesion. This unbiased approach to gene discovery conducts the phenotyping with no prior knowledge of the functional mutations. Before the advent of affordable next generation sequencing (NGS), ENU variant identification was a limiting step in gene characterization, akin to 'finding a needle in a haystack'. The emergence of a reliable reference genome alongside advances in NGS has propelled ENU mutation discovery from an arduous, time-consuming exercise to an effective and rapid form of mutation discovery. This has permitted large mouse facilities worldwide to use ENU for novel mutation discovery in a high-throughput manner, helping to accelerate basic science at the mechanistic level. Here, we describe three different strategies used to identify ENU variants from NGS data and some of the subsequent steps for mutation characterisation.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26449678 PMCID: PMC4602060 DOI: 10.1007/s00335-015-9603-x
Source DB: PubMed Journal: Mamm Genome ISSN: 0938-8990 Impact factor: 2.957
Fig. 1Overview of ENU mutation detection methods used on DNA-Seq data. Method 1 Male C57BL/6J mice mutagenized with ENU are bred to produce 50–100 third generation (G3) mice carrying mutations mostly in the heterozygous state. The G1 male founder of each pedigree is sent for whole genome sequencing. The G3 mice are put through a phenotyping screen and affected mice are genotyped with a SNP panel to identify ENU regions. Specific ENU SNPs within the candidate region are validated via Sanger Sequencing. After secondary phenotyping and inheritance testing a copy of the potential causative mutation may be generated with CRISPR/Cas9 targeting. Method 2 Two C57BL/6J mice are mutageneised with ENU, each are paired with WT C57BL/6J females to produce third generation mice carrying 4 possible haplotypes, ENU1, ENU2, WT1 and WT2. After phenotype testing 3 phenovariant G3 mice are sent for low coverage whole genome sequencing. Shared homozygous ENU variants seen in all 3 mice cluster in an IBD region, detected using the Lander-Green algorithm. Coding variants within the IBD are validated via Sanger Sequencing. Alternative alleles may be generated using CRISPR/Cas9 targeting. Method 3 Male C57BL/6J mice mutagenized with ENU are bred to produce 30–50 third generation (G3) mice carrying mutations in homozygous and heterozygous state. The G1 male founder of each pedigree is subjected to exome sequencing, and data are used to generate Ampliseq panel primers for amplification of mutated loci from G2 and G3 mouse DNA, followed by Ion PGM 200-bp sequencing. Genotyping data are uploaded to Mutagenetix prior to phenotypic screening. Quantitative phenotype data are entered into Mutagenetix and used with genotype data for mapping by Linkage Analyzer. Calculated P values for non-linkage, Manhattan plots, and scatter plots of phenotypic data for every mutant allele are displayed by Linkage Explorer. Confirmation of candidate genes depends on duplication of the mutant phenotype by a second allele, which may be generated by CRISPR/Cas9 targeting
Fig. 2Identification of ENU mutations using polymorphic markers on a mixed background. a WGS of 3 G1 samples showing heterozygous inbred SNP sites, which are shared among all samples. These sites are eliminated from the ENU mutation list; the remaining SNPs (b) are novel or ENU-induced. c Illustrates a simplistic view of randomly distributed ENU SNVs in a chromosome of a G1 mouse. The WGS of the G1 denotes the genomic location of the ENU SNVs in the candidate region of an affected G3 mouse
Fig. 3Identification of IBD regions using a modified Lander–Green Algorithm, a pedigree in strain APFN1015-1017, the sequenced mice are shaded. The gene and genotype for the candidate mutation is shown for each sequenced individual. 1/1 indicates homozygous for mutation, ./. indicates insufficient coverage to call the genotype at that locus in an individual. b Plot showing IBD homozygous (red) and IBD heterozygous (blue) regions predicted by the Lander–Green-based algorithm in APFN1015-1017. c Pedigree for strain ENU22 with genotypes for the Ighm mutation. d Plot showing IBD regions for ENU22
Fig. 4Presentation of mapping data by Linkage Explorer. A portion of a typical results table (top) displays P values for all three transmission models for each mutation, here sorted by phenotype. P values are linked directly to the Manhattan plot (lower left), where mousing over data points reveals the gene name and associated P value. Clicking a data point opens the scatter plot of phenotypic data graphed versus genotype (REF, homozygous for wild type allele; HET, heterozygous for mutant allele; or VAR, homozygous for mutant allele) for the mutation in question (lower right). μ mean, σ standard deviation
Parameters that may be specified in linkage explorer
| Parameter | Notes |
|---|---|
|
| |
| Gene | Will return all phenotypes linked to mutations of the specified gene(s), along with associated |
| Phenotypic screen | When specified, will return mutations linked to the phenotype(s) tested in the specified screen(s) |
| Pedigree or mouse/mice | Will return all genotype–phenotype associations identified in the specified pedigree or the pedigree of which the specified mouse (mice) is (are) part, along with associated |
| Total mouse numbers | Will restrict linkage analysis to pedigrees containing a specified range or number of G3 mice |
| Allele name (phenotype) | Will return all mutations linked to the specified phenotype, along with associated |
| Mutation type | Will restrict linkage analysis to the specified mutation type(s): nonsense, missense, makesense, critical splicing, noncritical splicing |
| Predicted effect of mutation | Will restrict linkage analysis to the specified mutation effect: probably null (corresponds to nonsense and critical splicing mutations); or probably damaging, possibly damaging, probably benign as determined by PolyPhen-2 |
|
| Will display genotype–phenotype associations with |
| Minimum number of HET or VAR mice screened | Will return genotype–phenotype associations tested with at least the specified number of HET (heterozygous) or VAR (homozygous mutant) mice |
| ‘Raw + Norm’ switch | When applied, enforces |
| Direction of phenovariance | Quantitative phenotype scores either higher than or lower than wild type scores |
| Number of linkage peaks | Will return genotype–phenotype associations for which a specified number of linkage peaks exceed the specified −log10[ |
| Date of data collection |
Tools used to predict the functional or structural impact of SNVs
| Tool | URL | Notes | Organism | Reference |
|---|---|---|---|---|
|
| ||||
| SiFT |
| Predicts effect of SNVs | Human and known mouse SNPs (dbSNP) | (Kumar et al. |
| MutationAssessor |
| Predicts effect of SNVs | Human data: cancer studies | (Reva et al. |
| Provean |
| Predicts effect of SNVs, insertions and deletions | Organism independent | (Choi et al. |
|
| ||||
| SNPs3D |
| Predictions based on sequence, 3-D structure, biological networks | Human, useful for association studies | (Yue et al. |
|
| ||||
| Polyphen-2 |
| Implements MSA, amino acid changes, evolutionary conservation, SNV site hypermutability. Uses a naïve Bayes classifier | Human, can be adapted for mouse genome (standalone) | (Adzhubei et al. |
| MutationTaster2 |
| Machine learning on evolutionary conservation, splice site changes, gene expression and protein features. Uses a Bayes classifier | Human, uses 1000G data | (Schwarz et al. |
| SNAP |
| Uses neural networks for evolutionary conservation, secondary structure, solvent accessibility | Human | (Bromberg and Rost |
| Site Directed Mutator (SDM) |
| Uses a potential free energy function for protein stability; algorithm uses environment-specific substitution tables to calculate stability, predicts disease association | Organism independent | (Worth et al. |
|
| ||||
| PhosSNP |
| Predicts SNV effect on PTM | Human | (Ren et al. |
| SNPeffect |
| Predicts SNV effect on PTM, structural features of proteins, subcellular localization and interactions | Human | (De Baets et al. |
|
| ||||
| MuSiC |
| Predicts SNV effect on pathways (Cancer studies). To segregate passenger mutations from truly significant mutations | Human | (Dees et al. |
MSA multiple sequence alignment, PTM post-translational modifications