| Literature DB >> 22555591 |
Ignaty Leshchiner1, Kristen Alexa, Peter Kelsey, Ivan Adzhubei, Christina A Austin-Tse, Jeffrey D Cooney, Heidi Anderson, Matthew J King, Rolf W Stottmann, Maija K Garnaas, Seungshin Ha, Iain A Drummond, Barry H Paw, Trista E North, David R Beier, Wolfram Goessling, Shamil R Sunyaev.
Abstract
Genetic mapping of mutations in model systems has facilitated the identification of genes contributing to fundamental biological processes including human diseases. However, this approach has historically required the prior characterization of informative markers. Here we report a fast and cost-effective method for genetic mapping using next-generation sequencing that combines single nucleotide polymorphism discovery, mutation localization, and potential identification of causal sequence variants. In contrast to prior approaches, we have developed a hidden Markov model to narrowly define the mutation area by inferring recombination breakpoints of chromosomes in the mutant pool. In addition, we created an interactive online software resource to facilitate automated analysis of sequencing data and demonstrate its utility in the zebrafish and mouse models. Our novel methodology and online tools will make next-generation sequencing an easily applicable resource for mutation mapping in all model systems.Entities:
Mesh:
Year: 2012 PMID: 22555591 PMCID: PMC3409267 DOI: 10.1101/gr.135541.111
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Schematic representation of mutation analysis by NGS and SNP mapping.
Figure 2.Mapping of two alleles of the cloche mutant. (A) Homozygosity scoring of the clo and clo alleles reveals corresponding overlapping peaks on chromosome 13, where the mutation has been previously mapped. (B) Sequence pileup of the clo allele confirms the previously established deletion and reveals the boundaries of the deletion interval. (C) Log likelihood scoring of the clo alleles achieves a higher resolution of the mutation region with a narrower interval than the homozygosity scoring method.
Figure 3.Mapping of ca1. (A) Homozygosity scoring reveals a single peak on chromosome 3. (B) Log likelihood analysis (red line) returns a ∼7-Mb interval for the presumptive mutation, while the homozygosity score (green graph) misses the mutation region due to very low SNP frequency in some areas of the interval. (C) Sequencing of exon 6 of the flr gene reveals a premature stop codon.
Figure 4.Mapping of malbec. (A) Homozygosity scoring maps the mutation to chromosome 7. (B) Log likelihood calculation reveals a ∼23-Mb high-scoring interval. (C) Meiotic map of the mlb locus on chromosome 7. The tightly linked genetic marker snp1 was identified by a traditional chromosomal walk. (D) NGS sequence analysis of mlb and control embryos identified a nonsense mutation in a candidate gene disrupted in mlb. This nonsense mutation is nonrecombinant with the mutant phenotype and resolved all remaining 12 genetic recombinants to a resolution <1/5440 meioses.
Figure 5.Improving the genome assembly using NGS-based SNP mapping. (A) Analysis of homozygous and heterozygous SNPs in the clo mutation region demonstrates abrupt changes in the frequency of heterozygous SNPs, corresponding with three distinct scaffolds. (B) Removal of the scaffolds indicated above improves the HMM log likelihood score.
Figure 6.Application to ENU-induced mouse mutant. (A) Homozygosity scoring reveals a single peak on chromosome 2. (B) Log likelihood analysis highlights a 5.5-Mb interval harboring a single nonsynonymous change in the candidate gene Lrp2.