| Literature DB >> 27765810 |
Andrew P Morgan1, John P Didion1, Anthony G Doran2, James M Holt3, Leonard McMillan3, Thomas M Keane2, Fernando Pardo-Manuel de Villena4.
Abstract
Wild-derived mouse inbred strains are becoming increasingly popular for complex traits analysis, evolutionary studies, and systems genetics. Here, we report the whole-genome sequencing of two wild-derived mouse inbred strains, LEWES/EiJ and ZALENDE/EiJ, of Mus musculus domesticus origin. These two inbred strains were selected based on their geographic origin, karyotype, and use in ongoing research. We generated 14× and 18× coverage sequence, respectively, and discovered over 1.1 million novel variants, most of which are private to one of these strains. This report expands the number of wild-derived inbred genomes in the Mus genus from six to eight. The sequence variation can be accessed via an online query tool; variant calls (VCF format) and alignments (BAM format) are available for download from a dedicated ftp site. Finally, the sequencing data have also been stored in a lossless, compressed, and indexed format using the multi-string Burrows-Wheeler transform. All data can be used without restriction.Entities:
Keywords: Robertsonian translocations; inbred mouse strains; karyotype evolution; wild-derived mouse strains
Mesh:
Year: 2016 PMID: 27765810 PMCID: PMC5144988 DOI: 10.1534/g3.116.034751
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Sequencing coverage. (A) Profiles of normalized read depth across autosomes and X chromosome. Blue and purple regions show coverage by reads with mapping quality (MQ) > 20; dark gray regions show coverage by reads with MQ < 20. Gray dashed line indicates expected haploid depth. Boxes below axis show positions of segmental duplications > 100 kb in the mm10 reference genome. Note that the results for ZALENDE/EiJ are projections onto the reference genome, since this strains has only 13 chromosomes; chromosome pairs involved in Robertsonian fusions are indicated at right. (B) Histograms of coverage by reads with MQ > 20 across the genome. Gray dashed line indicates median coverage in each sample. (C) Estimated ancestry proportions (dom, M. m. domesticus; mus, M. m. musculus; cas, M. m. castaneus; ?, masked) on autosomes and X chromosome.
Variant-calling statistics
| LEWES/EiJ | ZALENDE/EiJ | Shared | ||||
|---|---|---|---|---|---|---|
| Private SNVs | 403,770 | 612,721 | 102,561 | |||
| Coding | 3619 | 0.90% | 5525 | 0.90% | 672 | 0.66% |
| Damaging | 39 | 0.01% | 67 | 0.01% | 3 | 0.00% |
| Private indels | 92,082 | 157,366 | 19,684 | |||
| Coding | 51 | 0.06% | 86 | 0.05% | 9 | 0.05% |
| Damaging | 49 | 0.05% | 77 | 0.05% | 7 | 0.04% |
Deletions of protein-coding genes
| Strain | Ensembl Gene ID | Gene Symbol | Chromosome |
|---|---|---|---|
| LEWES/EiJ | ENSMUSG00000070868 | 4 | |
| ENSMUSG00000055960 | 4 | ||
| ZALENDE/EiJ | ENSMUSG00000073609 | 1 | |
| ENSMUSG00000094651 | 1 | ||
| ENSMUSG00000093805 | 1 | ||
| ENSMUSG00000089951 | 2 | ||
| ENSMUSG00000070868 | 4 | ||
| ENSMUSG00000055960 | 4 | ||
| ENSMUSG00000049972 | 4 | ||
| ENSMUSG00000055594 | 6 | ||
| ENSMUSG00000067599 | 6 | ||
| ENSMUSG00000091620 | 6 | ||
| ENSMUSG00000094298 | 7 | ||
| ENSMUSG00000094981 | 7 | ||
| ENSMUSG00000093941 | 7 | ||
| ENSMUSG00000091195 | 11 | ||
| ENSMUSG00000091275 | 14 | ||
| ENSMUSG00000096345 | 17 | ||
| ENSMUSG00000079342 | 19 | ||
| ENSMUSG00000079387 | X |
Figure 2Phylogenetic tree of Mouse Genomes Project strains. Tree was constructed from genotypes at 30,000 ancestry-informative SNPs from the Mouse Diversity Array identified in Yang . Filled dots, inbred strains sequenced by Sanger MGP; open dots, wild-caught mice from Yang , identified by two-letter country code. Samples are colored according to subspecies of origin: blue, M. m. domesticus; red, M. m. musculus; maroon, M. m. molossinus (MOLF/EiJ); green, M. m. castaneus. The tree was rooted using SPRET/EiJ (M. spretus) as the outgroup.