| Literature DB >> 28487860 |
Fuyi Xu1, Tianzhu Chao1, Yiyin Zhang1, Shixian Hu1, Yuxun Zhou1, Hongyan Xu2, Junhua Xiao1, Kai Li1.
Abstract
The Chinese Kunming (KM) mouse is a widely used outbred mouse stock in China. However, its genetic structure remains unclear. In this study, we sequenced the genome of the C57BL/6J-Chr1KM (B6-Chr1KM) strain, the chromosome 1 (Chr 1) of which was derived from one KM mouse. With 36.6× average coverage of the entire genome, 0.48 million single nucleotide polymorphisms (SNPs) and 96,679 indels were detected on Chr 1 through comparison with reference strain C57BL/6J. Moreover, 46,590 of them were classified as novel mutations. Further functional annotation identified 155 genes harboring potentially functional variants, among which 27 genes have been associated with human diseases. We then performed sequence similarity and Bayesian concordance analysis using the SNPs identified on Chr 1 and their counterparts in three subspecies, Mus musculus domesticus, M. m. musculus, and M. m. castaneus. Both analyses suggested that the Chr 1 sequence of B6-Chr1KM was predominantly derived from M. m. domesticus while 9.7% of the sequence was found to be from M. m. musculus. In conclusion, our analysis provided a detailed description of the genetic variations on Chr 1 of B6-Chr1KM and a new perspective on the subspecies origin of KM mouse which can be used to guide further genetic studies with this mouse strain.Entities:
Year: 2017 PMID: 28487860 PMCID: PMC5401761 DOI: 10.1155/2017/1712530
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1The characteristics of B6-Chr1KM genome background. Blue bars represent B6 chromosome while the red represents KM mouse chromosome.
Figure 2Distribution of SNP density on B6-Chr1KM Chr 1. The SNP density is represented by the number of SNPs mapped within 100 kb physical intervals across Chr 1.
Predictions of functional consequences of SNPs and indels.
| Consequences | SNPs | Novel SNPs | Indels | Novel indels |
|---|---|---|---|---|
| splice_donor_variant | 29 | 9 | 2 | 0 |
| splice_acceptor_variant | 11 | 0 | 4 | 0 |
| stop_gained | 19 | 8 | 1 | 1 |
| frameshift_variant | 0 | 0 | 22 | 7 |
| stop_lost | 5 | 1 | 1 | 0 |
| start_lost | 11 | 3 | 2 | 0 |
| missense_variant | 2378 | 486 | — | 0 |
| inframe_insertion | 0 | 0 | 28 | 2 |
| inframe_deletion | 0 | 0 | 26 | 2 |
| splice_region_variant | 1117 | 63 | 244 | 18 |
| synonymous_variant | 4238 | 281 | 0 | 0 |
| stop_retained_variant | 3 | 0 | 0 | 0 |
| coding_sequence_variant | 1 | 0 | 1 | 0 |
| mature_miRNA_variant | 4 | 2 | 2 | 0 |
| 5_prime_UTR_variant | 1210 | 86 | 198 | 31 |
| 3_prime_UTR_variant | 6563 | 484 | 1617 | 191 |
| non_coding_transcript_exon_variant | 11,955 | 640 | 2140 | 290 |
| intron_variant | 575,013 | 36,838 | 139,815 | 21,458 |
| NMD_transcript_variant | 42,052 | 2609 | 10,291 | 1372 |
| non_coding_transcript_variant | 143,110 | 8770 | 32,985 | 5190 |
| upstream_gene_variant | 96,888 | 8357 | 24,321 | 3992 |
| downstream_gene_variant | 93,184 | 5752 | 23,557 | 3390 |
| intergenic_variant | 224,557 | 13,198 | 48,568 | 8312 |
Consequences were predicted using Ensembl VEP and gene models from Ensembl version 76. Novel SNPs or indels are defined as variants that were not in MGP and dbSNP142 data sets.
List of human disease-associated genes with loss of function variants in B6-Chr1KM Chr 1.
| Gene | Ensembl ID | Variant type | Phenotype | OMIM ID |
|---|---|---|---|---|
| Col4a3 | ENSMUSG00000079465 | Frameshift | Alport syndrome, autosomal dominant |
|
| Alport syndrome, autosomal recessive |
| |||
| Hematuria, benign familial; BFH |
| |||
| Fn1 | ENSMUSG00000026193 | Frameshift | Glomerulopathy with fibronectin deposits 2; GFND2 | 601894 |
| Plasma fibronectin deficiency | 614101 | |||
| Pde6d | ENSMUSG00000026239 | Splice donor | Joubert syndrome 22; JBTS22 | 615665 |
| Hmcn1 | ENSMUSG00000066842 | Frameshift | Macular degeneration, age-related, 1; ARMD1 | 603075 |
| Cd244 | ENSMUSG00000004709 | Stop gain; splice donor | Rheumatoid arthritis; RA | 180300 |
| Rab3gap2 | ENSMUSG00000039318 | Splice acceptor, missense | Martsolf syndrome | 212720 |
| Warburg micro syndrome 2; WARBM2 | 614225 | |||
| Lamb3 | ENSMUSG00000026639 | Splice acceptor | Amelogenesis imperfecta, type IA; AI1A | 104530 |
| Epidermolysis bullosa, junctional, Herlitz type |
| |||
| Epidermolysis bullosa, junctional, non-Herlitz type |
| |||
| Dst | ENSMUSG00000026131 | Missense | Epidermolysis bullosa simplex, autosomal recessive 2; EBSB2 | 615425 |
| Neuropathy, hereditary sensory and autonomic, type VI; HSAN6 |
| |||
| Ercc5 | ENSMUSG00000026048 | Missense | Xeroderma pigmentosum, complementation group G; XPG | 278780 |
| Casp8 | ENSMUSG00000026029 | Missense | CASPase 8 deficiency |
|
| Dermatitis, atopic |
| |||
| Tmem237 | ENSMUSG00000038079 | Missense | Joubert syndrome 1; JBTS1 | 213300 |
| Joubert syndrome 14; JBTS14 | 614424 | |||
| Bard1 | ENSMUSG00000026196 | Missense | Breast cancer |
|
| Bcs1l | ENSMUSG00000026172 | Missense | Bjornstad syndrome; BJS | 262000 |
| Gracile syndrome |
| |||
| Leigh syndrome; LS | 256000 | |||
| Mitochondrial complex III deficiency, nuclear type 1; MC3DN1 | 124000 | |||
| Obsl1 | ENSMUSG00000026211 | Missense | Three M syndrome 2; 3 M2 | 612921 |
| Tm4sf20 | ENSMUSG00000026149 | Missense | Specific language impairment 5; SLI5 | 615432 |
| Dis3l2 | ENSMUSG00000053333 | Missense | Perlman syndrome; PRLMNS | 267000 |
| Chrng | ENSMUSG00000026253 | Missense | Multiple pterygium syndrome, Escobar variant; EVMPS | 265000 |
| Multiple pterygium syndrome, lethal type; LMPS | 253290 | |||
| Ugt1a1 | ENSMUSG00000089960 | Missense | Crigler-Najjar syndrome, type I | 218800 |
| Crigler-Najjar syndrome, type II | 606785 | |||
| Gilbert syndrome | 143500 | |||
| Hyperbilirubinemia, transient familial neonatal; HBLRTFN | 237900 | |||
| Steap3 | ENSMUSG00000026389 | Missense | Anemia, hypochromic microcytic, with iron overload 2; AHMIO2 | 615234 |
| Ube2t | ENSMUSG00000026429 | Missense | Fanconi anemia, complementation group T; FANCT | 616435 |
| Ppox | ENSMUSG00000062729 | Missense | Porphyria variegata |
|
| Ackr1 | ENSMUSG00000037872 | Missense | Malaria, susceptibility to | 611162 |
| Spta1 | ENSMUSG00000026532 | Missense | Elliptocytosis 2; EL2 | 130600 |
| Pyropoikilocytosis, hereditary; HPP | 266140 | |||
| Spherocytosis, type 3; SPH3 |
| |||
| Ephx1 | ENSMUSG00000038776 | Missense | Epoxide hydrolase 1, microsomal; EPHX1 | 132810 |
| Hypercholanemia, familial; FHCA | 607748 | |||
| Preeclampsia/eclampsia 1; PEE1 | 189800 | |||
| Rd3 | ENSMUSG00000049353 | Missense | Leber congenital amaurosis 12; LCA12 |
|
| Cd46 | ENSMUSG00000016493 | Missense | Hemolytic uremic syndrome, atypical, susceptibility to, 2; AHUS2 | 612922 |
| Cr2 | ENSMUSG00000026616 | Missense | Immunodeficiency, common variable, 2; CVID2 | 240500 |
| Immunodeficiency, common variable, 7; CVID7 | 614699 | |||
| Systemic lupus erythematosus, susceptibility to, 9; SLEB9 | 610927 |
OMIM: online Mendelian inheritance in man. Numbers in italic in OMIM ID column indicate that these diseases have mouse models. Human disease-related phenotypes come from “Human-Mouse: Disease Connection” database (http://www.informatics.jax.org/humanDisease.shtml) in Mouse Genome Informatics website.
Figure 3Sequence similarity between B6-Chr1KM and WSB, PWK, and CAST Chr 1. (a) Distribution of the numbers of 100 kb blocks of the B6-Chr1KM Chr 1 with sequence similarities (%) to the corresponding blocks of the WSB, PWK, and CAST Chr 1. (b) Sliding window analysis of the similarities of Chr 1 sequences between B6-Chr1KM and WSB, CAST, or PWK. The B6-Chr1KM Chr 1 sequence was compared using 500 kb windows and 100 kb sliding intervals. The horizontal line indicates the level of 99.7% sequence similarity.
Figure 4Phylogenetic analysis of B6-Chr1KM Chr 1. (a) Fine-scale phylogenetic discordance of B6-Chr1KM Chr 1 (PP indicates posterior probability). Red represents WSB, green indicates PWK, and white represents unknown. (b) Phylogenetic tree of the WSB-derived or PWK-derived sequences of B6-Chr1KM and the wild-derived inbred mouse strain sequences. A neighbor-joining tree was generated using MEGA6 software. Red and green indicate regions supporting a single topology for KM/WSB and KM/PWK, respectively, which are both associated with a high posterior probability, as determined by BCA.