| Literature DB >> 22916792 |
Kim Wong, Suzannah Bumpstead, Louise Van Der Weyden, Laura G Reinholdt, Laurens G Wilming, David J Adams, Thomas M Keane.
Abstract
BACKGROUND: The FVB/NJ mouse strain has its origins in a colony of outbred Swiss mice established in 1935 at the National Institutes of Health. Mice derived from this source were selectively bred for sensitivity to histamine diphosphate and the B strain of Friend leukemia virus. This led to the establishment of the FVB/N inbred strain, which was subsequently imported to the Jackson Laboratory and designated FVB/NJ. The FVB/NJ mouse has several distinct characteristics, such as large pronuclear morphology, vigorous reproductive performance, and consistently large litters that make it highly desirable for transgenic strain production and general purpose use.Entities:
Mesh:
Year: 2012 PMID: 22916792 PMCID: PMC3491372 DOI: 10.1186/gb-2012-13-8-r72
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Distribution of SNPs, indels and structural variants in the FVB/NJ genome, relative to the reference C57BL/6J genome. Each rectangle represents a chromosome of the mouse genome. Shown in the top panel of each chromosome is the ideogram representation, followed by genome features, as indicated in the legend. SV, structural variant; TE, transposable element.
Predicted functional consequence of SNPs and indels
| Consequence | SNPs | Indels |
|---|---|---|
| 5 kb upstream or downstream | 345,321 | 86,229 |
| 5' or 3' UTR | 19,402 | 3,769 |
| Intronic | 864,790 | 173,237 |
| Synonymous coding | 13,697 | NA |
| Non-synonymous coding | 7,880 | NA |
| Essential splice site | 10a (19) | 0a (3) |
| Stop gain | 42a (71) | NA |
| Stop lost | 6a (16) | NA |
| In-frame codon insertion or deletion | NA | 140 |
| Frameshift | NA | 126 |
| Two or more consequences | 689,535 | 136,895 |
| Within non-coding gene or mature microRNA | 115,453 | 22,735 |
| 5 kb upstream or downstream of non-coding gene | 163,280 | 31,988 |
| Intergenic (>5 kb from a coding or non-coding gene) | 2,247,487 | 416,289 |
A SNP or indel is annotated with a single consequence if the SNP or indel has the same effect on all protein-coding transcripts of a gene. A SNP or indel has two or more consequences if it has different effects on different transcripts of a gene. For example, a SNP may be in the intron of one transcript, and a synonymous SNP in a second transcript. Consequences were predicted using the Ensembl Variant Effect Predictor (VEP) [23] and gene models from Ensembl version 64. Note that a SNP may belong to more than one category. For example, a SNP may be a synonymous SNP in one gene and also within 5 kb of a second gene. aThe number of true stop codon gain, stop loss, and essential splice site consequences after manual inspection. The number in parentheses is the predicted number from VEP (Tables s1, s2 and s3 in Additional file 1). NA, not applicable.
Enrichment of PANTHER ontology terms from a list of 394 genes with radical amino acid substitutions (Grantham matrix score >150)
| PANTHER term | Genes | Percentage of total | Fold enrichment | |
|---|---|---|---|---|
| MF00002:G-protein coupled receptor | 36 | 9.7 | 2.2 | 1.50E-03 |
| MF00001:Receptor | 50 | 13.4 | 1.7 | 4.10E-02 |
| MF00224:KRAB box transcription factor | 20 | 5.4 | 2.8 | 4.50E-03 |
Values were calculated using the Database for Annotation, Visualization and Integrated Discovery (DAVID) web-based tools [24,25]. The background gene list used to determine enrichment is all genes in the mouse genome. The P-value refers to the enrichment P-value, which is a modified Fisher's exact test. A Bonferroni multiple testing correction has been applied.
Non-synonymous single nucleotide variants unique to FVB/NJ, relative to the 17 Mouse Genomes Project strains [14], with Grantham matrix score >150
| Chromosome | Position | Ref/FVB | Gene ID | Gene name | Description | GMS |
|---|---|---|---|---|---|---|
| 9 | 119518884 | C/A | ENSMUSG00000034533 | Sodium channel, voltage-gated, type X, alpha | 205 | |
| 4 | 138523178 | G/T | ENSMUSG00000070661 | Ring finger protein 186 | 205 | |
| 19 | 13485488 | G/T | ENSMUSG00000063777 | Olfactory receptor 1469 | 205 | |
| 7 | 19600268 | T/C | ENSMUSG00000040891 | Forkhead box A3 | 194 | |
| 15 | 101542878 | C/A | ENSMUSG00000061527 | Keratin 5 | 184 | |
| 9 | 85738244 | C/T | ENSMUSG00000035274 | Trophoblast glycoprotein | 180 | |
| 7 | 112715121 | G/A | ENSMUSG00000037032 | Amyloid beta (A4) precursor protein-binding, family B, member 1 | 180 | |
| 6 | 43230658 | C/T | ENSMUSG00000033542 | Rho guanine nucleotide exchange factor (GEF) 5 | 180 | |
| 2 | 84642975 | C/T | ENSMUSG00000027078 | Ubiquitin-conjugating enzyme E2L 6 | 180 | |
| 19 | 3356468 | C/T | ENSMUSG00000024900 | Carnitine palmitoyltransferase 1a, liver | 180 | |
| 19 | 16100043 | C/T | ENSMUSG00000049247 | Ribosomal protein 37, pseudogene 1 | 180 | |
| 4 | 118859970 | C/G | ENSMUSG00000028644 | Erythroblast membrane-associated protein | 177 | |
| 1 | 152466145 | C/A | ENSMUSG00000066842 | Hemicentin 1 | 160 | |
| 19 | 13137960 | G/T | ENSMUSG00000060593 | Olfactory receptor 1454 | 159 | |
| 19 | 12566519 | C/A | ENSMUSG00000039982 | Deltex 4 homolog ( | 159 | |
| 18 | 14002581 | C/A | ENSMUSG00000024420 | Zinc finger protein 521 | 159 | |
| 9 | 38623499 | C/T | ENSMUSG00000043911 | Olfactory receptor 922 | 155 | |
| 1 | 168031022 | T/A | ENSMUSG00000026564 | Dual specificity phosphatase 27 (putative) | 152 |
The C57BL/6J reference allele and FVB/NJ allele are shown (Ref/FVB). Predicted genes with unknown function are excluded. Gene IDs, names, and descriptions are from Ensembl version 64. GMS, Grantham matrix score.
Structural variants in the FVB/NJ genome
| Deletion | Duplication | Inversion | Insertion | Deletion in gain | Deletion + insertion | Inversion + deletion or insertion | |
|---|---|---|---|---|---|---|---|
| Total | 15,358 | 137 | 67 | 14,246 | 58 | 111 | 71 |
| Genes | 280 | 109 | 1 | 20 | 2 | 0 | 4 |
Listed are the number of SVs by type, and the number of genes with at least one protein-coding exon overlapping a SV. The following SV types involve more than one SV and have been described by Yalcin et al. [15,28]: a 'deletion in gain' is a deletion that is found within a duplicated region; a 'deletion plus insertion' is a deletion that co-occurs with an insertion; and an 'inversion plus a deletion or insertion' is an inversion flanked by one or more deletions or insertions.
Figure 2SNPs, indels, and structural variants (SVs) in the FVB/NJ mouse, in two narrowed regions of the chromosome 10 . (a) The 10a locus from 1 to 7.3 Mb. (b) The 10b locus from 20.1 to 21.9 Mb. (c) Within the 10a region is the Mthdf1l gene, with non-synonymous and synonymous SNPs shown. Shown in red are non-synonymous SNPs or SNPs that affect intronic splice sites. In green are synonymous SNPs. In blue are gene models based on Ensembl version 64. Previously known FVB/NJ SNPs from dbSNP version 128 are also shown. The 10a region displayed in (a) begins at 3 Mb since the sequence from 1 bp to 3 Mb has not been assembled in the reference genome. Shown in the 10b region are the genes described in Wolfrum et al. [34] (Pde7b, Ahi1, Myb, Hbs1l, Aldh8a1, Sgk1), and additional predicted genes from Ensembl (1700020N01Rik, 1700021A07Rik, Gm5420, 4930444G20Rik, E030030I06Rik).
Non-synonymous SNPs between C57BL6/J and FVB/NJ in Ath11
| Chromosome 10 position | Ref/FVB | dbSNP ID | MGP | Gene name, Ensembl gene ID and description | Protein domain | |
|---|---|---|---|---|---|---|
| 5016217 | G/C | None | 8 | 1/4 | Spectrin repeat | |
| 5380387 | C/A | rs29315913 | 13 | 1/5 | Nuclear hormone receptor, ligand-binding, core | |
| 5977100 | A/C | None | 16 | 2/2 | None | |
| 6338741 | A/G | rs29378605 | 15 | 3/3 | Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain | |
| 6367833 | A/G | rs29320259 | 14 | 3/3 | Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain | |
| 7309964 | T/C | rs29376214 | 6 | 4/6 | ||
| 21108744 | A/G | rs49757404 | 12 | 1/2 | Aldehyde dehydrogenase domain | |
| 21787374 | C/T | rs47037014 | 8 | 1/1 | None | |
| 21787451 | A/C | rs50088182 | 9 | 1/1 | None | |
| 21864292 | C/T | rs49453000 | 8 | 1/1 | None | |
| 21868313 | T/C | None | 9 | 1/1 | None | |
| 21868315 | T/A | None | 9 | 1/1 | None |
Wolfrum et al. [34] characterized two narrowed regions, 10a and 10b, within Ath11 containing 28 genes. This table describes additional non-synonymous SNPs discovered by sequencing of the FVB/NJ genome. Listed are the reference allele and FVB/NJ allele (Ref/FVB), the dbSNP ID (build 128) for sites previously identified as polymorphic in other mouse strains, and the number of strains in the Mouse Genomes Project (MGP) with the same allele as FVB/NJ. Also listed is the number of protein-coding transcripts (Trans.) affected and the protein domain involved. Gene and protein domain models were obtained from Ensembl version 64.