| Literature DB >> 30407599 |
Carol J Bult1, Judith A Blake1, Cynthia L Smith1, James A Kadin1, Joel E Richardson1.
Abstract
The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the community model organism genetic and genome resource for the laboratory mouse. MGD is the authoritative source for biological reference data sets related to mouse genes, gene functions, phenotypes, and mouse models of human disease. MGD is the primary outlet for official gene, allele and mouse strain nomenclature based on the guidelines set by the International Committee on Standardized Nomenclature for Mice. In this report we describe significant enhancements to MGD, including two new graphical user interfaces: (i) the Multi Genome Viewer for exploring the genomes of multiple mouse strains and (ii) the Phenotype-Gene Expression matrix which was developed in collaboration with the Gene Expression Database (GXD) and allows researchers to compare gene expression and phenotype annotations for mouse genes. Other recent improvements include enhanced efficiency of our literature curation processes and the incorporation of Transcriptional Start Site (TSS) annotations from RIKEN's FANTOM 5 initiative.Entities:
Year: 2019 PMID: 30407599 PMCID: PMC6323923 DOI: 10.1093/nar/gky1056
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Data for which MGD serves as an authoritative source
| Data type | Description |
|---|---|
| Unified mouse genome feature catalog | MGD integrates predictions from Gencode and NCBI to generate a single, comprehensive catalog |
| Gene Ontology (GO) annotations for mouse | MGD expertly curates data from literature and integrates from others |
| Mouse Phenotype annotations | MGD expertly curates data from literature and integrates from large scale projects |
| Mouse models of human disease | MGD expertly curates mouse models of human disease using terms from the Disease Ontology |
| Gene to nucleotide sequence association | MGD collaborates with NCBI and Gencode |
| Gene to protein sequence association | MGD collaborates with UniProt and Protein Ontology |
| Mammalian Phenotype (MP) Ontology | MGD develops and distributes MP |
| Symbols, names, and stable accession identifiers for genes, alleles and mouse strains | MGD implements standards set by the International Committee on Standardized Genetic Nomenclature in Mice and coordinates with human and rat gene nomenclature committees |
Summary of MGD content September 2017–2018
| Data type | 2017 | 2018 |
|---|---|---|
| Genes and genome features with nucleotide sequence data | 47 693 | 49 244 |
| Genes with protein sequence data | 24 317 | 24 408 |
| Mouse genes with human orthologs | 17 089 | 17 094 |
| Mouse genes with rat orthologs | 18 509 | 18 512 |
| Genes with GO annotations | 24 502 | 24 581 |
| Total number of GO annotations | 312 109 | 316 240 |
| Mutant alleles in mice | 51 378 | 56 254 |
| Genes with mutant alleles in mice | 12 401 | 13 455 |
| QTL records | 6257 | 6605 |
| Genotypes with phenotype annotation (MP) | 60 951 | 62 551 |
| Total number of MP annotations | 315 657 | 326 292 |
| Mouse models (genotypes) associated with human diseases | 6027 | 6374 |
| References in the MGD bibliography | 237 578 | 258 926 |
Figure 1.A screenshot of MGD’s Multiple Genome Viewer showing the display of genome annotations across multiple strains of mice. (A) Users may select one or more genomes to be displayed. (B) Equivalent genome features across the strains are highlighted by ‘swim lanes’ when a user clicks on one or more genome features. (C) Genome feature types (protein coding gene, pseudogene, etc.) are indicated by color; classes of genome features can be toggled on and off in the display.
Figure 2.Screenshot of the Gene Expression-Phenotype Matrix. The first column (gold highlight) summarizes the wild-type expression pattern of the Pax3 gene. The color of matrix cells in the column indicates the type and number of expression annotations for each tissue; the conventions are defined in the matrix legend (inset). Genotype summary data associated with alleles of Pax3 are displayed in the adjacent columns. The tissues where each mutation/genotype has phenotypic effects are indicated by the presence of colored matrix cells. Cells with an ‘N’ indicate that an expected abnormality was not detected. A red exclamation point indicates the phenotype is affected by changes in mouse strain background. Clicking on blue toggles next to term names expands and collapses the anatomy vocabulary tree. Annotation details are displayed when users click in the cells of the matrix.