| Literature DB >> 31696236 |
Molly A Bogue1, Vivek M Philip1, David O Walton1, Stephen C Grubb1, Matthew H Dunn1, Georgi Kolishovski1, Jake Emerson1, Gaurab Mukherjee1, Timothy Stearns1, Hao He1, Vinita Sinha1, Beena Kadakkuzha1, Govindarajan Kunde-Ramamoorthy1, Elissa J Chesler1.
Abstract
The Mouse Phenome Database (MPD; https://phenome.jax.org) is a widely accessed and highly functional data repository housing primary phenotype data for the laboratory mouse accessible via APIs and providing tools to analyze and visualize those data. Data come from investigators around the world and represent a broad scope of phenotyping endpoints and disease-related traits in naïve mice and those exposed to drugs, environmental agents or other treatments. MPD houses rigorously curated per-animal data with detailed protocols. Public ontologies and controlled vocabularies are used for annotation. In addition to phenotype tools, genetic analysis tools enable users to integrate and interpret genome-phenome relations across the database. Strain types and populations include inbred, recombinant inbred, F1 hybrid, transgenic, targeted mutants, chromosome substitution, Collaborative Cross, Diversity Outbred and other mapping populations. Our new analysis tools allow users to apply selected data in an integrated fashion to address problems in trait associations, reproducibility, polygenic syndrome model selection and multi-trait modeling. As we refine these tools and approaches, we will continue to provide users a means to identify consistent, quality studies that have high translational relevance.Entities:
Mesh:
Year: 2020 PMID: 31696236 PMCID: PMC7145612 DOI: 10.1093/nar/gkz1032
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Scatterplot and Correlations tool. Below the diagonal, female (red) and male (blue) data points are seen along with regression lines in thumbnail scatterplots. Above the diagonal in cells corresponding to scatterplots, positive correlations are in blue and negative correlations are in red. Pearson correlation coefficients (r) and P-values are indicated for each associated scatterplot. Clicking on the measure name along the diagonal will take users to a plot of that measure with summary statistics (not shown). Clicking on any other cell above or below the diagonal will take users to a larger, more detailed scatterplot with various viewing options (not shown).
Figure 2.Side-by-side strain comparison view of data in a series. Strain-specific phenotypic profiles and trends in the data are made obvious in this plot type, where a repeated measure is plotted over time.
Figure 3.Diversity Outbred mice with eight founder inbred strains. The histogram shows the distribution of DO mice for this measure, while means and standard errors of the founder strains are shown above the histogram. Strains are color-coded based on community standards (see plot legend).
Figure 4.GxL Replicability Adjuster Tool: Comparison Plot. Differences of strain means are plotted for pairwise comparisons (see abbreviated strain name along the y-axis). Inner segments are unadjusted confidence intervals while the outer segments indicate the GxL adjusted confidence intervals. Orange indicates significant results (P≤ 0.05) while blue indicates non-significant.
Figure 5.Polygenic Syndromic Model Selection Tool. User-selected measures are analyzed simultaneously for multivariate outliers. Results are presented in a 2D plot where outliers are indicated below a red cut-off line. The tool allows the user to select strains of interest (red box) whereby a visualization of results appears below the plot (see inset). The table is color-coded so that users can quickly identify strains of interest based on their phenotypic profiles across the measures.
Figure 6.GWAS tool based on PyLMM. A Manhattan plot of the results is provided (top panel). Data points (SNPs) can be selected (red box) and viewed in a searchable and sortable table (bottom panel). Chromosome, location, dbSNP functional annotation, rs (reference SNP accession) number, alleles and PyLMM output data are available in the table. Users can gather genes or rs numbers in a list to use as input for external research applications.