| Literature DB >> 32140729 |
Constance M Smith1, James A Kadin1, Richard M Baldarelli1, Jonathan S Beal1, Olin Blodgett1, Sharon C Giannatto1, Joel E Richardson1, Martin Ringwald1.
Abstract
The Gene Expression Database (GXD), an extensive community resource of curated expression information for the mouse, has developed an RNA-Seq and Microarray Experiment Search (http://www.informatics.jax.org/gxd/htexp_index). This tool allows users to quickly and reliably find specific experiments in ArrayExpress and the Gene Expression Omnibus (GEO) that study endogenous gene expression in wild-type and mutant mice. Standardized metadata annotations, curated by GXD, allow users to specify the anatomical structure, developmental stage, mutated gene, strain and sex of samples of interest, as well as the study type and key parameters of the experiment. These searches, powered by controlled vocabularies and ontologies, can be combined with free text searching of experiment titles and descriptions. Search result summaries include link-outs to ArrayExpress and GEO, providing easy access to the expression data itself. Links to the PubMed entries for accompanying publications are also included. More information about this tool and GXD can be found at the GXD home page (http://www.informatics.jax.org/expression.shtml). Database URL: http://www.informatics.jax.org/expression.shtml.Entities:
Mesh:
Year: 2020 PMID: 32140729 PMCID: PMC7058436 DOI: 10.1093/database/baaa002
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Field labels used by data submitters to describe the field containing sample ‘age’ information
| age | age (days old) |
| Age | age_days |
| AGE | age days postnatal |
| adult age | age description |
| age and sex | age/gender |
| age (day) | age group |
| agedays | age_group |
| age days | age in days |
| age (days) | age in months |
A total of 18 field labels are included in this table. We identified at least 43 other variants.
Data content as of 28 October 2019
| 16 550 | Experiments downloaded from ArrayExpress |
| 13 428 | Experiments incompatible with GXD’s scope |
| 13 | Experiments lacking publication required to identify allele used |
| 3109 | Experiments consistent with GXD’s scope/metadata included in index |
| 2043 | WT vs mutant studies |
| 1066 | Baseline studies |
*This total includes 9443 experiments manually evaluated by GXD curators. A linear support vector classifier, a machine learning algorithm, was used to predict that a further 3985 microarray experiments were outside GXD’s scope. Manual evaluation of a subset of the predictions suggests that few, if any, relevant experiments have been missed.
Figure 1Search. Illustrated is a search for experiments studying gene expression in the skeletal muscle of dystrophin (Dmd) mutants. It takes advantage of two of the curated sample attribute fields: anatomical structure (‘skeletal muscle’) and mutant gene (‘Dmd’). Additional curated fields available for searching are developmental stage, strain and sex. Users can also do free text searching of experiment titles and descriptions, as well as search by ArrayExpress or GEO id.
Figure 3Sample table. The sample information is displayed in a pop-up table that can be accessed by using the View button (Figure 2). Samples that match the search criteria (Figure 1) are highlighted in pink. The matching samples are annotated to tissues that are ontological children of the search term skeletal muscle and carry mutations of the dystrophin (Dmd) gene.
Figure 2Search return. Pictured is the return for the search in Figure 1. Filters that allow for further refinement of the return are circled. The red arrow indicates the button to access the pop-up sample table (Figure 3; discussed below). The display also includes the annotated experimental variable(s) and study type, as well as link-outs to the data at ArrayExpress and GEO and the publication at PubMed.