| Literature DB >> 19426459 |
Tim Beck1, Hugh Morgan, Andrew Blake, Sara Wells, John M Hancock, Ann-Marie Mallon.
Abstract
BACKGROUND: Large-scale international projects are underway to generate collections of knockout mouse mutants and subsequently to perform high throughput phenotype assessments, raising new challenges for computational researchers due to the complexity and scale of the phenotype data. Phenotypes can be described using ontologies in two differing methodologies. Traditionally an individual phenotypic character has either been defined using a single compound term, originating from a species-specific dedicated phenotype ontology, or alternatively by a combinatorial annotation, using concepts from a range of disparate ontologies, to define a phenotypic character as an entity with an associated quality (EQ). Both methods have their merits, which include the dedicated approach allowing use of community standard terminology, and the combinatorial approach facilitating cross-species phenotypic statement comparisons. Previously databases have favoured one approach over another. The EUMODIC project will generate large amounts of mouse phenotype data, generated as a result of the execution of a set of Standard Operating Procedures (SOPs) and will implement both ontological approaches to capture the phenotype data generated.Entities:
Mesh:
Year: 2009 PMID: 19426459 PMCID: PMC2679402 DOI: 10.1186/1471-2105-10-S5-S2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Levels of ontology annotation of mouse phenotype data. The relationships between the levels of annotation are shown along with real annotations taken from the Hotplate SOP. a) The SOP is annotated using MP. Each parameter, representing a mouse trait, is defined using EQ. At the point of annotating individual mouse data, qualitative and quantitative parameters are handled differently. Qualitative parameters have a quality assigned to them, with a child-to-parent "is a" relationship to the parameter quality, and may be described using MP where a relevant concept exists. Quantitative parameters have a numerical value assigned to them. After comparison of the mutant line (cohort of individual mice) to the baseline data, statistically significant lines are annotated dynamically. Qualitative EQ data is annotated as being present at an increased or decreased frequency and quantitative data annotation using an increased or decreased based EQ statement, where the quality is a child of the parameter quality. In both cases, if a relevant MP term exists to define the direct phenotype it is assigned. The annotation of inferred phenotypes using MP terms is explained in the Discussion. b) Ontology terms are used to define two Hotplate SOP parameters. Example data is used to illustrate possible annotations of the mouse and the mutant strain. The annotations of the direct phenotypes allow association with an inferred phenotype.
Figure 2Relationship between qualitative parameter and data. An example dysmorphology parameter and the corresponding value options are shown. The central photo shows a mouse with a sparse distribution of coat hair. A portion of the MA ontology is shown on the left and a portion of PATO (Revision 1.118) is shown on the right (visualised used OBO-Edit [21]). The highlighted terms are used to define the coat hair distribution parameter and the resulting phenotype annotation, illustrating the relationship between the parameter quality and the phenotype quality.
Figure 3Incorporation of assay data into the annotation framework for an individual mouse. A simplified operating procedure is shown marked up using EXACT ontology concepts (left box). The phenotype and procedural data capture framework, to describe an instance of tail length, incorporates the experimental action from within the SOP where the phenotype data was obtained. Also shown is the relationship of parameter assayed_by SOP returns_value value, which is a child quality of the parameter quality qualified_by raw data.