| Literature DB >> 28387809 |
Georgios V Gkoutos1, Paul N Schofield2, Robert Hoehndorf3.
Abstract
The past decade has seen an explosion in the collection of genotype data in domains as diverse as medicine, ecology, livestock and plant breeding. Along with this comes the challenge of dealing with the related phenotype data, which is not only large but also highly multidimensional. Computational analysis of phenotypes has therefore become critical for our ability to understand the biological meaning of genomic data in the biological sciences. At the heart of computational phenotype analysis are the phenotype ontologies. A large number of these ontologies have been developed across many domains, and we are now at a point where the knowledge captured in the structure of these ontologies can be used for the integration and analysis of large interrelated data sets. The Phenotype And Trait Ontology framework provides a method for formal definitions of phenotypes and associated data sets and has proved to be key to our ability to develop methods for the integration and analysis of phenotype data. Here, we describe the development and products of the ontological approach to phenotype capture, the formal content of phenotype ontologies and how their content can be used computationally.Entities:
Mesh:
Year: 2018 PMID: 28387809 PMCID: PMC6169674 DOI: 10.1093/bib/bbx035
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Landscape of phenotype ontologies
| Domain | Ontology | # Classes | Used by |
|---|---|---|---|
| Human and biomedical | DO [ | 11 663 | DisGeNet [ |
| HPO [ | 15 381 | HPO Database [ | |
| International Classification of Disease version 10, Clinical Modification (ICD10CM) [ | 92 168 | Various EHR systems | |
| ICD9CM [ | 22 533 | Various EHR systems | |
| Medical Subject Headings Thesaurus [ | 261 990 | Comparative Toxicogenomic Database [ | |
| UMLS [ | SIDER [ | ||
| NCI Thesaurus [ | 118 941 | NCI, NIH multiple projects | |
| Ontology of Adverse Events [ | 5514 | ||
| Orphanet Rare Disease Ontology [ | 13 105 | Orphanet [ | |
| Read Codes Clinical Terminology version 3 [ | 140 065 | UK General Practice | |
| SNOMED-CT [ | 324 129 | Various EHR systems | |
| Animal model organism | Dictyostelium Phenotype Ontology [ | 1058 | DictyBase [ |
| DPO [ | 506 | FlyBase [ | |
| MPATH [ | 889 | PathBase [ | |
| MPO [ | 30 316 | MGI [ | |
| Worm Phenotype Ontology (WBPhenotype) [ | 2435 | WormBase [ | |
| Plants and fungi | Ascomycete Phenotype Ontology | 619 | Saccharomyces Genome Database [ |
| Flora Phenotype Ontology [ | 28 430 | African Plants Database [ | |
| FYPO [ | 9870 | PomBase [ | |
| TO [ | 1433 | iPlant Collaborative Databases [ | |
| Solanaceae PATO [ | 397 | Sol Genomics Network [ | |
| Thesaurus Of Plant traits [ | 950 | TRY Database [ | |
| Cell | Cell Microscopical Phenotype Ontology [ | 813 | Cellular Phenotype Database [ |
| Ontology for Microbial Phenotypes (OMP) [ | 1120 | Microbialphenotypesȯrg [ |
Figure 1A schematic representation of the PATO framework. Quality classes from PATO are combined with entity classes from multiple ontologies (such as GO or Uberon) to provide formal definitions for species (and sometimes domain specific) phenotype ontologies. Examples of such ontologies are depicted in the outer ring and include the HPO [118], MPO [119], Cellular Phenotype Ontology (CPO) [120], Drosophila phenotype ontology (DPO) [53], Plant Trait Ontology (TO) [117] FYPO [121] and Wormbase Phenotype Ontology [57]. This provides an interoperability layer between these ontologies and facilitates the integration of the data annotated to them in the species-specific databases around the outside edge.
Figure 2Direct and comparative phenotype statements, and conversions between them. The upper part of the figure shows how phenotype statements are made in case/control experiments, in which the comparative phenotype statement expressed the difference of an observation to an explicitly or implicitly specified control. In the bottom figure, direct observations are made to two organisms, Organism 1 and Organism 2, and the comparative phenotype statements are derived by designating one of the two organisms as control and computing the differences between the other organism and the control.
Figure 3Using phenotype similarity to understand similarity between molecular mechanisms.