| Literature DB >> 29186578 |
Laurel Cooper1, Austin Meier1, Marie-Angélique Laporte2, Justin L Elser1, Chris Mungall3, Brandon T Sinn4, Dario Cavaliere4, Seth Carbon3, Nathan A Dunn3, Barry Smith5, Botong Qu6, Justin Preece1, Eugene Zhang6, Sinisa Todorovic6, Georgios Gkoutos7, John H Doonan8, Dennis W Stevenson4, Elizabeth Arnaud2, Pankaj Jaiswal1.
Abstract
The Planteome project (http://www.planteome.org) provides a suite of reference and species-specific ontologies for plants and annotations to genes and phenotypes. Ontologies serve as common standards for semantic integration of a large and growing corpus of plant genomics, phenomics and genetics data. The reference ontologies include the Plant Ontology, Plant Trait Ontology and the Plant Experimental Conditions Ontology developed by the Planteome project, along with the Gene Ontology, Chemical Entities of Biological Interest, Phenotype and Attribute Ontology, and others. The project also provides access to species-specific Crop Ontologies developed by various plant breeding and research communities from around the world. We provide integrated data on plant traits, phenotypes, and gene function and expression from 95 plant taxa, annotated with reference ontology terms. The Planteome project is developing a plant gene annotation platform; Planteome Noctua, to facilitate community engagement. All the Planteome ontologies are publicly available and are maintained at the Planteome GitHub site (https://github.com/Planteome) for sharing, tracking revisions and new requests. The annotated data are freely accessible from the ontology browser (http://browser.planteome.org/amigo) and our data repository.Entities:
Mesh:
Year: 2018 PMID: 29186578 PMCID: PMC5753347 DOI: 10.1093/nar/gkx1152
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Planteome reference ontologies and vocabularies
| Ontology name | Knowledge domain | Source URL |
|---|---|---|
| Plant Ontology (PO) | plant structures and developmental stages |
|
| Plant Trait Ontology (TO) | plant traits |
|
| Plant Experimental Conditions Ontology (PECO) | treatments and growth conditions used in plant science experiments |
|
| Gene Ontology (GO) | molecular functions, biological processes, cellular components |
|
| Phenotypic Qualities Ontology (PATO) | qualities and attributes |
|
| Chemical Entities of Biological Interest (ChEBI) | molecular entities of biological interest focusing on ‘small’ chemical compounds |
|
| Evidence and Conclusion Ontology (ECO) | evidence types for supporting conclusions in scientific research |
|
| Planteome NCBI Taxonomy* | taxonomic hierarchy |
|
*The Planteome NCBI Taxonomy is a ‘slice’ or portion of the NCBI taxonomy file, with only those terms needed to create annotation to plants in the Planteome database. It is converted to an OWL file for loading into the Planteome Database.
Planteome reference ontologies cover a range of knowledge domains and are used to annotate plant genomics and phenomics data.
Planteome database contents by bioentity type and number of annotations
| Bioentity type | # Unique bioentities | # Ontology annotations |
|---|---|---|
| protein | 1 674 962 | 14 984 245 |
| germplasm | 161 858 | 4 091 394 |
| gene_model | 35 988 | 1 501 291 |
| mRNA | 59 442 | 307 995 |
| gene | 42 995 | 222 205 |
| QTL | 13 873 | 49 296 |
| gene_product | 10 072 | 44 905 |
| Noncoding RNAs (tRNA, miRNA, snoRNA, rRNA, snRNA) | 1475 | 4786 |
| Total number of unique bioentities and annotations | 2 000 665 | 21 206 117 |
Each bioentity (gene, QTL, germplasm, etc.) may have more than one ontology annotation from the reference ontologies.
Total number of annotations present in the Planteome database supported by the respective evidence type
|
|
Planteome database annotations by source database
| Database source | URL | Annotations total# |
|---|---|---|
| Ensembl Plants |
| 13 436 306 |
| Genetic Resources Information Management System (GRIMS) |
| 2 978 495 |
| Maize Genetics and Genomics Database (MaizeGDB) |
| 1 520 941 |
| Germplasm Resources Information Network (GRIN) |
| 1 031 431 |
| Planteome |
| 824 309 |
| The Arabidopsis Information Resource (TAIR) |
| 736 816 |
| Gramene |
| 247 349 |
| The |
| 224 640 |
| The Rice Annotation Project Database (RAP-DB) |
| 73 952 |
| The International Rice Informatics Consortium (IRIC) |
| 54 727 |
| Genome Database for Rosaceae (GDR) |
| 38 519 |
| Sol Genomics Network (SGN) |
| 20 282 |
| Jaiswal_lab |
| 9561 |
| Grape Genome Database (CRIBI_Vitis) |
| 3410 |
| SoyBase and the Soybean Breeders Toolbox |
| 2472 |
| The European Arabidopsis Stock Centre (NASC) |
| 1897 |
| The Global Gateway to Genetic Resources (Genesys-pgr) |
| 389 |
| AgBase |
| 262 |
| GenBank |
| 210 |
| Legume Information System (LIS) |
| 146 |
| National Center for Biotechnology Information (NCBI_gi) |
| 3 |
| Total number of annotations | 21 206 117 |
Figure 1.An overview of the Planteome ontology and data annotation browser. (A) The drill-down browser allows users to explore the ontology hierarchy and the associated annotation data. Gray circles next to the ontology term names show the number of bioentities annotated to that ontology term either directly or accumulated indirectly from its children terms guided by the ontology tree and the term–term relationship types (B) Bioentities can be filtered by type and source taxon by selecting the red (exclude from search) or green (restrict search to) boxes on the left hand side. (C) Term information window appears if one clicks on an ontology term and displays the alphanumeric identifier, term name, definition, ontology source, synonyms and alternate IDs (if any). (D) The term detail page can be accessed by clicking on the term name in the popup window, with additional information and links to direct and indirect annotations. (E) A full list of all the bioentities associated with the selected term can be opened by selecting the ‘Retrieve Bioentities’ link in the popup box. (F) Free text search box. (G) Faceted search menu.
Figure 2.Faceted Searches for (A) Ontology terms, (B) Bioentities or (C) Annotations. Results can be filtered using the drop down menus on the left hand sides of each view.
Planteome ontologies are integrated into genomics platforms
| Site name | link | comments |
|---|---|---|
| Annotare |
| Array Express experiment submission tool- samples may be tagged with PO terms during submission process |
| Arapheno |
| Database of Arabidopsis thaliana phenotypes can be browsed using TO or PECO |
| Arabidopsis Information Portal (Araport) |
| Search in ThaleMine by PO or GO terms |
| Brassica Information Portal (BIP) |
| TO and PO terms can be used to search population and trait scoring information related to the Brassica breeding community |
| BRENDA Enzyme Database |
| PO, TO and EO are on their Ontology Explorer |
| Gramene Archive site |
| Browse PO, TO, PECO and GO on the Ontology browser |
| Gramene Biomart |
| Can filter by PO and GO |
| Grape Genome Database Interface |
| Grape genes are annotated to PO terms |
| MAize Gene expressIon Compendium |
| Publicly available microarray data from Gene Expression Omnibus (GEO), and ArrayExpress, annotated with PO terms |
| MaizeGDB |
| PO annotations are listed on Gene pages |
| Maize Cell Genomics Database (MAGIC) |
| Maize images tagged with PO terms |
| Manually Curated Database of Rice Proteins |
| Browse annotated data by PO, PECO, TO or GO term |
| Metabolomic Repository Bordeaux (MeRy-B) |
| Plant metabolomics platform database of Nuclear Magnetic Resonance metabolic profiles, browse by PO and PECO |
| The Compositae Genome Project (CGP) |
| Annotated data from lettuce and sunflower, can be browsed by PO hierarchy |
| Oryzabase PO site |
| Japanese version of PO- plant structure terms translated to Japanese |
| Phenopsis DB |
| Arabidopsis Phenotype database, annotated with PO terms |
| Plant Ontology Enrichment Analysis Server (POEAS) |
| Plant phenomic analysis using PO terms based on genes from Arabidopsis thaliana |
| RIKEN Arabidopsis Genome Encyclopedia (RARGEII) |
| Search mutant lines by phenotypes- use PO or PATO |
| Solanaceae Genome network (SGN) |
| Browse and search by PO, GO, PATO terms |
| SuperFamily Browser |
| Database of structural and functional annotation of protein domains and genomes- can browse by PO hierarchy |
| Virtual plant |
| Browse the PO to see Arabidopsis genes annotated to that term |
| The Arabidopsis Information Resource (TAIR) |
| Browse by GO and PO ontology terms, annotation data can be downloaded |
| Wheat Data Interoperability Guidelines |
| Use of ontologies recommended by the Wheat Data Interoperability Working Group, of the Research Data Alliance |
| Cross Species Plant Phenotype Network |
| Results of the analysis conducted in the frame of the Plant Phenotype Pilot Project study with annotation files from six plant species ( |
Figure 3.A view of a model under development in Planteome Noctua. Planteome Noctua (http://noctua.planteome.org/) is a web-based tool for collaborative curation and gene annotation supported by published literature or empirical data. Individuals from the reference ontologies are linked to one another through relationships and these assertions are supported by an evidence code from the Evidence and Conclusion Ontology. Once the model is complete, the information is saved and can be exported as a formatted file which can be processed to add the information to the database.
Figure 4.A view of the ontology hierarchy around Trait Ontology term plant height (TO:0000207). Crop Ontology (CO) terms for plant height from the lentil, wheat, rice and cassava ontologies are mapped to the Trait Ontology term for data integration.