Literature DB >> 25574136

POEAS: Automated Plant Phenomic Analysis Using Plant Ontology.

Khader Shameer1, Mahantesha Bn Naika2, Oommen K Mathew1, Ramanathan Sowdhamini1.   

Abstract

Biological enrichment analysis using gene ontology (GO) provides a global overview of the functional role of genes or proteins identified from large-scale genomic or proteomic experiments. Phenomic enrichment analysis of gene lists can provide an important layer of information as well as cellular components, molecular functions, and biological processes associated with gene lists. Plant phenomic enrichment analysis will be useful for performing new experiments to better understand plant systems and for the interpretation of gene or proteins identified from high-throughput experiments. Plant ontology (PO) is a compendium of terms to define the diverse phenotypic characteristics of plant species, including plant anatomy, morphology, and development stages. Adoption of this highly useful ontology is limited, when compared to GO, because of the lack of user-friendly tools that enable the use of PO for statistical enrichment analysis. To address this challenge, we introduce Plant Ontology Enrichment Analysis Server (POEAS) in the public domain. POEAS uses a simple list of genes as input data and performs enrichment analysis using Ontologizer 2.0 to provide results in two levels, enrichment results and visualization utilities, to generate ontological graphs that are of publication quality. POEAS also offers interactive options to identify user-defined background population sets, various multiple-testing correction methods, different enrichment calculation methods, and resampling tests to improve statistical significance. The availability of such a tool to perform phenomic enrichment analyses using plant genes as a complementary resource will permit the adoption of PO-based phenomic analysis as part of analytical workflows. POEAS can be accessed using the URL http://caps.ncbs.res.in/poeas.

Entities:  

Keywords:  Arabidopsis thaliana; phenomics; phenotype enrichment; plant genomics; plant ontology

Year:  2014        PMID: 25574136      PMCID: PMC4274039          DOI: 10.4137/BBI.S19057

Source DB:  PubMed          Journal:  Bioinform Biol Insights        ISSN: 1177-9322


Introduction

Phenomics is a recently evolved term to collectively define the measurement of the phenotypic characteristics of biological entities, including the physical and biochemical traits of an organism.1,2 A phenome is a catalog of all phenotypes that is compiled from an experiment or from the collective phenomic knowledge of an organism. Plant phenomics3–7 refers to the systematic study of plant phenotypes. Ontologies, such as plant ontology (PO), play an important role functioning as translational resources between experimental and in silico phenotyping. Ontologies can be used to capture and map out an existing library of phenotypes to a list of new entities (for example, genes, proteins, and metabolites). Biomedical ontologies have improved the unified interpretation of a group of genes (gene lists), proteins, RNA, or metabolites identified from high-throughput genomics, proteomics, transcriptomics, or metabolomics studies. Gene ontology (GO) and the association of GO terms with gene products and statistical enrichment analyses have contributed to the interpretation of gene or protein lists for more than one decade. Ontologies are currently developed to address highly specific domains or subdomains in the biomedical knowledge universe. To illustrate the growth, currently a total of 329 ontologies are available from BioPortal – an ontology repository of the National Centre for Biomedical Ontology (NCBO).8 Along with the unanimous growth of broad spectrum ontology and widely used ontologies, such as GO,9,10 various other biomedical ontologies are under active development.11,12 While these resources are available as reference tools, a large subset of biomedical ontologies does not have direct association data to connect different biological entities. Apart from the primary goal of the unification of concepts, definitions, and knowledge in biomedical science, a prominent application of biomedical ontologies is enrichment analysis.13–15 Biological enrichment analysis is a collective term used to define a broad area of knowledge-based statistical approaches. It is designed to identify statistically significant terms associated with the list of biological molecules identified from an experiment when compared to the background distribution (annotations of genes in the genome or genes in experimental platforms). Enrichment analysis can be implemented with an ontology or an annotation repository, such as Pfam domains and Swiss-Prot annotations, to understand the functional trend of biological phenomena.6,16,17 Ontology-based phenomic mappings were used in human phenotypes,18,19 cellular phenotypes,20 fission yeast,21 disease annotations,22 and plants.23 Plant phenomics have been employed to study several aspects of plants, including the phenomic impact of stress-responsive genes.6,24

Plant Phenomic Enrichment Analyses Using PO

Plant phenomics is the collective measurement of phenomes that includes the physical and biochemical traits of an organism, and the phenome of an organism can be effectively described using ontologies. PO is a compendium of terms to define the diverse phenotypic characteristics of plant species into two categories (plant anatomy, and morphology and development stages). PO definitions and related annotations are available for several model plant genomes and are integrated into several key plant genome databases, such as The Arabidopsis Information Resource (TAIR), NASC/NASCArrays,25 Gramene/GrameneMart,26 Sol Genomics Network (SGN),27 and MaizeGDB.28 Additional terms, annotations and genomes, are being added to PO because of the collective effort from experimental biologists, computational biologists, and biocurators.29 However, tools that are designed specifically to utilize the growing plant phenomic knowledgebase are required to leverage their application in large-scale plant phenomic studies. Currently, generic meta-analyses tools, such as DAVID30 or PANTHER,31 do not provide enrichment analyses using PO. A tool reported by Xin et al.29 provides enrichment analysis using PO terms, but the tool does not offer an option to select enrichment methods, multiple-testing correction methods, or visualization in diacyclic graph formats. Recently, while performing a large-scale comparative analysis of stress-responsive genes (n = 3091) in Arabidopsis thaliana,6 we realized this challenge and adapted a widely used GO term enrichment analysis tool (Ontologizer 2.0) to perform phenomic enrichment analyses using genes from STIFB2.17 In this manuscript, we describe a web-based version of the utility called Plant Ontology Enrichment Analysis Server (POEAS), which has been developed and provided in the public domain for phenomic analyses.

Materials and Methods

POEAS is currently available for A. thaliana; additional genomes will be added as part of future updates. The latest version of PO files (.obo and .assoc) and TAIR annotations are fetched periodically from PO and TAIR FTP servers, respectively. Currently, POEAS accepts lists of gene names, locus names, or TAIR identifiers (IDs) as input data. The POEAS web interface (Fig. 1) is developed using Javascript, HTML, and CSS. Enrichment analysis was implemented using Ontologizer 2.0, a biomedical ontology enrichment analysis tool that has multiple options available to select the enrichment method and statistical approach. The following types of multiple-testing correction methods are available in the current version of POEAS: Bonferroni, Bonferroni–Holm, Benjamini–Hochberg, Benjamini–Yekutieli, Westfall and Young step-down, and Westfall and Young single-step. Options are also provided to run enrichment analyses without multiple-testing corrections to test potential enrichment in small gene lists. Six enrichment calculation methods are available in the current version of POEAS: Model-Based Gene-Set Enrichment Analyses (MGSA),32 Parent–Child-Intersection, Parent–Child-Union,33 Term-For-Term,13 Topology-Elim, and Toplogy-Weighted.33,34 In the backend, the server uses a scheduler script to retrieve updated PO annotations and associations. POEAS also offers interactive options to identify user-defined background population sets, various multiple-testing correction methods, different enrichment calculation methods, and resampling tests to improve statistical significance (Fig. 1).
Figure 1

Web interface of POEAS. (A) Options to input list of genes identified from an experiment (microarray, next-generation sequencing, proteomics, etc.) and background list of genes from the study (for example, list of genes in a microarray, genes in a given genome, etc.). (B) Option to select multiple-testing correction method. (C) Option to select enrichment calculation method. (D) Option to select resampling steps for multiple-testing corrections.

Web server construction, the application features, and performance of POEAS

POEAS provides a web-platform for performing enrichment analyses of PO terms using genes from A. thaliana. The user can submit a list of differentially expressed gene IDs from expression profiling (RNASeq or microarray experiments). Depending on the availability, a list of background genes tested in the experiment can also be provided. Further, the user can select multiple-testing correction methods, enrichment calculation methods, and resampling steps to perform the enrichment analyses (Fig. 1). The successful POEAS run provides tables with enriched PO terms associated with the gene list; visualization of the enriched terms in a PO tree diagram can also be accessed. Files are also provided to download enrichment results, annotation tables, and PO diagrams in SVG format. The downloadable files can be used to filter associated PO terms and genes associated with each PO term based on user requirements (Fig. 2).
Figure 2

Features of POEAS. (A) Results table providing various information from a phenomic enrichment analysis using POEAS. The table provides information, including PO ID, PO term (name), and P-values (unadjusted and adjusted P-values using multiple-testing correction method (p.adjusted)). (B) Diacyclic graph of terms associated with input gene list. (C) Options to download data in various formats after a phenomic enrichment analysis using POEAS.

A use-case for POEAS: phenomic features of stress-responsive genes upregulated by abscisic acid (ABA)

POEAS can be used for the phenomic inference of genes from different types of experiments. To illustrate the application of POEAS, we discuss a use-case here. We identified 700 A. thaliana genes that were responsive to ABA stress, which were obtained from the Stress Responsive Transcription Factor Database, version 2 (STIFDB2). These were targeted by one or more stress-responsive transcription factors.6,17,35 This list of 700 TAIR locus IDs was used as input, and the multiple-testing correction method was set to “Bonferroni,” the enrichment calculation method was set to “Term-For-Term,” and the resampling steps were set to “1000.” The output from this analysis provided extensive information on plant phenotypic characteristics represented by these genes. Phenomic analytics revealed that a subset of genes influences plant phenotypes in multiple levels of plant structure development stages (temporal) and plant anatomy. A total of 65 enriched plant anatomy terms (Table 1) and 20 temporal terms (Table 2) were enriched (P = 0.05; Bonferroni corrected). The most significant terms associated with genes that respond to ABA stress treatments are ones like “cotyledon”, “pollen”, “microgametophyte” and “pollen sac”. ABA is a key regulatory plant hormone that acts as a mediator between various physiological processes, including seed dormancy, plant growth, and secondary stress response for various abiotic stressors, such as drought, cold, light, and temperature. Increased levels of ABA were used to replicate environmental stress in the laboratory setting.36–38 Biological and functional term enrichment analyses of the 700 genes responsive to ABA treatment provided insights into the key biological processes and molecular functions mediated by the genes.6,16,17 Such analyses did not provide insights into plant-specific anatomical or developmental regions where the ABA-responsive genes were localized. PO-based enrichment analyses provided information on sets of genes that are enriched to different anatomical or developmental regions. This information would further help plant or crop biologists in designing experiments that can target a specific anatomical or developmental region and further analyze its role in stress response, tolerance, and adaptation.39,40
Table 1

PO (anatomy/plant anatomical entity) terms associated with ABA responsive genes in A. thaliana identified using POEAS.

PO IDPO TERMP-VALUE+
PO:0020030Cotyledon8.9E-31
PO:0025281Pollen1.3E-27
PO:0025280Microgametophyte1.3E-27
PO:0025277Pollen sac1.4E-27
PO:0025202Microsporangium2.9E-27
PO:0025094Sporangium3.6E-27
PO:0009007portion of plant tissue6.9E-26
PO:0009002plant cell1.5E-25
PO:0020038Petiole2.0E-22
PO:0025066Stalk3.9E-22
PO:0025139phyllome apex9.3E-22
PO:0020137leaf apex9.3E-22
PO:0008019leaf lamina base2.1E-21
PO:0000230inflorescence meristem2.3E-21
PO:0020039leaf lamina2.5E-21
PO:0025060Lamina2.5E-21
PO:0000293guard cell8.3E-21
PO:0000013cauline leaf9.3E-21
PO:0009013meristem1.0E-20
PO:0002000stomatal complex3.4E-20
PO:0025165shoot epidermal cell6.1E-20
PO:0000005cultured plant cell1.1E-18
PO:0000004in vitro plant structure1.2E-18
PO:0006035shoot epidermis1.7E-18
PO:0009061Androecium9.2E-18
PO:0009029Stamen9.2E-18
PO:0009025vascular leaf2.2E-17
PO:0004013epidermal cell3.2E-17
PO:0025034Leaf4.7E-17
PO:0009027Megasporophyll6.6E-16
PO:0009030Carpel6.6E-16
PO:0025195pollen tube cell7.1E-16
PO:0005679Epidermis1.3E-15
PO:0005005shoot internode1.8E-15
PO:0020142stem internode1.8E-15
PO:0020100hypocotyls1.8E-15
PO:0009062Gynoecium8.1E-14
PO:0009001Fruit1.5E-13
PO:0009028Microsporophyll2.3E-13
PO:0009081inflorescence branch1.4E-11
PO:0009052Pedicel1.4E-11
PO:0020122inflorescence axis1.8E-11
PO:0009010Seed2.4E-11
PO:0000037shoot apex2.9E-11
PO:0025001cardinal organ part3.1E-10
PO:0009026Sporophyll1.4E-09
PO:0005052plant callus1.7E-09
PO:0009009plant embryo2.2E-09
PO:0009005Root3.4E-09
PO:0025025root system6.0E-09
PO:0009047Stem9.4E-09
PO:0009049Inflorescence1.3E-08
PO:0009059Corolla1.5E-08
PO:0009032Petal1.5E-08
PO:0009060Calyx1.1E-07
PO:0009031Sepal1.1E-07
PO:0000084plant sperm cell7.3E-07
PO:0025006Gamete3.9E-06
PO:0009058Perianth1.0E-05
PO:0025022collective leaf structure1.1E-05
PO:0000003whole plant1.1E-05
PO:0006001Phyllome2.0E-05
PO:0000025root tip1.3E-04
PO:0025029shoot axis9.2E-04
PO:0025023collective phyllome structure2.4E-02

Note:

Bonferroni-adjusted P-values.

Table 2

PO (temporal/plant structure development stage) terms associated with ABA responsive genes in A. thaliana identified using POEAS.

PO IDPO TERMP-VALUE+
PO:0007095LP.08 eight leaves visible6.9E-17
PO:0007123LP.06 six leaves visible2.0E-15
PO:0007064LP.12 twelve leaves visible3.3E-14
PO:0007103LP.10 ten leaves visible7.8E-13
PO:0007098LP.02 two leaves visible2.8E-12
PO:0001017M germinated pollen stage5.2E-12
PO:00010544 leaf senescence stage4.1E-11
PO:0001050leaf development stages4.8E-11
PO:0007131seedling development stage5.9E-11
PO:0001007pollen developmental stages9.3E-11
PO:0007605androecium developmental stages9.7E-11
PO:0007134A vegetative growth7.5E-10
PO:0007033whole plant growth stage9.2E-10
PO:0001016L mature pollen stage1.9E-09
PO:0007115LP.04 four leaves visible2.9E-08
PO:0007133leaf production3.6E-06
PO:00071121 main shoot growth3.8E-06
PO:0001185C globular stage5.2E-04
PO:0001081F mature embryo stage1.8E-03
PO:0004507D bilateral stage5.5E-03

Note: Bonferroni-adjusted P-values.

The server can also be used for phenomic interpretation of Arabidopsis gene lists from a wide array of experimental methods, including gene expression analysis using microarrays, transcriptomic profiling using next-generation sequencing technologies, and differential abundance analysis using proteomic profiling technologies.

Discussion

There are a large number of high-throughput resources that offer information on genes in plant genomics. However, there is currently no standard tool to integrate PO with GO data to conveniently analyze a large number of genes that are of interest. We report the development and availability of POEAS – a web server – for the automatic connections between PO and GO for gene products of A. thaliana. We will soon update this server for other plant genomes as well. Starting from a list of genes, TAIR codes, or locus of genes, it is possible to arrive at the connections after enrichment analysis, and they are suitable for publication-quality visualization outputs. It is possible for the user to include additional layers of information, such as a background dataset; select statistical tests, such as Bonferroni correction; and resample to improve plant phenomic enrichment analyses.

Conclusion

We have designed a public web server called POEAS for automated phenomic enrichment analyses of the genes of A. thaliana. As phenomic analyses are gaining interest in the plant community, the availability of POEAS would enable the use of phenomic enrichment as a routine analytical step in automated and custom annotation workflows.
  40 in total

1.  Trait dissection of salinity tolerance with plant phenomics.

Authors:  Bettina Berger; Bas de Regt; Mark Tester
Journal:  Methods Mol Biol       Date:  2012

2.  Ontologizer 2.0--a multifunctional tool for GO term enrichment analysis and data exploration.

Authors:  Sebastian Bauer; Steffen Grossmann; Martin Vingron; Peter N Robinson
Journal:  Bioinformatics       Date:  2008-05-29       Impact factor: 6.937

3.  Comparative analyses of stress-responsive genes in Arabidopsis thaliana: insight from genomic data mining, functional enrichment, pathway analysis and phenomics.

Authors:  Mahantesha Naika; Khader Shameer; Ramanathan Sowdhamini
Journal:  Mol Biosyst       Date:  2013-05-03

4.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

Review 5.  Plant phenomics and high-throughput phenotyping: accelerating rice functional genomics using multidisciplinary technologies.

Authors:  Wanneng Yang; Lingfeng Duan; Guoxing Chen; Lizhong Xiong; Qian Liu
Journal:  Curr Opin Plant Biol       Date:  2013-04-08       Impact factor: 7.834

6.  Computable visually observed phenotype ontological framework for plants.

Authors:  Jaturon Harnsomburana; Jason M Green; Adrian S Barb; Mary Schaeffer; Leszek Vincent; Chi-Ren Shyu
Journal:  BMC Bioinformatics       Date:  2011-06-24       Impact factor: 3.169

7.  Disease Ontology: a backbone for disease semantic integration.

Authors:  Lynn Marie Schriml; Cesar Arze; Suvarna Nadendla; Yu-Wei Wayne Chang; Mark Mazaitis; Victor Felix; Gang Feng; Warren Alden Kibbe
Journal:  Nucleic Acids Res       Date:  2011-11-12       Impact factor: 16.971

8.  The plant ontology as a tool for comparative plant anatomy and genomic analyses.

Authors:  Laurel Cooper; Ramona L Walls; Justin Elser; Maria A Gandolfo; Dennis W Stevenson; Barry Smith; Justin Preece; Balaji Athreya; Christopher J Mungall; Stefan Rensing; Manuel Hiss; Daniel Lang; Ralf Reski; Tanya Z Berardini; Donghui Li; Eva Huala; Mary Schaeffer; Naama Menda; Elizabeth Arnaud; Rosemary Shrestha; Yukiko Yamazaki; Pankaj Jaiswal
Journal:  Plant Cell Physiol       Date:  2012-12-05       Impact factor: 4.927

9.  STIFDB2: an updated version of plant stress-responsive transcription factor database with additional stress signals, stress-responsive transcription factor binding sites and stress-responsive genes in Arabidopsis and rice.

Authors:  Mahantesha Naika; Khader Shameer; Oommen K Mathew; Ramanjini Gowda; Ramanathan Sowdhamini
Journal:  Plant Cell Physiol       Date:  2013-01-10       Impact factor: 4.927

10.  PlantGSEA: a gene set enrichment analysis toolkit for plant community.

Authors:  Xin Yi; Zhou Du; Zhen Su
Journal:  Nucleic Acids Res       Date:  2013-04-30       Impact factor: 16.971

View more
  1 in total

1.  Genomes of the Venus Flytrap and Close Relatives Unveil the Roots of Plant Carnivory.

Authors:  Gergo Palfalvi; Thomas Hackl; Niklas Terhoeven; Tomoko F Shibata; Tomoaki Nishiyama; Markus Ankenbrand; Dirk Becker; Frank Förster; Matthias Freund; Anda Iosip; Ines Kreuzer; Franziska Saul; Chiharu Kamida; Kenji Fukushima; Shuji Shigenobu; Yosuke Tamada; Lubomir Adamec; Yoshikazu Hoshi; Kunihiko Ueda; Traud Winkelmann; Jörg Fuchs; Ingo Schubert; Rainer Schwacke; Khaled Al-Rasheid; Jörg Schultz; Mitsuyasu Hasebe; Rainer Hedrich
Journal:  Curr Biol       Date:  2020-05-14       Impact factor: 10.834

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.