| Literature DB >> 23293646 |
Swetlana Friedel1, Björn Usadel, Nicolaus von Wirén, Nese Sreenivasulu.
Abstract
Understanding the global abiotic stress response is an important stepping stone for the development of universal stress tolerance in plants in the era of climate change. Although co-occurrence of several stress factors (abiotic and biotic) in nature is found to be frequent, current attempts are poor to understand the complex physiological processes impacting plant growth under combinatory factors. In this review article, we discuss the recent advances of reverse engineering approaches that led to seminal discoveries of key candidate regulatory genes involved in cross-talk of abiotic stress responses and summarized the available tools of reverse engineering and its relevant application. Among the universally induced regulators involved in various abiotic stress responses, we highlight the importance of (i) abscisic acid (ABA) and jasmonic acid (JA) hormonal cross-talks and (ii) the central role of WRKY transcription factors (TF), potentially mediating both abiotic and biotic stress responses. Such interactome networks help not only to derive hypotheses but also play a vital role in identifying key regulatory targets and interconnected hormonal responses. To explore the full potential of gene network inference in the area of abiotic stress tolerance, we need to validate hypotheses by implementing time-dependent gene expression data from genetically engineered plants with modulated expression of target genes. We further propose to combine information on gene-by-gene interactions with data from physical interaction platforms such as protein-protein or TF-gene networks.Entities:
Keywords: Arabidopsis; abiotic stress; reverse engineering; stress tolerance; systems biology; yield
Year: 2012 PMID: 23293646 PMCID: PMC3533172 DOI: 10.3389/fpls.2012.00294
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Bioinformatic tools that are related to the analysis of gene regulatory networks.
| Name | Mathematical model | Description | Reference | |
|---|---|---|---|---|
| WGCNA | Weighted correlation | The R software package WGCNA is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis. | Langfelder and Horvath ( | |
| qp-Graph | Partial correlations | q-Order partial correlation graphs, or qp-graphs is useful for learning undirected graphical Gaussian Markov models from data sets in which the number of random variables | ||
| GeneReg | Linear model fitting | GeneReg is used to reconstruct time-course gene regulatory network. R package GeneReg reconstructs a gene regulatory network from short time-course gene expression data. A suitable application is the study of time-dependent biological processes such as cell cycle, cell differentiation, or causal inference | Huang et al. ( | |
| BoolNet | Boolean Networks | BoolNet or Boolean networks inference R package developed methods for synchronous, asynchronous, and probabilistic Boolean Network. It can be applied for reconstructing networks from time series and can be used for robustness analysis via perturbation (environmental or genetic) and identification and visualization of attractors in networks | Mussel et al. ( | |
| BNArray | Bayesian Network | BNArray: An R package for constructing gene regulatory networks from microarray data by using Bayesian network modeling. | Chen et al. ( | |
| GRENITS | Dynamic Bayesian network | The package GRENITS (Gene Regulatory Network Inference Using Time Series) offers four network inference statistical models using Dynamic Bayesian Networks and Gibbs Variable Selection: a linear interaction model, two linear interaction models with added experimental noise for the case where replicates are available and a non-linear interaction model | Morrissey et al. ( | |
| Minet | Mutual information | The R package | (Meyer et al., | |
| Parmigene | Mutual information | The R package parmigene (PARallel Mutual Information estimation for GEne NEtwork reconstruction) implements a mutual information estimator based on k-nearest neighbor distances that is minimally biased with respect to the other methods and uses a parallel computing paradigm to reconstruct gene regulatory networks. parmigene gives more precise results than existing softwares at strikingly lower computational costs. | (Sales and Romualdi, | |
| DTI | Mutual information | An R package including functions for inference of gene regulatory networks from microarray data. Directed information approach enables to reconstruct a directed graph. | (Kaleta et al., | |
| C3NET | Mutual information | R package Conservative cause core (C3NET) algorithm is based on mutual information and composed of two steps. The first step confers an elimination of non-significant edges, in the second step genes are only connected if their shared significant mutual information value is at least for one of these two genes maximal with respect to all other genes | Altay and Emmert-Streib ( | |
| Name | Kind of interactions network | Description | Taxon | Reference |
| CORNET | TF-to-gene | The TF tool retrieves regulatory interactions from AGRIS and from CORNET microarray data. The resulting network is represented in Cytoscape with possibilities to see localization, TAIR functional descriptions, Gene Ontology, Plant Ontology, MapMan pathways and processes, protein domains, PubMed IDs and phenotypes. Link out to other external databases by right-clicking the nodes in Cytoscape | De Bodt et al. ( | |
| AtRegNet | TF-to-gene | AtRegNet tool of AGRIS: | Yilmaz et al. ( | |
| CORNET | Protein–protein interaction | PPI tool interrogates available protein–protein interaction databases (both experimental and predicted interactions) and the AraNet probabilistic functional gene network. It includes, e.g., ArathReactome, AtPID, MINT, Yeast-2-hybrid Interactome, and so on. The resulting network is represented in Cytoscape with possibilities to see localization, TAIR functional descriptions, Gene Ontology, Plant Ontology, MapMan pathways and processes, protein domains, PubMed IDs, and phenotypes. Link out to other external databases by right-clicking the nodes in Cytoscape | De Bodt et al. ( | |
| PMN | Metabolic network | The Plant Metabolic Network (PMN) provides a broad network of plant metabolic pathway databases that contain curated information from the literature and computational analyses about the genes, enzymes, compounds, reactions, and pathways involved in primary and secondary metabolism in plants | Over 350 plant species | Mueller et al. ( |
| MetNet | Metabolic network, TF-to-gene, evidence network | The MetNet database (MetNetDB) contains integrative information on networks of metabolic and regulatory interactions. Types of interactions in MetNetDB include transcription, translation, protein modification, assembly, allosteric regulation, translocation from one subcellular compartment to another. Other fields describing the interactions are subcellular localization, confidence, directionality, references, evidence, and synonyms. Data on entities (DNA, RNA, polypeptides, protein complexes, metabolites) are derived from web databases (TAIR, GO, MapMan/GabiPD, PPDB, AMPDB, AtNoPDB, AraPerox, PLprot, BRENDA, ChEBI, PubChem, KEGG, NCI, NIST MS library), in some cases with additional annotation by experts | Sucaet and Wurtele ( | |
| AtCOECis | TF-to-gene | Transcriptional regulatory network is reconstructed here based on cis-regulatory elements and coexpression network | Vandepoele et al. ( | |
| ATTED-II | Coexpression | ATTED-II builds coexpression network allows searching coexpression network for individual target genes of interest or co-expressed network for each functional bin. Also abiotic, biotic, hormone treated gene expression data deposited for deriving coexpression network | Obayashi and Kinoshita ( | |
| Genevestigator | Coexpression | Pearson correlation coefficient is taken as the measure of similarity between genes, both for identifying co-expressed genes as well as to define the pairwise correlation between genes in the plot. This score is calculated based on log2-scaled expression data that is processed from the Genevestigator database | Zimmermann et al. ( | |
| MapMan | Visualization | MapMan brings its own hand-curated ontology which helps in exploring and understanding plant networks. The ontology aims to be redundancy-free aiding in simple graphical network visualization. | Lohse et al. ( | |
| Planet | Coexpression | Using multiple different plant species and mutual best ranked coexpression as well as domain information and MapMan terms it is possible to find conserved correlations between plants | Mutwil et al. ( | |
| Corto | Coexpression | Corto is a network building and visualization tool which comes pre-loaded with several plant specific array sets. Its prime role is network reconstruction of user supplied data sets however. CorTo applies simple correlation, partial correlation, lasso regression, and mutual information. A query centered target gene network can be visualized and explored. All nodes can be color coded based on MapMan categories | Any species where high throughput data is available | Giori unpublished. Available from |
| CORNET | Coexpression | Using one or more precompiled expression datasets the correlation between gene expression profiles will be calculated. There are possibilities to identify threshold values for acceptance of interactions. The result can be visualized as a graph in Cytoscape providing all possible gene annotations | De Bodt et al. ( | |
| AraNet | Evidence network | AraNet yields all neighbors to query genes, based on coherence of query genes, which is measured by the area under the ROC curve AUC from 0.5 to 1. Top neighbors would be good candidates for your follow-up screen, network-guided focused screen. As evidence, AraNet uses a probabilistic functional gene network of | Lee et al. ( | |
| STRING | Evidence network | STRING is a database of known and predicted protein–protein interactions. The interactions include direct (physical) and indirect (functional) associations; they are derived from genomic context, conserved coexpression, HT experiments, and text mining | 1133 organism including plants | Szklarczyk et al. ( |
| EVEX | Biomolecular interaction based on textmining | EVEX or Event Extraction is a text mining resource built on top of PubMed abstracts and PubMed Central full texts. It contains over 34 million biomolecular events among more than 67 million automatically extracted gene/protein name mentions. The text mining data has been enriched with gene normalization results, covering more than 42% of all gene/protein names. EVEX presents both direct and indirect associations between genes and proteins, enabling explorative browsing of relevant literature | Broad range of species, including plant species | Van Landeghem et al. ( |
| MetaCrop | Metabolic network | MetaCrop is a manually curated repository of high-quality data about plant metabolism, providing different levels of details from overview maps of primary metabolism to kinetic data of enzymes. It can be accessed via web, web services and an add-on to the Vanted software | It contains seven major crop plants and two model plants | Schreiber et al. ( |
Advantages and disadvantages of reverse engineering methods.
| Model | Advantages | Disadvantages |
|---|---|---|
| Boolean network | Large-scale network | Deterministic description |
| Bayesian network | Handle incomplete and noisy data | Computational complexity |
| Dynamic Bayesian network | Handling of feedback-loops and incomplete and noisy data | Computational complexity |
| Differential equations | Handling of negative feedback-loops | Computational complexity |
| Correlation analysis | Large-scale network | Dependency of accuracy on the set of thresholds |
| Mutual Information | Large-scale network | Dependency of accuracy on the set of thresholds |
Figure 1Reverse engineering strategies to simulate gene regulatory networks and reconstructed relationship at other physical interactions such as TF-to-gene, protein–protein, and metabolic networks. The application of relevant softwares and algorithms has been summarized in Table 1.
Figure 2Global overview of abiotic stress responses in roots and shoots of . Abiotic Stress Matrix for shoot (A) and root tissues (C). Values are calculated as a sum of absolute log-fold changes over all time points. The score is calculated as median over all abiotic stress conditions. Between Group Analysis for the whole transcriptome of shoot (B) and root (D). The larger distance between stress and control, the stronger the transcriptome reprograming under a given stress. Higher variance for shoots is noted for Osmotic > Salt > UV > Wounding. Higher variance for roots is noted for Salt > Osmotic > Heat.
Figure 3Gene ontology enrichment of top 100 abiotic stress-regulated genes in shoots (A) and roots (B) using AgriGO database.
Figure 4Gene regulatory network of shoots (A) and roots (B) highlights “cross-talk” between the top 100 stress-inducible genes which have been defined from Abiotic Stress Matrix. Within the shoot gene regulatory network two hub genes AT2G28400 (unknown protein) and AT1G80840 (WRKY40) and their connected sub-networks are shown in detail. Within the root gene regulatory network three hub genes (1) AT2G46400: ATWRKY46, (2) AT5G10040 (unknown protein), (3) AT1G26380 FAD-binding domain-containing protein and their connected sub-networks are shown in detail.
Figure 5Coexpression gene network and protein–protein interaction network of WRKY40 derived from CORNET database (De Bodt et al., . Within the coexpression network blue lines represent correlations of >0.9. The protein–protein interaction network is represented by black solid lines (validated protein interactions) or by dotted black lines (predicted protein–protein interactions).
Figure 6Gene coexpression network and knock-out phenotypes characterized in surrounding network of AtWRKY40 derived from PLANET database (Mutwil et al., .