Literature DB >> 19321735

In silico analysis of promoter regions from cold-induced genes in rice (Oryza sativa L.) and Arabidopsis thaliana reveals the importance of combinatorial control.

Angelica Lindlöf1, Marcus Bräutigam, Aakash Chawade, Olof Olsson, Björn Olsson.   

Abstract

MOTIVATION: Cold acclimation involves a number of different cellular processes that together increase the freezing tolerance of an organism. The DREB1/CBFs are transcription factors (TFs) that are prominent in the regulation of cold responses in Arabidopsis thaliana, rice and many other crops. We investigated if the expression of DREB1/CBFs and co-expressed genes relies on combinatorial control by several TFs. Our results support this notion and indicate that methods for studying the regulation of complex cellular processes should include identification of combinations of motifs, in addition to searching for individual overrepresented binding sites.

Entities:  

Mesh:

Year:  2009        PMID: 19321735      PMCID: PMC2682518          DOI: 10.1093/bioinformatics/btp172

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

The productivity and growth of many plant species is severely limited by low temperatures (Mahajan and Tuteja, 2005). Cold stress, which includes chilling (0–12○C) and freezing (<0○C) temperatures, adversely affects crop yields by restraining sowing time, causing tissue damage and stunted growth. In order to overcome such limitations and increase the yields for agronomically important crops an improved tolerance to cold stress is desired. Plants growing in temperate regions have evolved traits that make it possible for them to cope with freezing temperatures. In response to mild cold stress, ∼4−6○C, a cascade of genetic reactions are triggered that greatly enhance the tolerance to later, more severe sub-zero temperatures. This ability to cope with low temperatures is known as cold acclimation (Guy, 1999). Moreover, this is a complex trait, involving numerous genes that interact in an intricate regulatory network. The activation and de-activation of genes participating in signaling pathways is achieved by several mechanisms, including the binding of transcription factors (TFs) to cis-elements in the promoter region of affected genes, which thereby become activated or suppressed. Arabidopsis thaliana and rice (Oryza sativa L.) are two major model species for plants with apparent differences in overall physiology, adaptation to temperature variation and cold tolerance. Arabidopsis thaliana is a dicot with an ability to cold acclimate, whereas rice is a monocot that is susceptible to cold. There are three factors in A.thaliana that are prominent in the regulation of cold responses in this species: the AtCBF1-3 (A.thaliana C-repeat/dehydration responsive element-binding factor) (At4g25490, At4g25470 and At4g25480), which are all highly induced within the first hours of the response (Gilmour et al., 1998). In rice, two of the identified orthologs, OsDREB1A-B (O.sativa dehydration-responsive element-binding factor) (Os09g35010 and Os09g35030), have been shown to be induced by cold stress, but also increase the tolerance to high-salt and drought when being overexpressed (Dubouzet et al., 2003; Ito et al., 2006). Since it has been established that the expression of many genes rely on the combinatorial control of several TFs (Remenyi et al., 2004; Singh, 1998), we wanted to investigate if this could be the case for the CBF/DREB factors. While conducting this study, we made observations that support the importance of combinatorial control of the response to cold stress. Consequently, we argue that identifying combinations of motifs is a necessary complement to searching for individual overrepresented binding sites when analyzing complex processes such as responses to cold stress.

2 METHODS

We have previously conducted a whole-genome expression profiling on chilling-susceptible rice, O.sativa, cv. Indica v Jumla Marshi (M.Bräutigam et al., submitted for publication), where we identified 1450 genes as differentially expressed during cold stress. Rice seedlings were stressed at 4C for 30 min, 2, 4, 8 and 24 h, and samples were thereafter hybridized to the Affymetrix GeneChip® Rice Genome Array. Additionally, data from a similar experiment for A.thaliana (D'Angelo et al., 2005) was downloaded and analyzed in the same way. In this second dataset, 1753 genes were identified as differentially expressed. In more detail, probes were considered differentially expressed if their signal value was at least 3-fold changes different and had a Benjamini–Hochberg adjusted P-value of ≤0.05 in at least one time step. Genes corresponding to these probes were identified by matching each probe sequence to a Japonica and A.thaliana locus, respectively; through a BLASTn search against TIGR Rice (cv. Japonica) 5.0 pseudomolecules (TIGR pseudomoleuces, ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotation_dbs/) and TAIR A.thaliana 7.0 cds sequences, (TAIR cds sequences, ftp://ftp.arabidopsis.org/home/tair/Sequences/blast_datasets/TAIR7_blastsets/) respectively. The full-length sequence of each probe was used, which was downloaded from Affymetrix (Affymetrix, www.affymetrix.com) web site. For genes having a significant match against several probes only the one with the lowest E-value was kept. The promoter regions, 1 Kb upstream, of the identified cold-responsive genes in rice and A.thaliana were thereafter downloaded from TIGR (TIGR pseudomoleuces, ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotation_dbs/) and TAIR (TAIR cds sequences, ftp://ftp.arabidopsis.org/home/tair/Sequences/blast_datasets/TAIR7_blastsets/), respectively. In order to derive cis-elements that relate to TFs that are plausibly important in the regulation of the CBF/DREB factors, we looked at motifs present in genes that are co-expressed with the CBF/DREBs. One cluster per CBF/DREB factor was derived by identifying genes having highly similar expression profiles to one of the CBF/DREBs. This was done by calculating the Pearson correlation (PC) between each CBF/DREB and all other cold-responsive genes in each species. Only genes that had PC ≥ 0.95 with at least one CBF/DREB were included in the cluster. In order to extract regulatory motifs that are overrepresented among genes in a cluster, we used a database of previously characterized motifs and searched for occurrences of these motifs in the promoter regions of the genes. The database was a merge of motifs from the PLACE (Higo et al., 1999) and plantCARE (Lescot et al., 2002) databases as well as motifs extracted from the literature. We choose to use consensus motifs, since this provided us with a high coverage of previously characterized plant motifs (in total 946 motifs). Thereafter, subsequences matching the motifs were searched for in both the forward and reverse strand of the promoter regions of the genes, by using an in-house developed Perl script, and each such match was considered as an occurrence of the motif. Only perfect matches to the motif were considered, i.e. the search did not include a scoring function. Additionally, we only regarded motifs with >4 IUPAC letters. To identify significantly overrepresented individual motifs, we calculated how many genes in the cluster had ≥T (T=1, 2,…, 25) occurrences of a motif m and the number that had

3 RESULTS

The clustering procedure resulted in one cluster for each of the CBF/DREB factors. The clusters are somewhat overlapping, since the CBFs have highly similar expression profiles. Moreover, the number of detected individual overrepresented motifs (IOMs) differed extensively, since the number of genes in each cluster differed, and ranged from 15 to 58 detected motifs. When studying the presence of each IOM in more detail, it was revealed that motifs that are relatively common among genes in a cluster are also common among genes in the remaining genome, i.e. also in genes that are not cold-responsive. We calculated the relative occurrence fold-change (FC) of each motif as where m is the motif in consideration, x1 is the number of genes in a cluster having at least T occurrences of the motif, x2 the number of genes in the cluster having
Fig. 1.

Relative frequency of motif occurrence. Percentage number of genes in the AtCBF1 cluster and the remaining genome, respectively, having at least T occurrence(s) of a motif. The y-axis indicates percentages and the x-axis represents the 58 significantly overrepresented (P ≤0.01) motifs that were identified in this cluster.

Relative frequency of motif occurrence. Percentage number of genes in the AtCBF1 cluster and the remaining genome, respectively, having at least T occurrence(s) of a motif. The y-axis indicates percentages and the x-axis represents the 58 significantly overrepresented (P ≤0.01) motifs that were identified in this cluster. Regarding motif combinations, we derived all significantly overrepresented two-, three- and four-combinations and made observations supporting the importance of combinatorial control. In general, the P-value for the detected motif combinations decreased as the complexity of the motif combinations increased, i.e. the median P-value of the significantly overrepresented four-combinations was much lower than for the two-combinations (Fig. 2). This is explained by the fact that the number of genes having a significant combination decreased with increasing order of the combination to a greater extent among the genes in the remaining genome than in the cluster (compare middle boxplots with rightmost boxplots in Fig. 2). For example, using the cluster exemplified in Figure 2, the median number of genes having a combination decreased by 30% when comparing two- and four-combinations (from 10 to 7 genes), whereas in the remaining genome the decrease was 42% (from 3891 to 1647 genes). In many cases, a specific combination of four motifs frequently occurs among the genes in a specific cluster, but is totally absent among the genes in the remaining genome (w.r.t. T, in some cases a motif combination occurs in the remaining genome, but in those cases with a smaller value on T for one or several of the motifs included in the combination). These results indicate that a specific combination of TFs is important for the regulation.
Fig. 2.

Motif combinations. The results of significantly overrepresented motif combinations (SOMC) are shown for one cluster (AtCBF2). Lower row shows SOMC with FDR-adjusted P ≤0.01 and upper row with standard P ≤0.01. The left box-plots show the P-values, the middle box-plots show the number of genes in the cluster having a SOMC and the right box-plots show the number of genes in the remaining genome having a SOMC.

Motif combinations. The results of significantly overrepresented motif combinations (SOMC) are shown for one cluster (AtCBF2). Lower row shows SOMC with FDR-adjusted P ≤0.01 and upper row with standard P ≤0.01. The left box-plots show the P-values, the middle box-plots show the number of genes in the cluster having a SOMC and the right box-plots show the number of genes in the remaining genome having a SOMC. We further analyzed those significant four-combinations having P≤0.01 (BH FDR-adjusted), in order to investigate if the motifs occurring in these combinations had previously been coupled to cold stress. Applying the BH FDR procedure decreased the number of significant combinations, for example the number of four-combinations decreased with 55–88%. Additionally, to avoid the redundancy and the combinatorial complexity of higher order combinations, we considered only the most significant non-redundant, non-overlapping motif (MSNM) combinations from each cluster. Regarding A.thaliana, the MSNM four-combinations for the AtCBF1 cluster were combinations of six different motifs, and for the AtCBF2 and AtCBF3 clusters they were combinations of six and seven motifs, respectively (Table 1 and Supplementary Table S1). For the OsDREB1A and OsDREB1B clusters, the MSNM four-combinations were based on seven and eight motifs, respectively.
Table 1.

Overrepresented four motif combinations

ClusterNameMotifT
AtCBF1GT-1 bindingGRWAAW7
TATA-boxTAATA2
TATA-boxTATAA5
ABRE-relatedMACGYGB1
POLASIG3AATAAT1
ROOTMOTIFTAPOX1ATATT3
AtCBF2GT-1 bindingGRWAAW6
HSF (heat shock factor)NTTCNNGAAN4
TATA-boxTTTTA5
TATA-boxTATATA1
W-boxTTGAC1
ABRE-relatedMACGYGB1
AtCBF3WRKY bindingYTGACY1
CAAT-boxATTGG2
TATA-boxATATAT1
ABRE-relatedMACGYGB1
TATA-boxTTATTT2
W-boxTTGAC1
TATA-boxTAATA3
OsDREB1ALTRECCGAC1
UnnamedCGTGG2
bZIP bindingACACNNG3
Root-hair-specificKCACGW2
Light responsiveCCRCCC1
Sugar starvationCGACG1
MADS bindingCW8G1
OsDREB1BPollen-relatedAGAAA3
TATA-boxTATAA1
SEF3 bindingAACCCA1
Sugar starvationCGACG2
E2 bindingWTTSSCSS1
REalphaAACCAA1
GT-1 bindingGRWAAW3
ABRE-relatedMACGYGB1

Motifs included in the most significant non-redundant, non-overlapping motif (MSNM) four combinations found in each cluster. Cluster, the cluster; Name, a descriptive name of the motif; Motif, the sequence pattern of the regulatory motif; T, the lowest number of occurrences for which the motif is regarded as statistically overrepresented (BH FDR adjusted Fisher's exact one-sided test, P < 0.01).

Overrepresented four motif combinations Motifs included in the most significant non-redundant, non-overlapping motif (MSNM) four combinations found in each cluster. Cluster, the cluster; Name, a descriptive name of the motif; Motif, the sequence pattern of the regulatory motif; T, the lowest number of occurrences for which the motif is regarded as statistically overrepresented (BH FDR adjusted Fisher's exact one-sided test, P < 0.01). In the list of motifs, the ABRE-related MACGYGB motif is distinguished, as it occurs in all but one cluster. This motif has previously been identified among genes responsive to cytosolic Ca2+, which is a major secondary messenger for triggering cold acclimation signaling pathways (Kaplan et al., 2006). Furthermore, the GT-1 binding site is represented in two of the A.thaliana clusters and one of the rice clusters. This motif has been found in many light-regulated genes in various plants, such as oat and rice. Additionally, GT-1 is activated by a calcium-dependent phosphorylation in response to light (Marechal et al., 1999). Since cold stress peaks during the night it also coincides with light stress, which plausibly explains the presence of the GT-1 binding site. In two of the A.thaliana clusters W-box and/or WRKY binding sites are represented, to which WRKY TFs bind (Eulgem et al., 2000), but not in any of the rice clusters. Many of these TFs are rapidly and transiently induced by various abiotic stresses, including cold stress. However, WRKY motifs also occur frequently in the rice genome, but because of this ‘general’ occurrence, they are not detected as overrepresented individual motifs and thus not included in the identified combinations. In the A.thaliana clusters, there are several different TATA-boxes present in the combinations, whereas this is not as common in rice. There are also other AT-rich motifs in the A.thaliana clusters, e.g. the POLASIG3 motif. Moreover, the AT-composition of the promoter sequences for the A.thaliana genes is 68.6 ±3.9%, whereas in the rice promoters it is 54.4 ±6.8%. Whether this has any major impact on the regulation of the acclimation pathways remains to be elucidated. Several of the motifs in rice have been shown to be important in ABA and/or light stress signaling pathways, such as the bZIP, light responsive and REalpha motifs (Baker et al., 1994; Degenhardt and Tobin, 1996; Finkelstein and Lynch, 2000). Moreover, the LTRE motif CCGAC present in one of the rice clusters is overlapping with the DRE motif RCCGAC, which is actually the target of the CBF/DREB factors (Stockinger et al., 1997). This motif has also been coupled to drought and light signaling, amongst others, besides cold (Kim et al., 2002).

4 DISCUSSION

Biologically, cold acclimation is a complex trait that involves the up- and down-regulation of numerous genes. In addition, a portion of these genes also encode activities involved in protecting the cell from related stresses such as drought, light, salt and wounding (Fujita, et al., 2006). This complicates the elucidation of signaling pathways directly coupled to an increased cold tolerance. As previously mentioned, activation and deactivation of genes participating in a specific biological process is achieved, amongst others, by the binding of TFs to cis-elements in the promoter region of affected genes. In addition, the coordination of multiple TFs in a combinatorial fashion makes it possible for the plant to control the expression of genes in response to a variety of environmental stimuli by using a limited number of TFs. Several studies address the importance of considering combinatorial control in order to understand the underlying regulatory network (Beer and Tavazoie, 2004; Chawade, et al., 2007; Hannenhalli and Levy, 2002; Kato, et al., 2004; Pilpel, et al., 2001; Zhou and Wong, 2004). Our results further support this notion and we argue that the focus should therefore be on identifying motif combinations when studying complex processes such as cold acclimation. For A.thaliana, the results indicate that a combination of an ABRE-related, GT-1, WRKY and AT-rich motif is important in the regulation of the cold response. Interesting is also that some of the previously identified motifs coupled to cold stress, such as the ICEr3 and MYB15 motifs (Agarwal et al., 2006; Benedict et al., 2006), are present in the A.thaliana combinations but not in rice (data not shown). The derived combinations in rice indicate instead on a coupling to ABA, drought and/or light stress signaling pathways. In addition, in our microarray experiment the OsDREB1s are differentially expressed to a much lower level than the AtCBFs (M.Bräutigam et al., submitted for publication; data not shown), which further indicates an overall difference in response patterns. The OsDREB1s have previously been shown to not only enhance tolerance to cold, but also drought and high-salt, indicating that they are more universal TFs responding to various stresses (Dubouzet, et al., 2003). This is plausibly reflected in the identified motif combinations as well as the observed difference in the overall response patterns in the microarray experiments. Funding: Swedish Research School in Genomics and Bioinformatics; Swedish Research Council (VR). Conflict of Interest: none declared.
  26 in total

Review 1.  Molecular responses of plants to cold shock and cold acclimation.

Authors:  C Guy
Journal:  J Mol Microbiol Biotechnol       Date:  1999-11

2.  PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences.

Authors:  Magali Lescot; Patrice Déhais; Gert Thijs; Kathleen Marchal; Yves Moreau; Yves Van de Peer; Pierre Rouzé; Stephane Rombauts
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  Identifying regulatory networks by combinatorial analysis of promoter elements.

Authors:  Y Pilpel; P Sudarsanam; G M Church
Journal:  Nat Genet       Date:  2001-10       Impact factor: 38.330

4.  Predicting transcription factor synergism.

Authors:  Sridhar Hannenhalli; Samuel Levy
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

5.  Predicting gene expression from sequence.

Authors:  Michael A Beer; Saeed Tavazoie
Journal:  Cell       Date:  2004-04-16       Impact factor: 41.582

6.  The Arabidopsis abscisic acid response gene ABI5 encodes a basic leucine zipper transcription factor.

Authors:  R R Finkelstein; T J Lynch
Journal:  Plant Cell       Date:  2000-04       Impact factor: 11.277

7.  OsDREB genes in rice, Oryza sativa L., encode transcription activators that function in drought-, high-salt- and cold-responsive gene expression.

Authors:  Joseph G Dubouzet; Yoh Sakuma; Yusuke Ito; Mie Kasuga; Emilyn G Dubouzet; Setsuko Miura; Motoaki Seki; Kazuo Shinozaki; Kazuko Yamaguchi-Shinozaki
Journal:  Plant J       Date:  2003-02       Impact factor: 6.417

8.  Light signalling mediated by phytochrome plays an important role in cold-induced gene expression through the C-repeat/dehydration responsive element (C/DRE) in Arabidopsis thaliana.

Authors:  Hyoun-Joung Kim; Yun-Kyoung Kim; Jin-Young Park; Jungmook Kim
Journal:  Plant J       Date:  2002-03       Impact factor: 6.417

9.  Putative cold acclimation pathways in Arabidopsis thaliana identified by a combined analysis of mRNA co-expression patterns, promoter motifs and transcription factors.

Authors:  Aakash Chawade; Marcus Bräutigam; Angelica Lindlöf; Olof Olsson; Björn Olsson
Journal:  BMC Genomics       Date:  2007-09-02       Impact factor: 3.969

10.  Identifying combinatorial regulation of transcription factors and binding motifs.

Authors:  Mamoru Kato; Naoya Hata; Nilanjana Banerjee; Bruce Futcher; Michael Q Zhang
Journal:  Genome Biol       Date:  2004-07-28       Impact factor: 13.583

View more
  12 in total

1.  EgRBP42 encoding an hnRNP-like RNA-binding protein from Elaeis guineensis Jacq. is responsive to abiotic stresses.

Authors:  Wan-Chin Yeap; Tony Eng Keong Ooi; Parameswari Namasivayam; Harikrishna Kulaveerasingam; Chai-Ling Ho
Journal:  Plant Cell Rep       Date:  2012-06-15       Impact factor: 4.570

2.  Over-represented promoter motifs in abiotic stress-induced DREB genes of rice and sorghum and their probable role in regulation of gene expression.

Authors:  Amrita Srivasta; Sameet Mehta; Angelica Lindlof; Sujata Bhargava
Journal:  Plant Signal Behav       Date:  2010-07-01

Review 3.  Plant promoters: an approach of structure and function.

Authors:  Milena Silva Porto; Morganna Pollynne Nóbrega Pinheiro; Vandré Guevara Lyra Batista; Roseane Cavalcanti dos Santos; Péricles de Albuquerque Melo Filho; Liziane Maria de Lima
Journal:  Mol Biotechnol       Date:  2014-01       Impact factor: 2.695

4.  Organ- and stress-specific expression of the ASR genes in rice.

Authors:  Jorge Pérez-Díaz; Tsung-Meng Wu; Ricardo Pérez-Díaz; Simón Ruíz-Lara; Chwan-Yang Hong; José A Casaretto
Journal:  Plant Cell Rep       Date:  2013-10-02       Impact factor: 4.570

5.  Development of a model system to identify differences in spring and winter oat.

Authors:  Aakash Chawade; Pernilla Lindén; Marcus Bräutigam; Rickard Jonsson; Anders Jonsson; Thomas Moritz; Olof Olsson
Journal:  PLoS One       Date:  2012-01-09       Impact factor: 3.240

6.  Global Gene-Expression Analysis to Identify Differentially Expressed Genes Critical for the Heat Stress Response in Brassica rapa.

Authors:  Xiangshu Dong; Hankuil Yi; Jeongyeo Lee; Ill-Sup Nou; Ching-Tack Han; Yoonkang Hur
Journal:  PLoS One       Date:  2015-06-23       Impact factor: 3.240

7.  Comparative Transcriptomics of Sijung and Jumli Marshi Rice during Early Chilling Stress Imply Multiple Protective Mechanisms.

Authors:  Angelica Lindlöf; Aakash Chawade; Per Sikora; Olof Olsson
Journal:  PLoS One       Date:  2015-05-14       Impact factor: 3.240

8.  Genome-wide prediction and functional validation of promoter motifs regulating gene expression in spore and infection stages of Phytophthora infestans.

Authors:  Sourav Roy; Meenakshi Kagda; Howard S Judelson
Journal:  PLoS Pathog       Date:  2013-03-14       Impact factor: 6.823

9.  Deciphering Cis-Regulatory Element Mediated Combinatorial Regulation in Rice under Blast Infected Condition.

Authors:  Arindam Deb; Sudip Kundu
Journal:  PLoS One       Date:  2015-09-01       Impact factor: 3.240

10.  Global expression profiling of low temperature induced genes in the chilling tolerant japonica rice Jumli Marshi.

Authors:  Aakash Chawade; Angelica Lindlöf; Björn Olsson; Olof Olsson
Journal:  PLoS One       Date:  2013-12-12       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.