| Literature DB >> 25844119 |
Benildo G de Los Reyes1, Bijayalaxmi Mohanty2, Song Joong Yun3, Myoung-Ryoul Park1, Dong-Yup Lee2.
Abstract
Dissecting the upstream regulatory architecture of rice genes and their cognate regulator proteins is at the core of network biology and its applications to comparative functional genomics. With the rapidly advancing comparative genomics resources in the genus Oryza, a reference genome annotation that defines the various cis-elements and trans-acting factors that interface each gene locus with various intrinsic and extrinsic signals for growth, development, reproduction and adaptation must be established to facilitate the understanding of phenotypic variation in the context of regulatory networks. Such information is also important to establish the foundation for mining non-coding sequence variation that defines novel alleles and epialleles across the enormous phenotypic diversity represented in rice germplasm. This review presents a synthesis of the state of knowledge and consensus trends regarding the various cis-acting and trans-acting components that define spatio-temporal regulation of rice genes based on representative examples from both foundational studies in other model and non-model plants, and more recent studies in rice. The goal is to summarize the baseline for systematic upstream sequence annotation of the rapidly advancing genome sequence resources in Oryza in preparation for genus-wide functional genomics. Perspectives on the potential applications of such information for gene discovery, network engineering and genomics-enabled rice breeding are also discussed.Entities:
Keywords: Cis-elements; Comparative genomics; Regulatory network; Spatio-temporal regulation; Trans-acting factors
Year: 2015 PMID: 25844119 PMCID: PMC4385054 DOI: 10.1186/s12284-015-0041-x
Source DB: PubMed Journal: Rice (N Y) ISSN: 1939-8425 Impact factor: 4.783
Figure 1Comparison of the core promoter architectures of rice genes with the prototype core promoters of metazoan and Arabidopsis genes. In addition to the TATA-box, the composite prototype model of metazoan core promoter shows all previously identified cis-elements (BRE= TFIIB recognition element; DPE= Downstream promoter element; Inr= Initiator sequence) occurring in various combinations with the TATA-box. In Arabidopsis genes (A, B, C), TATA-box-containing core promoters (A) represent about 30% of all the protein-coding genes encoded by the genome. Non-TATA-box-containing promoters (B, C) represent a larger proportion of Arabidopsis genes. Small groups of genes contain novel motifs (cis?) that appear to be specific to higher plants. Evidence of the importance of DPE-like and/or BRE-like sequences in the functionality of Arabidopsis core promoters have not been established so far. In rice (D, E, F), TATA-box-containing core promoters (D) represent only about 18% of all protein-coding genes in the rice genome. Y-Patch is found in as much as 50% of the total protein-coding genes either in combination with or independent of TATA-box (E, F). DPE-like and BRE-like sequences are insignificantly represented in the core promoters of rice genes.
Figure 2Models of the regulatory sequence architectures of well characterized promoters of constitutively expressed rice genes such as actin ( ), ubiquitin ( ), phosphogluconate dehydrogenase ( ), Initiation factor ( ), ascorbic peroxidase ( ), cytochrome-c ( ), and R1G1-domain-containing protein ( ). Locations of critical cis-elements are indicated with colored symbols in both the upstream regions (gray lines) and downstream regions (5’UTR-exon, 5’UTR-intron; blue lines) relative to the location of the transcription start site (TSS).
List of experimentally or computationally defined upstream regulatory sequences, and their cognate transcriptional regulators that determine basal and constitutive expression of rice genes
|
|
|
|
|
|---|---|---|---|
|
| CTATAWAWA | TATA-binding protein ( | Core promoter; Pre-initiation complex |
|
| CYTCYYCCYC | Unknown | Core promoter; Pre-initiation complex |
| (Pyrimidine patch) | |||
|
| CCCAA (tandem octa-repeats) | Unknown | Fine-tuner of constitutive |
| CAAT (CCAT-box) | Unknown | Positive regulator of constitutive | |
| GTGAC (intronic) | Unknown | Positive regulator of constitutive | |
| CCGCGTTGGC (ABA-responsive) | Unknown | Positive regulator of constitutive | |
| ATTAAT, CACGTA (light-responsive) | Unknown | Positive regulator of constitutive | |
| (A/T)GCC (dideca-repeats) | Unknown | Positive regulator of constitutive | |
|
| GTTGTGGTTTG | Unknown | Positive regulator of constitutive |
List was based on seminal studies published in the literature and annotated in public databases.
List of experimentally and computationally defined upstream regulatory sequences, and their cognate transcriptional regulators that determine spatio-temporal control of rice genes
|
|
|
|
|
|---|---|---|---|
|
| TGTCTC | Auxin responsefactor ( | Primary regulator of auxin-regulated expression |
|
| GACACA | Auxin responsefactor ( | Primary regulator of auxin-regulated expression |
|
| GACActGACA | Auxin responsefactor ( | Primary regulator of auxin-regulated expression |
|
| TAACCACC |
| Primary regulator of GA-regulated expression; primary regulator of sugar sensitivity of GA-responsive genes |
|
| AACA/ACGT | Unknown | Regulator of endosperm-specific expression of rice seed storage protein glutelin ( |
|
| ATATCATGAGTCACTTCA |
| Positive regulator of endosperm-specific expression of rice glutelin ( |
|
| CTTTCGTGTACCACA |
| Positive regulator of endosperm-specific expression of rice glutelin ( |
|
| ACAATGCTGCTCAATTA |
| Positive regulator of endosperm-specific expression of rice glutelin ( |
|
| ATTATCCAATGTCATATTG |
| Positive regulator of endosperm-specific expression of rice glutelin ( |
|
| TAAGTCACGTTTGATGA |
| Positive regulator of endosperm-specific expression of rice glutelin ( |
|
| ATATCATGAGTCACTTCA |
| Positive regulator of endosperm-specific expression of rice glutelin ( |
|
| AAGCAACACACAAC | Dof-type P-boxBinding factor ( | Positive regulator of endosperm-specific expression |
|
| TGTAGAA, TGTTTTAATATGACGTGG | Dof-type P-boxbinding factor ( | Positive regulator of endosperm-specific expression |
|
| TACTATT, TCACTATT, |
| Positive regulator of endosperm-specific expression |
|
| ATTTGAGCTGCC | Unknown | Positive spatial regulator of expression in green leaves, stem and young panicle |
|
| TTGATATATTTGT | Unknown | Positive regulator of expression on green leaves and negative regulator of expression in young panicle and roots |
|
| CGGCGCGCCAC | Unknown | Positive regulator of expression in green leaves and negative regulator of expression in panicle and stem |
|
| TTAGATAATGGA | Unknown | Positive spatial regulator of expression in green leaves, stem root, and young panicles |
|
| TTTATCTATTTCC | Unknown | Negative regulator of expression in young panicle and stem |
|
| TCTTTGGCAGAG | Unknown | Positive regulator of expression in green leaves and negative regulator of expression in stem and young panicles |
|
| GCC-CORE | Ethylene response factor | Regulator of senescence-induced gene expression ( |
|
| TTCGAAATTGAACGTGCT-TAACCAAGAGAACAC |
| Senescence box |
|
| CGMCACGTB |
| ABA-response element for ABA regulated gene expression during stress response and seed maturation |
|
| TGCCACCGG |
| ABRE coupling factor |
|
| ACGCGTGTCCTC |
| ABRE coupling factor |
|
| (a/g)CCGAC |
| Dehydration response element for abiotic stress regulated gene expression independent of ABA |
|
| CACG |
| Regulator of abiotic stress-responsive gene expression independent of the DREB regulon |
|
| AATTTGAT, TAATTTGA |
| Regulator of chilling mediated oxidative stress signaling |
|
| AAAGAAAAA, TAGTTTTT |
| Regulator of chilling mediated oxidative stress signaling |
|
| CGTCA, TGACG, VCGCGB | Ethylene response factors ( | Jasmonic acid response element; regulator of responses to pathogens and herbivores |
|
| TAAGAGCCGCC | Ethylene response factors ( | Negative regulator of responses to |
|
| CCGCCCTCCA | Ethylene response factors ( | Disease responsive element |
|
| TTGAC element |
| Disease responsive |
|
| CAGTCG, CACCTG |
| Pathogen and herbivore response element; regulator of responses to |
List was based on seminal studies published in the literature and annotated in public databases.
Partial list of recently annotated regulatory sequences from rice based on recent updates in public cis-element and transcription factor databases
|
|
|
|
|
|---|---|---|---|
|
| AACAAAC |
| Regulator of endosperm-specific expression |
|
| GTACGTGGCGC | Unknown | Positive regulator of GA response; Negative regulator of ABA response |
| ACGTGKC | Unknown | ABA-response element for ABA regulated gene expression | |
| GCCGCGTGGC | Unknown | ABA-response element for ABA regulated gene expression | |
| AGTACGTGGC | Unknown | ABA-response element for ABA regulated gene expression | |
|
| GGACGCGTGGC |
| ABA-response element for ABA regulated gene expression |
|
| TACGTGTC |
| ABA-response element for ABA regulated gene expression |
|
| RTACGTGGCR |
| Positive regulator for response to ABA and desiccation in vegetative tissues |
| CTTCCACGTGGCA |
| ABA-response element for ABA regulated gene expression | |
|
| CATGCATG | Unknown | ABA-response element for ABA regulated gene expression |
|
| CAATTATTG |
| Enhancer of provascular-specific expression |
|
| AGATT |
| Non-symbiotic hemoglobin-2 promoter |
|
| GCATCTTTACTTTAGCATC | as1-like box binding factor | Phloem-specific expression |
|
| TGTGGGACCATG |
| Enhancer of meristem-specific expression |
|
| TGTCA |
| Pathogen-regulated expression |
|
| AACGT |
| Regulation of cell proliferation |
|
| GCGGGAAA |
| Enhancer in actively dividing cells |
|
| GATA |
| Light regulated expression |
|
| AGCCGCC |
| Pathogen-regulated expression |
| CATAAGAGCCGCCACT |
| Response to biotic and abiotic stress | |
|
| ATCAAGCATGCTTCTTGC |
| Iron deficiency regulated expression |
|
| CACGTGG |
| Iron deficiency regulated expression |
|
| ACGCTGCCG |
| Root meristem-specific expression |
|
| TGTCA |
| Pathogen-regulated expression |
|
| TTGAC |
| Pathogen-regulated expression |
Figure 3Hypothetical and simplistic models of upstream regulatory information content of spatio-temporally regulated genes of rice. (A) Prototype inducible promoter is comprised of combinations of cis-elements (i.e., colored boxes upstream from the core promoter identified by lowercase letters) that directly interface gene induction/represssion with various environmental and developmental signals. This occurs by virtue of their respective cognate regulatory transcription factors (i.e., TF1, TF2, etc.) that respond to hormonal or other physico-chemical changes in the cell. The spatio-temporal properties of genes are therefore defined by the integration of external and developmental cues through the synergistic interactions of various cis-elements and their cognate regulatory transcription factors. (B) Example of a pattern/trend that could be revealed by phylogenetic footprinting of upstream regulatory sequences of rice genes with potential significance to comparative functional genomics and allele mining. Phylogenetic footprinting has suggested that functional differences between the promoters of genes that vary in spatio-temporal regulation may be due to combinations of divergent regulatory fine-tuner elements (colored squares) and highly conserved core module elements (black squares). Variation in cis-regulatory information content may therefore reveal the genomic basis for variant alleles with unique spatio-temporal properties. This concept may be extrapolated for interspecific comparisons of orthologs and paralogs towards the discovery of potential novel alleles in the genus Oryza. Differences in expression in the absence of apparent upstream sequence variation may be used as basis for targeted analysis of differential methylation (shown with asterisks*) of homologous promoters. This concept may have potential applications for the discovery of epialleles that determine phenotypic variation.