| Literature DB >> 28276452 |
Prabhakaran Munusamy1, Yevgen Zolotarov1, Louis-Valentin Meteignier2, Peter Moffett2, Martina V Strömvik1.
Abstract
Gene regulation at the transcriptional and translational level leads to diversity in phenotypes and function in organisms. Regulatory DNA or RNA sequence motifs adjacent to the gene coding sequence act as binding sites for proteins that in turn enable or disable expression of the gene. Whereas the known DNA and RNA binding proteins range in the thousands, only a few motifs have been examined. In this study, we have predicted putative regulatory motifs in groups of untranslated regions from genes regulated at the translational level in Arabidopsis thaliana under normal and stressed conditions. The test group of sequences was divided into random subgroups and subjected to three de novo motif finding algorithms (Seeder, Weeder and MEME). In addition to identifying sequence motifs, using an in silico tool we have predicted microRNA target sites in the 3' UTRs of the translationally regulated genes, as well as identified upstream open reading frames located in the 5' UTRs. Our bioinformatics strategy and the knowledge generated contribute to understanding gene regulation during stress, and can be applied to disease and stress resistant plant development.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28276452 PMCID: PMC5343461 DOI: 10.1038/srep43861
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Number of translationally regulated genes in Arabidopsis plants treated with (Defense) and without (Control) dexamethasone.
| Group | NYup | NYdown | YNup | YNdown | YYup | YYdown |
|---|---|---|---|---|---|---|
| Genes with differential translational efficiency (total 514) | 90 | 265 | 65 | 58 | 12 | 24 |
| Sub-set genes with 3′ UTR sequence (total 470) | 85 | 241 | 59 | 54 | 10 | 21 |
| Sub-set genes with 5′ UTR sequence (total 455) | 79 | 237 | 58 | 53 | 6 | 22 |
Up, up-regulated genes; down, down-regulated genes.
NY represents genes translationally regulated in control but normally regulated in treated plants.
YN represents genes translationally regulated in treated plants but normal in control plants.
YY represents genes translationally regulated in both control and treated plants.
Number of uORFs found in the 5′ UTRs of translationally regulated genes during defense response.
| Dataset | Number of 5′ UTR sequences | Number of 5′ UTRs containing uORFs | Number of predicted uORFs |
|---|---|---|---|
| YNdown | 53 | 15 | 18 |
| YNup | 59 | 8 | 9 |
| NYdown | 241 | 50 | 62 |
| NYup | 80 | 13 | 14 |
| YYdown | 23 | 2 | 2 |
| YYup | 7 | 1 | 1 |
Examples of translationally regulated genes containing one or more uORFs in their 5′ UTR.
| Gene | Gene description | Number of uORFs | uORF location in the 5′ UTR |
|---|---|---|---|
| AT2G46830 | Circadian clock associated 1, a transcription factor | 1 | [147–232] |
| AT1G54260 | Winged-helix DNA-binding transcription factor family protein | 2 | [23–97], [156–245] |
| AT4G02280 | Sucrose synthase enzyme 3 | 2 | [3–107], [145–222] |
| AT3G29575 | ABI five binding protein 3 | 2 | [86–169], [178–267] |
| AT1G51620 | protein kinase family protein | 2 | [35–169], [237–365] |
| AT1G74088 | Unknown protein | 3 | [45–116], [132–197], [213–326] |
Figure 1Position of upstream open reading frames (uORFs) possessing a strong Kozak sequence context in plant genes translationally regulated under stress.
The Kozak signal is the nucleotides [A/G] at position −3 and G at +4, where A of the uORF AUG codon is designated +1. 0 indicates the start of the main protein coding sequence.
Translationally regulated genes containing miRNA target sites in their 3′ UTR predicted using psRNATarget server.
Figure 2Two genes in the group predicted to be translationally down-regulated under stress were identified in the analysis to contain a binding site for the microRNA (ath-miR5021) in the 3′UTR.
The psRNA target web server was used to identify microRNA target sites in translationally regulated genes. (a) Gene AT3G25660 has three nucleotide mismatches present on either end of their binding region which could lead to miRNA cleavage activity; whereas the gene AT3G54220 has a nucleotide mismatch in the central region of the binding site possibly leading to mRNA translation inhibition. (b) In addition, a conserved sequence motif discovered by MEME was found within the miRNA binding region. The motif is present between 2–11 nucleotides from the 5′ end of the miRNA, which is known as the seed region - important for miRNA activity. This motif was conserved in the 3′ UTR of 26 out of 54 genes that are translationally regulated under stress.
Number of significant motifs identified in the 3′ UTRs of translationally regulated genes.
| Motif length | NYdown | NYup | YNdown | YNup | YYdown | YYup | |
|---|---|---|---|---|---|---|---|
| 6-mer | Seeder | 3 | 16 | 3 | 1 | 3 | None |
| MEME | 1 | 1 | None | None | None | None | |
| Weeder | None | None | None | 1 | 1 | 1 | |
| 8-mer | Seeder | 56 | 62 | 29 | 7 | 22 | None |
| MEME | 6 | 3 | 2 | None | None | None | |
| Weeder | 1 | 1 | 1 | 1 | 1 | 1 |
Number of significant motifs identified in the 5′ UTRs of translationally regulated genes.
| Motif length | NYdown | NYup | YNdown | YNup | YYdown | YYup | |
|---|---|---|---|---|---|---|---|
| 6-mer | Seeder | 36 | 91 | 79 | 36 | 86 | None |
| MEME | 87 | 28 | 81 | 140 | 27 | None | |
| Weeder | None | None | None | 1 | 1 | 1 | |
| 8-mer | Seeder | 57 | 39 | 50 | 42 | 49 | None |
| MEME | 6 | 8 | 8 | 7 | 4 | None | |
| Weeder | 1 | 1 | 1 | 1 | 1 | 1 |
Translationally regulated genes with motifs [U(G/A)U(A/G)U(G/A)U] recognized by Bruno-like protein in their 3′ UTR.
| Gene ID | Gene description | Position | Sequence that matches to Bruno-like protein motif |
|---|---|---|---|
| AT2G06520 | Encodes a protein similar to spinach photosystem II subunit PsbX | −94 | C |
| AT4G38460 | Geranylgeranyl reductase involved in isoprenoid biosynthetic process | −169 | C |
| AT5G57350 | Arabidopsis H(+)-ATPase | −230 | C |
| AT2G06520 | Encodes a protein similar to spinach photosystem II subunit PsbX | −93 | |
| AT5G53030 | Unknown protein | −70 | |
| AT2G29670 | Tetracopeptide repeat-like superfamily protein | −256 | U |
| AT4G38460 | Geranylgeranyl reductase involved in isoprenoid biosynthetic process | −19 | C |
| AT4G38460 | Geranylgeranyl reductase involved in isoprenoid biosynthetic process | −15 | G |
| AT4G25570 | Alpha/beta-Hydrolases superfamily protein | −143 | U |
| AT4G14500 | Polyketide cyclase/dehydrase and lipid transport superfamily protein | −54 | A |
| AT1G56280 | Unknown protein | −289 | C |
| AT3G54220 | Similar to DNA binding protein containing basic-leucine zipper region | −95 | U |
| AT4G30400 | RING/U-box superfamily protein | 111 | |
| AT3G46450 | Cytosolic factor family protein/Phosphoglyceride transfer family protein | 59, 95 | |
| AT3G54220 | Similar to DNA binding protein containing basic-leucine zipper region | 138 | |
| AT2G34070 | Member of TRICHOME BIREFRINGENCE-LIKE gene family | 11 | A |
| AT2G29670 | Tetracopeptide repeat-like superfamily protein | 66 | U |
*Bold and underlined represents the sequence that matches to motif.
Translationally regulated genes with motifs [UGUA (A/C/U) AUA] recognized by Pumilio protein in their 3′ UTR.
| Gene ID | Gene description | Position | Sequence that matches to Pumilio protein motif |
|---|---|---|---|
| AT4G25570 | Encodes cytochrome b561 | −85 | UG |
| AT3G01400 | ARM repeat superfamily protein | −165 | GG |
| AT5G53030 | Unknown protein | −70 | UG |
| AT1G78630 | Embryo defective 1473 involved in embryo development | 206 | UG |
| AT3G49670 | Encodes a CLAVATA1-related kinase-like protein | 204 | A |
| AT4G30400 | RING/U-box superfamily protein | 111 | AU |
| AT4G00490 | Encodes a chloroplast beta-amylase enzyme | −220 | UU |
| AT4G17080 | Histone H3 K4-specific methyltransferase SET7/9 family protein | −132 | GU |
*Bold and underlined represents the sequence that matches to motif.