| Literature DB >> 22645502 |
Rajesh Mehrotra1, Amit Yadav, Purva Bhalothia, Ratna Karan, Sandhya Mehrotra.
Abstract
Transcription control of gene expression depends on a variety of interactions mediated by the core promoter region, sequence specific DNA-binding proteins, and their cognate promoter elements. The prominent group of cis acting elements in plants contains an ACGT core. The cis element with this core has been shown to be involved in abscisic acid, salicylic acid, and light response. In this study, genome-wide comparison of the frequency of occurrence of two ACGT elements without any spacers as well as those separated by spacers of different length was carried out. In the first step, the frequency of occurrence of the cis element sequences across the whole genome was determined by using BLAST tool. In another approach the spacer sequence was randomized before making the query. As expected, the sequence ACGTACGT had maximum occurrence in Arabidopsis thaliana genome. As we increased the spacer length, one nucleotide at a time, the probability of its occurrence in genome decreased. This trend continued until an unexpectedly sharp rise in frequency of (ACGT)N25(ACGT). The observation of higher probability of bigger size motif suggests its directed evolution in Arabidopsis thaliana genome.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22645502 PMCID: PMC3354754 DOI: 10.1100/2012/983528
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Frequency of occurrence of the various promoter sequences in which spacer sequence length between two ACGT palindromes is gradually increased from 5 to 25 nucleotides.
| Cis element | Chromosome 1 | Chromosome 2 | Chromosome 3 | Chromosome 4 | Chromosome 5 | Total | |
|---|---|---|---|---|---|---|---|
| (ACGT) 2 | ACGTACGT | 469 | 312 | 367 | 327 | 410 | 1885 |
| (ACGT) 8 | ACGTACGTACGTACGTACGTACGTACGTACGT | 70 | 31 | 12 | 28 | 59 | 200 |
| (ACGT)N5(ACGT) | ACGTGGCTAACGT | 16 | 11 | 13 | 13 | 19 | 72 |
| (ACGT)N10(ACGT) | ACGTGGCTATGGCGACGT | 8 | 5 | 10 | 4 | 12 | 39 |
| (ACGT)N25(ACGT) | ACGTGGCTATGGCGGAGCAAGATTCACTCACGT | 15 | 12 | 13 | 9 | 13 | 62 |
| (ACGT)RN5(ACGT) | ACGT–GCTAG–ACGT | 7 | 5 | 5 | 2 | 4 | 23 |
| (ACGT)RN10(ACGT) | ACGT–TGGGGCCGAT–ACGT | 2 | 2 | 4 | 3 | 3 | 14 |
| (ACGT)RN25(ACGT) | ACGTAGACACGTTGGGGGAACTTACTGCCACGT | 3 | 1 | 7 | 5 | 5 | 21 |
| (ACGT)RN25(ACGT) | ACGT-ATATGAGATCGGCGCTTCACGGAGC-ACGT | 4 | 14 | 6 | 4 | 4 | 32 |
| (ACGT)N5(ACGT) randomized | GGAATCCTTGGCA | 41 | 24 | 30 | 19 | 23 | 137 |
| (ACGT)N10(ACGT) randomized | GCGGGCTATCGGTAGCAT | 2 | 5 | 2 | 0 | 1 | 10 |
| (ACGT)N25(ACGT) randomized | TAAGGCTTAGCCACGCTTAGGGTGTGAGCACAC | 6 | 6 | 3 | 0 | 3 | 18 |
| (TGCA)N25(TGCA) | TGCAGGCTATGGCGGAGCAAGATTCACTCTGCA | 13 | 12 | 9 | 12 | 9 | 55 |
N5, N10, N25 denote sequence length between two ACGT palindromes. RN5, RN10, RN25—signify only spacer sequence being randomized. (ACGT) N_(ACGT) randomized—signify complete sequence being randomized.
Frequency of occurrence of nitrogenous bases when spacer sequence length between two ACGT palindromes is gradually increased from 5 to 25 nucleotides.
| A | C | G | T | Seq. used | Gap | Count | ||
|---|---|---|---|---|---|---|---|---|
| (ACGT)N5(ACGT) | ACGTGGCT_ACGT | 72 | 42 | 33 | 34 | 72 | 5 | 690 |
| (ACGT)N6(ACGT) | ACGTGGCTA_ACGT | 98 | 65 | 45 | 44 | 44 | 6 | 611 |
| (ACGT)N7(ACGT) | ACGTGGCTAT_ACGT | 92 | 91 | 77 | 80 | 77 | 7 | 824 |
| (ACGT)N8(ACGT) | ACGTGGCTATG_ACGT | 97 | 30 | 64 | 55 | 64 | 8 | 852 |
| (ACGT)N9(ACGT) | ACGTGGCTATGG_ACGT | 39 | 32 | 22 | 32 | 32 | 9 | 602 |
| (ACGT)N10 (ACGT) | ACGTGGCTATGGC_ACGT | 34 | 36 | 39 | 66 | 39 | 10 | 600 |
| (ACGT)N11(ACGT) | ACGTGGCTATGGCG_ACGT | 36 | 23 | 38 | 29 | 38 | 11 | 681 |
| (ACGT)N12(ACGT) | ACGTGGCTATGGCGG_ACGT | 56 | 54 | 65 | 45 | 56 | 12 | 638 |
| (ACGT)N13(ACGT) | ACGTGGCTATGGCGGA_ACGT | 78 | 50 | 77 | 59 | 77 | 13 | 652 |
| (ACGT)N14(ACGT) | ACGTGGCTATGGCGGAG_ACGT | 86 | 53 | 96 | 52 | 53 | 14 | 841 |
| (ACGT)N15(ACGT) | ACGTGGCTATGGCGGAGC_ACGT | 56 | 67 | 44 | 66 | 56 | 15 | 709 |
| (ACGT)N16(ACGT) | ACGTGGCTATGGCGGAGCA_ACGT | 60 | 34 | 52 | 34 | 60 | 16 | 843 |
| (ACGT)N17(ACGT) | ACGTGGCTATGGCGGAGCAA_ACGT | 39 | 41 | 42 | 39 | 42 | 17 | 830 |
| (ACGT)N18(ACGT) | ACGTGGCTATGGCGGAGCAAG_ACGT | 49 | 47 | 58 | 48 | 49 | 18 | 719 |
| (ACGT)N19(ACGT) | ACGTGGCTATGGCGGAGCAAGA_ACGT | 50 | 38 | 49 | 44 | 44 | 19 | 695 |
| (ACGT)N20(ACGT) | ACGTGGCTATGGCGGAGCAAGAT_ACGT | 34 | 30 | 44 | 37 | 37 | 20 | 821 |
| (ACGT)N21(ACGT) | ACGTGGCTATGGCGGAGCAAGATT_ACGT | 36 | 40 | 42 | 43 | 40 | 21 | 717 |
| (ACGT)N22(ACGT) | ACGTGGCTATGGCGGAGCAAGATTC_ACGT | 53 | 42 | 42 | 46 | 53 | 22 | 726 |
| (ACGT)N23(ACGT) | ACGTGGCTATGGCGGAGCAAGATTCA_ACGT | 91 | 55 | 60 | 61 | 55 | 23 | 771 |
| (ACGT)N24(ACGT) | ACGTGGCTATGGCGGAGCAAGATTCAC_ACGT | 77 | 64 | 57 | 53 | 53 | 24 | 1171 |
| (ACGT)N25(ACGT) | ACGTGGCTATGGCGGAGCAAGATTCACT_ACGT | 76 | 62 | 58 | 69 | 62 | 25 | 708 |
Alterations in transcription factor binding sites when spacer sequence length between two ACGT palindromes is gradually increased from 5 to 25 nucleotides.
| Minimal promoter sequence (MPS) | (ACGT) | (ACGT)(MPS) | (ACGT)2(MPS) | (ACGT)N5(ACGT)(MPS) | (ACGT)N10(ACGT)(MPS) | (ACGT)N25(ACGT)(MPS) | |
|---|---|---|---|---|---|---|---|
| Model name | Frequency | ||||||
|
| |||||||
| ARR10 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| AGL3 | 2 | 0 | 2 | 2 | 2 | 2 | 2 |
| ATHB-5 | 1 | 0 | 1 | 2 | 1 | 1 | 1 |
| bZIP910 | 0 | 0 | 0 | 0 | 1 | 1 | 1 |
| Dof3 | 1 | 0 | 1 | 1 | 1 | 1 | 2 |
| EmBP-1 | 2 | 0 | 2 | 1 | 2 | 2 | 2 |
| Gamyb | 5 | 0 | 5 | 5 | 5 | 5 | 5 |
| HAT5 | 2 | 0 | 2 | 2 | 2 | 2 | 2 |
| HMG-1 | 6 | 0 | 6 | 6 | 6 | 6 | 6 |
| HMG-I/Y | 6 | 0 | 6 | 6 | 6 | 6 | 6 |
| id1 | 5 | 0 | 5 | 5 | 5 | 5 | 5 |
| myb.Ph3 | 1 | 0 | 1 | 1 | 2 | 1 | 1 |
| PEND | 1 | 0 | 1 | 1 | 1 | 1 | 1 |
| squamosa | 2 | 0 | 3 | 3 | 3 | 3 | 3 |
| TGA1A | 1 | 0 | 1 | 1 | 2 | 2 | 2 |
|
|
|
|
|
|
|
| |