| Literature DB >> 19843335 |
François Fauteux1, Martina V Strömvik.
Abstract
BACKGROUND: Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19843335 PMCID: PMC2770497 DOI: 10.1186/1471-2229-9-126
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
DNA motifs discovered in the promoters of plant seed-storage protein genes
| B1 | 1.60e-07 | RYREPEATBNNAPA | 5.02-08 | ||
| B2 | 5.45e-04 | GADOWNAT | 6.86e-08 | ||
| B3 | 1.84e-02 | RYREPEATBNNAPA | 9.68e-08 | ||
| F1 | 1.43e-07 | LEGUMINBOXLEGA5 | 4.73e-12 | ||
| F2 | 2.06-03 | RYREPEATLEGUMINBOX | 1.05e-09 | ||
| F3 | 7.24e-03 | TATABOX1 | 1.58e-12 | ||
| F4 | 9.05e-03 | E2FBNTRNR | 9.03e-05 | ||
| F5 | 4.53e-02 | ACGTSEED2 | 1.32e-08 | ||
| P1 | 6.84e-08 | GLMHVCHORD | 1.52e-13 | ||
| P2 | 2.17e-05 | PROLAMINBOXOSGLUB1 | 4.41e-06 | ||
| P3 | 1.85e-04 | EMHVCHORD | 6.45e-11 | ||
| P4 | 2.94e-03 | TATABOX1 | 6.12e-10 | ||
| P5 | 9.17e-03 | BIHD1OS | 6.65e-06 | ||
| P6 | 1.14e-02 | GCAACREPEATZMZEIN | 5.47e-03 | ||
| P7 | 2.82e-02 | O2F3BE2S1 | 1.55e-05 |
q-value, statistical significance of motif
PLACE ID, identifier of PLACE consensus sequence matching motif
STAMP alignment, alignment of motif consensus sequence (top) with PLACE consensus sequence (bottom)
E value, expectation value of the STAMP alignment
Figure 1Sequence logos of motifs enriched in seed storage protein gene promoter sequences. A) Sequence logos of significant DNA motifs discovered in SSP gene promoter sequence from Brassicaceae (B1-3), Fabaceae (F1-5) and Poaceae (P1-P7). B) Sequence logos of the TATA-box motif identified in Brassicaceae SSP gene promoter sequences. Left, forward motif, right, reverse complement of motif.
Figure 2Position of . The positions of the best matching subsequence to motifs discovered in SSP gene promoters from (a) Brassicaceae, (b) Fabaceae and (c) Poaceae are mapped onto promoter sequences. (Motifs in Brassicaceae (B1-3), Fabaceae (F1-5) and Poaceae (P1-P7), respectively).
Top-ten scoring soybean promoters for the presence of Fabaceae seed-storage protein gene promoter motifs
| Glyma03 g32030.1 | 1 | P04776 | Glycinin Gy1 | 0.0 |
| Glyma19 g34780.1 | 2 | P11828 | Glycinin Gy3 | 0.0 |
| Glyma20 g28650.2 | 3 | P13916 | β-conglycinin, alpha chain | 0.0 |
| Glyma03 g32020.2 | 4 | P04405 | Glycinin Gy2 | 2.0e-251 |
| Glyma10 g04280.1 | 5 | P02858 | Glycinin Gy4 | 0.0 |
| Glyma13 g18450.1 | 6 | P04347 | Glycinin Gy5 | 3.0e-285 |
| Glyma10 g39150.1 | 7 | P11827 | Beta-conglycinin, alpha' chain | 3.0e-105 |
| Glyma20 g28640.1 | 8 | P25974 | β-conglycinin, beta chain | 7.0e-298 |
| Glyma20 g28460.2 | 9 | P25974 | β-conglycinin, beta chain | 1.0e-269 |
| Glyma03 g39940.1 | 10 | P13917 | Basic 7S globulin 1 | 7.0e-302 |
Gene ID, gene identifier (soybean genome assembly and annotation Glyma1)
PWM rank, total PWM matching score rank
C-B rank, total Cluster-Buster score rank
Hit ID, hit identifier (Uniprot/Swiss-Prot)
Description, hit description
E value, hit alignment expectation value.
Figure 3PWM score and rank of . The PWM matrix score associated with Fabaceae SSP gene promoter motifs in 14 soybean SSP gene promoters is compared to the average score obtained in 66,155 soybean promoters (baseline). Gy (1-5), glycinin subunit (1-5); βc (α', α, β), β-conglycinin subunit (α', α, β).