| Literature DB >> 35890495 |
Małgorzata Majewska1, Łukasz Kuźma1, Piotr Szymczyk1.
Abstract
Salvia miltiorrhiza synthesises tanshinones with multidirectional therapeutic effects. These compounds have a complex biosynthetic pathway, whose first rate limiting enzyme is 3-hydroxy-3-methylglutaryl-coenzyme A reductase (HMGR). In the present study, a new 1646 bp fragment of the S. miltiorrhiza HMGR4 gene consisting of a promoter, 5' untranslated region and part of a coding sequence was isolated and characterised in silico using bioinformatics tools. The results indicate the presence of a TATA box, tandem repeat and pyrimidine-rich sequence, and the absence of CpG islands. The sequence was rich in motifs recognised by specific transcription factors sensitive mainly to light, salicylic acid, bacterial infection and auxins; it also demonstrated many binding sites for microRNAs. Moreover, our results suggest that HMGR4 expression is possibly regulated during flowering, embryogenesis, organogenesis and the circadian rhythm. The obtained data were verified by comparison with microarray co-expression results obtained for Arabidopsis thaliana. Alignment of the isolated HMGR4 sequence with other plant HMGRs indicated the presence of many common binding sites for transcription factors, including conserved ones. Our findings provide valuable information for understanding the mechanisms that direct transcription of the S. miltiorrhiza HMGR4 gene.Entities:
Keywords: HMGR4; Salvia miltiorrhiza; microRNA; promoter; transcription factor; transcription factor binding site
Year: 2022 PMID: 35890495 PMCID: PMC9318348 DOI: 10.3390/plants11141861
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Isolated S. miltiorrhiza HMGR4 promoter sequence (1499 bp), 5′ untranslated region (5′UTR) (96 bp) and coding sequence fragment (51 bp). The potential TATA box, transcription start site (TSS), pyrimidine-rich sequence (PRS), tandem repeat and consensus sequences for hormone-, pathogen-, wounding-, light-, and anaerobic-responsive elements are signed and marked in pink on the strands. ABRE, abscisic acid-responsive element; ARE, anaerobic-responsive element; AuxRE, auxin-responsive element; BRRE, brassinosteroid-responsive element; ERE, ethylene-responsive element; LRE, light-responsive element; MJRE, methyl jasmonate-responsive element.
Transcription factor binding sites (TFBSs) with the potential to form dimers detected in the S. miltiorrhiza HMGR4 promoter sequence.
| Transcription Factor (TF) Family Name | Fragment of Promoter Sequence with Underlined TFBSs a and Potentially Interacting TF Pairs |
|---|---|
| ˗ | |
| ˗ | |
| HD-ZIP | |
| WRKY | |
| Dof |
a Most conserved positions within a matrix were written in capital letters.
Figure 2Classification of TFBSs found in the proximal S. miltiorrhiza HMGR4 promoter with regard to their response to biotic and abiotic factors.
Figure 3Classification of TFBSs found in the proximal S. miltiorrhiza HMGR4 promoter with regard to their biological functions.
Identification of microRNAs (miRNAs) potentially interacting with the S. miltiorrhiza HMGR4 promoter sequence and 5′UTR identified by the miRBase tool.
| miRNA Name and Source | miRNA Sequence | Sequence Alignment Position Start/End | Strand | e-Value |
|---|---|---|---|---|
|
| ||||
| miR1128 | UACUACUCCCUCCGUCCCAAA | 350/368 | + | 0.75 |
| miR6462c-5p | AAGGGACAAAAAUGGCAUAAGA | 259/279 | − | 3.5 |
| miR1128 | UACUACUCCCUCCGUCCGAAA | 350/368 | + | 4.2 |
| miR1436 | ACAUUAUGGGACGGAGGGAGU | 354/368 | − | 6.2 |
| miR5205a | CAUACAAUUUGGGACGGAGGGAG | 355/374 | − | 9.1 |
| miR8740 | UAAUGAUGUGGCACAAUAUUA | 634/653 | − | 9.1 |
| miR11573a and miR11573b | UUGGGGAGCGUAUUGUAGAUU | 197/216 | − | 9.1 |
|
| ||||
| miR477 | CGAAGUCUUGGAAGAGAGUAA | 59/75 | − | 3.2 |
| miR6180 | AGGGUGGAAGAAAGAGGGCG | 55/69 | − | 3.9 |
| miR4993 | GAGCGGCGGCGGUGGAGGAUG | 13/30 | − | 6.9 |
| miR12107-5p | CUGAUGAGAGAGCGAAUGAUA | 51/66 | − | 8.4 |
TFs common between in silico analysis of the S. miltiorrhiza HMGR4 promoter and microarray co-expression studies with A. thaliana HMGR1.
| TF Family Name | TF Gene Name and Locus | Processes in Which TF is Involved a | r-Value b | TFBS Motif and Localisation c,d | |
|---|---|---|---|---|---|
| Homeodomain; HD-ZIP | cotyledon and leaf morphogenesis; primary root development; sucrose-signalling pathway | 0.594 | ATAAT 310; 309 | AATAA 308; 307 | |
| regulation of timing of transition from vegetative to reproductive phase; repression of cell expansion during plant development; response to blue light | 0.562 | ATAAT 309 | |||
| maintenance of floral organ identity | 0.507 | ATTAA 161 |
| ||
| regulation of tissue-specific accumulation of anthocyanins; cellular organisation of primary root; cuticle hydrocarbon biosynthetic process; plant-type cell wall modification; root hair cell differentiation | 0.524 |
| ATTAA 1161 | ||
| cotyledon development; epidermal cell differentiation; seed dormancy and germination | 0.583 | ||||
| cotyledon development; epidermal cell differentiation; seed dormancy and germination; maintenance of floral organ identity | 0.587 | ||||
| Homeodomain; bZIP; HD-ZIP | leaf morphogenesis; response to blue light and salt stress | 0.521 | ATAAT 307; 310 | AATAA 308; 307 | |
| bZIP | positive regulation of seed maturation | 0.666 | CCACG 822 | AACGT 290;1188 | |
| WRKY | regulation of basal cell division patterns during early embryogenesis; establishment of cell polarity; longitudinal axis specification; pollen development | 0.575/ 0.557 | TGACT 5; 832 | TTGAC 831 | |
| ˗ | 0.576 | ||||
| response to osmotic stress, salt stress and water deprivation | 0.504 | ||||
| phosphate ion transport | 0.515 | TTGAC 830; 831 | |||
| ˗ | 0.564 | ||||
| Myb/SANT; ARR-B | His-to-Asp phosphorelay signal transduction system; expression of nuclear genes for components of mitochondrial complex I; ethylene- and cytokinin-activated signalling pathways; promotion of cytokinin-mediated leaf longevity; root meristem growth; seed growth; stomatal movement | 0.552/0.611 | AATCT 15; 197; 919 | AATCC 179; 204; 548 | |
| His-to-Asp phosphorelay signal transduction system; activation of some type-A response regulators in response to cytokinins | 0.509/ | ||||
| Myb/SANT; MYB | response to ethylene, abscisic acid (ABA), indole-3-acetic acid, and | 0.599 | ACCTA 886 | ||
| MYB-related | morning-phased TF integrating circadian clock and auxin pathways; regulation of free indole-3-acetic acid level in time-of-day specific manner; negative regulation of freezing tolerance | 0.501 | ATATC 1166 | ||
| regulation of circadian rhythm | 0.545/ | ATATC 1166 |
| ||
| EIN3; EIL | positive regulation of ethylene response pathway; cellular response to iron ion; defence response to bacterium | 0.554 | TGTAT 374; 759 | ATACA 391 | |
| ethylene response pathway; sulphur metabolic process; cellular response to iron ion | 0.525 | ATGTA 757 | |||
| MADS box; MIKC | negative regulation of flowering and short-day photoperiodism; pollen development | 0.567 | TTTCC 804; 801 | ||
| inhibition of floral transition in autonomous flowering pathway; identity of floral meristem; response to temperature stimulus | 0.537 | TTTCC 801 |
| ||
| NAC; NAM | jasmonic acid-mediated signalling pathway; response to water deprivation | 0.594 | TACGT 44; 718 | TTGAC 829 | |
| activator in ABA-mediated dehydration response | 0.543 | CGTA 720 | ACAT 644 | ||
| NF-YB | protein heterodimerization activity | 0.558 | CTAAT 42 | ||
| NF-YC | 0.523 | ||||
| TBP | required for basal transcription (facilitating the recruitment of TFIID to the promoter, forming a preinitiation complex with RNA polymerase) | 0.645 | ATATA 746; 739 | ||
| TCP | positive regulation of circadian clock | 0.546 | CCCAC 818 | ||
| AP2; ERF | cell death; heat acclimation; ethylene-activated signalling pathway; response to cytokinin, jasmonic acid and other organism | 0.502 | TAAGA 494 | ||
| C2H2 | inhibition of plant growth under abiotic stress conditions; negative regulation of ABA signalling during seed germination; positive regulation of leaf senescence; jasmonate early signalling response; response to chitin and water deprivation; plants overexpressing AZF2 have increased sensitivity to salt stress and barely survive under high salt conditions | 0.503 | |||
| Dof | metal ion binding; binding of OBF TFs to OCS elements | 0.727 | CGTTA 685 | ACCTT 158; 184; 605; 612 | |
| SBP | promotion of vegetative phase change and flowering; vegetative- to reproductive-phase transition of meristem | 0.596 | AGTAC 51; 972 | CTTAC 275; 427; 625 | |
a The roles of the TFs were assumed based on the UniProt database. b r-value between the TFBSs found in the S. miltiorrhiza HMGR4 promoter and those detected in A. thaliana ranged from 0.5 to 1.0. c For TFBSs, only the most conserved positions within a matrix were listed. d TFBSs localised in proximal promoter region were put in bold.
Figure 4Interactions between TFs potentially binding to the S. miltiorrhiza HMGR4 promoter found with the Pathway System tool. The presented dependencies are based on co-citation (dashed line) or expert-curation (solid line). Diamond-ended lines indicate that a given TF has a predicted binding site in dependent promoter sequence. The number in the lower right corner of TF indicates the number of interactions within the network (including those not displayed). EBP = RAP 2-3, NAC3 = NAC055, RD26 = NAC072, ZF2 = AZF2, HB-1 = HAT5.
Figure 5Distribution of matrix families common to the S. miltiorrhiza promoter sequences, i.e., HMGR1 (GU367911.1), HMGR2 (KF297286.1) and HMGR4 (KT921337.1) identified by the Common TFs tool. Black lines correspond to the promoter sequences. Each matrix family is marked with a semicircular coloured symbol. The figure shows families found on the positive and negative strands. P$AHBP, Arabidopsis homeobox proteins; P$L1BX, L1 box; P$MIIG, MYB IIG-type binding sites; P$DOFF, DNA binding with one finger; P$GTBX, GT box elements; P$MADS, MADS box proteins; P$MYBL, MYB-like proteins; P$MYBS, MYB proteins with single DNA binding repeat; P$NTMF, NAC factors with transmembrane motif; P$NACF, plant specific NAC proteins; P$KAN1, transcription repressor KANADI; P$WBXF, W box family; P$TODS, time-of-day-specific regulatory elements; P$NCS1, nodulin consensus sequence 1; P$SPF1, sweet potato DNA-binding factor with two WRKY domains; P$ZFAT, zinc finger proteins; P$LREM, light-responsive elements; P$PSPE, protein secretory pathway elements; P$CGCG, CGCG box binding proteins; P$PCDR, proteins involved in programmed cell death response; P$PNRE, plant nitrate-responsive elements; P$SCAP, stomatal carpenter.
Common TF matrix families found during in silico analysis of the S. miltiorrhiza HMGR1, HMGR2, HMGR4 promoter sequences using the Common TFs tool.
| TF Matrix Family | Processes in Which TF Is Involved a |
|---|---|
| root, leaf and anther development; seed maturation; meristem initiation and growth; xylem and phloem pattern formation; cell differentiation; determination of bilateral symmetry; transition from vegetative to reproductive phase; glucosinolate metabolic process; response to: auxin, gibberellin, ABA, water deprivation, blue light and salt stress | |
| L1 box | cotyledon development; seed germination and dormancy; epidermal cell differentiation; maintenance of floral organ identity |
| MYB IIG-type binding sites | root, seed, stamen and xylem development; cellular cadmium ion homeostasis; gibberellin and flavonol biosynthesis; defence response to fungi; response to: ABA, chitin, salt stress, cold, water deprivation, phosphate starvation, potassium ion and light |
| DNA binding with one finger | secondary shoot, cotyledon and seed development; cell wall modification; cell cycle; gibberellin biosynthesis; response to: SA, auxin, chitin, red light and cold |
| GT box elements | shoot system and stomatal complex development; trichome morphogenesis; seed maturation and germination; cell size and growth; response to: auxin and water deprivation |
| MADS box proteins | flower, ovule and seed coat development; seed maturation; meristem structural organisation; transition from vegetative to reproductive phase; short-day photoperiodism; circadian rhythm; response to auxin |
| MYB-like proteins | integument, anther and pollen development; leaf morphogenesis; seed growth and dormancy; endothelial cell proliferation; vacuole organisation; wax biosynthesis; long-day photoperiodism; defence response to bacteria and fungi; response to: SA, brassinosteroid, gibberellin, ABA, jasmonic acid, chitin, salt, water deprivation and cold |
| MYB proteins with single DNA binding repeat | leaf and lateral root development; leaf senescence; circadian rhythm; peroxidase activity; auxin and sulphate ion homeostasis; response to: ABA, phosphate starvation, absence of light and high light intensity |
| NAC factors with transmembrane motif | leaf and trichome morphogenesis; xylem development; seed germination; photoperiodism; membrane protein proteolysis; response to: gibberellin, salt stress |
| plant specific NAC proteins | leaf and secondary shoot development; primary shoot apical meristem specification; formation of organ boundary; regulation of timing of organ formation; response to water deprivation |
| transcription repressor KANADI | phenylpropanoid metabolic process |
| W box family | induced systemic resistance; JA-mediated signalling pathway; phosphate ion transport; defence response to: bacteria, fungi and viruses; response to: SA, chitin and wounding |
| time-of-day-specific regulatory elements | circadian rhythm; red or far-red light signalling pathway; response to temperature |
| nodulin consensus sequence 1 | nodule-specific expression |
| zinc finger proteins | regulation of root development; phosphate ion homeostasis |
| light-responsive elements | response to hypoxia |
| protein secretory pathway elements | SA induction of secretion-related genes via NPR1 |
| CGCG box binding proteins | leaf senescence; defence response to: bacteria, fungi and insects; response to: cold, auxins and water deprivation |
| proteins involved in programmed cell death response | regulation of expression of vacuolar processing enzyme |
| plant nitrate-responsive elements | nitrate assimilation; stomatal movement; response to: nitrate and water deprivation |
| stomatal carpenter | stomatal movement |
| sweet potato DNA-binding factor with two WRKY domains | ˗ |
a The roles of the TFs were assumed based on the MatInspector (Genomatix) database.
Figure 6Selected 10-element-frameworks of TFBSs obtained for S. miltiorrhiza promoter sequences, i.e., HMGR1 (GU367911.1), HMGR2 (KF297286.1) and HMGR4 (KT921337.1) using the FrameWorker tool. Black lines correspond to the promoter sequences. Each matrix family is marked with a semicircular coloured symbol. The figure shows families found on the positive and negative strands. (A) P$SBPD, SBP-domain proteins; P$AHBP, Arabidopsis homeobox proteins; P$SUCB, sucrose box; P$WTBX, WT box; P$HEAT, heat shock factors; P$NACF, plant specific NAC proteins; (B) P$SCAP, stomatal carpenter; P$L1BX, L1 box; P$MIIG, MYB IIG-type binding sites; P$AHBP, Arabidopsis homeobox proteins; P$TCXF, CRC domain containing tesmin/TSO1-like CXC (TCX) factors; P$TGAF, basic/leucine zipper-type TFs of the TGA-family.
Figure 7Alignment of proximal promoter regions and 5′UTRs of Arabidopsis thaliana (At), Arabidopsis lyrata (Al), Gossypium hirsutum (Gh), Glycine max (Gm), Oryza sativa (Os), Solanum lycopersicum (Sl), Zea mays (Zm) and Salvia miltiorrhiza (Sm) HMGR genes. Conserved nucleotides are marked in blue. The darker the colour, the greater the degree of conservation within the analysed sequences. The proximal promoters are present at the beginning and the 5′UTRs at the end of the displayed sequences, respectively. Conserved TFBSs identified by DiAlign TF tool are highlighted in red. P$AHBP, Arabidopsis homeobox protein; P$GCCF, GCC box family; P$TDTF, transposase-derived proteins; P$MYBL, MYB-like proteins; P$L1BX, L1 box; P$DREB, dehydration responsive element binding factors; P$ROOT, root hair-specific cis-elements in angiosperms.