| Literature DB >> 32365489 |
Massimiliano Chetta1, Lorena Di Pietro2, Nenad Bukvic3, Wanda Lattanzi2,4.
Abstract
Gene expression regulation is achieved through an intricate network of molecular interactions, in which trans-acting transcription factors (TFs) and small noncoding RNAs (sncRNAs), including microRNAs (miRNAs) and PIWI-interacting RNAs (piRNAs), play a key role. Recent observations allowed postulating an interplay between TFs and sncRNAs, in that they may possibly share DNA-binding sites. The aim of this study was to analyze the complete subset of miRNA and piRNA sequences stored in the main databases in order to identify the occurrence of conserved motifs and subsequently predict a possible innovative interplay with TFs at a transcriptional level. To this aim, we adopted an original in silico workflow to search motifs and predict interactions within genome-scale regulatory networks. Our results allowed categorizing miRNA and piRNA motifs, with corresponding TFs sharing complementary DNA-binding motifs. The biological interpretation of the gene ontologies of the TFs permitted observing a selective enrichment in developmental pathways, allowing the distribution of miRNA motifs along a topological and chronological frame. In addition, piRNA motifs were categorized for the first time and revealed specific functional implications in somatic tissues. These data might pose experimental hypotheses to be tested in biological models, towards clarifying novel in gene regulatory routes.Entities:
Keywords: binding motifs; epigenetics; gene expression regulation; in silico; miRNA; piRNA; regulatory pathways; transcriptional modulation
Mesh:
Substances:
Year: 2020 PMID: 32365489 PMCID: PMC7290514 DOI: 10.3390/genes11050482
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Experimental workflow. A schematic representation of the pipeline integrating the different analytical software is provided. The details of the analytical steps are described in the text (Section 2).
MicroRNA (miRNA) conserved motifs.
| Motif Sequence | miRNA-Contained Sequence Motif | miRNA-Contained Complement Sequence Motif | Transcription Factors |
|---|---|---|---|
| RAAAGWAA | 100 | 8 | AR, BCL6, ETS2, FOXA1, FOXA3, FOXJ3, FOXK1, FOXM1, FOXP1, FOXQ1, GATA3, IRF1, IRF2, IRF3, IRF7, IRF8, IRF9, LEF1, LHX3, MEF2B, NFATC1, NR2E3, NR4A2, PRDM1, PRDM6, RORG, SMARCA1, SOX2, SOX3, SOX4, SRY, STAT2, ZFP28, ZIM3, ZNF274, ZNF354A, ZNF394, ZNF85 |
| UACUUWUG | 59 | 2 | CDX1, FOXA1, FOXA2, FOXA3, FOXC1, FOXM1, LEF1, LHX3, MEF2A, MEF2B, MEF2C, MEF2D, NR2E3, POU5F1, ZNF708 |
| ACCAACC | 33 | 1 | ARI5B, FOXI1, GLI3, NFYA, RUNX2, SALL4, Z324A, ZIC1, ZN449 |
| CYUUCUG | 106 | 33 | ATOH1, BCL6, EHF, ERG, ETS1, ETV4, NDF1, NDF2, NR1D1, OLIG2, OSR2, PRGR, SMAD4, STAT2, THA11, ZIC3, ZIM3, ZN143, ZN436, ZN528, ZN547, ZN768, ZNF76, ZSC31 |
| ASGGAAG | 60 | 10 | E2F6, ELF1, ELF2, ELK1, ELK4, ETS2, ETV1, FEV, GABPA, IRF3, NFKB1, OLIG2, PAX6, T, ZNF257, ZNF341, ZNF436, ZNF528, ZSCAN31 |
| GCUUCCHU | 41 | 4 | ELF3, ERG, ETS2, FEV, FEZF1, FLI1, FOXK1, NHLH1, NFKB1, NFKB2, OLIG2, SMARCA1, SMARCA5, SOX10, ZNF341, ZNF394, ZNF436, ZNF502, ZNF528, ZNF582, ZNF85 |
| CAAAAACY | 35 | 2 | HOXA10, NFATC3, RUNX2, ZIM3, ZNF384 |
| UUACBGU | 40 | 4 | ELK1, ETS1, ETV4, OVOL1, RFX3, ZIM3 |
| YGGUUUUU | 37 | 3 | AIRE, FOXO4, FOXQ1, HXA10, HXA13, MEF2C, NR2E3, RUNX1, RUNX2, RUNX3, TWST1, ZN384 |
| UGUGAY | 179 | 90 | ERR1, RXRB, GFI1, GFI1B, MITF, TFEB, JUNB, TFE3, ESR1, RARG, FOXI1, USF1, ZIC3 |
| KAGGUUG | 74 | 21 | DUX4, HIC1, ZIM3, ZN136, ZN768 |
| GGKUGGGG | 48 | 9 | E2F4, E2F6, EGR1, EGR2, ESR2, GLI3, KLF1, KLF12, KLF15, KLF3, KLF4, KLF5, KLF6, KLF9, MAZ, MXI1, PATZ1, PRDM14, RXRA, SALL4, SP1, SP2, SP3, SP4, SREBF1, SREBF2, TAL1, TBX3, VEZF1, WT1, ZBTB17, ZIC1, ZNF281, ZNF449, ZNF467 |
| GGAMAG | 153 | 78 | BCL6, E2F1, E2F4, E2F6, E2F7, ERG, ETS1, ETV4, GATA1, IRF3, MEIS2, MYOD1, MYOG, NFATC1, NFATC2, NFATC3, NFATC4, NFKB1, NFKB2, NR1D1, NR1I3, PBX1, PRDM1, RELB, REST, STAT1, RELA, TFDP1, TGIF1, ZNF257, ZNF274, ZNF335 |
| AUUACUUU | 25 | 1 | ALX1, CDX2, DUX4, EVX2, FOXA1, FOXA2, FOXA3, FOXC1, FOXK1, FOXM1, HNF6, IRF1, IRF2, IRF7, IRF8, LHX2, LHX3, NR2E3, PBX2, PRDM1, ZFP28, ZN394, ZNF85 |
PIWI-interacting RNA (piRNA) conserved motifs.
| Motif Sequence | piRNA-Contained Sequence Motif | piRNA-Contained Complement Sequence Motif | Transcription Factors |
|---|---|---|---|
| HTTCY | 11887 | 9673 | BCL6, E2F1, E2F3, EHF, ELF1, ELF2, ELF3, ELF5, ELK1, ELK4, ERG, ETS1, ETS2, ETV1, ETV2, ETV4, ETV5, FEV, FLI1, GABPA, HSF1, NFATC2, NFATC3, NFKB1, OSR2, PRDM6, SMARCA1, SP4, STAT5A, STAT5B, STAT1, STAT3, STAT4, STAT6, TFDP1, ZFP28, ZNF317, ZNF394, ZNF418, ZNF528, ZNF680 |
| CAYCW | 11906 | 10059 | PDX1, CREB1, SNAI1, TBX3, CEBPG, ATF4, SNAI2, REST, ZEB1, NDF1, NDF2 |
| ACTCGYG | 345 | 55 | CLOCK |
| CGTWCCCA | 187 | 9 | NFKB1, NFKB2, RFX1, RFX2, RFX3, RBPJ |
| CACGK | 1749 | 1151 | ARNT, ATF3, ATF6A, BHLHE40, BMAL1, CLOCK, EPAS1, HIF1A, MAX, MITF, MTF1, MXI1, MYC, MYCN, TFE3, TFEB, USF1, USF2 |
| GACKCCTC | 204 | 47 | BACH2, CRX, FOSB, FOSL1, FOSL2, JUN, JUND, ZNF317 |
| ACCWY | 4967 | 4083 | NR2F1, NR2F2, ERR2, ESRRG, GLI3, NR2C1, NR4A1, NR4A2, PPARG, REST, RUNX1, RUNX2, RUNX3, RXRA, RXRG, TBX21, TBX3, ZIC1, ZNF250, ZNF8 |
| AGGTTKGA | 162 | 26 | TBX3, PRDM14, ZNF324, ZNF449, TBX21, TEAD1 |
| AAAVTGC | 412 | 215 | MAFF, NKX3-1, MAF, IRF1, IRF2, ZNF85, IRF7, PRDM1, BATF3, HNF4A, STAT2, HNF4G, FEZF1, MYB, HIF1A, RORG, ZFP42 |
| GGTTCCGA | 56 | 2 | NR2C2 |
| GCAGAYAC | 143 | 43 | CLOCK, SMAD2, ZNF708, MAFB, FOXM1, MXI1, ZFP42, ZNF547 |
| AGCTSCTG | 250 | 111 | BHLHA15, MYOG, TCF3, MYOD1, MYF6, HTF4, ITF2, ATOH1, ZBTB18, ASCL1, OSR2, PTF1A, NFE2L2, NHLH1, MAF, RFX5, ZNF563, CEBPG, ATF4, LYL1, ZIC3, NEUROD2, MAFG, OLIG2, TFAP4 |
| CACTTAGS | 116 | 30 | NKX3-1, ISL1, NKX3-2, DLX3, FOXA2, PRRX2, BACH2, NFE2, NOBOX, BACH1, MYC, FOXA1, NFE2L2, MYCN, ARNT |
| HATCCTA | 290 | 147 | ZNF586, CRX, ZNF324, ZNF274 |
| AGGYAG | 1145 | 843 | ZNF335, ZNF490, MYB, RFX5, ZNF770, ETS1, ERG, FLI1, SMAD3, ZNF257, ZNF549, PTF1A, ZNF563, ETV2 |
| CCAAAK | 911 | 649 | HNF4A, HNF4G, FOXA1, ISL1 |
| ATGAACTC | 70 | 13 | NR1I3, NR1I2, NR2C1, VDR, ATF2, ZNF18, RARA, ZNF549, RXRB, NR6A1, ZNF354A, NR1H4 |
| CKGCTAAA | 67 | 12 | T, ZNF322 |
| BCATTTC | 217 | 105 | BCL11A, EHF, ELF1, ELF2, ELF3, ELF5, ELK1, ELK4, ERG, ETS1, ETV1, ETV2, ETV4, ETV5, FEZF1, GABPA, MAFF, NFKB1, POU2F1, POU2F2, POU3F1, POU3F2, REL, SPI1, SPIB, STAT5B, STAT1, STAT2, STAT3, RELA, THRA, ZNF354A, ZFP42, ZFP82, ZNF140, ZNF528 |
| TAAGGGTA | 48 | 5 | ZNF264, NKX3-2, CRX, ZNF667, DUX4 |
| GTACGWCA | 45 | 4 | EPAS1, ARNT, CREB1, ATF1, RORG, HIF1A, CREM, BHE40, ATF3 |
| CGRTGCCC | 44 | 4 | HIC1, NFIA, CUX1 |
| GAACGGGY | 41 | 3 | HIF1A, ZBT14, ZNF18 |
| CVTGGA | 1343 | 1057 | ZNF436, TEAD4, BCL6, RARG, REST, TEAD1, STAT1, STAT5A, STAT3, ZBTB6, STAT4, SNAI1, POU3F2, TP63, ESRRG |
| GTAGCTAS | 55 | 9 | RFX5, BHLHA15 |
| ATCGCTGA | 47 | 6 | NFE2L2, MAFK, MAFG, OZF, ZFP42, ZNF335, ZNF528 |
| ABGTTTA | 124 | 50 | FOXC1, FOXJ3, FOXA3, FOXK1, FOXO4, FOXQ1, FOXA1, FOXA2, MEF2D, HNF1B, FOXJ2, FOXP2, HNF1A, FOXO3, SRY, MEF2C, MEF2A, FOXP1, MEF2B, FOXO1, FOXM1 |
| CCAMTAAC | 60 | 14 | CREM, ETS1, FOXI1, HOXA13, HOXB13, HOXB4, ISL1, RUNX2, VDR |
| KGGCTTA | 219 | 120 | ZNF528, ZNF41, OTX2, ZNF449, ZNF214 |
| GTGTYTA | 202 | 108 | CLOCK, FOXA1, FOXA2, FOXJ2, FOXJ3, FOXK1, FOXO3, FOXO4, FOXP1, FOXP2, FOXQ1, MEF2B, NKX3-1, PRDM14 |
| ATTGCACG | 23 | // | CEBPA, CEBPD, CEBPB, ATF4, CEBPG |
| CCTARAG | 186 | 99 | BCL6, ZNF436, STAT5A |
| AKGAGGAC | 108 | 46 | ZNF816, ZNF586, THRB, ELF5, ZKSCAN1, CRX, ATF2, ZNF263, ETV5, ZIC3, SPI1, RXRB |
| AAGGSCAC | 80 | 29 | ZNF667, ZNF264, NR4A2, ERR1, NR4A1, STF1, RXRG, NR5A2, TFAP2C, ERR2, ZNF214, ESRRG, HNF4G, SOX9, TFAP2A, NR1H3, RXRB, HNF4A |
| RATGGAA | 87 | 34 | ZNF502, SMARCA5, ZFP82, ZNF582, ZNF394, POU2F1, NFATC1, ZNF264, ZNF354A, FOXK1, ZFP28, HOXA1, BATF3, POU3F1, ISL1, ZNF8, ZNF85, TWIST1, STAT2, NFKB1, IRF3, SOX2, NFATC3, ATF4, NFATC2, PRDM6, CEBPG, ETS2 |
| ACTGWTCG | 34 | 5 | ZNF691, CUX1 |
| GAACWCA | 338 | 226 | NR3C1, VDR, RXRB, PGR, SOX4, ZKSCAN1, FOXP1 |
| CCGTAGCY | 29 | 3 | MYOG, RFX1, RFX2, MYCN |
| ACCDACTG | 104 | 45 | ZKSCAN1, MYB, NEUROD1, SNAI1, KLF8, NEUROD2, ATOH1, TFAP4, HTF4, BHLHA15, NR1D1 |
Figure 2Schematic representation of spatial and temporal distribution of putative DNA-binding motifs identified in microRNAs (miRNAs). The functional clusters identified with the computational pathway analysis of transcription factors (TFs) sharing the motifs identified in miRNAs are depicted as colored boxes distributed along the timeline (arrow on the left side) of human embryo development. The colored boxes (right side; same color ID scale) group the motifs identified in miRNAs for which developmental functions were enriched, according to GeneMANIA functional interpretation (see text for details).
Figure 3Schematic representation of the enriched functions for PIWI-interacting RNA (piRNA) motifs. The scheme provides an overview of the motifs associated with each functional cluster derived from the computational pathway analysis (based on GeneMANIA tool) of transcription factors (TFs) sharing the same domains with piRNAs (see text for details). A specific colored box is assigned to each piRNA motif.
Figure 4Several steps of microRNA (miRNA) biogenesis and nucleus–cytoplasm transport. Mature miRNAs derive from longer double-stranded primary transcripts (pri-miRNA), which are recognized and processed in the nucleus by the Drosha protein/DGCR8 microprocessor complex subunit (DGCR8) complex into shorter precursors folded in a hairpin loop structure (pre-miRNA). Pre-miRNAs are then exported to the cytoplasm (through Exportin 5) where they are first cleaved by Dicer and later processed by RNA-induced silencing complex (RISC) to form the mature miRNAs. Transactivation response element RNA-binding protein (TRBP) intervenes in the stabilization of Dicer. RISC, which includes Protein argonaute-2 (Ago2), also participates in the identification of miRNAs’ targets. The integrative miRNA network highlights, in the yellow circle, the import of Ago2 with mature miRNA into the nucleus via Importin 8 and trinucleotide repeat-containing gene 6A protein (TNRC6), another component of RISC complex, via Importin β. Nuclear RISC is again assembled to elicit pleiotropic effects by regulating multiple pathways with a direct interaction on DNA transcription factor binding sites (TFBSs) and possible formation of triple-helix structures.
Figure 5Different stages of PIWI-interacting RNA (piRNA) biogenesis and function. Mature piRNAs are derived from precursor RNAs following a post-transcriptional processing through two alternative mechanisms. The primary maturation pathway involves cleavage of long, single-stranded piRNA clusters and the binding with PIWI proteins in the cytoplasm. The second mechanism is a self-amplifying loop (termed “ping-pong” cycle), in which an antisense piRNA binds PIWI proteins and triggers production of a sense piRNA that binds to Protein argonaute-3 (Ago 3). Nuclear PIWI/piRNA complexes regulate gene and transposon expression by epigenetic modifications. Once the piRNAs are loaded onto PIWI, the activity and/or expression of DNA methyltransferases (Hen 1) is increased, promoting methylation of promoter regions, preventing transcription factor binding, and interacting with histone methyltransferase. Cytoplasmic mature piRNA promotes mRNA decay by interacting with deadenylation complex, inhibits translation by directly binding with translation factors, and modulates cellular signaling by directly regulating the post-translational modifications.