| Literature DB >> 25867481 |
Julia Starega-Roslan1, Tomasz M Witkos2, Paulina Galka-Marciniak3, Wlodzimierz J Krzyzosiak4.
Abstract
The deep-sequencing of small RNAs has revealed that different numbers and proportions of miRNA variants called isomiRs are formed from single miRNA genes and that this effect is attributable mainly to imprecise cleavage by Drosha and Dicer. Factors that influence the degree of cleavage precision of Drosha and Dicer are under investigation, and their identification may improve our understanding of the mechanisms by which cells modulate the regulatory potential of miRNAs. In this study, we focused on the sequences and structural determinants of Drosha and Dicer cleavage sites, which may explain the generation of homogeneous miRNAs (in which a single isomiR strongly predominates) as well as the generation of heterogeneous miRNAs. Using deep-sequencing data for small RNAs, we demonstrate that the generation of homogeneous miRNAs requires more sequence constraints at the cleavage sites than the formation of heterogeneous miRNAs. Additionally, our results indicate that specific Drosha cleavage sites have more sequence determinants in miRNA precursors than specific cleavage sites for Dicer and that secondary structural motifs in the miRNA precursors influence the precision of Dicer cleavage. Together, we present the sequence and structural features of Drosha and Dicer cleavage sites that influence the heterogeneity of the released miRNAs.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25867481 PMCID: PMC4425070 DOI: 10.3390/ijms16048110
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The nucleotide frequencies at hairpin positions adjacent to the Drosha cleavage site (Na, Nb, Nc, Nd, N5, N6, N7, and N8) and the Dicer cleavage site (Ne, Nf, Ng, Nh, N1, N2, N3, and N4) in the most abundant miRNA variant is presented as a WebLogo sequence. (A) For each sequence logo, the y-axis denotes the frequency of that position being a specific nucleotide, with the size of the nucleotide correlating to its frequency. The nucleotides that are significantly different from the background are marked with solid-color letters (p < 0.00078, two-sided Fisher’s exact test with Bonferroni correction), and the p-values are provided in Figure 1C. The letter n denotes the number of analyzed miRNAs; (B) The background was defined as the average frequencies of nucleotides that were identified for the human pri-miRNA sequences that were deposited in miRBase ver. 14 (the excluded miRNAs are described in the Experimental Section). The fraction of each nucleotide is presented on the right; (C) p-value for the statistically significant differences that were observed between total miRNA population vs. background. Positions are designated as in (A).
Figure 2The nucleotide frequencies at hairpin positions surrounding the main (A,B) and the second-most-frequent (C,D) miRNA variants among the homogenous (A,C) and heterogeneous miRNAs (B,D) are presented as WebLogo sequences. The nucleotides that are significantly different from the background (Figure 1B) are marked with solid-color letters or appropriate designations (p < 0.00078, two-sided Fisher’s exact test with the Bonferroni correction). The colored # and outlined letters correspond to nucleotides that are significantly different between the homogeneous and heterogeneous miRNA groups (Part A vs. Part B and Part C vs. Part D in Figure 2). The colored asterisk (*) and outlined letters correspond to nucleotides that are significantly different between the main- and second-most frequent miRNA variants (Part A vs. Part C and Part B vs. Part D in Figure 2). All of the statistically significant p-values are provided in Table S1. The other designations are the same as in Figure 1; (E) The distribution of the miRNA lengths based on the nucleotide specificities of the Dicer cleavages. The column scatter plot and the mean values with a 95% confidence interval are shown. The x- and y-axes denote each isomiR’s terminal nucleotide and its length, respectively. The analyzed isomiRs were derived from the 5' arm of the pre-miRNA whose 5' end was generated through homogeneous Drosha cleavage (regardless of the heterogeneity/homogeneity at the 3' end of the miRNA). Each of the colored symbols represents one isomiR (most frequent).
Figure 3Sequence specificities of the nucleases Drosha and Dicer. (A) Relationship between the cleavage sites and the generated miRNA variants. A similar description as for the 5' arm should be applied to the 3' pri-miRNA arm; (B–D) The nucleotide frequencies that were observed at hairpin positions surrounding the Drosha and Dicer cleavage sites are presented as WebLogo sequences. Separate analyses were performed for the main (B) and second-most-frequent (C) Drosha cleavage sites within the group of miRNAs with a homogenous end that was generated by Drosha (regardless of the homogeneity or heterogeneity of the end that was generated by Dicer). For miRNAs with an end that was generated by the main homogenous Drosha cleavage (analyzed in B), the subsequent Dicer cleavage sites were analyzed both for the main (D) and second-most-frequent (E) homogeneous Dicer cleavage sites. The other designations are the same as in Figure 2.
Figure 4Schematic representation of the pool of isomiRs released from (A) homogeneously and (B) heterogeneously processed pri-miRNAs. The cleavage sites of the Drosha and Dicer RNase domains (RIIIA and RIIIB) are indicated with arrowheads whose size corresponds to the intensity of cleavage at these positions. The background colors at different cleavage sites indicate the occurrence of sequence bias at these sites, which affects the generation of homogeneous and heterogeneous miRNAs (according to the legend).