Literature DB >> 21909094

Weak seed-pairing stability and high target-site abundance decrease the proficiency of lsy-6 and other microRNAs.

David M Garcia¹, Daehyun Baek, Chanseok Shin, George W Bell, Andrew Grimson, David P Bartel.

Abstract

Most metazoan microRNAs (miRNAs) target many genes for repression, but the nematode lsy-6 miRNA is much less proficient. Here we show that the low proficiency of lsy-6 can be recapitulated in HeLa cells and that miR-23, a mammalian miRNA, also has low proficiency in these cells. Reporter results and array data indicate two properties of these miRNAs that impart low proficiency: their weak predicted seed-pairing stability (SPS) and their high target-site abundance (TA). These two properties also explain differential propensities of small interfering RNAs (siRNAs) to repress unintended targets. Using these insights, we expand the TargetScan tool for quantitatively predicting miRNA regulation (and siRNA off-targeting) to model differential miRNA (and siRNA) proficiencies, thereby improving prediction performance. We propose that siRNAs designed to have both weaker SPS and higher TA will have fewer off-targets without compromised on-target activity.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2011 PMID： 21909094 PMCID： PMC3190056 DOI： 10.1038/nsmb.2115

Source DB: PubMed Journal: Nat Struct Mol Biol ISSN： 1545-9985 Impact factor: 15.369

Introduction

MicroRNAs are ~22-nucleotide RNAs that pair to the messages of protein-coding genes to direct posttranscriptional repression of these target mRNAs[1,2]. In animals, numerous studies using a wide range of methods, including comparative sequence analysis, site-directed mutagenesis, genetics, mRNA profiling, co-immunoprecipitation, and proteomics, have repeatedly shown that perfect pairing to miRNA nucleotides 2–7, known as the miRNA seed, is important for the recognition of many if not most miRNA targets[3]. To impart more than marginal repression of mammalian targets, this seed pairing is usually augmented by either a match to miRNA nucleotide 8 (7mer-m8 site)[4-7] or an A across from nucleotide 1 (7mer-A1 site)[4,7] or by both (8mer site)[4,7]. In relatively rare instances, targeting also occurs through 3′-compensatory sites[4,5,8] and centered sites[9], for which substantial pairing outside the seed region compensates for imperfect seed pairing. A single miRNA can target hundreds of distinct mRNAs through seed-matched sites[10]. Indeed, most human mRNAs are conserved regulatory targets[8], and many additional regulatory interactions occur through nonconserved sites[11-13]. However, not every site is effective; 8-nucleotide sites are more often effective than 7-nucleotide sites, which are more often effective than 6-nucleotide sites[7,14]. Another factor is site context. For example, sites in the 3′UTRs are more often effective than those in the path of the ribosome[7]. Among 3′UTR sites, those away from the centers of long UTRs and those within high local A–U sequence context are more often effective[7], consistent with reports that sites predicted to be within more accessible secondary structure tend to be more effective[15-19]. Also influencing site efficacy is proximity to other miRNA-binding sites[7,20], to protein-binding sites[21], and to sequences that can pair to the 3′ region of the miRNA, particularly nucleotides 13–17 (ref [7]). Studies of site efficacy have focused primarily on different sites to the same miRNA, without systematic investigation of whether some miRNA sequences might be intrinsically more proficient at targeting than others. Broadly conserved miRNAs typically have many more conserved targeting interactions than do other miRNAs[4,8], and highly or broadly expressed miRNAs appear to target more mRNAs than do others[22], but these phenomena reflect evolutionary happenstance more than intrinsic targeting proficiency. Our interest in targeting proficiency was spurred by intriguing results regarding the lsy-6 miRNA. When tested in C. elegans, only one of 14 predicted targets with 7–8-nucleotide seed-matched sites responds to lsy-6, which was interpreted to show that perfect seed-pairing is not a generally reliable predictor for miRNA–target interactions[23]. An alternative interpretation, which seemed more parsimonious with findings for many other miRNAs in other contexts[3], is that the results for lsy-6 might not be generally applicable to other miRNAs because lsy-6 might have unusually high targeting specificity because of unusually low targeting proficiency. A similar rationale might explain results for mammalian miR-23, another miRNA that confers unusually weak responses from most reporters designed to test predicted targets. When considering properties that might confer a low targeting proficiency, we noted that both lsy-6 and miR-23 have unusually AU-rich seed regions, which could lower the stability of seed-pairing interactions. Perhaps a threshold of seed-pairing stability (SPS) is required for the miRNA to remain associated with targets long enough to achieve widespread seed-based targeting. Indeed, predicted SPS correlates with the propensity of siRNAs to repress unintended targets[24], a process called “off-targeting,” which occurs through the same seed-based recognition as that for endogenous miRNA targeting[10]. Potentially confounding this interpretation, however, was that miRNAs with AU-rich seed regions have more 3′UTR binding sites, a consequence of the AU-rich nucleotide composition of 3′UTRs, which could dilute the effect on each target message. Indeed, target-site abundance (TA) can be manipulated to titrate miRNAs away from their normal targets[25,26], and natural TA has been proposed to play a role in miRNA targeting and siRNA off-targeting[27,28], although these reported TA effects have not been fully disentangled from potential SPS effects. Here, we find that both SPS and TA have a substantial impact on targeting proficiency, and then apply these insights to improve miRNA target predictions.

Results

The targeting specificity of lsy-6 is recapitulated in HeLa cells

Lsy-6 targeting was originally examined in a C. elegans neuron, whereas the more proficient targeting by other miRNAs was experimentally demonstrated in other systems, sometimes in vertebrate tissues or primary cells[11,13,29,30] but more often in cell lines[3]. To test whether differences in targeting proficiency might be attributed to the very different biological contexts in which the miRNAs had been examined, we ported the 14 3′UTRs tested in C. elegans into a luciferase reporter system typically used in mammalian cell lines and introduced the lsy-6 miRNA by co-transfecting an imperfect RNA duplex representing the miRNA and miRNA* sequences (Supplementary Fig. 1a). As observed in worms, only the cog-1 3′UTR responded in HeLa cells (Fig. 1b). Repression was lost when a control miRNA (miR-1) replaced lsy-6 or when the two cog-1 sites were mutated, introducing either mismatches (Fig. 1b) or G:U wobbles (Supplementary Fig. 1b,c)

Figure 1

Increasing SPS while decreasing TA imparted typical targeting proficiency to lsy-6 and miR-23 miRNAs. (a) Sequences of miRNAs and target sites tested in reporter assays of this figure. Each miRNA was co-transfected with reporter plasmids as a duplex designed to represent the miRNA paired with its miRNA* strand (Supplementary Fig 1a). (b) Response of reporters with 3′UTRs of predicted lsy-6 targets following co-transfection with lsy-6. As a specificity control, the experiment was also performed using a non-cognate miRNA, miR-1 (grey bars). Geometric means are plotted relative to those of reporters in which the predicted target sites were mutated after also normalizing for the repression observed for miR-1 (grey bars). The mutant sites of this experiment were the cognate sites of Figure 2d. Error bars represent the third largest and third smallest values among 12 replicates from 4 independent experiments. Statistically significant differences in repression by the cognate miRNA compared to that by the non-cognate miRNA are indicated. (*p < 0.01, **p < 0.001, Wilcoxon rank-sum test). (c) Distribution of predicted SPSs for 7mer-m8 sites of 60 conserved nematode miRNA families[36] (Supplementary Table 7). Values were rounded down to the next half-integer unit. (d) SPS distribution for 7mer-m8 sites of 87 conserved vertebrate miRNA families[8] (Supplementary Table 7). (e) Distributions of predicted genome TA for 7mer-m8 3′UTR sites of 60 conserved nematode miRNA families (Supplementary Table 7). Values were rounded up to the next tenth of a unit. (f) Distributions of predicted genome TA for 7mer-m8 3′UTR sites of 87 conserved vertebrate miRNA families (Supplementary Table 7). (g) Response of reporters mutated such that their sites matched the miR-142 seed. The cognate miRNA was the miR-142lsy-6 chimera; non-cognate sites were lsy-6 sites. Otherwise, as in b. (h) As in g, except showing the response to miR-142 transfection. (i) Response of reporters with 3′UTRs of predicted miR-23 targets following co-transfection with miR-23a. Non-cognate sites were for miR-CGCG. Otherwise, as in b. (j) Response of reporters mutated such that their sites matched the seed of miR-CGCG, which was co-transfected as the cognate miRNA. Non-cognate sites were for miR-23. Otherwise, as in i.

Each of the 14 3′UTRs had at least one canonical 7–8-nucleotide lsy-6 site, and 11 UTRs had a site conserved in three sequenced nematodes (Supplementary Table 1). When evaluated using the context-score model, some sites had scores comparable to those of sites that mediate repression in this assay[7] (Supplementary Table 1). Moreover, the C27H6.9 3′UTR had two 8mer sites with scores matching those of the two cog-1 sites. The close match between the results in our heterologous reporter assay and previous results in C. elegans neurons indicated that the exquisite specificity for targeting the cog-1 3′UTR did not require the endogenous cellular context of lsy-6 repression; it was operable in HeLa cell culture and thereby attributable to the intrinsic properties of lsy-6 and its targets. This unifying result also implied that these properties could be investigated in mammalian cell culture, which is easier than using stable reporter lines in worms.

Increasing SPS while decreasing TA elevates targeting proficiency

As expected for a miRNA with sequence UUUGUAU at nucleotides 2–8, the calculated free energy (ΔG°) of the predicted SPS for the lsy-6 8mer or 7mer-m8 sites (both seven base pairs) was −3.65 kcal mol−1, which was weaker than that of all but one conserved nematode miRNA (Fig. 1c). The lsy-6 predicted SPS was also weaker than that of the weakest of 87 broadly conserved vertebrate miRNAs (Fig. 1d). The predicted ΔG° of an 8mer or 7mer-m8 seed match for miR-23 was −5.85 kcal mol−1, which fell in the bottom quintile for broadly conserved vertebrate miRNAs (Fig. 1d). Similar results are observed for 7mer-A1 or 6mer sites (both 6 base pairs) for both miRNAs (Supplementary Fig. 1d,e). Lsy-6 also falls at the extreme end of the distribution of TA for miRNAs in nematodes and human (Fig. 1e, f). To predict the TA in a genome, we counted the number of sites in a curated set of distinct 3′UTRs. When considering a particular cell type, the genome TA was converted to a transcriptome TA by considering the relative levels of each mRNA bearing a site, although in practice the genome and transcriptome TA levels were highly correlated. For example, the transcriptome TA for HeLa cells (TAHeLa) correlated nearly perfectly with the genome TA (R = 0.98, P < 10−100, Spearman’s correlation test, Supplementary Fig. 1f). When considering 8mer and 7mer-m8 sites (which both pair to nucleotides 2–8), lsy-6 had a genome TA that ranked 2nd among 60 C. elegans miRNA families and a TAHeLa that would place it beside miR-23, which ranks 5th among the 87 vertebrate families (Fig. 1e and Supplementary Fig. 1g). To test the hypothesis that either the weak SPS or the high TA of lsy-6 influences its targeting proficiency, we made three substitutions in the lsy-6 seed that changed both properties. The three substitutions converted the lsy-6 seed to that of miR-142-3p (Fig. 1a; Supplementary Fig. 1a), which changed the predicted SPS to −7.70 kcal mol−1, which was −4.05 kcal mol−1 stronger than that of lsy-6 and near the median values for conserved nematode and vertebrate miRNAs (Fig. 1c, d). The substitutions also changed the predicted TA to 102.957 sites in C. elegans and 103.207 sites in human, values below the median of conserved miRNAs in both genomes (Fig. 1e, f). When assayed using reporters with compensatory substitutions in their seed matches, co-transfecting this miR-142lsy-6 chimeric miRNA repressed nine of 14 reporters, a fraction within, if not exceeding, the range expected in this system when using reporters with the site types and contexts assayed (Fig. 1g). Repeating the experiment using the full-length miR-142-3p sequence (Fig. 1a; Supplementary Fig. 1a) gave similar results, indicating that miRNA sequence outside the seed region was irrelevant for repression of both the cog-1 3′UTR and the other C. elegans 3′UTRs (Fig. 1h). Like lsy-6, miR-23 also had low targeting proficiency in our system. A survey of 17 human 3′UTR fragments, randomly chosen from a set with two 7–8-nucleotide miR-23 sites (conserved or nonconserved) spaced within 700 nt of each other, found only one fragment to be repressed by miR-23 endogenous to either HeLa or HepG2 cells (data not shown). Subsequent experiments focusing on the six UTRs with the most favorable context scores (Supplementary Table 1) showed that co-transfecting additional miR-23a imparted marginal if any repression (Fig. 1i). To test if increasing SPS while decreasing TA might also improve the targeting proficiency of miR-23a, we converted two A:U seed pairs into two G:C pairs (Fig. 1a; Supplementary Fig. 1a), which boosted the predicted SPS from −5.85 kcal mol−1 to −8.67 kcal mol−1 while reducing the TA from the 5th highest of the 87 vertebrate families to below the lowest. When assaying this miRNA, called miR-CGCG, using reporters with compensatory substitutions in their seed matches, the sporadic and marginal repression observed with the wild-type UTRs became much more robust (Fig. 1j). These results indicated that miR-23a had low targeting proficiency because of either its weak SPS or its high TA, or both, thereby extending our findings to a mammalian miRNA and mammalian 3′UTRs.

Separating the effects of SPS and TA on miRNA targeting

To begin to differentiate the potential effects of SPS from those of TA, we considered the relationship between these two properties for all 16,384 possible heptamers. When examining the C. elegans 3′UTRs, these properties were highly anti-correlated (Fig. 2a, R = 0.680, P < 10−100, Spearman’s correlation test). When examining mammalian 3′UTRs the relationship was still highly significant, but the substantial depletion of CG dinucleotides in the vertebrate transcriptome[31] created more spread in TA, which lowered correlation coefficients for both human (Fig. 2b, R = 0.121, P < 10−100) and mouse (Supplementary Fig. 2a, R = 0.081, P < 10−100). In general, each additional CG dinucleotide imparted an additional log10 reduction in TA.

Figure 2

Separating the effects of SPS and TA on miRNA targeting proficiency. (a) The relationship between predicted SPS and genomic TA for lsy-6 and the 59 other conserved nematode miRNAs (red squares), and all other heptamers (light blue, blue, dark blue, or purple squares indicating 0, 1, 2, or 3 CpG dinucleotides within the heptamer respectively). TA was defined as the total number of canonical 7–8-nucleotide sites (8mer, 7mer-m8, and 7mer-A1) in annotated 3′UTRs. SPS values were predicted using the respective 7mer-m8 sites. (b) The relationship between predicted SPS and TA in human 3′UTRs for miR-23 and the 86 other broadly conserved vertebrate miRNA families (red squares). Otherwise, as in a. (c) Sequences of miRNAs and target sites tested in reporter assays of this figure. (d) Response of reporters with 3′UTRs of predicted lsy-6 targets mutated such that their sites matched the seed of LTA-lsy-6, which was co-transfected as the cognate miRNA. Non-cognate sites were for lsy-6. Otherwise, as in Figure 1b. (e) 2,6-di-aminopurine (DAP or D)—uracil base pair. (f) Response of reporters used in d after co-transfecting D-LTA-lsy-6 as the cognate miRNA. Otherwise, as in d. (g) Response of reporters used in Figure 1i after co-transfecting D-miR-23a as the cognate miRNA, alongside results for miR-23a that was repeated in parallel. Otherwise, as in Figure 1i.

To test the influence of TA on lsy-6 targeting proficiency, we designed the low-TA (LTA) version of lsy-6, which had two point substitutions in the lsy-6 seed (Fig. 2c; Supplementary Fig. 1a). Substituting U4 with a C (substitution U4C) introduced a CG dinucleotide, whereas the other substitution, U2A, facilitated later investigation of SPS. Because of the CG dinucleotide, LTA-lsy-6 had a predicted TAHeLa 95% lower than that of lsy-6, a value that would be 3rd lowest among the conserved vertebrate miRNA families. Although the substitutions also increased SPS, the predicted SPS of −5.49 kcal mol−1 was still slightly weaker than that of miR-23 and well below the median for both nematode and vertebrate conserved miRNAs (Fig. 1c, d). When assayed using reporters with compensatory substitutions in their seed matches, LTA-lsy-6 repressed the cog-1 reporters and only three others (Fig. 2d). Two (F55G1.12 and C27H6.9) were repressed only marginally (<1.3 fold), which was reminiscent of the marginal repression imparted by miR-23 when using its cognate sites, and for the third, T20G5.9, much of the apparent repression was attributed to normalization to the miR-1 results, which in the case of this UTR were unusual (Supplementary Fig. 2d). Taken together, the LTA-lsy-6 results indicated that lowering TA was not sufficient on its own to confer robust targeting proficiency. To increase SPS without changing TA, we replaced each of the two seed adenines of LTA-lsy-6 with 2,6-di-aminopurine (DAP or D). DAP is an adenine analog with an exocyclic amino group at position 2, which enables it to pair with uracil with geometry and thermodynamic stability resembling that of a G:C pair (Fig. 2e). Because nearest-neighbor parameters had not been determined for model duplexes containing D:U pairs, we estimated SPS using the values for A:U pairs and adding −0.9 kcal mol−1 for each D:U pair, as this is the value attributed to an additional hydrogen bond in model duplexes[32]. With this approximation, the D-LTA-lsy-6 miRNA had a predicted SPS of −7.29 kcal mol−1, which approached −7.87 kcal mol−1, the median predicted SPS of the conserved vertebrate miRNAs. When assayed using the same reporters as used for LTA-lsy-6, D-LTA-lsy-6 repressed seven of fourteen reporters (Fig. 2f). Although less proficient than that observed with the miR-142 seed (Fig. 1g, h), repression was greater than that observed for LTA-lsy-6 and on par with that expected for mammalian miRNAs in this system when using reporters with the site types and site contexts assayed. We next tested D-miR-23, which also had two seed adenines replaced by DAP, thereby boosting the predicted SPS from −5.85 kcal mol−1 to −7.65 kcal mol−1. Five of the six reporters with miR-23 sites were repressed significantly greater by D-miR-23a than by wild-type miR-23a (Fig. 2g), thereby demonstrating a favorable effect for increasing SPS in the context of very high TA (93rd percentile). However, repression was still considerably lower than that conferred by miR-CGCG, presumably because miR-CGCG had lower TA and somewhat stronger SPS (−8.67 kcal mol−1), although we cannot exclude the possibility that the non-natural DAP in the miRNA compromised activity. In summary, the results with DAP-substituted miRNAs show that for miRNAs with weak SPS, increasing SPS can enhance targeting proficiency, regardless of whether these miRNAs have high or low TA. Because DAP-substitution changed the predicted SPS without changing the sites in the UTRs, these results indicated that the low proficiency was due to weak SPS rather than occlusion of the sites by RNA-binding proteins that recognized the miRNA seed matches. Taken together, our reporter results also suggest that lowering TA can further enhance targeting proficiency, particularly for miRNAs with moderate-to-strong SPS.

Global impact of TA and SPS on targeting proficiency

To examine the global impact of TA and SPS on targeting, we collected 175 published microarray datasets that monitored the response of transfecting miRNAs or siRNAs (sRNAs) into HeLa cells (Supplementary Table 2). Datasets reporting the effects of sRNAs with the same seed region were combined, yielding results for 102 distinct seeds that covered a broad spectrum of TA and predicted SPS (Fig. 3a). For each of these 102 datasets, we determined the mean repression of mRNAs with a single 3′UTR 8mer site and no other sites in the message, and plotted these values with respect to both the TAHeLa and predicted SPS of the transfected sRNA (Fig. 3b, top). sRNAs with lower TAHeLa were more effective than those with higher TAHeLa, and those with stronger predicted SPS were more effective than those with weaker predicted SPS (P=0.0006 and 0.0054 for TAHeLa and SPS, respectively, Pearson’s correlation test; Table 1). When using multiple linear regression to account for the cross-correlation between TAHeLa and SPS, correlations were still at least marginally significant for the individual features (P = 0.005 and 0.05, t test; Table 1), which indicated that both properties were independently associated with the proficiency of targeting 3′UTR sites. Similar results were observed for targeting 7mer-m8, 7mer-A1, and 6mer sites (Fig. 3b and Table 1).

Figure 3

Impact of TA and SPS on sRNA targeting proficiency, as determined using array data.(a) Distribution of TAHeLa and predicted SPS for the sRNAs from the 102 array datasets analyzed in this study (orange squares), and sRNAs from datasets that passed the motif-enrichment analysis (red squares). Otherwise, plotted as in Figure 2b. (b) Response of expressed mRNAs with a single 3′UTR site to the cognate sRNA, shown with respect to TAHeLa and predicted SPS. Fold-change values are plotted according the key to the right of each plot, comparing mRNAs with a single site of the type indicated (and no additional sites to the cognate sRNA elsewhere in the mRNA) to those with no site to the cognate sRNA; note different scales for different plots. In areas of overlap, mean values are plotted. Correlation coefficients and P values are in Table 1. (c) Response of expressed mRNAs with a single ORF site to the cognate sRNA, shown with respect to TAHeLa and predicted SPS. Otherwise, as in b. (d) Response of mRNAs with the indicated single sites when binning the cognate sRNA by TAHeLa (top panel) or predicted SPS (bottom panel). The key indicates the data considered, with the first quartiles of the top panel comprising data for sRNAs with the lowest TAHeLa and those of the bottom panel comprising data for sRNAs with the strongest predicted SPS. Error bars indicate 95% confidence intervals.

Table 1

Relationship between mean mRNA repression and either TA or predicted SPS for the indicated site types, as determined from microarray data (Fig. 3b,c)

Site location and type	Multiple linear regression			Simple linear regression
	Multiple R²	P value		TA_HeLa		SPS
	Multiple R²	TA_HeLa	SPS	R²	P value	R²	P value
3′UTR 8mer	0.149	0.0049	0.051	0.115	0.0006	0.076	0.0054
3′UTR 7mer-m8	0.190	0.0081	0.0047	0.122	0.0003	0.131	0.0002
3′UTR 7mer-A1	0.335	0.0009	2 × 10 ⁻⁵	0.196	3 × 10 ⁻⁶	0.256	6 × 10 ⁻⁸
3′UTR 6mer	0.177	0.039	0.0025	0.097	0.0014	0.141	0.0001
ORF 8mer	0.104	0.018	0.14	0.085	0.0030	0.052	0.021
ORF 7mer-m8	0.171	0.019	0.0054	0.103	0.0010	0.123	0.0003
ORF 7mer-A1	0.135	0.010	0.073	0.106	0.0008	0.076	0.0052
ORF 6mer	0.228	0.010	0.0008	0.133	0.0002	0.174	1 × 10 ⁻⁵
5′UTR 8mer	0.004	0.75	0.68	0.002	0.64	0.003	0.59
5′UTR 7mer-m8	0.003	0.63	0.72	0.002	0.70	0.000	0.84
5′UTR 7mer-A1	0.012	0.60	0.49	0.007	0.41	0.009	0.35
5′UTR 6mer	0.011	0.97	0.32	0.001	0.74	0.011	0.29

Although both TA and SPS each significantly influenced targeting proficiency, together they explained only a minority of the variability (Table 1). Most of the variability might be from factors unrelated to targeting, such as array noise, differential transfection efficiencies, or differential sRNA loading or stability. To reduce variability from these sources, we focused on 74 datasets for which responsive messages were significantly enriched in 3′UTR sites to the transfected sRNA (Fig. 3a, red squares; Supplementary Table 2). With these filtered datasets, correlations between proficiency and both TAHeLa and SPS were stronger and observed with similar statistical significance, even though the filtering reduced the quantity of data analyzed and might have preferentially discarded datasets for which high TA or weak SPS prevented detectable repression (Supplementary Fig. 3a,b and Supplementary Table 3). Studies monitoring global effects of miRNAs on target repression have concluded that sites in open reading frames (ORFs) can mediate repression but that the efficacy of these sites is generally less than that of sites in 3′UTRs[7,30,33,34]. To examine the impact of TA and SPS on targeting in ORFs, we considered expressed messages that had a single ORF site but no additional sites in the rest of the message. For 7mer-m8 and 6mer sites, mean repression significantly correlated with both TAHeLa and predicted SPS, and for the other two sites in ORFs, mean repression significantly correlated with TAHeLa (Fig. 3c and Table 1). The response of sites in 5′UTRs did not significantly correlate with either TA or predicted SPS (Table 1), consistent with the idea that 5′UTRs harbor relatively few effective sites[3]. We next examined the quantitative impact of TA and SPS on targeting proficiency. The same sets of mRNAs with single sites to the cognate sRNAs were considered, and for each site type and each mRNA region, mRNAs were binned into quartiles ranked by either low TA or strong predicted SPS. For each site type, messages in the top quartile responded more strongly than those in the bottom (Fig. 3d). The differences usually were substantial. For example, repression of the top quartile of mRNAs with 7mer-A1 sites matched the mean repression of mRNAs with 7mer-m8 sites, whereas repression of the bottom quartile resembled the mean repression of mRNAs with 6mer sites.

Improved miRNA target prediction

One of the more effective tools for mammalian miRNA target prediction is the context score[30]. Context scores are used to rank mammalian miRNA target predictions by modeling the relative contributions of previously identified targeting features, including site type, site number, site location, local AU content, and 3′-supplementary pairing, to predict the relative repression of mRNAs with 3′UTR sites[7]. However, the context-score model was not designed to consider differences between sRNAs, such as TA or SPS, which can cause sites of one miRNA to be more robustly targeted compared to those of another (assuming equal expression of the two miRNAs). To build a model appropriate for predicting the relative response of targets of different miRNAs, we considered TA and SPS as two independent variables when performing multiple linear regression on the 11 microarray datasets used previously for the initial development and training of the context-score model[7]. The other parameters were local A–U content, the location of the site within the 3′UTR, and 3′-supplementary pairing[7]. For each site type, TA and/or SPS robustly contributed (Supplementary Table 4). The scores generated by these models were called context+ scores, because they consider site type and context plus sRNA proficiency. We then generated the total context+ score for each mRNA with 3′UTR sites, relying on the observation that multiple sites typically act independently with respect to each other[7]. The predictive value of the new model was tested using data from array datasets not used to train the model, comparing the performance of the predicted targets ranked using the total context+ scores to those ranked using scores of the original model. To examine if any improvement over the original model was from training the model with multiple linear regression rather than simple linear regression, we also used multiple linear regression to build a model that considered only the three parameters used to build the original model(context-only scores, Supplementary Table 5). For each model, predicted targets with 7–8-nucleotide sites were ranked by score and assigned to 10 bins. The context+ scores performed better than the old context scores at predicting the response to the sRNAs (Fig. 4a), yielding significantly stronger mean repression for the top two bins (P = 5 × 10−56 and 3 × 10−8 for bins 1 and 2, respectively) and significantly weaker repression in the bottom four bins (P = 6 × 10−10, 1.5 × 10−5, 1 × 10−7, and 3 × 10−4 for bins 7, 8, 9, and 10, respectively, Wilcoxon’s rank sum test). Improved specificity was also illustrated in ROC curves (Supplementary Fig. 4a).

Figure 4

Predictive performance of the context+ model, which considers miRNA or siRNA proficiency in addition to site context. (a) Improved predictions for mRNAs with canonical 7–8-nucleotide 3′UTR sites. Predicted interactions between mRNAs and cognate sRNA were distributed into 10 equally populated bins based on total context scores generated using the model indicated (key), with the first bin comprising interactions with the most favorable scores. Plotted for each bin is the mean mRNA change on the arrays (error bars, 95% confidence intervals). (b) Prediction of responsive interactions involving mRNAs with only 3′UTR 6mer sites. Otherwise, as in a. (c) Prediction of responsive interactions involving mRNAs with at least one 8mer ORF site but no 3′UTR sites. Otherwise, as in a. (d) Impact of TA and SPS on siRNA-directed knock-down of the desired target. Efficacy in luciferase activity knock-down is plotted for 2,431 siRNAs transfected into H1299 cells[38]. Efficacy is linearly scaled (key), with positive and negative controls having values of 0.900 and 0.354, respectively[38].

Because most 6mer sites and ORF sites are either nonresponsive or only marginally responsive to the miRNA, algorithms that achieve useful prediction specificity do so at the expense of ignoring these sites[3]. Having found that low TA and strong SPS correlated with substantially greater efficacy of these marginal sites (Fig. 3c, d), we extended the context+ scores to 6mer sites. For the context+ model, the top bin of mRNAs with 6mer 3′UTR sites but no larger sites (Fig. 4b) had average repression resembling that observed for the third bin of mRNAs with 7–8-nucleotide 3′UTR sites (Fig. 4a; ROC curves, Supplementary Fig. 4b). Context-only and context+ scores were also generated for ORF sites, changing only the parameter of site location, which was not applicable for ORF sites because it accounts for the lower efficacy of sites near the middle of long 3′UTRs[7]. In ORFs, we found that sites further from the stop codon tended to be less effective, and thus the distance from the stop codon (linearly scaled distance of 0 to ≥1500 nt) was included as a parameter. Although this context+ model was not substantially better than the context-only model for ORF sites (perhaps because data from only 11 miRNAs were used in the regression), both models had predictive value. When comparing mRNAs with at least one 8mer ORF site (Fig. 4c), those ranked in the top bin had average repression resembling that observed for the second or third bins of mRNAs with 7–8-nucleotide 3′UTR sites (Fig. 4a). Overall, our findings showed that taking TA and SPS into account could significantly improve miRNA target prediction when pooling results from multiple sRNAs. Training on the 11 miRNA transfection datasets that had been used for the original context scores was appropriate for demonstrating the improvement that could be achieved by taking TA and SPS into account. We reasoned, however, that training on the 74 filtered datasets could generate a more precise context+ model to be used to quantitatively predict repression. As expected, correlations for all four parameters had even greater statistical significance when training the model on more data (Supplementary Table 6). Although an SVM (support vector machine) approach should in principle yield even better results by capturing effects lost in multiple linear regression due to multicollinearity, enhanced performance was not observed with SVM (Supplementary Fig. 4c–e). Therefore, we used multiple linear regression because it enabled more convenient calculation of context+ scores (Supplementary Fig. 5a). We will use these new scores in version 6.0 of TargetScan (targetscan.org).

Additional Considerations

A caveat of the reporter experiments was that miRNA sequence changes designed to alter TA or SPS might have inadvertently influenced other factors, such as miRNA stability or its loading into the silencing complex. However, our computational analyses of 102 array datasets also showed that TA and SPS each independently influence targeting efficacy. Therefore, if differences in sRNA stability or loading have confounded interpretation of our results, these differences must correlate with either predicted SPS or TA. Analysis of published miRNA over-expression data countered this possibility, revealing no correlation between miRNA accumulation and predicted SPS or TA (Supplementary Fig. 3c,d). Furthermore, experiments examining the RNAs co-purifying with AGO2 indicated that the difference in proficiency observed between lsy-6 and miR-142lsy-6 was not merely attributable to less accumulation of lsy-6 in the silencing complex (Supplementary Fig. 1m–s).

Discussion

The correlation between strong SPS and low TA confounded previous efforts to examine the influence of these parameters on targeting efficacy, with one study implicating SPS and not TA[24], and others implicating TA and not SPS[27,28]. Our results indicated that both parameters influence efficacy and solved one of the mysteries in miRNA targeting: the failure of lsy-6 to repress all but one of the 14 examined seed-matched mRNAs. Previous solutions hypothesized that the seed-based targeting model is unreliable[23], or that sites of the 13 non-responsive mRNAs fall in inaccessible UTR structure[18]. Our work shows that the actual solution is the unusually weak SPS and high TA of the lsy-6 miRNA. Changing these parameters to resemble those of more typical miRNAs imparted typical seed-based targeting proficiency, even though the sites were in their original UTR contexts, thereby demonstrating that neither the reliability of seed-based targeting nor the accessibility of the sites were at issue. MicroRNAs with unusually weak predicted SPS and unusually high TA, such as miR-23 and lsy-6, appear to have relatively few targets. Indeed, lsy-6 might have only a single biological target, the cog-1 mRNA—an extreme exception to the well-supported finding that metazoan miRNAs generally have dozens if not hundreds of preferentially conserved targets[4,8,35,36]. Solving the mystery of why so few mRNAs respond to lsy-6 brings to the fore a second mystery, still unsolved: How is the cog-1 3′UTR so efficiently recognized and repressed by a miRNA with such weak targeting proficiency? This UTR has two 8mer sites, which by virtue of their conservation make cog-1 the top predicted target of lsy-6 (ref. [3]), but this is only part of the answer[37]. Improving the context-score model to take into account the differential SPS and TA of different miRNAs will help focus attention on the predicted targets of miRNAs with more typical proficiencies, but leaves unsolved the problem of how to predict the few biological sites of the less proficient miRNAs without recourse to considering site conservation. MicroRNAs with very high TA, such as lsy-6 or miR-23, and those with very low TA, such as miR-100 or miR-126, two broadly conserved vertebrate miRNAs containing CG dinucleotides in their seeds (Supplementary Table 7), appear to represent two strategies for targeting very few genes, accomplished at opposite ends of the TA spectrum. For miRNAs with very high TA, other UTR features flanking the seed sites are required for regulation, as illustrated for lsy-6 regulation of cog-1 (ref. [37]), whereas miRNAs with very low TA simply have far fewer potential target sites to begin with. Our results also have implications for how siRNA might be designed to reduce off-targets. Previous studies have proposed that off-targets could be reduced by designing siRNAs with low TA[27] or weak SPS[24], and our results implied that off-targets could be largely eliminated by designing siRNAs with both high TA and weak SPS. One concern, though, is that such siRNAs might also be ineffective at recognizing the desired mRNA target because pairing to this target would nucleate on a match with weak SPS and might be titrated by the many other mRNAs with seed matches. To investigate this concern, we examined a published dataset of high-throughput luciferase assays reporting the response to 2,431 different siRNAs[38]. siRNAs with weak predicted SPS knocked down the desired target more effectively than did those with strong predicted SPS (Fig. 4d; P = <10−100, Pearson’s correlation test), presumably because of preferential loading into the silencing complex[39,40]. Moreover, high TA did not compromise the desired targeting efficacy, even after correcting for the cross-correlation between TA and SPS (P= 0.16, Pearson’s correlation test ). Therefore, designing siRNAs with high TA and weak SPS should minimize off-target effects without compromising knockdown of the desired target. Highly expressed mRNAs tend to be evolutionarily depleted in sites for co-expressed miRNAs, a phenomenon partly attributed to the possibility that these mRNAs might otherwise titrate the miRNAs from their intended targets[12,41,42]. Titration can also provide a useful mechanism for cells to regulate miRNA activity, as illustrated by IPS1 titration of miR-399 in Arabidopsis[25]. Beneficial titration has even been proposed to explain why so many miRNA sites are conserved[43]. However, because most preferentially conserved sites fall in lowly-to-moderately expressed mRNAs, and because these sites each comprise only a tiny fraction of the TA, each could impart at most a correspondingly tiny effect on the effective miRNA concentration—much less than that required to selectively retain the site. A though titration functions cannot explain most site conservation, TA could be dynamic during development, with interesting consequences. For example, the increase of a miRNA during development will often be accompanied by a decrease in its transcriptome TA, a consequence of the evolutionary depletion of sites in mRNAs co-expressed at high levels with the miRNA[12,42]. This accompanying TA decrease would sharpen the transition between the non-repressed and repressed states of targets. When predicting SPS we used parameters derived from model RNA duplexes, which presumably underestimated the actual affinity of RNA segments pairing to Argonaute-bound seed regions[2,3,44,45]. The extent to which Argonaute enhances affinity might vary for different seed sequences. These potential differences, however, did not obscure our detection of an influence of SPS on targeting proficiency. Thus, our study provided a lower bound on the actual influence of SPS, as well as an approach for learning its full magnitude once accurate SPSs of Argonaute-bound complexes are known.

Methods

Reporter assays

For lsy-6 reporter assays, HeLa cells were plated in 24-well plates at 5 × 104 cells per well. After 24 hours, each well was transfected with 20 ng TK-Renilla-luciferase reporter (pIS1)[46], 20 ng firefly-luciferase control reporter (pIS0)[46], and 25 nM miRNA duplex (Dharmacon) (Supplementary Fig. 1a), using Lipofectamine 2000 (Invitrogen). For miR-23 reporter assays, conditions were the same except for transfected DNA: 10 ng SV40-Renilla-luciferase reporter (pIS2)[46], 25 ng firefly-luciferase control reporter (pIS0), 1.25 ug pUC19 carrier DNA. Luciferase activities were measured 24 hours after transfection with the Dual-Luciferase Assay (Promega) and a Veritas microplate luminometer (Turner BioSystems). For every construct assayed, four independent experiments, each with three biological replicates, were performed. To control for transfection efficiency, firefly activity was divided by Renilla activity. Renilla values for constructs with sites matching the cognate miRNA were then normalized to the geometric mean of values for otherwise identical constructs in which the sites were mutated. To control for differences not attributable to the cognate miRNA, the ratios were further normalized to ratios for the same constructs tested with a non-cognate miRNA, miR-1. These double-normalized results are presented in the main figures; singly normalized results are presented in Supplementary Figures 1h–l and 2d–f.

Constructs

3′UTRs of lsy-6 predicted targets[23] were subcloned into XbaI and EagI sites in pIS1, and 3′UTRs of miR-23 predicated targets were cloned into SacI and SpeI sites in pIS2 after amplification (UTR sequences, Supplementary Table 1). Mutations were introduced using Quikchange (Stratagene) and confirmed by sequencing.

Predicted SPS

SPS was predicted using nearest-neighbor thermodynamic parameters, including the penalty for terminal A:U pairs[32]. The contribution of the A at position 1 of 8mer and 7mer-A1 sites was not included because this A does not pair to the miRNA[4] and thus its contribution is not expected to differ in a predictable way for different miRNAs. When performing linear regression analyses, the predicted SPS of positions 2–8 was used for 8mer and 7mer-m8 sites, and the predicted SPS of positions 2–7 was used for 7mer-A1 and 6mer sites. When assigning a single value for 7–8-nucleotide sites (7mer-A1, 7mer-m8, and 8mer), a mean weighted value of the three site types was used. This mean SPS was calculated as [(6mer SPS)(7mer-A1 TA) + (7mer-m8 SPS)(7mer-m8 TA + 8mer TA)] ÷ (7mer-A1 TA + 7mer-m8 TA + 8mer TA).

Reference mRNAs

To generate a list of unique mRNAs, human full-length mRNAs obtained from RefSeq[47] and H-Invitational[48] databases were aligned to the human genome[49] (hg18) using BLAT[50] software and processed as described to represent each gene by the mRNA isoform with the longest UTR[30]. These unique full-length mRNAs, which were each represented by the genomic sequence of their exons (since the genomic sequence was of higher quality than the mRNA sequence), were the “reference mRNAs” (Supplementary Table 8). Mouse full-length mRNAs were obtained from RefSeq[47] and FANTOM DB[51] databases, aligned against the mouse genome[52] (mm9), and processed similarly. For C. elegans and D. melanogaster, we obtained 3′UTR sequences from TargetScan (targetscan.org)[22,53]. Mature miRNA sequences were downloaded from the miRBase web site[54].

Microarray processing and mapping to reference mRNAs

We collected published datasets reporting the response of HeLa mRNAs 24 hours after 100 nM sRNA transfection using Agilent arrays (two-color platform), excluding datasets for which either multiple sRNAs were simultaneously transfected or the transfected RNAs contained chemically modified nucleotides (Supplementary Table 2). If probe sequences for an array platform were available, they were mapped to genomic locations in the human genome using BLAT[50] software. For some arrays (e.g., GSE8501), probe sequences were unavailable, but associated cDNA or EST sequence IDs were available. In such cases, genomic coordinates of cDNAs and ESTs obtained from the UCSC Genome Browser[55] were used as if they were coordinates of array probes. Each probe and its associated mRNA fold-change value were mapped to the reference mRNA sharing the greatest overlap with the probe’s genomic coordinates, ≥15 bases. When multiple probes were mapped to a single reference mRNA, the median fold change was used. To avoid analysis of mRNAs not expressed in HeLa cells, only mRNAs with signal above the median in the mock-transfection samples were considered. For each array, the median fold change of reference mRNAs without any 6–8-nucleotide site was used to normalize the fold changes of all reference mRNAs. To correct for the global association between mRNA fold change and AU content of the mRNA transcript, the LOWESS filtering was applied by using malowess() function within MATLAB (Supplementary Table 9). For some arrays, the transfected sRNA is designed to target nearly perfectly matching ( ≥18 nucleotides) mRNAs, in which case, these intended targets were excluded from analysis.

Motif-enrichment analysis for array filtering

To evaluate array datasets, we performed motif-enrichment analysis using the Fisher’s exact test for a 2×2 contingency table, populated based on whether the reference mRNA had a 7mer motif for the cognate sRNA in its 3′UTR and whether it was among the top 5% most down-regulated mRNAs. If multiple arrays examined the effects of transfecting sRNAs with identical seed regions (positions 2–8), the P value of the Fisher’s exact test for site enrichment (considering either of the two 7mer sites and picking the one with the lower P value) was assessed for each array, and the array with the median P value was chosen to represent that seed region, yielding 102 representative arrays (Supplementary Table 2). To obtain a filtered dataset, this test was reiterated for the 16,384 7mers, and arrays were retained if the motif most significantly associated with down-regulation was the 7mer-m8 or 7mer-A1 site of the transfected sRNA; 74 arrays passed this filter (Supplementary Table 2). Results of multiple linear regression and other analyses were robust to cutoff choice (other cutoffs tested, 10, 15, and 20%; data not shown).

TA

TA in the human transcriptome was calculated as the number of non-overlapping 3′UTR 8mer, 7mer-m8, and 7mer-A1 sites in the reference mRNAs. An analogous process was used to calculate TA in mouse, C. elegans, and D. melanogaster. To calculate TAHeLa, each site was weighted based on mRNA-Seq data[33]. Predicted SPS and TA values for all heptamers in C. elegans, human and HeLa, mouse, and D. melanogaster are provided in Supplementary Table 10.

miRNA target prediction and analysis of siRNA efficacy

Context scores were calculated for the cognate sites of the reference mRNAs using the simple linear-regression parameters reported previously[7]. Prior to fitting, scores for each parameter were scaled from 0 to 1 (Supplementary Fig. 5b). To account for site type without the complication of multiple sites, models were developed for each type individually, using mRNAs with only a single site to the cognate miRNA (Supplementary Fig. 5c). The multiple linear regression models for context-only and context+ were computed by using lm() function in the R package version 2.11.1

55 in total

1. Introducing RefSeq and LocusLink: curated human genome resources at the NCBI.

Authors: K D Pruitt; K S Katz; H Sicotte; D R Maglott
Journal: Trends Genet Date: 2000-01 Impact factor: 11.639

2. Initial sequencing and analysis of the human genome.

Authors: E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal: Nature Date: 2001-02-15 Impact factor: 49.962

Review 3. DNA methylation patterns and epigenetic memory.

Authors: Adrian Bird
Journal: Genes Dev Date: 2002-01-01 Impact factor: 11.361

4. BLAT--the BLAST-like alignment tool.

Authors: W James Kent
Journal: Genome Res Date: 2002-04 Impact factor: 9.043

5. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs.

Authors: Y Okazaki; M Furuno; T Kasukawa; J Adachi; H Bono; S Kondo; I Nikaido; N Osato; R Saito; H Suzuki; I Yamanaka; H Kiyosawa; K Yagi; Y Tomaru; Y Hasegawa; A Nogami; C Schönbach; T Gojobori; R Baldarelli; D P Hill; C Bult; D A Hume; J Quackenbush; L M Schriml; A Kanapin; H Matsuda; S Batalov; K W Beisel; J A Blake; D Bradt; V Brusic; C Chothia; L E Corbani; S Cousins; E Dalla; T A Dragani; C F Fletcher; A Forrest; K S Frazer; T Gaasterland; M Gariboldi; C Gissi; A Godzik; J Gough; S Grimmond; S Gustincich; N Hirokawa; I J Jackson; E D Jarvis; A Kanai; H Kawaji; Y Kawasawa; R M Kedzierski; B L King; A Konagaya; I V Kurochkin; Y Lee; B Lenhard; P A Lyons; D R Maglott; L Maltais; L Marchionni; L McKenzie; H Miki; T Nagashima; K Numata; T Okido; W J Pavan; G Pertea; G Pesole; N Petrovsky; R Pillai; J U Pontius; D Qi; S Ramachandran; T Ravasi; J C Reed; D J Reed; J Reid; B Z Ring; M Ringwald; A Sandelin; C Schneider; C A M Semple; M Setou; K Shimada; R Sultana; Y Takenaka; M S Taylor; R D Teasdale; M Tomita; R Verardo; L Wagner; C Wahlestedt; Y Wang; Y Watanabe; C Wells; L G Wilming; A Wynshaw-Boris; M Yanagisawa; I Yang; L Yang; Z Yuan; M Zavolan; Y Zhu; A Zimmer; P Carninci; N Hayatsu; T Hirozane-Kishikawa; H Konno; M Nakamura; N Sakazume; K Sato; T Shiraki; K Waki; J Kawai; K Aizawa; T Arakawa; S Fukuda; A Hara; W Hashizume; K Imotani; Y Ishii; M Itoh; I Kagawa; A Miyazaki; K Sakai; D Sasaki; K Shibata; A Shinagawa; A Yasunishi; M Yoshino; R Waterston; E S Lander; J Rogers; E Birney; Y Hayashizaki
Journal: Nature Date: 2002-12-05 Impact factor: 49.962

6. Initial sequencing and comparative analysis of the mouse genome.

Authors: Robert H Waterston; Kerstin Lindblad-Toh; Ewan Birney; Jane Rogers; Josep F Abril; Pankaj Agarwal; Richa Agarwala; Rachel Ainscough; Marina Alexandersson; Peter An; Stylianos E Antonarakis; John Attwood; Robert Baertsch; Jonathon Bailey; Karen Barlow; Stephan Beck; Eric Berry; Bruce Birren; Toby Bloom; Peer Bork; Marc Botcherby; Nicolas Bray; Michael R Brent; Daniel G Brown; Stephen D Brown; Carol Bult; John Burton; Jonathan Butler; Robert D Campbell; Piero Carninci; Simon Cawley; Francesca Chiaromonte; Asif T Chinwalla; Deanna M Church; Michele Clamp; Christopher Clee; Francis S Collins; Lisa L Cook; Richard R Copley; Alan Coulson; Olivier Couronne; James Cuff; Val Curwen; Tim Cutts; Mark Daly; Robert David; Joy Davies; Kimberly D Delehaunty; Justin Deri; Emmanouil T Dermitzakis; Colin Dewey; Nicholas J Dickens; Mark Diekhans; Sheila Dodge; Inna Dubchak; Diane M Dunn; Sean R Eddy; Laura Elnitski; Richard D Emes; Pallavi Eswara; Eduardo Eyras; Adam Felsenfeld; Ginger A Fewell; Paul Flicek; Karen Foley; Wayne N Frankel; Lucinda A Fulton; Robert S Fulton; Terrence S Furey; Diane Gage; Richard A Gibbs; Gustavo Glusman; Sante Gnerre; Nick Goldman; Leo Goodstadt; Darren Grafham; Tina A Graves; Eric D Green; Simon Gregory; Roderic Guigó; Mark Guyer; Ross C Hardison; David Haussler; Yoshihide Hayashizaki; LaDeana W Hillier; Angela Hinrichs; Wratko Hlavina; Timothy Holzer; Fan Hsu; Axin Hua; Tim Hubbard; Adrienne Hunt; Ian Jackson; David B Jaffe; L Steven Johnson; Matthew Jones; Thomas A Jones; Ann Joy; Michael Kamal; Elinor K Karlsson; Donna Karolchik; Arkadiusz Kasprzyk; Jun Kawai; Evan Keibler; Cristyn Kells; W James Kent; Andrew Kirby; Diana L Kolbe; Ian Korf; Raju S Kucherlapati; Edward J Kulbokas; David Kulp; Tom Landers; J P Leger; Steven Leonard; Ivica Letunic; Rosie Levine; Jia Li; Ming Li; Christine Lloyd; Susan Lucas; Bin Ma; Donna R Maglott; Elaine R Mardis; Lucy Matthews; Evan Mauceli; John H Mayer; Megan McCarthy; W Richard McCombie; Stuart McLaren; Kirsten McLay; John D McPherson; Jim Meldrim; Beverley Meredith; Jill P Mesirov; Webb Miller; Tracie L Miner; Emmanuel Mongin; Kate T Montgomery; Michael Morgan; Richard Mott; James C Mullikin; Donna M Muzny; William E Nash; Joanne O Nelson; Michael N Nhan; Robert Nicol; Zemin Ning; Chad Nusbaum; Michael J O'Connor; Yasushi Okazaki; Karen Oliver; Emma Overton-Larty; Lior Pachter; Genís Parra; Kymberlie H Pepin; Jane Peterson; Pavel Pevzner; Robert Plumb; Craig S Pohl; Alex Poliakov; Tracy C Ponce; Chris P Ponting; Simon Potter; Michael Quail; Alexandre Reymond; Bruce A Roe; Krishna M Roskin; Edward M Rubin; Alistair G Rust; Ralph Santos; Victor Sapojnikov; Brian Schultz; Jörg Schultz; Matthias S Schwartz; Scott Schwartz; Carol Scott; Steven Seaman; Steve Searle; Ted Sharpe; Andrew Sheridan; Ratna Shownkeen; Sarah Sims; Jonathan B Singer; Guy Slater; Arian Smit; Douglas R Smith; Brian Spencer; Arne Stabenau; Nicole Stange-Thomann; Charles Sugnet; Mikita Suyama; Glenn Tesler; Johanna Thompson; David Torrents; Evanne Trevaskis; John Tromp; Catherine Ucla; Abel Ureta-Vidal; Jade P Vinson; Andrew C Von Niederhausern; Claire M Wade; Melanie Wall; Ryan J Weber; Robert B Weiss; Michael C Wendl; Anthony P West; Kris Wetterstrand; Raymond Wheeler; Simon Whelan; Jamey Wierzbowski; David Willey; Sophie Williams; Richard K Wilson; Eitan Winter; Kim C Worley; Dudley Wyman; Shan Yang; Shiaw-Pyng Yang; Evgeny M Zdobnov; Michael C Zody; Eric S Lander
Journal: Nature Date: 2002-12-05 Impact factor: 49.962

7. Asymmetry in the assembly of the RNAi enzyme complex.

Authors: Dianne S Schwarz; György Hutvágner; Tingting Du; Zuoshang Xu; Neil Aronin; Phillip D Zamore
Journal: Cell Date: 2003-10-17 Impact factor: 41.582

8. Functional siRNAs and miRNAs exhibit strand bias.

Authors: Anastasia Khvorova; Angela Reynolds; Sumedha D Jayasena
Journal: Cell Date: 2003-10-17 Impact factor: 41.582

Review 9. MicroRNAs: genomics, biogenesis, mechanism, and function.

Authors: David P Bartel
Journal: Cell Date: 2004-01-23 Impact factor: 41.582

10. Prediction of mammalian microRNA targets.

Authors: Benjamin P Lewis; I-hung Shih; Matthew W Jones-Rhoades; David P Bartel; Christopher B Burge
Journal: Cell Date: 2003-12-26 Impact factor: 41.582

482 in total

1. A genome-wide transgenic resource for conditional expression of Drosophila microRNAs.

Authors: Fernando Bejarano; Diane Bortolamiol-Becet; Qi Dai; Kailiang Sun; Abil Saj; Yu-Ting Chou; David R Raleigh; Kevin Kim; Jian-Quan Ni; Hong Duan; Jr-Shiuan Yang; Tudor A Fulga; David Van Vactor; Norbert Perrimon; Eric C Lai
Journal: Development Date: 2012-06-28 Impact factor: 6.868

2. Computational functional genomics based analysis of pain-relevant micro-RNAs.

Authors: Jörn Lötsch; Ellen Niederberger; Alfred Ultsch
Journal: Hum Genet Date: 2015-09-18 Impact factor: 4.132

3. microRNAs Modulate Spatial Memory in the Hippocampus and in the Ventral Striatum in a Region-Specific Manner.

Authors: F Capitano; J Camon; V Ferretti; V Licursi; F De Vito; A Rinaldi; S Vincenti; C Mannironi; P Fragapane; I Bozzoni; A Oliverio; R Negri; C Presutti; Andrea Mele
Journal: Mol Neurobiol Date: 2015-08-26 Impact factor: 5.590

4. Independent regulation of vertebral number and vertebral identity by microRNA-196 paralogs.

Authors: Siew Fen Lisa Wong; Vikram Agarwal; Jennifer H Mansfield; Nicolas Denans; Matthew G Schwartz; Haydn M Prosser; Olivier Pourquié; David P Bartel; Clifford J Tabin; Edwina McGlinn
Journal: Proc Natl Acad Sci U S A Date: 2015-08-17 Impact factor: 11.205

5. MicroRNA-27a decreases the level and efficiency of the LDL receptor and contributes to the dysregulation of cholesterol homeostasis.

Authors: M Lucrecia Alvarez; Mahdieh Khosroheidari; Elena Eddy; Stefania C Done
Journal: Atherosclerosis Date: 2015-08-20 Impact factor: 5.162

6. Prenatal arsenic exposure and the epigenome: altered microRNAs associated with innate and adaptive immune signaling in newborn cord blood.

Authors: Julia E Rager; Kathryn A Bailey; Lisa Smeester; Sloane K Miller; Joel S Parker; Jessica E Laine; Zuzana Drobná; Jenna Currier; Christelle Douillet; Andrew F Olshan; Marisela Rubio-Andrade; Miroslav Stýblo; Gonzalo García-Vargas; Rebecca C Fry
Journal: Environ Mol Mutagen Date: 2013-12-10 Impact factor: 3.216

7. Post-transcriptional regulation of human breast cancer cell proteome by unliganded estrogen receptor β via microRNAs.

Authors: Giovanni Nassa; Roberta Tarallo; Giorgio Giurato; Maria Rosaria De Filippo; Maria Ravo; Francesca Rizzo; Claudia Stellato; Concetta Ambrosino; Marc Baumann; Niina Lietzèn; Tuula A Nyman; Alessandro Weisz
Journal: Mol Cell Proteomics Date: 2014-02-13 Impact factor: 5.911

8. The effects of a MAP2K5 microRNA target site SNP on risk for anxiety and depressive disorders.

Authors: Kevin P Jensen; Henry R Kranzler; Murray B Stein; Joel Gelernter
Journal: Am J Med Genet B Neuropsychiatr Genet Date: 2014-01-16 Impact factor: 3.568

9. Non-inhibited miRNAs shape the cellular response to anti-miR.

Authors: John R Androsavich; B Nelson Chau
Journal: Nucleic Acids Res Date: 2014-05-07 Impact factor: 16.971

10. Epstein-Barr virus microRNAs reduce immune surveillance by virus-specific CD8+ T cells.

Authors: Manuel Albanese; Takanobu Tagawa; Mickaël Bouvet; Liridona Maliqi; Dominik Lutter; Jonathan Hoser; Maximilian Hastreiter; Mitch Hayes; Bill Sugden; Larissa Martin; Andreas Moosmann; Wolfgang Hammerschmidt
Journal: Proc Natl Acad Sci U S A Date: 2016-10-03 Impact factor: 11.205