Literature DB >> 22140118

SeqTar: an effective method for identifying microRNA guided cleavage sites from degradome of polyadenylated transcripts in plants.

Yun Zheng1, Yong-Fang Li, Ramanjulu Sunkar, Weixiong Zhang.   

Abstract

In plants, microRNAs (miRNAs) regulate their mRNA targets by precisely guiding cleavages between the 10th and 11th nucleotides in the complementary regions. High-throughput sequencing-based methods, such as PARE or degradome profiling coupled with a computational analysis of the sequencing data, have recently been developed for identifying miRNA targets on a genome-wide scale. The existing algorithms limit the number of mismatches between a miRNA and its targets and strictly do not allow a mismatch or G:U Wobble pair at the position 10 or 11. However, evidences from recent studies suggest that cleavable targets with more mismatches exist indicating that a relaxed criterion can find additional miRNA targets. In order to identify targets including the ones with weak complementarities from degradome data, we developed a computational method called SeqTar that allows more mismatches and critically mismatch or G:U pair at the position 10 or 11. Precisely, two statistics were introduced in SeqTar, one to measure the alignment between miRNA and its target and the other to quantify the abundance of reads at the center of the miRNA complementary site. By applying SeqTar to publicly available degradome data sets from Arabidopsis and rice, we identified a substantial number of novel targets for conserved and non-conserved miRNAs in addition to the reported ones. Furthermore, using RLM 5'-RACE assay, we experimentally verified 12 of the novel miRNA targets (6 each in Arabidopsis and rice), of which some have more than 4 mismatches and have mismatches or G:U pairs at the position 10 or 11 in the miRNA complementary sites. Thus, SeqTar is an effective method for identifying miRNA targets in plants using degradome data sets.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22140118      PMCID: PMC3287166          DOI: 10.1093/nar/gkr1092

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

MicroRNAs (miRNAs) are non-coding RNAs that regulate the expression of protein-coding genes mainly at the post-transcriptional level in plants and animals (1). In plants, miRNAs are known to induce cleavages of their mRNA targets between the 10th and 11th nucleotides within nearly perfect complementary sites (2,3). This nearly perfect complementarity has extensively been used to predict miRNA targets in plants (2,4–13). However, such sequence complementarity-based methods often produce a large number of false positive predictions, which makes it costly to experimentally validate, e.g. using modified 5′-RACE assay (14). With the advance of next-generation sequencing technologies, a genome-wide strategy, namely the degradome or PARE (14,15), has been developed to directly profile the mRNA cleavage products induced by small regulatory RNAs, shorthanded as sRNAs that include miRNAs and small interfering RNAs (siRNAs). In this method, the 5′-ends of polyadenylated products of sRNA-mediated mRNA decay are sequenced and subsequently aligned to the cDNA sequences to detect mRNA cleavage sites and quantify the abundance of cleavage products to determine the effects of sRNA-guided gene expression regulation. Currently, CleaveLand (16) is the only publicly available computational method for identifying plant miRNA targets from degradome data (15,17–22). Cleaveland scores sRNA complementary sites based on a mismatch-based scoring scheme (4,6), i.e. (i) a mismatch in an sRNA complementary sites is given a score of 1 and a G:U pair is given a score of 0.5; (ii) a mismatch or a G:U pairs in the core region from 2 to 13 nt receives a double score (6,15); (iii) neither mismatch nor G:U pair at positions 10 and 11 in a complementary site is allowed (7). Generally, sRNA complementary sites with scores of ≤4 were used in identifying miRNA targets (6,15). In sharp contrast to this restrictive scheme, some miRNA complementary sites with scores of ≥4 can also guide the cleavage of their target transcripts. For instance, ath-miR390 is able to guide the cleavage at its 3′ complementary site of TAS3b transcript despite having a score of 7 (corresponding to 6.5 mismatches) (9,23); ath-miR159a can induce the cleavage of AT5G18100 although their complementary site has a score of 6.5 (corresponding to 4.5 mismatches) (14); miR398-guided cleavage of CCS1 is detected despite having a score of 6 (corresponding to 5.5 mismatches) (19); miR167 can lead to the cleavage of Os06g03830 despite having a mismatch at position 11 (19); and ath-miR173 can lead to the cleavage of AT1G50055 even the position 10 of their binding site is a mismatch (6). These observations suggest that the criteria adopted in CleaveLand are too stringent and omit many genuine targets, and relaxation of current criteria can identify additional novel targets for miRNAs from the degradomes. In order to fully utilize the large amount of degradome data for identifying miRNA targets particularly those with more mismatches, we developed a novel method called SeqTar (SEQuencing-based sRNA TARget prediction). To reduce the false positive predictions when allowing more mismatches, two P-values were introduced in the method to control the qualities of its predictions. Particularly, the number of mismatches in an sRNA complementary site is assigned a P-value, P, based on the shuffled sRNA sequences against randomly chosen target sequences, and the number of reads accumulated at the central region of the sRNA complementary site, the 9–11th nt from the 5′-end of miRNA, is given another P-value, P, by a Binomial-test. The reads mapped to the 9–11th nt are named as valid reads. On two degradome data sets from Arabidopsis (14) and one from rice (19), SeqTar identified 231 and 268 novel sRNA:target pairs with less than 3.5 mismatches and with at least 5 valid reads, respectively. Among these pairs, 103 and 92 sRNA:target pairs have significant numbers of valid reads with P < 10−5 in Arabidopsis and rice, respectively. Using a modified 5′-RACE (see ‘Materials and Methods’ section), we experimentally validated six sRNA targets each for Arabidopsis and rice, respectively. Most of these 12 sRNA:target pairs have more than 4 mismatches. More importantly, some of these verified miRNA:target pairs have mismatches or G:U pairs at positions 10 or 11. Furthermore, we identified thousands of sRNA:target pairs that showed strong accumulations of reads in the central regions (P < 10−5) but had more than three mismatches in both Arabidopsis and rice. These results demonstrated that SeqTar is an effective method for finding sRNA targets from plant degradome. Our analysis also revealed that more transcripts are cleaved by sRNA guided RISC in both Arabidopsis and rice than previously reported.

MATERIALS AND METHODS

Degradome and sequence data sets used

The two Arabidopsis degradome data sets (GSM280226, denoted as WT, and GSM280227, named as xrn4) (14) and one rice degradome data set (GSE17398, called as osa) (19) were downloaded from the NCBI GEO database. Two other studies (18,20) also generated degradome data from rice but both of them produced substantially less reads than the data set of Li et al. (19). Thus, the rice degradome of Li et al. (19) was chosen for analysis. The cDNA sequences of Arabidopsis and rice were downloaded from the TAIR database (r9, http://www.tair.org) and the Rice Genome Annotation Project (r6.1, http://rice.plantbiology.msu.edu/), respectively. The sequences of TAS3a/b/c of rice were retrieved from the NCBI EST database, under the accession numbers EU293144, AU100890 and CA765877 (19), respectively. The sequences of mature miRNAs were obtained from the miRBase (24) (version 16, http://www.mirbase.org/) and the unique miRNA sequences were used in the analysis. TasiRNAs of Arabidopsis TAS1 to TAS4 were collected from the Arabidopsis Small RNA Project Database (http://asrp.cgrb.oregonstate.edu). Some Arabidopsis small RNAs derived from PPR genes [reported in (15)] were also used in this study. The rice tasiRNAs were obtained from (19). All small RNA sequences used were provided in Supplementary Table S12.

Sequence alignment

SeqTar used a modified Smith–Waterman algorithm to align an sRNA to a target sequence. Briefly, instead of performing alignments with matched nucleotides, e.g. A-A and C-C, SeqTar found complementary nucleotides, i.e. G-C, A-U and G-U Wobble pairs that had rewards of +6, +4 and +2, respectively, in alignment. The affine gap penalty, i.e. the penalty increasing linearly with the length of gap after the initial gap opening penalty, was used for gap opening (−8) and gap extension (−4). The algorithm gave a penalty of −3 to a known mismatch and a penalty of −1 to a mismatch of unspecified nucleotides (i.e. ‘N’) in mRNAs. SeqTar next used shuffled sRNA sequences to evaluate predicted sRNA complementary sites, which was a standard way to evaluate predicted binding sites of plant sRNAs (2,4). One hundred dinucleotide shuffled sRNAs were generated for a given sRNA sequence. Each of these shuffled sRNAs was used to predict complementary sites on one target sequence randomly chosen from the pool of all target sequences. Finally, the number of mismatches of these 100 sRNA:target pairs were used to evaluate the P-values of the mismatches, P, of the mismatches of sRNA's complementary sites, m, by assuming a Student's t-distribution.

Reads distributions

The unique sequences of a degradome data set were aligned to the transcript (cDNA) sequences with the BLASTN program. Then, the abundance of a matched locus was obtained by averaging the number of a unique sequence to the number of its perfectly matched loci in all transcript sequences. Initially, SeqTar scanned the BLASTN results to obtain the normalized abundance in each position on a transcript. Then, SeqTar calculated the accumulation of reads in the central region of an sRNA complementary site, i.e. reads starting at positions opposite to 9–11 nt region from 5′-end of sRNA. Although major cleavages often took place between the 10th and 11th nt, minor cleavages between 9th and 10th or 11th and 12th nt had also been reported (6,11,25). Among the reads mapped to different positions on the target transcript, some reads could have been generated by sRNA-guided cleavage events and were named as valid reads, v. Thus, it was assumed that the degradation products of a target followed a Binomial distribution, where the reads mapped to the central region of an sRNA complementary site were treated as preferred (positive) samples and other reads as control (negative) ones. The probability of valid reads, P, was calculated by Equation 1. where x = max(n9, n10, n11), n9–n11 were the number of reads mapped to the positions opposite to the 9–11th nt of the sRNA, respectively, n was the total number of reads that were mapped to the whole target sequence, and q was a constant that stands for the probability that a mapped read was from any nucleotide of the target sequence. If no sRNA was involved in the degradation of a target, there was no reason to assume that one position would be more likely to break down than other positions. Therefore, each position of the target sequence was assumed to have the same probability to produce a degradation product by assuming a Uniform distribution on the degradation products of a transcript. Therefore, q in Equation 1 was assigned a value of 1/(l − (r − 1)), where l was the length of the target sequence and r was the length of a degradome read, since the last r − 1 position of the target sequence could not be detected with the sequencing reads. In current implementation of SeqTar, P < 10−300 were regarded as 0. It was important to note that although the valid reads, v, were all the reads mapped to the 9–11th positions, P was calculated from the largest number of reads of these three positions. This was because P was used to evaluate whether the major cleavage position was preferred by the sRNA-guided RISC complex.

The computational steps and outputs of SeqTar

The major steps of SeqTar were shown in Supplementary Methods. All computational steps of SeqTar had been integrated into a whole script whose major steps including SeqTar were implemented with the Java programming language. SeqTar had been used in the Linux operating system and was available for non-commercial purposes upon request. SeqTar produced six output files: the first listed the sRNA:target pairs; the second showed the alignments of sRNA complementary sites; the third provided the MatLab scripts for generating the T-plots of target mRNAs; the fourth gave the number of reads perfectly mapped to target mRNAs; the fifth listed the scores of shuffled sRNAs used to evaluate the P values; and the last provided the potential novel sRNA candidates. As suggested by German et al. (14), SeqTar predicted a potential sRNA if an accumulation of reads was found at a specific position, named as a peak, on a target but no input sRNAs contributed to this accumulation. Additional details of outputs were given in the Supplementary Methods. The first file consisted of 33 columns to show the information of a miRNA:target pair, such as the number of valid reads, the P-value of valid reads P, the number of mismatches, the P-value of mismatches P and the percentage of valid reads. A detailed description of these columns were also given in Supplementary Methods.

Performance evaluation

To evaluate the performance of SeqTar, we compared its prediction results with that reported in the literature. The verified or predicted Arabidopsis sRNA targets (2,4,6,7,9,14,15,26–29) were combined and duplicate pairs were removed and a resulting list of 428 sRNA:target pairs were obtained for Arabidopsis (Supplementary Table S1). A total of 230 of these 428 pairs were validated targets of 28 conserved sRNA families and summarized in Table 1. Similarly, 458 sRNA:target pairs of rice (Supplementary Table S2) were obtained from the reported results (18–20,28,30–38). Of these, 123 targets of 21 conserved sRNA families were previously validated and summarized in Table 1. We also compared the SeqTar's results with those of the CleaveLand pipeline (16) reported recently in the starBase (39).
Table 1.

The conserved miRNA targets of A. thaliana and O. sativa

miR familyTarget familyA.t.WTWT Newxrn4xrn4 NewO.s.osaosa New
miR156/157SBP11110111(1)10100
miR159/319MYB77(5)3(3)7(5)4(4)223
miR159/319TCP551(1)51(1)44(2)0
miR160ARF33030441
miR161PPR4040(25)46(40)40(25)90(83)000
miR162DCL11010110
miR163SAMT66(6)4(4)6(2)6(5)000
miR164NAC77(1)4(3)7(1)6(4)66(1)18(14)
miR165/166HD-Zip66161440
miR167ARF221(1)23(3)442
miR168Argonaute11010660
miR169HAP2773(2)73(3)880
miR170/171SCL44(1)14(1)1(1)55(2)0
miR172AP2664(4)63(3)55(1)4(3)
miR173TAS1/244040000
miR390/391TAS333030330
miR393F-Box55050224(1)
miR394F-Box1111(11)111(11)113
miR395APS33(1)03(1)0110
miR395SO2 Transp.111(1)11(1)33(1)0
miR396GRF771711212(2)0
miR397Laccase333(3)34(4)1616(14)4(4)
miR398CSD22020221
miR398CCS111010100
miR399PO4 Transp.11(1)6(6)1(1)3(3)44(4)7(7)
miR399E2-UBC1116(16)113(12)113
miR400PPR3939(32)48(43)39(33)46(42)000
miR403Argonaute212(2)12(2)000
miR408Plantacyanin3303077(2)8(5)
miR408Laccase33(3)03(3)022(2)1(1)
miR444MADS-box000004416(14)
miR4472-PGK22(2)02(2)0000
miR858MYB5536(26)5(1)56(45)000
miR859F-Box3531(28)72(68)31(30)72(72)000
TAS3-siRARF33030550
Total230225(105)264(234)225(105)328(300)123122(31)75(49)

The A.t. and O.s. columns list the number of targets of A. thaliana and O. sativa that were reported in literature, respectively. The WT, xrn4 and osa columns list the number of targets in the A.t. and O.s. column that are predicted by SeqTar in the three data sets, respectively. The WT New, xrn4 New and osa New columns list the number of targets that belong to the same family and are newly predicted by SeqTar. The numbers in parentheses are the number of targets whose miRNA complementary sites are predicted but these miRNA complementary sites have no valid reads. A potential target is counted if it is targeted by at least one member of the miRNA family.

The conserved miRNA targets of A. thaliana and O. sativa The A.t. and O.s. columns list the number of targets of A. thaliana and O. sativa that were reported in literature, respectively. The WT, xrn4 and osa columns list the number of targets in the A.t. and O.s. column that are predicted by SeqTar in the three data sets, respectively. The WT New, xrn4 New and osa New columns list the number of targets that belong to the same family and are newly predicted by SeqTar. The numbers in parentheses are the number of targets whose miRNA complementary sites are predicted but these miRNA complementary sites have no valid reads. A potential target is counted if it is targeted by at least one member of the miRNA family.

Experimental validation using 5′-RACE assay

The RLM 5′-RACE assay was performed to experimentally validate 19 predicted targets listed in Supplementary Table S13 by using the GeneRacer Kit (Invitrogen). Briefly, total RNA from Arabidopsis and rice were ligated with a 5′-RNA adapter and a reverse transcription was performed using oligodT. The resulting cDNA was used as a template for nested PCR. The first PCR was performed using GeneRacer 5′ primer and a gene-specific primer. The second PCR was performed using GeneRacer 5′ nested primer and a gene-specific nested primer. The amplified products were gel purified, cloned into pGEM T-easy vector and sequenced. Gene-specific primers used in this study were listed in Supplementary Table S13.

Transient co-expression of miR172 and novel target genes (AT5G16480 and Os10g08580) in Nicotiana benthamiana leaves

We chose miR172 and two of its putative novel target genes, one in Arabidopsis, AT5G16480 and the other in rice, Os10g08580, and experimentally analyzed their transient co-expression in N. benthamiana leaves. Arabidopsis MIR172a (the italic font means a sequence used in a construct) was amplified using locus-specific primers. Similarly, full length of AT5G16480 and partial gene product of Os10g08580 (∼600 bp) harboring miR172 complementary sites were amplified from Arabidopsis and rice, respectively (primer sequences were listed in Supplementary Table S17). The clones were initially cloned into TA-vector and sequenced and confirmed that no mutations/errors were introduced during the process. Then the genes were inserted into XbaI and KpnI sites of binary vector pBIB under the control of super promoter. The constructs harboring Ath-MIR172a, AT5G16480 or Os10g08580 were transformed into A. tumefaciens strain GV3101 and these cell cultures were infiltrated into N. benthamiana leaves as described by English et al. (40). For co-expression analysis, equal amount of Agrobacterium culture containing Ath-MIR172a and AT5G16480 or Os10g08580 were mixed before infiltration into N. benthamiana leaves.

RESULTS

Summary of the predictions from SeqTar

We analyzed three degradome data sets, two from Arabidopsis (WT and xrn4) and one from rice (osa) (see ‘Materials and Methods’ section) using SeqTar. SeqTar predicted a total of 235 695, 240 107 and 667 009 sRNA:target pairs in the WT, xrn4 and osa data sets, respectively (Figure 1). After removing duplicate and redundant pairs of different mature miRNAs and alternatively spliced transcripts, 183 194, 188 109 and 461 877 sRNA:target pairs were obtained from the WT, xrn4 and osa data sets, respectively (see Supplementary Methods for details). In addition to the 428 Arabidopsis sRNA:target pairs summarized in Supplementary Table S1, Howell et al. (9) reported that ath-miR161-1, ath-miR161-2, ath-miR400 and seven tasiRNAs derived from athTAS1/2 transcripts can regulate a total of 40 PPR transcripts. We thus did not treat the pairs consisting of these 10 sRNAs and these 40 PPR transcripts from the non-redundant pairs as novel targets in Figure 1. After removing the reported pairs, there were 1 82 673, 1 87 582 and 4 61 505 newly identified pairs in the WT, xrn4 and osa data sets, respectively. These pairs were classified into Category I (with P < 0.1 and P < 10−5) and Category II (with P < 0.1 and P ≥ 10−5). Many new sRNA:target pairs, specifically 3386, 925 and 3101 pairs in the WT, xrn4 and osa datasets respectively, belonged to Category I (see Figure 2d–f). These numbers were further reduced to 2809, 859 and 3036 (in Supplementary Tables S6–S8) after considering a minimum of five valid reads as a cutoff. Some pairs in Category I (i.e. 88, 39 and 92 in WT, xrn4 and osa, respectively) only had ≤3 mismatches. After combining results from the WT and xrn4 data sets, we found 103 novel Category I sRNA:target pairs with ≤3 mismatches for Arabidopsis. Many newly identified targets (solid diamonds in Figure 2d–f) in Category I had >3 mismatches, but had strong accumulations of valid reads as indicated by their P values. Among these identified targets, 4 and 6 with >3 mismatches from Arabidopsis and rice, respectively, were validated (red solid diamonds in Figure 2d–f; Figures 3 and 4; Tables 2 and 3).
Figure 1.

The numbers of predicted targets. m and v stand for the number of mismatches and the number of valid reads, respectively. Cat. I and Cat. II are the Category I and Category II sRNA:target pairs classified by their P and P-values, respectively, as shown in Figure 2. Boxes with thin and thick edges are operations and results, respectively. ‘Reported’ means the number of miRNA:target pairs reported in literature, as summarized Supplementary Tables S1 and S2. The predicted targets in the blue dashed box are used to find combinatorially regulated targets. Cat. I and Cat. II miRNA:target pairs in this box are given in the Supplementary Tables S6–S8 and S14–S16 for the WT, xrn4 and osa data sets, respectively.

Figure 2.

The P and P of sRNA:targets pairs. (a) The sRNA:targets pairs of WT and WT New in Table 1. (b) The sRNA:targets pairs of xrn4 and xrn4 New in Table 1. (c) The sRNA:target pairs of osa and osa New in Table 1. (d) The new sRNA:target pairs in the WT data set that are not shown in (a). (e) The new sRNA:target pairs in the xrn4 data set that are not shown in (b). (f) The new sRNA:targets in the osa data set that are not shown in (c). Circles stand for reported sRNA:target pairs, black diamonds stand for newly identified sRNA:target pairs, and red diamonds stand for newly identified sRNA:target pairs that had been verified with the RLM 5′-RACE experiments, respectively. Green circles and green diamonds stand for reported siRNA:target and new siRNA:target pairs, respectively. I, II, III and IV are the four Categories of sRNA:target pairs classified by their P and P values.

Figure 3.

The experimentally verified novel miRNA targets of Arabidopsis. (a) ath-miR172ab:AT1G24793. (b) ath-miR396b:AT1G53910. (c) ath-miR779-2:AT5G17240. (d) ath-miR172ab:AT5G16480. (e) ath-miR398a:AT3G27200. (f) The conservation of ath-miR398a site on AT3G27200. Abbreviated names, Aly, Zma, Bol, Nta, Rra and Sbi stand for A. lyrata PID:484503, Zea mays DQ245243, Brassica oleracea DK501936, N. tabacum FS399926, Raphanus raphanistrum subsp. maritimus FD965811, and Sorghum bicolor Sb05g007160, respectively. In Part (a) to (e), the x-axis is the position on the transcript, and y-axis is the number of reads detected from a position. The arrows in the upper parts correspond to the positions pointed by the arrows of the same colors in the lower parts. The numbers above the arrows are the number of reads detected at those positions on the WT data set. The numbers in the parenthesis are the cleavage frequencies determined by the RLM 5′-RACE experiments.

Figure 4.

The experimentally verified novel miRNA targets of rice Oryza sativa. (a) osa-miR1319:Os06g01304. (b) osa-miR171h:Os07g36170. (c) osa-miR1852:Os02g27400. (d) osa-miR530-3p:Os05g34720. (e) osa-miR172d:Os10g08580 and osa-miR1425:Os10g08580. (f) osa-miR1867:Os07g22930 and osa-miR1436:Os07g22930. For details refer to the legend of Figure 3. The T-plots and numbers of reads are the results on the osa data set. In part (f), the underlined nucleotides indicate the overlapped regions of different miRNA binding sites.

Table 2.

Some newly found sRNA targets of A. thaliana that belong to Category I

sRNALocusMVRPvPercentageTarget (cDNA)
ath-miR157a-cAT5G248705121.3E-1310.9Zinc finger (C3HC4-type) family protein
ath-miR158aAT1G011603.5121.9E-103.0GIF2; transcription co-activator-related
ath-miR167abAT1G178704223.0E-169.2ATEGY3; Ethylene-Dependent Gravitropism-Deficient And Yellow-Green-Like 3
ath-miR172abAT1G247934.5342.8E-4617.7UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase
ath-miR172abAT5G164805304.4E-5022.9Tyrosine-specific protein phosphatase family protein
ath-miR172b*AT1G604803.5119.0E-119.1Pseudogene, putative ADP-ribosylation factor
ath-miR172cdAT1G51405571.3E-1731.8Myosin-related
ath-miR393abAT1G49260491.0E-1818.4Unknown protein
ath-miR396aAT4G322503.563.3E-103.4Protein kinase family protein
ath-miR396bAT1G539103961.1E-1465.9RAP2.12; transcription factor
ath-miR396bAT2G291604134.6E-3539.4Pseudogene, similar to tropinone reductase I
ath-miR396bAT3G141102.5241.4E-387.2FLU (Fluorescent In Blue Light)
ath-miR396bAT5G430602367.5E-7919.8Cysteine proteinase, putative / thiol protease
ath-miR398aAT2G295603.561.2E-093.5Enolase, putative
ath-miR398aAT3G272004.54850.0E+0073.6Plastocyanin-like domain-containing protein
ath-miR400AT2G338604.5823.1E-1249.1ARF3; transcription factor
ath-miR413AT4G37730474.6E-1412.7AtbZIP7; transcription factor
ath-miR414AT3G012603.5123.0E-3685.7Aldose 1-epimerase
ath-miR414AT3G484704113.0E-2416.4EMB2423; EMBRYO DEFECTIVE 2423
ath-miR414AT5G1040042064.6E-20520.9Histone H3
ath-miR415AT5G175801.5155.5E-114.2Phototropic-responsive NPH3 family protein
ath-miR420AT2G319454.5137.7E-2319.7Unknown protein
ath-miR776AT5G505654.54110.0E+0019.8Unknown protein
ath-miR779-2AT5G172404.5313.4E-6113.7SDG40 (SET DOMAIN GROUP 40)
ath-miR780-1AT1G536503.576.8E-138.3CID8; RNA binding / protein binding
ath-miR783AT1G514204113.8E-1419.0SPP1; Sucrose-Phosphatase 1
ath-miR828AT3G02940562.9E-1213.0AtMYB107; transcription factor
ath-miR829-2AT4G131203.562.3E-126.8Transposable element gene
ath-miR831AT3G272904.586.1E-1919.5F-box family protein-related
ath-miR833-3pAT1G71160561.5E-1317.1KCS7; 3-Ketoacyl-Coa Synthase 7
ath-miR834AT1G77095564.1E-1316.2Transposable element gene
ath-miR834AT5G136804.5261.0E-359.8ABO1; ABA-Overly Sensitive 1, transcription elongation regulator
ath-miR835-5pAT1G714903.561.2E-1519.4PPR protein
ath-miR847AT1G017504.573.1E-1421.2ADF11 (Actin Depolymerizing Factor 11)
ath-miR850AT1G305005146.7E-2015.6NF-YA7; transcriptional repressor (factor)
ath-miR850AT3G50390562.3E-1422.2Transducin/WD-40 repeat family protein
ath-miR854a-dAT1G014903.5511.4E-645.1Heavy-metal-associated domain-containing protein
ath-miR858AT3G626103.5115.2E-117.5ATMYB11; transcription factor
ath-miR858AT5G608903.5105.6E-1311.9MYB34; transcription factor
ath-miR860AT5G260300.572.7E-063.8FC1 (ferrochelatase 1); ferrochelatase
ath-miR870AT1G061903102.0E-133.2TP binding / ATPase
ath-miR1887AT1G528272.5169.2E-133.9Unknown protein
ath-miR2934AT3G13610563.2E-1533.3Oxidoreductase, 2OG-Fe(II) oxygenase family protein
ath-miR2937AT3G42670564.8E-1512.0CHR38, CLSY; DNA binding
ath-miR3434AT1G74420451.35E-1010.2FUT3 (fucosyltransferase 3)
ath-miR3434AT1G67970451.55E-1011.9AT-HSFA8; DNA binding / transcription factor
ath-miR3434*AT1G343553.576.49E-154.7Forkhead-associated domain-containing protein
ath-miR3440b-3pAT1G048305291.37E-334.1RabGAP/TBC domain-containing protein
ath-miR3932abAT1G267305133.29E-2010.9EXS family protein
ath-miR3932abAT2G306204811.55E-15212.4Histone H1.2
ath-miR3933AT1G773304.568.48E-1875.01-aminocyclopropane-1-carboxylate oxidase
ath-miR3933AT1G089805412.69E-574.4AMI1 (amidase 1); amidase/ hydrolase
ath-miR4228AT4G370205249.18E-4414.6Unknown protein
ath-miR4239AT1G708304.51514.92E-1342.9MLP28 (MLP-LIKE PROTEIN 28)
ath-miR4239AT1G702504.562.37E-1310.2Receptor serine/threonine kinase
TAS1a_D4(+)AT3G06940362.6E-114.4Transposable element gene
TAS1a_D9(-)AT4G145103.582.9E-143.4RNA binding
TAS1c_D6(-)AT2G3968121743.9E-2295.4TAS2; other RNA
TAS2_D9(-)AT2G3968102614.36E-3198.5TAS2; other RNA
TAS3c_D4(+)AT2G192604.564.6E-139.1ELM2 domain-containing protein; PHD finger
AT1G62910-tasi4AT4G165702.581.6E-136.3PRMT7; protein Arginine methyltransferase 7

The Columns, M, VR, P and Percentage, mean the mismatches in the sRNA complementary sites, the number of valid reads, the P-value of valid reads, and the percentage of valid reads. In the Target column, PPR protein stands for pentatricopeptide (PPR) repeat-containing protein. The sRNA:target pairs that are verified by the 5′-RACE assay are shown in bold face. The VR, P and Percentage values are calculated from either the WT or the xrn4 data set where the larger accumulation of valid reads is found.

Table 3.

Some newly found sRNA targets of Oryza sativa that belong to Category I

sRNALocusMVRPv(%)Target (cDNA)
miR159cOs03g084805505.2E-385.3rho termination factor, N-terminal domain containing protein
miR168bOs01g059004.5355.6E-145.5Core histone4 H2A/H2B/H3/H4 domain containing protein
miR171hOs07g361704.53920.0E+0028.8Chitin-inducible gibberellin-responsive protein
miR171iOs01g722505282.2E-358.2Uridine 5-monophosphate synthase
miR171iOs03g541005507.5E-366.2Potassium channel protein
miR172dOs04g222705544.2E-725.4Expressed protein
miR172dOs10g0858053190.0E+0011.4FAD binding domain of DNA photolyase domain containing protein
miR319aOs03g342804.5209.9E-145.8Expressed protein
miR398aOs06g425404.5382.2E-2610.1Expressed protein
miR415Os02g222803.5184.7E-288.4Retrotransposon protein, unclassified
miR415Os07g423544.5141.2E-245.6PPR repeat domain containing protein
miR417Os09g315064.5374.1E-296.1Dihydroflavonol-4-reductase
miR419Os04g469905145.0E-226.3cis-zeatin O-glucosyltransferase
miR439a-jOs04g478204.5192.5E-1014.4Expressed protein
miR444bc-1Os03g230504.5173.1E-2611.4Expressed protein
miR444bc-1Os07g324604481.5E-466.9src homology-3 domain protein 3
miR444bc-2Os02g354804.5264.6E-426.3Expressed protein
miR446Os09g275005194.8E-4222.1Cytochrome P450
miR446Os09g300504193.6E-3427.9Expressed protein
miR528Os06g017203.5171.3E-2315.0Expressed protein
miR530-3pOs01g5292051788.2E-2947.7Expressed protein
miR530-3pOs05g024204.51081.8E-1817.4Expressed protein
miR530-3pOs05g347203.52870.0E+0025.5Transcriptional regulator
miR807a-cOs02g266605236.0E-209.0Exonuclease
miR808Os10g267202.5448.2E-4012.9Exonuclease
miR809a-hOs02g291401.5182.8E-2912.1Ankyrin, putative, expressed
miR809a-hOs04g456653191.3E-2428.8Expressed protein
miR810b-1Os12g020405333.3E-395.0Hypoxia-responsive family protein
miR818a-eOs12g318604.5123.4E-2131.6Ureide permease
miR1319Os06g013045.54360.0E+0020.2Spotted leaf 11
miR1423bOs01g192705161.1E-3950.0Expressed protein
miR1428bcdOs10g266003.5151.7E-1112.9Soluble inorganic pyrophosphatase
miR1429-3pOs01g506904586.6E-537.6WD domain, G-beta repeat domain containing protein
miR1436Os01g015204.5161.6E-225.4Transferase family protein
miR1436Os07g229303271.3E-202.3Starch synthase
miR1437Os07g361405301.3E-1312.5Core histone H2A/H2B/H3/H4
miR1438Os06g071005101.5E-1111.5RING-H2 finger protein
miR1439Os03g114904.5627.9E-8820.7Expressed protein
miR1851Os08g036305245.5E-428.1Acyl-activating enzyme 14
miR1852Os02g2740041880.0E+0018.7OsFBX49 - F-box domain containing protein
miR1857-3pOs05g337105531.1E-606.4WD domain, G-beta repeat domain containing protein
miR1857-5pOs11g037204.5255.4E-2316.0Expressed protein
miR1858abOs06g453404283.9E-185.7Peptidyl-prolyl cis-trans isomerase, FKBP-type
miR1861ekmOs10g328105164.1E-247.1Beta-amylase
miR1862dOs07g22930496.2E-050.8Starch synthase
miR1872Os02g487905.5991.0E-1234.9AML1, putative, expressed
miR2099-5pOs03g551644.51233.2E-8110.0OsWRKY4 - Superfamily of TFs having WRKY and zinc finger domains
miR2123a-cOs02g349501544.6E-839.0ATP binding protein, putative, expressed
miR2862Os08g017104.5199.4E-2610.7GLTP domain containing protein
miR2863bOs04g467304.5124.9E-175.4Thioesterase family protein
miR2874Os12g443505341.2E-427.8Actin
miR2878-3pOs02g409005.51802.5E-31837.7RNA recognition motif containing protein
miR2878-5pOs03g071105.5181.3E-4630.0Calmodulin-binding protein
miR2878-5pOs11g191005871.4E-1015.2Retrotransposon protein
miR2925Os08g035903.5389.2E-5410.2Expressed protein
miR2926Os07g336604433.1E-516.3Expressed protein
miR2926Os05g290204259.1E-4910.5Expressed protein
miR2929Os03g192404.5175.1E-244.6AMP-binding enzyme, putative, expressed
miR2930Os02g448704.5732.7E-342.6Dehydrin, putative, expressed
miR2931Os10g309513.5361.5E-351.5Expressed protein

For details refer to the legend of Table 2.

The numbers of predicted targets. m and v stand for the number of mismatches and the number of valid reads, respectively. Cat. I and Cat. II are the Category I and Category II sRNA:target pairs classified by their P and P-values, respectively, as shown in Figure 2. Boxes with thin and thick edges are operations and results, respectively. ‘Reported’ means the number of miRNA:target pairs reported in literature, as summarized Supplementary Tables S1 and S2. The predicted targets in the blue dashed box are used to find combinatorially regulated targets. Cat. I and Cat. II miRNA:target pairs in this box are given in the Supplementary Tables S6–S8 and S14–S16 for the WT, xrn4 and osa data sets, respectively. The P and P of sRNA:targets pairs. (a) The sRNA:targets pairs of WT and WT New in Table 1. (b) The sRNA:targets pairs of xrn4 and xrn4 New in Table 1. (c) The sRNA:target pairs of osa and osa New in Table 1. (d) The new sRNA:target pairs in the WT data set that are not shown in (a). (e) The new sRNA:target pairs in the xrn4 data set that are not shown in (b). (f) The new sRNA:targets in the osa data set that are not shown in (c). Circles stand for reported sRNA:target pairs, black diamonds stand for newly identified sRNA:target pairs, and red diamonds stand for newly identified sRNA:target pairs that had been verified with the RLM 5′-RACE experiments, respectively. Green circles and green diamonds stand for reported siRNA:target and new siRNA:target pairs, respectively. I, II, III and IV are the four Categories of sRNA:target pairs classified by their P and P values. The experimentally verified novel miRNA targets of Arabidopsis. (a) ath-miR172ab:AT1G24793. (b) ath-miR396b:AT1G53910. (c) ath-miR779-2:AT5G17240. (d) ath-miR172ab:AT5G16480. (e) ath-miR398a:AT3G27200. (f) The conservation of ath-miR398a site on AT3G27200. Abbreviated names, Aly, Zma, Bol, Nta, Rra and Sbi stand for A. lyrata PID:484503, Zea mays DQ245243, Brassica oleracea DK501936, N. tabacum FS399926, Raphanus raphanistrum subsp. maritimus FD965811, and Sorghum bicolor Sb05g007160, respectively. In Part (a) to (e), the x-axis is the position on the transcript, and y-axis is the number of reads detected from a position. The arrows in the upper parts correspond to the positions pointed by the arrows of the same colors in the lower parts. The numbers above the arrows are the number of reads detected at those positions on the WT data set. The numbers in the parenthesis are the cleavage frequencies determined by the RLM 5′-RACE experiments. The experimentally verified novel miRNA targets of rice Oryza sativa. (a) osa-miR1319:Os06g01304. (b) osa-miR171h:Os07g36170. (c) osa-miR1852:Os02g27400. (d) osa-miR530-3p:Os05g34720. (e) osa-miR172d:Os10g08580 and osa-miR1425:Os10g08580. (f) osa-miR1867:Os07g22930 and osa-miR1436:Os07g22930. For details refer to the legend of Figure 3. The T-plots and numbers of reads are the results on the osa data set. In part (f), the underlined nucleotides indicate the overlapped regions of different miRNA binding sites. Some newly found sRNA targets of A. thaliana that belong to Category I The Columns, M, VR, P and Percentage, mean the mismatches in the sRNA complementary sites, the number of valid reads, the P-value of valid reads, and the percentage of valid reads. In the Target column, PPR protein stands for pentatricopeptide (PPR) repeat-containing protein. The sRNA:target pairs that are verified by the 5′-RACE assay are shown in bold face. The VR, P and Percentage values are calculated from either the WT or the xrn4 data set where the larger accumulation of valid reads is found. Some newly found sRNA targets of Oryza sativa that belong to Category I For details refer to the legend of Table 2. Predicted targets in Category II with ≤3 mismatches (3700, 3762 and 7148 in the WT, xrn4 and osa data sets, respectively) may not express or express at low level in the sequenced tissues (Supplementary Tables S14–S16). Nevertheless, 81, 67 and 176 sRNA:target pairs from the WT, xrn4 and osa data sets, respectively, had at least five valid reads. After combining the results from the WT and xrn4 datasets, we had 128 novel targets belonging to Category II with ≤3 mismatches and ≥5 valid reads from Arabidopsis.

Validation of the results from SeqTar

In order to verify that SeqTar functions as expected, we first analyzed its performance on the Arabidopsis and rice degradome data sets for identification of reported sRNA targets. Of the 428 reported targets of Arabidopsis, SeqTar recovered 402 and 405 pairs (a total of 412 when merged) from the WT and xrn4 data set (Supplementary Table S1), respectively, with a P threshold of 0.1; the remaining 16 reported targets could be identified with a relaxed P threshold. Consequently, SeqTar achieved a sensitivity of 96.3% (412/428) with a P threshold of 0.1 in identifying the reported pairs of Arabidopsis. In rice, SeqTar identified 381 out of the 457 reported sRNA:target pairs (Supplementary Table S2), achieving a sensitivity of 83.4% with a P threshold of 0.1. After relaxing the P threshold, SeqTar could predict 17 additional reported pairs in rice. We further analyzed SeqTar's capability in identifying of conserved sRNA targets in Table 1. SeqTar successfully found most of these targets, 225/230 for the WT and xrn4 data sets and 122/123 for the osa data set, respectively, as shown in the last row of Table 1. The missing miRNA:target pairs included miR-403:AT1G31290, four miR895:F-Box pairs in Arabidopsis and miR398:CCS1 pair in rice. But these pairs were found with a relaxed P. These results indicate that SeqTar is sensitive in identifying conserved sRNA targets.

Comparisons with CleaveLand

We compared the results of SeqTar with those of CleaveLand (16) reported in the starBase (39). The two degradome data sets of ref. (14) and four degradome data sets of ref. (15) from Arabidopsis were combined and used in the starBase. Similarly, in the starBase, rice miRNA target prediction were performed by combining the degradome data sets in refs (18,20). CleaveLand (version 2) (16) was used in the starBase to predict miRNA:target pairs with at least one read from these combined degradome data sets (39). The duplicate miRNA:target pairs from starBase/CleaveLand, due to individual members of a miRNA family and alternatively spliced target transcripts, were removed to obtain 13 399 and 13 279 unique miRNA:target pairs in Arabidopsis and rice, respectively. The duplicate pairs from SeqTar prediction were also removed; the remaining pairs, collectively named as SeqTar-All, were then compared with CleaveLand's results. Here, SeqTar's results on the WT and xrn4 data sets were combined to form its results for Arabidopsis. In order to compare the ability of SeqTar for finding miRNA:target pairs with valid reads, we also compared CleaveLand's results to the pairs with at least one valid read predicted by SeqTar, named as SeqTar-VR. Then, the results of CleaveLand and SeqTar were further checked against the reported pairs summarized in Supplementary Tables S1 and S2 to compare their performances on detecting the known targets. SeqTar has a better performance in identifying the reported pairs than CleaveLand. On Arabidopsis, SeqTar identified 50 more reported miRNA:target pairs with valid reads than CleaveLand even though four more degradome data sets were used in ref. (15) (Table 4). On rice, similarly, SeqTar outperformed CleaveLand by identifying 28 additional reported miRNA:target pairs with valid reads (Table 4). When taking the pairs without valid reads into account, SeqTar had a significantly better performance than CleaveLand by identifying about 43% and 42% more reported pairs in Arabidopsis and rice, respectively (Table 4).
Table 4.

The comparisons between the CleaveLand Pipeline and the SeqTar pipeline

SeqTar-AllSeqTar-VRstarBase/CLReportedTotal
Arabidopsis
    SeqTar-All41 0207215412246 227
    SeqTar-VR41 020596627741 020
    starBase/CL7215596622713 399
    Reported412277227428
Rice
    SeqTar-All76 4977375382487 305
    SeqTar-VR76 497493821876 497
    starBase/CL7375493819013 279
    Reported382218190458

The number in a cell means the common non-redundant miRNA:target pairs predicted by the methods in the line and the column of the cell. SeqTar-All, SeqTar-VR, starBase/CL and Reported stand for pairs of SeqTar, SeqTar with at least one valid read, starBase/CleaveLand and literature summarized in Supplementary Table S1 (Arabidopsis) and S2 (rice), respectively. SeqTar's results on the WT and xrn4 data sets were combined to form the SeqTar-All and SeqTar-VR in Arabidopsis. The ‘Total’ column listed the total numbers of pairs of SeqTar-All, SeqTar-VR, starBase/CL and Reported.

The comparisons between the CleaveLand Pipeline and the SeqTar pipeline The number in a cell means the common non-redundant miRNA:target pairs predicted by the methods in the line and the column of the cell. SeqTar-All, SeqTar-VR, starBase/CL and Reported stand for pairs of SeqTar, SeqTar with at least one valid read, starBase/CleaveLand and literature summarized in Supplementary Table S1 (Arabidopsis) and S2 (rice), respectively. SeqTar's results on the WT and xrn4 data sets were combined to form the SeqTar-All and SeqTar-VR in Arabidopsis. The ‘Total’ column listed the total numbers of pairs of SeqTar-All, SeqTar-VR, starBase/CL and Reported. The numbers of common predictions from SeqTar-All, SeqTar-VR, starBase/CleaveLand, and reported pairs were summarized in Table 4. In both Arabidopsis and rice, ∼54% of CleaveLand's pairs were overlapped with SeqTar-All. The rest pairs of CleaveLand that were not found in SeqTar-All had an average score of 6.7 in both species. We thus speculated that the P threshold of 0.1 of SeqTar might be too stringent to identify these pairs. After relaxing P to 0.2, SeqTar identified more pairs overlapped with CleaveLand's results: 2004 new pairs in Arabidopsis and 2585 new pairs in rice in addition to those in Table 4.

Conserved miRNAs target additional members of known target gene families

SeqTar's results were analyzed to find whether the conserved miRNAs targeted additional members of the same gene families. Thirty, twenty-eight and twenty-six new targets for the conserved miRNA families had valid reads in the three data sets respectively (see the WT New, xrn4 New and osa New columns of Table 1), suggesting that additional members of these target gene families were also cleaved. These newly found targets generally had more mismatches in their complementary sites (≥4) than those reported, which could explain why these targets could not be identified in previous studies (2,4,6,7,9,14,15,26–29). Details of these newly found targets, along with the previously reported, were listed in Supplementary Tables S3–S5. We also examined the P-values of the complementary sites and valid reads of these conserved sRNA targets (Figures 2a–c). Most conserved targets have very small P values (<10−5) and almost all conserved targets have P values <0.1. The only exception was the 3′ targeting sites of miR390 on TAS3b(AT5G49615) with 6.5 mismatches (9,23). A proper threshold of P needs to be established in order to remove those targets that only had a few valid reads, which might be random degradation products. Because the P values of most conserved sRNA targets with valid reads (106/120, 107/120 and 73/89 for the WT, xrn4 and osa data sets, respectively) were <10−5 (Supplementary Tables S3 to S5, respectively), we used a P value of 10−5 to identify reliable sRNA:target pairs, as indicated by the blue lines in Figure 2. Based on the criteria of P = 0.1 and P = 10−5, all predicted targets could be grouped into four categories: Category I with P < 0.1 and P < 10−5, Category II with P < 0.1 and P ≥ 10−5, Category III with P ≥ 0.1 and P ≥ 10−5, and Category IV with P ≥ 0.1 and P < 10−5 (Figure 2). The miRNA:target pairs in Category I were the most reliable among all four categories because this category had both satisfactory complementary sites and enriched valid reads. The pairs in Category II, such as ath-miR163:SAMT in the WT data set, might also be genuine targets but with no or limited valid reads, which resulted in insignificant P values. Only one reported pair (miR390:AtTAS3b) belonged to Category III (Figure 2a) and IV (Figure 2b) in the WT and xrn4 data sets, respectively. We identified additional targets in Category I (Figures 2a–c and Supplementary Tables S3–S5). These targets included seven MYB family members (targeted by miR858, also see Table 2), two PPR members (targeted by miR400) in Arabidopsis (after combining results of the WT and xrn4 data sets), and an F-Box member (Os05g37690, targeted by miR393) in rice. These newly found targets had more than three mismatches when aligned with the respective miRNAs. Some other MYB family transcription factors were reported to be targets of miR828 (41) and miR858 in Arabidopsis (14,15), respectively. Our results suggest that more MYB family members are targets of these two miRNA families (Table 2).

Novel targets of conserved miRNAs and experimental validations

It is known that conserved miRNAs target members of the same gene families (as summarized in Table 1). To identify additional targets for conserved miRNAs and to determine whether non-conserved miRNAs were functional, we chose the top two targets that has the largest number of reads at their complementary sites (with the smallest P values) for each sRNA in Arabidopsis and rice, respectively. The obtained pairs were manually inspected based on the number of valid reads and the number of mismatches. The resulted miRNA:target pairs in Arabidopsis and rice were listed in Table 2 and 3, respectively. As mentioned in the ‘Materials and Methods’ section, we selected a total of 19 predicted targets, 7 from Arabidopsis and 12 from rice, for experimental validation. Of these genes, four were not amplified in the tissue tested, which could be due to low abundance below detectable level. Of the 15 amplified genes, 12 genes were cleaved at the expected sites, as shown in Figures 3, 4 and Supplementary Figure S4e. Our analyses revealed that conserved miRNAs target new gene families that have more mismatches at the miRNA complementary sites (Tables 2 and 3). For instance, ath-miR398a targets AT3G27200, a plastocyanin-like domain-containing protein, with 4.5 mismatches (Table 2 and Figure 3e). Homologs of this gene in many plant species, but not all, possess miR398 complementary sites (Figure 3f). These results indicated that the miR398 family in some plant species target three conserved gene families, in addition to the two reported families, CSD and CCS1 (Table 1). Ath-miR172ab targets five N-acetylglucosamine deacetylase family transcripts (with 4.5 mismatches, see Supplementary Tables S6 and S7), and one of them (AT1G24793) is validated (Figure 3a); ath-miR172ab targets AT5G16480 (a tyrosine-specific protein phosphatase), which is also validated (with five mismatches, see Figure 3d). Similarly, osa-miR171h:Os07g36170 (a chitin-inducible gibberellin-responsive protein) has 4.5 mismatches and osa-miR172d:Os10g08580 (a FAD binding domain of DNA photolyase domain containing protein) has five mismatches (Table 3), and both are validated (Figure 4b and e). The miR396 family targets the GRF (Growth-Regulating Factor) family (15,18). In our study, we found that ath-miR396 can also regulate RAP2.12, a member of the ERF/AP2 transcription factor family. The miR396b cleavage site on AT1G53910 (RAP2.12) was validated using the 5′-RACE assay although there is a mismatch at position 11 (Figure 3b and Table 2). These examples illustrated that some of the conserved miRNA families can target more than one gene families in Arabidopsis and rice. As shown in Figures 3d and 4e, AT5G16480 in Arabidopsis and Os10g08580 in rice are miR172 targets. To provide further experimental evidence on the accuracy of SeqTar, we infiltrated A. tumefaciens harboring the ath-miR172a primary transcript and two target genes, one from Arabidopsis (AT5G16480) and the other from rice (Os10g08580), into N. benthamiana leaves for transient co-expression analysis. The result confirmed the expression of miR172 in the mock, miR172, AT5G16480/Os10g08580 and miR172+AT5G16480/Os10g08580 infiltrated leaves. As expected, miR172 accumulation is significantly higher in leaves infiltrated with miR172 and miR172+AT5G16480/Os10g08580 than in leaves infiltrated with mock and AT5G16480/Os10g08580 (Figure 5a and b). miR172 is a highly conserved miRNA in plants, so that the detection of miR172 in mock and AT5G16480/Os10g08580 infiltrated N. benthamiana leaves is not surprising and the detected signal in these cases may also be due to endogenous miR172 in N. benthamiana (Figure 5a and b). Transcripts of AT5G16480 or Os10g08580 have been detected in tobacco leaves infiltrated with the respective constructs. Similarly, these transcripts were also detected in leaves infiltrated with AT5G16480/Os10g08580 along with miR172, but not in mock and miR172 infiltrated leaves (Figure 5a and b). AT5G16480/Os10g08580 expression levels were very high in leaves infiltrated with AT5G16480/Os10g08580 alone, but their levels were substantially reduced in the leaves when miR172 and AT5G16480/Os10g08580 were co-expressed (Figure 5a and b). These results indicated that the targets identified by SeqTar are indeed genuine and miR172 can target and cleave the AT5G16480/Os10g08580 transcripts in Arabidopsis/rice.
Figure 5.

The validation of AT5G16480 and Os10g08580 as targets of miR172 using the transient co-expression assay. N. benthamiana leaves were infiltrated with infiltration medium (mock); Agrobacteria harboring Ath-MIR172a alone (miR172); Agrobacteria harboring Arabidopsis transcript AT5G16480/rice transcript Os10g08580 alone (AT5G16480/Os10g08580); co-expression Ath-MIR172a and target genes (miR172+AT5G16480/miR172+Os10g08580). For the co-expression, equal amount of Agrobacterium culture containing Ath-MIR172a and AT5G16480 or Os10g08580 were mixed before infiltration into N. benthamiana leaves. U6 and actin are served as loading controls for miR172 and target gene (AT5G16480 or Os10g08580) detection, respectively. (a) The validation of AT5G16480. (b) The validation of Os10g08580.

The validation of AT5G16480 and Os10g08580 as targets of miR172 using the transient co-expression assay. N. benthamiana leaves were infiltrated with infiltration medium (mock); Agrobacteria harboring Ath-MIR172a alone (miR172); Agrobacteria harboring Arabidopsis transcript AT5G16480/rice transcript Os10g08580 alone (AT5G16480/Os10g08580); co-expression Ath-MIR172a and target genes (miR172+AT5G16480/miR172+Os10g08580). For the co-expression, equal amount of Agrobacterium culture containing Ath-MIR172a and AT5G16480 or Os10g08580 were mixed before infiltration into N. benthamiana leaves. U6 and actin are served as loading controls for miR172 and target gene (AT5G16480 or Os10g08580) detection, respectively. (a) The validation of AT5G16480. (b) The validation of Os10g08580.

Identification of new targets of non-conserved miRNAs and siRNAs

Many non-conserved miRNAs in Arabidopsis and rice were found to have cleavable targets, e.g. ath-miR779-2:AT5G17240 (Figure 3c), ath-miR3932b:AT2G30620, ath-miR3933:AT1G08980, and ath-miR4239:AT1G70830 (Table 2) and osa-miR1319:Os06g01304 (Figure 4a), osa-miR1852:Os02g27400 (Figure 4c), osa-miR2878-3p:Os02g40900 and osa-miR2878-5p:Os11g19100 (Table 3). Some of the pairs, such as ath-miR860:AT5G26030 with 0.5 mismatches (Table 2) and osa-miR2123a-c:Os02g34950 with 1 mismatch (Table 3), were highly complementary. Unlike the conserved miRNAs targeting many transcription factors, a few transcription factors were identified as targets of non-conserved sRNAs in Arabidopsis and rice. As listed in Table 2, only seven targets in Arabidopsis, i.e. ARF3 (AT2G33860, targeted by miR400), bZIP7 (AT4G37730, targeted by miR413), MYB107 (AT3G02940, targeted by miR828), NF-YA7 (AT1G30500, targeted by miR850), MYB11 (AT3G62610, targeted by miR858), MYB34 (AT5G60890, targeted by miR858) and HSFA8 (AT1G67970, targeted by miR3434), are transcription factors. In rice, a non-conserved miRNA osa-miR530-3p targeted Os05g34720, a transcription factor, which was also validated in this study (Figure 4d and Table 3). The non-conserved miRNAs, osa-miR1436 and osa-miR1867, target Os07g22930, a starch synthase protein (Figure 4f and Table 3). osa-miR1439 also has a complementary site with 3.5 mismatches on Os07g22930, which has 3 valid reads (P = 0.06), at 3 nt upstream of osa-miR1436 complementary site (Figure 4f). Interestingly, our analysis suggest that osa-miR1436 and osa-miR1439 can also combinatorially regulate another starch synthase, Os06g06560 (Supplementary Figure S2). These results suggested that osa-miR1436, osa-miR1439 and probably osa-miR1867 can regulate genes implicated in starch synthesis pathways in rice. Furthermore, our analysis also suggested that some siRNAs derived from both TAS1/2 and PPR transcripts might also target other transcripts. For examples, TAS1a_D4(+) can target AT3G06940, a transposable element, and AT1G62910-tasi4 (an siRNA derived from AT1G62910) can target AT4G16570, Protein Arginine Methyltranferase 7 (Table 2).

The combinatorial regulations of mRNA targets

In order to investigate potential combinatorial regulations by different miRNA families, we examined the previously reported miRNA:targets pairs (Supplementary Tables S1 and S2) and the pairs in the dashed box of Figure 1 (Supplementary Tables S6–S8 for Category I pairs, and S14–S16 for Category II pairs, respectively). Some of the combinatorially regulated targets are shown in Figures 6 and 7. For instance, AT3G26810 (an F-box family protein) was a known target of ath-miR393 (15,28). Our analysis suggested that AT3G28160 could also be regulated by ath-miR396b (Figure 6b). Zhou et al. (20) reported that osa-miR806 guided cleavage on Os02g43370 (Table S2). We find that osa-miR2123 can also regulate Os02g43370. The complementary sites of osa-miR806 and osa-miR2123 on Os2g43370 are partially overlapping (Figure 7b). Similarly, osa-miR446 can regulate Os02g29140 (19,20) (Supplementary Table S2). Our analysis shows that osa-miR809 can target Os02g29140 transcript with a partially overlapping complementary site (Figure 7h). We also recognize that osa-miR809, osa-miR446 and osa-miR808 combinatorially regulate several other transcripts, such as Os01g15520, Os06g19990, Os08g40440, Os10g26720 and Os12g12950 (Supplementary Table S8), indicating the existence of several common targets of these three miRNAs. Furthermore, AT5G38480 was found to be cleaved by AT1G62910-tasi4 and ath-miR167 (Figure 6f), suggesting a combinatorial regulation resulting from PPR-derived siRNA and miRNA. TAS3 derived siRNAs are known to target ARF3 (AT2G33860) transcript (6,15,26). Additionally, our analysis revealed that ath-miR400 could also target ARF3 transcript but at a different site with 4.5 mismatches (Supplementary Figure S1). These results, together with many other examples in the current study (Figures 6 and 7 and Supplementary Tables S6–S8) suggested that one transcript could be targeted by two or more different sRNA in Arabidopsis and rice.
Figure 6.

The predicted Arabidopsis targets that are combinatorially regulated. (a) AT5G11260. (b) AT3G26810. The blue binding site of ath-miR393ab was a reported site. (c) AT1G17650. (d) AT3G07990. (e) AT2G27530. (f) AT5G38480. For details refer to the legend of Figure 3. WT and xrn4 in parenthesis indicate the sample where the T-plots and number of reads were obtained.

Figure 7.

The predicted rice targets that are combinatorially regulated. (a) Os01g44990. (b) Os02g43370. (c) Os03g06960. (d) Os03g55164. (e) Os04g44800. (f) Os08g08190. (g) Os05g02420. (h) Os02g29140. (i) Os04g41620. For details refer to the legend of Figure 3. The blue sites were published sites, see Supplementary Table S2. In part (b), (d) and (h), the underlined nucleotides indicate the overlapped regions of different miRNA binding sites, and the numbers above start and end of the target sequences are the start and end positions of the binding sites, respectively.

The predicted Arabidopsis targets that are combinatorially regulated. (a) AT5G11260. (b) AT3G26810. The blue binding site of ath-miR393ab was a reported site. (c) AT1G17650. (d) AT3G07990. (e) AT2G27530. (f) AT5G38480. For details refer to the legend of Figure 3. WT and xrn4 in parenthesis indicate the sample where the T-plots and number of reads were obtained. The predicted rice targets that are combinatorially regulated. (a) Os01g44990. (b) Os02g43370. (c) Os03g06960. (d) Os03g55164. (e) Os04g44800. (f) Os08g08190. (g) Os05g02420. (h) Os02g29140. (i) Os04g41620. For details refer to the legend of Figure 3. The blue sites were published sites, see Supplementary Table S2. In part (b), (d) and (h), the underlined nucleotides indicate the overlapped regions of different miRNA binding sites, and the numbers above start and end of the target sequences are the start and end positions of the binding sites, respectively.

Self- and cross-repression of TAS/PPR transcripts

Mapping 20 nt reads to the TAS transcripts suggested that TAS1a (AT2G27400), TAS1c (AT2G39675) and TAS2 (AT2G39681) transcripts are subjected to cleavages guided by the siRNAs derived from their own precursors (Supplementary Figure S4). In addition to ath-miR173 cleavage sites, all these transcripts are regulated by at least one other siRNA, TAS1c_D6(−). The regulation of TAS2 by TAS1c_D6(−) siRNA was validated using the 5′-RACE assay (Supplementary Figure S4e). TAS1c was regulated by two other siRNAs, TAS1c_D10(−) and TAS1a_D9(−) (Supplementary Figure S4c and d). TAS2 was regulated by three siRNAs derived from its own transcript, TAS2_D6(−), TAS2_D9(−) and TAS2_D11(−) (Supplementary Figure S4e and f). Similarly, cleavage on TAS4 (AT3G25795) was guided by one of the self-derived tasiRNA, TAS4_D4(−) (P < 10−4 in the WT data set, see Supplementary Table S9). These results suggested that tasiRNAs derived from TAS1, TAS2 and probably TAS4, regulate and repress their own transcripts. AT1G62910, a PPR transcript, possessed three target sites for five different sRNAs (Supplementary Figure S5a and b). Among the three sites, one had a major peak and the other two had minor peaks. TAS2_D6(−) could contribute the major peak and the other two minor peaks could be attributed to AT1G62910−tasi3/ath−miR161−1 and AT1G63400−tasi1/ath−miR161−2, where AT1G62910−tasi3 and AT1G63400−tasi1 were miR-161-like siRNA derived from PPR transcripts (Figure 8b). Similar regulations on AT1G62930 and AT1G62860 were also identified (Supplementary Figure S5c–f).
Figure 8.

The self-repression of TAS and PPR transcripts. (a) A schematic view of ath-miR173/TAS1,TAS2/PPR sRNA generating cascade. The green arrows stand for the sRNA-mediated regulation that are required to generate sRNAs. The two red dull arrows stand for the cleavages of transcripts to repress the ever-expanding cascade at the TAS1/2 and PPR level, respectively. (b) The ath-miR161 and ath-miR161-like sRNAs that are derived from the PPR transcripts. The underlined nucleotides are identical in all four sRNAs.

The self-repression of TAS and PPR transcripts. (a) A schematic view of ath-miR173/TAS1,TAS2/PPR sRNA generating cascade. The green arrows stand for the sRNA-mediated regulation that are required to generate sRNAs. The two red dull arrows stand for the cleavages of transcripts to repress the ever-expanding cascade at the TAS1/2 and PPR level, respectively. (b) The ath-miR161 and ath-miR161-like sRNAs that are derived from the PPR transcripts. The underlined nucleotides are identical in all four sRNAs. AT1G63080 was targeted by TAS2_D6(−), miR161-1 and miR161-2, and it has been predicted that miR400, TAS2_D9(−) and TAS2_D11(−) can also target AT1G63080 (6). Our analysis confirmed that TAS2_D11(−) indeed induced a major cleavage site on AT1G63080 transcript. TAS2_D6(−) and miR161-1/AT1G62910-tasi3 contribute to another two minor cleavage sites, respectively (see Supplementary Table S10). Sixteen other PPR transcripts, i.e. AT1G06580, AT1G12775, AT1G19720, AT1G26460, AT1G62590, AT1G62860, AT1G62910, AT1G62930, AT1G63080, AT1G63130, AT1G63150, AT1G63330, AT1G63400, AT5G08510, AT5G16640 and AT5G41170, were found to be cleaved by at least two different sRNAs at different positions (Supplementary Table S10). As reported in (9), ath-miR161-1 and ath-miR161-2 can regulate as many as 40 PPR transcripts. Our results suggested that several siRNAs derived from PPR genes, especially the two ath-miR161 like siRNAs, AT1G62910-tasi3 and AT1G63400-tasi1, were involved in self- or cross-repression of many PPR transcripts (see Supplementary Table S10). Our results also suggested that a pseudogene of PPR proteins, AT1G62860, was cleaved by TAS2_D12(−), TAS2_D9(−), ath-miR161-1 and AT1G62910-tasi3 (Supplementary Figure S5e and f). In summary, these results suggest that there are complex combinatorial self- and cross-repression in the ath-miR173/TAS/PPR siRNA regulation cascade.

Self-repression of miRNAs in Arabidopsis

German et al. (14) found that ath-miR172 can self-repress the primary transcript of ath-miR172b. Four other miRNAs, ath-miR390a, ath-miR398b, ath-miR396a and ath-miR396b, also have similar self-repression guided by their own mature miRNAs (14). We found that four more miRNA families, ath-miR163, ath-miR860, ath-miR166f and ath-miR393b (Supplementary Figure S3) also self-repressed their own precursors (P < 10−3), suggesting that the self-repression of pre-miRNAs is more prevalent in Arabidopsis than previously reported.

The false discovery rate of SeqTar

We used the method introduced by Storey and Tibshirani (42) to evaluate the False Discovery Rate (FDR) of SeqTar's results. We estimated the FDR and q-values of P and P, respectively. The q-value is a measure of significance in terms of the FDR (42). The FDR and q-values of all new predictions were <0.05 when the thresholds of P and P were set to 0.1, except for the P of new and Category II predictions of the osa data (Supplementary Table S11). But these measures were <0.05 if a slightly more stringent P-value, P ≤ 0.07, was used. Because P and P were calculated independently, FDR and q-values of P and P were also supposed to be independent. Therefore, it was reasonable to expect the FDR and q of a predicted sRNA:target pair were <0.0025 (0.052) when both P < 0.1 and P < 0.1 (or P < 0.05 for large number of predictions such as the osa data set) were satisfied. This suggested that the FDR of newly predicted sRNA:target pairs were much <0.01 when both P < 0.1 and P < 0.1 (or P < 0.05 for a large number of predictions) were satisfied. The FDRs of the pairs of Category I were <10−4 (in Supplementary Table S11), indicating that the predictions of Category I were highly reliable. The FDR and q-values of P of reported pairs were <0.01, which was consistent with the preference of intensively matched complementary sites in the reported pairs. The FDR and q-values of P of reported pairs were smaller than pairs in Category II but larger than pairs in Category I (see Supplementary Table S11). In summary, the FDR values suggested that the results of SeqTar were reliable and had a very low ratio of false positives if both P and P were set to 0.05, or even P < 0.1 in all cases and P < 0.1 in most cases (see Supplementary Table S11).

Efficiency of SeqTar

SeqTar used about 1000 and 2000 CPU seconds of an Intel Xeon 2.66 GHz 64 bit CPU to search potential targets of one sRNA against all transcripts of Arabidopsis and rice, respectively. In addition to a few efficient supporting steps (see Supplementary Methods), it took a modest number of hours to perform target predictions on all annotated transcript cDNA sequences for all miRNAs and siRNAs in both of these two species on a normal server computer with multiple CPUs.

DISCUSSION

SeqTar's improved performance

In this study, we have demonstrated that SeqTar is a more effective and efficient computational method for identification of miRNA/siRNA targets from the degradome data sets in plants. By relaxing the number of mismatches, SeqTar found many new targets for conserved and non-conserved miRNAs in Arabidopsis and rice. The improved performance of SeqTar could be attributed to three major facts. First, instead of setting a subjective criterion such as the number of mismatches in its prediction, SeqTar used the P-values of mismatches generated with shuffled sRNA sequences. Because different miRNA families have varied number of targets and conserved miRNAs tend to bind to regions with high complementarities in their targets, P could have a better capability in differentiating true complementary sites from false ones. It is also better to use P-values than a specified number of mismatches for miRNAs of different lengths because longer miRNAs should be able to tolerate a few more mismatches than shorter ones. For example, 24 nt miRNAs such as ath-miR829-1 (Figure 6e), osa-miR1867 (Figure 4f), osa-miR1874-5p (Figure 7e) and osa-miR1862 (Figure 7f) could cleave their targets despite having >5 mismatches in the complementary sites. Second, SeqTar treated mismatches and G:U pairs in different positions of sRNA complementary sites equally. In previous studies, mismatches and G:U pairs in the 2 nt to 13 nt region received more penalties (6,15,16) and were not allowed at positions 10 and 11 (7). However, our results indicated that some sRNA complementary sites with mismatches and G:U pairs at these positions are also subjected to sRNA-guided cleavages. Eight verified miRNA:target pairs (Figures 3a–d and 4a, b, d and e) had at least two mismatches within the regions of the 2–13th nt. Among these eight pairs, osa-miR171h:Os07g36170 and ath-miR396b:AT1G53190 also had a mismatch at position 10 and 11, respectively (in Figures 3b and 4b). Two published work (6,43) also support our findings. Allen et al. (6) verified that ath-miR173 can cleave AT1G50055 (TAS1b) even the positions 10 and 9 of their complementary site are mismatches; Mallory et al. (43) demonstrated that a mutated miR165 complementary site with a mismatch at position 10 can be cleaved. More importantly, SeqTar took advantage of the abundance of valid reads, i.e. reads mapped to the 9–11 nt region, to perform a statistical analysis of sRNA complementary sites. In particular, the P values were calculated to evaluate the abundance of valid reads at the predicted cleavage sites. By combining the P and P-values, SeqTar's sensitivity and specificity were enhanced to outperform the methods that only used sequence information alone. Our results clearly suggest that the existing criteria of predicting targets for sRNA in plants may be too stringent to successfully identify genuine targets with weak complementarities. Finally, as a rule of thumb for using SeqTar, if P < 10−5, a P threshold of 0.1 can be used to find miRNA:target pairs with a good sensitivity and reasonable specificity. If P ≥ 10−5, it is better to use a stringent P value of ≤0.05 (or 0.01), or alternatively to restrict the number of mismatches m ≤ 4 as a criterion as proposed in early studies. For instance, by using P < 10−5 and P < 0.1, 41.6% and 45.0% reported pairs in Supplementary Table S1 could be identified on the WT and xrn4 data sets, respectively. Then, by using P < 0.05 alone, additional 43% pairs in Supplementary Table S1 were identified on both the WT and xrn4 data sets. Similarly, 132 and 245 out of the 458 reported pairs of rice in Supplementary Table S2 could be identified on the osa data set by using the same criteria.

More sRNA targets exist than previously reported

Even with a very strict criterion of P < 10−5 and ≤3 mismatches in complementary sites, SeqTar found 103 and 92 novel sRNA targets in Arabidopsis and rice, respectively. Another 128 and 176 novel target sites in Arabidopsis and rice, respectively, had ≤3 mismatches and at least five valid reads. If using P < 0.1, instead of restricting the number of mismatches m ≤ 3, and P < 10−5, >3000 novel miRNA:target pairs could be detected in both species (see Category I predictions in Figure 1 and Supplementary Tables S6–S8). Our results suggest that several newly identified non-conserved miRNAs are functional. As shown in Supplementary Tables S6–S8 and Figures 6 and 7, as well as Supplementary Tables S14–S16, a small percentage of targets are combinatorially regulated by more than one sRNA in these two species.

sRNA induced self- and cross-repression

The tasiRNAs derived from TAS1a/c and TAS2 may self- and/or cross-target their own transcripts (Figure 8a). Two ath-miR161 like siRNAs (Figure 8b) are derived from AT1G62910, AT1G62930, AT1G63130 and AT1G63400, which are close paralogs of the PPR-P clade proteins (9). As shown in Supplementary Figures S5a–f, they might potentially target their own transcripts and many other PPR transcripts (see Supplementary Table S10). As reported by Howell et al. (9), ath-miR161 might target as many as 40 PPR transcripts, including the 28 genes in the PPR-P clade. These observations suggested that the ath-miR161 like siRNAs derived from these closely related PPR paralogs repressed the ever-enlarging sRNA generation cascade originated from ath-miR173 at the PPR level (Figure 8a). Current model of ath-miR173/TAS/PPR cascade suggests that the ath-miR173 guided cleavage leads to the generation of tasiRNAs on TAS1 and TAS2, and some of these tasiRNAs induce the generation of siRNAs from PPR transcripts. But our analysis suggested that some tasiRNAs repressed their own transcripts at the TAS1 and TAS2 level (Figure 8a), and some siRNAs generated from PPR genes could potentially be involved in the silencing of PPR-P clade transcripts as also reported by Howell et al. (9). Furthermore, some siRNAs derived from both TAS1/2 and PPR transcripts might also target other transcripts. As listed in Table 2, TAS1a_D4(+) targeted AT3G06940, a transposable element, and AT1G62910-tasi4 targeted AT4G16570, Protein Arginine Methyltransferase 7. These results suggested that some siRNAs generated from the ath-miR173/TAS/PPR cascade might also have other targets, similar to the TAS3-siRNAs targeting the ARF family members (Table 1). As shown in Supplementary Figure S5e and f, our results suggested that a pseudogene of PPR proteins, AT1G62860, was regulated by TAS2_D12(−), TAS2_D9(−), ath-miR161-1 and AT1G62910-tasi3. Poliseno et al. (44) recently found that transcripts produced from pseudogene PTENP1, named as miRNA decoys, regulated the expression level of tumor suppressor gene PTEN by absorbing miRNAs that had complementary sites on both PTENP1 and PTEN transcripts. The case of AT1G62860 demonstrated that the so-called miRNA decoys were also applicable to trans-acting siRNAs, which made the miR173/TAS/PPR pathway even more complicated than previously thought (Figure 8a). Besides tasiRNAs, our analyses suggested that several additional miRNA families, ath-miR163, ath-miR860, ath-miR166 and ath-miR393 of Arabidopsis thaliana self-repressed their own primary or precursor transcripts, in addition to the ath-miR172, ath-miR390, ath-miR398 and ath-miR396 families reported in ref. (14).

CONCLUSIONS

The contributions of this study are 3-fold. First, it introduced a novel algorithm, called SeqTar, for identifying sRNA-induced cleavages captured in degradomes. Second, SeqTar identified many new sRNA targets in Arabidopsis and rice that could be missed when using stringent criteria. Finally, the use of P-value for evaluating the abundance of valid reads is a better means to identify sRNA guided cleavage sites on mRNA targets that have >4 mismatches than the existing criteria. The extra penalties to mismatches in the 2–13 th nt region and disallowing mismatch and G:U Wobble pair at positions 10 and 11 used in the existing criteria may miss these targets. By simultaneously taking into consideration the P-value of mismatches and P-value of valid reads, the false positive rate of SeqTar was further reduced than the other methods that only used alignment information. Our results suggested the existence of more targets with more mismatches and with mismatches at position 10 or 11. Our study offered novel insights into the principles that sRNAs follow in recognizing and degrading their targets in plants.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Tables 1–17, Supplementary Figures 1–5 and 7, Supplementary Methods and Supplementary Reference [45].

FUNDING

The research was supported in part by a start-up grant of Fudan University and a grant of the Science and Technology Commission of Shanghai Municipality (10ZR1403000 to Y.Z.); by NSF-EPSCOR award EPS0814361 and Oklahoma Agricultural Experiment Station (to R.S.); and by NSF (grant DBI-0743797) and NIH (grants R01GM086412 and RC1AR058681 (to W.Z.) Conflict of interest statement. None declared.
  42 in total

1.  OsSPL14 promotes panicle branching and higher grain productivity in rice.

Authors:  Kotaro Miura; Mayuko Ikeda; Atsushi Matsubara; Xian-Jun Song; Midori Ito; Kenji Asano; Makoto Matsuoka; Hidemi Kitano; Motoyuki Ashikari
Journal:  Nat Genet       Date:  2010-05-23       Impact factor: 38.330

2.  Regulation of OsSPL14 by OsmiR156 defines ideal plant architecture in rice.

Authors:  Yongqing Jiao; Yonghong Wang; Dawei Xue; Jing Wang; Meixian Yan; Guifu Liu; Guojun Dong; Dali Zeng; Zefu Lu; Xudong Zhu; Qian Qian; Jiayang Li
Journal:  Nat Genet       Date:  2010-05-23       Impact factor: 38.330

3.  MicroRNA gene evolution in Arabidopsis lyrata and Arabidopsis thaliana.

Authors:  Noah Fahlgren; Sanjuro Jogdeo; Kristin D Kasschau; Christopher M Sullivan; Elisabeth J Chapman; Sascha Laubinger; Lisa M Smith; Mark Dasenko; Scott A Givan; Detlef Weigel; James C Carrington
Journal:  Plant Cell       Date:  2010-04-20       Impact factor: 11.277

4.  Arabidopsis lyrata small RNAs: transient MIRNA and small interfering RNA loci within the Arabidopsis genus.

Authors:  Zhaorong Ma; Ceyda Coruh; Michael J Axtell
Journal:  Plant Cell       Date:  2010-04-20       Impact factor: 11.277

5.  Rice MicroRNA effector complexes and targets.

Authors:  Liang Wu; Qingqing Zhang; Huanyu Zhou; Fangrui Ni; Xueying Wu; Yijun Qi
Journal:  Plant Cell       Date:  2009-11-10       Impact factor: 11.277

6.  The sequential action of miR156 and miR172 regulates developmental timing in Arabidopsis.

Authors:  Gang Wu; Mee Yeon Park; Susan R Conway; Jia-Wei Wang; Detlef Weigel; R Scott Poethig
Journal:  Cell       Date:  2009-08-21       Impact factor: 41.582

7.  Sliced microRNA targets and precise loop-first processing of MIR319 hairpins revealed by analysis of the Physcomitrella patens degradome.

Authors:  Charles Addo-Quaye; Jo Ann Snyder; Yong Bum Park; Yong-Fang Li; Ramanjulu Sunkar; Michael J Axtell
Journal:  RNA       Date:  2009-10-22       Impact factor: 4.942

8.  Expression analysis of phytohormone-regulated microRNAs in rice, implying their regulation roles in plant hormone signaling.

Authors:  Qing Liu; Yu-Chan Zhang; Cong-Ying Wang; Yu-Chun Luo; Qiao-Juan Huang; Shao-Yu Chen; Hui Zhou; Liang-Hu Qu; Yue-Qin Chen
Journal:  FEBS Lett       Date:  2009-01-22       Impact factor: 4.124

9.  Cloning and characterization of small RNAs from Medicago truncatula reveals four novel legume-specific microRNA families.

Authors:  Guru Jagadeeswaran; Yun Zheng; Yong-Fang Li; Lata I Shukla; Jessica Matts; Peter Hoyt; Simone L Macmil; Graham B Wiley; Bruce A Roe; Weixiong Zhang; Ramanjulu Sunkar
Journal:  New Phytol       Date:  2009-06-23       Impact factor: 10.151

10.  Identification of precursor transcripts for 6 novel miRNAs expands the diversity on the genomic organisation and expression of miRNA genes in rice.

Authors:  Séverine Lacombe; Hiroshi Nagasaki; Carole Santi; David Duval; Benoît Piégu; Martine Bangratz; Jean-Christophe Breitler; Emmanuel Guiderdoni; Christophe Brugidou; Judith Hirsch; Xiaofeng Cao; Claire Brice; Olivier Panaud; Wojciech M Karlowski; Yutaka Sato; Manuel Echeverria
Journal:  BMC Plant Biol       Date:  2008-12-02       Impact factor: 4.215

View more
  44 in total

1.  New technologies for 21st century plant science.

Authors:  David W Ehrhardt; Wolf B Frommer
Journal:  Plant Cell       Date:  2012-02-24       Impact factor: 11.277

Review 2.  The use of high-throughput sequencing methods for plant microRNA research.

Authors:  Xiaoxia Ma; Zhonghai Tang; Jingping Qin; Yijun Meng
Journal:  RNA Biol       Date:  2015       Impact factor: 4.652

3.  StarScan: a web server for scanning small RNA targets from degradome sequencing data.

Authors:  Shun Liu; Jun-Hao Li; Jie Wu; Ke-Ren Zhou; Hui Zhou; Jian-Hua Yang; Liang-Hu Qu
Journal:  Nucleic Acids Res       Date:  2015-05-18       Impact factor: 16.971

4.  MiRNAting control of DNA methylation.

Authors:  Ashwani Jha; Ravi Shankar
Journal:  J Biosci       Date:  2014-06       Impact factor: 1.826

5.  Global Analysis of Truncated RNA Ends Reveals New Insights into Ribosome Stalling in Plants.

Authors:  Cheng-Yu Hou; Wen-Chi Lee; Hsiao-Chun Chou; Ai-Ping Chen; Shu-Jen Chou; Ho-Ming Chen
Journal:  Plant Cell       Date:  2016-10-14       Impact factor: 11.277

6.  Exploring the evolutionary differences of SBP-box genes targeted by miR156 and miR529 in plants.

Authors:  Li-Zhen Ling; Shu-Dong Zhang
Journal:  Genetica       Date:  2012-09-29       Impact factor: 1.082

7.  The RNA degradome: a precious resource for deciphering RNA processing and regulation codes in plants.

Authors:  Xiaoxia Ma; Xiaopu Yin; Zhonghai Tang; Hidetaka Ito; Chaogang Shao; Yijun Meng; Tian Xie
Journal:  RNA Biol       Date:  2020-04-26       Impact factor: 4.652

8.  Widespread Exon Junction Complex Footprints in the RNA Degradome Mark mRNA Degradation before Steady State Translation.

Authors:  Wen-Chi Lee; Bo-Han Hou; Cheng-Yu Hou; Shu-Ming Tsao; Ping Kao; Ho-Ming Chen
Journal:  Plant Cell       Date:  2020-01-27       Impact factor: 11.277

9.  Transcriptome and Degradome Sequencing Reveals Dormancy Mechanisms of Cunninghamia lanceolata Seeds.

Authors:  Dechang Cao; Huimin Xu; Yuanyuan Zhao; Xin Deng; Yongxiu Liu; Wim J J Soppe; Jinxing Lin
Journal:  Plant Physiol       Date:  2016-10-19       Impact factor: 8.340

10.  Widespread long noncoding RNAs as endogenous target mimics for microRNAs in plants.

Authors:  Hua-Jun Wu; Zhi-Min Wang; Meng Wang; Xiu-Jie Wang
Journal:  Plant Physiol       Date:  2013-02-21       Impact factor: 8.340

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.