Literature DB >> 26626050

Identification of Nicotiana benthamiana microRNAs and their targets using high throughput sequencing and degradome analysis.

Ivett Baksa1, Tibor Nagy2, Endre Barta3, Zoltán Havelda4, Éva Várallyay5, Dániel Silhavy6, József Burgyán7, György Szittya8.   

Abstract

BACKGROUND: Nicotiana benthamiana is a widely used model plant species for research on plant-pathogen interactions as well as other areas of plant science. It can be easily transformed or agroinfiltrated, therefore it is commonly used in studies requiring protein localization, interaction, or plant-based systems for protein expression and purification. To discover and characterize the miRNAs and their cleaved target mRNAs in N. benthamiana, we sequenced small RNA transcriptomes and degradomes of two N. benthamiana accessions and validated them by Northern blots.
RESULTS: We used a comprehensive molecular approach to detect and to experimentally validate N. benthamiana miRNAs and their target mRNAs from various tissues. We identified 40 conserved miRNA families and 18 novel microRNA candidates and validated their target mRNAs with a genomic scale approach. The accumulation of thirteen novel miRNAs was confirmed by Northern blot analysis. The conserved and novel miRNA targets were found to be involved in various biological processes including transcription, RNA binding, DNA modification, signal transduction, stress response and metabolic process. Among the novel miRNA targets we found the mRNA of REPRESSOR OF SILENCING (ROS1). Regulation of ROS1 by a miRNA provides a new regulatory layer to reinforce transcriptional gene silencing by a post-transcriptional repression of ROS1 activity.
CONCLUSIONS: The identified conserved and novel miRNAs along with their target mRNAs also provides a tissue specific atlas of known and new miRNA expression and their cleaved target mRNAs of N. benthamiana. Thus this study will serve as a valuable resource to the plant research community that will be beneficial well into the future.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26626050      PMCID: PMC4667520          DOI: 10.1186/s12864-015-2209-6

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Nicotiana benthamiana is a widely used model plant species for research on plant-pathogen interactions as well as other areas of plant science. It is particularly popular among plant virologist as it is susceptible to a large number of different plant viruses. It is also widely used in studies requiring protein localization, interaction, or plant-based systems for protein expression and purification. In addition, its susceptibility to pathogens has been used to transiently express proteins, using either engineered viruses or syringe-infiltration of Agrobacterium tumefaciens [1, 2]. The syringe-infiltration method proved to be extremely powerful tool to identify and characterize many viral silencing suppressor proteins [3], pathogens effector proteins [4] and also components of the plant nonsense-mediated mRNA decay system [5]. N. benthamiana is an herbaceous plant endemic to Australia [2], belonging to the Suaveolentes section of the Nicotiana genus of Solanaceae family. N. benthamiana is an allotetraploid species with 19 chromosome pairs, resulting from the hybridization of two unknown progenitors. The estimated genome size of N. benthamiana is 3 GB [2]. The draft genome sequence of the N. benthamiana Nb-1 accession has been published recently [1]. The origins of the N. benthamiana accessions used for research are generally not known. However it has been suggested that only a single accession of N. benthamiana, or a collection of closely related accessions, are being used by the plant research community today, since they are very similar in both susceptibility to pathogens and also in phenotype [2]. The genomes of economically important close relatives of N. benthamiana, such as potato (Solanum tuberosum), tomato (Solanum lycopersicum) and pepper (Capsicum annuum), genomes have been also sequenced recently [6-9]. These species has either smaller or very similar genome size to N. benthamiana. The Potato (Solanum tuberosum) genome size is 844 Mb [6], the tomato (Solanum lycopersicum) genome size is approximately 900 Mb [8] and both species has twelve chromosomes. Other economically important members of the Solanaceae family is the pepper and the eggplant. The widely cultivated C. annuum accession Zunla-1 genome size is estimated to be 3.26 Gb [9], while the genome size of the eggplant is approximately 833.1 Mb [10]. Gene expression is regulated at several layers to ensure optimal spatial and temporal accumulation of proteins. RNA silencing is an evolutionarily conserved, important gene expression regulator of various cellular processes in eukaryotes, whereby 21-24 nt small RNAs (sRNAs) direct silencing of endogenous genes or pathogens. Plant sRNAs have two major categories based on their biogenesis: small interfering RNA (siRNA) processed from perfectly double stranded RNA; and microRNA (miRNA) derived from single stranded RNA transcripts that form imperfectly double-stranded stem loop precursor structures [11]. MIR genes encode the precursor transcripts which are cleaved into miRNAs by Dicer-like 1 (DCL-1), and subsequently loaded into an RNA Induced Silencing Complex (RISC), containing an ARGONAUTE (AGO) protein. miRNA guided RISCs are able to inhibit expression of genes with at least partial homology to the miRNA by either transcript cleavage or translational arrest [12]. miRNAs have important roles in different cellular processes including developmental/ physiological processes and abiotic/biotic stress [13, 14]. miRNAs are classified into families, whereby miRNAs with the same or very similar sequences are grouped into the same family. It has been shown that only a small number of miRNA families are present across phylogenetically distant species [15-17]. The majority of plant miRNAs are evolutionarily young and are only present in one to few species and it is likely that MIR loci in plant genomes are in a constant dynamic evolutionary state [15, 16]. Although the draft genome of N. benthamiana is available and transcriptome-wide expression studies have been reported [1, 18–20], the miRNA profile of N. benthamiana has not been analysed. In this study, we used a comprehensive molecular and bioinformatic approach to detect and experimentally validate miRNAs and their target mRNAs from five different tissues of two N. benthamiana lines (the N. benthamiana plant, which is used in our laboratory and the sequenced Nb-1 accession). In our study, we used Illumina sequencing to identify small RNAs, their cleaved target mRNAs followed by Northern blot validation of the miRNAs. Furthermore, we examined cleaved miRNA targets from five different tissues of two aforementioned N. benthamiana accessions. We identified 40 conserved and 18 novel microRNAs and validated their target mRNAs in N. benthamiana with a genomic scale method. The targets were found to be involved in various biological processes including transcription, RNA binding, DNA modification, signal transduction, stress response and metabolic processes. We anticipate that this study will serve as a valuable resource of N. benthamiana miRNAs and their validated target mRNAs to the whole plant research community.

Results and discussion

High-throughput sequencing of N. benthamiana small RNAs

To identify and comprehensively describe the expression of endogenous sRNAs in N. benthamiana, sRNAs containing 5′-phosphate and 3′-OH (likely to be a products of Dicer-like proteins) were cloned from roots, stems, leaves, flowers and small seedlings (containing both root tissue and aerial parts) of N. benthamiana plants. The libraries were then sequenced by high-throughput Illumina sequencing. By using biological replicates of five tissues, ten cDNA libraries were made from the N. benthamiana line that is used in our laboratory. To compare the sRNA profile of our N. benthamiana line and the line, which has been sequenced (Nb-1 accession), two additional sRNA libraries were prepared and sequenced from the leaves and small seedlings of Nb-1 accession. These twelve cDNA libraries yielded approximately 90 million reads. Sequences flanked by the 5′ and 3′ Solexa adaptors, with a minimum and maximum length of 16 and 28 nt, respectively, were compared with the draft N. benthamiana genome sequences. More than 15 million non-redundant reads were perfectly matched to at least one locus, and were analysed further (Table 1). The size distribution of sequence reads showed that the majority of sRNAs in our libraries belonged to the 24 nt size class and it was followed by the 21–22 nt size classes (Fig. 1). Our sequence data are in agreement with previous observations which show that in plants, this 24 nt size class is the overall most abundant class of sRNAs [15].
Table 1

High-throughput sequencing statistics of N. benthamiana sRNAs

RedundantNon-redundantRedundantNon-redundant
ReadsMatching N. benthamiana genomeReadsMatching N. benthamiana genomeReadsMatching N. benthamiana genomeReadsMatching N. benthamiana genome
Seedling 1Raw reads4 353 615Stem 1Raw reads2 673 839
Adaptor removed3 192 5631 106 502Adaptor removed2 468 386858 314
Filter by sequence properties3 187 5581 102 305Filter by sequence properties2 464 474854 744
t/rRNA removed3 034 0092 791 6551 085 271942 175t/rRNA removed2 404 4892 205 954846 701746 712
Match known miRNAs262 998259 3941 531792Match known miRNAs121 361119 8191 376860
Seedling 2Raw reads13 565 620Stem 2Raw reads13 390 095
Adaptor removed9 931 8411 048 202Adaptor removed12 682 1603 280 430
Filter by sequence properties9 916 4971 036 381Filter by sequence properties12 662 1173 263 087
t/rRNA removed9 461 1358 656 7901 016 371841 123t/rRNA removed12 195 39611 213 5103 246 5172 773 011
Match known miRNAs1 132 8481 117 7582 4231 030Match known miRNAs664 082654 9843 4551 969
Root 1Raw reads4 416 396Flower 1Raw reads17 715 434
Adaptor removed3 567 326673 485Adaptor removed15 107 4624 920 011
Filter by sequence properties3 561 838669 035Filter by sequence properties15 084 0844 899 263
t/rRNA removed3 250 8832 926 826652 495553 372t/rRNA removed14 374 98913 270 3864 872 7744 186 250
Match known miRNAs368 908363 7931557827Match known miRNAs517 965510 9303 4612 128
Root 2Raw reads6 227 488Flower 2Raw reads8 784 135
Adaptor removed5 472 2372 020 365Adaptor removed8 156 0332 71814
Filter by sequence properties5 463 7912 013 081Filter by sequence properties8 143 554262 570
t/rRNA removed5 041 7194 673 6081 992 0061 746 022t/rRNA removed7 295 5966 508 891253 399185 299
Match known miRNAs308 179304 2762 4021 474Match known miRNAs817 620807 3171 526547
Leaf 1Raw reads4 481 051Leaf BTI Raw reads2 646 118
Adaptor removed3 752 7331 675 147Adaptor removed2 373 816445 460
Filter by sequence properties3 746 9431 670 236Filter by sequence properties2 370 062442 372
t/rRNA removed3 567 9873 252 3431 644 9091 440 195t/rRNA removed2 277 1892 092 901433 297375 749
Match known miRNAs258 610255 1792 3681 502Match known miRNAs234 022231 0821 025489
Leaf 2Raw reads3 801 050Seedling BTI Raw reads7 874 296
Adaptor removed2 925 9451 149 340Adaptor removed7 174 3262 105 046
Filter by sequence properties2 921 4081 145 456Filter by sequence properties7 163 0482 095 876
t/rRNA removed2 834 3542 596 0261 131 710982 432t/rRNA removed6 906 7766 088 7622 070 845287 724
Match known miRNAs274 028270 4761 434737Match known miRNAs786 266423 5812205113
Fig. 1

The size distribution of sequenced small RNA reads. The size class distribution of redundant and non redundant small RNA sequences in the twelve tissue samples. The percentage of the different size classes (y-axis) and the different tissues, which includes seedling, root, leaf, stem and flower with biological replicates and BTI seedling and leaf cDNA libraries (x-axis) represented in N. benthamiana

High-throughput sequencing statistics of N. benthamiana sRNAs The size distribution of sequenced small RNA reads. The size class distribution of redundant and non redundant small RNA sequences in the twelve tissue samples. The percentage of the different size classes (y-axis) and the different tissues, which includes seedling, root, leaf, stem and flower with biological replicates and BTI seedling and leaf cDNA libraries (x-axis) represented in N. benthamiana

Identification of known and conserved miRNAs in N. benthaniana

miRNAs are a well-studied class of regulatory sRNAs. They are present in all plant species studied so far and have important functions in plant development and stress responses. Conserved families of miRNAs were found in many plant species and analysis of data from high-throughput sequencing approaches identified eight deeply conserved miRNA families that are present in all embryophytes [16]. First we looked for known miRNAs by comparing our libraries to known miRNAs from other plant species and we identified 40 known or conserved miRNA families in the twelve sRNA libraries generated (Fig. 2, Additional file 1: Table S1). As expected, all of the deeply conserved miRNA families (miR156/157, miR159/319, miR160, miR165/166, miR171, miR408, miR390/391 and miR395) were present in our data sets (Fig. 2). We analysed the number of reads for conserved miRNAs and miR156, 159, 166, 168, 319, 396 and 403 were represented most frequently in the libraries (Fig. 2). As expected miRNA families expressed at very different levels and some miRNA families showed clear tissue-specific expression changes (Fig. 2). For example, the expression level of miR159 and miR166 families were high, while the expression of miR395 and miR399 families were similarly low in all tissues (Fig. 2). However, for miR395 and miR399 low expression is not surprising because their expression level is generally low under normal growth conditions as they are induced by sulphur or phosphate deficiency respectively [21, 22]. A recent comprehensive atlas of miRNAs from 31 representative species across vascular plants identified 82 known miRNA families that were present across several terrestrial plants. These families formed eight groups depending on their distribution across lineages and species. miRNAs belonging to group 1 were ubiquitous and generally highly expressed across all terrestrial species (miR156, miR166, miR167, miR168 and miR172) while group 2 miRNA families were represented in all taxonomic lineages but could be absent or present at very low abundance levels in some species (miR158, miR159, miR160, miR162, miR164, miR169, miR171, miR319, miR390, miR393, miR394, miR396, miR397, miR529, miR535 and miR4414) [15]. Most of the group 1 and group 2 miRNA families were present in our data sets except the miR529, miR535 and miR4414 families (Fig. 2). Similar to our result, these three miRNA families were absent or their abundance was very low (1–10 reads per million (RPM) or in some cases 11–100 RPM) in sRNA libraries prepared from different Solanaceae species such as petunia, pepper, potato, tomato and tobacco [15]. Furthermore, the remaining six miRNA groups were distributed across species with diverse lineage enrichment and one of them (group 8, containing miR1919, miR4376, miR5300, miR53001, miR6022, miR6023, miR6024, miR6025, miR6026, miR6027 and miR6149) was shown to be predominant in Solanaceae species [15]. As expected, in our libraries we identified several miRNA families that belong to group 8 such as miR1919, miR4376, miR6025 and miR6149. Among the group 8 miRNA families the miR6149 abundance level was the highest in our libraries similar to the tobacco libraries reported by Chavez Montes et al. [15]. Some of the group 8 miRNA families were missing from our libraries (miR5300, miR5301, miR6022, miR6023, miR6024, miR6026 and miR6027), however those miRNA families also showed low abundance (0–100 RPM) in the reported tobacco libraries (such as miR5300, miR5301, miR6023 and miR6027) [15].
Fig. 2

Conserved and other known miRNAs in N. benthamiana. Normalized read numbers of conserved and other known miRNAs across the tissues included in this study. Expression profiles are expressed in reads per million (RPM) genome matching reads. Heat map colours represents absolute normalized levels of miRNA expression ranging from less than 1 RPM (white) to more than 1000 RPM (red) as indicated in the colour key. We have identified 40 known or conserved miRNA families in the twelve sRNA libraries generated. All of the deeply conserved miRNA families (miR156/157, miR159/319, miR160, miR165/166, miR171, miR408, miR390/391 and miR395) were present in our data sets. The expression levels of different miRNA families were different and we also observed clear tissue-specific expressional changes within some miRNA families as it was expected

Conserved and other known miRNAs in N. benthamiana. Normalized read numbers of conserved and other known miRNAs across the tissues included in this study. Expression profiles are expressed in reads per million (RPM) genome matching reads. Heat map colours represents absolute normalized levels of miRNA expression ranging from less than 1 RPM (white) to more than 1000 RPM (red) as indicated in the colour key. We have identified 40 known or conserved miRNA families in the twelve sRNA libraries generated. All of the deeply conserved miRNA families (miR156/157, miR159/319, miR160, miR165/166, miR171, miR408, miR390/391 and miR395) were present in our data sets. The expression levels of different miRNA families were different and we also observed clear tissue-specific expressional changes within some miRNA families as it was expected We also identified several miRNA families which were identified previously as tobacco (N. tabacum) specific miRNAs such as miR6019, miR6020, miR6147, miR6151, miR6153, miR6155, miR6157 and miR6161 [23, 24]. These miRNA families generally showed low abundance across all tissues in our libraries with the exception of miR6153, which was expressed at medium levels in all samples (160–450 RPM across all tissue) and miR6155, which was relatively abundant in leaf and seedling samples (175–525 RPM) (Fig. 2). We checked the size distribution and starting nucleotide of the conserved and known miRNA families in our libraries. In 30 miRNA families, 21 nt long miRNAs were the most significant size class (Fig. 3a). In several cases they were accompanied by additional size classes, mostly 20 nucleotide long sequences or in some cases 22 nucleotide sequences were present in significant proportions (miR4376, miR393, miR167 and miR6157). Three miRNA families exhibited a strong preference for 20 nucleotide sequences and in seven miRNA families the 22 nucleotide long sequences were the most abundant size class (Fig. 3a). Our observation that the miR394 has a predominant 20 nucleotide size in our libraries is consistent with previous result and it might suggest a functional consequence since its size and sequence is conserved in terrestrial plant species [15]. It was shown in several cases that size specificity has a functional consequence, like the 22 nucleotide preference of phase initiating miRNAs associated with the generation of tasiRNAs [25, 26]. We found seven miRNA families with the 22 nucleotide sequence dominance and two of them (miR6019 and miR482) belongs to the described phase initiating miRNAs [23, 27, 28]. It is likely they are also involved in tasiRNA generation in N. benthamiana.
Fig. 3

Size distribution and starting nucleotide of the conserved and other known miRNAs. The relative abundance of different size categories (a), from 20 to 24 nucleotides is shown for the conserved and other known miRNAs presented in Fig. 2. The relative abundance of the 5′-nucleotide (b) is shown for the conserved and other known miRNAs presented in Fig. 2. In 31 miRNA families, 21 nt long miRNAs were the most significant size class. In 32 of the 40 conserved miRNA families, the majority of sequences started with uracyl at their 5′-end, although a portion of these have a different nucleotide composition at position 1 to a variable degree. Three miRNA families exhibited a preference for an adenine at position 1

Size distribution and starting nucleotide of the conserved and other known miRNAs. The relative abundance of different size categories (a), from 20 to 24 nucleotides is shown for the conserved and other known miRNAs presented in Fig. 2. The relative abundance of the 5′-nucleotide (b) is shown for the conserved and other known miRNAs presented in Fig. 2. In 31 miRNA families, 21 nt long miRNAs were the most significant size class. In 32 of the 40 conserved miRNA families, the majority of sequences started with uracyl at their 5′-end, although a portion of these have a different nucleotide composition at position 1 to a variable degree. Three miRNA families exhibited a preference for an adenine at position 1 Sorting of sRNAs into AGO complexes is influenced by their 5′ terminal nucleotide. Hence, we checked the 5′-nucleotide distribution of the conserved and known miRNA families in our libraries, since it is an important feature that is correlating to their biogenesis and function [29]. In 34 of the 40 conserved miRNA families, the majority of sequences started with uracyl at their 5′-end, although a portion of these have a different nucleotide composition at position 1 to a variable degree (Fig. 3b). Three miRNA families exhibited a preference for an adenine at position 1, although one of them has a different nucleotide composition at position 1 to a variable degree (miR172). In three miRNA families the most frequently found first nucleotide at their 5′-end was cytosine (miR395, miR479, miR477). miRNAs mostly are incorporated into the AGO1 effector complex to give sequence specificity to either slice or translationally repress their target mRNAs [12]. AGO1 harbours miRNAs that favour a 5′ terminal uridine [30]. In line with this observation, most of the conserved and known miRNA families in our libraries started with uracyl at their 5′-end. We also found several miRNAs that start with either adenine or cytosine (miR172, miR390 and miR395). It was reported in many different plant species that miRNAs belonging to these miRNA families are starting with the same nucleotide at the 5′-position [15]. Many plant miRNAs show tissue-specific and developmental-stage-specific expression patterns and they have important roles during development and stress adaptation [13]. To validate the tissue-specific expression patterns of selected conserved and known miRNAs we carried out Northern blot analysis using seedling (Se), root (R), leaf (L), stem (St) and flower (F) samples isolated from our N. benthamiana line; and leaf (LBTI) and seedling (SeBTI) samples from N. benthamiana Nb-1 accession that were obtained from the Boyce Thompson Institute for Plant Research. Northern analysis usually was in broad agreement with the sequencing data as in the cases of miR156, miR159, miR162, miR164, miR165, miR168, miR390 and miR393 (Fig. 4). In certain cases we observed a discrepancy between the Northern blot and sequencing data, as in the cases of miR167, miR171, miR482, miR6025 and miR6149 (Fig. 4; Fig. 2). However, these differences are not unexpected as it has been reported that Northern analysis are not always in full agreement with the sequencing data, presumably due to the inherent biases in next-generation sequencing technologies [31-36]. Theoretically, the oligoprobes used for Northern analysis might have detected heterogeneous miRNAs. However, we expect that Northern analysis should reveal more accurate in vivo expression patterns. We also performed RT-qPCR analysis to validate our Northern results by checking the expression pattern of miR167 and miR482 in N. benthamiana tissues. We found that the results of Northern analysis and RT-qPCR were identical (Additional file 2 Figure S1.). Furthermore, comparing the Northern analysis data of our N. benthamiana line to the Nb-1 accession to detect tissue specific expression pattern of miRNAs, we were not able to confirm the differences occasionally observed in the miRNA read numbers at certain miRNA families (in seedlings: miR156, miR168, miR319, miR396, miR482 and miR6149; in leaf: miR156, miR319, miR398, miR403 and miR1919) (Fig. 4). Our miRNA expression data therefore might indicate that these N. benthamiana lines are closely related based on their very similar miRNA expression patterns.
Fig. 4

Expression analysis of selected conserved and other known miRNAs in different N. benthamiana tissues. Total RNA was extracted from different tissues including, seedling (Se), root (R), leaf (L), stem (St), flower (F) from N. benthamiana plants used in our laboratory and from plants from Boyce Thompson Institute (leafBTI - LBTI, seedlingBTI - SeBTI). The RNA was separated on PAGE and transferred to nylon membranes for Northern blot analysis of the miRNAs. Oligonucleotide probes were used to detect specific miRNAs, and an U6-specific probe was used to detect U6 RNA as a loading control for each membrane

Expression analysis of selected conserved and other known miRNAs in different N. benthamiana tissues. Total RNA was extracted from different tissues including, seedling (Se), root (R), leaf (L), stem (St), flower (F) from N. benthamiana plants used in our laboratory and from plants from Boyce Thompson Institute (leafBTI - LBTI, seedlingBTI - SeBTI). The RNA was separated on PAGE and transferred to nylon membranes for Northern blot analysis of the miRNAs. Oligonucleotide probes were used to detect specific miRNAs, and an U6-specific probe was used to detect U6 RNA as a loading control for each membrane

Targets of conserved and known miRNAs

To generate a miRNA cleaved target library (degradome) from N. benthamiana we applied a high-throughput experimental approach that can identify mRNAs targeted by sRNAs [37]. The poly-A fraction of total RNA extracted from seedling (Se), root (R), leaf (L), stem (St), flower (F) of N. benthamiana plants; and leaf (LBTI) and seedling (SeBTI) of the Nb-1 accession were analysed for the identification of target transcripts of conserved, other known, and new miRNAs. We obtained a total number of 47 million short sequencing reads representing the 5′ ends of uncapped, poly-adenylated RNAs. After initial processing, equal numbers of 20- and 21-nt sequence reads were obtained, and 48 % of the short sequencing reads could be mapped to the N. bentamiana transcriptome. In plants, miRNA mediated mRNA cleavage is highly specific and miRNAs have been shown to bind with near perfect complementarity to their mRNA targets, which generally leads to the slicing of the mRNA between positions 10 and 11 of the AGO1 bound miRNA. As a consequence, the cleaved mRNA targets should have distinct peaks in the degradome sequence tags at the predicted cleavage site relative to the other regions of the transcript [38, 39]. In our analysis we have applied the PAREsnip pipeline [40] to identify cleaved targets for both known (conserved and non-conserved) and new miRNAs in N. benthamiana. Abundance of the sequenced tags was plotted on each transcript and the results are shown on Additional file 3: Figure S2. The cleaved target transcripts have been categorized into five classes (categories 0, 1, 2, 3 and 4) as it was defined previously in CleaveLand (version2) [41]. In our target list we kept high-confidence miRNA-target gene interactions (categories 0, 1, 2) and only category 3 targets as low–confidence miRNA-target pairs. Category 4 targets are defined as only one raw read at the expected cleavage position, however it might also be the result of the degradation of the target RNA, therefore they were omitted from our target list. This approach identified a total of 55 target mRNAs, for conserved and other known miRNAs (Table 2, Additional file 4: Table S2).
Table 2

Targets of conserved and other known miRNAs in Nicotiana benthamiana a

miRNA familymiRNA sequenceGene IDClevage positionCategoryNormalized abundancea Annotation
Conserved miRNAs and their targets
Nb_miR156TGACAGAAGAGAGTGAGCACcomp77119_c0_seq6 2667_2942178702,40Squamosa promoter-binding-like protein 6
Nb_miR156TGACAGAAGAGAGTGAGCACcomp69399_c0_seq4 393_94465021,28Squamosa promoter-binding-like protein 16
Nb_miR159TTTGGATTGAAGGGAGCTCTAcomp71582_c0_seq2 1_1230673040,63No annotationb
Nb_miR159TTTGGATTGAAGGGAGCTCTAcomp69815_c0_seq4 1_89460100,39Transcription factor GAMYB
Nb_miR159TTTGGATTGAAGGGAGCTCTAcomp73942_c0_seq1 927_2375183902,14Transcription factor GAMYB
Nb_miR160TGCCTGGCTCCCTGTATGCCAcomp78286_c0_seq3 693_278320650307,20Auxin response factor 18
Nb_miR160TGCCTGGCTCCCTGTATGCCAcomp79462_c0_seq1 2477_3379265806,41Auxin response factor 16
Nb_miR164TGGAGAAGCAGGGCACGTGCAcomp72325_c0_seq2 310_130599403,43Protein CUP-SHAPED COTYLEDON 2
Nb_miR165/166TCGGACCAGGCTTCATTCCCCcomp80060_c0_seq5 1298_38171868051,20Homeobox-leucine zipper protein REVOLUTA
Nb_miR165/166TCGGACCAGGCTTCATTCCTCcomp79517_c0_seq6 1043_355616010246,61Homeobox-leucine zipper protein ATHB-15
Nb_miR165/166TCGGACCAGGCTTCATTCCTCcomp79347_c3_seq8 336_61154611,60No annotationb
Nb_miR167AGATCATGTGGTAGCTTCACCcomp79742_c0_seq4 124_2421170025,905-methyltetrahydropteroyltriglutamate--homocysteine methyltransferaseb
Nb_miR167AGATCATGTGGTAGCTTCACCcomp74516_c0_seq2 292_2619116023,54Subtilisin-like proteaseb
Nb_miR168TCGCTTGGTGCAGGTCGGGACCcomp79927_c1_seq7 1_2438761,60Protein argonaute 1A
Nb_miR169TAGCCAAGGATGACTTGCCTcomp72459_c1_seq2 1230_1673172420,19Nuclear transcription factor Y subunit A-5/CCAAT-binding transcription factor
Nb_miR169TAGCCAAGGATGACTTGCCTcomp72338_c0_seq4 1046_1396143920,29Nuclear transcription factor Y subunit A-5/CCAAT-binding transcription factor
Nb_miR169TAGCCAAGGATGACTTGCCTcomp73705_c1_seq8 1254_1739183613,15Nuclear transcription factor Y subunit A-8/CCAAT-binding transcription factor
Nb_miR169TAGCCAAGGATGACTTGCCTcomp67524_c0_seq9 544_1167124601,96Nuclear transcription factor Y subunit A-10/CCAAT-binding transcription factor
Nb_miR171TTGAGCCGCGCCAATATCACGcomp76988_c0_seq2 277_1911896026,05Scarecrow-like protein 15
Nb_miR171TTGAGTCGCGCCAATATCACTcomp80119_c0_seq4 672_25731552034,34Scarecrow-like protein 6
Nb_miR171TTGAGTCGCGCCAATATCACTcomp75200_c0_seq2 459_271716960109,89Scarecrow-like protein 6
Nb_miR172AGAATCTTGATGATGCTGCAGcomp78340_c2_seq4 552_1409220004,42Floral homeotic protein APETALA 2
Nb_miR319TTGGACTGAAGGGAGCTCCCTcomp68096_c0_seq1 79_1191104601,02Transcription factor TCP4
Nb_miR319TTGGACTGAAGGGAGCTCCCTcomp73714_c2_seq12 2093_231720060,98Transcription factor TCP2
Nb_miR393TCCAAAGGGATCGCATTGATCCcomp80206_c0_seq2 468_4724423520,29No annotationb
Nb_miR394TTGGCATTCTGTCCACCTCCcomp60101_c0_seq1 5351_5605529122,16Tripeptidyl-peptidase 2b
Nb_miR395CTGAAGTGTTTGGGGGAACTCcomp75863_c0_seq1 200_160955320,53ATP sulfurylase 1, chloroplastic
Nb_miR395CTGAAGTGTTTGGGGGAACTCcomp29318_c1_seq1 108_62025401,07Sulfate transporter 2.1
Nb_miR396TTCCACATCTTTCTTGAACTGcomp71133_c0_seq1 1089_135593522,10Protein THYLAKOID FORMATION1, chloroplasticb
Nb_miR396TTCCACAGCTTTCTTGAACTGcomp77639_c0_seq4 942_2765172304,42Growth-regulating factor 6
Nb_miR396TTCCACAGCTTTCTTGAACTGcomp67678_c0_seq1 205_573279012,19Thioredoxin H-type 1b
Nb_miR396TTCCACAGCTTTCTTGAACTGcomp50987_c1_seq1 2_37914410,4840S ribosomal protein S15b
Nb_miR396TTACACAGCTTTCTTGAACTGcomp69468_c0_seq1 204_1724142520,58Protein IQ-DOMAIN 14b
Nb_miR396TTCCACAGCTTTCTTGAACTGcomp77795_c0_seq2 2_178015460,58Pentatricopeptide repeat-containing protein At5g04810, chloroplasticb
Nb_miR396cTTCCACAGCTTTCTTGAACTGcomp77334_c1_seq28 1342_2196137003,74Growth-regulating factor 5
Nb_miR396TTCCACAGCTTTCTTAAACTGcomp79406_c0_seq1 393_3506193220,29No annotationb
Nb_miR396TTCCACAGCTTTCTTGAACTGcomp76253_c0_seq1 643_155776700,19Growth-regulating factor 9
Nb_miR396TTCCACAGCTTTCTTGGACTGcomp69128_c0_seq2 114_80652120,19DNA-directed RNA polymerase V subunit 5A-l ikeb
Nb_miR397TCATTGAGTGCAGCGTTGATGcomp73318_c2_seq1 87_110988324,42Glyceraldehyde-3-phosphate dehydrogenase, cytosolicb
Nb_miR397TCATTGAGTGCAGCGTTGATGcomp75344_c1_seq2 127_183981020,24Laccase-7
Nb_miR397TCATTGAGTGCAGCGTTGATGcomp77652_c0_seq1 1261_317717620,29Probable serine/threonine-protein kinase abkCb
Nb_miR398TATGTTCTCAGGTCGCCCCTGcomp100607_c0_seq1 102_5577809,46Umecyanin
Nb_miR408TGCACTGCCTCTTCCCTGGCTcomp85708_c0_seq1 121_624630015,30Uclacyanin-2
Nb_miR408TGCACTGCCTCTTCCCTGGCTcomp79061_c1_seq3 184_14732346012,38Copper-transporting ATPase PAA2, chloroplastic
Other known miRNAs and their targets
Nb_miR482TTCCAATTCCACCCATTCCTAcomp75828_c0_seq3 249_1742160921,07No annotationb
Nb_miR827TTAGATGAACATCAACAAACTcomp75426_c0_seq1 2_226198221,36UPF0496 protein 4
Nb_miR4376TACGCAGGAGAGATGATACTGcomp79500_c0_seq4 305_356228420,98Calcium-transporting ATPase 8, plasma membrane-type
Nb_miR6020AAATGTTCTTCGAGTATCTTCcomp70698_c0_seq1 752_155599610,68GPN-loop GTPase 3 homologb
Nb_miR6020AAATGTTCTTCGAGTATCTTCcomp74553_c0_seq5 265_70259601,47No annotationb
Nb_miR6020AAATGTTCTTCGAGTATCTTCcomp75715_c0_seq5 271_2055154420,27Probable methylenetetrahydrofolate reductaseb
Nb_miR6149TTGATACGCACCTGAATCGGGcomp80883_c0_seq1 2_3101420,27Tubulin alpha chainb
Nb_miR6149TTGATACGCACCTGAATCGGGcomp75221_c0_seq2 455_1678164621,77Serine/threonine-protein kinase HT1b
Nb_miR6151TGAGTGTGAGGCGTTGGATTGAcomp84169_c0_seq1 105_112733422,10Probable carboxylesterase 8b
Nb_miR6157TGGTAGACGTAGGATTTGAAAcomp66224_c0_seq2 136_7802590,68Ras-related protein RABA2bb
Nb miR6161TGCTGGACCGACATACTTTGTcomp74264_c0_seq1 147_105234320,4860S ribosomal protein L5b

a The abundance for the degradome fragment (tag) indicating cleavage at that position/total number of fragments x 1,000,000

b New target in N. benthamiana for conserved and other known miRNAs

Targets of conserved and other known miRNAs in Nicotiana benthamiana a a The abundance for the degradome fragment (tag) indicating cleavage at that position/total number of fragments x 1,000,000 b New target in N. benthamiana for conserved and other known miRNAs Altogether we found 44 targets for conserved miRNAs, from which 23 were classed as category 0, six as category 1 and fifteen as category 2. For other known miRNAs we have found 11 targets, from which one was classed as category 0, two as category 1 and eight as category 2. We identified targets for 18 conserved miRNA families out of 23. Some of the conserved miRNAs without any identified targets (miR390 and miR399) were expressed at a low level in all of the sRNA libraries (Fig. 2), which may explain the lack of detection of cleaved targets. However, both miR162 and miR403 were expressed at a considerable level in at least some of the tissues (Fig. 2) without mRNA targets identified in the degradome libraries. As an alternative explanation these miRNAs might be engaged in translational repression, however in different plant species cleaved target mRNAs were identified as their target which makes this explanation less plausible [13]. We have confirmed 29 conserved targets of conserved miRNAs already identified in different plants [13]. Many of the identified conserved targets are members of different families of transcription factors, such as squamosal promoter-binding (SBP), MYB, ARF, NAC, HD-ZIPIII, CCAAT-binding, AP2 and TCP. In addition to these targets we also identified 15 new targets in N. benthamiana for conserved miRNAs (Table 2). We found one target of the non-conserved miR4376 which was already identified in tomato [42]. We also identified nine new targets of seven non-conserved miRNAs (Table 2). Most of the non-conserved miRNA families for which no target had been identified were expressed at a very low level in all tissues (miR170, miR477, miR1446, miR6019, miR6025, miR6147), except miR479, miR1919, miR6153 and miR6155 (expressed at moderate level at least in one sample) (Table 2, Fig. 2).

N. benthamiana specific miRNAs

We used strict criteria to identify new miRNAs. Briefly, the sRNA reads were mapped to the N. benthamiana genome and secondary structures were predicted for each locus. Based on the hairpin prediction, the presence of miRNA* strands in the sRNA libraries and validated target mRNA in the degradome libraries we found 18 new miRNA candidates (Fig. 5, Additional file 5: Figure S3.). Nb_miRC1_3p was the most abundant Nicotiana specific miRNA in our sRNA libraries and it expressed in all of the tissues. The sequence of Nb_miRC1_3p was also reported previously from a large scale sRNA dataset as endogenous small RNA of Nicotiana attenuata [43], however it was not further investigated which small RNA class it belongs to. The second most abundant miRNA was the Nb_miRC2_3p and it was also present in all of the small RNA libraries. For the remaining N. benthamiana specific miRNAs the relative number of reads in all tissues was low (Fig. 5). However, this result is in line with the observation shown previously, that miRNA abundance decreases as the conservation of the sequence decreases [15].
Fig. 5

Novel miRNAs in N. benthamiana. Normalized read numbers of novel miRNAs across the tissues included in this study. Expression profiles are expressed in reads per million genome matching reads. Heat map colours represents absolute normalized levels of miRNA expression ranging from less than 1 RPM (white) to more than 1000 RPM (red) as indicated in the colour key

Novel miRNAs in N. benthamiana. Normalized read numbers of novel miRNAs across the tissues included in this study. Expression profiles are expressed in reads per million genome matching reads. Heat map colours represents absolute normalized levels of miRNA expression ranging from less than 1 RPM (white) to more than 1000 RPM (red) as indicated in the colour key As a next step, we checked the size distribution and starting nucleotide of the new N. benthamiana specific miRNAs in our libraries. The 21 nt long miRNAs were the most significant size class and half of the new miRNAs belong to this group (Fig. 6a). In several cases they were accompanied by additional size class, 20 nucleotide long sequences were present in significant proportions (Nb_miRC18_3p and Nb_miRC11_5p). Three miRNAs exhibited a strong preference for 22 nucleotide sequences (Nb_miRC6_3p, Nb_miRC4_3p and Nb_miRC14_5p) and in five miRNA families the 24 nucleotide long sequences were the most abundant size class (Nb_miRC8_3pa, Nb_miRC17_5p, Nb_miRC7_3p, Nb_miRC13_3p and Nb_miRC10_3p). We found that two miRNA families showed the coexistence of 20–24 nucleotide variants with the 24 nucleotides being the most abundant variants (Nb_miRC5_3p and Nb_miRC9_5p) (Fig. 6a).
Fig. 6

Size distribution and starting nucleotide of the novel N. benthamiana specific miRNAs. The relative abundance of different size categories (a), from 20 to 24 nucleotides is shown for the novel miRNAs presented in Fig. 5. The relative abundance of the 5′-nucleotide (b) is shown for the novel miRNAs presented in Fig. 5

Size distribution and starting nucleotide of the novel N. benthamiana specific miRNAs. The relative abundance of different size categories (a), from 20 to 24 nucleotides is shown for the novel miRNAs presented in Fig. 5. The relative abundance of the 5′-nucleotide (b) is shown for the novel miRNAs presented in Fig. 5 Next we checked the 5′-nucleotide distribution of the new miRNA families in our libraries. Seven started with uracil at their 5′-end (Nb_miRC16_3p, Nb_miRC14_5p, Nb_miRC1_3p, Nb_miRC18_3p, Nb_miRC15_5p, Nb_miRC3_5p and Nb_miRC2_3p) (Fig. 6b). Eight miRNA families exhibited a preference for an adenine at position 1, although six of these have a different nucleotide composition at position 1 to a variable degree (NB_miRC9_5p, NB_miRC5_3p, NB_miRC8_3p_a, NB_miRC13_3p, NB_miRC7_3p and NB_miRC10_3p). In three miRNA families the most frequently found first nucleotide at their 5′-end was cytosine (Nb_miRC11_5p, Nb_miRC4_3p and Nb_miRC12_5p). One miRNA family exhibited a preference for a guanine at position 1, although a significant fraction has uracil at the 5′-position (Nb_miRC6_3p) (Fig. 6b). Although the majority of plant miRNAs have a uracil at position 1, it is not unusual that some miRNA families contain adenine, cytosine and even guanine in significant proportions at the 5′-position [15]. Young miRNAs are often weakly expressed, processed imprecisely or lack targets [16]. Our data fits well with these previous observations since most of the N. benthamiana specific miRNAs showed low relative abundance in our libraries (below 25 RPM; except Nb_miRC1_3p, Nb_miRC2_3p and Nb_miRC3_5p) and in several cases they were accompanied by additional size classes for various degrees. Furthermore, some of the new miRNAs identified in our sRNA libraries are produced from long hairpin precursors that may also indicate the young evolutionary origin (Nb_miRC2_3p, Nb_miRC3_5p, Nb_miRC5_3p, Nb_miRC8_3pa and Nb_miRC18_3p) (Additional file 5: Figure S3). This situation is well observable on the Nb_miRC8 precursor where two miRNA duplexes were both identified in the sRNA libraries and detected by Northern blots (Additional file 6: Figure S4). However, we did not identify miRNAs without target mRNAs in our data set, since we used the presence of cleaved target mRNA as a criterion to select new miRNAs. We used Northern blot analysis to check the expression level of N. benthamiana specific miRNAs in five tissues (seedling, root, leaf, stem and flower). The miRNAs that gave signal on the Northern blot, in most cases were detectable in all of the tissues tested (Fig. 7). Some of the new miRNAs were abundant in specific tissues (Nb_miRC10_3p in stem and flower; Nb_miRC16_3p in seedling and flower), and many of them showed differential expression in the tissues analysed (Nb_miRC1_3p, Nb_miRC2_5p, Nb_miRC10_3p, Nb_miRC11_5p and Nb_miRC16_3p) (Fig. 7). The relative number of reads in all tissues was very low for most of the miRNAs (Nb_miRC1_3p, Nb_miRC2_3p and Nb_miRC3_5p was an exception) and this was also reflected in the Northern blot analysis, since we did not obtain signal for six miRNAs (Nb_miRC5_5p, Nb_miRC12_5p, Nb_miRC13_3p, Nb_miRC14_5p, Nb_miRC15_5p and Nb_miRC18_3p). The expression profiles obtained by Northern blot analysis were different from the sequencing data for Nb_miRC3_5p, Nb_miRC6_3p, Nb_miRC8_3pa, Nb_miRC11_5p and Nb_miRC17_5p. However, for the other new miRNAs (Nb_miRC1_3p, Nb_miRC7_3p, Nb_miRC9_5p and Nb_miRC10_3p), sequencing data and expression profiles obtained by Northern blot analysis showed good correlation (Fig. 5 and Fig. 7). Although the relative read numbers for most of the new miRNAs were low, we were able to confirm the presence of 13 N. benthamiana specific miRNAs by Northern blot.
Fig. 7

Expression patterns of novel miRNAs found in N. benthamiana. Total RNA was extracted from different tissues including, seedling (Se), root (R), leaf (L), stem (St), flower (F) from N. benthamiana plants used in our laboratory and from plants from Boyce Thompson Institute (leafBTI - LBTI, seedlingBTI - SeBTI). The RNA was separated on PAGE and transferred to nylon membranes for Northern blot analysis of the novel miRNAs. Oligonucleotide probes were used to detect specific miRNAs, and an U6-specific probe was used to detect U6 RNA as a loading control for each membrane

Expression patterns of novel miRNAs found in N. benthamiana. Total RNA was extracted from different tissues including, seedling (Se), root (R), leaf (L), stem (St), flower (F) from N. benthamiana plants used in our laboratory and from plants from Boyce Thompson Institute (leafBTI - LBTI, seedlingBTI - SeBTI). The RNA was separated on PAGE and transferred to nylon membranes for Northern blot analysis of the novel miRNAs. Oligonucleotide probes were used to detect specific miRNAs, and an U6-specific probe was used to detect U6 RNA as a loading control for each membrane

Targets of N. benthamiana specific miRNAs

We identified 32 targets for new miRNAs using our degradome libraries. Interestingly, in the case of Nb_miRC5 we found target for both the mature and the star strand of the new miRNA (Table 3; Additional file 7: Figure S5; Additional file 8: Table S3). Based on the degradome data, among the identified target mRNAs of the specific, new N. benthamiana miRNAs, three were classed as category 0, two as category 1, twenty-two as category 2 and five as category 3. Comparing the target list of the known (conserved and non-conserved) and new miRNAs the most obvious difference between them is that the known miRNA targets mostly belong to category 0 and 1 (Tables 2 and 3). Based on their molecular function both groups regulate various genes that are included in transcriptional regulation or enzymatic activity (Additional file 9: Figure S6.).
Table 3

Targets of novel miRNAs found in Nicotiana benthamiana

new miRNA IDGene IDCleavage positionCategoryAlignment scoreNormalized abundancea Annotation
Nb_miRC1_3pcomp79937_c1_seq1 312_105886233,50,2Elongation factor 1-alpha 4
Nb_miRC1_3pcomp78325_c0_seq3 335_2797112811,00,2Protein ROS1
Nb_miRC2_3pcomp75577_c0_seq17 1347_198240524,01,2Down syndrome critical region protein 3 homolog
Nb_miRC3_5pcomp79768_c0_seq4 287_3412186823,51,1Elongation factor Ts
Nb_miRC4_3pcomp76033_c0_seq2 367_170191724,01,9CBL-interacting serine/threonine-protein kinase 5
Nb_miRC5_3pcomp73317_c0_seq28 1673_2236296703,52,1Peroxisome biogenesis factor 10
Nb_miRC5_3pcomp69461_c0_seq1 371_982138020,50,2Protein CREG1
Nb_miRC5_3pcomp76292_c1_seq18 500_808163221,50,2Probable WRKY transcription factor 53
Nb_miRC5_3pcomp72235_c0_seq2 239_1444172022,50,3Cytochrome b561 and DOMON domain-containing protein At5g47530-like
Nb_miRC5_5pcomp70843_c0_seq1 178_102352824,01,4Protein TIC 21, chloroplastic
Nb_miRC6_3pcomp75362_c2_seq1 7645_8472239923,50,2G-type lectin S-receptor-like serine/threonine-protein kinase At1g11330
Nb_miRC7_3pcomp73495_c0_seq6 312_1019128823,51,4Proteasome subunit alpha type-2-A
Nb_miRC7_3pcomp56753_c0_seq1 215_45751702,05,5WAT1-related protein At5g40240-like
Nb_miRC8_3p_acomp71383_c0_seq6 99_857131732,50,6Aquaporin TIP1-3
Nb_miRC8_3p_acomp75266_c2_seq4 3193_3486345421,00,7Probable isoaspartyl peptidase/L-asparaginase 3
Nb_miRC9_5pcomp71496_c0_seq1 489_146346224,00,2Serine/threonine-protein phosphatase 2A regulatory subunit B” subunit alpha
Nb_miRC9_5pcomp64986_c0_seq1 125_487123222,50,260S ribosomal protein L34
Nb_miRC10_3pcomp77607_c3_seq4 3_1151231724,00,2Suberization-associated anionic peroxidase
Nb_miRC10_3pcomp80078_c3_seq4 193_2097213121,01,1Formate--tetrahydrofolate ligase
Nb_miRC10_3pcomp72702_c0_seq4 152_2431257532,01,0Beta-amyrin synthase
Nb_miRC11_5pcomp82078_c0_seq1 3_127793823,52,7Threonine deaminase
Nb_miRC11_5pcomp78978_c0_seq3 374_2785255723,00,4Transcription factor RF2a/ Probable transcription factor PosF21
Nb_miRC11_5pcomp89465_c0_seq1 218_123753234,00,2Ervatamin-B-like
Nb_miRC12_5pcomp79295_c0_seq1 283_3321311423,52,7Probably inactive leucine-rich repeat receptor-like protein kinase At3g28040
Nb_miRC12_5pcomp56752_c0_seq156813,50,7ZRT/IRT-like protein 1
Nb_miRC13_3pcomp77835_c0_seq3 61_62179722,50,3Disease resistance response protein 206
Nb_miRC14_5pcomp74263_c0_seq1 230_202996624,03,8Chaperonin 60 subunit beta 2, chloroplastic
Nb_miRC15_5pcomp78088_c1_seq1 1536_2366325134,00,7Peroxisome biogenesis protein 19-1
Nb_miRC15_5pcomp85070_c0_seq1 3_2648215122,51,0(E,E)-geranyllinalool synthase
Nb_miRC16_3pcomp88815_c0_seq1 2_1909184522,50,9Polyphenol oxidase, chloroplastic
Nb_miRC17_5pcomp63365_c1_seq3 3_2422901,01,3Heterotrimeric GTP binding protein alpha subunit
Nb_miRC18_3pcomp67145_c0_seq1 106_93036123,52,050S ribosomal protein L3, chloroplastic

a The abundance for the degradome fragment (tag) indicating cleavage at that position/total number of fragments x 1,000,000

Targets of novel miRNAs found in Nicotiana benthamiana a The abundance for the degradome fragment (tag) indicating cleavage at that position/total number of fragments x 1,000,000 The new miRNAs target different genes with a wide variety of predicted functions. One of the three identified category 0 targets of the new miRNAs is the mRNA of the heterotrimeric GTP binding protein (G proteins) alpha subunit (targeted by Nb_miR17_5p). G proteins are highly conserved in eukaryotes. Plant G proteins play an important regulatory role in multiple physiological processes. They are involved in signal transduction from hormone receptors, including the plant hormones gibberellin and abscisic acid that regulate gene expression; secretion, defence, stomata movements, channel regulation, sugar sensing and cell death [44]. The heterotrimeric G protein complex is minimally composed of three subunits: alpha, beta and gamma. Plants have only a small number of genes encoding the G proteins subunits. Arabidopsis has only one canonical G protein alpha subunit, one beta subunit and three gamma subunit [45]. Plant alpha subunits are highly conserved between species, it binds and hydrolyse GTP and relay signals by interacting with downstream effector proteins [46]. Interestingly, during convergent evolution human G protein alpha-13 (GNA13) expression is also regulated by miRNA (miR-31) in breast cancer cells [47]. Another category 0 target is the mRNA of Peroxisome Biogenesis Factor 10 (PEX10). It is cleaved by Nb_miRC5_3p and has been demonstrated to be a component of the peroxisomal matrix protein import machinery. Peroxisomes are eukaryotic organelles and they are involved in a wide range of processes. Plant peroxisomes are important in primary and secondary metabolism, development, and responses to abiotic and biotic stresses. Previous studies identified 22 PEX genes in the Arabidopsis genome and demonstrated that they play a pivotal role in the import of peroxisomal matrix proteins [48]. PEX 10 perform multiple functions, including the biogenesis of ER-derived protein and oil bodies [49], the maintenance of ER morphology, the formation of cuticular wax [50], peroxisome and chloroplast connections [51]. One of the category 1 targets is REPRESSOR OF SILENCING (ROS1) which is cleaved by Nb_miRC1_3p. Interestingly, the sequence of Nb_miRC1_3p can also be found in Nicotiana attenuata [43] and its target site on ROS1 is conserved in other Nicotiana species as well. ROS1 encodes a nuclear DNA glycosylase domain protein that catalyse the release of 5-methylcytosine (5-meC) from DNA via a base excision repair reaction [52]. DNA cytosine methylation is an important epigenetic mark conserved in many eukaryotes. It has important roles in transposon silencing, genome organization, genomic imprinting and regulation of gene expression [53]. During continuous adaptation to variable external and internal environments DNA methylation patterns are dynamically controlled by methylation and demethylation reactions. ROS1 plays an important role in active DNA demethylation and helps to maintain gene expression [54, 55]. The targeting of ROS1 by a miRNA provides a new regulatory layer to reinforce transcriptional gene silencing by a post-transcriptional repression of its activity. The targeting of a DNA glycosylase (DEMETER-LIKE protein3) by a miRNA (miR402) was first predicted in Arabidopsis [56] and later it was shown that the expression of DEMETER-LIKE protein3 mRNA was down-regulated in miR402-overexpressing transgenic plants [57]. The other category 1 target is a zinc-regulated transporter (ZRT), iron-regulated transporter (IRT)-like protein 1 (ZRT/IRT-like protein 1) and is cleaved by NB-miRC12_5p. ZRT/IRT-like protein family has been demonstrated to be involved in metal uptake and transport. ZRT/IRT-like proteins generally contribute to metal ion homeostasis by transporting cations into the cytoplasm [58]. It has been reported in several plant species that various miRNAs are vital for maintaining nutrient homeostasis by regulating the expression of transporters that are involved in nutrient uptake and mobilization [59]. Other targets that are worthy to note are the WAT1-related protein At5g40240-like gene targeted by Nb_miRC7_3p; the CBL-interacting serine/threonine-protein kinase gene targeted by Nb_miRC4_3p, proteasome subunit alpha type-2-A gene targeted by Nb_miRC7_3p, suberization-associated anionic peroxidase targeted by Nb_miRC10_3p and beta-amyrin synthase targeted by Nb_miRC10_3p.

Conclusions

In this study we have confirmed the expression of known and new miRNAs in various tissues of N. benthamiana using high-throughput sequencing and Northern blot analysis. As a result, this work represents a tissue specific atlas of known and new miRNA expression of N. benthamiana. In addition, we have experimentally identified many targets of both known and new miRNAs using a high-throughput method for the global identification of miRNAs targets. This study provides a valuable resource of miRNAs and their validated target mRNAs to the plant research community that will be beneficial well into the future.

Methods

Plant materials

Wild-type Nicotiana benthamiana plants and Nicotiana benthamiana plants from Boyce Thompson Institute were grown in soil or, for seedling and root samples, on half-strength Murashige and Skoog plates under standard growth conditions at 24 °C with 16 h of illumination per day. Total RNA was extracted from 14 day old seedlings and root tissues grown on plates. Furthermore total RNA was also extracted from leaves, stems and flowers without sepals collected from plants grown in soil. Total RNA was isolated by using the phenol/chloroform extraction method [60]. The quality and the quantity of the isolated total RNAs were checked using a nanodrop spectrophotometer and the integrity of RNAs was checked via non-denaturing agarose gel electrophoresis.

RNA sequencing (RNA-Seq) library preparation for Solexa sequencing

cDNA library for RNA-Seq was generated from 1 μg total RNA using TruSeq RNA Sample Preparation Kit (Illumina, San Diego, CA, USA) according to the manufacturer’s protocol. Briefly, poly-A tailed RNAs were purified by oligodT conjugated magnetic beads and fragmented on 94 °C for 8 min, then 1st strand cDNA was transcribed using random primers and SuperScript II reverse transcriptase (Lifetechnologies, Carslbad, CA, USA). Following this step second strand cDNA were synthesized, then double stranded cDNA molecules were end repaired and 3′ ends adenylated then Illumina index adapters were ligated. After adapter ligation step enrichment PCR was performed to amplify the adapter-ligated cDNA fragments. Fragment size distribution and molarity of libraries were checked on Agilent BioAnalyzer DNA1000 chip (Agilent Technologies, Santa Clara, CA, USA). 10pM of denatured libraries were used for cluster generation on cBot instrument, and then sequencing run was performed on Illumina HiSeq 2000 instrument (100 bp paired-end) (Illumina, San Diego, CA, USA).

Small RNA sequencing libraries for Solexa sequencing

The small RNA cDNA libraries were constructed using the Small RNA Sample Prep Kit (Illumina, CA, US), according to the manufacturer’s instructions. Briefly, 1 μg of total RNA from wild-type Nicotiana benthamiana samples, including, seedling, root, leaf, stem and flower were ligated to Solexa adaptors and then converted to cDNA by RT-PCR. The resulting cDNAs were then amplified by PCR, gel-purified and submitted for Illumina/Solexa sequencing. High-throughput sequencing was performed using HiScan SQ platform (50 bp single-end).

Degradome cDNA libraries for Solexa sequencing

The degradome cDNA library was prepared using a total RNA from wild-type Nicotiana benthamiana samples, including, seedling, root, leaf, stem and flower following the procedures previously described by German et al. [39]. Conversely to the small RNA libraries, the degradome cDNA library was sequenced with a custom-made sequencing primer (5′-CCACCGACAGGTTCAGAGTTCTACAGT-3′) [36] on HiScan SQ platform (50 bp single-end).

Northern blot analysis

A total amount of 5 μg of RNA from each Nicotiana benthamiana tissues (seedling, root, leaf, stem, flower, leaf BTI, seedling BTI) were individually separated on a 8 M UREA/12 % polyacrylamid 1 x TBE gel. After gel run we transferred the RNAs from dPAGE onto the neutral nylon membrane, Hybond NX (Amersham/GE Healthcare) by semi-dry electroblotter. The transferred molecules were cross-linked by the enhanced sensitivity 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide (EDC) chemical cross-linking method at 60 °C for 90 min as described previously by Pall et al. [61]. The membranes were prehybridized at 38 °C for 1 h in ULTRAhyb® Ultrasensitive Hybridization Buffer (Ambion/thermo Scientific). DNA oligo probes were end labelled by the forward reaction using T4 polynucleotide kinase (Thermo Scientific) and [γ32P] ATP for 1 h at 37 °C. Hybridization was performed at 40 °C overnight. Each membrane was stripped than rehybridized with different probes (The sequences of the used probes are available in Additional file 10: Table S4.). The membranes were exposed to a phosphorimager.

Real-time PCR quantification of miRNAs

We also used real-time looped RT-qPCR procedure for detection and quantification of miRNAs. Stem-loop RT primers and forward qPCR primers were designed according to Chen et al. [62]. (The sequence data is aviable in Additional file 10: Table S4). As internal reference gene, nucleolar small RNA U6 was used [63] . We followed the protocol by Varkonyi-Gasic et al. [64] with optimalisation to our samples. For DNase treatment of the RNA samples 3 units of DNase I (Thermo Scientific) were used to 250 ng total RNA from the different tissues represented in this study. After 30 min incubation at 37 °C, a phenol/chloroform extraction was performed to inactivate the enzyme and purificate the template. For the elution 20 μl MQ water was used. In the next step, a multiplexed cDNA synthesis of U6 and miRNA was performed. Stem-loop pulsed reverse transcription protocol was performed in a 20 μl reaction volume with each sample for RT and NO RT reactions (0.5 μl (10 mM) dNTP’s, 1–1 μl (1 μM) U6 reference and miRNA specific primer and MQ water was mixed and heated to 65 °C for 5 min and incubate on ice for 2 min. Then 4 units of RNaseOUT (Invitrogen), 50 units of RevertAid H- Reverse transcriptase, 4 μl 5x RT Buffer (Thermo Scientific) and 1 μl RNA template from the previous DNase treatment the was added to the reaction tubes.). The pulsed RT was preformed in thermo cycler: 30 min at 16 °C incubation, followed by pulsed RT of 60 cycles at 30 °C for 30 s, 42 °C for 30 s and 50 °C for 1 s, then at 85 °C for 5 min to inactivate the enzymes. To test the primer specificity and cDNA quality end-point PCR was performed with Pfu polymerase (Thermo Scientific) by 40 cycles following the Várkonyi-Galic et al. protocol. To perform real-time PCR the following components were used: 5 μl 2x FastStart Essential DNA Green Master Mix (Roche), 0,5 μl (10 μM) forward primer, 0,5 μl (10 μM) Universal reverse primer, 3 μl water and 1 μl RT product. PCR tubes were incubated at 95 °C for 10 min, followed by a 2 Step amplicifation by 40 cycles of 95 °C for 15 s, 60 °C for 60 s. For melting curve analysis, the samples were denaturated at 95 °C for 10 s, 65 °C for 60 s, 97 °C 1 s, then cool to 37 °C in a Roche LightCycles 96 instrument. To analyse results the LightCycler software was used.

Small RNA processing

Raw sequence reads quality check was performed using FastQC (version 0.11.3). For adaptor trimming cutadapt (version 1.2.1) was used with the following parameters: minimal sequence length after adaptor removal was 16, maximal sequence length was 28 [65]. We used UEA Small RNA Workbench (version 2.5.0) for all miRNA processing tasks [66]. MiRProf calculated the expression of known miRNAs. This program uses Mirbase (version 20). We allowed 2 mismatches during the analysis. We grouped together mismatches, variants and miRNAs from different organisms and worked only with families. MirCat identified candidant microRNAs. We used the default plant specific parameters. PAREsnip was used to process degradome sequences for finding microRNA targets [40] (Additional file 11). During this analysis, we discarded results from category 4, the maximal mismatch number was 4, mismatches at position 10–11 were not allowed and more than two mismatches at adjacent positions also not allowed. Every other parameter was the default. We have created a non-redundant scaffold set from the available N. benthamiana genome sequencing projects using Blast. We used this set as genome in all programs where genome sequence was a mandatory requirement. Results from these programs were analysed further with custom made Perl and Python scripts. For statistical analysis and visualization R was used [67].

Transcript processing and annotation

To generate the corresponding mRNA transcriptome we have sequenced Illumina paired end TruSeq libraries from the same N. benthamiana leaf, stem, germ, root, flower and seed samples as at the small RNAs. We pooled together 315 million 2x100 bp reads and used Trinity de novo assembly program (r2013-02-16) with default parameters [68]. To find biological function for de-novo transcripts, Trinotate was used (version: 20130225) according to the manual [68]. All related data were pushed into an SQLight database for further processing. To perform the Gene Ontology (GO) analysis of miRNA targets the agriGO web-based tool and database version 1.2 was used [69]. The analysis setting was the following: Singular Enrichment Analysis (SEA) with the Arabidopsis thaliana TAIR9 reference gene model. The graphical results showing the molecular functions.

Data access

The RNA-seq, sRNA and degradome data discussed in this publication have been deposited in National Center for Biotechnology Informations (NCBI) BioProject database under Accession number: PRJNA288746.
  67 in total

1.  Construction of Parallel Analysis of RNA Ends (PARE) libraries for the study of cleaved miRNA targets and the RNA degradome.

Authors:  Marcelo A German; Shujun Luo; Gary Schroth; Blake C Meyers; Pamela J Green
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

2.  Function and evolution of a MicroRNA that regulates a Ca2+-ATPase and triggers the formation of phased small interfering RNAs in tomato reproductive growth.

Authors:  Ying Wang; Asuka Itaya; Xuehua Zhong; Yang Wu; Jianfeng Zhang; Esther van der Knaap; Richard Olmstead; Yijun Qi; Biao Ding
Journal:  Plant Cell       Date:  2011-09-13       Impact factor: 11.277

3.  agriGO: a GO analysis toolkit for the agricultural community.

Authors:  Zhou Du; Xin Zhou; Yi Ling; Zhenhai Zhang; Zhen Su
Journal:  Nucleic Acids Res       Date:  2010-04-30       Impact factor: 16.971

4.  AthPEX10, a nuclear gene essential for peroxisome and storage organelle formation during Arabidopsis embryogenesis.

Authors:  Uwe Schumann; Gerhard Wanner; Marten Veenhuis; Markus Schmid; Christine Gietl
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

5.  Genome sequence and analysis of the tuber crop potato.

Authors:  Xun Xu; Shengkai Pan; Shifeng Cheng; Bo Zhang; Desheng Mu; Peixiang Ni; Gengyun Zhang; Shuang Yang; Ruiqiang Li; Jun Wang; Gisella Orjeda; Frank Guzman; Michael Torres; Roberto Lozano; Olga Ponce; Diana Martinez; Germán De la Cruz; S K Chakrabarti; Virupaksh U Patil; Konstantin G Skryabin; Boris B Kuznetsov; Nikolai V Ravin; Tatjana V Kolganova; Alexey V Beletsky; Andrei V Mardanov; Alex Di Genova; Daniel M Bolser; David M A Martin; Guangcun Li; Yu Yang; Hanhui Kuang; Qun Hu; Xingyao Xiong; Gerard J Bishop; Boris Sagredo; Nilo Mejía; Wlodzimierz Zagorski; Robert Gromadka; Jan Gawor; Pawel Szczesny; Sanwen Huang; Zhonghua Zhang; Chunbo Liang; Jun He; Ying Li; Ying He; Jianfei Xu; Youjun Zhang; Binyan Xie; Yongchen Du; Dongyu Qu; Merideth Bonierbale; Marc Ghislain; Maria del Rosario Herrera; Giovanni Giuliano; Marco Pietrella; Gaetano Perrotta; Paolo Facella; Kimberly O'Brien; Sergio E Feingold; Leandro E Barreiro; Gabriela A Massa; Luis Diambra; Brett R Whitty; Brieanne Vaillancourt; Haining Lin; Alicia N Massa; Michael Geoffroy; Steven Lundback; Dean DellaPenna; C Robin Buell; Sanjeev Kumar Sharma; David F Marshall; Robbie Waugh; Glenn J Bryan; Marialaura Destefanis; Istvan Nagy; Dan Milbourne; Susan J Thomson; Mark Fiers; Jeanne M E Jacobs; Kåre L Nielsen; Mads Sønderkær; Marina Iovene; Giovana A Torres; Jiming Jiang; Richard E Veilleux; Christian W B Bachem; Jan de Boer; Theo Borm; Bjorn Kloosterman; Herman van Eck; Erwin Datema; Bas te Lintel Hekkert; Aska Goverse; Roeland C H J van Ham; Richard G F Visser
Journal:  Nature       Date:  2011-07-10       Impact factor: 49.962

6.  Endogenous siRNA and miRNA targets identified by sequencing of the Arabidopsis degradome.

Authors:  Charles Addo-Quaye; Tifani W Eshoo; David P Bartel; Michael J Axtell
Journal:  Curr Biol       Date:  2008-05-08       Impact factor: 10.834

7.  PAREsnip: a tool for rapid genome-wide discovery of small RNA/target interactions evidenced through degradome sequencing.

Authors:  Leighton Folkes; Simon Moxon; Hugh C Woolfenden; Matthew B Stocks; Gyorgy Szittya; Tamas Dalmay; Vincent Moulton
Journal:  Nucleic Acids Res       Date:  2012-03-29       Impact factor: 16.971

Review 8.  The role of G-proteins in plant immunity.

Authors:  Huajian Zhang; Zhimou Gao; Xiaobo Zheng; Zhengguang Zhang
Journal:  Plant Signal Behav       Date:  2012-08-20

9.  Protocol: a highly sensitive RT-PCR method for detection and quantification of microRNAs.

Authors:  Erika Varkonyi-Gasic; Rongmei Wu; Marion Wood; Eric F Walton; Roger P Hellens
Journal:  Plant Methods       Date:  2007-10-12       Impact factor: 4.993

10.  Optimizing stem-loop qPCR assays through multiplexed cDNA synthesis of U6 and miRNAs.

Authors:  Marie Turner; Sajag Adhikari; Senthil Subramanian
Journal:  Plant Signal Behav       Date:  2013-05-15
View more
  19 in total

1.  Quantitative expression of microRNAs in Brassica oleracea infected with Xanthomonas campestris pv. campestris.

Authors:  Lucas Souza Santos; Mariana Rocha Maximiano; Esaú Megias; Marília Pappas; Simone Graça Ribeiro; Angela Mehta
Journal:  Mol Biol Rep       Date:  2019-04-03       Impact factor: 2.316

2.  A Viral Satellite DNA Vector (TYLCCNV) for Functional Analysis of miRNAs and siRNAs in Plants.

Authors:  Zheng Ju; Dongyan Cao; Chao Gao; Jinhua Zuo; Baiqiang Zhai; Shan Li; Hongliang Zhu; Daqi Fu; Yunbo Luo; Benzhong Zhu
Journal:  Plant Physiol       Date:  2017-02-22       Impact factor: 8.340

3.  Multiple cellular compartments engagement in Nicotiana benthamiana-peanut stunt virus-satRNA interactions revealed by systems biology approach.

Authors:  Barbara Wrzesińska; Agnieszka Zmienko; Lam Dai Vu; Ive De Smet; Aleksandra Obrępalska-Stęplowska
Journal:  Plant Cell Rep       Date:  2021-05-24       Impact factor: 4.570

4.  Genome-Wide Identification of MicroRNAs and Their Targets in the Leaves and Fruits of Eucommia ulmoides Using High-Throughput Sequencing.

Authors:  Lin Wang; Hongyan Du; Ta-Na Wuyun
Journal:  Front Plant Sci       Date:  2016-11-08       Impact factor: 5.753

5.  Characterization of microRNAs of Beta macrocarpa and their responses to Beet necrotic yellow vein virus infection.

Authors:  Jun-Ying Liu; Hui-Yan Fan; Ying Wang; Yong-Liang Zhang; Da-Wei Li; Jia-Lin Yu; Cheng-Gui Han
Journal:  PLoS One       Date:  2017-10-16       Impact factor: 3.240

6.  Distinct Effects of p19 RNA Silencing Suppressor on Small RNA Mediated Pathways in Plants.

Authors:  Levente Kontra; Tibor Csorba; Mario Tavazza; Alessandra Lucioli; Raffaela Tavazza; Simon Moxon; Viktória Tisza; Anna Medzihradszky; Massimo Turina; József Burgyán
Journal:  PLoS Pathog       Date:  2016-10-06       Impact factor: 6.823

7.  Development of Incompletely Fused Carpels in Maize Ovary Revealed by miRNA, Target Gene and Phytohormone Analysis.

Authors:  Hongping Li; Ting Peng; Qun Wang; Yufeng Wu; Jianfeng Chang; Moubiao Zhang; Guiliang Tang; Chaohai Li
Journal:  Front Plant Sci       Date:  2017-04-03       Impact factor: 5.753

8.  Small RNA Sequencing Reveals Differential miRNA Expression in the Early Development of Broccoli (Brassica oleracea var. italica) Pollen.

Authors:  Hui Li; Yu Wang; Mei Wu; Lihong Li; Chuan Jin; Qingli Zhang; Chengbin Chen; Wenqin Song; Chunguo Wang
Journal:  Front Plant Sci       Date:  2017-03-24       Impact factor: 5.753

9.  Systematic analysis of DEMETER-like DNA glycosylase genes shows lineage-specific Smi-miR7972 involved in SmDML1 regulation in Salvia miltiorrhiza.

Authors:  Jiang Li; Caili Li; Shanfa Lu
Journal:  Sci Rep       Date:  2018-05-08       Impact factor: 4.379

10.  Genome-Wide Identification of miRNAs and Their Targets Involved in the Developing Internodes under Maize Ears by Responding to Hormone Signaling.

Authors:  Zhan Zhao; Yadong Xue; Huili Yang; Huimin Li; Gaoyang Sun; Xiaofeng Zhao; Dong Ding; Jihua Tang
Journal:  PLoS One       Date:  2016-10-03       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.