Literature DB >> 23894571

Transcriptome-wide identification and characterization of microRNAs from castor bean (Ricinus communis L.).

Wei Xu1, Qinghua Cui, Fei Li, Aizhong Liu.   

Abstract

BACKGROUND: MicroRNAs (miRNAs) are endogenously encoded small RNAs that post-transcriptionally regulate gene expression and play essential roles in numerous developmental and physiological processes. Currently, little information on the transcriptome and tissue-specific expression of miRNAs is available in the model non-edible oilseed crop castor bean (Ricinus communis L.), one of the most important non-edible oilseed crops cultivated worldwide. Recent advances in sequencing technologies have allowed the identification of conserved and novel miRNAs in many plant species. Here, we used high-throughput sequencing technologies to identify and characterize the miRNAs in castor bean.
RESULTS: Five small RNA libraries were constructed for deep sequencing from root tips, leaves, developing seeds (at the initial stage, seed1; and at the fast oil accumulation stage, seed2) and endosperms in castor bean. High-throughput sequencing generated a large number of sequence reads of small RNAs in this study. In total, 86 conserved miRNAs were identified, including 63 known and 23 newly identified. Sixteen miRNA isoform variants in length were found from the conserved miRNAs of castor bean. MiRNAs displayed diverse organ-specific expression levels among five libraries. Combined with criteria for miRNA annotation and a RT-PCR approach, 72 novel miRNAs and their potential precursors were annotated and 20 miRNAs newly identified were validated. In addition, new target candidates for miRNAs newly identified in this study were proposed.
CONCLUSIONS: The current study presents the first high-throughput small RNA sequencing study performed in castor bean to identify its miRNA population. It characterizes and increases the number of miRNAs and their isoforms identified in castor bean. The miRNA expression analysis provides a foundation for understanding castor bean miRNA organ-specific expression patterns. The present study offers an expanded picture of miRNAs for castor bean and other members in the family Euphorbiaceae.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23894571      PMCID: PMC3722108          DOI: 10.1371/journal.pone.0069995

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The castor bean (Ricinus communis L., Euphorbiaceae, 2 n = 20) is one of most important non-edible oilseed crops and its seed derivatives are often used in aviation oil, lubricants, nylon, dyes, inks, soaps, adhesive and biodiesel. Among all the vegetable oils, seed oil of castor bean is distinctive due to its high level of ricinoleic acid (over 85%), a fatty acid consisting of 18 carbons, a double bond between C9 and C10, and a hydroxyl group attached to C12. In particular, owing to its excellent solubility in ethanol or methanol, seed oil of castor bean was considered as an ideal and unique feedstock for biodiesel production [1]–[3]. Because of its high economic value, castor bean is widely cultivated in tropical, sub-tropical and warm-temperate countries, particularly India, China and Brazil [4]. Due to increased demand for production of castor bean seed oils in many countries, breeding and improvement of varieties are drawing a great attention from breeders [5]. Particularly, genetic improvement of varieties by genetic engineering techniques offers great promises in castor bean [6], [7]. Enhanced efforts should be paid for elucidating the molecular mechanism underlying the regulation of growth and development. The microRNAs (miRNAs) are endogenous noncoding small RNAs which play significant roles in the regulation of gene expression. Post-transcriptional gene regulation by miRNAs constitutes one of the most conserved and well characterized gene regulatory mechanisms. In higher plants, miRNAs play significant roles in different developmental stages by regulating gene expression at transcriptional and post-transcriptional levels [8]–[12]. Identification and characterization of miRNAs and their targets in diverse species has been a major focus in recent years [13]–[16]. Although a number of miRNAs have been identified from diverse plants, information on identification and characterization of miRNAs in the family Euphorbiaceae, an important resource plant group, is very limited. So far, the miRNA database miRBase[17]–[19] (Release 19, January 2013, http://www.mirbase.org) contains 63 miRNAs identified from castor bean, 28 miRNAs identified from rubber tree (Heven brasiliensis) and 10 miRNAs identified from Manihot esculenta in Euphorbiaceae. Although 63 miRNAs had been identified from castor bean in the previous study [20], little information on the transcript level and their tissue-specific expression of miRNAs, however, is available in castor bean. Identification and characterization of miRNAs will contribute to the understanding of the molecular basis of regulating developmental and physiological processes in castor bean. Recently, high-throughput sequencing technologies have been proven to be a powerful strategy to profile miRNA expression pattern and detect novel miRNAs at unprecedented perspectives [21]–[24]. In particular, high-throughput sequencing technologies can be reliably used to measure modest changes in miRNA abundance among different samples; such changes are unlikely to be identified by sequencing low numbers of clones (i.e., traditional small RNA library sequencing) or hybridization-based methods such as small RNA blot and miRNA array analyses. High-throughput sequencing technologies can not only discover novel miRNAs (which produce transcripts in low abundance) due to their ability to generate millions of reads with a determined length, but also characterize their expression among tissues according to their relative abundance. MiRNAs of diverse plants such as maize [25], common bean [26], peanut [27], safflower [28], cucumber [29], soybean [30], cabbage [31], Panax ginseng [32] and Pinus densata [33], have been investigated using high-throughput sequencing technologies in recent years. In this study, we performed deep sequencing and bioinformatic analyses of caster bean tissues (leaves, roots, developing seeds and endosperms) to identify and characterize conserved and novel miRNAs, as well as expression patterns of miRNAs among different tissues and at different stages of seed development. We expected that the conserved, novel and differentially expressed miRNAs obtained in this study provide a basis for further investigation of the physiological roles of identified miRNAs and the molecular mechanism underlying the regulation of growth and development in castor bean.

Results

Library Construction, Sequencing and Characterization of Small RNA Transcriptomes in Castor Bean

In order to identify and characterize conserved and novel miRNAs in castor bean, we constructed five small RNA libraries from leaves, root tips, developing seeds at the initial stage (seed1) and at the oil fast accumulation stage (seed2) and endosperms, and obtained sequence reads through Solexa high-throughput sequencing technologies. Initially, a total number of 14,259,011 (leaf), 13,467,037 (root tip), 11,423,439 (seed1), 11,334,893 (seed2), 12,955,198 (endosperm) raw reads were obtained. After filtering the low quality reads, adaptor and contaminant sequences, the clean reads were 14,187,024, 13,317,609, 11,098,154, 11,089,507 and 12,553,234 for leaf, root tip, seed1, seed2 and endosperm libraries, respectively. Based on these sequences we analyzed the length distribution and found that among the unique size distribution pattern, most of the reads were distributed between 21 and 24 nt (Figure 1). This observation was consistent with the typical size of miRNA from Dicer digestion products. Among which, sequences with the length of the 21 nt and 24 nt were shown to be significantly in abundance, specifically, the sequences with length of 21 nt was highest abundance in leaf, root tip and seed1 libraries, accounting for 56.82%, 37.22% and 28.42% of the sequence number, respectively, whereas sequences with the length of 24 nt were the highest abundance in seed2 and endosperm, accounting for 33.35% and 33.17% of the sequence number.
Figure 1

The length size distribution of small RNAs in root, leaf, seed1, seed2 and endosperm libraries in castor bean.

Subsequently, we annotated all the reads fall into the length of 16–26 nt from all the five libraries (including leaf, root, seed1, seed2 and endosperm) and obtained 1,742,976, 2,758,394, 2,411,289, 2,944,394, 3,557,270 unique reads (the sequence of a particular type with non-redundancy) for leaf, root, seed1, seed2 and endosperm libraries, respectively. Among them, non-coding small RNAs annotated (snRNAs, snoRNAs, tRNAs, rRNAs and miRNAs) occupy 7,050,077, 5,569,288, 4,714,941, 3,012,704 and 3,742,076 reads in leaf, root, seed1, seed2 and endosperm, respectively (Table 1). In addition, a small proportion of reads could be mapped to coding sequences, which are likely to be RNA degradation products; a small proportion of reads could be mapped to intron sequences, which are likely to be related to the splicing of the host gene to produce pre-miRNA molecules.
Table 1

Reads abundance of small RNAs in leaf, root, seed1, seed2 and endosperm libraries.

CategoryReads abundance of total small RNAs
LeafRootSeed1Seed2Endosperm
Total reads1425901113467037114234391133489312955198
Clean reads1418702413317609110981541108950712553234
Unique reads17429762758394241128929443943557270
exon_antisense117549108113623408822098610
exon_sense484447269154271996281762265621
intron antisense212386328599233083272340317912
intron_sense878033768915608664691092764507
rRNA406058146399118872176906931152370
snRNA87598069563141976062
snoRNA444421301635084069969
tRNA437711398694410336208648290630
miRNA61931053677233240540721007602283045
perfect miRNA matching reads57161062938562863191564291814579
miRNA isoform reads476999738671154221615364691468466
unannotated54445326273540520713067433897364508

Identification of Conserved miRNAs in Castor Bean

To identify conserved miRNAs in castor bean small RNA libraries, the unique reads (excluded reads mapped to snRNAs, snoRNAs, tRNAs and coding sequence or intron sequence) 11,637,637, 9,950,773, 7,612,537, 8,844,149, 9,647,553 from five libraries were subjected to the homolog search against miRBase 19. A number of 6,193,105, 3,677,233, 2,405,407, 2,100,760, and 2,283,045 reads in leaf, root, seed1, seed2 and endosperm libraries respectively, were homologous with known castor bean miRNAs, which accounts for 53.2%, 36.9%, 31.6%, 23.8%, and 23.7% of unique reads from each library, respectively (see Table 1). These observations suggest that known miRNAs are only a small portion and there still may be complicated ingredients in Solexa sequenced data. In total, 86 conserved miRNAs were detected, covering 26 miRNA families. As shown in Table 2 and Table S1, the most abundant are miR169 (12 members), miR170/171 (nine members), and miR156/157 (eight members). Of the 86 miRNAs, 69 miRNAs were expressed in all five libraries, which accounted for 80.2%; 13 miRNAs (including one miR159/319, nine miR169s, one miR172, two miR399s) were not detected in the leaf library; seven miRNAs (including one miR160, four miR169s, one miR398 and miR2111) were not detected in the root library; ten miRNAs (including nine miR169s and one miR170/171) were not detected in the seed1 library; six miR169s were not detected in the seed2 library; and four miRNAs (including two miR169s, one miR170/171 and one miR399) were not detected in the endosperm tissue.
Table 2

Conserved miRNAs and their expression levels among different tissues.

miRNAfamilyReferencemiRNASequence (5′–3′)Length (nt)Reads
leafrootseed1seed2endosperm
156rco-miR156a TGACAGAAGAGAGTGAGCAC 2011596755493820415397950259577
rco-miR156b TGACAGAAGAGAGTGAGCAC 2011638255542320532398971261537
rco-miR156c TGACAGAAGAGAGTGAGCAC 2011596755493820414397950259577
rco-miR156d TGACAGAAGAGAGTGAGCAC 2011701055551820426398098259709
rco-miR156e TTGACAGAAGAGAGAGAGCAC 2117342320819809928064912186
rco-miR156f TTGACAGAAGATAGAGAGCAC 2145815482141241170320254303517291
rco-miR156g TTGACAGAAGATAGAGAGCAC 2145742072143679169362253236517521
rco-miR156h TTGACAGAAGATAGAGAGCAC 2145812052140983170329254302517216
159rco-miR159 TTTGGATTGAAGGGAGCTCTA 21112468102895043591711
160rco-miR160a TGCCTGGCTCCCTGTATGCCA 211003892871
rco-miR160b TGCCTGGCTCCCTGTATGCCA 21938082871
rco-miR160c TGCCTGGCTCCCTGAATGCCA 2163042237
162rco-miR162 TCGATAAACCTCTGCATCCAG 2111259251682866713
164rco-miR164a TGGAGAAGCAGGGCACGTGCA 2117925372306834482143
rco-miR164b TGGAGAAGCAGGGCACGTGCA 2117885256306134472141
rco-miR164c TGGAGAAGCAGGGCACGTGCA 2118125269307834472150
rco-miR164d TGGAGAAGCAGGGCACATGCT 2125114735
166rco-miR166a TCGGACCAGGCTTCATTCCCC 211041527630125521438192914200808
rco-miR166b TCGGACCAGGCTTCATTCCCC 211040582629444521029192753200683
rco-miR166c TCGGACCAGGCTTCATTCCCC 211044631634097533841193804201735
rco-miR166d TCGGACCAGGCTTCATTCCCC 211077411712533540960199357206911
rco-miR166e TCGGACCAGGCTTCATTCCCC 211040582629429521021192738200671
167rco-miR167a TGAAGCTGCCAGCATGATCTA 212525817649724902143232506
rco-miR167b TGAAGCTGCCAGCATGATCTAA 2219972626315657226165097258009
rco-miR167c TGAAGCTGCCAGCATGATCTG 211571125421044169677664
rco-miR167d* TAAAGCTGCCAGCATGATCTA 21808456604292148
168rco-miR168 TCGCTTGGTGCAGGTCGGGAA 2164409112339191097888352530
169rco-miR169a CAGCCAAGGATGACTTGCCGG 2124417879456
rco-miR169b CAGCCAAGGATGACTTGCCGG 2124406764414
rco-miR169c TGAGCCAAGGATGACTTGCCG 21222006331
rco-miR169d CAGCCAAGGATGACTTGCCGA 210152005
rco-miR169e CAGCCAAGGATGACTTGCCGA 210152000
rco-miR169f CAGCCAAGGATGACTTGCCGA 210154010
rco-miR169g TAGCCAAGGATGACTTGCCTG 2100009
rco-miR169h TAGCCAAGGATGACTTGCCTG 2100008
rco-miR169i TAGCCAAGGATGACTTGCCCA 2100005
rco-miR169j TAGCCAAGGATGACTTGCCCG 2102701030
rco-miR169k TAGCCAAGGATGACTTGCCCG 210190820
rco-miR169l TAGCCAAGGATGACTTGCCCA 21000010
171rco-miR171a TGATTGAGCCGTGCCAATATC 215392601953641274
rco-miR171b TGATTGAGCCGTGCCAATATC 215392601953641274
rco-miR171c TGATTGAGCCGTGCCAATATC 215112681933741294
rco-miR171d TGATTGAGCCGTGCCAATATC 215102571913641261
rco-miR171e TGATTGAGCCGTGCCAATATC 215164352414321487
rco-miR171f TGATTGAGCCGTGCCAATATC 215102571913641261
rco-miR171g TTGAGCCGCGCCAATATCACT 214107650
rco-miR171h TTGAGCCGCGTCAATATCTCC 212714302460
rco-miR171i CGAGCCGAATCAATATCACTC 2124487845626635
172rco-miR172a GGAATCTTGATGATGCTGCAG 21017022482
rco-miR172b AGAATCTTGATGATGCTGCAT 2116659964116028653119
rco-miR172c AGAATCTTGATGATGCTGCAT 2116659964116028653119
rco-miR172d AGAATCTTGATGATGCTGCAT 2116659964116028653119
319rco-miR319a TTGGACTGAAGGGAGCTCCC 202110792
rco-miR319b TTGGACTGAAGGGAGCTCCC 20219792
rco-miR319c TTGGACTGAAGGGAGCTCCC 202221892
rco-miR319d TTGGACTGAAGGGAGCTCCTT 21010141
390rco-miR390a AAGCTCAGGAGGGATAGCGCC 2130866716698246554
rco-miR390b AAGCTCAGGAGGGATAGCGCC 2132667596844254562
393rco-miR393a TCCAAAGGGATCGCATTGATCT 2231876101135
rco-miR393b TCCAAAGGGATCGCATTGATCC 2224735821
394rco-miR394a* TTGGCATTCTGTCCACCTCC 20901025108
rco-miR394b* TTGGCATTCTGTCCACCTCC 201401325110
395rco-miR395a CTGAAGTGTTTGGGGGAACTC 2134158101430
rco-miR395b CTGAAGTGTTTGGGGGAACTC 2134158101430
rco-miR395c CTGAAGTGTTTGGGGGAACTC 2133758101330
rco-miR395d CTGAAGTGTTTGGGGGAACTC 2133658101330
rco-miR395e CTGAAGTGTTTGGGGGAACTC 2134158101430
396rco-miR396a TTCCACAGCTTTCTTGAACTT 21171369519496
rco-miR396b TTCCACAGCTTTCTTGAACTG 211859100110885791032
rco-miR396c TTCCACAGCTTTCTTGAACTG 211854100510925801035
397rco-miR397 TCATTGAGTGCAGCGTTGATG 213683350509582237
398rco-miR398a TTCTCAGGTCACCCCTTTGGG 2110213
rco-miR398b TGTGTTCTCAGGTCGCCCCTG 2147601313
399rco-miR399a TGCCAAAGGAGAGTTGCCCTG 2117716347095
rco-miR399b TGCCAAAGGAGATTTGCCCGG 2112842715
rco-miR399c TGCCAAAGGAGATTTGCCCGG 2112802715
rco-miR399d TGCCAAAGGAGAGCTGCCCTG 2101110
rco-miR399e TGCCAAAGGAGATTTGCC 1805147
403rco-miR403a TTAGATTCACGCACAAACTCG 2176130936822931061
rco-miR403b TTAGATTCACGCACAAACTCG 2176130936822931061
408rco-miR408 CTGCACTGCCTCTTCCCTGGC 21772502439469
482rco-miR482* GGAATGGGCGGTTTGGGAAAG 2134671784925481295534179
535rco-miR535 TGACAACGAGAGAGAGCACGC 214447737580569702769221852
827rco-miR827* TTAGATGACCATCAACAAACA 2123381328615201
2111rco-miR2111* TAATCTGCATCCTGAGGTTTA 211800132109153
4414rco-miR4414* TATGAATGATGCGGGAGATAA 2130332202260190551

Note: *: New conserved miRNA in known miRBase in other species. The loci on genome were identified for six miRNAs newly identified in this study. rco-miR167d, 29883∶144402:144497:+; homologue: Arabidopsis thaliana miR167a; rco-miR394a,b, 30170∶3866594:3866721:+; 30116∶128336:128443:+; homologue: Arabidopsis thaliana miR394a,b; rco-miR482, 29586∶144986:145094:-; homologue: Malus domestica miR482a; rco-miR827, 28266∶68399:68502:+; homologue: Gossypium hirsutum miR827a; rco-miR2111, 29973∶58727:58830:+; homologue: Arabidopsis thaliana miR2111a; rco-miR4414, 29729∶702439:702549:+; homologue: Medicago truncatula miR4414b.

Note: *: New conserved miRNA in known miRBase in other species. The loci on genome were identified for six miRNAs newly identified in this study. rco-miR167d, 29883∶144402:144497:+; homologue: Arabidopsis thaliana miR167a; rco-miR394a,b, 30170∶3866594:3866721:+; 30116∶128336:128443:+; homologue: Arabidopsis thaliana miR394a,b; rco-miR482, 29586∶144986:145094:-; homologue: Malus domestica miR482a; rco-miR827, 28266∶68399:68502:+; homologue: Gossypium hirsutum miR827a; rco-miR2111, 29973∶58727:58830:+; homologue: Arabidopsis thaliana miR2111a; rco-miR4414, 29729∶702439:702549:+; homologue: Medicago truncatula miR4414b. Compared with the known 63 miRNAs from castor bean in the miRNA database, 23 conserved miRNAs (see Table S1) were newly identified including one miR167 member (rco-miR167d), nine miR169 members (rco-miR169d-i), two miR170/171 members (rco-miR171 h,i), three miR172 members (rco-miR172b-d), one miR393 (rco-miR393b), one miR394 member (rco-miR394b), two miR396 members (rco-miR396b.c), one miR482 (rco-miR482), one miR827 member (rco-miR827), one miR2111 member (rco-miR2111) and one miR4414 member (rco-miR4414). The second structures of 23 new conserved miRNAs were predicted and results were shown in Figure S1. Further, we compared with the miRNAs predicted by Zeng et al. [20] based on genome sequences of castor bean and found that six (including rco-miR167d, rco-miR394, rco-miR482, rco-miR827, rco-miR2111, rco-miR4414) of the 23 miRNAs newly identified in our analyses were reported for the first time in castor bean (see Table 2 and Table S1). Seventy-eight of 83 miRNAs predicted in previous study were confirmed. Five (including one miR169 and four miR399) of 83 miRNAs predicted were not identified in our analysis, probably because the expression of the five miRNAs is related to environmental stress. The sequencing frequencies for miRNAs in the library can be used as an index for estimating the relative abundance of miRNAs. High-throughput sequencing produced a large number of miRNA sequences, allowing us to determine the relative abundance of miRNAs in castor bean; the frequencies of miRNA families varied largely in different libraries, e.g. most members of miRNA156, miRNA167, miRNA168, miRNA535 were abundant in all libraries, whereas members of miRNA160, miRNA169, miRNA319, miRNA393, miRNA395, miRNA398 and miRNA399 were scarce in all libraries (see Table 2), indicating that expression level of miRNAs varies significantly among different miRNA families in castor bean. In addition, most of the miRNA members displayed a tissue- or developmental stage-specific expression, e.g. miR156e has a low expression in leaf and root libraries and a high expression in the seed libraries; the miR156f, miR156g and miR156 h have the highest expression in the leaf library and the lowest expression in seed1 library. When analyzing the miRNA/miRNA* duplex structure for all conserved miRNAs identified in castor bean, we found that 60 of 86 conserved miRNAs displayed the miRNA/miRNA* duplex structure (Figure 2 for examples), involving 23 families (see Table 3). In contrast, the abundance of miRNA* is significantly lower than their reference miRNAs, except for rco-miR171e* and rco-miR408* (which has abundances higher than their references rco-miR171e and rco-miR408).
Figure 2

The secondary structures of rco-miR482, rco-miR2111, rcomiR827 and rco-miR167 miRNAs identified from castor bean.

Sequences shaded in red and blue, corresponding to miRNA and predicated miRNA*, respectively.

Table 3

Conserved mature-star miRNAs from castor bean.

miRNAfamilyReferencemiRNAStar sequence(5′–3′)Length (nt)Reads
rootleafseed1seed2endosperm
156rco-miR156a GCTCACCCTCTATCTGTCGCC 211821515
rco-miR156b GCTCACTTCTCTTTCTGTCAAG 221851546
rco-miR156c GCTTACTCTCTATCTGTCACC 217079293156
rco-miR156d TGCTCACCTCTCTTTCTGTCAGC 231024127529658
rco-miR156e TGCTCTCTCCTCTTCTGTCATC 22001621109
rco-miR156f TTTTGTGCTCTTTTTTCTTCTG 22020000
rco-miR156g GCTCTCTAGTCTTCTGTCATC 218210729
rco-miR156h GCTCTCTATGCTTCTGTCATC 21481062882
160rco-miR160b GCGTGCGAGGAGCCAAGCATA 21494020
rco-miR160c ATGAGGGGAGTCATGCAGGCC 2101001
162rco-miR162 TGGAGGCAGCGGTTCATCGATC 229843233220
164rco-miR164a CACGTGCTCCACTTCTCCAAC 2570001
rco-miR164c CATGTGCCCGTCTTCCCCATC 2118126058
166rco-miR166b GGAATGTTGTCTGGCTCGAGG 2175331685158120781428
rco-miR166c TGAATGTTGTCTGGTTCGATG 2113146174918
rco-miR166d GGGAATGCTGTCTGGTTCGAG 2106514
rco-miR166e GGAATGTTGTCTGGCTCGAGG 2175331685158120781428
167rco-miR167a GGTCATGCTCTGACAGCCTCACT 23910024
rco-miR167b AGATCATGTGGCAGTTTCACC 217594225779
rco-miR167c AGATCATGTGGCAGTTTCACC 217594225779
rco-miR167d GATCATGTGGTAGCTTCACC 202391111
168rco-miR168 CCCGCCTTGCATCAACTGAAT 21165055511617291276
169rco-miR169a CGGCAAGCTGTTCTTGGCTAT 21207543126503
rco-miR169b GGCAAGTTGTTCTTGGCTACA 2141001
rco-miR169c GCAAGACATTCTTGGCTCTAC 2159200021
rco-miR169d GGCAAGTTGTCCTTGGCTACA 2104005
rco-miR169e GGCAGGTTGTCCTTGGCTAC 200354000
rco-miR169f GGCGAGCTGTTCTTGGCTACA 2104100130
rco-miR169g GGCAGTCTCCTTGGCTAAC 1900003
rco-miR169i GGCAGTCAACTTGGCTAAT 19000010
rco-miR169j GGCATGTCACCTTGGCTAAT 2002022
171rco-miR171a ATATTGGTCCGGTTCAATAAG 21545191
rco-miR171b CGAGATATTGGTGCGGTTCAA 21125714128
rco-miR171e TGTTGGAATGGCTCAATCAAA 212488754583554104
rco-miR171g CGATGTTGGTGAGGTTCAATC 21210010
rco-miR171h GAAGGTATTGGCGCGTCTCAATC 23211037
rco-miR171i CGTGATATTGGTCCGACTCATC 22230184518124
172rco-miR172a GGAGCATCATCAAGATTCACA 210011920512
rco-miR172b GGAGCATCATCAAGATTCACA 214293113
rco-miR172c GTAGCATCATCAAGATTCACA 21162026
rco-miR172d GCGGCATCATCAAGATTCACA 2114032156
390rco-miR390a CGCTATCCATCCTGAGTTTCA 2194316168
rco-miR390b CGCTATCCATCCTGAGTTTCA 2194316168
393rco-miR393a ATCATGCGATCCCTTAGGAAG 2111134
rco-miR393b ATCATGCTATCCCTTTGGATT 21704211
394rco-miR394a AGGTGGGCATACTGCCAACT 202037913
396rco-miR396a TTCAAGAAAGCTGTGGGAGA 20171725778
rco-miR396b TTCAATAAAGCTGTGGGAAG 20899684402371407
rco-miR396c GTTCAAGAAAACTGTGGAAAA 000030
397rco-miR397 CACCAGCGCTGCATTCAATCA 2010000
398rco-miR398a CAGAGGAGTGGCTCCCTGAGAACA 240326317
rco-miR398b GGAGCGACCTGAGAATCACATG 2212722212
399rco-miR399d GGGCATCTCTCGCTTGGCAGG 2101014
403rco-miR403a AGTTTGTGTGTGAATCTAATT 2102013
rco-miR403b TCTCTAGTTTGTGCGTGAATC 2153051
408rco-miR408 AAGACTGGGAACAGGCAGTGC 211770357544239337
482rco-miR482 TTCCCAATTCCGCCCATTCCGA 2287143720931289
535rco-miR535 GTGCTCCCTATCGTTGTCAAT 2193022184858901272
827rco-miR827 TTTGTTGATAGTCACCTAGTT 214714210342
2111rco-miR2111 GCCCTCGGGTTGCAGATTACC 2110105

The secondary structures of rco-miR482, rco-miR2111, rcomiR827 and rco-miR167 miRNAs identified from castor bean.

Sequences shaded in red and blue, corresponding to miRNA and predicated miRNA*, respectively.

Identification of miRNA Isoforms

MiRNAs were initially thought to have a specific sequence of a defined length. Identification of miRNAs from different species has revealed that there are variations in pre-miRNA processing, which could result in miRNA isoforms with one or two nucleotide variation in length or structure from the same locus [26]. Ehrhardt et al. (2010) demonstrated that one fifth of the annotated Arabidopsis thaliana miRNAs (miRBase 14) have a stable miRNA isoform of one or two nucleotides longer [34]. Previous studies have revealed that these miRNA isoforms may have functional divergence due to differential associations with AGO proteins [35]–[36]. To identify miRNA isoforms from our transcriptome data, all miRNA reads (including 6,193,105, 3,677,233, 2,405,407, 2,100,760, 2,283,045 reeds from leaf, root, seed1, seed2 and endosperm, respectively) obtained from previous analyses were aligned against miRBase 19), allowing at most two mismatches or four nucleotides in length difference. The total number of isoform variants found for each library was subjected to a filter that consisted of choosing variants that had a total number of reads 50% greater than the number of total reads of their reference miRNA previously reported, so that low-abundance and probable non-functional variants were discarded. Compared with the length and sequences of the reference miRNAs identified from castor bean genome based on computational prediction in previous study [20], 16 isoform variants from five libraries were detected totally, involving ten families (miRNAs 156, 167, 171, 319, 393, 395, 396, 398, 399 and 403; see Table 4). In the case of miR156, the isoform variant iso-miR156a-d with the 21A absent was detected from four loci (a, b, c and d); the isoform iso-miR156e with a 5′ single nucleotide U/T extension from one locus (e). For the miR167 family, two isoforms with a 3′ single nucleotide A (iso-miR167b) extension or G (isomiR167c) deletion were detected from two loci (b and c). In the case of miR319, two isoform variants with a 3′ single nucleotide T (iso-miR319a-c) and a 5′ single T extension and a 3′di- nucleotide TT deletion (iso-miR319d) were detected from different loci. In the case of miR395, the isoform variant iso-miR319a-e with a 3′ tri- nucleotide TCT deletion were detected from all miR395 loci identified (a, b, c, d and e). Similarly, in the case of miR399, the isoform variant with a 3′ bi- nucleotide GG deletion (isomiR399b-d) was detected from three loci (b, c and d), and the isoform variant with a 3′ tri- nucleotide CAG deletion (iso-miR399e) was detected from the e locus. In the cases of miR171 and miR398, two isoform variants (iso-miR171a,b and iso-miR171g, and iso-miR398a and iso-miR398b) with a 5′ tri- or tetra- nucleotide addition and a 3′ tri- or tetra- nucleotide deletion were detected from different loci. In the other cases such as miR393, miR396 and miR403, isoform variants were produced due to the 1–3 nucleotide addition or deletion in the 3 strand of miRNAs. These results indicated that the isoform variants mainly occurred in several specific miRNA families such as miR156 (isoforms were detected from five loci), miR395 (isoforms were detected from five loci) and miR399 (isoforms were detected from four loci) in castor bean. The variation in length of isoforms identified involved two types: 1) single or several nucleotides addition or deletion in the 3′ strand only (such as miR167 and iso-miR167, miR395 and iso-miR395, miR399 and iso-miR399); and 2) single or several nucleotides addition or deletion both in the 5′ and 3′ strands simultaneously (such as miR156 and iso-miR156, miR171 and iso-miR171, miR398 and iso-miR398).
Table 4

miRNA isoforms identified from castor bean.

miRNASequence (5′–3′)Length (nt)Reads
leafrootseed1seed2endosperm
rco-miR156a-d TGACAGAAGAGAGTGAGCACA 21183271238316936492407
TGACAGAAGAGAGTGAGCAC 209702854106320192393009255941
rco-miR156e TGACAGAAGAGAGAGAGCACA 222414184112586
T TGACAGAAGAGAGAGAGCAC 2115601493817557925113906579
rco-miR167b TGAAGCTGCCAGCATGATCTA 212453717445703552080731417
TGAAGCTGCCAGCATGATCTAA 221741868647582840143017224446
rco-miR167c TGAAGCTGCCAGCATGATCTGG 221074678265613521578
TGAAGCTGCCAGCATGATCTG 21136491167550850405006
rco-miR171a,b TTGAGCCGTGCCAATATCACG 2160001
TGA TTGAGCCGTGCCAATATC 215052491843551228
rco-miR171g AGA TTGAGCCGCGCCAATATC 2101000
TTGAGCCGCGCCAATATCACT 21487550
rco-miR319a-c TTGGACTGAAGGGAGCTCCCT 21113351
TTGGACTGAAGGGAGCTCCC 2082321
rco-MIR319d TTGGACTGAAGGGAGCTCCTT 2207011
A TTGGACTGAAGGGAGCTCC 2000111
rco-miR393 TCCAAAGGGATCGCATTGATC 2110132119
TCCAAAGGGATCGCATTGATCT 22215274249
rco-MIR395a-e CTGAAGTGTTTGGGGGAACTC 21296478516
CTGAAGTGTTTGGGGGAA 18122276
rco-miR396 TTCCACAGCTTTCTTGAACTT 219917161315
TTCCACAGCTTTCTTGAA 181575477160
rco-miR398a TGTG TTCTCAGGTCACCCCTT 2100102
TTCTCAGGTCACCCCTTTGGG 21101181
rco-miR398b TGTGTTCTCAGGTCGCCCCTG 2137561011
TCA TGTGTTCTCAGGTCGCCC 2162102
rco-miR399b-d TGCCAAAGGAGATTTGCCCGG 211275139
TGCCAAAGGAGATTTGCCC 1901135
rco-miR399e TGCCAAAGGAGATTTGCCCAG 2101001
TGCCAAAGGAGATTTGCC 1803115
rco-miR403a,b TTAGATTCACGCACAAACTCG 21688278137358267
TTAGATTCACGCACAAACT 194381841601639
When inspecting the expression of these isoform variants among five libraries, we unexpectedly found that the expression of these isoforms among different libraries had significant divergence, e.g., in the cases of miR156a-d, miR156e, miR167c and miR171a,b, the variants iso-miR156a-d, iso-miR156e, iso-miR167c and iso-miR171a,b were more highly expressed in all libraries than the rco-miR156a-d, rco-miR156e, rco-miR167c and rco-miR171a,b (Figure 3 for examples); in the case of miR395, rco-miR395a-e had relatively higher expression in the leaf library than its expression in other libraries, whereas the iso-miR395a-e was weakly expressed in all libraries; in the case of miR399, rco-miR399b-d was relatively higherly expressed in root library than other tissues, whereas the rco-miR399b-d was weakly expressed in all libraries; in the case of miR403, rco-miR403a,b was relatively higherly expressed in the leaf library than other libraries, whereas the iso-miR403a,b was higherly expressed in the seed2 library than others; in the case of miR171, rco-miR171g was only present in the root library, whereas the iso-miR171g was present in all libraries except for the endosperm library (see Table 4).
Figure 3

Differential processing of castor bean pre-miRNAs.

Stem-loop precursors of rco-miR156a and rco-miR167c pre-miRNAs were aligned against mature (red) and isoform (blue) miRNA sequences. Count data number represents the total number of reads found in leaf libraries.

Differential processing of castor bean pre-miRNAs.

Stem-loop precursors of rco-miR156a and rco-miR167c pre-miRNAs were aligned against mature (red) and isoform (blue) miRNA sequences. Count data number represents the total number of reads found in leaf libraries.

Expression Patterns of miRNAs among Tissues

Preferential expression of a miRNA in specific tissues might provide clues about its physiological function. To investigate the expression patterns of miRNAs among leaf, root, developing seeds and endosperm in castor bean, read count of each identified miRNA was normalized to the total number of miRNA read count in each library. Based on the relative abundance, we found that the expression of certain members within the miRNA families varied greatly in the given tissues, suggesting functional divergence within the family in castor bean. For example, abundance of the miR156 family varied from 122 reads (rco-miR156e) to 322,939 reads (rco-miR156e) in the leaf library, similar to the case for miR167 family varied from 941 reads to 59,219 reads in the seed1 library (see Table S2). These results indicate that miRNA members in one given miRNA family display clearly different expression levels, probably implying their functional divergence. We compared the expressional differentiation of conserved miRNAs identified between the leaf and seed1, root and seed1, seed2 and seed1, and endosperm and seed1, respectively. We found that 49 out of 69 miRNAs detected between the leaf and seed1 were significantly differentially expressed (log2ratio fold-change >1.0 and P value <0.001, see Figure 4a and Table S2) with 15 miRNAs up-regulated and 34 miRNAs down-regulated in leaf. Similarly, 42 out of 69 miRNAs between the seed1 and root were significantly differentially expressed with 17 miRNAs up-regulated and 25 miRNAs down-regulated in root (see Figure 4b and Table S2). When comparing the expressional differentiation of miRNAs between the seed2 and seed1, endosperm and seed1, respectively, we found that 42 out of 65 miRNAs detected between the seed1 and seed2, and 60 out of 68 miRNAs detected between the endosperm and seed1, were significantly differentially expressed (log2ratio fold-change >1.0 and P value <0.001, see Figure 4c-d and Table S2) with 23 miRNAs up-regulated and 19 miRNAs down-regulated in the seed2, and 23 miRNAs up-regulated and 37 miRNAs down-regulated in the endosperm. It is worthy to note that some families such as miR166 and miR165 were of abundance cross the five libraries, whereas many families such as miR160, miR169, miR171, miR395 were lowly expressed in five libraries. Based on their abundance in the libraries, most members of miR156 family were of higher abundance in vegetable tissues (leaf and root), whereas the rco-miR156e had higher expression in developing seeds than in the leaf and root; the members of miR167 and miR164 had obviously preferential expression among tissues (see Table 2 and Table S2).
Figure 4

Comparison of expression patterns of miRNAs identified between seed1/root (a), seed1/leaf (b), seed1/seed2 (c), and seed1/endosperm (d).

Novel miRNA Detection

One of the most important features for high-throughput sequencing is that it can be employed to detect novel miRNAs in small RNA transcriptome [22], [37]. In the previous study, 83 miRNAs were predicted based on genome sequences in castor bean and 63 of 83 miRNAs predicted were validated and released in the miRNA database [20]. In this study, remaining unannotated reads (5,444,532, 6,273,540, 5,207,130, 6,743,389 and 7,364,508 from leaf, root, seed1, seeds and endosperm, respectively) were mapped to reference genome of castor bean for identifying the genomic location and retrieving the adjoining sequence to help with secondary structure prediction of a miRNA precursor using the MIREAP pipeline (developed by BGI). The resulting reads, with a characteristic hairpin structure, a maximum free energy of ∼25kcal/mol, minimal matched base pairs of miRNA and miRNA* exceeding 16 nt and the sequence length of 20–23 nt, and reads abundance more than 100 at least in one independent library were considered as novel miRNA candidates. As a result, 72 potential miRNA candidates were identified with typical stem-loop structure (Figure S1), the negative folding free energies ranged from 25.4 to 103 (kcal/mol), and diverse loci in castor bean genome (see Table 5 and Table S3). Of the 72 potential miRNAs, 24 represented both the miRNA and miRNA* and 48 were miRNA*-deficient cases (having only the 5′ arm or 3′ arm sequences) (see Table 5 and Table S3). Fifty-three of these novel miRNA candidates were expressed in at least two independent libraries, and 19 of these candidates were expressed in a single library. A recently published article proposed precise and strict new miRNA annotation criteria by Meyers et al. [38]. Besides the primary criteria used by Mireap, two elementary requirements are demanded in high-throughput sequencing data analysis: (i) high-throughput sequencing data should represent both the miRNA and miRNA*; and (ii) in miRNA*-deficient cases, isolation and sequencing of the candidate miRNA should come from multiple and independent libraries. Based on these precise criteria, 58 of 72 novel miRNA candidates were categorized as highly confident. Fourteen miRNA candidates identified by Mireap did not meet Meyers et al.’s criteria (see Table 5).
Table 5

Novel miRNAs identified from castor bean.

miRNASequences (5′–3′)Length(nt)ReadsRNA*No of loci
leafrootseed1seed2endosperm
Rco-miR001a TTGGAGGATAGTTTCAGGCCGG 2201270190no1
Rco-miR002a GTGGACGTGCCGGAGTGGTTA 2115653188012211656no2
Rco-miR003b TCTGATAGCAAAAGATAGAAC 218140000no1
Rco-miR004a CAACGGATAGGTATACAGTTTT 2230204121000no1
Rco-miR005a TCTGAAATTGCAGAGCCTAAA 21225372124205343no1
Rco-miR006a TCTTTGTAGTTTTGATCCGGAG 2213122054173513941314no1
Rco-miR007a AGAGAAGGATGGTAGAGATGGTT 23100270276no2
Rco-miR008a TATCTTTGTAGTTTTGATCCGG 2232270554700no1
Rco-miR009a TGAAGATGAAGAGCTATGTTTGA 2386714101310no1
Rco-miR010a TGAGGAAGAGGATGACTTTGGA 22011005922no1
Rco-miR011a TCTCTAATTCGCTTGGTGCAG 21193178437158yes1
Rco-miR012a CAATTGGATCGTTATTTGCTA 2111313787167132no1
Rco-miR013a AGGTGCAGGTGTGAGTGCAGG 211712396018yes1
Rco-miR014a TAATCTTGCTAACGGACTAAA 21291630055yes1
Rco-miR015a GCCGCTATGGTGAAATCGGT 204070000yes1
Rco-miR016a AAGCCTGCGAGAGAGAGTTGG 21000371346yes1
Rco-miR017a AGGCCGATGACGATTAGAGGACG 230147000yes2
Rco-miR018b TTCAAAAGGAGAACAAGGATAA 224570000no1
Rco-miR019a ACATCCTTGAAGCTAACTCTA 214519465386573yes1
Rco-miR020a AGGCAGTCATCTCTTGGCTAC 210000163yes1
Rco-miR021a CGAGTCATCTGACAGAAGTAG 210443000yes1
Rco-miR022b AGTGGGCGGAAAGGGGGGGTA 211890000no1
Rco-miR023a TTTTATCACCGTCAGATTCTA 2112733377221185no1
Rco-miR024a TTTTGCCTACACCACCCATTCC 22063762100no4
Rco-miR025a AATAGTGATTGTGATATTGGCC 22323010100yes1
Rco-miR026a ATTTTAGGAAGGGAATGAACA 21249768368653431yes1
Rco-miR027b TTATTTTGATTTTGGACGTTTC 221800000no5
Rco-miR028a TCTTATAGCAATCAGGGGACTTG 23016651000yes1
Rco-miR029a TATGGGGGGATCGGGCAATAT 2130798498619142222431yes1
Rco-miR030a GTCTGGGTGGTGTAGTCGGTT 2138423735521350045369no1
Rco-miR031a TGTCGCTGGAGAGATGGCGCCA 221321146400no1
Rco-miR032a GAGGTCCTGTAGGGAGAGTGG 21143311443029yes1
Rco-miR033a TCCGGAGAGATTTGTGGACGA 2123704180285no1
Rco-miR034a TCAGGTGGAGAATCAAACAGA 211710167600419no1
Rco-miR035a TCCGGAGAGATTTGTGGACGAT 22004180285no1
Rco-miR036a CATGGACCAGAAGGCATATAC 211038208466no1
Rco-miR037a CTGAGACTTGAGGGATAGGTGTT 23057911100no5
Rco-miR038a TGACGTGGCATGAACTTCGGCA 2292364137610131707no1
Rco-miR039a TAGAGCCAAGAATGACTTGCCGG 23000204411yes1
Rco-miR040a ACTCTCTCTGAAGGCTTCAAA 213199117993545834525no1
Rco-miR041a TCCGGAGAGATTTGTGGACGAT 2241805150285yes1
Rco-miR042a TCTGTCGCAGGAAAGATGGTAC 2203225760863yes1
Rco-miR043a TTTGCATGACCTGGGAGACGT 218196172432845425073no1
Rco-miR044a TGGAAATTTCTGGGTTGGAGG 21027892941896314no1
Rco-miR045b ATCAAATAAGGAAGAATCGAG 210001210no1
Rco-miR046b TCGAAAGAGATATCAAGGACTG 2200017890no1
Rco-miR047a GGAGGCCTTTGAGCAGAGTGGA 2200118400yes1
Rco-miR048b TTGGCATCAGAGGAGTCAAGC 211050000no1
Rco-miR049a TAGGCAAAGCATCAGGATTCAT 222121434000no2
Rco-miR050a TGTTTTTTGATCAGGACCATAA 2217416720168139no1
Rco-miR051a CTGTCGCAGGAGCGGTGGCACC 226875232300yes1
Rco-miR052a GGTATTGGACGGGTTGGCAAGA 22912719777438981401429yes1
Rco-miR053a TCGAACCCAACTAGAAGATCTC 2200122522811379no4
Rco-miR054a TATGGGAGGCATGGTCAGAAA 212905820886867417no1
Rco-miR055a TGGACAAGTAGAGGTTACTAAT 220214244422472no1
Rco-miR056b TCTGGATGAAGGCTGGAGTGAT 220054900no1
Rco-miR057a GCCGCTATGGTGAAATCGGT 20407170150no1
Rco-miR058b TGAGGTTGGGTTGGACGACATA 2201470000no1
Rco-miR059a CAGCAAGGATTAAGGGACATTT 22296055600no1
Rco-miR060b TCTGAAGCTGTGAATGGGAAT 210002770no2
Rco-miR061a GAACGGCATTTGTAGCCCAGGAG 231013517100yes1
Rco-miR062a TCTGAATCAGGCTCTATATTAG 2205301590yes1
Rco-miR063b TTGAACAGTAGGAAGAGGGTTT 220003280no1
Rco-miR064a TCTTTATATAGAGGTCTCGGAG 2225951375110316001864no1
Rco-miR065a TTTTGTGCCAAGAACGTTGTTT 22237121480198no5
Rco-miR066a TGGATAAGTTTCAGGAGATCTC 226678337958220yes1
Rco-miR067b TGGGCTTTGAAGAAGAAGGTA 210011000no1
Rco-miR068a TCATCAGATGAAGAGCATGACC 221064093300no1
Rco-miR069b TGGGCTAGAGCATTAGAAGTTT 220012900no1
Rco-miR070a TCTGGGAGTAGATTGAAGTGAA 2211820014750no1
Rco-miR071b ATTGAGTTGGTAGAAGGTGCAA 220014000no1
Rco-miR072a TTAGGAAAGCAGCTTGACACGTG 2300361890yes1

Note: a: these candidates meet Meyers et al.’s criteria; b: these candidates do not meet Meyers et al.’s criteria.

Note: a: these candidates meet Meyers et al.’s criteria; b: these candidates do not meet Meyers et al.’s criteria.

Predicted Targets of Castor Bean miRNAs

According to Allen et al. and Schwab et al.’s methods [39], [40], we predicted targets of the 95 miRNA candidates (including 23 new conserved and 72 novel miRNAs) using the currently annotated mRNAs of genes in the castor bean (from the CBGD database http://castorbean.jcvi.org). As a result, 80 of 95 miRNA candidates were identified to have their target genes, involving 482 miRNA:target pairs. The function of these target genes were broadly involved in the growth and development process of castor bean. The predicted target genes of these 95 miRNA candidates and their potential functional annotations are listed in Table S4.

Validation of the Putative miRNAs Newly Identified in Castor Bean

To validate the 95 miRNA candidates newly identified by high-throughput sequencing results, RT-PCR analysis was performed according to the method described in “Materials and Methods”. Using first-strand cDNAs obtained respectively from leaves, root tips and developing seeds, 20 primer pairs showed clean amplification bands for miRNAs PCR products including five conserved miRNA families (rco-miR172bc-d, 396b-c, 482, 827 and 4414) and fifteen novel putative miRNAs (Rco-miR002, Rco-miR006, Rco-miR029, Rco-miR030, Rco-miR032, Rco-miR038, Rco-miR040, Rco-miR043, Rco-miR044, Rco-miR052, Rco-miR053, Rco-miR054, Rco-miR058, Rco-miR064 and Rco-miR068, see Figure 5), suggesting the 20 miRNAs newly identified were validated by RT-PCR amplification. When comparing the abundance of these miRNAs validated in five miRNA libraries, we found these miRNAs were relatively more abundant than other miRNAs newly identified (see Table 2 and Table 5). Those miRNAs newly identified with low abundances were not validated by RT-PCR amplification probably because of their low expression levels in these tissues tested. These RT-PCR results exhibited the same expression profiles as the original high-throughput sequencing results.
Figure 5

Validation of the 20 miRNAs newly identified using the RT-PCR method.

The numbers 1, 2 and 3 showed that the bands amplified using cDNAs as templates obtained from developing seeds, root tips and leaves, respectively. The number 4 showed a negative control (NTC, i.e., no template in PCR reaction). M denoted markers. The amplified bands were separated in 1.5% agarose gel.

Validation of the 20 miRNAs newly identified using the RT-PCR method.

The numbers 1, 2 and 3 showed that the bands amplified using cDNAs as templates obtained from developing seeds, root tips and leaves, respectively. The number 4 showed a negative control (NTC, i.e., no template in PCR reaction). M denoted markers. The amplified bands were separated in 1.5% agarose gel.

Discussion

Although miRNAs have been studied extensively in diverse plant species in these years, limited knowledge is known for plant species in the family Euphorbiaceae. Based on complete genome data of castor bean, the study on a genome-scale computational prediction of miRNAs combined with experimental analysis [20] provided a basis for further characterization and functional analysis of miRNAs in Euphorbiaceae species. The current study using high-throughput sequencing method greatly enriches our knowledge in identifying miRNAs in castor bean and facilitates more particular and specific miRNA studies castor bean and other members of the family Euphorbiaceae as well. High-throughput sequencing analyses have become one of the major sources supporting miRNA annotations [22]–[24]. This study is the first report on identification and characterization of miRNAs and generates a large number of small RNA sequence reads using high-throughput sequencing techniques in castor bean. Studies to elucidate the number of miRNA molecules sequenced from these small RNA sequence reads are still needed for more accurate small RNA profiling studies. In term of reads, the small RNA libraries sequenced finally yielded a large number of unannotated reads after new miRNA screen in this study. These remaining unannotated reads could remain for further analyzing characterization of siRNA populations in castor bean. Usually, miRNA isoform variants are considered to be a consequence of inaccuracies in Dicer pre-miRNA processing [41]. However, sequence length variation often have been overlooked, as small variations in the sequence length might not have been thought to alter the function of individual miRNAs, as they are directed to their target genes by base pairing [34]. Recent studies had showed that miRNAs and their isoform variants in length broadly co-existed and these variants might lead to functional differentiation, in particular, when the variation occurs in the 5′-end and gives rise to a alternation of the miRNA and argonaute (AGO) binding [36], [42]. A decrease in abundance of the 21 nt isoform variant reduces miR168 homeostasis and leads to developmental defects in Arabidopsis and sequence length heterogeneity for plant miRNAs often is essential for correct plant development and environmental responses [36]. Although most of the isoform variants identified from the length variant group exhibit 3′ heterogeneity, little is known about the biological interest of the variation in length occurring in 3′-end of miRNAs. In this study, small RNA sequences from libraries were considered as miRNA isoforms only if they were similar to a reference miRNA identified in miRBase and had a significantly greater number of reads compared to those found for the reference miRNA in all five libraries. From these analyses for isoform identification, 16 miRNA isoforms involving 10 miRNA families were added to the total number of conserved miRNA families identified in castor bean. Six miRNA isoforms displayed 5′ heterogeneity and ten displayed 3′ heterogeneity. Whether these isoform variants detected in castor bean have functional differentiation and play different regulatory roles in plant growth and environmental responses are yet unknown. The expressional differentiation of these isoform variants and their references among tissues, however, imply their functional divergence, if these isoform variants have their biological interest. In addition, those variant sequences with missing bases and low frequencies produced from high-throughput sequencing could be viewed as degradation products or pyrophosphate sequencing errors. Application of deep sequencing technology can shed considerable novel lights hidden in the small RNA transcriptome data not only for identification of new conserved miRNAs, but also for successful discovery of novel miRNAs with high accuracy and efficiency [41]. Our current study has led to the discovery of 23 new conserved and 72 novel miRNA candidates in castor bean. These new miRNA candidates largely enriched the miRNA database for castor bean and Euphorbiaceae members. However, only seven new conserved and 15 novel miRNAs were validated using experimental RT-PCR method, though 58 of 72 novel miRNA candidates had been categorized as highly confident according to previous strict miRNA annotation criteria, with 35 represented both the miRNA and miRNA*. Most of novel miRNA candidates identified in this study have not been validated. The most likely reason is due to the limit of RT-PCR method when target miRNAs tested have a low expression [23], [37]. Thus, validity of these novel miRNA candidates need to be further confirmed. When comparing the numbers of miRNAs identified using the same high-throughput sequencing approach between rubber tree [43] and castor bean, we found that castor bean appeared to have less conserved miRNAs (86) involving 27 miRNA families than rubber tree which had 115 conserved miRNAs, covering 56 families. Further, we found that all homologs of 27 conserved miRNA families of castor bean in rubber tree, but we did not find any homolog of the 72 novel miRNAs identified from castor bean in other members of Euphorbiaceae including rubber tree [43], [44], Jatropha curcas [45] and Manihot esculenta [46], implying that the 72 novel miRNAs detected might represent castor bean species-specific miRNAs. Compared to the target genes identified in other plants, rco-miR167, rco-miR172 and rco-miR482 exhibited similar targets to their homologs in Arabidopsis [47] and maize [25]. However, four conserved miRNAs newly identified (including rco-miR396, rco-miR827, rco-miR2111 and rco-miR4414) and most of the novel miRNAs in castor bean displayed species-specific targets. In addition, high-throughput sequencing technologies can serve as a powerful miRNA expression profiling tool to identify the differentially expressed miRNAs, providing the basis for future analysis of miRNA functions and elucidating underlying mechanisms in regulating diverse molecular and physiological pathways [12], [37]. In the study, comparison of their expression patterns among different tissues shows that 49, 42, 42 and 60 of 86 conserved miRNAs are significantly differentially expressed between seed1/leaf, seed1/root, seed1/seed2 and seed1/endosperm, respectively. Similarly, many of the miRNA*, isoform variants and novel miRNAs identified in this study presented differential expression patterns among tissues sampled. Although the biological function of miRNAs in castor bean is unclear the expressional differentiation of these miRNAs among tissues provides a clue for further investigation of the physiological roles of miRNAs in castor bean. Castor bean is of an important oilseed crop worldwide, containing significant amounts of lipid and protein. In this study, we searched for miRNAs that might play a function in regulating biological processes related to the biosynthesis of lipid and protein in developing seeds and endosperms. Our results demonstrated that ten miRNAs (rco-miR156f,e, rco-miR159, rco-miR168, rco-miR390a, rco-miR393a, rco-miR396a, rco-miR408, rco-miR003 and rco-miR020) had 21 target genes, which were involved in amino acid metabolism, fatty acid metabolism and lipid metabolism with differential expressions at different stages of seed development. These results imply that the ten miRNAs might have a physiological role in regulating lipid and protein biosynthesis in castor bean. In summary, we have identified and characterized a large number of miRNAs from castor bean, analyzed their expression and predicted the putative targets of these miRNAs. It will be very important to experimentally characterize these miRNAs and their downstream targets, as this will lead to a better understanding of the function relationship and mechanism of miRNAs in the regulation network. In particular, our high-throughput sequencing approach to miRNA discovery suggests that a significant number of novel miRNAs remain to be further analyzed and characterized. The current study is the first report on identification and characterization of miRNA using the high-throughput sequencing approach in castor bean.

Materials and Methods

Ethics Statement

No specific permits were required for the described field studies. No specific permissions were required for these locations and activities. The location is not privately-owned or protected in any way and the field studies did not involve endangered or protected species.

Sample Preparation and Total RNA Extraction

Seeds of castor bean var. ZB306 elite inbred line (provided kindly by Zibo Academy of Agricultural Sciences, Shandong, China) were cultivated in the greenhouse of Xishuangbanna tropical botanical garden (Kunming branch) with the temperature of day at 24–26°C and night at 18–20°C with the humidity controlled at 60–80%. Leaf tissue was collected from a fully expanded young leaf and root tips were collected, washed and dissected. Immature seeds at two different stages, i.e. seed1 at the initial stage (15 days after pollination) and seed2 at the fast oil accumulation stage (35 days after pollination) of seed development, were collected. Endosperm tissue was dissected from the immature seeds (40 days after pollination). The developing seeds did not start to accumulate TAG at the initial stage (seed1) and fast accumulated TAG at the fast oil accumulation stage (seed2, see Figure S2). Total RNA was extracted from the leaf, root tip, immature seed (seed1 and seed2) and endosperm tissues separately using Trizol (TaKaRa, Dalian, China) following the manufacturer’s protocol. The quality of total RNA samples was tested using both the NanoDrop Spectrometer (ND-1000 Spectrophotometer, Peqlab) and agarose gel (1.5%) electrophoresis.

Small RNA Library Construction and Sequencing

Total RNA samples were firstly processed by 15% denaturing polyacrylamide gel electrophoresis (PAGE). The small RNA fragments in the range of 16–30 nt in length were isolated from the gel and purified by sRNAs gel extraction Kit (TaKaRa Bio, Otsu, Japan). Then, the 5′ and 3′ termini of the small RNA were linked with proprietary adapters sequentially and RT-PCR was performed to amplify RNA to DNA, which can be used as templates to produce sequencing libraries. At last, approximately 20 µg sequencing libraries were produced and Illumina Solexa Genome Analyzer was employed to sequence the generated libraries.

Small RNA Sequencing Analysis

After sequencing, we trimmed the adaptor sequences, filtered out the low quality tags and eliminated contamination of adaptor sequences. Non-coding RNAs including rRNA, tRNA, snRNA and snoRNA were identified by reads alignment to the Pfam 10.1 (http://www.sanger.ac.uk/software/Rfam) and GeneBank databases. After removing non-coding RNAs, the clean small RNA sequences ranging from 16–28 nt were collected and mapped to the castor bean genome for getting the unique reads with abundance and position on the genome using SOAP 2.0 program (http://soap.genomics.org.cn/). The unique RNA sequences that perfectly matched the castor bean genome were subjected to subsequent analysis. Sequence reads overlapping with exons and introns of mRNA were excluded to avoid DNA contamination or mRNA degradation products.

Identification of Conserved, Isoform and Novel miRNAs

In order to determine conserved miRNAs, the trimmed unique reads were aligned against the mature or precursor of conserved castor bean miRNAs in the miRBase [48]. Only the small RNA sequences that perfectly matched known castor bean miRNAs were considered to be conserved miRNAs. To find new conserved miRNAs, the remaining reads were aligned with mature plant miRNA sequences in miRBase allowing at most two mismatches. According to the genomic positions of new conserved miRNA candidates identified, we retrieved the flanking genomic sequences around matched loci to form possible precursors of candidate miRNAs with the Mfold program [49]. Those candidate sequences containing a typical RNA stem-loop with at least 18 bp in matched regions and having folding energy no greater than −18 kcal/mol were considered as new conserved miRNAs. Meanwhile, we inspected stem-loop structures for each miRNAs identified in castor bean and defined the star miRNA sequences based on Dicer-cleavage rules as implemented in the miRDeep software tool [50]. With the purpose of identifying miRNA isoforms, the sequence reads from all libraries that perfectly mapped in the annotated miRNA precursor sequences but not representing annotated miRNA mature and star sequences, were not shifted more than four positions from their original mature or star 5′ position and have a total number of reads 50% greater than the total reads of their reference miRNA were considered as isoform miRNAs in castor bean. If no reference miRNA for a variant was previously detected in all libraries, the variant with the highest frequency was considered. To identify the novel miRNAs, the unannotated reads that were identical to genome sequence were collected and the flanking sequences around matched position were retrieved. The MIREAP pipeline (https://sourceforge.net/projects/mireap/) was used to analyze their characteristic hairpin structure of miRNA precursor. Those reads which could meet criteria including having a characteristic hairpin structure and the Dicer cleavage site with a maximum free energy of ∼25kcal/mol, minimal matched base pairs of miRNA and miRNA* exceeding 16 nt, the sequence length of 20–23 nt and the reads abundance >100, were considered as novel miRNAs. The filtered pre-miRNA sequences were folded again using Mfold and checked manually.

Validation of miRNAs Newly Identified

To validate castor bean miRNAs newly identified in this study, a modified oligo (dT) primers RT-PCR approach as described by Fiedler et al. [51] was performed. Briefly, after total miRNAs were extracted from plant tissues, polyA tails to all transcript miRNAs were added, and then transcript miRNAs with polyA tails were reversely transcribed into cDNAs using a set of 12 modified oligo(dT) primers containing a unique sequence tag at the 5′ end and two bases at the 3′ end. This step reaction converts all miRNAs into cDNAs with ∼90bp length. Further, RT-PCR amplification is achieved using a primer specific to the miRNA in interest and a primer specific to the tag. In our study, total miRNA was isolated from leaves, root tips and developing seeds of castor bean using Plant MicroRNA Extraction Kit (BIOTEKE, Beijing, China), following the manufacturer’s instructions. MiRNA reverse transcription reactions were performed using One Step miRNA 1st cDNA Synthesis Kit (HaiGene Biotech, Haerbin, China) in a 20 µL reaction solution containing 1000 ng miRNAs, 4 µL 4x One Step miRNA RT solution, 2 µL 10x miRNA RT Primers, and RNase- free water was used to adjust the total volume of the reverse transcription reaction to 20 µL. The miRNA reverse transcription reactions were incubated in an Eppendorf Mastercycler (Eppendorf North America, Westbury, NY) for 60 min at 37°C, followed by 5 min at 95°C, and then 4°C until further use. For PCR amplification, 86 specific primers were designed based on mature miRNA sequences for amplifying 95 miRNAs new indentified (see Table S4). The RT-PCR reactions were performed in a 10 µL volume containing 1 µL diluted reverse transcription product, 1×PCR buffer, 0.2 mM dNTPs, 2.0 U EasyTaq DNA polymerase (TransGen Biotech, Beijing, China), and 0.5 µM specific miRNA primer and universal primer (5′-TTACCTAGCGTATCGTTGAC-3′) on Eppendorf Mastercycler. The PCR reaction conditions used were as follows: 2 min at 95°C, followed by 38 cycles of denaturation for 5 s at 95°C, annealing for 5s at 55–60°C, extension for 35s at 70°C, and then 4°C. PCR amplification products were confirmed on 1.5% agarose gel.

Differential Expression Analysis

To investigate the differentially expressed miRNAs among castor bean leaf, root, seed1, seed2 and endosperm, miRNAs considered for this analysis were the conserved miRNAs (Table 2). Firstly, each miRNAs read count was normalized against the total number of miRNA reads in each given sample. Subsequently, the fold-change (log2(sample1/sample2) and P-value were calculated from the normalized expression, and significantly difference of a given miRNA was determined by the P≤0.001 and fold-change ≥1 in two samples.

Prediction of miRNA Targets

The whole genome and transcript databases of castor bean (http://castorbean.jcvi.org/index.php) provide a rich resource for predictions of miRNA targets. The putative target sites of miRNA candidates were identified by aligning the miRNA sequences with the genome and transcript database of castor bean. Allen et al.’s and Schwab et al’s criteria [39], [40] were used in our analysis, i.e.: each G:U wobble pairing was assigned 0.5 point; each indel was assigned 2.0 points; all other noncanonical Watson-Crick pairings were each assigned 1.0 point; no more than two adjacent mismatches in the miRNA/target duplex with a minimum free energy (MFE) of the miRNA/target duplex 75% greater than the MFE of the miRNA bound to it’s perfect complement. The second structures of newly identified 95 miRNAs including 23 conserved (*) miRNAs and 72 novel pre-miRNAs in castor bean. (DOC) Click here for additional data file. Developing seeds of castor bean and lipid (triacylglycerols, TAG) accumulation at two different developmental stages. (DOC) Click here for additional data file. The conserved miRNAs identified from castor bean and their distribution among miRNA families. (DOC) Click here for additional data file. The expressional differentiation of conserved miRNAs identified between seed1/leaf, seed1/root, seed1/seed2, seed1/endosperm, respectively. (XLS) Click here for additional data file. Novel rco-miRNAs identified and their expression levels in castor bean. (XLS) Click here for additional data file. Putative targets for the conserved 23 miRNAs newly identified (*) and 72 novel miRNAs in castor bean. (XLS) Click here for additional data file. The 86 primers designed for RT-PCR amplification of 95 miRNAs newly identified in this study. (XLS) Click here for additional data file.
  45 in total

1.  The microRNA Registry.

Authors:  Sam Griffiths-Jones
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  High-throughput sequencing discovery of conserved and novel microRNAs in Chinese cabbage (Brassica rapa L. ssp. pekinensis).

Authors:  Fengde Wang; Libin Li; Lifeng Liu; Huayin Li; Yihui Zhang; Yingyin Yao; Zhongfu Ni; Jianwei Gao
Journal:  Mol Genet Genomics       Date:  2012-05-29       Impact factor: 3.291

3.  Specific effects of microRNAs on the plant transcriptome.

Authors:  Rebecca Schwab; Javier F Palatnik; Markus Riester; Carla Schommer; Markus Schmid; Detlef Weigel
Journal:  Dev Cell       Date:  2005-04       Impact factor: 12.270

4.  Computational and analytical framework for small RNA profiling by high-throughput sequencing.

Authors:  Noah Fahlgren; Christopher M Sullivan; Kristin D Kasschau; Elisabeth J Chapman; Jason S Cumbie; Taiowa A Montgomery; Sunny D Gilbert; Mark Dasenko; Tyler W H Backman; Scott A Givan; James C Carrington
Journal:  RNA       Date:  2009-03-23       Impact factor: 4.942

5.  Investigation of the microRNAs in safflower seed, leaf, and petal by high-throughput sequencing.

Authors:  Haiyan Li; Yuanyuan Dong; Yepeng Sun; Erle Zhu; Jing Yang; Xiuming Liu; Ping Xue; Yanshuang Xiao; Shulin Yang; Jinyu Wu; Xiaokun Li
Journal:  Planta       Date:  2010-12-07       Impact factor: 4.116

6.  Regulation of flowering time and floral organ identity by a MicroRNA and its APETALA2-like target genes.

Authors:  Milo J Aukerman; Hajime Sakai
Journal:  Plant Cell       Date:  2003-10-10       Impact factor: 11.277

7.  Small RNA diversity in plants and its impact in development.

Authors:  Christine Lelandais-Brière; Céline Sorin; Marie Declerck; Abdelali Benslimane; Martin Crespi; Caroline Hartmann
Journal:  Curr Genomics       Date:  2010-03       Impact factor: 2.236

Review 8.  MicroRNA biogenesis and function in plants.

Authors:  Xuemei Chen
Journal:  FEBS Lett       Date:  2005-08-09       Impact factor: 4.124

9.  Transcriptome-wide identification and characterization of miRNAs from Pinus densata.

Authors:  Li-Chuan Wan; Haiyan Zhang; Shanfa Lu; Liang Zhang; Zongbo Qiu; Yuanyuan Zhao; Qing-Yin Zeng; Jinxing Lin
Journal:  BMC Genomics       Date:  2012-04-06       Impact factor: 3.969

10.  AGO1 homeostasis involves differential production of 21-nt and 22-nt miR168 species by MIR168a and MIR168b.

Authors:  Hervé Vaucheret
Journal:  PLoS One       Date:  2009-07-30       Impact factor: 3.240

View more
  18 in total

1.  Unravelling the complexity of microRNA-mediated gene regulation in black pepper (Piper nigrum L.) using high-throughput small RNA profiling.

Authors:  Srinivasan Asha; Sweda Sreekumar; E V Soniya
Journal:  Plant Cell Rep       Date:  2015-09-23       Impact factor: 4.570

2.  Mining NGS transcriptomes for miRNAs and dissecting their role in regulating growth, development, and secondary metabolites production in different organs of a medicinal herb, Picrorhiza kurroa.

Authors:  Ira Vashisht; Prashant Mishra; Tarun Pal; Sreekrishna Chanumolu; Tiratha Raj Singh; Rajinder Singh Chauhan
Journal:  Planta       Date:  2015-02-07       Impact factor: 4.116

3.  Genomic DNA Methylation Analyses Reveal the Distinct Profiles in Castor Bean Seeds with Persistent Endosperms.

Authors:  Wei Xu; Tianquan Yang; Xue Dong; De-Zhu Li; Aizhong Liu
Journal:  Plant Physiol       Date:  2016-04-28       Impact factor: 8.340

4.  Transcriptome-Wide Identification of miRNAs and Their Targets from Typha angustifolia by RNA-Seq and Their Response to Cadmium Stress.

Authors:  Yingchun Xu; Lingling Chu; Qijiang Jin; Yanjie Wang; Xian Chen; Hui Zhao; Zeyun Xue
Journal:  PLoS One       Date:  2015-04-29       Impact factor: 3.240

5.  Genomic imprinting, methylation and parent-of-origin effects in reciprocal hybrid endosperm of castor bean.

Authors:  Wei Xu; Mengyuan Dai; Fei Li; Aizhong Liu
Journal:  Nucleic Acids Res       Date:  2014-05-05       Impact factor: 16.971

6.  Genome-Wide Identification of MicroRNAs and Their Targets in the Leaves and Fruits of Eucommia ulmoides Using High-Throughput Sequencing.

Authors:  Lin Wang; Hongyan Du; Ta-Na Wuyun
Journal:  Front Plant Sci       Date:  2016-11-08       Impact factor: 5.753

7.  Identification of bolting-related microRNAs and their targets reveals complex miRNA-mediated flowering-time regulatory networks in radish (Raphanus sativus L.).

Authors:  Shanshan Nie; Liang Xu; Yan Wang; Danqiong Huang; Everlyne M Muleke; Xiaochuan Sun; Ronghua Wang; Yang Xie; Yiqin Gong; Liwang Liu
Journal:  Sci Rep       Date:  2015-09-15       Impact factor: 4.379

8.  Virus versus host plant microRNAs: who determines the outcome of the interaction?

Authors:  Fatemeh Maghuly; Rose C Ramkat; Margit Laimer
Journal:  PLoS One       Date:  2014-06-04       Impact factor: 3.240

9.  Genome-wide survey and expression profiles of the AP2/ERF family in castor bean (Ricinus communis L.).

Authors:  Wei Xu; Fei Li; Lizhen Ling; Aizhong Liu
Journal:  BMC Genomics       Date:  2013-11-13       Impact factor: 3.969

10.  Genome-Wide Identification, Evolutionary Analysis, and Stress Responses of the GRAS Gene Family in Castor Beans.

Authors:  Wei Xu; Zexi Chen; Naeem Ahmed; Bing Han; Qinghua Cui; Aizhong Liu
Journal:  Int J Mol Sci       Date:  2016-06-24       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.