Literature DB >> 23922789

Genome-wide analysis of small RNA and novel microRNA discovery during fiber and seed initial development in Gossypium hirsutum. L.

Hua Zhang1, Qun Wan, Wenxue Ye, Yuanda Lv, Huaitong Wu, Tianzhen Zhang.   

Abstract

Cotton is the source of the most important, renewable natural textile fiber and oil in the world. MicroRNAs (miRNAs) are endogenous, non-coding, approximately 18-24 nucleotides long RNAs and function in the negative regulation of their target genes. Two mostly overlapping libraries of small RNA molecules were constructed and sequenced, and served as repetition sets of data to identify miRNAs involved in fiber initiation and seed development. The D genome sequence of Gossypium raimondii was used in conjunction with EST sequences to predict miRNA precursors. Overall, 93 new miRNA precursors were identified, of which 28 belonged to 10 known families and the other 65 were considered to be novel miRNAs. Seven hundred EST sequences were proposed to be candidate target genes which involved in the regulation of a diverse group of genes with diverse functions and transcription factors. Some of the novel miRNAs and candidate target genes were validated by the Northern blot and rapid amplification of 5' cDNA ends (5' RACE).

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23922789      PMCID: PMC3726788          DOI: 10.1371/journal.pone.0069743

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

MicroRNAs (miRNAs) are endogenous, non-coding RNA molecules that are approximately 18–24 nucleotides in length. In plants cell, Dicer like1 (DCL1), orchestrates both catalyze processes in the nucleus, and this results in ∼21 nucleotide mature miRNA/miRNA* (the complementary strand of miRNA) duplex after which they are exported into the cytoplasm [1]–[5]. Increasing evidence has demonstrated that plant miRNAs negatively regulate their target genes, which have functions in a wide range of developmental processes [6] by mRNA cleavage or translation repression [7]. A substantially increasing number of plant miRNAs populate a number of databases, such as miRBase [8], and sets of precursors and mature miRNAs have been discovered in many different plant genomes. From 2007 onwards, almost the entire growth of miRBase has been driven by deep sequencing experiments [8]. Cotton is the source of the most important renewable, natural textile fiber in the world and is of significant importance in the textile industry. Cotton ‘fibers’ are trichomes derived from epidermal cells of the developing seed [9]. These trichomes share many similarities with those found on Arabidopsis thaliana leaves and which could serve as a model for elucidating the genetic mechanisms that control cotton fiber and seed development [10]. In Arabidopsis and maize (Zea mays L.), it have been validated miR172 which targets Apetala 2 (AP2) transcription factor, together with miR156 whose targets are a series of Squamosa promoter binding protein-like (SPL) transcription factors, regulate developmental transitions and guide various aspects of reproductive development in complementary patterns [11]–[14]. In Arabidopsis, miR156 temporally controls trichome distribution during flowering via its target SPL9, which define an endogenous flowering pathway and establish a direct link between developmental programming and trichome distribution [15]. Plants that overexpressed miR156 developed ectopic trichomes on the stem and floral organs [15]. MiR165/166 targets the class III Homeodomain leucine zipper (HD-ZIP III) family of transcription factors that were investigated widely because of their great functions in the regulation of organ polarity and morphogenesis [6]. The abnormal distribution of trichomes on two mutants of the Incurvata (Icu) gene, which encodes ATHB15, is probably due to a single nucleotide mutation within a miRNA-mRNA complementarily area [16]. Auxin plays a critical role in cotton fiber initiation and the initiated fiber cells may be the site of indole-3-acetic acid (IAA) accumulation [17] and, moreover, promote seed development [18]. Both miR167 and miR160 could target AUXIN RESPONSE FACTOR (ARF) mRNAs. Nevertheless, miR160 and miR167 targets appear to have opposite roles with respect to controlling the expression of the auxin homeostatic enzyme GH3 [6]. It has also been documented that miR167 is essential for fertility of both ovules and anthers [19]. Mutations in the miR167 target sites of ARF6 or ARF8 are responsible for the ectopic expression of these genes in ovules, which results in the arrested development of integuments and ovule sterility [19]. During early embryogenesis, miRNAs repress expressing of LEAFY COTYLEDON2 and FUSCA3, while ARABIDOPSIS 6B-INTERACTING PROTEIN1- LIKE1 (ASIL1), ASIL2 and the histone deacetylase HDA6/SIL1 act as downstream components of miRNAs to repress maturation [20]. Up to now, 43 cotton miRNA genes are currently registered in miRBase (Release 19.0, August 2012), which encode 45 mature miRNAs including two miRNA* sequences. On the other hand, 348 miRNA were proposed based on genome of G. raimondii released by Cotton Genome Project (CGP) [21]. To further understand the various roles miRNAs play in cotton young ovules during fiber cell initiation, we have constructed two small RNA libraries from cotton ovules at critical times of fiber cell development initiation and 33 million clean reads were achieved in total. Besides EST sequences released on NCBI, we aligned small RNAs with D genome sequence of G. raimondii [21], [22], and candidate precursors were predicted. After strict filtration, 93 new miRNA precursors were identified, 28 of which produce mature miRNAs that belong to 10 miRNA families and another 65 were suggested as novel miRNAs of cotton. A total of 693 EST sequences were obtained and identified as candidate target genes involved in diverse functions besides transcription factors.

Materials and Methods

Plant materials preparation and total RNA isolation

The Upland cotton, Gossypium hirsutum acc. Texas Marker-1 (TM-1), was grown at Jiangpu Breeding Station, Nanjing (JBS/NAU) in 2010. Flowers were tied up one day before anthesis (−1 DPA) to ensure self-pollination. The −3 DPA and −1 DPA flowers were estimated based on flower size and shape. Flowers and bolls were harvested at around 4 pm after pollination and stored in ice. Intraday, ovules were dissected carefully from each boll, frozen in liquid nitrogen and stored at −70°C. Total RNA was extracted using the CTAB method [23]. We pooled the total RNAs from −3, −1, 0 and 1 DPA ovules in an equal fraction ratio, and named it TM-1L-A library. The same procedure was done to −1, 0, 1, 3 and 5 DPA samples, which was named TM-1L-B.

Small RNA sequencing and annotation

Solexa sequencing technology was employed to sequence the small RNAs from pooled cotton ovules samples. After summarizing the length distribution of clean reads, the small RNA reads were mapped to cotton ESTs on NCBI and D genome sequence to analyze their expression and distribution on the reference sequences. The standard bioinformatics analysis annotated the clean reads with rRNA, scRNA, snoRNA, snRNA and tRNA from GenBank and Rfam to delete matched reads. The remains of mapped sequences were used to predict miRNA precursors based on common criteria [24].

Qualifications for the prediction of new miRNAs

The prediction software Mireap (http://sourceforge.net/projects/mireap) developed by BGI (Beijing Genome Institute at Shenzhen, China) was used to predict new miRNAs by exploring their secondary structure, the Dicer cleavage site and the minimum free energy (MFE) of the small RNA reads, which could be mapped to genome or EST sequences. The primary criterion was that small RNA was precisely excised from the stem of a stem-loop precursor [24]. Specifically, only dominant, mature sequences that were counted more than 10 times and located within the stem region of the stem-loop structure and ranged between 20–24 nt with a maximum MFE of −20 kcal·mol−1 were considered. A maximum of four unpaired nucleotides between the miRNA and miRNA* were allowed. To enhance the reliability of the results and support a consistent processing of miRNAs at the 5′-end, we only preserve the one that the majority of reads had an identical 5′-end. The selected sequences were then folded into a secondary structure using an RNA-folding program mFold 3.2 [25]. When a perfect stem-loop structure was formed, the small RNA sequence sat at one arm of the stem and asymmetric bulges were minimal in size (one base at most) and infrequent (typically one or less) within the miRNA/miRNA* duplex. These small RNA structures that formed were proposed to be new cotton miRNAs.

Northern blotting of mature miRNAs

A total of 10 µg RNA was resolved on 15% polyacrylamide (29∶1) gels containing 8 M Urea and 2×SSC (0.3M NaCl, 30 mM Sodium Citrate, pH 7) under denaturing conditions. RNA was blotted onto a Hybond-N+ nylon membrane (Roche, Basel, Switzerland) using a Trans-Blot SD Semi-Dry lectrophoretic Transfer Cell (Bio-Rad, Hercules, CA, USA). Then the membrane was cross-linked by UV at 1200 mJ in Stratagene UV Stratalinker 1800 for 60 sec. Antisense oligonucleotide (20 pmol) for each miRNA (Table S5) was 5′-end labeled (γ-32P-ATP) by T4 polynucleotide kinase (NEB, Ipswich, MA, USA) to detect mature miRNAs. U6 was used as the loading control. Hybridization was performed in 5×SSC, 20 mM Na2HPO4, 3×Denhardt's solution (1×Denhardt's: 0.02% Ficoll, 0.02% polyvinylpyrrolidone, and 0.02% BSA), and 0.7% SDS with competitor Herring sperm DNA (0.1 mg/ml, Sigma-Aldrich, St. Louis, MO, USA). Prehybridization (2 h) and hybridization (overnight) were performed at 37°C. After hybridization, the membrane was washed twice with 2×SSC and 0.2×SDS at 37°C for 10 min. Hybridized membranes were exposed to a storage phosphor screen (GE Healthcare, Buckinghamshire, UK) for 1–4 days and the screens were scanned using a Storm 825 phosphoimager (GE Healthcare, Buckinghamshire, UK).

Target genes prediction

The targets of miRNAs were predicted by the web tool psRNATarget [26] using the Gossypium (cotton) DFCI Gene Index (CGI) Release 11 (http://compbio.dfci.harvard.edu/index.html) as the sequence library for the targets search. No more than three mismatches between miRNAs and targets were allowed, especially no mismatches were allowed within the maximum expectation region. Gene Ontology (GO) annotation was used to infer target genes function description by Blast2GO [27].

RNA ligase-mediated 5′ RACE

To validate the cleavage sites of target genes, we used RNA ligation-mediated (RLM) rapid amplification of 5′ complementary DNA ends (5′ RACE) using a GeneRacer™ kit (Invitrogen, Carlsbad, CA, USA). Four mg RNAs from 3DPA ovule was ligated to a 5′ RACE RNA adapter without calf intestine alkaline phosphatase treatment. cDNAs were transcribed with reverse transcriptase by the GeneRacer™ Oligo dT primer and used as the template for PCR amplification with GeneRacer™ 5′ Primer, GeneRacer™ 5′Nested Primer and two gene-specific reverse primers for each RACE. In each case, the PCR products were gel purified, cloned, and sequenced. PCR primers are shown in Table S5.

Results

Distribution of small RNA in cotton fiber cells

After the deep sequencing of the two libraries, more than thirty-three million reads were obtained, 17,027,097 and 16,441,172 reads for TM-1L-A and TM-1L-B (Table 1), respectively. After the deletion of low quality reads and several kinds of contaminant tags, 32,665,490 (97.6%) clean reads were obtained and the distinct reads for further analysis were 8,205,472 and 7,834,823 for TM-1L-A and TM-1L-B, respectively. Among all of the clean reads, 24 nucleotide sequences were found in significantly greater numbers than the others, and accounted for 72% and 70% of the total reads (Figure 1). The 22 nt reads were second (9.2% and 9.57%), and followed by 21 nt (6.38% and 6.81%) and 23 nt (6.25% and 6.51%) reads.
Table 1

Statistics of small RNA sequence reads.

TM-1L-ATM-1L-B
Distinct readsAll readsDistinct readsAll reads
Total:820547216622462783482316043028
match genome504254961.45%1084219265.23%484447361.83%1059021666.01%
rRNA482980.59%4655372.80%560260.72%6732454.20%
siRNA2693643.30%11084476.67%2474433.16%9928286.19%
snRNA9470.01%17720.01%10900.01%20750.01%
snoRNA6690.01%13780.01%7160.01%15340.01%
tRNA40510.05%340520.20%55770.07%540780.34%
unannotation786390595.84%1428488085.94%750682095.81%1359507284.74%
Figure 1

Sequence length distribution of cotton small RNA libraries of TM-1L-A and TM-1L-B.

24-nucleotides reads were of significant greater proportion (∼70%) than others.

Sequence length distribution of cotton small RNA libraries of TM-1L-A and TM-1L-B.

24-nucleotides reads were of significant greater proportion (∼70%) than others. The small RNA reads were mapped to the genome and about more than sixty percent of distinct reads or 65% and 66% of the total reads were mapped to the Gossypium genome (DOE Joint Genome Institute: Cotton D V1.0). Approximately 4% of the distinct reads matched non-coding rRNA, scRNA, snoRNA, snRNA and tRNA, which accounted for 9.69% and 10.7% of TM-1L-A and TM-1L-B, respectively (Table 1). The majority of the remaining reads were documented as unannotated reads for further analysis.

miRNAs expressed differently between each other during fiber cell initiation

There were 33 miRNA families were identified in TM-1L-A and TM-1L-B. The miR166 was the mostly accumulated miRNAs in the young ovules and fibers of cotton with a total of 153,922 reads detected between the two libraries, [78,159 miRNAs in TM-1L-A and 75,163 miRNAs in TM-1L-B (Table 2)]. There was considerable expression levels diversity between the families probably due to that the cotton ovule is a highly differentiated organ and genes are expressed dynamically during development. For example, miR166 and miR172 were both sequenced more than seventy thousand (∼42%) and sixty thousand (∼35%) times between both of the two libraries, respectively. In contrast to the most abundant miRNAs, miR828, miR475 and miR1023 (Table 2) were detected less than 10 times, which confirmed that miRNA expression maybe developmental and/or tissue specific [28]. Members of the same family did not express equally either. For example, among the miR156/157 members, the mature sequence miR157a/b (UUGACAGAAGAUAGAGAGCAC) was accumulated much more than miR156d (UGACAGAAGAGAGUGAGCAC). The two libraries TM-1L-A and TM-1L-B mostly overlapped and the majority of the families were expressed in equal levels between the libraries, but there were some exceptions (Table 2). Six miRNAs miR160, miR394, miR397, miR398, miR482 and miR2111 were significantly upregulated in TM-1L-B compared to TM-1L-A, which suggested that miRNAs expression level were constantly changing during early cotton fiber and ovule development.
Table 2

Conserved miRNA families expression in cotton.

FamilyMature sequenceTotal readsTM-1L-ATM-1L-BFold
Raw readsRPM readsRaw readsRPM reads
miR156/157 UUGACAGAAGAUAGAGAGCAC 83744743285.343631226.331.26
miR159/319 UUUGGAUUGAAGGGAGCUCUA 50663235194.621831114.131.71
miR160 UGCCUGGCUCCCUGUAUGCCA 138322913.78115471.930.19
miR162 UCGAUAAACCUCUGCAUCCAG 2340131579.11102563.891.24
miR164 UGGAGAAGCAGGGCACGUGCA 102854189252.016096379.980.66
miR166 UCGGACCAGGCUUCAUUCCCC 153922781594702.01757634722.491.00
miR167 UGAAGCUGCCAGCAUGAUCU 139537691462.696262390.331.19
miR168 UCGCUUGGUGCAGGUCGGGAA 79037422.5041625.930.87
miR169 CAGCCAAGGAUGAUUUGCCGG 109646527.9763139.330.71
miR171 UGAUUGAGCCGUGCCAAUAUC 127961837.1866141.200.90
miR172 AGAAUCCUGAUGAUGCUGCAG 130236660133971.31642234003.170.99
miR390 AAGCUCAGGAGGGAUAGCGCC 53702839170.792531157.761.08
miR393 UCCAAAGGGAUCGCAUUGAUC 99543.25452.801.16
miR394 UUGGCAUUCUGUCCACCUCC 135100.601257.790.08
miR395 CUGAAGUGUUUGGGGGAACUC 21100.60110.690.88
miR396 UUCCACAGCUUUCUUGAACUU 70330718.4739624.680.75
miR397 UCAUUGAGUGCAGCGUUGAUG 163342.051298.040.25
miR398 UUCUCAGGUCACCCCUUUGGG 79116710.0562438.900.26
miR399 UGCCAAAGGAGAUUUGCCCGG 3090.54211.310.41
miR403 UUAGAUUCACGCACAAACUCG 76703799228.553871241.290.95
miR408 AUGCACUGCCUCUUCCCUGGC 3060.36241.500.24
miR475 UUACAAUUCCAUUGAUUAAACCGU 210.0610.060.97
miR482 UUGCCUACUCCACCCAUGCCAC 170759735.92111069.190.52
miR530 UGCAUUUGCACCUGCACCUUC 123462.77774.800.58
miR535 UUGACAACGAGAGAGAGCACG 2878137582.72150393.690.88
miR827 UUAGAUGACCAUCAACAAACA 156734.39835.170.85
miR828 UCUUGCUCAAAUGAGUAUUCCA 820.1260.370.32
miR1023 ACACUCUGUCAUAUCGUCUGC 410.0630.190.32
miR2111 UAAUCUGCAUCCUGAGGUUUG 249824.9316710.410.47
miR2947 UAUACCGUGCCCAUGACU 101784814289.615364334.350.87
miR2948 UGUGGGAGAGUUGGGCAAGAAU 2841609.631247.731.25
miR2949 ACUUUUGAACUGGAUUUGCCGA 47192435146.492284142.371.03
miR3476 UGAACUGGGUUUGUUGGCUGC 3095153492.28156197.300.95

FoRPM, reads per million. Fold, fold change of TM-1L-A/TM-1L-B.

FoRPM, reads per million. Fold, fold change of TM-1L-A/TM-1L-B.

Identification of new miRNAs in cotton

Thanks to deep-sequencing, a few new conserved and less conserved MIRNA genes were also identified. Overall, 28 MIRNA genes from 10 families survived a series of strict filtrations (Table S1 and Figure S1) and were named based on the common nomenclatural rules [8]: triliteral was prefixed according to the source of the sequence, while postfixes were used to distinguish the different precursors produce the same mature sequence. In addition, 65 precursors containing 43 small RNAs were identified but did not show enough similarity with any currently known miRNAs, and were labeled as novel miRNAs (Table S2 and Figure S2). Some of the novel MIRNAs produce remarkably similar even coincident mature sequence that we reckoned them probably evolved from the same ancestor, and different letters were suffixed for recognition. The precursor sequences were numbered in consecutive order from MIR7234 to MIR7276, and the mature sequences were named after their precursors as miR7234 to miR7276. The majority of the miRNAs (23 of the 43, 53.5%) have a uridine at 5′ terminal (Figure S3), showing a preference of AGO1 [29]. Differing from the miRNAs described as conserved families, none of the 43 novel miRNAs displayed unequal expression levels between the two libraries TM-1L-A and TM-1L-B (Table 3). MiRn7234 accumulated abundantly with more than twenty thousand reads. MiR7235, miR7236 and miR7237 were also detected in relatively high levels with more than one thousand reads in each library. However, the majority of the novel miRNAs (26, 60.5%) was detected in very low levels, and had less than 50 reads.
Table 3

Novel miRNAs identified in cotton.

miRNA IDMature miRNA sequenceArmG+C (%)TotalTM-1L-ATM-1L-BFold
Raw readsRPM readsRaw readsRPM reads
miR7234 UUGGACAGAGUAAUCACGGUCG 5′50.00%42574319950512002.3722623814101.950.85
miR7235 UUUUGGAAGAAUUUCAGCUGG 5′38.10%101654656280.115509343.390.82
miR7236 ACAGCUUUAGAAAUCAUCCCU 5′38.10%54462924175.912522157.21.12
miR7237 UUACUUUAGAUGUCUCCUUCA 3′33.33%2414137282.54104264.951.27
miR7238 UCCAUAUUUCACUAUCUCUUA 3′28.57%102244226.5958036.150.74
miR7239 UGAAUAUUGUUAAAGUAGAAA 3′19.05%50624014.4426616.580.87
miR7240 AAUAAGGGGCUUAGAAAGAUG 3′38.10%50826415.8824415.211.04
miR7241 GAUUUGGGGCAAAGACGGGAU 3′52.38%37421212.7516210.11.26
miR7242 AGGCUCUUUGUAGAAUCAGGAG 3′45.45%36418511.1317911.161
miR7243 UUCAGAAACCAUCCCUUCCUU 5′42.86%3141086.520612.840.51
miR7244 UGGACUUAGCUGCCAAGUUUG 3′47.62%2831509.021338.291.09
miR7245 UUCCAUGUCACAGAGAUGUUG 5′42.86%180995.96815.051.18
miR7246 GGAAUGUUGUCUGGACCGGGG 5′61.90%172895.35835.171.03
miR7247 UUGUGAUGUUUGUGAGGAACA 3′38.10%110603.61503.121.16
miR7248 UUGAAAAGAAUCCUUCAAACGU 3′31.82%103543.25493.051.07
miR7249 UCUGACAGUGCACUGAAAACG 3′47.62%101432.59583.620.72
miR7250 UCACAGGGAUCAAAAUUGGGA 3′42.86%68331.99352.180.91
miR7251 UAAGUGAAGAAAGAGGUAGGUU 5′36.36%154764.57784.860.94
miR7252 UGCUACUUGUAGUUAUGCAUG 3′38.10%90482.89422.621.1
miR7253 AUCAUGCGAUCCCUUCGGAAU 3′47.62%60261.56342.120.74
miR7254 AGCCCGAUUUUGGGCCUAGU 3′55.00%48281.68201.251.34
miR7255 AUGGAUGAAAUUUUUAACAGA 3′23.81%90432.59472.930.88
miR7256 CUUGGUAGAGCACAGGAGACA 3′52.38%48201.2281.750.69
miR7257 UGAUGGAGAUAGGUAUCUGCA 5′42.86%37201.2171.061.13
miR7258 AUAUGAUUUGUUAAGGCAAG 5′30.00%32130.78191.180.66
miR7259 UUAGAUCAAAGAGUAAACUAAUU 5′21.74%31130.78181.120.7
miR7260 UAGAAACUCGAUCGUCUUCU 3′40.00%46160.96301.870.51
miR7261 UCUGUCGCAGGGGAGAUGGCUG 5′63.64%3770.42301.870.22
miR7262 UAGACACUCUGGGCACAAUAG 3′47.62%28120.721610.72
miR7263 AUUGAUCUGUAUCGAUUAUCU 3′28.57%24120.72120.750.96
miR7264 ACUCUCUUCCAAAGGCUUCAAG 5′45.45%2960.36231.430.25
miR7265 CCACCGUCGAGGGUUCGAGAUCG 3′65.22%23100.6130.810.74
miR7266 AAUGGCAUUGAUGUAGCAGCU 5′42.86%2110.06201.250.05
miR7267 UUGUACGUUAGAUUAAAGAGC 5′33.33%2470.42171.060.4
miR7268 UUUAAAUCUAUAAAGACUCCA 5′23.81%1770.42100.620.68
miR7269 AAUGGAGGAGUUGGAAAGAUU 5′38.10%23110.66120.750.88
miR7270 CACAAUACUUCCACCAUUGAG 3′42.86%12120.72---
miR7271 CAAUUCUUCAAUCGCACGUCG 3′47.62%11110.66---
miR7272 UUCACAUGUUGAAUUACUUGG 5′33.33%20110.6690.561.18
miR7273 UUGAUAUCAUACUUGAGACUC 5′33.33%20100.6100.620.97
miR7274 ACUAAAAAAUGGGCAAAUUAG 5′28.57%1030.1870.440.41
miR7275 AGGUACUAAAUUGAAUAUUGA 3′23.81%770.42---
miR7276 AGUGAAUUAAGAACAAACUUU 5′23.81%1150.3---

RPM, Reads per million. Fold, fold change of TM-1L-A/TM-1L-B.

RPM, Reads per million. Fold, fold change of TM-1L-A/TM-1L-B. In total, 487 miRNA precursors were identified in cotton and had an average length of 128.2 nucleotides (Figure S4), while in miRBase (release 19) the average length of 5,166 plant pre-miRNAs was 148.8 nt. The newly identified 93 precursors from the EST and D genome sequences had an average length of 140 nt and were longer than the precursors released in miRBase (131.4 nt). Interestingly, precursors of the 65 novel MIRNAs (148.4 nt) were much longer than the ones of conserved families (124.9 nt). As none of the novel miRNA had homologs in miRBase (Release 19), we supposed that they were Gossypium-specific, or restricted to closely related species. To validate if they are miRNAs, three of the novel miRNAs, miR7235, miR7244 and miR7251, were randomly selected to conduct the Northern blots. The total RNA from −1DPA and 3DPA ovules was blotted onto Hybond-N+ membrane and the Northern blots validated that these three novel miRNAs expressed equally between −1DPA and 3DPA (Figure 2) as the same as the deep sequencing results.
Figure 2

The Northern blot analysis of three novel miRNAs in cotton ovules at −1 and 3 DPA.

Targets predicting and validating of cotton miRNAs

In total, 693 EST sequences were obtained and are proposed as candidate target genes (Tables S3 and S4). The 33 known MIRNA families had 300 affiliated target genes, while the 43 novel miRNAs targeted 395 EST sequences. The highly conserved miRNA families have highly conserved target genes, such as miR156 and SPL (Table S3). The targets of the novel miRNAs included a much broader range of proteins as compared to those regulated by the miRNAs from the more conserved families (Table S4). To validate the potential targets of miRNAs that might play crucial roles in ovule and fiber early development, two predicted target genes, as an example, were selected and assayed using 5′ RACE. Sequencing of the 5′ RACE products of the two miRNA targets showed that most cleavage sites were mapped to miRNA complementary sequences (Figure 3).
Figure 3

5′ RACE verification of predicted miRNA target genes.

The cleavage sites of two selected targets in two miRNA as identified by 5′ RACE analysis. For each miRNA, the miRNA sequence is shown on the top and the target sequence on the bottom. Arrows indicate the cleavage site of the mRNA, and the frequency of clones was shown under the arrow.

5′ RACE verification of predicted miRNA target genes.

The cleavage sites of two selected targets in two miRNA as identified by 5′ RACE analysis. For each miRNA, the miRNA sequence is shown on the top and the target sequence on the bottom. Arrows indicate the cleavage site of the mRNA, and the frequency of clones was shown under the arrow.

Discussion

Two independent libraries validate each other

Cotton ‘fibers’ are unicellular trichomes derived from epidermal cells of the ovule [9]. The cotton fiber cells undergo a physiological change at −3 DPA when the potential elongation of the cell is determined, although it is not started until anthesis [30], [31]. On the day of anthesis (0 DPA), hemispheroids heave are borne on the surface of cotton ovules, and will grow into fiber cells. The fuzz cells produce a visible morphologic phenotype at 4∼5 DPA. The development of fiber cells is a rapid and continuous progress, and the two libraries produced here could serve as repetitious databases that serve to validate each other. TM-1L-A was pooled from −3, −1, 0 and 1 DPA, and TM-1L-B was pooled from −1, 0, 1, 3 and 5 DPA. The results from the two libraries have little variation across the full dynamic range in the log-log plot (Figure 4). The value of Pearson's correlation coefficient is 0.99. The differences seen between the two databases will reveal distinctions between fiber cell initiation and elongation, as well as fuzz cell initiation processes.
Figure 4

Sequence read counts of small RNAs obtained by sequencing the two libraries.

Each point represents a unique small RNA in this log-log scatter plot. The points in red are small RNAs found in both libraries.

Sequence read counts of small RNAs obtained by sequencing the two libraries.

Each point represents a unique small RNA in this log-log scatter plot. The points in red are small RNAs found in both libraries. In total, more than thirty-three million reads were obtained from TM-1L-A and TM-1L-B combined (Table 1), which had great potential to dig out novel miRNAs. Among all the reads, the 24 nucleotides reads were in far greater numbers than all others (∼70%; Figure 1), which was consistent with many other reports [32]–[35]. The majority of miRNAs were 21 nt long [36], as the length of the miRNA/miRNA* duplex was determined by the “molecular ruler” property of DCL1 [37]. In miRBase (Release 19.0), 347 mature miRNAs were identified in A. thaliana, of which 264 were 21 nt long (76.1%). In rice, the 24 nt long miRNAs were 18.3% of the total miRNAs currently known (132 in 721). The large number of the 24 nt small RNAs found in cotton and described in this study might be small interference RNAs (siRNAs). In A. thaliana, the relatively longer small RNAs were prevalently produced in the floral structures (inflorescences) where DCL3, the enzyme responsible for its synthesis, is ten times more abundant relative to its concentration in leaves [38]. The ovules and fibers are contained within flowers and we presume the amount of 24 nt long small RNAs may be due to redundant DCL3 expression in the associated floral organs. The other reports of cotton miRNAs also proposed a similar deduction to explain this phenomenon [32]–[35], [39], [40].

Identification of novel miRNAs in cotton

Since 2007, almost all of the growth of miRBase has been driven by deep sequencing experiments, which have identified novel miRNAs by the 10s or 100s per experiment [8]. Considering of the complicated nature of plant small RNA, a series of strict filtrations was used to enhance reliability of the result presented here. Currently, no allopolyploid cotton genome sequence was available; therefore ESTs from NCBI and the genome sequence of G. raimondii [21], [22], the closest living relative of the progenitor D-genome donor of allotetropolyploid cottons [41], [42] were used as reference sequences to predict the miRNA precursors. To support the positive identification of the mature miRNA sequences they had to be detected in both of the libraries or have more than ten reads in at least one of the library. With the help of high-depth sequencing and the EST and genome sequence of G. raimondii, there is a greater potential to find new miRNAs of Gossypium. In total, 93 new MIRNA genes were identified as eligible candidates for further investigation (Table S1 and S2). A blast search was done to identify their family classification based on conservation which is a powerful indicator of their functional relevance and ancient origins [24]. In total, 28 MIRNA genes were found to belong to 10 families (Table S1), which left the other 65 as novel miRNAs (Table S2). In miRBase, besides a few conserved miRNA families, the majority of the families available are restricted to species or subfamily lineages (miRBase, release 19). Not one of the novel miRNAs reported here has a homolog detected in miRBase (Release 19), we proposed them as Gossypium-specific or restricted to species that are closely related to cotton. There are 416 precursors predicted according to the sequence of G. raimondii. No significant regularities of distribution were found, but there was a specific genomic region on Chr. D5 where the MIRNAs clustered (Figure S5).

Function of miRNAs in fiber and seed initial development

Cotton fibers share many similarities with Arabidopsis leaf trichomes, which could serve as a model for elucidating the genetic mechanisms that control the development of cotton fiber and seeds [10]. MiRNAs have been shown to play an important role in cotton fiber and seed development. MiR166 was found mostly accumulated miRNAs in cotton young ovules and fibers with more than 75,000 reads between the two libraries used here (Table 2). The targets of miR166 were predicted to express Class III HD-ZIP family of transcription factors with functions of organ polarity and morphogenesis [6]. MiR166 was also shown to be involved in the distribution of trichomes [16]. Another miRNA shown to control trichome development in Arabidopsis was miR156, and its target, SPL9, was shown to define function in an endogenous flowering pathway; this finding established a direct link between developmental programming and trichome distribution [15]. Thousands of miR156/157 was accumulated during the initial development of the cotton fibers and seeds, and fifteen targets of miR156/157 were predicted to express SPL transcription factors. MiR172 also had a significant expression with more than sixty thousand reads between each of the two libraries (Table 2). MiR172 targets the AP2 transcription factor in Arabidopsis and maize, and it have been validated miR172 together with miR156 regulate development transitions in complementary patterns and guide various aspects of reproductive development [11]–[14]. The expression levels of SPL9 increase in conjunction with inflorescence development, which reflects a decrease in miR156 [15]. MiR156 regulates the expression of miR172 via SPL9 which, redundantly with SPL10, directly promotes the transcription of miR172b [12]. The cotton ovules have high expression levels of miR172 and have relatively low levels of miR156/157, which was consisted with their conversely regulation patterns [6], implying they play an important role in the development of this organ. Auxin plays a crucial role in seed development [18] and cotton fiber initiation [17]. Both miR167 and miR160 could target ARF mRNAs to regulate the accumulation of the auxin, and in Arabidopsis, miR160 targets ARF10, ARF16 and ARF17, while miR167 targets ARF6, ARF8 and IAR3. Plants expressing a miRNA-resistant version of ARF17 have increased ARF17 mRNA levels and reduced accumulation of auxin-inducible GH3-like mRNAs, which encode auxin-conjugating proteins [43]. However, OsGH3-2 was positively regulated by ARF8 [44]. Meanwhile, another target of miR167, IAA-Ala Resistant3 (IAR3), encode an enzyme hydrolyzing an inactive form of auxin (IAA-alanine) and releases bioactive auxin (IAA) [45]. In the libraries of TM-1L-A and TM-1L-B, the expression levels of miR167 were 33-fold and five-fold higher than miR160, as miR160 was notably upregulated in TM-1L-B (Table 2). These results were consistent with the high auxin accumulation during fiber initiation, and endogenous IAA levels in ovules were reduced continuously from 1 DPA to 3 DPA [17]. MiR167 was also shown to be essential for the fertility of ovules and the relatively high level of IAA was imperative for proper zygote polarity and development [19], [46]. Besides miR160, miR394, miR397, miR398, miR482 and miR2111 were also significantly upregulated in TM-1L-B compared to TM-1L-A, suggesting that the miRNA population changed during the early stages of cotton ovule development. MiR397 targets genes encoding a series of laccase and steroid-binding protein by cleavage or translation repression. The targets of miR398 were Cu/Zn SUPEROXIDE DISMUTASE (SOD) mRNAs. The targets of miR394, miR482 and miR2111 were genes encoding enzyme or other protein. All of these target genes are known involving in the regulation of metabolic processes. A function for miRNAs as safeguards against unwanted gene expression is a common theme in eukaryotes [6], [7], [47]. The increase in the six miRNAs in TM-1L-B seemed to suggest it was no longer necessary to preserve the same level of the gene expression after fiber initiation as the cell fate had been determined. All of the 43 novel miRNA described here are expressed equally between the TM-1L-A and TM-1L-B libraries (Table 3). MiR7234 was accumulated in high levels with more than twenty thousand reads and targeted two ESTs, which expressed starch synthase and retrotransposon. MiR7235, miR7236 and miR7237 were detected with more than one thousand reads in each library. However, the majority of the novel miRNAs (26, 60.5%) were detected less than 50 times. Northern blots validated miR7235, miR7244 and miR7251 expressed equally between −1DPA and 3DPA (Figure 2). Combining Northern blots results and sequencing data, we infer that the expression level of these three novel miRNAs change little during the early stage of fiber development. Unlike the targets of conserved miRNA families that have a tendency to be transcription factors, less conserved miRNAs target genes that have more flexibility cover a large-scale of aspect of functions in cotton, such as metabolism, transcription factors, structure protein, signals transduction and so on which play essential roles in seed set development [18]. Secondary structure of the precursors of the new members of the known families. (TIF) Click here for additional data file. Secondary structure of the precursors of the novel miRNA. (TIF) Click here for additional data file. Analyses of nuclleotide bias at each position along the novel miRNAs. (TIF) Click here for additional data file. Length distribution of MIRNA precursors in plant. (TIF) Click here for additional data file. MIRNAs distribution on the 13 assemble chromosomes. (TIF) Click here for additional data file. New members of known families of microRNAs. (XLS) Click here for additional data file. Novel miRNAs identified in the present report. (XLS) Click here for additional data file. Targets of known cotton miRNAs. (XLS) Click here for additional data file. Targets of novel miRNAs identified in the present report. (XLS) Click here for additional data file. Primers and probes used in this study. (XLSX) Click here for additional data file.
  41 in total

1.  MicroRNAs in plants.

Authors:  Brenda J Reinhart; Earl G Weinstein; Matthew W Rhoades; Bonnie Bartel; David P Bartel
Journal:  Genes Dev       Date:  2002-07-01       Impact factor: 11.361

Review 2.  MicroRNA networks and developmental plasticity in plants.

Authors:  Ignacio Rubio-Somoza; Detlef Weigel
Journal:  Trends Plant Sci       Date:  2011-04-04       Impact factor: 18.313

3.  Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.

Authors:  Andrew H Paterson; Jonathan F Wendel; Heidrun Gundlach; Hui Guo; Jerry Jenkins; Dianchuan Jin; Danny Llewellyn; Kurtis C Showmaker; Shengqiang Shu; Joshua Udall; Mi-jeong Yoo; Robert Byers; Wei Chen; Adi Doron-Faigenboim; Mary V Duke; Lei Gong; Jane Grimwood; Corrinne Grover; Kara Grupp; Guanjing Hu; Tae-ho Lee; Jingping Li; Lifeng Lin; Tao Liu; Barry S Marler; Justin T Page; Alison W Roberts; Elisson Romanel; William S Sanders; Emmanuel Szadkowski; Xu Tan; Haibao Tang; Chunming Xu; Jinpeng Wang; Zining Wang; Dong Zhang; Lan Zhang; Hamid Ashrafi; Frank Bedon; John E Bowers; Curt L Brubaker; Peng W Chee; Sayan Das; Alan R Gingle; Candace H Haigler; David Harker; Lucia V Hoffmann; Ran Hovav; Donald C Jones; Cornelia Lemke; Shahid Mansoor; Mehboob ur Rahman; Lisa N Rainville; Aditi Rambani; Umesh K Reddy; Jun-kang Rong; Yehoshua Saranga; Brian E Scheffler; Jodi A Scheffler; David M Stelly; Barbara A Triplett; Allen Van Deynze; Maite F S Vaslin; Vijay N Waghmare; Sally A Walford; Robert J Wright; Essam A Zaki; Tianzhen Zhang; Elizabeth S Dennis; Klaus F X Mayer; Daniel G Peterson; Daniel S Rokhsar; Xiyin Wang; Jeremy Schmutz
Journal:  Nature       Date:  2012-12-20       Impact factor: 49.962

4.  IAA-Ala Resistant3, an evolutionarily conserved target of miR167, mediates Arabidopsis root architecture changes during high osmotic stress.

Authors:  Natsuko Kinoshita; Huan Wang; Hiroyuki Kasahara; Jun Liu; Cameron Macpherson; Yasunori Machida; Yuji Kamiya; Matthew A Hannah; Nam-Hai Chua
Journal:  Plant Cell       Date:  2012-09-07       Impact factor: 11.277

5.  The GIGANTEA-regulated microRNA172 mediates photoperiodic flowering independent of CONSTANS in Arabidopsis.

Authors:  Jae-Hoon Jung; Yeon-Hee Seo; Pil Joon Seo; Jose Luis Reyes; Ju Yun; Nam-Hai Chua; Chung-Mo Park
Journal:  Plant Cell       Date:  2007-09-21       Impact factor: 11.277

6.  The draft genome of a diploid cotton Gossypium raimondii.

Authors:  Kunbo Wang; Zhiwen Wang; Fuguang Li; Wuwei Ye; Junyi Wang; Guoli Song; Zhen Yue; Lin Cong; Haihong Shang; Shilin Zhu; Changsong Zou; Qin Li; Youlu Yuan; Cairui Lu; Hengling Wei; Caiyun Gou; Zequn Zheng; Ye Yin; Xueyan Zhang; Kun Liu; Bo Wang; Chi Song; Nan Shi; Russell J Kohel; Richard G Percy; John Z Yu; Yu-Xian Zhu; Jun Wang; Shuxun Yu
Journal:  Nat Genet       Date:  2012-08-26       Impact factor: 38.330

7.  Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research.

Authors:  Ana Conesa; Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Manuel Talón; Montserrat Robles
Journal:  Bioinformatics       Date:  2005-08-04       Impact factor: 6.937

8.  Genome-wide profiling of miRNAs and other small non-coding RNAs in the Verticillium dahliae-inoculated cotton roots.

Authors:  Zujun Yin; Yan Li; Xiulan Han; Fafu Shen
Journal:  PLoS One       Date:  2012-04-25       Impact factor: 3.240

9.  MicroRNA159 can act as a switch or tuning microRNA independently of its abundance in Arabidopsis.

Authors:  Maria M Alonso-Peral; Cheng Sun; Anthony A Millar
Journal:  PLoS One       Date:  2012-04-12       Impact factor: 3.240

10.  'Evidence of an auxin signal pathway, microRNA167-ARF8-GH3, and its response to exogenous auxin in cultured rice cells'.

Authors:  Ji Hyun Yang; So Jeong Han; Eun Kyung Yoon; Woo Sung Lee
Journal:  Nucleic Acids Res       Date:  2006-04-05       Impact factor: 16.971

View more
  10 in total

Review 1.  MicroRNAs in cotton: an open world needs more exploration.

Authors:  Qinglian Wang; Baohong Zhang
Journal:  Planta       Date:  2015-04-05       Impact factor: 4.116

2.  Global expression dynamics and miRNA evolution profile govern floral/fiber architecture in the modern cotton (Gossypium).

Authors:  Sakshi Arora; Bhupendra Chaudhary
Journal:  Planta       Date:  2021-08-30       Impact factor: 4.116

3.  Identification and characterization of microRNAs in phloem and xylem from ramie (Boehmeria nivea).

Authors:  Fang Liu; Yinghong Tang; Qingquan Guo; Jianrong Chen
Journal:  Mol Biol Rep       Date:  2019-12-09       Impact factor: 2.316

4.  Bioinformatics analysis of small RNAs in pima (Gossypium barbadense L.).

Authors:  Hongtao Hu; Dazhao Yu; Hong Liu
Journal:  PLoS One       Date:  2015-02-13       Impact factor: 3.240

5.  Deep sequencing reveals important roles of microRNAs in response to drought and salinity stress in cotton.

Authors:  Fuliang Xie; Qinglian Wang; Runrun Sun; Baohong Zhang
Journal:  J Exp Bot       Date:  2014-11-04       Impact factor: 6.992

6.  Differential expression of microRNAs during fiber development between fuzzless-lintless mutant and its wild-type allotetraploid cotton.

Authors:  Runrun Sun; Chengqi Li; Jinbao Zhang; Fei Li; Liang Ma; Yangguang Tan; Qinglian Wang; Baohong Zhang
Journal:  Sci Rep       Date:  2017-01-31       Impact factor: 4.379

7.  MicroRNA expression profiles during cotton (Gossypium hirsutum L) fiber early development.

Authors:  Min Wang; Runrun Sun; Chao Li; Qinglian Wang; Baohong Zhang
Journal:  Sci Rep       Date:  2017-03-22       Impact factor: 4.379

8.  Integration of proteomic and transcriptomic profiles reveals multiple levels of genetic regulation of salt tolerance in cotton.

Authors:  Zhen Peng; Shoupu He; Wenfang Gong; Feifei Xu; Zhaoe Pan; Yinhua Jia; Xiaoli Geng; Xiongming Du
Journal:  BMC Plant Biol       Date:  2018-06-20       Impact factor: 4.215

9.  Small RNA sequencing and degradome analysis of developing fibers of short fiber mutants Ligon-lintles-1 (Li 1 ) and -2 (Li 2 ) revealed a role for miRNAs and their targets in cotton fiber elongation.

Authors:  Marina Naoumkina; Gregory N Thyssen; David D Fang; Doug J Hinchliffe; Christopher B Florane; Johnie N Jenkins
Journal:  BMC Genomics       Date:  2016-05-17       Impact factor: 3.969

Review 10.  RNA Interference for Functional Genomics and Improvement of Cotton (Gossypium sp.).

Authors:  Ibrokhim Y Abdurakhmonov; Mirzakamol S Ayubov; Khurshida A Ubaydullaeva; Zabardast T Buriev; Shukhrat E Shermatov; Haydarali S Ruziboev; Umid M Shapulatov; Sukumar Saha; Mauricio Ulloa; John Z Yu; Richard G Percy; Eric J Devor; Govind C Sharma; Venkateswara R Sripathi; Siva P Kumpatla; Alexander van der Krol; Hake D Kater; Khakimdjan Khamidov; Shavkat I Salikhov; Johnie N Jenkins; Abdusattor Abdukarimov; Alan E Pepper
Journal:  Front Plant Sci       Date:  2016-02-22       Impact factor: 5.753

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.