Literature DB >> 22493671

Identification and characterization of microRNAs in Asiatic cotton (Gossypium arboreum L.).

Min Wang1, Qinglian Wang, Baomin Wang.   

Abstract

To date, no miRNAs have been identified in the important diploid cotton species although there are several reports on miRNAs in upland cotton. In this study, we identified 73 miRNAs, belonging to 49 families, from Asiatic cotton using a well-developed comparative genome-based homologue search. Several of the predicted miRNAs were validated using quantitative real time PCR (qRT-PCR). The length of miRNAs varied from 18 to 22 nt with an average of 20 nt. The length of miRNA precursors also varied from 46 to 684 nt with an average of 138 ±120 nt. For a majority of Asiatic cotton miRNAs, there is only one member per family; however, multiple members were identified for miRNA 156, 414, 837, 838, 1044, 1533, 2902, 2868, 5021 and 5142 families. Nucleotides A and U were dominant, accounted for 62.95%, in the Asiatic cotton pre-miRNAs. The Asiatic cotton pre-miRNAs had high negative minimal folding free energy (MFE) and adjusted MFE (AMFE) and high MFE index (MFEI). Many miRNAs identified in Asiatic cotton suggest that miRNAs also play a similar regulatory mechanism in diploid cotton.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22493671      PMCID: PMC3320894          DOI: 10.1371/journal.pone.0033696

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Cotton is an important economic crop due to its role in providing a source of fiber, textile, and oils. Besides the fact that cotton is the most important fiber crop; cotton is also the second most important resource for plant protein and the fifth most important oil crop. Cotton seeds contain about 23% proteins, and it is estimated that cotton seeds could potentially provide the protein required by about half billion people annually. Because cotton seeds are rich in oil content (approximately 21%), it has become the fifth most important plant oil resource in the world.. Although cotton is an important crop, scientific research and genetic knowledge of cotton is much behind other crops, such as rice, maize, wheat and soybean. The genus Gossypium is a large family, which consists of approximately 50 species originating from different geographic areas, majorly from Australia and Latin America. However, only four of them are domesticated and produce valuable spinnable fiber. The four domesticated species are Upland cotton (Gossypium hirsutum L.), Island cotton (G. barbadense L.), African cotton (G. herbaceum L.) and Asiatic cotton (G. arboreum L.). The Upland cotton and Island cotton are allotetraploids (AADD, 2n  =  4x  =  52); African cotton (A1A1) and Asiatic cotton (A2A2) are diploids (2n  =  2x  =  26). Because tetraploid cotton (Upland Island cotton) have much higher fiber yield as well longer fibers than the diploid cotton ( African and Asiatic cotton), tetraploid cotton, particularly the Upland cotton, is currently widely cultivated around the world. Despite their poor yield and fiber quality, the diploid species have many other attractive agronomic traits [1]. For example, several investigations demonstrated that the Asiatic cotton was tolerant and even resistant to several environmental biotic stresses, such as resistance to several pests and diseases, including bollworms [2], aphids [3], leafhopper [3], rust, fungal [4], and viral [5] diseases. Asiatic cotton also contains traits tolerant to environmental abiotic stress, including drought and salinity stress [6], [7]. Thus, Asiatic cotton easily grows in dry land with low input cultivation practices. Many Asiatic cotton cultivars produce fibers with high strength and some produc seeds with high oil content. All of these traits make Asiatic cotton a valuable resource of genetic germplasm or gene pool for modern cotton cultivar improvement and the cotton industry [1]. Additionally, diploid cotton also has an advantage over tetraploid cotton in elucidating gene structure and function because their genome is much more simple as compared to the tetraploid species. Unfortunately, very few studies have focused on diploid cotton, particularly on the molecular level. Understandinggene function and their regulatory mechanisms could provide new insight into the mechanisms of cotton growth and development, particularly on cotton fiber initiation and development and the response to environmental abiotic and biotic stresses. The expected results of this study will facilitate efficient usage of the diploid cotton species for developing improved cotton cultivars with favorable fiber yield and other key agronomic traits. microRNAs (miRNAs) are a newly identified class of extensively endogenous small RNA molecules and are approximately 20–22 nucleotides in length. Increasing amounts of evidence shows that miRNAs play an essential role in almost all biological and metabolic processes, such as plant organ differentiation and development, signal transduction, phase change, and response to environmental biotic and abiotic stress [8]. For example, miR 172 controls flower development and phase change from vegetative growth to reproductive growth though targeting the apetala 2 gene [9], [10], [11], [12]. miRNAs are different from other RNAs, including small interfering RNAs (siRNAs). The important features of miRNAs include the following: 1) all miRNAs are coded by miRNA genes with unknown length; a miRNA gene is first transcribed into a long primary RNAs, called primary miRNA (pri-miRNAs). A pri-miRNA is sequentially cleaved into a short stem-loop structured miRNA precursor (pre-miRNA) and then to a mature miRNA by an enzyme called dicer-like 1 (dcl 1) with the help of other several other enzymes. 2) All pre-miRNAs can form a stem-looped hairpin secondary structure with the mature miRNA siting on one of the arms. 3) A pre-miRNA forms its secondary structure with high negative minimum folding free energy (MFE) and minimum folding free energy index (MFEI). 4) All miRNAs have their star sequence, the opposite site sequence, termed miRNA*, in the secondary hairpin structure; miRNA and miRNA* sequences are almost complementary to each other with few mismatches; therefore miRNA and miRNA* sequences always form miRNA:miRNA* complexes during miRNA biogenesis. In most cases, only miRNAs function in regulating gene expression and the miRNA* sequence is degraded by an unknown mechanism. However, in some cases, the miRNA* sequence also can function to target a specific gene. More interestingly, many miRNAs are highly evolutionarily conserved from species to species in both animal and plant kingdoms although no clue for that miRNAs were a common ancestor for plant and animal miRNAs. This conservation ranges a long distance, such as from worms to humans in animals [13], [14], [15] and from moss and ferns to high flowering plants in plants [16]. This provides a powerful strategy to identify miRNAs in a new plant species using already known miRNAs in a known plant species through the use of a comparative genome-based homologue search. However, conservation is not the only criteria used for identifying miRNAs using a homologue-based Blastn search because miRNAs are short and it is difficult to distinguish a miRNA sequence from their targeted gene sequences. When using homologue-based Blastn searches for miRNAs, other criteria, such as their secondary hairpin stem-loop structure must also be considered. Since comparative genome-based homologue searcesh were employed to identify plant miRNAs in the middle of 2000s [17], this strategy has been widely adopted by many research laboratories [18]. Based on a rough approximation, using this method, thousands of miRNAs have been systematically identified in more than 30 plant species, including many important plant and agricultural crop species, such as soybean [19], [20], maize [21], tomato [22], tobacco [23], potato [24], [25], wheat [26], Brassica napus [27], citrus [28], switchgrass [29] and apple [30]. According to miRBase, a public miRNA database (Release 18, November 2011), there are a total of 4,014 plant miRNAs that have been currently identified and deposited into this database [31]. These miRNAs are obtained from 52 plant species, ranging from ferns and mosses to high flowering plants. However, no study has been reported in Asiatic cotton, one of the most important cotton species. In this study, we employed a comparative genome-based homologue search to identify miRNAs in Asiatic cotton using current Asiatic cotton expressed sequence tags (ESTs) available in the NCBI Genbank database. Based on our findings, 73 miRNAs, belonging to 49 families have been identified for the first time in Asiatic cotton. A majority of these miRNAs also play important roles in other plant species. We will also report the major features of these newly identified Asiatic cotton miRNAs.

Materials and Methods

Reference Set of miRNAs

Currently, a total of 4,014 miRNAs from 52 plant species have been deposited in miRBase, a public available miRNA database. These miRNAs are defined as a reference set of miRNA sequences for identifying potential miRNAs in Asiatic cotton. All currently available mature miRNAs and their precursor sequences were downloaded from miRBase (http://www.mirbase.org/; Release 18, November 2011). After the miRNA sequences were downloaded, mature miRNA sequences were Blasted against each other to remove identical sequences. Only unique sequences were kept for later Blastn seaches for identifying Asiatic cotton miRNAs.

Asiatic Cotton ESTs, cDNAs, and mRNAs

Asiatic cotton expressed sequence tags (ESTs), cDNAs, and mRNAs were downloaded from the GenBank nucleotide databases at the National Center for Biotechnology Information (NCBI). Currently, a total of 41,781 Asiatic cotton ESTs are available in the NCBI EST database (dbEST release 120111, December 1, 2011; http://www.ncbi.nlm.nih.gov/dbEST/dbEST_summary.html).

Identifying Potential Asiatic Cotton miRNAs Using a Comparative Genome-based Homologue Search and Secondary Structure Analysis

A comparative genome-based homologue search is a powerful approach for identifying conserved miRNAs in both plants and animals. Since it was developed in 2005, this method has been widely employed to identify miRNAs in various plant species and also in animals. In this study, we employed the previously published method to identify Asiatic cotton miRNAs from currently available EST sequences. A comparative genome-based homologue search is based on two key parameters: conservation of miRNA sequences and the second stem-loop hairpin structure of the potential pre-miRNAs. Both factors should be strictly considered during identifying conserved miRNAs. Because only mature miRNAs are highly evolutionary conserved in plants for the majority of miRNAs, only mature miRNA sequences were used for the Blast search in this study. The mature sequences of all currently available plant miRNAs were subjected to a Blastn search against all of the currently available Asiatic cotton EST sequences. To identify all potential Asiatic cotton miRNAs, the following Blast parameters were used according to the previous studies: 1) the default word-match size was set at seven, the smallest setting that can be used for the online Blastn program between the miRNA query and the EST sequences; 2) the expected values were set at 1,000 to increase the hit chance for more potential sequences; and 3) the sequence number of the Blastn search and the sequence alignments were set to 1,000. More likely, for the majority of the Blastn search, only partial miRNA sequences were matched to the EST sequences. If this occured, the non-aligned regions of the miRNA query were manually inspected and aligned to the hit EST sequence; the number of mismatched nucleotides was recorded for determining whether the EST sequences contained a potential miRNA sequence according to the criteria described in the later section. All ESTs with no more than 3 mismatches were selected for further investigation. First, the selected ESTs were Blasted against each other to remove the repeated sequences. Then, the secondary structures of the remaining EST sequences were predicted using a web-based computational program, mfold (http://mfold.rna.albany.edu/?q=mfold/RNA-Folding-Form). All mfold outputs were recorded into an excel file, which included the number of each nucleotide (A, G, C and U), the number of arms per structure, location of the matching regions, and minimal folding free energy (MFE, ΔG kcal/mol). Then, the adjust minimal folding free energy (AMFE) and the minimal folding free energy index (MFEI) were calculated according to the previous report [32]. AMFE means the MFE of a RNA sequence with 100 nt in length, which is equal to MFE/(length of a potential pre-miRNA) * 100. MFEI is equal to MFE/(length of a potential pre-miRNA)/(The percentage of nucleotides G and C) [32]. An EST was considered a miRNA candidate when it fit all of the following criteria: 1) the predicted mature miRNAs had no more than three nucleotide substitutions compared with a known mature miRNAs; 2) the EST sequence could fold into an appropriate stem-loop hairpin secondary structure; 3) the mature miRNA was localized in one arm of the stem-loop structure; 4) there was no loop or break in the miRNA or miRNA* sequences; 5) there were no more than 6 mismatches between the predicted mature miRNA sequence and its opposite miRNA* sequence in the secondary structure; and 6) the predicted secondary structure had high negative MFE and high MFEI value.

Validation of Asiatic Cotton miRNAs

The predicted Asiatic cotton miRNAs were validated using stem-loop RT-PCR according to the previous report [33]. Total RNA was isolated from 10-day-old seedlings of Asiatic cotton following a previous report. After measuring the concentraion and quality of the RNAs, 200 ng of total RNA was used to reverse transcribe a miRNA into cDNA using a miRNA-specific stem-loop primer and the TaqMan® microRNA Reverse Transcription kit (Applied Biosystems) according to the manufacturer’s protocol. Quantitative real time PCR (qRT-PCR) was performed on an Applied Biosystems 7300 Sequence Detection System. For each reaction, 20 µL PCR reaction mixtures were prepared and each contained 2 µL of RT product from the reverse-transcription reaction (after tenfold dilution), 2 µL of miRNA-specific primers, 10 µL Universal Syber Green PCR Master Mix, and 6 µL nuclease-free water. The reactions were incubated in a 96-well plate at 95°C for 10 min, followed by 45 cycles of 95°C for 15 s and 60°C for 60 s. After the completion of the real-time reactions, the threshold was manually set and the threshold cycle (CT) was recorded. All reactions were conducted in triplicate.

Results

Identification of miRNAs in Asiatic Cotton

After carefully considering the Blastn results and the secondary structure of the sequences, we identified 73 potential miRNAs from the 41,781 currently available Asiatic cotton ESTs. These 73 miRNAs belong to 49 families (Table 1, Figure 1 and Table S1). A large number of miRNAs identified in Asiatic cotton suggests that miRNAs also widely exist in diploid cotton species and play important roles in Asiatic cotton growth and development based on the evidences in other plant species.
Table 1

Asiatic cotton miRNAs identified by homolog search and secondary structure.

FamilymiRNAMature SequenceMLLcPLACGUATGCA/UC/GMFEAMFEMFEI
156156a CGACAGAAGCAAGAGGCAC 19311926.8926.8928.5717.6544.5455.461.520.9435.329.660.535
156b GGACAGAGAGAGAGATGCACA 2139222.8321.7431.5223.9146.7453.260.950.6939.643.040.808
156c TTACAGAAGAGAAGATCAC 1958236.5915.8520.7326.8363.4136.591.360.7619.924.270.663
156d AGAGAGAGATAGAGAGACAG 2055232.6913.4623.0830.7763.4636.541.060.5816.631.920.874
158158 TCCCAAATCTGAGACAAAGA 2055726.3212.2829.8231.5857.8942.110.830.4116.629.120.692
167167 TGAAGCTGCATGCATGATGTC 2156524.6224.6220.0030.7755.3844.620.801.231421.540.483
172172 TGAACCTGATGCTGCTGCAG 2055219.2321.1538.4621.1540.3859.620.910.5522.944.040.739
399399 TGCCAAAGGTGCTGCTCTT 1936520.0023.0832.3124.6244.6255.380.810.7127.442.150.761
414414a TGATCATCATCATCATCGTCG 21538623.5818.6538.8618.9142.4957.511.250.48126.932.880.572
414b TCATCATCATCATCATCTTCA 21537325.2021.7215.8237.2762.4737.530.681.3786.623.220.619
482482 CCTTGCCTCTCCTCCCTTT 19518036.6716.1115.5631.6768.3331.671.161.0438.721.500.679
530530 AGGTGAAGAGGTAGCTGCAAC 21510426.9218.2731.7323.0850.0050.001.170.5834.332.980.660
773773 CTTGATTGCAGCTTTTGTTTC 2135320.7513.2130.1935.8556.6043.400.580.447.914.910.343
780780 TATACCAGCTTTTGAGCAGGT 2134822.9218.7527.0831.2554.1745.830.730.6918.538.540.841
821821 AATTAATAAAATAAAAGTTG 2035236.540.005.7757.6994.235.770.630.004.58.651.500
837837a CATTGTTTCTTCTTCTTTTCT 21311529.5720.8713.9135.6565.2234.780.831.5026.122.700.653
837b CTTTGTTACTTGTATTTTTCA 2154738.3010.6414.8936.1774.4725.531.060.716.914.680.575
838838a TTTTCTTCTCCTTCTTTACA 2038925.8422.4720.2231.4657.3042.700.821.1127.230.560.716
838b TTTTCTTTTCTCTTCCACA 1935536.3616.3614.5532.7369.0930.911.111.1313.524.550.794
840840 ACACTAAAGACAAACTAAC 19547028.5123.8314.4733.1961.7038.300.861.6510121.490.561
846846 TTGAATTGGTGCTTAGAATT 20317828.655.0612.9253.3782.0217.980.540.3932.918.481.028
851851 TGGATGGCAACAAAGACGAT 20512027.5026.6724.1721.6749.1750.831.271.1034.428.670.564
902902 TATGACGCAGCTTCTTAA 18520933.0121.0513.8832.0665.0734.931.031.5234.516.510.473
952952 ATCTGAGAAGCCATCTGGTG 20533725.8223.7425.8224.6350.4549.551.050.92113.733.740.681
10271027 TTTCTTTCTTCCCTTCTCAATC 22311819.4927.9726.2726.2745.7654.240.741.0648.340.930.755
10441044a TTGAGTGCTTATTTGTCTT 1956523.0816.9220.0040.0063.0836.920.580.8514.622.460.608
1044b TTGTAGTGAATATGTGATTT 2034630.4310.8723.9134.7865.2234.780.880.456.614.350.413
10921092 TGAAGGAAGAATTGGTGTT 19321735.9413.3619.8230.8866.8233.181.160.6746.821.570.650
11041104 CGCTGCTGTTTTTGTTTCCTTC 22515011.3318.0043.3327.3338.6761.330.410.4258.939.270.640
11681168 TGTGGACAGGCCAAGGTCGA 2055820.6913.7939.6625.8646.5553.450.800.3519.333.280.623
13131313 TACCATGAAATAATTGTTTG 2035733.3312.2815.7938.6071.9328.070.860.7810.618.600.663
15251525 TGGGTTATATTACTTTTTTAGT 2256728.367.4622.3941.7970.1529.850.680.3317.425.970.870
15301530 TTTACAAATAAATTAAATAT 20534535.656.9613.3344.0679.7120.290.810.5259.817.330.854
15331533a ATATATAAAAGTAATAATGC 20516532.1210.917.8849.0981.2118.790.651.382213.330.710
1533b ATAACAATAATAGTAATGA 1934636.9613.0413.0436.9673.9126.091.001.005.111.090.425
1533c ATATATAAAAATAAAATGA 1936446.883.136.2543.7590.639.381.070.5011.117.341.850
1533d CTAATAATAATAGTAATGA 1955230.775.7715.3848.0878.8521.150.640.383.26.150.291
1533e AGAATAAAAATAATAATGT 1936634.854.5524.2436.3671.2128.790.960.1912.518.940.658
15351535 CTTGGTTTTTGGTGATTCT 19313731.3912.4124.8231.3962.7737.231.000.5034.124.890.669
18461846 TTTCCGGCGCCCCAAGGAGG 2057630.2623.6825.0021.0551.3248.681.440.9519.926.180.538
18881888 TAAGTTTGATTTGTGAAGTA 20517425.8614.9424.1435.0660.9239.080.740.6247.427.240.697
20792079 GGTGTTGATGTGATGACGCA 2036426.5621.8821.8829.6956.2543.750.891.0018.528.910.661
20922092a GAACTGAAGTGGGTTTTTACT 21316830.958.3326.1934.5265.4834.520.900.3237.522.320.647
2092b CAATGAAGTAGGTGTGTACT 2034932.6510.2024.4932.6565.3134.691.000.4210.521.430.618
20932093 ACATTTTCCAATTAATGCTT 2035032.0014.0018.0036.0068.0032.000.890.7812.424.800.775
20952095 CTTCCATGTAGGATAATTAT 2058125.9322.2219.7532.1058.0241.980.811.131417.280.412
21082108 TTATATGTGTGTGTTTTGAG 2036826.4716.1825.0032.3558.8241.180.820.6517.826.180.636
22752275 AGTGTTGAGGAAAGAAACC 19337434.4913.6416.0435.8370.3229.680.960.857520.050.676
26072607 ATGTATTTTGTGATAGTGT 19310428.8517.3115.3838.4667.3132.690.751.1323.222.310.682
26302630 TGGTTTTGGTCTTTGATATTT 21312926.3610.8526.3636.4362.7937.210.720.4133.626.050.700
28682868a TTGGTTTTTGAAGGAGAAA 19315635.9017.3114.7432.0567.9532.051.121.1727.617.690.552
2868b TTGTTTTTGTGTGTAGAAG 19316330.0612.8821.4735.5865.6434.360.840.6041.525.460.741
29112911 GGGCGGGGGAGGGGCTGAGGA 21368419.1515.5045.1820.1839.3360.670.950.34232.233.950.560
29472947 TATACCGTGCCCATGACTGTAG 2238622.0915.1223.2639.5361.6338.370.560.6544.651.861.352
34473447 TTTGAGTTGTAAGTTATGAA 2035036.004.0028.0032.0068.0032.001.130.146.713.400.419
42484248 GTATTTTATTTTTTGGCAATCA 2256924.648.7027.5439.1363.7736.230.630.3211.516.670.460
44134413 AAGAGTATGTAAGCACTG 1855125.4917.6521.5735.2960.7839.220.720.8213.225.880.660
50205020 ATGGTGAAGAAGGTGAGA 18319128.2717.2828.2726.1854.4545.551.080.6163.333.140.728
50215021a AGAGAAGAAGAAGAGAAAA 19538033.4227.3717.3721.8455.2644.741.531.5810026.320.588
5021b TGATAAAAGAAGAAGAAAA 19519223.9621.8819.2734.9058.8541.150.691.1456.429.380.714
5021c AGAGAAGAAGACGAAGAAGA 20520733.3321.2620.7724.6457.9742.031.351.0259.528.740.684
5021d TGAGAAGAAAAAGAAGAAAG 20517636.9311.9323.3027.8464.7735.231.330.5148.427.500.781
5021e TGAGAGAAGAAAAGGAAAA 1936241.9411.2917.7429.0370.9729.031.440.6410.617.100.589
5021f TAAGAAAGAAAAAGAAGAAAA 21335449.727.639.3233.3383.0516.951.490.8251.614.580.860
5021g TGAAAAGAAGAAAAGAAAG 19316740.127.7820.3631.7471.8628.141.260.3834.320.540.730
5021h TTAGAAGAAGATGAAGACAA 20514624.6624.6622.6028.0852.7447.260.881.0945.331.030.657
5021i GGAGAATATGAAGAAGAAAA 2034942.864.0816.3336.7379.5920.411.170.2511.323.061.130
5021j TAAGAAGAAGAAGAAGAATA 2055534.5518.1814.5532.7367.2732.731.061.2525.746.731.428
50535053 CGCCGCTGTCGTCGCCGCCGT 21314921.4830.2020.8127.5248.9951.010.781.4548.132.280.633
51425142a ACATTGATTATAACTGAT 1857240.2811.1115.2833.3373.6126.391.210.736.69.170.347
5142b ATATGCTTGATATGTGAT 18312932.569.3020.1637.9870.5429.460.860.462821.710.737
5142c ATCTTGATTGAGAAATGAT 1957525.3313.3312.0049.3374.6725.330.511.111317.330.684
51855185 TCCTAGTTATTTTTCAAATG 2055330.195.6620.7543.4073.5826.420.700.279.317.550.664

ML: mature sequence length; Lc: the location of arm; PL: the length of pre-miRNAs.

Figure 1

Predicted stem-loop hairpin secondary structures of the selected Asiatic cotton miRNAs identified in this study.

Mature miRNA sequences are in red color. The length of the accurate miRNA precursors may be slightly longer than what is presented here.

ML: mature sequence length; Lc: the location of arm; PL: the length of pre-miRNAs.

Predicted stem-loop hairpin secondary structures of the selected Asiatic cotton miRNAs identified in this study.

Mature miRNA sequences are in red color. The length of the accurate miRNA precursors may be slightly longer than what is presented here. Although miRNA mature sequences are highly evolutionarily conserved from species to species, in this study, we found that a majority of identified mature Asiatic cotton miRNAs had 2 or 3 nucleotide changes as compared with miRNAs in other plant species. This may be caused by two reasons: 1) due to the limited EST sequences available in the database, we only identified partial miRNAs identified in other plant species. For many high evolutionally conserved miRNAs, such as miR 162 and miR 164, we did not find those in this study. As more EST sequences become available in the database, it is possible that we could identify more miRNAs that are identical with currently known miRNAs. 2) Although miRNAs are highly conserved, it does not mean the miRNAs are identical among different species and there may be mutantions in the evolution of miRNA biogenesis.

Characterization of miRNAs in Asiatic Cotton

The identified miRNAs were not evenly distributed in each miRNA family (Table 1). For a majority of the miRNA families identified, there was only one member identified in this study. However, for some miRNA families (miRNA 156, 414, 837, 838, 1044, 1533, 2902, 2868, 5021 and 5142), multiple members were identified. For example, there are four members identified in the miR156 family. Although this may be caused by the uneven EST sequences, it also suggests the potential miRNA patterns in Asiatic cotton. miR156 is a large family in other plant species, such as the model plant species Arabidopsis, which plays an important role in leaf development. A majority of identified miRNAs were obtained from the plus strand. However, there were several miRNAs identified from the minus strand of certain ESTs. The mature miRNA sequences could be located within either the 3' or 5' arm of the secondary stem-loop hairpin structures. Among the 73 identified miRNAs, 38 were located within the 3' arm and the other 35 miRNAs were located with the 5' arm. The length of mature miRNAs varied from 18 to 22 nt with an average of 19.9±1.0 nucleotides (Figure 2). A majority of mature miRNAs have 19–21 nt, which counts for 86.30% of the total identified miRNAs. The length range of the Asiatic cotton mature miRNAs is similar to those of other plant species.
Figure 2

The length distribution of mature miRNAs identified in Asiatic cotton

. The range of length is 18–22 nt with an average of 19.9±1.0 nt.

The length distribution of mature miRNAs identified in Asiatic cotton

. The range of length is 18–22 nt with an average of 19.9±1.0 nt. The length of identified miRNA precursor sequences was also varied from miRNA to miRNA. The range of pre-miRNA lengths is 46 to 684 nt with an average of 138±120 nt (Figure 3). However, a large percentage of pre-miRNAs (32.9%) had 50–70 nt. There were only 10 miRNAs (13.7%) with more than 210 nt in length.
Figure 3

The length distribution of pre-miRNAs identified in Asiatic cotton.

The range of length is 46–684 nt with an average of 138±120 nt.

The length distribution of pre-miRNAs identified in Asiatic cotton.

The range of length is 46–684 nt with an average of 138±120 nt. The composition of the four nucleotides (A, G, C, and U) is an important parameter, which is an indicator for species evolution as well as for the stabilization of one specific RNA sequence cased by their secondary structure. The percentage composition of each nucleotide was not evenly distributed in the identified Asiatic cotton pre-miRNAs (Table 2). With an unknown reason, the nucleotide uracil (U) is dominant in both mature miRNAs and pre-miRNAs in both plant and animal miRNAs. In this study, we observed that the U content varied from 17.7% to 57.7% with an average of 33.1±7.9% in the identified Asiatic cotton pre-miRNAs (Figure 4); which is significantly higher than the content of other nucleotides, particularly much higher than nucleotides C (15.4±6.8%) and G (21.7±8.0%). A majority (83.7%) of pre-miRNAs contained more than 25% of the nucleotide U (Figure 4).
Table 2

Statistics of the characterized parameters of Asiatic cotton miRNA precursors.

ParameterMeanStandard DeviationMinimalMaximal
MFE (▵G, -kcal/mol)35.1935.123.2232.2
AMFE( -kcal/mol)24.929.216.1551.86
MFEI0.700.250.291.85
Length (nt)13812046684
(G+C)%37.0511.785.7761.33
(U+A)%62.9511.7838.6794.23
A%29.866.9111.3349.72
C%15.366.810.0030.20
G%21.697.995.7745.18
U%33.097.8517.6557.69
A/U ratio0.940.260.411.53
C/G ratio0.760.380.001.65
Figure 4

The distribution of U content.

The range is 17.65–57.69% with an average of 33.09±7.85%.

The distribution of U content.

The range is 17.65–57.69% with an average of 33.09±7.85%. Nucleotides G and C contributes to the formation and stabilization of the secondary structure of stem-loop hairpins because G and C will form three hydrogen bonds with each other but A and U only form two hydrogen bonds with each other. Generally speaking, the more GC content a sequence contains, the more stable the secondary structure of a specific RNA will be. In the identified Asiatic cotton pre-miRNAs, the GC content (37.1±11.8%) was much lower than the AU content (63.0±11.8%) (Table 2, Figures 5 and 6). More than half of the total number of nucleotides were A or U. In this study, we also calculated the A/U and C/G ratio; the ratios of A/U and C/G were 0.94 and 0.76, respectively. This suggests that more of the nucleotide G existed in the pre-miRNA sequences than C however the number of A and U nucleotiudes was almost equal.
Figure 5

The distribution of GC content.

The range is 5.77–61.33% with an average of 37.05±11.78%.

Figure 6

The distribution of AU content.

The range is 38.67–94.23% with an average of 62.95±11.78%.

The distribution of GC content.

The range is 5.77–61.33% with an average of 37.05±11.78%.

The distribution of AU content.

The range is 38.67–94.23% with an average of 62.95±11.78%. One criterion for measuring the stability of a RNA or DNA secondary structure is the minimal folding free energy (MFE). Usually, the lower the MFE, the more stable the RNA or DNA molecule. The MFE of the 73 identified Asiatic cotton pre-miRNAs varied from -3.2 to -232.2 kcal/mol with an average of -35.19±35.12 kcal/mol (Figure 7). For a majority of Asiatic cotton pre-miRNAs, their MFE are ranged from -10 to -30 kcal/mol. One reason causing the large variance in MFE is the significance in their length. It is also an issue for determining the stability of a RNA or DNA using MFE because different RNA or DNA strands contains a different number of nucleotides. To better measure the stability of RNA or DNA strands, the adjusted minimal folding free energy (AMFE) strategy was developed, which is the MFE of a RNA/DNA sequence that is 100 nt in length. The AMFE of the 73 identified Asiatic cotton pre-miRNAs ranged from - 6.15 to -51.86 kcal/mol with an average of - 24.92 ±9.21 kcal/mol, which is a smaller range as compared with the MFE range. About half of the pre-miRNAs have AMFE values with –20 to –30 kcal/mol (Figure 8).
Figure 7

The distribution of MFE.

The range of MFE is -3.2 - -232.2 kcal/mol with an average of –35.19±35.12 kcal/mol.

Figure 8

The distribution of AMFE.

The range of AMFE is -6.15 - -51.86 kcal/mol with an average of - 24.92±9.21 kcal/mol.

The distribution of MFE.

The range of MFE is -3.2 - -232.2 kcal/mol with an average of –35.19±35.12 kcal/mol.

The distribution of AMFE.

The range of AMFE is -6.15 - -51.86 kcal/mol with an average of - 24.92±9.21 kcal/mol. The minimal folding free energy index (MFEI) is a new criterion for assaying miRNAs and distinguishing miRNAs from other coding and non-coding RNAs. The MFEI of the identified Asiatic cotton pre-miRNAs ranged from 0.29 to 1.85 with an average of 0.70±0.25 (Figure 9).
Figure 9

The distribution of MFEI.

The range of MFEI is 0.29–1.85 with an average of 0.70±0.25.

The distribution of MFEI.

The range of MFEI is 0.29–1.85 with an average of 0.70±0.25.

Validation of miRNAs in Asiatic Cotton

Quantitative real-time PCR (qRT-PCR) is a reliable method to determine the expression of specific miRNAs [33]. In this study, we employed qRT-PCR to detect the expression of three predicted miRNAs (miR 156, miR 172 and miR 399). The results show that all of the three predicted miRNAs exist and are highly expressed in Asiatic cotton seedlings.

Discussion

Although miRNA-related research is one of the hottest research topics in biological or biomedicinal fields [34], [35], miRNA research on cotton is much behind other plant species. Up to now, only several research projects have reported on miRNA identification and expression analysis in cotton [36], [37], [38], [39], [40]. These research projects employed computational approaches as well deep sequencing technology to identify miRNAs from cotton and to investigate their expression profile using qRT-PCR [37] and deep sequencing [40]. However, all of these studies are focused on upland cotton. Based on our best knowledge, no research has been performed on diploid cotton. In this study, our focuswas on an important diploid cotton species, Asiatic cotton, and we were able to successfully identify 73 miRNAs, belonging to 49 families, using an in silico comparative genome-based approach. Many of these identified miRNAs play an important role in model plant species. For example, miR 172 controls floral development and phase switch to reproductive growth. miR 399 is a responsive miRNA to environmental stress. We believe these miRNAs also play important roles in Asiatic cotton growth, development, and response to environmental abiotic and biotic stresses. Newly identified Asiatic cotton miRNAs have similar characteristics to the miRNAs in other plant species [32]. These major parameters include the length of mature miRNAs and miRNA precursors, the nucleotide composition, and the minimal folding free energy (Table 2). This suggests that Asiatic cotton miRNAs have a common ancestor with other plant species and that they share the same regulatory mechanisms. microRNA precursor sequences (XLS) Click here for additional data file.
  33 in total

1.  Identification and characterization of new plant microRNAs using EST analysis.

Authors:  Bao Hong Zhang; Xiao Ping Pan; Qing Lian Wang; George P Cobb; Todd A Anderson
Journal:  Cell Res       Date:  2005-05       Impact factor: 25.617

Review 2.  Plant microRNA: a small regulatory molecule with big impact.

Authors:  Baohong Zhang; Xiaoping Pan; George P Cobb; Todd A Anderson
Journal:  Dev Biol       Date:  2005-12-01       Impact factor: 3.582

3.  Identification of 188 conserved maize microRNAs and their targets.

Authors:  Baohong Zhang; Xiaoping Pan; Todd A Anderson
Journal:  FEBS Lett       Date:  2006-06-09       Impact factor: 4.124

4.  Conservation and divergence of plant microRNA genes.

Authors:  Baohong Zhang; Xiaoping Pan; Charles H Cannon; George P Cobb; Todd A Anderson
Journal:  Plant J       Date:  2006-04       Impact factor: 6.417

Review 5.  Computational identification of microRNAs and their targets.

Authors:  Baohong Zhang; Xiaoping Pan; Qinglian Wang; George P Cobb; Todd A Anderson
Journal:  Comput Biol Chem       Date:  2006-11-22       Impact factor: 2.877

6.  Evidence that miRNAs are different from other RNAs.

Authors:  B H Zhang; X P Pan; S B Cox; G P Cobb; T A Anderson
Journal:  Cell Mol Life Sci       Date:  2006-01       Impact factor: 9.261

Review 7.  MicroRNAs and their regulatory roles in animals and plants.

Authors:  Baohong Zhang; Qinglian Wang; Xiaoping Pan
Journal:  J Cell Physiol       Date:  2007-02       Impact factor: 6.384

8.  Floral patterning defects induced by Arabidopsis APETALA2 and microRNA172 expression in Nicotiana benthamiana.

Authors:  Sizolwenkosi Mlotshwa; Zhiyong Yang; Yunju Kim; Xuemei Chen
Journal:  Plant Mol Biol       Date:  2006-07       Impact factor: 4.076

9.  Computational identification of novel microRNAs and targets in Brassica napus.

Authors:  Fu Liang Xie; Si Qi Huang; Kai Guo; An Ling Xiang; Yi Yong Zhu; Li Nie; Zhi Min Yang
Journal:  FEBS Lett       Date:  2007-03-08       Impact factor: 4.124

10.  Clustering and conservation patterns of human microRNAs.

Authors:  Yael Altuvia; Pablo Landgraf; Gila Lithwick; Naama Elefant; Sébastien Pfeffer; Alexei Aravin; Michael J Brownstein; Thomas Tuschl; Hanah Margalit
Journal:  Nucleic Acids Res       Date:  2005-05-12       Impact factor: 16.971

View more
  17 in total

Review 1.  MicroRNAs in cotton: an open world needs more exploration.

Authors:  Qinglian Wang; Baohong Zhang
Journal:  Planta       Date:  2015-04-05       Impact factor: 4.116

2.  Identification and characterization of microRNAs and their targets in high-altitude stress-adaptive plant maca (Lepidium meyenii Walp).

Authors:  Sujay Paul
Journal:  3 Biotech       Date:  2017-05-30       Impact factor: 2.406

3.  Computational identification of miRNA and targets from expressed sequence tags of coffee (Coffea arabica).

Authors:  Arzuba Akter; Md Muzahidul Islam; Shakhinur Islam Mondal; Zabed Mahmud; Nurnabi Azad Jewel; Sabiha Ferdous; Md Ruhul Amin; Md Mahfuzur Rahman
Journal:  Saudi J Biol Sci       Date:  2013-05-03       Impact factor: 4.219

4.  Identification of miRNA and their target genes in Cestrum nocturnum L. and Cestrum diurnum L. in stress responses.

Authors:  Nasreen Bano; Shafquat Fakhrah; Sagar Prasad Nayak; Sumit Kumar Bag; Chandra Sekhar Mohanty
Journal:  Physiol Mol Biol Plants       Date:  2022-02-04

5.  Global expression dynamics and miRNA evolution profile govern floral/fiber architecture in the modern cotton (Gossypium).

Authors:  Sakshi Arora; Bhupendra Chaudhary
Journal:  Planta       Date:  2021-08-30       Impact factor: 4.116

6.  Wheat miRNA ancestors: evident by transcriptome analysis of A, B, and D genome donors.

Authors:  Burcu Alptekin; Hikmet Budak
Journal:  Funct Integr Genomics       Date:  2016-03-31       Impact factor: 3.410

7.  Identification and profiling of novel microRNAs in the Brassica rapa genome based on small RNA deep sequencing.

Authors:  Bumjin Kim; Hee-Ju Yu; Sin-Gi Park; Ja Young Shin; Mijin Oh; Namshin Kim; Jeong-Hwan Mun
Journal:  BMC Plant Biol       Date:  2012-11-19       Impact factor: 4.215

8.  Small RNA and degradome sequencing reveal complex miRNA regulation during cotton somatic embryogenesis.

Authors:  Xiyan Yang; Lichen Wang; Daojun Yuan; Keith Lindsey; Xianlong Zhang
Journal:  J Exp Bot       Date:  2013-02-04       Impact factor: 6.992

9.  A contig-based strategy for the genome-wide discovery of microRNAs without complete genome resources.

Authors:  Jun-Zhi Wen; Jian-You Liao; Ling-Ling Zheng; Hui Xu; Jian-Hua Yang; Dao-Gang Guan; Si-Min Zhang; Hui Zhou; Liang-Hu Qu
Journal:  PLoS One       Date:  2014-02-07       Impact factor: 3.240

10.  Virus versus host plant microRNAs: who determines the outcome of the interaction?

Authors:  Fatemeh Maghuly; Rose C Ramkat; Margit Laimer
Journal:  PLoS One       Date:  2014-06-04       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.