| Literature DB >> 30846812 |
Sarika Jaiswal1, M A Iquebal1, Vasu Arora1, Sonia Sheoran2, Pradeep Sharma2, U B Angadi1, Vikas Dahiya1, Rajender Singh2, Ratan Tiwari2, G P Singh2, Anil Rai1, Dinesh Kumar3.
Abstract
MicroRNA are 20-24 nt, non-coding, single stranded molecule regulating traits and stress response. Tissue and time specific expression limits its detection, thus is major challenge in their discovery. Wheat has limited 119 miRNAs in MiRBase due to limitation of conservation based methodology where old and new miRNA genes gets excluded. This is due to origin of hexaploid wheat by three successive hybridization, older AA, BB and younger DD subgenome. Species specific miRNA prediction (SMIRP concept) based on 152 thermodynamic features of training dataset using support vector machine learning approach has improved prediction accuracy to 97.7%. This has been implemented in TamiRPred ( http://webtom.cabgrid.res.in/tamirpred ). We also report highest number of putative miRNA genes (4464) of wheat from whole genome sequence populated in database developed in PHP and MySQL. TamiRPred has predicted 2092 (>45.10%) additional miRNA which was not predicted by miRLocator. Predicted miRNAs have been validated by miRBase, small RNA libraries, secondary structure, degradome dataset, star miRNA and binding sites in wheat coding region. This tool can accelerate miRNA polymorphism discovery to be used in wheat trait improvement. Since it predicts chromosome-wise miRNA genes with their respective physical location thus can be transferred using linked SSR markers. This prediction approach can be used as model even in other polyploid crops.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30846812 PMCID: PMC6405928 DOI: 10.1038/s41598-019-40333-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Performance of the miRNA prediction models using ANN, RF and SVM methodology.
| Methods | Se | Sp | PPV | NPV | Accuracy | Precision | Recall | F-measure | MCC |
|---|---|---|---|---|---|---|---|---|---|
| ANN | 0.809 ± 0.008 | 0.787 ± 0.010 | 0.799 ± 0.014 | 0.795 ± 0.018 | 0.797 ± 0.006 | 0.799 ± 0.014 | 0.809 ± 0.008 | 0.803 ± 0.005 | 0.830 ± 0.012 |
| RF | 0.874 ± 0.013 | 0.880 ± 0.014 | 0.899 ± 0.010 | 0.850 ± 0.019 | 0.877 ± 0.012 | 0.899 ± 0.010 | 0.874 ± 0.013 | 0.886 ± 0.010 | 0.751 ± 0.024 |
| SVM-Lin | 0.892 ± 0.007 | 0.858 ± 0.20 | 0.870 ± 0.025 | 0.877 ± 0.15 | 0.876 ± 0.009 | 0.870 ± 0.025 | 0.892 ± 0.007 | 0.880 ± 0.013 | 0.749 ± 0.019 |
| SVM-Poly | 0.917 ± 0.007 | 0.917 ± 0.008 | 0.913 ± 0.009 | 0.921 ± 0.004 | 0.917 ± 0.005 | 0.913 ± 0.009 | 0.917 ± 0.007 | 0.915 ± 0.007 | 0.834 ± 0.010 |
|
|
| ||||||||
| SVM-Sig | 0.904 ± 0.021 | 0.746 ± 0.013 | 0.803 ± 0.013 | 0.878 ± 0.023 | 0.831 ± 0.013 | 0.803 ± 0.013 | 0.904 ± 0.021 | 0.850 ± 0.012 | 0.665 ± 0.026 |
Figure 1Evaluation measures (FRP:False Positive Rate; FNR:False Negative Rate; FDR:False Discovery Rate; Inf:Informedness; Mar: Markedness) of miRNA prediction models using ANN, RF and SVM methodology.
Area Under the Curve of the various methodologies.
| Models | Area | Standard Error | Asymptotic 95% Confidence Interval | |
|---|---|---|---|---|
| Lower Bound | Upper Bound | |||
| ANN | 0.788 | 0.029 | 0.731 | 0.845 |
| RF | 0.864 | 0.024 | 0.816 | 0.912 |
| SVM-Linear | 0.864 | 0.024 | 0.816 | 0.912 |
| SVM-Polynomial3 | 0.920 | 0.019 | 0.883 | 0.958 |
| SVM-RBF | 0.973 | 0.011 | 0.951 | 0.996 |
| SVM-Sigmoid | 0.833 | 0.027 | 0.781 | 0.885 |
Figure 2ROC curves for models.
Comparative analysis of chromosome-wise 5′ mature miRNAs prediction by TamiRPred and miRLocator.
| Chromosome Number | No. of mature 5′ miRNAs (Length ≥ 17) predicted by TamiRPred | No. of mature 5′ miRNAs (Length ≥ 17) predicted by miRLocator |
|---|---|---|
| 1A | 384 | 227 |
| 1B | 325 | 175 |
| 1D | 75 | 43 |
| 2A | 416 | 259 |
| 2B | 402 | 221 |
| 2D | 77 | 35 |
| 3A | 147 | 97 |
| 3B | 614 | 347 |
| 3D | 45 | 21 |
| 4A | 280 | 173 |
| 4B | 86 | 39 |
| 4D | 19 | 8 |
| 5A | 141 | 78 |
| 5B | 340 | 211 |
| 5D | 103 | 54 |
| 6A | 204 | 111 |
| 6B | 167 | 96 |
| 6D | 126 | 71 |
| 7A | 256 | 141 |
| 7B | 52 | 23 |
| 7D | 205 | 117 |
| Total | 4464 | 2547 |
Figure 3Comparative analysis of chromosome-wise 5′ mature miRNAs prediction by TamiRPred and miRLocator.
Validation of predicted miRNAs in wheat small RNA library.
| Sr. No. | BioProject IDs | Number of detected miRNAs in wheat small RNA library |
|---|---|---|
| 1. | PRJNA218544 | 1104 |
| 2. | PRJNA232120 | 1138 |
| 3. | PRJNA244006 | 1021 |
| 4. | PRJNA297977 | 1070 |
| 5. | PRJNA326902 | 1024 |
| Total | 5357 (Unique: 1906) |
Figure 4Structure of precursor miRNAs with bulge, stem loop, hairpin and star sequence.
Figure 5Various search options of TamiRPred.
Figure 6Workflow of wheat miRNA and its target prediction.