| Literature DB >> 19208135 |
Abstract
BACKGROUND: MicroRNAs (miRNAs) are small and noncoding RNAs that play important roles in various biological processes. They regulate target mRNAs post-transcriptionally through complementary base pairing. Since the changes of miRNAs affect the expression of target genes, the expression levels of target genes in specific biological processes could be different from those of non-target genes. Here we demonstrate that gene expression profiles contain useful information in separating miRNA targets from non-targets.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19208135 PMCID: PMC2648752 DOI: 10.1186/1471-2105-10-S1-S34
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The procedure of computational prediction of miRNA targets. After the training dataset of gene expression files is trained by SVM, the test set is predicted by the decision making of SVM classifiers and the scoring method based on the sequence alignment.
Figure 2The miRNA target prediction with SVM using the gene expression dataset. Three datasets with different qualities, which were the validated target dataset, the putative target dataset, and the random dataset, were compared in terms of the prediction accuracy.
Figure 3The performance of target prediction with SVM using different gene expression sets. The performance of target prediction with SVM using developmental- or stress-related gene expression profiles, or combined expression profiles from the two datasets.
Feature selection in the gene expression dataset.
| Rank | Sample ID | Type | Tissue |
| 1 | ATGE26 | D | Leaf |
| 2 | Heat-Shoots-1.0 h | S | Shoot |
| 3 | UV-B-Roots-1.0 h | S | Root |
| 4 | ATGE73 | D | Pollen |
| 5 | ATGE91 | D | Leaf |
| 6 | ATGE34 | D | Flower |
| 7 | Drought-Roots-0.25 h | S | Root |
| 8 | Drought-Shoots-0.25 h | S | Shoot |
| 9 | UV-B-Shoots-3.0 h | S | Shoot |
| 10 | Cold-Roots-24 h | S | Root |
| 11 | ATGE97 | D | Seedling |
| 12 | Drought-Roots-24.0 h | S | Root |
| 13 | UV-B-shoots-0.5 h | S | Shoot |
| 14 | ATGE55 | D | Flower |
| 15 | ATGE101 | D | Seedling |
| 16 | Drought-Shoots-3.0 h | S | Shoot |
| 17 | Wounding-Shoots-6.0 h | S | Shoot |
| 18 | Osmotic-Shoots-1.0 h | S | Shoot |
| 19 | Oxidative-Roots-6.0 h | S | Root |
| 20 | UV-B-Roots-6.0 h | S | Root |
The top 20 ranked features for miRNA target classification. Each feature corresponds to a condition in the two expression datasets (D: developmental process and S: stress treatment).
Comparison of predictions using different methods.
| SVM+SC3 | SC1 | SC3 | |
| TP (True Positive) rate | 0.36 | 0.20 | 0.83 |
| FP (False Positive) rate | 0.00 | 0.00 | 0.03 |
| TN (True Negative) rate | 1.00 | 1.00 | 0.97 |
| FN (False Negative) rate | 0.64 | 0.80 | 0.17 |
| Sensitivity (TP/(TP+FN)) | 0.36 | 0.20 | 0.83 |
| Specificity (TN/(TN+FP)) | 1.00 | 1.00 | 0.97 |
| Precision (TP/(TP+FP)) | 1.00 | 1.00 | 0.97 |
SVM+SC3, the method combining the SVM classifier and the scoring method based on the sequence matches. SCτ indicates the score cutoff, τ. The results were obtained with 100 test sets.
Top 20 target genes associated with the developmental series.
| Locus ID | miRNA | Rate | Description |
| At1g69440* | miR854 | 0.97 | Encodes ARGONAUTE7 |
| At1g62930 | miR400 | 0.83 | Similar to pentatricopeptide (PPR) repeat-containing protein |
| At5g47250 | miR472 | 0.82 | Disease resistance protein |
| At3g15270* | miR156 | 0.78 | Squamosa promoter-binding protein-like 5 |
| At5g59000 | miR414 | 0.77 | Zinc finger family protein |
| At4g31610* | miR414 | 0.77 | REM1 (Reproductive Meristem 1) transcription factor |
| At5g58980 | miR396 | 0.77 | Ceramidase family protein |
| At5g43730 | miR472 | 0.76 | Disease resistance protein |
| At4g15430 | miR855 | 0.72 | Similar to early-responsive to dehydration protein-related |
| At5g08430 | miR414 | 0.70 | SWIB complex BAF60b domain-containing protein/plus-3 domain-containing protein |
| At2g28510 | miR829 | 0.69 | Dof-type zinc finger domain-containing protein |
| At5g48560 | miR778 | 0.69 | Basic helix-loop-helix (bHLH) family protein |
| At1g27360 | miR156 | 0.68 | Squamosa promoter-binding protein-like 11 |
| At3g53310 | miR414 | 0.65 | Transcriptional factor B3 family protein |
| At2g42200* | miR156 | 0.64 | Squamosa promoter-binding protein-like 9 |
| At1g63130 | miR400 | 0.62 | Transacting siRNA generating locus |
| At3g20910 | miR169 | 0.62 | CCAAT-binding transcription factor |
| At2g34960 | miR157 | 0.61 | Encodes a member of the cationic amino acid transporter |
| At1g62670 | miR161 | 0.61 | Pentatricopeptide (PPR) repeat-containing protein |
| At3g57670* | miR854 | 0.57 | Similar to zinc finger |
The targets were predicted with the expression dataset of the developmental series. The rate indicates the fraction of runs in which the gene was predicted as a positive in 200 runs. * indicates the gene reported to be involved in the developmental process.
Top 20 target genes associated with stress responses.
| Locus ID | miRNA | Rate | Description |
| At5g43760 | miR854 | 0.88 | A member of the 3-ketoacyl-CoA synthase family involved in the biosynthesis of VLCFA |
| At5g47250 | miR472 | 0.79 | Disease resistance protein |
| At3g20710 | miR859 | 0.79 | F-box/Kelch-repeat protein |
| At2g36890* | miR847 | 0.60 | Myb-like transcription factor MYB38 |
| At4g28310 | miR837-5p | 0.60 | Unknown protein |
| At5g41410 | miR414 | 0.55 | Homeodomain protein required for ovule identity |
| At2g25980 | miR846 | 0.53 | Jacalin lectin family protein |
| At5g57590 | miR396 | 0.52 | Mutant complemented by E coli Bio A gene encoding 7,8-diaminopelargonic acid aminotransferase |
| At1g49750 | miR854 | 0.47 | Leucine-rich repeat family protein |
| At5g39710 | miR400 | 0.47 | Similar to pentatricopeptide (PPR) repeat-containing protein |
| At3g13690 | miR419 | 0.47 | Protein kinase family protein |
| At3g18980 | miR859 | 0.45 | F-box family protein |
| At5g43730 | miR472 | 0.45 | Disease resistance protein |
| At2g32760 | miR414 | 0.43 | Unknown protein |
| At1g74840* | miR863-5p | 0.43 | Myb family transcription factor |
| At1g80340 | miR835-5p | 0.42 | Encodes a protein with gibberellin 3 β-hydroxylase activity |
| At1g26210 | miR414 | 0.41 | unknown protein |
| At2g17830 | miR859 | 0.41 | F-box family protein |
| At4g14680 | miR395 | 0.40 | ATP sulfurylase |
| At5g61480 | miR870 | 0.38 | Leucine-rich repeat transmembrane protein kinase |
The targets were predicted with the expression dataset of stress treatments. The rate indicates the fraction of runs in which the gene was predicted as a positive in 200 runs. * indicates the gene reported to be involved in the stress responses.