| Literature DB >> 25025130 |
Jingshan Huang1, Jiangbo Dang2, Glen M Borchert3, Karen Eilbeck4, He Zhang1, Min Xiong1, Weijian Jiang1, Hao Wu1, Judith A Blake5, Darren A Natale6, Ming Tan7.
Abstract
As a special class of short non-coding RNAs, microRNAs (a.k.a. miRNAs or miRs) have been reported to perform important roles in various biological processes by regulating respective target genes. However, significant barriers exist during biologists' conventional miR knowledge discovery. Emerging semantic technologies, which are based upon domain ontologies, can render critical assistance to this problem. Our previous research has investigated the construction of a miR ontology, named Ontology for MIcroRNA Target Prediction (OMIT), the very first of its kind that formally encodes miR domain knowledge. Although it is unavoidable to have a manual component contributed by domain experts when building ontologies, many challenges have been identified for a completely manual development process. The most significant issue is that a manual development process is very labor-intensive and thus extremely expensive. Therefore, we propose in this paper an innovative ontology development methodology. Our contributions can be summarized as: (i) We have continued the development and critical improvement of OMIT, solidly based on our previous research outcomes. (ii) We have explored effective and efficient algorithms with which the ontology development can be seamlessly combined with machine intelligence and be accomplished in a semi-automated manner, thus significantly reducing large amounts of human efforts. A set of experiments have been conducted to thoroughly evaluate our proposed methodology.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25025130 PMCID: PMC4099014 DOI: 10.1371/journal.pone.0100855
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A List of Current miR Target Prediction Tools.
| Prediction Tool Name | Prediction Strategy | Access | Official Website |
| deepBase | A database for annotating and discovering small and long ncRNAs (microRNAs, siRNAs, piRNAs…) from high-throughput deep sequencing data. | Both |
|
| DIANA-microT-CDS | Thermodynamic modeling. | Both |
|
| DIANA-mirGen 2.0 | A database of microRNA genomic information and regulation. | Both |
|
| GenMiR++ | Paired expression profiles of microRNAs and mRNAs; as well as Baynesian inference. | Both |
|
| mimiRNA | Expression correlation. | Both |
|
| mirBridge | Complementary and target site conservation. | Download |
|
| miRanda | Complementary and target site conservation. | Both |
|
| miRBase | A searchable database of published miRNA sequences and annotation. | Both |
|
| miRDB | Microarray corrleation training; as well as Support Vector Machine. | Both |
|
| miRecords | Validated targets and algorithm integration. | Both |
|
| miRGator | Expression correlation and algorithm integration. | Online Search |
|
| miRGen | Positional relationships target prediction integration. | Both |
|
| miRNA-Target Gene Prediction at EMBL | Complementary and target site conservation. | Online Search |
|
| miRNAMap | Genomic maps of microRNA genes and their target genes in mammalian genomes. | Both |
|
| MicroInspector | Algorithm integration. | Online Search |
|
| MiTarget | Positional relationships thermodynamic modeling; as well as Support Vector Machine. | Online Search |
|
| PicTar | Target site conservation and thermodynamic modeling. | Both |
|
| PITA | Incorporating the role of target site accessibility, as determined by base-pairing interactions within the mRNA, in microRNA target recognition. | Both |
|
| PMRD | PMRD: Plant microRNA database. | Both |
|
| RepTar | Searching for repeating 3′ UTR target sites. | both |
|
| RNA22 | Identifying patterns in cDNAs and matching to miRs. | Online search |
|
| RNAhybrid | Thermodynamics & statistical model. | Both |
|
| starBase | Argonaute CLIP-Seq and degradome sequencing data. | Both |
|
| TarBase | Experimentally validated targets. | Both |
|
| TargetScan | Seed complementary and target site conservation. | Both |
|
| ViTa | Complementary of host microRNAs to viruses. | Both |
|
Figure 1Three steps in the proposed semi-automated ontology development: (i) develop a backbone ontology; (ii) align the backbone ontology with other ontologies/schemas; and (iii) augment the backbone ontology.
Figure 2The development of a backbone ontology.
Sample Concepts in the Backbone Ontology.
| Concept Name | Created by ourselves? | Imported from | Properties Extended? | Relationships Extended? |
|
| No | SO | Yes | Yes |
|
| No | SO | No | No |
|
| No | GO | No | Yes |
|
| No | GO | No | Yes |
|
| No | PRO | Yes | Yes |
|
| No | PRO | No | Yes |
|
| No | HDO | Yes | Yes |
|
| No | FMA | Yes | Yes |
|
| No | BFO | No | Yes |
|
| No | BFO | No | Yes |
|
| No | BFO | No | Yes |
|
| No | BFO | No | Yes |
|
| No | BFO | No | Yes |
|
| Yes | N/A | N/A | N/A |
|
| Yes | N/A | N/A | N/A |
|
| Yes | N/A | N/A | N/A |
|
| Yes | N/A | N/A | N/A |
|
| Yes | N/A | N/A | N/A |
|
| Yes | N/A | N/A | N/A |
|
| Yes | N/A | N/A | N/A |
Sample Relationships in the Backbone Ontology.
| Relationship Name | Simple Definition or Usage | miR Specific? |
|
| imported from OBO Relation Ontology (RO) | No |
|
| imported from OBO Relation Ontology (RO) | No |
|
| imported from OBO Relation Ontology (RO) | No |
|
| miRs affect numerous tumors, including cancers | Yes |
|
| each miR has some mRNA binding sites | Yes |
|
| each miR has one or more computationally predicted target genes | Yes |
|
| each miR has one or more target genes | Yes |
|
| each miR has one or more biological validations for each of its target genes | Yes |
|
| miRs are involved in some pathological events | Yes |
|
| miRs can down-regulate or up-regulate some pathological events | Yes |
Characteristics of Test Ontologies/Schemas.
| Features | SBO | GRO | TarBase |
| Number of Instances | 0 | 4 | 0 |
| Number of Concepts | 604 | 507 | 76 |
| Number of Properties | 0 | 9 | 19 |
| Number of Relationships | 1 | 24 | 23 |
| (excluding |
Figure 3Weight convergence experimental results when aligning TarBase with the backbone ontology, where was set to 0.1 in (a) and 0.3 in (b), respectively.
Pairwise Alignment Results among Four Ontologies/Schemas.
| GRO + SBO | GRO + TarBase | SBO + TarBase | GRO + Backbone | SBO + Backbone | TarBase + Backbone | |
| Initial weights | 0.25 0.25 0.25 0.25 | 0.25 0.25 0.25 0.25 | 0.25 0.25 0.25 0.25 | 0.25 0.25 0.25 0.25 | 0.25 0.25 0.25 0.25 | 0.25 0.25 0.25 0.25 |
| Training examples | 5 | 2 | 2 | 5 | 4 | 3 |
| Learned weights | 0.65 0.00 0.35 0.00 | 0.67 0.05 0.28 0.00 | 0.58 0.00 0.42 0.00 | 0.51 0.03 0.46 0.00 | 0.61 0.00 0.39 0.00 | 0.38 0.15 0.13 0.34 |
| Output equivalent | 51 | 6 | 7 | 39 | 27 | 56 |
| concept pairs ( | ||||||
| Correct equivalent | 41 | 5 | 5 | 33 | 21 | 49 |
| concept pairs ( | ||||||
| Missed equivalent | 11 | 2 | 3 | 8 | 5 | 9 |
| concept pairs ( | ||||||
|
| 80.39% | 83.33% | 71.43% | 84.62% | 77.78% | 87.50% |
|
| 78.85% | 71.43% | 62.50% | 80.49% | 80.77% | 84.48% |
|
| 79.61% | 76.92% | 66.67% | 82.50% | 79.25% | 85.96% |
|
| 59.62% | 57.14% | 37.50% | 65.85% | 57.69% | 72.41% |
Note that all concept pairs, except for those in Row 3 (“Training examples”) in the above table, have been used as actual test data.
Figure 4A screenshot from Protégé, demonstrating the concept miRNA and its parents, ancestors, descendants, and siblings in is_a hierarchy.
Figure 5A screenshot from OBO-Edit, demonstrating more details of parents, ancestors, and direct descendants of the concept miRNA.
All relationships exhibited in this figure are is_a relationships.
Figure 6Another OBO-Edit screenshot, demonstrating a subset of relationships designed for the concept miRNA.
Many of these relationships are miR domain-dependent ones.
Figure 7A two-layer, ANN designed for the learning problem.
Figure 8Pseudocode 1 — ANN Weight Learning.
Figure 9Pseudocode 2 — Agglomerative Clustering.