| Literature DB >> 26560461 |
Lila E Mullany1, Roger K Wolff1, Martha L Slattery1.
Abstract
MiRNAs are small, nonprotein-coding RNA molecules involved in gene regulation. While bioinformatics help guide miRNA research, it is less clear how they perform when studying biological pathways. We used 13 criteria to evaluate effectiveness and usability of existing bioinformatics tools. We evaluated the performance of six bioinformatics tools with a cluster of 12 differentially expressed miRNAs in colorectal tumors and three additional sets of 12 miRNAs that are not part of a known cluster. MiRPath performed the best of all the tools in linking miRNAs, with 92% of all miRNAs linked as well as the highest based on our established criteria followed by Ingenuity (58% linked). Other tools, including Empirical Gene Ontology, miRó, miRMaid, and PhenomiR, were limited by their lack of available tutorials, lack of flexibility and interpretability, and/or difficulty using the tool. In summary, we observed a lack of standardization across bioinformatic tools and a general lack of specificity in terms of pathways identified between groups of miRNAs. Hopefully, this evaluation will help guide the development of new tools.Entities:
Keywords: bioinformatics; miRNA; miRPath; pathways
Year: 2015 PMID: 26560461 PMCID: PMC4629629 DOI: 10.4137/CIN.S32716
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Coordinated expression with miRNAs.
MiRNA target gene regulation. This table shows the potential number of targets that need to be examined when analyzing a set of miRNA.
| miRNA | VALIDATED TARGETS | PREDICATED TARGETS | POTENTIAL TOTAL GENES |
|---|---|---|---|
| hsa-miR-106b-5p | 102 | 855 | 957 |
| hsa-miR-25-3p | 81 | 496 | 577 |
| hsa-miR-93-5p | 125 | 839 | 964 |
| hsa-miR-221-3p | 327 | 316 | 643 |
| hsa-miR-17-5p | 417 | 860 | 1277 |
| hsa-miR-20a-5p | 156 | 847 | 1003 |
| hsa-miR-92a-3p | 656 | 496 | 1152 |
| hsa-miR-20b-5p | 31 | 849 | 880 |
| hsa-miR-203a | 133 | 711 | 844 |
| hsa-miR-19a-3p | 108 | 788 | 896 |
| hsa-miR-7-5p | 300 | 434 | 734 |
| hsa-miR-224-5p | 118 | 441 | 559 |
Notes:
miRWalk (http://www.umm.uni-heidelberg.de/apps/zmf/mirwalk/mirnatargetpub.html).
Predicated targets from miRDB (http://mirdb.org/cgi-bin/search.cgi?searchType=miRNA&full=mirbase&searchBox=MIMAT0000081).
Validated targets are for hsa-miR-203.
Description of miRNA analysis tools evaluated. This table describes the tools objectives and methods to the best of this study’s ability based on the information given in the tool’s paper and by the tool’s site.
| BIOINFORMATICS TOOL | DESCRIPTION |
|---|---|
| Empirical GO | One approach for looking at common biological pathways is through gene-annotation enrichment analysis. Since biological processes and pathways incorporate multiple genes, gene enrichment analysis considers the expression of all of the genes in a dataset as compared to the reference genome in question; if there are more genes in the dataset that correlate to a specific process than is expected by the background genome, this process is said to be enriched in the dataset.35 Recently, Bleazard et al. |
| miRό | MiRό ( |
| miRMaid | MiRMaid ( |
| PhenomiR | PhenomiR ( |
| miRPath v.2.0 | DIANA miRPath ( |
| IPA | Ingenuity Pathway Analysis (IPA) ( |
miRNA Input Number and Content. This table portrays the different evaluation criteria categories (‘Criteria’), the operationalized question (‘Definition’), and the different tools which were evaluated. Tools have a ‘Y’ if they fulfilled the criteria question, and ‘N’ if they did not, and a ‘U’ if this criteria question was not determined. At the bottom of the table there is a tally of the total criteria that the tool fulfilled (total number of ‘Y’s).
| CRITERIA | DEFINITION | EMPIRICAL GO | miRó | miRMaid | PhenomiR | miRPath | IPA |
|---|---|---|---|---|---|---|---|
| Access | Is the tool offered as a web-server? | N | Y | Y | Y | Y | Y |
| Is the tool downloadable? | Y | N | Y | N | N | N | |
| Appropriateness | Does the tool provide pathway output for a miRNA cluster input? | Y | N | N | N | Y | Y |
| Cost | Is the tool free? | Y | Y | Y | Y | Y | N |
| Documentation | Are there peer-reviewed articles provided on the tool’s web-site? | Y | Y | Y | Y | Y | Y |
| Are there examples or tutorials provided on the web-site? | N | Y | Y | N | Y | Y | |
| Are there manual pages provided on the tool’s web-site? | Y | N | Y | N | N | Y | |
| Efficiency | Is this the only tool needed to perform the desired analysis? | N | N | N | N | Y | Y |
| Exportability | Can the user download summary results? | Y | N | U | N | Y | Y |
| Does the tool allow data to be saved for future use? | Y | N | U | N | Y | Y | |
| Can the user download raw results? | U | N | U | Y | N | Y | |
| Flexibility | Does the tool find validated targets? | N | Y | U | Y | Y | Y |
| Does the tool find predicated targets? | Y | Y | U | N | Y | Y | |
| Does the tool allow predicted and validated target to be viewed together? | N | Y | U | N | Y | Y | |
| Does the tool allow predicted and validated target to be viewed separately? | N | N | U | N | Y | Y | |
| Does the tool discriminate between predicated and validated targets? | N | N | U | N | Y | Y | |
| Does the tool allow basic text file format? | Y | N | U | N | Y | Y | |
| Does the tool allow various nomenclature? | U | Y | Y | N | Y | Y | |
| Does the tool recognize equivalent names? | U | Y | Y | N | Y | Y | |
| Does the tool allow for species filtering? | Y | N | Y | Y | Y | N | |
| Does the tool suggest different miRNAs if the nomenclature does not match? | U | Y | Y | N | Y | U | |
| Does the tool alert the user when a miRNA does not link? | U | N | Y | Y | Y | Y | |
| Does the tool alert the user of how many miRNAs linked? | U | N | Y | Y | Y | Y | |
| Interpretability | Does the tool provide any summary statistics? | U | Y | U | Y | Y | Y |
| Does the tool provide any visualization? | U | Y | U | Y | Y | Y | |
| Does the tool link to external sites? | N | Y | Y | Y | Y | N | |
| Knowledgebase | Is the knowledgebase manually curated? | N | Y | Y | Y | Y | Y |
| Is the knowledgebase comprised of various repositories? | Y | Y | N | Y | Y | Y | |
| Methodologies | Are any utilized databases updated on a regular basis? | Y | N | N | N | N | Y |
| Are any statistical methodologies used up to date? | Y | U | U | U | U | U | |
| Scalability | Does the tool allow data to be saved for future use? | Y | N | U | N | Y | Y |
| Does the tool allow for different amounts of miRNAs input without limit? | Y | N | U | Y | Y | Y | |
| Standardization | Does the tool utilize KEGG pathways? | Y | N | N | N | Y | N |
| Does the tool utilize GO terms? | Y | Y | N | Y | N | N | |
| Usability | Is the tool compatible with the Windows platform? | Y | Y | N | Y | Y | Y |
| Is the tool compatible with the Mac platform? | Y | Y | Y | Y | Y | Y | |
| Is the tool compatible with the Linux platform? | U | U | Y | U | U | U | |
| Is the tool compatible with the UNIX platform? | U | U | U | U | U | U | |
| Can the user execute the tool without acquiring specific outside software? | N | Y | N | Y | Y | Y | |
| Can the user execute the tool using only their own input miRNA file(s), without acquiring additional files? | N | Y | Y | Y | Y | Y | |
| Can the user operate the tool without any programming language knowledge? | N | Y | N | Y | Y | Y | |
| Total summary score | 7 | 4 | 8 | 2 | 28 | 25 |
Notes:
Certain criteria were evaluated using miRMaid’s web-based server’s search tool only.
When the entry is entered with a validated target, this target is displayed in the results.
Twelve miRNA entries were allows in the query, however results were not obtained.
The examples are provided on the tool site, which is not neccesarily the web site.
Filtering is available, however only TargetScanHuman (predicted targets) will narrow the species.
When miRNAs are entered individually, the tool allows the user to pick a new name; if the miRNA is entered in a cluster it will not be added.
The authors re-ran the analysis using KEGG pathways in place of GO terms.
Raw results cannot be downloaded from the server-based tool; it is possible that this criterion doesn’t reflect full use of the tool.
The web-based tool is able to run on Windows; the Ruby-based claims to be incompatible with Windows, and compatible with Mac and Linux.
For the majority of tools, these criteria are supposed to be answerable as ‘Y’, however this was not verified by attempt by the user.
The tool does provide a link to miRBase, which provides links to validated and predicted targets, however this link is out of date.
MiRPath utilizes the hypergeometric distribution, which has been suggested to be inappropriate for miRNA GSEA; it is possible though that more research needs to be conducted on this before this criterion can conclusively be determined for this tool.
MiRNA nomenclature and linking in tools. This table displays the groupings of the miRNAs used as input to the evaluated tools (which were clustered based on expression in CRC tumors). There are three groups: 1) mature miRNA exact spelling (using ‘miR’ in the name and the most current and specific version, ie, ‘-3p’); 2) gene miRNA exact spelling (using ‘mir’ in the name and the most specific version, ie, ‘-3p’); and 3) alternative spelling (this is the older version of the miRNA’s nomenclature). Each miRNA was tried individually (if available) and as a group (if available); this means that each miRNA was input twice in total and the ‘input group #’ column reflects this. For each tool, it is recorded under the tool column if the tool recognized the miRNA as a valid miRNA (‘Recognized’) and had curated pathway or process data for that miRNA (‘Curated’). A cell value of ‘Y, Y’ for example means that that miRNA (row) was both ‘Recognized’ and ‘Curated’ by that tool (column). If the tool could not be executed an ‘N/A’ is recorded in that column.
| CATEGORY | miRNA | INPUT GROUP # | RESULTS (RECOGNIZED, CURATED) | |||||
|---|---|---|---|---|---|---|---|---|
| EMPIRICAL GO | miRó | miRMaid | PhenomiR | miRPath | IPA | |||
| Mature miRNA exact spelling | hsa-miR-106b-5p | 1, 4 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y |
| hsa-miR-25-3p | 1, 5 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-93-5p | 1, 6 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-221-3p | 1, 7 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-17-5p | 1, 8 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-20a-5p | 1, 9 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-20b-5p | 1, 10 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-92a-3p | 1, 11 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-203a | 1, 12 | N/A | Y, N | N, N | N, N | N, N | Y, Y | |
| hsa-miR-19a-3p | 1, 13 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-7-5p | 1, 14 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-miR-224-5p | 1, 15 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| Gene miRNA exact spelling | hsa-mir-106b-5p | 2, 16 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y |
| hsa-mir-25-3p | 2, 17 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-93-5p | 2, 18 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-221-3p | 2, 19 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-17-5p | 2, 20 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-20a-5p | 2, 21 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-20b-5p | 2, 22 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-92a-3p | 2, 23 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-203a | 2, 24 | N/A | Y, N | N, N | N, N | N, N | Y, Y | |
| hsa-mir-19a-3p | 2, 25 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-7-5p | 2, 26 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| hsa-mir-224-5p | 2, 27 | N/A | Y, N | Y, N | N, N | Y, Y | Y, Y | |
| Alternative ID spellings | hsa-miR-106b | 3, 28 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y |
| hsa-miR-25 | 3, 29 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-93 | 3, 30 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-221 | 3, 31 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-17 | 3, 32 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-20a | 3, 33 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-20 | 3, 34 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-20b | 3, 35 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-92a | 3, 36 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-92 | 3, 37 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-203a-3p | 3, 38 | N/A | Y, N | N, N | N, N | N, N | Y, Y | |
| hsa-miR-19a | 3, 39 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-7 | 3, 40 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| hsa-miR-224 | 3, 41 | N/A | Y, Y | N, N | Y, Y | Y, Y | Y, Y | |
| Cluster 1 (Y/N, %) | N/A | N | N | N | Y, 92% | Y, | ||
| Cluster 2 (Y/N, %) | N/A | N | N | N | Y, 92% | Y, | ||
| Cluster 3 (Y/N, %) | N/A | N | N | N | N, 0% | Y, | ||
| Individual (#, %) ALL clusters | N/A | 13, 34% | 0, 0% | 13, 34% | 35, 92% | 7, 58% | ||
| Individual (#, %) ONE cluster | N/A | 13, 93% | 0, 0% | 13, 93% | 13, 93% | 7, 58% | ||
Notes:
Provided alternative options for spelling.
Questions involving the miRNA-linking of miRMaid were answered using the web-based server’s search tool.
These miRNAs were recognized as gene miRNAs (or precursors) and not mature miRNAs.
These results included any entry that has this miRNA in the title (ie, searching for ‘hsa-miR-20’ returns all entries including miRNAs such as hsa-miR-20b or -205).
Provided annotation for miRNAs with the same seed sequence, which is typically from the same family but also includes other species.
As IPA linked the miRNAs to miRNA families, only the families were counted.
MiRPath pathway results overlap by tissue type. This table displays the number of pathway results shared between any two miRNA clusters used as input. For example, the group of miRNAs that were clustered in CRC tumors and the miRNAs that were randomly selected from CRC tumors share 13 pathways in common. There were 33 pathways total for CRC clustered miRNAs and 17 total pathways for random CRC miRNAs.
| CRC TUMOR CLUSTER | CRC TUMOR RANDOM | CRC NORMAL RANDOM | BREAST TUMOR RANDOM | |||||
|---|---|---|---|---|---|---|---|---|
| N | N/TOTAL | N | N/TOTAL | N | N/TOTAL | N | N/TOTAL | |
| CRC tumor cluster | 13 | 0.76 | 5 | 1.00 | 6 | 0.86 | ||
| CRC tumor random | 13 | 0.39 | 3 | 0.60 | 5 | 0.71 | ||
| CRC normal random | 5 | 0.15 | 3 | 0.18 | 1 | 0.14 | ||
| Breast tumor random | 6 | 0.18 | 5 | 0.29 | 1 | 0.20 | ||
| Total | 33 | 17 | 5 | 7 | ||||