| Literature DB >> 30097004 |
Luqman Hakim Abdul Hadi1, Quy Xiao Xuan Lin1, Tri Tran Minh1, Marie Loh2, Hong Kiat Ng1, Agus Salim3, Richie Soong4,5, Touati Benoukraf6.
Abstract
BACKGROUND: The knowledge of miRNAs regulating the expression of sets of mRNAs has led to novel insights into numerous and diverse cellular mechanisms. While a single miRNA may regulate many genes, one gene can be regulated by multiple miRNAs, presenting a complex relationship to model for accurate predictions.Entities:
Keywords: Expectation-maximization; Gene regulation; miRNA
Mesh:
Substances:
Year: 2018 PMID: 30097004 PMCID: PMC6086043 DOI: 10.1186/s12859-018-2292-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1miREM Workflow. a The gene-list and setup parameters are entered into the input page. b Each transcript is associated to its targeted miRNA(s) using the selected prediction databases. miRNAs are selected only if their respective HP p-values reach a pre-defined enrichment level. Then, these selected miRNAs are subjected to the EM-algorithm to establish the likelihood probability of each miRNA. miRNAs with the highest likelihood probabilities are the most likely to have an influence on the DEG. Afterwards, predicted miRNAs are clustered according to their seed region sequences in order to identify duplicated predictions. c Subsequently, results are displayed in a dynamic graphical interface allowing an easy data interpretation. d Finally, a dendrogram of miRNA seed sequences is generated to help in identifying duplicated predictions (miRNAs sharing similar sequences)
Feature comparisons of five miRNA predicting tools
| miREM | CORNA | Geneset2miRNA | ChemiRs | Sylamer | |
|---|---|---|---|---|---|
| Platform | Web-based | R package | Web-based | Web-based | Web-based |
| standalone | |||||
| Installation required | No | Yes | No | No | Optional |
| GUI | Yes | No | Yes | Yes | Yes |
| Software last update | 2017 | 2013 | 2009 | 2015 | Unspecified |
| Organisms supported | Human, mouse | 22 species including | 6 species including | Human | 7 species including |
| human and mouse | human and mouse | human and mouse | |||
| Reference databases | Diana, | Miranda | mirDB, | Diana, | Unspecified |
| Miranda, | Pictar, | Miranda, | |||
| mirDB, | PITA, | mirDB, | |||
| Pictar, | TargetScan | miRWalk, | |||
| PITA, | RNA22, | ||||
| RNA22, | RNAhybrid, | ||||
| TargetScan | Pictar(4way), | ||||
| Pictar(5way), | |||||
| PITA, | |||||
| TargetScan | |||||
| Mirbase version | 21 (June 2014) | 10.0 (August 2007) | 11 (April 2008) | 21 (June 2014) | Unspecified |
| Ability to specify | Yes | No | No | Yes | No |
| database(s) | |||||
| Database last updated | 2017 | 2007 | 2008 | 2015 | Unspecified |
| Cross-database query | Yes | No | No | Yes | No |
| Conserved/ nonconserved | Yes | No | No | No | No |
| miRNA | |||||
| distinction | |||||
| Input format | List of DEG: | List of DEG: | List of DEG: | List of DEG: | Full gene list |
| text file, | Ensembl | text file, | text file, | ranked by fold | |
| Gene Symbol, | Gene Symbol, | Gene Symbol, | change | ||
| RefSeq, | RefSeq Protein ID, | Ensembl | |||
| Ensembl, | RefSeq Transcript ID, | ||||
| UCSC | Ensembl, | ||||
| UniGene, | |||||
| Entrez Gene ID, | |||||
| UniProt/Swiss-Prot, | |||||
| Output formats | Tab-delimited file | R list | CSV | CSV | jnlp |
| Scatterplot | |||||
| Heatmap | |||||
| Phylogenetic miRNA | |||||
| classification | |||||
| Show target gene list | Yes | No | Yes | No | No |
| Algorithm | Hypergeometric | Hypergeometric | Hypergeometirc | Hypergeometirc | Hypergeometirc |
| Expectation | Fisher’s exact test | ||||
| Maximization | |||||
| Chi-square | |||||
| Detection of | Yes | No | No | No | No |
| duplicated predictions |
Performance of five miRNA prediction tools using two single miRNA knock-in and one miR-144/451 double knock-out experiments
| Input data | Predictions | ||||||
|---|---|---|---|---|---|---|---|
| Datasets | miRNA | DEG | miREM (based | ChemiRsc | GeneSet2MiRNAc | CORNAc | Sylamerd |
| involved | a | on EM)b | |||||
| Cytoplasmic | hsa-miR-155 | 647 | hsa-miR-155-5p | hsa-miR- | HSA-MIR-155 | hsa-mir-155 | has-miR-155 |
| RNA-seq in | knock in | (1/160) | 155-5p | (1/23) | (1/2) | ||
| U2OS cells | (1/118) | ||||||
| Cytoplasmic | hsa-miR-1 | 743 | hsa-miR-1-3p | hsa-miR-1- | HSA-MIR-1 (1/6) | hsa-mir-1 | has-miR-1 |
| RNA-seq in | knock in | (3/9) | 3p (1/65) | (2/4) | |||
| U2OS cells | |||||||
| Microarray in | miR-144/451 | 396 | mmu-miR-144- | Not | mmu-mir-144 | mmu-mir-144 | mmu-miR- |
| mice | knock out | 3p (1/2) | Applicablee | (3/3) * | (2/4) * | 144 | |
| mmu-miR-451a | |||||||
| (2/2) | |||||||
aDEG list not applicable for Sylamer where a full gene list ranked by fold change was input
bSettings: p-value threshold = 0.01, EM convergence parameter = 0.001, common mappings from 3 or more databases and nonconserved miRNAs not included
cSettings: p-value threshold = 0.01 (for ChemiRs, the minimum number of databases is 5 out of 10)
dRanking number / full prediction result number is not available in Sylamer
eMouse databases are not provided
*P-value threshold = 0.05 (no result with p-value threshold = 0.01)
Performance of five miRNA prediction tools using a miR-181a1/b1 double knock-out experiment
| Input data | Predictions | ||||||
|---|---|---|---|---|---|---|---|
| Datasets | miRNA involved | Gene list | miREM (based on EM) * | ChemiRs | GeneSet2MiRNA ** | CORNA# | Sylamer |
| mmu-miR-181a-5p | MMU-MIR-181B | ||||||
| (rank 4 out of 4) | (rank 1 out of 9) | ||||||
| RNA-seq in mice | miR-181a/b knock out | 243 | Not applicable | No result | No result | ||
| mmu-miR-181b-5p | |||||||
| (rank 1 out of 4) | |||||||
*Settings: p-value threshold = 0.0001, EM convergence parameter = 0.001, common mappings from 4 or more databases and non-conserved miRNAs not included
**p-value threshold = 0.01
#p-value threshold = 0.05