| Literature DB >> 35609985 |
Michael Soutschek1,2, Tomás Germade1, Pierre-Luc Germain1,3,4, Gerhard Schratt1,2.
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs that are among the main post-transcriptional regulators of gene expression. A number of data collections and prediction tools have gathered putative or confirmed targets of these regulators. It is often useful, for discovery and validation, to harness such collections to perform target enrichment analysis in given transcriptional signatures or gene-sets in order to predict involved miRNAs. While several methods have been proposed to this end, a flexible and user-friendly interface for such analyses using various approaches and collections is lacking. enrichMiR (https://ethz-ins.org/enrichMiR/) addresses this gap by enabling users to perform a series of enrichment tests, based on several target collections, to rank miRNAs according to their likely involvement in the control of a given transcriptional signature or gene-set. enrichMiR results can furthermore be visualised through interactive and publication-ready plots. To guide the choice of the appropriate analysis method, we benchmarked various tests across a panel of experiments involving the perturbation of known miRNAs. Finally, we showcase enrichMiR functionalities in a pair of use cases.Entities:
Year: 2022 PMID: 35609985 PMCID: PMC9252831 DOI: 10.1093/nar/gkac395
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Overview of miRNA target enrichment tools and comparison of their specific web interface options
| enrichMiR | Mienturnet | mirExTra 2.0 | miTEA | |
|---|---|---|---|---|
| Statistical tests: (see | A compendium of benchmarked statistical tests | Hyper-geometric test | Hyper-geometric test | Mutual minimum hyper-geometric test (mmHG) |
|
| ||||
| Allowed input: | Various DEA formats, custom gene lists & GO-Terms (gene symbol or Ensembl IDs) | Custom gene list (gene symbols) | Table with differentially expressed genes + background (Ensembl IDs) | Ranked gene list (gene symbols) |
| Supported Species: | 6 | 6 | 4 | 5 |
| Available binding site databases: | ScanMiR, TargetScan, miRTarBase & oRNAment | TargetScan & miRTarBase | DIANA-TarBase v7.0 & DIANA-microT-CDS | TargetScan, MicroCosm & EiMMo |
| Allow specific background: | Yes | No | Yes | Yes |
| Specify miRNA expression: | Yes | No | Yes | No |
| UTR length correction: | Yes | No | No | No |
| Plotting options: | Enrichment plots & Cumulative Distribution Plots | Network visualization | Network visualization | miRNA network visualization |
Tools which used to have a web interface that is not anymore available (e.g Sylamer) are not included here. See Supplementary Table S1 for additional information on miRNA enrichment tools in general.
Overview of the available enrichment tests
| Test | Input signal type | Annotation type | Description |
|---|---|---|---|
| overlap | binary (set) | genesets | Over-representation (ORA) of target genes among set |
| woverlap † | binary (set) | nb sites | ORA of binding sites, correcting for UTR length |
| siteoverlap | binary (set) | nb sites | ORA of binding sites |
| areamir † | continuous (DEA) | scores | Score-weighted analytic Rank Enrichment Analysis (aREA) |
| modsites | continuous (DEA) | nb sites | Linear regression of logFCs on nb of binding sites |
| modscore | continuous (DEA) | scores | Linear regression of logFCs on predicted repression scores |
| ebayes † | continuous (DEA) | scores | Linear regression of logFCs on predicted repression scores with moderated |
| lmadd †,* | continuous (DEA) | scores | As ebayes, but followed by consecutive additive linear regression of logFCs on predicted repression scores |
| regmir.cc † | continuous (DEA) | scores | Regularized regression to select miRNAs for linear regression testing |
| ks | continuous (DEA) | genesets | Kolmogorov-Smirnov (KS) test on logFCs |
| mw | continuous (DEA) | genesets | Mann-Whitney / Wilcoxon test on logFCs |
| gsea | continuous (DEA) | genesets | Gene set enrichment analysis (GSEA) |
Tests denoted by an asterisk (*) are novel, while those denoted by a dagger (†) are novel in the specific context of miRNA target enrichment analysis.
Figure 1.Benchmark of miRNA target enrichment tests, using the original datasets (A, B) as well as their conjunction with scrambled versions (C–E). (A) Rank of the true miRNA among the predicted candidates (the lower the better) using the various tests across the benchmark datasets. (B) Overall sensitivity and False Discovery Rate (FDR) at a nominal FDR threshold of 0.05, using the original datasets. (C, D) Rank of the true miRNA (C) and FDR (D) using the various tests across the original and scrambled datasets (averaged per dataset). (E) Overall sensitivity and False Discovery Rate (FDR) at a nominal FDR threshold of 0.05 across scrambled datasets.
Figure 2.Enrichment of miR-7 binding sites in genes downregulated upon Cyrano knockout in skeletal muscle. An enrichMiR analysis was conducted on the 5’000 highest expressed genes of a DEA generated from bulk RNA-sequencing data upon knockout of the lncRNA Cyrano versus control in skeletal muscle tissue (39). (A) Enrichment plot showing a significant enrichment of miR-7 binding sites in downregulated genes as revealed by the siteoverlap test using the TargetScan mouse conserved annotation. Each dot depicts one miRNA family, coloured by the expression of its top expressed member in the tissue (23). The size of the dot indicates the number of predicted targets significantly downregulated in the dataset. (B) CD plot showing a site type dependent downregulation of miR-7a-5p targets using the scanMiR mouse annotation. Genes classified as ‘no site’ do not contain a canonical 7mer or 8mer site.
Figure 3.enrichMiR analysis suggests a role of miR-129-5p upon PTX treatment in neurons. An enrichMiR analysis was performed on the 5’000 highest expressed genes of a DEA generated from RNA-sequencing data of hippocampal neurons treated with PTX versus control (44). (A) Enrichment plot depicting the ranking of miRNAs by enrichment and p-value as assessed with the siteoverlap test on the TargetScan rat conserved annotation. Each dot depicts one miRNA family, coloured by the expression of its top expressed member in hippocampal neurons (44). The size of the dot indicates the number of predicted significantly downregulated target genes of each miRNA. (B) CD plot indicating a downregulation of miR-129–5p targets using the ScanMiR rat annotation (split by best site type, ‘no site’ indicating genes without a canonical 7mer or 8mer binding site). (C) An enrichMiR analysis of genes belonging to the GO-Term ‘Calcium channel complex’ against the same background of expressed genes as used before (44) employing the siteoverlap test and the Targetscan rat conserved annotation reveals a significant enrichment of miR-129 targets in this gene set.