| Literature DB >> 32093602 |
Lucile Mégret1, Satish Sasidharan Nair2, Julia Dancourt2, Jeff Aaronson3, Jim Rosinski3, Christian Neri4.
Abstract
BACKGROUND: MicroRNA (miRNA) regulation is associated with several diseases, including neurodegenerative diseases. Several approaches can be used for modeling miRNA regulation. However, their precision may be limited for analyzing multidimensional data. Here, we addressed this question by integrating shape analysis and feature selection into miRAMINT, a methodology that we used for analyzing multidimensional RNA-seq and proteomic data from a knock-in mouse model (Hdh mice) of Huntington's disease (HD), a disease caused by CAG repeat expansion in huntingtin (htt). This dataset covers 6 CAG repeat alleles and 3 age points in the striatum and cortex of Hdh mice.Entities:
Keywords: Biological precision; Machine learning; Multidimensional data; Predictive accuracy; Shape analysis; miRNA regulation
Year: 2020 PMID: 32093602 PMCID: PMC7041117 DOI: 10.1186/s12859-020-3418-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1MiRAMINT analysis of miRNA regulation. This protocol integrates network-based analysis for reduction of data complexity followed by (i) random forest (RF) analysis for selecting explanatory variables, with a p-value computed for each predictor variable using the Altmann’s approach [25] and this RF analysis iterated (which involves different seeds in each iteration) until the number of hypotheses is stable across consecutive iterations (see also Materials and Methods), (ii) surface-matching analysis for high precision in matching the miRNA and mRNA expression profiles across conditions (herein as defined by 6 CAG repeat alleles and 3 age points), (iii) evidence for binding sites and (iv) data prioritization for selecting predicted miRNA-mRNA pairs of high interest. The number of possible miRNA-mRNA pairs retained at each step of the analysis (outcome) of multidimensional data from the brain of Hdh mice is indicated. The miRNA-mRNA pairs retained upon shape-matching analysis can be visualized at http://www.broca.inserm.fr/MiRAMINT/index.php. The whole approach, data prioritization included, retained 5 miRNA-mRNA pairs of high interest in the striatum of Hdh mice and none in the cortex
Negative correlations (FDR < 0.01) between miRNA modules and mRNA modules in Hdh mice
| Striatal data | Cortical data | ||
|---|---|---|---|
| miRNA module | Target module | miRNA module | Target module identity |
| aID#1 ( | ID#2 ( | ID#1 ( | ID#1 ( |
| ID#1 (N = 206) | ID#7 ( | ID#1 (N = 217) | ID#3 ( |
| ID#1 (N = 206) | ID#11 ( | ID#1 (N = 217) | ID#6 ( |
| ID#1 (N = 206) | ID#15 ( | ID#2 ( | ID#11 ( |
| ID#1 (N = 206) | ID#17 ( | ID#3 ( | ID#4 ( |
| ID#2 ( | ID#18 ( | ID#3 (N = 87) | ID#8 ( |
| ID#3 ( | ID#5 ( | ID#3 (N = 87) | ID#9 ( |
| ID#3 (N = 104) | ID#6 ( | ID#3 (N = 87) | ID#10 ( |
| ID#3 (N = 104) | ID#10 ( | ID#4 ( | ID#4 (N = 291) |
| ID#3 (N = 104) | ID#11 (N = 49) | ID#4 (N = 82) | ID#8 (N = 170) |
| ID#4 ( | ID#3 ( | ID#4 (N = 82) | ID#9 (N = 75) |
| ID#8 (N = 20) | ID#2 (N = 40,131) | ID#4 (N = 82) | ID#10 (N = 74) |
aIdentity number
Fig. 2Selected miRNA-target pairs in the striatum of Hdh mice. a Shown are the 31 miRAMINT miRNA-target pairs (see also Table S3 for the full list of miRAMINT miRNA-target pairs in striatum). The targets are contained in rectangles and the miRNAs in ellipses. A thick edge means that evidence for binding sites is available from at least two miRNA databases. A thin edge means that evidence for binding sites is available from only one miRNA database. A thick rectangle means that the maximal LFC of the target is greater than 0.5. A cross arrow indicates the miRNA that is best paired with a target when this target has several possible miRNA regulators. Biological annotations of miRNA targets correspond to GO Biological processes or KEGG pathways at the result of STRING analyzes using stringent criteria (i.e. STRING score > 0.7, Databases and Experiments only, 20 neighbors added on the first shell) the KEGG pathways are those with, at least, 3 genes implied, the GO Biological processes are those with, at least, 5 genes implied. b Examples of 3D-graphs for top miRNA-target pairs (LFC amplitude of the target above 0.5)
Fig. 3Selected miRNA-target pairs in the cortex of Hdh mice. Shown are the 9 miRAMINT miRNA-target pairs (see also Table S3 for the full list of miRAMINT miRNA-target pairs in cortex). The targets are contained in rectangles and the miRNAs in ellipses. A thick edge means that evidence for binding sites is available from at least two miRNA databases. A thin edge means that evidence for binding sites is available from only one miRNA database. All LFC are below 0.5. A cross arrow indicates the miRNA that is best paired with a target when this target has several possible miRNA regulators. Biological annotations of miRNA targets correspond to GO Biological processes or KEGG pathways at the result of STRING analyzes using stringent criteria (i.e. STRING score > 0.7, Databases and Experiments only, 5 neighbors added on the first shell) the KEGG pathways are those with, at