| Literature DB >> 23284279 |
Claudia Coronnello1, Ryan Hartmaier, Arshi Arora, Luai Huleihel, Kusum V Pandit, Abha S Bais, Michael Butterworth, Naftali Kaminski, Gary D Stormo, Steffi Oesterreich, Panayiotis V Benos.
Abstract
MicroRNAs (miRNAs) are post-transcriptional regulators that bind to their target mRNAs through base complementarity. Predicting miRNA targets is a challenging task and various studies showed that existing algorithms suffer from high number of false predictions and low to moderate overlap in their predictions. Until recently, very few algorithms considered the dynamic nature of the interactions, including the effect of less specific interactions, the miRNA expression level, and the effect of combinatorial miRNA binding. Addressing these issues can result in a more accurate miRNA:mRNA modeling with many applications, including efficient miRNA-related SNP evaluation. We present a novel thermodynamic model based on the Fermi-Dirac equation that incorporates miRNA expression in the prediction of target occupancy and we show that it improves the performance of two popular single miRNA target finders. Modeling combinatorial miRNA targeting is a natural extension of this model. Two other algorithms show improved prediction efficiency when combinatorial binding models were considered. ComiR (Combinatorial miRNA targeting), a novel algorithm we developed, incorporates the improved predictions of the four target finders into a single probabilistic score using ensemble learning. Combining target scores of multiple miRNAs using ComiR improves predictions over the naïve method for target combination. ComiR scoring scheme can be used for identification of SNPs affecting miRNA binding. As proof of principle, ComiR identified rs17737058 as disruptive to the miR-488-5p:NCOA1 interaction, which we confirmed in vitro. We also found rs17737058 to be significantly associated with decreased bone mineral density (BMD) in two independent cohorts indicating that the miR-488-5p/NCOA1 regulatory axis is likely critical in maintaining BMD in women. With increasing availability of comprehensive high-throughput datasets from patients ComiR is expected to become an essential tool for miRNA-related studies.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23284279 PMCID: PMC3527281 DOI: 10.1371/journal.pcbi.1002830
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1The effect of Fermi-Dirac model in miRNA target prediction.
(A) Overlap of predicted targets from PITA and miRanda using a naïve combination of energy scores. (B) Target overlap between PITA and miRanda using the Fermi-Dirac energy score combination. (C) Receiver-operating Characteristic (ROC) curves of PITA and miRanda predictions with naïve (solid lines) and Fermi-Dirac (broken lines) energy score combination. AUC: area under the curve. Positive and negative sets were derived from the Ago1 IP data (Materials and Methods).
Figure 2ComiR schema.
Figure 3Predicting efficiency of Drosophila-trained ComiR on various datasets.
(A) Self-test on the Drosophila Ago1-IP dataset consist of Set I (positive examples) and equal number of negative examples (from Set IV). (B) Performance on an external Drosophila Ago1-IP dataset consisting of Set III (positive examples) and the remaining of Set IV (negative examples). This Drosophila dataset was not used in training ComiR. (C) SN vs. threshold on an external C. elegans AIN-IP dataset (not an ROC curve due to inability to define a negative dataset). (D) Performance on an external human PAR-CLIP dataset. In all cases, TargetScan was used without the evolutionary conservation feature resulting in a binary outcome. For the human dataset the reader can find a continuous TargetScan ROC curve in Figure S3B, plotted using the context score.
Figure 4Comparison of SVM models for multiple miRNA targets.
Multiple miRNA target scores are combined using the naïve model (red dots) or the ComiR model (FD score or WSUM score). The comparison has been performed on the same datasets as in Figure 3 with the exception of the C. elegans dataset, which has no proper ROC curve. Results are arranged by the difference the ComiR combination models offers over the naïve combination model. P: PITA, M: miRanda, T: TargetScan, S: mirSVR. AUC: area under the curve.
Figure 5rs17737058 is sufficient to disrupt miR-488-5p targeting of NCOA1 (SRC-1) and it associates with BMD.
(A) Western blot of U2OS-ERα cells were transfected with either a mimic negative control (MNC) or a miR-488-5p mimic miRNA. NCOA3 was used as a negative control and β-tubulin as a loading control. (B) U2OS-ERα cells were transfected with 100 nm of MNC or miR-488-5p mimic with 500 ng of either WT or SNP (rs17737058) psiCHECK2-renilla-NCOA1 3′UTR in combination with MNC or miR-488-5p miRNA renilla luciferase values were normalized to firefly luciferase values. * represents p<0.05 by t-test. (C) Premenopausal women not receiving bisphosphonates or chemotherapy (known role on BMD) in the COBRA trial were genotyped for association with decreased BMD. * p-values<0.05 comparing WT and Het+Homo BMD for each genotype by t-test.