| Literature DB >> 23284698 |
Brian Godsey1, Diane Heiser, Curt Civin.
Abstract
MicroRNAs (miRs) are known to play an important role in mRNA regulation, often by binding to complementary sequences in "target" mRNAs. Recently, several methods have been developed by which existing sequence-based target predictions can be combined with miR and mRNA expression data to infer true miR-mRNA targeting relationships. It has been shown that the combination of these two approaches gives more reliable results than either by itself. While a few such algorithms give excellent results, none fully addresses expression data sets with a natural ordering of the samples. If the samples in an experiment can be ordered or partially ordered by their expected similarity to one another, such as for time-series or studies of development processes, stages, or types, (e.g. cell type, disease, growth, aging), there are unique opportunities to infer miR-mRNA interactions that may be specific to the underlying processes, and existing methods do not exploit this. We propose an algorithm which specifically addresses [partially] ordered expression data and takes advantage of sample similarities based on the ordering structure. This is done within a Bayesian framework which specifies posterior distributions and therefore statistical significance for each model parameter and latent variable. We apply our model to a previously published expression data set of paired miR and mRNA arrays in five partially ordered conditions, with biological replicates, related to multiple myeloma, and we show how considering potential orderings can improve the inference of miR-mRNA interactions, as measured by existing knowledge about the involved transcripts.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23284698 PMCID: PMC3526609 DOI: 10.1371/journal.pone.0051480
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Partial orderings.
The graphs above show the three different partial orderings of the data that we explore in this paper. Arrows give the direction of the ordering, where sample A can be said to precede sample B (i.e. ) if in the graph an arrow points from A to B. G-O refers to the grouped-ordered model version, in which samples of the same Durie-Salmon stage are grouped together as replicates. I-O is the individual-ordered model version, where the groups of smaller circles represent that different samples are not grouped as replicates, but the ordering of Durie-Salmon stages is the same as in the G-O ordering (i.e. if and only if the stage of A precedes the stage of B in the G-O ordering). And, I-R is the individual-reference model version, where the samples are again not grouped as replicates, but each of the Durie-Salmon stage IA samples precedes each sample of every other stage. In each partial ordering, there is a prior distribution over the samples which are not preceded by any other samples.
Enriched KEGG pathways among genes in the top 100 interactions.
| G–O | I–O | I–R | TaLasso | Neg.Cor | |
| Number of unique genes in the top 100 interactions | 41 | 58 | 53 | 85 | 56 |
| 05200 :Pathways in cancer | 3 | ||||
| 05215 :Prostate cancer | 2 | 3 | 3 | ||
| 05219 :Bladder cancer | 2 | 2 | 2 | ||
| 05222 :Small cell lung cancer | 2 | 2 | |||
| 05216 :Thyroid cancer | 2 | ||||
| 05214 :Glioma | 2 | 2 | |||
| 05218 :Melanoma | 2 | 2 | |||
| 05016 :Huntington’s disease | 3 | 4 | 3 | ||
| 05014 :Amyotrophic lateral sclerosis (ALS) | 2 | 2 | 2 | ||
| 05010 :Alzheimer’s disease | 3 | ||||
| 04976 :Bile secretion | 2 | ||||
| 04730 :Long-term depression | 2 | ||||
| 04115 :p53 signaling pathway | 2 | 3 | 4 | ||
| 04210 :Apoptosis | 2 | 2 | 2 | ||
| 04010 :MAPK signaling pathway | 3 | 3 | |||
| 04722 :Neurotrophin signaling pathway | 2 | ||||
| 04110 :Cell cycle | 2 | ||||
| 04120 :Ubiquitin mediated proteolysis | 2 | 4 | |||
| 04622 :RIG-I-like receptor signaling pathway | 2 | 2 | |||
| 04144 :Endocytosis | 4 | ||||
| 04914 :Progesterone-mediated oocyte maturation | 3 | ||||
| 04114 :Oocyte meiosis | 3 | ||||
| 04142 :Lysosome | 3 | ||||
| 03060 :Protein export | 3 | ||||
| 04141 :Protein processing in endoplasmic reticulum | 4 |
The top row gives the number of unique genes present in the top 100 miR-mRNA interactions according to each model; the remaining rows give, per column, the number of these genes annotated by KEGG pathway terms with significant enrichment (FDR corrected ) for for at least one of the models proposed. A blank entry indicates that the particular pathway was not significantly enriched in the model. The column G–O refers to the grouped-ordered model version, I–O is the individual-ordered model version, and I-R is the individual-reference model version, while Neg.Cor is the ranking by most negative Pearson correlation (between miR and mRNA expression profiles) among the predicted target pairs. The horizontal lines separate general categories of KEGG pathways, namely cancer-related pathways, other disease-related pathways, and then remaining pathways found to be enriched by at least one of the models.
Figure 2The top 10 interactions according to the G–O ordering.
In the above diagram, we show the miRs (top row) and genes (bottom row) involved in the 10 most significant targeting interactions based on the G–O ordering from Figure 1. In each case, the inferred interaction is negative, meaning that the miR inhibits the expression of the corresponding gene. A red line from an miR to an mRNA indicates that the interaction was predicted by TargetScan and a blue line indicates that the interaction was predicted by miRanda.
miRs with a significant estimate for the trend parameter.
| trending miR | direction | z-score |
| miR-18b | + | 3.88 |
| miR-367 | − | 3.80 |
| miR-18a | + | 3.61 |
| miR-194 | + | 3.57 |
| miR-133b | + | 3.54 |
| miR-92a | + | 3.38 |
| miR-554 | − | 3.24 |
| miR-551a | + | 3.23 |
Shown are the most significant trend parameter estimates (). A “+” in the table denotes that the expression of the miR increased throughout the progression of the partial ordering by disease stage (G–O), and tended to be higher during later stages. Likewise, a “−” denotes that the expression of that miR tended to be higher in early stages and lower in later stages.