| Literature DB >> 34267778 |
Yongkang Kim1, Sungyoung Lee2,3, Jin-Young Jang4, Seungyeoun Lee5, Taesung Park1,6.
Abstract
In the "personalized medicine" era, one of the most difficult problems is identification of combined markers from different omics platforms. Many methods have been developed to identify candidate markers for each type of omics data, but few methods facilitate the identification of multiple markers on multi-omics platforms. microRNAs (miRNAs) is well known to affect only indirectly phenotypes by regulating mRNA expression and/or protein translation. To take into account this knowledge into practice, we suggest a miRNA-mRNA integration model for survival time analysis, called mimi-surv, which accounts for the biological relationship, to identify such integrated markers more efficiently. Through simulation studies, we found that the statistical power of mimi-surv be better than other models. Application to real datasets from Seoul National University Hospital and The Cancer Genome Atlas demonstrated that mimi-surv successfully identified miRNA-mRNA integrations sets associated with progression-free survival of pancreatic ductal adenocarcinoma (PDAC) patients. Only mimi-surv found miR-96, a previously unidentified PDAC-related miRNA in these two real datasets. Furthermore, mimi-surv was shown to identify more PDAC related miRNAs than other methods because it used the known structure for miRNA-mRNA regularization. An implementation of mimi-surv is available at http://statgen.snu.ac.kr/software/mimi-surv.Entities:
Keywords: The Cancer Genome Atlas; miRNA-mRNA integration; pancreatic ductal adenocarcinoma; personalized medicine; statistical method
Year: 2021 PMID: 34267778 PMCID: PMC8276759 DOI: 10.3389/fgene.2021.634922
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Schematic diagram of mimi-surv model. Rectangles and circles indicate observed and latent variables, respectively. Arrows indicate conceptualized directions of effects between the variables. Each miRNA-mRNA integration set consists of one miRNA and its target mRNAs. Each miRNA-mRNA integration set j is summarized by the latent variable f which is linear combination of z and its adjusted mRNA expressions.
FIGURE 2Kaplan-Meier curves of (A) 95 samples of SNUH dataset and (B) 112 samples of TCGA dataset. Red vertical lines indicate median survival times (795 and 652 days SNUH and TCGA, respectively).
List of causal miRNAs and the numbers of target mRNAs used in simulation.
| miRNA | # target mRNAs | Regulated mRNAs in SNUH data |
| 425 | PAX5, SHISA9 | |
| 445 | HMGA2, EGR3 | |
| 9 | SLIT2, BNC2, CDH11 | |
| 172 | PRKAB2, SNX2 | |
| 125 | PLAT, SMAD2, CHRDL1 | |
| 56 | GRIN2B, HMGA2, ARNTL2, ACADL, TDRD6 | |
| 449 | LHX1, NR4A2, PKP1, SHOX, TRIM71, CAMK2A | |
| 285 | MCL1, RLF, RAB5IF, EDEM3 | |
| 149 | FLRT2, PAX6, SDHC, SERAC1, SYT5, UBXN2A | |
| 550 | FRAS1, ANKRD50, LIN28B, PDE7A, SLC4A4, TP53INP1, TRIB2, CD248 |
The number of mRNAs included in the miRNA-mRNA integration set.
| miRNA | # overlapped | # mRNAs (SNUH) | # mRNAs (TCGA) |
| 41 | 331 | 51 | |
| 3 | 10 | 281 | |
| 28 | 469 | 37 | |
| 1 | 47 | 1 | |
| 2 | 4 | 9 | |
| 10 | 336 | 15 | |
| 8 | 50 | 114 | |
| 60 | 461 | 119 | |
| 7 | 24 | 207 | |
| 3 | 32 | 14 | |
| 13 | 43 | 62 | |
| 4 | 50 | 17 | |
| 2 | 8 | 131 | |
| 3 | 36 | 43 |
FIGURE 3Result of type I error evaluation. Bars indicate estimated type I error rate with given parameters (censoring fraction δ). Note that the type I errors were evaluated by fixing all parameters to 0.
FIGURE 4Statistical powers of mimi-surv and the compared methods with different miRNA effect sizes (γ = 0.2, 0.3, and 0.4). The phenotypes were generated from two, five and ten causal miRNA-mRNA integration set and censoring fraction of 0.15 and 0.3.
FIGURE 5Statistical powers of mimi-surv and the compared methods with different mRNA effect sizes (w = 0.5, 0.6, and 0.7). The phenotypes were generated from two, five and ten causal miRNA-mRNA integration set and censoring fraction of 0.15 and 0.3.
FIGURE 6Venn diagram for the number of miRNAs detected by each method in analysis of PDAC data from SNUH. The numbers without brackets show the numbers of miRNAs found in other PDAC analyses, while those within brackets show the number of miRNAs not previously identified.
Results of statistically significant miRNA and its significant mRNAs from both datasets using mimi-surv.
| miRNA | # mRNAs | # significant mRNAs (names) | |||||
| S N U H | 5 | –0.018 | 0.015 | 0.690 | Ridge | ||
| 1 (GRIN2B) | –0.179 | 0.004 | 0.221 | Lasso | |||
| 1 (GRIN2B) | –0.142 | 0.031 | 0.490 | ||||
| –0.179 | 0.021 | 0.382 | |||||
| 901 | 9 | –0.406 | 0.012 | 0.319 | Lasso | ||
| 7 | –0.544 | 0.003 | 0.178 | ||||
| –0.544 | 0.005 | 0.259 | |||||
| 2 | 1 (PAX5) | 0.015 | 0.045 | 0.690 | Ridge | ||
| 1 (PAX5) | 0.008 | 0.033 | 0.601 | Lasso | |||
| 189 | 2 (LRRC14, PHF13) | –0.252 | 0.036 | 0.490 | |||
| –0.252 | 0.046 | 0.620 | |||||
| 46 | 0.024 | 0.045 | 0.690 | Ridge | |||
| T C G A | 281 | 2 (ELFN1, KCNJ12) | 0.679 | 0.010 | 0.218 | ||
| 0.679 | 0.002 | ||||||
| 15 | 2 (BASP1, LPAR1) | 0.131 | 0.038 | 0.154 | Lasso | ||
| 0.131 | 0.029 | 0.167 | |||||
| 109 | 2 (OXSR1, RAB43) | 0.023 | 0.040 | 0.249 | Ridge | ||
| 115 | 0.018 | 0.018 | 0.142 | ||||