| Literature DB >> 26967326 |
Maria Vila-Casadesús1,2, Meritxell Gironella2, Juan José Lozano1,2.
Abstract
MicroRNAs (miRNAs) are small RNAs that regulate the expression of target mRNAs by specific binding on the mRNA 3'UTR and promoting mRNA degradation in the majority of cases. It is often of interest to know the specific targets of a miRNA in order to study them in a particular disease context. In that sense, some databases have been designed to predict potential miRNA-mRNA interactions based on hybridization sequences. However, one of the main limitations is that these databases have too many false positives and do not take into account disease-specific interactions. We have developed an R package (miRComb) able to combine miRNA and mRNA expression data with hybridization information, in order to find potential miRNA-mRNA targets that are more reliable to occur in a specific physiological or disease context. This article summarizes the pipeline and the main outputs of this package by using as example TCGA data from five gastrointestinal cancers (colon cancer, rectal cancer, liver cancer, stomach cancer and esophageal cancer). The obtained results can be used to develop a huge number of testable hypotheses by other authors. Globally, we show that the miRComb package is a useful tool to deal with miRNA and mRNA expression data, that helps to filter the high amount of miRNA-mRNA interactions obtained from the pre-existing miRNA target prediction databases and it presents the results in a standardised way (pdf report). Moreover, an integrative analysis of the miRComb miRNA-mRNA interactions from the five digestive cancers is presented. Therefore, miRComb is a very useful tool to start understanding miRNA gene regulation in a specific context. The package can be downloaded in http://mircomb.sourceforge.net.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26967326 PMCID: PMC4788200 DOI: 10.1371/journal.pone.0151127
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flow diagram showing the main steps of an analysis using the miRComb package.
Fig 2Main findings of the LIHC report.
A) Principal Components Analysis (PCA) (based on correlation matrix) of miRNA samples. B) Volcano plot showing the miRNAs according to its logratio between cancer and control. C) Heatmap of the top 50 most deregulated miRNAs according to its FDR. D) Density plot of the Pearson Correlation Coefficients of all possible miRNA-mRNA interactions. Lines show different cutoff: p-value < 0.05, p-value < 0.01, FDR < 0.05 and FDR < 0.01. E) Correlation of miR-139-5p and CCNB1 as an example. F) Venn diagram showing the total number of sigifnicant correlations (FDR < 0.05), the total number of predicted interactions in at least one database (TargetScan or microcosm), and the intersection of both. G) Network of selected interactions. Each miRNA-mRNA interaction is negatively correlated (FDR < 10–33) and predicted at least in one database (Targetscan or MicroCosm). Circles represent miRNAs and squares mRNAs; red fill means upregulated miRNA/mRNA, while green fill means downregulated miRNA/mRNA; lines indicate the miRNA-mRNA pairs; red line means positive score and green line means negative score; arrow width is proportional to the number of appearances on the databases (TargetScan or MicroCosm). H) Pie chart showing the number of mRNAs regulated by 0, 1, 2, 3, 4, 5, and >5 miRNAs. I) Barplot showing the number of targets per miRNA and the percentage of mRNAs that are cumulatively regulated by the miRNAs. J) Circos plot of the top 45 miRNA-mRNA interactions sorted by FDR, a line means a miRNA-mRNA pair. Blue lines are the position of the miRNAs and orange lines are the position of the mRNAs.
Summary of the main miRComb computations of the five digestive cancer data sets analysed.
| COAD | ESCA | LIHC | READ | STAD | |
|---|---|---|---|---|---|
| 444 (436, 8) | 191 (178, 13) | 407 (357, 50) | 160 (157, 3) | 443 (406, 37) | |
| 325 | 338 | 343 | 325 | 330 | |
| 14860 | 18807 | 14428 | 14973 | 18565 | |
| 4829500 | 6356766 | 4948804 | 4866225 | 6126450 | |
| 823121 (17.04%) | 568914 (8.95%) | 1156839 (24.38%) | 423296 (8.70%) | 1390596 (22.70%) | |
| 47134 | 30061 | 57675 | 24941 | 71464 |
Fig 3Percentage of false positive miRNA-mRNA predicted interactions in LIHC.
Plot showing the ratios of negatively correlated predicted targets respect to all predicted targets according to the databases for each miRNA. The intensity of the grey color dot is related to the percentage of false postive miRNA-mRNA predicted interactions. In brackets, the exact percentages of false positivesfrom selected miRNAs (miR-122; miR-122*; miR-378c).
Fig 4Venn diagram for miRComb miRNA-mRNA interactions between 5 digestive cancers.
Venn diagram showing miRComb miRNA-mRNA interactions (FDR < 0.05 and predicted in at least one database) that are present in at least one cancer. 1570 miRNA-mRNA interactions appear in the 5 studied digestive cancers.
Top 10 miRNAs sorted by number of specific targets in COAD.
Target mRNAs are sorted according to its negative correlation value (top 20 are dislplayed).
| miRNA | n.targets | mRNAs |
|---|---|---|
| 264 | MRAS, ZEB2, FCER1G, CFL2, SMNDC1, KIAA1949, CLIP4, SIGLEC5, C10orf119, CHORDC1, DZIP1L, LRCH2, SAP30, INTS2, ATF1, RPRD1A, PLXNC1, HCFC2, EIF5A2, TMEFF1 | |
| 213 | TRANK1, KHNYN, PHLPP2, KIAA1370, C5orf41, ACOX1, CCDC88C, LCOR, CCNJL, SYNRG, CHD2, ZBTB34, SESN2, PDCD6IP, GCC2, MLL2, WNT5B, KIAA0317, NBR1, TNIP1 | |
| 178 | TAOK3, GBF1, ZFYVE26, PPARD, TEP1, CYP26B1, KDM6B, BTBD7, CD68, NRBP1, NCOR1, KIAA1671, GOLGA2, ARHGAP21, MINK1, ALOX15B, KIAA1522, PSEN1, ARHGEF18, SEMA7A | |
| 174 | KIAA1211, HADHB, CSF1, FAM107B, PANK3, BTG1, ADAM28, FAM78A, MIER3, MXD1, BTD, RNASEL, MOBKL2B, GZMK, B2M, HADHA, TP53INP1, TCF7L2, DCTN2, TAGAP | |
| 173 | SLC36A1, LASP1, TANC2, FGFR2, ANKFY1, BAHD1, KDM6B, SLC22A23, STAT3, CRK, C15orf17, TADA2B, ABHD5, MAP3K9, IQSEC1, ARHGEF11, NDEL1, CNNM2, KIAA1522, RCOR1 | |
| 170 | FAM107B, KIAA1211, ZDHHC7, MAML3, MXD1, MTF1, BTG1, RAB5B, PANK3, KLF3, LMTK2, MOBKL2B, NDEL1, ABHD5, C8orf4, FAM78A, LAMA3, KLHL20, HOXD1, TSPAN3 | |
| 159 | MAP3K6, ERI1, UGT8, KPNB1, SAP30, MORC4, SLC25A37, RNF125, PAX9, E2F7, ZIC2, WASF1, TUBB, PKNOX1, XKR9, MAP2K1, KITLG, XPO7, SLC25A4, C18orf55 | |
| 157 | ZNF609, WNK1, YLPM1, HIVEP2, PHF3, MED1, DDX24, SPEN, INO80D, LRP6, SP1, SH3PXD2A, C10orf118, AHR, IWS1, SETD5, HNRNPK, RNMT, KIAA1244, LCOR | |
| 146 | TM4SF19, POM121C, WDR45L, ACAN, SPP1, PROCR, ZNF207, PITPNB, EME1, STC1, RAN, MELK, EEF1E1, MRPS23, SCD, SAP30, TUBB, RNF8, CDCA4, SLC35B1 | |
| 140 | SMAD7, FAM102A, TEP1, NCOR1, SESN2, KIAA1522, PANK3, PPARD, GBF1, CRK, SLC22A23, WDR37, TRIM36, CYP26B1, ANKFY1, MYO1F, TMEM156, KIAA1671, MBNL3, MAP3K12 |
Top 10 miRNAs sorted by number of specific targets in STAD.
Target mRNAs are sorted according to its negative correlation value (top 20 are dislplayed).
| miRNA | n.targets | mRNAs |
|---|---|---|
| 390 | PRUNE2, NFIA, LMOD3, PARVA, TMEM35, KANK2, ZNF25, HCFC2, FOXP2, ATP2B4, PDE5A, TEAD1, HOXA3, DPYSL3, RNF180, NRP2, TSHZ3, SMAD9, DDR2, SHISA9 | |
| 357 | KIAA1737, UBR3, RANBP9, TMEM106B, G3BP2, KPNA6, ZNF148, STXBP4, ZYG11B, FAM8A1, HEATR5A, UBE2H, UBE2G1, RLF, PEX13, UBR1, SCAMP1, AHI1, LIMS1, FBXW2 | |
| 326 | PIGW, UHMK1, CAPRIN1, MTHFD1, NXT2, POLA1, PHF6, CMTM8, AZIN1, SMG7, HOOK1, TMED5, SLC39A9, FAF2, NUP54, IPO9, SMCR7L, PASK, SF3B3, SPTLC1 | |
| 319 | LPP, VEZF1, ETV1, RBFOX2, NEK7, SLC25A12, SLC20A2, VAMP4, SGMS2, FBXO8, ZCWPW2, TEAD1, VCL, FAT3, DIXDC1, NCAM2, SGCD, CALD1, MACF1, FBXO3 | |
| 315 | RBFOX2, SLC8A2, DMD, CPEB1, GHR, KLHL4, NEFM, HLF, WNK3, DOCK3, FGF5, LEPR, NFASC, TGFBR3, KLF8, KIAA2022, EZH1, NOVA1, PBX1, FOXN3 | |
| 314 | TMEM62, COL11A1, NXT2, C6orf223, WDFY1, FCGR1A, DTL, NOX1, TRIAP1, PRPF40A, WDR12, TGIF2, CACYBP, SLBP, ALG6, MRPL13, TPM3, RPIA, NDUFA10, E2F7 | |
| 313 | ACTR10, FGF5, MAP4K3, BACH1, PPAPDC1A, SNX6, RBFOX2, CALM1, DPH3, CALU, SESTD1, SLMAP, BAG2, CRBN, ELOVL4, SGCD, COPS4, FBXO32, PRKAB2, KPNA4 | |
| 310 | DIP2C, IL17RD, DNAL1, RMND5A, TGFB2, BACE1, FBXL20, PRICKLE2, ATP2B4, ILDR2, OXTR, SBF2, RYBP, PCYT1B, CALU, CACNA1C, C16orf72, CDKL2, KIF5A, JAZF1 | |
| 290 | FOXP2, NOS1, GRPR, KATNAL1, TEAD1, ANKRD53, GPR135, PENK, KY, WNK3, PRTG, CHIC1, TLE4, BAI1, AASS, KCNQ5, BCL2, SYDE2, PID1, BMPR1A | |
| 286 | VSTM4, TEAD1, AFF4, ABCC9, BCL6, KLF11, ZYG11B, PRKAR1A, UBE2G1, EPN2, C3orf58, ZCCHC24, CCDC6, PCDH10, SETD7, AMOTL2, YPEL2, SAMD4A, ZNF264, PHACTR2 |
Fig 5Specificity of MiRNA-mRNA interactions in LIHC.
Number of total miRNA targets in LIHC versus number of miRNA targets present only in LIHC but not in COAD, ESCA, READ or STAD. Size of the points is proportional to the mean miRNA expression on the LIHC samples included.
Fig 6Clustering and Principal Components Analysis of the five digestive cancers.
Computations are based on the correlation coefficients of the 106.426 miRNA-mRNA pairs that are expressed across all five cancer data sets. A) Heatmap showing the centers of the different clusters. Values represent the mean of the Pearson correlation coefficient of the miRNA-mRNA pairs that fall into the cluster. B) Principal Components Analysis (based on correlation matrix) of the Pearson correlation coefficient of the miRNA-mRNA pairs from the five digestive cancer data sets.
Top 10 miRNAs sorted by number of specific targets in ESCA.
Target mRNAs are sorted according to its negative correlation value (top 20 are dislplayed).
| miRNA | n.targets | mRNAs |
|---|---|---|
| 313 | SLC41A2, HNMT, LONRF3, GATA6, ARHGAP18, MGAT4A, ICA1, LPIN2, VPS13C, SLC12A2, NR3C2, HSD17B11, FOXP1, THRA, C2orf88, PTPRB, TMEM50B, C20orf112, C11orf54, SEPSECS | |
| 261 | ENPP4, SERPINA5, PTPRJ, SPATA13, BTNL8, SPRY1, ACACB, SLC4A4, PHF17, MGRN1, PTP4A2, MGAT4A, MAGI1, DOK1, EXOC6, PRDM16, NEK6, CASC4, HSD17B11, FBP1 | |
| 177 | TTLL6, TFPI, TMEM135, SFMBT1, GALC, SLC46A3, CCL23, YPEL2, MUC3A, ITGB3, C7orf58, ATP8A1, SEC14L1, INSR, GXYLT1, BHMT2, KLF9, HGF, MLXIP, MAP4K3 | |
| 141 | SAFB, VRK2, NEDD1, ABHD12B, LIMK2, AEBP2, TANC2, QSER1, RAB38, CERS3, ROBO1, MBD2, SP3, SYPL1, SCPEP1, ATP2C1, UNK, CCNA1, FKBP3, NCK1 | |
| 128 | RALGPS1, NFATC2, EEPD1, PLEKHA6, MAN2A2, SPATA13, PPM1H, KIAA1958, FOXA3, PRDM16, KBTBD11, SEMA3B, PTPRJ, RALGAPA2, TRIM2, PPFIA2, KIAA1147, GPD1, CAPN9, NKTR | |
| 115 | KLHDC7A, PTPRB, REPS2, LONRF3, ZC3H12B, ZNF420, C11orf75, FUCA1, TTC6, TBC1D12, RAB17, ZNF518A, MLPH, ZNF238, GPRC5B, C10orf68, CRBN, ZNF780B, ZNF506, ZNF253 | |
| 97 | MLPH, TM9SF3, AHCYL2, CAPN5, LONRF3, CREB3L1, MYO7B, LGR4, C10orf81, BACE2, PARP4, MGAT4A, TGFBR2, IYD, MICAL2, LRCH1, FUT8, GOLPH3L, UBR1, TM9SF2 | |
| 87 | MEGF11, NDST3, SNTB1, HNF1B, ATP6V0A2, AHI1, EPB41L1, SNED1, SLC12A3, C9orf96, ARHGDIG, C20orf112, FCGRT, TCAP, NLK, ARHGAP26, IDUA, SLC37A1, UBN2, SMPDL3B | |
| 87 | ACAA2, PEAK1, ZFP36L2, JMJD1C, ARL14, GPD1, PLCL2, CTH, PDK4, PLEKHA6, ZC3H12B, PTPRB, GPR126, FOXA3, OXER1, NR2F2, KBTBD11, SLC46A3, PAPSS2, GORASP1 | |
| 81 | PLEKHA6, GJB1, CREB3L3, ACHE, GAB2, GRK5, FZD5, GPR114, RILP, MIA, MMP15, RPH3AL, MUC5B, DENND3, MUC5AC, SEMA3B, C11orf86, BIN1, ANPEP, IGJ |
Top 10 miRNAs sorted by number of specific targets in LIHC.
Target mRNAs are sorted according to its negative correlation value (top 20 are dislplayed).
| miRNA | n.targets | mRNAs |
|---|---|---|
| 498 | SLC9A1, G6PC3, PKM2, VPS24, TBC1D10B, NCDN, ZDHHC7, C9orf86, GYS1, CHST12, GIT1, DULLARD, ALDOA, PLEKHB2, ATN1, SLC10A3, SLC25A6, TMEM87A, LMNB2, GLG1 | |
| 454 | APLN, AMIGO3, RECQL5, FAM189B, UBE2Q1, MXD3, SNRPC, BAT4, ZNHIT3, NSMCE2, TOMM20, MTX1, BCAP31, PUF60, E4F1, CDKN2A, DUS1L, NFKBIL1, TARBP1, DEDD | |
| 451 | FBXO46, RCC2, UTP18, NAT9, H2AFX, COPS7B, UBE2Z, PHF5A, MCM6, KIF18A, C17orf53, OLA1, POGK, WDR62, HNRNPH1, FAM49B, FBXL19, TPM3, ENTPD2, RFXANK | |
| 448 | BMP1, KIAA0174, ACCN2, C9orf116, CCDC103, E2F4, CDK6, RARG, SP5, OTUD5, OSR1, RALY, EIF2B4, CLDN2, PRMT2, PLSCR3, CDYL2, GTF3C5, CCDC40, PPP1R12C | |
| 444 | LASS5, DNMT3A, NAP1L1, EZH2, RIT1, UCK2, SMARCA4, SUB1, C1orf77, KIAA1841, SMARCD1, RASD2, STK19, DSTYK, ATP6V1E1, ATP5G2, UBE2D2, MFSD6, C12orf34, EED | |
| 439 | NKD1, ADAMTS9, C20orf196, CMIP, VLDLR, DNAL1, RPGRIP1L, AP2M1, CDYL2, HSPB8, MFSD5, AAK1, HIF1AN, LAMA5, WWTR1, LUZP6, TTC30A, RNASEL, CFLAR, CHMP5 | |
| 415 | ARID3A, IGF2BP1, NAP1L1, PCBP4, NPEPL1, C7orf49, ABCC5, DLGAP4, ABCC10, BAX, SLC12A9, C15orf39, IRGQ, CYB561D1, IGF2BP3, FBXL19, GGA3, DUSP9, MMP11, AARSD1 | |
| 399 | SLC26A6, RBCK1, NUP210, NEU1, THOC5, P2RX4, ARID3A, ATP5G2, STK11IP, GLTP, LIMK1, MAZ, RIT1, PLXNA1, MAN1B1, CD2BP2, C15orf39, MSI1, RFXANK, TAZ | |
| 392 | C8orf76, FKBP1A, MICAL1, DTX2, C19orf50, NME6, STK39, STOML1, DGKZ, TMC7, TTC39A, USF1, VOPP1, SEMA7A, TTC35, GNPDA1, FZD2, LENG9, AURKB, RPS19BP1 | |
| 376 | PSMD7, KIAA0513, HM13, EFNA3, WDR45, ACCN2, SLC7A11, WDR8, ATP6AP1, ELOVL1, SCAMP3, PIGT, MRPL33, BRSK1, KIAA0226, FAM21B, UNC45A, MEPCE, TSEN54, RRP12 |
Top 10 miRNAs sorted by number of specific targets in READ.
Target mRNAs are sorted according to its negative correlation value (top 20 are dislplayed).
| miRNA | n.targets | mRNAs |
|---|---|---|
| 262 | KIAA0907, MYLIP, MACC1, RBM41, EFNA3, RBBP6, ABI1, TPR, TMEM106B, MLL5, PHF14, MKLN1, SLC25A36, AFTPH, NCBP2, ZNF292, RBM39, RSBN1, ZNF485, NCOA3 | |
| 179 | WASL, UBE2D1, MTM1, PLEKHM3, TMEM87B, PPP1R12A, CBLL1, WAC, MLLT4, CDC40, PTP4A2, AEBP2, RPRD2, RBBP6, CPEB2, TSR2, BMPR2, BACH2, PURB, ZYG11B | |
| 120 | BMPR2, STYX, STON1, ZEB1, GOPC, RC3H1, RAB3GAP2, TMEM87B, PHACTR2, IQSEC1, GABPA, ZNF350, SEC63, TNRC6A, RAB11FIP2, UBE2J1, JHDM1D, VPS36, SMG1, OSTM1 | |
| 109 | BTBD7, PRKAA1, NT5C2, FBXO28, DHX32, MBNL1, HIPK1, ZMYM2, MIER1, PLEKHA1, ZNF638, C3orf63, DDX3X, RSBN1L, ZNF197, FOXN2, CCDC132, PDE5A, C9orf68, CASP3 | |
| 105 | GPBP1L1, C9orf68, CCNT2, TCF7L2, CREB1, FANCL, ZNF14, ARHGAP5, CLK4, C5orf28, NSUN6, DPY19L4, PPHLN1, EBAG9, NDFIP2, ATXN3, TBL1XR1, SLC35F5, ZNF540, SAV1 | |
| 101 | NCOA6, EEA1, ADNP, TSR2, PAPD5, TAB3, TXLNG, FAM123B, IYD, ZNF81, FMR1, UBN2, WASL, GCC1, WIPF2, XIAP, ZBTB44, PICALM, KLHL15, SIAH1 | |
| 99 | AKAP6, SORBS1, CACNB2, PBX1, ANK3, LMO3, MBNL1, ZFYVE21, BTBD7, SPPL3, TES, NBEA, MYST4, CHMP1B, ARHGAP5, CACNA1C, CASD1, KIAA1143, ADAMTSL3, RABGAP1 | |
| 93 | ZBTB6, GMCL1, CDC40, FAM3C, PHTF2, ZNF800, TBC1D15, HOOK3, PTP4A2, SLC4A7, LMBRD1, ZBTB41, CNOT6L, ITGB8, DEGS1, CMPK1, SNX16, SGTB, TMEM168, SNTB2 | |
| 68 | EGFR, STON1, CSF1, SERTAD2, MARCKS, HGSNAT, ATP2B1, SGMS1, C5orf41, SMCR8, SMCHD1, GPD2, SSH1, SEPN1, ARHGAP21, TICAM2, WIPF2, PLS1, DIRC2, C16orf54 | |
| 62 | ANKRD13C, MON2, TLK1, DYRK1A, PDE4D, FRS2, FAM129A, PDIK1L, RAB3GAP1, C9orf45, NBEA, ZBTB34, PRKAA2, USP15, ARID4B, SFRS11, ENSA, KIAA1598, BRAP, MKL2 |