| Literature DB >> 26138921 |
Hyun-Hwan Jeong1, Sangseob Leem2, Kyubum Wee3, Kyung-Ah Sohn4.
Abstract
BACKGROUND: Recent advances in high-throughput technology and the emergence of large-scale genomic datasets have enabled detection of genomic features that affect clinical outcomes. Although many previous computational studies have analysed the effect of each single gene or the additive effects of multiple genes on the clinical outcome, less attention has been devoted to the identification of gene-gene interactions of general type that are associated with the clinical outcome. Moreover, the integration of information from multiple molecular profiles adds another challenge to this problem. Recently, network-based approaches have gained huge popularity. However, previous network construction methods have been more concerned with the relationship between features only, rather than the effect of feature interactions on clinical outcome.Entities:
Mesh:
Year: 2015 PMID: 26138921 PMCID: PMC4491426 DOI: 10.1186/s13048-015-0171-1
Source DB: PubMed Journal: J Ovarian Res ISSN: 1757-2215 Impact factor: 4.234
Fig. 1Illustration of MINA
Summary of datasets used in this study
| Genomic profile | Platform | Data type |
|---|---|---|
| mRNA | Agilent microarray | Continuous |
| CNA | Affymetrix SNP 6 | Discrete |
| methylation | Illumina Infinium HumanMethylation27 | Continuous |
Fig. 2Empirical distribution of mutual information values. We show the distribution of mutual information values computed for every pair of genes in each profile of mRNA expression (red), CNA (blue) and methylation (yellow). The solid lines correspond to the values with respect to the original clinical outcome labels, and the dotted lines are with respect to the permuted labels averaged over 30 permutations
Threshold mutual information on each genomic profile
| Genomic profile | Threshold | No. of gene pairs above threshold | Percentage | |
|---|---|---|---|---|
| α | θ (1 + α) | |||
| mRNA | 0.0 | 0.0763 | 2,562,178 | 5.10 % |
| 0.1 | 0.0839 | 1,125,398 | 2.24 % | |
| 0.5 | 0.1145 | 20,219 | 0.04 % | |
| 0.8 | 0.1373 | 555 | <0.001 % | |
| 1.0 | 0.1526 | 45 | <0.001 % | |
| CNA | 0.0 | 0.0664 | 2,472,048 | 4.92 % |
| 0.1 | 0.0730 | 1,090,500 | 2.17 % | |
| 0.5 | 0.0996 | 23,143 | 0.05 % | |
| 0.8 | 0.1195 | 526 | <0.001 % | |
| 1.0 | 0.1328 | 17 | <0.001 % | |
| METH | 0.0 | 0.0782 | 554,048 | 1.10 % |
| 0.1 | 0.0860 | 221,680 | 0.44 % | |
| 0.5 | 0.1173 | 3,641 | 0.01 % | |
| 0.8 | 0.1407 | 115 | <0.001 % | |
| 1.0 | 0.1564 | 8 | <0.001 % | |
| Total | 50,215,231 | 100.00 % | ||
Top 10 gene pairs for each genomic profile
| Genomic profile | Gene pair | Chromosome | Mutual information |
| ||
|---|---|---|---|---|---|---|
| mRNA | MYO3A | SWI5 | 10p11.1 | 9q34.13 | 0.1753 | 6.62E-05 |
| CYTH3 | ZC3H14 | 7p22.1 | 14q31.3 | 0.1710 | 8.70E-08 | |
| ARHGDIA | DNMBP | 17q25.3 | 10q24.31 | 0.1688 | 1.81E-05 | |
| AK1 | THBS1 | 9q34.1 | 15q15 | 0.1670 | 3.82E-07 | |
| MCM3 | PCDHB5 | 6p12 | 5q31 | 0.1645 | 1.20E-05 | |
| CRYAB | TTPAL | 11q22.3-q23.1 | 20q13.12 | 0.1627 | 1.57E-07 | |
| CYP39A1 | NUAK1 | 6p21.1-p11.2 | 12q23.3 | 0.1627 | 2.01E-08 | |
| CMBL | KRT23 | 5p15.2 | 17q21.2 | 0.1624 | 1.67E-03 | |
| CYTH3 | FBXW8 | 7p22.1 | 12q24.23 | 0.1616 | 4.66E-06 | |
| CYTH3 | IDE | 7p22.1 | 10q23-q25 | 0.1605 | 4.16E-08 | |
| CNA | SNRPB2 | WSB2 | 20p12.1 | 12q24.23 | 0.1432 | 1.21E-04 |
| KIF16B | WSB2 | 20p11.23 | 12q24.23 | 0.1411 | 1.52E-04 | |
| SNRPB2 | TAOK3 | 20p12.1 | 12q | 0.1377 | 1.70E-04 | |
| SNRPB2 | TESC | 20p12.1 | 12q24.22 | 0.1372 | 1.22E-04 | |
| PEBP1 | SNRPB2 | 12q24 | 20p12.1 | 0.1370 | 1.70E-04 | |
| NOS1 | SNRPB2 | 12q24.22 | 20p12.1 | 0.1367 | 1.22E-04 | |
| KIF16B | TAOK3 | 20p11.23 | 12q | 0.1355 | 2.13E-04 | |
| KIF16B | TESC | 20p11.23 | 12q24.22 | 0.1352 | 1.53E-04 | |
| KIF16B | PEBP1 | 20p11.23 | 12q24 | 0.1349 | 2.13E-04 | |
| FBXW8 | SNRPB2 | 12q24.23 | 20p12.1 | 0.1348 | 1.87E-04 | |
| METH | F2RL3 | SLC7A11 | 19p12 | 4q28-q32 | 0.1670 | 1.14E-04 |
| CCM2L | TMEM129 | 20q11.21 | 4p16.3 | 0.1618 | 2.60E-04 | |
| CAND1 | YTHDC1 | 12q14 | 4q13.3 | 0.1598 | 5.86E-04 | |
| ENSA | PTHLH | 1q21.3 | 12p12.1-p11.2 | 0.1584 | 2.59E-06 | |
| CDH8 | DYRK2 | 16q22.1 | 12q15 | 0.1582 | 5.08E-11 | |
| FOXL1 | NRTN | 16q24 | 19p13.3 | 0.1575 | 1.55E-07 | |
| FOLR2 | TMEM129 | 11q13.3-q14.1 | 4p16.3 | 0.1570 | 1.04E-05 | |
| SYT8 | ZBTB1 | 11p15.5 | 14q23.3 | 0.1566 | 1.91E-04 | |
| IL23A | ZBTB1 | 12q13.13 | 14q23.3 | 0.1559 | 3.62E-06 | |
| MFAP4 | ZBTB1 | 17p11.2 | 14q23.3 | 0.1557 | 3.51E-05 | |
Fig. 3Kaplan-Meier survival plots of the gene pair with the highest mutual information value for each single profile. We show the Kaplan-Meier survival curve of the gene pair having the highest mutual information along with the ones derived by each single gene
Fig. 4Boxplots for p-values from survival analysis. The distribution of p-values from the survival analysis for the extracted gene pairs obtained from different significance levels of α is shown as a boxplot
Network Topologies for different α values
| α | Profile | Vertices | Edges | Number of components | Size of largest component |
|
|---|---|---|---|---|---|---|
| 0.0 | mRNA | 9,997 | 2,562,178 | 1 | 9,997 | 0.643 |
| CNA | 10,021 | 2,472,048 | 1 | 10,021 | 0.590 | |
| METH | 9,801 | 554,048 | 1 | 9,801 | 0.839 | |
| I∀ | 1,244 | 1,538 | 61 | 1,105 | 0.914 | |
| I∃ | 10,022 | 5,385,486 | 1 | 10,022 | 0.366 | |
| 0.1 | mRNA | 9,943 | 1,125,398 | 1 | 9,943 | 0.758 |
| CNA | 9,934 | 1,090,500 | 1 | 9,934 | 0.749 | |
| METH | 9,118 | 221,680 | 1 | 9,118 | 0.842 | |
| I∀ | 138 | 95 | 44 | 27 | 0.950 | |
| I∃ | 10,022 | 2,396,372 | 1 | 10,022 | 0.505 | |
| 0.5 | mRNA | 6,466 | 20,219 | 25 | 6,418 | 0.810 |
| CNA | 2,886 | 23,143 | 9 | 2,855 | 0.831 | |
| METH | 2,166 | 3,641 | 25 | 2,116 | 0.700 | |
| I∃ | 8,032 | 46,975 | 11 | 8,012 | 0.864 | |
| 0.8 | mRNA | 641 | 555 | 112 | 358 | 0.804 |
| CNA | 245 | 526 | 10 | 106 | 0.892 | |
| METH | 145 | 115 | 32 | 58 | 0.690 | |
| I∃ | 1,002 | 1,196 | 137 | 579 | 0.913 | |
| 1.0 | mRNA | 73 | 45 | 28 | 11 | 0.797 |
| CNA | 13 | 17 | 1 | 13 | 0.363 | |
| METH | 15 | 8 | 7 | 3 | 1.000 | |
| I∃ | 100 | 70 | 35 | 23 | 0.803 |
Fig. 5I0.8∀ of whole genomic profiles
Fig. 6I0.8∃ of whole genomic profiles
Fig. 7Four-way Venn diagram summarizing the number of shared and unique GO terms enriched in the network from each profile
Significantly enriched GO terms in the largest component of I0.8∃
| Category | ID | Description |
| Adjusted | Count | Total |
|---|---|---|---|---|---|---|
| MF | GO:0044822 | poly(A) RNA binding | 4.37E-12 | 1.35E-09 | 50 | 1180 |
| CC | GO:0005654 | nucleoplasm | 1.93E-09 | 6.54E-08 | 63 | 1745 |
| CC | GO:0070062 | extracellular vesicular exosome | 2.39E-09 | 7.09E-08 | 59 | 1589 |
| BP | GO:0006915 | apoptotic process | 1.32E-07 | 1.20E-05 | 49 | 1305 |
| BP | GO:0016567 | protein ubiquitination | 1.75E-07 | 1.52E-05 | 28 | 542 |
| CC | GO:0005730 | nucleolus | 5.61E-06 | 1.06E-04 | 38 | 1072 |
| CC | GO:0031226 | intrinsic component of plasma membrane | 1.66E-05 | 2.89E-04 | 49 | 1612 |
| BP | GO:0006366 | transcription from RNA polymerase II promoter | 1.60E-05 | 6.57E-04 | 26 | 611 |
| CC | GO:0005887 | integral component of plasma membrane | 5.07E-05 | 8.34E-04 | 46 | 1546 |
| BP | GO:0071156 | regulation of cell cycle arrest | 2.62E-05 | 9.70E-04 | 10 | 116 |
| BP | GO:0001775 | cell activation | 5.63E-05 | 1.86E-03 | 31 | 856 |
| BP | GO:0045087 | innate immune response | 7.46E-05 | 2.34E-03 | 34 | 993 |
| CC | GO:0000228 | nuclear chromosome | 1.60E-04 | 2.43E-03 | 19 | 453 |
| MF | GO:0042803 | protein homodimerization activity | 4.87E-05 | 3.26E-03 | 26 | 781 |
| MF | GO:0019901 | protein kinase binding | 4.93E-05 | 3.26E-03 | 22 | 603 |
| MF | GO:0008201 | heparin binding | 5.51E-05 | 3.40E-03 | 11 | 182 |
| BP | GO:0071901 | negative regulation of protein serine/threonine kinase activity | 1.29E-04 | 3.69E-03 | 10 | 140 |
| BP | GO:0007596 | blood coagulation | 1.36E-04 | 3.80E-03 | 22 | 541 |
| CC | GO:0005783 | endoplasmic reticulum | 3.16E-04 | 4.27E-03 | 51 | 1918 |
| BP | GO:0051222 | positive regulation of protein transport | 1.88E-04 | 4.88E-03 | 15 | 301 |
| BP | GO:0000086 | G2/M transition of mitotic cell cycle | 2.27E-04 | 5.70E-03 | 10 | 150 |
Significantly enriched GO terms in the largest component of I0.1∀
| Category | ID | Description |
| Adjusted | Count | Total |
|---|---|---|---|---|---|---|
| BP | GO:0030097 | hemopoiesis | 1.82E-05 | 6.81E-03 | 6 | 699 |
| BP | GO:0002520 | immune system development | 4.12E-05 | 6.81E-03 | 6 | 809 |
| BP | GO:0007568 | aging | 3.03E-04 | 1.36E-02 | 4 | 399 |
| BP | GO:0030217 | T cell differentiation | 4.69E-04 | 1.99E-02 | 3 | 185 |
| BP | GO:0043065 | positive regulation of apoptotic process | 7.47E-04 | 2.02E-02 | 4 | 507 |
| BP | GO:0006915 | apoptotic process | 5.92E-04 | 2.02E-02 | 6 | 1320 |
| BP | GO:0001890 | placenta development | 1.07E-03 | 2.44E-02 | 3 | 246 |
| BP | GO:0050870 | positive regulation of T cell activation | 1.08E-03 | 2.44E-02 | 3 | 247 |
| BP | GO:0023014 | signal transduction by phosphorylation | 1.49E-03 | 2.90E-02 | 3 | 276 |
| BP | GO:0071214 | cellular response to abiotic stimulus | 1.68E-03 | 2.93E-02 | 3 | 288 |
| BP | GO:0001525 | angiogenesis | 4.53E-03 | 4.90E-02 | 3 | 409 |