| Literature DB >> 19736252 |
Mitsunori Kayano1, Ichigaku Takigawa, Motoki Shiga, Koji Tsuda, Hiroshi Mamitsuka.
Abstract
MOTIVATION: We address the issue of finding a three-way gene interaction, i.e. two interacting genes in expression under the genotypes of another gene, given a dataset in which expressions and genotypes are measured at once for each individual. This issue can be a general, switching mechanism in expression of two genes, being controlled by categories of another gene, and finding this type of interaction can be a key to elucidating complex biological systems. The most suitable method for this issue is likelihood ratio test using logistic regressions, which we call interaction test, but a serious problem of this test is computational intractability at a genome-wide level.Entities:
Mesh:
Year: 2009 PMID: 19736252 PMCID: PMC2781753 DOI: 10.1093/bioinformatics/btp531
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Synthetic examples: expressions of two genes under the three classes of another gene. (a) randomly distributed, (b,c) easily categorized into three classes and (d) a switching mechanism.
Log-likelihoods and LLR by Newton–Raphson
| LLR ( | ||||
|---|---|---|---|---|
| (a) | −196.4 | −195.5 | −194.4 | 2.23 (0.45) |
| (b) | −1.86 | −0.42 | −2.36 | −3.87 (1.00) |
| (c) | −83.5 | −1.52 | −6.00 | −8.97 (1.00) |
| (d) | −197.8 | −197.4 | −126.4 | 142.12 (0.00) |
Fig. 2.Pseudocode of interaction test.
Fig. 3.LLR and its components.
MANOVA, Box's M test and Means-Covariances (MC) test on four examples in Figure 1
| Examples in | MANOVA | Box's | MC test |
|---|---|---|---|
| (a) | 0.53 (0.28) | 0.70 (0.25) | 0.60 (0.30) |
| (b) | 0.00 (0.00) | 0.68 (0.25) | 0.00 (0.00) |
| (c) | 0.00 (0.00) | 0.71 (0.25) | 0.00 (0.00) |
| (d) | 0.94 (0.09) | 0.00 (0.00) | 0.00 (0.00) |
Fig. 4.Pseudocode of MC test.
Fig. 5.Pseudocode of our entire procedure: FTGI.
Fig. 6.Computation time improvement by reducing α.
Pruning rates and pruning accuracies (top 100) at three α values of FTGI for 107 combinations
| α | 0.05 | 0.01 | 0.001 |
|---|---|---|---|
| Pruning rate | 0.7095 | 0.8611 | 0.9354 |
| Pruning accuracy (top 100) | 0.9967 | 0.9567 | 0.8467 |
Fig. 7.Expressions of two genes under three genotypes of another gene for top 10 (a–j) ranked three-way interactions out of 3 × 108 combinations.
Details of the top 10 three-way interactions in Figure 7
| SNP (GeneID and name) | Gene 1 | Gene 2 | ||||
|---|---|---|---|---|---|---|
| Name | Definition | Name | Definition | |||
| (GeneID) | (GeneID) | |||||
| 1 | −8.91108 | rs7487429 (113251, LARP4) | COX6C (1345) | Cytochrome c oxidase subunit VIc (EC:1.9.3.1) | UBA1 (7317) | Ubiquitin-like modifier activating enzyme 1 (EC:6.3.2.19) |
| 2 | −8.4901 | rs13086670 (80163, FLJ11827) | RERE (473) | Arginine-glutamic acid dipeptide (RE) repeats | TNFRSF1A (7132) | Tumor necrosis factor receptor superfamily, member 1A |
| 3 | −8.10611 | rs2175200 (439992, RPS3AP5) | ATP5D (513) | ATP synthase, H+ transporting, mitochondrial F1 complex, δ subunit (EC:3.6.1.14) | ITCH (83737) | ITCHY E3 ubiquitin protein ligase homolog (mouse) |
| 4 | −8.06076 | rs2797425 (55227, LRRC1) | ATP5G1 (516) | ATP synthase, H+ transporting, mitochondrial F0 complex, subunit C1 (subunit 9) | ATP5H (10476) | ATP synthase, H+ transporting, mitochondrial F0 complex, subunit d (EC:3.6.1.14) |
| 5 | −8.02645 | rs7116710 (440031, LOC440031) | NCSTN (23385) | Nicastrin | HSPA5 (3309) | Heat shock 70kDa protein 5 (glucose-regulated protein, 78kDa) |
| 6 | −8.02495 | rs2058619 (728730, LOC728730) | NDUFA8 (4702) | NADH dehydrogenase (ubiquinone) 1 α subcomplex, 8, 19 kDa (EC:1.6.5.3 1.6.99.3) | NDUFA6 (4700) | NADH dehydrogenase (ubiquinone) 1 α subcomplex, 6, 14kDa |
| 7 | −8.0149 | rs1893261 (25833, POU2F3) | ALS2 (57679) | Amyotrophic lateral sclerosis 2 (juvenile) | SLC25A6 (293) | Solute carrier family 25 (mitochondrial carrier; adenine nucleotide translocator), member 6 |
| 8 | −7.86801 | rs1571176 (9044, BTAF1) | ATP5G1 (516) | ATP synthase, H+ transporting, mitochondrial F0 complex, subunit C1 (subunit 9) | ATP5J (522) | ATP synthase, H+ transporting, mitochondrial F0 complex, subunit F6 (EC:3.6.1.14) |
| 9 | −7.84081 | rs12425705 (91012, LASS5) | COX6C (1345) | Cytochrome c oxidase subunit VIc (EC:1.9.3.1) | UBA1 (7317) | Ubiquitin-like modifier activating enzyme 1 (EC:6.3.2.19) |
| 10 | −7.73205 | rs12698191 (393078, tcag7.1023) | NDUFA10 (4705) | NADH dehydrogenase (ubiquinone) 1 α subcomplex, 10, 42kDa (EC:1.6.5.3 1.6.99.3) | COX4 (1327) | Cytochrome c oxidase subunit IV isoform 1 (EC:1.9.3.1) |
Fig. 8.Distributions (left side) of P-values of the top 10 000 interactions detected by FTGI, with those (right side) of Null data (a) 1 and (b) 2.
Fig. 9.The P-values of the (a) top and (b) 10th interactions (shown by arrows) and the distributions of P-values of the corresponding null examples generated.
Results of interaction test over the datasets from GEO
| Rank | Gene pair | #datasets | GDS | #ex. | #ex. | Annotation | |
|---|---|---|---|---|---|---|---|
| from GEO | class 1 | class 2 | |||||
| 1 | {COX6C,UBA1} | 117 | GDS2960_1 | −3.9532 | 60 | 41 | Marfan syndrome: cultured skin fibroblasts |
| 2 | {RERE,TNFRSF1A} | 284 | GDS2736_25 | −5.9049 | 19 | 15 | Malignant fibrous histiocytoma and various soft tissue sarcomas |
| 3 | {ATP5D,ITCH} | 324 | GDS1875_3 | −5.1235 | 27 | 24 | Host cell response to HIV-1 Vpr-induced cell cycle arrest |
| 4 | {ATP5G1,ATP5H} | 392 | GDS2733_1 | −7.9996 | 17 | 17 | Cytosine arabinoside effect on Ewing's sarcoma cell line |
| 5 | {NCSTN,HSPA5} | 102 | GDS2545_5 | −6.4398 | 63 | 25 | Metastatic prostate cancer (HG-U95A) |
| 6 | {NDUFA8,NDUFA6} | 142 | GDS2733_4 | −4.7027 | 17 | 16 | Cytosine arabinoside effect on Ewing's sarcoma cell line |
| 7 | {ALS2,SLC25A6} | 108 | GDS1627_2 | −3.2808 | 16 | 15 | Breast cancer cell lines response to chemotherapeutic drugs |
| 8 | {ATP5G1,ATP5J} | 418 | GDS2960_1 | −3.1628 | 60 | 41 | Marfan syndrome: cultured skin fibroblasts |
| 9 | {COX6C,UBA1} | 117 | GDS2960_1 | −3.9532 | 60 | 41 | Marfan syndrome: cultured skin fibroblasts |
| 10 | {NDUFA10,COX4} | 232 | GDS2643_9 | −6.2133 | 13 | 12 | Waldenstrom's macroglobulinemia: B lymphocytes and plasma cells |
For each gene pair of 10 interactions in Table 4, the number of datasets obtained from GEO, the GDS which gave the smallest P-value, the P-value, the number of examples (ex.) in two classes of the GDS and the annotation of the GDS are shown.
Fig. 10.(a–j) Expressions of two genes which give the smallest P-value of interaction test in the corresponding GDS of GEO.