| Literature DB >> 30563156 |
Yingjie Guo1,2, Chenxi Wu3, Maozu Guo4,5, Xiaoyan Liu6, Alon Keinan7.
Abstract
Among the various statistical methods for identifying gene⁻gene interactions in qualitative genome-wide association studies (GWAS), gene-based methods have recently grown in popularity because they confer advantages in both statistical power and biological interpretability. However, most of these methods make strong assumptions about the form of the relationship between traits and single-nucleotide polymorphisms, which result in limited statistical power. In this paper, we propose a gene-based method based on the distance correlation coefficient called gene-based gene-gene interaction via distance correlation coefficient (GBDcor). The distance correlation (dCor) is a measurement of the dependency between two random vectors with arbitrary, and not necessarily equal, dimensions. We used the difference in dCor in case and control datasets as an indicator of gene⁻gene interaction, which was based on the assumption that the joint distribution of two genes in case subjects and in control subjects should not be significantly different if the two genes do not interact. We designed a permutation-based statistical test to evaluate the difference between dCor in cases and controls for a pair of genes, and we provided the p-value for the statistic to represent the significance of the interaction between the two genes. In experiments with both simulated and real-world data, our method outperformed previous approaches in detecting interactions accurately.Entities:
Keywords: distance correlation coefficient; gene–gene interaction; genome-wide association studies; qualitative trait
Year: 2018 PMID: 30563156 PMCID: PMC6316506 DOI: 10.3390/genes9120608
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Detailed information about GNPDA2 and FAIM2 used in a study of gene–gene interaction. Shown are the rsID (rs number used by researchers and databases to refer to specific SNPs)and physics position of each SNP on each gene.
| Index | SNP Name: Position | |
|---|---|---|
| GNPDA2 (chr4) | FAIM2 (chr12) | |
| 1 | rs16857402:44706453 | rs17201502:50285562 |
| 2 | rs2709:44706913 | rs905619:50286055 |
| 3 | rs10020551:44707815 | rs637871:50287592 |
| 4 | rs4484337:44711547 | rs1027711:50288032 |
| 5 | rs12643262:44714455 | rs956864:50290023 |
| 6 | rs7670601:44715341 | rs640081:50290554 |
| 7 | rs707695:50297670 | |
Figure 1Linkage disequilibrium (LD) patterns of GNPDA2 and FAIM2 used in simulation studies. Figures are LD plots produced using Haploview. GNPDA2 has 6 SNPs, and FAIM2 has 7 SNPs. The number in each square is the LD strength that was measured by , where , 0 means no LD, and 1 means complete LD.
Odds table of the recessive-recessive model.
| SNP1 | SNP2 | ||
|---|---|---|---|
| BB | Bb | bb | |
| AA |
|
|
|
| Aa |
|
|
|
| aa |
|
|
|
Penetrance table of the recessive-recessive model.
| SNP1 | SNP2 | ||
|---|---|---|---|
| BB | Bb | bb | |
| AA |
|
|
|
| Aa |
|
|
|
| aa |
|
|
|
Type-I error of the four methods in different sample sizes. AGGrEGATOr, a gene-based gene gene interaction; GBDcor, gene-based gene-gene interaction via distance correlation coefficient GBIGM, gene-based information gain method; KCCU, kernelized CCU.
| Method | Sample Size | ||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | |
| AGGrEGATOr | 0.05 | 0.06 | 0.07 | 0.04 | 0.02 |
| GBDcor | 0.05 | 0.03 | 0.04 | 0.04 | 0.06 |
| GBIGM | 0.13 | 0.06 | 0.07 | 0.07 | 0.07 |
| KCCU | 0.02 | 0.02 | 0.01 | 0.05 | 0.07 |
Figure 2Empirical, simulation-based statistical power of GBIGM, KCCU, AGGrEGATOr, and GBDcor under eight disease models, after varying the .
Figure 3Empirical, simulation-based statistical power of GBIGM, KCCU, AGGrEGATOr, and GBDcor under eight disease models, after varying the .
The p-values of the gene pairs detected to interact from different methods. The p-values with bold font mean they are significant
| Gene1 | Gene2 | Ref | ||||
|---|---|---|---|---|---|---|
| GBDcor | AGGrEGATOr | KCCU | GBIGM | |||
| AP-1 | M-CSF | ref [ |
| 0.0679 |
|
|
| CXCL12 | FLT-1 |
| 0.59 | 0.152 |
| |
| GM-CSF | VEGF | ref [ |
| 0.284 |
| 0.545 |
| CTSK | VEGF |
| 0.873 |
| 0.47 | |
| CTLA4 | TLR2 |
| 0.152 | 0.057 |
| |
| CXCL1 | RANK | ref [ |
|
| 0.147 | 0.697 |
| IL15 | MMP-3 | ref [ |
| 0.066 | 0.167 | 0.088 |
| GM-CSF | AP-1 | ref [ |
| 0.394 |
|
|
| CD86 | APRIL | ref [ |
| 0.637 |
| 0.655 |
| TGF | VEGF | ref [ |
| 1 |
| 0.632 |
| CD80 | APRIL | ref [ | 0.865 |
| 0.941 | 0.334 |
| CTSK | BLyS | 0.298 |
| 0.356 | 0.056 | |
| AP-1 | IL-6 | 0.24 |
| 0.098 | 0.287 | |
| CD80 | CTSL | 0.094 |
| 0.519 | 0.252 | |
| CXCL6 | FLT-1 | 0.441 |
|
| 0.52 | |
| CTLA4 | AP-1 | 0.075 |
|
| 0.102 | |
| FLT1 | LFA-1 | 0.645 |
| 0.063 |
| |
| CCL3 | TRAP | 0.746 |
| 0.682 |
| |
| IL-18 | TGF | 0.841 |
| 0.149 | 0.22 | |
| IL-1 | SDF-1 | ref [ | 0.618 |
| 0.116 | 0.636 |