| Literature DB >> 35180244 |
Dominic Russ1,2, John A Williams1,2, Victor Roth Cardoso1,2, Laura Bravo-Merodio1,2, Samantha C Pendleton1,2, Furqan Aziz1,2, Animesh Acharjee1,2,3, Georgios V Gkoutos1,2,3,4,5.
Abstract
BACKGROUND: Numerous approaches have been proposed for the detection of epistatic interactions within GWAS datasets in order to better understand the drivers of disease and genetics.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35180244 PMCID: PMC8856572 DOI: 10.1371/journal.pone.0263390
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1An example of the distribution of cases and controls across all possible genotypes as a result of an epistatic interaction for two loci (note that y-axes are different scales).
To the right and below are the marginal effects, shown as a ratio of cases to controls. This is an example of a pure interaction because these marginal effects are very minimal. The blue dotted line indicates numbers expected under the Hardy-Weinberg Equilibrium, with capital genotypes representing the major allele and lower case the minor allele.
Configurations used for GAMETES generated models.
In all datasets there are 1,000 cases and controls and 30 replicate.
| Loci in Interaction | Heritability | Ease of Detection Measure | Total loci |
|---|---|---|---|
| 2 | 0.02 | 2 | 2500 |
| 2 | 0.01 | 2 | 2500 |
| 2 | 0.005 | 2 | 2500 |
| 2 | 0.02 | 1 | 2500 |
| 2 | 0.01 | 1 | 2500 |
| 2 | 0.005 | 1 | 2500 |
| 3 | 0.02 | 2 | 500 |
| 3 | 0.01 | 2 | 500 |
| 3 | 0.005 | 2 | 500 |
| 3 | 0.02 | 1 | 500 |
| 3 | 0.01 | 1 | 500 |
| 3 | 0.005 | 1 | 500 |
EpiGEN Penetrance models with capital genotypes as the major allele.
|
|
|
|
|
|
|
|
|
|
| 0 | 0 | 0 |
| 0 | 0 | 0 |
|
| 0 | 1 | 1 |
| 0 | 0 | 0 |
|
| 0 | 1 | 1 |
| 0 | 0 | 1 |
|
|
|
|
|
|
|
|
|
|
| 0 | 0 | 0 |
| 0 | 0 | 1 |
|
| 0 | 0 | 1 |
| 0 | 0 | 1 |
|
| 1 | 1 | 1 |
| 1 | 1 | 0 |
|
|
|
|
| ||||
|
| 1 | 0 | 0 | ||||
|
| 0 | 1 | 0 | ||||
|
| 0 | 0 | 1 |
Fig 2Summary of the pure epistasis results for second order interactions.
The three bar charts show the number of True Positive interactions discovered and the position that the algorithm ranked it amongst combinations with noise loci. Each chart shows a different heritability for the interaction, with higher heritability explained making it more prominent against random noise. The table shows the results of a Mann-Whitney U Test comparing the non-normal distribution of True Positive ranks for a single tool against the distribution of true positive ranks for all other tools.
Fig 5Summary of the impure model results for third order interactions.
Each bar chart shows the number of True Positive interactions discovered and the position that the algorithm ranked it amongst combinations with noise loci. Each chart shows a different interaction models (see Table 2). The table shows the results of a Mann-Whitney U Test comparing the non-normal distribution of True Positive ranks for a single tool against the distribution of True Positive ranks for all other tools.
Fig 3Summary of the impure model results for second order interactions.
Each bar chart shows the number of True Positive interactions discovered and the position that the algorithm ranked it amongst combinations with noise loci. Each chart shows a different interaction models (see Table 2). The table shows the results of a Mann-Whitney U Test comparing the non-normal distribution of True Positive ranks for a single tool against the distribution of True Positive ranks for all other tools.
Fig 4Summary of the pure epistasis results for third order interactions.
The three bar charts show the number of True Positive interactions discovered and the position that the algorithm ranked it amongst combinations with noise loci. Each chart shows a different heritability for the interaction, with higher heritability explained making it more prominent against random noise. The table shows the results of a Mann-Whitney U Test comparing the non-normal distribution of True Positive ranks for a single tool against the distribution of True Positive ranks for all other tools.
Average time taken in minutes/RAM used in MB per method at different numbers of loci for a di-locus or tri-locus search.
*AntEpiSeeker performed with slightly different settings in three-locus experiments (see Methods).
| Two Locus Detection | Three Locus Detection | |||
|---|---|---|---|---|
| Tool | 500 loci | 2500 loci | 100 loci | 500 loci |
| AntEpiSeeker | 1.89/6 | 6.35/16 | 1.32*/6 | 1.79*/9 |
| CINOEDV | 16.20/205 | 15.70/272 | 48.18/211 | 422.18/1346 |
| MDR | 0.26/1 | 6.01/291 | 0.37/1 | 22.59/2092 |
| SNPRuler | 0.34/1 | 0.66/306 | 0.56/1 | 0.46/2 |
| wtest | 0.41/1 | 7.47/2096 | 6.73/199 | 851.33/14302 |
| MPI3SNP | -/- | -/- | 0.22/1 | 1.23/69 |
| Cassi | 0.50/1 | 1.76/156 | -/- | -/- |
| epiACO | 13.20/665 | 20.85/697 | -/- | -/- |
| GSS | 29.45/682 | 2791.15/843 | -/- | -/- |
| PLINK | ||||
| Fast Epistasis | 0.23/1 | 0.21/1 | -/- | -/- |
| PLINK BOOST | 0.22/1 | 0.22/1 | -/- | -/- |
| PLINK Epistasis | 0.25/1 | 2.40/1 | -/- | -/- |
Epistatic interactions predicted by each tool as most important for Atrial Fibrillation.
An * denotes that the SNP was found to be intergenic and this is the nearest gene.
| Tool | SNP1 | Gene1 | SNP2 | Gene2 |
|---|---|---|---|---|
|
| rs730072 | rs1152591 | ||
|
| rs1608994 |
| rs3809775 | |
|
| rs730072 | rs4668136 |
| |
|
| rs730072 | rs1152591 | ||
|
| rs9346918 |
| rs4342945 |
|
|
| rs3792234 |
| rs1152591 | |
|
| rs1608994 |
| rs3809775 | |
|
| 9:140746691 |
| rs56018060 | |
|
| rs9346918 |
| rs4342945 |
|
|
| rs6754266 | rs12627212 |
| |
|
| rs3792234 |
| rs1152591 |