Diane Duroux1, Héctor Climente-González2,3,4,5, Chloé-Agathe Azencott2,3,4, Kristel Van Steen1,6. 1. BIO3 - Systems Genetics, GIGA-R Medical Genomics, University of Liège, 4000 Liège, Belgium, 11 Liège 4000, Belgium. 2. Institut Curie, PSL Research University, F-75005 Paris, France. 3. INSERM, U900, F-75005 Paris, France. 4. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France. 5. High-Dimensional Statistical Modeling Team, RIKEN Center for Advanced Intelligence Project, Chuo-ku, Tokyo 103-0027, Japan. 6. BIO3 - Systems Medicine, Department of Human Genetics, KU Leuven, 3000 Leuven, Belgium, 49 3000 Leuven, Belgium.
Abstract
BACKGROUND: Detecting epistatic interactions at the gene level is essential to understanding the biological mechanisms of complex diseases. Unfortunately, genome-wide interaction association studies involve many statistical challenges that make such detection hard. We propose a multi-step protocol for epistasis detection along the edges of a gene-gene co-function network. Such an approach reduces the number of tests performed and provides interpretable interactions while keeping type I error controlled. Yet, mapping gene interactions into testable single-nucleotide polymorphism (SNP)-interaction hypotheses, as well as computing gene pair association scores from SNP pair ones, is not trivial. RESULTS: Here we compare 3 SNP-gene mappings (positional overlap, expression quantitative trait loci, and proximity in 3D structure) and use the adaptive truncated product method to compute gene pair scores. This method is non-parametric, does not require a known null distribution, and is fast to compute. We apply multiple variants of this protocol to a genome-wide association study dataset on inflammatory bowel disease. Different configurations produced different results, highlighting that various mechanisms are implicated in inflammatory bowel disease, while at the same time, results overlapped with known disease characteristics. Importantly, the proposed pipeline also differs from a conventional approach where no network is used, showing the potential for additional discoveries when prior biological knowledge is incorporated into epistasis detection.
BACKGROUND: Detecting epistatic interactions at the gene level is essential to understanding the biological mechanisms of complex diseases. Unfortunately, genome-wide interaction association studies involve many statistical challenges that make such detection hard. We propose a multi-step protocol for epistasis detection along the edges of a gene-gene co-function network. Such an approach reduces the number of tests performed and provides interpretable interactions while keeping type I error controlled. Yet, mapping gene interactions into testable single-nucleotide polymorphism (SNP)-interaction hypotheses, as well as computing gene pair association scores from SNP pair ones, is not trivial. RESULTS: Here we compare 3 SNP-gene mappings (positional overlap, expression quantitative trait loci, and proximity in 3D structure) and use the adaptive truncated product method to compute gene pair scores. This method is non-parametric, does not require a known null distribution, and is fast to compute. We apply multiple variants of this protocol to a genome-wide association study dataset on inflammatory bowel disease. Different configurations produced different results, highlighting that various mechanisms are implicated in inflammatory bowel disease, while at the same time, results overlapped with known disease characteristics. Importantly, the proposed pipeline also differs from a conventional approach where no network is used, showing the potential for additional discoveries when prior biological knowledge is incorporated into epistasis detection.
Authors: Dermot P B McGovern; Jerome I Rotter; Ling Mei; Talin Haritunians; Carol Landers; Carrie Derkowski; Deb Dutridge; Marla Dubinsky; Andy Ippoliti; Eric Vasiliauskas; Emebet Mengesha; Lily King; Sheila Pressman; Stephan R Targan; Kent D Taylor Journal: Inflamm Bowel Dis Date: 2009-06 Impact factor: 5.325
Authors: Jestinah M Mahachie John; Tom Cattaert; François Van Lishout; Elena S Gusareva; Kristel Van Steen Journal: PLoS One Date: 2012-01-05 Impact factor: 3.240
Authors: Sarah A Pendergrass; Alex Frase; John Wallace; Daniel Wolfe; Neerja Katiyar; Carrie Moore; Marylyn D Ritchie Journal: BioData Min Date: 2013-12-30 Impact factor: 2.522