Literature DB >> 33827420

Optimized permutation testing for information theoretic measures of multi-gene interactions.

James M Kunert-Graf1, Nikita A Sakhanenko2, David J Galas2.   

Abstract

BACKGROUND: Permutation testing is often considered the "gold standard" for multi-test significance analysis, as it is an exact test requiring few assumptions about the distribution being computed. However, it can be computationally very expensive, particularly in its naive form in which the full analysis pipeline is re-run after permuting the phenotype labels. This can become intractable in multi-locus genome-wide association studies (GWAS), in which the number of potential interactions to be tested is combinatorially large.
RESULTS: In this paper, we develop an approach for permutation testing in multi-locus GWAS, specifically focusing on SNP-SNP-phenotype interactions using multivariable measures that can be computed from frequency count tables, such as those based in Information Theory. We find that the computational bottleneck in this process is the construction of the count tables themselves, and that this step can be eliminated at each iteration of the permutation testing by transforming the count tables directly. This leads to a speed-up by a factor of over 103 for a typical permutation test compared to the naive approach. Additionally, this approach is insensitive to the number of samples making it suitable for datasets with large number of samples.
CONCLUSIONS: The proliferation of large-scale datasets with genotype data for hundreds of thousands of individuals enables new and more powerful approaches for the detection of multi-locus genotype-phenotype interactions. Our approach significantly improves the computational tractability of permutation testing for these studies. Moreover, our approach is insensitive to the large number of samples in these modern datasets. The code for performing these computations and replicating the figures in this paper is freely available at https://github.com/kunert/permute-counts .

Entities:  

Keywords:  Information theory; Multi-locus GWAS; Multivariable interactions; Permutation testing

Year:  2021        PMID: 33827420     DOI: 10.1186/s12859-021-04107-6

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  3 in total

1.  PERMORY: an LD-exploiting permutation test algorithm for powerful genome-wide association testing.

Authors:  Roman Pahl; Helmut Schäfer
Journal:  Bioinformatics       Date:  2010-07-06       Impact factor: 6.937

Review 2.  Prioritizing GWAS results: A review of statistical methods and recommendations for their application.

Authors:  Rita M Cantor; Kenneth Lange; Janet S Sinsheimer
Journal:  Am J Hum Genet       Date:  2010-01       Impact factor: 11.025

3.  Discovering pair-wise genetic interactions: an information theory-based approach.

Authors:  Tomasz M Ignac; Alexander Skupin; Nikita A Sakhanenko; David J Galas
Journal:  PLoS One       Date:  2014-03-26       Impact factor: 3.240

  3 in total
  1 in total

1.  Dissecting the contribution of single nucleotide polymorphisms in CCR9 and CCL25 genomic regions to the celiac disease phenotype.

Authors:  Laura Airaksinen; Juliana Xm Cerqueira; Heini Huhtala; Päivi Saavalainen; Dawit A Yohannes; Markku Mäki; Kalle Kurppa; Elina Kilpeläinen; Anastasia Shcherban; Aarno Palotie; Katri Kaukinen; Katri Lindfors
Journal:  J Transl Autoimmun       Date:  2021-10-14
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.