Literature DB >> 24581733

Fast detection of high-order epistatic interactions in genome-wide association studies using information theoretic measure.

Sangseob Leem1, Hyun-hwan Jeong2, Jungseob Lee3, Kyubum Wee4, Kyung-Ah Sohn5.   

Abstract

There are many algorithms for detecting epistatic interactions in GWAS. However, most of these algorithms are applicable only for detecting two-locus interactions. Some algorithms are designed to detect only two-locus interactions from the beginning. Others do not have limits to the order of interactions, but in practice take very long time to detect higher order interactions in real data of GWAS. Even the better ones take days to detect higher order interactions in WTCCC data. We propose a fast algorithm for detection of high order epistatic interactions in GWAS. It runs k-means clustering algorithm on the set of all SNPs. Then candidates are selected from each cluster. These candidates are examined to find the causative SNPs of k-locus interactions. We use mutual information from information theory as the measure of association between genotypes and phenotypes. We tested the power and speed of our method on extensive sets of simulated data. The results show that our method has more or equal power, and runs much faster than previously reported methods. We also applied our algorithm on each of seven diseases in WTCCC data to analyze up to 5-locus interactions. It takes only a few hours to analyze 5-locus interactions in one dataset. From the results we make some interesting and meaningful observations on each disease in WTCCC data. In this study, a simple yet powerful two-step approach is proposed for fast detection of high order epistatic interaction. Our algorithm makes it possible to detect high order epistatic interactions in GWAS in a matter of hours on a PC.
Copyright © 2014 Elsevier Ltd. All rights reserved.

Keywords:  Genome-wide association studies; High-order epistatic interactions; K-means clustering; Mutual information; WTCCC

Mesh:

Year:  2014        PMID: 24581733     DOI: 10.1016/j.compbiolchem.2014.01.005

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  12 in total

1.  Investigating the utility of clinical outcome-guided mutual information network in network-based Cox regression.

Authors:  Hyun-hwan Jeong; So Kim; Kyubum Wee; Kyung-Ah Sohn
Journal:  BMC Syst Biol       Date:  2015-01-21

2.  Relevance Epistasis Network of Gastritis for Intra-chromosomes in the Korea Associated Resource (KARE) Cohort Study.

Authors:  Hyun-Hwan Jeong; Kyung-Ah Sohn
Journal:  Genomics Inform       Date:  2014-12-31

3.  Integrative network analysis for survival-associated gene-gene interactions across multiple genomic profiles in ovarian cancer.

Authors:  Hyun-Hwan Jeong; Sangseob Leem; Kyubum Wee; Kyung-Ah Sohn
Journal:  J Ovarian Res       Date:  2015-07-03       Impact factor: 4.234

4.  Niche harmony search algorithm for detecting complex disease associated high-order SNP combinations.

Authors:  Shouheng Tuo; Junying Zhang; Xiguo Yuan; Zongzhen He; Yajun Liu; Zhaowen Liu
Journal:  Sci Rep       Date:  2017-09-14       Impact factor: 4.379

5.  Integrative information theoretic network analysis for genome-wide association study of aspirin exacerbated respiratory disease in Korean population.

Authors:  Sehee Wang; Hyun-Hwan Jeong; Dokyoon Kim; Kyubum Wee; Hae-Sim Park; Seung-Hyun Kim; Kyung-Ah Sohn
Journal:  BMC Med Genomics       Date:  2017-05-24       Impact factor: 3.063

6.  An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions.

Authors:  Sangseob Leem; Taesung Park
Journal:  BMC Genomics       Date:  2017-03-14       Impact factor: 3.969

7.  ClearF: a supervised feature scoring method to find biomarkers using class-wise embedding and reconstruction.

Authors:  Sehee Wang; Hyun-Hwan Jeong; Kyung-Ah Sohn
Journal:  BMC Med Genomics       Date:  2019-07-11       Impact factor: 3.063

Review 8.  Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci.

Authors:  Hannah L Nicholls; Christopher R John; David S Watson; Patricia B Munroe; Michael R Barnes; Claudia P Cabrera
Journal:  Front Genet       Date:  2020-04-15       Impact factor: 4.599

9.  BowSaw: Inferring Higher-Order Trait Interactions Associated With Complex Biological Phenotypes.

Authors:  Demetrius DiMucci; Mark Kon; Daniel Segrè
Journal:  Front Mol Biosci       Date:  2021-06-17

10.  The Influence of Higher-Order Epistasis on Biological Fitness Landscape Topography.

Authors:  Daniel M Weinreich; Yinghong Lan; Jacob Jaffe; Robert B Heckendorn
Journal:  J Stat Phys       Date:  2018-02-07       Impact factor: 1.548

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.