Literature DB >> 24717145

Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.

Xuan Guo, Yu Meng, Ning Yu, Yi Pan1.   

Abstract

BACKGROUND: Taking the advantage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions with more than 2 SNPs propose computational and analytical challenges because the computation increases exponentially as the cardinality of SNPs combinations gets larger.
RESULTS: In this paper, we provide a simple, fast and powerful method using dynamic clustering and cloud computing to detect genome-wide multi-locus epistatic interactions. We have constructed systematic experiments to compare powers performance against some recently proposed algorithms, including TEAM, SNPRuler, EDCF and BOOST. Furthermore, we have applied our method on two real GWAS datasets, Age-related macular degeneration (AMD) and Rheumatoid arthritis (RA) datasets, where we find some novel potential disease-related genetic factors which are not shown up in detections of 2-loci epistatic interactions.
CONCLUSIONS: Experimental results on simulated data demonstrate that our method is more powerful than some recently proposed methods on both two- and three-locus disease models. Our method has discovered many novel high-order associations that are significantly enriched in cases from two real GWAS datasets. Moreover, the running time of the cloud implementation for our method on AMD dataset and RA dataset are roughly 2 hours and 50 hours on a cluster with forty small virtual machines for detecting two-locus interactions, respectively. Therefore, we believe that our method is suitable and effective for the full-scale analysis of multiple-locus epistatic interactions in GWAS.

Entities:  

Mesh:

Year:  2014        PMID: 24717145      PMCID: PMC4021249          DOI: 10.1186/1471-2105-15-102

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  29 in total

1.  A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

Authors:  M R Nelson; S L Kardia; R E Ferrell; C F Sing
Journal:  Genome Res       Date:  2001-03       Impact factor: 9.043

Review 2.  Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans.

Authors:  Heather J Cordell
Journal:  Hum Mol Genet       Date:  2002-10-01       Impact factor: 6.150

3.  Complement factor H polymorphism in age-related macular degeneration.

Authors:  Robert J Klein; Caroline Zeiss; Emily Y Chew; Jen-Yue Tsai; Richard S Sackler; Chad Haynes; Alice K Henning; John Paul SanGiovanni; Shrikant M Mane; Susan T Mayne; Michael B Bracken; Frederick L Ferris; Jurg Ott; Colin Barnstable; Josephine Hoh
Journal:  Science       Date:  2005-03-10       Impact factor: 47.728

4.  Genome-wide strategies for detecting multiple loci that influence complex diseases.

Authors:  Jonathan Marchini; Peter Donnelly; Lon R Cardon
Journal:  Nat Genet       Date:  2005-03-27       Impact factor: 38.330

5.  A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction.

Authors:  Digna R Velez; Bill C White; Alison A Motsinger; William S Bush; Marylyn D Ritchie; Scott M Williams; Jason H Moore
Journal:  Genet Epidemiol       Date:  2007-05       Impact factor: 2.135

6.  Bayesian inference of epistatic interactions in case-control studies.

Authors:  Yu Zhang; Jun S Liu
Journal:  Nat Genet       Date:  2007-08-26       Impact factor: 38.330

7.  Genetic risk prediction--are we there yet?

Authors:  Peter Kraft; David J Hunter
Journal:  N Engl J Med       Date:  2009-04-15       Impact factor: 91.245

8.  A novel strategy for detecting multiple loci in Genome-Wide Association Studies of complex diseases.

Authors:  Jing Li
Journal:  Int J Bioinform Res Appl       Date:  2008

9.  Whole genome identity-by-descent determination.

Authors:  Hadi Sabaa; Zhipeng Cai; Yining Wang; Randy Goebel; Stephen Moore; Guohui Lin
Journal:  J Bioinform Comput Biol       Date:  2013-01-16       Impact factor: 1.122

10.  A random forest approach to the detection of epistatic interactions in case-control studies.

Authors:  Rui Jiang; Wanwan Tang; Xuebing Wu; Wenhui Fu
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

View more
  15 in total

1.  FDHE-IW: A Fast Approach for Detecting High-Order Epistasis in Genome-Wide Case-Control Studies.

Authors:  Shouheng Tuo
Journal:  Genes (Basel)       Date:  2018-08-29       Impact factor: 4.096

2.  GWAS of longevity in CHARGE consortium confirms APOE and FOXO3 candidacy.

Authors:  Linda Broer; Aron S Buchman; Joris Deelen; Daniel S Evans; Jessica D Faul; Kathryn L Lunetta; Paola Sebastiani; Jennifer A Smith; Albert V Smith; Toshiko Tanaka; Lei Yu; Alice M Arnold; Thor Aspelund; Emelia J Benjamin; Philip L De Jager; Gudny Eirkisdottir; Denis A Evans; Melissa E Garcia; Albert Hofman; Robert C Kaplan; Sharon L R Kardia; Douglas P Kiel; Ben A Oostra; Eric S Orwoll; Neeta Parimi; Bruce M Psaty; Fernando Rivadeneira; Jerome I Rotter; Sudha Seshadri; Andrew Singleton; Henning Tiemeier; André G Uitterlinden; Wei Zhao; Stefania Bandinelli; David A Bennett; Luigi Ferrucci; Vilmundur Gudnason; Tamara B Harris; David Karasik; Lenore J Launer; Thomas T Perls; P Eline Slagboom; Gregory J Tranah; David R Weir; Anne B Newman; Cornelia M van Duijn; Joanne M Murabito
Journal:  J Gerontol A Biol Sci Med Sci       Date:  2014-09-08       Impact factor: 6.053

3.  FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.

Authors:  Shouheng Tuo; Junying Zhang; Xiguo Yuan; Yuanyuan Zhang; Zhaowen Liu
Journal:  PLoS One       Date:  2016-03-25       Impact factor: 3.240

4.  VariantSpark: population scale clustering of genotype information.

Authors:  Aidan R O'Brien; Neil F W Saunders; Yi Guo; Fabian A Buske; Rodney J Scott; Denis C Bauer
Journal:  BMC Genomics       Date:  2015-12-10       Impact factor: 3.969

5.  Combinations of genetic variants associated with bipolar disorder.

Authors:  Erling Mellerup; Ole A Andreassen; Bente Bennike; Henrik Dam; Srdjan Djurovic; Martin Balslev Jorgensen; Lars Vedel Kessing; Pernille Koefoed; Ingrid Melle; Ole Mors; Gert Lykke Moeller
Journal:  PLoS One       Date:  2017-12-21       Impact factor: 3.240

6.  Niche harmony search algorithm for detecting complex disease associated high-order SNP combinations.

Authors:  Shouheng Tuo; Junying Zhang; Xiguo Yuan; Zongzhen He; Yajun Liu; Zhaowen Liu
Journal:  Sci Rep       Date:  2017-09-14       Impact factor: 4.379

Review 7.  Combinations of Genetic Variants Occurring Exclusively in Patients.

Authors:  Erling Mellerup; Gert Lykke Møller
Journal:  Comput Struct Biotechnol J       Date:  2017-03-10       Impact factor: 7.271

8.  Integrative information theoretic network analysis for genome-wide association study of aspirin exacerbated respiratory disease in Korean population.

Authors:  Sehee Wang; Hyun-Hwan Jeong; Dokyoon Kim; Kyubum Wee; Hae-Sim Park; Seung-Hyun Kim; Kyung-Ah Sohn
Journal:  BMC Med Genomics       Date:  2017-05-24       Impact factor: 3.063

9.  An Improved Opposition-Based Learning Particle Swarm Optimization for the Detection of SNP-SNP Interactions.

Authors:  Junliang Shang; Yan Sun; Shengjun Li; Jin-Xing Liu; Chun-Hou Zheng; Junying Zhang
Journal:  Biomed Res Int       Date:  2015-07-05       Impact factor: 3.411

10.  HiSeeker: Detecting High-Order SNP Interactions Based on Pairwise SNP Combinations.

Authors:  Jie Liu; Guoxian Yu; Yuan Jiang; Jun Wang
Journal:  Genes (Basel)       Date:  2017-05-31       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.