Literature DB >> 31443041

SPADIS: An Algorithm for Selecting Predictive and Diverse SNPs in GWAS.

Serhan Yilmaz, Oznur Tastan, A Ercument Cicek.   

Abstract

Phenotypic heritability of complex traits and diseases is seldom explained by individual genetic variants identified in genome-wide association studies (GWAS). Many methods have been developed to select a subset of variant loci, which are associated with or predictive of the phenotype. Selecting connected SNPs on SNP-SNP networks have been proven successful in finding biologically interpretable and predictive SNPs. However, we argue that the connectedness constraint favors selecting redundant features that affect similar biological processes and therefore does not necessarily yield better predictive performance. In this paper, we propose a novel method called SPADIS that favors the selection of remotely located SNPs in order to account for their complementary effects in explaining a phenotype. SPADIS selects a diverse set of loci on a SNP-SNP network. This is achieved by maximizing a submodular set function with a greedy algorithm that ensures a constant factor approximation to the optimal solution. We compare SPADIS to the state-of-the-art method SConES, on a dataset of Arabidopsis Thaliana with continuous flowering time phenotypes. SPADIS has better average phenotype prediction performance in 15 out of 17 phenotypes when the same number of SNPs are selected and provides consistent improvements across multiple networks and settings on average. Moreover, it identifies more candidate genes and runs faster.

Entities:  

Mesh:

Year:  2021        PMID: 31443041     DOI: 10.1109/TCBB.2019.2935437

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  3 in total

1.  STS-BN: An efficient Bayesian network method for detecting causal SNPs.

Authors:  Yanran Ma; Botao Fa; Xin Yuan; Yue Zhang; Zhangsheng Yu
Journal:  Front Genet       Date:  2022-09-15       Impact factor: 4.772

2.  GMStool: GWAS-based marker selection tool for genomic prediction from genomic data.

Authors:  Seongmun Jeong; Jae-Yoon Kim; Namshin Kim
Journal:  Sci Rep       Date:  2020-11-12       Impact factor: 4.379

3.  Enhancing statistical power in temporal biomarker discovery through representative shapelet mining.

Authors:  Thomas Gumbsch; Christian Bock; Michael Moor; Bastian Rieck; Karsten Borgwardt
Journal:  Bioinformatics       Date:  2020-12-30       Impact factor: 6.937

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.