Literature DB >> 15961458

Tag SNP selection in genotype data for maximizing SNP prediction accuracy.

Eran Halperin1, Gad Kimmel, Ron Shamir.   

Abstract

MOTIVATION: The search for genetic regions associated with complex diseases, such as cancer or Alzheimer's disease, is an important challenge that may lead to better diagnosis and treatment. The existence of millions of DNA variations, primarily single nucleotide polymorphisms (SNPs), may allow the fine dissection of such associations. However, studies seeking disease association are limited by the cost of genotyping SNPs. Therefore, it is essential to find a small subset of informative SNPs (tag SNPs) that may be used as good representatives of the rest of the SNPs.
RESULTS: We define a new natural measure for evaluating the prediction accuracy of a set of tag SNPs, and use it to develop a new method for tag SNPs selection. Our method is based on a novel algorithm that predicts the values of the rest of the SNPs given the tag SNPs. In contrast to most previous methods, our prediction algorithm uses the genotype information and not the haplotype information of the tag SNPs. Our method is very efficient, and it does not rely on having a block partition of the genomic region. We compared our method with two state-of-the-art tag SNP selection algorithms on 58 different genotype datasets from four different sources. Our method consistently found tag SNPs with considerably better prediction ability than the other methods. AVAILABILITY: The software is available from the authors on request.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15961458     DOI: 10.1093/bioinformatics/bti1021

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  FastANOVA: an Efficient Algorithm for Genome-Wide Association Study.

Authors:  Xiang Zhang; Fei Zou; Wei Wang
Journal:  KDD       Date:  2008

2.  A fast method for computing high-significance disease association in large population-based studies.

Authors:  Gad Kimmel; Ron Shamir
Journal:  Am J Hum Genet       Date:  2006-07-24       Impact factor: 11.025

3.  Efficiently identifying significant associations in genome-wide association studies.

Authors:  Emrah Kostem; Eleazar Eskin
Journal:  J Comput Biol       Date:  2013-09-14       Impact factor: 1.479

4.  Efficient genome-wide TagSNP selection across populations via the linkage disequilibrium criterion.

Authors:  Lan Liu; Yonghui Wu; Stefano Lonardi; Tao Jiang
Journal:  J Comput Biol       Date:  2010-01       Impact factor: 1.479

5.  Increasing power of genome-wide association studies by collecting additional single-nucleotide polymorphisms.

Authors:  Emrah Kostem; Jose A Lozano; Eleazar Eskin
Journal:  Genetics       Date:  2011-04-05       Impact factor: 4.562

6.  Autophagy-related IRGM genes confer susceptibility to ankylosing spondylitis in a Chinese female population: a case-control study.

Authors:  Q Xia; M Wang; X Yang; X Li; X Zhang; S Xu; Z Shuai; J Xu; D Fan; C Ding; F Pan
Journal:  Genes Immun       Date:  2016-12-29       Impact factor: 2.676

7.  FastChi: an efficient algorithm for analyzing gene-gene interactions.

Authors:  Xiang Zhang; Fei Zou; Wei Wang
Journal:  Pac Symp Biocomput       Date:  2009

8.  Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping.

Authors:  Sivan Bercovici; Christopher Meek; Ydo Wexler; Dan Geiger
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

9.  Efficient association study design via power-optimized tag SNP selection.

Authors:  B Han; H M Kang; M S Seo; N Zaitlen; E Eskin
Journal:  Ann Hum Genet       Date:  2008-08-13       Impact factor: 1.670

10.  A statistical method for predicting classical HLA alleles from SNP data.

Authors:  Stephen Leslie; Peter Donnelly; Gil McVean
Journal:  Am J Hum Genet       Date:  2008-01       Impact factor: 11.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.