Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Tag SNP selection in genotype data for maximizing SNP prediction accuracy.

Literature DB >> 15961458

Tag SNP selection in genotype data for maximizing SNP prediction accuracy.

Abstract

MOTIVATION: The search for genetic regions associated with complex diseases, such as cancer or Alzheimer's disease, is an important challenge that may lead to better diagnosis and treatment. The existence of millions of DNA variations, primarily single nucleotide polymorphisms (SNPs), may allow the fine dissection of such associations. However, studies seeking disease association are limited by the cost of genotyping SNPs. Therefore, it is essential to find a small subset of informative SNPs (tag SNPs) that may be used as good representatives of the rest of the SNPs.
RESULTS: We define a new natural measure for evaluating the prediction accuracy of a set of tag SNPs, and use it to develop a new method for tag SNPs selection. Our method is based on a novel algorithm that predicts the values of the rest of the SNPs given the tag SNPs. In contrast to most previous methods, our prediction algorithm uses the genotype information and not the haplotype information of the tag SNPs. Our method is very efficient, and it does not rely on having a block partition of the genomic region. We compared our method with two state-of-the-art tag SNP selection algorithms on 58 different genotype datasets from four different sources. Our method consistently found tag SNPs with considerably better prediction ability than the other methods. AVAILABILITY: The software is available from the authors on request.

Entities: Disease

Mesh：

Substances：
DNA

Year: 2005 PMID： 15961458 DOI： 10.1093/bioinformatics/bti1021

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

30 in total

1. FastANOVA: an Efficient Algorithm for Genome-Wide Association Study.

Authors: Xiang Zhang; Fei Zou; Wei Wang
Journal: KDD Date: 2008

2. A fast method for computing high-significance disease association in large population-based studies.

Authors: Gad Kimmel; Ron Shamir
Journal: Am J Hum Genet Date: 2006-07-24 Impact factor: 11.025

3. Efficiently identifying significant associations in genome-wide association studies.

Authors: Emrah Kostem; Eleazar Eskin
Journal: J Comput Biol Date: 2013-09-14 Impact factor: 1.479

4. Efficient genome-wide TagSNP selection across populations via the linkage disequilibrium criterion.

Authors: Lan Liu; Yonghui Wu; Stefano Lonardi; Tao Jiang
Journal: J Comput Biol Date: 2010-01 Impact factor: 1.479

5. Increasing power of genome-wide association studies by collecting additional single-nucleotide polymorphisms.

Authors: Emrah Kostem; Jose A Lozano; Eleazar Eskin
Journal: Genetics Date: 2011-04-05 Impact factor: 4.562

6. Autophagy-related IRGM genes confer susceptibility to ankylosing spondylitis in a Chinese female population: a case-control study.

Authors: Q Xia; M Wang; X Yang; X Li; X Zhang; S Xu; Z Shuai; J Xu; D Fan; C Ding; F Pan
Journal: Genes Immun Date: 2016-12-29 Impact factor: 2.676

7. FastChi: an efficient algorithm for analyzing gene-gene interactions.

Authors: Xiang Zhang; Fei Zou; Wei Wang
Journal: Pac Symp Biocomput Date: 2009

8. Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping.

Authors: Sivan Bercovici; Christopher Meek; Ydo Wexler; Dan Geiger
Journal: Bioinformatics Date: 2010-06-15 Impact factor: 6.937

9. Efficient association study design via power-optimized tag SNP selection.

Authors: B Han; H M Kang; M S Seo; N Zaitlen; E Eskin
Journal: Ann Hum Genet Date: 2008-08-13 Impact factor: 1.670

10. A statistical method for predicting classical HLA alleles from SNP data.

Authors: Stephen Leslie; Peter Donnelly; Gil McVean
Journal: Am J Hum Genet Date: 2008-01 Impact factor: 11.025