Literature DB >> 16278949

Effective algorithms for tag SNP selection.

Tie-Fei Liu1, Wing-Kin Sung, Yi Li, Jian-Jun Liu, Ankush Mittal, Pei-Lin Mao.   

Abstract

Single nucleotide polymorphisms (SNPs), due to their abundance and low mutation rate, are very useful genetic markers for genetic association studies. However, the current genotyping technology cannot afford to genotype all common SNPs in all the genes. By making use of linkage disequilibrium, we can reduce the experiment cost by genotyping a subset of SNPs, called Tag SNPs, which have a strong association with the ungenotyped SNPs, while are as independent from each other as possible. The problem of selecting Tag SNPs is NP-complete; when there are large number of SNPs, in order to avoid extremely long computational time, most of the existing Tag SNP selection methods first partition the SNPs into blocks based on certain block definitions, then Tag SNPs are selected in each block by brute-force search. The size of the Tag SNP set obtained in this way may usually be reduced further due to the inter-dependency among blocks. This paper proposes two algorithms, TSSA and TSSD, to tackle the block-independent Tag SNP selection problem. TSSA is based on A* search algorithm, and TSSD is a heuristic algorithm. Experiments show that TSSA can find the optimal solutions for medium-sized problems in reasonable time, while TSSD can handle very large problems and report approximate solutions very close to the optimal ones.

Mesh:

Substances:

Year:  2005        PMID: 16278949     DOI: 10.1142/s0219720005001521

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  2 in total

1.  CGTS: a site-clustering graph based tagSNP selection algorithm in genotype data.

Authors:  Jun Wang; Mao-zu Guo; Chun-yu Wang
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

2.  A modified T-test feature selection method and its application on the HapMap genotype data.

Authors:  Nina Zhou; Lipo Wang
Journal:  Genomics Proteomics Bioinformatics       Date:  2007-12       Impact factor: 7.691

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.