Literature DB >> 16819782

Choosing SNPs using feature selection.

Tu Minh Phuong1, Zhen Lin, Russ B Altman.   

Abstract

A major challenge for genomewide disease association studies is the high cost of genotyping large number of single nucleotide polymorphisms (SNPs). The correlations between SNPs, however, make it possible to select a parsimonious set of informative SNPs, known as "tagging" SNPs, able to capture most variation in a population. Considerable research interest has recently focused on the development of methods for finding such SNPs. In this paper, we present an efficient method for finding tagging SNPs. The method does not involve computation-intensive search for SNP subsets but discards redundant SNPs using a feature selection algorithm. In contrast to most existing methods, the method presented here does not limit itself to using only correlations between SNPs in local groups. By using correlations that occur across different chromosomal regions, the method can reduce the number of globally redundant SNPs. Experimental results show that the number of tagging SNPs selected by our method is smaller than by using block-based methods. Supplementary website: http://htsnp.stanford.edu/FSFS/.

Mesh:

Year:  2006        PMID: 16819782     DOI: 10.1142/s0219720006001941

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  3 in total

Review 1.  A systematic review of emerging feature selection optimization methods for optimal text classification: the present state and prospective opportunities.

Authors:  Esther Omolara Abiodun; Abdulatif Alabdulatif; Oludare Isaac Abiodun; Moatsum Alawida; Abdullah Alabdulatif; Rami S Alkhawaldeh
Journal:  Neural Comput Appl       Date:  2021-08-13       Impact factor: 5.606

2.  Discovering Genome-Wide Tag SNPs Based on the Mutual Information of the Variants.

Authors:  Abdulkadir Elmas; Tai-Hsien Ou Yang; Xiaodong Wang; Dimitris Anastassiou
Journal:  PLoS One       Date:  2016-12-16       Impact factor: 3.240

3.  A random forest approach to the detection of epistatic interactions in case-control studies.

Authors:  Rui Jiang; Wanwan Tang; Xuebing Wu; Wenhui Fu
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.