Literature DB >> 16447987

Choosing SNPs using feature selection.

Tu Minh Phuong1, Zhen Lin, Russ B Altman.   

Abstract

A major challenge for genomewide disease association studies is the high cost of genotyping large number of single nucleotide polymorphisms (SNP). The correlations between SNPs, however, make it possible to select a parsimonious set of informative SNPs, known as "tagging" SNPs, able to capture most variation in a population. Considerable research interest has recently focused on the development of methods for finding such SNPs. In this paper, we present an efficient method for finding tagging SNPs. The method does not involve computation-intensive search for SNP subsets but discards redundant SNPs using a feature selection algorithm. In contrast to most existing methods, the method presented here does not limit itself to using only correlations between SNPs in local groups. By using correlations that occur across different chromosomal regions, the method can reduce the number of globally redundant SNPs. Experimental results show that the number of tagging SNPs selected by our method is smaller than by using block-based methods.

Mesh:

Year:  2005        PMID: 16447987     DOI: 10.1109/csb.2005.22

Source DB:  PubMed          Journal:  Proc IEEE Comput Syst Bioinform Conf        ISSN: 1551-7497


  10 in total

1.  CT radiomics may predict the grade of pancreatic neuroendocrine tumors: a multicenter study.

Authors:  Dongsheng Gu; Yabin Hu; Hui Ding; Jingwei Wei; Ke Chen; Hao Liu; Mengsu Zeng; Jie Tian
Journal:  Eur Radiol       Date:  2019-06-21       Impact factor: 5.315

2.  Efficient genome-wide TagSNP selection across populations via the linkage disequilibrium criterion.

Authors:  Lan Liu; Yonghui Wu; Stefano Lonardi; Tao Jiang
Journal:  J Comput Biol       Date:  2010-01       Impact factor: 1.479

Review 3.  Risk estimation and risk prediction using machine-learning methods.

Authors:  Jochen Kruppa; Andreas Ziegler; Inke R König
Journal:  Hum Genet       Date:  2012-07-03       Impact factor: 4.132

4.  Application of a spatially-weighted Relief algorithm for ranking genetic predictors of disease.

Authors:  Matthew E Stokes; Shyam Visweswaran
Journal:  BioData Min       Date:  2012-12-03       Impact factor: 2.522

5.  Effective selection of informative SNPs and classification on the HapMap genotype data.

Authors:  Nina Zhou; Lipo Wang
Journal:  BMC Bioinformatics       Date:  2007-12-20       Impact factor: 3.169

6.  CGTS: a site-clustering graph based tagSNP selection algorithm in genotype data.

Authors:  Jun Wang; Mao-zu Guo; Chun-yu Wang
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

7.  A modified T-test feature selection method and its application on the HapMap genotype data.

Authors:  Nina Zhou; Lipo Wang
Journal:  Genomics Proteomics Bioinformatics       Date:  2007-12       Impact factor: 7.691

8.  WERFE: A Gene Selection Algorithm Based on Recursive Feature Elimination and Ensemble Strategy.

Authors:  Qi Chen; Zhaopeng Meng; Ran Su
Journal:  Front Bioeng Biotechnol       Date:  2020-05-28

9.  A data driven methodology for social science research with left-behind children as a case study.

Authors:  Chao Wu; Guolong Wang; Simon Hu; Yue Liu; Hong Mi; Ye Zhou; Yi-Ke Guo; Tongtong Song
Journal:  PLoS One       Date:  2020-11-20       Impact factor: 3.240

Review 10.  Big data in IBD: big progress for clinical practice.

Authors:  Nasim Sadat Seyed Tabib; Matthew Madgwick; Padhmanand Sudhakar; Bram Verstockt; Tamas Korcsmaros; Séverine Vermeire
Journal:  Gut       Date:  2020-02-28       Impact factor: 23.059

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.