Literature DB >> 16895924

MLR-tagging: informative SNP selection for unphased genotypes based on multiple linear regression.

Jingwu He1, Alexander Zelikovsky.   

Abstract

UNLABELLED: The search for the association between complex diseases and single nucleotide polymorphisms (SNPs) or haplotypes has recently received great attention. For these studies, it is essential to use a small subset of informative SNPs accurately representing the rest of the SNPs. Informative SNP selection can achieve (1) considerable budget savings by genotyping only a limited number of SNPs and computationally inferring all other SNPs or (2) necessary reduction of the huge SNP sets (obtained, e.g. from Affymetrix) for further fine haplotype analysis. A novel informative SNP selection method for unphased genotype data based on multiple linear regression (MLR) is implemented in the software package MLR-tagging. This software can be used for informative SNP (tag) selection and genotype prediction. The stepwise tag selection algorithm (STSA) selects positions of the given number of informative SNPs based on a genotype sample population. The MLR SNP prediction algorithm predicts a complete genotype based on the values of its informative SNPs, their positions among all SNPs, and a sample of complete genotypes. An extensive experimental study on various datasets including 10 regions from HapMap shows that the MLR prediction combined with stepwise tag selection uses fewer tags than the state-of-the-art method of Halperin et al. (2005). AVAILABILITY: MLR-Tagging software package is publicly available at http://alla.cs.gsu.edu/~software/tagging/tagging.html

Mesh:

Year:  2006        PMID: 16895924     DOI: 10.1093/bioinformatics/btl420

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  An approach to incorporate linkage disequilibrium structure into genomic association analysis.

Authors:  Fengyu Zhang; Diane Wagener
Journal:  J Genet Genomics       Date:  2008-06       Impact factor: 4.275

2.  SITDEM: a simulation tool for disease/endpoint models of association studies based on single nucleotide polymorphism genotypes.

Authors:  Jung Hun Oh; Joseph O Deasy
Journal:  Comput Biol Med       Date:  2013-12-19       Impact factor: 4.589

3.  Networks inferred from biochemical data reveal profound differences in toll-like receptor and inflammatory signaling between normal and transformed hepatocytes.

Authors:  Leonidas G Alexopoulos; Julio Saez-Rodriguez; Benjamin D Cosgrove; Douglas A Lauffenburger; Peter K Sorger
Journal:  Mol Cell Proteomics       Date:  2010-05-10       Impact factor: 5.911

4.  Methods of tagSNP selection and other variables affecting imputation accuracy in swine.

Authors:  Yvonne M Badke; Ronald O Bates; Catherine W Ernst; Clint Schwab; Justin Fix; Curtis P Van Tassell; Juan P Steibel
Journal:  BMC Genet       Date:  2013-02-21       Impact factor: 2.797

5.  CGTS: a site-clustering graph based tagSNP selection algorithm in genotype data.

Authors:  Jun Wang; Mao-zu Guo; Chun-yu Wang
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

6.  Supervised learning-based tagSNP selection for genome-wide disease classifications.

Authors:  Qingzhong Liu; Jack Yang; Zhongxue Chen; Mary Qu Yang; Andrew H Sung; Xudong Huang
Journal:  BMC Genomics       Date:  2008       Impact factor: 3.969

7.  The utility of low-density genotyping for imputation in the Thoroughbred horse.

Authors:  Laura J Corbin; Andreas Kranis; Sarah C Blott; June E Swinburne; Mark Vaudin; Stephen C Bishop; John A Woolliams
Journal:  Genet Sel Evol       Date:  2014-02-04       Impact factor: 4.297

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.