Literature DB >> 17646323

Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows.

Adam Roberts1, Leonard McMillan, Wei Wang, Joel Parker, Ivan Rusyn, David Threadgill.   

Abstract

MOTIVATION: Typical high-throughput genotyping techniques produce numerous missing calls that confound subsequent analyses, such as disease association studies. Common remedies for this problem include removing affected markers and/or samples or, otherwise, imputing the missing data. On small marker sets imputation is frequently based on a vote of the K-nearest-neighbor (KNN) haplotypes, but this technique is neither practical nor justifiable for large datasets.
RESULTS: We describe a data structure that supports efficient KNN queries over arbitrarily sized, sliding haplotype windows, and evaluate its use for genotype imputation. The performance of our method enables exhaustive exploration over all window sizes and known sites in large (150K, 8.3M) SNP panels. We also compare the accuracy and performance of our methods with competing imputation approaches. AVAILABILITY: A free open source software package, NPUTE, is available at http://compgen.unc.edu/software, for non-commercial uses.

Mesh:

Year:  2007        PMID: 17646323     DOI: 10.1093/bioinformatics/btm220

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  39 in total

1.  Rapid and robust resampling-based multiple-testing correction with application in a genome-wide expression quantitative trait loci study.

Authors:  Xiang Zhang; Shunping Huang; Wei Sun; Wei Wang
Journal:  Genetics       Date:  2012-01-31       Impact factor: 4.562

2.  Genome-wide genetic changes during modern breeding of maize.

Authors:  Yinping Jiao; Hainan Zhao; Longhui Ren; Weibin Song; Biao Zeng; Jinjie Guo; Baobao Wang; Zhipeng Liu; Jing Chen; Wei Li; Mei Zhang; Shaojun Xie; Jinsheng Lai
Journal:  Nat Genet       Date:  2012-06-03       Impact factor: 38.330

3.  Genome-wide association studies of 14 agronomic traits in rice landraces.

Authors:  Xuehui Huang; Xinghua Wei; Tao Sang; Qiang Zhao; Qi Feng; Yan Zhao; Canyang Li; Chuanrang Zhu; Tingting Lu; Zhiwu Zhang; Meng Li; Danlin Fan; Yunli Guo; Ahong Wang; Lu Wang; Liuwei Deng; Wenjun Li; Yiqi Lu; Qijun Weng; Kunyan Liu; Tao Huang; Taoying Zhou; Yufeng Jing; Wei Li; Zhang Lin; Edward S Buckler; Qian Qian; Qi-Fa Zhang; Jiayang Li; Bin Han
Journal:  Nat Genet       Date:  2010-10-24       Impact factor: 38.330

4.  FastANOVA: an Efficient Algorithm for Genome-Wide Association Study.

Authors:  Xiang Zhang; Fei Zou; Wei Wang
Journal:  KDD       Date:  2008

5.  An imputed genotype resource for the laboratory mouse.

Authors:  Jin P Szatkiewicz; Glen L Beane; Yueming Ding; Lucie Hutchins; Fernando Pardo-Manuel de Villena; Gary A Churchill
Journal:  Mamm Genome       Date:  2008-02-27       Impact factor: 2.957

6.  Using population mixtures to optimize the utility of genomic databases: linkage disequilibrium and association study design in India.

Authors:  T J Pemberton; M Jakobsson; D F Conrad; G Coop; J D Wall; J K Pritchard; P I Patel; N A Rosenberg
Journal:  Ann Hum Genet       Date:  2007-05-30       Impact factor: 1.670

7.  Prediction of hybrid performance in maize using molecular markers and joint analyses of hybrids and parental inbreds.

Authors:  Tobias A Schrag; Jens Möhring; Albrecht E Melchinger; Barbara Kusterer; Baldev S Dhillon; Hans-Peter Piepho; Matthias Frisch
Journal:  Theor Appl Genet       Date:  2009-11-15       Impact factor: 5.699

8.  Improved risk prediction for Crohn's disease with a multi-locus approach.

Authors:  Jia Kang; Subra Kugathasan; Michel Georges; Hongyu Zhao; Judy H Cho
Journal:  Hum Mol Genet       Date:  2011-03-22       Impact factor: 6.150

9.  Replication and narrowing of gene expression quantitative trait loci using inbred mice.

Authors:  Daniel M Gatti; Alison H Harrill; Fred A Wright; David W Threadgill; Ivan Rusyn
Journal:  Mamm Genome       Date:  2009-07-17       Impact factor: 2.957

10.  FastChi: an efficient algorithm for analyzing gene-gene interactions.

Authors:  Xiang Zhang; Fei Zou; Wei Wang
Journal:  Pac Symp Biocomput       Date:  2009
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.