Literature DB >> 20078395

Efficient genome-wide TagSNP selection across populations via the linkage disequilibrium criterion.

Lan Liu1, Yonghui Wu, Stefano Lonardi, Tao Jiang.   

Abstract

In this article, we studied the tag single-nucleotide polymorphism (tagSNP) selection problem on multiple populations using the pairwise r(2) linkage disequilibrium criterion. We proposed a novel combinatorial optimization model for the tagSNP selection problem, called the minimum common tagSNP selection (MCTS) problem, and presented efficient solutions for MCTS. Our approach consists of the following three main steps: (i) partitioning the SNP markers into small disjoint components, (ii) applying some data reduction rules to simplify the problem, and (iii) applying either a fast greedy algorithm or a Lagrangian relaxation algorithm to solve the remaining (general) MCTS. These algorithms also provide lower bounds on tagging (i.e., the minimum number of tagSNPs needed). The lower bounds allow us to evaluate how far our solution is from the optimum. To the best of our knowledge, it is the first time the tagging lower bounds are discussed in the literature. We assessed the performance of our algorithms on real HapMap data for genome-wide tagging. The experiments demonstrated that our algorithms run 3-4 orders of magnitude faster than the existing single-population tagging programs such as FESTA, LD-Select, and the multiple-population tagging method MultiPop-TagSelect. Our method also greatly reduced the required tagSNPs compared with LD-Select on a single population and MultiPop-TagSelect on multiple populations. Moreover, the numbers of tagSNPs selected by our algorithms are almost optimal because they are very close to the corresponding lower bounds obtained by our method.

Mesh:

Year:  2010        PMID: 20078395      PMCID: PMC3163390          DOI: 10.1089/cmb.2007.0228

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  30 in total

1.  The structure of haplotype blocks in the human genome.

Authors:  Stacey B Gabriel; Stephen F Schaffner; Huy Nguyen; Jamie M Moore; Jessica Roy; Brendan Blumenstiel; John Higgins; Matthew DeFelice; Amy Lochner; Maura Faggart; Shau Neen Liu-Cordero; Charles Rotimi; Adebowale Adeyemo; Richard Cooper; Ryk Ward; Eric S Lander; Mark J Daly; David Altshuler
Journal:  Science       Date:  2002-05-23       Impact factor: 47.728

2.  Selection of minimum subsets of single nucleotide polymorphisms to capture haplotype block diversity.

Authors:  Hadar I Avi-Itzhak; Xiaoping Su; Francisco M De La Vega
Journal:  Pac Symp Biocomput       Date:  2003

3.  Distribution of recombination crossovers and the origin of haplotype blocks: the interplay of population history, recombination, and mutation.

Authors:  Ning Wang; Joshua M Akey; Kun Zhang; Ranajit Chakraborty; Li Jin
Journal:  Am J Hum Genet       Date:  2002-10-15       Impact factor: 11.025

4.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

Authors:  Christopher S Carlson; Michael A Eberle; Mark J Rieder; Qian Yi; Leonid Kruglyak; Deborah A Nickerson
Journal:  Am J Hum Genet       Date:  2003-12-15       Impact factor: 11.025

5.  HaploBlockFinder: haplotype block analyses.

Authors:  Kun Zhang; Li Jin
Journal:  Bioinformatics       Date:  2003-07-01       Impact factor: 6.937

6.  Minimal haplotype tagging.

Authors:  Paola Sebastiani; Ross Lazarus; Scott T Weiss; Louis M Kunkel; Isaac S Kohane; Marco F Ramoni
Journal:  Proc Natl Acad Sci U S A       Date:  2003-08-04       Impact factor: 11.205

7.  Choosing haplotype-tagging SNPS based on unphased genotype data using a preliminary sample of unrelated subjects with an example from the Multiethnic Cohort Study.

Authors:  Daniel O Stram; Christopher A Haiman; Joel N Hirschhorn; David Altshuler; Laurence N Kolonel; Brian E Henderson; Malcolm C Pike
Journal:  Hum Hered       Date:  2003       Impact factor: 0.444

8.  Entropy-based SNP selection for genetic association studies.

Authors:  Jochen Hampe; Stefan Schreiber; Michael Krawczak
Journal:  Hum Genet       Date:  2003-09-18       Impact factor: 4.132

9.  Finding haplotype tagging SNPs by use of principal components analysis.

Authors:  Zhen Lin; Russ B Altman
Journal:  Am J Hum Genet       Date:  2004-09-23       Impact factor: 11.025

10.  The effect of haplotype-block definitions on inference of haplotype-block structure and htSNPs selection.

Authors:  Keyue Ding; Kaixin Zhou; Jing Zhang; Joanne Knight; Xuegong Zhang; Yan Shen
Journal:  Mol Biol Evol       Date:  2004-09-15       Impact factor: 16.240

View more
  3 in total

1.  Population differences in transcript-regulator expression quantitative trait loci.

Authors:  Pierre R Bushel; Ray McGovern; Liwen Liu; Oliver Hofmann; Ahsan Huda; Jun Lu; Winston Hide; Xihong Lin
Journal:  PLoS One       Date:  2012-03-27       Impact factor: 3.240

2.  SNP variable selection by generalized graph domination.

Authors:  Shuzhen Sun; Zhuqi Miao; Blaise Ratcliffe; Polly Campbell; Bret Pasch; Yousry A El-Kassaby; Balabhaskar Balasundaram; Charles Chen
Journal:  PLoS One       Date:  2019-01-24       Impact factor: 3.240

3.  Association of MMP-2 gene haplotypes with thoracic aortic dissection in chinese han population.

Authors:  Ou Liu; Jiachen Li; Yi Xin; Yanwen Qin; Haiyang Li; Ming Gong; Yuyong Liu; Xiaolong Wang; Jianrong Li; Hongjia Zhang
Journal:  BMC Cardiovasc Disord       Date:  2016-01-14       Impact factor: 2.298

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.