Literature DB >> 15389393

Finding haplotype tagging SNPs by use of principal components analysis.

Zhen Lin1, Russ B Altman.   

Abstract

The immense volume and rapid growth of human genomic data, especially single nucleotide polymorphisms (SNPs), present special challenges for both biomedical researchers and automatic algorithms. One such challenge is to select an optimal subset of SNPs, commonly referred as "haplotype tagging SNPs" (htSNPs), to capture most of the haplotype diversity of each haplotype block or gene-specific region. This information-reduction process facilitates cost-effective genotyping and, subsequently, genotype-phenotype association studies. It also has implications for assessing the risk of identifying research subjects on the basis of SNP information deposited in public domain databases. We have investigated methods for selecting htSNPs by use of principal components analysis (PCA). These methods first identify eigenSNPs and then map them to actual SNPs. We evaluated two mapping strategies, greedy discard and varimax rotation, by assessing the ability of the selected htSNPs to reconstruct genotypes of non-htSNPs. We also compared these methods with two other htSNP finders, one of which is PCA based. We applied these methods to three experimental data sets and found that the PCA-based methods tend to select the smallest set of htSNPs to achieve a 90% reconstruction precision.

Entities:  

Mesh:

Year:  2004        PMID: 15389393      PMCID: PMC1182114          DOI: 10.1086/425587

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  32 in total

1.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

2.  Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

Authors:  N Patil; A J Berno; D A Hinds; W A Barrett; J M Doshi; C R Hacker; C R Kautzer; D H Lee; C Marjoribanks; D P McDonough; B T Nguyen; M C Norris; J B Sheehan; N Shen; D Stern; R P Stokowski; D J Thomas; M O Trulson; K R Vyas; K A Frazer; S P Fodor; D R Cox
Journal:  Science       Date:  2001-11-23       Impact factor: 47.728

3.  Haplotype variation and linkage disequilibrium in 313 human genes.

Authors:  J C Stephens; J A Schneider; D A Tanguay; J Choi; T Acharya; S E Stanley; R Jiang; C J Messer; A Chew; J H Han; J Duan; J L Carr; M S Lee; B Koshy; A M Kumar; G Zhang; W R Newell; A Windemuth; C Xu; T S Kalbfleisch; S L Shaner; K Arnold; V Schulz; C M Drysdale; K Nandabalan; R S Judson; G Ruano; G F Vovis
Journal:  Science       Date:  2001-07-12       Impact factor: 47.728

4.  Integrating genotype and phenotype information: an overview of the PharmGKB project. Pharmacogenetics Research Network and Knowledge Base.

Authors:  T E Klein; J T Chang; M K Cho; K L Easton; R Fergerson; M Hewett; Z Lin; Y Liu; S Liu; D E Oliver; D L Rubin; F Shafa; J M Stuart; R B Altman
Journal:  Pharmacogenomics J       Date:  2001       Impact factor: 3.550

5.  Haplotype tagging for the identification of common disease genes.

Authors:  G C Johnson; L Esposito; B J Barratt; A N Smith; J Heward; G Di Genova; H Ueda; H J Cordell; I A Eaves; F Dudbridge; R C Twells; F Payne; W Hughes; S Nutland; H Stevens; P Carr; E Tuomilehto-Wolf; J Tuomilehto; S C Gough; D G Clayton; J A Todd
Journal:  Nat Genet       Date:  2001-10       Impact factor: 38.330

6.  High-resolution haplotype structure in the human genome.

Authors:  M J Daly; J D Rioux; S F Schaffner; T J Hudson; E S Lander
Journal:  Nat Genet       Date:  2001-10       Impact factor: 38.330

7.  Missing value estimation methods for DNA microarrays.

Authors:  O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal:  Bioinformatics       Date:  2001-06       Impact factor: 6.937

8.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

9.  Sequence variation in the human angiotensin converting enzyme.

Authors:  M J Rieder; S L Taylor; A G Clark; D A Nickerson
Journal:  Nat Genet       Date:  1999-05       Impact factor: 38.330

10.  Principal components analysis to summarize microarray experiments: application to sporulation time series.

Authors:  S Raychaudhuri; J M Stuart; R B Altman
Journal:  Pac Symp Biocomput       Date:  2000
View more
  30 in total

1.  Allosteric drug discrimination is coupled to mechanochemical changes in the kinesin-5 motor core.

Authors:  Elizabeth D Kim; Rebecca Buckley; Sarah Learman; Jessica Richard; Courtney Parke; David K Worthylake; Edward J Wojcik; Richard A Walker; Sunyoung Kim
Journal:  J Biol Chem       Date:  2010-03-18       Impact factor: 5.157

2.  A sparse marker extension tree algorithm for selecting the best set of haplotype tagging single nucleotide polymorphisms.

Authors:  Ke Hao; Simin Liu; Tianhua Niu
Journal:  Genet Epidemiol       Date:  2005-12       Impact factor: 2.135

3.  Multilocus LD measure and tagging SNP selection with generalized mutual information.

Authors:  Zhenqiu Liu; Shili Lin
Journal:  Genet Epidemiol       Date:  2005-12       Impact factor: 2.135

Review 4.  Recent developments in genomewide association scans: a workshop summary and review.

Authors:  Duncan C Thomas; Robert W Haile; David Duggan
Journal:  Am J Hum Genet       Date:  2005-08-01       Impact factor: 11.025

5.  Multipoint linkage-disequilibrium mapping with haplotype-block structure.

Authors:  Maoxia Zheng; Mary Sara McPeek
Journal:  Am J Hum Genet       Date:  2006-11-30       Impact factor: 11.025

6.  Intra- and interpopulation genotype reconstruction from tagging SNPs.

Authors:  Peristera Paschou; Michael W Mahoney; Asif Javed; Judith R Kidd; Andrew J Pakstis; Sheng Gu; Kenneth K Kidd; Petros Drineas
Journal:  Genome Res       Date:  2006-12-06       Impact factor: 9.043

7.  Association mapping of complex trait loci with context-dependent effects and unknown context variable.

Authors:  Mikko J Sillanpää; Madhuchhanda Bhattacharjee
Journal:  Genetics       Date:  2006-10-08       Impact factor: 4.562

8.  A novel method combining linkage disequilibrium information and imputed functional knowledge for tagSNP selection.

Authors:  R H Rochat; L de las Fuentes; G Stormo; V G Davila-Roman; C Charles Gu
Journal:  Hum Hered       Date:  2007-06-22       Impact factor: 0.444

9.  Methicillin-resistant Staphylococcus aureus infection or colonization present at hospital admission: multivariable risk factor screening to increase efficiency of surveillance culturing.

Authors:  Clinton C Haley; Deepa Mittal; Amanda Laviolette; Sai Jannapureddy; Najma Parvez; Robert W Haley
Journal:  J Clin Microbiol       Date:  2007-07-11       Impact factor: 5.948

10.  Selected questions on biomechanical exposures for surveillance of upper-limb work-related musculoskeletal disorders.

Authors:  Alexis Descatha; Yves Roquelaure; Bradley Evanoff; Isabelle Niedhammer; Jean François Chastang; Camille Mariot; Catherine Ha; Ellen Imbernon; Marcel Goldberg; Annette Leclerc
Journal:  Int Arch Occup Environ Health       Date:  2007-05-03       Impact factor: 3.015

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.