Literature DB >> 21181893

Postassociation cleaning using linkage disequilibrium information.

Buhm Han1, Brian M Hackel, Eleazar Eskin.   

Abstract

In genetic association studies, quality control (QC) filters are applied to remove potentially problematic markers before the markers are tested for statistical associations. However, spurious associations can still occur after QC. We introduce Post-Association Cleaning (PAC) approach that can complement QC by capturing spurious associations using the information in the post-association results. Specifically, we propose a PAC filter based on the linkage disequilibrium (LD) information. The intuition is that if the association is caused by a true genetic effect, neighboring markers in LD should show comparably significant P-values. If not, it may be evidence of spurious association. Previous studies have applied the same idea but only manually without a formal statistical framework. Our proposed method LD-PAC provides a systematic framework to quantitatively measure the evidence of spurious associations based on the likelihood ratio. Simulations show that LD-PAC can detect spurious associations with high detection rate (84%). In addition to detecting spurious associations, our method can also be used to "rescue" candidate associations from the supposedly unclean data such as the markers excluded by QC. Although the additional associations must be treated with care, they can suggest interesting regions. The application of our method to the Wellcome Trust Case Control Consortium (WTCCC) data led to the discovery of an additional candidate association for type 1 diabetes among the QC-excluded markers. This locus turns out to be in a region recently identified as significant by a meta-analysis performed after the WTCCC study was published.
© 2010 Wiley-Liss, Inc.

Entities:  

Mesh:

Year:  2011        PMID: 21181893     DOI: 10.1002/gepi.20544

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  5 in total

1.  FAPI: Fast and accurate P-value Imputation for genome-wide association study.

Authors:  Johnny S H Kwan; Miao-Xin Li; Jia-En Deng; Pak C Sham
Journal:  Eur J Hum Genet       Date:  2015-08-26       Impact factor: 4.246

2.  DISSCO: direct imputation of summary statistics allowing covariates.

Authors:  Zheng Xu; Qing Duan; Song Yan; Wei Chen; Mingyao Li; Ethan Lange; Yun Li
Journal:  Bioinformatics       Date:  2015-03-24       Impact factor: 6.937

3.  Pitfalls of merging GWAS data: lessons learned in the eMERGE network and quality control procedures to maintain high data quality.

Authors:  Rebecca L Zuvich; Loren L Armstrong; Suzette J Bielinski; Yuki Bradford; Christopher S Carlson; Dana C Crawford; Andrew T Crenshaw; Mariza de Andrade; Kimberly F Doheny; Jonathan L Haines; M Geoffrey Hayes; Gail P Jarvik; Lan Jiang; Iftikhar J Kullo; Rongling Li; Hua Ling; Teri A Manolio; Martha E Matsumoto; Catherine A McCarty; Andrew N McDavid; Daniel B Mirel; Lana M Olson; Justin E Paschall; Elizabeth W Pugh; Luke V Rasmussen; Laura J Rasmussen-Torvik; Stephen D Turner; Russell A Wilke; Marylyn D Ritchie
Journal:  Genet Epidemiol       Date:  2011-12       Impact factor: 2.135

4.  Imputing Phenotypes for Genome-wide Association Studies.

Authors:  Farhad Hormozdiari; Eun Yong Kang; Michael Bilow; Eyal Ben-David; Chris Vulpe; Stela McLachlan; Aldons J Lusis; Buhm Han; Eleazar Eskin
Journal:  Am J Hum Genet       Date:  2016-06-09       Impact factor: 11.025

5.  Adapt-Mix: learning local genetic correlation structure improves summary statistics-based analyses.

Authors:  Danny S Park; Brielin Brown; Celeste Eng; Scott Huntsman; Donglei Hu; Dara G Torgerson; Esteban G Burchard; Noah Zaitlen
Journal:  Bioinformatics       Date:  2015-06-15       Impact factor: 6.937

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.