Literature DB >> 25810429

DISSCO: direct imputation of summary statistics allowing covariates.

Zheng Xu1, Qing Duan2, Song Yan1, Wei Chen3, Mingyao Li4, Ethan Lange5, Yun Li1.   

Abstract

BACKGROUND: Imputation of individual level genotypes at untyped markers using an external reference panel of genotyped or sequenced individuals has become standard practice in genetic association studies. Direct imputation of summary statistics can also be valuable, for example in meta-analyses where individual level genotype data are not available. Two methods (DIST and ImpG-Summary/LD), that assume a multivariate Gaussian distribution for the association summary statistics, have been proposed for imputing association summary statistics. However, both methods assume that the correlations between association summary statistics are the same as the correlations between the corresponding genotypes. This assumption can be violated in the presence of confounding covariates.
METHODS: We analytically show that in the absence of covariates, correlation among association summary statistics is indeed the same as that among the corresponding genotypes, thus serving as a theoretical justification for the recently proposed methods. We continue to prove that in the presence of covariates, correlation among association summary statistics becomes the partial correlation of the corresponding genotypes controlling for covariates. We therefore develop direct imputation of summary statistics allowing covariates (DISSCO).
RESULTS: We consider two real-life scenarios where the correlation and partial correlation likely make practical difference: (i) association studies in admixed populations; (ii) association studies in presence of other confounding covariate(s). Application of DISSCO to real datasets under both scenarios shows at least comparable, if not better, performance compared with existing correlation-based methods, particularly for lower frequency variants. For example, DISSCO can reduce the absolute deviation from the truth by 3.9-15.2% for variants with minor allele frequency <5%.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2015        PMID: 25810429      PMCID: PMC4514926          DOI: 10.1093/bioinformatics/btv168

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  41 in total

1.  So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests.

Authors:  Karen N Conneely; Michael Boehnke
Journal:  Am J Hum Genet       Date:  2007-12       Impact factor: 11.025

2.  Comparison of ENCODE region SNPs between Cebu Filipino and Asian HapMap samples.

Authors:  Amanda F Marvelle; Leslie A Lange; Li Qin; Yunfei Wang; Ethan M Lange; Linda S Adair; Karen L Mohlke
Journal:  J Hum Genet       Date:  2007-07-18       Impact factor: 3.172

3.  Imputation of coding variants in African Americans: better performance using data from the exome sequencing project.

Authors:  Qing Duan; Eric Yi Liu; Paul L Auer; Guosheng Zhang; Ethan M Lange; Goo Jun; Chris Bizon; Shuo Jiao; Steven Buyske; Nora Franceschini; Chris S Carlson; Li Hsu; Alex P Reiner; Ulrike Peters; Jeffrey Haessler; Keith Curtis; Christina L Wassel; Jennifer G Robinson; Lisa W Martin; Christopher A Haiman; Loic Le Marchand; Tara C Matise; Lucia A Hindorff; Dana C Crawford; Themistocles L Assimes; Hyun Min Kang; Gerardo Heiss; Rebecca D Jackson; Charles Kooperberg; James G Wilson; Gonçalo R Abecasis; Kari E North; Deborah A Nickerson; Leslie A Lange; Yun Li
Journal:  Bioinformatics       Date:  2013-08-16       Impact factor: 6.937

4.  Design of the Women's Health Initiative clinical trial and observational study. The Women's Health Initiative Study Group.

Authors: 
Journal:  Control Clin Trials       Date:  1998-02

5.  USING LINEAR PREDICTORS TO IMPUTE ALLELE FREQUENCIES FROM SUMMARY OR POOLED GENOTYPE DATA.

Authors:  Xiaoquan Wen; Matthew Stephens
Journal:  Ann Appl Stat       Date:  2010-09       Impact factor: 2.083

6.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

7.  Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma.

Authors:  John C Chambers; Weihua Zhang; Joban Sehmi; Xinzhong Li; Mark N Wass; Pim Van der Harst; Hilma Holm; Serena Sanna; Maryam Kavousi; Sebastian E Baumeister; Lachlan J Coin; Guohong Deng; Christian Gieger; Nancy L Heard-Costa; Jouke-Jan Hottenga; Brigitte Kühnel; Vinod Kumar; Vasiliki Lagou; Liming Liang; Jian'an Luan; Pedro Marques Vidal; Irene Mateo Leach; Paul F O'Reilly; John F Peden; Nilufer Rahmioglu; Pasi Soininen; Elizabeth K Speliotes; Xin Yuan; Gudmar Thorleifsson; Behrooz Z Alizadeh; Larry D Atwood; Ingrid B Borecki; Morris J Brown; Pimphen Charoen; Francesco Cucca; Debashish Das; Eco J C de Geus; Anna L Dixon; Angela Döring; Georg Ehret; Gudmundur I Eyjolfsson; Martin Farrall; Nita G Forouhi; Nele Friedrich; Wolfram Goessling; Daniel F Gudbjartsson; Tamara B Harris; Anna-Liisa Hartikainen; Simon Heath; Gideon M Hirschfield; Albert Hofman; Georg Homuth; Elina Hyppönen; Harry L A Janssen; Toby Johnson; Antti J Kangas; Ido P Kema; Jens P Kühn; Sandra Lai; Mark Lathrop; Markus M Lerch; Yun Li; T Jake Liang; Jing-Ping Lin; Ruth J F Loos; Nicholas G Martin; Miriam F Moffatt; Grant W Montgomery; Patricia B Munroe; Kiran Musunuru; Yusuke Nakamura; Christopher J O'Donnell; Isleifur Olafsson; Brenda W Penninx; Anneli Pouta; Bram P Prins; Inga Prokopenko; Ralf Puls; Aimo Ruokonen; Markku J Savolainen; David Schlessinger; Jeoffrey N L Schouten; Udo Seedorf; Srijita Sen-Chowdhry; Katherine A Siminovitch; Johannes H Smit; Timothy D Spector; Wenting Tan; Tanya M Teslovich; Taru Tukiainen; Andre G Uitterlinden; Melanie M Van der Klauw; Ramachandran S Vasan; Chris Wallace; Henri Wallaschofski; H-Erich Wichmann; Gonneke Willemsen; Peter Würtz; Chun Xu; Laura M Yerges-Armstrong; Goncalo R Abecasis; Kourosh R Ahmadi; Dorret I Boomsma; Mark Caulfield; William O Cookson; Cornelia M van Duijn; Philippe Froguel; Koichi Matsuda; Mark I McCarthy; Christa Meisinger; Vincent Mooser; Kirsi H Pietiläinen; Gunter Schumann; Harold Snieder; Michael J E Sternberg; Ronald P Stolk; Howard C Thomas; Unnur Thorsteinsdottir; Manuela Uda; Gérard Waeber; Nicholas J Wareham; Dawn M Waterworth; Hugh Watkins; John B Whitfield; Jacqueline C M Witteman; Bruce H R Wolffenbuttel; Caroline S Fox; Mika Ala-Korpela; Kari Stefansson; Peter Vollenweider; Henry Völzke; Eric E Schadt; James Scott; Marjo-Riitta Järvelin; Paul Elliott; Jaspal S Kooner
Journal:  Nat Genet       Date:  2011-10-16       Impact factor: 38.330

8.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

9.  DIST: direct imputation of summary statistics for unmeasured SNPs.

Authors:  Donghyung Lee; T Bernard Bigdeli; Brien P Riley; Ayman H Fanous; Silviu-Alin Bacanu
Journal:  Bioinformatics       Date:  2013-08-28       Impact factor: 6.937

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  11 in total

1.  A powerful subset-based method identifies gene set associations and improves interpretation in UK Biobank.

Authors:  Diptavo Dutta; Peter VandeHaar; Lars G Fritsche; Sebastian Zöllner; Michael Boehnke; Laura J Scott; Seunggeun Lee
Journal:  Am J Hum Genet       Date:  2021-03-16       Impact factor: 11.025

2.  Transethnic Genetic-Correlation Estimates from Summary Statistics.

Authors:  Brielin C Brown; Chun Jimmie Ye; Alkes L Price; Noah Zaitlen
Journal:  Am J Hum Genet       Date:  2016-06-16       Impact factor: 11.025

Review 3.  Dissecting the genetics of complex traits using summary association statistics.

Authors:  Bogdan Pasaniuc; Alkes L Price
Journal:  Nat Rev Genet       Date:  2016-11-14       Impact factor: 53.242

4.  PolyGEE: a generalized estimating equation approach to the efficient and robust estimation of polygenic effects in large-scale association studies.

Authors:  Julian Hecker; Dmitry Prokopenko; Christoph Lange; Heide Loehlein Fier
Journal:  Biostatistics       Date:  2018-07-01       Impact factor: 5.899

5.  Prospects of Fine-Mapping Trait-Associated Genomic Regions by Using Summary Statistics from Genome-wide Association Studies.

Authors:  Christian Benner; Aki S Havulinna; Marjo-Riitta Järvelin; Veikko Salomaa; Samuli Ripatti; Matti Pirinen
Journal:  Am J Hum Genet       Date:  2017-09-21       Impact factor: 11.025

6.  JEPEGMIX: gene-level joint analysis of functional SNPs in cosmopolitan cohorts.

Authors:  Donghyung Lee; Vernell S Williamson; T Bernard Bigdeli; Brien P Riley; Bradley T Webb; Ayman H Fanous; Kenneth S Kendler; Vladimir I Vladimirov; Silviu-Alin Bacanu
Journal:  Bioinformatics       Date:  2015-10-01       Impact factor: 6.937

7.  Accurate and adaptive imputation of summary statistics in mixed-ethnicity cohorts.

Authors:  Matteo Togninalli; Damian Roqueiro; Karsten M Borgwardt
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

8.  Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics.

Authors:  David Lamparter; Daniel Marbach; Rico Rueedi; Zoltán Kutalik; Sven Bergmann
Journal:  PLoS Comput Biol       Date:  2016-01-25       Impact factor: 4.475

9.  Comparison of novel and existing methods for detecting differentially methylated regions.

Authors:  Samantha Lent; Hanfei Xu; Lan Wang; Zhe Wang; Chloé Sarnowski; Marie-France Hivert; Josée Dupuis
Journal:  BMC Genet       Date:  2018-09-17       Impact factor: 2.797

10.  Analysis of genetic and nongenetic factors influencing triglycerides-lowering drug effects based on paired observations.

Authors:  Zheng Xu; Qing Duan; Juan Cui; Yumou Qiu; Qidong Jia; Cong Wu; Jennifer Clarke
Journal:  BMC Proc       Date:  2018-09-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.