Literature DB >> 31520493

Combining sequence data from multiple studies: Impact of analysis strategies on rare variant calling and association results.

Zhongsheng Chen1, Michael Boehnke1, Christian Fuchsberger1,2,3.   

Abstract

Individual sequencing studies often have limited sample sizes and so limited power to detect trait associations with rare variants. A common strategy is to aggregate data from multiple studies. For studying rare variants, jointly calling all samples together is the gold standard strategy but can be difficult to implement due to privacy restrictions and computational burden. Here, we compare joint calling to the alternative of single-study calling in terms of variant detection sensitivity and genotype accuracy as a function of sequencing coverage and assess their impact on downstream association analysis. To do so, we analyze deep-coverage (~82×) exome and low-coverage (~5×) genome sequence data on 2,250 individuals from the Genetics of Type 2 Diabetes study jointly and separately within five geographic cohorts. For rare single nucleotide variants (SNVs): (a) ≥97% of discovered SNVs are found by both calling strategies; (b) nonreference concordance with a set of highly accurate genotypes is ≥99% for both calling strategies; (c) meta-analysis has similar power to joint analysis in deep-coverage sequence data but can be less powerful in low-coverage sequence data. Given similar data processing and quality control steps, we recommend single-study calling as a viable alternative to joint calling for analyzing SNVs of all minor allele frequency in deep-coverage data.
© 2019 Wiley Periodicals, Inc.

Entities:  

Keywords:  Sequencing studies; joint analysis; meta-analysis; rare variants

Mesh:

Year:  2019        PMID: 31520493      PMCID: PMC7231418          DOI: 10.1002/gepi.22261

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  23 in total

Review 1.  Rare-variant association analysis: study designs and statistical tests.

Authors:  Seunggeung Lee; Gonçalo R Abecasis; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2014-07-03       Impact factor: 11.025

2.  Searching for missing heritability: designing rare variant association studies.

Authors:  Or Zuk; Stephen F Schaffner; Kaitlin Samocha; Ron Do; Eliana Hechter; Sekar Kathiresan; Mark J Daly; Benjamin M Neale; Shamil R Sunyaev; Eric S Lander
Journal:  Proc Natl Acad Sci U S A       Date:  2014-01-17       Impact factor: 11.205

3.  From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.

Authors:  Geraldine A Van der Auwera; Mauricio O Carneiro; Christopher Hartl; Ryan Poplin; Guillermo Del Angel; Ami Levy-Moonshine; Tadeusz Jordan; Khalid Shakir; David Roazen; Joel Thibault; Eric Banks; Kiran V Garimella; David Altshuler; Stacey Gabriel; Mark A DePristo
Journal:  Curr Protoc Bioinformatics       Date:  2013

4.  Low-, high-coverage, and two-stage DNA sequencing in the design of the genetic association study.

Authors:  Chao Xu; Kehao Wu; Ji-Gang Zhang; Hui Shen; Hong-Wen Deng
Journal:  Genet Epidemiol       Date:  2016-11-04       Impact factor: 2.135

5.  Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs.

Authors:  Zheng-Zheng Tang; Dan-Yu Lin
Journal:  Am J Hum Genet       Date:  2015-06-18       Impact factor: 11.025

6.  Recommended joint and meta-analysis strategies for case-control association testing of single low-count variants.

Authors:  Clement Ma; Tom Blackwell; Michael Boehnke; Laura J Scott
Journal:  Genet Epidemiol       Date:  2013-06-20       Impact factor: 2.135

7.  An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data.

Authors:  Goo Jun; Mary Kate Wing; Gonçalo R Abecasis; Hyun Min Kang
Journal:  Genome Res       Date:  2015-04-16       Impact factor: 9.043

8.  Analysis of protein-coding genetic variation in 60,706 humans.

Authors:  Monkol Lek; Konrad J Karczewski; Eric V Minikel; Kaitlin E Samocha; Eric Banks; Timothy Fennell; Anne H O'Donnell-Luria; James S Ware; Andrew J Hill; Beryl B Cummings; Taru Tukiainen; Daniel P Birnbaum; Jack A Kosmicki; Laramie E Duncan; Karol Estrada; Fengmei Zhao; James Zou; Emma Pierce-Hoffman; Joanne Berghout; David N Cooper; Nicole Deflaux; Mark DePristo; Ron Do; Jason Flannick; Menachem Fromer; Laura Gauthier; Jackie Goldstein; Namrata Gupta; Daniel Howrigan; Adam Kiezun; Mitja I Kurki; Ami Levy Moonshine; Pradeep Natarajan; Lorena Orozco; Gina M Peloso; Ryan Poplin; Manuel A Rivas; Valentin Ruano-Rubio; Samuel A Rose; Douglas M Ruderfer; Khalid Shakir; Peter D Stenson; Christine Stevens; Brett P Thomas; Grace Tiao; Maria T Tusie-Luna; Ben Weisburd; Hong-Hee Won; Dongmei Yu; David M Altshuler; Diego Ardissino; Michael Boehnke; John Danesh; Stacey Donnelly; Roberto Elosua; Jose C Florez; Stacey B Gabriel; Gad Getz; Stephen J Glatt; Christina M Hultman; Sekar Kathiresan; Markku Laakso; Steven McCarroll; Mark I McCarthy; Dermot McGovern; Ruth McPherson; Benjamin M Neale; Aarno Palotie; Shaun M Purcell; Danish Saleheen; Jeremiah M Scharf; Pamela Sklar; Patrick F Sullivan; Jaakko Tuomilehto; Ming T Tsuang; Hugh C Watkins; James G Wilson; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2016-08-18       Impact factor: 49.962

9.  Data use under the NIH GWAS data sharing policy and future directions.

Authors:  Dina N Paltoo; Laura Lyman Rodriguez; Michael Feolo; Elizabeth Gillanders; Erin M Ramos; Joni L Rutter; Stephen Sherry; Vivian Ota Wang; Alice Bailey; Rebecca Baker; Mark Caulder; Emily L Harris; Kristofor Langlais; Hilary Leeds; Erin Luetkemeier; Taunton Paine; Tamar Roomian; Kimberly Tryka; Amy Patterson; Eric D Green
Journal:  Nat Genet       Date:  2014-09       Impact factor: 38.330

10.  Deep whole-genome sequencing reveals recent selection signatures linked to evolution and disease risk of Japanese.

Authors:  Yukinori Okada; Yukihide Momozawa; Saori Sakaue; Masahiro Kanai; Kazuyoshi Ishigaki; Masato Akiyama; Toshihiro Kishikawa; Yasumichi Arai; Takashi Sasaki; Kenjiro Kosaki; Makoto Suematsu; Koichi Matsuda; Kazuhiko Yamamoto; Michiaki Kubo; Nobuyoshi Hirose; Yoichiro Kamatani
Journal:  Nat Commun       Date:  2018-04-24       Impact factor: 14.919

View more
  1 in total

1.  Genetic diversity of 'Very Important Pharmacogenes' in two South-Asian populations.

Authors:  Neeraj Bharti; Ruma Banerjee; Archana Achalere; Sunitha Manjari Kasibhatla; Rajendra Joshi
Journal:  PeerJ       Date:  2021-11-10       Impact factor: 2.984

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.