Literature DB >> 27813156

Low-, high-coverage, and two-stage DNA sequencing in the design of the genetic association study.

Chao Xu1, Kehao Wu1, Ji-Gang Zhang1, Hui Shen1, Hong-Wen Deng1.   

Abstract

Next-generation sequencing-based genetic association study (GAS) is a powerful tool to identify candidate disease variants and genomic regions. Although low-coverage sequencing offers low cost but inadequacy in calling rare variants, high coverage is able to detect essentially every variant but at a high cost. Two-stage sequencing may be an economical way to conduct GAS without losing power. In two-stage sequencing, an affordable number of samples are sequenced at high coverage as the reference panel, then to impute in a larger sample is sequenced at low coverage. As unit sequencing costs continue to decrease, investigators can now conduct GAS with more flexible sequencing depths. Here, we systematically evaluate the effect of the read depth and sample size on the variant discovery power and association power for study designs using low-coverage, high-coverage, and two-stage sequencing. We consider 12 low-coverage, 12 high-coverage, and 51 two-stage design scenarios with the read depth varying from 0.5× to 80×. With state-of-the-art simulation and analysis packages and in-house scripts, we simulate the complete study process from DNA sequencing to SNP (single nucleotide polymorphism) calling and association testing. Our results show that with appropriate allocation of sequencing effort, two-stage sequencing is an effective approach for conducting GAS. We provide practical guidelines for investigators to plan the optimum sequencing-based GAS including two-stage sequencing design given their specific constraints of sequencing investment.
© 2016 WILEY PERIODICALS, INC.

Entities:  

Keywords:  next-generation sequencing; rare variant; sequencing cost; study design

Mesh:

Year:  2016        PMID: 27813156      PMCID: PMC5363279          DOI: 10.1002/gepi.22015

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  46 in total

Review 1.  Introduction to genetic association studies.

Authors:  Cathryn M Lewis; Jo Knight
Journal:  Cold Spring Harb Protoc       Date:  2012-03-01

2.  Low-coverage sequencing: implications for design of complex trait association studies.

Authors:  Yun Li; Carlo Sidore; Hyun Min Kang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genome Res       Date:  2011-04-01       Impact factor: 9.043

3.  Evaluating the heritability explained by known susceptibility variants: a survey of ten complex diseases.

Authors:  Hon-Cheong So; Allen H S Gui; Stacey S Cherny; Pak C Sham
Journal:  Genet Epidemiol       Date:  2011-03-03       Impact factor: 2.135

4.  AbCD: arbitrary coverage design for sequencing-based genetic studies.

Authors:  Jian Kang; Kuan-Chieh Huang; Zheng Xu; Yunfei Wang; Gonçalo R Abecasis; Yun Li
Journal:  Bioinformatics       Date:  2013-01-28       Impact factor: 6.937

5.  Genome sequencing identifies major causes of severe intellectual disability.

Authors:  Christian Gilissen; Jayne Y Hehir-Kwa; Djie Tjwan Thung; Maartje van de Vorst; Bregje W M van Bon; Marjolein H Willemsen; Michael Kwint; Irene M Janssen; Alexander Hoischen; Annette Schenck; Richard Leach; Robert Klein; Rick Tearle; Tan Bo; Rolph Pfundt; Helger G Yntema; Bert B A de Vries; Tjitske Kleefstra; Han G Brunner; Lisenka E L M Vissers; Joris A Veltman
Journal:  Nature       Date:  2014-06-04       Impact factor: 49.962

6.  A recurrent de novo mutation in KCNC1 causes progressive myoclonus epilepsy.

Authors:  Mikko Muona; Samuel F Berkovic; Leanne M Dibbens; Karen L Oliver; Snezana Maljevic; Marta A Bayly; Tarja Joensuu; Laura Canafoglia; Silvana Franceschetti; Roberto Michelucci; Salla Markkinen; Sarah E Heron; Michael S Hildebrand; Eva Andermann; Frederick Andermann; Antonio Gambardella; Paolo Tinuper; Laura Licchetta; Ingrid E Scheffer; Chiara Criscuolo; Alessandro Filla; Edoardo Ferlazzo; Jamil Ahmad; Adeel Ahmad; Betul Baykan; Edith Said; Meral Topcu; Patrizia Riguzzi; Mary D King; Cigdem Ozkara; Danielle M Andrade; Bernt A Engelsen; Arielle Crespel; Matthias Lindenau; Ebba Lohmann; Veronica Saletti; João Massano; Michael Privitera; Alberto J Espay; Birgit Kauffmann; Michael Duchowny; Rikke S Møller; Rachel Straussberg; Zaid Afawi; Bruria Ben-Zeev; Kaitlin E Samocha; Mark J Daly; Steven Petrou; Holger Lerche; Aarno Palotie; Anna-Elina Lehesjoki
Journal:  Nat Genet       Date:  2014-11-17       Impact factor: 38.330

7.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

9.  Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations.

Authors:  Brian J O'Roak; Pelagia Deriziotis; Choli Lee; Laura Vives; Jerrod J Schwartz; Santhosh Girirajan; Emre Karakoc; Alexandra P Mackenzie; Sarah B Ng; Carl Baker; Mark J Rieder; Deborah A Nickerson; Raphael Bernier; Simon E Fisher; Jay Shendure; Evan E Eichler
Journal:  Nat Genet       Date:  2011-05-15       Impact factor: 38.330

10.  The UK10K project identifies rare variants in health and disease.

Authors:  Klaudia Walter; Josine L Min; Jie Huang; Lucy Crooks; Yasin Memari; Shane McCarthy; John R B Perry; ChangJiang Xu; Marta Futema; Daniel Lawson; Valentina Iotchkova; Stephan Schiffels; Audrey E Hendricks; Petr Danecek; Rui Li; James Floyd; Louise V Wain; Inês Barroso; Steve E Humphries; Matthew E Hurles; Eleftheria Zeggini; Jeffrey C Barrett; Vincent Plagnol; J Brent Richards; Celia M T Greenwood; Nicholas J Timpson; Richard Durbin; Nicole Soranzo
Journal:  Nature       Date:  2015-09-14       Impact factor: 49.962

View more
  5 in total

1.  Combining sequence data from multiple studies: Impact of analysis strategies on rare variant calling and association results.

Authors:  Zhongsheng Chen; Michael Boehnke; Christian Fuchsberger
Journal:  Genet Epidemiol       Date:  2019-09-14       Impact factor: 2.135

2.  A method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals.

Authors:  Roger Ros-Freixedes; Serap Gonen; Gregor Gorjanc; John M Hickey
Journal:  Genet Sel Evol       Date:  2017-10-25       Impact factor: 4.297

3.  Assessment of low-coverage nanopore long read sequencing for SNP genotyping in doubled haploid canola (Brassica napus L.).

Authors:  M M Malmberg; G C Spangenberg; H D Daetwyler; N O I Cogan
Journal:  Sci Rep       Date:  2019-06-18       Impact factor: 4.379

4.  Optimal sequencing depth design for whole genome re-sequencing in pigs.

Authors:  Yifan Jiang; Yao Jiang; Sheng Wang; Qin Zhang; Xiangdong Ding
Journal:  BMC Bioinformatics       Date:  2019-11-08       Impact factor: 3.169

Review 5.  Personalized or Precision Medicine? The Example of Cystic Fibrosis.

Authors:  Fernando A L Marson; Carmen S Bertuzzo; José D Ribeiro
Journal:  Front Pharmacol       Date:  2017-06-20       Impact factor: 5.810

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.