Literature DB >> 20552648

Design of association studies with pooled or un-pooled next-generation sequencing data.

Su Yeon Kim1, Yingrui Li, Yiran Guo, Ruiqiang Li, Johan Holmkvist, Torben Hansen, Oluf Pedersen, Jun Wang, Rasmus Nielsen.   

Abstract

Most common hereditary diseases in humans are complex and multifactorial. Large-scale genome-wide association studies based on SNP genotyping have only identified a small fraction of the heritable variation of these diseases. One explanation may be that many rare variants (a minor allele frequency, MAF <5%), which are not included in the common genotyping platforms, may contribute substantially to the genetic variation of these diseases. Next-generation sequencing, which would allow the analysis of rare variants, is now becoming so cheap that it provides a viable alternative to SNP genotyping. In this paper, we present cost-effective protocols for using next-generation sequencing in association mapping studies based on pooled and un-pooled samples, and identify optimal designs with respect to total number of individuals, number of individuals per pool, and the sequencing coverage. We perform a small empirical study to evaluate the pooling variance in a realistic setting where pooling is combined with exon-capturing. To test for associations, we develop a likelihood ratio statistic that accounts for the high error rate of next-generation sequencing data. We also perform extensive simulations to determine the power and accuracy of this method. Overall, our findings suggest that with a fixed cost, sequencing many individuals at a more shallow depth with larger pool size achieves higher power than sequencing a small number of individuals in higher depth with smaller pool size, even in the presence of high error rates. Our results provide guidelines for researchers who are developing association mapping studies based on next-generation sequencing. (c) 2010 Wiley-Liss, Inc.

Entities:  

Mesh:

Year:  2010        PMID: 20552648      PMCID: PMC5001557          DOI: 10.1002/gepi.20501

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  50 in total

1.  Optimal two-stage genotyping designs for genome-wide association scans.

Authors:  Hansong Wang; Duncan C Thomas; Itsik Pe'er; Daniel O Stram
Journal:  Genet Epidemiol       Date:  2006-05       Impact factor: 2.135

2.  Direct selection of human genomic loci by microarray hybridization.

Authors:  Thomas J Albert; Michael N Molla; Donna M Muzny; Lynne Nazareth; David Wheeler; Xingzhi Song; Todd A Richmond; Chris M Middle; Matthew J Rodesch; Charles J Packard; George M Weinstock; Richard A Gibbs
Journal:  Nat Methods       Date:  2007-10-14       Impact factor: 28.547

3.  Power of deep, all-exon resequencing for discovery of human trait genes.

Authors:  Gregory V Kryukov; Alexander Shpunt; John A Stamatoyannopoulos; Shamil R Sunyaev
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-06       Impact factor: 11.205

4.  SNP frequency estimation using massively parallel sequencing of pooled DNA.

Authors:  Max Ingman; Ulf Gyllensten
Journal:  Eur J Hum Genet       Date:  2008-10-15       Impact factor: 4.246

5.  Association testing by DNA pooling: an effective initial screen.

Authors:  Aruna Bansal; Dirk van den Boom; Stefan Kammerer; Christiane Honisch; Gail Adam; Charles R Cantor; Patrick Kleyn; Andi Braun
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-10       Impact factor: 11.205

Review 6.  Human genetic variation and its contribution to complex traits.

Authors:  Kelly A Frazer; Sarah S Murray; Nicholas J Schork; Eric J Topol
Journal:  Nat Rev Genet       Date:  2009-04       Impact factor: 53.242

7.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

Review 8.  Common and rare variants in multifactorial susceptibility to common diseases.

Authors:  Walter Bodmer; Carolina Bonilla
Journal:  Nat Genet       Date:  2008-06       Impact factor: 38.330

9.  Multiple rare nonsynonymous variants in the adenomatous polyposis coli gene predispose to colorectal adenomas.

Authors:  Duncan Azzopardi; Anthony R Dallosso; Kristilyn Eliason; Brant C Hendrickson; Natalie Jones; Edward Rawstorne; James Colley; Valentina Moskvina; Cynthia Frye; Julian R Sampson; Richard Wenstrup; Thomas Scholl; Jeremy P Cheadle
Journal:  Cancer Res       Date:  2008-01-15       Impact factor: 12.701

10.  Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing.

Authors:  Andreas Gnirke; Alexandre Melnikov; Jared Maguire; Peter Rogov; Emily M LeProust; William Brockman; Timothy Fennell; Georgia Giannoukos; Sheila Fisher; Carsten Russ; Stacey Gabriel; David B Jaffe; Eric S Lander; Chad Nusbaum
Journal:  Nat Biotechnol       Date:  2009-02-01       Impact factor: 54.908

View more
  47 in total

1.  Two-stage extreme phenotype sequencing design for discovering and testing common and rare genetic variants: efficiency and power.

Authors:  Guolian Kang; Dongyu Lin; Hakon Hakonarson; Jinbo Chen
Journal:  Hum Hered       Date:  2012-06-07       Impact factor: 0.444

2.  A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.

Authors:  Li Luo; Yun Zhu; Momiao Xiong
Journal:  J Comput Biol       Date:  2012-05-31       Impact factor: 1.479

3.  Quantifying population genetic differentiation from next-generation sequencing data.

Authors:  Matteo Fumagalli; Filipe G Vieira; Thorfinn Sand Korneliussen; Tyler Linderoth; Emilia Huerta-Sánchez; Anders Albrechtsen; Rasmus Nielsen
Journal:  Genetics       Date:  2013-08-26       Impact factor: 4.562

4.  On the design and analysis of next-generation sequencing genotyping for a cohort with haplotype-informative reads.

Authors:  Degui Zhi; Nianjun Liu; Kui Zhang
Journal:  Methods       Date:  2015-01-30       Impact factor: 3.608

5.  Two-stage design of sequencing studies for testing association with rare variants.

Authors:  Fan Yang; Duncan C Thomas
Journal:  Hum Hered       Date:  2011-07-02       Impact factor: 0.444

6.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

7.  Likelihood-based complex trait association testing for arbitrary depth sequencing data.

Authors:  Song Yan; Shuai Yuan; Zheng Xu; Baqun Zhang; Bo Zhang; Guolian Kang; Andrea Byrnes; Yun Li
Journal:  Bioinformatics       Date:  2015-05-14       Impact factor: 6.937

8.  Fast and accurate site frequency spectrum estimation from low coverage sequence data.

Authors:  Eunjung Han; Janet S Sinsheimer; John Novembre
Journal:  Bioinformatics       Date:  2014-10-30       Impact factor: 6.937

9.  Identification of Sex-determining Loci in Pacific White Shrimp Litopeneaus vannamei Using Linkage and Association Analysis.

Authors:  Yang Yu; Xiaojun Zhang; Jianbo Yuan; Quanchao Wang; Shihao Li; Hao Huang; Fuhua Li; Jianhai Xiang
Journal:  Mar Biotechnol (NY)       Date:  2017-05-16       Impact factor: 3.619

10.  Rare variant discovery and calling by sequencing pooled samples with overlaps.

Authors:  Wenhui Wang; Xiaolin Yin; Yoon Soo Pyon; Matthew Hayes; Jing Li
Journal:  Bioinformatics       Date:  2012-10-27       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.