Literature DB >> 24733292

Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic.

Andriy Derkach1, Theodore Chiang1, Jiafen Gong1, Laura Addis1, Sara Dobbins1, Ian Tomlinson1, Richard Houlston1, Deb K Pal1, Lisa J Strug2.   

Abstract

MOTIVATION: Sufficiently powered case-control studies with next-generation sequence (NGS) data remain prohibitively expensive for many investigators. If feasible, a more efficient strategy would be to include publicly available sequenced controls. However, these studies can be confounded by differences in sequencing platform; alignment, single nucleotide polymorphism and variant calling algorithms; read depth; and selection thresholds. Assuming one can match cases and controls on the basis of ethnicity and other potential confounding factors, and one has access to the aligned reads in both groups, we investigate the effect of systematic differences in read depth and selection threshold when comparing allele frequencies between cases and controls. We propose a novel likelihood-based method, the robust variance score (RVS), that substitutes genotype calls by their expected values given observed sequence data.
RESULTS: We show theoretically that the RVS eliminates read depth bias in the estimation of minor allele frequency. We also demonstrate that, using simulated and real NGS data, the RVS method controls Type I error and has comparable power to the 'gold standard' analysis with the true underlying genotypes for both common and rare variants.
AVAILABILITY AND IMPLEMENTATION: An RVS R script and instructions can be found at strug.research.sickkids.ca, and at https://github.com/strug-lab/RVS. CONTACT: lisa.strug@utoronto.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2014        PMID: 24733292      PMCID: PMC4103600          DOI: 10.1093/bioinformatics/btu196

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Rare-variant association testing for sequencing data with the sequence kernel association test.

Authors:  Michael C Wu; Seunggeun Lee; Tianxi Cai; Yun Li; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2011-07-07       Impact factor: 11.025

Review 3.  Genotype and SNP calling from next-generation sequencing data.

Authors:  Rasmus Nielsen; Joshua S Paul; Anders Albrechtsen; Yun S Song
Journal:  Nat Rev Genet       Date:  2011-06       Impact factor: 53.242

4.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

5.  Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays.

Authors:  Radoje Drmanac; Andrew B Sparks; Matthew J Callow; Aaron L Halpern; Norman L Burns; Bahram G Kermani; Paolo Carnevali; Igor Nazarenko; Geoffrey B Nilsen; George Yeung; Fredrik Dahl; Andres Fernandez; Bryan Staker; Krishna P Pant; Jonathan Baccash; Adam P Borcherding; Anushka Brownley; Ryan Cedeno; Linsu Chen; Dan Chernikoff; Alex Cheung; Razvan Chirita; Benjamin Curson; Jessica C Ebert; Coleen R Hacker; Robert Hartlage; Brian Hauser; Steve Huang; Yuan Jiang; Vitali Karpinchyk; Mark Koenig; Calvin Kong; Tom Landers; Catherine Le; Jia Liu; Celeste E McBride; Matt Morenzoni; Robert E Morey; Karl Mutch; Helena Perazich; Kimberly Perry; Brock A Peters; Joe Peterson; Charit L Pethiyagoda; Kaliprasad Pothuraju; Claudia Richter; Abraham M Rosenbaum; Shaunak Roy; Jay Shafto; Uladzislau Sharanhovich; Karen W Shannon; Conrad G Sheppy; Michel Sun; Joseph V Thakuria; Anne Tran; Dylan Vu; Alexander Wait Zaranek; Xiaodi Wu; Snezana Drmanac; Arnold R Oliphant; William C Banyai; Bruce Martin; Dennis G Ballinger; George M Church; Clifford A Reid
Journal:  Science       Date:  2009-11-05       Impact factor: 47.728

6.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

7.  Three ways of combining genotyping and resequencing in case-control association studies.

Authors:  Jeffrey A Longmate; Garrett P Larson; Theodore G Krontiris; Steve S Sommer
Journal:  PLoS One       Date:  2010-12-20       Impact factor: 3.240

8.  Testing for an unusual distribution of rare variants.

Authors:  Benjamin M Neale; Manuel A Rivas; Benjamin F Voight; David Altshuler; Bernie Devlin; Marju Orho-Melander; Sekar Kathiresan; Shaun M Purcell; Kathryn Roeder; Mark J Daly
Journal:  PLoS Genet       Date:  2011-03-03       Impact factor: 5.917

9.  Estimation of allele frequency and association mapping using next-generation sequencing data.

Authors:  Su Yeon Kim; Kirk E Lohmueller; Anders Albrechtsen; Yingrui Li; Thorfinn Korneliussen; Geng Tian; Niels Grarup; Tao Jiang; Gitte Andersen; Daniel Witte; Torben Jorgensen; Torben Hansen; Oluf Pedersen; Jun Wang; Rasmus Nielsen
Journal:  BMC Bioinformatics       Date:  2011-06-11       Impact factor: 3.169

10.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

View more
  18 in total

1.  Likelihood-based complex trait association testing for arbitrary depth sequencing data.

Authors:  Song Yan; Shuai Yuan; Zheng Xu; Baqun Zhang; Bo Zhang; Guolian Kang; Andrea Byrnes; Yun Li
Journal:  Bioinformatics       Date:  2015-05-14       Impact factor: 6.937

2.  Analysis in case-control sequencing association studies with different sequencing depths.

Authors:  Sixing Chen; Xihong Lin
Journal:  Biostatistics       Date:  2020-07-01       Impact factor: 5.899

Review 3.  Complex-Trait Prediction in the Era of Big Data.

Authors:  Gustavo de Los Campos; Ana Ines Vazquez; Stephen Hsu; Louis Lello
Journal:  Trends Genet       Date:  2018-08-20       Impact factor: 11.639

4.  Family-based genome scan for age at onset of late-onset Alzheimer's disease in whole exome sequencing data.

Authors:  M Saad; Z Brkanac; E M Wijsman
Journal:  Genes Brain Behav       Date:  2015-09-23       Impact factor: 3.449

5.  Association score testing for rare variants and binary traits in family data with shared controls.

Authors:  Mohamad Saad; Ellen M Wijsman
Journal:  Brief Bioinform       Date:  2019-01-18       Impact factor: 11.622

6.  Improving power for rare-variant tests by integrating external controls.

Authors:  Seunggeun Lee; Sehee Kim; Christian Fuchsberger
Journal:  Genet Epidemiol       Date:  2017-06-28       Impact factor: 2.135

7.  Whole exome sequencing in extended families with autism spectrum disorder implicates four candidate genes.

Authors:  Nicola H Chapman; Alejandro Q Nato; Raphael Bernier; Katy Ankenman; Harkirat Sohi; Jeff Munson; Ashok Patowary; Marilyn Archer; Elizabeth M Blue; Sara Jane Webb; Hilary Coon; Wendy H Raskind; Zoran Brkanac; Ellen M Wijsman
Journal:  Hum Genet       Date:  2015-07-24       Impact factor: 4.132

8.  Prioritizing rare variants with conditional likelihood ratios.

Authors:  Weili Li; Sara Dobbins; Ian Tomlinson; Richard Houlston; Deb K Pal; Lisa J Strug
Journal:  Hum Hered       Date:  2015-02-03       Impact factor: 0.444

9.  A data harmonization pipeline to leverage external controls and boost power in GWAS.

Authors:  Danfeng Chen; Katherine Tashman; Duncan S Palmer; Benjamin Neale; Kathryn Roeder; Alex Bloemendal; Claire Churchhouse; Zheng Tracy Ke
Journal:  Hum Mol Genet       Date:  2022-02-03       Impact factor: 5.121

10.  Novel score test to increase power in association test by integrating external controls.

Authors:  Yatong Li; Seunggeun Lee
Journal:  Genet Epidemiol       Date:  2020-11-08       Impact factor: 2.344

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.