Literature DB >> 30590456

Analysis in case-control sequencing association studies with different sequencing depths.

Sixing Chen1, Xihong Lin1,2.   

Abstract

With the advent of next-generation sequencing, investigators have access to higher quality sequencing data. However, to sequence all samples in a study using next generation sequencing can still be prohibitively expensive. One potential remedy could be to combine next generation sequencing data from cases with publicly available sequencing data for controls, but there could be a systematic difference in quality of sequenced data, such as sequencing depths, between sequenced study cases and publicly available controls. We propose a regression calibration (RC)-based method and a maximum-likelihood method for conducting an association study with such a combined sample by accounting for differential sequencing errors between cases and controls. The methods allow for adjusting for covariates, such as population stratification as confounders. Both methods control type I error and have comparable power to analysis conducted using the true genotype with sufficiently high but different sequencing depths. We show that the RC method allows for analysis using naive variance estimate (closely approximates true variance in practice) and standard software under certain circumstances. We evaluate the performance of the proposed methods using simulation studies and apply our methods to a combined data set of exome sequenced acute lung injury cases and healthy controls from the 1000 Genomes project.
© The Author 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Genetic association studies; Hypothesis testing; Maximum likelihood; Misclassification; Regression calibration; Sequencing depth

Mesh:

Year:  2020        PMID: 30590456      PMCID: PMC7308042          DOI: 10.1093/biostatistics/kxy073

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  22 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  Am J Hum Genet       Date:  2008-08-07       Impact factor: 11.025

3.  Rare-variant association testing for sequencing data with the sequence kernel association test.

Authors:  Michael C Wu; Seunggeun Lee; Tianxi Cai; Yun Li; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2011-07-07       Impact factor: 11.025

4.  Integrative analysis of sequencing and array genotype data for discovering disease associations with rare mutations.

Authors:  Yi-Juan Hu; Yun Li; Paul L Auer; Dan-Yu Lin
Journal:  Proc Natl Acad Sci U S A       Date:  2015-01-12       Impact factor: 11.205

Review 5.  Rare-variant association analysis: study designs and statistical tests.

Authors:  Seunggeung Lee; Gonçalo R Abecasis; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2014-07-03       Impact factor: 11.025

6.  A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity.

Authors:  D J Balding; R A Nichols
Journal:  Genetica       Date:  1995       Impact factor: 1.082

7.  Binary regression with differentially misclassified response and exposure variables.

Authors:  Li Tang; Robert H Lyles; Caroline C King; David D Celentano; Yungtai Lo
Journal:  Stat Med       Date:  2015-02-04       Impact factor: 2.373

8.  Three ways of combining genotyping and resequencing in case-control association studies.

Authors:  Jeffrey A Longmate; Garrett P Larson; Theodore G Krontiris; Steve S Sommer
Journal:  PLoS One       Date:  2010-12-20       Impact factor: 3.240

9.  Fine mapping of five loci associated with low-density lipoprotein cholesterol detects variants that double the explained heritability.

Authors:  Serena Sanna; Bingshan Li; Antonella Mulas; Carlo Sidore; Hyun M Kang; Anne U Jackson; Maria Grazia Piras; Gianluca Usala; Giuseppe Maninchedda; Alessandro Sassu; Fabrizio Serra; Maria Antonietta Palmas; William H Wood; Inger Njølstad; Markku Laakso; Kristian Hveem; Jaakko Tuomilehto; Timo A Lakka; Rainer Rauramaa; Michael Boehnke; Francesco Cucca; Manuela Uda; David Schlessinger; Ramaiah Nagaraja; Gonçalo R Abecasis
Journal:  PLoS Genet       Date:  2011-07-28       Impact factor: 5.917

10.  Estimation of allele frequency and association mapping using next-generation sequencing data.

Authors:  Su Yeon Kim; Kirk E Lohmueller; Anders Albrechtsen; Yingrui Li; Thorfinn Korneliussen; Geng Tian; Niels Grarup; Tao Jiang; Gitte Andersen; Daniel Witte; Torben Jorgensen; Torben Hansen; Oluf Pedersen; Jun Wang; Rasmus Nielsen
Journal:  BMC Bioinformatics       Date:  2011-06-11       Impact factor: 3.169

View more
  5 in total

Review 1.  Opportunities and challenges for the use of common controls in sequencing studies.

Authors:  Genevieve L Wojcik; Jessica Murphy; Jacob L Edelson; Christopher R Gignoux; Alexander G Ioannidis; Alisa Manning; Manuel A Rivas; Steven Buyske; Audrey E Hendricks
Journal:  Nat Rev Genet       Date:  2022-05-17       Impact factor: 59.581

2.  Integrating external controls in case-control studies improves power for rare-variant tests.

Authors:  Yatong Li; Seunggeun Lee
Journal:  Genet Epidemiol       Date:  2022-02-16       Impact factor: 2.344

3.  A data harmonization pipeline to leverage external controls and boost power in GWAS.

Authors:  Danfeng Chen; Katherine Tashman; Duncan S Palmer; Benjamin Neale; Kathryn Roeder; Alex Bloemendal; Claire Churchhouse; Zheng Tracy Ke
Journal:  Hum Mol Genet       Date:  2022-02-03       Impact factor: 5.121

4.  Novel score test to increase power in association test by integrating external controls.

Authors:  Yatong Li; Seunggeun Lee
Journal:  Genet Epidemiol       Date:  2020-11-08       Impact factor: 2.344

5.  Current Status of Next-Generation Sequencing Approaches for Candidate Gene Discovery in Familial Parkinson´s Disease.

Authors:  Nikita Simone Pillay; Owen A Ross; Alan Christoffels; Soraya Bardien
Journal:  Front Genet       Date:  2022-03-01       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.