Literature DB >> 22570057

Association testing for next-generation sequencing data using score statistics.

Line Skotte1, Thorfinn Sand Korneliussen, Anders Albrechtsen.   

Abstract

The advances in sequencing technology have made large-scale sequencing studies for large cohorts feasible. Often, the primary goal for large-scale studies is to identify genetic variants associated with a disease or other phenotypes. Even when deep sequencing is performed, there will be many sites where there is not enough data to call genotypes accurately. Ignoring the genotype classification uncertainty by basing subsequent analyses on called genotypes leads to a loss in power. Additionally, using called genotypes can lead to spurious association signals. Some methods taking the uncertainty of genotype calls into account have been proposed; most require numerical optimization which for large-scale data is not always computationally feasible. We show that using a score statistic for the joint likelihood of observed phenotypes and observed sequencing data provides an attractive approach to association testing for next-generation sequencing data. The joint model accounts for the genotype classification uncertainty via the posterior probabilities of the genotypes given the observed sequencing data, which gives the approach higher power than methods based on called genotypes. This strategy remains computationally feasible due to the use of score statistics. As part of the joint likelihood, we model the distribution of the phenotypes using a generalized linear model framework, which works for both quantitative and discrete phenotypes. Thus, the method presented here is applicable to case-control studies as well as mapping of quantitative traits. The model allows additional covariates that enable correction for confounding factors such as population stratification or cohort effects.
© 2012 Wiley Periodicals, Inc.

Mesh:

Year:  2012        PMID: 22570057     DOI: 10.1002/gepi.21636

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  17 in total

1.  Quantifying population genetic differentiation from next-generation sequencing data.

Authors:  Matteo Fumagalli; Filipe G Vieira; Thorfinn Sand Korneliussen; Tyler Linderoth; Emilia Huerta-Sánchez; Anders Albrechtsen; Rasmus Nielsen
Journal:  Genetics       Date:  2013-08-26       Impact factor: 4.562

2.  Whole-exome sequencing of 2,000 Danish individuals and the role of rare coding variants in type 2 diabetes.

Authors:  Kirk E Lohmueller; Thomas Sparsø; Qibin Li; Ehm Andersson; Thorfinn Korneliussen; Anders Albrechtsen; Karina Banasik; Niels Grarup; Ingileif Hallgrimsdottir; Kristoffer Kiil; Tuomas O Kilpeläinen; Nikolaj T Krarup; Tune H Pers; Gaston Sanchez; Youna Hu; Michael Degiorgio; Torben Jørgensen; Annelli Sandbæk; Torsten Lauritzen; Søren Brunak; Karsten Kristiansen; Yingrui Li; Torben Hansen; Jun Wang; Rasmus Nielsen; Oluf Pedersen
Journal:  Am J Hum Genet       Date:  2013-11-27       Impact factor: 11.025

3.  Estimating individual admixture proportions from next generation sequencing data.

Authors:  Line Skotte; Thorfinn Sand Korneliussen; Anders Albrechtsen
Journal:  Genetics       Date:  2013-09-11       Impact factor: 4.562

4.  Likelihood-based complex trait association testing for arbitrary depth sequencing data.

Authors:  Song Yan; Shuai Yuan; Zheng Xu; Baqun Zhang; Bo Zhang; Guolian Kang; Andrea Byrnes; Yun Li
Journal:  Bioinformatics       Date:  2015-05-14       Impact factor: 6.937

5.  Sensitive detection of chromatin-altering polymorphisms reveals autoimmune disease mechanisms.

Authors:  Ricardo Cruz-Herrera del Rosario; Jeremie Poschmann; Sigrid Laure Rouam; Eileen Png; Chiea Chuen Khor; Martin Lloyd Hibberd; Shyam Prabhakar
Journal:  Nat Methods       Date:  2015-03-23       Impact factor: 28.547

6.  Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data.

Authors:  Jonas Meisner; Anders Albrechtsen
Journal:  Genetics       Date:  2018-08-21       Impact factor: 4.562

7.  Analysis in case-control sequencing association studies with different sequencing depths.

Authors:  Sixing Chen; Xihong Lin
Journal:  Biostatistics       Date:  2020-07-01       Impact factor: 5.899

8.  Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic.

Authors:  Andriy Derkach; Theodore Chiang; Jiafen Gong; Laura Addis; Sara Dobbins; Ian Tomlinson; Richard Houlston; Deb K Pal; Lisa J Strug
Journal:  Bioinformatics       Date:  2014-04-14       Impact factor: 6.937

9.  Genomic variation from an extinct species is retained in the extant radiation following speciation reversal.

Authors:  David Frei; Rishi De-Kayne; Oliver M Selz; Ole Seehausen; Philine G D Feulner
Journal:  Nat Ecol Evol       Date:  2022-02-24       Impact factor: 19.100

10.  Calculation of Tajima's D and other neutrality test statistics from low depth next-generation sequencing data.

Authors:  Thorfinn Sand Korneliussen; Ida Moltke; Anders Albrechtsen; Rasmus Nielsen
Journal:  BMC Bioinformatics       Date:  2013-10-02       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.