Literature DB >> 25395270

Hierarchical Bayesian model for rare variant association analysis integrating genotype uncertainty in human sequence data.

Liang He1, Janne Pitkäniemi, Antti-Pekka Sarin, Veikko Salomaa, Mikko J Sillanpää, Samuli Ripatti.   

Abstract

Next-generation sequencing (NGS) has led to the study of rare genetic variants, which possibly explain the missing heritability for complex diseases. Most existing methods for rare variant (RV) association detection do not account for the common presence of sequencing errors in NGS data. The errors can largely affect the power and perturb the accuracy of association tests due to rare observations of minor alleles. We developed a hierarchical Bayesian approach to estimate the association between RVs and complex diseases. Our integrated framework combines the misclassification probability with shrinkage-based Bayesian variable selection. It allows for flexibility in handling neutral and protective RVs with measurement error, and is robust enough for detecting causal RVs with a wide spectrum of minor allele frequency (MAF). Imputation uncertainty and MAF are incorporated into the integrated framework to achieve the optimal statistical power. We demonstrate that sequencing error does significantly affect the findings, and our proposed model can take advantage of it to improve statistical power in both simulated and real data. We further show that our model outperforms existing methods, such as sequence kernel association test (SKAT). Finally, we illustrate the behavior of the proposed method using a Finnish low-density lipoprotein cholesterol study, and show that it identifies an RV known as FH North Karelia in LDLR gene with three carriers in 1,155 individuals, which is missed by both SKAT and Granvil.
© 2014 WILEY PERIODICALS, INC.

Entities:  

Keywords:  LDL-C; classification error; rare variant; shrinkage-based Bayesian variable selection

Mesh:

Substances:

Year:  2014        PMID: 25395270     DOI: 10.1002/gepi.21871

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  5 in total

1.  Detection and Classification of Hard and Soft Sweeps from Unphased Genotypes by Multilocus Genotype Identity.

Authors:  Alexandre M Harris; Nandita R Garud; Michael DeGiorgio
Journal:  Genetics       Date:  2018-10-12       Impact factor: 4.562

Review 2.  The impact of rare and low-frequency genetic variants in common disease.

Authors:  Lorenzo Bomba; Klaudia Walter; Nicole Soranzo
Journal:  Genome Biol       Date:  2017-04-27       Impact factor: 13.583

3.  Efficient and flexible Integration of variant characteristics in rare variant association studies using integrated nested Laplace approximation.

Authors:  Hana Susak; Laura Serra-Saurina; German Demidov; Raquel Rabionet; Laura Domènech; Mattia Bosio; Francesc Muyas; Xavier Estivill; Geòrgia Escaramís; Stephan Ossowski
Journal:  PLoS Comput Biol       Date:  2021-02-19       Impact factor: 4.475

4.  Neuregulin signaling pathway in smoking behavior.

Authors:  R Gupta; B Qaiser; L He; T S Hiekkalinna; A B Zheutlin; S Therman; M Ollikainen; S Ripatti; M Perola; V Salomaa; L Milani; T D Cannon; P A F Madden; T Korhonen; J Kaprio; A Loukola
Journal:  Transl Psychiatry       Date:  2017-08-22       Impact factor: 6.222

5.  Allele balance bias identifies systematic genotyping errors and false disease associations.

Authors:  Francesc Muyas; Mattia Bosio; Anna Puig; Hana Susak; Laura Domènech; Georgia Escaramis; Luis Zapata; German Demidov; Xavier Estivill; Raquel Rabionet; Stephan Ossowski
Journal:  Hum Mutat       Date:  2018-11-23       Impact factor: 4.878

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.