Literature DB >> 30020412

Heritability estimation and differential analysis of count data with generalized linear mixed models in genomic sequencing studies.

Shiquan Sun1,2, Jiaqiang Zhu2, Sahar Mozaffari3, Carole Ober3, Mengjie Chen3,4, Xiang Zhou2,5.   

Abstract

Motivation: Genomic sequencing studies, including RNA sequencing and bisulfite sequencing studies, are becoming increasingly common and increasingly large. Large genomic sequencing studies open doors for accurate molecular trait heritability estimation and powerful differential analysis. Heritability estimation and differential analysis in sequencing studies requires the development of statistical methods that can properly account for the count nature of the sequencing data and that are computationally efficient for large datasets.
Results: Here, we develop such a method, PQLseq (Penalized Quasi-Likelihood for sequencing count data), to enable effective and efficient heritability estimation and differential analysis using the generalized linear mixed model framework. With extensive simulations and comparisons to previous methods, we show that PQLseq is the only method currently available that can produce unbiased heritability estimates for sequencing count data. In addition, we show that PQLseq is well suited for differential analysis in large sequencing studies, providing calibrated type I error control and more power compared to the standard linear mixed model methods. Finally, we apply PQLseq to perform gene expression heritability estimation and differential expression analysis in a large RNA sequencing study in the Hutterites. Availability and implementation: PQLseq is implemented as an R package with source code freely available at www.xzlab.org/software.html and https://cran.r-project.org/web/packages/PQLseq/index.html. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2019        PMID: 30020412      PMCID: PMC6361238          DOI: 10.1093/bioinformatics/bty644

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  49 in total

1.  Estimation using penalized quasilikelihood and quasi-pseudo-likelihood in Poisson mixed models.

Authors:  Xihong Lin
Journal:  Lifetime Data Anal       Date:  2007-12-16       Impact factor: 1.588

2.  Common SNPs explain a large proportion of the heritability for human height.

Authors:  Jian Yang; Beben Benyamin; Brian P McEvoy; Scott Gordon; Anjali K Henders; Dale R Nyholt; Pamela A Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

3.  Genetic inheritance of gene expression in human cell lines.

Authors:  S A Monks; A Leonardson; H Zhu; P Cundiff; P Pietrusiak; S Edwards; J W Phillips; A Sachs; E E Schadt
Journal:  Am J Hum Genet       Date:  2004-10-21       Impact factor: 11.025

4.  Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans.

Authors: 
Journal:  Science       Date:  2015-05-07       Impact factor: 47.728

5.  The genetic architecture of gene expression levels in wild baboons.

Authors:  Jenny Tung; Xiang Zhou; Susan C Alberts; Matthew Stephens; Yoav Gilad
Journal:  Elife       Date:  2015-02-25       Impact factor: 8.140

6.  MOABS: model based analysis of bisulfite sequencing data.

Authors:  Deqiang Sun; Yuanxin Xi; Benjamin Rodriguez; Hyun Jung Park; Pan Tong; Mira Meong; Margaret A Goodell; Wei Li
Journal:  Genome Biol       Date:  2014-02-24       Impact factor: 13.583

7.  Non-parametric genetic prediction of complex traits with latent Dirichlet process regression models.

Authors:  Ping Zeng; Xiang Zhou
Journal:  Nat Commun       Date:  2017-09-06       Impact factor: 14.919

8.  Contribution of genetic variation to transgenerational inheritance of DNA methylation.

Authors:  Allan F McRae; Joseph E Powell; Anjali K Henders; Lisa Bowdler; Gibran Hemani; Sonia Shah; Jodie N Painter; Nicholas G Martin; Peter M Visscher; Grant W Montgomery
Journal:  Genome Biol       Date:  2014-05-29       Impact factor: 13.583

Review 9.  A survey of best practices for RNA-seq data analysis.

Authors:  Ana Conesa; Pedro Madrigal; Sonia Tarazona; David Gomez-Cabrero; Alejandra Cervera; Andrew McPherson; Michał Wojciech Szcześniak; Daniel J Gaffney; Laura L Elo; Xuegong Zhang; Ali Mortazavi
Journal:  Genome Biol       Date:  2016-01-26       Impact factor: 13.583

10.  Integrated analyses of gene expression and genetic association studies in a founder population.

Authors:  Darren A Cusanovich; Minal Caliskan; Christine Billstrand; Katelyn Michelini; Claudia Chavarria; Sherryl De Leon; Amy Mitrano; Noah Lewellyn; Jack A Elias; Geoffrey L Chupp; Roberto M Lang; Sanjiv J Shah; Jeanne M Decara; Yoav Gilad; Carole Ober
Journal:  Hum Mol Genet       Date:  2016-02-29       Impact factor: 6.150

View more
  21 in total

1.  Age influences domestic dog cognitive performance independent of average breed lifespan.

Authors:  Marina M Watowich; Evan L MacLean; Brian Hare; Josep Call; Juliane Kaminski; Ádám Miklósi; Noah Snyder-Mackler
Journal:  Anim Cogn       Date:  2020-04-30       Impact factor: 3.084

2.  Statistical analysis of spatial expression patterns for spatially resolved transcriptomic studies.

Authors:  Shiquan Sun; Jiaqiang Zhu; Xiang Zhou
Journal:  Nat Methods       Date:  2020-01-27       Impact factor: 28.547

3.  Trade-offs of Linear Mixed Models in Genome-Wide Association Studies.

Authors:  Haohan Wang; Bryon Aragam; Eric P Xing
Journal:  J Comput Biol       Date:  2022-02-25       Impact factor: 1.479

4.  Multi-scale inference of genetic trait architecture using biologically annotated neural networks.

Authors:  Pinar Demetci; Wei Cheng; Gregory Darnell; Xiang Zhou; Sohini Ramachandran; Lorin Crawford
Journal:  PLoS Genet       Date:  2021-08-19       Impact factor: 5.917

5.  Spatially informed cell-type deconvolution for spatial transcriptomics.

Authors:  Ying Ma; Xiang Zhou
Journal:  Nat Biotechnol       Date:  2022-05-02       Impact factor: 68.164

Review 6.  Genetic prediction of complex traits with polygenic scores: a statistical review.

Authors:  Ying Ma; Xiang Zhou
Journal:  Trends Genet       Date:  2021-07-06       Impact factor: 11.639

7.  Effective and scalable single-cell data alignment with non-linear canonical correlation analysis.

Authors:  Jialu Hu; Mengjie Chen; Xiang Zhou
Journal:  Nucleic Acids Res       Date:  2022-02-28       Impact factor: 16.971

8.  Hybrid Stem Cell States: Insights Into the Relationship Between Mammary Development and Breast Cancer Using Single-Cell Transcriptomics.

Authors:  Tasha Thong; Yutong Wang; Michael D Brooks; Christopher T Lee; Clayton Scott; Laura Balzano; Max S Wicha; Justin A Colacino
Journal:  Front Cell Dev Biol       Date:  2020-05-08

Review 9.  Statistical methods for mediation analysis in the era of high-throughput genomics: Current successes and future challenges.

Authors:  Ping Zeng; Zhonghe Shao; Xiang Zhou
Journal:  Comput Struct Biotechnol J       Date:  2021-05-26       Impact factor: 7.271

10.  Plasma cell-free DNA methylation marks for episodic memory impairment: a pilot twin study.

Authors:  M Konki; N Lindgren; M Kyläniemi; R Venho; E Laajala; B Ghimire; R Lahesmaa; J Kaprio; J O Rinne; R J Lund
Journal:  Sci Rep       Date:  2020-08-25       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.