Literature DB >> 25641202

PLNseq: a multivariate Poisson lognormal distribution for high-throughput matched RNA-sequencing read count data.

Hong Zhang1, Jinfeng Xu, Ning Jiang, Xiaohua Hu, Zewei Luo.   

Abstract

High-throughput RNA-sequencing (RNA-seq) technology provides an attractive platform for gene expression analysis. In many experimental settings, RNA-seq read counts are measured from matched samples or taken from the same subject under multiple treatment conditions. The induced correlation therefore should be evaluated and taken into account in deriving tests of differential expression. We proposed a novel method 'PLNseq', which uses a multivariate Poisson lognormal distribution to model matched read count data. The correlation is directly modeled through Gaussian random effects, and inferences are made by likelihood methods. A three-stage numerical algorithm is developed to estimate unknown parameters and conduct differential expression analysis. Results using simulated data demonstrate that our method performs reasonably well in terms of parameter estimation, DE analysis power, and robustness. PLNseq also has better control of FDRs than the benchmarks edgeR and DESeq2 in the situations where the correlation is different across the genes but can still be accurately estimated. Furthermore, direct evaluation of correlation through PLNseq enables us to develop a new and more powerful test for DE analysis. Application to a lung cancer study is provided to illustrate the practical utilities of our method. An R package implementing the method is also publicly available.
Copyright © 2015 John Wiley & Sons, Ltd.

Entities:  

Keywords:  Poisson lognormal model; RNA-seq; differential expression analysis; matched samples

Mesh:

Year:  2015        PMID: 25641202     DOI: 10.1002/sim.6449

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  3 in total

1.  A comparison of methods for multiple degree of freedom testing in repeated measures RNA-sequencing experiments.

Authors:  Elizabeth A Wynn; Brian E Vestal; Tasha E Fingerlin; Camille M Moore
Journal:  BMC Med Res Methodol       Date:  2022-05-28       Impact factor: 4.612

2.  multiDE: a dimension reduced model based statistical method for differential expression analysis using RNA-sequencing data with multiple treatment conditions.

Authors:  Guangliang Kang; Li Du; Hong Zhang
Journal:  BMC Bioinformatics       Date:  2016-06-22       Impact factor: 3.169

3.  MCMSeq: Bayesian hierarchical modeling of clustered and repeated measures RNA sequencing experiments.

Authors:  Brian E Vestal; Camille M Moore; Elizabeth Wynn; Laura Saba; Tasha Fingerlin; Katerina Kechris
Journal:  BMC Bioinformatics       Date:  2020-08-28       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.