Literature DB >> 23001152

A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data.

Hao Wu1, Chi Wang, Zhijin Wu.   

Abstract

Recent developments in RNA-sequencing (RNA-seq) technology have led to a rapid increase in gene expression data in the form of counts. RNA-seq can be used for a variety of applications, however, identifying differential expression (DE) remains a key task in functional genomics. There have been a number of statistical methods for DE detection for RNA-seq data. One common feature of several leading methods is the use of the negative binomial (Gamma-Poisson mixture) model. That is, the unobserved gene expression is modeled by a gamma random variable and, given the expression, the sequencing read counts are modeled as Poisson. The distinct feature in various methods is how the variance, or dispersion, in the Gamma distribution is modeled and estimated. We evaluate several large public RNA-seq datasets and find that the estimated dispersion in existing methods does not adequately capture the heterogeneity of biological variance among samples. We present a new empirical Bayes shrinkage estimate of the dispersion parameters and demonstrate improved DE detection.

Entities:  

Mesh:

Year:  2012        PMID: 23001152      PMCID: PMC3590927          DOI: 10.1093/biostatistics/kxs033

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  21 in total

1.  Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors:  Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

2.  The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.

Authors:  Leming Shi; Laura H Reid; Wendell D Jones; Richard Shippy; Janet A Warrington; Shawn C Baker; Patrick J Collins; Francoise de Longueville; Ernest S Kawasaki; Kathleen Y Lee; Yuling Luo; Yongming Andrew Sun; James C Willey; Robert A Setterquist; Gavin M Fischer; Weida Tong; Yvonne P Dragan; David J Dix; Felix W Frueh; Frederico M Goodsaid; Damir Herman; Roderick V Jensen; Charles D Johnson; Edward K Lobenhofer; Raj K Puri; Uwe Schrf; Jean Thierry-Mieg; Charles Wang; Mike Wilson; Paul K Wolber; Lu Zhang; Shashi Amur; Wenjun Bao; Catalin C Barbacioru; Anne Bergstrom Lucas; Vincent Bertholet; Cecilie Boysen; Bud Bromley; Donna Brown; Alan Brunner; Roger Canales; Xiaoxi Megan Cao; Thomas A Cebula; James J Chen; Jing Cheng; Tzu-Ming Chu; Eugene Chudin; John Corson; J Christopher Corton; Lisa J Croner; Christopher Davies; Timothy S Davison; Glenda Delenstarr; Xutao Deng; David Dorris; Aron C Eklund; Xiao-hui Fan; Hong Fang; Stephanie Fulmer-Smentek; James C Fuscoe; Kathryn Gallagher; Weigong Ge; Lei Guo; Xu Guo; Janet Hager; Paul K Haje; Jing Han; Tao Han; Heather C Harbottle; Stephen C Harris; Eli Hatchwell; Craig A Hauser; Susan Hester; Huixiao Hong; Patrick Hurban; Scott A Jackson; Hanlee Ji; Charles R Knight; Winston P Kuo; J Eugene LeClerc; Shawn Levy; Quan-Zhen Li; Chunmei Liu; Ying Liu; Michael J Lombardi; Yunqing Ma; Scott R Magnuson; Botoul Maqsodi; Tim McDaniel; Nan Mei; Ola Myklebost; Baitang Ning; Natalia Novoradovskaya; Michael S Orr; Terry W Osborn; Adam Papallo; Tucker A Patterson; Roger G Perkins; Elizabeth H Peters; Ron Peterson; Kenneth L Philips; P Scott Pine; Lajos Pusztai; Feng Qian; Hongzu Ren; Mitch Rosen; Barry A Rosenzweig; Raymond R Samaha; Mark Schena; Gary P Schroth; Svetlana Shchegrova; Dave D Smith; Frank Staedtler; Zhenqiang Su; Hongmei Sun; Zoltan Szallasi; Zivana Tezak; Danielle Thierry-Mieg; Karol L Thompson; Irina Tikhonova; Yaron Turpaz; Beena Vallanat; Christophe Van; Stephen J Walker; Sue Jane Wang; Yonghong Wang; Russ Wolfinger; Alex Wong; Jie Wu; Chunlin Xiao; Qian Xie; Jun Xu; Wen Yang; Liang Zhang; Sheng Zhong; Yaping Zong; William Slikker
Journal:  Nat Biotechnol       Date:  2006-09       Impact factor: 54.908

3.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

4.  Sex-specific and lineage-specific alternative splicing in primates.

Authors:  Ran Blekhman; John C Marioni; Paul Zumbo; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2009-12-15       Impact factor: 9.043

5.  Polymorphic cis- and trans-regulation of human gene expression.

Authors:  Vivian G Cheung; Renuka R Nayak; Isabel Xiaorong Wang; Susannah Elwyn; Sarah M Cousins; Michael Morley; Richard S Spielman
Journal:  PLoS Biol       Date:  2010-09-14       Impact factor: 8.029

6.  Sequencing technology does not eliminate biological variability.

Authors:  Kasper D Hansen; Zhijin Wu; Rafael A Irizarry; Jeffrey T Leek
Journal:  Nat Biotechnol       Date:  2011-07-11       Impact factor: 54.908

7.  Removing technical variability in RNA-seq data using conditional quantile normalization.

Authors:  Kasper D Hansen; Rafael A Irizarry; Zhijin Wu
Journal:  Biostatistics       Date:  2012-01-27       Impact factor: 5.899

8.  ReCount: a multi-experiment resource of analysis-ready RNA-seq gene count datasets.

Authors:  Alyssa C Frazee; Ben Langmead; Jeffrey T Leek
Journal:  BMC Bioinformatics       Date:  2011-11-16       Impact factor: 3.169

9.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

10.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

View more
  90 in total

1.  Differential methylation analysis for bisulfite sequencing using DSS.

Authors:  Hao Feng; Hao Wu
Journal:  Quant Biol       Date:  2019-12-15

2.  Count-based differential expression analysis of RNA sequencing data using R and Bioconductor.

Authors:  Simon Anders; Davis J McCarthy; Yunshun Chen; Michal Okoniewski; Gordon K Smyth; Wolfgang Huber; Mark D Robinson
Journal:  Nat Protoc       Date:  2013-08-22       Impact factor: 13.491

3.  5-Hydroxymethylcytosine alterations in the human postmortem brains of autism spectrum disorder.

Authors:  Ying Cheng; Ziyi Li; Sasicha Manupipatpong; Li Lin; Xuekun Li; Tianlei Xu; Yong-Hui Jiang; Qiang Shu; Hao Wu; Peng Jin
Journal:  Hum Mol Genet       Date:  2018-09-01       Impact factor: 6.150

4.  Robustly detecting differential expression in RNA sequencing data using observation weights.

Authors:  Xiaobei Zhou; Helen Lindsay; Mark D Robinson
Journal:  Nucleic Acids Res       Date:  2014-04-20       Impact factor: 16.971

5.  A novel statistical method for quantitative comparison of multiple ChIP-seq datasets.

Authors:  Li Chen; Chi Wang; Zhaohui S Qin; Hao Wu
Journal:  Bioinformatics       Date:  2015-02-13       Impact factor: 6.937

6.  Dissecting differential signals in high-throughput data from complex tissues.

Authors:  Ziyi Li; Zhijin Wu; Peng Jin; Hao Wu
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

7.  Statistical Modeling of High Dimensional Counts.

Authors:  Michael I Love
Journal:  Methods Mol Biol       Date:  2021

8.  Active N6-Methyladenine Demethylation by DMAD Regulates Gene Expression by Coordinating with Polycomb Protein in Neurons.

Authors:  Bing Yao; Yujing Li; Zhiqin Wang; Li Chen; Mickael Poidevin; Can Zhang; Li Lin; Feng Wang; Han Bao; Bin Jiao; Junghwa Lim; Ying Cheng; Luoxiu Huang; Brittany Lynn Phillips; Tianlei Xu; Ranhui Duan; Kenneth H Moberg; Hao Wu; Peng Jin
Journal:  Mol Cell       Date:  2018-08-02       Impact factor: 17.970

9.  Age-related epigenome-wide DNA methylation and hydroxymethylation in longitudinal mouse blood.

Authors:  Joseph Kochmanski; Elizabeth H Marchlewicz; Raymond G Cavalcante; Maureen A Sartor; Dana C Dolinoy
Journal:  Epigenetics       Date:  2018-08-23       Impact factor: 4.528

10.  DiPhiSeq: robust comparison of expression levels on RNA-Seq data with large sample sizes.

Authors:  Jun Li; Alicia T Lamere
Journal:  Bioinformatics       Date:  2019-07-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.