Literature DB >> 26519502

Bayesian inference with historical data-based informative priors improves detection of differentially expressed genes.

Ben Li1, Zhaonan Sun2, Qing He1, Yu Zhu2, Zhaohui S Qin3.   

Abstract

MOTIVATION: Modern high-throughput biotechnologies such as microarray are capable of producing a massive amount of information for each sample. However, in a typical high-throughput experiment, only limited number of samples were assayed, thus the classical 'large p, small n' problem. On the other hand, rapid propagation of these high-throughput technologies has resulted in a substantial collection of data, often carried out on the same platform and using the same protocol. It is highly desirable to utilize the existing data when performing analysis and inference on a new dataset.
RESULTS: Utilizing existing data can be carried out in a straightforward fashion under the Bayesian framework in which the repository of historical data can be exploited to build informative priors and used in new data analysis. In this work, using microarray data, we investigate the feasibility and effectiveness of deriving informative priors from historical data and using them in the problem of detecting differentially expressed genes. Through simulation and real data analysis, we show that the proposed strategy significantly outperforms existing methods including the popular and state-of-the-art Bayesian hierarchical model-based approaches. Our work illustrates the feasibility and benefits of exploiting the increasingly available genomics big data in statistical inference and presents a promising practical strategy for dealing with the 'large p, small n' problem.
AVAILABILITY AND IMPLEMENTATION: Our method is implemented in R package IPBT, which is freely available from https://github.com/benliemory/IPBT CONTACT: yuzhu@purdue.edu; zhaohui.qin@emory.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2015        PMID: 26519502      PMCID: PMC4907396          DOI: 10.1093/bioinformatics/btv631

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  24 in total

1.  Analyzing 'omics data using hierarchical models.

Authors:  Hongkai Ji; X Shirley Liu
Journal:  Nat Biotechnol       Date:  2010-04       Impact factor: 54.908

2.  TileMap: create chromosomal map of tiling array hybridizations.

Authors:  Hongkai Ji; Wing Hung Wong
Journal:  Bioinformatics       Date:  2005-07-26       Impact factor: 6.937

3.  Frozen robust multiarray analysis (fRMA).

Authors:  Matthew N McCall; Benjamin M Bolstad; Rafael A Irizarry
Journal:  Biostatistics       Date:  2010-01-22       Impact factor: 5.899

4.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

5.  Background adjustment for DNA microarrays using a database of microarray experiments.

Authors:  Yunxia Sui; Xiaoyue Zhao; Terence P Speed; Zhijin Wu
Journal:  J Comput Biol       Date:  2009-11       Impact factor: 1.479

6.  A quantum leap in the reproducibility, precision, and sensitivity of gene expression profile analysis even when sample size is extremely small.

Authors:  Kevin Lim; Zhenhua Li; Kwok Pui Choi; Limsoon Wong
Journal:  J Bioinform Comput Biol       Date:  2015-05-26       Impact factor: 1.122

7.  A Selective Overview of Variable Selection in High Dimensional Feature Space.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  Stat Sin       Date:  2010-01       Impact factor: 1.261

8.  Bayesian modeling of differential gene expression.

Authors:  Alex Lewin; Sylvia Richardson; Clare Marshall; Anne Glazier; Tim Aitman
Journal:  Biometrics       Date:  2006-03       Impact factor: 2.571

9.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nucleic Acids Res       Date:  2008-11-25       Impact factor: 16.971

10.  Robust modeling of differential gene expression data using normal/independent distributions: a Bayesian approach.

Authors:  Mojtaba Ganjali; Taban Baghfalaki; Damon Berridge
Journal:  PLoS One       Date:  2015-04-24       Impact factor: 3.240

View more
  4 in total

1.  Improving Hierarchical Models Using Historical Data with Applications in High-Throughput Genomics Data Analysis.

Authors:  Ben Li; Yunxiao Li; Zhaohui S Qin
Journal:  Stat Biosci       Date:  2016-07-08

2.  Identifying differentially expressed genes from cross-site integrated data based on relative expression orderings.

Authors:  Hao Cai; Xiangyu Li; Jing Li; Qirui Liang; Weicheng Zheng; Qingzhou Guan; Zheng Guo; Xianlong Wang
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

3.  Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously.

Authors:  Chenchen Ma; Tieming Ji
Journal:  BMC Bioinformatics       Date:  2018-09-20       Impact factor: 3.169

4.  The prognostic and clinical significance of IFI44L aberrant downregulation in patients with oral squamous cell carcinoma.

Authors:  Deming Ou; Ying Wu
Journal:  BMC Cancer       Date:  2021-12-13       Impact factor: 4.430

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.