Literature DB >> 14594714

A Bayesian missing value estimation method for gene expression profile data.

Shigeyuki Oba1, Masa-aki Sato, Ichiro Takemasa, Morito Monden, Ken-ichi Matsubara, Shin Ishii.   

Abstract

MOTIVATION: Gene expression profile analyses have been used in numerous studies covering a broad range of areas in biology. When unreliable measurements are excluded, missing values are introduced in gene expression profiles. Although existing multivariate analysis methods have difficulty with the treatment of missing values, this problem has received little attention. There are many options for dealing with missing values, each of which reaches drastically different results. Ignoring missing values is the simplest method and is frequently applied. This approach, however, has its flaws. In this article, we propose an estimation method for missing values, which is based on Bayesian principal component analysis (BPCA). Although the methodology that a probabilistic model and latent variables are estimated simultaneously within the framework of Bayes inference is not new in principle, actual BPCA implementation that makes it possible to estimate arbitrary missing variables is new in terms of statistical methodology.
RESULTS: When applied to DNA microarray data from various experimental conditions, the BPCA method exhibited markedly better estimation ability than other recently proposed methods, such as singular value decomposition and K-nearest neighbors. While the estimation performance of existing methods depends on model parameters whose determination is difficult, our BPCA method is free from this difficulty. Accordingly, the BPCA method provides accurate and convenient estimation for missing values. AVAILABILITY: The software is available at http://hawaii.aist-nara.ac.jp/~shige-o/tools/.

Mesh:

Year:  2003        PMID: 14594714     DOI: 10.1093/bioinformatics/btg287

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  116 in total

1.  Application of survival analysis methodology to the quantitative analysis of LC-MS proteomics data.

Authors:  Carmen D Tekwe; Raymond J Carroll; Alan R Dabney
Journal:  Bioinformatics       Date:  2012-05-24       Impact factor: 6.937

2.  Changes in S1 neural responses during tactile discrimination learning.

Authors:  Michael C Wiest; Eric Thomson; Janaina Pantoja; Miguel A L Nicolelis
Journal:  J Neurophysiol       Date:  2010-05-05       Impact factor: 2.714

3.  Biological impact of missing-value imputation on downstream analyses of gene expression profiles.

Authors:  Sunghee Oh; Dongwan D Kang; Guy N Brock; George C Tseng
Journal:  Bioinformatics       Date:  2010-11-02       Impact factor: 6.937

4.  How to improve postgenomic knowledge discovery using imputation.

Authors:  Muhammad Shoaib B Sehgal; Iqbal Gondal; Laurence S Dooley; Ross Coppel
Journal:  EURASIP J Bioinform Syst Biol       Date:  2009-02-08

5.  MSPrep--summarization, normalization and diagnostics for processing of mass spectrometry-based metabolomic data.

Authors:  Grant Hughes; Charmion Cruickshank-Quinn; Richard Reisdorph; Sharon Lutz; Irina Petrache; Nichole Reisdorph; Russell Bowler; Katerina Kechris
Journal:  Bioinformatics       Date:  2013-10-29       Impact factor: 6.937

6.  A computational strategy to analyze label-free temporal bottom-up proteomics data.

Authors:  Xiuxia Du; Stephen J Callister; Nathan P Manes; Joshua N Adkins; Roxana A Alexandridis; Xiaohua Zeng; Jung Hyeob Roh; William E Smith; Timothy J Donohue; Samuel Kaplan; Richard D Smith; Mary S Lipton
Journal:  J Proteome Res       Date:  2008-04-29       Impact factor: 4.466

7.  Impact of missing value imputation on classification for DNA microarray gene expression data--a model-based study.

Authors:  Youting Sun; Ulisses Braga-Neto; Edward R Dougherty
Journal:  EURASIP J Bioinform Syst Biol       Date:  2010-03-02

8.  Shrinkage regression-based methods for microarray missing value imputation.

Authors:  Hsiuying Wang; Chia-Chun Chiu; Yi-Ching Wu; Wei-Sheng Wu
Journal:  BMC Syst Biol       Date:  2013-12-13

9.  Reverse engineering module networks by PSO-RNN hybrid modeling.

Authors:  Yuji Zhang; Jianhua Xuan; Benildo G de los Reyes; Robert Clarke; Habtom W Ressom
Journal:  BMC Genomics       Date:  2009-07-07       Impact factor: 3.969

10.  Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors:  Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.