Literature DB >> 24565159

Shrinkage regression-based methods for microarray missing value imputation.

Hsiuying Wang, Chia-Chun Chiu, Yi-Ching Wu, Wei-Sheng Wu.   

Abstract

BACKGROUND: Missing values commonly occur in the microarray data, which usually contain more than 5% missing values with up to 90% of genes affected. Inaccurate missing value estimation results in reducing the power of downstream microarray data analyses. Many types of methods have been developed to estimate missing values. Among them, the regression-based methods are very popular and have been shown to perform better than the other types of methods in many testing microarray datasets.
RESULTS: To further improve the performances of the regression-based methods, we propose shrinkage regression-based methods. Our methods take the advantage of the correlation structure in the microarray data and select similar genes for the target gene by Pearson correlation coefficients. Besides, our methods incorporate the least squares principle, utilize a shrinkage estimation approach to adjust the coefficients of the regression model, and then use the new coefficients to estimate missing values. Simulation results show that the proposed methods provide more accurate missing value estimation in six testing microarray datasets than the existing regression-based methods do.
CONCLUSIONS: Imputation of missing values is a very important aspect of microarray data analyses because most of the downstream analyses require a complete dataset. Therefore, exploring accurate and efficient methods for estimating missing values has become an essential issue. Since our proposed shrinkage regression-based methods can provide accurate missing value estimation, they are competitive alternatives to the existing regression-based methods.

Entities:  

Mesh:

Year:  2013        PMID: 24565159      PMCID: PMC4028886          DOI: 10.1186/1752-0509-7-S6-S11

Source DB:  PubMed          Journal:  BMC Syst Biol        ISSN: 1752-0509


  26 in total

1.  MissForest--non-parametric missing value imputation for mixed-type data.

Authors:  Daniel J Stekhoven; Peter Bühlmann
Journal:  Bioinformatics       Date:  2011-10-28       Impact factor: 6.937

2.  Sequential local least squares imputation estimating missing value of microarray data.

Authors:  Xiaobai Zhang; Xiaofeng Song; Huinan Wang; Huanping Zhang
Journal:  Comput Biol Med       Date:  2008-09-30       Impact factor: 4.589

3.  New components of a system for phosphate accumulation and polyphosphate metabolism in Saccharomyces cerevisiae revealed by genomic expression analysis.

Authors:  N Ogawa; J DeRisi; P O Brown
Journal:  Mol Biol Cell       Date:  2000-12       Impact factor: 4.138

4.  Genomic expression programs in the response of yeast cells to environmental changes.

Authors:  A P Gasch; P T Spellman; C M Kao; O Carmel-Harel; M B Eisen; G Storz; D Botstein; P O Brown
Journal:  Mol Biol Cell       Date:  2000-12       Impact factor: 4.138

5.  Yeast cell cycle transcription factors identification by variable selection criteria.

Authors:  Hsiuying Wang; Yu-Han Wang; Wei-Sheng Wu
Journal:  Gene       Date:  2011-06-16       Impact factor: 3.688

6.  Incorporating Nonlinear Relationships in Microarray Missing Value Imputation.

Authors:  Tianwei Yu; Hesen Peng; Wei Sun
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2011 May-Jun       Impact factor: 3.710

7.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.

Authors:  A A Alizadeh; M B Eisen; R E Davis; C Ma; I S Lossos; A Rosenwald; J C Boldrick; H Sabet; T Tran; X Yu; J I Powell; L Yang; G E Marti; T Moore; J Hudson; L Lu; D B Lewis; R Tibshirani; G Sherlock; W C Chan; T C Greiner; D D Weisenburger; J O Armitage; R Warnke; R Levy; W Wilson; M R Grever; J C Byrd; D Botstein; P O Brown; L M Staudt
Journal:  Nature       Date:  2000-02-03       Impact factor: 49.962

8.  Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors:  Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

9.  Model-based deconvolution of cell cycle time-series data reveals gene expression details at high resolution.

Authors:  Dan Siegal-Gaskins; Joshua N Ash; Sean Crosson
Journal:  PLoS Comput Biol       Date:  2009-08-14       Impact factor: 4.475

10.  Identifying gene regulatory modules of heat shock response in yeast.

Authors:  Wei-Sheng Wu; Wen-Hsiung Li
Journal:  BMC Genomics       Date:  2008-09-23       Impact factor: 3.969

View more
  2 in total

1.  MVIAeval: a web tool for comprehensively evaluating the performance of a new missing value imputation algorithm.

Authors:  Wei-Sheng Wu; Meng-Jhun Jhou
Journal:  BMC Bioinformatics       Date:  2017-01-13       Impact factor: 3.169

2.  Development of a miniaturized protein microarray as a new serological IgG screening test for zoonotic agents and production diseases in pigs.

Authors:  Katharina Loreck; Sylvia Mitrenga; Diana Meemken; Regina Heinze; Annett Reissig; Elke Mueller; Ralf Ehricht; Claudia Engemann; Matthias Greiner
Journal:  PLoS One       Date:  2019-05-22       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.