Literature DB >> 15731210

Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.

Muhammad Shoaib B Sehgal1, Iqbal Gondal, Laurence S Dooley.   

Abstract

MOTIVATION: Microarray data are used in a range of application areas in biology, although often it contains considerable numbers of missing values. These missing values can significantly affect subsequent statistical analysis and machine learning algorithms so there is a strong motivation to estimate these values as accurately as possible before using these algorithms. While many imputation algorithms have been proposed, more robust techniques need to be developed so that further analysis of biological data can be accurately undertaken. In this paper, an innovative missing value imputation algorithm called collateral missing value estimation (CMVE) is presented which uses multiple covariance-based imputation matrices for the final prediction of missing values. The matrices are computed and optimized using least square regression and linear programming methods.
RESULTS: The new CMVE algorithm has been compared with existing estimation techniques including Bayesian principal component analysis imputation (BPCA), least square impute (LSImpute) and K-nearest neighbour (KNN). All these methods were rigorously tested to estimate missing values in three separate non-time series (ovarian cancer based) and one time series (yeast sporulation) dataset. Each method was quantitatively analyzed using the normalized root mean square (NRMS) error measure, covering a wide range of randomly introduced missing value probabilities from 0.01 to 0.2. Experiments were also undertaken on the yeast dataset, which comprised 1.7% actual missing values, to test the hypothesis that CMVE performed better not only for randomly occurring but also for a real distribution of missing values. The results confirmed that CMVE consistently demonstrated superior and robust estimation capability of missing values compared with other methods for both series types of data, for the same order of computational complexity. A concise theoretical framework has also been formulated to validate the improved performance of the CMVE algorithm. AVAILABILITY: The CMVE software is available upon request from the authors.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15731210     DOI: 10.1093/bioinformatics/bti345

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  How to improve postgenomic knowledge discovery using imputation.

Authors:  Muhammad Shoaib B Sehgal; Iqbal Gondal; Laurence S Dooley; Ross Coppel
Journal:  EURASIP J Bioinform Syst Biol       Date:  2009-02-08

Review 2.  Metabolomics in the developmental origins of obesity and its cardiometabolic consequences.

Authors:  M F Hivert; W Perng; S M Watkins; C S Newgard; L C Kenny; B S Kristal; M E Patti; E Isganaitis; D L DeMeo; E Oken; M W Gillman
Journal:  J Dev Orig Health Dis       Date:  2015-01-29       Impact factor: 2.401

3.  Incorporating Nonlinear Relationships in Microarray Missing Value Imputation.

Authors:  Tianwei Yu; Hesen Peng; Wei Sun
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2011 May-Jun       Impact factor: 3.710

4.  Investigating the effects of imputation methods for modelling gene networks using a dynamic bayesian network from gene expression data.

Authors:  Lian En Chai; Chow Kuan Law; Mohd Saberi Mohamad; Chuii Khim Chong; Yee Wen Choon; Safaai Deris; Rosli Md Illias
Journal:  Malays J Med Sci       Date:  2014-03

5.  Which missing value imputation method to use in expression profiles: a comparative study and two selection schemes.

Authors:  Guy N Brock; John R Shaffer; Richard E Blakesley; Meredith J Lotz; George C Tseng
Journal:  BMC Bioinformatics       Date:  2008-01-10       Impact factor: 3.169

6.  Quality determination and the repair of poor quality spots in array experiments.

Authors:  Brian D M Tom; Walter R Gilks; Elizabeth T Brooke-Powell; James W Ajioka
Journal:  BMC Bioinformatics       Date:  2005-09-26       Impact factor: 3.169

7.  Improving missing value imputation of microarray data by using spot quality weights.

Authors:  Peter Johansson; Jari Häkkinen
Journal:  BMC Bioinformatics       Date:  2006-06-16       Impact factor: 3.169

8.  Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors:  Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

9.  Microarray missing data imputation based on a set theoretic framework and biological knowledge.

Authors:  Xiangchao Gan; Alan Wee-Chung Liew; Hong Yan
Journal:  Nucleic Acids Res       Date:  2006-03-20       Impact factor: 16.971

10.  A meta-data based method for DNA microarray imputation.

Authors:  Rebecka Jörnsten; Ming Ouyang; Hui-Yu Wang
Journal:  BMC Bioinformatics       Date:  2007-03-29       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.