Literature DB >> 15333461

Missing value estimation for DNA microarray gene expression data: local least squares imputation.

Hyunsoo Kim1, Gene H Golub, Haesun Park.   

Abstract

MOTIVATION: Gene expression data often contain missing expression values. Effective missing value estimation methods are needed since many algorithms for gene expression data analysis require a complete matrix of gene array values. In this paper, imputation methods based on the least squares formulation are proposed to estimate missing values in the gene expression data, which exploit local similarity structures in the data as well as least squares optimization process.
RESULTS: The proposed local least squares imputation method (LLSimpute) represents a target gene that has missing values as a linear combination of similar genes. The similar genes are chosen by k-nearest neighbors or k coherent genes that have large absolute values of Pearson correlation coefficients. Non-parametric missing values estimation method of LLSimpute are designed by introducing an automatic k-value estimator. In our experiments, the proposed LLSimpute method shows competitive results when compared with other imputation methods for missing value estimation on various datasets and percentages of missing values in the data. AVAILABILITY: The software is available at http://www.cs.umn.edu/~hskim/tools.html CONTACT: hpark@cs.umn.edu

Mesh:

Year:  2004        PMID: 15333461     DOI: 10.1093/bioinformatics/bth499

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  81 in total

1.  Reconstructing the pathways of a cellular system from genome-scale signals by using matrix and tensor computations.

Authors:  Orly Alter; Gene H Golub
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-28       Impact factor: 11.205

2.  How to improve postgenomic knowledge discovery using imputation.

Authors:  Muhammad Shoaib B Sehgal; Iqbal Gondal; Laurence S Dooley; Ross Coppel
Journal:  EURASIP J Bioinform Syst Biol       Date:  2009-02-08

3.  Structured Matrix Completion with Applications to Genomic Data Integration.

Authors:  Tianxi Cai; T Tony Cai; Anru Zhang
Journal:  J Am Stat Assoc       Date:  2016-08-18       Impact factor: 5.033

4.  A computational strategy to analyze label-free temporal bottom-up proteomics data.

Authors:  Xiuxia Du; Stephen J Callister; Nathan P Manes; Joshua N Adkins; Roxana A Alexandridis; Xiaohua Zeng; Jung Hyeob Roh; William E Smith; Timothy J Donohue; Samuel Kaplan; Richard D Smith; Mary S Lipton
Journal:  J Proteome Res       Date:  2008-04-29       Impact factor: 4.466

5.  Unique plasma metabolomic signatures of individuals with inherited disorders of long-chain fatty acid oxidation.

Authors:  Colin S McCoin; Brian D Piccolo; Trina A Knotts; Dietrich Matern; Jerry Vockley; Melanie B Gillingham; Sean H Adams
Journal:  J Inherit Metab Dis       Date:  2016-02-23       Impact factor: 4.982

6.  Impact of missing value imputation on classification for DNA microarray gene expression data--a model-based study.

Authors:  Youting Sun; Ulisses Braga-Neto; Edward R Dougherty
Journal:  EURASIP J Bioinform Syst Biol       Date:  2010-03-02

7.  Shrinkage regression-based methods for microarray missing value imputation.

Authors:  Hsiuying Wang; Chia-Chun Chiu; Yi-Ching Wu; Wei-Sheng Wu
Journal:  BMC Syst Biol       Date:  2013-12-13

8.  Metabolomic analyses of plasma reveals new insights into asphyxia and resuscitation in pigs.

Authors:  Rønnaug Solberg; David Enot; Hans-Peter Deigner; Therese Koal; Sabine Scholl-Bürgi; Ola D Saugstad; Matthias Keller
Journal:  PLoS One       Date:  2010-03-09       Impact factor: 3.240

9.  NOTCH2 in breast cancer: association of SNP rs11249433 with gene expression in ER-positive breast tumors without TP53 mutations.

Authors:  Yi-Ping Fu; Hege Edvardsen; Alpana Kaushiva; Juan P Arhancet; Tiffany M Howe; Indu Kohaar; Patricia Porter-Gill; Anushi Shah; Hege Landmark-Høyvik; Sophie D Fosså; Stefan Ambs; Bjørn Naume; Anne-Lise Børresen-Dale; Vessela N Kristensen; Ludmila Prokunina-Olsson
Journal:  Mol Cancer       Date:  2010-05-19       Impact factor: 27.401

10.  Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors:  Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.