Literature DB >> 14751970

Gaussian mixture clustering and imputation of microarray data.

Ming Ouyang1, William J Welsh, Panos Georgopoulos.   

Abstract

MOTIVATION: In microarray experiments, missing entries arise from blemishes on the chips. In large-scale studies, virtually every chip contains some missing entries and more than 90% of the genes are affected. Many analysis methods require a full set of data. Either those genes with missing entries are excluded, or the missing entries are filled with estimates prior to the analyses. This study compares methods of missing value estimation.
RESULTS: Two evaluation metrics of imputation accuracy are employed. First, the root mean squared error measures the difference between the true values and the imputed values. Second, the number of mis-clustered genes measures the difference between clustering with true values and that with imputed values; it examines the bias introduced by imputation to clustering. The Gaussian mixture clustering with model averaging imputation is superior to all other imputation methods, according to both evaluation metrics, on both time-series (correlated) and non-time series (uncorrelated) data sets.

Entities:  

Mesh:

Year:  2004        PMID: 14751970     DOI: 10.1093/bioinformatics/bth007

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  34 in total

1.  Application of survival analysis methodology to the quantitative analysis of LC-MS proteomics data.

Authors:  Carmen D Tekwe; Raymond J Carroll; Alan R Dabney
Journal:  Bioinformatics       Date:  2012-05-24       Impact factor: 6.937

2.  Biological impact of missing-value imputation on downstream analyses of gene expression profiles.

Authors:  Sunghee Oh; Dongwan D Kang; Guy N Brock; George C Tseng
Journal:  Bioinformatics       Date:  2010-11-02       Impact factor: 6.937

3.  How to improve postgenomic knowledge discovery using imputation.

Authors:  Muhammad Shoaib B Sehgal; Iqbal Gondal; Laurence S Dooley; Ross Coppel
Journal:  EURASIP J Bioinform Syst Biol       Date:  2009-02-08

4.  Bayesian model-based tight clustering for time course data.

Authors:  Yongsung Joo; G Casella; J Hobert
Journal:  Comput Stat       Date:  2010-03       Impact factor: 1.000

5.  Shrinkage regression-based methods for microarray missing value imputation.

Authors:  Hsiuying Wang; Chia-Chun Chiu; Yi-Ching Wu; Wei-Sheng Wu
Journal:  BMC Syst Biol       Date:  2013-12-13

Review 6.  Metabolomics in the developmental origins of obesity and its cardiometabolic consequences.

Authors:  M F Hivert; W Perng; S M Watkins; C S Newgard; L C Kenny; B S Kristal; M E Patti; E Isganaitis; D L DeMeo; E Oken; M W Gillman
Journal:  J Dev Orig Health Dis       Date:  2015-01-29       Impact factor: 2.401

7.  Combining Fourier and lagged k-nearest neighbor imputation for biomedical time series data.

Authors:  Shah Atiqur Rahman; Yuxiao Huang; Jan Claassen; Nathaniel Heintzman; Samantha Kleinberg
Journal:  J Biomed Inform       Date:  2015-10-21       Impact factor: 6.317

8.  Quality Weighted Mean and T-test in Microarray Analysis Lead to Improved Accuracy in Gene Expression Measurements and Reduced Type I and II Errors in Differential Expression Detection.

Authors:  Shouguo Gao; Shuang Jia; Martin Hessner; Xujing Wang
Journal:  J Comput Sci Syst Biol       Date:  2008-12-26

9.  Investigating the effects of imputation methods for modelling gene networks using a dynamic bayesian network from gene expression data.

Authors:  Lian En Chai; Chow Kuan Law; Mohd Saberi Mohamad; Chuii Khim Chong; Yee Wen Choon; Safaai Deris; Rosli Md Illias
Journal:  Malays J Med Sci       Date:  2014-03

10.  Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors:  Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.