Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reuse of imputed data in microarray analysis increases imputation efficiency.

Literature DB >> 15504240

Reuse of imputed data in microarray analysis increases imputation efficiency.

Ki-Yeol Kim¹, Byoung-Jin Kim, Gwan-Su Yi.

Abstract

BACKGROUND: The imputation of missing values is necessary for the efficient use of DNA microarray data, because many clustering algorithms and some statistical analysis require a complete data set. A few imputation methods for DNA microarray data have been introduced, but the efficiency of the methods was low and the validity of imputed values in these methods had not been fully checked.
RESULTS: We developed a new cluster-based imputation method called sequential K-nearest neighbor (SKNN) method. This imputes the missing values sequentially from the gene having least missing values, and uses the imputed values for the later imputation. Although it uses the imputed values, the efficiency of this new method is greatly improved in its accuracy and computational complexity over the conventional KNN-based method and other methods based on maximum likelihood estimation. The performance of SKNN was in particular higher than other imputation methods for the data with high missing rates and large number of experiments. Application of Expectation Maximization (EM) to the SKNN method improved the accuracy, but increased computational time proportional to the number of iterations. The Multiple Imputation (MI) method, which is well known but not applied previously to microarray data, showed a similarly high accuracy as the SKNN method, with slightly higher dependency on the types of data sets.
CONCLUSIONS: Sequential reuse of imputed data in KNN-based imputation greatly increases the efficiency of imputation. The SKNN method should be practically useful to save the data of some microarray experiments which have high amounts of missing entries. The SKNN method generates reliable imputed values which can be used for further cluster-based analysis of microarray data.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2004 PMID： 15504240 PMCID： PMC528735 DOI： 10.1186/1471-2105-5-160

Source DB: PubMed Journal: BMC Bioinformatics ISSN： 1471-2105 Impact factor: 3.169

11 in total

1. Missing value estimation methods for DNA microarrays.

Authors: O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal: Bioinformatics Date: 2001-06 Impact factor: 6.937

Review 2. Computational analysis of microarray data.

Authors: J Quackenbush
Journal: Nat Rev Genet Date: 2001-06 Impact factor: 53.242

3. Variation in gene expression patterns in follicular lymphoma and the response to rituximab.

Authors: Sean P Bohen; Olga G Troyanskaya; Orly Alter; Roger Warnke; David Botstein; Patrick O Brown; Ronald Levy
Journal: Proc Natl Acad Sci U S A Date: 2003-02-05 Impact factor: 11.205

4. Genomic expression programs in the response of yeast cells to environmental changes.

Authors: A P Gasch; P T Spellman; C M Kao; O Carmel-Harel; M B Eisen; G Storz; D Botstein; P O Brown
Journal: Mol Biol Cell Date: 2000-12 Impact factor: 4.138

5. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.

Authors: A A Alizadeh; M B Eisen; R E Davis; C Ma; I S Lossos; A Rosenwald; J C Boldrick; H Sabet; T Tran; X Yu; J I Powell; L Yang; G E Marti; T Moore; J Hudson; L Lu; D B Lewis; R Tibshirani; G Sherlock; W C Chan; T C Greiner; D D Weisenburger; J O Armitage; R Warnke; R Levy; W Wilson; M R Grever; J C Byrd; D Botstein; P O Brown; L M Staudt
Journal: Nature Date: 2000-02-03 Impact factor: 49.962

6. Diversity of gene expression in adenocarcinoma of the lung.

Authors: M E Garber; O G Troyanskaya; K Schluens; S Petersen; Z Thaesler; M Pacyna-Gengelbach; M van de Rijn; G D Rosen; C M Perou; R I Whyte; R B Altman; P O Brown; D Botstein; I Petersen
Journal: Proc Natl Acad Sci U S A Date: 2001-11-13 Impact factor: 11.205

7. Cluster analysis and display of genome-wide expression patterns.

Authors: M B Eisen; P T Spellman; P O Brown; D Botstein
Journal: Proc Natl Acad Sci U S A Date: 1998-12-08 Impact factor: 11.205

8. Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors: P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal: Mol Biol Cell Date: 1998-12 Impact factor: 4.138

9. Genome-wide analysis of gene expression regulated by the calcineurin/Crz1p signaling pathway in Saccharomyces cerevisiae.

Authors: Hiroyuki Yoshimoto; Kirstie Saltsman; Audrey P Gasch; Hong Xia Li; Nobuo Ogawa; David Botstein; Patrick O Brown; Martha S Cyert
Journal: J Biol Chem Date: 2002-06-10 Impact factor: 5.157

10. Clustering gene-expression data with repeated measurements.

Authors: Ka Yee Yeung; Mario Medvedovic; Roger E Bumgarner
Journal: Genome Biol Date: 2003-04-25 Impact factor: 13.583

21 in total

1. How to improve postgenomic knowledge discovery using imputation.

Authors: Muhammad Shoaib B Sehgal; Iqbal Gondal; Laurence S Dooley; Ross Coppel
Journal: EURASIP J Bioinform Syst Biol Date: 2009-02-08

2. A flexible, interpretable, and accurate approach for imputing the expression of unmeasured genes.

Authors: Christopher A Mancuso; Jacob L Canfield; Deepak Singla; Arjun Krishnan
Journal: Nucleic Acids Res Date: 2020-12-02 Impact factor: 16.971

3. Gram-positive pathogenic bacteria induce a common early response in human monocytes.

Authors: Svetlin Tchatalbachev; Rohit Ghai; Hamid Hossain; Trinad Chakraborty
Journal: BMC Microbiol Date: 2010-11-02 Impact factor: 3.605

4. DNA methylation profiles of airway epithelial cells and PBMCs from healthy, atopic and asthmatic children.

Authors: Dorota Stefanowicz; Tillie-Louise Hackett; Farshid S Garmaroudi; Oliver P Günther; Sarah Neumann; Erika N Sutanto; Kak-Ming Ling; Michael S Kobor; Anthony Kicic; Stephen M Stick; Peter D Paré; Darryl A Knight
Journal: PLoS One Date: 2012-09-06 Impact factor: 3.240

5. Sertoli-cell-specific knockout of connexin 43 leads to multiple alterations in testicular gene expression in prepubertal mice.

Authors: Sarah Giese; Hamid Hossain; Melanie Markmann; Trinad Chakraborty; Svetlin Tchatalbachev; Florian Guillou; Martin Bergmann; Klaus Failing; Karola Weider; Ralph Brehm
Journal: Dis Model Mech Date: 2012-06-14 Impact factor: 5.758

6. Quality determination and the repair of poor quality spots in array experiments.

Authors: Brian D M Tom; Walter R Gilks; Elizabeth T Brooke-Powell; James W Ajioka
Journal: BMC Bioinformatics Date: 2005-09-26 Impact factor: 3.169

7. Improving missing value imputation of microarray data by using spot quality weights.

Authors: Peter Johansson; Jari Häkkinen
Journal: BMC Bioinformatics Date: 2006-06-16 Impact factor: 3.169

8. Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors: Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal: BMC Genomics Date: 2010-01-07 Impact factor: 3.969

9. The Application of SILAC Mouse in Human Body Fluid Proteomics Analysis Reveals Protein Patterns Associated with IgA Nephropathy.

Authors: Shilin Zhao; Rongxia Li; Xiaofan Cai; Wanjia Chen; Qingrun Li; Tao Xing; Wenjie Zhu; Y Eugene Chen; Rong Zeng; Yueyi Deng
Journal: Evid Based Complement Alternat Med Date: 2013-05-15 Impact factor: 2.629

10. A meta-data based method for DNA microarray imputation.

Authors: Rebecka Jörnsten; Ming Ouyang; Hui-Yu Wang
Journal: BMC Bioinformatics Date: 2007-03-29 Impact factor: 3.169