Literature DB >> 16377613

Improving missing value estimation in microarray data with gene ontology.

Johannes Tuikkala1, Laura Elo, Olli S Nevalainen, Tero Aittokallio.   

Abstract

MOTIVATION: Gene expression microarray experiments produce datasets with frequent missing expression values. Accurate estimation of missing values is an important prerequisite for efficient data analysis as many statistical and machine learning techniques either require a complete dataset or their results are significantly dependent on the quality of such estimates. A limitation of the existing estimation methods for microarray data is that they use no external information but the estimation is based solely on the expression data. We hypothesized that utilizing a priori information on functional similarities available from public databases facilitates the missing value estimation.
RESULTS: We investigated whether semantic similarity originating from gene ontology (GO) annotations could improve the selection of relevant genes for missing value estimation. The relative contribution of each information source was automatically estimated from the data using an adaptive weight selection procedure. Our experimental results in yeast cDNA microarray datasets indicated that by considering GO information in the k-nearest neighbor algorithm we can enhance its performance considerably, especially when the number of experimental conditions is small and the percentage of missing values is high. The increase of performance was less evident with a more sophisticated estimation method. We conclude that even a small proportion of annotated genes can provide improvements in data quality significant for the eventual interpretation of the microarray experiments. AVAILABILITY: Java and Matlab codes are available on request from the authors. SUPPLEMENTARY MATERIAL: Available online at http://users.utu.fi/jotatu/GOImpute.html.

Entities:  

Mesh:

Year:  2005        PMID: 16377613     DOI: 10.1093/bioinformatics/btk019

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  Biological impact of missing-value imputation on downstream analyses of gene expression profiles.

Authors:  Sunghee Oh; Dongwan D Kang; Guy N Brock; George C Tseng
Journal:  Bioinformatics       Date:  2010-11-02       Impact factor: 6.937

2.  How to improve postgenomic knowledge discovery using imputation.

Authors:  Muhammad Shoaib B Sehgal; Iqbal Gondal; Laurence S Dooley; Ross Coppel
Journal:  EURASIP J Bioinform Syst Biol       Date:  2009-02-08

3.  Impact of missing value imputation on classification for DNA microarray gene expression data--a model-based study.

Authors:  Youting Sun; Ulisses Braga-Neto; Edward R Dougherty
Journal:  EURASIP J Bioinform Syst Biol       Date:  2010-03-02

4.  NanoParticle Ontology for cancer nanotechnology research.

Authors:  Dennis G Thomas; Rohit V Pappu; Nathan A Baker
Journal:  J Biomed Inform       Date:  2010-03-06       Impact factor: 6.317

5.  Incorporating Nonlinear Relationships in Microarray Missing Value Imputation.

Authors:  Tianwei Yu; Hesen Peng; Wei Sun
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2011 May-Jun       Impact factor: 3.710

6.  Integrative analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: a non-linear model to predict abundance of undetected proteins.

Authors:  Wandaliz Torres-García; Weiwen Zhang; George C Runger; Roger H Johnson; Deirdre R Meldrum
Journal:  Bioinformatics       Date:  2009-05-15       Impact factor: 6.937

7.  Information theory applied to the sparse gene ontology annotation network to predict novel gene function.

Authors:  Ying Tao; Lee Sam; Jianrong Li; Carol Friedman; Yves A Lussier
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

8.  Which missing value imputation method to use in expression profiles: a comparative study and two selection schemes.

Authors:  Guy N Brock; John R Shaffer; Richard E Blakesley; Meredith J Lotz; George C Tseng
Journal:  BMC Bioinformatics       Date:  2008-01-10       Impact factor: 3.169

Review 9.  Semantic similarity in biomedical ontologies.

Authors:  Catia Pesquita; Daniel Faria; André O Falcão; Phillip Lord; Francisco M Couto
Journal:  PLoS Comput Biol       Date:  2009-07-31       Impact factor: 4.475

10.  Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors:  Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.