Literature DB >> 15706519

On combining multiple microarray studies for improved functional classification by whole-dataset feature selection.

See-Kiong Ng1, Soon-Heng Tan, V S Sundararajan.   

Abstract

As microarray technologies become routinely applied in genome laboratories for studying gene expression, it is not uncommon that experiments on identical or similar sets of genes are conducted by multiple laboratories for various functional studies of these genes. Much of such data are often available to researchers for their data analysis, either through collaborators or from online gene expression databases. It will be useful to combine data from different microarray studies to improve the microarray data mining results. We show that the functional classification of genes from microarray data can be improved further by combining gene expression data from multiple microarray studies, even if the experimental focus or conditions for each experimental study may differ. However, blindly combining all available datasets may not always improve the analysis results---it is important to be selective of the datasets for inclusion. In our approach, we consider each dataset to be one feature, and then apply feature selection strategies to select appropriate datasets for training. With a simple hill-climbing method, we show that gene classification performances can be improved by whole-dataset feature selection.

Mesh:

Substances:

Year:  2003        PMID: 15706519

Source DB:  PubMed          Journal:  Genome Inform        ISSN: 0919-9454


  1 in total

1.  Effect of data combination on predictive modeling: a study using gene expression data.

Authors:  Melanie Osl; Stephan Dreiseitl; Jihoon Kim; Kiltesh Patel; Christian Baumgartner; Lucila Ohno-Machado
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.