Literature DB >> 15465482

A primer on gene expression and microarrays for machine learning researchers.

Winston Patrick Kuo1, Eun-Young Kim, Jeff Trimarchi, Tor-Kristian Jenssen, Staal A Vinterbo, Lucila Ohno-Machado.   

Abstract

Data originating from biomedical experiments has provided machine learning researchers with an important source of motivation for developing and evaluating new algorithms. A new wave of algorithmic development has been initiated with the publication of gene expression data derived from microarrays. Microarray data analysis is particularly challenging given the large number of measurements (typically in the order of thousands) that are reported for relatively few samples (typically in the order of dozens). Many data sets are now available on the web. It is important that machine learning researchers understand how data are obtained and which assumptions are necessary in the analysis. Microarray data have the potential to cause significant impact in machine learning research, not just as a rich and realistic source of cases for testing new algorithms, as has been the UCI machine learning repository in the past decades, but also as a main motivation for their development. In this article, we briefly review the biology underlying microarrays, the process of obtaining gene expression measurements, and the rationale behind the common types of analyses involved in a microarray experiment. We outline the main challenges and reiterate critical considerations regarding the construction of supervised learning models that use this type of data. The goal of this article is to familiarize machine learning researchers with data originated from gene expression microarrays.

Mesh:

Year:  2004        PMID: 15465482     DOI: 10.1016/j.jbi.2004.07.002

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  5 in total

Review 1.  Metabolic engineering in the -omics era: elucidating and modulating regulatory networks.

Authors:  Goutham N Vemuri; Aristos A Aristidou
Journal:  Microbiol Mol Biol Rev       Date:  2005-06       Impact factor: 11.056

2.  Effective feature selection framework for cluster analysis of microarray data.

Authors:  Gouchol Pok; Jyh-Charn Steve Liu; Keun Ho Ryu
Journal:  Bioinformation       Date:  2010-02-28

3.  Ethanol sensitivity: a central role for CREB transcription regulation in the cerebellum.

Authors:  George K Acquaah-Mensah; Vikas Misra; Shyam Biswal
Journal:  BMC Genomics       Date:  2006-12-05       Impact factor: 3.969

4.  A Marfan syndrome gene expression phenotype in cultured skin fibroblasts.

Authors:  Zizhen Yao; Jochen C Jaeger; Walter L Ruzzo; Cecile Z Morale; Mary Emond; Uta Francke; Dianna M Milewicz; Stephen M Schwartz; Eileen R Mulvihill
Journal:  BMC Genomics       Date:  2007-09-12       Impact factor: 3.969

5.  Ten simple rules for organizing a special session at a scientific conference.

Authors:  Davide Chicco; Philip E Bourne
Journal:  PLoS Comput Biol       Date:  2022-08-25       Impact factor: 4.779

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.