Literature DB >> 17945892

A knowledge driven regression model for gene expression and microarray analysis.

Rong Jin1, Luo Si, Shireesh Srivastava, Zheng Li, Christina Chan.   

Abstract

The linear regression model has been widely used in the analysis of gene expression and microarray data to identify a subset of genes that are important to a given metabolic function. One of the key challenges in applying the linear regression model to gene expression data analysis arises from the sparse data problem, in which the number of genes is significantly larger than the number of conditions. To resolve this problem, we present a knowledge driven regression model that incorporates the knowledge of genes from the Gene Ontology (GO) database into the linear regression model. It is based on the assumption that two genes are likely to be assigned similar weights when they share similar sets of GO codes. Empirical studies show that the proposed knowledge driven regression model is effective in reducing the regression errors, and furthermore effective in identifying genes that are relevant to a given metabolite.

Mesh:

Substances:

Year:  2006        PMID: 17945892     DOI: 10.1109/IEMBS.2006.260347

Source DB:  PubMed          Journal:  Conf Proc IEEE Eng Med Biol Soc        ISSN: 1557-170X


  1 in total

1.  Reconstruct modular phenotype-specific gene networks by knowledge-driven matrix factorization.

Authors:  Xuerui Yang; Yang Zhou; Rong Jin; Christina Chan
Journal:  Bioinformatics       Date:  2009-06-19       Impact factor: 6.937

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.