Literature DB >> 18631025

Discovering high-order patterns of gene expression levels.

Andrew K C Wong1, Wai-Ho Au, Keith C C Chan.   

Abstract

This paper reports the discovery of statistically significant association patterns of gene expression levels from microarray data. By association patterns, we mean certain gene expression intensity intervals having statistically significant associations among themselves and with the tissue classes, such as cancerous and normal tissues. We describe how the significance of the associations among gene expression levels can be evaluated using a statistical measure in an objective manner. If an association is found to be significant based on the measure, we say that it is statistically significant. Given a gene expression data set, we first cluster the entire gene pool comprising all the genes into groups by optimizing the correlation (or more precisely, interdependence) among the gene expression levels within gene groups. From each group, we select one or several genes that are most correlated with other genes within that group to form a smaller gene pool. This gene pool then constitutes the most representative genes from the original pool. Our pattern discovery algorithm is then used, for the first time, to discover the significant association patterns of gene expression levels among the genes from the small pool. With our method, it is more effective to discover and express the associations in terms of their intensity intervals. Hence, we discretize each gene expression levels into intervals maximizing the interdependence between the gene expression and the tissue classes. From this data set of gene expression intervals, we discover the association patterns representing statistically significant associations, some positively and some negatively, with different tissue classes. We apply our pattern discovery methodology to the colon-cancer microarray gene expression data set. It consists of 2000 genes and 62 samples taken from colon cancer or normal subjects. The statistically significant combinations of gene expression levels that repress or activate colon cancer are revealed in the colon-cancer data set. The discovered association patterns are ranked according to their statistical significance and displayed for interpretation and further analysis.

Entities:  

Mesh:

Year:  2008        PMID: 18631025     DOI: 10.1089/cmb.2007.0147

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  Unsupervised fuzzy pattern discovery in gene expression data.

Authors:  Gene P K Wu; Keith C C Chan; Andrew K C Wong
Journal:  BMC Bioinformatics       Date:  2011-07-27       Impact factor: 3.169

2.  Exploiting identifiability and intergene correlation for improved detection of differential expression.

Authors:  J R Deller; Hayder Radha; J Justin McCormick
Journal:  ISRN Bioinform       Date:  2013-06-03

3.  Nonlinear dependence in the discovery of differentially expressed genes.

Authors:  J R Deller; Hayder Radha; J Justin McCormick; Huiyan Wang
Journal:  ISRN Bioinform       Date:  2012-04-12
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.