Literature DB >> 12499296

Mining gene expression databases for association rules.

Chad Creighton1, Samir Hanash.   

Abstract

MOTIVATION: Global gene expression profiling, both at the transcript level and at the protein level, can be a valuable tool in the understanding of genes, biological networks, and cellular states. As larger and larger gene expression data sets become available, data mining techniques can be applied to identify patterns of interest in the data. Association rules, used widely in the area of market basket analysis, can be applied to the analysis of expression data as well. Association rules can reveal biologically relevant associations between different genes or between environmental effects and gene expression. An association rule has the form LHS --> RHS, where LHS and RHS are disjoint sets of items, the RHS set being likely to occur whenever the LHS set occurs. Items in gene expression data can include genes that are highly expressed or repressed, as well as relevant facts describing the cellular environment of the genes (e.g. the diagnosis of a tumor sample from which a profile was obtained).
RESULTS: We demonstrate an algorithm for efficiently mining association rules from gene expression data, using the data set from Hughes et al. (2000, Cell, 102, 109-126) of 300 expression profiles for yeast. Using the algorithm, we find numerous rules in the data. A cursory analysis of some of these rules reveals numerous associations between certain genes, many of which make sense biologically, others suggesting new hypotheses that may warrant further investigation. In a data set derived from the yeast data set, but with the expression values for each transcript randomly shifted with respect to the experiments, no rules were found, indicating that most all of the rules mined from the actual data set are not likely to have occurred by chance. AVAILABILITY: An implementation of the algorithm using Microsoft SQL Server with Access 2000 is available at http://dot.ped.med.umich.edu:2000/pub/assoc_rules/assoc_rules.zip. Our results from mining the yeast data set are available at http://dot.ped.med.umich.edu:2000/pub/assoc_rules/yeast_results.zip.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12499296     DOI: 10.1093/bioinformatics/19.1.79

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  48 in total

1.  Discovery of error-tolerant biclusters from noisy gene expression data.

Authors:  Rohit Gupta; Navneet Rao; Vipin Kumar
Journal:  BMC Bioinformatics       Date:  2011-11-24       Impact factor: 3.169

2.  Translational bioinformatics and healthcare informatics: computational and ethical challenges.

Authors:  Prerna Sethi; Kimberly Theodos
Journal:  Perspect Health Inf Manag       Date:  2009-09-16

3.  Association Rule Discovery Has the Ability to Model Complex Genetic Effects.

Authors:  William S Bush; Tricia A Thornton-Wells; Marylyn D Ritchie
Journal:  IEEE Symp Comput Intell Data Min       Date:  2007-03-01

4.  Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model.

Authors:  Liang-Tsung Huang; M Michael Gromiha; Shinn-Ying Ho
Journal:  J Mol Model       Date:  2007-03-30       Impact factor: 1.810

5.  Use of Radcube for extraction of finding trends in a large radiology practice.

Authors:  Pragya A Dang; Mannudeep K Kalra; Michael A Blake; Thomas J Schultz; Markus Stout; Elkan F Halpern; Keith J Dreyer
Journal:  J Digit Imaging       Date:  2008-06-10       Impact factor: 4.056

6.  Respiratory knowledge discovery utilising expertise.

Authors:  Tristan Ling
Journal:  Australas Med J       Date:  2012-12-31

7.  icuARM-II: improving the reliability of personalized risk prediction in pediatric intensive care units.

Authors:  Chih-Wen Cheng; Nikhil Chanani; Kevin Maher
Journal:  ACM BCB       Date:  2014-09

8.  Patient-reported and actionable safety events in CKD.

Authors:  Jennifer S Ginsberg; Min Zhan; Clarissa J Diamantidis; Corinne Woods; Jingjing Chen; Jeffrey C Fink
Journal:  J Am Soc Nephrol       Date:  2014-02-20       Impact factor: 10.121

9.  Identification of temporal association rules from time-series microarray data sets.

Authors:  Hojung Nam; KiYoung Lee; Doheon Lee
Journal:  BMC Bioinformatics       Date:  2009-03-19       Impact factor: 3.169

10.  An integrated method for cancer classification and rule extraction from microarray data.

Authors:  Liang-Tsung Huang
Journal:  J Biomed Sci       Date:  2009-02-24       Impact factor: 8.410

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.