Literature DB >> 17975272

High confidence rule mining for microarray analysis.

Tara McIntosh, Sanjay Chawla.   

Abstract

We present an association rule mining method for mining high confidence rules, which describe interesting gene relationships from microarray datasets. Microarray datasets typically contain an order of magnitude more genes than experiments, rendering many data mining methods impractical as they are optimised for sparse datasets. A new family of row-enumeration rule mining algorithms have emerged to facilitate mining in dense datasets. These algorithms rely on pruning infrequent relationships to reduce the search space by using the support measure. This major shortcoming results in the pruning of many potentially interesting rules with low support but high confidence. We propose a new row-enumeration rule mining method, MaxConf, to mine high confidence rules from microarray data. MaxConf is a support-free algorithm which directly uses the confidence measure to effectively prune the search space. Experiments on three microarray datasets show that MaxConf outperforms support-based rule mining with respect to scalability and rule extraction. Furthermore, detailed biological analyses demonstrate the effectiveness of our approach -- the rules discovered by MaxConf are substantially more interesting and meaningful compared with support-based methods.

Mesh:

Substances:

Year:  2007        PMID: 17975272     DOI: 10.1109/tcbb.2007.1050

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  4 in total

1.  Discovery of error-tolerant biclusters from noisy gene expression data.

Authors:  Rohit Gupta; Navneet Rao; Vipin Kumar
Journal:  BMC Bioinformatics       Date:  2011-11-24       Impact factor: 3.169

2.  MIDClass: microarray data classification by association rules and gene expression intervals.

Authors:  Rosalba Giugno; Alfredo Pulvirenti; Luciano Cascione; Giuseppe Pigola; Alfredo Ferro
Journal:  PLoS One       Date:  2013-08-06       Impact factor: 3.240

Review 3.  A primer to frequent itemset mining for bioinformatics.

Authors:  Stefan Naulaerts; Pieter Meysman; Wout Bittremieux; Trung Nghia Vu; Wim Vanden Berghe; Bart Goethals; Kris Laukens
Journal:  Brief Bioinform       Date:  2013-10-26       Impact factor: 11.622

4.  Mining differential top-k co-expression patterns from time course comparative gene expression datasets.

Authors:  Yu-Cheng Liu; Chun-Pei Cheng; Vincent S Tseng
Journal:  BMC Bioinformatics       Date:  2013-07-21       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.