Literature DB >> 25265613

RANWAR: rank-based weighted association rule mining from gene expression and methylation data.

Saurav Mallik, Anirban Mukhopadhyay, Ujjwal Maulik.   

Abstract

Ranking of association rules is currently an interesting topic in data mining and bioinformatics. The huge number of evolved rules of items (or, genes) by association rule mining (ARM) algorithms makes confusion to the decision maker. In this article, we propose a weighted rule-mining technique (say, RANWAR or rank-based weighted association rule-mining) to rank the rules using two novel rule-interestingness measures, viz., rank-based weighted condensed support (wcs) and weighted condensed confidence (wcc) measures to bypass the problem. These measures are basically depended on the rank of items (genes). Using the rank, we assign weight to each item. RANWAR generates much less number of frequent itemsets than the state-of-the-art association rule mining algorithms. Thus, it saves time of execution of the algorithm. We run RANWAR on gene expression and methylation datasets. The genes of the top rules are biologically validated by Gene Ontologies (GOs) and KEGG pathway analyses. Many top ranked rules extracted from RANWAR that hold poor ranks in traditional Apriori, are highly biologically significant to the related diseases. Finally, the top rules evolved from RANWAR, that are not in Apriori, are reported.

Mesh:

Year:  2014        PMID: 25265613     DOI: 10.1109/TNB.2014.2359494

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  9 in total

1.  Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

2.  ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Genes (Basel)       Date:  2017-12-28       Impact factor: 4.096

3.  Identification of gene signatures from RNA-seq data using Pareto-optimal cluster algorithm.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  BMC Syst Biol       Date:  2018-12-21

4.  OmicsARules: a R package for integration of multi-omics datasets via association rules mining.

Authors:  Danze Chen; Fan Zhang; Qianqian Zhao; Jianzhen Xu
Journal:  BMC Bioinformatics       Date:  2019-11-08       Impact factor: 3.169

5.  Relation Extraction of Protein Complexes from Dynamic Protein-Protein Interaction Network.

Authors:  Moslem Mohammadi Jenghara; Majid Iranpour Mobarakeh; Hossein Ebrahimpour Komleh
Journal:  J Biomed Phys Eng       Date:  2021-12-01

6.  Towards integrated oncogenic marker recognition through mutual information-based statistically significant feature extraction: an association rule mining based study on cancer expression and methylation profiles.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Quant Biol       Date:  2017-11-23

7.  Partitioning of functional gene expression data using principal points.

Authors:  Jaehee Kim; Haseong Kim
Journal:  BMC Bioinformatics       Date:  2017-10-12       Impact factor: 3.169

8.  An efficient pattern growth approach for mining fault tolerant frequent itemsets.

Authors:  Shariq Bashir
Journal:  Expert Syst Appl       Date:  2019-10-21       Impact factor: 6.954

9.  A Linear Regression and Deep Learning Approach for Detecting Reliable Genetic Alterations in Cancer Using DNA Methylation and Gene Expression Data.

Authors:  Saurav Mallik; Soumita Seth; Tapas Bhadra; Zhongming Zhao
Journal:  Genes (Basel)       Date:  2020-08-12       Impact factor: 4.096

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.