Literature DB >> 28114033

Integrating Multiple Data Sources for Combinatorial Marker Discovery: A Study in Tumorigenesis.

Sanghamitra Bandyopadhyay, Saurav Mallik.   

Abstract

Identification of combinatorial markers from multiple data sources is a challenging task in bioinformatics. Here, we propose a novel computational framework for identifying significant combinatorial markers ( s) using both gene expression and methylation data. The gene expression and methylation data are integrated into a single continuous data as well as a (post-discretized) boolean data based on their intrinsic (i.e., inverse) relationship. A novel combined score of methylation and expression data (viz., ) is introduced which is computed on the integrated continuous data for identifying initial non-redundant set of genes. Thereafter, (maximal) frequent closed homogeneous genesets are identified using a well-known biclustering algorithm applied on the integrated boolean data of the determined non-redundant set of genes. A novel sample-based weighted support ( ) is then proposed that is consecutively calculated on the integrated boolean data of the determined non-redundant set of genes in order to identify the non-redundant significant genesets. The top few resulting genesets are identified as potential s. Since our proposed method generates a smaller number of significant non-redundant genesets than those by other popular methods, the method is much faster than the others. Application of the proposed technique on an expression and a methylation data for Uterine tumor or Prostate Carcinoma produces a set of significant combination of markers. We expect that such a combination of markers will produce lower false positives than individual markers.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 28114033     DOI: 10.1109/TCBB.2016.2636207

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  9 in total

1.  Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

2.  Molecular signatures identified by integrating gene expression and methylation in non-seminoma and seminoma of testicular germ cell tumours.

Authors:  Saurav Mallik; Guimin Qin; Peilin Jia; Zhongming Zhao
Journal:  Epigenetics       Date:  2020-07-13       Impact factor: 4.528

3.  ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Genes (Basel)       Date:  2017-12-28       Impact factor: 4.096

4.  Multi-Objective Optimized Fuzzy Clustering for Detecting Cell Clusters from Single-Cell Expression Profiles.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Genes (Basel)       Date:  2019-08-13       Impact factor: 4.096

5.  Identification of gene signatures from RNA-seq data using Pareto-optimal cluster algorithm.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  BMC Syst Biol       Date:  2018-12-21

6.  Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Sci Rep       Date:  2020-12-17       Impact factor: 4.379

7.  Comparison of five supervised feature selection algorithms leading to top features and gene signatures from multi-omics data in cancer.

Authors:  Tapas Bhadra; Saurav Mallik; Neaj Hasan; Zhongming Zhao
Journal:  BMC Bioinformatics       Date:  2022-04-28       Impact factor: 3.307

8.  Towards integrated oncogenic marker recognition through mutual information-based statistically significant feature extraction: an association rule mining based study on cancer expression and methylation profiles.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Quant Biol       Date:  2017-11-23

9.  Partitioning of functional gene expression data using principal points.

Authors:  Jaehee Kim; Haseong Kim
Journal:  BMC Bioinformatics       Date:  2017-10-12       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.