Literature DB >> 28092570

Identifying Epigenetic Biomarkers using Maximal Relevance and Minimal Redundancy Based Feature Selection for Multi-Omics Data.

Saurav Mallik, Tapas Bhadra, Ujjwal Maulik.   

Abstract

Epigenetic Biomarker discovery is an important task in bioinformatics. In this article, we develop a new framework of identifying statistically significant epigenetic biomarkers using maximal-relevance and minimal-redundancy criterion based feature (gene) selection for multi-omics dataset. Firstly, we determine the genes that have both expression as well as methylation values, and follow normal distribution. Similarly, we identify the genes which consist of both expression and methylation values, but do not follow normal distribution. For each case, we utilize a gene-selection method that provides maximal-relevant, but variable-weighted minimum-redundant genes as top ranked genes. For statistical validation, we apply t-test on both the expression and methylation data consisting of only the normally distributed top ranked genes to determine how many of them are both differentially expressed andmethylated. Similarly, we utilize Limma package for performing non-parametric Empirical Bayes test on both expression and methylation data comprising only the non-normally distributed top ranked genes to identify how many of them are both differentially expressed and methylated. We finally report the top-ranking significant gene-markerswith biological validation. Moreover, our framework improves positive predictive rate and reduces false positive rate in marker identification. In addition, we provide a comparative analysis of our gene-selection method as well as othermethods based on classificationperformances obtained using several well-known classifiers.

Mesh:

Substances:

Year:  2017        PMID: 28092570     DOI: 10.1109/TNB.2017.2650217

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  11 in total

1.  Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

2.  ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Genes (Basel)       Date:  2017-12-28       Impact factor: 4.096

Review 3.  Machine Learning and Integrative Analysis of Biomedical Big Data.

Authors:  Bilal Mirza; Wei Wang; Jie Wang; Howard Choi; Neo Christopher Chung; Peipei Ping
Journal:  Genes (Basel)       Date:  2019-01-28       Impact factor: 4.096

4.  Multi-Objective Optimized Fuzzy Clustering for Detecting Cell Clusters from Single-Cell Expression Profiles.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Genes (Basel)       Date:  2019-08-13       Impact factor: 4.096

5.  Identification of gene signatures from RNA-seq data using Pareto-optimal cluster algorithm.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  BMC Syst Biol       Date:  2018-12-21

6.  Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Sci Rep       Date:  2020-12-17       Impact factor: 4.379

7.  A Novel Biomarker Identification Approach for Gastric Cancer Using Gene Expression and DNA Methylation Dataset.

Authors:  Ge Zhang; Zijing Xue; Chaokun Yan; Jianlin Wang; Huimin Luo
Journal:  Front Genet       Date:  2021-03-25       Impact factor: 4.599

8.  Towards integrated oncogenic marker recognition through mutual information-based statistically significant feature extraction: an association rule mining based study on cancer expression and methylation profiles.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Quant Biol       Date:  2017-11-23

9.  Partitioning of functional gene expression data using principal points.

Authors:  Jaehee Kim; Haseong Kim
Journal:  BMC Bioinformatics       Date:  2017-10-12       Impact factor: 3.169

Review 10.  A Review of Matched-pairs Feature Selection Methods for Gene Expression Data Analysis.

Authors:  Sen Liang; Anjun Ma; Sen Yang; Yan Wang; Qin Ma
Journal:  Comput Struct Biotechnol J       Date:  2018-02-25       Impact factor: 7.271

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.