Literature DB >> 32040771

Detecting biomarkers from microarray data using distributed correlation based gene selection.

Alok Kumar Shukla1, Diwakar Tripathi2.   

Abstract

BACKGROUND: Over the past few decades, DNA microarray technology has emerged as a prevailing process for early identification of cancer subtypes. Several feature selection (FS) techniques have been widely applied for identifying cancer from microarray gene data but only very few studies have been conducted on distributing the feature selection process for detecting cancer subtypes.
OBJECTIVE: Not all the gene expressions are needed in prediction, this research article objective is to select discriminative biomarkers by using distributed FS method which helps in accurately diagnosis of cancer subtype. Traditional feature selection techniques have several drawbacks like unrelated features that could perform well in terms of classification accuracy with a suitable subset of genes will be left out of the selection.
METHOD: To overcome the issue, in this paper a new filter-based method for gene selection is introduced which can select the highly relevant genes for distinguishing tissues from the gene expression dataset. In addition, it is used to compute the relation between gene-gene and gene-class and simultaneously identify subset of essential genes. Our method is tested on Diffuse Large B cell Lymphoma (DLBCL) dataset by using well-known classification techniques such as support vector machine, naïve Bayes, k-nearest neighbor, and decision tree.
RESULTS: Results on biological DLBCL dataset demonstrate that the proposed method provides promising tools for the prediction of cancer type, with the prediction accuracy of 97.62%, precision of 94.23%, sensitivity of 94.12%, F-measure of 90.12%, and ROC value of 99.75%.
CONCLUSION: The experimental results reveal the fact that the proposed method is significantly improved classification accuracy and execution time, compared to existing standard algorithms when applied to the non-partitioned dataset. Furthermore, the extracted genes are biologically sound and agree with the outcome of relevant biomedical studies.

Entities:  

Keywords:  DLBCL; Feature selection; Information theory; Spearman’s correlation

Mesh:

Substances:

Year:  2020        PMID: 32040771     DOI: 10.1007/s13258-020-00916-w

Source DB:  PubMed          Journal:  Genes Genomics        ISSN: 1976-9571            Impact factor:   1.839


  15 in total

1.  An Extensive Empirical Comparison of Probabilistic Hierarchical Classifiers in Datasets of Ageing-Related Genes.

Authors:  Fabio Fabris; Alex A Freitas; Jennifer M A Tullet
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2015-12-03       Impact factor: 3.710

2.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

3.  Iterative RELIEF for feature weighting: algorithms, theories, and applications.

Authors:  Yijun Sun
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2007-06       Impact factor: 6.226

4.  A New Approach for Feature Selection from Microarray Data Based on Mutual Information.

Authors:  Jian Tang; Shuigeng Zhou
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2016-01-07       Impact factor: 3.710

5.  Novel Consensus Gene Selection Criteria for Distributed GPU Partial Least Squares-Based Gene Microarray Analysis in Diffused Large B Cell Lymphoma (DLBCL) and Related Findings.

Authors:  Ho-Chun Wu; Xi-Guang Wei; Shing-Chow Chan
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2017-10-09       Impact factor: 3.710

6.  A New Binary Particle Swarm Optimization Approach: Momentum and Dynamic Balance Between Exploration and Exploitation.

Authors:  Bach Hoai Nguyen; Bing Xue; Peter Andreae; Mengjie Zhang
Journal:  IEEE Trans Cybern       Date:  2021-01-15       Impact factor: 11.448

7.  Gene selection using iterative feature elimination random forests for survival outcomes.

Authors:  Herbert Pang; Stephen L George; Ken Hui; Tiejun Tong
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2012 Sep-Oct       Impact factor: 3.710

8.  Gene-expression-based cancer subtypes prediction through feature selection and transductive SVM.

Authors:  Ujjwal Maulik; Anirban Mukhopadhyay; Debasis Chakraborty
Journal:  IEEE Trans Biomed Eng       Date:  2012-10-18       Impact factor: 4.538

Review 9.  Application of microarrays to the analysis of gene expression in cancer.

Authors:  Pascale F Macgregor; Jeremy A Squire
Journal:  Clin Chem       Date:  2002-08       Impact factor: 8.327

10.  Identification of spatial expression trends in single-cell gene expression data.

Authors:  Daniel Edsgärd; Per Johnsson; Rickard Sandberg
Journal:  Nat Methods       Date:  2018-03-19       Impact factor: 28.547

View more
  3 in total

1.  Somatic copy number alterations are predictive of progression-free survival in patients with lung adenocarcinoma undergoing radiotherapy.

Authors:  Fan Kou; Lei Wu; Yan Guo; Bailu Zhang; Baihui Li; Ziqi Huang; Xiubao Ren; Lili Yang
Journal:  Cancer Biol Med       Date:  2021-08-27       Impact factor: 5.347

2.  An Efficient Cancer Classification Model Using Microarray and High-Dimensional Data.

Authors:  Hanaa Fathi; Hussain AlSalman; Abdu Gumaei; Ibrahim I M Manhrawy; Abdelazim G Hussien; Passent El-Kafrawy
Journal:  Comput Intell Neurosci       Date:  2021-12-29

3.  Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio.

Authors:  Isis Narváez-Bandera; Deiver Suárez-Gómez; Clara E Isaza; Mauricio Cabrera-Ríos
Journal:  PLoS One       Date:  2022-01-27       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.