Literature DB >> 22084149

A top-r feature selection algorithm for microarray gene expression data.

Alok Sharma1, Seiya Imoto, Satoru Miyano.   

Abstract

Most of the conventional feature selection algorithms have a drawback whereby a weakly ranked gene that could perform well in terms of classification accuracy with an appropriate subset of genes will be left out of the selection. Considering this shortcoming, we propose a feature selection algorithm in gene expression data analysis of sample classifications. The proposed algorithm first divides genes into subsets, the sizes of which are relatively small (roughly of size h), then selects informative smaller subsets of genes (of size r < h) from a subset and merges the chosen genes with another gene subset (of size r) to update the gene subset. We repeat this process until all subsets are merged into one informative subset. We illustrate the effectiveness of the proposed algorithm by analyzing three distinct gene expression data sets. Our method shows promising classification accuracy for all the test data sets. We also show the relevance of the selected genes in terms of their biological functions.

Mesh:

Year:  2012        PMID: 22084149     DOI: 10.1109/TCBB.2011.151

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  27 in total

1.  Graph-based unsupervised feature selection and multiview clustering for microarray data.

Authors:  Tripti Swarnkar; Pabitra Mitra
Journal:  J Biosci       Date:  2015-10       Impact factor: 1.826

2.  Noninvasive fetal trisomy detection by multiplexed semiconductor sequencing: a barcoding analysis strategy.

Authors:  Jiawei Shen; Zujia Wen; Xiaolan Qin; Yongyong Shi
Journal:  J Hum Genet       Date:  2015-12-10       Impact factor: 3.172

3.  A new parameter tuning approach for enhanced motor imagery EEG signal classification.

Authors:  Shiu Kumar; Alok Sharma
Journal:  Med Biol Eng Comput       Date:  2018-04-04       Impact factor: 2.602

4.  A Self-Training Subspace Clustering Algorithm under Low-Rank Representation for Cancer Classification on Gene Expression Data.

Authors:  Chun-Qiu Xia; Ke Han; Yong Qi; Yang Zhang; Dong-Jun Yu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2017-06-06       Impact factor: 3.710

5.  DeepFeature: feature selection in nonimage data using convolutional neural network.

Authors:  Alok Sharma; Artem Lysenko; Keith A Boroevich; Edwin Vans; Tatsuhiko Tsunoda
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

6.  Gene expression feature selection for prostate cancer diagnosis using a two-phase heuristic-deterministic search strategy.

Authors:  Saleh Shahbeig; Akbar Rahideh; Mohammad Sadegh Helfroush; Kamran Kazemi
Journal:  IET Syst Biol       Date:  2018-08       Impact factor: 1.615

7.  Large-scale RNA-Seq Transcriptome Analysis of 4043 Cancers and 548 Normal Tissue Controls across 12 TCGA Cancer Types.

Authors:  Li Peng; Xiu Wu Bian; Di Kang Li; Chuan Xu; Guang Ming Wang; Qing You Xia; Qing Xiong
Journal:  Sci Rep       Date:  2015-08-21       Impact factor: 4.379

Review 8.  Integrating genetics and epigenetics in breast cancer: biological insights, experimental, computational methods and therapeutic potential.

Authors:  Claudia Cava; Gloria Bertoli; Isabella Castiglioni
Journal:  BMC Syst Biol       Date:  2015-09-21

9.  Recursive feature selection with significant variables of support vectors.

Authors:  Chen-An Tsai; Chien-Hsun Huang; Ching-Wei Chang; Chun-Houh Chen
Journal:  Comput Math Methods Med       Date:  2012-08-15       Impact factor: 2.238

10.  A strategy to select suitable physicochemical attributes of amino acids for protein fold recognition.

Authors:  Alok Sharma; Kuldip K Paliwal; Abdollah Dehzangi; James Lyons; Seiya Imoto; Satoru Miyano
Journal:  BMC Bioinformatics       Date:  2013-07-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.