Literature DB >> 26390495

Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.

Jun Chin Ang, Andri Mirzal, Habibollah Haron, Haza Nuzly Abdull Hamed.   

Abstract

Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26390495     DOI: 10.1109/TCBB.2015.2478454

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  34 in total

1.  Multilayer feature selection method for polyp classification via computed tomographic colonography.

Authors:  Weiguo Cao; Zhengrong Liang; Marc J Pomeroy; Kenneth Ng; Shu Zhang; Yongfeng Gao; Perry J Pickhardt; Matthew A Barish; Almas F Abbasi; Hongbing Lu
Journal:  J Med Imaging (Bellingham)       Date:  2019-12-27

2.  Identification of cancer omics commonality and difference via community fusion.

Authors:  Yifan Sun; Yu Jiang; Yang Li; Shuangge Ma
Journal:  Stat Med       Date:  2018-11-12       Impact factor: 2.373

3.  Detecting biomarkers from microarray data using distributed correlation based gene selection.

Authors:  Alok Kumar Shukla; Diwakar Tripathi
Journal:  Genes Genomics       Date:  2020-02-10       Impact factor: 1.839

4.  In silico markers: an evolutionary and statistical approach to select informative genes of human breast cancer subtypes.

Authors:  Shib Sankar Bhowmick; Debotosh Bhattacharjee; Luis Rato
Journal:  Genes Genomics       Date:  2019-04-19       Impact factor: 1.839

5.  High dimensionality reduction by matrix factorization for systems pharmacology.

Authors:  Adel Mehrpooya; Farid Saberi-Movahed; Najmeh Azizizadeh; Mohammad Rezaei-Ravari; Farshad Saberi-Movahed; Mahdi Eftekhari; Iman Tavassoly
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

6.  Simple strategies for semi-supervised feature selection.

Authors:  Konstantinos Sechidis; Gavin Brown
Journal:  Mach Learn       Date:  2017-07-17       Impact factor: 2.940

7.  The Unsupervised Feature Selection Algorithms Based on Standard Deviation and Cosine Similarity for Genomic Data Analysis.

Authors:  Juanying Xie; Mingzhao Wang; Shengquan Xu; Zhao Huang; Philip W Grant
Journal:  Front Genet       Date:  2021-05-13       Impact factor: 4.599

8.  HFS-SLPEE: A Novel Hierarchical Feature Selection and Second Learning Probability Error Ensemble Model for Precision Cancer Diagnosis.

Authors:  Yajie Meng; Min Jin
Journal:  Front Cell Dev Biol       Date:  2021-06-30

9.  Integrating ensemble systems biology feature selection and bimodal deep neural network for breast cancer prognosis prediction.

Authors:  Li-Hsin Cheng; Te-Cheng Hsu; Che Lin
Journal:  Sci Rep       Date:  2021-07-21       Impact factor: 4.379

Review 10.  Selecting gene features for unsupervised analysis of single-cell gene expression data.

Authors:  Jie Sheng; Wei Vivian Li
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.