Literature DB >> 36187051

Algorithmic Stability and Generalization of an Unsupervised Feature Selection Algorithm.

Xinxing Wu1, Qiang Cheng1.   

Abstract

Feature selection, as a vital dimension reduction technique, reduces data dimension by identifying an essential subset of input features, which can facilitate interpretable insights into learning and inference processes. Algorithmic stability is a key characteristic of an algorithm regarding its sensitivity to perturbations of input samples. In this paper, we propose an innovative unsupervised feature selection algorithm attaining this stability with provable guarantees. The architecture of our algorithm consists of a feature scorer and a feature selector. The scorer trains a neural network (NN) to globally score all the features, and the selector adopts a dependent sub-NN to locally evaluate the representation abilities for selecting features. Further, we present algorithmic stability analysis and show that our algorithm has a performance guarantee via a generalization error bound. Extensive experimental results on real-world datasets demonstrate superior generalization performance of our proposed algorithm to strong baseline methods. Also, the properties revealed by our theoretical analysis and the stability of our algorithm-selected features are empirically confirmed.

Entities:  

Year:  2021        PMID: 36187051      PMCID: PMC9524443     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  10 in total

1.  An overview of statistical learning theory.

Authors:  V N Vapnik
Journal:  IEEE Trans Neural Netw       Date:  1999

2.  The Fisher-Markov selector: fast selecting maximally separable feature subset for multiclass classification with applications to high-dimensional data.

Authors:  Qiang Cheng; Hongbo Zhou; Jie Cheng
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2011-06       Impact factor: 6.226

3.  FREL: A Stable Feature Selection Algorithm.

Authors:  Yun Li; Jennie Si; Guojing Zhou; Shasha Huang; Songcan Chen
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2014-08-12       Impact factor: 10.451

4.  Algorithm-Dependent Generalization Bounds for Multi-Task Learning.

Authors:  Tongliang Liu; Dacheng Tao; Mingli Song; Stephen J Maybank
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2016-03-21       Impact factor: 6.226

5.  Stability-Based Generalization Analysis of Distributed Learning Algorithms for Big Data.

Authors:  Xinxing Wu; Junping Zhang; Fei-Yue Wang
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2019-05-08       Impact factor: 10.451

6.  Fractal Autoencoders for Feature Selection.

Authors:  Xinxing Wu; Qiang Cheng
Journal:  Proc Conf AAAI Artif Intell       Date:  2021-02

7.  CYCLOPS reveals human transcriptional rhythms in health and disease.

Authors:  Ron C Anafi; Lauren J Francey; John B Hogenesch; Junhyong Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2017-04-24       Impact factor: 11.205

8.  Random forest-integrated analysis in AD and LATE brain transcriptome-wide data to identify disease-specific gene expression.

Authors:  Xinxing Wu; Chong Peng; Peter T Nelson; Qiang Cheng
Journal:  PLoS One       Date:  2021-09-07       Impact factor: 3.240

9.  Feature Selection for high Dimensional DNA Microarray data using hybrid approaches.

Authors:  Ammu Prasanna Kumar; Preeja Valsala
Journal:  Bioinformation       Date:  2013-09-23

Review 10.  A Review of Matched-pairs Feature Selection Methods for Gene Expression Data Analysis.

Authors:  Sen Liang; Anjun Ma; Sen Yang; Yan Wang; Qin Ma
Journal:  Comput Struct Biotechnol J       Date:  2018-02-25       Impact factor: 7.271

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.