Literature DB >> 33385830

Deep-gKnock: Nonlinear group-feature selection with deep neural networks.

Guangyu Zhu1, Tingting Zhao2.   

Abstract

Feature selection is central to contemporary high-dimensional data analysis. Group structure among features arises naturally in various scientific problems. Many methods have been proposed to incorporate the group structure information into feature selection. However, these methods are normally restricted to a linear regression setting. To relax the linear constraint, we design a new Deep Neural Network (DNN) architecture and integrating it with the recently proposed knockoff technique to perform nonlinear group-feature selection with controlled group-wise False Discovery Rate (gFDR). Experimental results on high-dimensional synthetic data demonstrate that our method achieves the highest power and accurate gFDR control compared with state-of-the-art methods. The performance of Deep-gKnock is especially superior in the following five situations: (1) nonlinearity relationship; (2) dimension p greater than sample size n; (3) high between-group correlation; (4) high within-group correlation; (5) large number of associated groups. And Deep-gKnock is also demonstrated to be robust to the misspecification of the feature distribution and the change of network architecture. Moreover, Deep-gKnock achieves scientifically meaningful group-feature selection results for cutting-edge real world datasets.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Deep neural networks; False discovery rate; Group feature selection; Knockoffs

Mesh:

Year:  2020        PMID: 33385830     DOI: 10.1016/j.neunet.2020.12.004

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  2 in total

1.  Effective Cancer Subtype and Stage Prediction via Dropfeature-DNNs.

Authors:  Zhong Chen; Wensheng Zhang; Hongwen Deng; Kun Zhang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2022-02-03       Impact factor: 3.710

2.  Explainable deep transfer learning model for disease risk prediction using high-dimensional genomic data.

Authors:  Long Liu; Qingyu Meng; Cherry Weng; Qing Lu; Tong Wang; Yalu Wen
Journal:  PLoS Comput Biol       Date:  2022-07-15       Impact factor: 4.779

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.