Literature DB >> 36010742

Information Theoretic Methods for Variable Selection-A Review.

Jan Mielniczuk1,2.   

Abstract

We review the principal information theoretic tools and their use for feature selection, with the main emphasis on classification problems with discrete features. Since it is known that empirical versions of conditional mutual information perform poorly for high-dimensional problems, we focus on various ways of constructing its counterparts and the properties and limitations of such methods. We present a unified way of constructing such measures based on truncation, or truncation and weighing, for the Möbius expansion of conditional mutual information. We also discuss the main approaches to feature selection which apply the introduced measures of conditional dependence, together with the ways of assessing the quality of the obtained vector of predictors. This involves discussion of recent results on asymptotic distributions of empirical counterparts of criteria, as well as advances in resampling.

Entities:  

Keywords:  Markov blanket; Möbius expansion; conditional independence; feature selection; interaction information

Year:  2022        PMID: 36010742      PMCID: PMC9407310          DOI: 10.3390/e24081079

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.738


  13 in total

1.  Estimating mutual information.

Authors:  Alexander Kraskov; Harald Stögbauer; Peter Grassberger
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-06-23

2.  BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies.

Authors:  Xiang Wan; Can Yang; Qiang Yang; Hong Xue; Xiaodan Fan; Nelson L S Tang; Weichuan Yu
Journal:  Am J Hum Genet       Date:  2010-09-10       Impact factor: 11.025

3.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

4.  The use of the restricted partition method with case-control data.

Authors:  R Culverhouse
Journal:  Hum Hered       Date:  2007-02-02       Impact factor: 0.444

5.  Using mutual information for selecting features in supervised neural net learning.

Authors:  R Battiti
Journal:  IEEE Trans Neural Netw       Date:  1994

6.  Normalized mutual information feature selection.

Authors:  Pablo A Estévez; Michel Tesmer; Claudio A Perez; Jacek M Zurada
Journal:  IEEE Trans Neural Netw       Date:  2009-01-13

7.  AMBIENCE: a novel approach and efficient algorithm for identifying informative genetic and environmental associations with complex phenotypes.

Authors:  Pritam Chanda; Lara Sucheston; Aidong Zhang; Daniel Brazeau; Jo L Freudenheim; Christine Ambrosone; Murali Ramanathan
Journal:  Genetics       Date:  2008-09-09       Impact factor: 4.562

8.  Efficient Markov Blanket Discovery and Its Application.

Authors:  Tian Gao; Qiang Ji
Journal:  IEEE Trans Cybern       Date:  2016-03-24       Impact factor: 11.448

9.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

10.  A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited.

Authors:  Jan Mielniczuk; Paweł Teisseyre
Journal:  Genet Epidemiol       Date:  2017-12-18       Impact factor: 2.135

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.