Literature DB >> 34720675

Extending greedy feature selection algorithms to multiple solutions.

Giorgos Borboudakis1, Ioannis Tsamardinos1,2,3.   

Abstract

Most feature selection methods identify only a single solution. This is acceptable for predictive purposes, but is not sufficient for knowledge discovery if multiple solutions exist. We propose a strategy to extend a class of greedy methods to efficiently identify multiple solutions, and show under which conditions it identifies all solutions. We also introduce a taxonomy of features that takes the existence of multiple solutions into account. Furthermore, we explore different definitions of statistical equivalence of solutions, as well as methods for testing equivalence. A novel algorithm for compactly representing and visualizing multiple solutions is also introduced. In experiments we show that (a) the proposed algorithm is significantly more computationally efficient than the TIE* algorithm, the only alternative approach with similar theoretical guarantees, while identifying similar solutions to it, and (b) that the identified solutions have similar predictive performance.
© The Author(s) 2021.

Entities:  

Keywords:  Feature selection; Multiple feature selection; Multiple solutions; Stepwise selection

Year:  2021        PMID: 34720675      PMCID: PMC8550441          DOI: 10.1007/s10618-020-00731-7

Source DB:  PubMed          Journal:  Data Min Knowl Discov        ISSN: 1384-5810            Impact factor:   3.670


  12 in total

1.  HITON: a novel Markov Blanket algorithm for optimal variable selection.

Authors:  C F Aliferis; I Tsamardinos; A Statnikov
Journal:  AMIA Annu Symp Proc       Date:  2003

2.  2D image registration in CT images using radial image descriptors.

Authors:  Franz Graf; Hans-Peter Kriegel; Matthias Schubert; Sebastian Pölsterl; Alexander Cavallaro
Journal:  Med Image Comput Comput Assist Interv       Date:  2011

3.  Prediction of cancer outcome with microarrays: a multiple random validation strategy.

Authors:  Stefan Michiels; Serge Koscielny; Catherine Hill
Journal:  Lancet       Date:  2005 Feb 5-11       Impact factor: 79.321

4.  Ensemble gene selection by grouping for microarray data classification.

Authors:  Huawen Liu; Lei Liu; Huijie Zhang
Journal:  J Biomed Inform       Date:  2009-08-20       Impact factor: 6.317

5.  T-ReCS: stable selection of dynamically formed groups of features with application to prediction of clinical outcomes.

Authors:  Grace T Huang; Ioannis Tsamardinos; Vineet Raghu; Naftali Kaminski; Panayiotis V Benos
Journal:  Pac Symp Biocomput       Date:  2015

6.  Multiple robust signatures for detecting lymph node metastasis in head and neck cancer.

Authors:  Paul Roepman; Patrick Kemmeren; Lodewijk F A Wessels; Piet J Slootweg; Frank C P Holstege
Journal:  Cancer Res       Date:  2006-02-15       Impact factor: 12.701

7.  Algorithms for Discovery of Multiple Markov Boundaries.

Authors:  Alexander Statnikov; Nikita I Lytkin; Jan Lemeire; Constantin F Aliferis
Journal:  J Mach Learn Res       Date:  2013-02       Impact factor: 3.654

8.  Bridging a translational gap: using machine learning to improve the prediction of PTSD.

Authors:  Karen-Inge Karstoft; Isaac R Galatzer-Levy; Alexander Statnikov; Zhiguo Li; Arieh Y Shalev
Journal:  BMC Psychiatry       Date:  2015-03-16       Impact factor: 3.630

9.  A multi-marker association method for genome-wide association studies without the need for population structure correction.

Authors:  Jonas R Klasen; Elke Barbez; Lukas Meier; Nicolai Meinshausen; Peter Bühlmann; Maarten Koornneef; Wolfgang Busch; Korbinian Schneeberger
Journal:  Nat Commun       Date:  2016-11-10       Impact factor: 14.919

10.  On the number of close-to-optimal feature sets.

Authors:  Edward R Dougherty; Marcel Brun
Journal:  Cancer Inform       Date:  2007-02-16
View more
  3 in total

1.  Just Add Data: automated predictive modeling for knowledge discovery and feature selection.

Authors:  Ioannis Tsamardinos; Paulos Charonyktakis; Georgios Papoutsoglou; Giorgos Borboudakis; Kleanthi Lakiotaki; Jean Claude Zenklusen; Hartmut Juhl; Ekaterini Chatzaki; Vincenzo Lagani
Journal:  NPJ Precis Oncol       Date:  2022-06-16

2.  A Machine Learning Model to Predict Knee Osteoarthritis Cartilage Volume Changes over Time Using Baseline Bone Curvature.

Authors:  Hossein Bonakdari; Jean-Pierre Pelletier; François Abram; Johanne Martel-Pelletier
Journal:  Biomedicines       Date:  2022-05-26

3.  Liquid Biopsy in Type 2 Diabetes Mellitus Management: Building Specific Biosignatures via Machine Learning.

Authors:  Makrina Karaglani; Maria Panagopoulou; Christina Cheimonidi; Ioannis Tsamardinos; Efstratios Maltezos; Nikolaos Papanas; Dimitrios Papazoglou; George Mastorakos; Ekaterini Chatzaki
Journal:  J Clin Med       Date:  2022-02-17       Impact factor: 4.241

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.