Literature DB >> 30031057

Relief-based feature selection: Introduction and review.

Ryan J Urbanowicz1, Melissa Meeker2, William La Cava3, Randal S Olson4, Jason H Moore5.   

Abstract

Feature selection plays a critical role in biomedical data mining, driven by increasing feature dimensionality in target problems and growing interest in advanced but computationally expensive methodologies able to model complex associations. Specifically, there is a need for feature selection methods that are computationally efficient, yet sensitive to complex patterns of association, e.g. interactions, so that informative features are not mistakenly eliminated prior to downstream modeling. This paper focuses on Relief-based algorithms (RBAs), a unique family of filter-style feature selection algorithms that have gained appeal by striking an effective balance between these objectives while flexibly adapting to various data characteristics, e.g. classification vs. regression. First, this work broadly examines types of feature selection and defines RBAs within that context. Next, we introduce the original Relief algorithm and associated concepts, emphasizing the intuition behind how it works, how feature weights generated by the algorithm can be interpreted, and why it is sensitive to feature interactions without evaluating combinations of features. Lastly, we include an expansive review of RBA methodological research beyond Relief and its popular descendant, ReliefF. In particular, we characterize branches of RBA research, and provide comparative summaries of RBA algorithms including contributions, strategies, functionality, time complexity, adaptation to key data characteristics, and software availability.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Epistasis; Feature interaction; Feature selection; Feature weighting; Filter; ReliefF

Mesh:

Year:  2018        PMID: 30031057      PMCID: PMC6299836          DOI: 10.1016/j.jbi.2018.07.014

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  19 in total

1.  Iterative RELIEF for feature weighting: algorithms, theories, and applications.

Authors:  Yijun Sun
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2007-06       Impact factor: 6.226

2.  Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting.

Authors:  Kai Ye; K Anton Feenstra; Jaap Heringa; Adriaan P Ijzerman; Elena Marchiori
Journal:  Bioinformatics       Date:  2007-11-17       Impact factor: 6.937

3.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

4.  Local-learning-based feature selection for high-dimensional data analysis.

Authors:  Yijun Sun; Sinisa Todorovic; Steve Goodison
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2010-09       Impact factor: 6.226

5.  Benchmarking relief-based feature selection methods for bioinformatics data mining.

Authors:  Ryan J Urbanowicz; Randal S Olson; Peter Schmitt; Melissa Meeker; Jason H Moore
Journal:  J Biomed Inform       Date:  2018-07-17       Impact factor: 6.317

6.  ExSTraCS 2.0: Description and Evaluation of a Scalable Learning Classifier System.

Authors:  Ryan J Urbanowicz; Jason H Moore
Journal:  Evol Intell       Date:  2015-04-03

7.  Differential privacy-based evaporative cooling feature selection and classification with relief-F and random forests.

Authors:  Trang T Le; W Kyle Simmons; Masaya Misaki; Jerzy Bodurka; Bill C White; Jonathan Savitz; Brett A McKinney
Journal:  Bioinformatics       Date:  2017-09-15       Impact factor: 6.937

8.  PMLB: a large benchmark suite for machine learning evaluation and comparison.

Authors:  Randal S Olson; William La Cava; Patryk Orzechowski; Ryan J Urbanowicz; Jason H Moore
Journal:  BioData Min       Date:  2017-12-11       Impact factor: 2.522

9.  Capturing the spectrum of interaction effects in genetic association studies by simulated evaporative cooling network analysis.

Authors:  Brett A McKinney; James E Crowe; Jingyu Guo; Dehua Tian
Journal:  PLoS Genet       Date:  2009-03-20       Impact factor: 5.917

10.  A comparative analysis of biomarker selection techniques.

Authors:  Nicoletta Dessì; Emanuele Pascariello; Barbara Pes
Journal:  Biomed Res Int       Date:  2013-11-10       Impact factor: 3.411

View more
  74 in total

1.  Interpretation of machine learning predictions for patient outcomes in electronic health records.

Authors:  William La Cava; Christopher Bauer; Jason H Moore; Sarah A Pendergrass
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

2.  Disentangled-Multimodal Adversarial Autoencoder: Application to Infant Age Prediction With Incomplete Multimodal Neuroimages.

Authors:  Dan Hu; Han Zhang; Zhengwang Wu; Fan Wang; Li Wang; J Keith Smith; Weili Lin; Gang Li; Dinggang Shen
Journal:  IEEE Trans Med Imaging       Date:  2020-11-30       Impact factor: 10.048

3.  Consensus features nested cross-validation.

Authors:  Saeid Parvandeh; Hung-Wen Yeh; Martin P Paulus; Brett A McKinney
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

4.  Theoretical properties of distance distributions and novel metrics for nearest-neighbor feature selection.

Authors:  Bryan A Dawkins; Trang T Le; Brett A McKinney
Journal:  PLoS One       Date:  2021-02-08       Impact factor: 3.240

5.  Benchmarking relief-based feature selection methods for bioinformatics data mining.

Authors:  Ryan J Urbanowicz; Randal S Olson; Peter Schmitt; Melissa Meeker; Jason H Moore
Journal:  J Biomed Inform       Date:  2018-07-17       Impact factor: 6.317

Review 6.  Reinventing polysomnography in the age of precision medicine.

Authors:  Diane C Lim; Diego R Mazzotti; Kate Sutherland; Jesse W Mindel; Jinyoung Kim; Peter A Cistulli; Ulysses J Magalang; Allan I Pack; Philip de Chazal; Thomas Penzel
Journal:  Sleep Med Rev       Date:  2020-03-20       Impact factor: 11.609

7.  Chaotic emperor penguin optimised extreme learning machine for microarray cancer classification.

Authors:  Santos Kumar Baliarsingh; Swati Vipsita
Journal:  IET Syst Biol       Date:  2020-04       Impact factor: 1.615

Review 8.  Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine.

Authors:  Emre Arslan; Jonathan Schulz; Kunal Rai
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2021-07-07       Impact factor: 10.680

9.  Computer-aided diagnosis of masses in breast computed tomography imaging: deep learning model with combined handcrafted and convolutional radiomic features.

Authors:  Marco Caballo; Andrew M Hernandez; Su Hyun Lyu; Jonas Teuwen; Ritse M Mann; Bram van Ginneken; John M Boone; Ioannis Sechopoulos
Journal:  J Med Imaging (Bellingham)       Date:  2021-03-29

10.  Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments.

Authors:  Muhammad Hamraz; Naz Gul; Mushtaq Raza; Dost Muhammad Khan; Umair Khalil; Seema Zubair; Zardad Khan
Journal:  PeerJ Comput Sci       Date:  2021-06-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.