Literature DB >> 17431301

Iterative RELIEF for feature weighting: algorithms, theories, and applications.

Yijun Sun1.   

Abstract

RELIEF is considered one of the most successful algorithms for assessing the quality of features. In this paper, we propose a set of new feature weighting algorithms that perform significantly better than RELIEF, without introducing a large increase in computational complexity. Our work starts from a mathematical interpretation of the seemingly heuristic RELIEF algorithm as an online method solving a convex optimization problem with a margin-based objective function. This interpretation explains the success of RELIEF in real application and enables us to identify and address its following weaknesses. RELIEF makes an implicit assumption that the nearest neighbors found in the original feature space are the ones in the weighted space and RELIEF lacks a mechanism to deal with outlier data. We propose an iterative RELIEF (I-RELIEF) algorithm to alleviate the deficiencies of RELIEF by exploring the framework of the Expectation-Maximization algorithm. We extend I-RELIEF to multiclass settings by using a new multiclass margin definition. To reduce computational costs, an online learning algorithm is also developed. Convergence analysis of the proposed algorithms is presented. The results of large-scale experiments on the UCI and microarray data sets are reported, which demonstrate the effectiveness of the proposed algorithms, and verify the presented theoretical results.

Entities:  

Mesh:

Year:  2007        PMID: 17431301     DOI: 10.1109/TPAMI.2007.1093

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  33 in total

1.  Cancer progression modeling using static sample data.

Authors:  Yijun Sun; Jin Yao; Norma J Nowak; Steve Goodison
Journal:  Genome Biol       Date:  2014-08-26       Impact factor: 13.583

2.  Human communication dynamics in digital footsteps: a study of the agreement between self-reported ties and email networks.

Authors:  Stefan Wuchty; Brian Uzzi
Journal:  PLoS One       Date:  2011-11-17       Impact factor: 3.240

3.  Detecting biomarkers from microarray data using distributed correlation based gene selection.

Authors:  Alok Kumar Shukla; Diwakar Tripathi
Journal:  Genes Genomics       Date:  2020-02-10       Impact factor: 1.839

Review 4.  Derivation of cancer diagnostic and prognostic signatures from gene expression data.

Authors:  Steve Goodison; Yijun Sun; Virginia Urquidi
Journal:  Bioanalysis       Date:  2010-05       Impact factor: 2.681

5.  Benchmarking relief-based feature selection methods for bioinformatics data mining.

Authors:  Ryan J Urbanowicz; Randal S Olson; Peter Schmitt; Melissa Meeker; Jason H Moore
Journal:  J Biomed Inform       Date:  2018-07-17       Impact factor: 6.317

Review 6.  Relief-based feature selection: Introduction and review.

Authors:  Ryan J Urbanowicz; Melissa Meeker; William La Cava; Randal S Olson; Jason H Moore
Journal:  J Biomed Inform       Date:  2018-07-18       Impact factor: 6.317

7.  Derivation of molecular signatures for breast cancer recurrence prediction using a two-way validation approach.

Authors:  Yijun Sun; Virginia Urquidi; Steve Goodison
Journal:  Breast Cancer Res Treat       Date:  2009-03-17       Impact factor: 4.872

8.  Optimizing molecular signatures for predicting prostate cancer recurrence.

Authors:  Yijun Sun; Steve Goodison
Journal:  Prostate       Date:  2009-07-01       Impact factor: 4.104

9.  Feature weight estimation for gene selection: a local hyperlinear learning approach.

Authors:  Hongmin Cai; Peiying Ruan; Michael Ng; Tatsuya Akutsu
Journal:  BMC Bioinformatics       Date:  2014-03-14       Impact factor: 3.169

10.  A decision support system to improve medical diagnosis using a combination of k-medoids clustering based attribute weighting and SVM.

Authors:  Musa Peker
Journal:  J Med Syst       Date:  2016-03-21       Impact factor: 4.460

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.