Literature DB >> 29990221

Cost-Sensitive Feature Selection by Optimizing F-Measures.

.   

Abstract

Feature selection is beneficial for improving the performance of general machine learning tasks by extracting an informative subset from the high-dimensional features. Conventional feature selection methods usually ignore the class imbalance problem, thus the selected features will be biased towards the majority class. Considering that F-measure is a more reasonable performance measure than accuracy for imbalanced data, this paper presents an effective feature selection algorithm that explores the class imbalance issue by optimizing F-measures. Since F-measure optimization can be decomposed into a series of cost-sensitive classification problems, we investigate the cost-sensitive feature selection by generating and assigning different costs to each class with rigorous theory guidance. After solving a series of cost-sensitive feature selection problems, features corresponding to the best F-measure will be selected. In this way, the selected features will fully represent the properties of all classes. Experimental results on popular benchmarks and challenging real-world data sets demonstrate the significance of cost-sensitive feature selection for the imbalanced data setting and validate the effectiveness of the proposed method.

Year:  2017        PMID: 29990221     DOI: 10.1109/TIP.2017.2781298

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  2 in total

1.  A semi-supervised model to predict regulatory effects of genetic variants at single nucleotide resolution using massively parallel reporter assays.

Authors:  Zikun Yang; Chen Wang; Stephanie Erjavec; Lynn Petukhova; Angela Christiano; Iuliana Ionita-Laza
Journal:  Bioinformatics       Date:  2021-01-30       Impact factor: 6.937

2.  Individual-patient prediction of meningioma malignancy and survival using the Surveillance, Epidemiology, and End Results database.

Authors:  Jeremy T Moreau; Todd C Hankinson; Sylvain Baillet; Roy W R Dudley
Journal:  NPJ Digit Med       Date:  2020-01-30
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.