Literature DB >> 29993889

A Distributed Feature Selection Algorithm Based on Distance Correlation with an Application to Microarrays.

Aida Brankovic, Marjan Hosseini, Luigi Piroddi.   

Abstract

DNA microarray datasets are characterized by a large number of features with very few samples, which is a typical cause of overfitting and poor generalization in the classification task. Here, we introduce a novel feature selection (FS) approach which employs the distance correlation (dCor) as a criterion for evaluating the dependence of the class on a given feature subset. The dCor index provides a reliable dependence measure among random vectors of arbitrary dimension, without any assumption on their distribution. Moreover, it is sensitive to the presence of redundant terms. The proposed FS method is based on a probabilistic representation of the feature subset model, which is progressively refined by a repeated process of model extraction and evaluation. A key element of the approach is a distributed optimization scheme based on a vertical partitioning of the dataset, which alleviates the negative effects of its unbalanced dimensions. The proposed method has been tested on several microarray datasets, resulting in quite compact and accurate models obtained at a reasonable computational cost.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29993889     DOI: 10.1109/TCBB.2018.2833482

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  Use of radiomics based on 18F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach.

Authors:  Yi Zhou; Xue-Lei Ma; Ting Zhang; Jian Wang; Tao Zhang; Rong Tian
Journal:  Eur J Nucl Med Mol Imaging       Date:  2021-02-05       Impact factor: 9.236

2.  An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.

Authors:  Jamshid Pirgazi; Mohsen Alimoradi; Tahereh Esmaeili Abharian; Mohammad Hossein Olyaee
Journal:  Sci Rep       Date:  2019-12-09       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.