Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A Distributed Feature Selection Algorithm Based on Distance Correlation with an Application to Microarrays.

Literature DB >> 29993889

A Distributed Feature Selection Algorithm Based on Distance Correlation with an Application to Microarrays.

Aida Brankovic, Marjan Hosseini, Luigi Piroddi.

Abstract

DNA microarray datasets are characterized by a large number of features with very few samples, which is a typical cause of overfitting and poor generalization in the classification task. Here, we introduce a novel feature selection (FS) approach which employs the distance correlation (dCor) as a criterion for evaluating the dependence of the class on a given feature subset. The dCor index provides a reliable dependence measure among random vectors of arbitrary dimension, without any assumption on their distribution. Moreover, it is sensitive to the presence of redundant terms. The proposed FS method is based on a probabilistic representation of the feature subset model, which is progressively refined by a repeated process of model extraction and evaluation. A key element of the approach is a distributed optimization scheme based on a vertical partitioning of the dataset, which alleviates the negative effects of its unbalanced dimensions. The proposed method has been tested on several microarray datasets, resulting in quite compact and accurate models obtained at a reasonable computational cost.

Entities: Gene

Mesh：

Substances：
Biomarkers, Tumor

Year: 2018 PMID： 29993889 DOI： 10.1109/TCBB.2018.2833482

Source DB: PubMed Journal: IEEE/ACM Trans Comput Biol Bioinform ISSN： 1545-5963 Impact factor: 3.710

Keyword Cloud
Cited

2 in total

1. Use of radiomics based on ¹⁸F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach.

Authors: Yi Zhou; Xue-Lei Ma; Ting Zhang; Jian Wang; Tao Zhang; Rong Tian
Journal: Eur J Nucl Med Mol Imaging Date: 2021-02-05 Impact factor: 9.236

2. An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.

Authors: Jamshid Pirgazi; Mohsen Alimoradi; Tahereh Esmaeili Abharian; Mohammad Hossein Olyaee
Journal: Sci Rep Date: 2019-12-09 Impact factor: 4.379

2 in total

A Distributed Feature Selection Algorithm Based on Distance Correlation with an Application to Microarrays.

1. Use of radiomics based on 18F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach.

2. An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.

1. Use of radiomics based on ¹⁸F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach.