Literature DB >> 18048398

Monte Carlo feature selection for supervised classification.

Michal Draminski1, Alvaro Rada-Iglesias, Stefan Enroth, Claes Wadelius, Jacek Koronacki, Jan Komorowski.   

Abstract

MOTIVATION: Pre-selection of informative features for supervised classification is a crucial, albeit delicate, task. It is desirable that feature selection provides the features that contribute most to the classification task per se and which should therefore be used by any classifier later used to produce classification rules. In this article, a conceptually simple but computer-intensive approach to this task is proposed. The reliability of the approach rests on multiple construction of a tree classifier for many training sets randomly chosen from the original sample set, where samples in each training set consist of only a fraction of all of the observed features.
RESULTS: The resulting ranking of features may then be used to advantage for classification via a classifier of any type. The approach was validated using Golub et al. leukemia data and the Alizadeh et al. lymphoma data. Not surprisingly, we obtained a significantly different list of genes. Biological interpretation of the genes selected by our method showed that several of them are involved in precursors to different types of leukemia and lymphoma rather than being genes that are common to several forms of cancers, which is the case for the other methods. AVAILABILITY: Prototype available upon request.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18048398     DOI: 10.1093/bioinformatics/btm486

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  63 in total

1.  Comprehensive detection and discrimination of Campylobacter species by use of confocal micro-Raman spectroscopy and multilocus sequence typing.

Authors:  Xiaonan Lu; Qian Huang; William G Miller; D Eric Aston; Jie Xu; Feng Xue; Hongwei Zhang; Barbara A Rasco; Shuo Wang; Michael E Konkel
Journal:  J Clin Microbiol       Date:  2012-06-27       Impact factor: 5.948

2.  MIST: Maximum Information Spanning Trees for dimension reduction of biological data sets.

Authors:  Bracken M King; Bruce Tidor
Journal:  Bioinformatics       Date:  2009-03-04       Impact factor: 6.937

3.  Discriminating cirRNAs from other lncRNAs using a hierarchical extreme learning machine (H-ELM) algorithm with feature selection.

Authors:  Lei Chen; Yu-Hang Zhang; Guohua Huang; Xiaoyong Pan; ShaoPeng Wang; Tao Huang; Yu-Dong Cai
Journal:  Mol Genet Genomics       Date:  2017-09-14       Impact factor: 3.291

4.  A Rough Set-Based Model of HIV-1 Reverse Transcriptase Resistome.

Authors:  Marcin Kierczak; Krzysztof Ginalski; Michał Dramiński; Jacek Koronacki; Witold Rudnicki; Jan Komorowski
Journal:  Bioinform Biol Insights       Date:  2009-10-05

5.  Identification of leukemia stem cell expression signatures through Monte Carlo feature selection strategy and support vector machine.

Authors:  JiaRui Li; Lin Lu; Yu-Hang Zhang; YaoChen Xu; Min Liu; KaiYan Feng; Lei Chen; XiangYin Kong; Tao Huang; Yu-Dong Cai
Journal:  Cancer Gene Ther       Date:  2019-05-29       Impact factor: 5.987

6.  Win percentage: a novel measure for assessing the suitability of machine classifiers for biological problems.

Authors:  R Mitchell Parry; John H Phan; May D Wang
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

7.  Computational Analysis of Molecular Interaction Networks Underlying Change of HIV-1 Resistance to Selected Reverse Transcriptase Inhibitors.

Authors:  Marcin Kierczak; Michał Dramiński; Jacek Koronacki; Jan Komorowski
Journal:  Bioinform Biol Insights       Date:  2010-12-12

8.  Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments.

Authors:  Muhammad Hamraz; Naz Gul; Mushtaq Raza; Dost Muhammad Khan; Umair Khalil; Seema Zubair; Zardad Khan
Journal:  PeerJ Comput Sci       Date:  2021-06-01

9.  Stable feature selection and classification algorithms for multiclass microarray data.

Authors:  Sebastian Student; Krzysztof Fujarewicz
Journal:  Biol Direct       Date:  2012-10-02       Impact factor: 4.540

10.  Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition.

Authors:  Michal Marczyk; Roman Jaksik; Andrzej Polanski; Joanna Polanska
Journal:  BMC Bioinformatics       Date:  2013-03-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.