MOTIVATION: Feature selection is one of the main challenges in analyzing high-throughput genomic data. Minimum redundancy maximum relevance (mRMR) is a particularly fast feature selection method for finding a set of both relevant and complementary features. Here we describe the mRMRe R package, in which the mRMR technique is extended by using an ensemble approach to better explore the feature space and build more robust predictors. To deal with the computational complexity of the ensemble approach, the main functions of the package are implemented and parallelized in C using the openMP Application Programming Interface. RESULTS: Our ensemble mRMR implementations outperform the classical mRMR approach in terms of prediction accuracy. They identify genes more relevant to the biological context and may lead to richer biological interpretations. The parallelized functions included in the package show significant gains in terms of run-time speed when compared with previously released packages. AVAILABILITY: The R package mRMRe is available on Comprehensive R Archive Network and is provided open source under the Artistic-2.0 License. The code used to generate all the results reported in this application note is available from Supplementary File 1. CONTACT: bhaibeka@ircm.qc.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Feature selection is one of the main challenges in analyzing high-throughput genomic data. Minimum redundancy maximum relevance (mRMR) is a particularly fast feature selection method for finding a set of both relevant and complementary features. Here we describe the mRMRe R package, in which the mRMR technique is extended by using an ensemble approach to better explore the feature space and build more robust predictors. To deal with the computational complexity of the ensemble approach, the main functions of the package are implemented and parallelized in C using the openMP Application Programming Interface. RESULTS: Our ensemble mRMR implementations outperform the classical mRMR approach in terms of prediction accuracy. They identify genes more relevant to the biological context and may lead to richer biological interpretations. The parallelized functions included in the package show significant gains in terms of run-time speed when compared with previously released packages. AVAILABILITY: The R package mRMRe is available on Comprehensive R Archive Network and is provided open source under the Artistic-2.0 License. The code used to generate all the results reported in this application note is available from Supplementary File 1. CONTACT: bhaibeka@ircm.qc.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: George Adam; Ladislav Rampášek; Zhaleh Safikhani; Petr Smirnov; Benjamin Haibe-Kains; Anna Goldenberg Journal: NPJ Precis Oncol Date: 2020-06-15
Authors: Thibaud P Coroller; Patrick Grossmann; Ying Hou; Emmanuel Rios Velazquez; Ralph T H Leijenaar; Gretchen Hermann; Philippe Lambin; Benjamin Haibe-Kains; Raymond H Mak; Hugo J W L Aerts Journal: Radiother Oncol Date: 2015-03-04 Impact factor: 6.280
Authors: Daniel S Tylee; Jonathan L Hess; Thomas P Quinn; Rahul Barve; Hailiang Huang; Yanli Zhang-James; Jeffrey Chang; Boryana S Stamova; Frank R Sharp; Irva Hertz-Picciotto; Stephen V Faraone; Sek Won Kong; Stephen J Glatt Journal: Am J Med Genet B Neuropsychiatr Genet Date: 2016-11-11 Impact factor: 3.568