Literature DB >> 35265835

Feature selection for kernel methods in systems biology.

Céline Brouard1, Jérôme Mariette1, Rémi Flamary2, Nathalie Vialaneix1.   

Abstract

The substantial development of high-throughput biotechnologies has rendered large-scale multi-omics datasets increasingly available. New challenges have emerged to process and integrate this large volume of information, often obtained from widely heterogeneous sources. Kernel methods have proven successful to handle the analysis of different types of datasets obtained on the same individuals. However, they usually suffer from a lack of interpretability since the original description of the individuals is lost due to the kernel embedding. We propose novel feature selection methods that are adapted to the kernel framework and go beyond the well-established work in supervised learning by addressing the more difficult tasks of unsupervised learning and kernel output learning. The method is expressed under the form of a non-convex optimization problem with a ℓ1 penalty, which is solved with a proximal gradient descent approach. It is tested on several systems biology datasets and shows good performances in selecting relevant and less redundant features compared to existing alternatives. It also proved relevant for identifying important governmental measures best explaining the time series of Covid-19 reproducing number evolution during the first months of 2020. The proposed feature selection method is embedded in the R package mixKernel version 0.8, published on CRAN. Installation instructions are available at http://mixkernel.clementine.wf/.
© The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2022        PMID: 35265835      PMCID: PMC8900155          DOI: 10.1093/nargab/lqac014

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  22 in total

1.  A structural alignment kernel for protein structures.

Authors:  Jian Qiu; Martial Hue; Asa Ben-Hur; Jean-Philippe Vert; William Stafford Noble
Journal:  Bioinformatics       Date:  2007-01-18       Impact factor: 6.937

2.  Joint embedding learning and sparse regression: a framework for unsupervised feature selection.

Authors:  Chenping Hou; Feiping Nie; Xuelong Li; Dongyun Yi; Yi Wu
Journal:  IEEE Trans Cybern       Date:  2013-07-22       Impact factor: 11.448

3.  Adaptive Unsupervised Feature Selection With Structure Regularization.

Authors:  Minnan Luo; Feiping Nie; Xiaojun Chang; Yi Yang; Alexander G Hauptmann; Qinghua Zheng
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2017-01-27       Impact factor: 10.451

4.  A variable selection approach in the multivariate linear model: an application to LC-MS metabolomics data.

Authors:  Marie Perrot-Dockès; Céline Lévy-Leduc; Julien Chiquet; Laure Sansonnet; Margaux Brégère; Marie-Pierre Étienne; Stéphane Robin; Grégory Genta-Jouve
Journal:  Stat Appl Genet Mol Biol       Date:  2018-09-08

5.  Unsupervised multiple kernel learning for heterogeneous data integration.

Authors:  Jérôme Mariette; Nathalie Villa-Vialaneix
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

6.  Novel aspects of PPARalpha-mediated regulation of lipid and xenobiotic metabolism revealed through a nutrigenomic study.

Authors:  Pascal G P Martin; Hervé Guillou; Frédéric Lasserre; Sébastien Déjean; Annaig Lan; Jean-Marc Pascussi; Magali Sancristobal; Philippe Legrand; Philippe Besse; Thierry Pineau
Journal:  Hepatology       Date:  2007-03       Impact factor: 17.425

7.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

8.  Classification of microarray data using gene networks.

Authors:  Franck Rapaport; Andrei Zinovyev; Marie Dutreix; Emmanuel Barillot; Jean-Philippe Vert
Journal:  BMC Bioinformatics       Date:  2007-02-01       Impact factor: 3.169

9.  Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery.

Authors:  Nora K Speicher; Nico Pfeifer
Journal:  Bioinformatics       Date:  2015-06-15       Impact factor: 6.937

10.  Block HSIC Lasso: model-free biomarker detection for ultra-high dimensional data.

Authors:  Héctor Climente-González; Chloé-Agathe Azencott; Samuel Kaski; Makoto Yamada
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.