Literature DB >> 30205662

A variable selection approach in the multivariate linear model: an application to LC-MS metabolomics data.

Marie Perrot-Dockès1, Céline Lévy-Leduc1, Julien Chiquet1, Laure Sansonnet1, Margaux Brégère1, Marie-Pierre Étienne1, Stéphane Robin1, Grégory Genta-Jouve2.   

Abstract

Omic data are characterized by the presence of strong dependence structures that result either from data acquisition or from some underlying biological processes. Applying statistical procedures that do not adjust the variable selection step to the dependence pattern may result in a loss of power and the selection of spurious variables. The goal of this paper is to propose a variable selection procedure within the multivariate linear model framework that accounts for the dependence between the multiple responses. We shall focus on a specific type of dependence which consists in assuming that the responses of a given individual can be modelled as a time series. We propose a novel Lasso-based approach within the framework of the multivariate linear model taking into account the dependence structure by using different types of stationary processes covariance structures for the random error matrix. Our numerical experiments show that including the estimation of the covariance matrix of the random error matrix in the Lasso criterion dramatically improves the variable selection performance. Our approach is successfully applied to an untargeted LC-MS (Liquid Chromatography-Mass Spectrometry) data set made of African copals samples. Our methodology is implemented in the R package MultiVarSel which is available from the Comprehensive R Archive Network (CRAN).

Keywords:  metabolomics; multivariate linear model; time series; variable selection

Mesh:

Substances:

Year:  2018        PMID: 30205662     DOI: 10.1515/sagmb-2017-0077

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  2 in total

1.  Feature selection for kernel methods in systems biology.

Authors:  Céline Brouard; Jérôme Mariette; Rémi Flamary; Nathalie Vialaneix
Journal:  NAR Genom Bioinform       Date:  2022-03-07

Review 2.  The metaRbolomics Toolbox in Bioconductor and beyond.

Authors:  Jan Stanstrup; Corey D Broeckling; Rick Helmus; Nils Hoffmann; Ewy Mathé; Thomas Naake; Luca Nicolotti; Kristian Peters; Johannes Rainer; Reza M Salek; Tobias Schulze; Emma L Schymanski; Michael A Stravs; Etienne A Thévenot; Hendrik Treutler; Ralf J M Weber; Egon Willighagen; Michael Witting; Steffen Neumann
Journal:  Metabolites       Date:  2019-09-23
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.