Literature DB >> 25185878

A case study of normalization, missing data and variable selection methods in lipidomics.

M Kujala1, J Nevalainen.   

Abstract

Lipidomics is an emerging field of science that holds the potential to provide a readout of biomarkers for an early detection of a disease. Our objective was to identify an efficient statistical methodology for lipidomics-especially in finding interpretable and predictive biomarkers useful for clinical practice. In two case studies, we address the need for data preprocessing for regression modeling of a binary response. These are based on a normalization step, in order to remove experimental variability, and on a multiple imputation step, to make the full use of the incompletely observed data with potentially informative missingness. Finally, by cross-validation, we compare stepwise variable selection to penalized regression models on stacked multiple imputed data sets and propose the use of a permutation test as a global test of association. Our results show that, depending on the design of the study, these data preprocessing methods modestly improve the precision of classification, and no clear winner among the variable selection methods is found. Lipidomics profiles are found to be highly important predictors in both of the two case studies.
Copyright © 2014 John Wiley & Sons, Ltd.

Keywords:  left censoring; lipidomics; multiple imputation; normalization; penalized logistic regression; permutation tests

Mesh:

Substances:

Year:  2014        PMID: 25185878     DOI: 10.1002/sim.6296

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  1 in total

1.  Penalized Variable Selection for Lipid-Environment Interactions in a Longitudinal Lipidomics Study.

Authors:  Fei Zhou; Jie Ren; Gengxin Li; Yu Jiang; Xiaoxi Li; Weiqun Wang; Cen Wu
Journal:  Genes (Basel)       Date:  2019-12-03       Impact factor: 4.096

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.