Literature DB >> 23440969

EMLasso: logistic lasso with missing data.

N Sabbe1, O Thas, J-P Ottoy.   

Abstract

In clinical settings, missing data in the covariates occur frequently. For example, some markers are expensive or hard to measure. When this sort of data is used for model selection, the missingness is often resolved through a complete case analysis or a form of single imputation. An alternative sometimes comes in the form of leaving the most damaged covariates out. All these strategies jeopardise the goal of model selection. In earlier work, we have applied the logistic Lasso in combination with multiple imputation to obtain results in such settings, but we only provided heuristic arguments to advocate the method. In this paper, we propose an improved method that builds on firm statistical arguments and that is developed along the lines of the stochastic expectation-maximisation algorithm. We show that our method can be used to handle missing data in both categorical and continuous predictors, as well as in a nonpenalised regression. We demonstrate the method by applying it to data of 273 lung cancer patients. The objective is to select a model for the prediction of acute dysphagia, starting from a large set of potential predictors, including clinical and treatment covariates as well as a set of single-nucleotide polymorphisms.
Copyright © 2013 John Wiley & Sons, Ltd.

Entities:  

Keywords:  EM; Lasso; missing data; model selection

Mesh:

Substances:

Year:  2013        PMID: 23440969     DOI: 10.1002/sim.5760

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  6 in total

Review 1.  Radiogenomics: Identification of Genomic Predictors for Radiation Toxicity.

Authors:  Barry S Rosenstein
Journal:  Semin Radiat Oncol       Date:  2017-10       Impact factor: 5.934

Review 2.  The Prediction of Radiotherapy Toxicity Using Single Nucleotide Polymorphism-Based Models: A Step Toward Prevention.

Authors:  Sarah L Kerns; Suman Kundu; Jung Hun Oh; Sandeep K Singhal; Michelle Janelsins; Lois B Travis; Joseph O Deasy; A Cecile J E Janssens; Harry Ostrer; Matthew Parliament; Nawaid Usmani; Barry S Rosenstein
Journal:  Semin Radiat Oncol       Date:  2015-05-15       Impact factor: 5.934

3.  Evolutionary methods for variable selection in the epidemiological modeling of cardiovascular diseases.

Authors:  Christina Brester; Jussi Kauhanen; Tomi-Pekka Tuomainen; Sari Voutilainen; Mauno Rönkkö; Kimmo Ronkainen; Eugene Semenkin; Mikko Kolehmainen
Journal:  BioData Min       Date:  2018-08-14       Impact factor: 2.522

4.  In silico Prediction on the PI3K/AKT/mTOR Pathway of the Antiproliferative Effect of O. joconostle in Breast Cancer Models.

Authors:  Alejandra Ortiz-González; Pedro Pablo González-Pérez; Maura Cárdenas-García; María Guadalupe Hernández-Linares
Journal:  Cancer Inform       Date:  2022-03-25

5.  Improved Variable Selection Algorithm Using a LASSO-Type Penalty, with an Application to Assessing Hepatitis B Infection Relevant Factors in Community Residents.

Authors:  Pi Guo; Fangfang Zeng; Xiaomin Hu; Dingmei Zhang; Shuming Zhu; Yu Deng; Yuantao Hao
Journal:  PLoS One       Date:  2015-07-27       Impact factor: 3.240

6.  Association between biomarkers and clinical characteristics in chronic subdural hematoma patients assessed with lasso regression.

Authors:  Are Hugo Pripp; Milo Stanišić
Journal:  PLoS One       Date:  2017-11-06       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.