Literature DB >> 31603346

The clinical consequences of variable selection in multiple regression models: a case study of the Norwegian Opioid Maintenance Treatment program.

Marianne Riksheim Stavseth1, Thomas Clausen1, Jo Røislien2.   

Abstract

Background: Selecting which variables to include in multiple regression models is a pervasive problem in medical research.
Objectives: Based on questionnaire data (n = 18538, 69.9% men) from the Norwegian Opioid Maintenance Treatment Program, this study aims to compare the performance of different variable selection methods and the potential clinical consequences of choice of method. The effect of missing data is also explored.
Methods: The dependent variable was engagement in criminal behavior while in treatment. Twenty-nine potential covariates on demographics, psychosocial factors and drug use were tested for inclusion in a multiple logistic regression model. Both complete case and multiply imputed data were considered. We compared the results from variable selection methods ranging from expert-based and purposeful variable selection, through stepwise methods, to more recently developed penalized regression using the Least Absolute Shrinkage and Selection Operator (LASSO).
Results: The various variable selection methods resulted in regression models including from 9 to 22 covariates. The stepwise selection procedures generated the models with the most covariates included. The choice of variable selection method directly affected the estimated regression coefficients, both in effect size and statistical significance. For several variables the expert-based approach disagreed with all data-driven methods.Conclusions: The choice of variable selection method may strongly affect the resulting regression model, along with accompanying effect sizes and confidence intervals. This may affect clinical conclusions. The process should consequently be given sufficient consideration in model building. We recommend combining expert knowledge with a data-driven variable selection method to explore the models' robustness.

Entities:  

Keywords:  Logistic regression; crime; missing data; opioid maintenance treatment; variable selection

Mesh:

Year:  2019        PMID: 31603346     DOI: 10.1080/00952990.2019.1648484

Source DB:  PubMed          Journal:  Am J Drug Alcohol Abuse        ISSN: 0095-2990            Impact factor:   3.829


  2 in total

1.  Correlates of days of medication for opioid use disorder exposure among people living with HIV in Northern Vietnam.

Authors:  Dana Button; Ryan Cook; Caroline King; Tong Thi Khuyen; Lynn Kunkel; Gavin Bart; Dinh Thanh Thuy; Diep Bich Nguyen; Christopher K Blazes; Le Minh Giang; P Todd Korthuis
Journal:  Int J Drug Policy       Date:  2021-11-09

2.  Determinants of suboptimal immune recovery among a Chinese Yi ethnicity population with sustained HIV suppression.

Authors:  Liyu Chen; Chang-Hai Liu; Shuang Kang; Lingyao Du; Fanghua Ma; Changmin Li; Lang Bai; Hong Li; Hong Tang
Journal:  BMC Infect Dis       Date:  2022-02-08       Impact factor: 3.090

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.