Literature DB >> 30506295

Efficient Exploration of Many Variables and Interactions Using Regularized Regression.

Tyson S Barrett1, Ginger Lockhart2.   

Abstract

The prevention sciences often face several situations that can compromise the statistical power and validity of a study. Among these, research can (1) have data with many variables, sometimes with low sample sizes, (2) have highly correlated predictors, (3) have unclear theory or empirical evidence related to the research questions, and/or (4) have difficulty selecting the proper covariates in observational studies. Modeling in these situations is difficult-and at times impossible-with conventional methods. Fortunately, regularized regression-a machine learning technique-can aid in exploring datasets that are otherwise difficult to analyze, allowing researchers to draw insights from these data. Although many of these methods have existed for several decades, prevention researchers rarely use them. As a gentle introduction, we discuss the utility of regularized regression to the field of prevention science and apply the technique to a real dataset. The data (n = 7979) for the demonstration consisted of 76 variables (151 including the modeled interactions) from the Youth Risk-Behavior Surveillance System (YRBSS) from 2015. Overall, it is clear that regularized regression can be an important tool in analyzing and gaining insight from data in the prevention sciences.

Keywords:  Adolescent development; Drug/substance abuse; Health; Regularized regression

Mesh:

Year:  2019        PMID: 30506295     DOI: 10.1007/s11121-018-0963-9

Source DB:  PubMed          Journal:  Prev Sci        ISSN: 1389-4986


  8 in total

1.  Predicting suicides after psychiatric hospitalization in US Army soldiers: the Army Study To Assess Risk and rEsilience in Servicemembers (Army STARRS).

Authors:  Ronald C Kessler; Christopher H Warner; Christopher Ivany; Maria V Petukhova; Sherri Rose; Evelyn J Bromet; Millard Brown; Tianxi Cai; Lisa J Colpe; Kenneth L Cox; Carol S Fullerton; Stephen E Gilman; Michael J Gruber; Steven G Heeringa; Lisa Lewandowski-Romps; Junlong Li; Amy M Millikan-Bell; James A Naifeh; Matthew K Nock; Anthony J Rosellini; Nancy A Sampson; Michael Schoenbaum; Murray B Stein; Simon Wessely; Alan M Zaslavsky; Robert J Ursano
Journal:  JAMA Psychiatry       Date:  2015-01       Impact factor: 21.596

2.  Using Lasso for Predictor Selection and to Assuage Overfitting: A Method Long Overlooked in Behavioral Sciences.

Authors:  Daniel M McNeish
Journal:  Multivariate Behav Res       Date:  2015       Impact factor: 5.923

3.  How many imputations are really needed? Some practical clarifications of multiple imputation theory.

Authors:  John W Graham; Allison E Olchowski; Tamika D Gilreath
Journal:  Prev Sci       Date:  2007-06-05

4.  The new statistics: why and how.

Authors:  Geoff Cumming
Journal:  Psychol Sci       Date:  2013-11-12

5.  Invited commentary: structural equation models and epidemiologic analysis.

Authors:  Tyler J VanderWeele
Journal:  Am J Epidemiol       Date:  2012-09-06       Impact factor: 4.897

6.  ON THE ADAPTIVE ELASTIC-NET WITH A DIVERGING NUMBER OF PARAMETERS.

Authors:  Hui Zou; Hao Helen Zhang
Journal:  Ann Stat       Date:  2009       Impact factor: 4.028

7.  Depressive symptoms in children and adolescents with chronic physical illness: an updated meta-analysis.

Authors:  Martin Pinquart; Yuhui Shen
Journal:  J Pediatr Psychol       Date:  2010-11-18

8.  Mediation analysis in psychosomatic medicine research.

Authors:  Ginger Lockhart; David P MacKinnon; Vanessa Ohlrich
Journal:  Psychosom Med       Date:  2010-12-10       Impact factor: 4.312

  8 in total
  3 in total

1.  Using Machine Learning to Predict Young People's Internet Health and Social Service Information Seeking.

Authors:  W Scott Comulada; Cameron Goldbeck; Ellen Almirol; Heather J Gunn; Manuel A Ocasio; M Isabel Fernández; Elizabeth Mayfield Arnold; Adriana Romero-Espinoza; Stacey Urauchi; Wilson Ramos; Mary Jane Rotheram-Borus; Jeffrey D Klausner; Dallas Swendeman
Journal:  Prev Sci       Date:  2021-05-11

2.  Innovative Identification of Substance Use Predictors: Machine Learning in a National Sample of Mexican Children.

Authors:  Alejandro L Vázquez; Melanie M Domenech Rodríguez; Tyson S Barrett; Sarah Schwartz; Nancy G Amador Buenabad; Marycarmen N Bustos Gamiño; María de Lourdes Gutiérrez López; Jorge A Villatoro Velázquez
Journal:  Prev Sci       Date:  2020-02

3.  Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset.

Authors:  Robert Ancuceanu; Marilena Viorica Hovanet; Adriana Iuliana Anghel; Florentina Furtunescu; Monica Neagu; Carolina Constantin; Mihaela Dinu
Journal:  Int J Mol Sci       Date:  2020-03-19       Impact factor: 5.923

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.