| Literature DB >> 35127353 |
Thanthirige Lakshika Maduwanthi Ruberu1, Emily A Kenyon2, Karen A Hudson2, Francesca Filbey3, Sarah W Feldstein Ewing2, Swati Biswas1, Pankaj K Choudhary1.
Abstract
For some, substance use during adolescence may be a stepping stone on the way to substance use disorders in adulthood. Risk prediction models may help identify adolescent users at elevated risk for hazardous substance use. This preliminary analysis used cross-sectional data (n = 270, ages 13-18) from the baseline dataset of a randomized controlled trial intervening with adolescent alcohol and/or cannabis use. Models were developed for jointly predicting quantitative scores on three measures of hazardous substance use (Rutgers Alcohol Problems Index, Adolescent Cannabis Problem Questionnaire, and Hooked on Nicotine Checklist) based on personal risk factors using two statistical and machine learning methods: multivariate covariance generalized linear models (MCGLM) and penalized multivariate regression with a lasso penalty. The predictive accuracy of a model was evaluated using root mean squared error computed via leave-one-out cross-validation. The final proposed model was an MCGLM model. It has eleven risk factors: age, early life stress, age of first tobacco use, age of first cannabis use, lifetime use of other substances, age of first use of other substances, maternal education, parental attachment, family cigarette use, family history of hazardous alcohol use, and family history of hazardous cannabis use. Different subsets of these risk factors feature in the three outcome-specific components of this joint model. The quantitative risk estimate provided by the proposed model may help identify adolescent substance users of cannabis, alcohol, and tobacco who may be at an elevated risk of developing hazardous substance use.Entities:
Keywords: Adolescents; CPQ-A, Adolescent Cannabis Problems Questionnaire; HONC, Hooked on Nicotine Checklist; LOOCV, Leave-one-out cross-validation; MCGLM; MCGLM, Multivariate Covariance Generalized Linear Model; Machine learning; Multiple outcomes; Multivariate lasso; RAPI, Rutgers Alcohol Problems Index; RMSE, Root mean squared error; Risk prediction; SD, Standard deviation; SE, Standard error; Statistical learning; Substance use
Year: 2021 PMID: 35127353 PMCID: PMC8800066 DOI: 10.1016/j.pmedr.2021.101674
Source DB: PubMed Journal: Prev Med Rep ISSN: 2211-3355
Summary of results for the joint MCGLM model.
| Variable | Hazardous alcohol use (RAPI) | Hazardous cannabis use (CPQ-A) | Hazardous tobacco use (HONC) | |||
|---|---|---|---|---|---|---|
| Estimate (SE) | P-value | Estimate (SE) | P-value | Estimate (SE) | P-value | |
| Intercept | 0.196 (1.26) | 0.876 | 2.406 (0.41) | <0.001 | −0.051 (9.77) | 0.996 |
| Age | 0.214 (0.08) | 0.007 | 0.914 (0.17) | <0.001 | ||
| Maternal education | −0.068 (0.02) | 0.002 | ||||
| Parental attachment | −0.021 (0.01) | 0.083 | −0.074 (0.04) | 0.096 | ||
| Early life stress | 0.354 (0.13) | 0.005 | 1.178 (0.46) | 0.010 | ||
| Lifetime use of other substances | 2.557 (0.76) | 0.001 | 0.701 (0.17) | <0.001 | ||
| Age of first use of other substances | −0.138 (0.05) | 0.008 | ||||
| Age of first use of cannabis | 0.051 (0.02) | 0.034 | −0.174 (0.09) | 0.051 | ||
| Age of first use of tobacco | −0.095 (0.03) | 0.005 | −0.065 (0.02) | 0.002 | −3.212 (2.80) | 0.251 |
| (Age of first use of tobacco)^2 | 0.375 (0.26) | 0.144 | ||||
| (Age of first use of tobacco)^3 | −0.014 (0.01) | 0.067 | ||||
| Family use of cigarette | 1.748 (0.47) | <0.001 | ||||
| Family history of hazardous alcohol use | −0.402 (0.19) | 0.039 | ||||
| Family history of hazardous cannabis use | 0.589 (0.19) | 0.002 | −0.925 (0.40) | 0.022 | ||
Estimated coefficients for the variables retained by the multivariate lasso model.
| Variable | Hazardous alcohol use (RAPI) | Hazardous cannabis use (CPQ-A) | Hazardous tobacco use (HONC) |
|---|---|---|---|
| Intercept | 1.138 | 2.428 | −2.968 |
| Age | 0.108 | −0.023 | 0.450 |
| Parental attachment | −0.024 | −0.016 | −0.045 |
| Early life stress | 0.124 | 0.263 | 0.949 |
| Lifetime use of other substances | 0.500 | 0.431 | 1.048 |
| Age of first use of other substances | 0.000† | 0.010 | 0.037 |
| Age of first use of cannabis | −0.002 | 0.016 | −0.082 |
| Age of first use of tobacco | 0.000 | 0.000 | 0.000 |
| (Age of first use of tobacco)^2 | 0.000 | 0.000 | 0.000 |
| (Age of first use of tobacco)^3 | 0.000† | 0.000† | −0.001 |
| Family use of cigarette | 0.323 | 0.141 | 1.315 |
| Family history of hazardous cannabis use | 0.283 | −0.005 | −0.506 |
†Non-zero coefficient < 0.001.
Fig. 1Distributions of predicted score estimated using MCGLM. The corresponding mean (SD) are 9.64 (3.58) for RAPI, 7.03 (2.00) for CPQ-A, and 4.18 (1.88) for HONC.