Literature DB >> 35909578

Health and household environment factors linked with early alcohol use in adolescence: a record-linked, data-driven, longitudinal cohort study.

Amrita Bandyopadhyay1,2, Sinead Brophy1,2,3, Ashley Akbari1,3, Joanne Demmler3, Jonathan Kennedy2, Shantini Paranjothy4,5, Ronan A Lyons1,2,3, Simon Moore4,6.   

Abstract

Introduction: Early alcohol use has significant association with poor health outcomes. Individual risk factors around early alcohol use have been identified, but a holistic, data-driven investigation into health and household environmental factors on early alcohol use is yet to be undertaken.
Objectives: This study aims to investigate the relationship between preceding health events, household exposures and early alcohol use during adolescence using a two-stage data-driven approach.
Methods: In stage one, a study population (N = 1,072) were derived from the Millennium Cohort Study (MCS) Wales (born between 2000-2002). MCS data were first linked with electronic-health records. Factors associated with early (<=eleven years old) alcohol use were identified using feature selection and stepwise logistic regression. In stage two, analogous risk factors from MCS were recreated for whole population (N = 59,231) of children (born between 1998-2002 in the Welsh Demographic Service Dataset) using routine data to predict the alcohol-related health events in hospital or GP records.
Results: Significant risk factors from stage two included poor maternal mental (adjusted odds ratio [aOR] = 1.31) and physical health (aOR = 1.25), living with someone with alcohol-related problem (aOR = 2.16), single-adult household (aOR = 1.45), ever in deprivation (aOR = 1.66), child's high hyperactivity (aOR = 3.57), and conduct disorder (aOR = 3.26). Children with health events, whose health needs are supported (e.g., are taken to the doctor), are at lower risk of early alcohol use.
Conclusion: Health events of the family members and the child can act as modifiable exposures and may therefore inform the development of prevention initiatives. Families with known alcohol problems, living in deprivation, experiencing child behavioural problems and those who are not taken to the doctor are at higher risk of early drinking behaviour and should be prioritised for early years support and interventions to target problem drinking in young people.

Entities:  

Keywords:  adolescent; alcohol; cohort study; data linkage; electronic health records (EHRS)

Mesh:

Year:  2022        PMID: 35909578      PMCID: PMC9284510          DOI: 10.23889/ijpds.v7i1.1717

Source DB:  PubMed          Journal:  Int J Popul Data Sci        ISSN: 2399-4908


Introduction

Alcohol use in childhood is associated with the risk of later alcohol abuse, alcohol dependence [1] and several negative outcomes including poor educational achievement, death and disability [2-5]. Known factors that predict early alcohol use include a child’s hyperactivity and conduct disorder [6, 7], lack of family support, household dysfunction, parental alcohol drinking pattern, parental indifference towards young persons’ alcohol use [8-11] and adverse childhood experiences (ACEs) (e.g., child abuse and parental discord) [12]. Current research has largely focused on the family environment, individual level socio-demographic, neurocognitive, behavioural or emotional features, individually or in combination [13-15]. Although it is known that ACEs have a detrimental impact on a child’s health in early life [16, 17], it is not known whether a child’s own health status is associated with subsequent alcohol use and alcohol-related health outcomes. Child health is a broad term that includes maintaining and protecting physical, mental and social health [18]. Broadly, there are two dominant methodological approaches in the investigation of child alcohol use that are increasingly regarded as complementary [19]. First, survey methodology allows researchers to focus on specific exposures and outcomes, such as volume of alcohol consumed, and to tailor validated [20] instruments to address preconceived study hypothesis [2]. Limitations include relatively small sample size, non-response, selection and volunteer bias [21]. Second, the analysis of routinely collected electronic health records (EHRs) facilitates the inclusion of a greater number of individuals, even entire populations, than is feasible using surveys. The analysis of whole population EHRs, however, imposes challenges relating to the processing and management of data, including addressing missing data on informative variables [22]. For example, EHRs are unlikely to capture occasional alcohol consumption but would be expected to capture health outcomes relating to hazardous alcohol use. Existing literature on this topic has predominantly focused on preconceived study hypothesis [2], however this increases the chance of missing risk factors which have not already been identified. In contrast to this, a data-driven framework would avoid the limits of a pre-defined and hypothesis-bound investigation and significantly open up the exploration of the variable space. We anticipate that this will provide new insights and will ultimately help to develop a better understanding of the research problem under investigation. Hence, the current study does not focus on an explicit causal analysis, rather we aim to merge hypothesis-based knowledge with data-driven insights to investigate the risk factors associated with early alcohol use. In this study we assess the relationship between childhood health factors, household environment and alcohol-related outcomes during adolescence using a two-stage data-driven approach. These broad categories of risk factors were based on hypothesis-based knowledge as discussed above. This method brings together a hypothesis-based study design followed by a data-driven approach which complements and minimises the limitation of both study designs.

Methods

A two-stage data-driven approach has been undertaken to investigate the association between the specific risk factors and the outcome in this study. In stage one, a machine learning feature selection algorithm and a classifier were used to identify the health conditions and socio-demographic factors associated with early alcohol use from linked EHRs and Millennium Cohort Study (MCS) survey data. In stage two, analogous risk factors identified from stage one were then sought in routine data and an analytic approach was used to determine the prediction model. The linked routinely collected EHRs and vast volume of administrative data from the whole population of Wales was analysed to determine the effect of the risk factors identified in the MCS data analysis as predictors to target alcohol-related health outcomes in the general adolescent population.

Stage one – Millennium Cohort Study (MCS)

Participants

The MCS is a longitudinal birth cohort of children born in the UK between the years 2000 and 2002 [23]. Parents of the original 18,819 singleton children were interviewed from all parts of UK when their child was nine months old, of those 1,951 were interviewed in Wales. Subsequent interviews took place at ages three, five, seven and eleven years of age. Written consent to link MCS children with their routine EHRs up to age fourteen years was obtained from their parents at the interview undertaken when children were seven years of age. Data of the 1,838 consented singleton children resident in Wales was subsequently linked with their EHRs. The study population included children who also participated in the interview at age eleven years, as the primary outcome data were collected at that point. The current study excluded participants who did not have a general practitioner (GP) record in the Welsh Longitudinal General Practice (WLGP) dataset before they were eleven years of age (Supplementary Figure 1).

Exposure

The study included parent reported socio-demographic and family-related variables for children from MCS interviews which took place between the age of nine months and seven years of the children. These include child’s sex, mother’s socio-economic classification (SEC), household poverty level (whether the household income was above/below 60% of national median using a modified Organisation for Economic Co-operation and Development scale), living area (based on 2005 Rural/Urban Area Classification), mother’s alcohol use during and post pregnancy, lone parent carer, and number of children. Based on lone parent status, the total number of siblings at household and total number of household members, the study derived a binary variable to identify whether the child was residing with any other additional household members. Using both parents’ responses on alcohol consumption, guardian alcohol use variables were derived. Children’s emotional and behavioural difficulties were measured using the parent completed Strength and Difficulty Questionnaire (SDQ) [24]. Since most of these variables are time varying (and collected from MCS at ages nine months until age eleven years) aggregated summary variables were derived based on average values. These variables include SDQ, mother’s SEC, lone parent status, guardians’ alcohol use, living area, poverty indicator, additional household member and mother’s alcohol use after their child was born. The exposure variables from MCS have been described in Table 1.
Table 1: Socio-demographic characteristics of the MCS population (following imputation) and whole population sample with descriptive statistics
MCS Whole Population
Child Sexn%n%
Female52148.60Female28,77048.57
Male55151.40Male30,46151.43
Deprivation
Mother Socio economic classification (SEC)Overall deprivation
Always managerial or intermediate37735.17Low (WIMD quintile >=3)29,10249.13
Always semi-employed, self-employed, semi-routine or routine28026.12High (WIMD quintile <3)24,70141.70
Unknown41538.71Borderline (ever belong to high group but not always)5,4289.16
Poverty indicatorEmployment deprivation
Above poverty level53950.28Low (WIMD quintile >= 3)29,39449.63
Below poverty level27025.19High (WIMD quintile < 3)24,77441.83
Ever been below poverty level26324.53Borderline (ever belong to high group but not always)5,0638.55
Household alcohol use
Mother’s alcohol use during pregnancyMother’s alcohol-related health condition during pregnancy
Never75270.15No55,25193.28
Low (less than once a month or 1–2 times a month)21820.34Yes3,9806.72
High (more than 1–2 times a month)1029.51
Mother’s alcohol use after child was born
Never827.65
Low50046.64
High49045.71
Guardian alcohol useHousehold member identified with alcohol-related hospital admission
Low24723.04No57,79997.58
Moderate52448.88Yes1,4322.42
High23321.74
Variable686.34
Living area
Rural23822.2014,76024.92
Urban77972.6741,90770.75
Ever been urban555.132,5644.33
Maternal age at child’s birth
Less than 20 years1029.517,11112.01
20 to 24 years20218.849,26615.64
25 to 29 years30528.4517,38929.36
30 to 34 years32430.2217,00528.71
35 years and over13912.978,46014.28
Gestational age
Not term524.851,3172.22
Term1,02095.1557,91497.78
Household composition
Siblings at homeLiving with single adult
No sibling12912.03No33,66256.83
One sibling always or at some point49345.99Yes8,42514.22
More than one sibling ever45041.98Ever been17,14428.94
Lone parent
No75470.34
Yes13012.13
Ever been18817.54
Additional household member
No79273.88
Yes11811.01
Ever had16215.11
Mother’s health
Longstanding illnessMother’s any comorbidity
No58954.94No46,17077.95
Yes17015.86Yes13,06122.05
Varies31329.20Mother’s psychosis disorder
No58,92499.48
Yes3070.52
Mother’s common mental health condition
No28,60348.29
Yes30,62851.71
The health records of the children were also considered as the exposures for risk of early alcohol use. EHRs of the MCS children obtained from hospital admission record and primary care events within the Patient Episode Database for Wales (PEDW) and the WLGP dataset. A broad list of explanatory health codes was constructed using the three-digit ICD-10 codes and Read Code Version 2 recorded in PEDW and WLGP from birth until age ten (one year before the alcohol data were collected). Wales Electronic Cohort for Children (WECC) [25] containing further details on child health in Wales, were used to obtain age and maternal age at birth.

Outcome

Alcohol data for MCS children were obtained from a self-report questionnaire at age eleven (Supplementary Table 1). Based on the responses to the questionnaire the children were classified into two groups: those who had consumed alcohol (case) and those who had not (non-case). Those who did not answer or provided contradictory responses were removed from analyses (Supplementary Figure 1).

Statistical analysis

In the cohort exposure dataset, the participants with more than 10 missing variables (out of 13) were removed from analyses to ensure the accuracy of the data. An explanatory variable with less than 10% missing data had been imputed using a predictive mean matching (PMM) imputation method [26, 27]. To identify the health codes that were associated with early alcohol use from the large volume of linked EHRs spanning 10 years, a chi-square (X2) feature selection method was applied [28]. A critical threshold value X2 ≥ 2.706 (one degree of freedom, p ≤ 0.1) was applied and health codes with a X2 above this threshold were retained in subsequent analyses. A multivariate stepwise logistic regression with bidirectional (forward and backward) search was then performed for the exposure variables to obtain the best-fit model [29]. In stepwise model the variables with least significance were removed at each iteration step and the final model was selected based on the minimum Akaike Information Criterion (AIC) value. From the final model, only the statistically significant (p ≤ 0.05) variables were selected as significant predictors associated with the risk of early alcohol use leading to a further reduction in variable space. This is justified due to the following reasons. The variable selection process facilitates the choice of best model by incorporating the interdependence between the explanatory variables. The approach only considers the statistically significant variables for the stage two analysis which reduces the variable space and optimises the time to recreate analogous variables.

Stage two – whole population

All children born between 1 January 1998 and 31 December 2002 and were resident in Wales during the first fourteen years of their life were included in the whole population dataset. The study population was selected from the Welsh Demographic Service Dataset (WDSD), which is an administrative dataset of individuals living in Wales registered with a GP. The participants without continuous record in the WLGP from age six months to fourteen years were excluded to ensure a complete follow-up period. Analogous risk factors to those identified in the MCS analysis were created using the WDSD, WLGP and PEDW data. The study used an encrypted household identifier known as residential anonymised linking field (RALF) which enabled the participants to be linked with other household members and related records [30]. Each RALF is associated with the smallest geographical representation known as lower super output area (LSOA) which again is associated with a Welsh Index of Multiple Deprivation (WIMD) rank aggregated into a quintile or decile scale. Overall and employment WIMD scores were used as the measure of deprivation from routine data in the study. The main explanatory variables derived from routine data for the whole population analysis include child’s sex, employment deprivation and overall deprivation, living with single adult, mother’s alcohol-related condition during pregnancy, living with household member with alcohol-related condition, living area, maternal age, gestational age, and child mental and physical health. To be consistent with the MCS data, primary exposure data were collected for children up to age seven years. For time varying variables, the study used the same time points as MCS (birth to nine months, nine months to three years, three to five years, and five to seven years) and derived aggregated summary variables for the risk factors. Detailed descriptions of the variables are available in Supplementary Table 2. Alcohol-related health events across the whole population cohort were obtained from ICD-10 codes in PEDW (Supplementary Table 3) and Read codes in WLGP (Supplementary Table 4) between the age seven and fourteen years [31]. As the case (alcohol-related EHRs) to non-case (no alcohol-related EHRs) ratio was 1:99 in the whole population cohort and unbalanced, to improve the efficiency and the sensitivity of model performance case-control selection was undertaken by randomly selecting 20 non-cases for each sex matched case [32]. The dataset was randomly split into a training (70%) and test set (30%). Logistic regression was used to obtain the best-fit model on the training data. Model prediction on the test data provided a predictive probability of the expected outcome associated with each individual. Model prediction was quantified by performance accuracy, sensitivity, specificity, positive predictive value, and negative predictive value. MCS and routine EHRs were anonymously linked and accessed within the Secure Anonymised Information Linkage (SAIL) Databank. Linkage was completed using an encrypted person-based identifier known as the anonymised linkage field (ALF), generated by the Digital Health and Care Wales (DHCW) [33, 34]. Data preparation (extraction, cleaning, and linkage) was performed in Structured Query Language (SQL) on an IBM DB2 platform, with subsequent analyses performed in R v3.3.2 [35].

Results

Stage one – MCS

Among the consented singleton children 1,838 were assigned an ALF, with 82% of the children having a GP registration record in SAIL before age eleven years (Supplementary Figure 1). Individual and household characteristics (following imputation) are described in Table 1. 7.6% of the MCS children were considered as ‘case’ based on their response. Health codes (256 ICD-10 and Read codes) were obtained after merging the first ten years of EHRs from PEDW and WLGP. Feature selection method reduced this to 13 health features (Table 2).
Table 2: Health codes identified as risk factors for early alcohol use by chi-square feature selection method in the MCS cohort and the percent of sample with these codes present in whole population (WP) following selection
Health code Description of the code Type of code chi-square MCS (%) WP (%)
Read code H05%Upper respiratory infectionsDiagnosis.6062.5059.95
Read code K2%Male genital organ diseasesDiagnosis7.7712.418.46
Read code 919%Child health surveillance related administrative codeAdministrative6.0725.5630.70
Read code 64N%Child physical health examinationAdministrative4.6317.3515.56
Read code 656%Tetanus vaccinationAdministrative4.1128.2634.21
ICD-10 code Z%Factors influencing health status and contact with health servicesDiagnosis3.9027.9920.68
Read code 654%Diphtheria vaccinationAdministrative3.6927.71-
Read code 655%Pertussis vaccinationAdministrative3.3529.94-
Read code F%Nervous system and/or sense organ diseasesDiagnosis3.0470.24-
Read code F4%Disorders of eye and adnexaDiagnosis3.0046.27-
Read code K27%Disorders of penisDiagnosis2.999.42
Read code etc.%Trimethoprim, an antibiotic used mainly in the treatment of bladder infectionsMedication2.9316.70
Read code 4%Laboratory test and procedures (e.g. urine culture, blood test)Administrative2.8960.73

codes were not selected by the logistic regression models, hence were not selected for WP analysis

codes were not selected by the logistic regression models, hence were not selected for WP analysis After merging health and socio-demographic variables, 31 main explanatory variables (13 health codes and 18 socio-demographic variables) were available for the two-way logistic model. The final 19 features with significant p values were considered to be significantly associated with the risk profile of early alcohol use (Table 3).
Table 3: The explanatory variables associated with higher and lower risk of early alcohol use for the MCS children (Stage one analysis) with the adjusted Odds Ratio (OR) and 95% confidence interval (CI)
Feature Adjusted OR (95%CI)
Child’s sex
Female1
Male3.06 (2.35 to 3.99)***
Mother’s Socio-economic classification (SEC)
Always Managerial or intermediate1
Always semi-employed, self-employed, semi-routine or routine1.30 (0.93 to 1.81)
Unknown1.94 (1.37 to 2.74)***
Lone parent
Never lone parent1
Lone parent1.68 (1.07 to 2.65)*
Ever been1.77 (1.27 to 2.49)**
Mother alcohol use during pregnancy
Never1
Low (less than once a month, 1–2 times a month)2.48 (1.83 to 3.38)***
High5.38 (3.58 to 8.15)***
Mother alcohol use after child was born
Never1
Low1.15 (0.70–1.92)
High0.70 (0.04 to 1.24)
Guardian alcohol use
Low1
Moderate1.73 (1.22 to 2.25)**
High1.07 (0.70 to 1.64)
Variable0.91 (0.48 to 1.70)
Living area
Rural1
Urban1.61 (1.17 to 2.23)**
Ever been urban4.54 (2.69 to 7.75)***
Poverty indicator
Above poverty level1
Below poverty level0.93 (0.60 to 1.45)
Ever been below poverty level1.33 (0.95 to 1.86)
Maternal age at child’s birth
Less than 20 years1
20 to 24 years1.57 (0.97 to 2.58)
25 to 29 years3.28 (2.03 to 5.36)***
30 to 34 years2.68 (1.64 to 4.43)***
35 years or over0.65 (0.35 to 1.21)
Gestational age
Not term1
Term9.42 (4.22 to 23.03)***
Additional household member
No1
Yes0.69 (0.45 to 1.06)
Ever had0.57 (0.39 to 0.81)**
Hyperactivity
Always normal1
Any mention of higher level of hyperactivity1.84 (1.37 to 2.47)***
Conduct disorder
Always normal1
Any mention of higher level of CP2.10 (1.57 to 2.82)***
Emotional difficulty
Always normal1
Any mention of higher level of ED0.68 (0.48–0.97)*
Total Difficulty Score
Always normal1
Any mention of higher level of TDS0.45 (0.31 to 0.66)***
Mother longstanding illness
No1
Yes1.53 (1.09 to 2.16)*
Varies1.25 (0.96 to 1.65)
Other acute upper respiratory infections (Read code H05%)
No1
Yes0.43 (0.34–0.55)***
Male genital organ diseases (Read code K2%)
No
Yes2.77 (1.58–4.94)***
Child surveillance administration (Read code 919%)
No
Yes1.38 (1.06 to 1.81)*
Child exam (Read code 64N%)
No
Yes0.51 (0.35 to 0.75)**
Tetanus vaccination (Read code 656%)
No
Yes0.60 (0.45 to 0.79)***
General examination (ICD10 code Z%)
No
Yes0.73 (0.55 to 0.99)*
Disorders of penis (Read code K27%)
No
Yes0.63 (0.33 to 1.19)

Note: * p < 0.05, ** p < 0.01, *** p < 0.001.

Note: * p < 0.05, ** p < 0.01, *** p < 0.001. In Wales, 207,114 children were born in between 1 January 1998 and 31 December 2002, and their records were obtained from WDSD. After applying exclusion criteria there were 59,231 children as the study population (Supplementary Figure 2). Of the study population, 591 (0.99%) children had at least one alcohol-related event between seven and 14 years of age (Supplementary Figure 3) who were the cases from the whole population subset. After applying case control selection, the dataset had 591cases and 11,820 non-cases, which were further split into training and test set. There were 8,688 (417 cases and 8,271 non-cases) children in the training dataset. The variables identified as significantly associated with early alcohol use using MCS data were mapped into the whole population cohort (Supplementary Table 2). Table 1 presents descriptive statistics for this population. Mothers of 6.72% of the children had an alcohol-related event reported in PEDW or WLGP while pregnant. 2.42% children lived with a household member who had alcohol-related inpatient hospital admission. The adjusted odds ratio of the features with 95% confidence interval are presented in Table 4 (also see Supplementary Figure 4).
Table 4: The explanatory variables associated with higher and lower risk of early alcohol-related health outcomes for the whole population (Stage two analysis) with the adjusted Odds Ratio (OR) and 95% confidence interval (CI)
Feature Adjusted OR (95% CI)
Child’s Sex
Female1
Male1.09 (1.02 to 1.17)**
Overall deprivation:
Low1
High1.11 (0.98 to 1.25)
Borderline1.66 (1.41 to 1.95)***
Employment deprivation:
Low1
High0.84 (0.75 to 0.95)**
Borderline0.82 (0.69 to 0.97)*
Living with single adult:
No1
Yes1.45 (1.32 to 1.59)***
Ever been1.17 (1.08 to 1.26)***
Mother’s alcohol-related condition during pregnancy
No1
Yes0.88 (0.77 to 1.00)*
Household member with alcohol-related condition
No1
Yes2.16 (1.80 to 2.60)***
Living area
Rural1
Urban0.99 (0.92 to 1.08)
Ever in urban2.42 (2.08 to 2.81)***
Maternal age at birth
Less than 20 years1
20 to 24 years0.88 (0.79 to 0.99)*
25 to 29 years0.79 (0.71 to 0.87)***
30 to 34 years0.68 (0.61 to 0.76)***
35 years or over0.53 (0.46 to 0.60)***
Gestational age
Not-term1
Term1.11 (0.89 to 1.40)
Child – Attention deficit hyperactive disorder (ADHD)
No1
Yes3.57 (2.52 to 5.15)***
Child - Conduct disorder
No1
Yes3.26 (2.14 to 5.07)***
Child – Depression/Anxiety
No1
Yes0.75 (0.34 to 1.69)
Mother’s any comorbidity
No1
Yes1.25 (1.16 to 1.34)***
Mother’s common mental health condition
No1
Yes1.31 (1.23 to 1.40)***
Mother’s psychosis disorder
No1
Yes3.12 (2.04 to 4.90)***
Other acute upper respiratory infections (Read code H05%)
No1
Yes0.97 (0.91 to 1.04)
Male genital organ diseases (Read code K27%)
No1
Yes0.90 (0.79 to 1.02)
Child surveillance administration (Read code 919%)
No1
Yes0.80 (0.75 to 0.86)***
Tetanus vaccination (Read code 656%)
No1
Yes0.47 (0.44 to 0.51)***
Child exam (Read code 64N%)
No1
Yes0.59 (0.53 to 0.65)***
General examination (ICD10 code Z%)
No1
Yes0.84 (0.78 to 0.92)***

Note: * p < 0.05, ** p < 0.01, *** p < 0.001.

Note: * p < 0.05, ** p < 0.01, *** p < 0.001. The model was run on the test dataset. The accuracy of the model was 61.32% with a sensitivity of 58.05% and specificity of 68.48% (additional details are provided in Supplementary Tables 5, 6). Out of 174 cases, the model was able to predict 101 (58%) children who had an alcohol-related health event recorded in the healthcare system between ages seven and fourteen.

Discussion

This study has developed a two-stage data-driven framework that can create a profile of the characteristics of children who end up with an alcohol problem in adolescence. The study undertook data linkage between a longitudinal survey data (MCS) and routine EHRs in stage one to select the significant risk factors associated with early alcohol use. Stage two built the analogous risk factors using only the linked routine data and based this, a prediction model was developed. Hybridisation of these two powerful data sources (routine and survey) enabled us to create a data-driven risk profile. The risk factors were significantly associated across both MCS and whole population analyses, but effect estimates varied. Children whose health needs are supported are at lower risk of early alcohol use, evidenced by protective effect of receiving vaccinations, attending routine health examinations with their GP, and contact with health services recorded in primary and secondary care were consistent across MCS and whole population analyses. Similarly, children with health codes relating to acute upper respiratory infections may have more protective guardians willing to consult medical professionals for mild conditions. Together, this suggests that the avoidance of regular healthcare contact is an indicator that increases the risk of early alcohol use. However, the trends relating to the two codes, the child surveillance administration code and the chapter heading linked to male genitals, differed between the whole population and the MCS analysis. The code linked to male genitals showed an association with higher risk of alcohol use in MCS but was statistically inconclusive for the whole population analysis. The child surveillance administration code was associated with higher risk for the MCS cohort in contrast to the whole population which can be attributed to the differential support received by two cohorts which was not captured by the data and hence this requires further investigation. Also, the proportion of cases obtained from MCS data (stage one) were higher than those obtained from the whole population data (stage two). This can be attributed to the fact that cases from stage one were based on the self-reported alcohol consumption data whereas the stage two routine data highlighted the most severe cases caused by alcohol among the adolescents and recorded on the healthcare system. The overall risk profile obtained from MCS and whole population analyses were broadly consistent with each other and the research literature generally both in the UK and internationally. Similar risk factors include being male [13], ever living in an urban environment where there is a greater density of alcohol outlets [36], ever living in conditions of social deprivation, living in a household with higher level of alcohol use by household members [9]. Studies from USA highlighted that early onset of alcohol use was significantly associated with parental drinking pattern and living in a lone parent household [11], child’s attention deficit hyperactivity disorder (ADHD) and conduct disorder [6, 7]. The stage one MCS analysis in this study revealed that emotional difficulty and a higher level of behavioural difficulty (as assessed by parents) were associated with a reduced risk of alcohol use. However, diagnosis of clinically relevant behavioural/emotional problems was protective in the population model. Poor maternal mental health was linked with adverse outcomes, consistent with family-level risk factors that promote children’s alcohol use [12, 17]. A difference was observed in regards to the effect of maternal age at birth on the risk of a child’s early alcohol use. The protective effect of higher maternal age was observed for the whole population but the finding on MCS data differed and requires further investigation. Further, employment deprivation in the whole population analysis was associated with lower risk of a child’s early alcohol use after adjusting for overall deprivation. This finding is similar to the existing literature [15, 37], which found that early alcohol use is more common in higher income families. This suggests that reliance on employment indicators is not sufficient to understand the socio-economic factors influencing a child’s early alcohol use, the overall deprivation (also measured by education, health, access to the service, physical environment of living) plays an important role as well. The result of this study needs to be interpreted in conjunction with a number of limitations. Firstly, mapping the MCS survey to the routine data was challenging, not all MCS variables were available in the routine data. In some instances, multiple variables had to be merged to derive summary variables. This may result in a degree of uncertainty about the information captured in the summary variables. Secondly, it was necessary to aggregate some time-varying variables into a single point estimate and, as such, the analyses are unable to capture how the recency of some events might influence results. Thirdly, due to unavailability of continuous GP records of some participants between six months and fourteen years (if the participants changed their GP and the their registered GP was not contributing to SAIL), they were removed from the whole population analysis. Similarly, the follow-up of children was not possible where they who moved out of the study area (Wales, UK), or died under age fourteen, because of which their exposure (sociodemographic and health related data) and outcome (alcohol data) data were not available. This resulted in a large reduction of the number of children in the study population. However, this did not contribute to selection bias as this happened randomly and the losses had no direct relationship with alcohol-related outcome. Fourthly, the EHRs did not include Emergency Department (ED) attendance data (but does include admissions into hospital via the ED) as there are no uniformly applicable codes for alcohol-related attendances in ED, and even when available, these are sparsely populated [38]. Lastly, in this study the model performance, measured by sensitivity and specificity, was moderate. However, even if we had a sensitivity and specificity of 90% the maximum positive predictive value, we can get is 31%, given the low prevalence of alcohol-related medical contact, as the prevalence influences the positive and negative predictive value of a model performance [39]. Machine learning approaches generally aim to achieve the best predictive models from the available data. The low positive predictive value, obtained here, suggests that the variables needed to improve model performance are not available in the data (e.g., genetic information, peer alcohol-related data). Routine EHRs and administrative data are available to healthcare professionals and are used by policy makers and commissioners to determine how resources are best utilised to manage preventive interventions. However, the bulk of research considering early alcohol use and related outcomes has relied on self-report surveys. It has been shown that linking survey and routine data can offer new insights [40]. The results presented here are novel in that our approach generalised results from an established survey to a whole population analysis using predictive analytic techniques. This provides in-depth knowledge about the profile of the children susceptible to early alcohol use and can feasibly be used to inform population health strategies designed to reduce the prevalence of early alcohol use in children and related health outcomes.

Conclusions

The hybridisation of data of different nature, as carried out in this study, is a novel approach that combines the complementary advantages of EHRs with more personal insights from questionnaire-based cohort data. This provides a robust resource on which findings can be based and generalised to the wider population. The identified risk factors such as living with a single parent, alcohol problem in the household, social deprivation and children receiving poor support from the healthcare system indicate that involvement and support for the family is important in breaking cycles and improving children’s outcomes.
  29 in total

1.  How should we define health?

Authors:  Machteld Huber; J André Knottnerus; Lawrence Green; Henriëtte van der Horst; Alejandro R Jadad; Daan Kromhout; Brian Leonard; Kate Lorig; Maria Isabel Loureiro; Jos W M van der Meer; Paul Schnabel; Richard Smith; Chris van Weel; Henk Smid
Journal:  BMJ       Date:  2011-07-26

2.  Multiple imputation using chained equations: Issues and guidance for practice.

Authors:  Ian R White; Patrick Royston; Angela M Wood
Journal:  Stat Med       Date:  2010-11-30       Impact factor: 2.373

Review 3.  Magnitude of and trends in alcohol-related mortality and morbidity among U.S. college students ages 18-24, 1998-2005.

Authors:  Ralph W Hingson; Wenxing Zha; Elissa R Weitzman
Journal:  J Stud Alcohol Drugs Suppl       Date:  2009-07

4.  Adverse childhood experiences and the association with ever using alcohol and initiating alcohol use during adolescence.

Authors:  Shanta R Dube; Jacqueline W Miller; David W Brown; Wayne H Giles; Vincent J Felitti; Maxia Dong; Robert F Anda
Journal:  J Adolesc Health       Date:  2006-04       Impact factor: 5.012

5.  Strengths and difficulties questionnaire as a dimensional measure of child mental health.

Authors:  Anna Goodman; Robert Goodman
Journal:  J Am Acad Child Adolesc Psychiatry       Date:  2009-04       Impact factor: 8.829

6.  Impacts of adverse childhood experiences on health, mental health, and substance use in early adulthood: a cohort study of an urban, minority sample in the U.S.

Authors:  J P Mersky; J Topitzes; A J Reynolds
Journal:  Child Abuse Negl       Date:  2013-08-24

7.  Adolescent alcohol and tobacco use and early socioeconomic position: the ALSPAC birth cohort.

Authors:  Roberto Melotti; Jon Heron; Matthew Hickman; John Macleod; Ricardo Araya; Glyn Lewis
Journal:  Pediatrics       Date:  2011-03-14       Impact factor: 7.124

8.  Alcohol affordability: implications for alcohol price policies. A cross-sectional analysis in middle and older adults from UK Biobank.

Authors:  Simon C Moore; Bella Orpen; Jesse Smith; Chinmoy Sarkar; Chenlu Li; Jonathan Shepherd; Sarah Bauermeister
Journal:  J Public Health (Oxf)       Date:  2022-06-27       Impact factor: 5.058

9.  Electronic Longitudinal Alcohol Study in Communities (ELAStiC) Wales - protocol for platform development.

Authors:  L Trefan; A Akbari; S Paranjothy; D M Farewell; A Gartner; D Fone; J Greene; A Evans; A Smith; V Adekanmbi; J Kennedy; R A Lyons; S C Moore
Journal:  Int J Popul Data Sci       Date:  2019-05-20

10.  Risk of emergency hospital admission in children associated with mental disorders and alcohol misuse in the household: an electronic birth cohort study.

Authors:  Shantini Paranjothy; Annette Evans; Amrita Bandyopadhyay; David Fone; Behnaz Schofield; Ann John; Mark A Bellis; Ronan A Lyons; Daniel Farewell; Sara Jayne Long
Journal:  Lancet Public Health       Date:  2018-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.