Literature DB >> 33492009

Is Cohort Representativeness Passé? Poststratified Associations of Lifestyle Risk Factors with Mortality in the UK Biobank.

Emmanuel Stamatakis1, Katherine B Owen2, Leah Shepherd2, Bradley Drayton2, Mark Hamer3, Adrian E Bauman2.   

Abstract

BACKGROUND: The UK Biobank (UKB) has been used widely to examine associations between lifestyle risk factors and mortality outcomes. It is unknown whether the extremely low UKB response rate (5.5%) and lack of representativeness materially affects the magnitude and direction of effect estimates.
METHODS: We used poststratification to match the UKB sample to the target population in terms of sociodemographic characteristics and prevalence of lifestyle risk factors (physical inactivity, alcohol intake, smoking, and poor diet). We compared unweighted and poststratified associations between each lifestyle risk factor and a lifestyle index score with all-cause, cardiovascular disease (CVD), and cancer mortality. We also calculated the unweighted to poststratified ratio of HR (RHR) and 95% confidence interval as a marker of effect-size difference.
RESULTS: Of 371,974 UKB participants with no missing data, 302,009 had no history of CVD or cancer, corresponding to 3,298,958 person years of follow-up. Protective associations between alcohol use and CVD mortality observed in the unweighted UKB were substantially altered after poststratification, for example, from a hazard ratio (HR) of 0.63 (0.45-0.87) unweighted to 0.99 (0.65-1.50) poststratified for drinking ≥5 times/week versus never drinking. The magnitude of the poststratified all-cause mortality hazard ratio comparing least healthy with healthiest tertile of lifestyle risk factor index was 9% higher (95% confidence interval: 4%, 14%) than the unweighted estimates.
CONCLUSIONS: Lack of representativeness may distort the associations of alcohol with CVD mortality, and may underestimate health hazards among those with cumulatively the least healthy lifestyles.
Copyright © 2021 The Author(s). Published by Wolters Kluwer Health, Inc.

Entities:  

Mesh:

Year:  2021        PMID: 33492009      PMCID: PMC7850587          DOI: 10.1097/EDE.0000000000001316

Source DB:  PubMed          Journal:  Epidemiology        ISSN: 1044-3983            Impact factor:   4.860


Lifestyle risk factors such as physical inactivity, poor diet, and smoking have established associations with chronic disease,[1] premature mortality,[2] and health-related quality of life. Because of the chronic nature of such behavioral exposures and the near absence of long-term randomized controlled trials due to ethical or feasibility hurdles, much of our current knowledge on how they affect health comes from observational studies. For example, almost the entirety of the evidence used to develop guidelines on alcohol drinking[3] and physical activity[4] comes from observational cohort studies with mortality outcomes. Such guidelines are often translated into clinical practice and policy, and are used in clinical trials that involve lifestyle modification.[5] With rare exceptions, the samples of such observational studies are unrepresentative of the general population.[6] Unrepresentativeness is often rooted in the very low response rates these studies achieve, such as the Australian 45 and Up Study[7] (18% response rate) and the UK Biobank (UKB)[8] (5.5% response rate). The markedly low response rate of the influential UKB resource (>1000 peer-reviewed publications), in particular, has ignited debate on how (lack of) sample representativeness in observational cohorts affects the magnitude, direction, and generalizability of the associations between behavioral exposures and disease or mortality outcomes. Compared with the general UK population, UKB participants were considerably more likely to be older[6] and less likely to be physically inactive[8] or obese, smoke, or drink alcohol on a daily basis.[6] They had fewer chronic conditions, and markedly lower mortality and cancer incidence rates.[6] The UKB investigators previously stated that “UK Biobank is not representative of the general population on a variety of sociodemographic, physical, lifestyle and health-related characteristics, ….. As a result, UK Biobank is not a suitable resource for deriving generalizable disease prevalence and incidence rates,”[9] but have also informally recalled a previous statement[10] claiming that valid measures of association of lifestyle exposures with disease and mortality outcomes can be safely generalized as they do “not require participants to be representative of the population at large.” As both statements have been withdrawn from the UKB website, there is currently no UKB guidance on the role of representativeness and low response rates on interpreting environment (including lifestyle) – disease associations, a topic which has attracted substantial theoretical discussions before[11] and after the launch of the UKB data resource,[10,12] but surprisingly little empirical assessment.[12-15] A recent simulation by Keyes and Westreich[10] in the Lancet illustrated how the “healthy volunteer effect” present in the UKB[6] may grossly distort relative risk estimates of environmental and lifestyle exposures with chronic disease. Among the very few attempts – to our knowledge – to empirically evaluate the role of unrepresentativeness, a recent study compared lifestyle risk factor and cardiovascular disease (CVD) mortality estimates in the UKB and a pooled series of health surveillance cohorts from 1993 to 2008 in England and Scotland that had high response rates (69% on average).[14] Results were inconsistent as, despite the authors’ conclusions that estimates were comparable, the magnitude of the age and sex adjusted estimates of CVD mortality risk for physical inactivity (physically inactive vs. the rest) and alcohol drinking (noncurrent drinker vs. the rest) were markedly larger in the UKB than the pooled cohorts, for example, the HR for physical inactivity was 3.40 (95% confidence interval [CI]: 3.04, 3.80) versus 2.33 (95% CI: 2.02, 2.68). These results offer very limited insights on the influence of poor cohort representativeness on the associations between lifestyle risk factor and mortality risk. Among other reasons, pooling representative datasets from two different countries (e.g., England and Scotland) results in a dataset that is representative of neither country. Age and sex adjusted estimates are rarely (if ever) used in policy and guideline development; multivariable adjusted models are necessary to reduce or eliminate confounding by socioeconomic and other lifestyle risk factors. Cohort unrepresentativeness may also be affected by analytic decisions to exclude study participants with prevalent disease at baseline to minimize the possibility of reverse causation (i.e., spurious associations between abstinence from alcohol or low physical activity levels and mortality risk due to existing illness). When the target population is well defined, weighting methods can be used to restore the results in a nonrepresentative study sample to reflect the target population.[16] To the best of our knowledge, no study has calculated the multivariable adjusted associations of lifestyle risk factors with mortality in the UKB study (or any other large cohort), after restoring the cohort’s socioeconomic, demographic, and health behavior profiles to closely match the target population. The aim of this study was to examine how sample representativeness affects the multivariable adjusted associations of lifestyle risk factors with all-cause and cause-specific (CVD and cancer) mortality.

METHODS

UK Biobank

This research has been conducted using the UKB Resource under Application Number 25813.[8] The UKB is a prospective cohort study including 502,600 participants 40–69 years of age who were recruited in 22 centers across the United Kingdom between 2006 and 2010. This sample was drawn from over 9 million people initially approached (response rate 5.45%) who were registered with the UK NHS, were between 40 and 69 years of age, and lived within 40 km (25 miles) from an assessment center in England, Wales, and Scotland. Full details of the study methods have been published elsewhere.[17] All participants consented to the use of their de-identified data, including access to their health-related records, for research.[17] The UKB has been approved by the National Research Ethics Service (Ref 11/NW/0382).

The Health Survey for England

The Health Survey for England 2008 (HSE)[18] served as the poststratification reference for lifestyle risk factors’ prevalence. We used HSE and the already available corresponding nonresponse weights to estimate the total number of people with combinations of lifestyle risk factors for adults aged 40-69 years in the general UK population. We chose 2008 as the approximate mid-point year of the UKB baseline data collection (2006–2010). HSE is a household-based population surveillance study in which a multistage, stratified probability design was used to select households representative of the target populations of England.[19,20] The overall response rate in HSE 2008 was 64%.[21] To obtain a truly representative dataset of the target population in terms of key characteristics (age, sex, household type, geographical region, and social class), the survey team developed nonresponse weights using methods that are described in detail elsewhere.[21] In brief, the HSE team fitted a logistic regression model for all adults in participating households, excluding single-adult households. They calculated adult nonresponse weights as the inverse of the predicted probabilities of response estimated from the regression model, and trimmed the nonresponse weights for adults at the 1% tails to remove extreme values.[21] The HSE contained records on 7721 participants 40–69 years of age in 2008. We excluded HSE participants who were missing any poststratification variables (smoking n = 21, highest qualification achieved n = 22, body mass index [BMI] n = 1,048, total excluded n = 1,055), leaving the total of 6666 participants to be included in the calculation of poststratification weights.

Outcomes (UKB and HSE)

Date of death was obtained through linkage with national datasets from the National Health Service (NHS) Information Centre (England and Wales) and the NHS Central Register Scotland (Scotland). Participants were followed until April 2020. Primary cause of death was recorded using the International Classification of Diseases 10th revision (ICD10). CVD deaths included codes I01.0–I199. Cancer deaths included codes C00.0–C97.

Lifestyle Risk Factors

The choice and categorization of UKB lifestyle risk factors was determined by the availability of comparable information in HSE 2008[18] data. Alcohol consumption was categorized using the number of days alcohol was consumed per week[3]: (1) never drinker; (2) previous drinkers; (3) current, drinking less than 5 times/week); 4) current, drinking ≥5 times/week). Physical activity was assessed using the short-form International Physical Activity Questionnaire (IPAQ).[22] IPAQ assesses physical activity across leisure time, domestic activities, occupational activity, and transport-related activity.[23] Physical activity was quantified using the metabolic equivalent task (MET) hours of physical activity/week, calculated by multiplying the MET value of activity by the number of hours/week. We then classified participants’ physical activity as low (no physical activity), medium (>0, <7.5 MET hours/week), high (roughly equivalent to current public health PA guidelines; ≥7.5 MET hours/week).[8,24] Smoking was grouped as: never smoker, ex-smoker, and current smoker. To classify diet, we calculated average daily fruit and vegetable consumption as the sum of servings of cooked vegetables (1 serving = 2 tablespoons), salad and raw vegetables (1 serving = 2 tablespoons), fresh fruit and dried fruit consumed (1 serving = 1 piece) per day.

Composite Lifestyle Index

Several recent publications from the UKB[25-27] use composite lifestyle risk factor scores instead of a single lifestyle health behavior exposure. To examine the influence of sample representativeness on the associations of overall lifestyle and mortality, we categorized the three lifestyle risk factors (physical inactivity, fruit and vegetable consumption, and smoking) into three groups each as described above. We then applied the following scoring: least healthy = score of 2, medium = 1, most healthy = 0) to derive a composite variable (“lifestyle index”) with eight groups. Alcohol was scored as never drinker = 0; previous drinker = 1; current (<5 time and >5 times combined) = 2. We then grouped the resultant score into tertiles: 5 – 8 (least healthy lifestyle), 4 (middle tertile), 0–3 (healthiest lifestyle),

Poststratification

We used poststratification to weight underrepresented and overrepresented groups in the UKB to that of the HSE data, which is generalizable to the English population.[28] We chose a source from the largest UK constituent country (England, 84% of total UK population) because there are no nationally representative UK-wide data on lifestyle risk factors. The HSE data were divided into mutually exclusive groups, according to the six-way cross tabulation of age group (40–49, 50–59, and 60–70 years); sex: (male and female); highest qualification (college or university degree, high school diploma, other/none), smoking (ever, never smoker), physical activity (≥7.5 MET hours/week, <7.5 MET hours/week), and BMI (not overweight, overweight, or obese). We then calculated the sampling weights for each cell such that the weighted totals from the UKB sum to the totals in the UK population. We also considered alcohol and fruit and vegetable consumption for inclusion; however, it was not possible due to lower cell counts. The variable groupings listed earlier were selected to preserve adequate unweighted frequencies in the mutually exclusive cells in both the UKB and the HSE.

Statistical Analyses

We present unweighted and poststratified UKB estimates and compared the latter with actual Census 2011 UK (age and sex) and HSE (lifestyle risk factors) data. Consistent with current practice, we excluded those with a history of CVD and cancer at baseline from the main multivariable analyses. We used Cox proportional hazard regression models to estimate hazard ratios (HRs) for the association between lifestyle risk factors/unhealthy index and all-cause mortality both before and after poststratification. We measured survival using age as the time scale, from age of assessment to age at death or censoring date.[29] We mutually adjusted models adjusted for lifestyle risk factors and additionally adjusted for age, gender, and highest qualification. We used the ratio of the HRs (RHRs), calculated as the poststratified HR divided by the unweighted HR) () to quantify the relative change in estimates following poststratification. Percentile CIs for the RHR were estimated using bootstrapping with 1000 iterations.[30] We ran a set of sensitivity analyses to establish the robustness of the results. We repeated the analysis including those with history of major CVD (coronary heart disease or stroke) or cancer and adjusting for history of CVD and history of cancer. We did an additional series of sensitivity analyses to address the high proportion of missing data for poststratification variables. These data were assumed to be missing at random, where the probability of missing variables depends on the observed values of other variables, rather than the missing values.[31] We imputed missing data using the multiple imputation by chained equations approach to create five datasets.[32] We used logistic regression for binary variables and polytomous regression for categorical variables and included all covariates in the imputation models. Cox proportional hazard regression models were performed on all five datasets and estimates were combined using Rubin’s rules.[31] All analyses were performed using R software version 3.6.1 (R Foundation for Statistical Computing, Vienna, Austria).[33] All analytic code can be found in eAppendix 1; http://links.lww.com/EDE/B762.

RESULTS

Sample

From the full UKB sample of 502,600, we excluded participants who were missing any poststratification variables (missing: smoking n = 2,951, physical activity n = 121,167, highest qualification achieved n = 10,141, BMI n = 10,138, total excluded = 130,626), leaving a total of 371,974 participants to be considered, corresponding to 3,298,958 person years of follow-up. For the main analyses, we excluded participants with a history of major cardiovascular events (n = 56,345) or who had been diagnosed with cancer (n = 18,782) before baseline, leaving a total of n = 302,009 individuals with no CVD or cancer at baseline (n = 5162 participants had history of both CVD and cancer). Of the 302,009 individuals in the main analyses, there were 11,875 all-cause deaths (5.82%), of which we classified 3,580 as CVD and 7,217 as cancer deaths.

Fit of the Poststratified Dataset to the Target Population

eTable 1; http://links.lww.com/EDE/B763 presents the unweighted and poststratified distribution of key characteristics in the UKB Study. We noted considerable corrections in the distribution of the poststratified sample for age (e.g., percentage of participants in the 40- to 49-years age group increased from 24.9% in the unweighted to 38.0% in the poststratified), educational qualification (e.g., from 47.9 to 32.4% college/university educated), and physical activity (from 87.2% to 69.2% meeting the recommendations). Table 1 shows the key characteristics of the UKB Study after excluding those with a history of CVD or cancer (n after exclusion = 302,009), and eTable 2; http://links.lww.com/EDE/B763 shows the characteristics of the sample imputed exposures and covariates (n = 400,793). eTable 3; http://links.lww.com/EDE/B763 compares the distributions of the poststratified UKB and the HSE 2008 samples key characteristics. With the exception of alcohol intake, where some relatively modest differences were present, and fruit and vegetable consumption, the poststratification in the UKB achieved identical distributions across sociodemographic and lifestyle risk factors. For both men and women, poststratification normalized the age distribution toward to the actual UK population, especially in the groups of 40–44, 45–49, 60–64, and 65–69 years where the UKB sample was markedly unrepresentative (eFigure 1; http://links.lww.com/EDE/B763). We also calculated mortality rates per 1000 person–years of follow-up for the unweighted and poststratified UKB samples. These are compared to the UK annual mid-year mortality rates over the period of the UKB follow-up in eTable 4; http://links.lww.com/EDE/B763. Poststratified mortality rates were consistently higher than the unweighted rates, and closer to actual UK mortality rates.
TABLE 1.

Frequencies of Participant Demographic and Health-related Variables in the Health Survey for England and UK Biobank, Excluding People in the UK Biobank with History of Cancer or Cardiovascular Disease (n = 302,009 After Exclusion)

VariableUK Biobank N Unweighted nUK Biobank % UnweightedUK Biobank % Poststratified
Age group (years)40–4984,1582842
50–59105,3113532
60–70112,5403726
SexFemale159,8275351
Male142,1824749
Education qualificationCollege or university degree147,7864934
Highschool diploma118,0803941
Other/None36,1431225
Physical activity≥7.5 MET hours/week264,6018871
>0, <7.5 MET hours/week33,7031126
No physical activity3,70513
Fruit and vegetable consumptionAt least 10 portions/day25,65587
5–9 portions/day133,0124439
under 5 portions/day140,3964652
Unknown2,94611
Alcohol use frequencyNever10,68744
Previous8,89533
Current: < almost daily216,4057273
Current: ≥ almost daily65,8912219
Unknown131<1<1
Smoking statusNever170,4175652
Previous101,2273434
Current30,3651114
Lifestyle indexLowest tertile 1 most healthy100,8343326
Middle tertile111,4733732
Highest tertile: least healthy86,6372940
BMI category (kg/m2)Underweight (<18.5)1,450<1<1
Normal (18.5–24.9)104,9453530
Overweight (25.0–29.9)130,8884344
Obese I (30.0–34.9)48,0661618
Obese II (35.0–39.9)12,35545
Obese III ≥ 40.04,30512

BMI indicates body mass index; MET, metabolic equivalent task.

Frequencies of Participant Demographic and Health-related Variables in the Health Survey for England and UK Biobank, Excluding People in the UK Biobank with History of Cancer or Cardiovascular Disease (n = 302,009 After Exclusion) BMI indicates body mass index; MET, metabolic equivalent task.

Individual Lifestyle Risk Factors

All HRs presented in tables, figures, or text are multivariable adjusted. Figures 1–3 present the unweighted and poststratified HRs for each lifestyle risk factor against all-cause, CVD, and cancer mortality. Table 2 shows the unweighted to poststratified RHR and corresponding 95% CIs for each lifestyle risk factor. With the exception of the CVD mortality alcohol use estimates and all-cause mortality smoking estimates, the unweighted and poststratified estimates for the remaining lifestyle risk factors were similar across the three mortality outcomes. Specifically, the protective associations between increased alcohol use and CVD mortality in the unweighted dataset were not present following poststratification, for example, from an HR of 0.63 (0.45–0.87) unweighted to 0.99 (0.65–1.50) poststratified for drinking ≥5 times/week versus never drinking. These alcohol findings were also reflected by the RHR (poststratified to unweighted): 1.52, 95% CI: 1.23, 1.86 for drinking <5 times/week; and 1.55, 1.23–1.86 for drinking ≥5 times/week) (Figure 2; Table 2). The poststratified all-cause mortality risk for current smokers was higher than the unweighted estimates (ratio of HRs 1.13, 95% CI: 1.06, 1.20) (Figure 2; Table 2). The pattern of the cancer mortality estimates comparisons was broadly similar to all-cause mortality with less evidence for differences between unweighted and poststratified estimates (Figure 3; Table 2). Including participants who had a history of major CVD or cancer at baseline affected minimally the poststratified versus unweighted comparisons of lifestyle risk factor estimates across all three mortality outcomes (eFigure 2; http://links.lww.com/EDE/B763 shows the CVD mortality results as an example).
FIGURE 1.

Adjusted (model mutually adjusted for all lifestyle risk factors, and additionally adjusted for age, sex, and highest educational qualification) hazard ratio of each lifestyle risk factor for all-cause mortality, excluding people with history of cancer or cardiovascular disease (n = 302,009 after exclusions). MET indicates metabolic equivalents.

FIGURE 3.

Adjusted (model mutually adjusted for all lifestyle risk factors, and additionally adjusted for age, sex, and highest educational qualification) hazard ratio of each lifestyle risk factor for cancer mortality, excluding people with history of cancer or CVD (n = 302,009 after exclusions). CVD indicates cardiovascular disease; MET, metabolic equivalents.

TABLE 2.

Adjusted RHR of Each Lifestyle Risk Factor for All-cause, CVD, and Cancer Mortality, Excluding People with History of Cancer or CVD (n = 302,009 After Exclusion)

RHRs (Poststratified/Unweighted)
VariableLevelAll-cause MortalityCVD MortalityCancer Mortality
Physical activity level≥7.5 MET hours/weekReferenceReferenceReference
>0, <7.5 MET hours/week1.04 (0.99, 1.08)0.92 (0.82, 1.03)1.06 (0.99, 1.19)
No PA1.01 (0.90, 1.13)0.83 (0.57, 1.13)1.21 (0.98, 1.45)
Fruit and vegetable consumptionAt least 10 portions/dayReferenceReferenceReference
5–9 portions/day0.97 (0.89, 1.07)0.93 (0.75, 1.19)1.02 (0.86, 1.20)
Under 5 portions/day0.95 (0.87, 1.04)0.96 (0.77, 1.22)1.00 (0.84, 1.18)
Alcohol use frequencyNeverReferenceReferenceReference
Previous1.15 (0.98, 1.35)1.63 (1.07, 2.34)1.00 (0.73, 1.36)
Current: < almost daily1.04 (0.92, 1.18)1.52 (1.23, 1.86)1.00 (0.79, 1.29)
Current: ≥ almost daily1.08 (0.95, 1.24)1.55 (1.22, 1.99)1.02 (0.79, 1.33)
Smoking statusNeverReferenceReferenceReference
Previous1.03 (0.98, 1.08)0.99 (0.87, 1.14)1.06 (0.98, 1.16)
Current1.13 (1.06, 1.20)0.98 (0.82, 1.14)1.06 (0.94, 1.20)

aTo be equivalent to Health Survey for England weighed estimates.

CVD indicates cardiovascular disease; MET, metabolic equivalent task; RHR, ratio of hazard ratio.

FIGURE 2.

Adjusted (model mutually adjusted for all lifestyle risk factors, and additionally adjusted for age, sex, and highest educational qualification) hazard ratio of each lifestyle risk factor for all cardiovascular disease (CVD) mortality, excluding people with history of cancer or CVD (n = 302,009 after exclusion). MET indicates metabolic equivalents.

Adjusted RHR of Each Lifestyle Risk Factor for All-cause, CVD, and Cancer Mortality, Excluding People with History of Cancer or CVD (n = 302,009 After Exclusion) aTo be equivalent to Health Survey for England weighed estimates. CVD indicates cardiovascular disease; MET, metabolic equivalent task; RHR, ratio of hazard ratio. Adjusted (model mutually adjusted for all lifestyle risk factors, and additionally adjusted for age, sex, and highest educational qualification) hazard ratio of each lifestyle risk factor for all-cause mortality, excluding people with history of cancer or cardiovascular disease (n = 302,009 after exclusions). MET indicates metabolic equivalents. Adjusted (model mutually adjusted for all lifestyle risk factors, and additionally adjusted for age, sex, and highest educational qualification) hazard ratio of each lifestyle risk factor for all cardiovascular disease (CVD) mortality, excluding people with history of cancer or CVD (n = 302,009 after exclusion). MET indicates metabolic equivalents. Adjusted (model mutually adjusted for all lifestyle risk factors, and additionally adjusted for age, sex, and highest educational qualification) hazard ratio of each lifestyle risk factor for cancer mortality, excluding people with history of cancer or CVD (n = 302,009 after exclusions). CVD indicates cardiovascular disease; MET, metabolic equivalents.

Lifestyle Index

No consistent differences existed between the unweighted and poststratified all-cause mortality HRs in the lower and middle range values of the lifestyle index scores (eFigure 3; http://links.lww.com/EDE/B763). Figure 4A corroborates this pattern as the unweighted and poststratified estimates for all-cause mortality were very similar in the middle lifestyle index tertile but the magnitude of the poststratified estimates was higher in the least healthy lifestyle index tertile (ratio of HRs in the top lifestyle risk factor index tertile: 1.09, 95% CI: 1.04, 1.14, Table 3). The unweighted and poststratified estimates for CVD and cancer mortality were similar across tertiles of the lifestyle index (Figure 4B and C; Table 3. None of these findings were affected materially by the inclusion of participants with history of CVD or cancer (see eFigure 4; http://links.lww.com/EDE/B763, as an example).
FIGURE 4.

Adjusted (adjusted for age, sex, and highest educational qualification) hazard ratio of lifestyle index tertiles for all-cause mortality (A)/CVD mortality (B)/cancer mortality (C), excluding people with history of cancer or CVD (n = 302,009 after exclusions). CVD indicates cardiovascular disease.

TABLE 3.

Adjusted RHRs of Lifestyle Index for All-cause, CVD, and Cancer Mortality, Excluding People with History of Cancer or CVD (n = 302,009)

RHRs (Poststratified/Unweighted)
VariableLevelAll-cause MortalityCVD MortalityCancer Mortality
Lifestyle indexLowest tertile: most healthyReferenceReferenceReference
Middle tertile1.02 (0.98, 1.07)0.91 (0.78, 1.05)1.00 (0.90, 1.09)
Highest tertile: least healthy1.09 (1.04, 1.14)0.96 (0.81, 1.13)1.08 (0.97, 1.19)
Missing1.08 (0.90, 1.29)0.81 (0.52, 1.29)1.05 (0.67, 1.54)

aTo be equivalent to Health Survey for England weighed estimates.

CVD indicates cardiovascular disease; RHR, ratio of hazard ratio.

Adjusted RHRs of Lifestyle Index for All-cause, CVD, and Cancer Mortality, Excluding People with History of Cancer or CVD (n = 302,009) aTo be equivalent to Health Survey for England weighed estimates. CVD indicates cardiovascular disease; RHR, ratio of hazard ratio. Adjusted (adjusted for age, sex, and highest educational qualification) hazard ratio of lifestyle index tertiles for all-cause mortality (A)/CVD mortality (B)/cancer mortality (C), excluding people with history of cancer or CVD (n = 302,009 after exclusions). CVD indicates cardiovascular disease.

Imputation

eFigures 5–7 (http://links.lww.com/EDE/B763) present the unweighted and poststratified HRs for each lifestyle risk factor against all-cause, CVD, and cancer mortality in the imputed dataset (n = 400,793). There were no appreciable differences with the unimputed data presented in Figures 1–3.

DISCUSSION

We used poststratification based on a nationally representative sample to restore the UKB’s socioeconomic and behavioral profiles to the target population; and we compared the associations of lifestyle risk factors and mortality in the original and poststratified UKB samples. We found that the associations of physical activity, smoking, and diet with all-cause, CVD, and cancer mortality were broadly similar in the two sets of analyses. Poststratification eliminated the protective associations between alcohol use and CVD mortality we observed in the original UKB analyses. We also found that the all-cause mortality risk of current smokers and those with cumulatively the least healthy lifestyles may be underestimated due to poor sample representativeness. Nevertheless, the absolute difference in these estimates was 13% (smoking) and 9% (least healthy lifestyle index score) respectively, and the practical importance of such risk underestimation is likely to be small. Our results suggest that the protective associations of alcohol intake with CVD outcomes observed in previous studies[8,34] may be spurious. Although we only used weekly frequency of alcohol intake in the current analyses, recent UKB results[8] suggested that alcohol volume also shows protective associations, for example, drinking within guidelines was associated with lower risk for CVD mortality compared to never drinkers (HR: 0.73, 95% CI: 0.56, 0.95) while drinking even more than double the recommended amounts was not associated with elevated CVD risk (HR: 0.86, 95% CI: 0.63, 1.17). The link of low response rates to spurious cardioprotective effect estimates of alcohol intake[34] is further supported by the absence of such findings in studies involving nationally representative cohorts. For example, an analysis of eight pooled British (England and Scotland) cohorts with high response rates (68%–77%) found no association between moderate drinking volume and CVD mortality.[24] It is not possible to directly compare our alcohol use results to the recent study with similar aims to ours that compared the UKB to the pooled 1993–2008 dataset from England and Scotland.[14] Being noncurrent drinker (vs. the rest) showed a 22% (1%–48%) higher risk in the latter dataset compared to the Biobank. However, the “noncurrent drinker” group pooled lifetime abstainers and exdrinkers who might have quit due to health reasons and as such it offers little information on the risks of alcohol drinking. In the same study, physical inactivity (which was not clearly defined in the article) estimates were 46% (22%–75%) higher in the UKB than the pooled 1993–2008 cohorts. This contrasts our results where the poststratified and unweighted estimates for physical inactivity were closely aligned across all three mortality outcomes. Compared to the sparse previous literature,[14] our study has a number of notable strengths. We calculated multivariable adjusted estimates that are usually used in policy and guideline development. We assessed differences between unweighted and poststratified estimates in the UKB using data handling methods commonly employed in the field of lifestyle risk factors, such as exclusion of participants with history of major chronic disease at baseline. Our HSE 2008 reference dataset was temporally consistent to the UKB baseline (2006–2010); and was weighed for nonresponse to give a reference that is truly representative of the population of adults living in the largest constituent country of the United Kingdom. Another unique strength of our study is that poststratification allowed us to correct the UKB’s distribution not only for socioeconomic and demographic variables, but also for health behavior profiles. This is an innovative and more policy relevant method than previous approaches. Our study is relevant to both individual lifestyle risk factors and composite lifestyle risk factors indices that are used increasingly in large-scale observational research.[25-27] Our study has some limitations. In the absence of a nationally representative UK-wide data resource we used an English reference. This is unlikely to have had a major impact on our results as England corresponds to 84% of the total UK population, although we acknowledge that there are some differences in lifestyle risk factor prevalence between UK contries.[35] As in previous UKB physical activity publications,[36] we were not able to make use of the entire dataset due to missing data on lifestyle risk factors and covariates. However, our sensitivity analyses where we imputed all such data show that our comparisons, conclusions, and underlying study principle were unlikely to be affected by missing data. We cannot eliminate entirely the possibility that the chosen categories of lifestyle risk factors, necessitated by the requirements of poststratification weights development, are not sufficient to correct for complex selection biases. Equally, the variables we used in the development of the poststratification weight may not have captured differences between HSE and UKB participants in terms of unmeasured factors such as genetic and psychosocial characteristics. Our approach assumes that measurement errors of lifestyle risk factors in HSE and the UKB are comparable between studies. We did not seek to employ best-practice casual inference modeling strategies, such as providing a sound structural framework to identify confounders and mediators based on directed acyclic graphs, or taking into account competing risks in the cause-specific mortality analyses. These omissions were intentional because our primary goal was to replicate common practices in the published UKB (and analogous) literature of lifestyle risk factors and mortality. For example, such literature[25,37-39] does not typically take into account competing risks, while the use of a predefined structural framework for choice of confounders and evaluation of selection biases is still the exception rather than the rule. Accordingly, we acknowledge that both the unweighted and poststratified estimates we reported may be biased and/or imprecise. In conclusion, lack of cohort representativeness in the UKB may lead to spurious cardioprotective effects estimates for alcohol intake and may underestimate health hazards among those with the least healthy lifestyles. Although physical inactivity, smoking, and dietary estimates appeared to be minimally affected, our findings suggest that the extremely low response rates in cohort studies may distort policy relevant research findings on the health effects of specific exposures. Our results suggest that future UKB (and analogous cohort) users should exercise caution when examining associations between established risk factors and mortality outcomes as poor cohort sample representativeness might influence materially some estimates. Further studies empirically evaluating the influence of unrepresentativeness across other categories of risk factors estimates are warranted.
  29 in total

1.  International physical activity questionnaire: 12-country reliability and validity.

Authors:  Cora L Craig; Alison L Marshall; Michael Sjöström; Adrian E Bauman; Michael L Booth; Barbara E Ainsworth; Michael Pratt; Ulf Ekelund; Agneta Yngve; James F Sallis; Pekka Oja
Journal:  Med Sci Sports Exerc       Date:  2003-08       Impact factor: 5.411

2.  Multiple imputation by chained equations: what is it and how does it work?

Authors:  Melissa J Azur; Elizabeth A Stuart; Constantine Frangakis; Philip J Leaf
Journal:  Int J Methods Psychiatr Res       Date:  2011-03       Impact factor: 4.035

3.  Why representativeness should be avoided.

Authors:  Kenneth J Rothman; John E J Gallacher; Elizabeth E Hatch
Journal:  Int J Epidemiol       Date:  2013-08       Impact factor: 7.196

4.  Transportability of Trial Results Using Inverse Odds of Sampling Weights.

Authors:  Daniel Westreich; Jessie K Edwards; Catherine R Lesko; Elizabeth Stuart; Stephen R Cole
Journal:  Am J Epidemiol       Date:  2017-10-15       Impact factor: 4.897

5.  Alcohol Consumption and Mortality From Coronary Heart Disease: An Updated Meta-Analysis of Cohort Studies.

Authors:  Jinhui Zhao; Tim Stockwell; Audra Roemer; Timothy Naimi; Tanya Chikritzhs
Journal:  J Stud Alcohol Drugs       Date:  2017-05       Impact factor: 2.582

6.  Does a physically active lifestyle attenuate the association between alcohol consumption and mortality risk? Findings from the UK biobank.

Authors:  Lauren Powell; Yingyu Feng; Mitch J Duncan; Mark Hamer; Emmanuel Stamatakis
Journal:  Prev Med       Date:  2019-11-12       Impact factor: 4.018

7.  Cohort profile: the health survey for England.

Authors:  Jennifer Mindell; Jane P Biddulph; Vasant Hirani; Emanuel Stamatakis; Rachel Craig; Susan Nunn; Nicola Shelton
Journal:  Int J Epidemiol       Date:  2012-01-09       Impact factor: 7.196

8.  UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age.

Authors:  Cathie Sudlow; John Gallacher; Naomi Allen; Valerie Beral; Paul Burton; John Danesh; Paul Downey; Paul Elliott; Jane Green; Martin Landray; Bette Liu; Paul Matthews; Giok Ong; Jill Pell; Alan Silman; Alan Young; Tim Sprosen; Tim Peakman; Rory Collins
Journal:  PLoS Med       Date:  2015-03-31       Impact factor: 11.069

9.  The impact of physical activity and an additional behavioural risk factor on cardiovascular disease, cancer and all-cause mortality: a systematic review.

Authors:  Jason Lacombe; Miranda E G Armstrong; F Lucy Wright; Charlie Foster
Journal:  BMC Public Health       Date:  2019-07-08       Impact factor: 3.295

10.  Genetic risk, incident stroke, and the benefits of adhering to a healthy lifestyle: cohort study of 306 473 UK Biobank participants.

Authors:  Loes Ca Rutten-Jacobs; Susanna C Larsson; Rainer Malik; Kristiina Rannikmäe; Cathie L Sudlow; Martin Dichgans; Hugh S Markus; Matthew Traylor
Journal:  BMJ       Date:  2018-10-24
View more
  15 in total

1.  Inpatient care utilisation and expenditure associated with objective physical activity: econometric analysis of the UK Biobank.

Authors:  Leonie Heron; Mark A Tully; Frank Kee; Ciaran O'Neill
Journal:  Eur J Health Econ       Date:  2022-06-24

2.  Education and Income Show Heterogeneous Relationships to Lifespan Brain and Cognitive Differences Across European and US Cohorts.

Authors:  Kristine B Walhovd; Anders M Fjell; Yunpeng Wang; Inge K Amlien; Athanasia M Mowinckel; Ulman Lindenberger; Sandra Düzel; David Bartrés-Faz; Klaus P Ebmeier; Christian A Drevon; William F C Baaré; Paolo Ghisletta; Louise Baruël Johansen; Rogier A Kievit; Richard N Henson; Kathrine Skak Madsen; Lars Nyberg; Jennifer R Harris; Cristina Solé-Padullés; Sara Pudas; Øystein Sørensen; René Westerhausen; Enikő Zsoldos; Laura Nawijn; Torkild Hovde Lyngstad; Sana Suri; Brenda Penninx; Ole J Rogeberg; Andreas M Brandmaier
Journal:  Cereb Cortex       Date:  2022-02-08       Impact factor: 5.357

3.  Prospective Associations of Daily Step Counts and Intensity With Cancer and Cardiovascular Disease Incidence and Mortality and All-Cause Mortality.

Authors:  Borja Del Pozo Cruz; Matthew N Ahmadi; I-Min Lee; Emmanuel Stamatakis
Journal:  JAMA Intern Med       Date:  2022-09-12       Impact factor: 44.409

4.  Visualizing the Geography of HIV Observational Cohorts With Density-Adjusted Cartograms.

Authors:  Daniel E Sack; Stephen J Gange; Keri N Althoff; April C Pettit; Asghar N Kheshti; Imani S Ransby; Jeff J Nelson; Megan M Turner; Timothy R Sterling; Peter F Rebeiro
Journal:  J Acquir Immune Defic Syndr       Date:  2022-04-15       Impact factor: 3.771

5.  Cross-classification between self-rated health and health status: longitudinal analyses of all-cause mortality and leading causes of death in the UK.

Authors:  Julian Mutz; Cathryn M Lewis
Journal:  Sci Rep       Date:  2022-01-10       Impact factor: 4.996

6.  Reproductive factors and the risk of incident dementia: A cohort study of UK Biobank participants.

Authors:  Jessica Gong; Katie Harris; Sanne A E Peters; Mark Woodward
Journal:  PLoS Med       Date:  2022-04-05       Impact factor: 11.069

7.  The value of combining individual and small area sociodemographic data for assessing and handling selective participation in cohort studies: Evidence from the Swedish CardioPulmonary bioImage Study.

Authors:  Carl Bonander; Anton Nilsson; Jonas Björk; Anders Blomberg; Gunnar Engström; Tomas Jernberg; Johan Sundström; Carl Johan Östgren; Göran Bergström; Ulf Strömberg
Journal:  PLoS One       Date:  2022-03-08       Impact factor: 3.240

8.  Impact of replacing sedentary behaviour with other movement behaviours on depression and anxiety symptoms: a prospective cohort study in the UK Biobank.

Authors:  A A Kandola; B Del Pozo Cruz; D P J Osborn; B Stubbs; K W Choi; J F Hayes
Journal:  BMC Med       Date:  2021-06-17       Impact factor: 11.150

9.  Correcting pandemic data analysis through environmental fluid dynamics.

Authors:  Talib Dbouk; Dimitris Drikakis
Journal:  Phys Fluids (1994)       Date:  2021-06-22       Impact factor: 3.521

10.  Polygenic risk for coronary artery disease in the Scottish and English population.

Authors:  Chuhua Yang; Fabian Starnecker; Shichao Pang; Zhifen Chen; Ulrich Güldener; Ling Li; Matthias Heinig; Heribert Schunkert
Journal:  BMC Cardiovasc Disord       Date:  2021-12-07       Impact factor: 2.174

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.