Literature DB >> 20202213

Statistical modeling of volume of alcohol exposure for epidemiological studies of population health: the US example.

Jürgen Rehm1, Tara Kehoe, Gerrit Gmel, Fred Stinson, Bridget Grant, Gerhard Gmel.   

Abstract

BACKGROUND: Alcohol consumption is a major risk factor in the global burden of disease, with overall volume of exposure as the principal underlying dimension. Two main sources of data on volume of alcohol exposure are available: surveys and per capita consumption derived from routine statistics such as taxation. As both sources have significant problems, this paper presents an approach that triangulates information from both sources into disaggregated estimates in line with the overall level of per capita consumption.
METHODS: A modeling approach was applied to the US using data from a large and representative survey, the National Epidemiologic Survey on Alcohol and Related Conditions. Different distributions (log-normal, gamma, Weibull) were used to model consumption among drinkers in subgroups defined by sex, age, and ethnicity. The gamma distribution was used to shift the fitted distributions in line with the overall volume as derived from per capita estimates. Implications for alcohol-attributable fractions were presented, using liver cirrhosis as an example.
RESULTS: The triangulation of survey data with aggregated per capita consumption data proved feasible and allowed for modeling of alcohol exposure disaggregated by sex, age, and ethnicity. These models can be used in combination with risk relations for burden of disease calculations. Sensitivity analyses showed that the gamma distribution chosen yielded very similar results in terms of fit and alcohol-attributable mortality as the other tested distributions.
CONCLUSIONS: Modeling alcohol consumption via the gamma distribution was feasible. To further refine this approach, research should focus on the main assumptions underlying the approach to explore differences between volume estimates derived from surveys and per capita consumption figures.

Entities:  

Year:  2010        PMID: 20202213      PMCID: PMC2841092          DOI: 10.1186/1478-7954-8-3

Source DB:  PubMed          Journal:  Popul Health Metr        ISSN: 1478-7954


Introduction

The volume of alcohol consumed has been shown to be causally related to more than 230 International Classification of Disease, version 10, disease codes [1-3]. Most of these relationships follow the component cause model [4], in which only a fraction (the so-called attributable fraction [5]), of the incidence of a disease would disappear if the causal component, in this case alcohol use [6], could be eliminated. Liver cirrhosis may be used as an example: alcohol use has been shown to have a causal impact on liver cirrhosis [7-9]; however, there are also cases of liver cirrhosis where alcohol use has not been involved, such as those stemming from HCV infection in nondrinkers. The proportion of most diseases caused by alcohol in the component cause model in a population is determined by: • The distribution of the volume of exposure • The relative risk associated with each level of exposure, i.e., dose-response relationship [10] For most disease categories, the dose-response relationship is nonlinear and varies by sex as well as age in some cases. Thus, to calculate the alcohol-attributable fractions (AAFs) for estimating a population's burden of disease attributable to alcohol, we need to characterize the volume of alcohol exposure by sex and age. There is one main problem with the characterization of the volume of alcohol exposure in populations: the best indicator, adult per capita consumption [11], is not available by sex and age. It is derived mainly from production, sales, export, and import figures, which are almost never disaggregated [12]. As an alternative, alcohol exposure can be measured from surveys, yet this has the disadvantage that adult per capita figures, the best indicator for consumption, are often severely underestimated [11]. To give but one recent example: the national Canadian Addiction Survey [13] had a coverage rate of between 30% and 40% of the adult per capita consumption. Thus, estimating overall volume of consumption in Canada based on answers from a representative survey results in a figure that is 60% to 70% lower than the figure derived from aggregate statistics mainly based on sales and taxation. The exact magnitude of underestimation depends on the alcohol measure used in the survey [14]. This underestimation of population exposure leads to two problems: first, the absolute level of exposure is incorrectly estimated, usually underestimated. And second, inter- and intrapopulation distributions based on different surveys are not adequately comparable as the degree of incorrect estimation will vary. Recent large, nationally representative surveys have shown coverage rates between 25% and more than 100% [15]. Studies have thus had to explore additional means for achieving comparability. In the case of the Comparative Risk Assessment (CRA) for alcohol within the Global Burden of Disease (GBD) 2000 study, this was achieved by triangulating survey and per capita information based on sales or production [12,15]. In short, the distribution of volume by sex and age was taken from surveys, while the overall exposure was taken from adult per capita figures [15]. There is, however, more than one way to triangulate such data. The CRA 2000, for instance, used a categorical approach based on the standard categories of volume of drinking from English and colleagues [16]. For shifting the distribution of the survey to correspond to the per capita consumption, however, an additional assumption had to be made to obtain a unique solution, and the assumption chosen at the time was that the highest drinking category was a constant fraction of the percentage of the next highest category [15]. However, no empirical evidence supports this specific assumption. Therefore, a more evidence-based approach was sought for the ongoing CRA of the GBD 2005 study. Different distributions (gamma, log-normal, Weibull) were fitted to empirical data of surveys from 66 countries with the aim of identifying associations between distribution parameters that would enable shifting survey distributions to fit the volume of drinking indicated by adult per capita consumption data [17]. The two-parameter gamma distribution proved to be best suited for triangulating survey and per capita data by shifting distributions upward because: 1) As an inherent characteristic of this distribution, the means of fitted distributions are equal to those of the empirical distributions. Thus, there is no error involved in estimating means from the fitted distribution and no need for erroneous and complicated back-transformations from fitted distributions to original scales (as in the case of the log-normal distribution). 2) There was a very high correlation between means and standard deviations of the sex- and age-specific, fitted two-parameter gamma distributions (r = 0.923; N = 851; p < 0.01). We then conducted a linear regression with the standard deviation as the dependent variable, and mean and sex as the independent variables, and could predict the standard deviation with precision (r = 0.971; N = 851; p < 0.01; for the derived prediction equation, see below). Thus, the shifted distributions could easily and reliably be derived from the mean (= per capita consumption) and the standard deviation as estimated by regression methods [17]. This article discusses the modeling approach described above in its application to US data, with three main objectives: 1. To model the volume of alcohol exposure in the US with three different distributions: log-normal, gamma, and Weibull distribution for different strata by sex, age and ethnicity. The volume of alcohol use was obtained from a large representative survey, the National Epidemiologic Survey on Alcohol and Related Conditions (NESARC). 2. To shift the alcohol use distribution to the level of adult per capita consumption. 3. To show the impact of shifting the alcohol use distribution on AAFs, using liver cirrhosis as an example.

Methods

Description of underlying survey (NESARC)

This analysis is based on data from the 2001-2002 NESARC, which was designed and sponsored by the National Institute on Alcohol Abuse and Alcoholism. The fieldwork for the NESARC was conducted by the US Census Bureau, with data collected in face-to-face, computer-assisted, in-home interviews. The NESARC sample represents the civilian, noninstitutionalized adult population of the United States, including the District of Columbia, Alaska, and Hawaii, and includes people living in households and military personnel living on and off base [18]. The NESARC oversampled African Americans, Hispanics, and adults aged 18 to 24. One sample adult (age 18 or older) was selected for an interview in each household. The overall response rate was 81% (N = 43,093). The volume of ethanol consumption reflected consumption in the 12 months preceding the interview. The volume of ethanol intake was based on data summed over a separate series of questions for coolers, beer, wine, and distilled spirits. For each beverage, volume was estimated on the basis of: overall frequency of drinking; typical and largest quantities consumed; frequency of consuming the largest quantity; frequency of consuming five-plus drinks; typical drink size; and ethanol content by volume of the brand usually consumed. The test-retest reliabilities for the various measures of alcohol consumption from the 2001 to 2002 NESARC were good to excellent, with intraclass correlation coefficients ranging from 0.68 to 0.84 [19].

Methods for fitting the distributions

To find an appropriate model for alcohol consumption, we examined three distributions that were unimodal, had a density with only one maximum, and could be used to fit right-skewed empirical data: log-normal, gamma, and Weibull. The log-normal, gamma, and Weibull densities are similar in shape and mainly differ at the tails (i.e., at high levels of consumption). Alcohol consumption has been more commonly modeled using the log-normal distribution, mostly for historical reasons related to the so-called single distribution theory [20,21]. But the log-normal distribution also has been favored because it is easy to use as a transformation and results in sufficiently accurate values that permit fitting and testing hypotheses [22]. Although doubts regarding the single parametric log-normal distribution and its justification as the best approximation for the distribution of consumption have been previously raised, [23,24] it has appeared to provide good approximations for most applications [25,26]. Later developments on modeling alcohol exposure have favored more complex distributions such as gamma [27], or revealed that mixing distributions is needed to fit separate distributions for frequency of drinking and quantity of drinking [28]. The log-normal distribution, with parameters μ and σ, describes a random variable X where log x is normally distributed with mean μ and standard deviation σ. The log-normal distribution function, with parameters μ and σ, is given by: Although alcohol consumption is frequently modeled using the log-normal distribution, empirical distributions often deviate considerably from the log-normal model [27-29]. The gamma distribution has two parameters, a scale parameter θ and a shape parameter k. The gamma distribution is more adaptable than the log-normal distribution because it has the effect of stretching or compressing its range by changing the scale parameter θ. The gamma distribution has the following probability density function: The Weibull distribution is one of the most widely used distributions in applied statistics, especially life data analysis, because of its versatility in fitting a variety of distributions. The probability density function of a random variable X having a two-parameter Weibull distribution with shape parameter γ and scale parameter θ is given by: The shape parameter γ in the Weibull distribution gives this distribution its flexibility. The maximum likelihood method of estimation was used to fit all three models -- log-normal, gamma, and Weibull -- to the data using the R language [30]. All missing values on volume of drinking -- 298 in total (men 185; women 113) -- were excluded from the fitted models. All numerical integration used the trapezoidal rule with many subintervals to obtain more accurate estimates. The trapezoidal rule uses trapezoids instead of rectangles for approximating the definite interval over closed bounded intervals. The Newton-Raphson algorithm, having a quadratic rate of convergence, was used to optimize the likelihood equations solving for the unknown parameters with maximum likelihood estimates. To compare different fitted distributions, we used chi-square tests by comparing expected frequencies (derived from the fitted distributions) and observed frequencies from the empirical distributions, using a bandwidth of 10 grams of ethanol for the frequencies. The use of chi-square tests to compare two distributions is a standard method [31].

Method for shifting the distribution

As indicated above, the gamma distribution has two parameters k and θ, which can be expressed in terms of means and standard deviations using the intrinsic properties of the gamma function, namely: hence The gamma distribution also has the welcome property of its mean being the same as that of the empirical distribution. To shift a gamma distribution, the mean and standard deviation of the shifted distribution must be known. However, per capita consumption only indicates the mean of the (up-)shifted distribution. To derive the standard deviation of this distribution, we had to find a way to predict the standard deviation based on the mean. This was achieved via regression from the large global dataset. To shift the gamma functions to fit the adult per capita level, two crucial assumptions had to be made: 1. The proportion of abstainer categories as derived from the survey reflected the true proportion of current abstainers (lifetime abstainers plus ex-drinkers) in the population. 2. The overall coverage rate for the survey (i.e., the total volume of alcohol exposure derived from the survey divided by the adult per capita consumption from sales or other statistics) applied to all subpopulations as defined by age and sex. We found a coverage rate of 0.529% between the NESARC survey and adult per capita alcohol consumption for the US, estimated at 8.75 liters per capita for 2001-2002 when NESARC took place based on the Global Information System for Alcohol and Health http://apps.who.int/globalatlas/default.asp. Using this value, the shifted means of the drinkers for different subpopulations as described by age, sex, and ethnicity were derived as follows with the same constant for all subpopulations: As described above, the shifted standard deviation was derived empirically via regression analyses [17]: where sex was coded 0 for men and 1 for women. In the regression, N = 851 sex and age subpopulations from 66 countries, individually modeled to derive the above equation [17]. The multiple linear regression based on all these surveys explained 94% of variation of the dependent variable.

Method for deriving AAFs

Knowing the percentages of abstainers and former drinkers, as well as the risk relation, the AAFs based on continuous distributions were obtained using the following formula: while the categorical value of the AAFs were obtained using: where Pabs represents the proportion of lifetime abstainers, Pform the proportion of former drinkers, and P(x) the probability distribution function of drinkers. RRform represents the relative risk for former drinkers, and RR(x) the relative risk function for a given alcohol consumption in grams per day. The subscript i denotes the groups as characterized by different categories for volume of drinking. We conducted a sensitivity analysis with consumption capped at 150 grams of pure alcohol per day. Confidence intervals were based on simulations using the bootstrapping method [32]. For each AAF, 10,000 simulations were run.

Results

Additional File 1 gives an overview of the volume of drinking for different groups as defined by sex, age, and ethnicity. Overall, as expected, the younger the age group among adults, the higher the volume. Men consumed more than women, and Native Americans consumed on average more than other ethnicities. Table 1 and Figures 1 and 2 show the fitting of different distributions to the data for non-Hispanic whites as an example. All three distributions fit the data reasonably well, with some deviations mainly in the tails of the distributions. For consumption levels up to 100 grams, the Weibull and gamma distributions were very similar and fit well, but the log-normal distribution underestimated drinking at lower levels of drinking.
Table 1

Example for fitting exposure distributions for white non-Hispanic Americans

Goodness of fitChi Square
Volume of drinking in g/dayCountEmpirical distribution %log-normal fit %gamma fit %Weibull fit %log-normalgammaWeibull
Men
0 - 10452457.364.652.257.373.053.30.5
10 - 20.116314.711.715.214.650.40.20.5
20 - 306668.45.89.27.967.52.63.3
30 - 404595.83.56.15.070.90.510.3
40 - 502383.02.44.33.49.438.23.2
50 - 601562.01.83.12.41.747.07.3
60 - 701191.51.42.31.81.229.83.8
70 - 80951.21.11.71.41.115.81.3
80 - 90851.10.91.31.13.13.10.1
90 - 100480.60.71.00.81.717.85.7
100 +3364.36.53.54.096.913.31.9
TOTAL7889100.0100.0100.0100.0377.0221.637.7
Women
0 - 10690879.782.874.579.610.760.13.2
10 - 20.8489.87.313.810.553.0116.22.4
20 - 304435.13.16.04.369.58.314.4
30 - 401691.91.72.92.22.535.01.2
40 - 50921.11.11.51.20.113.21.4
50 - 60650.70.80.80.70.00.10.1
60 - 70370.40.50.40.53.10.00.2
70 - 80160.20.40.20.325.51.26.6
80 - 90190.20.30.10.24.63.10.1
90 - 100170.20.30.10.12.06.61.2
100 +560.61.80.10.4191.640.37.2
TOTAL8670100.0100.0100.0100.0362.4284.037.8
Figure 1

Histogram of alcohol exposure and fitted distributions for non-Hispanic white men.

Figure 2

Histogram of alcohol exposure and fitted distributions for non-Hispanic white women.

Example for fitting exposure distributions for white non-Hispanic Americans Histogram of alcohol exposure and fitted distributions for non-Hispanic white men. Histogram of alcohol exposure and fitted distributions for non-Hispanic white women. Table 2 gives an overview of all the chi-square deviations for the different subgroups as defined by sex, age, and ethnicity. Clearly, the Weibull distribution fits best, but there are no options for easily shifting this distribution [17]. However, for descriptive purposes only, the Weibull distribution seems to best fit the various distributions. For shifting distributions, the gamma distribution was chosen mainly for feasibility. However, it offered a relatively good fit for descriptive purposes as well. Overall, for several subgroups, gamma had a better fit than log-normal, whereas for others, log-normal showed the better fit (Table 2).
Table 2

Fit indices for different distributions

SexEthnicityAge categoryChi-square totalChi-square up to 100 g pure alcohol per day
log-normalgammaWeibulllog-normalgammaWeibull
MenWhite18 - 3483.289.722.968.089.322.6
MenWhite35 - 54108.1136.424.093.0123.518.9
MenWhite55+174.523.625.0136.021.523.6
MenBlack18 - 3419.747.813.317.747.813.3
MenBlack35 - 5442.456.622.738.656.422.7
MenBlack55+14.213.20.712.513.00.6
MenNative18 - 348.627.615.08.625.814.9
MenNative35 - 549.621.511.98.721.111.2
MenNative55+3.85.73.33.05.02.5
MenAsian/Pac. Islander18 - 344.318.96.84.216.85.6
MenAsian/Pac. Islander35 - 547.128.012.66.626.411.5
MenAsian/Pac. Islander55+5.011.26.24.510.75.7
MenHispanic18 - 3421.294.420.421.288.716.9
MenHispanic35 - 5442.963.914.227.362.113.8
MenHispanic55+8.921.35.58.518.74.7
WomenWhite18 - 3442.4243.053.029.5221.743.6
WomenWhite35 - 54206.081.027.1122.668.525.7
WomenWhite55+220.944.628.2102.939.827.5
WomenBlack18 - 349.9116.815.99.8111.113.1
WomenBlack35 - 5433.558.58.126.753.87.2
WomenBlack55+9.324.04.79.222.44.2
WomenNative18 - 344.218.16.23.918.16.0
WomenNative35 - 547.426.511.76.825.710.9
WomenNative55+3.45.63.22.95.12.7
WomenAsian/Pac. Islander18 - 343.249.514.03.048.813.1
WomenAsian/Pac. Islander35 - 546.04.34.85.53.84.3
WomenAsian/Pac. Islander55+5.44.85.14.94.34.6
WomenHispanic18 - 349.1140.317.68.4133.814.4
WomenHispanic35 - 5413.734.46.612.929.52.8
WomenHispanic55+6.710.23.56.48.32.0
Fit indices for different distributions Table 3 lists the parameters of the original and the shifted gamma distributions. Again, the unshifted and shifted distributions for non-Hispanic whites are given for illustration in Figures 3 and 4.
Table 3

Parameter estimates for original and shifted gamma distributions

SexEthnicityAgeOriginal NESARC distributionUpshifted distribution
kThetaMeanSDKTheta
MenWhiteAll Ages0.43648.13240.26847.2750.72655.501
MenWhite18 - 340.45058.55050.58659.3880.72669.721
MenWhite35 - 540.44146.95138.99345.7780.72653.743
MenWhite55+0.43037.48629.71334.8830.72640.953
MenBlackAll Ages0.38765.82149.57058.1950.72668.321
MenBlack18 - 340.38065.85047.95756.3010.72666.098
MenBlack35 - 540.40665.15552.86062.0580.72672.856
MenBlack55+0.36266.53044.64252.4100.72661.529
MenNativeAll Ages0.35488.39956.52866.3630.72677.911
MenNative18 - 340.354111.13070.68782.9870.72697.427
MenNative35 - 540.37687.40058.11168.2230.72680.094
MenNative Asian/Pac.55+0.34546.58427.41232.1810.72637.781
MenIslander Asian/Pac.All Ages0.41728.22422.54426.4670.72631.072
MenIslander Asian/Pac.18 - 340.46027.78426.02230.5490.72635.865
MenIslander Asian/Pac.35 - 540.41230.00622.78426.7490.72631.403
MenIslander55+0.34517.66511.80113.8540.72616.265
MenHispanicAll Ages0.42640.75640.72647.8120.72656.131
MenHispanic18 - 340.42243.41736.14842.4380.72649.823
MenHispanic35 - 540.43340.63352.21561.3000.72671.967
MenHispanic55+0.42631.67823.51727.6090.72632.413
WomenWhiteAll Ages0.39119.77814.78518.3610.64822.801
WomenWhite18 - 340.39923.00917.35321.3760.65926.330
WomenWhite35 - 540.41018.28214.22417.7020.64622.030
WomenWhite55+0.36617.85212.49515.6720.63619.657
WomenBlackAll Ages0.31627.72920.06924.5650.66830.066
WomenBlack18 - 340.31130.99619.84424.2990.66729.756
WomenBlack35 - 540.32627.46122.64227.5850.67433.607
WomenBlack55+0.31019.49412.31315.4590.63419.408
WomenNativeAll Ages0.29554.51433.40540.2210.69048.427
WomenNative18 - 340.29787.19049.13558.6880.70170.098
WomenNative35 - 540.30442.47227.34733.1090.68240.084
WomenNative Asian/Pac.55+0.33620.35019.86624.3260.66729.787
WomenIslander Asian/Pac.All Ages0.31824.85514.46417.9840.64722.360
WomenIslander Asian/Pac.18 - 340.31142.17224.42829.6820.67736.065
WomenIslander Asian/Pac.35 - 540.4416.7025.4607.4130.54210.065
WomenIslander55+0.3605.8352.4303.8560.3976.118
WomenHispanicAll Ages0.34117.85518.48222.7010.66327.883
WomenHispanic18 - 340.32523.54515.42319.1090.65123.677
WomenHispanic35 - 540.37712.81524.68029.9770.67836.411
WomenHispanic55+0.33013.51310.68213.5440.62217.173
Figure 3

Original and shifted gamma distributions for non-Hispanic white men.

Figure 4

Original and shifted gamma distributions for non-Hispanic white women.

Parameter estimates for original and shifted gamma distributions Original and shifted gamma distributions for non-Hispanic white men. Original and shifted gamma distributions for non-Hispanic white women. Finally, the implications of the distributional shift for the AAFs of liver cirrhosis are displayed in Table 4. Clearly, the shift in distribution results in changes in AAFs that are public health-relevant. On the other hand, Table 4 also shows that the different distributions did not have a marked influence on AAFs, further justifying the choice of the gamma distribution.
Table 4

AAFs (in %) for liver cirrhosis for subpopulations as defined by sex and ethnicity based on different distributions

SexEthnicityContinuous AAF from NESARC using log normalContinuous AAF from NESARC using WeibullContinuous AAF from NESARC using gammaContinuous AAF from shifted distribution*Categorical AAF from NESARC**Categorical AAF from shifted distribution**
MenWhite51.252.655.172.355.171.9
MenBlack49.152.457.073.254.672.2
MenNative51.155.962.276.158.774.9
MenAsian/Pac. Islander34.731.430.949.737.552.3
MenHispanic45.044.947.371.348.370.9
WomenWhite66.265.265.773.573.677.2
WomenBlack65.165.366.975.172.077.2
WomenNative70.371.674.682.376.883.1
WomenAsian/Pac. Islander53.653.756.463.463.167.5
WomenHispanic56.856.257.871.767.474.6

* The gamma distributions were shifted upward to the level of the per capita data.

** Based on four drinking categories: > 0 - 30 g/day; > 30 - 60 g/day; > 60 - 90 g/day; > 90 g/day.

AAFs (in %) for liver cirrhosis for subpopulations as defined by sex and ethnicity based on different distributions * The gamma distributions were shifted upward to the level of the per capita data. ** Based on four drinking categories: > 0 - 30 g/day; > 30 - 60 g/day; > 60 - 90 g/day; > 90 g/day. Furthermore, the differences between the continuous and the categorical approach were not pronounced. The results for the confidence intervals are listed in Table 5.
Table 5

AAFs (in %) for liver cirrhosis and corresponding 95% confidence intervals for subpopulations as defined by sex and ethnicity based on different distributions

SexEthnicityContinuous AAF from shifted distribution*Standard Error95% Confidence interval
Lower BoundUpper Bound
MenWhite72.30.4571.473.2
MenBlack73.20.4972.274.2
MenNative76.10.8374.577.7
MenAsian/Pac. Islander49.70.9847.851.6
MenHispanic71.30.4770.472.2
WomenWhite73.50.2473.074.0
WomenBlack75.10.2674.675.6
WomenNative82.30.4581.483.2
WomenAsian/Pac. Islander63.40.8161.865.0
WomenHispanic71.70.3471.072.4

* The gamma distributions were shifted upward to the level of the per capita data.

AAFs (in %) for liver cirrhosis and corresponding 95% confidence intervals for subpopulations as defined by sex and ethnicity based on different distributions * The gamma distributions were shifted upward to the level of the per capita data.

Discussion

A procedure for triangulating survey and per capita data for deriving population exposure based on the gamma distribution for drinkers was presented and explored. This procedure proved feasible for modeling US drinking and generating AAFs based on the per capita consumption. It also allowed for the quantitative comparability of data on alcohol exposure from surveys with different coverage rates and a necessary correction for varying coverage rates. One crucial assumption made in the triangulation process was a constant factor of underreporting for all subpopulations as defined by sex, age, and ethnicity (see formula for the shifted mean above). There is no conclusive literature on differential underreporting by different subpopulations; even the literature on underreporting by volume of drinking is not conclusive [11]. It may be hypothesized that the more irregular the occasions of heavy drinking, the more underreporting there is as such occasions are difficult to report with our standard instruments. However, research is necessary to test this hypothesis before differential upshifting factors can be used for subpopulations. The methodological framework presented here certainly allows for such modifications. One argument against upshifting the distribution of alcohol consumed to the level of per capita data is that the risk relations usually are also derived from self-reports on alcohol exposure, i.e., that the subject's responses in epidemiological studies also underestimate real drinking. There are, however, several counterarguments. First, alcohol exposure measurement in medical epidemiological studies and in general appears to yield valid individual consumption levels [33,34]. Second, as typically found in medical epidemiological studies [33], there are higher intercorrelations with external standards when alcohol is embedded into a series of other food items. Third, it has been shown that embedded alcohol items yield higher levels of consumption compared to questionnaires specifically targeting alcohol use [35]. Thus, there are indications that questions on alcohol in medical epidemiological studies yield more consistent and higher alcohol exposure compared to those in typical national alcohol surveys. However, the degree of difference between these approaches is not clear. There are three principal ways to explain the undercoverage of per capita consumption in surveys: • Measurement error in surveys due to sampling; • Measurement error in surveys due to respondents' behavior, such as underreporting, problems in averaging, forgetting, or dropping out of the survey [11]; • Measurement error in per capita consumption. It is beyond the scope of this article to give a full review on the types of measurement error in alcohol surveys. The literature is diverse and highly speculative, and there are few systematic studies on reasons for undercoverage. In high-income countries such as the US, there are groups left out of sampling frames who are very high alcohol consumers, such as the homeless or those living in institutions. Consumption is skewed, with a small segment of the population consuming a high proportion of alcohol. In the NESARC sample, 6.7% of the heaviest white male drinkers consume 33% of the overall consumption; in the upshifted distribution for the same group, 10.2% consume this proportion [36]. It is thus possible that a large part of the undercoverage is due to sampling schemes, and hence there might be less systematic underreporting among respondents than it appears. However, as laid out above, it is unlikely that sampling explains all of the undercoverage. There is some evidence for individual underreporting as well [11]. The third explanation is that the assumption that the sales/production figures are the gold standard might be wrong. This explanation seems implausible except in circumstances where there is large unrecorded consumption. Where there is much unrecorded consumption, including that in the sales/production estimate would of course increase the differences with estimates from surveys. It is unlikely that there is much alcohol measured in the sales/production figures that is not in fact consumed. Why would consumers pay for goods that would be wasted, e.g., as in the case of Canada cited above, a wastage of 60% to 70%? Such behavior would be contradictory to evidence from economics. Clearly, some alcohol bought is not consumed due to spillage. However, according to industry experts, this spillage should amount to less than 10%. Other alcohol may also be stocked rather than consumed in the year of purchase, but overall across regions and years, this should cancel out. Thus, the assumption of per capita consumption derived from aggregate statistics such as sales and/or production being the best estimate for overall volume of consumption seems justified. Exploring the factors involved in undercoverage by surveys should be a research priority in coming years. It seems wasteful to conduct hundreds of annual surveys with questions about alcohol in high-income countries without knowledge of why these surveys typically cover only proportions between 30% to 60% of per capita consumption. The assumption made in the present analysis of constant undercoverage in different population segments also needs to be tested and replaced by empirical estimates of differential undercoverage. However, irrespective of reasons for undercoverage, triangulation is necessary for the comparison of alcohol exposure. If surveys are compared to each other, they should have their proportions of coverage standardized in the same way that disease rates are standardized to correct for population distributions. It would also appear irrational to continue the comparisons of surveys in which the underlying coverage rates starkly differ as the results of such comparisons are not interpretable. This reasoning is independent of the level of upshifting chosen. Based on the uncertainty about the degree of underreporting in medical epidemiological studies, we suggest the routine application of sensitivity analyses using 100%, 90%, and 80% of per capita consumption as the target levels when the actual level of population consumption is important. This method will be used not only in the CRA 2005, but, together with new empirically determined disability weights, also in the ongoing US Burden of Disease study [37]. A final point concerns the assumptions made in the upshifting. First, we assumed the proportion of abstainers and ex-drinkers to be exactly as assessed by the survey. Unfortunately, there is no better information available on which different estimates could be based. The medical epidemiological literature is of no help here, as cohorts get selected based on their potential to be followed up, and this may yield some proportions of abstainers that are not at all representative of the general population. Second, we assumed that proportions by sex and age are correctly estimated by the survey. Again, there are no better data currently available. We can only speculate if and how the inclusion of nonhousehold members shifts the proportions of alcohol consumed. More research is necessary on such populations to estimate the bias introduced by relying only on households in the sampling frame.

Conclusion

Overall, the chosen methods for estimating alcohol exposure for population health proved feasible and seemed justified based on current knowledge. Further work is needed for refinement of methods and validation of assumptions.

Conflict of interest

The authors declare that they have no competing interests.

Authors' contributions

JR supervised all aspects of the work and wrote the first draft. TK programmed most of the statistical tests and contributed to the writing. GG Jr contributed to the programming, did part of the statistical analyses, and contributed to the writing. FS and BG did part of the statistical analyses and contributed to the writing. GG Sr. contributed to the programming, helped design the study, and contributed to the writing. All authors read and approved the final manuscript.

Additional file 1

Description of key parameters on volume of alcohol exposure from NESARC (weighted to be representative for the population). Click here for file
  20 in total

1.  A heuristic approach to the formulas for population attributable fraction.

Authors:  J A Hanley
Journal:  J Epidemiol Community Health       Date:  2001-07       Impact factor: 3.710

2.  The estimation and interpretation of attributable risk in health research.

Authors:  S D Walter
Journal:  Biometrics       Date:  1976-12       Impact factor: 2.571

3.  The relation between different dimensions of alcohol consumption and burden of disease: an overview.

Authors:  Jürgen Rehm; Dolly Baliunas; Guilherme L G Borges; Kathryn Graham; Hyacinth Irving; Tara Kehoe; Charles D Parry; Jayadeep Patra; Svetlana Popova; Vladimir Poznyak; Michael Roerecke; Robin Room; Andriy V Samokhvalov; Benjamin Taylor
Journal:  Addiction       Date:  2010-03-15       Impact factor: 6.526

4.  Avoidable burden of disease: conceptual and methodological issues in substance abuse epidemiology.

Authors:  Jürgen Rehm; Benjamin Taylor; Jayadeep Patra; Gerhard Gmel
Journal:  Int J Methods Psychiatr Res       Date:  2006       Impact factor: 4.035

5.  The distribution of alcohol consumption--30 years on.

Authors:  J C Duffy
Journal:  Br J Addict       Date:  1986-12

6.  Drinking in California: theoretical and empirical analyses of alcohol consumption patterns.

Authors:  P J Gruenewald; T Nephew
Journal:  Addiction       Date:  1994-06       Impact factor: 6.526

Review 7.  Valuation of health states in the US study to establish disability weights: lessons from the literature.

Authors:  Jürgen Rehm; Ulrich Frick
Journal:  Int J Methods Psychiatr Res       Date:  2010-03       Impact factor: 4.035

Review 8.  Alcohol as a risk factor for liver cirrhosis: a systematic review and meta-analysis.

Authors:  Jürgen Rehm; Benjamin Taylor; Satya Mohapatra; Hyacinth Irving; Dolly Baliunas; Jayadeep Patra; Michael Roerecke
Journal:  Drug Alcohol Rev       Date:  2010-07

9.  Alcohol-attributable mortality and potential years of life lost in Canada 2001: implications for prevention and policy.

Authors:  Jürgen Rehm; Jayadeep Patra; Svetlana Popova
Journal:  Addiction       Date:  2006-03       Impact factor: 6.526

10.  The Alcohol Use Disorder and Associated Disabilities Interview Schedule-IV (AUDADIS-IV): reliability of alcohol consumption, tobacco use, family history of depression and psychiatric diagnostic modules in a general population sample.

Authors:  Bridget F Grant; Deborah A Dawson; Frederick S Stinson; Patricia S Chou; Ward Kay; Roger Pickering
Journal:  Drug Alcohol Depend       Date:  2003-07-20       Impact factor: 4.492

View more
  73 in total

Review 1.  Using administrative health data for the surveillance of interventions for alcohol-related harm among young people.

Authors:  Steve Kisely; Mark Asbridge; Jason Connor; Angela White; Joanne Pais; Elizabeth Lin
Journal:  CMAJ       Date:  2011-06-13       Impact factor: 8.262

2.  Assessment of alcoholic standard drinks using the Munich composite international diagnostic interview (M-CIDI): An evaluation and subsequent revision.

Authors:  Sören Kuitunen-Paul; Jürgen Rehm; Dirk W Lachenmeier; Firdeus Kadrić; Paula T Kuitunen; Hans-Ulrich Wittchen; Jakob Manthey
Journal:  Int J Methods Psychiatr Res       Date:  2017-03-28       Impact factor: 4.035

3.  Global burden of injuries attributable to alcohol consumption in 2004: a novel way of calculating the burden of injuries attributable to alcohol consumption.

Authors:  Kevin D Shield; Gerrit Gmel; Jayadeep Patra; Jürgen Rehm
Journal:  Popul Health Metr       Date:  2012-05-18

4.  Estimation of the impacts of substance use on workplace productivity: a hybrid human capital and prevalence-based approach applied to Canada.

Authors:  Justin T Sorge; Matthew Young; Bridget Maloney-Hall; Adam Sherk; Pam Kent; Jinhui Zhao; Tim Stockwell; Katerina Perlova; Scott Macdonald; Brian Ferguson
Journal:  Can J Public Health       Date:  2019-12-02

5.  Estimating relative risk of a log-transformed exposure measured in pools.

Authors:  Emily M Mitchell; Torie C Plowden; Enrique F Schisterman
Journal:  Stat Med       Date:  2016-08-16       Impact factor: 2.373

6.  Missing value imputation in longitudinal measures of alcohol consumption.

Authors:  Ulrike Grittner; Gerhard Gmel; Samuli Ripatti; Kim Bloomfield; Matthias Wicki
Journal:  Int J Methods Psychiatr Res       Date:  2011-03       Impact factor: 4.035

7.  A new methodological approach to adjust alcohol exposure distributions to improve the estimation of alcohol-attributable fractions.

Authors:  William J Parish; Arnie Aldridge; Benjamin Allaire; Donatus U Ekwueme; Diana Poehler; Gery P Guy; Cheryll C Thomas; Justin G Trogdon
Journal:  Addiction       Date:  2017-06-23       Impact factor: 6.526

8.  Distribution of Drinks Consumed by U.S. Adults by Average Daily Alcohol Consumption: A Comparison of 2 Nationwide Surveys.

Authors:  Marissa B Esser; Jeffrey J Sacks; Adam Sherk; Katherine J Karriker-Jaffe; Thomas K Greenfield; Carol Pierannunzi; Robert D Brewer
Journal:  Am J Prev Med       Date:  2020-08-01       Impact factor: 5.043

9.  How should prevalence of alcohol use disorders be assessed globally?

Authors:  Jürgen Rehm
Journal:  Int J Methods Psychiatr Res       Date:  2016-05-01       Impact factor: 4.035

10.  Alcohol-attributable cancer deaths and years of potential life lost in the United States.

Authors:  David E Nelson; Dwayne W Jarman; Jürgen Rehm; Thomas K Greenfield; Grégoire Rey; William C Kerr; Paige Miller; Kevin D Shield; Yu Ye; Timothy S Naimi
Journal:  Am J Public Health       Date:  2013-02-14       Impact factor: 9.308

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.