Literature DB >> 35172753

Inverse probability weighting to handle attrition in cohort studies: some guidance and a call for caution.

Marie-Astrid Metten1, Nathalie Costet2, Luc Multigner2, Jean-François Viel1, Guillaume Chauvet3.   

Abstract

BACKGROUND: Attrition in cohort studies challenges causal inference. Although inverse probability weighting (IPW) has been proposed to handle attrition in association analyses, its relevance has been little studied in this context. We aimed to investigate its ability to correct for selection bias in exposure-outcome estimation by addressing an important methodological issue: the specification of the response model.
METHODS: A simulation study compared the IPW method with complete-case analysis (CCA) for nine response-mechanism scenarios (3 missing at random - MAR and 6 missing not at random - MNAR). Eighteen response models differing by the type of variables included were assessed.
RESULTS: The IPW method was equivalent to CCA in terms of bias and consistently less efficient in all scenarios, regardless of the response model tested. The most effective response model included only the confounding factors of the association model.
CONCLUSION: Our study questions the ability of the IPW method to correct for selection bias in situations of attrition leading to missing outcomes. If the method is to be used, we encourage including only the confounding variables of the association of interest in the response model.
© 2022. The Author(s).

Entities:  

Keywords:  Attrition; Cohort studies; Complete-case analysis; Inverse probability weighting; Missing outcome; Selection bias

Mesh:

Year:  2022        PMID: 35172753      PMCID: PMC8848672          DOI: 10.1186/s12874-022-01533-9

Source DB:  PubMed          Journal:  BMC Med Res Methodol        ISSN: 1471-2288            Impact factor:   4.615


Cohort studies are essential for investigating associations between exposure and health outcomes thanks to their prospective design. The repeated collection of information in successive follow-ups (also called survey waves) allows studying the effects of past exposures on health outcomes occurring at inclusion or thereafter. However, such studies are known to be affected by partial and total non-response, which can invalidate the causal inference that can be drawn from them. Partial non-response refers to missing data that occasionally occurs for certain variables during a survey wave when some individuals fail or refuse to answer some of the questions. Total non-response (or attrition) occurs when a subset of individuals does not participate in one specific survey wave or quit the study completely [1]. Only the latter (drop-outs) was considered in this study. Missing data resulting from non-response can be classified according to their postulated underlying mechanism [2]. In situations of the missing completely at random mechanism (MCAR), the probability of missing data does not depend on either the observed or unobserved values. In situations of the missing at random mechanism (MAR), it depends on the observed data but not the unobserved data. Finally, in situations of the missing not at random mechanism (MNAR), it depends on the unobserved data. The simplest and most widely used approach to handle total non-response in cohort studies is complete-case analysis (CCA). This method assumes a MCAR mechanism and consists of studying the exposure-outcome association in the subset of respondents only. However, total non-response is generally considered to result from MAR or MNAR mechanisms. Several methodological publications have suggested the use of the inverse probability weighting (IPW) method in situations of the MAR mechanism of attrition [3, 4]. It aims to recreate a representative sample of the initial cohort by differentially weighting the so-called “complete individuals” (i.e. those who participate in the survey wave under consideration). More precisely, when modeling the association between exposure and outcome, respondents are weighted by the inverse of their probability to participate (hereinafter referred to as the “response probability” or “probability of response”). This response probability depends on some of the respondent’s characteristics. The use of the inverse of this probability implies that a respondent with a high probability of response (e.g. an individual with a high socio-economic level [5]) is given a comparatively lower weight in the analysis. The approach can be summarized as: “the respondents carry the weight of the non-respondents”. The probability of response is unknown and needs to be estimated from the data. The first step is therefore to build a response model (logistic regression model) to obtain weights that will be used in a second step in the association model. Because the association model is only fitted among respondents or complete individuals, the method is also called “weighted complete-case analysis” [2]. It is also referred to as “inverse probability of participation/attrition weighting” (IPPW/IPAW) in the literature [6-8]. Originally developed for reducing the effects of confounding in observational studies (propensity score method) [9], the IPW method was extended to correct for selection biases in situations of attrition. Although researchers have already adopted the method in association studies [10, 11], guidance on its correct use is still needed, in particular regarding the specification of the response model (i.e. variables to be introduced into the response model). In this article, we will focus on attrition resulting in a missing outcome of interest. This situation is commonly encountered in mother-child cohorts for example, where the effects of prenatal medical conditions or exposures on the future health of the children are studied. At the time point of interest (6 year old, for example), some children do not participate in the follow-up. Depending on the attrition mechanism (MAR, MNAR), restricting the analysis to the participating children may result in a biased estimation of the association between the exposure and the outcome. Our work aimed at evaluating through simulations the ability of the IPW method to correct for a selection bias under various missingness mechanisms and specifications of the response model. Response model specifications were compared in terms of bias, variance and mean square error of the association estimates between the exposure and the outcome. In all scenarios tested, we assumed that the exposure variable and the other covariates were fully observed at preceding waves or at baseline.

Which variables should be introduced into the response model?

Relatively few authors have addressed the question of which variables should be introduced in the response model from which the weighting is derived. In 2004, Hernan et al. recommended including the exposure variable and all variables that independently predict both response and outcome [3]. In 2013, Seaman and White advised not including variables that are exclusively related to the response without being related to the outcome and exposure variables. They suggested adding confounding variables (i.e. associated with both exposure and outcome) and prognostic variables (i.e. exclusively associated with the outcome) of the association studied [4]. Seaman and White’s recommendations are consistent with simulation studies performed in the propensity score literature [12, 13]. In this context, including variables that are related to the exposure but not the outcome is discouraged. Variables unrelated to the exposure but related to the outcome should instead be included. The literature on the IPEW method considers only two variables: the exposure and the outcome, whereas the IPPW method involves the response as a third variable. Thus, there is still uncertainty as to whether the inclusion of confounding and prognostic variables in the response model depends on whether or not they are associated with the response. Furthermore, if the exposure variable X is itself associated with the response, should it be included in the response model? A quick glance at the applied literature shows that researchers usually build the response and association models independently. They often fit a logistic regression model by including the presumed predictors of response, whether or not they are related to exposure or outcome [14, 15]. Their approach is thus primarily to fit a model that perfectly predicts the response and not to optimize the response model in relation to the association model. None of the proposed strategies in the literature has been tested through simulations and they do not appear to be applied by researchers. Therefore, we propose here to provide insight on this issue by studying the impact of the type of variables included in the response model on the bias and variance of the exposure regression coefficient in the association model.

Simulation study

We conducted a Monte-Carlo simulation study under several MAR and MNAR scenarios. We aimed to evaluate i) the relative performance of the IPPW method relative to CCA and ii) how the specification of the response model in the IPPW method affects the bias of the exposure regression coefficient , its variance and mean square error, and the coverage rate of confidence intervals. The SAS code to implement the simulation study is available in Additional file 1: e-Appendix 1. We focused on the case of a linear regression model in which a continuous outcome is explained by continuous exposure and covariates.

Data-generating process

We first created a sample of size n = 1,000, containing seven covariates z1, …, z7 generated independently according to standard normal distributions. We then generated an exposure variable according to the following model:where ϵ is generated according to a standard normal distribution. In the exposure model (1), the coefficients were chosen as α1 = α2 = α5 = α6 = 0.218, so that the correlation between x and each of the covariates z was approximately 0.2. We generated an outcome variable according to the following model:where ϵ′ is generated according to a standard normal distribution. In the outcome model (2), the coefficients were chosen as β =0.5, β1 = β5 = 0.170, and β3 = β7 = 0.230, such that the correlation between y and x was approximately 0.3,and the correlation between y and any of the covariates z was approximately 0.2. Finally, we generated response probabilities according to the following logistic model: We used the values γ = 0.0, 0.2 or 0.5 and γ = 0.0, 0.2 or 0.5. The case in which γ =0.0 corresponds to a MAR situation (i.e. the response probability does not depend on y). The cases in which γ =0.2 and 0.5 correspond to MNAR situations (i.e. the response probability depends on y). In the response model (3), the coefficients γ1, γ2, γ3, and γ4 were chosen to be equal to 0.1. The coefficient γ0 was chosen such that the average response rate was approximately 60% for all cases. In the sample, the individuals responded independently with the probabilities p. The data-generation model is presented in Fig. 1 and the nine response mechanism scenarios are summarized in Table 1.
Fig. 1

Scheme of the data-generation model for the simulation experiments. Seven covariates, differing in their association with the variables of interest (exposure and outcome variables) and the response variable, were generated. The strength of the associations (dashed arrows) between the variables of interest and the response variable (γx and γy) varied according to the scenarios described in Table 1

Table 1

Response mechanism scenarios (data generation)

Scenarioγxγy\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\boldsymbol{\gamma}}_{\mathbf{1}},{\boldsymbol{\gamma}}_{\mathbf{2}},{\boldsymbol{\gamma}}_{\mathbf{3}},{\boldsymbol{\gamma}}_{\begin{array}{c}\mathbf{4}\\ {}\ \end{array}}$$\end{document}γ1,γ2,γ3,γ4Description
MAR 10.00.00.1Response depending only on covariates
MAR 20.20.00.1Response depending on covariates and exposure
MAR 30.50.00.1Response depending on covariates and exposure
MNAR 10.00.20.1Response depending on outcome and covariates
MNAR 20.20.20.1Response depending on outcome, exposure, and covariates
MNAR 30.50.20.1Response depending on outcome, exposure, and covariates
MNAR 40.00.50.1Response depending on outcome and covariates
MNAR 50.20.50.1Response depending on outcome, exposure, and covariates
MNAR 60.50.50.1Response depending on outcome, exposure, and covariates

:regression coefficients of the generated response models (logit(p) = γ0 + γ y + γ x + γ1 z1 + γ2 z2 + γ3 z3 + γ4 z4)

Scheme of the data-generation model for the simulation experiments. Seven covariates, differing in their association with the variables of interest (exposure and outcome variables) and the response variable, were generated. The strength of the associations (dashed arrows) between the variables of interest and the response variable (γx and γy) varied according to the scenarios described in Table 1 Response mechanism scenarios (data generation) :regression coefficients of the generated response models (logit(p) = γ0 + γ y + γ x + γ1 z1 + γ2 z2 + γ3 z3 + γ4 z4)

Simulation parameters and performance criteria

We compared the IPPW method to CCA for a parsimonious association model, i.e. including only the confounding variables Z1 and Z5, which corresponds to standard epidemiological practice: . Several response models were tested (see Table 2) to determine the impact of the type of variables included on the regression coefficient of the exposure variable and its variance in the association model. Briefly, we first evaluated the “well-specified” response model, i.e. the one that included all the variables really related to the response (X, Z1, Z2, Z3, Z4), as simulated (Eq. 1). We also initially included the exposure variable X, although this variable was not associated with the response in certain tested scenarios (MAR 1, MNAR 1, MNAR 4). We then tested a model including all available variables. Then, we assessed the proposals by Hernan et al. (2004) and by Seaman and White (2013), described above [3, 4]. Finally, we evaluated parsimonious strategies: including only the confounding variable associated with the response, including only the confounding variable not associated with the response, including both, including both with the addition of a prognostic variable not associated with the response, and finally, including both confounding and prognostic variables not associated with the response. All these response models were then evaluated without the exposure variable X.
Table 2

Response models tested

Response modelSet of variablesDescription
1X, Z1, Z2, Z3, Z4All variables associated with the responsea
2X, Z1, Z2, Z3, Z4, Z5, Z6, Z7The exposure variable X and all covariates
3X, Z1, Z3The exposure variable X and variables associated with both response and outcome (strategy proposed by Hernan et al. [3])
4X, Z1, Z2, Z3, Z5, Z7All variables associated with the response*, except Z4 only associated with the response; Adding Z5 a confounding variable (Z5) and a prognostic variable (Z7), neither associated with the response (strategy proposed by Seaman and White [4])
5X, Z1The exposure variable X and the confounding variable associated with the response (Z1)
6X, Z5The exposure variable X and the confounding variable not associated with the response (Z5)
7X, Z1, Z5The exposure variable X and both confounding variables, that associated with the response (Z1) the other not (Z5)
8X, Z1, Z5, Z7The exposure variable X, both confounding variables (Z1, Z5) and a prognostic variable not associated with the response (Z7)
9X, Z5, Z7The exposure variable X and a confounding variable and prognostic variable, neither associated with the response (Z5, Z7)
10Z1, Z2, Z3, Z4Previous response models without the exposure variable X
11Z1, Z2, Z3, Z4, Z5, Z6, Z7
12Z1, Z3
13Z1, Z2, Z3, Z5, Z7
14Z1
15Z5
16Z1, Z5
17Z1, Z5, Z7
18Z5, Z7

aX was not associated with the response in scenarios MAR 1, MNAR 1, or MNAR 4

Response models tested aX was not associated with the response in scenarios MAR 1, MNAR 1, or MNAR 4 The generation of the sample and variables was repeated B = 10,000 times. For each sample, we computed the regression coefficient and its variance according to the 18 possible response models. The simulations were conducted using SAS version 9.4. The results were assessed according to the following criteria: The Monte Carlo bias: The Monte Carlo variance: The mean square error: The relative root mean square error: The Monte Carlo variance is the variance of the estimates over 10,000 replications. Therefore, it accounts for the entire variability of the estimators, including the fact that the weights are estimated. We have also computed the coverage rates for the normality-based confidence intervals for , with nominal rates of 2.5% in each tail.

Results of the simulation study

The simulation results are reported in Tables 3, 4 and 5 for the Monte Carlo bias, variance, mean square error and relative root mean square error, and in Table 6 for the coverage rates.
Table 3

Simulation study results: bias, variance, mean square error and related root mean square error in the  regression coefficient for CCA and the IPPW method (18 response models), for three MAR response mechanism scenarios

ScenarioaγxγyCCAIPPW method
Response models
(X), Z1, Z2, Z3, Z4(X), Z1, Z2, Z3, Z4, Z5, Z6, Z7(X), Z1, Z3(X), Z1, Z2, Z3, Z5, Z7(X), Z1(X), Z5(X), Z1, Z5(X), Z1, Z5, Z7(X), Z5, Z7
MAR 10.00.0Bias0.00Xb0.000.000.000.000.000.000.000.000.00
0.000.000.000.000.000.000.000.000.00
Variance (10-3)1.670X1.6861.6911.6791.6831.6761.6761.6781.6791.677
1.6841.6871.6761.6801.6721.6711.6741.6751.672
MSE (10-3)1.670X1.6861.6911.6791.6831.6761.6761.6781.6791.677
1.6841.6871.6761.6801.6721.6711.6741.6751.672
RRMSE (%)16.3X16.416.416.416.416.416.416.416.416.4
16.416.416.416.416.416.416.416.416.4
MAR 20.20.0Bias0.00Xb0.000.000.000.000.000.000.000.000.00
0.000.000.000.000.000.000.000.000.00
Variance (10-3)1.668X1.7151.7201.7091.7111.7041.7051.7061.7071.705
1.6841.6891.6781.6821.6731.6681.6741.6761.670
MSE (10-3)1.669X1.7151.7201.7091.7111.7051.7061.7071.7081.706
1.6851.6891.6791.6821.6741.6701.6751.6771.671
RRMSE (%)16.3X16.616.616.516.516.516.516.516.516.5
16.416.416.416.416.416.316.416.416.4
MAR 30.50.0Bias0.00Xb0.000.000.000.000.000.000.000.000.00
0.000.000.000.000.000.000.000.000.00
Variance (10-3)1.761X1.9922.0021.9811.9851.9731.9771.9781.9771.976
1.7871.7981.7771.7871.7731.7631.7761.7781.765
MSE (10-3)1.768X1.9922.0021.9811.9851.9811.9841.9861.9851.983
1.7931.8041.7841.7931.7811.7711.7841.7861.773
RRMSE (%)16.8X17.917.917.817.817.817.817.817.817.8
16.917.016.916.916.916.816.916.916.8

Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, MSE Mean square error, RRMSE Relative root mean square error

aScenario MAR 1: response depending only on covariates; Scenarios MAR 2, 3: response depending on exposure and covariates

bWhether for the bias or the variance, the first line represents the result obtained when the exposure variable X is included in the response model, whereas the second line represents the result obtained without the exposure variable X in the response model

Table 4

Simulation study results: bias, variance, mean square error and related root mean square error in the  regression coefficient for CCA and the IPPW method (18 response models), for three MNAR response mechanism scenarios

ScenarioaγxγyCCAIPPW method
Response models
(X), Z1, Z2, Z3, Z4(X), Z1, Z2, Z3, Z4, Z5, Z6, Z7(X), Z1, Z3(X), Z1, Z2, Z3, Z5, Z7(X), Z1(X), Z5(X), Z1, Z5(X), Z1, Z5, Z7(X), Z5, Z7
MNAR 10.00.2Bias0.00Xb0.000.000.000.000.000.000.000.000.00
0.000.000.000.000.000.000.000.000.00
Variance (10-3)1.676X1.7011.7071.6941.6981.6881.6851.6891.6901.686
1.6971.7031.6891.6951.6821.6771.6841.6851.679
MSE (10-3)1.688X1.7111.7151.7031.7061.7001.6981.7021.7021.697
1.7091.7141.7021.7061.6951.6901.6971.6981.691
RRMSE (%)16.4X16.516.616.516.516.516.516.516.516.5
16.516.616.516.516.516.416.516.516.5
MNAR 20.20.2Bias−0.01Xb−0.01−0.01−0.01−0.01−0.01−0.01−0.01−0.01−0.01
− 0.01− 0.01− 0.01− 0.01− 0.01− 0.01− 0.01− 0.01− 0.01
Variance (10-3)1.668X1.7461.7511.7371.7401.7281.7271.7301.7301.728
1.6951.7031.6861.6941.6781.6711.6811.6821.672
MSE (10-3)1.881X1.9071.8971.8961.8851.9421.9401.9431.9251.920
1.9041.9071.9001.9001.8921.8841.8951.8961.885
RRMSE (%)17.3X17.517.417.417.417.617.617.617.517.5
17.517.517.417.417.417.417.417.417.4
MNAR 30.50.2Bias−0.03Xb−0.03−0.02−0.03−0.02−0.03−0.03−0.03−0.03− 0.03
−0.03− 0.03− 0.03− 0.03−0.03− 0.03−0.03− 0.03−0.03
Variance (10-3)1.756X2.0442.0502.0282.0322.0132.0132.0142.0132.013
1.7931.8061.7801.7941.7721.7601.7771.7781.761
MSE (10-3)2.572X2.7082.6502.6852.6302.8852.8822.8852.8112.806
2.6112.6132.6052.6072.5962.5782.6042.6042.579
RRMSE (%)20.3X20.820.620.720.521.521.521.521.221.2
20.420.420.420.420.420.320.420.420.3

Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, MSE Mean square error, RRMSE Relative root mean square error

aScenarios MNAR 1: response depending on outcome and covariates; Scenario MNAR 2, 3: response depending on outcome, exposure, and covariates

bWhether for the bias or the variance, the first line represents the result obtained when the exposure variable X is included in the response model, whereas the second line represents the result obtained without the exposure variable X in the response model

Table 5

Simulation study results: bias, variance, mean square error and related root mean square error in the  regression coefficient for CCA and the IPPW method (18 response models), for three MNAR response mechanism scenarios

ScenarioaγxγyCCAIPPW method
Response models
(X), Z1, Z2, Z3, Z4(X), Z1, Z2, Z3, Z4, Z5, Z6, Z7(X), Z1, Z3(X), Z1, Z2, Z3, Z5, Z7(X), Z1(X), Z5(X), Z1, Z5(X), Z1, Z5, Z7(X), Z5, Z7
MNAR 40.00.5Bias−0.02Xb− 0.02−0.01− 0.01− 0.01− 0.02− 0.02−0.02− 0.02−0.02
− 0.02−0.02− 0.02− 0.02−0.02− 0.02−0.02− 0.02−0.02
Variance (10-3)1.591X1.6331.6411.6251.6331.6161.6111.6181.6211.614
1.6201.6301.6131.6231.6031.5941.6071.6101.597
MSE (10-3)1.857X1.8591.8491.8491.8401.8851.8781.8871.8661.852
1.8831.8881.8831.8821.8721.8611.8761.8801.864
RRMSE (%)17.2X17.217.217.217.217.417.317.417.317.2
17.417.417.417.417.317.317.317.317.3
MNAR 50.20.5Bias−0.04Xb−0.04−0.03−0.04−0.03−0.04−0.04−0.04− 0.04− 0.04
− 0.04− 0.04− 0.04− 0.04− 0.04− 0.04− 0.04− 0.04−0.04
Variance (10-3)1.621X1.7371.7431.7271.7311.7121.7081.7111.7121.709
1.6561.6671.6481.6591.6361.6241.6411.6431.626
MSE (10-3)3.094X3.0382.9333.0132.9143.2303.2193.2313.1033.081
3.1423.1373.1443.1383.1253.1043.1363.1403.108
RRMSE (%)22.3X22.021.722.021.622.722.722.722.322.2
22.422.422.422.422.422.322.422.422.3
MNAR 60.50.5Bias−0.07Xb−0.06−0.06−0.06−0.06−0.07−0.07−0.07−0.07−0.07
−0.07− 0.07−0.07− 0.07−0.07− 0.07−0.07− 0.07−0.07
Variance (10-3)1.734X2.1362.1502.1182.1292.0822.0792.0782.0822.084
1.7861.8101.7701.7981.7551.7421.7641.7671.744
MSE (10-3)6.211X6.2845.9386.2295.8966.9036.8876.9016.5036.473
6.3506.3606.3416.3686.2956.2516.3366.3466.262
RRMSE (%)31.5X31.730.831.630.733.233.233.232.332.2
31.931.931.931.931.731.631.831.931.7

Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, MSE Mean square error, RRMSE Relative root mean square error

aScenarios MNAR 4: response depending on outcome and covariates; Scenario MNAR 5, 6: response depending on outcome, exposure, and covariates

bWhether for the bias or the variance, the first line represents the result obtained when the exposure variable X is included in the response model, whereas the second line represents the result obtained without the exposure variable X in the response model

Table 6

Simulation study results: coverage rate of the normality-based confidence interval for the  regression coefficient for CCA and the IPPW method (18 response models), for nine response mechanism scenarios

ScenarioaγxγyCCAIPPW method
Response models
(X), Z1, Z2, Z3, Z4(X), Z1, Z2, Z3, Z4, Z5, Z6, Z7(X), Z1, Z3(X), Z1, Z2, Z3, Z5, Z7(X), Z1(X), Z5(X), Z1, Z5(X), Z1, Z5, Z7(X), Z5, Z7
MAR 10.00.094.9Xb95.195.094.995.195.095.095.095.095.0
95.195.094.995.094.995.095.094.995.0
MAR 20.20.095.1Xb95.195.195.195.095.295.295.195.195.2
95.095.095.095.095.095.295.095.095.1
MAR 30.50.094.9Xb95.094.995.095.094.894.994.894.994.9
94.894.994.894.994.994.994.994.995.0
MNAR 10.00.295.0Xb95.295.295.195.195.095.095.095.095.1
95.295.295.195.194.995.095.095.095.0
MNAR 20.20.293.8Xb94.194.394.294.393.893.893.794.093.9
93.894.093.993.893.893.893.893.993.8
MNAR 30.50.289.9Xb91.591.991.592.090.290.390.190.690.8
89.990.089.890.089.790.089.689.689.9
MNAR 40.00.593.2Xb93.693.693.793.793.393.393.393.393.4
93.493.393.393.393.293.193.293.293.2
MNAR 50.20.584.3Xb86.287.086.287.084.484.584.485.485.5
84.384.684.184.384.384.384.284.284.3
MNAR 60.50.563.9Xb71.273.471.273.566.566.866.569.069.4
64.264.763.964.363.663.663.763.563.4
Simulation study results: bias, variance, mean square error and related root mean square error in the  regression coefficient for CCA and the IPPW method (18 response models), for three MAR response mechanism scenarios Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, MSE Mean square error, RRMSE Relative root mean square error aScenario MAR 1: response depending only on covariates; Scenarios MAR 2, 3: response depending on exposure and covariates bWhether for the bias or the variance, the first line represents the result obtained when the exposure variable X is included in the response model, whereas the second line represents the result obtained without the exposure variable X in the response model Simulation study results: bias, variance, mean square error and related root mean square error in the  regression coefficient for CCA and the IPPW method (18 response models), for three MNAR response mechanism scenarios Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, MSE Mean square error, RRMSE Relative root mean square error aScenarios MNAR 1: response depending on outcome and covariates; Scenario MNAR 2, 3: response depending on outcome, exposure, and covariates bWhether for the bias or the variance, the first line represents the result obtained when the exposure variable X is included in the response model, whereas the second line represents the result obtained without the exposure variable X in the response model Simulation study results: bias, variance, mean square error and related root mean square error in the  regression coefficient for CCA and the IPPW method (18 response models), for three MNAR response mechanism scenarios Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, MSE Mean square error, RRMSE Relative root mean square error aScenarios MNAR 4: response depending on outcome and covariates; Scenario MNAR 5, 6: response depending on outcome, exposure, and covariates bWhether for the bias or the variance, the first line represents the result obtained when the exposure variable X is included in the response model, whereas the second line represents the result obtained without the exposure variable X in the response model Simulation study results: coverage rate of the normality-based confidence interval for the  regression coefficient for CCA and the IPPW method (18 response models), for nine response mechanism scenarios

Bias in the regression coefficient

We observed no bias with either CCA or the IPPW method for the three MAR scenarios and MNAR scenario 1 ( = 0.0,  = 0.2). A bias occurred with both methods for the five other MNAR scenarios, with a greater amplitude for MNAR scenarios 5 ( = 0.2,  = 0.5) and 6 ( = 0.5,  = 0.5). The bias was globally equivalent between CCA and the IPPW method for these five scenarios. Within the IPPW method, all response models tested showed the same bias pattern across all MAR and MNAR scenarios. For MNAR scenarios 1 to 3, the absolute bias increases as increases. Similarly, the absolute bias increases as increases for MNAR scenarios 4 to 6.

Variance of the regression coefficient

The IPPW method was less efficient than CCA for all scenarios. We observed an increase in variance with increasing correlation between the exposure variable X and the response (illustrated by Figs. 2 and 3). The loss of efficiency of the IPPW method was thus particularly pronounced in MAR scenario 3 and MNAR scenarios 3 and 6 (all three characterized by  = 0.5).
Fig. 2

Monte-Carlo variance obtained with CCA and the IPPW method (response models 1 and 10, see Table 2) for the nine response mechanism scenarios. The variance increased as the correlation between the exposure variable and the response variable increased for both methods. The variance was consistently higher with the IPPW method than with CCA in all scenarios. With the IPPW method, variance inflation was particularly observed when the exposure variable X was put into the response model

Fig. 3

Monte-Carlo variance obtained with CCA and the IPPW method (response models 6 and 15, see Table 2) for the nine response mechanism scenarios. The variance increased as the correlation between the exposure variable and the response variable increased for both methods. The variance was consistently higher with the IPPW method than with CCA in all scenarios. With the IPPW method, variance inflation was particularly observed when the exposure variable X was put in the response model. On the other hand, removal of variable X (covariate Z5 only) resulted in the variance obtained with the IPPW method being very close to that obtained by CCA

Monte-Carlo variance obtained with CCA and the IPPW method (response models 1 and 10, see Table 2) for the nine response mechanism scenarios. The variance increased as the correlation between the exposure variable and the response variable increased for both methods. The variance was consistently higher with the IPPW method than with CCA in all scenarios. With the IPPW method, variance inflation was particularly observed when the exposure variable X was put into the response model Monte-Carlo variance obtained with CCA and the IPPW method (response models 6 and 15, see Table 2) for the nine response mechanism scenarios. The variance increased as the correlation between the exposure variable and the response variable increased for both methods. The variance was consistently higher with the IPPW method than with CCA in all scenarios. With the IPPW method, variance inflation was particularly observed when the exposure variable X was put in the response model. On the other hand, removal of variable X (covariate Z5 only) resulted in the variance obtained with the IPPW method being very close to that obtained by CCA The response models that enabled a reduction in the variance were those in which the exposure variable X was removed (see Figs. 2 and 3). This was particularly observed in scenarios in which the exposure variable X was associated with the response, but it was also observed in MAR scenario 1 and MNAR scenarios 1 and 4 (all three characterized by   0.0). Among the response models without the exposure variable X, the response model that further reduced the variance was that which included only the variable Z5 (confounding variable not associated with the response). Nevertheless, response models including the variables Z5, Z7 (a confounding variable and a prognostic variable, neither associated with the response), and only Z1 (confounding variable associated with the response) also showed good performance in terms of precision. Overall, the gain in precision obtained with all these response models did not enable us to reach the level of precision obtained using CCA.

Mean square error of the regression coefficient

For the MAR scenarios, all the tested estimators are unbiased and there is therefore no difference between the variance and the mean square error (see Table 3). For MNAR scenarios 1 to 3, the mean square error increases with , i.e. when the correlation between the exposure variable and the response increases. This also holds true for MNAR scenarios 4 to 6.

Coverage rates

The coverage rates are well respected for all the MAR scenarios and for MNAR scenario 1. For MNAR scenarios 1 to 3, the coverage decreases as the correlation between the exposure variable and the response increases. This also holds true for MNAR scenarios 4 to 6. The coverage rates are poorly respected for MNAR scenarios 5 and 6.

Illustrative example

As an example, we analyzed the association between pre-pregnancy maternal BMI with the child’s BMI at age 7 in TIMOUN, a prospective mother-child cohort study conducted in the Guadeloupe archipelago (French West Indies) [16].

Study population and data collection

Between November 2004 and December 2007, 1068 pregnant women were enrolled in TIMOUN by obstetricians during their second- or third-trimester prenatal visit at public hospitals or at a local dispensary. At inclusion, women were interviewed by trained midwives to assess their medical history, socioeconomic conditions, and lifestyle. At birth, information concerning maternal diseases during pregnancy, health status of the newborn, and details of the delivery was also collected [17]. In total, 1033 single live births were registered. Several follow-ups were organized within a selected subsample of the children at 3, 7, and 18 months of age [18, 19]. When the children were 7 years of age, all the mothers initially included were invited to participate in a new follow-up which consisted of an interview of the mothers and a medical examination of the children. Among the 1033 mother-child couples initially included, 592 participated in this second wave, representing 57% of the initial sample. Weight was not measured for two children examined at age 7, resulting in a final population of 590 for the association studied (see detailed flow-chart in Additional file 1: e-Appendix 2).

Outcome and exposure

The exposure of interest was the pre-pregnancy maternal BMI (kg/m2). It was calculated from the mothers’ self-reported weight and height before pregnancy at inclusion in the cohort. The outcome of interest was the child’s BMI at 7 years. It was calculated from measurements performed during a medical examination at 7 years.

Covariates

The covariates considered in the analysis were maternal age at birth (continuous), maternal educational level (< 5 years, 5–12 years, > 12 years), maternal place of birth (French West Indies, other Caribbean island, Europe), non-gestational maternal diabetes (yes, no), enrollment site (university hospital, local hospital, antenatal care dispensary), maternal alcohol consumption during pregnancy (yes, no), maternal smoking during pregnancy (yes, no), sex of the child (boy, girl). The proportion of missing data within these covariates did not exceed 3%, except for maternal alcohol consumption during pregnancy (5.6%). For the variables with missing values, a single imputation by the modal value was previously performed. The directed acyclic graph (DAG) on which we based our analyses is presented in Fig. 4. All arrows were placed according to a priori knowledge. In our study, the DAG approach did not identify all the types of covariates Z presented in the simulation study: no variables of type Z3, Z5, or Z6 were present in our example. In this didactic example, we assume a situation equivalent to the MAR 1 scenario in the simulation study (i.e. the response at 7 years depends neither on the exposure nor the outcome, but only on the covariates).
Fig. 4

Directed acyclic graph (DAG) of the known or assumed associations between variables of the illustrative example. For the sake of simplicity and clarity, the arrows representing the associations between the covariates are not drawn. Not all types of variables considered in the simulation study were suitable for this illustrated example. The covariates ‘maternal educational level’ and ‘maternal place of birth’ were considered to be confounding factors in the relationship between pre-pregnancy maternal BMI and child BMI at age 7. The association models were therefore adjusted for these covariates

Directed acyclic graph (DAG) of the known or assumed associations between variables of the illustrative example. For the sake of simplicity and clarity, the arrows representing the associations between the covariates are not drawn. Not all types of variables considered in the simulation study were suitable for this illustrated example. The covariates ‘maternal educational level’ and ‘maternal place of birth’ were considered to be confounding factors in the relationship between pre-pregnancy maternal BMI and child BMI at age 7. The association models were therefore adjusted for these covariates A linear regression model was fitted with an a priori adjustment for maternal education and maternal place of birth (confounding variables). Both CCA and the IPPW method were applied, the latter using several response models. The analyses were performed using R 3.3.2 (R Foundation for Statistical Computing, Vienna, Austria). The standard errors were computed taking into account the weight estimation phase, according to the method described by Metten et al. [20]. The R code to implement the CCA and IPPW analyses and a training dataset are available in Additional file 1: e-Appendix 3.

Results

The β coefficients related to the exposure of interest were very similar between CCA and the IPPW method (Table 7). Within the IPPW results, the most effective response model strategy was the one including only Z1 variables (maternal educational level and maternal place of birth).
Table 7

Adjusted association between pre-pregnancy maternal BMI and child BMI at age 7 (CCA and IPPW method)

βSE
CCA (N = 590)a0.1420.0197
IPPW method (N = 590)a
Response models
  Z1, Z2, Z40.1370.0229
  Z1, Z2, Z4, Z70.1380.0231
  Z1, Z2, Z70.1400.0234
  Z10.1400.0224

Z1: maternal educational level and maternal place of birth

Z2: maternal tobacco smoking during pregnancy, maternal age at birth, and non-gestational maternal diabetes

Z4: enrollment site and maternal alcohol consumption during pregnancy

Z7: sex of the child

Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, β Beta coefficient (regression estimate), SE Standard error

aAdjustment for maternal educational level and maternal place of birth

Adjusted association between pre-pregnancy maternal BMI and child BMI at age 7 (CCA and IPPW method) Z1: maternal educational level and maternal place of birth Z2: maternal tobacco smoking during pregnancy, maternal age at birth, and non-gestational maternal diabetes Z4: enrollment site and maternal alcohol consumption during pregnancy Z7: sex of the child Abbreviations: CCA Complete case analysis, IPPW Inverse probability of participation weighting, β Beta coefficient (regression estimate), SE Standard error aAdjustment for maternal educational level and maternal place of birth

Discussion

Attrition is a major methodological issue in cohort studies. It challenges the validity of association analyses because its occurrence is generally not completely at random. Several authors have proposed the IPPW method to correct for potential selection biases [3, 4]. However, little evaluation of the method has been performed and there is little guidance for researchers who wish to apply it, in particular for the specification of the response model. Our simulation study showed no superiority of the IPPW method over CCA in terms of bias, and it even led to a loss of efficiency. Both were similarly unbiased in the MAR scenarios and similarly biased in most MNAR scenarios. For the MNAR scenarios, the absolute bias increased as the correlation between the exposure and the response increased. As a result, the mean square error is high for these scenarios when  = 0.5. In addition, because the bias is negative, the confidence intervals are shifted to the left and the nominal error rates are poorly respected. These results are consistent with those observed in a study comparing several methods of handling attrition in a simulated cohort of 300 subjects [21]. In this study, the authors concluded that CCA produces results as valid as those obtained with the other compared methods, which included the IPPW method. It is worth noting that the IPPW method consists in reweighting the study population with complete data, meaning that both CCA and IPPW methods are based on the same sub-population. Therefore, a difference in efficiency cannot be attributed to a varying sample size. One explanation for the loss of efficiency observed with the IPPW method lies in the fact that adding covariates in the response model tends to increase the variability of estimated weights. We chose to solely compare the IPPW method to CCA. However, there are also other approaches, including imputation methods, which consist of replacing missing values with plausible ones. Multiple imputation (MI) is an advanced imputation method that has steadily improved and gained popularity in recent years [22, 23]. It consists of imputing the dataset several times by using adapted models that include the collected variables. However, imputation methods are mainly used for missing covariates in situations of partial non-response. Seaman and White emphasized that it may be potentially dangerous to use MI in situations of total non-response [4]. The risk of mis-specifying the imputation model would be high because it requires the imputation of all missing variables of a given survey wave, without auxiliary information at the time of the survey wave. The results of Lewin et al. also showed that MI was no better than CCA in situations of attrition that lead to a missing outcome [24]. This is also consistent with the findings of Kristman et al. [21]. Our study aimed also to assess the impact of the choice of the variables included in the response model on the bias of the exposure regression coefficient and its variance. The various response models tested did not change the bias patterns, which is consistent with what has been observed in the literature on the propensity score method. Indeed, Brookhart et al. showed that the issue of the choice of variables resided essentially in the variance, not in the bias [12]. The strategy of not including variables associated with the exposure in the propensity score, but rather confounding and prognostic variables, improved the precision of the estimates without increasing the bias. In our study, we show that it is preferable not to include the exposure variable in the response model. Otherwise variance inflation would be observed, which is not in line with the proposal of Hernan et al. [3]. Paradoxically, this phenomenon was particularly pronounced in scenarios in which the exposure variable was associated with the response. This can possibly be explained by over-fitting because the exposure variable is present in the association model. Within response models without the exposure variable, the minimalist strategy, consisting of including only the confounding variable unrelated to the response, resulted in the lowest estimated variance. Close response models (in order of best precision: inclusion of both confounding and prognostic variables unrelated to the response; inclusion of the confounding variable related to the response) also performed well in terms of precision. Thus, parsimonious strategies using the same variables as the association model (except the exposure variable) were the most effective. This was also observed in our illustrated example. The strategy to optimize the response model when using the IPW method to limit a selection bias (IPPW) is thus the same as that recommended in situations in which the IPW method is used to limit a confounding bias (propensity scores). The strategy for constructing the response model requires clear identification of the role played by the variables. This can be based on a structural approach using DAGs, as we did in our example in Section 5 [3]. DAGs are causal analysis tools originally designed to assist in the selection of variables in an association model [25]. They make it possible to control for a confusion bias and avoid over-adjustment in the association model. Within the framework of the propensity score method, Austin and Stuart recommended using them to identify sets of variables to be included in the propensity score [26]. Similarly, it can be useful to guide the variable selection in the response model. Indeed, it allows researchers to better visualize the relationships between all the variables involved in the association of interest (exposure, outcome, response, covariates) and thus enables optimization of the specification of the response and association models. DAGs are based on a priori knowledge and thus do not protect against misidentification of the role played by the variables. In surveys, rather than weighting individuals by their individual probability of response, the sample is often partitioned into response homogeneity groups (RHGs), i.e. groups that are homogeneous in terms of response probability. The parameters of interest are estimated in each group and then pooled across the groups to obtain an overall parameter. This strategy, while improving the precision of the estimates, protects against possible misspecification of the response model. The RHG method is quite similar to what is referred to as stratification in the context of the propensity score used to reduce a confounding bias. In the context of attrition, Seaman and White proposed a stratified IPPW method, but the stratification was not based on response probability but rather response patterns to survey waves [4]. Once the strata were defined, a response model was fitted independently in each stratum. We are not aware of any use of stratification on the probability of response in association studies based on cohort data, but this may be a new application of an existing method in other contexts (surveys, and propensity scores).

Strengths

The first strength of our study is that we tested through simulations nine response mechanism scenarios, corresponding to three degrees of correlation between the response variable and our interest variables (exposure, outcome). The parameters chosen were consistent with those observed in the TIMOUN cohort to represent a realistic setting. In addition, we evaluated the impact of several response models on the estimated exposure effect. This has not been previously performed in the literature when the IPW method has been used to reduce a selection bias.

Limitations

This study also had limitations. First, our simulation framework did not consider binary outcomes, although this is a common situation in epidemiology. However, recent literature indicates that CCA is potentially much less prone to give biased estimates of the exposure coefficient in a logistic regression [27], making the linear regression framework more challenging for evaluating the IPPW method. Second, in our simulations the level of attrition was kept constant, at a quite high but realistic level (40%) for cohort studies. The influence of the attrition rate is however well known, with the expected conclusion that bias and variance increase with the percentage of non-respondents [21]. Moreover, the attrition level we chose was realistic for cohort studies and thus quite high (40%). Third, we did not vary the degree of correlation between the covariates Z and the response or our variables of interest (exposure, outcome). However, Lewin et al. showed that a strong correlation between the outcome and a variable of type Z3 (i.e. associated with the outcome and response) could increase the magnitude of the bias [24]. Finally, our study only considered attrition leading to missing outcomes and fully observed exposure and covariates. However, these variables may also be affected by partial non-response in everyday practice. Seaman proposed a mixed approach to address this problem, combining MI and the IPPW method [28]. Although it has already been used by several epidemiological researchers [29-31], such an approach should be further explored, especially for the evaluation of its superiority over a method combining IPPW and single imputation. Finally, we did not address the consequence of using estimated weights (inverse response probability) in the association models. The usual statistical packages and procedures estimate the standard error of the effect of exposure as if weights are a priori known, ignoring the extra-variability due to their estimation. Consequently, the standard error is biased and may mislead the conclusions about the significance of the effect. We proposed an exact estimation of the variance (linearized variance) that should be used when IPW is implemented (as we did in our illustrative example). The details of the calculations of this variance are available in Metten et al. [20].

Conclusion

Our study suggests that using IPPW to handle attrition in cohort studies does not reduce bias and may result in a loss of efficiency. These results therefore raise questions about the contribution of the IPW method to correcting possible selection bias that occurs in situations of attrition that lead to a missing outcome in association analyses. If the method is to be used, we encourage use of only the confounding variables of the association of interest in the response model. Additional file 1: e-Appendix 1. SAS code to implement the simulation study. e-Appendix 2. Detailed flow-chart of the TIMOUN cohort. e-Appendix 3. R Script to implement an IPPW analysis - Illustrative example (demo dataset “dt.csv”).
  27 in total

1.  Causal diagrams for epidemiologic research.

Authors:  S Greenland; J Pearl; J M Robins
Journal:  Epidemiology       Date:  1999-01       Impact factor: 4.822

Review 2.  Review of inverse probability weighting for dealing with missing data.

Authors:  Shaun R Seaman; Ian R White
Journal:  Stat Methods Med Res       Date:  2011-01-10       Impact factor: 3.021

3.  Chlordecone exposure, length of gestation, and risk of preterm birth.

Authors:  Philippe Kadhel; Christine Monfort; Nathalie Costet; Florence Rouget; Jean-Pierre Thomé; Luc Multigner; Sylvaine Cordier
Journal:  Am J Epidemiol       Date:  2014-01-08       Impact factor: 4.897

4.  Gestational weight gain and obesity, adiposity and body size in African-American and Dominican children in the Bronx and Northern Manhattan.

Authors:  Elizabeth M Widen; Robin M Whyatt; Lori A Hoepner; Noel T Mueller; Judyth Ramirez-Carvey; Sharon E Oberfield; Abeer Hassoun; Frederica P Perera; Dympna Gallagher; Andrew G Rundle
Journal:  Matern Child Nutr       Date:  2015-03-05       Impact factor: 3.092

5.  Multiple imputation using chained equations: Issues and guidance for practice.

Authors:  Ian R White; Patrick Royston; Angela M Wood
Journal:  Stat Med       Date:  2010-11-30       Impact factor: 2.373

6.  Exposure to an organochlorine pesticide (chlordecone) and development of 18-month-old infants.

Authors:  Olivier Boucher; Marie-Noëlle Simard; Gina Muckle; Florence Rouget; Philippe Kadhel; Henri Bataille; Véronique Chajès; Renée Dallaire; Christine Monfort; Jean-Pierre Thomé; Luc Multigner; Sylvaine Cordier
Journal:  Neurotoxicology       Date:  2013-01-29       Impact factor: 4.294

7.  A novel application of the Intent to Attend assessment to reduce bias due to missing data in a randomized controlled clinical trial.

Authors:  Dustin J Rabideau; Andrew A Nierenberg; Louisa G Sylvia; Edward S Friedman; Charles L Bowden; Michael E Thase; Terence A Ketter; Michael J Ostacher; Noreen Reilly-Harrington; Dan V Iosifescu; Joseph R Calabrese; Andrew C Leon; David A Schoenfeld
Journal:  Clin Trials       Date:  2014-05-28       Impact factor: 2.486

8.  The association between depressive symptoms from early to late adolescence and later use and harmful use of alcohol.

Authors:  Alexis C Edwards; Carol Joinson; Danielle M Dick; Kenneth S Kendler; John Macleod; Marcus Munafò; Matthew Hickman; Glyn Lewis; Jon Heron
Journal:  Eur Child Adolesc Psychiatry       Date:  2014-08-18       Impact factor: 4.785

9.  Alcohol Use in Adolescence and Later Working Memory: Findings From a Large Population-Based Birth Cohort.

Authors:  Liam Mahedy; Matt Field; Suzanne Gage; Gemma Hammerton; Jon Heron; Matt Hickman; Marcus R Munafò
Journal:  Alcohol Alcohol       Date:  2018-05-01       Impact factor: 2.826

10.  Association of Serum Lipids and Obstructive Lung Disease in Hispanic/Latino Adults of Diverse Backgrounds.

Authors:  Majid Afshar; Donghong Wu; Ramon Durazo-Arvizu; Frank G Aguilar; Ravi Kalhan; Sonia M Davis; Robert Kaplan; Oana L Klein; Eliana P Mende; Maria S Pattany; Martha L Daviglus
Journal:  J Pulm Respir Med       Date:  2017-08-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.