Kosuke Inoue1,2, Atsushi Goto3, Naoki Kondo4,5, Tomohiro Shinozaki6. 1. Department of Social Epidemiology, Graduate School of Medicine, Kyoto University, Floor 2, Science Frontier Laboratory, Yoshida-konoe-cho, Sakyo-ku, Kyoto, 604-8146, Japan. koinoue@ucla.edu. 2. Department of Epidemiology, UCLA Fielding School of Public Health, Los Angeles, CA, USA. koinoue@ucla.edu. 3. Department of Health Data Science, Graduate School of Data Science, Yokohama City University, Yokohama, Kanagawa, Japan. 4. Department of Social Epidemiology, Graduate School of Medicine, Kyoto University, Floor 2, Science Frontier Laboratory, Yoshida-konoe-cho, Sakyo-ku, Kyoto, 604-8146, Japan. 5. Institute for Future Initiatives, The University of Tokyo, Tokyo, Japan. 6. Department of Information and Computer Technology, Faculty of Engineering, Tokyo University of Science, Tokyo, Japan.
Abstract
BACKGROUND: It is often challenging to determine which variables need to be included in the g-computation algorithm under the time-varying setting. Conditioning on instrumental variables (IVs) is known to introduce greater bias when there is unmeasured confounding in the point-treatment settings, and this is also true for near-IVs which are weakly associated with the outcome not through the treatment. However, it is unknown whether adjusting for (near-)IVs amplifies bias in the g-computation algorithm estimators for time-varying treatments compared to the estimators ignoring such variables. We thus aimed to compare the magnitude of bias by adjusting for (near-)IVs across their different relationships with treatments in the time-varying settings. METHODS: After showing a case study of the association between the receipt of industry payments and physicians' opioid prescribing rate in the US, we demonstrated Monte Carlo simulation to investigate the extent to which the bias due to unmeasured confounders is amplified by adjusting for (near-)IV across several g-computation algorithms. RESULTS: In our simulation study, adjusting for a perfect IV of time-varying treatments in the g-computation algorithm increased bias due to unmeasured confounding, particularly when the IV had a strong relationship with the treatment. We also found the increase in bias even adjusting for near-IV when such variable had a very weak association with unmeasured confounders between the treatment and the outcome compared to its association with the time-varying treatments. Instead, this bias amplifying feature was not observed (i.e., bias due to unmeasured confounders decreased) by adjusting for near-IV when it had a stronger association with the unmeasured confounders (≥0.1 correlation coefficient in our multivariate normal setting). CONCLUSION: It would be recommended to avoid adjusting for perfect IV in the g-computation algorithm to obtain a less biased estimate of the time-varying treatment effect. On the other hand, it may be recommended to include near-IV in the algorithm unless their association with unmeasured confounders is very weak. These findings would help researchers to consider the magnitude of bias when adjusting for (near-)IVs and select variables in the g-computation algorithm for the time-varying setting when they are aware of the presence of unmeasured confounding.
BACKGROUND: It is often challenging to determine which variables need to be included in the g-computation algorithm under the time-varying setting. Conditioning on instrumental variables (IVs) is known to introduce greater bias when there is unmeasured confounding in the point-treatment settings, and this is also true for near-IVs which are weakly associated with the outcome not through the treatment. However, it is unknown whether adjusting for (near-)IVs amplifies bias in the g-computation algorithm estimators for time-varying treatments compared to the estimators ignoring such variables. We thus aimed to compare the magnitude of bias by adjusting for (near-)IVs across their different relationships with treatments in the time-varying settings. METHODS: After showing a case study of the association between the receipt of industry payments and physicians' opioid prescribing rate in the US, we demonstrated Monte Carlo simulation to investigate the extent to which the bias due to unmeasured confounders is amplified by adjusting for (near-)IV across several g-computation algorithms. RESULTS: In our simulation study, adjusting for a perfect IV of time-varying treatments in the g-computation algorithm increased bias due to unmeasured confounding, particularly when the IV had a strong relationship with the treatment. We also found the increase in bias even adjusting for near-IV when such variable had a very weak association with unmeasured confounders between the treatment and the outcome compared to its association with the time-varying treatments. Instead, this bias amplifying feature was not observed (i.e., bias due to unmeasured confounders decreased) by adjusting for near-IV when it had a stronger association with the unmeasured confounders (≥0.1 correlation coefficient in our multivariate normal setting). CONCLUSION: It would be recommended to avoid adjusting for perfect IV in the g-computation algorithm to obtain a less biased estimate of the time-varying treatment effect. On the other hand, it may be recommended to include near-IV in the algorithm unless their association with unmeasured confounders is very weak. These findings would help researchers to consider the magnitude of bias when adjusting for (near-)IVs and select variables in the g-computation algorithm for the time-varying setting when they are aware of the presence of unmeasured confounding.
In most epidemiologic studies, there is a dilemma between including only true confounders or all possible confounders. It is often believed that including as many covariates as possible would reduce the bias due to unmeasured confounding between the treatment and outcome. However, as suggested in prior literature, this is not always true, and conditioning on variables that are associated with the outcome only through the treatment, called instrumental variables (IVs), can increase the bias of treatment effect estimates in the point-treatment settings under the presence of unmeasured confounding [1-5]. This is also the case for near-IVs, variables that are weakly associated with the outcome not through the treatment, and these covariates are often called ‘bias amplifiers’ [1, 2]. For example, a previous study demonstrated that adjusting for glaucoma diagnosis (by including it in the propensity score model)—a near-IV of statin use vs. glaucoma drugs use—moved the estimated effect of statin (vs. glaucoma drug) on mortality or hip fracture risk away from the expected effect based on the results from randomized controlled trials [6]. Some previous studies have emphasized the practical challenges to determine which variables are confounders or IVs [2, 3], suggesting the need for careful consideration of including strong predictors of the treatment in the model. If the variables are weakly associated with the outcome not through the treatment (i.e., near-IVs), it might be better to present the findings from both models with and without such variables in the point-treatment settings. However, it has been still unclear whether and the extent to which this bias amplifying feature of IVs and near-IVs can be applied to the time-varying treatment settings.G-computation, which is the computational algorithm of g-formula, is one of the methods to estimate the causal effect of time-varying treatments accounting for time-varying confounders that are affected by the treatment [7-9]. In practice, we need the fits of 1) the regression model of the outcome on the time-varying treatments and time-varying covariates and 2) the regression models of the time-varying covariates on previous treatments and covariates, as well as the Monte Carlo integration from these model fits in the g-computation algorithm. Note that, for time-varying treatment settings, there can be IV or near-IV for each time-point which may influence the g-computation estimator in an unpredictable way through distinct regression models in the presence of unmeasured (time-fixed or time-varying) confounders. Hence, it is crucial to quantify the influence of including IV or near-IV in the regression models on the g-computation estimates.The goal of our study is to compare the magnitude of bias due to the adjustment of IV or near-IV across different relationships between (near-)IV and treatments in the time-varying settings. After describing a case study of industry marketing payments and prescriptions of opioid products, we conducted Monte Carlo simulation studies to investigate the extent to which the bias due to unmeasured confounders is amplified by adjusting for IV or near-IV (which could also reduce bias as a proxy of an unmeasured confounder). Our study will guide readers to consider the possible magnitude of bias due to adjustment for potential IVs, and thus help them to select covariates in their g-computation algorithms under the time-varying setting in future epidemiological research.
Methods
A case study of industry payments and prescriptions of opioid products
Opioid overdose is a major public health crisis in the United States [10]. As initial exposure to opioid prescriptions by physicians is known as a risk factor of opioid misuse and dependence [11-13], it is important to understand the upstream determinants of physician prescription of opioids. The financial relationship between physicians and the pharmaceutical industry has received substantial attention as it may affect physicians’ clinical practice [14, 15]. Since the launch of the Open Payments program under the Physician Payment Sunshine Act which requires the pharmaceutical industry to publicly report data on all payments and ownership interests made to licensed physicians and teaching hospitals in 2013 [16], a growing body of literature linking the financial physician-industry relationship with physicians’ prescriptions of opioids has been published [17-21]. All previous studies consistently showed the association between the receipt (or the number of encounters) of industry marketing payments at a single time point and an increased number of opioid prescriptions. This association was observed even among physicians who already received industry payments in the previous year [21], generating a hypothesis that physicians who received the payments at multiple time points would prescribe opioids than physicians who never received the payments. However, the evidence is lacking about the potential impact of industry marketing at multiple time points on physicians’ prescription of opioid products.In addition, while the receipt of industry payments for non-opioids has not been adjusted for in some previous studies, it is strongly correlated with the receipt of opioid-related industry payments [21] and may also be associated with opioid prescriptions through patient-level characteristics that are not available in the Open payments data (i.e., near-IV). For example, physicians who treat patients with severe cancer may seek an educational opportunity through industry marketing of chemotherapy, and they would also have a high chance to prescribe opioids to control pain due to cancer. This possible link poses a question of whether the results change by adjusting for the receipt of industry payments for non-opioids.Therefore, we employed a g-computation algorithm to account for the time-varying confounding due to change in opioid prescription pattern over time that was partially affected by industry marketing (i.e., physicians who received industry marketing might prescribe more opioids which would motivate the pharmaceutical industry to conduct further marketing in the following year). We then compared estimated effects between models with and without the receipt of industry payments for non-opioids, a potential (near-)IV.
Data sources and causal structure
We used data from the Centers for Medicare and Medicaid Services (CMS) Open Payments database 2015, 2016, and 2017 [22] linked with the CMS National Plan & Provider Enumeration System (NPPES) database [23], the CMS Physician Compare database [24], and the CMS Medicare Provider Utilization and Payment Data 2016, 2017, and 2018 [25]. We restricted physicians to those who treated Medicare beneficiaries and had physician-level characteristics, resulting in the final analytical sample of 250,944 physicians. Detailed information on each database can be found in previous literature [20, 21].Throughout this paper, we let T1 and T2 denote the treatment at time 1 and time 2, respectively. We let Y denote the outcome of interest. We let X1 denote the common cause of T1, T2, and Y, let X2 denote the common cause of T2 and Y affected by T1, and let X3 denote the common cause of T1 and T2 (i.e., IV). We let Yt1,t2 denote the potential outcome if the treatment had taken values T1 = t1 and T2 = t2.Causal diagram is shown in Fig. 1. Our treatments of interest (T1 and T2) are the receipt of general payments (all forms of non-research payment including meals, speaker compensation, honoraria, travel and lodging, consulting fees, gifts, and education materials) related to opioids in 2016 (T1) and 2017 (T2). Our outcome of interest (Y) was the opioid prescribing rate (the percentage of the total claims represented by opioid claims) in 2018. Covariates at baseline (X1) included physicians’ sex, years in practice, specialty (30 categories listed in Supplementary Table 1), attended medical school (top-20 U.S. medical schools, U.S. medical schools ranked between 21 and 50, or others based on the U.S. News & World Report research ranking), the average age of beneficiaries, the proportion of male beneficiaries, average hierarchical condition category score of beneficiaries, and opioid prescribing rate in 2016. Time-varying confounder (X2) was the opioid prescribing rate in 2017. We considered the receipt of industry payments for non-opioids in 2016 (X3) as a near-IV that was strongly associated with X1 and X2 but was weakly associated with Y only through X1 and characteristics of patients (e.g., comorbidities, socioeconomic status, etc.) who each physician treated. Because such patient-level characteristics were not available in the Open Payments data while it may influence the physicians’ receipt of industry payments in general, we evaluated whether the results changed between models with and without adjustment of the receipt of industry payments for non-opioids in 2016. More detailed information on treatment, outcome, and covariates can also be found in previous literature [20, 21].
Fig. 1
Causal diagram assumed for example about the effect of industry marketing for opioid products on physicians’ opioid prescribing rate. Patient-level characteristics were not available in the Open Payments data
Causal diagram assumed for example about the effect of industry marketing for opioid products on physicians’ opioid prescribing rate. Patient-level characteristics were not available in the Open Payments data
Statistical analysis
We employed the following steps of the g-computation algorithm to estimate the mean difference in the opioid prescribing rate in 2018 according to the receipt of opioid-related industry marketing payments in 2016 and 2017. Analytical steps in this case study are shown as follows:Fit a linear regression model to predict opioid prescribing rate in 2017 (X2) given the receipt of opioid-related payments in 2016 (T1) and baseline covariates (X1).Fit a linear regression model to predict opioid prescribing rate in 2018 (Y) given the receipt of opioid-related payments in 2016 (T1) and 2017 (T2), baseline covariates (X1), and opioid prescribing rate in 2017 (X2).Randomly assign the treatment status in 2016 (new T1) and 2017 (new T2) based on the proportion in the original dataset (i.e., new T1 and new T2 are marginally independent of all covariates [26]).Predict opioid prescribing rate in 2017 (new X2) using the fitted regression model in step 1, newly assigned treatment status (new T1), and baseline covariates (X1).Predict opioid prescribing rate in 2018 (new Y) using the fitted regression model in step 2, newly assigned treatment status (new T1 and new T2), newly predicted time-varying confounder (new X2), and baseline covariates (X1).Estimate mean difference between distinct counterfactual marginal expectations of Y: E [Yt1, t2] – E [Y0, 0] for (t1, t2) = (1, 0), (0, 1), and (1, 1) where t1 and t2 were 1 when physicians received opioid-related payments in 2016 and 2017, respectively, and were 0 when they did not receive the payments in 2016 and 2017, respectively.Calculate the 95% confidence interval by 200 bootstrapped samples (step 1–6).Then, we compared the results with those when the g-computation algorithm in steps 1 and 2 additionally included the receipt of industry payments for non-opioids (X3).
Monte Carlo simulation study
Causal structure and data-generating process
We conducted two Monte Carlo simulation studies to obtain quantitative results adjusting for IV or near-IV. In these simulation studies, we aimed to evaluate the difference in biases due to unmeasured confounding between models with and without adjustment of perfect IV (scenario A) or near-IV (scenario B) of the time-varying treatments. In both scenarios, we simulated 10,000 datasets of sample size (N) = 500, 10,000, and 200,000, respectively.In scenario A, we assumed that X3As (X3A_1, X3A_2, X3A_3, X3A_4, X3A_5, X3A_6, X3A_7, X3A_8, and X3A_9) are associated with Y only through T1 or T2, and therefore, perfect IVs (Fig. 2A). Each X3As has a different magnitude of association with T1 and T2 as shown in Table 1. We first generated X1 and X3As for subject i (=1, …, N). X1 was drawn from independent Bernoulli distributions with parameter 0.5. X3As were drawn from independent standard normal distributions; N (0, 1). Under the guidance of causal structure in Fig. 2A, we generated the status of T1, X2, T2, and Y for subject i (=1, …, N) using the following equations and values of each parameter in Table 1:
Fig. 2
Causal diagrams for simulation studies. T1: treatment at time 1; T2: treatment at time 2; X1: Common cause of treatment at time 1, treatment at time 2, and outcome; X2: Time-varying confounder (i.e., confounder between treatment at time 2 and outcome affected by treatment at time 1); X3: Common cause of treatment at time 1 and treatment at time 2 (X3As, IV in scenario A; and X3B, near-IV in scenario B)
Table 1
Assigned values of parameters in the data-generating process of scenario A
Association with Treatment at time 2 (T2), γ3A_s
log (2.0) i.e., OR = 2.0
log (5.0) i.e., OR = 5.0
log (10.0) i.e., OR = 10.0
Association with Treatment at time 1 (T1),α3A_s
log (2.0) i.e., OR = 2.0
X3A_1
X3A_4
X3A_7
log (5.0) i.e., OR = 5.0
X3A_2
X3A_5
X3A_8
log (10.0) i.e., OR = 10.0
X3A_3
X3A_6
X3A_9
OR, odds ratio
Assigned values of parameters in the data-generating process of scenario AOR, odds ratioCausal diagrams for simulation studies. T1: treatment at time 1; T2: treatment at time 2; X1: Common cause of treatment at time 1, treatment at time 2, and outcome; X2: Time-varying confounder (i.e., confounder between treatment at time 2 and outcome affected by treatment at time 1); X3: Common cause of treatment at time 1 and treatment at time 2 (X3As, IV in scenario A; and X3B, near-IV in scenario B)In scenario B, we assumed that X3B is associated with X1 in addition to T1 and T2 (Fig. 2B). In this causal structure, as X3B is associated with Y through X1, X3B is not a perfect instrument (i.e., near-IV or proxy confounder). X1 was drawn from independent Bernoulli distributions with parameter 0.5. Under the guidance of causal structure in Fig. 2B, we generated the status of T1, X2, T2, and Y for subject (i=1,…,N) using the following equations. To detect the possible impact of adjusting for near-IV, we assumed that X3B is strongly associated with T1 and T2 (OR = 10.0) and varied its association with X1 as follows; β1 = 0.01, 0.05, 0.1, 0.2, and 0.3 and 0.3.Using the g-computation algorithm, we estimated mean difference between distinct counterfactual marginal expectations of Y:for (t1, t2) = (1, 0), (0, 1), and (1, 1). We first fit two logistic regression models; 1) a model to predict X2 given T1 and an IV (one of X3As in scenario A) or a near-IV (X3B in scenario B) and 2) a model to predict Y given T1, T2, X2, and the same covariate used in the first regression model. Next, we used the regression coefficients obtained from these models to predict the values of the potential X2 and subsequently of the potential Y under a hypothetical intervention on T1 and T2.True values of marginal expectations were approximately obtained in large (N = 10,000,000) sample generated with (Y1, 1, Y1, 0, Y0, 1, Y0, 0). We evaluated the biases of g-computation estimates across the 10,000 datasets. Then, we compared them with biases obtained in the model without adjusting for IV (in scenario A) or near-IV (in scenario B) under the presence of unmeasured confounding.
Results
Among 250,944 physicians included in this study, 10,826 (4.3%) physicians received opioid-related industry payments in 2016 only, 5773 (2.3%) physicians received opioid-related industry payments in 2017 only, and 12,558 (5.0%) physicians received opioid-related industry payments in both 2016 and 2017. Physicians’ demographic characteristics are shown in Supplementary Table 1. In Model 1 (without adjusting for the receipt of industry payments for non-opioids, X3), physicians who received industry payments for opioids in either 2016 or 2017 had a higher opioid prescribing rate in 2018 than physicians who did not receive the payments in 2016 and 2017 (physicians who received payments only in 2016, + 11.32% [95% CI, 9.87 to 12.78]; those who received payments only in 2017, + 7.43% [95% CI, 5.51 to 9.35]; and those who received payments in both 2016 and 2017, + 15.96% [95% CI, 15.24 to 16.69]; Table 2). We also found the association between the receipt of industry payments and the increased opioid prescribing rate in Model 2 (with adjusting for X3), but the estimated effect was smaller than those in Model 1 (physicians who received payments only in 2016, + 7.21% [95% CI, 3.95 to 10.47]; those who received payments only in 2017, + 3.63% [95% CI, 1.54 to 5.71]; and those who received payments in both 2016 and 2017, + 13.47% [95% CI, 12.20 to 14.73]).
Table 2
Estimated effect of industry marketing for opioid products on physicians’ opioid prescribing rate using g-computation model adjusting for time-varying confounders
Receipt of industry payments for opioids in 2016
Receipt of industry payments for opioids in 2017
Number of physicians
Adjusted mean difference (95% CI) in opioid prescribing rate in 2018
Model 1
Model 2
No
No
221,787
Ref
Ref
Yes
No
10,826
+ 11.32% (9.87 to 12.78)
+ 7.21% (3.95 to 10.47)
No
Yes
5773
7.43% (5.51 to 9.35)
+ 3.63% (1.54 to 5.71)
Yes
Yes
12,558
+ 15.96% (15.24 to 16.69)
+ 13.47% (12.20 to 14.73)
Model 1 includes physician characteristics (years in practice, sex, specialty, the medical school graduated), patients’ characteristics (average age of beneficiaries, proportion of male beneficiaries, average hierarchical condition category score of beneficiaries), and opioid prescribing rate in 2016. Model 2 included receipt of industry marketing for non-opioids in 2016 in addition to covariates in Model 1. Both models adjusted for time-varying confounder (i.e., opioid prescribing rate in 2017) using the g-computation algorithm. The 95% CIs were estimated by repeating the analyses on 200 bootstrapped samples
Estimated effect of industry marketing for opioid products on physicians’ opioid prescribing rate using g-computation model adjusting for time-varying confoundersModel 1 includes physician characteristics (years in practice, sex, specialty, the medical school graduated), patients’ characteristics (average age of beneficiaries, proportion of male beneficiaries, average hierarchical condition category score of beneficiaries), and opioid prescribing rate in 2016. Model 2 included receipt of industry marketing for non-opioids in 2016 in addition to covariates in Model 1. Both models adjusted for time-varying confounder (i.e., opioid prescribing rate in 2017) using the g-computation algorithm. The 95% CIs were estimated by repeating the analyses on 200 bootstrapped samplesIn scenario A, the approximate true mean difference between distinct counterfactual marginal expectation of Y under (t1, t2) = (1, 0), (0, 1), and (1, 1) were 8.4, 8.0, and 6.4 percentage point. Respectively. When we did not include IV in the model, the biases due to the presence of unmeasured confounding were 5.5 for E [Y1, 0] – E [Y0, 0], 4.3 for E [Y0, 1] – E [Y0, 0], and − 4.0 for E [Y1, 1] – E [Y0, 0]. The biases generally increased (i.e., away from the true value) when additionally adjusting for IV (Fig. 3, Supplementary Table 2). For example, we found the largest bias for E [Y1, 0] – E [Y0, 0] when the g-computation algorithms included IV which was strongly associated with T1. Likewise, we found the largest bias for E [Y0, 1] – E [Y0, 0] when the g-computation algorithms included IV which was strongly associated with T2. For E [Y1, 1] – E [Y0, 0], we found the largest bias when the g-computation algorithms included IV which was strongly associated with both T1 and T2. These trends were less clear when the sample size was 500 compared to when the sample size was 10,000 or 200,000. We did not find a clear trend in standard error in the models adjusting for IV in the g-computation algorithm (Supplementary Table 2).
Fig. 3
Scenario A: Increase in biases of treatment effects in the model adjusting for IV (X3As) compared to those in the model without adjusting for IV under the presence of unmeasured confounder (X1). Bias of treatment effects was calculated by subtracting true values of marginal expectations obtained in a large (N = 10,000,000) sample from g-computation estimates across the 10,000 datasets in each situation. Each panel shows the magnitude of increase in biases when the g-computation algorithm additionally included IV (X3As) which is associated with T1 and T2 (odds ratio = 2.0, 5.0, or 10.0) under the presence of an unmeasured confounder (X1). Biases of [Y1, 1] – E [Y0, 0] (the right column) were multiplied by − 1 to provide intuitive information on the gap from the true estimates because the true estimates of [Y1, 1] – E [Y0, 0] were negative
Scenario A: Increase in biases of treatment effects in the model adjusting for IV (X3As) compared to those in the model without adjusting for IV under the presence of unmeasured confounder (X1). Bias of treatment effects was calculated by subtracting true values of marginal expectations obtained in a large (N = 10,000,000) sample from g-computation estimates across the 10,000 datasets in each situation. Each panel shows the magnitude of increase in biases when the g-computation algorithm additionally included IV (X3As) which is associated with T1 and T2 (odds ratio = 2.0, 5.0, or 10.0) under the presence of an unmeasured confounder (X1). Biases of [Y1, 1] – E [Y0, 0] (the right column) were multiplied by − 1 to provide intuitive information on the gap from the true estimates because the true estimates of [Y1, 1] – E [Y0, 0] were negativeIn scenario B, we found the bias amplification for all estimates by adjusting for near-IV (X3B) in the g-computation algorithm when X3B had a very weak association with unmeasured confounder X1 (β = 0.01 or 0.05) compared with its association with T1 and T2 (OR = 10) (Fig. 4, Supplementary Table 3). However, when the association between X3B and X1 was larger (β ≥ 0.1), we rather found the bias reduction by adjusting for X3B (as a proxy of an unmeasured confounder X1). This trend was observed in all estimates; E [Y1, 0] – E [Y0, 0], E [Y0, 1] – E [Y0, 0], and E [Y1, 1] – E [Y0, 0].
Fig. 4
Scenario B: Comparison of bias of treatment effects between models with and without adjusting for near-IV (X3B) varying its relationship with unmeasured confounder (X1). Beta (β in X ~ N (βX, 1)) ranged from 0.01 to 0.3. Y-axis shows bias which was calculated by subtracting true values of marginal expectations obtained in a large (N = 10,000,000) sample from g-computation estimates across the 10,000 datasets in each situation. Biases of [Y1, 1] – E [Y0, 0] were multiplied by − 1 to provide intuitive information on the gap from the true estimates because the true estimates of [Y1, 1] – E [Y0, 0] were negative
Scenario B: Comparison of bias of treatment effects between models with and without adjusting for near-IV (X3B) varying its relationship with unmeasured confounder (X1). Beta (β in X ~ N (βX, 1)) ranged from 0.01 to 0.3. Y-axis shows bias which was calculated by subtracting true values of marginal expectations obtained in a large (N = 10,000,000) sample from g-computation estimates across the 10,000 datasets in each situation. Biases of [Y1, 1] – E [Y0, 0] were multiplied by − 1 to provide intuitive information on the gap from the true estimates because the true estimates of [Y1, 1] – E [Y0, 0] were negative
Discussion
Our simulation study showed that adjusting for IV of time-varying treatments in the g-computation algorithm increased bias due to unmeasured confounding, particularly when the IV has a strong relationship with the treatment of interest (i.e., T1 for E [Y1, 0] – E [Y0, 0], T2 for E [Y0, 1] – E [Y0, 0], and both T1 and T2 for E [Y1, 1] – E [Y0, 0]). Of note, we found the increase in bias even adjusting for near-IV when such variable had a very weak association with unmeasured confounders between the treatment and the outcome. Instead, the bias decreased after adjusting for near-IV when it had a stronger association with the unmeasured confounders (β ≥ 0.1, or having ≥0.1 correlation coefficient in our multivariate normal setting). Our findings were not stable with a small sample size (N = 500), indicating the importance of additional consideration for small sample bias.Taken together, if there is a variable considered as a perfect IV for the time-varying exposure, we recommend not to include it in the model to obtain a less biased estimate. However, it is challenging to decide if the variable is a perfect IV or near-IV. Given that exposure is a collider between the candidate variable for IV and unmeasured exposure-outcome confounders, investigating the relationship between the candidate variable and the outcome conditional on the exposure would not overcome this challenging issue because a perfect IV can also be associated with the outcome through opened collider path due to conditioning on the exposure. Therefore, we need subject matter knowledge to determine whether the variable is a perfect IV or near-IV. If the candidate variable is likely to be near-IV for time-varying exposures, we would recommend including the variable in the g-computation algorithm because possible residual confounding would largely outweigh bias amplification due to including such a variable in the model, unless it has a very strong association with the exposure (e.g., OR > 10 with a unit standard-deviation increase) and a very weak association with the outcome (e.g., a difference of 5% standard-deviation between the presence and the absence of an unmeasured confounder). This net bias reduction is expected to be larger when the relationship between the near-IV and the unmeasured confounders gets stronger. Conducting analysis both with and without adjusting for the variable would also be an option to transparently convey the results [27, 28]. Because our simulation results were obtained through a specific data-generating process based on a case study, future studies are needed to validate our findings in other time-varying settings (e.g., different parameter values, different scales, continuous treatments, etc.).In our case study, consistent with prior findings [17-21], we found that opioid-related industry marketing payments to physicians were associated with a higher rate of prescribing opioids in clinical practice. Moreover, using g-computation algorithms, we found that the receipt of opioid-related industry payments in 2016 was associated with opioid prescribing rate in 2018 regardless of whether they received opioid-related industry payments in 2017 or not. Although there are multiple factors that may influence opioid prescriptions such as insurance coverage, State laws, and the advent of abuse-deterrent opioid analgesics, our findings would extend the ongoing discussion about how industry marketing affects clinical practice to the time-varying setting (i.e., the possible ‘legacy effect’ of the industry payments on clinical practice) for not only opioids [17-21] but also other drugs such as cardiovascular drugs [29, 30] and insulin [31]. Our case study was based on data of licensed physicians in the US and did not include other professionals such as nurse practitioners who might have prescribed opioids during the study period. In addition, because we focused on the influence of industry marketing on overall opioid prescribing rates among physicians, whether the relationship varies across health practitioners and opioid types should be the subject of future research.We also found that the estimated effects were generally larger in Model 1 (model without adjustment for the receipt of industry payments for non-opioids) than Model 2 (model with adjustment for the receipt of industry payments for non-opioids). If we assume there is no relationship between unmeasured confounders (e.g., patient-level characteristics, State laws, the development of abuse-deterrent opioid analgesics over the study period, etc.) and the receipt of industry payments for non-opioids, Model 1 would be preferred to Model 2. However, given that the association between the receipt of industry payments for non-opioids and the above-mentioned unmeasured confounders may not be very weak in reality, we assume that the estimated effects in Model 2 would be less biased estimates than those in Model 1 based on our simulation study.
Conclusions
In summary, we found that adjusting for a perfect IV may amplify the bias even under the setting of time-varying treatments. This was also the case for near-IVs only when the magnitude of its association with unmeasured confounders is much weaker than that with the time-varying treatments. These findings would help researchers to consider the magnitude of bias when adjusting for (near-)IVs and select variables in the g-computation algorithm for the time-varying setting when they are aware of the presence of unmeasured confounding.Additional file 1.
Authors: Jessica A Myers; Jeremy A Rassen; Joshua J Gagne; Krista F Huybrechts; Sebastian Schneeweiss; Kenneth J Rothman; Marshall M Joffe; Robert J Glynn Journal: Am J Epidemiol Date: 2011-10-24 Impact factor: 4.897
Authors: Scott E Hadland; Magdalena Cerdá; Yu Li; Maxwell S Krieger; Brandon D L Marshall Journal: JAMA Intern Med Date: 2018-06-01 Impact factor: 21.873
Authors: Colette DeJong; Thomas Aguilar; Chien-Wen Tseng; Grace A Lin; W John Boscardin; R Adams Dudley Journal: JAMA Intern Med Date: 2016-08-01 Impact factor: 21.873
Authors: Geoffrey K Spurling; Peter R Mansfield; Brett D Montgomery; Joel Lexchin; Jenny Doust; Noordin Othman; Agnes I Vitry Journal: PLoS Med Date: 2010-10-19 Impact factor: 11.069