Literature DB >> 29892204

The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study.

Elisa Martín-Merino¹, Amaia Calderón-Larrañaga^2,3, Samuel Hawley⁴, Beatriz Poblador-Plou³, Ana Llorente-García¹, Irene Petersen^5,6, Daniel Prieto-Alhambra^4,7.

Abstract

BACKGROUND: Missing data are often an issue in electronic medical records (EMRs) research. However, there are many ways that people deal with missing data in drug safety studies. AIM: To compare the risk estimates resulting from different strategies for the handling of missing data in the study of venous thromboembolism (VTE) risk associated with antiosteoporotic medications (AOM).
METHODS: New users of AOM (alendronic acid, other bisphosphonates, strontium ranelate, selective estrogen receptor modulators, teriparatide, or denosumab) aged ≥50 years during 1998-2014 were identified in two Spanish (the Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria [BIFAP] and EpiChron cohort) and one UK (Clinical Practice Research Datalink [CPRD]) EMR. Hazard ratios (HRs) according to AOM (with alendronic acid as reference) were calculated adjusting for VTE risk factors, body mass index (that was missing in 61% of patients included in the three databases), and smoking (that was missing in 23% of patients) in the year of AOM therapy initiation. HRs and standard errors obtained using cross-sectional multiple imputation (MI) (reference method) were compared to complete case (CC) analysis - using only patients with complete data - and longitudinal MI - adding to the cross-sectional MI model the body mass index/smoking values as recorded in the year before and after therapy initiation.
RESULTS: Overall, 422/95,057 (0.4%), 19/12,688 (0.1%), and 2,051/161,202 (1.3%) VTE cases/participants were seen in BIFAP, EpiChron, and CPRD, respectively. HRs moved from 100.00% underestimation to 40.31% overestimation in CC compared with cross-sectional MI, while longitudinal MI methods provided similar risk estimates compared with cross-sectional MI. Precision for HR improved in cross-sectional MI versus CC by up to 160.28%, while longitudinal MI improved precision (compared with cross-sectional) only minimally (up to 0.80%).
CONCLUSION: CC may substantially affect relative risk estimation in EMR-based drug safety studies, since missing data are not often completely at random. Little improvement was seen in these data in terms of power with the inclusion of longitudinal MI compared with cross-sectional MI. The strategy for handling missing data in drug safety studies can have a large impact on both risk estimates and precision.

Entities: Chemical

Keywords: complete case analysis; electronic medical records; longitudinal data; missing data; multiple imputation; pharmacoepidemiology

Year: 2018 PMID： 29892204 PMCID： PMC5993167 DOI： 10.2147/CLEP.S154914

Source DB: PubMed Journal: Clin Epidemiol ISSN： 1179-1349 Impact factor: 4.790

Background

Electronic medical record (EMR) databases are a commonly used data source for drug safety research.1 They provide exceptional value for the study of suspected unwanted effects not tested in randomized clinical trials (RCT) and for the high proportion of drug users in the community who are deemed not eligible for participation in RCTs.2 Notwithstanding these benefits, the lack of full records on key variables (ie, missing data) can pose a real challenge to conducting EMR research. Missing data are a well-recognized issue that can compromise statistical power, as “complete case (CC)” analyses will automatically exclude subjects with missing information for any of the variables included in multivariable models. More importantly, if data are missing at random (MAR), but not completely at random, analyses based on CCs may be biased,3 and researchers may not be aware of this. Likewise, including a “missing data indicator” variable in the analysis is a widely used strategy likely to introduce bias.4,5 Such biases can be overcome by using methods such as multiple imputation (MI), which allow individuals with incomplete data to be included in analyses.3 However, MI methods are still seldom used in EMR-based post-authorization safety studies.6 In this study, our aim was to compare the risk estimates observed when applying CC analysis versus MI and give the reader an intuitive understanding and practical overview of the potential issues and limitations of using CC and longitudinal MI by taking the cross-sectional MI as the reference. As a part of a cohort study on the risk of venous thromboembolism (VTE) associated with the use of different antiosteoporotic medications (AOM),7 we explored the issues of missing data and compared different methods using data from three different EMR databases from Spain and the UK. Our secondary aim was to describe a step-by-step approach for setting up and conducting MIs in drug safety studies using EMR data and to list practical challenges faced during the MI process. We followed the guidelines for this process provided by previous authors.3,5,8

Methods

Sources of data

We obtained data from primary care deidentified computerized medical records from two different EMR databases in Spain (the Base de datos para la Investigación Farmacoepide-miológica en Atención Primaria [BIFAP]9 and the EpiChron Cohort10) and one from the UK (Clinical Practice Research Datalink [CPRD]).11 These databases include information on patient age, sex, life-style factors, clinical events, drug prescriptions, specialist referrals, and laboratory test results for millions of primary care patients.

Study population and exposure cohorts

The study population was formed by all patients aged ≥50 years with at least 1 year of available recorded data and a prescription or dispensation of AOM (date for therapy initiation) during each database study period, that is, 2000–2014 (CPRD), 2001–2013 (BIFAP), and 2010–2011 (EpiChron). Patients with a prescription or dispensation of AOM recorded during the year before therapy initiation were considered as prevalent users and thus excluded from the study. Six exposure cohorts were identified according to the first AOM prescribed as follows: 1) alendronic acid (Anatomical Therapeutic Chemical classifications M05BA04 and M05BB03); 2) other oral bisphosphonates (etidronic acid [M05BA01], ibandronic acid [M05BA06], risedronic acid [M05BA07], clodronic acid [M05BA02], and tiludronic acid [M05BA05] – the two latter available only in Spain-); 3) selective estrogen receptor modulators (bazedoxifene [G03XC01] and raloxifene [G03XC02]); 4) strontium ranelate (M05BX03); 5) denosumab (M05BX04); and 6) teriparatide (H05AA02). We considered treatment episodes of prescribing including prescription durations with <180 days gap.

Case ascertainment and follow-up

Participants from exposure cohorts were followed from the day after therapy initiation to the earliest of the following: a record of treated VTE diagnosis (cases), end of first AOM treatment episode (end of supply of the last prescription before a gap of 180 days), switching to an alternative cohort exposure, lost to follow-up, death, or end of study period. A list of Read and International Classification of Primary Care (ICPC) codes for identification of VTE diagnosis recording in CPRD and BIFAP/EpiChron, respectively, is reported elsewhere.7 Read codes were developed by Dr James Read, and are standard clinical terminology in UK primary care practice.

Confounders

Factors associated with VTE risk according to The National Institute for Health and Care Excellence guidelines12 were collected as potential confounders: 1) history of VTE, cancer, peripheral arterial disease, vein insufficiency as recorded at any time before the date of therapy initiation; 2) use of hormone replacement therapy (HRT) during the year before therapy initiation; 3) fractures recorded in the 2 months before therapy initiation (used as a proxy for bed-bound patients in the target population of incident AOM drug users); and 4) body mass index (BMI) in kg/m2 (as a continuous variable) and smoking status, as recorded in the year of therapy initiation (the closest value to January 1 in case of repeated records in that same year). Smoking status was assessed as a binary variable (current smoker yes/no) where nonsmokers included both ex and never smokers. Patients with recorded status of exclusively “never smoker” at the age of ≥40 years were considered as nonsmokers even in the presence of missing data in the year of therapy initiation. This is because it is unlikely that many people aged >40 years will take up smoking later in life.13 Also, the Charlson comorbidity index was calculated14 (CPRD) or an equivalent list of comorbidities included in the latter index (EpiChron/BIFAP). Alcohol abuse (“yes/no” as ICPC symptom/complaint recorded by the primary care practitioner in EpiChron/BIFAP) and use of other AOM (ie, parathyroid hormone, calcitonin, and elcatonin), calcium–vitamin D supplements, systemic corticosteroids, heparins, and oral anticoagulant drugs were also included in the analyses.

Statistical analyses

Cox regression models adjusted for all confounders listed above were applied in each EMR database separately (ie, substantive model) to quantify the risk of VTE (hazard ratios [HRs] and 95% confidence intervals [CIs]) associated with the different AOM compared with alendronic acid. Stata® program by StataCorp LP was used for statistical analysis. Regarding the MI of missing data, we followed a step-by-step strategy based on the guidance provided by Sterne et al,3 White et al,8 and Pedersen et al5 as described below:

1. Reflecting on the nature of missing data

In the proposed scenario, two variables had missing information in the year of therapy initiation: smoking status and BMI. Most software that implements MI assumes that missing data are MAR, meaning, given the observed data, data are missing independent of unobserved data.15 The assumption of the existence of such variables is relatively easy to accept in EMR-based research, as for example, women are often more likely to see their general practitioners on a regular basis16 leading to a higher chance of recording of lifestyle variables such as BMI or smoking in women than in men. We examined the presence of such variables in our data as follows: We fitted multivariable logistic regression models to identify predictors of missingness for each individual variable (one model for BMI missingness and a separate one for smoking missingness) using the whole dataset. We fitted logistic and linear regression models for binary variables (ie, current smoking status) and continuous variables (ie, BMI), respectively, to identify predictors of variable values based on the CCs dataset. Automatic backward stepwise selection methods (with a p-exit<0.1 and p-entry<0.05) were used to identify key variables in all the above models. A number of variables, in addition to those included in the substantive model, were included for each database depending on the availability in the extracted analytical dataset as mentioned above. Table S1 provides a list of variables that were examined as associated with missingness and the values of BMI or smoking data.

2. Specification of the imputation model

According to the MI guidelines,17 the MI model should include all variables planned for the substantive model; in our case, potential confounders, exposure and outcome (including both VTE yes/no and the Nelson–Aalen estimate of the cumulative hazard rate function) as well as auxiliary variables that are associated with both missingness and variable values at the same time.17 The auxiliary variables may increase the likelihood of the MAR.3

3. Multiple imputation models

In this study, we applied MI using chained equations.18 A total of 15 datasets were imputed for each of the MI models. We evaluated different applications of MI where we used data recorded at different times in the databases: A) cross-sectional MI model (CSMI), including predictors detected exclusively using BMI and smoking data recorded in the year of therapy initiation; B) longitudinal MI model (MI−1y), including the variables in A and the value of BMI and smoking as recorded (when available) in the year before therapy initiation year; and C) longitudinal MI (MI±1y), including the predictors in A and B, but also the values of BMI and smoking as recorded (when available) in the year after therapy initiation year. Stata statement: mi impute chained (logit, omit(smoking predictors)) smoking (regress,omit(BMI predictors)) bmi= substantive model and all predictors, add(number of imputations) rseed(random number) mi impute chained (logit, omit(smoking predictors)) smoking smoking−1y smoking+1y (regress,omit(BMI predictors)) bmi bmi−1y bmi+1y = substantive model and all predictors without missing, add(number of imputations) rseed(random number)

4. Running the substantive model in the imputed datasets

The final step was fitting the proposed substantive model above to the imputed datasets. In our case, comparative risk of VTE across AOM status was estimated using Cox regression models. Stata statement: mi stset Time,failure(censoring==1) scale(365.25) mi estimate:stcox i.exposure i.var1 i.var2 i.var3

Comparing different strategies for handling missing data in drug safety research

In addition to the three MI approaches, we also analyzed the data using CC analysis, in which patients without BMI and smoking status were excluded from the analysis. In total, four strategies were compared, with CSMI being the reference standard. Baseline characteristics and incidence rates (IRs) of VTE in patients participating in CC compared with the whole study population were provided as an overall assessment of the potential bias underlying CC analysis. Then, differences in point estimates of each model compared to CSMI were calculated for each exposure cohort as (HRAlternative−HRCSMI)/HRCSMI. Improvement in precision of each model (as a function of standard error [SE] size) relative to CSMI was finally estimated for all exposure cohorts as (SEAlternative−SECSMI)/SECSMI.

Patient involvement

No patient(s) or public representatives have been involved as part of this work.

Ethics and scientific committees’ approval

The study protocol was approved by the UK Independent Scientific Advisory Committee ISAC (REF 14_110R), CEIC Aragón (Number CP20/2014), and BIFAP Scientific Committee (Number 02_2015).

Results

The study populations were made up of 95,057 (BIFAP), 12,688 (EpiChron), and 161,202 (CPRD) new users of AOM aged ≥50 years during 1998–2014. Among them, 422, 19, and 2,051 VTE cases during the first AOM treatment episode were detected, respectively, during a median time of follow-up between 1.04 and 1.71 years, depending on the EMR database. Teriparatide in CPRD as well as denosumab and selective estrogen receptor modulator (SERM) in EpiChron were not assessed due to scarce numbers of exposed patients. No estimates were obtained in CC for denosumab in BIFAP and for teriparatide in EpiChron for the same reason. Table 1 shows the availability of BMI and smoking data within up to 1 year before and after year of therapy initiation. Depending on the database, between 37.4% and 40.3% of new AOM users had BMI records, and between 55.8% and 90.7% had smoking records in the year of therapy initiation. Among those patients with information, mean BMI varied from 28.65 (BIFAP) to 29.11 (EpiChron) kg/m2 and prevalence of current smokers ranged from 4.6% (EpiChron) to 23.1% (BIFAP). Among patients without information in the year of therapy initiation, few had BMI or smoking data recorded the year before (10.8% and 9.6% of total patients in BIFAP, 7.7% and 1.7% in EpiChron, and 14.7% and 3.8% in CPRD, respectively). Even fewer additional patients had BMI and smoking records only the year after therapy initiation year (3.0% and 4.8% in BIFAP, 11.0% and 2.1% in EpiChron, and 8.5% and 1.5% in CPRD, respectively).

Table 1

Distribution of patients with BMI and smoking records by calendar year-blocks up to ±1 year from year of therapy initiation.

	BIFAP, N=95,057				EpiChron Cohort, N=12,688				CPRD, N=161,202
	BMI records		Smoking recordsa		BMI records		Smoking recordsa		BMI records		Smoking recordsa
	N	%	N	%	N	%	N	%	N	%	N	%
Available information 1 year before	36,608	38.50	52,277	55.00	3,974	31.32	8,937	70.44	59,260	36.76	144,602	89.70
Information only 1 year before (data lacking on year of therapy initiation)	10,283	10.80	9,117	9.59	970	7.65	219	1.73	23,740	14.73	6,085	3.77
Information on year of therapy initiation	38,307	40.30	53,017	55.77	4,745	37.40	9,084	71.6	63,283	39.26	146,276	90.74
Available information 1 year after	33,908	35.70	50,139	52.70	4,792	37.77	9,032	71.19	56,514	35.06	143,781	89.19
Information only 1 year after (data lacking on year of therapy initiation and 1 year before)	2,844	2.99	4,538	4.77	1,401	11.04	270	2.13	13,649	8.47	2,410	1.50

Notes:

Smoking values after assumption of considering those patients with recorded status of exclusively never smoker at age ≥40 years as non-smokers. Text and values in bold refer to the year of interest to collect BMI and smoking values (year of therapy initiation) and when missing data were needed to impute.

Abbreviations: BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; BMI, body mass index; CPRD, Clinical Practice Research Datalink.

Table 2 shows the distribution of baseline characteristics in the whole study populations and in patients participating in the CC analysis, as well as the global IR of VTE during the first treatment episode of AOM. Depending on the database, between 31% and 38% of new AOM users, that is 32%–41% of total VTE cases, had complete records in the year of therapy initiation in relation to the substantive model and could, therefore, be included for the CC analysis. CC yielded a nonsignificant higher IR of VTE than for the whole study population in BIFAP and CPRD, but lower in EpiChron. The distribution of AOM cohorts was slightly different in CC versus the whole study populations, that is, SERM was slightly underrepresented in CC in BIFAP and EpiChron, as was strontium in CPRD. Restricting to CC also impacted the distribution of potential confounders, such as gender and age. For example, in BIFAP, 18% of those with complete records were between 50 and 59 years while in those with missing data 32% were between 50 and 59 years.

Table 2

Global IR of VTE during first treatment episode of AOM, and the distribution of baseline characteristics in the whole study population and in patients participating in complete case analysis

	BIFAP				EpiChron Cohort				CPRD
	Complete case participants (N=29,609)		All eligible patients (N=95,057)		Complete case participants (N=4,516)		All eligible patients (N=12,688)		Complete case participants (N=60,923)		All eligible patients (N=161,202)
VTE cases, N	163		422		6		19		848		2,051
IR/100,000 person-years (95% CI)	245.61 (210.66–286.37)		215.96 (196.31–237.58)		113.34 (50.92–252.29)		132.81 (84.71–208.21)		533.55 (498.82–570.70)		485.20 (464.65–506.66)
	N	%	N	%	N	%	N	%	N	%	N	%
Exposure cohort
Alendronic acid	11,634	39.30	36,182	38.10	1,457	32.26	3,852	30.40	48,713	79.96	127,121	78.86
Other bisphosphonates	11,823	39.90	37,594	39.50	1,964	43.49	5,644	44.50	10,487	17.21	29,007	17.99
Strontium ranelate	2,507	8.50	7,978	8.40	558	12.36	1,495	11.80	551	0.90	1,993	1.24
SERM	3,165	10.70	11,723	12.30	331	7.33	1,127	8.90	1,150	1.89	3,045	1.89
Teriparatide	393	1.30	1,287	1.40	205	4.54	569	4.50	4	0.01	7	<0.01
Denosumab	87	0.30	293	0.30	1	0.02	1	0	18	0.03	29	0.02
Females	27,010	91.20	86,187	90.70	4,101	90.81	11,414	89.96	47,624	78.17	129,661	80.43
Age at year of therapy initiation
50–59 years	5,441	18.40	26,160	27.50	701	15.52	2,894	22.81	6,538	10.70	19,579	12.15
60–69 years	8,743	29.50	27,593	29.00	1,417	31.38	3,832	30.2	14,647	24.0	37,499	23.26
70–79 years	11,011	37.20	28,026	29.50	1,736	38.44	3,844	30.3	22,095	36.3	52,897	32.81
≥80 years	4,414	14.90	13,278	14.00	662	14.66	2,118	16.69	17,643	29.0	51,227	31.78

Abbreviations: AOM, antiosteoporotic medications; BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; CPRD, Clinical Practice Research Datalink; IR, incidence rates; SERM, selective estrogen receptor modulator; VTE, venous thromboembolism.

Table S1 shows the variables associated with the missingness and/or the values of BMI or smoking data in the three databases, as well as those finally included in the MI model. In BIFAP, predictors of BMI values were the calendar year, alcohol abuse, most of the diseases assessed (including aneurysm, COPD, asthma, dementia, diabetes, heart failure, liver disease, rheumatoid arthritis, peptic ulcer, renal diseases, treated VTE [outcome], a history of VTE, recent fractures, vein insufficiency or phlebitis, peripheral arterial disease, and cancer), the AOM cohort (exposure of interest), and the prescription of other antiosteoporotic drugs, calcium–vitamin D, corticosteroid drugs, heparin, and HRT. Predictors of current smoking were the calendar year, alcohol abuse, COPD, diabetes, rheumatoid arthritis, and prescription of other anti-osteoporotic drugs, calcium–vitamin D, and corticosteroid drugs. Age and sex predicted both BMI and smoking values. In EpiChron, predictors of BMI values were the calendar year, asthma, dementia, diabetes, vein insufficiency or phlebitis, cancer, and the prescription of other antiosteoporotic drugs, calcium–vitamin D, corticosteroids, and heparin. Predictors of smoking values were alcohol abuse, COPD, diabetes, heart failure, and other antiosteoporotic drugs, age, and sex. In CPRD, predictors of BMI and smoking status were the calendar year, the AOM cohort (exposure), a history of VTE, recent fractures, vein insufficiency or phlebitis, the use of calcium–vitamin D, corticosteroids, and anticoagulant drugs. Rheumatoid arthritis, treated VTE (outcome), and prescription of heparins and HRT were also predictors of BMI value, whereas hypnotic drugs predicted smoking status. Figure S1 shows the distribution of the BMI and current smoking values as recorded in the database and as imputed in the 15 datasets for the three databases.

Change in risk estimation

Table 3 reports the HR (and 95%CI) produced for each AOM versus alendronic acid in each method by database. The risk of VTE associated with other oral bisphosphonates, strontium ranelate, teriparatide, and denosumab versus alendronic acid obtained in the CSMI for CPRD and BIFAP was published in a previous article focused on the clinical interpretation of those HRs.7

Table 3

HR (and 95% CI) of VTE associated with each AOM versus alendronic acid according to each method (CC, CSMI, MI−1y, and MI±1y) and database

	Alendronic acid	Other oral bisphosphonates	Strontium ranelate	SERM	Teriparatide	Denosumab
BIFAP
Complete cases, N	11,634	11,823	2,507	3,165	393	87
HR (95% CI)	Ref	0.85 (0.61–1.18)	0.81 (0.40–1.61)	0.80 (0.35–1.80)	0.45 (0.06–3.25)	–
All eligible patients, N	36,182	37,594	7,978	11,723	1,287	293
HR CSMIa (95% CI)	Ref	0.96 (0.78–1.18)	1.19 (0.82–1.74)	0.93 (0.58–1.50)	1.27 (0.59–2.71)	1.77 (0.25–12.66)
HR MI−1y (95% CI)	Ref	0.96 (0.78–1.18)	1.19 (0.82–1.74)	0.93 (0.58–1.50)	1.26 (0.59–2.71)	1.78 (0.25–12.79)
HR MI±1y (95% CI)	Ref	0.96 (0.78–1.18)	1.19 (0.82–1.74)	0.94 (0.58–1.50)	1.27 (0.59–2.71)	1.75 (0.24–12.51)
EpiChron Cohort
Complete cases, N	1,457	1,964	558	331	205	1
HR (95% CI)	Ref	0.31 (0.03–3.52)	–	–	–	–
All eligible patients, N	3,852	5,644	1,495	1,127	569	1
HR CSMI (95% CI)	Ref	0.88 (0.33–2.35)	0.38 (0.05–3.20)	–	0.69 (0.08–5.81)	–
HR MI−1y (95% CI)	Ref	0.87 (0.32–2.33)	0.38 (0.05–3.26)	–	0.65 (0.08–5.60)	–
HR MI±1y (95% CI)	Ref	0.87 (0.32–2.34)	0.39 (0.05–3.27)	–	0.66 (0.08–5.69)	–
CPRD
Complete cases, N	48,713	10,487	551	1,150	4	18
HR (95% CI)	Ref	1.09 (0.92–1.30)	0.60 (0.28–1.26)	1.02 (0.46–2.30)	–	4.84 (0.68–34.49)
All eligible patients, N	127,121	29,007	1,993	3,045	7	29
HR CSMIa (95% CI)	Ref	1.05 (0.94–1.18)	0.90 (0.61–1.34)	0.82 (0.50–1.35)	–	3.47 (0.49–24.65)
HR MI−1y (95% CI)	Ref	1.05 (0.94–1.17)	0.89 (0.60–1.33)	0.82 (0.50–1.35)	–	3.46 (0.49–24.62)
HR MI±1y (95% CI)	Ref	1.05 (0.94–1.18)	0.90 (0.60–1.33)	0.83 (0.51–1.37)	–	3.50 (0.49–24.88)

Notes:

HR (95% CI) of VTE associated with other oral bisphosphonates, strontium ranelate, teriparatide, and denosumab versus alendronic acid obtained in the CSMI for CPRD and BIFAP was published in a previous article focused on the clinical interpretation of those HRs.7 En dashes indicate that no estimates were obtained due to small cohort sizes.

Abbreviations: AOM, antiosteoporotic medications; BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; CC, complete case analysis; CPRD, Clinical Practice Research Datalink; CSMI, cross-sectional MI model; HR, hazard ratio; MI, multiple imputation; SERM, selective estrogen receptor modulator; VTE, venous thromboembolism.

Table 4 shows HR differences produced in each method relative to CSMI (prespecified reference group) for each exposure cohort and database. Negative percentages indicate underestimation in HR with the alternative method versus CSMI. Big differences were found when comparing CC versus CSMI, where the HR moved in both directions, from 100.00% underestimation (in BIFAP, EpiChron, and strontium in CPRD) to 40.31% overestimation (for all other AOM in CPRD).

Table 4

HR differences according to each method relative to CSMI for each exposure cohort and database

		HR differences versus CSMI
Database	AOM cohorta	Complete case (%)	MI–1y (%)	MI±1y (%)
CPRD	Other bisphosphonates	3.92	−0.29	−0.14
BIFAP		−11.57	−0.24	−0.07
EpiChron		−64.81	−1.40	−0.98
CPRD	Strontium ranelate	−33.65	−0.84	−0.63
BIFAP		−32.50	−0.23	0.00
EpiChron		−100.00	1.01	2.26
CPRD	SERM	24.58	0.25	1.32
BIFAP	SERM	−14.34	0.03	0.25
BIFAP	Teriparatide	−64.57	−0.05	0.13
EpiChron	Teriparatide	−100.00	−5.14	−3.74
CPRD	Denosumab	40.31	−0.17	0.94
BIFAP	Denosumab	b	1.02	−1.14

Notes: Percentages quantify the higher (positive %) or lower (negative %) HR obtained with respect to CSMI.

Teriparatide in CPRD as well as denosumab and SERM in EpiChron were not assessed due to small cohort sizes.

No estimates were obtained due to small cohort sizes.

Abbreviations: AOM, antiosteoporotic medications; BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; CPRD, Clinical Practice Research Datalink; CSMI, cross-sectional MI model; HR, hazard ratio; MI, multiple imputation; SERM, selective estrogen receptor modulator.

Little difference was found when comparing the HR produced in longitudinal MI−1y and MI±1y versus CSMI. Estimates were higher or lower depending on exposure cohorts. Differences ranged from −5.14% to 2.26% and the highest discrepancies were found in the smallest exposure cohorts (all EpiChron exposure cohorts and denosumab in BIFAP).

Statistical power/accuracy

Table 5 shows SE differences for each method relative to CSMI by AOM cohort and database. Positive percentages mean precision improvement with CSMI. Precision improved always in CSMI versus CC (up to 160.28%). Relatively low or no improvement in power (ie, SE size reduction) was observed when using longitudinal MI analysis compared with CSMI (range 0.02%–0.80%).

Table 5

Standard error differences produced in each method relative to CSMI for each AOM cohort and database

		Standard error differences versus CSMI
Database	AOM cohorta	Complete case (%)	MI–1y (%)	MI±1y (%)
CPRD	Other bisphosphonates	54.38	−0.09	−0.03
BIFAP		57.80	0.00	−0.07
EpiChron		147.31	0.52	0.43
CPRD	Strontium ranelate	88.01	−0.09	−0.09
BIFAP		83.25	−0.01	−0.05
EpiChron		86.17	0.40	−0.13
CPRD	SERM	63.12	−0.03	−0.01
BIFAP	SERM	72.13	0.00	−0.05
BIFAP	Teriparatide	160.28	−0.04	−0.07
EpiChron	Teriparatide	b	0.79	0.80
CPRD	Denosumab	0.05	0.02	−0.01
BIFAP	Denosumab	b	0.00	−0.02

Notes: Positive percentages mean improvement with CSMI while negative percentages mean improvement with the alternative method.

Teriparatide in CPRD as well as denosumab and SERM in EpiChron were not assessed due to small cohort sizes.

No standard errors were obtained due to small cohort sizes.

Abbreviations: AOM, antiosteoporotic medications; BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; CPRD, Clinical Practice Research Datalink; CSMI, cross-sectional MI model; MI, multiple imputation; SERM, selective estrogen receptor modulator.

Discussion

Our purpose was to compare four methods used for handling BMI and smoking missing data as potential confounders in a clinical use case – a comparative drug safety study of the risk of VTE associated with different AOM. We compared results from CC analysis and MI based only on cross-sectional records (reference method as based on previous simulation studies), or after adding within-patient longitudinal records. The study was performed in three electronic primary care databases from the UK and Spain. Our findings can be summarized as follows. First, we observed that missing BMI and smoking values in the three databases were not completely at random since multiple predictors were associated with the values and missingness in the three databases. For example, in BIFAP, 18% of those with complete records were between 50 and 59 years while in those with missing data 32% were between 50 and 59 years. Age was also associated with the values of BMI and smoking and, consequently, the risk of VTE associated with different AOM obtained in CC analysis is likely to be biased. Indeed, we found substantial differences in the risk estimates based on CC and CSMI. The sample sizes were also reduced in the CC analyses, which led to a significant power loss of up to 160%, compared with base-case CSMI. Second, an extra level of imputation was added to CSMI, which took advantage of the information on BMI and smoking values recorded 1 year apart (either before or after index year) as predictors. In our study, these longitudinal MI models did not improve precision substantially when compared with CSMI-based risk estimates. The intermittent pattern of records completeness in EMR is common. Previous authors using measurements collected at different time points in MI strategies have failed to reveal the best-performing method, suggesting further research to compare different imputation methods.19,20 Some further strategies used the data by time blocks and their temporal ordering.13,21,22 A further simulation study considering correlation over time suggested an optimal use of data available, the gain relative to baseline MI being dependent on the strength of the correlations among variables collected over time.13 A potential explanation for the lack of gain in precision in our study is that the large study cohorts provided enough cross-sectional information for predicting and imputing values to such an extent that records within 1 extra year were not as informative as expected. Also, few patients had values of BMI or smoking records 1 year apart, therefore, adding very little data to that available in the year of therapy initiation. Furthermore, a slight loss of precision was observed when adding longitudinal information to the analysis of study cohorts that were smallest in size and shortest in time contribution. A possible reason for this could be the availability in the time of smoking data, which was used for assuming noncurrent smoking status whenever patients had consistent registries of noncurrent smoking throughout their life. This was previously done by other authors,13 entailing correlation within time blocks. This deterministic assumption was based on the fewer and therefore more uncertain records of smoking status in the EpiChron database compared with the other databases covering longer periods and wider populations. CSMI analysis preserves power using the whole study population while accounting for variance of imputed values (Rubin’s rule23). In the current study, BMI and smoking values reached realistic distributions in the three databases after imputation based on multiple predictors (Figure S1). Under those distributions and based on simulation studies,13 CSMI showed accurate risk estimates after adjusting by imputed BMI and smoking values based on multiple predictors of missing and observed values on the basic principle of replacement.4 We identified some practical challenges during the MI process, which are worth mentioning. First, we observed that the complexity of identification of potential variables to be included in the MI model increases with the increasing number of variables to impute, especially when mutual adjustment for all variables with missing data is preferred. In the current study, two variables with missing data were imputed (ie, BMI and smoking) and mutually adjusted for each other in a way that we kept those value predictors that were significant in individuals who had complete records of both BMI and smoking, that is, predictors of BMI values were selected through a model adjusted by current smoking and, vice versa, predictors of current smoking were selected in a model adjusted by BMI values. However, eight different regression models for prediction examination were possible, where missingness and values of both variables acted as outcomes and potential predictors of each other. Still, an alternative would have been to select variables based on models without mutual adjustment for the other variable with missing data that probably simplifies the process. Some limitations should be mentioned. First, the “MAR” distribution is an assumption that cannot be formally tested, and if not valid, the violation of such assumption would question the validity of CSMI as a “reference” method in our analysis. However, this is the challenge often faced by researchers in the field of drug safety research, where no reference risk estimate is available; in contrast, methodological research using simulation studies13 has demonstrated the validity of CSMI. Finally, CSMI multivariable models suggested that BMI and current smoking were actually weak confounders in the studied association between AOM and VTE. It is, therefore, plausible that the observed change in risk estimation observed in alternative (non-CSMI) methods might differ in other clinical scenarios where missingness affects stronger confounders; indeed, bigger differences among methods are expected when data are missing in strong confounders. In summary, MI requires a preliminary evaluation of missing data and the patterns of observed values and predictors. The complexity and time requirements increase with the number of variables with missing data to impute. If the initial explorative analyses of the missing data suggest that MAR is plausible and information is available within the dataset to account for MAR, then MI is recommended. However, among MI strategies, we observed a relatively small gain in precision when using longitudinal MI versus CSMI. Still, our model was based only on longitudinal data relatively close to the baseline (±1 year), but the simulation study of longitudinal MI by Welch et al13 demonstrated that for some variables with a strong correlation between values over time, the gain in precision can be substantial. In situations of small study cohorts or few cross-sectional predictors, we hypothesize that the inclusion of longitudinal records may be superior to using only cross-sectional records for MI, but further work is needed to evaluate this hypothesis. There may, however, be situations where it is impossible to perform MI as there may be insufficient data available to inform the MI model. In these cases, an option might be to exclude such an incomplete variable from the substantive model rather than exclude patients with missing data. Fortunately, MI is now supported by most common statistical software through fairly simple commands, despite being a relatively sophisticated method. CC may result in biased estimates calculation when data are missing at random, which may be highly frequent in EMR, and our case study illustrates the superior benefits and applicability of CSMI compared with CC in a real-life drug safety study. In some cases, it may, however, be reasonable to use CC analyses, such as when working with large datasets with few missing observations, because the risk of bias is minimal and the precision is still good. Likewise, if the missingness is not associated with the outcome, a CC analysis will provide unbiased estimates.24 In this study, we were able to demonstrate that the missing data were dependent on other observed variables. Hence, under the assumption that we have specified the imputation model correctly, the CSMI will provide unbiased estimates and, therefore, form the basis for the further analysis rather than the CC. Distribution of the BMI and current smoking values as recorded in the year of therapy initiation in the database (0), and as imputed in the 15 datasets (1–15) for the three databases from top to bottom (A) BIFAP, (B) EpiChron and (C) CPRD. Abbreviations: BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; BMI, body mass index; CPRD, Clinical Practice Research Datalink. Association of the studied factors with the missingness and value of BMI and current smoking status in the three databases, as well as those finally used for the multiple imputation of BMI and current smoking (grey columns) Notes: NA since the Charlson comorbidity index was calculated (CPRD), or an equivalent list of comorbidities included in the latter index (EpiChron/BIFAP). Bold text highlights the variables finally included in the MI models. Abbreviations: AOM, antiosteoporotic medications; BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; BMI, body mass index; CPRD, Clinical Practice Research Datalink; HIV, human immunodeficiency virus; MI, multiple imputation; NA, not applicable; VTE, venous thromboembolism.

Table S1

Association of the studied factors with the missingness and value of BMI and current smoking status in the three databases, as well as those finally used for the multiple imputation of BMI and current smoking (grey columns)

BIFAP

	BMI missing	BMI value	For BMI MI	Smoking missing	Smoking value	For smoking value MI

	Adjusted by smoking value			Adjusted by BMI value
Number of patients	53017	29609		38307	29608
Year of therapy initiation	+	+	X	+	+	X
Charlson index	NA	NA		NA	NA
Alcohol abuse	−	+	X	−	+	X
Aneurysm	−	+	X	−	−
COPD	−	+	X	−	+	X
Asthma	+	+	X	−	−	X
Cerebrovascular disease	−	−		−	−
Dementia	+	+	X	−	−
Diabetes	+	+	X	+	+	X
Hemiplegia	+	−		−	−
Heart failure	−	+	X	−	−
HIV	−	−		−	−
Liver disease	+	+	X	+	−
Rheumatoid arthritis	−	+	X	−	+	X
Myocardial infarction	−	−		−	−
Peptic ulcer	−	+	X	−	−
Renal disease	+	+	X	−	−
Other anti–osteoporotic medication	−	+	X	−	+	X
Calcium–Vitamin D	−	+	X	−	+	X
Corticosteroids	+	+	X	−	+	X
Hypnotic drugs	NA	NA		NA	NA
Heparin	+	+	X	+	−
Oral anticoagulant drugs	−	−		−	−

Variables included in substantive model

Treated VTE (outcome)	−	+	X	−	−	X
AOM Cohort (exposure)	−	+	X	−	−	X
Sex	+	+	X	+	+	X
Age	+	+	X	+	+	X
History of VTE	−	+	X	−	−	X
Recent fractures	+	+	X	−	−	X
Vein insufficiency or phlebitis	+	+	X	−	−	X
Peripheral arterial disease	−	+	X	−	−	X
Cancer	−	+	X	−	−	X
Hormone replacement therapy (last year)	+	+	X	−	−	X

EpiChron Cohort

Number of patients	9083	4516		4564	8941
Year of therapy initiation	−	+	X	+	−	−
Charlson index	NA	NA		NA	NA
Alcohol abuse	−	−		+	+	X
Aneurysm	+	−		−	−
COPD	+	−		−	+	X
Asthma	−	+	X	−	−
Cerebrovascular disease	−	−		−	−
Dementia	+	+	X	−	−
Diabetes	+	+	X	−	+	X
Hemiplegia	−	−		−	−
Heart failure	−	−		−	+	X
HIV	−	−		−	−
Liver disease	+	−		−	−
Rheumatoid arthritis	−	−		−	−
Myocardial infarction	−	−		−	−
Peptic ulcer	−	−		−	−
Renal disease	−	−		−	−
Other anti–osteoporotic medication	−	+	X	−	+	X
Calcium–Vitamin D	−	+	X	−	−
Corticosteroids	−	+	X	−	−	−
Hypnotic drugs	NA	NA		NA	NA
Heparin	−	+	X	−	−	−
Oral anticoagulant drugs	−	−		−	−

Variables included in substantive model

Treated VTE (outcome)	−	−	X	−	−	X
AOM Cohort (exposure)	−	−	X	−	−	X
Sex	−	−	X	+	+	X
Age	+	+	X	+	+	X
History of VTE	−	−	X	−	−	X
Recent fractures	−	−	X	−	−	X
Vein insufficiency or phlebitis	+	+	X	−	−	X
Peripheral arterial disease	−	−	X	−	−	X
Cancer	+	+	X	−	−	X
Hormone replacement therapy (last year)	+	−	X	−	−	X

CPRD

Number of patients	84,847	49,598		63,283	49,592
Year of therapy initiation	+	+		+	+
Charlson index	+	+	X	+	+	X
Alcohol abuse	NA	NA		NA	NA
Aneurysm	NA	NA		NA	NA
COPD	NA	NA		NA	NA
Asthma	NA	NA		NA	NA
Cerebrovascular disease	NA	NA		NA	NA
Dementia	NA	NA		NA	NA
Diabetes	NA	NA		NA	NA
Hemiplegia	NA	NA		NA	NA
Heart failure	NA	NA		NA	NA
HIV	NA	NA		NA	NA
Liver disease	NA	NA		NA	NA
Rheumatoid arthritis	+	+	X	+	−	X
Myocardial infarction	NA	NA		NA	NA
Peptic ulcer	NA	NA		NA	NA
Renal disease	NA	NA		NA	NA
Other anti–osteoporotic medication	−	−		−	−
Calcium–Vitamin D	−	+	X	+	+	X
Corticosteroids	+	+	X	+	+	X
Hypnotic drugs	+	−	X	−	+	X
Heparin	−	−		+	+
Oral anticoagulant drugs	+	+	X	−	+	X

Variables included in substantive model

Treated VTE (outcome)	+	+	X	−	−	X
AOM Cohort (exposure)	+	+	X	−	+	X
Sex	−	+	X	+	+	X
Age	+	+	X	+	+	X
History of VTE	+	+	X	−	+	X
Recent fractures	+	+	X	−	+	X
Vein insufficiency or phlebitis	+	+	X	−	+	X
Peripheral arterial disease	NA	NA		NA	NA
Cancer	NA	NA		NA	NA
Hormone replacement therapy (last year)	−	+	X	−	−	X

Notes: NA since the Charlson comorbidity index was calculated (CPRD), or an equivalent list of comorbidities included in the latter index (EpiChron/BIFAP). Bold text highlights the variables finally included in the MI models.

Abbreviations: AOM, antiosteoporotic medications; BIFAP, Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria; BMI, body mass index; CPRD, Clinical Practice Research Datalink; HIV, human immunodeficiency virus; MI, multiple imputation; NA, not applicable; VTE, venous thromboembolism.

14 in total

1. Seeking cancer-related information from media and family/friends increases fruit and vegetable consumption among cancer patients.

Authors: Nehama Lewis; Lourdes S Martinez; Derek R Freres; J Sanford Schwartz; Katrina Armstrong; Stacy W Gray; Taressa Fraze; Rebekah H Nagler; Angel Bourgoin; Robert C Hornik
Journal: Health Commun Date: 2011-09-20

Review 2. Review: a gentle introduction to imputation of missing values.

Authors: A Rogier T Donders; Geert J M G van der Heijden; Theo Stijnen; Karel G M Moons
Journal: J Clin Epidemiol Date: 2006-07-11 Impact factor: 6.437

3. Non-steroidal anti-inflammatory drugs and risk of heart failure in four European countries: nested case-control study.

Authors: Andrea Arfè; Lorenza Scotti; Cristina Varas-Lorenzo; Federica Nicotra; Antonella Zambon; Bianca Kollhorst; Tania Schink; Edeltraut Garbe; Ron Herings; Huub Straatman; René Schade; Marco Villa; Silvia Lucchi; Vera Valkhoff; Silvana Romio; Frantz Thiessard; Martijn Schuemie; Antoine Pariente; Miriam Sturkenboom; Giovanni Corrao
Journal: BMJ Date: 2016-09-28

4. Missing value imputation in longitudinal measures of alcohol consumption.

Authors: Ulrike Grittner; Gerhard Gmel; Samuli Ripatti; Kim Bloomfield; Matthias Wicki
Journal: Int J Methods Psychiatr Res Date: 2011-03 Impact factor: 4.035

5. Application of multiple imputation using the two-fold fully conditional specification algorithm in longitudinal clinical data.

Authors: Catherine Welch; Jonathan Bartlett; Irene Petersen
Journal: Stata J Date: 2014-04-01 Impact factor: 2.637

6. Multiple imputation using chained equations: Issues and guidance for practice.

Authors: Ian R White; Patrick Royston; Angela M Wood
Journal: Stat Med Date: 2010-11-30 Impact factor: 2.373

7. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls.

Authors: Jonathan A C Sterne; Ian R White; John B Carlin; Michael Spratt; Patrick Royston; Michael G Kenward; Angela M Wood; James R Carpenter
Journal: BMJ Date: 2009-06-29

8. Do men consult less than women? An analysis of routinely collected UK general practice data.

Authors: Yingying Wang; Kate Hunt; Irwin Nazareth; Nick Freemantle; Irene Petersen
Journal: BMJ Open Date: 2013-08-19 Impact factor: 2.692

9. Adaptation and validation of the Charlson Index for Read/OXMIS coded databases.

Authors: Nada F Khan; Rafael Perera; Stephen Harper; Peter W Rose
Journal: BMC Fam Pract Date: 2010-01-05 Impact factor: 2.497

10. Real-Life and RCT Participants: Alendronate Users Versus FITs' Trial Eligibility Criterion.

Authors: Carlen Reyes; Anton Pottegård; Peter Schwarz; M Kassim Javaid; Tjeerd P Van Staa; Cyrus Cooper; Adolfo Diez-Perez; Bo Abrahamsen; Daniel Prieto-Alhambra
Journal: Calcif Tissue Int Date: 2016-04-20 Impact factor: 4.333

4 in total

1. Artificial Intelligence in Pharmacovigilance: An Introduction to Terms, Concepts, Applications, and Limitations.

Authors: Jeffrey K Aronson
Journal: Drug Saf Date: 2022-05-17 Impact factor: 5.606

2. A multi-step approach to managing missing data in time and patient variant electronic health records.

Authors: Nina Cesare; Lawrence P O Were
Journal: BMC Res Notes Date: 2022-02-17

3. Completeness and representativeness of small area socioeconomic data linked with the UK Clinical Practice Research Datalink (CPRD).

Authors: Preveina Mahadevan; Mia Harley; Stuart Fordyce; Susan Hodgson; Rebecca Ghosh; Puja Myles; Helen Booth; Eleanor Axson
Journal: J Epidemiol Community Health Date: 2022-07-28 Impact factor: 6.286

4. The role of prenatal exposure to antidepressants, anxiolytic, and hypnotics and its underlying illness on the risk of miscarriage using BIFAP database.

Authors: Álvaro Kitchin; Consuelo Huerta; Ana Llorente-García; David Martínez; Paloma Ortega; Lucía Cea-Soriano
Journal: Pharmacoepidemiol Drug Saf Date: 2022-06-18 Impact factor: 2.732

4 in total