| Literature DB >> 31894164 |
Alessandro Gasparini1, Keith R Abrams1, Jessica K Barrett2, Rupert W Major1,3, Michael J Sweeting1,4, Nigel J Brunskill3,5, Michael J Crowther1.
Abstract
Electronic health records are being increasingly used in medical research to answer more relevant and detailed clinical questions; however, they pose new and significant methodological challenges. For instance, observation times are likely correlated with the underlying disease severity: Patients with worse conditions utilise health care more and may have worse biomarker values recorded. Traditional methods for analysing longitudinal data assume independence between observation times and disease severity; yet, with health care data, such assumptions unlikely hold. Through Monte Carlo simulation, we compare different analytical approaches proposed to account for an informative visiting process to assess whether they lead to unbiased results. Furthermore, we formalise a joint model for the observation process and the longitudinal outcome within an extended joint modelling framework. We illustrate our results using data from a pragmatic trial on enhanced care for individuals with chronic kidney disease, and we introduce user-friendly software that can be used to fit the joint model for the observation process and a longitudinal outcome.Entities:
Keywords: Monte Carlo simulation; electronic health records; informative visiting process; inverse intensity of visiting weighting; longitudinal data; mixed‐effects models; recurrent‐events models; selection bias
Year: 2019 PMID: 31894164 PMCID: PMC6919310 DOI: 10.1111/stan.12188
Source DB: PubMed Journal: Stat Neerl ISSN: 0039-0402 Impact factor: 1.190
Figure 1Simplified directed acyclic graph depicting a joint model for a longitudinal outcome and its observation process
Figure 2Bias (a), coverage (b), and mean squared error (c) of the estimated treatment effect α 1. The orange colour identifies scenarios where the summary statistics were significantly different than the target value (0 for bias, 95% for coverage) using Z‐tests based on estimated Monte Carlo standard errors
Summary characteristics of simulated data under each data‐generating mechanism
| Data‐generating mechanism | Sample size | No. of measurements | Gap time |
|---|---|---|---|
| Γ distribution not | 938 (918–957) | 4 (3–6) | 1.31 (0.74–2.17) |
| depending on treatment | |||
| JM ( | 666 (634–705) | 2 (1–4) | 0.91 (0.33–2.12) |
| JM ( | 1,564 (1,475–1,667) | 5 (2–9) | 0.37 (0.13–0.94) |
| JM ( | 4,489 (4,188–4,815) | 13 (6–27) | 0.13 (0.04–0.33) |
| Γ distribution depending | 3,444 (3,296–3,606) | 11 (4–28) | 0.23 (0.12–0.41) |
| on treatment | |||
| Γ distribution depending on Y | 2,564 (2,457–2,670) | 9 (4–20) | 0.31 (0.16–0.60) |
| treatment and previous | |||
| JM ( | 669 (637–707) | 2 (1–4) | 0.90 (0.33–2.11) |
| JM ( | 1,556 (1,461–1,654) | 5 (2–9) | 0.37 (0.13–0.94) |
| JM ( | 4,482 (4,218–4,794) | 13 (6–26) | 0.13 (0.04–0.33) |
| JM ( | 1,842 (1,818–1,867) | 9 (7–10) | 1.00 (1.00–1.00) |
| with regular visits |
Note. Values are median with interquartile interval.
Figure 3Forest plot with estimated coefficients for the longitudinal component, models fit to the application data from the PSP‐CKD study. Each estimated coefficient is included as text placed on the leftmost side of each subplot. PSP‐CKD = Primary–Secondary Care Partnership to Prevent Adverse Outcomes in Chronic Kidney Disease
Figure 4Predicted longitudinal trajectories from the models fit to the application data from the PSP‐CKD study. The solid lines represent estimated trajectories for males, whereas the dashed lines represent trajectories for females. Colours identify the model. PSP‐CKD = Primary–Secondary Care Partnership to Prevent Adverse Outcomes in Chronic Kidney Disease; eGFR = estimated glomerular filtration rate