Literature DB >> 35582814

Modeling the underlying biological processes in Alzheimer's disease using a multivariate competing risk joint model.

Floor M van Oudenhoven^1,2,3, Sophie H N Swinkels³, Tobias Hartmann^4,5, Dimitris Rizopoulos^1,2.

Abstract

Many clinical trials repeatedly measure several longitudinal outcomes on patients. Patient follow-up can discontinue due to an outcome-dependent event, such as clinical diagnosis, death, or dropout. Joint modeling is a popular choice for the analysis of this type of data. Using example data from a prodromal Alzheimer's disease trial, we propose a new type of multivariate joint model in which longitudinal brain imaging outcomes and memory impairment ratings are allowed to be associated both with time to open-label medication and dropout, and where the brain imaging outcomes may also directly affect the memory impairment ratings. Existing joint models for multivariate longitudinal outcomes account for the correlation between the longitudinal outcomes through the random effects, often by assuming a multivariate normal distribution. However, for these models, it is difficult to interpret how the longitudinal outcomes affect each other. We model the dependence between the longitudinal outcomes differently so that a first longitudinal outcome affects a second one. Specifically, for each longitudinal outcome, we use a linear mixed-effects model to estimate its trajectory, where, for the second longitudinal outcome, we include the linear predictor of the first outcome as a time-varying covariate. This facilitates an easy and direct interpretation of the association between the longitudinal outcomes and provides a framework for latent mediation analysis to understand the underlying biological processes. For the trial considered here, we found that part of the intervention effect is mediated through hippocampal brain atrophy. The proposed joint models are fitted using a Bayesian framework via MCMC simulation.

Entities: Chemical

Keywords: competing risks; joint model; latent mediation analysis; serial multiple mediator model

Mesh：

Year: 2022 PMID： 35582814 PMCID： PMC9545329 DOI： 10.1002/sim.9425

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.497

INTRODUCTION

In randomized clinical trials, the focus is often on whether an intervention has a beneficial effect on the outcome of interest rather than the mechanism through which the intervention exerts its effects. However, identifying the mechanism can strengthen the validity of findings and can generate ideas to improve the effectiveness. This is particularly useful for interventions in complex diseases. Alzheimer's disease (AD) is such a complex disease as it is a slowly progressing neurodegenerative disease affecting several domains, including memory, cognition, executive function, and brain atrophy. Consequently, AD trials typically collect several longitudinal outcomes repeatedly measured for months or even years. An example for our work is a long‐term AD trial, namely the LipiDiDiet trial. , The LipiDiDiet trial aimed to investigate the effects of medical nutrition (Souvenaid) on cognition and related measures in prodromal AD individuals. Various outcomes, including scores on cognitive performance, memory, and brain atrophy measures, were collected over time. During the trial, subjects that progressed to dementia were allowed to remain in the trial while using open‐label medication, defined as active study product and/or AD medication. However, as the trial was designed to investigate the effects of the intervention on drug‐naïve individuals with prodromal AD, data collected after subjects started open‐label medication were prespecified to be excluded from the efficacy analysis. In addition to the exclusion of data after the start of open‐label medication, some subjects dropped out during the trial. This article focuses on two longitudinal outcomes: memory domain measured based on a neuropsychological test battery (NTB memory domain) and hippocampal brain atrophy. The hippocampal brain atrophy outcome is expected to affect the memory impairment ratings, as the hippocampus's importance in memory functioning has been demonstrated. For verbal memory tasks, stronger associations in AD patients have been reported for the volume of the left hippocampus than for the volume of the right hippocampus. , Therefore, in this article, we concentrate on the volume of the left hippocampus. The memory impairment ratings, in turn, are expected to affect the time to open‐label medication. Importantly, in this research, we use the time to open‐label medication as a proxy for disease progression. Also, hippocampal brain atrophy might affect the time to open‐label medication, and the same types of relationships might exist for the time to dropout. Furthermore, the intervention is expected to affect both hippocampal brain atrophy and memory. The latter effect might be through hippocampal volume or a different pathway. Figure 1 provides a schematic representation of the potential underlying biological processes in the data. As can be seen, several paths link the intervention to time to open‐label medication and dropout. Additionally, the intervention effect can be partitioned into direct and indirect paths of influence. Our goal is to estimate these different paths to better understand the underlying biological mechanisms, and in particular, to understand how the intervention affects the time to open‐label medication.

FIGURE 1

A conceptual diagram of the potential underlying biological processes in the LipiDiDiet data

A conceptual diagram of the potential underlying biological processes in the LipiDiDiet data Both the use of open‐label medication and dropout resulted in missing data for the longitudinal outcomes. We can expect different (missing) longitudinal trajectories for subjects whose data collected after the start of open‐label medication were censored than for subjects who dropped out of the trial. Both these events, but particularly the use of open‐label medication following progression to dementia, could be related to disease progression. Therefore, the missing data might be nonrandom (i.e., the probability of dropout depends on unobserved longitudinal responses). When analyzing this type of longitudinal data, we rely on methods that jointly estimate the longitudinal outcome and the dropout process, such as the joint model for longitudinal and time‐to‐event data. , , Using the joint modeling framework, it is also possible to deal with the competing events of open‐label medication use and dropout. , , Joint models for two or more longitudinal outcomes have been considered before. For example, Brown et al proposed a multivariate joint model with a B‐spline formulation for the longitudinal part, and Rizopoulos and Ghosh proposed a semiparametric multivariate joint model. In these existing joint models for multiple longitudinal outcomes, longitudinal outcomes are assumed to be correlated but not to causally influence one another. Often, in these models, the interdependencies between the longitudinal outcomes are captured by the random effects, typically by assuming a multivariate normal distribution; for other examples, see Chi and Ibrahim and Andrinopoulou et al. However, these models are not suitable when in the hypothesized working mechanism one longitudinal outcome is assumed to influence another. We propose a new type of multivariate competing risk joint model in which, for each of the longitudinal outcomes, we use a linear mixed‐effects model to estimate its evolution over time, where for the second longitudinal outcome, we include the predictor of the first outcome as a time‐varying covariate. The proposed model provides a framework for latent mediation analysis, which we use to estimate the different direct and indirect effects to understand the underlying biological processes. Our approach is different from traditional mediation analysis as we use the predicted values (ie, latent processes) of the longitudinal outcome instead of the observed outcomes. This article is organized as follows: Section 2 presents the proposed multivariate competing risk joint model. Section 3 describes how the proposed model can be used to investigate mediation. Section 4 shows the results for the proposed model on the LipiDiDiet data. Finally, in Section 5, we evaluate the proposed model's performance.

MODEL FORMULATION

For subject (), let and denote the longitudinal values at time point for respectively the first and second longitudinal outcome. These values are observed at specific time points that can differ for each subject and each longitudinal outcome. We use mixed‐effects models to describe the trajectories of the longitudinal outcomes over time. In particular, to model the first longitudinal outcome, we postulate where denotes the vector for the fixed regression coefficients, denotes the vector of random effects, and are design vectors for the fixed and random effects, respectively, and denotes the measurement error term for which we assume . Further, the random effects are assumed normally distributed with mean zero and covariance matrix , being independent of . The linear predictor denotes the true but unobserved trajectory, that is, the latent process, of the first longitudinal outcome over time. We postulate that affects , that is, the latent process of the second longitudinal outcome, and we assume that the intervention and other covariates affect both In this way, using the parameter , we can directly interpret the association between the two longitudinal outcomes as it denotes the change in the predicted value of the second longitudinal outcome for a one unit increase in the predicted value of the first longitudinal outcome, holding the other factors constant. Further, , , , and denote, respectively, the fixed regression coefficients, the random effects, and the design vectors, similar as before but now for the second longitudinal outcome. The random effects are assumed to be normally distributed, with mean zero and covariance matrix . Note that the random effects and are not assumed to be correlated. For subject , let denote the true event time for event , and let denote the censoring time. Moreover, let be the observed survival time and the event indicator, where in case of the competing events and for censoring. The hazard function for subject for competing event is modeled as follows where denotes the baseline hazard for the th event, which we approximate in a flexible manner, using B‐splines . Further, is the parameter vector for the th survival outcome, and is a design vector of baseline covariates with a corresponding vector of regression coefficients . The latent processes of the first and second longitudinal outcome are allowed to be associated with both competing events, quantified by the association parameters and . The parameters are estimated under the Bayesian framework, using Markov chain Monte Carlo (MCMC) methods and using the JAGS software.

MEDIATION

Different intervention effects

The proposed joint model can be used to understand the process(es) of how the intervention affects the competing risks. A schematic representation of the proposed model is presented in Figure 2, based on which we can distinguish different paths that link the intervention to the competing events, either passing through none, one, or both of the longitudinal outcomes (i.e., their latent processes). Our goal is to estimate each of these paths. For ease of illustration, let and denote the regression coefficients involving the intervention effect in, respectively, the first and second longitudinal submodel. These are typically time‐dependent due to the specification of an interaction of intervention by time in the longitudinal models. Based on Figure 2, we can distinguish a direct path, quantified by , and three different indirect paths linking the intervention to the first competing event, passing through one or both of the longitudinal outcomes. In Supplementary Figures 7a‐c, we highlighted the indirect paths one by one.

FIGURE 2

Schematic representation of the proposed joint model. Note that this represents a simplified version of the model, as next to the intervention, also other covariates may affect m (t), m (t), and the competing events. The first indirect path we can distinguish (see Supplementary Figure 7a) is the path that links the intervention to the first event, passing through the first longitudinal outcome. The parameter describes the intervention effect on the first longitudinal outcome over time, and the parameter quantifies the effect of this longitudinal outcome on the first event. Therefore, the indirect effect of the intervention on the first event mediated through the first longitudinal outcome is quantified as the product . Second, we observe an indirect path (see Supplementary Figure 7b) quantifying the intervention effect on the first event mediated through the second longitudinal outcome. This path involves the parameters and . The third indirect path (see Supplementary Figure 7c) quantifies the mediation through the first and second longitudinal outcome on the first competing event. That is, the intervention may affect the first longitudinal outcome, quantified by the parameter , which in turn is expected to affect the second longitudinal outcome. The second longitudinal outcome, in turn, may affect the risk of the first event. This path is quantified as . As is the case for the Cox model, these quantities' exponent expresses the effects as multiplicative changes on the hazard. For example, denotes the increase in the risk of the first event due to the indirect effect of the intervention through both longitudinal outcomes. Lastly, the total intervention effect on the first event in the joint model is the hazard ratio of the active vs the control group, with all other covariates held constant. For the proposed joint model, this hazard ratio is a combination of the direct path, which is time‐independent, and the three, typically time‐dependent, indirect paths. Therefore for the proposed joint model, the time‐varying hazard ratio for the overall intervention effect can be expressed as: Note that the hazard ratios for the direct, indirect, and total effect are conditional on the random effects and therefore have subject‐specific interpretations. For more information and a method to obtain the marginal overall treatment effect, we refer to Reference 15.

Traditional mediation analysis and our approach

There are two major general approaches for calculating the point estimate of the mediated effect in single‐mediator models: the difference in coefficients and the product of coefficients methods. Consider the following three regression equations: where is the independent variable, is the mediator, is the dependent variable, represents the relation between the independent and dependent variable, represents the relation between the independent and dependent variables adjusted for the effect of the mediator, represents the relation between the mediator and the dependent variable adjusted for the effect of the independent variable, and represents the relation between the independent variable and the mediator. With the difference in coefficients methods, the difference between the regression coefficients before and after adjustment for the mediator is calculated: . The product of coefficients method computes the product and is based on the rationale that the mediated effect is equal to the effect of the independent variable on the mediator times the effect of the mediator on the dependent variable. MacKinnon et al have shown that the two estimators, and , are mathematically equivalent when the dependent variable is continuous and ordinary regression is used. However, for multilevel models, logistic or probit regression, and survival analysis, the estimators are not always equivalent. , , , The traditional single‐mediator model can be extended to models with more than one mediator. Two forms of multiple mediator models are serial multiple mediator models, in which the mediators are linked together in a causal chain, and parallel multiple mediator models, in which the mediators are merely allowed to correlate but not to causally influence another mediator in the model. The parallel and serial mediator model are shown in Figure 3. The existing multivariate joint models belong to the class of parallel multiple mediator models. In these models, there is no direct link between the mediators; the mediators are allowed to correlate but not to causally influence one another.

FIGURE 3

Two multiple mediator models with two mediators

Two multiple mediator models with two mediators This article's proposed model has the form of the serial multiple mediator model with two mediators. As can be seen in the lower panel of Figure 3, is modeled as affecting through one direct and three indirect paths. The four regression equations that represent the two‐mediator serial mediator model are: where the total indirect effect can either be calculated by the difference in coefficients method, quantified as , or by the product of coefficients methods, quantified as the sum of the three indirect effects , , and . Although the proposed model has the form of the serial multiple mediator model, our approach to assessing latent mediation analysis differs from traditional mediation analysis for a few reasons. First, we assume that the observed longitudinal outcomes are realizations with error from latent processes, and we postulate that the latent process of the first longitudinal may affect the latent process of the second longitudinal outcome. Using the latent processes is advantageous as it allows to distinguish the true process of interest from its noisy observations. For the estimation of the mediated effect on the risk of the first competing event through the longitudinal outcomes, based on the product of coefficients method, we need the following equation: The total indirect effect based on the product of coefficients methods can be calculated by taking the sum of the three indirect effects , , and . As such, another advantage of using a joint model compared to traditional mediation analysis is that the path coefficients for the product of coefficients method can be estimated based on a single equation. In other words, they can be estimated simultaneously based on the same model while controlling for one another. To estimate the mediated effect based on the difference in coefficients method, we would need a second equation to obtain the unadjusted effect of the intervention on the risk of the first event (). For this reason, we do not recommend using the difference in coefficients method for our approach, as it is unclear whether we can combine estimates based on different models in this setting. Furthermore, modeling an interaction of the intervention by time in the longitudinal submodel(s) results in a time‐dependent intervention effect on the survival outcome. Using the difference in coefficients method it is not possible to capture such a time‐dependent mediated effect, as the estimator () is time‐independent by definition. Using the product of coefficients methods, it is straightforward to estimate a time‐dependent mediated effect because the estimator includes the time‐dependent intervention effects on the longitudinal outcomes and .

ANALYSIS OF THE LIPIDIDIET DATA

This section presents the analysis of the LipiDiDiet trial, briefly introduced in Section 1, using the proposed multivariate competing risk joint model. The LipiDiDiet trial is a randomized, controlled, multicenter trial performed at study sites in Finland, Germany, the Netherlands, and Sweden. The trial had an initial 24‐month intervention period and up to 72‐month double‐blind extension periods and showed positive effects on longitudinal measures of cognition, functioning, and brain atrophy. , Here, we focus on the first 36 months of intervention, which currently is the maximum reported intervention period. Our primary interest is in estimating the different effects of how the intervention affects the risk of starting open‐label medication use, both directly and indirectly, through one or both of the longitudinal outcomes. In the model, we assume that there is a true underlying profile for each subject, that is, the latent process that dictates how the observed values of hippocampal volume change over time. Likewise, we assume that the NTB memory domain instrument measures the true underlying memory profile . We model that the latent process of hippocampal volume has a direct effect on the latent process of NTB memory domain . We assume that the intervention and other covariates affect both latent processes, and we allow both latent processes to be associated with the risks of open‐label medication and dropout. To model hippocampal volume, we use a mixed‐effects submodel with random intercepts and slopes. We include both a main effect of the intervention and an interaction of intervention by time to allow the trajectory of the intervention groups to be different over time. Further, following References 1 and 2, we correct for the effect of baseline MMSE and site. The mixed‐effects submodel for hippocampal volume is The model for NTB memory domain takes the same form, except we include the latent process of hippocampal volume as a time‐varying covariate. For these models, and , defined as the regression coefficients involving the intervention effects denote and for the first longitudinal submodel and and for the second longitudinal submodel. To facilitate the comparison among the longitudinal outcomes, we scaled both longitudinal outcomes to zero‐mean and unit variances using the mean and standard deviation (SD) among all subjects and time points. To model the two competing events, we use cause‐specific hazard regression with separate Cox proportional hazards models for each of the competing events. We include the intervention, baseline MMSE, and site as baseline covariates (with for open‐label medication and for dropout) The association parameters and measure the strength of the association between the longitudinal outcomes and the risk of the corresponding events. For example, the quantity denotes the hazard ratio for open‐label medication use at time point for one SD increase in the longitudinal trajectory of hippocampal volume at the same time point. We ran the MCMC for 100 000 iterations and discarded the first 50 000 iterations as burn‐in. Detailed information on the estimation procedure, such as the prior distributions, can be found in the supplementary material. Convergence was assessed based on trace plots, based on which we concluded that the fit is satisfactory. Next to the different mediated effects, we are also interested in the overall intervention effect on the risk of open‐label medication use. Here, the overall intervention effect can be expressed as: Table 1 shows the results using the proposed multivariate joint model with hippocampal volume and NTB memory domain as longitudinal outcomes. Only the results for the most relevant parameters are shown here (all results are shown in the supplementary material).

TABLE 1

Posterior means, standard deviation, and the 95% credibility intervals for the multivariate joint model using left hippocampal volume and NTB memory domain as longitudinal outcomes

	Mean	SD	95% credibility interval
Longitudinal process (hippocampal volume)
β11	−0.183	0.021	−0.224 to −0.143
β12(ℬ1(t))	−0.064	0.118	−0.296 to 0.166
β13(ℬ1(t))	0.071	0.030	0.011 to 0.129
Longitudinal process (NTB memory domain)
β21	−0.088	0.034	−0.155 to −0.022
β22(ℬ2(t))	0.060	0.091	−0.119 to 0.239
β23(ℬ2(t))	0.051	0.046	−0.038 to 0.140
ξ	0.354	0.057	0.244 to 0.465
Survival process (open‐label medication)
γ11	0.153	0.210	−0.259 to 0.563
α11	−0.416	0.151	−0.710 to −0.118
α12	−1.073	0.166	−1.409 to −0.758
Survival process (dropout)
γ21	−0.241	0.179	−0.597 to 0.114
α21	0.047	0.142	−0.230 to 0.327
α22	−0.242	0.136	−0.509 to 0.025

Posterior means, standard deviation, and the 95% credibility intervals for the multivariate joint model using left hippocampal volume and NTB memory domain as longitudinal outcomes The parameter quantifies the direct effect on the risk of open‐label medication use. However, the 95% credible interval for does include 0, indicating there is much uncertainty about this effect. For the first indirect path (see Supplementary Figure 7a), that is, the path that quantifies the intervention effect on open‐label medication mediated through hippocampal volume, we observe a strong intervention effect on hippocampal volume (). In particular, we observe 0.071 SD (95% CI: 0.011 to 0.129) less reduction in hippocampal volume per year in the active group than in the control group. Furthermore, we observe a strong association () between hippocampal volume and the risk of open‐label medication. The sign of the ‐coefficient indicates the direction of the association. For hippocampal volume, the sign of the coefficient is negative, indicating that a higher value is found to be associated with a lower risk of open‐label medication use. Specifically an increase of one SD (0.62 units) in the trajectory of hippocampal volume is estimated with a risk reduction of 34% for open‐label medication use ( = 0.34). The effect of the intervention on open‐label medication mediated through this path expressed as the time‐varying hazard ratio is shown in Figure 4. This figure also shows the results of the other mediated effects and the total intervention effect.

FIGURE 4

The total intervention effect, the direct effect, and the different mediated effects for the LipiDiDiet data. The colors correspond with the mediation paths in Figure 7a‐c. The combined mediation through hippocampal volume denotes the total mediation through hippocampal volume. It is the sum of the mediation through only hippocampal volume and through both hippocampal volume and NTB memory domain. Further, the total intervention effect is the sum of the direct effect and the different mediated effects The second indirect path (see Supplementary Figure 7b) quantifies the intervention effect on open‐label medication through NTB memory domain. In this path, the parameter describes the intervention effect on NTB memory domain over time. The average decrease is estimated to be 0.051 SD (95% CI: 0.038 to 0.140) per year less in the active group than in the control group. Importantly, this effect does not quantify the full treatment effect on NTB memory as part of the effect is mediated through hippocampal volume. In its turn, the predicted value of NTB memory domain is strongly associated with the risk of open‐label medication, quantified by the parameter . Also for NTB memory domain, a higher value is associated with a lower risk of open‐label medication use. Specifically, an increase of one SD (0.90 units) in the trajectory of NTB memory is estimated with a risk reduction of 66.0% ( = 0.66). The parameter reflects the association between the two longitudinal outcomes. We observe a strong association; an increase of one SD (0.62 units) in the trajectory of hippocampal volume is estimated to increase the trajectory of NTB memory domain with 0.354 SD (95% CI: 0.244 to 0.465).

SIMULATION STUDY

We performed four simulation studies. The first simulation study aimed to verify that the proposed method results in unbiased estimates. In the second simulation study, we investigated the bias when the actual underlying data mechanism was based on the proposed model, with a first longitudinal outcome affecting a second one, but when only one of them was taken into account. Specifically, we used a univariate competing risk joint model for the first longitudinal outcome in the analysis step. In the third and fourth simulation study, we respectively compared the proposed model with the existing multivariate joint model and investigated the robustness of the direct and indirect effects when the model is slightly misspecified. For information on these simulation studies, we refer to the supplementary material.

Design

Data for the first three simulation studies were simulated under the same model structure, which we will now describe. For each simulation study and each scenario, we simulated 200 datasets based on the proposed multivariate competing risk joint model. The longitudinal submodels included an intercept, a linear time effect, an interaction of treatment by time, and random intercepts and slopes. In the second longitudinal submodel, we included the linear predictor of the first outcome as a time‐varying covariate, where the parameter quantified the association between the longitudinal outcomes. For the first and third simulation study we used . For the second simulation study, we considered four simulation scenarios, for which we used parameter values for with an increasing magnitude: (I) , (II) , (III) , and (IV) . For the survival submodels, we used cause‐specific hazard regression with separate Cox proportional hazards models with the intervention as the only covariate. In the first three simulation studies, the remaining parameters were , , , , , , , , , , , . The baseline hazards were simulated using B‐splines, and we used the same knots and spline coefficients for both survival submodels. For each dataset, we simulated 500 subjects and 10 equally spaced measurements per subject. The maximum follow‐up was 3.2, and the mean of the exponential distribution for the censoring mechanism was 5.

Analysis and results

Simulation study 1

In the first simulation study, we analyzed each simulated dataset with the same model structure as in the simulation step. The results are shown in Table 2, where the mean, bias, and coverage rate are given for each parameter. The bias is the mean difference between the estimate of the simulation and the true value. The coverage rate is the percentage of times the true parameter value falls in each simulation's credible interval. The proposed model seems to be performing well as means are close to the true values, the bias is small, and coverage rates are high. Supplementary Figure 9 also compares the average overall intervention effect with the true intervention effect. As can be seen, they are nearly identical.

TABLE 2

Results of simulation study 1 including the true parameter values, the mean of the MCMC sample means, the bias, and coverage for each parameter

	True values	Mean	Bias	Coverage
β10	0.070	0.066	−0.004	94%
β11	−0.250	−0.251	−0.001	97%
β12	0.120	0.120	0.000	98%
β20	0.150	0.149	−0.001	94%
β21	0.450	0.452	0.002	96%
β22	−0.100	−0.100	0.000	96%
γ11	−0.200	−0.231	−0.031	95%
γ21	0.100	0.085	−0.015	97%
α11	−0.600	−0.599	0.001	96%
α21	−0.100	−0.096	0.004	95%
α12	0.400	0.390	−0.010	95%
α22	0.100	0.093	−0.007	94%
ξ	−0.500	−0.497	0.003	95%

Note: Results are based on 200 simulated datasets.

Results of simulation study 1 including the true parameter values, the mean of the MCMC sample means, the bias, and coverage for each parameter Note: Results are based on 200 simulated datasets.

Simulation study 2

In the second simulation study, we analyzed each simulated dataset with a model in which the second longitudinal outcome was not included. We simulated the data under the proposed multivariate competing risk model but analyzed the data with a univariate competing risk joint model for the first longitudinal outcome. Supplementary Figure 8 presents a visualization of this simulation study. The gray parts denote the parameters that are omitted from the model. The results for each simulation scenario are presented in Figure 5. As can be seen, the overall intervention effect as estimated by the univariate joint model is biased in this situation, and the bias increases for larger values of .

FIGURE 5

Average time‐varying overall intervention effect (solid line) based on 200 simulated datasets with corresponding 95% percentile confidence interval for simulation scenarios I) ξ = −0.01, II) ξ = −1, III) ξ = −2, and IV) ξ = −3. The dashed line denotes the true overall intervention effect

DISCUSSION

Motivated by data from a prodromal AD trial, we proposed a multivariate competing risk joint model in which a first longitudinal outcome is allowed to influence a second longitudinal outcome. Existing multivariate joint models have the form of so‐called parallel mediator models, in which the longitudinal outcomes are merely allowed to be correlated but not to causally influence one another. Using the proposed model, we could model a direct effect of hippocampal brain atrophy on the memory impairment ratings. We allowed both hippocampal brain atrophy and memory to be associated with the risk of open‐label medication use and dropout. The intervention was also allowed to affect both hippocampal brain atrophy and memory, with the latter effect being direct or indirect through hippocampal volume. The proposed method provided a framework for latent mediation analysis, which helped understand the intervention's underlying biological processes. For the LipiDiDiet trial, we found that part of the intervention effect is mediated through hippocampal brain atrophy. However, there could be other effects of the underlying biological processes that we were not aware of, and therefore did not consider in our model. For the data we considered here, the choice on the directionality was based on biological grounds. As the intervention (Souvenaid) was designed to provide neuroprotection (by supplying rate‐limiting compounds for brain phospholipid synthesis and addressing multiple AD‐related pathological processes ), our hypothesized working mechanism was that the intervention affects the brain, which is noticeable with brain imaging outcomes of, for example, hippocampal volume. In turn, changes in the brain may affect how well patients perform on cognitive tests, such as on NTB memory domain. For other datasets where the predominant order between the longitudinal outcomes might be less clear, a decision on the directionality should be made with care and together with a clinical expert. However, we also argue that the proposed model is developed explicitly for situations in which the longitudinal outcomes are hypothesized to be linked together in a causal chain, where a first longitudinal outcome is expected to affect a second one. If no direct link or clear order between the outcomes is assumed, the existing multivariate joint models are perhaps more suitable as the outcomes are only allowed to be correlated in these models. Also, the estimation process here cannot be entirely separated for the two longitudinal outcomes due to their simultaneous estimation (for more information, we refer to the supplementary material). A first simulation study showed good performance of the proposed method. A second simulation study showed the bias in the overall intervention effect when the data was analyzed with a standard univariate competing risk joint model, while the actual underlying data mechanism was based on the proposed model. We argue that the joint model, and particularly the proposed type of joint model, is a very suitable and intuitive method for latent mediation analysis with a survival outcome and has certain advantages compared to traditional mediation analysis. First, the different effects that form the total indirect effect, that is, the total mediated effect, can be estimated based on the same joint model while controlling for one another. Second, it is straightforward to model a time‐varying effect on the survival outcome using a joint model. Third, in the proposed method, we assume that the latent process rather than the observed values of the first longitudinal outcome affects the latent process of the second longitudinal outcome, which allows us to focus on the true process of interest instead of its noisy observations. This is different from traditional mediation analysis, where the observed outcomes are used. Another difference is that the hazard ratios for the direct, indirect, and total effect presented here are conditional hazard ratios, whereas often marginal hazard ratios are considered. In this article, we have assumed that the first longitudinal outcome's underlying value at time affects the second longitudinal outcome's underlying value at the same time point. However, similar to the association structure between the longitudinal outcome and the survival outcome, , we could use a more complex association structure for the association between the longitudinal outcomes. For example, the second longitudinal outcome's value at time point could be related to the slope at or the cumulative value up to of the first longitudinal outcome. A simulation study (not shown here) confirmed that the proposed method also works for these types of dependencies between the longitudinal outcomes. Checking for model misspecification is always important, but it is even more important for our approach as the latent process of the second longitudinal outcome (i.e., NTB memory) depends on the model for the first longitudinal outcome and whether this model is correctly specified. It is recommended to investigate the model's fit using figures, such as comparing the fitted vs predicted trajectories for different subjects and examining the models' assumptions using QQ and residuals vs fitted values plots. These figures can often show if a model adequately fits the data and guide analysts to adapt it if it does not fit the data. Here, we have used linear models for both longitudinal outcomes, but it is also possible to use more complex functions of time such as quadratic or higher‐order polynomials, such as splines. We suggest fitting several plausible models as a sensitivity analysis. In the supplementary material, we included such a sensitivity analysis for the data considered in this article. We modeled a constant intervention effect across subjects and across time within subjects. One should be aware that this is a simplification that might be more or less permissible depending on the nature of the intervention, disease, and time frame under investigation. However, when this assumption is not plausible, the model could easily be extended with a random treatment effect and a random treatment by time interaction (and potentially more complex forms of this interaction), allowing each subject to respond differently to the treatment, that is, having its own time‐varying treatment effect. Finally, it is important to emphasize that although we have been able to perform latent mediation analysis, we did not prove the pattern of causation, as shown in Figure 1. One of the main assumptions in causal mediation analysis is the assumption of sequential ignorability, which, for a single mediator, assumes that (i) given the observed covariates, the intervention assignment is ignorable of the mediator process and the outcome of interest, and (ii) given the observed covariates and the intervention, the mediator is ignorable of the outcome. Usually, the randomization can only guarantee the ignorability of the intervention assignment, whereas the second part of this assumption also requires that, conditional on the intervention and the observed covariates, the mediator can be treated as if it were randomized. In other words, the ignorability of the mediator implies that there is no unmeasured confounding between the mediator and the outcome, which excludes the possibility of both unobserved pretreatment confounding or any post‐treatment confounding. In our case, with multiple (causally) dependent mediators, this ignorability assumption is violated as one mediator acts as a post‐treatment confounder for the other mediator on the outcome of interest. , Moreover, because the mediators are repeatedly measured, the association between a mediator at a given time point and the outcome may be confounded by other previously measured assessments of the mediator. It is of our future interest to investigate how we can extend the proposed model to allow for a causal interpretation. However, even for the joint model with one longitudinal outcome, research on its causal (latent) mediation mechanism is very limited , and has yet to be investigated. The code to fit the proposed multivariate competing risk joint model is available in the supplementary material.

CONFLICT OF INTEREST

The authors Floor M. van Oudenhoven and Sophie H. N. Swinkels are employees of Danone Nutricia Research. Data S1 Supplementary material Click here for additional data file.

22 in total

1. Joint modelling of longitudinal measurements and event time data.

Authors: R Henderson; P Diggle; A Dobson
Journal: Biostatistics Date: 2000-12 Impact factor: 5.899

2. Multilevel mediation modeling in group-based intervention studies.

Authors: J L Krull; D P MacKinnon
Journal: Eval Rev Date: 1999-08

3. Joint models for multivariate longitudinal and multivariate survival data.

Authors: Yueh-Yun Chi; Joseph G Ibrahim
Journal: Biometrics Date: 2006-06 Impact factor: 2.571

4. A Bayesian semiparametric multivariate joint model for multiple longitudinal outcomes and a time-to-event.

Authors: Dimitris Rizopoulos; Pulak Ghosh
Journal: Stat Med Date: 2011-02-21 Impact factor: 2.373

5. A Simulation Study of Mediated Effect Measures.

Authors: David P Mackinnon; Ghulam Warsi; James H Dwyer
Journal: Multivariate Behav Res Date: 1995-01-01 Impact factor: 5.923

6. A joint model for survival and longitudinal data measured with error.

Authors: M S Wulfsohn; A A Tsiatis
Journal: Biometrics Date: 1997-03 Impact factor: 2.571

7. Joint modeling of two longitudinal outcomes and competing risk data.

Authors: Eleni-Rosalina Andrinopoulou; Dimitris Rizopoulos; Johanna J M Takkenberg; Emmanuel Lesaffre
Journal: Stat Med Date: 2014-03-27 Impact factor: 2.373

8. Joint modelling of longitudinal and competing risks data.

Authors: P R Williamson; R Kolamunnage-Dona; P Philipson; A G Marson
Journal: Stat Med Date: 2008-12-30 Impact factor: 2.373

9. A joint model for longitudinal measurements and survival data in the presence of multiple failure types.

Authors: Robert M Elashoff; Gang Li; Ning Li
Journal: Biometrics Date: 2007-12-20 Impact factor: 1.701

10. Accommodating informative dropout and death: a joint modelling approach for longitudinal and semi-competing risks data.

Authors: Qiuju Li; Li Su
Journal: J R Stat Soc Ser C Appl Stat Date: 2017-01-30 Impact factor: 1.864

1 in total

1. Modeling the underlying biological processes in Alzheimer's disease using a multivariate competing risk joint model.

Authors: Floor M van Oudenhoven; Sophie H N Swinkels; Tobias Hartmann; Dimitris Rizopoulos
Journal: Stat Med Date: 2022-05-18 Impact factor: 2.497

1 in total