Eleni-Rosalina Andrinopoulou1, Kazem Nasserinejad2, Rhonda Szczesniak3,4, Dimitris Rizopoulos1. 1. Department of Biostatistics, Erasmus MC, Rotterdam, The Netherlands. 2. Department of Hematology, Erasmus MC, Rotterdam, The Netherlands. 3. Division of Biostatistics & Epidemiology and Division of Pulmonary Medicine, Cincinnati Children's Hospital Medical Center, Cincinnati, USA. 4. Department of Pediatrics, University of Cincinnati, Cincinnati, USA.
Abstract
Cystic fibrosis is a chronic lung disease requiring frequent lung-function monitoring to track acute respiratory events (pulmonary exacerbations). The association between lung-function trajectory and time-to-first exacerbation can be characterized using joint longitudinal-survival modeling. Joint models specified through the shared parameter framework quantify the strength of association between such outcomes but do not incorporate latent sub-populations reflective of heterogeneous disease progression. Conversely, latent class joint models explicitly postulate the existence of sub-populations but do not directly quantify the strength of association. Furthermore, choosing the optimal number of classes using established metrics like deviance information criterion is computationally intensive in complex models. To overcome these limitations, we integrate latent classes in the shared parameter joint model through a fully Bayesian approach. To choose the optimal number of classes, we construct a mixture model assuming more latent classes than present in the data, thereby asymptotically "emptying" superfluous latent classes, provided the Dirichlet prior on class proportions is sufficiently uninformative. Model properties are evaluated in simulation studies. Application to data from the US Cystic Fibrosis Registry supports the existence of three sub-populations corresponding to lung-function trajectories with high initial forced expiratory volume in 1 s (FEV1), rapid FEV1 decline, and low but steady FEV1 progression. The association between FEV1 and hazard of exacerbation was negative in each class, but magnitude varied.
Cystic fibrosis is a chronic lung disease requiring frequent lung-function monitoring to track acute respiratory events (pulmonary exacerbations). The association between lung-function trajectory and time-to-first exacerbation can be characterized using joint longitudinal-survival modeling. Joint models specified through the shared parameter framework quantify the strength of association between such outcomes but do not incorporate latent sub-populations reflective of heterogeneous disease progression. Conversely, latent class joint models explicitly postulate the existence of sub-populations but do not directly quantify the strength of association. Furthermore, choosing the optimal number of classes using established metrics like deviance information criterion is computationally intensive in complex models. To overcome these limitations, we integrate latent classes in the shared parameter joint model through a fully Bayesian approach. To choose the optimal number of classes, we construct a mixture model assuming more latent classes than present in the data, thereby asymptotically "emptying" superfluous latent classes, provided the Dirichlet prior on class proportions is sufficiently uninformative. Model properties are evaluated in simulation studies. Application to data from the US Cystic Fibrosis Registry supports the existence of three sub-populations corresponding to lung-function trajectories with high initial forced expiratory volume in 1 s (FEV1), rapid FEV1 decline, and low but steady FEV1 progression. The association between FEV1 and hazard of exacerbation was negative in each class, but magnitude varied.
Entities:
Keywords:
Classification; Dirichlet prior; clustering; joint model; latent class model; longitudinal outcome; medical monitoring; survival outcome
Cystic fibrosis (CF) is a lethal genetic disorder that primarily affects the lungs.
The clinical course of CF is marked by progressive loss of lung function and
typically results in respiratory failure. Forced expiratory volume in 1 s
(hereafter, FEV1) is the most important clinical
indicator in monitoring lung function decline in patients with CF. Patients during
follow-up might experience acute respiratory events referred to as pulmonary
exacerbations. It is, therefore, of clinical interest to characterize the
association between the longitudinal outcome FEV1 and
time-to-first exacerbation. The motivation for our research comes from the US CF
Foundation Patient Registry that consists of patients that were monitored from 2003
until 2015. In particular, we examined a subset of the Registry, which consists of
1016 patients. These patients were six years and older and were observed with a
median number of follow-up visits equal to six (with a range of 1–93 visits). The
average age at baseline is 15 years (with a range of 6–21).Several authors have studied the evolution of lung function over time, as summarized
in a recent review;[1] however, to our knowledge, little work has been done regarding the
association of the lung function such as FEV1 with
time-to-event outcomes. In particular, joint modeling of longitudinal
FEV1 and survival outcomes in CF was introduced
several years ago,[2] but has not been further used in CF epidemiology due to the computational
burden of this approach. Furthermore, it is well recognized that different
unobserved sub-groups of the biomarker FEV1 exhibit
different longitudinal profiles.[3] Patients can be categorized in several sub-groups (latent classes) with
different trajectories. It is, therefore, of high clinical interest to measure the
strength of association between FEV1 with the risk of
first exacerbation accounting for the latent trajectories.The joint model of longitudinal and survival data constitutes a popular framework to
analyze longitudinal and survival outcomes simultaneously.[4,5] In particular, two paradigms
within this framework are the shared parameter joint models and the joint latent
class models. The former paradigm links the longitudinal and the survival process
via the random effects; however, it does not allow for latent classes.[6-11] The latter paradigm, which
associates the two processes through latent classes, explicitly postulates the
existence of sub-populations.[12-14] The main disadvantage of this
approach is that there is no clear interpretation for the association of the two
outcomes. In particular, it is not possible to obtain a parameter that quantifies
the relationship between FEV1 and time-to-first
exacerbation. The use of latent classes in the shared parameter model has been
previously proposed in the framework of joint models of longitudinal and survival
data. In particular, Jacqmin-Gadda et al.[15] focused on the evaluation of the conditional independence assumption by
proposing a score test; however, the relationship between the longitudinal and
survival outcome per class is not further discussed.In our clinical application, we are mainly interested in quantifying the association
between the longitudinal outcome FEV1 and the survival
outcome time-to-first exacerbation using the shared parameter model. From the
literature, it has been shown that sub-populations exist for the evolution of
FEV1.[3] The aim of the paper is twofold. Firstly, to model the relationship between
FEV1 and time-to-first exacerbation. For this
purpose, we propose a Bayesian shared parameter joint model that integrates latent
classes inherent in this heterogeneous population. This model will assess the
strength of the association between the two outcomes while allowing for latent
classes. Secondly, to address a problem that arises in latent class models, which is
the selection of the optimal number of classes. Several approaches have been
proposed in the literature both in frequentist and Bayesian frameworks, including
among others the use of information criterion, Bayes factors, and reversible jump
Markov chain Monte Carlo (MCMC). These approaches are computationally intensive and
can require the fit of several models with different numbers of classes, which can
be time-consuming. To overcome this problem, we will implement the method of
Nasserinejad et al.[16] to our joint model. This method is a pragmatic extension of Rousseau and Mengersen[17] criterion that showed that when we overfit a mixture model by assuming more
latent classes than present in the data, the superfluous latent classes will
asymptotically become empty if the Dirichlet prior on the class proportions is
sufficiently uninformative. Nasserinejad et al.[16] performed an extensive simulation study to further investigate this approach
and used it as a criterion also in longitudinal studies for obtaining the optimal
number of classes by simply excluding latent classes that are negligible in
proportion.
Joint model estimation
Longitudinal submodel
To account for the fact that the population is heterogeneous and consists of
G possible unobserved sub-groups, we postulate a latent
class mixed-effects model.[18-20] We let denote the longitudinal response vector for the
ith patient () obtained at different time points
t > 0 . In particular, we have where presents the latent class indicator, denotes the design vector for the fixed effects regression
coefficients , and is the design vector for the random effects . Moreover, . For the corresponding random effects, we assume a
multivariate normal distribution, namely where N denotes the normal distribution and Σ is the variance diagonal matrix of the random effects. An individual
has a probability of belonging to latent class g. Using a
multinomial distribution we obtain the class of each individual asAccording to the specification of the latent class mixed-effects submodel (1),
both fixed and random effects are class-specific, whereas the measurement error
and the variance diagonal matrix of the random effects Σ are not.
Survival submodel
We let denote the true failure time for the ith
individual and C the censoring time. Moreover,
denotes the observed failure time and is the event indicator where zero corresponds to censoring. We
postulate a joint model for the relationship between the survival and the
longitudinal outcome. Specifically, we have where is a vector of baseline covariates with a corresponding vector
of regression coefficients and is the baseline hazard. Specifically, the B-splines baseline
hazard function is assumed as , where denotes the qth basis function of a B-spline
with knots and is the vector of spline coefficients. The knots are placed at
the corresponding percentiles of the observed event times. Furthermore,
α denotes the association parameter for the
gth class. According to the specification of the survival
submodel (2) the baseline covariates, the baseline hazard, and the association
parameter are class-specific parameters. The proposed model goes beyond the
standard joint model and joint latent class model where a single or no
association parameter is assumed and provides a class-specific association. This
is a more realistic assumption for the motivating data set since it is
clinically expected that the risk of the first exacerbation will be higher when
the rate of FEV1 decline is faster. Accounting for
these latent classes will lead to improved estimates of association arising from
the joint model.
Bayesian estimation
We employ a Bayesian approach where inference is based on the posterior distribution
of the parameters in the model. We use MCMC methods to estimate the parameters of
the proposed model. The likelihood of the model is derived under the assumption that
the longitudinal and survival processes are independent given the random effects and
the latent classes. Moreover, the longitudinal responses of each subject are assumed
independent given the random effects and the latent classes. The likelihood
contribution for the ith patient is written as where with and .The likelihood contribution of the longitudinal outcome takes the formThe likelihood contribution of the survival model is given byThe posterior distribution is written as
where
and denotes the prior distributions.A commonly used prior in mixture models for the class probability is a Dirichlet
distribution. In particularSmall values of correspond to a less informative prior and a flat prior
distribution is obtained when each a is equal to 1. The
selection of is an important task and will be discussed in the next section. Standard
priors can be assumed for the rest of the parameters. In particular, for the
coefficients of the longitudinal fixed effects, the survival covariates, and the
baseline hazard, normal priors can be taken. For the elements of the variance
diagonal matrix of the random effects and the variance parameter of the longitudinal
outcome, we can assume a gamma prior.
Selection of number of classes
An important task in latent class models is to identify the optimal number of
classes. Several approaches have been previously proposed for choosing the optimal
number of classes in both frequentist and Bayesian settings. Common examples are the
Bayesian information criterion (BIC),[21] deviance information criterion (DIC),[22] and other Bayesian approaches such as Bayes factor and reversible jump MCMC algorithm.[23] A drawback of the approaches above is that they are computationally intensive
and some require the fit of models assuming different numbers of classes, which
might be time-consuming for complex models such as the joint models of longitudinal
and survival outcomes.Furthermore, it has been previously discussed in the literature that a problem that
arises in the calculation of the DIC in mixture models is that the posterior mean of
the parameters may not lead to good predictive estimates. The MCMC parameters suffer
from label switching, making the DIC (which is based on averaging over MCMC draws)
unstable. A more appropriate choice for the estimates would be the mode of the
posterior distribution. A wide range of options for constructing an appropriate DIC,
including also mixture models, has been proposed by Celeux et al.[22] However, these are not straightforward to apply with existing software.An interesting alternative was proposed by Rousseau and Mengersen,[17] where they proved that in overfitted mixture models (with more latent classes
than present in the data), the superfluous latent classes will asymptomatically
become empty if the Dirichlet prior on the class proportion is sufficiently
uninformative. Recently, Nasserinejad et al.[16] used this approach and proposed a latent class selection procedure for
longitudinal models. An overfitted mixture model converged to the true mixture by
assigning a small portion of individuals to empty classes, if the parameters of the
Dirichlet prior are smaller than , where d is the number of class-specific
parameters (excluding the random effects). Furthermore, non-informative priors for
the rest of the parameters are required. The steps are described as follows: where G is the total number of classes, k
represents the iteration, is the number of patients in class g at iteration
k, n is the total number of patients, and
ψ is a predefined value.First, a latent class model with a large enough number of latent classes
is fitted.Then, the number of non-empty classes at each iteration is calculated asAfter obtaining the non-empty classes per iteration, the posterior mode
of the non-empty classes is calculated.Finally, the model with the optimal number of classes, which are the
non-empty classes, is refitted.Advantages of this approach are that it is easy to implement even in such complex
models, and it is not influenced by the label switching problem since we observe the
non-empty classes at each iteration. The only time that we need to correct for label
switching is when we fit the final model with the optimal number of classes.
Furthermore, this approach requires us to fit the model only two times (namely one
with the high number of classes and one with the optimal number of classes) instead
of assuming all possible number of classes, therefore decreasing the computational
burden. It has been shown through extensive simulations in the longitudinal setting
that this method performs better than alternative model selection criteria such as
BIC and DIC.[16]
Simulations
We performed a series of simulations to validate the proposed model, to examine the
class selection method on the joint modeling framework, and to explore the
appropriate threshold that indicates a class as empty.
Design
We assumed around 1000 patients with maximum number of repeated measurements
equal to 10. For simplicity, in this section, we ignore the notation for the
latent classes (g). To simulate the continuous longitudinal
outcome, we used the following linear mixed-effects model per data set. In
particular where and . We adopted a linear effect of time for both the fixed and the
random part, and corrected for a binary variable (). Time t was simulated from a uniform
distribution between zero and 19.5. For the survival part, we assumed the
following modelThe baseline risk was simulated from a Weibull distribution . For the simulation of the censoring times, an exponential
censoring distribution was chosen so that the censoring rate was between 40% and
60%. Age was simulated from a normal distribution with mean 45 and standard
deviation 15.7.Under this setting, we simulated three different data sets that have different
parameters for the fixed effects in the longitudinal submodel, the baseline
covariates and baseline hazard in the survival submodel, the variance diagonal
matrix of the random effects and the association parameter (more details are
presented in Table
1). Figure 1
illustrates the evolution of the longitudinal outcome per gender from the
simulation parameters for each one of the three data sets.
Table 1.
Simulation parameters for the three data sets.
β
σy
diag{Σb}
ξ
μc
γ
α
Data set 1
(Intercept) = 8.03
0.69
0.87
1.8
10
(Intercept) = –4.85
0.38
Male = –5.86
0.02
Age = –0.02
Time = –0.16
Data set 2
(Intercept) = –8.03
0.69
0.02
1.4
10
(Intercept) = –4.85
0.08
Male = 12.20
0.91
Age = 0.09
Time = 0.46
Data set 3
(Intercept) = 0.03
0.69
0.28
1.8
10
(Intercept) = 2.85
0.58
Male = –1.96
0.31
Age = –0.12
Time = –0.01
Figure 1.
Evolution of the longitudinal outcome per gender from the simulation
parameters for each one of the three data sets.
Simulation parameters for the three data sets.Evolution of the longitudinal outcome per gender from the simulation
parameters for each one of the three data sets.
Analysis
In order to investigate the proposed methodology, we assumed the three following
scenarios. For Scenario I, we combined all three data sets assuming
, and individuals. For Scenario II, we combined the two data sets
with and . Finally, for Scenario III, we used one data set with
. The separate data sets represent the sub-populations. We
first used Scenarios II and III and fitted the proposed joint model (2) assuming
two and one class, respectively, to evaluate the model. We, then, assumed a
variety number of classes to investigate the proposed class selection method. In
the fitted models, all parameters were class-specific except for the measurement
error in the mixed-effects submodel and the variance diagonal matrix of the
random effects. These include the fixed effects from the longitudinal submodel
(three parameters), the baseline covariates from the survival model (two
parameters), the baseline hazard (eight parameters) from the survival submodel,
and the association parameter (one parameter). We assumed , which is smaller than the total number of the class-specific
parameters divided by two. The priors that we used are: where denotes the inverse gamma distribution. For the variance of
the association parameter, no large value was required to ensure that we have a
non-informative prior since we simulated this parameter to be smaller than 0.6.
We ran the MCMC using a single chain with 50,000 iterations, 25,000 burn-in, and
10 thinning. We performed 150 simulations per scenario.,,,,diag{,We compared our proposed class selection methodology with the DIC criterion.
Several adaptations have been proposed for mixture models by Celeux et al.[22] The parameters are not always identifiable, and the posterior mean of the
parameters can be a poor estimator; therefore, in our simulation study, we
assumed DIC3. A frequently used package, when the
focus is on the association between a longitudinal and a survival outcome while
taking into account that different sub-populations exist, is the
lcmm in R developed by Proust-Lima et al.[24] We used the function Jointlcmm() and assumed the
BIC criterion to obtain the optimal number of classes and compare it with the
proposed approach. The same specifications, as described for the data
simulation, were assumed for fitting the models. A difference can be found in
the Jointlcmm() function, where the longitudinal and the
survival outcomes are only connected via the latent classes, and a cubic
M-splines baseline risk function was assumed. We present an overview of the
simulation study in Table
2.
Table 2.
Simulation study overview.
Evaluating the proposed model
Simulate
Fit
1 data set
1 class
2 data sets
2 classes
Evaluating the proposed class selection method (DIC)[a]
Simulate
Fit
3 data sets
2, 3, 4 classes
2 data sets
1, 2, 3 classes
1 data set
1, 2 classes
Evaluating the proposed class selection method
(BIC)
Simulate
Fit
3 data sets
1–6 class
2 data sets
1–6 class
1 data set
1–6 classes
aNote that for the DIC criterion, we did not assume all
classes (1–6) since it is computationally intensive to run so many
complex models.
Simulation study overview.aNote that for the DIC criterion, we did not assume all
classes (1–6) since it is computationally intensive to run so many
complex models.
Results
The results from the different scenarios for the validation of the proposed model
are presented in Table
1 in the Supplementary Material. We obtain that the true parameters
are close to the model parameters. The results from the different scenarios for
the investigation of the proposed class selection approach are illustrated in
Table 3. In
particular, we present for different ψ the percentage of the
true number of classes and the mode of the number of classes.
Table 3.
Simulation results: cut-off ψ, percentage of true number
of classes, mode of the number of classes.
ψ (%)
True # of classes (%)
Mode of # of classes
Scenario I:
150 simulations
3 classes simulated
1
0
6
2
3
6
5
12
5
8
39
4
10
54
3
12
66
3
15
72
3
Scenario II:
150 simulations
2 classes simulated
1
10
6
2
27
2
5
30
3
8
35
2
10
37
2
12
44
2
15
49
2
Scenario III:
150 simulations
1 class simulated
1
0
5
2
2
4
5
8
2
8
15
2
10
20
2
12
22
2
15
30
2
Simulation results: cut-off ψ, percentage of true number
of classes, mode of the number of classes.For Scenario I, we obtain the highest percentage when assuming ψ
to be between 10 and 15%. In particular, assuming that the ψ is
equal to 15% we obtain 72% of the time the correct number of classes and a mode
equal to the correct number of classes (three). On the other hand, using the DIC
and BIC criterion, we obtain 7% and 13% of the time the correct number of
classes, respectively. Furthermore, both methods seem to underestimate the true
number of classes (mode equal to one).For Scenario II, we obtain the highest percentage when ψ is
between 8 and 15%. In particular, assuming that the ψ is equal
to 15% we obtain 49% of the time the correct number of classes and a mode equal
to the correct number of classes (two). On the other hand, using the DIC and BIC
criterion, we obtain 0% and 5% of the time the correct number of classes,
respectively. Similar to Scenario I, both methods seem to underestimate the true
number of classes (mode equal to one).Finally, for Scenario III, the DIC and BIC criteria seem to perform better than
the proposed approach, where it almost always selects the correct number of
classes (one). This is not surprising since those criteria always underestimated
the true number of classes in the previous scenarios. These results are also in
line with previous work by Nasserinejad et al.[16] Using the proposed approach and assuming that the ψ is
equal to 15%, we obtain 30% of the time, the correct number of class and a mode
equal to two.
Analysis of the CF data
In this section, we present the analysis of the motivating data set. Our primary
focus is to investigate the association between FEV1 and
time-to-first exacerbation by taking into account that we have sub-groups with
different evolution over time for FEV1. The first step
is to obtain the optimal number of classes that can explain the heterogeneity of the
population. From the literature, it is known that two or three classes are observed
for the evolution of FEV1 outcome.[3] Therefore, for the selection process, we fitted a joint model assuming six
classes. For the longitudinal outcome, we assumed a linear mixed-effects submodel
including natural cubic splines for time (modeled as age, in years) with one
internal knot at 15.84 years (corresponding to 50% of the observed follow-up times)
in both the fixed and random effects parts. The DIC criterion and subject-specific
plots (illustrating the observed and predicted values) were used to investigate the
need for a non-linear evolution over time. Based on the DIC criterion, the best fit
was obtained when using cubic splines with two knots. However, the observed versus
predicted value plots did not show any drastic difference when assuming one knot;
therefore, we decided to continue with the simplified model. Furthermore, we
corrected for some baseline characteristics which were mainly based on clinical
relevance and the literature. These variables, together with descriptive statistics,
are presented in Table
4. In Figure 2, the
FEV1 evolutions of 25 randomly selected patients
with more than one repeated measurements are presented.
Table 4.
Descriptive statistics of the variables that were used in the model.
Percentage
Gender
Males
43
Females
57
Number of F508del alleles (genotype)
Homozygous
53
Heterozygous
32
Neither
6
Missing
9
SESlow (using state/federal or having no insurance
is a marker of low socioeconomic status)
PancEnzymes (taking a pancreatic enzyme
supplement,
marks pancreatic insufficiency)
Yes
40
No
60
Mean (standard deviation)
Numvisityr
(number of visits at the last follow-up within the prior
year)
5 (3)
Figure 2.
Individual FEV1 evolutions of 25 randomly
selected patients with more than two repeated measurements.
Descriptive statistics of the variables that were used in the model.Individual FEV1 evolutions of 25 randomly
selected patients with more than two repeated measurements.Specifically, the model takes the formTo investigate the association between FEV1 and
time-to-first exacerbation, we postulated the proposed joint latent class modelFor the baseline hazard, we assumed a quadratic B-splines basis with five internal
knots placed at the corresponding percentiles of the observed event times ranging
from 11.7 until 20.8 years.In the Dirichlet distribution for the prior of the class probability, following the
recommendation in Nasserinejad et al.,[16] we assumed smaller than (where d is the number of class-specific
parameters). To ensure that we have the same scale for the coefficients of the
covariates in order to easier select non-informative priors, we standardized the
FEV1 outcome and the continuous variables (age and
numVisityr). Relatively uninformative priors were selected for the parameters in the
model. These priors are as follows:,,,,diag{.For the variance of the association parameter, no large value was required to ensure
that we have a uninformative prior since with the standard joint model we obtained
an association parameter smaller than 0.1. We ran the MCMC using a single chain with
500,000 iterations, 450,000 burn-in, and 100 thinning. The results indicate the
presence of three or four classes, assuming that a class is empty if it contains 10
to 15% of the patients (10% 15%). Since it is established in the literature that two or three
classes are present in such populations, we decided to continue with three classes.[3],[25]We reran the model assuming three classes and the normal scale of the continuous
covariate age, numVisitsyr and FEV1 outcome. We ran the
MCMC using 500,000 iterations, 450,000 burn-in, and 100 thinning. We, moreover,
assumed two chains with different initial values to investigate the convergence
towards local maximum. Density plots are presented in the Supplementary Material
(Figures 1
to 5). We assumed the same
priors as before except for , where we used a lower variance due to convergence problems, in
particular we assumed N(0, 100). We used a method proposed by Stephens[26] to handle the label switching problem. In particular, this method uses
relabelling algorithms to perform a k-means type clustering of the MCMC samples. We
should also note that when a lot of observations are available (like in our
application), the label switching problem occurs less. Convergence was monitored by
trace plots which are presented in the Supplementary Material (Figures 6 to 10). To
investigate whether we have weak identifiability of the model parameters, we
compared the prior and posterior distributions. The posterior was distinguishable
from the prior indicating that our model is identifiable. The figures are presented
in the Supplementary Material (Figures 11 to 15). Table 2 in the Supplementary Material shows
the mean and standard deviation of age (at baseline),
FEV1 (at baseline), and number of visits (at last
follow-up) per class, while Table 3 in the Supplementary Material shows the percentage of the
categorical variables (at baseline) per class. In Figure 3, we illustrate the evolution of the
longitudinal outcome in each class assuming patients who are females, are F508del
homozygotes, without low SES, are not infections with MRSA, MSSA, or Aspergillus, do
not use pancreatic enzyme, do not have Pseudomonas aeruginosa and
had five visits within the prior year (which is the mean value of all observations).
In general, we obtain a faster progression in class two. Patients in class three
have a more stable evolution in the beginning of the follow-up and patients in class
one seem to have an increased evolution in the beginning (this could be explained by
the measurement error or possible transplantations). In addition, patients in class
two start from a higher FEV1 compared to patients in
classes one and three. In Figure
4, we illustrate the evolution of the longitudinal outcome in each class
assuming patients who are females, are F508del homozygotes, have low SES, are
infections with MRSA, MSSA, Aspergillus, use pancreatic enzymes, have
Pseudomonas aeruginosa, and had eight visits within the prior
year. We obtain that patients in these classes start from a lower
FEV1 value compared to Figure 3. The mean and the credible interval
of the MCMC samples of the association parameters per class are presented in Figure 5. In our proposed
model, two of the three subgroups did not have a strong effect. In particular, we
obtain a weak association between FEV1 and time-to-first
exacerbation for the first and third classes, while a strong negative association
for class two. The association parameter α can be
interpreted as the log hazard ratio of the time-to-first exacerbation for class
g when the FEV1 value is increased
by one unit, given that the gender is the same. In particular, for class two, a
10-unit decrease in the FEV1 value (which is a clinical
meaningful difference) results in a hazard ratio of 1.49.
Figure 3.
Evolution of the longitudinal outcome FEV1 per
class assuming patients who are females, are F508del homozygotes, without
low SES, are not infections with MRSA, MSSA, or Aspergillus, do not use
pancreatic enzyme, do not have Pseudomonas aeruginosa and
had five visits within the prior year which is the mean value of all
observations. The plots illustrate the posterior mean and credible interval
for each class.
Figure 4.
Evolution of the longitudinal outcome FEV1 per
class assuming patients who are females, are F508del homozygotes, have low
SES, are infections with MRSA, MSSA, Aspergillus, use pancreatic enzymes,
have Pseudomonas aeruginosa and had eight visits within the
prior year. The plots illustrate the posterior mean and credible interval
for each class.
Figure 5.
Mean and credible interval of the association parameter per class.
Evolution of the longitudinal outcome FEV1 per
class assuming patients who are females, are F508del homozygotes, without
low SES, are not infections with MRSA, MSSA, or Aspergillus, do not use
pancreatic enzyme, do not have Pseudomonas aeruginosa and
had five visits within the prior year which is the mean value of all
observations. The plots illustrate the posterior mean and credible interval
for each class.Evolution of the longitudinal outcome FEV1 per
class assuming patients who are females, are F508del homozygotes, have low
SES, are infections with MRSA, MSSA, Aspergillus, use pancreatic enzymes,
have Pseudomonas aeruginosa and had eight visits within the
prior year. The plots illustrate the posterior mean and credible interval
for each class.Mean and credible interval of the association parameter per class.We, furthermore, examined this association parameter while ignoring the latent
sub-populations. In that case, the association parameter is smaller than the largest
parameter from our proposed model. This indicates that the use of a common
association parameter for all sub-populations would lead to an
underestimated/overestimated parameter for each group. Since it is expected that
patients with a constant lung-function trajectory are less likely to experience
exacerbation compared to the patients with a steeper decline, it is not realistic to
assume a common association between the FEV1 evolution
and time-to-first exacerbation for those group of patients.
Discussion
In this paper, we proposed a shared parameter joint model incorporating latent
classes. Applying it to CF data, this model accounted for patient heterogeneity
inherent in the progression of FEV1. Compared to
previously proposed joint latent class models,[13] we obtained the strength of the association between
FEV1 and time-to-first exacerbation per group of
patients. Finally, we focused on the selection of the optimal number of classes and
used an overfitted mixture model (high number of classes) to obtain the non-empty
classes.A limitation of this approach is that it requires an intensive computational effort.
In particular, for the class selection, where a model with a high number of classes
is required, the number of parameters increases drastically. This, in combination
with the high number of observations in the CF application increases the
computational time that is required. Considering the difficulty of this model, it is
almost impossible to obtain the optimal number of classes with other Bayesian
criterion. Implementing the proposed criterion is straightforward; however, due to
the complexity of the model, it is computationally expensive to fit a model with a
larger number of classes, e.g. 10. It was shown in the simulation analysis that the
DIC and BIC criteria always underestimated the true number of parameters, and it,
therefore, performed better when the true number of classes was one. Even though in
that scenario, the proposed method did not work perfectly, it seems to be better
than other criteria and easier to perform.The main limitation and challenge of the proposed approach is the choice of the
threshold that indicates whether a class is empty or not. A simulation study was
performed to investigate whether the proposed class selection method would be
appropriate for the shared parameter models. An interesting finding was that the
threshold was higher compared to simple approaches, such as mixed models. In
particular, in our shared parameter model with integrated latent classes, the
threshold was between 10% and 15%. In more simple settings, such as a linear mixed
model that was extensively investigated with simulations by Nasserinejad et al.,[16] this threshold was lower. Since this threshold highly depends on the
complexity of the model, it is advisable to perform simulation studies to decide on
a realistic cut-off when a different model is used. When no clear decision can be
taken, our proposed criterion could be combined with other criteria to investigate
the optimal number of classes in fewer models. Caution, however, is needed when
using other criteria since we showed that the DIC and BIC performed worse in finding
the correct number of classes. These results were in line with previous work by
Nasserinejad et al.[16] The proposed approach led to the same number of classes as discussed in the
CF literature on FEV1 trajectory classification;[3],[25] therefore, we did not investigate further the number of classes using other
criteria. A further limitation of the manuscript is that we did not investigate
which functional form describes best the association between
FEV1 and time-to-first exacerbation. In this work,
we assumed the underlying values of FEV1 to be related
to time-to-first exacerbation as a first step to establishing an association between
the two outcomes;[2,27,28] however, our future research focuses on investigating which
functional form of FEV1 is associated with the survival
outcome exacerbations. Furthermore, in our model, recurrent exacerbations and death
are not taken into account. Although there is a large database available in the US
Registry, we used only a subset in order to make it feasible to run the proposed
model. This subset has particular characteristics, and it cannot be generalized to
all patients in the Registry. Therefore, the presented results do not reflect the
diversity of the whole database.Possible extensions would be to include more covariates also in the survival submodel
in order to take into account extra information regarding the patients. Furthermore,
using the proposed model for obtaining future FEV1
measurement and time-to-first exacerbation probabilities could lead to more
efficient treatment prioritization and clinical management for patients with CF. In
this paper, we used an overfitted mixture model and a non-informative Dirichlet
prior for the class proportion in order to obtain the optimal number of classes.
Overviews of mixture modeling, mixtures of Dirichlet processes, and how to identify
the optimal number of classes using a Dirichlet prior can be found in Escobar and
West and in Frühwirth-Schnatter and Malsiner-Walli.[29,30]Click here for additional data file.Supplemental material, sj-pdf-1-smm-10.1177_0962280220924680 for Integrating
latent classes in the Bayesian shared parameter joint model of longitudinal and
survival outcomes by Eleni-Rosalina Andrinopoulou, Kazem Nasserinejad, Rhonda
Szczesniak and Dimitris Rizopoulos in Statistical Methods in Medical
Research
Authors: Jessica Barrett; Peter Diggle; Robin Henderson; David Taylor-Robinson Journal: J R Stat Soc Series B Stat Methodol Date: 2014-04-08 Impact factor: 4.488
Authors: Graeme L Hickey; Pete Philipson; Andrea Jorgensen; Ruwanthi Kolamunnage-Dona Journal: BMC Med Res Methodol Date: 2016-09-07 Impact factor: 4.615