Literature DB >> 35668699

MEGH: A parametric class of general hazard models for clustered survival data.

Francisco Javier Rubio¹, Reza Drikvandi².

Abstract

In many applications of survival data analysis, the individuals are treated in different medical centres or belong to different clusters defined by geographical or administrative regions. The analysis of such data requires accounting for between-cluster variability. Ignoring such variability would impose unrealistic assumptions in the analysis and could affect the inference on the statistical models. We develop a novel parametric mixed-effects general hazard (MEGH) model that is particularly suitable for the analysis of clustered survival data. The proposed structure generalises the mixed-effects proportional hazards and mixed-effects accelerated failure time structures, among other structures, which are obtained as special cases of the MEGH structure. We develop a likelihood-based algorithm for parameter estimation in general subclasses of the MEGH model, which is implemented in our R package MEGH. We propose diagnostic tools for assessing the random effects and their distributional assumption in the proposed MEGH model. We investigate the performance of the MEGH model using theoretical and simulation studies, as well as a real data application on leukaemia.

Entities: Chemical

Keywords: Accelerated failure time; diagnostic tool; proportional hazards; random effects; survival data

Mesh：

Year: 2022 PMID： 35668699 PMCID： PMC9315191 DOI： 10.1177/09622802221102620

Source DB: PubMed Journal: Stat Methods Med Res ISSN： 0962-2802 Impact factor: 2.494

1 Introduction

In the analysis of time-to-event data, it is common to obtain clustered samples. For instance, in medical statistics, one may be interested in modelling the survival times of a sample of patients who receive attention in different health centres with varying characteristics or when the patients live in different geographic or administrative regions. This clustering or grouping information needs to be incorporated in any survival model to avoid under-estimation of the standard errors of the parameter estimators. Moreover, ignoring unobserved heterogeneity, due to clustering, in hazard regression models has an effect on model misspecification as the marginal model obtained by integrating out random effects may not coincide with the model ignoring clustering.[1-3] A simple way to account for clustered survival data consists of fitting a stratified survival model. This is typically done by fitting a proportional hazards (PHs) model for each strata level with different baseline hazards but sharing the same regression coefficients. Alternatively, when the number of clusters is small, the cluster indicator can be coded as a categorical variable. The main disadvantage of these approaches is that they do not account for between-cluster variability. Modelling the clustering effect is more formally done by incorporating random effects, which are random variables that capture the hierarchical structure of the data and account for unobserved variability or heterogeneity between clusters. Regression models, including survival models, that incorporate random effects are often referred to as mixed-effects models (we refer the reader to McCulloch et al. and the references therein for an extensive review of this kind of models). The most common mixed-effects models for time-to-event data correspond to extensions of common regression survival models. For example, the mixed-effects PHs (MEPHs) model[6,7] is an extension of the semiparametric Cox PHs models. This model is obtained by adding random effects, indexed by the cluster indicator, to the linear predictor function in the Cox model. Another common strategy to capture clustering consists of adding random effects to the linear predictor function in parametric and semiparametric accelerated failure time (AFT) models, leading to the so-called mixed-effects AFT (MEAFT) model (see Rubio and Steel and Do Ha and Lee for a review on these models). A related class of mixed-effects models is constructed by adding random effects or Gaussian processes to the linear predictor function in hazard-based (parametric and semiparametric) regression models.[10,11] Some examples of these kinds of models are the mixed-effects spline regression models formulated at the cumulative hazard level by Crowther et al.,[12,13] Wood, or those formulated at the hazard level by Vaida and Xu and Charvat et al. In a related vein, Zhou and Hanson proposed mixed-effects models by adding random effects to PHs, proportional odds and AFT models in a Bayesian semiparametric framework. These classes of models are thus characterised by aiming at capturing the clustering effect via the incorporation of random effects into the linear predictor of different sorts of fixed effects hazard-based regression models. A natural question about mixed-effects models is the effect of misspecification of the random effects structure on the estimation of quantities of interest. It has been found that misspecifying the random-effects distribution has a negligible effect on the point estimation of the fixed effects parameters of some classes of random effects models. However, there are more tangible effects of this misspecification on interval estimation of the parameters and predictions[8,19] as well as point and interval estimation of the parameters of the baseline hazard. More crucially, ignoring a significant random effect could affect both parameter estimation and inference. Tools for diagnosing misspecification of the random-effects part have been developed for mixed-effects models (without censoring) in recent years (see the authors in[22-24]). We develop a novel mixed-effects general hazard (MEGH) structure which incorporates random effects into a hazard-based regression model[25,26] at the hazard scale and the time scale. The resulting model can capture clustering affecting the hazard scale and the time scale, and contains the MEPH and MEAFT models as particular cases. We focus our attention on two tractable subclasses of mixed-effects models that generalise the MEPH and MEAFT models. The specification of the MEGH model requires modelling a baseline hazard, for which we employ several flexible parametric distributions. We provide diagnostic tools for the random-effects part of the proposed classes of models in the presence of censoring. We conduct extensive simulation studies that reveal that not only misspecifying the random-effects distribution is problematic, but also misspecifying the role of the random effects in the hazard structure has adverse effects. The remainder of the paper is organised as follows. In Section 2, we describe the proposed MEGH model and discuss two submodels of interest. In Section 3, we present the expressions for the conditional and marginal likelihood functions, and define the marginal maximum likelihood estimator (MMLE), which defines the estimation approach followed in this paper. We also present theoretical results regarding the consistency and asymptotic normality of the MMLE under standard regularity conditions. Section 4 presents a test for the inclusion of random effects, as well as a graphical diagnostic tool to assess the goodness-of-fit of the random-effects distribution. Section 5 presents an extensive simulation study which illustrates the performance of the proposed methodology as well as an exploration of the effects of misspecifying the distribution and the role of the random effects. Section 6 presents an application with real data in the context of cancer survival with a spatial component. We conclude with a general discussion and possible extensions of our work in Section 7.

2 MEGH: Mixed-effects general hazard models

In this section, we describe the proposed MEGH model, and discuss two general subclasess of interest. To this end, let us first introduce some notation. Let , indicates the cluster, denotes the individuals, be a sample of times-to-event of interest, and be a vector of covariates associated with the th individual. Let be the right-censoring times, and be the observed survival times. Let be the vital status indicators (1 - dead, 0 - alive), and be the total sample size across clusters. We propose the MEGHs model, which is defined through the individual hazard function: where is a baseline hazard, , , is a vector of covariates affecting the hazard scale, is a vector of covariates affecting the time scale (typically, ), and , where is a continuous distribution with support on and zero mean. Denote by , , the design matrices, and by the sub-matrices with the rows associated with the uncensored individuals. To avoid collinearity problems, we assume that the matrices associated with uncensored individuals are full column rank. The MEGH model represents an extension of the general hazard (GH) structure by Chen and Jewell, which is obtained by removing the random effects from (1). Clearly, the GH structure contains the PHs model for , the AFT model for , and the accelerated hazards (AHs) model for . Consequently, we also need to impose that the baseline hazard does not belong to the Weibull family, as in this case the GH structure, and consequently the MEGH structure, becomes non-identifiable. Random effects are typically incorporated at the hazard scale (MEPH) or simultaneously at the hazard and time scales (MEAFT). The MEGH structure allows for incorporating random effects in both scales separately. The MEGH structure can also be seen as a generalisation of shared frailty survival models, which aim at accounting for unobserved heterogeneity between clusters.[27,16] Another appealing feature of the MEGH formulation is that the cumulative hazard function can be written in closed-form as follows The MEGH structure (1) is a very rich mixed hazard structure that contains, as particular cases, a number of hazard models of interest. In this paper, we particularly focus on the following two subclasses.

2.1 MEGH-I: Mixed Structure I

The first subclass, MEGH-I, is defined by the hazard and cumulative hazard functions The mixed hazard structure (2) contains the MEPH model ( ), while allowing for the inclusion of time-dependent effects through the parameter .

2.2 MEGH-II: Mixed Structure II

The second subclass, MEGH-II, is defined by the hazard and cumulative hazard functions The MEGH-II structure (3) generalises the MEAFT structure ( ). Figure 1 shows the link between the MEGH and some submodels of interest, while the proposed MEGH structure is much richer than illustrated in this figure.

Figure 1.

Diagram of the link between the MEGH and some particular hazard structures of interest. (AH: accelerated hazards; AFT: accelerated failure time; PH: proportional hazards; GH: general hazard; MEPH: mixed-effects proportional hazards; MEAFT: mixed-effects accelerated failure time; MEGH: mixed-effects general hazard). We will adopt parametric baseline hazards for models (2)–(3), based on flexible parametric distributions. This is an important step as the choice of the parametric baseline hazard determines the hazard shapes one can obtain. For instance, the log-normal hazard function is unimodal (up-then-down), while the Gamma hazard function can be increasing, decreasing or flat. There exists other (three-parameter) distributions that can capture the basic shapes of the hazard (increasing, decreasing, unimodal, and bathtub), such as the Exponentiated Weibull, Generalised Gamma and Power Generalised Weibull (PGW). However, it is important to keep in mind that an efficient estimation of the parameters of more flexible distributions typically requires larger sample sizes.[26,28] On the other hand, parametric models have been found to perform well, compared to semiparametric counterparts, in the context of clustered survival data.

3 Likelihood function and parameter estimation

We here describe the conditional and marginal likelihood functions, which will be used to calculate the marginal maximum likelihood estimators as well as to develop the diagnostic tools presented in the next section. We discuss the details behind the calculation of the marginal likelihoods and the optimisation methods. Finally, we present a theoretical result on the consistency and asymptotic normality of the marginal maximum likelihood estimators.

3.1 Likelihood and marginal likelihood functions

Let be the sample associated with the th cluster and denote the corresponding matrix of covariates. The cluster-specific marginal likelihood function associated with the th cluster, after integrating out the random effects and , is given by For the case where the random-effects distribution is assumed to be a multivariate distribution with mean and finite variance, with parameters , we can write down the marginal likelihood function of the MEGH model (1) in terms of the hazard and cumulative hazard functions as follows. To simplify notation, let us denote and let be the parameter space. The marginal likelihood function is where represents the cluster-specific marginal likelihood associated with the th cluster and denotes the log-likelihood function conditional on the random effects and . The marginal maximum likelihood estimator (MMLE) is then defined as .

3.2 Computations

To calculate the MMLE , we need to evaluate the marginal likelihood function (4), which is a product of integrals that define the cluster-specific marginal likelihoods (5). The integrand in (5) can be small when the number of individuals belonging to that cluster is moderate or large. To deal with this numerical challenge, we scale the integrand by using where , for fixed values of . This allows for a more stable numerical integration, as the integrand is now bounded between (always taking the value at the maximum). Maximisation over the random effects is simpler as their variance is rarely large. Numerical integration in our empirical examples, which include only one hierarchical level, is done by using the R command integrate with default options. Nonetheless, other methods, such as Laplace approximations or Monte Carlo integration, could also be used. The optimisation step is carried out using standard R routines, such as those implemented in the commands optim and nlminb. The R package MEGH (https://github.com/FJRubio67/MEGH) implements the MEGH-I and MEGH-II models under these specifications.

3.3 Theory

Here, we prove the consistency and asymptotic normality of the MMLE, when (which implicitly implies ), under Assumptions A1–A7 given in the online Supplemental Material. We denote by the unknown true value of the parameters, which is assumed to be an interior point of the parameter space . Consider the MEGH model (1). Let be a pre-specified parametric baseline hazard as in Section 2. Under Assumptions A1–A7 given in the online Supplemental Material, it follows that where is the expectation matrix evaluated at . Consistency: as , Asymptotic normality: as , Regarding the required Assumptions A1–A7, we note that Assumption A1 is a restriction to a compact parameter space, which can be assumed to be a large compact set. Assumption A2 concerns the censoring mechanism, which is assumed to be non-informative. Assumption A3 guarantees identifiability and continuity of the MEGH model. Assumption A4 togheter with Assumptions A5–A7 represent standard integrability and differentiability conditions on the marginal likelihood function, which implicitly impose conditions on the choice of the baseline hazard. Assumption A6 assumes the existence and non-singularity of the information matrix. Verifying such condition in practice for specific choices of the parametric baseline hazard is complicated. However, it has been shown that certain smoothness and identifiability conditions, which are easier to verify in practice, guarantee the non-singularity of this matrix (see Asgharian ). Similar asymptotic results have also been obtained for parametric copula survival models for clustered data in Prenen et al., where parametric models were found to perform better than their semiparametric counterparts.

4 Diagnostic tools for random effects in the MEGH model

As already discussed, the proposed MEGH model (1) incorporates random effects to capture the unknown variability between clusters or groups in the survival data. Since random effects are latent and unobservable variables, their assumed distribution can be subject to misspecification, so it is important to check their distributional assumption to identify potential model misspecification. It is also important to test which random effects are statistically significant and should be included in the model. These two problems are crucial especially for practical use, so we study them for the MEGH model in the following two subsections, respectively.

4.1 Detecting misspecification of the random-effects distribution

We here present a diagnostic tool for detecting misspecification of the random-effects distribution in the MEGH model. The assumed random-effects distribution is typically a multivariate normal distribution. To check the appropriateness of an assumed random-effects distribution, suggested to use the so-called gradient function for models with random effects. Their diagnostic tool based on a gradient function is not directly applicable to the MEGH model (1). We extend the gradient function approach for the MEGH model. For this purpose, we use the key idea of the gradient function, that is, to check if the marginal log-likelihood of the model can be increased considerably by replacing with another random-effects distribution, say . If so, then the assumed distribution may not be adequate for the random effects because another random-effects distribution produces a considerably larger marginal likelihood. Recall that is the combined vector of the model parameters. We here write the marginal likelihood as , instead of , to emphasise its dependence on the assumed random-effects distribution . To check if the marginal log-likelihood can be increased considerably by replacing with another random-effects distribution , we use the directional derivative of the marginal log-likelihood at into the direction as follows Then, there is no better random-effects distribution than if for all . We can write that in which is the gradient function for the MEGH model, defined as Hence, we have for all , if . The gradient function (6) has useful properties. The calculation of the gradient function does not require specifying an alternative distribution . Also, since , we must have for all random effects points in the support of . As a graphical diagnostic tool, we plot the gradient function versus random effects points , and if the gradient plot does not exceed then the assumed random-effects distribution will be deemed to be adequate for random effects; otherwise, the random-effects distribution is deemed to be misspecified. See and who applied a similar approach to generalised linear and nonlinear mixed models. The calculation of the gradient function (6) is straightforward as it only requires the calculation of the marginal and conditional distributions for all clusters. Note that the dependence of the gradient function on the parameters is suppressed for simplicity of the notation, however, we replace these parameters with their estimates when calculating the gradient function.

4.2 Tests for the need of random effects in the MEGH model

Since random effects are latent and unobservable variables, it is not obvious in practice which random effects must be included in the model. As an initial check, one can use a plot of the survival curves for all clusters to get some understanding of the variability between clusters and the potential need of random effects (see an example in Figure 4). We provide formal tests for verifying which random effects are statistically significant and must be present in the MEGH model. If the test confirms that all the random effects are not significant, then the extended model (1) will not be essential and one could use the reduced models without random effects as in Chen and Jewell and Rubio et al. The test for inclusion or exclusion of random effects can be carried out via a test for zero random effects. From a statistical perspective, to test for zero random effects is equivalent to testing if their variance parameters are equal to . We want to test whether or not the random effects and can be excluded from the MEGH model (1). This hypothesis testing problem can be expressed as follows

Figure 4.

Leukaemia data: Kaplan-Meier estimators of survival by district.

Leukaemia data: Kaplan-Meier estimators of survival by district. This is a non-standard testing problem because the null hypothesis places the true variance parameters on the boundary of the parameter space.[33-36] Classical tests such as the likelihood ratio, Wald and score tests suffer from testing on the boundary of the parameter space, because the regularity conditions do not hold under such situations. As a consequence, the usual asymptotic chi-squared distribution of the likelihood ratio or score statistic is incorrect here. The correct asymptotic distribution is well understood to be a mixture of chi-squared distributions when under some mild conditions.[33,34] The hypotheses and in (7) correspond to Case 7 of Self and Liang. Thus, under , the correct asymptotic distribution of the likelihood ratio statistic for testing (7) would be , where is a point mass at , and and are the chi-squared distributions with one and two degrees of freedom, respectively. The -value of the likelihood ratio test for the hypothesis test (7) is as follows (see also Verbeke and Molenberghs ) where denotes the observed likelihood ratio statistic. For the MEGH model with the mixed hazard structures (2)–(3), since and are the same, the hypothesis test (7) collapses to The correct asymptotic distribution of the likelihood ratio statistic for this hypothesis test is , as it corresponds to Case 5 of Self and Liang. We will apply the above diagnostic tools to our case study in Section 6. We note that there are also non-asymptotic tests available for testing random effects, which avoid the boundary issue. Bootstrap and permutation tests as well as Bayesian tests do not rely on asymptotic results and hence avoid boundary issues when testing random effects. Those tests have been well studied for linear and generalised linear mixed models[24,36,38]; however, such tests may not be easily applied to the case of MEGH model, due to the presence of censored and clustered data which makes bootstrap or permutation challenging. Furthermore, Verbeke and Molenberghs and Claeskens et al. provide asymptotic score tests that can be applied in this context.

5 Simulation studies

In this section, we conduct simulations to evaluate the performance of the proposed MEGH model. In the simulations, we first examine the accuracy of parameter estimation as well as statistical inference with the MEGH model under different scenarios. We investigate how the MEGH model compares with the model ignoring random effects. Such comparisons are crucial in demonstrating the advantages of the proposed MEGH model over the models ignoring random effects. We then assess the impact of misspecifying the mixed structure as well as misspecifying the random-effects distribution. We also examine the finite-sample performance of the diagnostic tests for testing random effects in the MEGH model. Some of the simulation results are reported in the online Supplemental Material for the sake of space. In the simulations, we study both mixed hazard structures (2)–(3) and consider the following MEGH model, according to our real data application in Section 6, where the covariates are in accordance with the leukaemia data in Section 6, and is the baseline hazard for which we here consider the PGW baseline hazard with parameters as in the online Supplemental Material. We also conduct simulations with the log-logistic baseline hazard, which are presented in the Supplemental Material. Recall that for the mixed hazard structure MEGH-I, and for the mixed hazard structure MEGH-II. In the simulations, we choose the true parameter values according to the estimates obtained from the leukaemia data application. The number of clusters is 24. We simulate 250 data sets from the above model with each of the hazard structures MEGH-I and MEGH-II, each with the same size as of the real data application (i.e. ), using our R package MEGH available in the online Supplemental Material. The random effects are generated from a normal distribution with mean and two standard deviation values of and . The censoring rate is set to . We fit the MEGH model with both mixed hazard structures (2)–(3) separately to the 250 simulated data sets. We also fit the model ignoring random effects to the simulated data sets. We calculate the bias of the parameter estimates from each of these three models. Figures 2 and 3 show the estimation bias for these three models. It can be seen that both MEGH models with mixed hazard structures (2)–(3) produce estimates with considerably smaller bias compared to the model ignoring random effects. The bias of the model ignoring random effects for most parameters is higher due to neglecting the presence of random effects. However, we note that the estimates obtained from models with or without random effects are not generally comparable like with like, as discussed by Lee and Nelder. This is because the model with random effects is conditional on random effects, while the model without random effects is marginal (or population-average).

Figure 2.

Figure 3.

The bias of the estimates from the three methods: MEGH-I, MEGH-II and MLE, for all the parameters based on simulation replications when the simulated data are generated from model (9) with the mixed structure I and PGW baseline hazard, and a normal distribution for the generated random effects with . The bias of the estimates from the three methods: MEGH-I, MEGH-II and MLE, for all the parameters based on simulation replications when the simulated data are generated from model (9) with the mixed structure II and PGW baseline hazard, and a normal distribution for the generated random effects with . We also observe from the simulation results presented in Figures 2 and 3 that when the mixed hazard structure is misspecified it leads to relatively higher bias compared to the model with the correct mixed hazard structure. This suggests that the choice of the mixed hazard structure is crucial for achieving a reliable model fit and results. We note that the bias of the MEGH model with the correct mixed hazard structure is very small in the simulations. Table 1 presents the confidence intervals for all the regression parameters obtained from the three models considered for the simulation case when the true mixed structure is MEGH-I. The results show that the model with the correct mixed structure MEGH-I produces reliable confidence intervals, but both the model ignoring random effects and the model with the incorrect mixed structure MEGH-II tend to produce inaccurate confidence intervals. In particular, the confidence intervals from the model ignoring random effects mostly do not cover the true parameter values.

Table 1.

Parameter	True value	Method	95% confidence interval
		MLE	(0.0880,0.0933)
η	0.20	MEGH-I	(0.1999,0.2089)
		MEGH-II	(0.2815,0.3017)
		MLE	(1.6734,1.7146)
ν	1.50	MEGH-I	(1.5012,1.5244)
		MEGH-II	(1.3492,1.3762)
		MLE	(5.4804,5.7141)
δ	3.00	MEGH-I	(2.9988,3.0882)
		MEGH-II	(2.3658,2.4574)
		MLE	(0.9812,1.0106)
α	0.96	MEGH-I	(0.9307,0.9643)
		MEGH-II	(1.1294,1.1829)
		MLE	(0.9847,1.0010)
β1	1.00	MEGH-I	(0.9951,1.0054)
		MEGH-II	(0.9676,0.9782)
		MLE	(0.0596,0.0785)
β2	0.08	MEGH-I	(0.0741,0.0919)
		MEGH-II	(0.0713,0.0891)
		MLE	(0.1540,0.1633)
β3	0.22	MEGH-I	(0.2204,0.2289)
		MEGH-II	(0.2172,0.2258)
		MLE	(0.0641,0.0872)
β4	0.10	MEGH-I	(0.0914,0.1009)
		MEGH-II	(0.0901,0.1001)

The confidence intervals for all the regression parameters from the three methods: MEGH-I, MEGH-II and MLE, for the case when the true mixed structure is MEGH-I and the random effects are generated from a normal distribution with . Table 2 shows the average AIC of the model fit across the simulation replications for the three models considered. The results indicate that the MEGH model with the correct mixed hazard structure has the smallest AIC, while the model ignoring random effects tends to produce a very large AIC. Table 2 also presents the average power of the likelihood ratio test for random effects across the simulation replications, when the random effects are generated from a normal distribution with . It can be seen that the test provides a high power with this sample size. Further simulation results on the performance of the test for random effects under different number of clusters, censoring rates and variance values can be found in the Supplemental Material.

Table 2.

Average AIC of the model fit and average power of the test for random effects across simulation replications, when the random effects are generated from a normal distribution with .

True structure	Fitted model	AIC	Power
	MLE	1390.66	−
MEGH-I	MEGH-I	927.84	1.0
	MEGH-II	975.79	1.0
	MLE	1383.18	−
MEGH-II	MEGH-I	996.35	1.0
	MEGH-II	962.57	1.0

Average AIC of the model fit and average power of the test for random effects across simulation replications, when the random effects are generated from a normal distribution with .

6 Real data application

In this section, we apply the MEGH model to analyse the data set LeukSurv, which contains information on the survival of patients of acute myeloid leukaemia in northwest England, recorded between 1982 and 1998. This data set is available in the R package spBayesSurv. Previous analyses of this data set have suggested that there is evidence of variation in survival across this region, and we can see that this is also suggested by the nonparametric Kaplan-Meier estimators of the survival curves by district shown in Figure 4. We use information about the survival time (in years), vital status at the end of follow-up (0 – right-censored, 1 – dead), age (in years), sex (0 – female, 1 – male), white blood cell count at diagnosis (wbc, truncated at 500), the Townsend score (tpi, higher values indicates less affluent areas), and administrative district of residence (district, 24 districts). We fit the MEGH models with hazard structures MEGH-I (2) and MEGH-II (3), and with the PGW and log-logistic baseline hazards. For the random-effects distribution, we use the normal, Student- and two-piece normal distributions (to account for heavier tails than normal and asymmetry). In all fitted models, we consider the time dependent effect ( ) of age (standardised), and the hazard-level effects ( ) of age (standardised), sex, white blood cell count at diagnosis (wbc, standardised) and the Townsend score (tpi, standardised). The variable district is used to define the random effects in the two mixed hazard structures considered. In addition, we fit the models ignoring random effects for comparison. We also assess the need for including random effects using the diagnostic tests in Section 4.2. The best model selected using AIC (1553.725) is the model with the mixed structure MEGH-I, log-logistic baseline hazard (with log-location and scale parameters ) and normal random effects. The AIC for the model ignoring random effects with log-logistic baseline hazard is 1556.366, which is fairly close to the AIC value for the best model. The reason for this is that the estimate of the variance of random effects is relatively small ( ), and the number of clusters ( ) is not large in this data set. However, the p-value for testing is , which suggests that the random effect is significant and must be present in the model. This implies that there is significant between-cluster variability after incorporating the information on the observed covariates (see also the Box and Whisker plot in the online Supplemental Material). Figure 5 shows the maps of aggregated summaries at district level. We notice from Figure 5(c) and (d) that the marginal survival functions (which is obtained as the average of cluster-specific marginal survival functions using Monte Carlo integration) at years exhibit some variability by district. Figure 5(a) and (b) shows that the distribution of mean age and mean Townsend score is also quite different for the different districts. Consequently, survival gaps between different districts are explained by a combination of complex factors, including different age and Townsend distribution, a conclusion that could be harder to obtain with fully nonparametric methods that require data stratification.

Figure 5.

Leukaemia data: Maps showing the aggregated summaries for each district: (a) mean age, (b) mean Townsend score, (c) marginal 1-year survival and (d) marginal 5-year survival.

Leukaemia data: Maps showing the aggregated summaries for each district: (a) mean age, (b) mean Townsend score, (c) marginal 1-year survival and (d) marginal 5-year survival. The gradient function plot for the model MEGH-I fitted under normal random effects is presented in Figure 6(a), together with the confidence bands similar to Verbeke and Molenberghs, which indicates that the normality assumption on random effects is valid since the gradient values remain under or close to . To double check whether the fit can be improved by using a distribution for random effects which allows to capture asymmetry, we calculate the gradient function plot with the two-piece normal distribution for random effects. The gradient plot with the confidence bands for the two-piece normal distribution, shown in Figure 6(b), is very similar, suggesting that the usual normal distribution is adequate for random effects.

Figure 6.

Leukaemia data: (a) Gradient function plot with the confidence bands for the MEGH model (2) with normal random effects fitted to the leukaemia data. (b) Gradient function plot with the confidence bands for the MEGH model (2) with a two-piece normal distribution for the random effects.

7 Discussion

We have introduced a general mixed-effects hazard structure (MEGH), as well as two general subclasses of interest (MEGH -I and MEGH-II). The proposed structures generalise classical survival regression models of interest in practice, such as the MEPH and MEAFT. We have adopted a flexible parametric approach, which allowed us to prove asymptotic results under standard regularity conditions. Our simulation studies show that the estimates of the parameters of the proposed models, based on the marginal likelihood, have good frequentist properties. Another contribution of this work consists of evaluating the impact of misspecifying the role of the random effects and potentially the random-effects distribution. Our simulation study shows that not only misspecifying the distribution of the random effects has an impact on the quality of the inference on the parameters, but also misspecifying the role of the random effects in the hazard regression model, a topic that has received little attention in survival analysis. The routines used to produce our results are implemented in the R package MEGH which is available, along with the real data application, at https://github.com/FJRubio67/MEGH. In our simulations and applications, we have focused on the use of the MEGH-I and MEGH-II structures, as these represent generalisations of the most common mixed models in survival analysis. However, the implementation of the general structure (1), with two random effects, represents a possible extension of this work. Such extension does not only bring computational challenges, as one requires double numerical integration, but also the modelling of the dependence between the two random effects. The latter can be explored by using copulas to capture different types of dependencies, and a variety of parametric marginal distributions. The tractability of the MEGH model allows for several extensions, such as accounting for left-censored or interval censored times. A more substantial extension corresponds to the case where the distribution of the random effects reflects a spatial multilevel structure. Another possible extension consists of modelling the baseline hazard using nonparametric (frequentist or Bayesian) methods. Extensions to the inclusion of more than one hierarchical level would also be of practical interest. This could be done simply by considering multivariate random effects where and are random effects covariates associated with the hazard and time scales respectively, and , where is a continuous distribution with support on and zero mean. We emphasise that, although interesting, this strategy requires a careful study of the identifiability of parameters. Our formulation remains valid for these extensions, but the main challenges relate to the development of computational methods for addressing those questions. Click here for additional data file. Supplemental material, sj-pdf-1-smm-10.1177_09622802221102620 for MEGH: A parametric class of general hazard models for clustered survival data by Francisco Javier Rubio and Reza Drikvandi in Statistical Methods in Medical Research

18 in total

1. Modeling spatial survival data using semiparametric frailty models.

Authors: Yi Li; Louise Ryan
Journal: Biometrics Date: 2002-06 Impact factor: 2.571

2. The use of score tests for inference on variance components.

Authors: Geert Verbeke; Geert Molenberghs
Journal: Biometrics Date: 2003-06 Impact factor: 2.571

3. A goodness-of-fit test for the random-effects distribution in mixed models.

Authors: Achmad Efendi; Reza Drikvandi; Geert Verbeke; Geert Molenberghs
Journal: Stat Methods Med Res Date: 2014-12-24 Impact factor: 3.021

4. On a general structure for hazard-based regression models: An application to population-based cancer research.

Authors: Francisco J Rubio; Laurent Remontet; Nicholas P Jewell; Aurélien Belot
Journal: Stat Methods Med Res Date: 2018-08-01 Impact factor: 3.021

5. Testing multiple variance components in linear mixed-effects models.

Authors: Reza Drikvandi; Geert Verbeke; Ahmad Khodadadi; Vahid Partovi Nia
Journal: Biostatistics Date: 2012-08-28 Impact factor: 5.899

6. The gradient function as an exploratory goodness-of-fit assessment of the random-effects distribution in mixed models.

Authors: Geert Verbeke; Geert Molenberghs
Journal: Biostatistics Date: 2013-01-31 Impact factor: 5.899

7. Nonlinear mixed-effects models with misspecified random-effects distribution.

Authors: Reza Drikvandi
Journal: Pharm Stat Date: 2019-10-29 Impact factor: 1.894

8. Testing random effects in the linear mixed model using approximate bayes factors.

Authors: Benjamin R Saville; Amy H Herring
Journal: Biometrics Date: 2009-06 Impact factor: 2.571

9. A multilevel excess hazard model to estimate net survival on hierarchical data allowing for non-linear and non-proportional effects of covariates.

Authors: Hadrien Charvat; Laurent Remontet; Nadine Bossard; Laurent Roche; Olivier Dejardin; Bernard Rachet; Guy Launoy; Aurélien Belot
Journal: Stat Med Date: 2016-02-28 Impact factor: 2.373

10. A Tutorial on Multilevel Survival Analysis: Methods, Models and Applications.

Authors: Peter C Austin
Journal: Int Stat Rev Date: 2017-03-24 Impact factor: 2.217