Literature DB >> 33104698

Bayesian variable selection in linear quantile mixed models for longitudinal data with application to macular degeneration.

Abstract

This paper presents a Bayesian analysis of linear mixed models for quantile regression based on a Cholesky decomposition for the covariance matrix of random effects. We develop a Bayesian shrinkage approach to quantile mixed regression models using a Bayesian adaptive lasso and an extended Bayesian adaptive group lasso. We also consider variable selection procedures for both fixed and random effects in a linear quantile mixed model via the Bayesian adaptive lasso and extended Bayesian adaptive group lasso with spike and slab priors. To improve mixing of the Markov chains, a simple and efficient partially collapsed Gibbs sampling algorithm is developed for posterior inference. Simulation experiments and an application to the Age-Related Macular Degeneration Trial data to demonstrate the proposed methods.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Year: 2020 PMID： 33104698 PMCID： PMC7588124 DOI： 10.1371/journal.pone.0241197

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Quantile regression(QR) in longitudinal or panel data models have received increasing attention in recent years. For example, [1] proposed a general method to estimate the QR coefficients by using a L1 penalized likelihood approach. Later [2] generalized the work of [1] to be more flexible model with endogenous variables. [3] offered a estimation of individual effects for panel data QR models, which is widely applied by practitioners because of computationally simple. [4] studied quantile panel models with correlated random effects. [5] proposed a Stochastic Approximation of the EM (SAEM) algorithm to analyze linear quantile mixed regressions(LQMMs) via the asymmetric Laplace distribution. Other related papers include [6-9], among many others. Like all regression issues, when there are many covariates in longitudinal or panel QR models, variable selection becomes necessary to avoid overfitting and multicollinearity. [10] presented a penalized quantile regression model for random intercept using the Bayesian lasso priors and Bayesian adaptive lasso priors, respectivly. [11] also considered a Bayesian Lasso approach to jointly estimate a vector of covariate effects and a vector of random effects by introducing an l1 penalty. However, for the random effects, their method mainly focus on the penalty of diagonal elements in the covariance matrix of random effects after a modified Cholesky decomposition. In addition, they neglect the nonnegative constraint on the diagonal terms in the covariance matrix of random effects after a modified Cholesky decomposition, which can cause non-uniqueness in the decomposition [12]. Then, we first propose a new Bayesian shrinkage approach to solve these problems in the LQMMs. In particular, we develop an extended Bayesian adaptive group lasso which can accommodate the nonnegative constraint of the diagonal terms to attempt to identify the non-zero random effects. Unfortunately, the shrinkage approach need some ad hoc methods to do variable selection similar to [10] and [11] for the well known reason. An alternative to the Bayesian variable selection is to use spike and slab priors. We then develop a general variable selection technic via the Bayesian adaptive lasso and the extended Bayesian adaptive group lasso with spike and slab priors to identify the significant fixed and random effects simultaneously. The paper is organized as follows. Section 2 presents the reparameterized LQMMs based on a Cholesky decomposition of covariance matrix of random effects. Section 3 describes the structure of our hierarchical Bayesian LQMMs and discuss our prior specifications. We also develop a Partially collapsed Gibbs (PCG) sampling algorithm for posterior inference. In section 4, we will develop a variable selection procedure for both fixed and random effects in a LQMM with spike and slab priors and an efficient PCG. We illustrate the performance of our method by simulation studies and a real data example in section 5 and 6. Results show that the proposed approach performs very well. In Section 7, we conclude the paper.

The re-parameterized LQMMs

Consider n subjects containing n observations, , for j = 1, …, n and , where y is the jth response for subject i, is the jth row of a known n × p matrix and is the jth row of a known n × q matrix . The canonical linear mixed model is as follows: where is a p × 1 vector of unknown population fixed parameters, is a q × 1 vector of unobservable random effects with covariance matrix . In order to guarantee the positive semidefinite of , we adopt the Cholesky decomposition to it, resulting in where Γ = (γ) is a q × q lower triangular matrix with nonnegative diagonal entries. Similar to [13], the model (1) can be re-expressed as follows: Here is a 1 × q(q + 1)/2 vector, where γ consists of the non-zero lower triangular elements of the kth row of Γ, i.e., , k = 1, …, q, is a transformation matrix with size q2 × q(q + 1)/2 such that vec(Γ) = γ, I denotes the q × q identity matrix. For example, if q = 2 the transformation matrix 2 would have the following form With model (2), the linear conditional τth (0 < τ < 1) mixed quantile estimator can be calculated by In a Bayesian setup, this is equivalent to assuming that the error terms ε in (2) follow the asymmetric Laplace distribution (ALD), of which the probability density function is given by where σ−1 and τ are the scale and skewness parameters, respectively. [14] first used the ALD to establish Bayesian QR in a linear model for independent data and proposed a random walk Metropolis-Hastings algorithm to draw samples. Since then, there has been a great deal of literature on the extension and application of the QR based on the ALD such as [15-18]. However, direct sampling using the ALD is often inconvenient and not easy to generalize to more complex scenarios. We employ a mixture representation of ALD proposed by [19] to facilitate Bayesian inference in LQMMs, which was also used by many authors (e.g., see [11, 20–22]), where v ∼ exp(1/σ) has an exponential prior with mean 1/σ, ζ ∼ N(0, 1) has a standard normal prior and is independent of v, and . Let = (v: i = 1, …, n; j = 1, …, n), = (ζ: i = 1, …, n; j = 1, …, n). Hence, based on the Cholesky decomposition and the mixture representation of ALD, we obtain the following hierarchal model,

Shrinkage estimation of linear quantile mixed regression

We first propose a new Bayesian shrinkage approach in LQMMs by using the Bayesian adaptive lasso and an extended Bayesian adaptive group lasso. A regularized τth quantile estimates of the fixed and random effect coefficients can be defined by Motivated by [23], we put a Laplace priors on β, where , a Laplace priors , where , and suppose that the error terms ε come from the ALD (4). Then, the posterior distribution of and is proportional to So it can be easily observed that maximizing the posterior distribution (7) is equivalent to minimizing the target function (6) if we ignore the nonnegative constraints γ ≥ 0, k = 1, …q. However, it is not clear how to incorporate this constraints within the Bayesian Penalized LQMMs framework. Using truncated normal priors, [24] and [25] solved a similar problem on the selection of fixed effects, which is non-group structure in linear and generalized mixed model, respectively. We extend the idea to the Bayesian Group lasso on the shrinkage of random effects with group structure. We specify the conditional prior distributions of as . To facilitate posterior inference, we utilize a mixture representation to yield where . Then, can be expressed hierarchically as for k = 1, …, q. Moving to the fixed effects parameters, can also be written hierarchically as follows for s = 1, …, p. There are two main ways to estimate the tuning parameter in Bayesian regularization: first, a fully Bayesian method, which specifies a hyperprior on it, such as a conjugate Gamma distribution [23], and second, an empirical Bayesian method, which estimates it using maximum marginal likelihood method [26]. Here we use the fully Bayesian approach and set Gamma priors on the parameters , and σ to complete the Bayesian structure. We develop a Partially collapsed Gibbs (PCG) sampling sampler by integrating the full conditional distribution of with respect to {b} to sample from posterior distributions. In order to improve the convergence behavior of the Gibbs sampler, [27] proposed PCG which is a generalization of blocking and collapsing method [28]. The key idea of the PCG is to replace some of the posterior distributions with marginal posterior distribution while preserving the target distribution. Next, we present the conditional distribution of unknown parameters. (1) Conditional distribution of : Let = diag(t, s = 1, …, p) and . The conditional distribution of is then a , where (2) Conditional distribution of : The conditional distribution of is a multivariate normal distribution with mean and variance where (3) Conditional distribution of : The full conditional distribution of follows a Inverse Gaussian(μ′, λ′) with p.d.f. Here and , where . (4) Conditional distributions of and : The full conditional distributions of and are independent Gamma distributions, (5) Conditional distributions of t and η: The full conditional distributions of t and η are independent Inverse Gaussian, (6) Conditional distribution of : Denote , i = 1, …, n; j = 1, …, n, and let be the covariate vector corresponding to , k = 1, …, q. The full conditional distribution of is then a truncated normal distribution where where and represent the covariate matrix corresponding to γ(. (7) Conditional distribution of σ: The full conditional distribution of σ is

Bayesian model selection in LQMMs

The shrinkage approach proposed in the above section has potential for variable selection in LQMMs. However, the estimators are never exact 0 due to the continuous priors on and . This section develops a general model selection method with spike and slab priors to identify the relative important effects. We first discuss the prior specification and propose the hierarchal model. Subsequently, we consider a Partially collapsed Gibbs sampling algorithm for posterior computation, and derive the relevant conditional distributions. We assume the following hierarchical Bayesian lasso with independent spike and slab type priors for : where δ0(⋅) denotes a point mass at 0, is the prior probability of excluding the sth fixed effect in the model, which is assigned a beta prior with parameters and , resulting in a noninformative uniform prior on (0, 1). Furthermore, we assume the following extended hierarchical adaptive Bayesian group lasso with independent spike and slab type priors for : where π0 is the prior probability of excluding the kth random effect in the model, which is also assumed to be a noninformative uniform prior on (0, 1). We also present the conditional distribution of unknown parameters. (1) Conditional distribution of : Let , and . The conditional distribution of β is then a spike and slab distribution, where (2) Conditional distribution of : The conditional distribution of is a multivariate normal distribution with mean and variance where (3) Conditional distribution of : The full conditional distribution of follows a Inverse Gaussian(μ′, λ′) with and , where . (4) Conditional distributions of λ1 and λ2: The full conditional distributions of λ1 and λ2 are independent Gamma distributions, (5) Conditional distributions of t and η: where InverseGamma(a, b) denotes a Inverse Gamma distribution with shape parameter a and scale parameter b. (6) Conditional distribution of and π0: The full conditional distributions of and π0 are independent Beta distribution. (7) Conditional distribution of : Denote , , , and let be the covariate vector corresponding to be the covariate vector corresponding to , k = 1, …, q. The full conditional distribution of is then a a spike and slab distribution, where Here, Φ(⋅) indicates the cumulative distribution function of the standard normal distribution, is the kth element of and is the kth diagonal element of . (8) Conditional distribution of σ: The full conditional distribution of σ is

Simulation studies

In this section we check the performance of the proposed extended Bayesian adaptive group lasso approach (adGL) and the Bayesian adaptive group lasso with independent spike and slab type priors approach (adSpikeGL) through simulations. We also compare them with other models: the PMQ method reported by [10], the BL and BAL methods proposed by [11] and the ordinary quantile regression estimator (QR) focusing solely on independent regression. The test data are generated from the following model: for i = 1, …, 50, j = 1, …, 5, where ε has the τth quantile equal to 0, random effects ∼ N(0, ), where Similar formulations of in mixed models have been used by [29] and [30]. We set and , where x, k = 1, …, 8 are drawn independently from the uniform [−2, 2]. The fixed effects parameters = (β1, …, β8) are set as follows: Simulation 1: = (3, 1.5, 0, 0, 2, 0, 0, 0)′ to illustrate a sparse case; Simulation 2: = (0.85, 0.85, 0.85, 0.85, 0.85, 0.85, 0.85, 0.85)′ to illustrate a dense case; Simulation 3: = (5, 0, 0, 0, 0, 0, 0, 0)′ to illustrate a very sparse case. With each simulation of , we generate data under three different quantiles i.e. τ = 0.1, 0.3 and 0.5, and four different error distributions: normal, normal mixture, laplace, and laplace mixture as follows (1) ε ∼ N(0, 9), i = 1, …, n, j = 1, …, n; (2) ε ∼ 0.1N(0, 1) + 0.9N(0, 5), i = 1, …, n, j = 1, …, n; (3) ε ∼ Laplace(0, 3), i = 1, …, n, j = 1, …, n; (4) ; We consider three sample sizes n = 30, n = 5; n = 50, n = 5; n = 100, n = 5. Therefore, we have a total of 36 simulation setups for each of the sample sizes. We simulate 100 data sets for simulation setups. Priors for the Bayesian methods are taken to be weak prior. The hyperparameters in the Gamma priors for tuning and scale parameters are all set to be 0.1. We run our PCG sampler for 20000 following initial burn-in 10000 cycles. The convergence of the proposed PCG is monitored by using the multivariate potential scale reduction factor (MPSRF) introduced by [31]. Figs 1–6 show that the MPSRF generally become stable and get close to 1 after about 10000 iterations under normal mixture error distribution, suggesting that the above burn-in size is large enough to ensure the convergence of PCG. The results for other error distributions are similar and omitted.

Fig 1

MPSRF for adGL in Simulation 1, when normal mixture error distribution.

Fig 6

MPSRF for adSpikeGL in Simulation 3, when normal mixture error distribution.

We use the posterior median estimates of and as our point estimator, denoted as and , respectively, and we consider two error measures including the mean absolute deviations (MAD), root mean squared errors (RMSE). To check the performance of the parameter estimation, we also calculate the median of mean square error for () and the median of quadratic loss error for (). More specifically, where is the predicted value of y, and the median is taken over the 100 simulations. The results of MAD and RMSE are summarized in Figs 7–12, meanwhile and are listed in Tables 1–9. Because BL, BAL, and QR neglect the covariance structure of random effects, we just report the results of the other three techniques for comparison.

Fig 7

Boxplots summarizing the MAD for three sample sizes in Simulation 1.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Fig 12

Boxplots summarizing the RMSE for three sample sizes in Simulation 3.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Table 1

Median of mean square errors () and quadratic loss errors() at τ = 0.1 for Simulation 1.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	3.17(11.314)	1.783(9.209)	4.034(14.133)	2.273(11.656)
	adSpikeGL	3.535(11.089)	1.475(9.02)	6.26(20.161)	2.409(12.694)
	PMQ	5.017(11.124)	4.127(9.899)	5.494(14.108)	4.364(12.485)
	BL	5.635(-)	5.919(-)	6.404(-)	7.011(-)
	BAL	5.377(-)	5.708(-)	5.61(-)	6.155(-)
	QR	9.964(-)	11.953(-)	10.803(-)	12.641(-)
n = 50	adGL	2.645(9.065)	1.508(7.383)	3.508(12.752)	2.002(9.291)
	adSpikeGL	2.102(9.102)	0.944(6.497)	3.443(14.761)	1.627(9.121)
	PMQ	2.643(8.517)	1.796(6.588)	3.151(11.198)	2.089(8.981)
	BL	4.422(-)	5.452(-)	4.114(-)	5.35(-)
	BAL	4.011(-)	4.763(-)	3.661(-)	4.736(-)
	QR	7.875(-)	10.489(-)	8.198(-)	10.686(-)
n = 100	adGL	2.125(7.452)	1.133(5.182)	3.094(11.117)	1.481(7.841)
	adSpikeGL	1.601(6.809)	0.802(4.374)	2.692(12.212)	1.164(7.423)
	PMQ	1.825(6.597)	1.019(4.745)	2.526(10.248)	1.192(6.914)
	BL	3.627(-)	4.737(-)	3.462(-)	4.885(-)
	BAL	3.2(-)	4.125(-)	2.923(-)	4.21(-)
	QR	7.128(-)	9.408(-)	7.445(-)	9.278(-)

Table 9

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 3.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	0.586(11.256)	0.415(9.501)	0.822(12.716)	0.564(11.175)
	adSpikeGL	0.195(7.985)	0.172(6.351)	0.246(9.296)	0.183(8.295)
	PMQ	0.912(9.032)	0.737(7.37)	0.952(11.751)	0.918(8.996)
	BL	1.733(-)	1.303(-)	1.955(-)	1.401(-)
	BAL	1.729(-)	1.282(-)	1.953(-)	1.445(-)
	QR	2.024(-)	1.479(-)	2.306(-)	1.686(-)
n = 50	adGL	0.428(8.755)	0.309(7.215)	0.453(10.334)	0.394(8.34)
	adSpikeGL	0.162(5.735)	0.127(5.397)	0.164(6.863)	0.101(5.69)
	PMQ	0.447(6.472)	0.4(5.933)	0.663(7.408)	0.404(5.941)
	BL	0.991(-)	0.886(-)	1.09(-)	0.925(-)
	BAL	0.99(-)	0.899(-)	1.111(-)	0.892(-)
	QR	1.193(-)	1.006(-)	1.314(-)	1.022(-)
n = 100	adGL	0.217(6.604)	0.143(5.139)	0.239(7.822)	0.166(5.982)
	adSpikeGL	0.068(4.448)	0.032(3.543)	0.053(6.163)	0.047(4.268)
	PMQ	0.231(4.655)	0.133(3.832)	0.256(5.194)	0.169(4.527)
	BL	0.463(-)	0.374(-)	0.53(-)	0.445(-)
	BAL	0.496(-)	0.377(-)	0.53(-)	0.452(-)
	QR	0.523(-)	0.413(-)	0.647(-)	0.481(-)

Boxplots summarizing the MAD for three sample sizes in Simulation 1.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Boxplots summarizing the RMSE for three sample sizes in Simulation 1.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Boxplots summarizing the MAD for three sample sizes in Simulation 2.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Boxplots summarizing the RMSE for three sample sizes in Simulation 2.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Boxplots summarizing the MAD for three sample sizes in Simulation 3.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Boxplots summarizing the RMSE for three sample sizes in Simulation 3.

The rows from top to bottom correspond to normal, normal mixture, laplace, and laplace mixture error distributions respectively.

Median of mean square errors () and quadratic loss errors() at τ = 0.1 for Simulation 1.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 1.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 1.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.1 for Simulation 2.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 2.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 2.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.1 for Simulation 3.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 3.

are in parentheses.

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 3.

are in parentheses. A few observations can be seen from the results. Firstly, it can be seen from Figs 7–12 that the proposed approaches perform well in general, the adGL generally outperforms the other methods in terms of MAD and RMSE, which indicate explicitly penalizing the off-diagonal covariance components can improve the model selection procedure. Similar findings were also obtained by [32]. On the other hand, the adSpikeGL also has good performance, yet does not perform as well as the adGL. Secondly. Tables 1–9 show that adSpikeGL performs the best in terms of and . Take n = 50 for an example, adSpikeGL has the smallest in 22 out of 36 simulation setups, and the smallest in 24 out of 36 simulation setups. Though PMQ performs good for τ = 0.1, its performance drops when τ = 0.3 and τ = 0.5, especially in dense case (Simulation 2). Thirdly, Tables 1–9 also display that our approaches work well in terms of MAD, RMSE, and even when the true error term is not ALD, indicating that the LQMMs using ALD in model (2) is merely a working model with artificial assumptions that aim to achieve equivalence between the problems in maximizing ALD and minimizing (3). Finally, as we can observe from Figs 7–12 and Tables 1–9 that, in terms of the four criterions, adGL, adSpikeGl and PMQ all have an overall decreasing trend as the sample size increases, whereas BA, BAL and QR generally fail to estimate accurately, possibly because they almost completely ignore the random effects, which describe within-subject dependence among data, and their performance doesn’t get better as the sample size grows. For adSpikeGL, the median posterior probability model (MPPM) criteria [33] and the highest posterior probability model (HPPM) criteria are used for model selection, respectively. Tables 10–14 and 11–15 summarize the number of correct and wrong zero coefficients for all cases based on MPPM and HPPM, respectively. On the whole, adSpikeGL generally yield promising results in most cases, even for extreme quantile when τ = 0.1 with notoriously challenging. As the sample size grows from 30 to 100, the number of the wrong zero coefficients goes to zero at all considered quantiles, and the number of the correct zero coefficients also become large at moderate and median quantiles. These all indicates good performance of our method in variable selection.

Table 10

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 1.

Random effects are in parentheses.

Sample size	τ		Error distribution
Sample size	τ		Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	p = .1	correct	4.45(4)	4.61(4.77)	4.54(2.15)	4.57(3.71)
	p = .1	wrong	0.14(0.4)	0.09(0.33)	0.28(0.3)	0.09(0.33)
	p = .3	correct	5.29(4.92)	5.35(5.52)	5.24(3.58)	5.16(4.69)
	p = .3	wrong	0.12(0.41)	0.09(0.29)	0.2(0.43)	0.09(0.39)
	p = .5	correct	5.29(5.42)	5.37(5.79)	5.27(5.31)	5.45(5.57)
	p = .5	wrong	0.11(0.45)	0.1(0.43)	0.18(0.73)	0.07(0.49)
n = 50	p = .1	correct	4.44(4.13)	4.53(4.6)	4.47(2.92)	4.63(3.86)
	p = .1	wrong	0(0.16)	0.01(0.13)	0.03(0.31)	0.02(0.29)
	p = .3	correct	5.48(5.45)	5.38(5.65)	5.25(4.77)	5.23(5.4)
	p = .3	wrong	0.01(0.21)	0.01(0.13)	0.01(0.4)	0.01(0.35)
	p = .5	correct	5.58(5.78)	5.5(5.85)	5.55(5.68)	5.51(5.87)
	p = .5	wrong	0(0.35)	0.01(0.18)	0(0.55)	0.01(0.44)
n = 100	p = .1	correct	4.53(3.5)	4.55(3.74)	4.51(2.42)	4.68(2.87)
	p = .1	wrong	0(0.08)	0(0.01)	0(0.1)	0(0.05)
	p = .3	correct	5.43(5.42)	5.47(5.59)	5.05(5.12)	4.93(5.68)
	p = .3	wrong	0(0.11)	0(0.02)	0(0.28)	0(0.04)
	p = .5	correct	5.4(5.78)	5.59(5.92)	5.61(5.9)	5.7(5.96)
	p = .5	wrong	0(0.16)	0(0)	0(0.49)	0(0.13)

Table 14

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 3.

Random effects are in parentheses.

Sample size	τ		Error distribution
Sample size	τ		Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	p = .1	correct	6.11(4.04)	6.51(4.91)	6.18(2.5)	6.27(4.09)
	p = .1	wrong	0(0.43)	0(0.25)	0(0.29)	0(0.43)
	p = .3	correct	6.96(4.8)	7.24(5.5)	7(3.77)	6.97(5.04)
	p = .3	wrong	0(0.37)	0(0.21)	0(0.39)	0(0.39)
	p = .5	correct	6.99(5.37)	7.32(5.83)	7.13(5.31)	7.2(5.76)
	p = .5	wrong	0(0.47)	0(0.25)	0(0.66)	0(0.55)
n = 50	p = .1	correct	6.22(4)	6.1(4.73)	6.32(3.05)	6.39(3.83)
	p = .1	wrong	0(0.21)	0(0.12)	0(0.41)	0(0.27)
	p = .3	correct	7.12(5.32)	7.08(5.73)	6.92(5.07)	6.91(5.51)
	p = .3	wrong	0(0.22)	0(0.1)	0(0.46)	0(0.31)
	p = .5	correct	7.15(5.63)	7.25(5.88)	7.41(5.83)	7.39(5.95)
	p = .5	wrong	0(0.38)	0(0.2)	0(0.62)	0(0.37)
n = 100	p = .1	correct	6.36(3.34)	6.47(3.73)	6.49(2.37)	6.54(2.94)
	p = .1	wrong	0(0.05)	0(0.01)	0(0.16)	0(0.11)
	p = .3	correct	7.27(5.29)	7.42(5.65)	6.95(5.17)	6.96(5.52)
	p = .3	wrong	0(0.1)	0(0)	0(0.3)	0(0.15)
	p = .5	correct	7.45(5.7)	7.53(5.89)	7.61(5.93)	7.55(5.92)
	p = .5	wrong	0(0.16)	0(0.04)	0(0.46)	0(0.14)

Table 11

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 1.

Random effects are in parentheses.

Sample size	τ		Error distribution
Sample size	τ		Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	p = .1	correct	4.43(4.12)	4.61(4.83)	4.5(2.27)	4.53(3.78)
	p = .1	wrong	0.13(0.38)	0.08(0.35)	0.22(0.32)	0.08(0.35)
	p = .3	correct	5.27(5.07)	5.34(5.52)	5.28(3.8)	5.19(4.8)
	p = .3	wrong	0.11(0.47)	0.09(0.33)	0.16(0.47)	0.09(0.41)
	p = .5	correct	5.25(5.4)	5.42(5.8)	5.33(5.42)	5.48(5.59)
	p = .5	wrong	0.11(0.5)	0.11(0.43)	0.17(0.76)	0.07(0.53)
n = 50	p = .1	correct	4.46(4.13)	4.53(4.63)	4.47(2.97)	4.66(3.87)
	p = .1	wrong	0(0.18)	0.01(0.13)	0.04(0.32)	0.02(0.29)
	p = .3	correct	5.47(5.42)	5.37(5.65)	5.25(4.86)	5.2(5.46)
	p = .3	wrong	0(0.23)	0.01(0.14)	0.01(0.44)	0.01(0.37)
	p = .5	correct	5.57(5.76)	5.43(5.84)	5.59(5.7)	5.48(5.89)
	p = .5	wrong	0(0.36)	0.01(0.18)	0(0.59)	0(0.46)
n = 100	p = .1	correct	4.54(3.54)	4.52(3.82)	4.52(2.44)	4.67(2.9)
	p = .1	wrong	0(0.07)	0(0.01)	0(0.1)	0(0.06)
	p = .3	correct	5.41(5.43)	5.46(5.59)	5.05(5.18)	4.95(5.68)
	p = .3	wrong	0(0.11)	0(0.01)	0(0.28)	0(0.06)
	p = .5	correct	5.4(5.78)	5.55(5.92)	5.6(5.9)	5.68(5.97)
	p = .5	wrong	0(0.2)	0(0)	0(0.5)	0(0.13)

Table 15

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 3.

Random effects are in parentheses.

Sample size	τ		Error distribution
Sample size	τ		Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	p = .1	correct	6.05(4.11)	6.48(4.94)	6.15(2.69)	6.26(4.14)
	p = .1	wrong	0(0.43)	0(0.27)	0(0.26)	0(0.39)
	p = .3	correct	6.92(4.84)	7.21(5.58)	6.92(3.94)	6.89(5.07)
	p = .3	wrong	0(0.37)	0(0.25)	0(0.44)	0(0.42)
	p = .5	correct	6.95(5.46)	7.25(5.83)	7.09(5.39)	7.18(5.83)
	p = .5	wrong	0(0.51)	0(0.27)	0(0.75)	0(0.59)
n = 50	p = .1	correct	6.2(3.96)	6.24(4.77)	6.26(3.07)	6.38(3.82)
	p = .1	wrong	0(0.21)	0(0.1)	0(0.37)	0(0.28)
	p = .3	correct	7.08(5.32)	6.99(5.76)	6.92(5.17)	6.88(5.58)
	p = .3	wrong	0(0.23)	0(0.11)	0(0.51)	0(0.33)
	p = .5	correct	7.13(5.63)	7.25(5.86)	7.4(5.86)	7.38(5.96)
	p = .5	wrong	0(0.38)	0(0.21)	0(0.66)	0(0.38)
n = 100	p = .1	correct	6.34(3.33)	6.48(3.74)	6.45(2.39)	6.5(2.94)
	p = .1	wrong	0(0.05)	0(0.01)	0(0.17)	0(0.13)
	p = .3	correct	7.3(5.29)	7.39(5.64)	7.01(5.21)	6.97(5.54)
	p = .3	wrong	0(0.1)	0(0)	0(0.31)	0(0.15)
	p = .5	correct	7.47(5.72)	7.55(5.88)	7.59(5.96)	7.53(5.92)
	p = .5	wrong	0(0.17)	0(0.04)	0(0.48)	0(0.15)

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 1.

Random effects are in parentheses.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 1.

Random effects are in parentheses.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 2.

Random effects are in parentheses.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 2.

Random effects are in parentheses.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 3.

Random effects are in parentheses.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 3.

Random effects are in parentheses. Next, in order to evaluate the sensitivity of our methods, we consider the following four different priors of tuning and scale parameters for Simulation 1: case(1) , case(2) , case(3) , case(4) . From Table 16, it can be observed that the differences between the four different priors is quite small in general. Hence our methods are not sensitivity to the choice of the hyper-parameters for the priors.

Table 16

MAD, RMSE, and for different priors in Simulation 1, when normal mixture error distribution.

Method	τ	Prior	MAD	RMSE	MME(β^)	MMR(D^)
adGL	0.1	case (1)	1.957	2.491	1.516	7.439
		case (2)	2.214	2.849	0.839	6.342
		case (3)	2.4	3.073	0.578	6.352
		case (4)	2.439	3.11	0.532	6.632
	0.3	case (1)	1.729	2.239	0.354	7.773
		case (2)	1.795	2.358	0.306	6.353
		case (3)	1.882	2.482	0.265	6.577
		case (4)	1.881	2.513	0.231	6.49
	0.5	case (1)	1.554	1.984	0.324	7.787
		case (2)	1.624	2.124	0.28	6.708
		case (3)	1.685	2.208	0.253	6.799
		case (4)	1.713	2.256	0.259	7.011
adSpikeGL	0.1	case (1)	2.231	2.835	0.987	6.518
		case (2)	2.412	3.058	0.656	6.18
		case (3)	2.461	3.126	0.607	6.389
		case (4)	2.496	3.158	0.498	6.76
	0.3	case (1)	1.84	2.407	0.24	4.762
		case (2)	1.849	2.432	0.24	5.451
		case (3)	1.905	2.494	0.246	5.381
		case (4)	1.946	2.532	0.232	5.765
	0.5	case (1)	1.672	2.212	0.232	5.226
		case (2)	1.741	2.256	0.241	5.996
		case (3)	1.696	2.222	0.24	5.445
		case (4)	1.757	2.263	0.233	5.571

Real data example

In this section, we use the Age-Related Macular Degeneration Trial (ARMD) data to assess the behavior of the proposed model under different quantiles. This dataset consists of 4 variables, available in the R package nlme. Visual acuity of 188 patients with ARMD were measured at 4 time points, resulting in 752 observations. We consider the relationship between the variable of the visual acuity (VISUAL) and three other explanatory variables, including the value of visual acuity measured at baseline(VISUAL0), the time of the measurement(TIME), the treatment indicator(Treat). A detailed description is given in [34]. [34] analyzed the data set using a linear mean mixed model and found that there exist possible interactions between time and treat effects, we then consider the following LQMM (i = 1, …, 188, j = 1, …, 4): Here we choose five different values of τ, i.e. 0.1, 0.3, 0.5, 0.7, 0.9, to thoroughly describe the response distribution. We implement the adGL, adSpikeGL, PMQ, BL, BAL and QR models on this data set. Table 17 shows the summaries of the MAD and the RMSE, where The results in Table 17 suggest that PMQ provides smaller MAD and the RMSE than other methods in upper quantiles(τ = 0.7 and τ = 09), however, adGL and adSpikeGL perform better in lower and median quantile (τ = 0.1, τ = 0.3 and τ = 0.5). We also report the posterior median estimations and 95% credible intervals for fixed effects and random effect variances in Table, respectively. As in simulations, adGL, adSpikeGL and PMQ are inclined to have similar performance, meanwhile, BAL, BL and QR also are likely to behave similarly, and all methods except for the adSpikeGL do not compress any fixed and random effects to zero.

Table 17

MAD and RMSE for the ARMD data.

Criterion	Method	τ = 0.1	τ = 0.3	τ = 0.5	τ = 0.7	τ = 0.9
MADs	adGL	7.140	5.306	5.078	4.935	6.805
	adSpikeGL	5.931	4.459	4.717	4.970	6.321
	PMQ	7.170	5.959	6.056	4.251	5.871
	BL	9.138	6.011	5.166	5.936	7.961
	BAL	9.189	6.052	5.211	6.009	7.987
	QR	16.665	10.025	8.712	9.618	13.764
RMSEs	adGL	9.385	7.162	7.256	7.725	9.871
	adSpikeGL	8.461	6.339	6.975	7.726	9.137
	PMQ	9.490	7.820	8.370	6.870	8.909
	BL	12.786	8.769	8.252	9.739	12.081
	BAL	12.828	8.896	8.351	9.877	12.136
	QR	19.948	12.816	12.120	13.624	17.975

The results of adSpikeGL in Tables 18 and 19 show that (i) for fixed effects, VISUAL0, TIME and TREAT are all identified as effective variables of Visual acuity at all quantiles except for TREAT at extreme quantile (τ = 0.1), this may be because treatment has no significant effect on the patients with poor visual acuity, (ii) for random effects, VISUAL0 and TREAT are deemed to be significant predictors of Visual acuity at all quantiles, however, TIME to be insignificant predictor at lower and median quantiles, and (iii) it is worth noticing that the fixed and random effect of the interactions between time and treat effects (TIME×TREAT) is not selected by adSpikeGL across all quantiles, suggesting that the effect of the treatment does not change over time no matter what the patient’s vision is.

Table 18

Posterior median and 95% credible intervals for the fixed effects of ARMD data.

quantile	Parameters	adGL	adSpikeGL	PMQ	BL	BAL	QR
τ = 0.1	β₀	-1.59(-7.493,1.045)	-0.111(-7.424,0.952)	-0.12(-2.21,0.796)	-1.184(-7.773,4.749)	-1.585(-6.614,3.445)	2.897(-)
	β₁	0.896(0.834,1.005)	0.881(0.841,0.999)	0.864(0.831,0.915)	0.915(0.803,1.029)	0.941(0.849,1.027)	0.812(-)
	β₂	-0.21(-0.274,-0.145)	-0.228(-0.282,-0.162)	-0.207(-0.266,-0.139)	-0.222(-0.27,-0.172)	-0.226(-0.274,-0.174)	-0.533(-)
	β₃	-0.426(-3.262,1.11)	0(-3.039,0.749)	-0.125(-1.79,0.695)	-2.492(-6.143,1.073)	-2.443(-5.618,1.039)	-4.374(-)
	β₄	-0.051(-0.147,0.04)	0(-0.119,0.021)	-0.057(-0.156,0.029)	-0.065(-0.138,0.007)	-0.064(-0.136,0.006)	0.031(-)
τ = 0.3	β₀	-0.117(-3.049,2.088)	0(-3.025,1.604)	-0.014(-1.191,1.01)	1.552(-3.475,6.39)	1.338(-2.966,5.848)	2.500(-)
	β₁	0.95(0.905,1.004)	0.946(0.913,0.998)	0.943(0.915,0.971)	0.93(0.844,1.013)	0.944(0.868,1.017)	0.900(-)
	β₂	-0.223(-0.285,-0.162)	-0.24(-0.291,-0.177)	-0.224(-0.282,-0.164)	-0.194(-0.245,-0.145)	-0.192(-0.243,-0.144)	-0.265(-)
	β₃	-1.207(-3.612,0.394)	-0.249(-3.319,0.394)	-0.227(-1.866,0.465)	-1.751(-4.568,1.149)	-1.687(-4.404,1.029)	-0.323(-)
	β₄	-0.048(-0.14,0.04)	0(-0.115,0.018)	-0.052(-0.142,0.036)	-0.061(-0.129,0.006)	-0.066(-0.133,0.002)	-0.126(-)
τ = 0.5	β₀	0.975(-0.965,5.282)	0.004(-0.935,4.763)	0.171(-0.931,2.569)	4.185(0.052,8.845)	3.765(-0.251,8.174)	2.839(-)
	β₁	0.975(0.901,1.018)	0.983(0.909,1.014)	0.986(0.945,1.013)	0.926(0.847,0.994)	0.937(0.866,1.001)	0.946(-)
	β₂	-0.197(-0.256,-0.136)	-0.219(-0.268,-0.157)	-0.197(-0.255,-0.14)	-0.143(-0.185,-0.104)	-0.142(-0.185,-0.102)	-0.151(-)
	β₃	-0.872(-3.133,0.507)	-0.072(-2.622,0.507)	-0.235(-1.903,0.615)	-1.391(-3.917,1.045)	-1.204(-3.729,1.169)	-0.656(-)
	β₄	-0.058(-0.147,0.028)	0(-0.118,0.012)	-0.057(-0.145,0.026)	-0.073(-0.131,-0.015)	-0.073(-0.132,-0.013)	-0.075(-)
τ = 0.7	β₀	6.253(1.701,10.526)	4.644(-0.128,10.815)	3.118(-0.276,8.154)	6.422(1.923,11.117)	5.304(1.124,9.614)	6.749(-)
	β₁	0.929(0.855,1.008)	0.949(0.852,1.032)	0.977(0.897,1.036)	0.924(0.847,0.997)	0.942(0.873,1.008)	0.929(-)
	β₂	-0.176(-0.236,-0.118)	-0.197(-0.247,-0.135)	-0.181(-0.241,-0.122)	-0.11(-0.145,-0.075)	-0.109(-0.143,-0.074)	-0.074(-)
	β₃	-1.09(-3.435,0.401)	-0.045(-2.521,0.616)	-0.52(-2.538,0.964)	-1.155(-3.589,1.19)	-0.891(-3.226,1.363)	-1.451(-)
	β₄	-0.056(-0.14,0.029)	0(-0.122,0.012)	-0.05(-0.139,0.036)	-0.063(-0.115,-0.01)	-0.06(-0.115,-0.007)	-0.063(-)
τ = 0.9	β₀	12.515(7.598,18.005)	9.273(3.135,14.157)	7.479(2.442,12.155)	8.299(3.633,13.256)	6.382(2.174,10.956)	17.773(-)
	β₁	0.87(0.775,0.952)	0.912(0.834,1.007)	0.94(0.87,1.021)	0.938(0.858,1.015)	0.966(0.89,1.033)	0.818(-)
	β₂	-0.174(-0.237,-0.108)	-0.189(-0.238,-0.128)	-0.18(-0.235,-0.121)	-0.091(-0.124,-0.058)	-0.089(-0.123,-0.051)	-0.011(-)
	β₃	-2.257(-5.291,0.092)	-0.252(-3.101,0.51)	-0.576(-2.655,1.234)	-1.087(-3.599,1.17)	-0.873(-2.955,1.401)	-2.636(-)
	β₄	-0.043(-0.134,0.045)	0(-0.117,0.016)	-0.041(-0.12,0.048)	-0.051(-0.102,0.003)	-0.048(-0.099,0.007)	0.023(-)

Table 19

Posterior median and 95% credible intervals for the random effect variances of ARMD data.

τ	Parameters	adGL	adSpikeGL	PMQ
τ = 0.1	σ₀₀	3.182(0.016,14.82)	91.361(62.646,130.536)	0.251(0,22.157)
	σ₁₁	0.023(0.007,0.035)	0(0,0)	0.029(0.01,0.043)
	σ₂₂	0.076(0.051,0.109)	0.077(0.058,0.102)	0.073(0.05,0.106)
	σ₃₃	1.236(0.031,8.159)	247.861(113.119,422.548)	1.249(0.002,85.935)
	σ₄₄	0.025(0.001,0.121)	0(0,0)	0.065(0,0.253)
τ = 0.3	σ₀₀	0.981(0.002,8.525)	30.736(15.537,56.308)	0.087(0,12.763)
	σ₁₁	0.008(0.002,0.014)	0(0,0.003)	0.007(0.001,0.013)
	σ₂₂	0.072(0.049,0.103)	0.072(0.053,0.096)	0.072(0.049,0.102)
	σ₃₃	4.622(0.218,13.325)	137.31(35.006,217.77)	20.295(0.957,144.869)
	σ₄₄	0.026(0.002,0.104)	0(0,0)	0.007(0,0.102)
τ = 0.5	σ₀₀	0.591(0,10.282)	27.488(15.813,46.503)	1.387(0,51.397)
	σ₁₁	0.007(0.001,0.014)	0(0,0)	0.004(0,0.011)
	σ₂₂	0.068(0.046,0.104)	0.066(0.048,0.09)	0.067(0.047,0.098)
	σ₃₃	3.363(0.1,10.958)	81.255(2.149,172.215)	14.283(0.082,97.629)
	σ₄₄	0.017(0.001,0.153)	0(0,0)	0.006(0,0.114)
τ = 0.7	σ₀₀	10.218(2.272,24.638)	143.485(78.765,280.103)	90.912(28.974,222.612)
	σ₁₁	0.002(0,0.007)	0.013(0.004,0.048)	0.005(0,0.028)
	σ₂₂	0.062(0.04,0.088)	0.063(0.045,0.087)	0.064(0.042,0.094)
	σ₃₃	1.795(0.045,9.234)	54.378(4.052,162.822)	2.581(0.005,80.103)
	σ₄₄	0.016(0.001,0.093)	0(0,0)	0.006(0,0.115)
τ = 0.9	σ₀₀	21.142(0.001,40.1)	162.243(104.606,339.58)	170.374(92.769,301.983)
	σ₁₁	0.002(0,0.03)	0.013(0.004,0.067)	0.013(0.003,0.036)
	σ₂₂	0.065(0.039,0.379)	0.066(0.048,0.089)	0.074(0.048,0.109)
	σ₃₃	1.168(0.026,8.957)	50.279(5.654,124.126)	2.636(0.004,74.622)
	σ₄₄	0.017(0.001,0.389)	0(0,0)	0.028(0,0.199)

Conclusion and discussion

In this work, we presented a novel shrinkage method to linear quantile mixed model for analysing and interpreting longitudinal data. We also consider the variable selection based on a Bayesian adaptive lasso and an extended Bayesian adaptive group lasso with spike and slab priors. Unlike other approaches for LQMMs, which need some ad hoc methods to do model selection, adSpikeGL can directly identify the significant fixed and random effects simultaneously. Several criterions were adopted to evaluate the finite sample performance of the proposed methods thoroughly by simulations and real dat example. The results reveal that the proposed methods is very competitive with the existing methods. On the other hand, as the sample size gets large, adGL and adGLSpike generally tend to provide smaller MMAD, MRMSE, and across all scenarios, and adGLSpike also has good performance in variable selection in most cases. The only exception is at τ = 0.1 where the number of the correct zero coefficients of adGLSpike did not show an increasing trend with sample size. The problem can be caused by the drawbacks of the ALD that do not allow the skewness, kurtosis and tails of distribution to vary. To overcome this problem [35] recently introduced a generalized asymmetric Laplace distribution (GALD) to establish QR in linear regression. In the future, we would like to extend the GALD to the linear mixed effect models. Moreover, we would also like to consider some other Bayesian shrinkage priors, such as the double-Pareto prior [36], horseshoe prior [37], the normal-gamma prior [38], etc, though the lasso-type priors is the most widely used shrinkage priors in the literature. (RAR) Click here for additional data file.

Table 2

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 1.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	0.958(11.408)	0.607(9.013)	1.447(13.115)	0.949(10.876)
	adSpikeGL	1.394(7.827)	0.85(7.501)	2.456(11.005)	1.572(9.061)
	PMQ	4.366(10.095)	3.825(10.022)	5.127(13.1)	4.299(11.501)
	BL	2.838(-)	2.763(-)	3.785(-)	3.697(-)
	BAL	2.576(-)	2.64(-)	3.761(-)	3.615(-)
	QR	3.001(-)	3.068(-)	3.866(-)	3.842(-)
n = 50	adGL	0.543(8.77)	0.357(7.495)	0.736(10.75)	0.516(8.6)
	adSpikeGL	0.337(5.917)	0.241(4.7)	0.521(7.668)	0.432(6.034)
	PMQ	1.395(6.996)	0.975(5.921)	1.402(8.938)	1.441(6.762)
	BL	1.743(-)	1.974(-)	2.748(-)	2.54(-)
	BAL	1.736(-)	1.863(-)	2.637(-)	2.506(-)
	QR	1.978(-)	2.062(-)	2.862(-)	2.606(-)
n = 100	adGL	0.3(6.429)	0.191(5.149)	0.441(8.083)	0.313(6.55)
	adSpikeGL	0.198(4.973)	0.133(3.585)	0.298(6.177)	0.28(3.93)
	PMQ	0.373(5.299)	0.305(3.939)	0.576(6.255)	0.388(4.797)
	BL	1.289(-)	1.567(-)	1.951(-)	2.109(-)
	BAL	1.243(-)	1.469(-)	1.859(-)	1.984(-)
	QR	1.368(-)	1.544(-)	2.095(-)	2.1(-)

Table 3

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 1.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	0.924(11.166)	0.581(8.938)	1.278(13.494)	0.862(11.367)
	adSpikeGL	1.377(7.44)	0.732(6.637)	1.736(8.771)	1.181(7.687)
	PMQ	4.216(10.914)	3.594(10.373)	4.813(13.136)	4.121(10.363)
	BL	1.729(-)	1.276(-)	1.968(-)	1.596(-)
	BAL	1.734(-)	1.289(-)	1.923(-)	1.633(-)
	QR	2.032(-)	1.517(-)	2.223(-)	1.88(-)
n = 50	adGL	0.421(8.836)	0.326(7.586)	0.605(10.555)	0.433(8.434)
	adSpikeGL	0.317(6.044)	0.238(5.088)	0.399(6.996)	0.283(6.042)
	PMQ	1.327(7.136)	0.993(5.648)	1.324(7.431)	1.409(6.689)
	BL	0.982(-)	0.823(-)	1.086(-)	0.912(-)
	BAL	1.007(-)	0.831(-)	1.109(-)	0.906(-)
	QR	1.157(-)	0.923(-)	1.294(-)	1.102(-)
n = 100	adGL	0.286(6.651)	0.18(5.065)	0.31(7.512)	0.207(6.058)
	adSpikeGL	0.197(4.55)	0.128(3.223)	0.22(5.378)	0.134(4.589)
	PMQ	0.344(5.167)	0.296(3.711)	0.472(5.622)	0.3(4.821)
	BL	0.471(-)	0.402(-)	0.573(-)	0.469(-)
	BAL	0.5(-)	0.423(-)	0.582(-)	0.478(-)
	QR	0.541(-)	0.456(-)	0.634(-)	0.515(-)

Table 4

Median of mean square errors() and quadratic loss errors() at τ = 0.1 for Simulation 2.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	3.28(12.016)	2.091(9.127)	4.671(14.802)	2.492(10.868)
	adSpikeGL	4.802(13.526)	2.63(8.487)	7.702(18.516)	3.663(12.025)
	PMQ	2.796(9.328)	1.919(7.995)	3.461(11.497)	2.477(8.897)
	BL	5.359(-)	6.499(-)	5.447(-)	6.328(-)
	BAL	5.214(-)	5.98(-)	5.255(-)	5.842(-)
	QR	9.121(-)	11.535(-)	10.766(-)	11.09(-)
n = 50	adGL	2.639(9.979)	1.541(7.654)	3.914(13.945)	2.098(10.088)
	adSpikeGL	3.117(9.197)	1.909(6.214)	4.98(16.776)	2.641(10.728)
	PMQ	2.19(8.009)	1.426(6.192)	2.884(12.507)	1.769(8.396)
	BL	4.36(-)	5.097(-)	4.458(-)	5.872(-)
	BAL	3.97(-)	4.407(-)	4.165(-)	5.476(-)
	QR	8.105(-)	9.584(-)	9.089(-)	10.387(-)
n = 100	adGL	2.23(7.876)	1.279(5.596)	3.163(12.262)	1.446(7.97)
	adSpikeGL	2.364(6.951)	1.296(5.048)	3.708(13.513)	1.772(8.228)
	PMQ	1.796(6.807)	1.097(4.709)	2.515(10.873)	1.247(7.199)
	BL	3.476(-)	5.193(-)	3.526(-)	4.909(-)
	BAL	3.106(-)	4.431(-)	2.995(-)	4.17(-)
	QR	7.104(-)	9.498(-)	7.716(-)	9.371(-)

Table 5

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 2.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	1.337(11.777)	0.948(8.895)	1.836(13.974)	1.212(10.574)
	adSpikeGL	3.095(10.405)	1.849(7.039)	4.022(13.495)	2.494(8.874)
	PMQ	1.985(10.15)	1.423(7.629)	2.495(11.006)	1.765(7.828)
	BL	2.722(-)	2.548(-)	3.531(-)	3.35(-)
	BAL	2.678(-)	2.477(-)	3.514(-)	3.323(-)
	QR	3.2(-)	2.831(-)	3.648(-)	3.539(-)
n = 50	adGL	0.773(9.367)	0.51(7.751)	1.092(10.676)	0.837(8.88)
	adSpikeGL	1.78(6.842)	1.363(5.378)	2.569(8.632)	1.692(5.98)
	PMQ	1.091(7.026)	0.819(5.406)	1.614(8.468)	1.113(6.712)
	BL	1.831(-)	1.923(-)	2.918(-)	2.76(-)
	BAL	1.787(-)	1.874(-)	2.848(-)	2.702(-)
	QR	1.989(-)	1.976(-)	3.115(-)	2.905(-)
n = 100	adGL	0.37(6.871)	0.225(5.369)	0.561(7.846)	0.328(6.736)
	adSpikeGL	0.871(4.78)	0.546(3.421)	1.137(5.862)	0.789(4.451)
	PMQ	0.515(5.01)	0.347(3.739)	0.703(6.179)	0.468(5.179)
	BL	1.253(-)	1.61(-)	1.952(-)	2.022(-)
	BAL	1.183(-)	1.483(-)	1.848(-)	1.956(-)
	QR	1.339(-)	1.627(-)	2.083(-)	2.098(-)

Table 6

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 2.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	1.242(11.082)	0.903(9.625)	1.544(13.627)	1.089(10.873)
	adSpikeGL	2.824(8.786)	1.797(6.892)	3.649(10.828)	2.247(8.175)
	PMQ	1.894(9.806)	1.379(7.22)	2.419(11.003)	1.6(7.938)
	BL	1.668(-)	1.266(-)	2.046(-)	1.502(-)
	BAL	1.648(-)	1.267(-)	2.005(-)	1.508(-)
	QR	2.074(-)	1.406(-)	2.384(-)	1.864(-)
n = 50	adGL	0.74(9.271)	0.502(7.87)	0.953(10.308)	0.667(8.587)
	adSpikeGL	1.636(6.982)	1.288(5.22)	2.221(7.434)	1.581(6.003)
	PMQ	1.03(6.806)	0.802(5.529)	1.352(7.545)	0.99(6.257)
	BL	1.023(-)	0.845(-)	1.298(-)	0.945(-)
	BAL	1.008(-)	0.871(-)	1.312(-)	0.923(-)
	QR	1.121(-)	0.959(-)	1.528(-)	1.007(-)
n = 100	adGL	0.33(6.845)	0.213(5.253)	0.422(7.898)	0.265(6.053)
	adSpikeGL	0.843(4.707)	0.644(3.548)	0.948(5.46)	0.77(4.39)
	PMQ	0.504(5.029)	0.334(3.65)	0.58(5.615)	0.412(4.861)
	BL	0.517(-)	0.363(-)	0.57(-)	0.429(-)
	BAL	0.536(-)	0.389(-)	0.554(-)	0.43(-)
	QR	0.55(-)	0.446(-)	0.632(-)	0.469(-)

Table 7

Median of mean square errors() and quadratic loss errors() at τ = 0.1 for Simulation 3.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	2.922(11.216)	1.577(9.453)	3.801(14.215)	1.962(11.422)
	adSpikeGL	1.284(11.385)	0.509(7.829)	3.764(15.632)	1.044(10.661)
	PMQ	2.045(9.833)	1.358(8.179)	3.375(12.93)	1.861(9.465)
	BL	5.425(-)	6.815(-)	6.204(-)	6.326(-)
	BAL	5.426(-)	6.208(-)	6.357(-)	5.936(-)
	QR	9.733(-)	11.759(-)	11.056(-)	10.68(-)
n = 50	adGL	2.584(9.042)	1.231(7.481)	2.929(12.619)	1.639(9.513)
	adSpikeGL	1.545(8.405)	0.616(6.177)	2.372(14.616)	1.079(9.174)
	PMQ	1.931(7.836)	1.098(6.659)	2.336(10.67)	1.295(8.154)
	BL	4.411(-)	5.516(-)	4.747(-)	5.774(-)
	BAL	3.947(-)	5.067(-)	4.248(-)	5.313(-)
	QR	8.331(-)	10.489(-)	9.016(-)	11.041(-)
n = 100	adGL	2.09(7.332)	1.033(5.836)	2.619(11.603)	1.399(7.948)
	adSpikeGL	1.538(6.93)	0.632(5.038)	2.104(13.169)	1.03(7.819)
	PMQ	1.662(6.521)	0.842(4.998)	2.018(10.587)	1.097(7.047)
	BL	3.592(-)	5.101(-)	3.939(-)	4.656(-)
	BAL	3.106(-)	4.404(-)	3.438(-)	4.084(-)
	QR	6.751(-)	9.933(-)	8.013(-)	9.714(-)

Table 8

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 3.

are in parentheses.

Sample size	Method	Error distribution
Sample size	Method	Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	adGL	0.693(11.078)	0.426(9.632)	0.935(13.41)	0.608(11.588)
	adSpikeGL	0.207(8.668)	0.196(6.897)	0.326(9.916)	0.236(8.558)
	PMQ	0.916(9.637)	0.757(7.408)	1.352(11.942)	1.012(8.875)
	BL	2.838(-)	2.522(-)	3.815(-)	3.22(-)
	BAL	2.659(-)	2.559(-)	3.799(-)	3.089(-)
	QR	2.977(-)	2.633(-)	3.83(-)	3.15(-)
n = 50	adGL	0.489(8.883)	0.325(7.26)	0.609(10.14)	0.479(8.714)
	adSpikeGL	0.166(5.899)	0.13(5.429)	0.265(6.91)	0.179(5.89)
	PMQ	0.523(6.773)	0.414(5.967)	0.826(7.842)	0.587(6.097)
	BL	1.971(-)	2.242(-)	2.728(-)	2.757(-)
	BAL	1.899(-)	2.106(-)	2.666(-)	2.663(-)
	QR	2.092(-)	2.444(-)	3.06(-)	2.78(-)
n = 100	adGL	0.261(6.807)	0.15(5.202)	0.394(7.977)	0.254(6.402)
	adSpikeGL	0.066(4.502)	0.031(3.74)	0.182(5.552)	0.123(4.076)
	PMQ	0.251(5.041)	0.152(3.931)	0.403(5.937)	0.289(4.513)
	BL	1.231(-)	1.557(-)	2.219(-)	2.06(-)
	BAL	1.123(-)	1.454(-)	2.092(-)	1.823(-)
	QR	1.338(-)	1.717(-)	2.297(-)	2.1(-)

Table 12

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 2.

Random effects are in parentheses.

Sample size	τ		Error distribution
Sample size	τ		Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	p = .1	correct	0.11(3.68)	0.25(4.86)	0.01(1.85)	0.21(3.72)
	p = .1	wrong	2.93(0.27)	1.78(0.29)	3.72(0.29)	2.64(0.31)
	p = .3	correct	0.87(4.27)	0.88(5.46)	0.88(2.32)	0.64(4.54)
	p = .3	wrong	2.54(0.31)	1.54(0.29)	3.59(0.24)	2.27(0.26)
	p = .5	correct	0.9(4.89)	0.92(5.76)	0.9(4.05)	0.92(5.4)
	p = .5	wrong	2.27(0.43)	1.48(0.36)	3.07(0.48)	1.99(0.46)
n = 50	p = .1	correct	0.04(3.99)	0.14(4.71)	0.04(2.63)	0.15(3.51)
	p = .1	wrong	1.45(0.26)	1.07(0.12)	2.33(0.23)	1.55(0.22)
	p = .3	correct	0.87(5.36)	0.86(5.64)	0.61(4.21)	0.49(5.41)
	p = .3	wrong	1.31(0.29)	1(0.12)	1.88(0.36)	1.31(0.26)
	p = .5	correct	0.93(5.61)	0.94(5.89)	0.93(5.44)	0.94(5.91)
	p = .5	wrong	1.26(0.35)	0.93(0.18)	1.55(0.52)	1.16(0.39)
n = 100	p = .1	correct	0.01(3.19)	0(3.67)	0(2.25)	0.02(2.93)
	p = .1	wrong	0.5(0.12)	0.47(0.01)	0.79(0.16)	0.43(0.05)
	p = .3	correct	0.9(5.41)	0.9(5.7)	0.48(5.05)	0.32(5.59)
	p = .3	wrong	0.5(0.1)	0.48(0.01)	0.61(0.27)	0.35(0.07)
	p = .5	correct	0.93(5.76)	0.95(5.89)	0.93(5.91)	0.91(5.92)
	p = .5	wrong	0.51(0.22)	0.48(0)	0.55(0.4)	0.3(0.16)

Table 13

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 2.

Random effects are in parentheses.

Sample size	τ		Error distribution
Sample size	τ		Normal	Normalmixtrue	Laplace	Laplacemixtrue
n = 30	p = .1	correct	0.1(3.66)	0.25(4.81)	0.01(1.82)	0.2(3.71)
	p = .1	wrong	2.97(0.3)	1.64(0.28)	3.65(0.29)	2.59(0.32)
	p = .3	correct	0.89(4.32)	0.88(5.4)	0.83(2.35)	0.66(4.63)
	p = .3	wrong	2.38(0.34)	1.5(0.3)	3.47(0.26)	2.16(0.34)
	p = .5	correct	0.9(4.95)	0.94(5.78)	0.88(4.01)	0.9(5.46)
	p = .5	wrong	2.35(0.51)	1.5(0.39)	2.96(0.52)	1.95(0.49)
n = 50	p = .1	correct	0.04(4.03)	0.14(4.72)	0.04(2.65)	0.14(3.56)
	p = .1	wrong	1.34(0.25)	0.97(0.13)	2.24(0.23)	1.56(0.21)
	p = .3	correct	0.85(5.39)	0.88(5.67)	0.6(4.26)	0.49(5.48)
	p = .3	wrong	1.26(0.31)	0.89(0.11)	1.79(0.36)	1.28(0.3)
	p = .5	correct	0.93(5.66)	0.93(5.89)	0.93(5.46)	0.93(5.92)
	p = .5	wrong	1.17(0.37)	0.85(0.18)	1.47(0.57)	1.15(0.4)
n = 100	p = .1	correct	0.01(3.2)	0(3.66)	0(2.31)	0.02(3)
	p = .1	wrong	0.45(0.12)	0.46(0.01)	0.75(0.16)	0.34(0.06)
	p = .3	correct	0.9(5.36)	0.9(5.7)	0.49(5.11)	0.32(5.6)
	p = .3	wrong	0.46(0.13)	0.44(0.01)	0.61(0.3)	0.32(0.07)
	p = .5	correct	0.93(5.77)	0.95(5.9)	0.93(5.93)	0.91(5.92)
	p = .5	wrong	0.44(0.22)	0.44(0)	0.54(0.42)	0.25(0.16)

7 in total

Bayesian variable selection in linear quantile mixed models for longitudinal data with application to macular degeneration.

Introduction

The re-parameterized LQMMs

Shrinkage estimation of linear quantile mixed regression

Bayesian model selection in LQMMs

Simulation studies

Boxplots summarizing the MAD for three sample sizes in Simulation 1.

Boxplots summarizing the RMSE for three sample sizes in Simulation 1.

Boxplots summarizing the MAD for three sample sizes in Simulation 2.

Boxplots summarizing the RMSE for three sample sizes in Simulation 2.

Boxplots summarizing the MAD for three sample sizes in Simulation 3.

Boxplots summarizing the RMSE for three sample sizes in Simulation 3.

Median of mean square errors () and quadratic loss errors() at τ = 0.1 for Simulation 1.

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 1.

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 1.

Median of mean square errors() and quadratic loss errors() at τ = 0.1 for Simulation 2.

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 2.

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 2.

Median of mean square errors() and quadratic loss errors() at τ = 0.1 for Simulation 3.

Median of mean square errors() and quadratic loss errors() at τ = 0.3 for Simulation 3.

Median of mean square errors() and quadratic loss errors() at τ = 0.5 for Simulation 3.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 1.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 1.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 2.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 2.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of MPPM for Simulation 3.

Average numbers of correct and wrong zero coefficients for fixed effects and random effects in terms of HPPM for Simulation 3.

Real data example

Conclusion and discussion

1. Random effects selection in linear mixed models.

2. Quantile regression for longitudinal data using the asymmetric Laplace distribution.

3. Fixed and random effects selection in linear and logistic models.

4. Fixed and random effects selection in mixed effects models.

5. Joint variable selection for fixed and random effects in linear mixed-effects models.

6. GENERALIZED DOUBLE PARETO SHRINKAGE.

7. Quantile regression in linear mixed models: a stochastic approximation EM approach.