Literature DB >> 35799315

Defining R-squared measures for mixed-effects location scale models.

Abstract

Ecological momentary assessment and other modern data collection technologies facilitate research on both within-subject and between-subject variability of health outcomes and behaviors. For such intensively measured longitudinal data, Hedeker et al extended the usual two-level mixed-effects model to a two-level mixed-effects location scale (MELS) model to accommodate covariates' influence as well as random subject effects on both mean (location) and variability (scale) of the outcome. However, there is a lack of existing standardized effect size measures for the MELS model. To fill this gap, our study extends Rights and Sterba's framework of R 2 $$ {R}^2 $$ measures for multilevel models, which is based on model-implied variances, to MELS models. Our proposed framework applies to two different specifications of the random location effects, namely, through covariate-influenced random intercepts and through random intercepts combined with random slopes of observation-level covariates. We also provide an R function, R2MELS, that outputs summary tables and visualization for values of our R 2 $$ {R}^2 $$ measures. This framework is validated through a simulation study, and data from a health behaviors study and a depression study are used as examples to demonstrate this framework. These R 2 $$ {R}^2 $$ measures can help researchers provide greater interpretation of their findings using MELS models.

Entities: Chemical

Keywords: EMA; R-squared; mixed-effects location scale model; standardized effect size

Mesh：

Year: 2022 PMID： 35799315 PMCID： PMC9481677 DOI： 10.1002/sim.9521

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.497

INTRODUCTION

Modern data collection methods such as ecological momentary assessments (EMA) have allowed more detailed examination of subjects' heterogeneity both at the between‐subject (BS) level (also known as level 2) and the within‐subject (WS) level (also known as level 1). Hedeker et al extended the commonly used mixed‐effects regression model (MRM) into the mixed‐effects location scale (MELS) model that includes both random location effects and random scale effects. Random location effects refer to random subject effects on the mean of the response variable, and random scale effects refer to random subject effects on the WS variability of the response variable. While scale sometimes only refers to standard deviation, here it is on variance metric. Increasingly, researchers are encouraged to report effect sizes in addition to ‐values for their study results. Standardized effect size measures are of particular interest as they allow direct comparison of different models. However, to the best of our knowledge, there are no existing standardized effect size measures specifically for MELS models. Standardized effect size measures have been developed for MRMs. Earlier pseudo‐ measures evaluate a model's reduction in residual variance from the null model. A problem with this approach is that it can result in negative values and thus become meaningless as shown by Snijders and Bosker. Snijders and Bosker then resolved this problem by constructing using model‐implied variances. Recently, Rights and Sterba introduced a comprehensive framework of measures for multilevel models using model‐implied variances that measure both the total variance of the response variable explained and level‐specific variances of the response variable explained. While this work primarily discusses cross‐sectional multilevel models, their later work accommodates specific features of longitudinal multilevel models. Rights and Sterba also introduced supplementary visualization and R functions to help researchers implement their proposed framework. Our study extends Rights and Sterba's framework to two‐level MELS models. We develop frameworks of measures for two forms of MELS models, which differ in their characterization of the random location effects. In the first form, the model includes random subject intercepts and allows their variance to be influenced by both subject‐level and observation‐level covariates. Alternatively, the second form includes random subject intercepts and slopes of observation‐level covariates. For both forms, measures are constructed for both the location model and the scale model. We also develop an R function, R2MELS, that allows calculation and visualization of measures specifically for MELS models.

Mixed‐effects location‐scale (MELS) models with random intercepts with covariate‐influenced variance

To begin, we review the MELS model for a two‐level continuous response variable ( subjects, observations) proposed by Hedeker et al: where is the fixed intercept of the location model, is the vector of fixed location effect covariates, and is the corresponding vector of fixed location effects. BS heterogeneity is included via random intercepts , also recognized as the random location effects. is the observation‐level residuals and incorporates WS heterogeneity. and are assumed to be normally distributed with mean 0 and variances and , respectively. is further modeled in log‐linear form to account for different BS heterogeneity at different values of covariates: where is the variance of when the covariates equal zero, or if the covariates have no influence on the BS variance. is the vector of covariates influencing , which can contain both subject‐level and observation‐level covariates. represents the vector of fixed effects associated with on . For , both heteroskedasticity at different covariate values and heteroskedasticity between subjects are allowed. The heteroskedasticity of between subjects is included via a random scale effect , which is assumed to follow a normal distribution with a mean of 0 and a variance of . is again modeled in log‐linear form to ensure positive variance values: where is the value of when the covariates and the random scale effect equal zero, or when there is neither any covariate influencing nor subject heteroskedasticity of . is the vector of covariates influencing . Similar to , can contain both subject‐level and observation‐level covariates. is the vector of fixed effects associated with on . For convenience, Equation (2) can be rewritten in terms of standardized random location effects, denoted as : and Equation (3) can also be written in terms of standardized random scale effects, denoted as : Since the random location effects and random scale effects are not necessarily uncorrelated, and are assumed to follow the following bivariate normal distribution: where is the correlation between the random location effect and the random scale effect .

MELS models with random slopes of observation‐level covariates

Instead of having random intercepts and allowing covariates to influence their variance, random location effects can be modeled by random intercepts with constant variance and random slopes of observation‐level covariates: where we use to represent an vector with the first element of for the random intercept followed by observation‐level covariates for random slopes. , the corresponding vector of the random location effects, follows the following distribution: The first element of is the random intercept, and the following elements are random slopes associated with covariates in . The variances of the random intercept and the random slopes, , ,…, , are scalars not influenced by any covariates. In this form of MELS models, the fixed intercept, fixed location effects, and variance of observation‐level residuals are modeled the same way as in Section 1.1. Again, we can express the random effects as standardized random effects for convenience. Namely, and where , and follow a multivariate normal distribution with mean 0, and their variance‐covariance matrix is a matrix given as: where is the correlation between and , and is the correlation between and .

THE PROPOSED METHOD

Decomposition of observation‐level covariates

As mentioned, , , and can contain both subject‐level covariates and observation‐level covariates, and is solely composed of observation‐level covariates. Since an observation‐level covariate not centered at the subject level contains both BS variation and WS variation, and we assume that the covariates are multivariate normally distributed at each level, we first decompose each of , , , and into a BS component and a WS component before deriving their contribution to the variance of the response. For example, where is the vector of subject‐level fixed location effect covariates, which characterizes the BS components of observation‐level fixed location effect covariates. Elements of are multivariate normally distributed with mean and variance . Analogously, is the vector of WS components of observation‐level fixed location effect covariates, and its elements are multivariate normally distributed with mean 0 and variance . Similarly, decomposition of covariates influencing the variance of the random intercepts (ie, the BS variance) is given by , where , and . Decomposition of covariates influencing the WS variance is similarly: , where , and . Finally, decomposition of covariates for random slopes is given by , where , and . The recurring superscript in the variance matrices represents BS components, and represents WS components. Note that if one includes a random slope for an observation‐level covariate in , the same covariate will also occur in , and the decomposition of this variable will be the same in and .

Variance partitioning

Variance partitioning for MELS models with random intercepts with covariate‐influenced variance

For the model described in Section 1.1, since , , , and are independent. The WS variance of the response variable explained by fixed location effects of WS components of observation‐level covariates, denoted as f1, can be derived based on the property of the multivariate normal distribution: Similarly, the BS variance of the response variable explained by fixed location effects of subject‐level covariates and BS components of observation‐level covariates, denoted as f2, can be expressed as: In applying these formulas, we substitute and with their sample estimators. As shown in Appendix A, the variance of the random intercepts is denoted as v. Let m represent the variance of the random intercepts at the mean (both on the BS level and the WS level) of all covariates influencing the variance of the random intercepts. Note that the mean of the WS component of a covariate is zero. Then, represents the variance of the random intercepts explained by covariates. To further decompose , logarithmic transformations are taken: where represents the log‐transformed variance of the random intercepts explained by fixed effects of WS components of observation‐level covariates. , denoted as v2, is the log‐transformed variance of the random intercepts explained by fixed effects of subject‐level covariates and BS components of observation‐level covariates. We use sample estimators of , , and in place of these population parameters themselves in practice. The variance of the observation‐level residuals, also known as the scale of the response variable, is denoted as e. is the variance of the observation‐level residuals at the mean (both on the BS level and the WS level) of all covariates in the scale model, which is also the unexplained scale of the response variable, and is the variance of the observation‐level residuals explained by covariates and the random scale effects. Similar to the decomposition of , can be further decomposed on the logarithmic scale: where is the log‐transformed scale of the response variable explained by fixed effects of WS components of observation‐level covariates, is the log‐transformed scale of the response variable explained by fixed effects of subject‐level covariates and BS components of observation‐level covariates, and is the log‐transformed scale of the response variable explained by the random scale effects. When applied, the sample estimators of , , and are used in Equations 18 and 19. For simplicity, here the coefficients , , and are assumed to be the same for the BS and WS components of covariates. However, in some cases, it may be desirable to allow the BS and the WS component of a covariate to have different effects. For this, one can substitute the corresponding coefficients of distinct BS and WS effects in the appropriate equations. Our R function described in Section 2.4 allows for this possibility. Users would need to specify the BS component and the WS component of a covariate as two distinct variables in their dataset and input their corresponding effect estimates into the function separately.

Variance partitioning for MELS models with random slopes of observation‐level covariates

Here, we decompose the variance of the response variable based on the model described in Section 1.2. given the independence of , , , and as well as the independence of and . The interpretations and derivations for (f1), (f2), and (e) are the same as in Section 2.2.1, while the variance partitioning for the random location effects are given by, with representing the trace of a matrix: denoted as v1, which is the WS variance of the response variable explained by random slopes of WS components of observation‐level covariates, and Here, is denoted as v2, which corresponds to the BS variance of the response variable explained by random slopes of BS components of observation‐level covariates. Also, is denoted as m, which represents the BS variance of the response variable explained by the random intercepts at the mean of BS components of all covariates for random location effects. The derivations of Equation (21) and Equation (22) can be found in Rights and Sterba's work.

Defining measures

We develop measures for the total variance of the response variable, the level‐specific variances of the response variable, and the scale of the response variable. Table 1 details measures for the location part of the model described in Section 1.1, and Table 2 describes measures for the location part of the model presented in Section 1.2. measures for the scale model are illustrated in Table 3. The superscripts in parentheses denote the source(s) of variation. Also, the subscripts represent the denominators of the measures, meaning which part of the variance of the response variable that one is trying to explain. Namely, subscript indicates total variance of the response variable and is calculated as for MELS models with random intercepts (with covariate‐influenced variance), and for MELS models with random slopes of observation‐level covariates. Subscript and subscript represent WS variance of the response variable and BS variance of the response variable, respectively. For the model described in Section 1.1, the model‐implied WS variance of the response variable is , and the model‐implied BS variance of the response variable is . For the Section 1.2 model, the WS variance of the response variable equals while the BS variance of the response variable is calculated as . Lastly, the subscript represents variance of the observation‐level residuals, which is denoted as e in both specifications of MELS models.

TABLE 1

Definitions and interpretations of measures for the location part of an MELS model with random intercepts with covariate‐influenced variance

Definition	Coefficients ^a	Covariates ^a	Interpretation
R2 Measures for total variance of the response variable
Rt2(f1)=f1f1 + f2 + v + e	β	(xij−x‾i)	Proportion of total variance of the response variable explained by fixed location effects of WS components of observation‐level covariates
Rt2(f2)=f2f1 + f2 + v + e	β	x‾i	Proportion of total variance of the response variable explained by fixed location effects of subject‐level covariates and BS components of observation‐level covariates
Rt2(f)=f1 + f2f1 + f2 + v + e	β	(xij−x‾i), x‾i	Proportion of total variance of the response variable explained by fixed location effects
Rt2(v1)=v1v1+v2(v−m)f1 + f2 + v + e	α	(uij−ūi)	Proportion of total variance of the response variable explained by fixed effects of WS components of observation‐level covariates on the variance of the random intercepts
Rt2(v2)=v2v1+v2(v−m)f1 + f2 + v + e	α	ūi	Proportion of total variance of the response variable explained by fixed effects of subject‐level covariates and BS components of observation‐level covariates on the variance of the random intercepts
Rt2(m)=mf1 + f2 + v + e	α0,α	ūi	Proportion of total variance of the response variable explained by random intercepts at the mean of all covariates influencing the variance of the random intercepts
Rt2(v)=vf1 + f2 + v + e	α0,α	(uij−ūi), ūi	Proportion of total variance of the response variable explained by random location effects
Rt2(fv)=f1 + f2 + vf1 + f2 + v + e	β,α0,α	(xij−x‾i), x‾i, (uij−ūi), ūi	Proportion of total variance of the response variable explained by both fixed location effects and random location effects
Rt2(f2v)=f2 + vf1 + f2 + v + e	β,α0,α	x‾i, (uij−ūi), ūi	Proportion of total variance of the response variable explained by BS location effects
R2 Measures for BS variance of the response variable
Rb2(f2)=f2f2 + v	β	x‾i	Proportion of BS variance of the response variable explained by fixed location effects of subject‐level covariates and BS components of observation‐level covariates
Rb2(v1)=v1v1 + v2(v−m)f2 + v	α	(uij−ūi)	Proportion of BS variance of the response variable explained by fixed effects of WS components of observation‐level covariates on the variance of the random intercepts
Rb2(v2)=v2v1 + v2(v−m)f2 + v	α	ūi	Proportion of BS variance of the response variable explained by fixed effects of subject‐level covariates and BS components of observation‐level covariates on the variance of the random intercepts
Rb2(m)=mf2 + v	α0,α	ūi	Proportion of BS variance of the response variable explained by random intercepts at the mean of all covariates influencing the variance of the random intercepts
Rb2(v)=vf2 + v	α0,α	(uij−ūi), ūi	Proportion of BS variance of the response variable explained by random location effects
R2 Measures for WS variance of the response variable
Rw2(f1)=f1f1 + e	β	(xij−x‾i)	Proportion of WS variance of the response variable explained by fixed location effects of WS components of observation‐level covariates

The coefficients and covariates refer to elements of the model needed to calculate the source of variation in the specific measure, that is, what is labeled in the parenthesized superscript.

TABLE 2

Definitions and interpretations of measures for the location part of an MELS model with random slopes of observation‐level covariates

Definition	Coefficients ^a	Covariates ^a	Interpretation
R2 Measures for total variance of the response variable
Rt2(f1)=f1f1 + f2 + v1 + v2 + m + e	β	(xij−x‾i)	Proportion of total variance of the response variable explained by fixed location effects of WS components of observation‐level covariates
Rt2(f2)=f2f1 + f2 + v1 + v2 + m + e	β	x‾i	Proportion of total variance of the response variable explained by fixed location effects of subject‐level covariates and BS components of observation‐level covariates
Rt2(f)=f1 + f2f1 + f2 + v1 + v2 + m + e	β	(xij−x‾i), x‾i	Proportion of total variance of the response variable explained by fixed location effects
Rt2(v1)=v1f1 + f2 + v1 + v2 + m + e	vi	(zij−z‾i)	Proportion of total variance of the response variable explained by random slopes of WS components of observation‐level covariates
Rt2(v2)=v2f1 + f2 + v1 + v2 + m + e	vi	z‾i	Proportion of total variance of the response variable explained by random slopes of BS components of observation‐level covariates
Rt2(m)=mf1 + f2 + v1 + v2 + m + e	vi	z‾i	Proportion of total variance of the response variable explained by random intercepts at the mean of BS components of all covariates for random location effects
Rt2(vm)=v1 + v2 + mf1 + f2 + v1 + v2 + m + e	vi	(zij−z‾i),z‾i	Proportion of total variance of the response variable explained by random location effects
Rt2(fvm)=f1 + f2 + v1 + v2 + mf1 + f2 + v1 + v2 + m + e	β, vi	(xij−x‾i), x‾i, (zij−z‾i), z‾i	Proportion of total variance of the response variable explained by both fixed location effects and random location effects
Rt2(f2v2m)=f2 + v2 + mf1 + f2 + v1 + v2 + m + e	β, vi	x‾i, z‾i	Proportion of total variance of the response variable explained by BS location effects
R2 Measures for BS variance of the response variable
Rb2(f2)=f2f2 + v2 + m	β	x‾i	Proportion of BS variance of the response variable explained by fixed location effects of subject‐level covariates and BS components of observation‐level covariates
Rb2(v2)=v2f2 + v2 + m	vi	z‾i	Proportion of BS variance of the response variable explained by random slopes of BS components of observation‐level covariates
Rb2(m)=mf2 + v2 + m	vi	z‾i	Proportion of BS variance of the response variable explained by random intercepts at the mean of BS components of all covariates for random location effects
Rb2(v2m)=v2 + mf2 + v2 + m	vi	z‾i	Proportion of BS variance of the response variable explained by random location effects
R2 Measures for WS variance of the response variable
Rw2(f1)=f1f1 + v1 + e	β	(xij−x‾i)	Proportion of WS variance of the response variable explained by fixed location effects of WS components of observation‐level covariates
Rw2(v1)=v1f1 + v1 + e	vi	(zij−z‾i)	Proportion of WS variance of the response variable explained by random slopes of WS components of observation‐level covariates
Rw2(f1v1)=f1 + v1f1 + v1 + e	β, vi	(xij−x‾i), (zij−z‾i)	Proportion of WS variance of the response variable explained by both fixed location effects and random slopes of WS components of observation‐level covariates

The coefficients and covariates refer to elements of the model needed to calculate the source of variation in the specific measure, that is, what is labeled in the parenthesized superscript.

TABLE 3

Definitions and interpretations of measures for the scale part of an MELS model

Definition	Coeficients ^a	Covariates ^a	Interpretation
Rs2(e1)=e1e1 + e2 + d(e−e0)e	τ	(wij−w‾i)	Proportion of variance of observation‐level residuals explained by WS components of observation‐level covariates
Rs2(e2)=e2e1 + e2 + d(e−e0)e	τ	w‾i	Proportion of variance of observation‐level residuals explained by subject‐level covariates and BS components of observation‐level covariates
Rs2(e1e2)=e1 + e2e1 + e2 + d(e−e0)e	τ	(wij−w‾i),w‾i	Proportion of variance of observation‐level residuals explained by covariates
Rs2(d)=de1 + e2 + d(e−e0)e	σω	N/A	Proportion of variance of observation‐level residuals explained by random scale effects

The coefficients and covariates refer to elements of the model needed to calculate the source of variation in the specific measure, that is, what is labeled in the parenthesized superscript.

Definitions and interpretations of measures for the location part of an MELS model with random intercepts with covariate‐influenced variance The coefficients and covariates refer to elements of the model needed to calculate the source of variation in the specific measure, that is, what is labeled in the parenthesized superscript. Definitions and interpretations of measures for the location part of an MELS model with random slopes of observation‐level covariates The coefficients and covariates refer to elements of the model needed to calculate the source of variation in the specific measure, that is, what is labeled in the parenthesized superscript. Definitions and interpretations of measures for the scale part of an MELS model The coefficients and covariates refer to elements of the model needed to calculate the source of variation in the specific measure, that is, what is labeled in the parenthesized superscript. The s defined can measure the variance of the response variable explained by single sources of variation. Namely, , , ,,, , , , , and in Table 1, , , , , , , , , , and in Table 2, as well as , , and in Table 3 are single‐source measures. We also define s that measure joint effects of multiple parts of the models. Specifically, represents the variance of the response variable explained by fixed location effects of both subject‐level covariates and observation‐level covariates, v in Table 1 represents the variance of the response variable explained by the random intercepts, and in Table 2 represents the variance of the response variable explained by the random slopes of both the WS components and BS components of observation‐level covariates. Since the proportion of the total variance of the response variable that is BS can be of interest to researchers applying MELS models, we add in Table 1 and in Table 2. These two s measure the proportion of total variance of the response variable explained by BS location effects.

Implementation in R

The commented code for an R function named R2MELS and descriptions of its arguments are provided in the Supporting Information. The function is developed based on Rights and Sterba's r2MLMlong function. Users input their parameter estimates of a MELS model, and the function will output two tables of variance partitioning results (one for the location part of the model, and the other for the scale part of the model), two tables of values (one for the location part of the model, and the other for the scale part of the model) as well as a stacked bar plot of the single‐source values.

SIMULATION STUDY

To assess the validity of the proposed method, we conducted a small simulation study. Specifically, we fitted MELS models to 500 simulated datasets (200 subjects in each sample, 50 observations of each subject). For each simulated dataset, we allowed for two subject‐level covariates, , and three observation‐level covariates that vary purely within‐subjects, . The location model was specified as follows: For the variance of the random intercept , For the variance of the observation‐level residuals, and The generating parameters and average estimates from SAS PROC NLMIXED are summarized in Table 4, and the corresponding theoretical values of measures and average simulated values can be found in Table 5. As can be seen, all parameter estimates were well recovered, and all average simulated values resemble their corresponding theoretical values. Namely, all differences between the average simulated values and their corresponding theoretical values are lower than 0.004.

TABLE 4

Generating parameters and mean parameter estimates from 500 simulations

Parameter	True value	Simulated values mean (SD)
β0	1	1.000(0.069)
β1	−0.5	−0.497(0.020)
β2	2	1.999(0.024)
β3	1	1.000(0.013)
β4	−2	−2.000(0.007)
β5	3	3.000(0.010)
α0	0.1	0.098(0.103)
α1	0.4	0.401(0.012)
τ0	0.2	0.199(0.061)
τ1	0.3	0.300(0.013)
τ2	−0.1	−0.101(0.013)
τ3	0.5	0.502(0.018)
σω2	0.7	0.697(0.076)
ρvω	0.1	0.106 (0.075)

TABLE 5

Theoretical values of measures and average simulated values from 500 simulations

R2 Measure	Theoretical value	Simulated values mean (SD)
R2 Measures for total variance of the response variable
Rt2(f1)=f1f1 + f2 + v + e	0.661	0.663(0.015)
Rt2(f2)=f2f1 + f2 + v + e	0.203	0.201(0.017)
Rt2(f)=f1 + f2f1 + f2 + v + e	0.864	0.864(0.008)
Rt2(v1)=v‐mf1 + f2 + v + e	0.007	0.007(0.001)
Rt2(m)=mf1 + f2 + v + e	0.041	0.041(0.004)
Rt2(v)=vf1 + f2 + v + e	0.048	0.048(0.005)
Rt2(fv)=f1 + f2 + vf1 + f2 + v + e	0.912	0.912(0.006)
Rt2(f2v)=f2 + vf1 + f2 + v + e	0.251	0.249(0.017)
R2 Measures for BS variance of the response variable
Rb2(f2)=f2f2 + v	0.809	0.806(0.023)
Rb2(v1)=v−mf2 + v	0.028	0.029(0.004)
Rb2(m)=mf2 + v	0.163	0.165(0.020)
Rb2(v)=vf2 + v	0.191	0.194(0.023)
R2 Measures for WS variance of the response variable
Rb2(f1)=f1f1 + e	0.883	0.883(0.008)
R2 Measures for scale of the response variable
Rs2(e1)=e1e1+d(e−e0)e	0.231	0.232(0.009)
Rs2(d)=de1+d(e−e0)e	0.256	0.254(0.024)

Generating parameters and mean parameter estimates from 500 simulations Theoretical values of measures and average simulated values from 500 simulations

EXAMPLES

Example 1: Health behaviors data

Flueckiger et al collected intensive longitudinal data on 72 first‐year psychology students from the University of Basel regarding their sleep quality, physical activity, positive and negative affect, learning goal achievement, and examination grades. We fit a MELS model on these data to examine how negative affect (NA) of the students and survey day influenced their mean positive affect (PA), how survey day influenced the BS variance of PA, as well as how survey day influenced the WS variance of PA. PA and NA were measured on 7‐point Likert scales in which 1 means “not at all” and 7 means “extremely”. We used a grand‐mean‐centered and scaled version of survey day (Day_c), for which 1 unit indicates 1 week. The location model is as follows: The variance of the random subject intercepts and the variance of the observation‐level residuals are modeled as and respectively. The random effects and follow the same bivariate normal distribution as specified in Equation (6). The parameter estimates from SAS PROC NLMIXED are included in Table 6, and a visualization of the proposed measures for this example is shown in Figure 1.

TABLE 6

Parameter estimates of the MELS model on health behaviors data

Parameter	Estimate	SE	T‐value	P‐value
β0	5.855	0.109	53.631	<0.001
βNA	−0.644	0.016	−39.900	<0.001
βDay_c	−0.077	0.015	−5.247	<0.001
α0	−0.376	0.176	−2.144	0.036
αDay_c	0.139	0.031	4.531	<0.001
τ0	−0.603	0.093	−6.509	<0.001
τDay_c	0.073	0.025	2.888	0.005
σω	0.735	0.070	10.451	<0.001
ρvω	−0.500	0.100	−5.009	<0.001

FIGURE 1

Variance partitioning for Example 1: application to health behaviors data

Variance partitioning for Example 1: application to health behaviors data Parameter estimates of the MELS model on health behaviors data The first three bars on the left of Figure 1 correspond to the total variance, the BS variance, and the WS variance of PA, respectively. Within the bars, the red blocks represent proportion of the variance of PA explained by fixed location effects of WS components of observation‐level covariates, and , while the orange blocks can be interpreted as the proportion of the variance of PA, that is, the variance of observation‐level residuals. Blue blocks correspond to the BS variance of PA. Specifically, the darkest blue blocks indicate the proportion of the variance of PA explained by the effect of on the variance of the random intercepts, and the lightest blue represents the variance of the random intercepts at the means of both the BS component and the WS component of Day_c. The mid‐blue blocks show the proportion of the variance of PA explained by the fixed location effects of the BS components of NA and Day_c. Grey blocks, which represent the proportion of explained by the variance of random intercepts explained by , are too small and thus almost invisible in the plot. We can see that most (53.5%) of the variance of PA is within‐subject, as represented by the red and orange blocks in the first column of Figure 1. For the WS variance of PA specifically, 42.3% is attributed to the fixed location effects of the WS components of NA and Day_c, which is visualized by the red block in the third column of Figure 1. In terms of the BS variance of PA, the random subject intercepts at the mean of both the BS component and the WS component of Day_c, which explain 63.1% of the BS variance of PA as indicated by the light blue portion of the middle column of Figure 1, are of particular importance. The measures for the scale model are summarized in the rightmost bar of Figure 1. As shown by the proportion of the bottom dark olive green block in this bar, 23.6% of the scale of PA is explained by random scale effects. Less than 0.5% of the scale of PA is explained by covariates, which is made clear by the almost negligible proportion of the two green blocks in the middle of this bar.

Example 2: Depression study data

While the example in Section 4.1 presents a random intercepts model, in which its variance is modeled in terms of covariates, here we examine the application of our proposed framework to a MELS model with random slopes, as described in Section 1.2. The data are from Reisby et al's study on the clinical responses of 66 depressed inpatients treated with anti‐depressant medication. Here, we are interested in how patients' Hamilton depression score (HamD) changed following their weeks in the study (week). Additionally, endog is a subject‐level dummy code that is coded as 1 if the patient had endogenous depression. The location model with response variable HamD and predictor week, controlled for endog, is specified as follows: where is the individual deviation from the average intercept , and is the individual deviation from the average weekly change . The variance of the observation‐level residuals is assumed to change along weeks and across subjects as well. The distribution of the standardized random effects is given by: where is the correlation between the random intercepts and the random slopes. Likewise, and are the correlations between the random intercepts and the random scale effects, and between the random slopes and the random scale effects, respectively. Table 7 lists the parameter estimates for this model, and Figure 2 is the visualization of the proposed measures for this example. The meaning of each bar in Figure 2 can be interpreted as in Section 4.1, but for the variance of HamD instead of PA. The red blocks now represent the proportion of the variance of HamD explained by fixed location effects of the WS component of week. The orange blocks can be interpreted as the proportion of the variance of HamD that corresponds to observation‐level residuals, and the mid‐blue blocks indicate the proportion of the variance of HamD explained by fixed location effects of endog and the BS component of week. The newly added brown blocks correspond to the proportion of the variance of HamD explained by random slopes of WS components of week, and the dark blue blocks represent the proportion of the variance of HamD explained by random slopes of BS components of week. The light blue blocks still indicate the proportion of the variance of the response variable captured by the random intercepts but at the mean of .

TABLE 7

Parameter estimates of the MELS model on depression study data

Parameter	Estimate	SE	T‐value	P‐value
β0	22.657	0.702	32.27	<0.001
βweek	−2.357	0.201	−11.74	<0.001
βendog	1.533	0.924	1.66	0.102
τ0	2.062	0.212	9.72	<0.001
τweek	0.100	0.067	1.51	0.137
σω	0.538	0.116	4.65	<0.001
σv0	3.154	0.481	6.56	<0.001
σvweek	1.379	0.174	7.92	<0.001
ρv0ivweek,i	−0.207	0.180	−1.15	0.255
ρv0iω	0.464	0.235	1.98	0.053
ρvweek,iω	−0.485	0.213	−2.28	0.026

FIGURE 2

Variance partitioning for Example 2: application to depression study data

Variance partitioning for Example 2: application to depression study data Parameter estimates of the MELS model on depression study data As represented by the total proportion of the blue blocks in the first column of Figure 2, 35.8% of the variance of HamD is between‐subjects. While random slopes of BS components of week explain very little BS variance of HamD, as varies very little across subjects, the relative size of the light blue block in the second bar of Figure 2 shows that random subject intercepts at the mean of explain 94.3% of BS variance of HamD. Fixed location effects of the WS component of week explain the most (47.5%, the relative space of the red block in the third bar of Figure 2) WS variance of HamD while another 16.3% is attributed to random slopes of the WS component of week as indicated by the brown block in that bar. 13.4% of the scale of HamD is explained by random scale effects while another 1.3% is explained by WS variation in Week. Less than 0.03% of the scale of HamD is explained by the BS component of Week.

DISCUSSION

Our work extends Rights and Sterba's framework of defining measures for multilevel models to MELS models proposed by Hedeker et al and Nordgren et al. The extended framework accommodates two special features of MELS modeling: (1) observation‐level residual heteroskedasticity at different covariate values and across subjects; (2) inclusion of random location effects through either heteroskedastic random subject intercepts depending on covariates, or random subject intercepts and random subject slopes of observation‐level covariates. We believe that this standardized effect size framework can facilitate the interpretation of MELS models and encourage wider use of this type of model. In this article, our defined measures assume a two‐level MELS model. Future work can extend these measures to three‐level models, as developed by Lin et al in which WS variation of the response variable is further divided into WS variation between waves and WS variation within waves. Also, the proposed measures can be expanded to other kinds of outcomes, for example, count outcomes and ordinal outcomes. An application of a MELS model for ordinal data was discussed by Hedeker et al. Furthermore, future research might take into consideration the autocorrelation of observation‐level residuals. A recent development by Nestler extends the MELS models to include AR(1) autocorrelation influenced by subject‐level covariates and a random subject effect for the autocorrelation. Currently, our framework focuses on point estimates of measures. While reporting the point estimates is conventional for measures, researchers interested in coverage and confidence intervals of these measures can apply bootstrapping methods to calculate these quantities. There is no existing literature on bootstrapping in MELS models to our knowledge, but researchers can refer to Goldstein's discussion on bootstrapping in multilevel models. Lastly, this work focuses on defining 's for a single model, and comparisons of 's between different models are beyond the scope of this study. Researchers can refer to Rights and Sterba's recommendations on the use of differences in multilevel model comparisons for interpretation of differences between measures. Appedix S1 Supporting Information Click here for additional data file.

13 in total

1. Using Effect Size-or Why the P Value Is Not Enough.

Authors: Gail M Sullivan; Richard Feinn
Journal: J Grad Med Educ Date: 2012-09

2. A mixed ordinal location scale model for analysis of Ecological Momentary Assessment (EMA) data.

Authors: Donald Hedeker; Hakan Demirtas; Robin J Mermelstein
Journal: Stat Interface Date: 2009 Impact factor: 0.582

3. Effect size measures for longitudinal growth analyses: Extending a framework of multilevel model R-squareds to accommodate heteroscedasticity, autocorrelation, nonlinearity, and alternative centering strategies.

Authors: Jason D Rights; Sonya K Sterba
Journal: New Dir Child Adolesc Dev Date: 2021-01-29

9. An extension of the mixed-effects growth model that considers between-person differences in the within-subject variance and the autocorrelation.

Authors: Steffen Nestler
Journal: Stat Med Date: 2021-12-26 Impact factor: 2.373

10. How health behaviors relate to academic performance via affect: an intensive longitudinal study.

Authors: Lavinia Flueckiger; Roselind Lieb; Andrea H Meyer; Jutta Mata
Journal: PLoS One Date: 2014-10-29 Impact factor: 3.240

1 in total

1. Defining R-squared measures for mixed-effects location scale models.

Authors: Xingruo Zhang; Donald Hedeker
Journal: Stat Med Date: 2022-07-07 Impact factor: 2.497

1 in total

Defining R-squared measures for mixed-effects location scale models.

INTRODUCTION

Mixed‐effects location‐scale (MELS) models with random intercepts with covariate‐influenced variance

MELS models with random slopes of observation‐level covariates

THE PROPOSED METHOD

Decomposition of observation‐level covariates

Variance partitioning

Variance partitioning for MELS models with random intercepts with covariate‐influenced variance

Variance partitioning for MELS models with random slopes of observation‐level covariates

Defining measures

Implementation in R

SIMULATION STUDY

EXAMPLES

Example 1: Health behaviors data

Example 2: Depression study data

DISCUSSION

1. Using Effect Size-or Why the P Value Is Not Enough.

2. A mixed ordinal location scale model for analysis of Ecological Momentary Assessment (EMA) data.

3. Effect size measures for longitudinal growth analyses: Extending a framework of multilevel model R-squareds to accommodate heteroscedasticity, autocorrelation, nonlinearity, and alternative centering strategies.

4. Quantifying explained variance in multilevel models: An integrative framework for defining R-squared measures.

5. Imipramine: clinical effects and pharmacokinetic variability.

6. Extending the mixed-effects model to consider within-subject variance for Ecological Momentary Assessment data.

7. New Recommendations on the Use of R-Squared Differences in Multilevel Model Comparisons.

8. A 3-level Bayesian mixed effects location scale model with an application to ecological momentary assessment data.

9. An extension of the mixed-effects growth model that considers between-person differences in the within-subject variance and the autocorrelation.

10. How health behaviors relate to academic performance via affect: an intensive longitudinal study.

1. Defining R-squared measures for mixed-effects location scale models.