Literature DB >> 32300374

Correlation Coefficients for a Study with Repeated Measures.

Abstract

Repeated measures are increasingly collected in a study to investigate the trajectory of measures over time. One of the first research questions is to determine the correlation between two measures. The following five methods for correlation calculation are compared: (1) Pearson correlation; (2) correlation of subject means; (3) partial correlation for subject effect; (4) partial correlation for visit effect; and (5) a mixed model approach. Pearson correlation coefficient is traditionally used in a cross-sectional study. Pearson correlation is close to the correlations computed from mixed-effects models that consider the correlation structure, but Pearson correlation may not be theoretically appropriate in a repeated-measure study as it ignores the correlation of the outcomes from multiple visits within the same subject. We compare these methods with regard to the average of correlation and the mean squared error. In general, correlation under the mixed-effects model with the compound symmetric structure is recommended as its correlation is close to the nominal level with small mean square error.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2020 PMID： 32300374 PMCID： PMC7136761 DOI： 10.1155/2020/7398324

Source DB: PubMed Journal: Comput Math Methods Med ISSN： 1748-670X Impact factor: 2.238

1. Introduction

Repeated-measure designs are increasingly used in practice to evaluate the trajectory of measures. The Alzheimer's Disease Neuroimaging Initiative (ADNI) study is a longitudinal study to investigate the progression of Alzheimer's disease (AD) [1, 2]. This study evaluates the normal cognitive aging with the focus on mild cognitive impairment (MCI) and early AD. Brain structure and function are two research areas of interest in the ADNI study. As expected, brain structure volumes are often highly associated with results from cognitive tests [3-5]. In a longitudinal study, correlation for repeated measures should be calculated and reported. However, recent articles still only reported the Pearson correlation coefficient that ignores the correlation of outcomes from the same subject. For these reasons, it is important to compare the existing correlations for repeated measures and make recommendations for other researchers to use. Bland and Altman [6, 7] discussed several approaches to compute correlations for repeated measures. They proposed calculating subject means to compute the Pearson correlation, where subject means eliminate the correlation of outcomes from the same subject. The second approach is to fit a linear regression model with one measure as the dependent variable and the other measure and the subject as the predictor variables. The second approach is similar to the one proposed by Christensen [8] who suggested computing correlation after adjusting for the subject effect [9-12]. In a repeated-measure study, the visit effect is the correlation within the subject. Lipsitz et al. [13] proposed computing partial correlation adjusting the visit effect. When data are correlated, mixed-effects models may be utilized to analyze data while controlling for these additional correlations. Lam et al. [14] were among the first to propose computing correlation between repeated measures under the compound symmetric (CS) correlation structure. Later, Hamlett et al. [15] developed programs to compute correlation under the CS structure by using the commercially available statistical software, SAS. In the work by Lam et al. [14], they also computed the correlation under the autoregressive correlation structure, AR(1). After that, Roy [16] developed SAS macros to compute correlation under the AR(1) structure and compared the correlations for repeated measures under these two correlation structures with limited simulation studies. The objective of this manuscript is to conduct extensive simulation studies to compare the existing correlations for repeated measures with regard to the average of correlation and the mean squared error (MSE) and identify the correlation method that has the best performance to be used in practice. In addition to the parameter of interest (correlation for repeated measures), there are several nuisance parameters in the variance-covariance matrix: variances, correlations within each outcome, and correlation between outcomes from different visits [17-20]. It is computationally intensive for these comparisons. We have to use supercomputers for simulation studies. However, it is computationally feasible to calculate correlations for an observed data set. We use one example from the ADNI study to illustrate the application of the considered methods to calculate correlation between hippocampal volumes and a neuropsychological assessment to evaluate verbal memory. We organize this article as follows. In Section 2, we introduce the existing methods to calculate correlations for repeated measures. In Section 3, we conduct extensive Monte Carlo simulation studies to compare the performance of the considered correlations with regard to the average of correlation and the MSE. A real example from the ADNI study is then used to illustrate the application of these correlations. Lastly, we provide conclusions in Section 4 on computing correlation for repeated measures when heterogeneity of correlation is observed across visits.

2. Methods

For a repeated-measure study with n participants, each participant has several scheduled visits (m visits for the i-th subject). Suppose U and W are the two measures in a repeated-measure study and U and W are the outcomes of the i-th subject at the j-th visit, where i = 1, 2,…, n and j = 1, 2,…, m. The correlation between U and W, ρ, is the parameter of interest to quantify a relationship between them. Several methods have been proposed to calculate ρ, including independence models, partial correlation models, and mixed-effects models.

2.1. Independent Assumption

Bland and Altman [6, 7] were among the first to provide methods to compute longitudinal correlation coefficient. One of their approaches assumes the independence between outcomes from the same subject: U ⊥ U and W⊥W. The longitudinal correlation ρ is computed as the Pearson correlation by ignoring the correlation structure from repeated measures. This approach is referred to as the I approach, with the computed correlation as ρ. This is a naive approach that is easy to apply. Irimata and Li [21] found that ρ for a pharmacokinetics data set is very close to other correlations computed from other complicated models.

2.2. Subject Means

As suggested by Bland and Altman [6], the correlation can be computed by using the averages at the subject level to eliminate the subject effect in repeated measures. This correlation is able to address the research question whether the average of one measure is related to the average of another. When correlation within each measure is large, ρ at different visits should be similar to each other, and this average correlation model would have good performance. We refer to this correlation approach as the M approach with the notation of ρ. These two correlations for repeated measures, ρ and ρ, are the Pearson correlation and can be computed by using many statistical software: such as the Proc corr procedure in SAS and the function cor or cor.test in R [22]. The next five correlations are computed from regression models (e.g., mixed-effects models), and we would like to suggest using SAS Proc mixed procedure for implementation. Detailed SAS programs are provided in the Appendix.

2.3. Correlation Adjusting for the Subject Effect

Christensen [8] proposed computing correlation for repeated measures by partialling out the subject effect. The subject effect can be removed from the two measures by fitting a multivariate regression model with both measures being the outcomes and the subject ID as the only covariate. The residuals are used to compute the final correlation, which is essentially a partial correlation method for repeated data. This correlation is referred to as the PS correlation that partials out the subject effect, ρ.

2.4. Correlation Adjusting for the Visit Effect

In the ρ calculation, the correlation between the two measures is included in the multivariate model. In addition to that correlation, another correlation between measures at different visits may be considered. Lipsitz et al. [13] proposed computing partial correlation between outcome and one of the covariates by using this approach. When one of the two measures (e.g., measure U) is considered as the dependent variable, the other measure (W) is considered as the covariate. The correlation structure between visits is assumed to be compound symmetric. We refer this correlation as the ρ correlation. We use ρ for another correlation when W is considered as the dependent variable in the model. One of the properties for correlation is ρ=ρ, but this property is not met here: ρ is generally not equal to ρ.

2.5. Mixed-Effects Model

Let Y=(U, W, U, W,…, U, W) be the outcomes from the i-th subject, with the vector length of 2m. The complete data can be reorganized in a long format, with the columns subject ID, visit, mtype, and outcome, where mtype = “U” for the U measure and mtype = “W” for the W measure. The long format utilizes 2m rows for the outcomes from Y. The linear mixed-effects model is presented aswhere X and Z are the design matrices for the fixed effect and the random effect, respectively. The random effect b follows a multivariate normal distribution N (0, D), and the measurement error ϵ follows a multivariate normal distribution N (0, R). The detailed formula for D and R may be found in the article by Hamlett et al. [15]. The fixed effect is β = (β0, β, β)′, where β0 is the intercept, and β and β are the fixed effects of U and W, respectively. Correlation between U and W is computed aswhich is assumed to be independent of the visit. Each subject has multiple visits, correlation within U is Corr(U, U)=ρ, and the correlation within W is Corr(W, W)=ρ, where d (j − j′) = 1 for the CS structure and d (j − j′) = |j − j′| for the AR(1) structure. Since W is correlated with both U and W, therefore, U and W are correlated and their correlation is assumed to be δρUW, where δ is a factor which is generally less than 1. Let σ2 and σ2 be the variances of U and W, respectively. These variances and covariances are used to derive the variance-covariance matrix under the CS structure (see Lam et al. [14] and Hamlett et al. [15]) and that under the AR(1) structure (see Lam et al. [14] and Roy [16]).

3. Results

We conduct simulation studies to compare the performance of the considered 7 methods for the correlation between repeated measures for a study with four visits. The mean values of U and W are assumed to be (2.0, 1.9, 1.7, 1.4) and (0.8, 0.7, 0.6, 0.5), with both measures decreasing as time goes. Such data are commonly available from cognitive tests on elderly population and other studies. The prespecified correlation for repeated measures is ρ=0.2, 0.5, and 0.8. In the simulation studies for the AR(1) structure for the visit effect, the correlation within U is Corr(U, U)=ρ|, with ρ=0.2, 0.5, and 0.8, and the correlation within W is Corr(W, W)=ρ|, with ρ=0.2, 0.5, and 0.8. The factor δ in the correlation between U and W is assumed to be 0.6 in all simulations. The considered variances are σ2=1 and 3 and σ2=0.5 and 1. The variance-covariance matrix can be separated into two parts: ZDZ′ and R. We assume that a quarter of variance is from R and the remaining is from ZDZ′. This weight is needed in order to calculate the covariances. For each configuration, we simulate B = 2,000 data sets. Under the AR(1) structure for the visit effect, Figure 1 presents the average of correlation ρ and the MSE when ρ=0.2, σ2=1, and n = 60 subjects. The MSE is defined aswhere is the estimator of ρ by using the b-th simulated data set. It can be seen that the correlations adjusting the visit effect, ρ and ρ, often underestimate the correlation, while the correlation adjusting the subject effect, ρ, always overestimate the correlation. The remaining methods have correlations close to the nominal level. Although ρ is the best with the correlation around the nominal level, its MSE is much larger than the ones that have the correlations close to the nominal level. In the calculation of ρ, each subject only has one outcome for each measure, as compared to multiple outcomes in other correlation calculations. Due to the reduced number of outcomes, the variance of ρ is much large that leads to a large MSE. It is noted that ρ or ρ could have the lowest MSE in some cases, but their estimated correlations are generally much below the nominal level. For this reason, we exclude ρ and ρ in the following simulation studies. When a study has the same number of visits for each subject, the estimated correlation by using the mixed-effects model with the CS structure, ρ, is very similar to ρ under the independent assumption. The other mixed-effects model correlation ρ has a similar correlation as ρ and ρ. The MSE of ρ is slightly smaller than the MSEs of ρ and ρ when the correlations within U or W are small, and this trend is reversed when ρ and ρ are large. Similar results are observed when σ2 is increased to 3.

Figure 1

Average correlation and the MSE for the 7 methods under the AR(1) correlation structure when ρ = 0.2, σ2=1, and n = 60.

When ρ is increased to 0.5 (the top plot in Figure 2), the averages of ρ, ρ, and ρ are generally above the nominal level, and the first two correlations are closer to the nominal level as compared to the third correlation ρ. We also present the correlation estimates when sample size n is 100 in Figure 2. It can be seen that the MSEs become smaller as compared to the MSEs in the top plot (Figure 2) when sample size is 60.

Figure 2

Average correlation and the MSE under the AR(1) correlation structure with n = 60 (top) and n = 100 (bottom) when ρ = 0.5 and σ2=1.

Figure 3 shows the results when data sets are simulated under the CS structure given ρ=0.5, σ2=1, and n = 60. Correlation ρ does not perform well with the average correlations much below the nominal level in many configurations. We also found that ρ is likely to overestimate the correlation. It seems that ρ and ρ have different trajectories as ρ increases. Both of these methods do not have satisfactory performance with regard to correlation under the CS structure, although ρ has very good correlation estimates under the AR(1) structure. The other three correlations (ρ, ρ, and ρ) have similar good performance with regard to correlation and the MSE. It should be noted that the variance-covariance matrix is not positively defined when ρ=ρ=0.8. Therefore, data sets cannot be generated for that configuration. We also simulate data under the unstructured correlation structure and found that ρ, ρ, and ρ are still the best correlation estimates.

Figure 3

Average correlation and the MSE under the CS correlation structure when ρ = 0.5, σ2=1, and n = 60.

The aforementioned simulations have data sets that each subject has the same number of visits. In practice, it is possible that the number of visits may not be exactly the same for all subjects. We assume the number of visits is either 2, 3, or 4. Each subject is randomly assigned to have 2, 3, or 4 visits with the same probability. We present the results with n = 60 in Figure 4 when variances are small (σ2=1 and σ2=0.5 and 1) and large (σ2=20 and σ2=10 and 30). The MSE of ρ is slightly smaller than that of ρ, and their biggest difference occurs when both ρ and ρ are large. ρ is more likely to overestimate the correlation. Although ρ has the correlation very close to the nominal level, it has the largest MSE as compared to other correlations. When variance is large, ρ and ρ are the best correlations with the estimated correlations much closer to the nominal level as to the configurations with small variances. The mixed-effects model correlation ρ performs slightly better than ρ with regard to the average of correlation and the MSE.

Figure 4

Average correlation and the MSE under the AR(1) correlation structure with a small variance σ2=1 (top) and a large variance σ2=20 (bottom) when ρ = 0.5 and n = 60 for a study with unequal numbers of visits (2, 3, or 4 visits).

3.1. Example

We use one data set from the ADNI study to illustrate the application of the considered correlation methods, with 47 participants who had 5-year visits and completed imaging volumes and memory scores. Hippocampal volumes are found to be highly associated with the delayed recall scores from the Rey Auditory Verbal Learning Test (RAVLT delayed recall) [23]. The RAVLT delayed recall has the possible integer score from 0 to 15, which is often used to assess verbal memory. The higher the score is, the better the memory is. The computed correlations are presented in Table 1. Participants in this data set have the same number of visits. For this reason, ρ is very similar to ρ. ρ is slightly larger than them, and ρ is smaller than them. Correlation adjusted by the subject effect ρ is much smaller than ρ. Correlations adjusted by the visit effect highly depend on which variable is considered as the dependent variable in the linear regression model. When hippocampal volumes are used as the dependent variable, the estimated correlation is high (0.686), and it becomes too low (0.016) when RAVLT delayed recalls are considered as the dependent variable.

Table 1

Correlation between hippocampal volumes and RAVLT delayed recall scores using 47 participants with 5 visits from the ADNI study.

	ρ _I	ρ _M	ρ _PS	ρ _PVa	ρ _PVb	ρ _CS	ρ _AR
Left hippocampal and RAVLT delayed recall scores	0.421	0.468	0.151	0.016	0.686	0.421	0.392
Left hippocampal and RAVLT immediate recall scores	0.352	0.421	0.208	0.023	0.447	0.365	0.399
Right hippocampal and RAVLT delayed recall scores	0.361	0.398	0.149	0.014	0.652	0.361	0.327
Right hippocampal and RAVLT immediate recall scores	0.316	0.373	0.211	0.021	0.443	0.335	0.343

It was reported by Wang et al. [23] that the Pearson correlation ρ between hippocampal volumes and RAVLT delayed recall scores is slightly above 0.4. They also provided the Pearson correlations for each group (AD, MCI, and control) which are all below the correlation using combined samples. The correlation within the control group is the lowest. From Table 1, RAVLT delayed recall scores always have a larger correlation with left hippocampal volumes than the correlation with right hippocampal volumes for each correlation method. We also add RAVLT immediate recall scores to further illustrate the application of the considered methods. Its correlation with left hippocampal volumes is often larger than its correlation with right hippocampal volumes. The estimated ρ between hippocampal volumes and RAVLT delayed recalls is larger than that between hippocampal volumes and RAVLT immediate recalls.

4. Conclusions

From the simulation studies, ρ under the independence assumption and ρ using the mixed-effects model with CS variance-covariance structure are shown to have similar correlation estimates when subjects have the same number of visits. But, ρ is appropriate as it models the data properly. The mixed-effects model correlation ρ is recommended for use as its correlation is close to the nominal level with small mean square error.

5. Discussions

Lam et al. [14] derived the detailed variance and covariance. The variances σ2 and σ2 and covariance σ are used to calculate ρ. These variances and covariance estimates are not exactly the same from the independent model and the mixed-effects model with the CS structure: σ2=16.6846 in the ρ calculation and 16.6136 from the CS model. Because these estimated variances and covariance are very close between these two methods, the final estimated correlations are very similar. When a study has different number of follow-up for each participant, ρ and ρ differ from each other [18, 24–26]. For a study with some possible outliers as seen in the data testing association between pH and PaCO2 [6], their difference is substantial. We provide the SAS programs by using that example in the Appendix. When CS or AR(1) correlation structure for the visit effect is applied in the mixed-effects models [10, 25, 27], the computed correlation is the same at different visits. In the observation of the heterogeneity of correlations at different visits, the unstructured correlation may be considered for the visit effect. Under the heterogeneity assumption, correlation can be computed at each visit from a mixed-effects model [28-30]. This brings some challenges to explain the results, such as the overall correlation and the trend of correlation. Alternatively, when a study has a monotonic relationship between correlation and visit, one may include an additional predictor: visit, in the statistical model, to calculate a monotonic correlation for repeated measures.

19 in total

1. Estimating correlation coefficient between two variables with repeated observations using mixed effects model.

Authors: Anuradha Roy
Journal: Biom J Date: 2006-04 Impact factor: 2.207

2. A better confidence interval for the sensitivity at a fixed level of specificity for diagnostic tests with continuous endpoints.

Authors: Guogen Shan
Journal: Stat Methods Med Res Date: 2016-09-30 Impact factor: 3.021

3. Exact one-sided confidence limits for Cohen's kappa as a measurement of agreement.

Authors: Guogen Shan; Weizhen Wang
Journal: Stat Methods Med Res Date: 2014-10-06 Impact factor: 3.021

4. Exact p-values for Simon's two-stage designs in clinical trials.

Authors: Guogen Shan; Hua Zhang; Tao Jiang; Hanna Peterson; Daniel Young; Changxing Ma
Journal: Stat Biosci Date: 2016-06-16

5. Exact Unconditional Tests for Dichotomous Data When Comparing Multiple Treatments With a Single Control.

Authors: Guogen Shan; Carolee Dodge-Francis; Gregory E Wilding
Journal: Ther Innov Regul Sci Date: 2020-01-06 Impact factor: 1.778

6. Calculating correlation coefficients with repeated observations: Part 1--Correlation within subjects.

Authors: J M Bland; D G Altman
Journal: BMJ Date: 1995-02-18

7. Exact confidence limits for the probability of response in two-stage designs.

Authors: Guogen Shan
Journal: Statistics (Ber) Date: 2018-05-08 Impact factor: 1.051

8. The longitudinal associations between cognition, mood and striatal dopaminergic binding in Parkinson's Disease.

Authors: Ece Bayram; Nikki Kaplan; Guogen Shan; Jessica Z K Caldwell
Journal: Neuropsychol Dev Cogn B Aging Neuropsychol Cogn Date: 2019-08-14

9. The Relationship Between Hippocampal Volumes and Delayed Recall Is Modified by APOE ε4 in Mild Cognitive Impairment.

Authors: Xiwu Wang; Wenjun Zhou; Teng Ye; Xiaodong Lin; Jie Zhang
Journal: Front Aging Neurosci Date: 2019-02-26 Impact factor: 5.750

10. Fisher's exact approach for post hoc analysis of a chi-squared test.

Authors: Guogen Shan; Shawn Gerstenberger
Journal: PLoS One Date: 2017-12-20 Impact factor: 3.240

7 in total

1. Methods for characterizing ovarian and adrenal hormone variability and mood relationships in peripubertal females.

Authors: Elizabeth Andersen; Serena Fiacco; Jennifer Gordon; Rachel Kozik; Kayla Baresich; David Rubinow; Susan Girdler
Journal: Psychoneuroendocrinology Date: 2022-03-25 Impact factor: 4.693

2. Self-Reported Social Relationship Capacities Predict Motor, Functional and Cognitive Decline in Huntington's Disease.

Authors: Pablo Lemercier; Laurent Cleret de Langavant; Jennifer Hamet Bagnou; Katia Youssov; Laurie Lemoine; Etienne Audureau; Renaud Massart; Anne-Catherine Bachoud-Lévi
Journal: J Pers Med Date: 2022-01-27

3. Sex-dependent jugular vein optical attenuation and distension during head-down tilt and lower body negative pressure.

Authors: Courtney A Patterson; Robert Amelard; Essi Saarikoski; Hannah Heigold; Richard L Hughson; Andrew D Robertson
Journal: Physiol Rep Date: 2022-02

4. Respiratory Motion and Airflow Estimation During Sleep Using Tracheal Movement and Sound.

Authors: Nasim Montazeri Ghahjaverestan; Wei Fan; Cristiano Aguiar; Jackson Yu; T Douglas Bradley
Journal: Nat Sci Sleep Date: 2022-07-01

5. Mobile sonouroflowmetry using voiding sound and volume.

Authors: Elie El Helou; Joy Naba; Karim Youssef; Georges Mjaess; Ghassan Sleilaty; Samar Helou
Journal: Sci Rep Date: 2021-05-27 Impact factor: 4.379

6. Testing Clinical Intuitions About Barriers to Improvement in Cognitive-Behavioral Therapy for Panic Disorder.

Authors: Rachel A Schwartz; Dianne L Chambless; Jacques P Barber; Barbara Milrod
Journal: Behav Ther Date: 2021-01-01

7. Remodeling the Skeletal Muscle Extracellular Matrix in Older Age-Effects of Acute Exercise Stimuli on Gene Expression.

Authors: Matthias Gumpenberger; Barbara Wessner; Alexandra Graf; Marco V Narici; Christian Fink; Sepp Braun; Christian Hoser; Anthony J Blazevich; Robert Csapo
Journal: Int J Mol Sci Date: 2020-09-25 Impact factor: 5.923

7 in total