| Literature DB >> 29970000 |
Annie M Racine1,2,3, Yun Gou4, Tamara G Fong4,5,6, Edward R Marcantonio5,7, Eva M Schmitt4, Thomas G Travison4,5, Sharon K Inouye4,5,7, Richard N Jones8.
Abstract
BACKGROUND: Few studies have compared methods to correct for retest effects or practice effects in settings where an acute event could influence test performance, such as major surgery. Our goal in this study was to evaluate the use of different methods to correct for the effects of practice or retest on repeated test administration in the context of an observational study of older adults undergoing elective surgery.Entities:
Keywords: Cognitive decline; Delirium; Post-operative; Practice; Repeated measures; Retest; Surgery
Mesh:
Year: 2018 PMID: 29970000 PMCID: PMC6029140 DOI: 10.1186/s12874-018-0530-x
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Description of retest correction approaches
| Model | Analysis Sample | Retest Correction Method** | Additional Specifications to Basic Model* | Strengths | Weaknesses |
|---|---|---|---|---|---|
| 1: No correction | SAGES surgery sample (delirium-positive and delirium-negative) | No correction for retest effect | Main effect: delirium group | -Does not make manipulations to observed data | -Difficult to separate retest effects from effects due to delirium/surgery |
| 2: Mean difference correction | SAGES surgery sample (delirium-positive and delirium-negative) | Step 1: Calculate mean retest effect in NSC group: | Main effect: delirium group | -Straightforward, constant (within occasion) transformation for all people | -Potential variability due to the fact that precision of retest correction estimation is not accounted for |
| 3: Predicted difference correction | SAGES surgery sample (delirium-positive and delirium-negative), N = 560 | Step 1: Estimate linear regression equations in NSC group expressing the dependence of GCP score at follow-up on baseline GCP, centered at the overall mean of the GCP at baseline in the NSC group; The regression is estimated separately for each observation time point (follow-up month) | Main effect: delirium group | -Retest effects will be more appropriately modeled if a baseline variable is known to predict retest effects | -The degree of correction could differ by group if predictors of the retest effect differ by group |
| 4: Model-based correction | SAGES surgery sample (delirium-positive and delirium-negative), | Raw GCP is utilized for both the NSC and surgical groups, but this method differs from Model 1 (no correction) in that NSC data is modeled as the comparison group for both the delirium+ and delirium- groups. Relative differences between the delirium+ and delirium- groups are then calculated with post hoc tests. | Main effect: NSC, delirium group | -Standard errors are appropriately conservative | -Model estimates are reported in reference to the NSC group |
SAGES=Successful AGing after Elective Surgery; NSC = non-surgical controls; GCP = general cognitive performance
*Basic model: Linear mixed effects model with random intercept and time piece from 2 to 18 months; fixed time indicator variables for months 1 and 2 and time piece from 2 to 18 months; fixed covariates: baseline age, sex, non-white race
**The Approach 2 Step 1 equation defines the normative retest or practice effect [R(NSC)] as the mean difference in general cognitive performance (GCP) score from baseline among the non-surgical comparison (NSC) group. This effect is computed assuming no true change occurs within a six month time frame, or is vanishingly small relative to the practice or retest effect. The retest effect is simply the mean within-person difference between the time t follow-up and baseline observed cognitive test score (GCP). These are computed for months 1, 2 and 6 (per-protocol observation time points). Step 2 defines the retest corrected cognitive performance score (GCPRC) for a person (i) at time t as their observed score at time t minus the mean retest effect in the NSC group at time t. We set the 12 and 18 month follow-up equal to the six-month retest effect to reflect our assumption that practice or retest effects are constant following the six month follow-up
Sample characteristics at baseline by group
| Baseline sample characteristic | Non-surgical comparison ( | Successful Aging after Elective Surgery | ||
|---|---|---|---|---|
| Total ( | Delirium- ( | Delirium+ ( | ||
| Age – years, mean (SD) | 77 (5.2) | 77 (5.2) | 76 (5.2) | 77 (5.0) |
| Sex – n (%) female | 52 (44%) | 326 (58%) | 245 (58%) | 81 (60%) |
| Race/ethnicity – n (%) non-white or Hispanic | 16 (14%) | 37 (7%) | 28 (7%) | 9 (7%) |
| Marital status – n (%) married or living with partner | 73 (62%) | 332 (59%) | 253 (59%) | 79 (59%) |
| Education – years, mean (SD) | 16 (3.2) | 15 (2.9) | 15 (2.9) | 15 (3.0) |
| 3MS Score (0–100) – mean (SD) | 93.4 (5.6) | 93.5 (5.4) | 94.1 (5.1) | 91.6 (5.8) |
| Baseline GCP – mean (SD) | 58.1 (9.7) | 57.6 (7.2) | 57.5 (7.3) | 54.7 (6.5) |
| IQCODE (0–5) – mean (SD)* | 3.1 (0.19) | 3.1 (0.19) | 3.1 (0.16) | 3.1 (0.25) |
| Geriatric Depression Scale (0–15) – mean (SD)* | 1.3 (1.76) | 2.5 (2.5) | 2.3 (2.4) | 3.0 (0.3) |
| Total ADL impairment (0–14) – mean (SD) | 0.02 (0.16) | 0.81 (1.6) | 0.75 (1.5) | 1.0 (1.9) |
| Total IADL impairment (0–14) – mean (SD) | 0.12 (0.53) | 0.54 (1.18) | 0.45 (1.1) | 0.78 (1.5) |
| Physical impairment composite – mean (SD) | 50.6 (8.0) | 38.7 (10.3) | 39.3 (10.3) | 36.8 (10.0) |
| Visual impairment – n (%)* | 1 (< 1%) | 3 (< 1%) | 1 (< 1%) | 2 (< 2%) |
| Hearing impairment – n (%)* | 39 (33%) | 182 (33%) | 133 (31%) | 49 (37%) |
| Body mass index – mean (SD)* | 27 (4.6) | 29 (5.5) | 28.3 (5.6) | 29.3 (5.2) |
Continuous variables are presented as mean (SD) and dichotomous variables are presented as n (%). Successful Aging after Elective Surgery (SAGES) is grouped as the total cohort sample, the group who did not have delirium (Delirium-) and the group who developed delirium (Delirium+). Body mass index is calculated from self-reported height and weight. Visual impairment = < 20/70 corrected binocular vision. GCP = general cognitive performance; 3MS = Modified Mini Mental State Examination; IQCODE = Informant Questionnaire on Cognitive Decline in the Elderly; ADL = Activities of Daily Living; IADL = Instrumental Activities of Daily Living
*Indicates some missing data for the SAGES surgical and/or NSC samples: 2 missing Geriatric Depression Scale, 1 missing IQCODE, 8 missing body mass index (2 from NSC), 1 missing hearing impairment, 3 missing visual impairment
Fig. 1Spaghetti plots of raw general cognitive performance over time. Spaghetti plots of random samples (n = 20) per group of raw general cognitive performance (GCP) over time in the non-surgical comparison group (top panel), surgery delirium-negative group (middle panel), and the surgery delirium-positive group (bottom panel). Raw time-series data for all three groups generally show a plateauing of cognitive performance after month 6, with the most variability in the delirium group
Fig. 2Mean general cognitive performance (GCP) in the surgical vs. non-surgical comparison (NSC) groups. Raw GCP performance in the surgery group (n = 560, dashed line, 95% C.I.s in light gray) and NSC group (n = 118, solid line, 95% C.I.s in dark gray) estimated using marginal means derived from a linear mixed effects model with random intercept and fixed effects for assessment time points coded as dichotomous “dummy” variables, surgery vs. NSC dichotomous grouping variable, and no additional covariables. X-axis: assessment time point (months since surgery); Y-axis: model-estimated GCP performance
Comparison of model outputs by retest correction method
| Model 1: No Correction | Model 2: Mean Difference Correction | Model 3: Predicted Difference Correction | Model 4: Model-based Correction | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model Parameter | Del- | Del+ | Net Diff. | Del- | Del+ | Net Diff. | Del- | Del+ | Net Diff. | NSC | Del- | Del+ | Net Diff. |
| Intercept | 58.32 | 55.33 | −3.00* | 58.32 | 55.33 | − 3.00* | 58.31 | 55.35 | −2.97* | 57.86 | 58.32 | 55.41 | −2.91* |
| Change at month 1 | 1.08 | −0.20 | −1.28* | −0.73 | −2.01 | −1.28* | −0.67 | −2.27 | −1.60* | 1.80 | 1.08 | −0.20 | −1.28* |
| Change at month 2 | 1.39 | 2.33 | 0.94* | 1.08 | 2.02 | 0.94* | 1.04 | 2.17 | 1.14* | 0.32 | 1.39 | 2.33 (0.25) | 0.94* |
| Change from months 2–18 | 0.23 | −0.11 | − 0.34 | 0.40 | 0.06 | −0.34 | 0.39 | 0.07 | −0.32 | −0.10 | 0.23 | −0.11 | − 0.34 |
| Model | 0.32 | 0.27 | 0.31 | 0.25 | 0.31 | 0.25 | 0.47 | 0.32 | 0.27 | ||||
| Model | 0.35 | 0.33 | 0.33 | 0.37 | |||||||||
Comparison of model outputs for baseline GCP (intercept), change in GCP from baseline to 1 month, change in GCP from month 1 to 2, and estimated GCP slope from 2 to 18 months by retest correction method. Model outputs are presented as parameter estimates (and standard errors) for intercept, time indicator variables at months 1 and 2, and time-slope from months 2–18 by group for each retest correction method
Del- = surgery Delirium-negative group (n = 426), Del + = surgery delirium-positive group (n = 134), NSC = non-surgical comparison group (n = 118). GCP = General Cognitive Performance. For models 1–3, Del- is the reference group; for model 4, NSC is the reference group. Net diff = net difference calculated as the β-coefficient in the surgery delirium- group subtracted from the β-coefficient in the surgery delirium+ group. R is the squared correlations of observed and model-implied outcome values
*Indicates a significant difference between delirium groups at α = 0.05
Fig. 3General cognitive performance (GCP) over time using the four retest correction methods. Models 1-3) SAGES surgical sample by delirium group: Delirium-negative (n = 426, solid line, 95% C.I.s in dark gray) and Delirium-positive (n = 134, dashed line, 95% C.I.s in medium gray); Model 4) SAGES surgical sample by delirium group and non-surgical comparison (NSC) sample (n = 118, dotted line, 95% C.I.s in light gray). X-axis: assessment time point (months since surgery); Y-axis: model-based estimated GCP at the means of covariates (age = 77 years, 56% female, 1% nonwhite race/ethnicity, education = 15 years)
Fig. 4Absolute mean general cognitive performance by delirium status. Absolute mean general cognitive performance (GCP) in the (a) Delirium-negative and (b) Delirium-positive groups using raw/uncorrected GCP (dark gray squares, used in Models 1 and 4), mean difference-corrected GCP (black diamonds, used in Model 2), and predicted difference correction (light gray circles, used in Model 3). X-axis: assessment time point (months since surgery); Y-axis: mean GCP performance. Means and standard deviations for each time point by group are provided in Table 4
Raw and corrected mean (standard deviation) GCP at each visit by group
| NSC | Delirium-Negative | Delirium-Positive | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Visit | Raw GCP | Mean Difference-Corrected GCP | Predicted Difference-Corrected GCP | Raw GCP | Mean Difference-Corrected GCP | Predicted Difference-Corrected GCP | Raw GCP | Mean Difference-Corrected GCP | Predicted Difference-Corrected GCP |
| Month 0 | 58.14 | N/A | N/A | 58.51 | 58.51 | 58.51 | 54.73 | 54.73 | 54.73 |
| Month 1 | 59.60 | N/A | N/A | 59.58 | 57.90 | 57.96 | 54.54 | 52.84 | 52.58 |
| Month 2 | 60.20 | N/A | N/A | 61.01 | 58.86 | 58.88 | 56.82 | 54.72 | 54.61 |
| Month 6 | 60.49 | N/A | N/A | 61.10 | 59.26 | 59.26 | 57.31 | 55.48 | 55.38 |
| Month 12 | N/A | N/A | N/A | 61.44 | 59.56 | 59.56 | 57.10 | 55.26 | 55.17 |
| Month 18 | 60.59 | N/A | N/A | 61.46 | 59.64 | 59.64 | 56.77 | 54.92 | 54.82 |
This table contrasts the mean absolute scores of the raw and retest-corrected General Cognitive Performance (GCP) across the three clinical groups of interest: non-surgical comparison group (NSC; n = 118), surgical patients who did not develop delirium (Delirium-negative; n = 426), and surgical patients who developed post-operative delirium (Delirium-positive, n = 134). Approaches 1 and 4 both utilized raw GCP data; however, Approach 1 only compared the delirium-positive and delirium-negative groups while Approach 4 first compared both surgical groups to the NSC group and subsequently performed post hoc tests to compare the delirium-positive and delirium-negative groups. Approach 2 utilized mean difference-corrected GCP and Approach 3 utilized predicted difference-corrected GCP