| Literature DB >> 22824413 |
Iben Axén1, Lennart Bodin, Alice Kongsted, Niels Wedderkopp, Irene Jensen, Gunnar Bergström.
Abstract
BACKGROUND: Repeated data collection is desirable when monitoring fluctuating conditions. Mobile phones can be used to gather such data from large groups of respondents by sending and receiving frequently repeated short questions and answers as text messages.The analysis of repeated data involves some challenges. Vital issues to consider are the within-subject correlation, the between measurement occasion correlation and the presence of missing values.The overall aim of this commentary is to describe different methods of analyzing repeated data. It is meant to give an overview for the clinical researcher in order for complex outcome measures to be interpreted in a clinically meaningful way.Entities:
Mesh:
Year: 2012 PMID: 22824413 PMCID: PMC3434072 DOI: 10.1186/1471-2288-12-105
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Percentage of patients recovered (LBP days = 0) and unrecovered in each of 18 weeks following a first visit to a chiropractor (n = 212 week one; n = 186 week 18).
Figure 2Incidence of “recovery” (reporting zero or one pain days in two consecutive weeks), stratified by previous duration (> 30 days in pink, < 30 days in blue) for the full data set.
Categories used to classify individual pain patterns by visual analysis
| Improved-Mainly recovered | 25 (12) | 8.92 (6.01) | 76 |
| Improved-Stays in the category | 64 (30) | 26.83 (14.65) | 33 |
| Improved-Fluctuating | 23 (11) | 37.43 (21.17) | 30 |
| Improved-Moves towards mainly worsened | 2 (1) | 95.00 (11.31) | 0 |
| Unchanged-Mainly recovered | 13 (6) | 5.54 (5.68) | 54 |
| Unchanged-Moves towards mainly improved | 18 (8) | 38.11 (21.81) | 50 |
| Unchanged-Stays in the category | 10 (5) | 45.60 (51.21) | 33 |
| Unchanged-Fluctuating | 40 (19) | 51.28 (30.95) | 38 |
| Unchanged-Moves towards mainly worsened | 2 (1) | 84.00 (29.70) | 0 |
| Worsened-Mainly recovered | 1 (0.5) | 12 | 0 |
| Worsened-Moves towards mainly improved | 2 (1) | 37.5 (17.68) | 50 |
| Worsened-Fluctuating | 13 (6) | 55.00 (20.29) | 0 |
| Worsened-Stays in the category | 2 (1) | 111.5 (17.68) | 50 |
The first step is to categorize the pattern in relation to the early course (improved, unchanged or worsened). Afterwards that category is combined with the relevant late course.
Figure 3A dendrogram obtained with Ward’s method, describing the formation of clusters.
DESCRIPTIVE measures, variable- oriented hypotheses
| 1A: Crude outcome | Total number of 1: days with pain from 18 weekly measures, 2: weeks reported | Summaries. | 1: Mean 33.0, Range 0 – 126 Short duration: Mean 24.5, Range 0-124 Long duration: 41.1, Range 0-126 2: Mean 15.2. Range 2-18 | 1: Mean 36.4. Range 0-126 Short duration: Mean 27.4, Range 0-124 Long duration: mean 45.4, Range 4-126 2: Mean 17.3, Range 15-18 |
| 1B: Difference in weekly outcome between groups | Average number of pain days per week | Student’s | Short duration: 1.6 Long duration: 2.8 p < 0.001 | Short duration: 1.6 Long duration: 2.6 p < 0.001 |
| 2A: Proportion with different levels of the condition | Incidence of recovered = reporting 0 pain days week by week | Proportion, i.e. percentage of subjects who are recovered compared to those who are not | Illustrated in Figure | Illustrated in Figure |
| 2B: Incidence at a prespecified time point | Proportion of patients recovered = reporting 0 or 1 pain days at the chosen time, e.g. the 5th week | Logistic regression (or other generalised linear regression models) | Short duration: 58.7% Long duration: 27.7% OR = 3.71 (2.1-6.6) RR = 1.75 (1.4 – 2.3) Long duration reference category | Short duration: 58.2% Long duration: 28.0% OR = 3.58 (1.8 - 7.0) RR = 1.72 (1.34 – 2.3) Long duration reference category |
INCIDENCE measures, variable- oriented hypotheses
| 3A: Incidence during the full study period for the whole sample and for subgroups | Recovery, i.e. reporting 0 or 1 pain days in 2 consecutive weeks = Event | Time to event analysis, with Kaplan Meier curves. Log rank test for differences between groups | Illustrated in Figure | Logrank testfor effect of previous duration: p = 0.002 |
| 3B: Incidence for the full study period in relation to the selected predictive variables | Recovery, i.e. reporting 0 or 1 pain days in 2 consecutive weeks = Event | Time to event analysis with a) Cox proportional hazard regression or b) Discrete hazard regression | Hazard ratio (HR) showing recovery, long duration reference, estimate and 95% CI: a) 1.95 (95% CI: 1.4-2.6), b) 2.03 (95% CI: 1.5-2.7). | Hazard ratio (HR) showing recovery, long duration reference, estimate and 95% CI: a) 1.95 (95% CI: 1.4-2.6), b) 2.03 (95% CI: 1.5-2.7). |
| 3 C: Time point for an event during the pain course | The time point of change in the course of pain = Event | Spline regressions, the event defined as the intersection of linear regression lines (the knot). | Short duration: knot at 4.5 weeks Long duration: knot at 5.9 weeks | Short duration: knot at 4.4 weeks Long duration: knot at 5.8 weeks |
LINEAR MODELS, variable- oriented hypotheses
| 4A: Association of baseline variables with outcome | Weekly recorded pain days, count variable, assuming a binominal distribution | Multilevel mixed-effects logistic regression or generalized estimating equation assuming a logit link function (Long previous duration reference category) | Subject specific OR = 3.31 (95% CI: 2.1-5.1) Population average OR = 1.96 (95% CI 1.4-2.6) (Note: Interaction duration*week significant) | Subject specific OR = 2.67 (95% CI: 1.6-4.5) Population average OR =1.52 (95% CI 1.1- 2.2) (Note: Interaction duration*week significant) |
| 4B: Association of baseline variables with outcome | Weekly recorded pain days, count variable, assuming a Poisson distribution | Multilevel mixed-effects Poisson regression assuming a log link function (Long previous duration reference category) | Subject specific IRR = 1.92 (95% CI: 1.5 – 2.4) (Note: Interaction duration*week significant) | Subject specific IRR = 1.82 (95% CI: 1.4 – 2.4) (Note: Interaction duration*week significant) |
| 4 C: Association of baseline variables with outcome | Weekly recorded pain days, considered a count variable and assuming a normal distribution | Generalized linear regression or mixed linear model assuming an identity link function | Average difference in pain days for Long duration – Short duration 1.20 (95% CI: 0.8 – 1.5) (Note: Interaction duration*week significant) | Average difference in pain days for Long duration –Short duration 0.95 (95% CI: 0.6-1.4) Note: Interaction duration*week significant) |
SUBGROUPS, person-based hypotheses
| 5: Are there subgroups of patients? | Subgroups as clusters with low within-cluster variation and high between-cluster variation | A. Visual inspection based on plots of the course of pain in a graphical presentation where predefined criteria of directions in early and late phases, a qualitative approach | A: Illustrated in Table | Not applied | Not applied |
| | | B: Regression coefficients from spline regression (1 knot) derived from each subject were used in Wards’ hierarchical cluster analysis. Optionally this analysis was followed by K-means cluster analysis. Inspection of number of clusters based on the Calinski-Harabasz criterion and the criteria by Duda & Hart | B: Not done due to lack of degrees of freedom in spline regression of some individual subjects | B: 4 clusters suggested. | Not applied |
| | | C: Wards’ hierarchical cluster analysis, optionally followed by K-means cluster analysis, applied directly on the weekly number of pain days for the first 8 weeks. Cluster criteria as in B. | Cluster 2: 79 | Percentage with short duration in these clusters: | C: 6 clusters suggested. Percentage with short duration the previous year in these clusters: |
| | | | C: Not applied | Cluster 1: 39 | Cluster 1: 58 |
| | | | | Cluster 2: 49 | Cluster 3: 20 |
| | | | | Cluster 3: 85 | Cluster 4: 33 |
| | | | | Cluster 4: 37 | Cluster 5: 52 |
| C: Not applied | Cluster 6: 50 | ||||