| Literature DB >> 29433533 |
Morten Wærsted1, Taran Svenssen Børnick2, Jos W R Twisk3,4, Kaj Bo Veiersted2.
Abstract
OBJECTIVE: Missing data in longitudinal studies may constitute a source of bias. We suggest three simple missing data indicators for the initial phase of getting an overview of the missingness pattern in a dataset with a high number of follow-ups. Possible use of the indicators is exemplified in two datasets allowing wave nonresponse; a Norwegian dataset of 420 subjects examined at 21 occasions during 6.5 years and a Dutch dataset of 350 subjects with ten repeated measurements over a period of 35 years.Entities:
Keywords: Attrition; Longitudinal study; Missing data; Missing data indicators; Patterns of missingness; Wave nonresponse
Mesh:
Year: 2018 PMID: 29433533 PMCID: PMC5809924 DOI: 10.1186/s13104-018-3228-6
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Bivariate relationships in the two sample datasets
| Missing data indicators | Last response | Retention | Dispersion | |||
|---|---|---|---|---|---|---|
| Median (percentiles 25–75%) | p | Median (percentiles 25–75%) | p | Median (percentiles 25–75%) | p | |
|
| ||||||
| Gender | ||||||
| Men (n = 153) | 85 (25–100) | .43 | 30 (10–75) | .016 | 65 (18–88) | 1.0 |
| Woman (n = 267) | 90 (35–100) | 50 (20–80) | 50 (22–89) | |||
| Parents’ country of origin | ||||||
| Both western (n = 368) | 95 (35–100) | .015 | 50 (20–80) | < .001 | 64 (21–90) | .050 |
| One or both non-western (n = 52) | 53 (20–100) | 20 (5–44) | 43 (5–78) | |||
| Smoking (n = 419) | ||||||
| Never/former/sometimes (n = 279) | 100 (35–100) | .005 | 55 (20–83) | .001 | 67 (22–92) | .017 |
| Every day (n = 140) | 65 (30–100) | 35 (10–60) | 50 (19–76) | |||
| Self–reported health | ||||||
| Good/very good (n = 306) | 95 (30–100) | .41 | 40 (15–80) | .84 | 54 (19–88) | .71 |
| Not quite good/poor (n = 114) | 78 (34–100) | 45 (20–80) | 51 (26–88) | |||
| Neck and shoulder pain last 4 weeks | ||||||
| No (0–1) (n = 279) | 95 (30–100) | .73 | 40 (15–80) | .99 | 58 (19–90) | .92 |
| Yes (2–12) (n = 141) | 80 (35–100 | 50 (18–75) | 64 (28–85) | |||
|
| ||||||
| Gender | ||||||
| Men (n = 169) | 100 (56–100) | .93 | 56 (33–100) | .25 | 50 (0–100) | .42 |
| Woman (n = 267) | 100 (56–100) | 67 (33–100) | 62 (0–100) | |||
Fig. 1Examples on response patterns and missing data indicator scores. Each line represents the response pattern of one subject, listed according to increasing scores on the Retention indicator. The scores are standardized between 0 and 100. A subject responding only at baseline will get a score of zero on all indicators (subject 1); while a subject responding to all follow-ups will get a score of 100 on all indicators (subject 23). The 23 response patterns illustrated in this figure are all drawn from observed response patterns among the 420 subjects of the Norwegian sample dataset, with the exception for subject 16 who is added for illustrative purposes. See main text for more detailed description of the three missing data indicators
Fig. 2Distribution of the three missing data indicators. The scores of the 420 participants of the Norwegian dataset (a) and the 350 participants of the Dutch dataset (b). In each dataset the scores of the Dispersion indicator are for comparison put in bins centered on the same values as the possible values of the other two indicators. The pairwise Spearman correlation coefficients for the Last response (LR), Retention (Ret) and Dispersion (Dis) indicators were LR-Ret 0.82, LR-Dis 0.79, Ret-Dis 0.76 (Norwegian dataset) and LR-Ret 0.68, LR-Dis 0.82, Ret-Dis 0.74 (Dutch dataset). All these correlation coefficients were highly significant (p < .001)