| Literature DB >> 26161552 |
Aline Dugravot1, Severine Sabia2, Martin J Shipley3, Catherine Welch3, Mika Kivimaki3, Archana Singh-Manoux4.
Abstract
BACKGROUND: Participants' non adherence to protocol affects data quality. In longitudinal studies, this leads to outliers that can be present at the level of the population or the individual. The purpose of the present study is to elaborate a method for detection of outliers in a study of cognitive ageing.Entities:
Mesh:
Year: 2015 PMID: 26161552 PMCID: PMC4498688 DOI: 10.1371/journal.pone.0132110
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Sample selection, Whitehall II Study.
Comparison of annual change between 1997–99 and 2007–09 with that between 1997–99 and 2012–13.
| N | 1997–99 to 2007–09 M (SD) | 1997–99 to 2012–13 M (SD) | p-value | |
|---|---|---|---|---|
|
| ||||
| Memory | 5067 | -0.07 (0.27) | -0.07 (0.18) | 0.875 |
| Reasoning | 5104 | -0.28 (0.65) | -0.24 (0.45) | <0.001 |
| Phonemic fluency | 5104 | -0.16 (0.42) | -0.13 (0.27) | <0.001 |
| Semantic fluency | 5108 | -0.13 (0.37) | -0.12 (0.24) | 0.02 |
| Vocabulary | 5109 | 0.02 (0.26) | 0.01 (0.18) | 0.01 |
|
| ||||
| Memory | 4279 | -0.07 (0.26) | -0.07 (0.18) | 0.29 |
| Reasoning | 4300 | -0.28 (0.63) | -0.22 (0.44) | <0.001 |
| Phonemic fluency | 4300 | -0.16 (0.40) | -0.12 (0.26) | <0.001 |
| Semantic fluency | 4304 | -0.12 (0.36) | -0.11 (0.24) | 0.02 |
| Vocabulary | 4304 | 0.02 (0.24) | 0.01 (0.17) | 0.17 |
|
| ||||
| Memory | 788 | -0.09 (0.33) | -0.11 (0.20) | 0.10 |
| Reasoning (AH4-I) | 804 | -0.29 (0.73) | -0.31 (0.50) | 0.16 |
| Phonemic fluency | 804 | -0.15 (0.50) | -0.15 (0.31) | 0.64 |
| Semantic fluency | 804 | -0.15 (0.42) | -0.14 (0.27) | 0.75 |
| Vocabulary | 805 | 0.01 (0.34) | -0.02 (0.23) | 0.01 |
Abbreviations: M mean, SD standard deviation.
Comparison of residuals: from 1997–99 to 2007–09 compared with 2012–13.
| Across first 3 waves (1997–99 to 2007–09) | At fourth wave (2012–13) | Threshold used to define outliers | ||
|---|---|---|---|---|
| Range | M (SD) | M (SD) | ||
|
| ||||
| Reasoning (AH4-I) | 0–65 | 0.00 (2.96) | 1.68 (4.69) | 5.92 |
| Phonemic fluency | 0–35 | 0.00 (1.92) | 0.54 (2.93) | 3.84 |
| Semantic fluency | 0–35 | 0.00 (1.71) | 0.34 (2.64) | 3.42 |
|
| ||||
| Reasoning (AH4-I) | 0–65 | 0.00 (2.98) | 0.44 (4.75) | 5.96 |
| Phonemic fluency | 0–35 | 0.00 (1.94) | 0.22 (2.86) | 3.88 |
| Semantic fluency | 0–35 | 0.00 (1.77) | 0.09 (2.58) | 3.54 |
*Represents 2SD of the residual from the prediction model across first 3 waves.
Characteristics of outliers compared to others assessed at the clinic in 2012–13.
| Reasoning (N = 5516) | Phonemic fluency (N = 5513) | Semantic fluency (N = 5516) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 2012–13 characteristics | Outliers | Others | age adjusted p value | Outliers | Others | age adjusted p value | Outliers | Others | age adjusted p value |
| N (%) | 434 (7.87) | 5082 (92.13) | 377 (6.84) | 5136 (93.16) | 370 (6.71) | 5146 (93.29) | |||
| Age (years) | 70.43 (5.58) | 69.41 (5.79) | <0.001 | 68.63 (5.53) | 69.56 (5.79) | 0.003 | 69.15 (5.77) | 69.52 (5.78) | 0.23 |
| Men, N (%) | 298 (68.66) | 3692 (72.65) | 0.08 | 269 (71.35) | 3720 (72.43) | 0.62 | 286 (77.30) | 3704 (71.98) | 0.03 |
| Education, N (%) | |||||||||
| Lower secondary | 199 (45.85) | 2083 (40.99) | 125 (33.16) | 2155 (41.96) | 125 (33.78) | 2157 (41.92) | |||
| Secondary school | 116 (26.73) | 1387 (27.29) | 0.31 | 113 (29.97) | 1390 (27.06) | 0.02 | 104 (28.11) | 1399 (27.19) | 0.01 |
| University | 119 (27.42) | 1612 (31.72) | 136 (36.87) | 1591 (30.98) | 141 (38.11) | 1590 (30.90) | |||
|
| 48.45 (10.25) | 42.85 (11.34) | <0.001 | 21.44 (4.20) | 14.70 (3.94) | <0.001 | 20.33 (3.17) | 14.55 (3.73) | <0.001 |
|
| |||||||||
|
| -0.43 (0.77) | -0.27 (0.63) | <0.001 | -0.24 (0.52) | -0.15 (0.41) | <0.001 | -0.21 (0.43) | -0.12 (0.36) | <0.001 |
|
| 0.25 (0.60) | -0.26 (0.48) | <0.001 | 0.21 (0.35) | -0.15 (0.29) | <0.001 | 0.15 (0.31) | -0.14 (0.27) | <0.001 |
|
| |||||||||
| Memory | 5.55 (2.19) | 6.05(2.39) | 0.002 | 6.27 (2.47) | 5.99 (2.37) | 0.21 | 6.29 (2.36) | 5.99 (2.38) | 0.04 |
| Vocabulary | 24.22 (6.20) | 25.26 (4.59) | <0.001 | 25.38 (50.4) | 25.17 (4.71) | 0.59 | 26.05 (4.29) | 25.12 (4.77) | <0.001 |
| MMSE | 28.14 (1.60) | 28.35 (1.58) | 0.08 | 28.35 (1.37) | 28.33 (1.59) | 0.65 | 28.45 (1.36) | 28.32 (1.59) | 0.26 |
| Trail Making Test A | 37.87 (14.99) | 39.04 (18.17) | 0.01 | 34.78 (11.10) | 39.25 (18.27) | <0.001 | 34.30 (11.54) | 39.28 (18.27) | <0.001 |
| Trail Making Test B | 81.42 (37.41) | 85.22 (39.02) | 0.001 | 76.26 (33.21) | 85.54 (39.22) | <0.001 | 73.15 (27.55) | 85.77 (39.46) | <0.001 |
*For difference between outliers and non-outliers
Figures are means (SD) unless stated otherwise.
Mean annual change over the first 3 waves (1997/99 to 2007/09) and over all 4 waves (1997/99 to 2012/13) before and after exclusion of outliers.
| BEFORE EXCLUSION OF OUTLIERS | AFTER EXCLUSION OF OUTLIERS | |||||||
|---|---|---|---|---|---|---|---|---|
| N | 1997–99 to 2007–09 | 1997–99 to 2012–13 | N | 1997–99 to 2007–09 | 1997–99 to 2012–13 | |||
| M (SD) | M (SD) | p-value | M (SD) | M (SD) | p-value | |||
|
| 5104 | -0.28 (0.65) | -0.24 (0.45) | <0.001 | 4725 | -0.27 (0.63) | -0.27 (0.44) | 0.46 |
|
| 5104 | -0.16 (0.42) | -0.13 (0.27) | <0.001 | 4757 | -0.15 (0.41) | -0.15 (0.25) | 0.96 |
|
| 5108 | -0.13 (0.37) | -0.12 (0.24) | 0.02 | 4763 | -0.12 (0.36) | -0.13 (0.24) | 0.001 |
Abbreviations: M mean, SD standard deviation.
Estimated difference in cognitive decline over 10 years, as a function of diabetes status in 1997/99.
| Before exclusion | After exclusion | |
|---|---|---|
| Beta | Beta | |
|
| ||
| Nobs = 20847 N = 6269 | Nobs = 20482 N = 6269 | |
| Normoglycaemia |
|
|
| Diabetes | -0.79 (-1.38, -0.19) | -0.84 (-1.45, -0.24) |
|
| ||
| Nobs = 20881 N = 6281 | Nobs = 20556 N = 6281 | |
| Normoglycaemia |
|
|
| Diabetes | -0.29 (-0.62, 0.05) | -0.46 (-0.79, -0.13) |
|
| ||
| Nobs = 20882 N = 6282 | Nobs = 20561 N = 6282 | |
| Normoglycaemia |
|
|
| Diabetes | -0.15 (-0.46, 0.15) | -0.16 (-0.46, 0.14) |
*P<0.01.
†Models with age as time-scale and adjusted for year of birth, sex, education, and their interaction with age. Betas are for difference in cognitive change over 10 years.
‡ 10-year cognitive change in the reference group.