| Literature DB >> 22363411 |
Rochelle E Tractenberg1, Futoshi Yumoto, Paul S Aisen, Jeffrey A Kaye, Robert J Mislevy.
Abstract
We used a Guttman model to represent responses to test items over time as an approximation of what is often referred to as "points lost" in studies of cognitive decline or interventions. To capture this meaning of "point loss", over four successive assessments, we assumed that once an item is incorrect, it cannot be correct at a later visit. If the loss of a point represents actual decline, then failure of an item to fit the Guttman model over time can be considered measurement error. This representation and definition of measurement error also permits testing the hypotheses that measurement error is constant for items in a test, and that error is independent of "true score", which are two key consequences of the definition of "measurement error"--and thereby, reliability--under Classical Test Theory. We tested the hypotheses by fitting our model to, and comparing our results from, four consecutive annual evaluations in three groups of elderly persons: a) cognitively normal (NC, N = 149); b) diagnosed with possible or probable AD (N = 78); and c) cognitively normal initially and a later diagnosis of AD (converters, N = 133). Of 16 items that converged, error-free measurement of "cognitive loss" was observed for 10 items in NC, eight in converters, and two in AD. We found that measurement error, as we defined it, was inconsistent over time and across cognitive functioning levels, violating the theory underlying reliability and other psychometric characteristics, and key regression assumptions.Entities:
Mesh:
Year: 2012 PMID: 22363411 PMCID: PMC3281811 DOI: 10.1371/journal.pone.0030019
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Example Guttman Scale response patterns for one item over four visits.
| Observed response pattern on one item | Time 1 | Time 2 | Time 3 | Time 4 |
| Pattern 1 | 1 | 1 | 1 | 1 |
| Pattern 2 | 1 | 1 | 1 | 0 |
| Pattern 3 | 1 | 1 | 0 | 0 |
| Pattern 4 | 1 | 0 | 0 | 0 |
| Pattern 5 | 0 | 0 | 0 | 0 |
| Pattern 1 of 11 | 0 | 1 | 0 | 0 |
Notes: 1 indicates the item was answered correctly; 0 indicates it was incorrect. Patterns in the first five rows are consistent with the Guttman scale. NB: the first and fifth patterns (1111, 0000) do not represent decline since individuals with either pattern of responses to this item over the four visits either always or never exhibited the ability to answer correctly (respectively). Both patterns are consistent with a Guttman Model because each shows the expected consistency in what an item reflects about the individual's state/ability.
*indicates one example pattern of the 11 other possible outcomes for one item over four visits; none of these other patterns is consistent with a Guttman Model since the item is shown to have been correct after not being correct at an earlier visit. There are a total of 16 (24) patterns of right (1) and wrong (0) responses on this item, but only the first five response patterns in this table represent error-free measurement of decline for the item. The proportion of the sample that does not exhibit one of these five patterns over four years is the estimated measurement error for the item.
Descriptive Statistics (% or Mean (SD)) for three cohorts of elderly MMSE respondents with four consecutive visits.
| NC (N = 149) | Converters (N = 133) | AD (N = 78) | |
| Age (Time 1) | 83.6 (6.7) | 84.3 (6.9) | 70.8 (9.3) |
| % Female | 63% | 62% | 46% |
| Education (yrs) | 13.9 (2.7) | 14.0 (2.8) | 13.7 (3.3) |
| MMSE Total: Time 1 | 28.6 (1.3) | 27.8 (1.7) | 22.2 (4.6) |
| MMSE Total: Time 2 | 28.4 (1.3) | 27.8 (1.8) | 20.9 (5.6) |
| MMSE Total: Time 3 | 28.4 (1.3) | 27.4 (2.2) | 17.8 (6.8) |
| MMSE Total: Time 4 | 28.6 (1.3) | 27.1 (2.6) | 14.5 (7.7) |
| MMSE 16 items: Time 1 | 14.9 (0.9) | 14.4 (1.2) | 11.2 (2.6) |
| MMSE 16 items: Time 2 | 14.7 (1.0) | 14.4 (1.3) | 10.6 (3.1) |
| MMSE 16 items: Time 3 | 14.8 (1.0) | 14.3 (1.4) | 8.8 (3.7) |
| MMSE 16 items: Time 4 | 14.8 (1.1) | 14.1 (1.6) | 7.0 (4.1) |
MMSE Total: range from 0–30. MMSE 16 items: sum of 0/1 score on the 16 items shown in Tables 3 and 4.
π* statistics (standard error), reflecting badness of fit of a Guttman model to each modeled MMSE item over four years.
| MMSE Item | Over All Groups | NC | Converters | AD |
| Year |
|
|
| 0.103 (.035) |
| Season | 0.164 (.020) | 0.101 (.025) | 0.152 (.031) | 0.308 (.053) |
| Date | 0.269 (.039) | 0.235 (.035) | 0.152 (.031) | 0.231 (.041) |
| Day | 0.103 (.016) |
|
| 0.282 (.052) |
| Month |
|
|
| 0.231 (.048) |
| State |
|
|
|
|
| City |
|
|
| 0.128 (.038) |
| WORLD | 0.264 (.025) | 0.241 (.036) | 0.299 (.043) | 0.245 (.063) |
| 3 word recall | 0.431 (.026) | 0.403 (.090) | 0.451 (.043) |
|
| Paper on floor | 0.144 (.019) |
| 0.158 (.032) | 0.333 (.054) |
| Pencil |
|
|
|
|
| Watch |
|
|
|
|
| No ifs/ands/buts | 0.302 (.024) | 0.302 (.038) | 0.371 (.042) | 0.182 (.045) |
| Read |
|
|
|
|
| Write |
|
|
| 0.179 (.044) |
| Copy | 0.248 (.023) | 0.255 (.036) | 0.278 (.039) | 0.182 (.045) |
π* estimates the proportion of observations that are inconsistent with the model under investigation. Low values of π* suggest that very little (100%×π*) of the data do not fit the model under investigation. Bold values of π* indicate acceptably LOW (<10%) levels of misfit; that is, bold values indicate consistency of the item with the Guttman (‘real’ loss) Model.
This item was recoded so that all possible points right = 1 and any mistakes = 0.
These items were each assigned one point (i.e., not treated as one point together). Items not represented in this table did not have 0/1 coding (name 3 items), had too much missing data (what floor are we on? What county are we in?) or failed to converge (take this paper, fold it in half) in all 3 groups (and over all responses) so estimates of π* were not computable.
Dissimilarity Index (DI) values per item, over all participants and separately by cohort (* indicates that the solution for DI did not converge so no index value was calculated).
| MMSE Item | ALL | NC | Converters | AD |
| Year |
|
|
| 0.077 |
| Season | 0.072 |
| 0.065 | 0.123 |
| Date |
| 0.120 | 0.067 | 0.110 |
| Day | 0.059 |
|
| 0.125 |
| Month | 0.068 |
|
| 0.131 |
| State |
|
|
| 0.056 |
| City | 0.079 |
|
| 0.100 |
| WORLD | 0.111 | 0.093 | 0.150 | 0.125 |
| 3 word recall | 0.144 | 0.119 | 0.088 |
|
| Paper on floor | 0.084 |
| 0.074 | 0.109 |
| Pencil |
|
|
|
|
| Watch |
|
|
|
|
| No ifs/ands/buts | 0.060 | 0.121 | 0.103 | 0.084 |
| Read |
|
|
|
|
| Write |
|
|
| 0.275 |
| Copy |
| 0.060 | 0.136 | 0.150 |
Note: Dissimilarity indices computed for each item represent how well the model under investigation produces expected distributions of response patterns (e.g., from Table 1) that are consistent with observed response patterns for each item. Higher values suggest less consistency between observed and expected values; one recommended cutoff for the index is 0.05 (Dayton, 1998) but this is essentially an arbitrary index value cutoff. Bold items have values below 0.055.
*indicates additional convergence problems when DI was computed.
This item was recoded so that all possible points right = 1 and any mistakes = 0.
These items were each assigned one point (i.e., not treated as one point together). Items not represented in this table did not have 0/1 coding (name 3 items), had too much missing data (what floor are we on? What county are we in?) or failed to converge (take this paper, fold it in half) in all 3 groups (and over all responses) so estimates of π* were not computable.