| Literature DB >> 27893845 |
Ulrich Schroeders1, Oliver Wilhelm2, Gabriel Olaru2.
Abstract
The advent of large-scale assessment, but also the more frequent use of longitudinal and multivariate approaches to measurement in psychological, educational, and sociological research, caused an increased demand for psychometrically sound short scales. Shortening scales economizes on valuable administration time, but might result in inadequate measures because reducing an item set could: a) change the internal structure of the measure, b) result in poorer reliability and measurement precision, c) deliver measures that cannot effectively discriminate between persons on the intended ability spectrum, and d) reduce test-criterion relations. Different approaches to abbreviate measures fare differently with respect to the above-mentioned problems. Therefore, we compare the quality and efficiency of three item selection strategies to derive short scales from an existing long version: a Stepwise COnfirmatory Factor Analytical approach (SCOFA) that maximizes factor loadings and two metaheuristics, specifically an Ant Colony Optimization (ACO) with a tailored user-defined optimization function and a Genetic Algorithm (GA) with an unspecific cost-reduction function. SCOFA compiled short versions were highly reliable, but had poor validity. In contrast, both metaheuristics outperformed SCOFA and produced efficient and psychometrically sound short versions (unidimensional, reliable, sensitive, and valid). We discuss under which circumstances ACO and GA produce equivalent results and provide recommendations for conditions in which it is advisable to use a metaheuristic with an unspecific out-of-the-box optimization function.Entities:
Mesh:
Year: 2016 PMID: 27893845 PMCID: PMC5125670 DOI: 10.1371/journal.pone.0167110
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Illustration of the Genetic Algorithm.
In the first iteration, the Genetic Algorithm randomly selects four item sets, which are highlighted in color, from an initial pool of 10 item sets. Each two of these sets are used to produce a new item set based on two principles: 1) subsetting and recombination (= crossover) and 2) random changes to the item set (= mutation). The newly assembled short versions are evaluated and ordered according to an optimization function (= fitness evaluation). The ordering influences the selection probability of an item to be assembled in a short version in the next iteration.
Description of Cognitive and Motivational Measures.
| Covariate | Variable label | Description |
|---|---|---|
| Reading speed | rsg9_sc3/ rsci_sc3 | Based on the test construction principles of the two |
| Math competence | mag9_sc1 | Math competence was assessed in four content areas: a) quantity, b) space and shape, c) change and relationships, and d) data and chance. The test consisted of 22 items with a simple multiple-choice, a complex multiple-choice, or a short constructed response format. Testing time was 28 minutes [ |
| Reading competence | reg9_sc1 | The reading competence test assessed students’ abilities to find relevant information in a given text, draw text-related conclusions, and reflect on and evaluate these information. Students were asked to read 5 texts of different genres (informational, argumentative, literary, instructional, and advertising texts) and answer 5 item sets with a total number of 31 items. The response format of the items were mostly multiple choice. Testing time was 28 minutes [ |
| Perceptual speed | dgg9_sc3a | Perceptual speed was assessed with a symbol-digit- test. Under severe time constraints participants had to assign the correct digit to the corresponding symbol. Secondary school students and adults worked on 93 items with a time limit of 90 seconds. [ |
| Reasoning | dgg9_sc3b | Reasoning ability was measured with a matrices test; students were asked to detect the regularities by which geometric figures change and to choose a missing figure out of six possible response alternatives. The test included 12 matrices; testing time was 3 minutes [ |
| Interest German | t66208a –t66208d | Students had to evaluate the extent to which statements regarding their interest in the German language applied to them (e.g., “I enjoy reading and writing texts. . .”) on a four-point rating scale (“does not apply at all”, “does not really apply”, “applies to some extent”, “applies completely”). |
| Motivation German | t66400a –t66400d | To assess student’s motivation in German, they had to evaluate four statements on a four-point scale ("does not apply at all", "does not really apply", "applies to some extent", "applies completely"). Example item: "I study in German class because I enjoy the subject matter" |
| German native language | t413000_g1D | Students select if German is their native language. |
| Grade German and math | t724101 and t724102 | Refers to their grade on last year’s final report card in German/math classes, ranging in accordance with the German grading system from “very good (1)”, “good (2)”, “satisfactory (3)”, “passing (4)”, “poor (5)”, to “failing (6)”. In other words, lower values represent better grades. |
Additional information on the items and constructs can be found in the codebook of the scientific user file (Leibniz Institute for Educational Trajectories, 2016) or through the references given at the end of the short descriptions. The second column gives the original variable labels in the data set.
Model Fit of Original and Shortened Versions.
| Model | χ2WLSMV | CFI | RMSEA | |
|---|---|---|---|---|
| SCOFA|15 | 410.94 | 90 | .993 | .016 |
| ACO|15 | 606.62 | 90 | .986 | .020 |
| GA|15 | 1,006.44 | 90 | .979 | .027 |
| SCOFA|20 | 777.51 | 170 | .990 | .016 |
| ACO|20 | 1,410.10 | 170 | .979 | .022 |
| GA|20 | 1,576.96 | 170 | .977 | .024 |
| SCOFA|25 | 1,409.38 | 275 | .986 | .017 |
| ACO|25 | 1,923.25 | 275 | .978 | .020 |
| GA|25 | 2,482.45 | 275 | .973 | .024 |
| 89 | 22,944.35 | 3,827 | .933 | .019 |
With respect to the models: Letters indicate the item selection algorithm: SCOFA = Stepwise Confirmatory Factor Analysis; ACO = Ant Colony Optimization; GA = Genetic Algorithm; Numbers refer to the number of items in the abridged version; 89 = Original version with 89 items. WLSMV = Weighted Least Squares Mean and Variance adjusted. CFI = Comparative Fit Index. RMSEA = Root Mean Square Error of Approximation.
Fig 2Distribution of Factor Loadings for the Original and Shortened Versions.
A boxplot covers the interquartile range; the solid line refers to the median. Letters indicate the item selection algorithm: S = Stepwise Confirmatory Factor Analysis; A = Ant Colony Optimization; G = Genetic Algorithm; Numbers refer to the number of items in the abridged version; 89 = Original version with 89 items.
Fig 4Correlation of the Original and Shortened Versions to Covariates.
Letters indicate the item selection algorithm: S = Stepwise Confirmatory Factor Analysis; A = Ant Colony Optimization; G = Genetic Algorithm; Numbers refer to the number of items in the abridged version; 89 = Original version with 89 items.
Fig 3Distributions of the Item Difficulties of the Original and Shortened Versions.
A boxplot covers the interquartile range; the solid line within a boxplot refers to the median. The gray solid line points out the guessing probability; the gray dashed line refers to the optimum item difficulty of .625. Letters indicate the item selection algorithm: S = Stepwise Confirmatory Factor Analysis; A = Ant Colony Optimization; G = Genetic Algorithm; Numbers refer to the number of items in the abridged version; 89 = Original version with 89 items. The horizontal gray line indicates the guessing probability (P(x) = .25).