| Literature DB >> 28572781 |
Yue Xiao1, Hongyun Liu1,2, Hui Li1.
Abstract
The Thurstonian item response theory (IRT) model allows estimating the latent trait scores of respondents directly through their responses in forced-choice questionnaires. It solves a part of problems brought by the traditional scoring methods of this kind of questionnaires. However, the forced-choice designs may still have their own limitations: The model may encounter underidentification and non-convergence and the test may show low test reliability in simple test designs (e.g., test designs with only a small number of traits measured or short length). To overcome these weaknesses, the present study applied the Thurstonian IRT model and the Graded Response Model to a different test format that comprises both forced-choice blocks and Likert-type items. And the Likert items should have low social desirability. A Monte Carlo simulation study is used to investigate how the mixed response format performs under various conditions. Four factors are considered: the number of traits, test length, the percentage of Likert items, and the proportion of pairs composed of items keyed in opposite directions. Results reveal that the mixed response format can be superior to the forced-choice format, especially in simple designs where the latter performs poorly. Besides the number of Likert items needed is small. One point to note is that researchers need to choose Likert items cautiously as Likert items may bring other response biases to the test. Discussion and suggestions are given to construct personality tests that can resist faking as much as possible and have acceptable reliability.Entities:
Keywords: Likert scale; forced-choice questionnaire; mixed test format; personality test; simulation
Year: 2017 PMID: 28572781 PMCID: PMC5435816 DOI: 10.3389/fpsyg.2017.00806
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1The combination of the Thurstonian IRT model and the Graded Response Model.
Model convergence rates (%) under all conditions.
| 2 | 5:1 | 0 | 83 | 94 | 100 | 100 |
| 20 | 95 | 100 | 100 | 100 | ||
| 10:1 | 0 | 98 | 100 | 100 | 100 | |
| 20 | 100 | 100 | 100 | 100 | ||
| 5 | 5:1 | 0 | 100 | 100 | 100 | 100 |
| 20 | 100 | 100 | 100 | 100 | ||
| 10:1 | 0 | 100 | 100 | 100 | 100 | |
| 20 | 100 | 100 | 100 | 100 | ||
| Average | 97 | 99 | 100 | 100 | ||
The test length is expressed as the ratio of the number of all questions in the test to the number of traits.
Conditions with extreme estimated SEs and the corresponding numbers of replications in which some estimated SEs were larger than 10.
| 5:1 | 0 | 0 | 10 | 13 | 0 |
| 0 | 20 | 0 | 0 | 8 | |
| 20 | 0 | 5 | 8 | 0 | |
| 10:1 | 0 | 0 | 3 | 4 | 0 |
| 20 | 0 | 2 | 2 | 0 | |
The five designs listed all measure two traits.
The test length is expressed as the ratio of the number of all questions in the test to the number of traits.
Figure 2RMSE-values for two type of parameter estimates (thresholds and factor loadings of forced-choice items) under different conditions. Each column presents the change tendency of one type of parameter estimates as the percentage of Likert items increases in different conditions. The panels (A–C) correspond to the RMSE of thresholds as a function of percentage of Likert items in different conditions of three factors. The panels (D–F) correspond to the RMSE of forced-choice items' loadings as a function of percentage of Likert items in different conditions of three factors.
Figure 3RMSE-values for another two types of parameter estimates (factor loadings of Likert items and intertrait correlations) under different conditions. Each column presents the change tendency of one type of parameter estimates as the percentage of Likert items increases in different conditions. The panels (A–C) correspond to the RMSE of Likert items' loadings as a function of percentage of Likert items in different conditions of three factors. The panels (D–F) correspond to the RMSE of intertrait correlations as a function of percentage of Likert items in different conditions of three factors.
Figure 4RMSE-values for estimated standard errors (. Each column presents the change tendency of one type of estimated SEs as the percentage of Likert items increases in different conditions.The panels (A–C) correspond to the RMSE for thresholds' SEs as a function of percentage of Likert items in different conditions of three factors. The panels (D–F) correspond to the RMSE for SEs of forced-choice items' loadings as a function of percentage of Likert items in different conditions of three factors.
Figure 5RMSE-values for estimated standard errors (. Each column presents the change tendency of one type of estimated SEs as the percentage of Likert items increases in different conditions.The panels (A–C) correspond to the RMSE for SEs of Likert items' loadings as a function of percentage of Likert items in different conditions of three factors. The panels (D–F) correspond to the RMSE for intertrait correlations' SEs as a function of percentage of Likert items in different conditions of three factors.
Average actual reliabilities under all conditions.
| 2 | 5:1 | 0 | 0.238 | 0.477 | 0.586 | 0.619 |
| 20 | 0.489 | 0.568 | 0.607 | 0.662 | ||
| 10:1 | 0 | 0.266 | 0.616 | 0.746 | 0.797 | |
| 20 | 0.556 | 0.753 | 0.797 | 0.807 | ||
| 5 | 5:1 | 0 | 0.550 | 0.559 | 0.611 | 0.659 |
| 20 | 0.539 | 0.599 | 0.644 | 0.676 | ||
| 10:1 | 0 | 0.634 | 0.699 | 0.760 | 0.799 | |
| 20 | 0.720 | 0.760 | 0.771 | 0.786 | ||
| Average | 0.499 | 0.629 | 0.690 | 0.726 | ||
The test length is expressed as the ratio of the number of all questions in the test to the number of traits.
Figure 6Average actual reliabilities in different test design conditions. The panels (A–C) correspond to average test reliability as a function of percentage of Likert items in different conditions of three factors.