| Literature DB >> 26903916 |
Shengyu Jiang1, Chun Wang1, David J Weiss1.
Abstract
Likert types of rating scales in which a respondent chooses a response from an ordered set of response options are used to measure a wide variety of psychological, educational, and medical outcome variables. The most appropriate item response theory model for analyzing and scoring these instruments when they provide scores on multiple scales is the multidimensional graded response model (MGRM) A simulation study was conducted to investigate the variables that might affect item parameter recovery for the MGRM. Data were generated based on different sample sizes, test lengths, and scale intercorrelations. Parameter estimates were obtained through the flexMIRT software. The quality of parameter recovery was assessed by the correlation between true and estimated parameters as well as bias and root-mean-square-error. Results indicated that for the vast majority of cases studied a sample size of N = 500 provided accurate parameter estimates, except for tests with 240 items when 1000 examinees were necessary to obtain accurate parameter estimates. Increasing sample size beyond N = 1000 did not increase the accuracy of MGRM parameter estimates.Entities:
Keywords: graded response model; item parameters; multidimensionality; parameter recovery; sample size
Year: 2016 PMID: 26903916 PMCID: PMC4746434 DOI: 10.3389/fpsyg.2016.00109
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Bias, RMSE, and correlations of parameter estimates for .
Effect sizes (η.
| Test Length | 0.281 | 0.307 | 0.268 | 0.244 | 0.000 | 0.236 |
| Sample Size | 0.005 | 0.016 | 0.014 | 0.003 | 0.001 | 0.000 |
| Correlation | 0.001 | 0.005 | 0.001 | 0.001 | 0.000 | 0.003 |
| Test Length × Sample Size | 0.007 | 0.008 | 0.004 | 0.015 | 0.003 | 0.002 |
| Test Length × Correlation | 0.008 | 0.010 | 0.004 | 0.001 | 0.004 | 0.014 |
| Sample Size × Correlation | 0.000 | 0.000 | 0.001 | 0.001 | 0.000 | 0.000 |
| Test Length × Sample Size × Correlation | 0.000 | 0.002 | 0.000 | 0.001 | 0.001 | 0.000 |
| Residual | 0.698 | 0.652 | 0.708 | 0.734 | 0.991 | 0.746 |
All effect size η where SS.
Figure 2Bias of parameter estimates for three test lengths. (A) Discrimination parameters. (B) Boundary parameters.
Effect sizes (η.
| Test Length | 0.055 | 0.048 | 0.049 | 0.226 | 0.218 | 0.180 |
| Sample Size | 0.295 | 0.335 | 0.336 | 0.215 | 0.266 | 0.244 |
| Correlation | 0.005 | 0.002 | 0.002 | 0.000 | 0.001 | 0.005 |
| Test Length × Sample Size | 0.030 | 0.049 | 0.024 | 0.030 | 0.008 | 0.012 |
| Test Length × Correlation | 0.006 | 0.020 | 0.008 | 0.004 | 0.000 | 0.006 |
| Sample Size × Correlation | 0.000 | 0.000 | 0.001 | 0.001 | 0.000 | 0.000 |
| Test Length × Sample Size × Correlation | 0.000 | 0.001 | 0.000 | 0.001 | 0.001 | 0.000 |
| Residual | 0.609 | 0.545 | 0.579 | 0.524 | 0.506 | 0.552 |
Figure 3RMSE of parameter estimates for four sample sizes. (A) Discrimination parameters. (B) Boundary parameters.
Figure 4RMSE of boundary parameters for three test lengths.
Effect sizes (η.
| Test Length | 0.031 | 0.024 | 0.027 | 0.001 | 0.029 | 0.001 |
| Sample Size | 0.345 | 0.276 | 0.367 | 0.511 | 0.459 | 0.519 |
| Correlation | 0.000 | 0.002 | 0.000 | 0.016 | 0.025 | 0.009 |
| Test Length × Sample Size | 0.009 | 0.028 | 0.009 | 0.011 | 0.001 | 0.013 |
| Test Length × Correlation | 0.001 | 0.001 | 0.001 | 0.004 | 0.021 | 0.005 |
| Sample Size × Correlation | 0.000 | 0.001 | 0.000 | 0.003 | 0.002 | 0.000 |
| Test Length × Sample Size × Correlation | 0.000 | 0.002 | 0.001 | 0.000 | 0.001 | 0.001 |
| Residual | 0.614 | 0.665 | 0.595 | 0.456 | 0.462 | 0.452 |
Figure 5Correlation between true and estimated parameters for four sample sizes. (A) Discrimination parameters. (B) Boundary parameters.