| Literature DB >> 28490822 |
W W Koczkodaj1, T Kakiashvili2, A Szymańska3, J Montero-Marin4, R Araya5, J Garcia-Campayo6, K Rutkowski7, D Strzałka8.
Abstract
Rating scales are used to elicit data about qualitative entities (e.g., research collaboration). This study presents an innovative method for reducing the number of rating scale items without the predictability loss. The "area under the receiver operator curve method" (AUC ROC) is used. The presented method has reduced the number of rating scale items (variables) to 28.57% (from 21 to 6) making over 70% of collected data unnecessary. Results have been verified by two methods of analysis: Graded Response Model (GRM) and Confirmatory Factor Analysis (CFA). GRM revealed that the new method differentiates observations of high and middle scores. CFA proved that the reliability of the rating scale has not deteriorated by the scale item reduction. Both statistical analysis evidenced usefulness of the AUC ROC reduction method.Entities:
Keywords: Prediction; Rating scale; Receiver operator characteristic; Reduction
Year: 2017 PMID: 28490822 PMCID: PMC5400800 DOI: 10.1007/s11192-017-2283-4
Source DB: PubMed Journal: Scientometrics ISSN: 0138-9130 Impact factor: 3.238
Fig. 1AUC for the running total of all variables
The confusion matrix
| True positives | False positives |
| False negative | True negative |
AUC of running variable totals
|
|
|
|
|
|
|
|
| 0.725 | 0.777 | 0.795 | 0.810 | 0.813 | 0.822 | 0.821 |
|
|
|
|
|
|
|
|
| 0.819 | 0.820 | 0.821 | 0.821 | 0.821 | 0.821 | 0.820 |
|
|
|
|
|
|
|
|
| 0.819 | 0.818 | 0.816 | 0.814 | 0.812 | 0.811 | 0.812 |
AUC of individual variables in the original data
| Var | AUC | Var | AUC | Var | AUC |
|---|---|---|---|---|---|
| 21 | 0.587468 | 12 | 0.636791 | 17 | 0.674283 |
| 11 | 0.597342 | 13 | 0.648917 | 15 | 0.692064 |
| 16 | 0.605937 | 4 | 0.651187 | 10 | 0.697225 |
| 6 | 0.610004 | 3 | 0.655666 | 9 | 0.700461 |
| 18 | 0.610028 | 5 | 0.658478 | 7 | 0.701489 |
| 19 | 0.629285 | 20 | 0.666999 | 14 | 0.707401 |
| 2 | 0.631205 | 8 | 0.667983 | 1 | 0.725009 |
Fig. 2System R code for GRM models
Likelihood ratio for the full GRM model
| AIC | BIC | log.Lik | LRT |
|
| |
|---|---|---|---|---|---|---|
| Fit1 | 25494.12 | 25772.35 | −12683.06 | |||
| Fit2 | 25367.63 | 25732.81 | −12599.81 | 166.49 | 20 |
|
Unconstrained GRM model results for the full rating scale and the item discrimination power
| Extrmt1 | Extrmt2 | Extrmt3 | Dscrmn | |
|---|---|---|---|---|
| V1 |
|
|
|
|
| V2 | 0.214 | 1.536 | 2.485 | 1.315 |
| V3 | –0.094 | 1.306 | 3.077 | 1.528 |
| V4 | –0.447 | 1.673 | 3.511 | 1.268 |
| V5 | –0.709 | 1.903 | 2.717 | 1.440 |
| V6 | –0.073 | 1.679 | 2.194 | 0.860 |
| V7 | – |
|
|
|
| V8 | –0.641 | 0.876 | 2.157 | 1.461 |
| V9 |
|
|
|
|
| V10 | – |
|
|
|
| V11 | –0.881 | 2.187 | 3.280 | 0.767 |
| V12 | –0.242 | 1.282 | 2.221 | 1.271 |
| V13 | –0.351 | 1.631 | 2.660 | 1.054 |
| V14 | – |
|
|
|
| V15 | – |
|
|
|
| V16 | –2.364 | 0.634 | 2.388 | 0.764 |
| V17 | –0.482 | 1.287 | 2.366 | 1.296 |
| V18 | –1.685 | 0.847 | 2.024 | 0.902 |
| V19 | –1.623 | 0.366 | 2.648 | 1.078 |
| V20 | –0.575 | 1.227 | 2.066 | 1.643 |
| V21 | 1.271 | 2.240 | 3.531 | 0.870 |
Test information curve
| Total information = 56.21 | |
| Information in (−4, 4) = 52 (92.51%) | |
| Based on all the items | |
| Total information = 19.62 | |
| Information in (−4, 4) = 18.97 (96.65%) | |
| Based on items 1, 7, 9, 10, 14, 15 |
Fig. 3System R code for CFA
Fig. 4CFA model for the rating scale with all items presented in AMOS graphics
Parameter estimates of the full rating scale
| Parameters | Standardized | Non-standardized | Standardized error |
|---|---|---|---|
|
|
| 1.000 | |
|
| 0.604 | 0.844 | 0.048 |
|
| 0.655 | 0.914 | 0.047 |
|
| 0.585 | 0.818 | 0.049 |
|
| 0.633 | 0.884 | 0.049 |
|
| 0.454 | 0.634 | 0.052 |
|
|
| 0.889 | 0.048 |
|
| 0.645 | 0.901 | 0.045 |
|
|
| 0.883 | 0.049 |
|
|
| 0.731 | 0.053 |
|
| 0.394 | 0.550 | 0.053 |
|
| 0.594 | 0.830 | 0.048 |
|
| 0.514 | 0.718 | 0.056 |
|
|
| 1.029 | 0.049 |
|
|
| 0.913 | 0.050 |
|
| 0.413 | 0.578 | 0.055 |
|
| 0.589 | 0.822 | 0.047 |
|
| 0.468 | 0.653 | 0.053 |
|
| 0.520 | 0.726 | 0.050 |
|
| 0.681 | 0.952 | 0.047 |
|
| 0.433 | 0.605 | 0.067 |
Fig. 5CFA Model with a reduced number of items presented in AMOS graphics
Parameter estimates for the reduced scale
| Parameters | Standardized | Non-standardized | Standardized error |
|---|---|---|---|
|
| 0.748 | 1.000 | |
|
| 0.614 | 0.821 | 0.054 |
|
| 0.703 | 0.940 | 0.059 |
|
| 0.534 | 0.714 | 0.061 |
|
| 0.736 | 0.984 | 0.059 |
|
| 0.816 | 0.816 | 0.057 |
Results of fit statistics for two rating scale models
| Statistics for the full and reduced rating scale models | |
|---|---|
| Chi2 = 437.899 | Chi2 = 30.883 |
|
|
|
| CFI = .950 | CFI = .983 |
| RMSEA = .048 | RMSEA = .065 |
Results of CR and VE of two models
| Rating scale | CR | VE |
|---|---|---|
| Full rating scale | 0.929 | 0.394 |
| Reduced scale | 0.822 | 0.483 |