| Literature DB >> 34366694 |
Seyyed Morteza Hashemi Toroujeni1.
Abstract
Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on computerized testing may be used to investigate the interchangeability of CFLT and PPBT mean scores to corroborate if test takers' testing performance is influenced by the effects of testing administration mode; specifically, if validity and reliability of two versions of the same test are affected. This research was conducted to probe not only score interchangeability across testing modes but also to explore the role of age and gender stereotypes, item review, ICT literacy and attitudes towards computer use as moderator variables in test takers' reading achievement in CFLT. Fifty-eight EFL learners homogeneous in both general English and reading skills assigned into one testing group participated in this study. Three different versions of TOEFL reading comprehension test, Computer Attitude Scale (CAS), and ICT literacy Scale of TOEFL Examinees were used in this crossover quasi-controlled empirical study with a common-person and pretest-posttest design to collect data. The findings demonstrated that although the reading scores of test takers were interchangeable in both CFLT and PPBT versions regarding testing administration modes, they were different regarding item review. Furthermore, no significant interaction was found between age, gender, and ICT literacy and CFLT performance. However, attitudes towards the use of computer led to a significant change in testing achievement on CFLT.Entities:
Keywords: Computer attitudes; Computer-based testing; ICT literacy; Item review; Score interchangeability; Testing administration mode effect
Year: 2021 PMID: 34366694 PMCID: PMC8329632 DOI: 10.1007/s10639-021-10584-2
Source DB: PubMed Journal: Educ Inf Technol (Dordr) ISSN: 1360-2357
Box design of the research
| Pre-test | Treatment and Variables | Post-test 1 | Treatment | Post-test 2 | ||
| Testing Group | PPBT version | Administration Mode | CFLT1with no item review option | Implementing the Equivalent Test after two weeks interval | Item Review | CFLT2 with item review option |
| Age | ||||||
| Gender | ||||||
| ICT literacy | ||||||
| Computer Attitude |
Gender and age frequency distribution
| Frequency | Percentage | Frequency | Percentage | Total | |||
|---|---|---|---|---|---|---|---|
| Gender | Male | 32 | 55.17 | Female | 26 | 44.83 | 58/100 |
| Age | Younger | 22 | 73.33 | Younger | 8 | 26.67 | 30/100 |
| Older | 10 | 35.71 | Older | 18 | 64.29 | 28/100 |
Testing Sphericity Assumption
| Measure: Reading Comprehension Performance | |||||||
|---|---|---|---|---|---|---|---|
| Within-Subjects Effect | Mauchly's W | Approx. Chi-Square | df | Sig | Epsilon | ||
| Greenhouse–Geisser | Huynh–Feldt | Lower-bound | |||||
| Testing Administration Mode | .990 | .564 | 2 | .754 | .990 | 1.000 | .500 |
Distribution scores in PPBT, CFLT1 & CFLT2 (Estimates)
| Measure: Reading Comprehension Performance | ||||
|---|---|---|---|---|
| Testing Administration Mode | Mean | Std. Error | 95% Confidence Interval | |
| Lower Bound | Upper Bound | |||
| PPBT | 41.492 | .618 | 40.255 | 42.728 |
| CFLT1 | 43.268 | .507 | 42.252 | 44.284 |
| CFLT2 | 40.404 | .611 | 39.182 | 41.627 |
RM ANOVA for finding an overall significant difference (Tests of Within-Subjects Effects)
| Measure: Reading Comprehension Performance | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Source | Type III Sum of Squares | df | Mean Square | F | Sig | Partial Eta Squared | Noncent. Parameter | Observed Power | |
| Testing Administration Mode | Sphericity Assumed | 242.35 | 2 | 121.17 | 6.76 | .002 | .106 | 13.53 | .912 |
| Greenhouse–Geisser | 242.35 | 1.98 | 122.39 | 6.76 | .002 | .106 | 13.40 | .910 | |
| Huynh–Feldt | 242.35 | 2.00 | 121.17 | 6.76 | .002 | .106 | 13.53 | .912 | |
| Lower-bound | 242.35 | 1.00 | 242.35 | 6.76 | .012 | .106 | 6.76 | .725 | |
| Error (Testing Administration Mode) | Sphericity Assumed | 2041.07 | 114 | 17.90 | |||||
| Greenhouse–Geisser | 2041.07 | 112.86 | 18.08 | ||||||
| Huynh–Feldt | 2041.07 | 114.00 | 17.90 | ||||||
| Lower-bound | 2041.07 | 57.00 | 35.80 | ||||||
Post-Hoc test (Pairwise Comparisons of three mean score differences)
| Measure: Reading Comprehension Performance | ||||||
|---|---|---|---|---|---|---|
| (I) Testing Administration Mode | (J) Testing Administration Mode | Mean Difference (I-J) | Std. Error | Sig. | 95% Confidence Interval for Difference | |
| Lower Bound | Upper Bound | |||||
| PPBT | CFLT1 | -1.776 | .752 | .065 | -3.632 | .080 |
| CFLT2 | 1.087 | .820 | .571 | -.936 | 3.111 | |
| CFLT2 | CFLT1 | -2.863* | .783 | .002 | -4.795 | -.932 |
Distribution of male/female test takers’ scores (Group Statistics)
| Gender | N | Mean | Std. Deviation | Std. Error Mean | |
|---|---|---|---|---|---|
| PPBT | male | 32 | 44.75 | 2.68 | .474 |
| female | 26 | 37.48 | 3.34 | .655 | |
| CFLT1 | male | 32 | 43.76 | 3.87 | .684 |
| female | 26 | 42.65 | 3.84 | .753 | |
| CFLT2 | male | 32 | 40.79 | 4.28 | .758 |
| female | 26 | 39.92 | 5.10 | 1.001 |
Paired Samples Test to compare the mean difference between gender subgroups across PPBT & CFLT1
| Paired Differences | t | df | Sig. (2-tailed) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | Std. Deviation | Std. Error Mean | 95% Confidence Interval of the Difference | ||||||
| Lower | Upper | ||||||||
| Pair 1 | Male PPBT vs Male CFLT1 | .862 | 4.6 | .813 | -.796 | 2.521 | 1.061 | 31 | .297 |
| Pair 2 | Female PPBT vs Female CFLT1 | -5.158 | 5.249 | 1.029 | -7.279 | -3.038 | -5.011 | 25 | .000 |
Distribution of younger/older test takers’ scores (Group Statistics)
| Age | N | Mean | Std. Deviation | Std. Error Mean | |
|---|---|---|---|---|---|
| PPBT | younger | 40 | 40.51 | 4.87 | .77 |
| older | 18 | 43.65 | 3.54 | .83 | |
| CFLT1 | younger | 40 | 42.62 | 3.92 | .62 |
| older | 18 | 44.69 | 3.40 | .8 | |
| CFLT2 | younger | 40 | 40.46 | 4.55 | .72 |
| older | 18 | 40.27 | 4.97 | 1.17 |
Testing Normality Assumption
| One-Sample Kolmogorov–Smirnov Test | CAS Total Scale Score | TOEFL Familiarity Total Scale Score | |
|---|---|---|---|
| N | 58 | 58 | |
| Normal Parameters | Mean | 92.0862 | 66.2069 |
| Std. Deviation | 10.98370 | 7.98630 | |
| Most Extreme Differences | Absolute | .133 | .084 |
| Positive | .103 | .084 | |
| Negative | -.133 | -.074 | |
| Kolmogorov–Smirnov Z | 1.013 | .637 | |
| Asymp. Sig. (2-tailed) | .256 | .812 | |
Paired-Samples Test to compare the mean difference of age subgroups across PPBT & CFLT1
| Paired Differences | t | df | Sig. (2-tailed) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | Std. Deviation | Std. Error Mean | 95% Confidence Interval of the Difference | ||||||
| Lower | Upper | ||||||||
| Pair 1 | Younger PPBT vs Younger CFLT1 | -2.12 | 6.1 | .96 | -4.07 | -.17 | -2.2 | 39 | .034 |
| Pair 2 | Older PPBT vs OlderCFLT1 | -18.41 | 22.15 | 5.22 | -29.43 | -7.39 | -3.5 | 17 | .003 |
Fig. 1Normality Assumption Graphs
Fig. 2Homoscedasticity Assumption Graph
Pearson Correlation Coefficients between variable pairs
| CFLT 1 | CAS Total Scale Score | TOEFL Familiarity Total Scale Score |
|---|---|---|
| Pearson Correlation | .319 | .150 |
| Sig. (1-tailed) | .007 | .130 |
| N | 58 | 58 |
Fig. 3Linearity Assumption Graph
ANOVA output, Sig. for Deviation from Linearity (ANOVA Table)
| Sum of Squares | df | Mean Square | F | Sig | |||
|---|---|---|---|---|---|---|---|
CFLT 1 CAS Total Scale Score | Between Groups | (Combined) | 397.10 | 28 | 14.18 | .932 | .573 |
| Linearity | 85.13 | 1 | 85.13 | 5.596 | .025 | ||
| Deviation from Linearity | 311.97 | 27 | 11.55 | .760 | .762 | ||
| Within Groups | 441.16 | 29 | 15.21 | ||||
| Total | 838.27 | 57 | |||||
| Sum of Squares | df | Mean Square | F | Sig | |||
CFLT 1 TOEFL Familiarity Total Scale Score (LS) | Between Groups | (Combined) | 325.80 | 25 | 13.03 | .814 | .699 |
| Linearity | 18.87 | 1 | 18.87 | 1.178 | .286 | ||
| Deviation from Linearity | 306.93 | 24 | 12.78 | .799 | .713 | ||
| Within Groups | 512.46 | 32 | 16.01 | ||||
| Total | 838.27 | 57 | |||||
The Coefficients table presenting which variable is significant (Coefficients)
| Modela | Unstandardized Coefficients | Standardized Coefficients | t | Sig | Collinearity Statistics | ||
|---|---|---|---|---|---|---|---|
| B | Std. Error | Beta | Tolerance | VIF | |||
| (Constant) | 29.20 | 5.543 | 5.269 | .000 | |||
| CAS Total Scale Score | .108 | .044 | .309 | 2.433 | .018 | .994 | 1.006 |
| TOEFL Familiarity Total Scale Score (LS) | .061 | .061 | .127 | .997 | .323 | .994 | 1.006 |
a Dependent Variable: CFLT1
A Model Summary table presenting R and R2 values for CAS & LS (Model Summary)
| Modelb | R | R Square | Adjusted R Square | Std. The error of the Estimate | Durbin-Watson |
|---|---|---|---|---|---|
| 1 | .343a | .118 | .085 | 3.66746 | 2.084 |
a Predictors: (Constant), TOEFL Familiarity Total Scale Score, CAS Total Scale Score
b Dependent Variable: CFLT1
Fig. 4Heteroscedasticity Assumption Graph
The ANOVA table predicting RV (ANOVA)
| Modela | Sum of Squares | df | Mean Square | F | Sig |
|---|---|---|---|---|---|
| Regression | 98.509 | 2 | 49.255 | 3.662 | .032b |
| Residual | 739.766 | 55 | 13.450 | ||
| Total | 838.276 | 57 |
a Dependent Variable: CFLT1
b Predictors: (Constant), TOEFL Familiarity Total Scale Score, CAS Total Scale Score
Model Summary table presenting R and R2 values for CAS (Model Summary)
| Modelb | R | R Square | Adjusted R Square | Std. The error of the Estimate | Durbin-Watson |
|---|---|---|---|---|---|
| 1 | .319a | .102 | .086 | 3.66728 | 1.932 |
a Predictors: (Constant), CAS Total Scale Score
b Dependent Variable: CFLT1
The ANOVA table predicting RV by CAS (ANOVA)
| Modela | Sum of Squares | df | Mean Square | F | Sig |
|---|---|---|---|---|---|
| Regression | 85.135 | 1 | 85.135 | 6.330 | .015b |
| Residual | 753.141 | 56 | 13.449 | ||
| Total | 838.276 | 57 |
a Dependent Variable: CFLT1
b Predictors: (Constant), CAS Total Scale Score
The Coefficients table presenting if the predictor (PV) is significant (Coefficients)
| Modela | Unstandardized Coefficients | Standardized Coefficients Beta | t | Sig | Collinearity Statistics | ||
|---|---|---|---|---|---|---|---|
| B | Std. Error | Tolerance | VIF | ||||
| (Constant) | 32.926 | 4.101 | 8.029 | 0 | |||
| CAS Total Scale Score | 0.111 | 0.044 | 0.319 | 2.516 | 0.015 | 1.000 | 1.000 |
a Dependent Variable: CFLT1
Fig. 5Normality Assumption Graph