| Literature DB >> 29739405 |
Katrin Schüttpelz-Brauns1, Martina Kadmon2, Claudia Kiessling3, Yassin Karay4, Margarita Gestmann5, Juliane E Kämmer6,7.
Abstract
BACKGROUND: Low-stakes tests are becoming increasingly important in international assessments of educational progress, and the validity of these results is essential especially as these results are often used for benchmarking. Test scores in these tests not only mirror students' ability but also depend on their test-taking effort. One way to obtain more valid scores from participating samples is to identify test-takers with low test-taking effort and to exclude them from further analyses. Self-assessment is a convenient and quick way of measuring test-taking effort. We present the newly developed Test-taking Effort Short Scale (TESS), which comprises three items measuring attainment value/intrinsic value, utility value, and perceived benefits, respectively.Entities:
Keywords: Nonconsequential progress testing; Psychometrics; Self-assessment; Short scale; Test-taking effort; Validation study
Mesh:
Year: 2018 PMID: 29739405 PMCID: PMC5941641 DOI: 10.1186/s12909-018-1196-0
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Self-report measures of test-taking effort
| Instrument | No. items | Subscales | What is measured | Psychometrics |
|---|---|---|---|---|
| Effort Thermometer [ | 3 10-point Likert items | No subscales | Individual test effort, anchored against a personal situation in which maximum effort was applied | Not reported |
| Online Motivation Questionnaire (OMQ) [ | 32 4-point Likert items | Mood scale | Test-taking effort in the context of performance assessment (part 1: pre-test, part 2: post-test) | Confirmatory factor analysis revealed that seven factors could be distinguished empirically; proven validity and acceptable reliability |
| Questionnaire of Current Motivation (QCM) [ | 18 7-point Likert items | Situational interest | Current motivation during a learning situation | Sufficient to excellent reliability2 |
| Student Opinion Scale (SOS) | 10 5-point Likert items | Importance | Motivation, administered as a post-test after students have completed achievement tests [ | Proven validity and good to excellent reliability [ |
Item statistics
| Item |
|
|
|
|
|
|---|---|---|---|---|---|
| 1. I want to achieve the best possible results in the test. [German: Ich möchte beim PTM die bestmöglichen Ergebnisse erreichen.] | 3.14 | 1.36 | 0.47 | 0.70 | 0.68 |
| 2. I think the progress test is useful. [German: Ich finde den PTM sinnvoll.] | 3.27 | 1.35 | 0.50 | 0.76 | 0.72 |
| 3. The test is a valuable part of my education. [German: Der PTM ist ein wertvoller Teil meines Studiums.] | 2.48 | 1.25 | 0.31 | 0.75 | 0.73 |
M mean, SD standard deviation, p difficulty, r discriminatory power (part–whole corrected), H Mokken homogeneity coefficient of item with scale
Correlations of single TESS items and the TESS score with external criteria
| Internal criterion | External criterion |
|
|
|
|
|---|---|---|---|---|---|
| Convergent validity | |||||
| TESS item 1 | Short Scale of Intrinsic Motivation [ | 1333 | .52 | <.001 | .27 |
| TESS item 2 | Perceived Usefulness Scale [ | 1377 | .57 | < .001 | .32 |
| TESS item 3 | Cost# | 1380 | −.34 | < .001 | .12 |
| Discriminant validity | |||||
| TESS score | Repetition scale of the LIST [ | 1195 | .06 | <.05 | .00 |
N number of participants, r product-moment correlation, p p-value, r2 effect size, # item “To what extent do you feel that sitting the BPT keeps you from your other duties?” (reverse coded)
TESS scores in extreme groups
| TESS values | ||||
|---|---|---|---|---|
| External criterion | Group |
| ANOVA | |
| BPT test time | Fast performersa | 2.29 (1.06) | 257 | |
| Slow performersb | 3.70 (0.87) | 237 | ||
| BPT test score | Poor performersc | 2.66 (1.25) | 100 | |
| High performersd | 3.50 (1.08) | 110 | ||
N number of participants (the difference to the Ns reported in the text is due to the fact that not all participants belonging to the extreme groups also reported their test time and test score), M mean TESS scores; SD: standard deviation, BPT Berlin Progress Test
aM = 0:41, SD = 0:21 to take the test
bM = 2:40, SD = 0:17 to take the test
ctest score of M = 1.46, SD = 2.87
dtest score of M = 92.46, SD = 30.11
Percentile ranks of TESS scores and corresponding stanine values
| TESS score |
| Percentile rank | Stanine | Interpretation |
|---|---|---|---|---|
| 1 | 157 | 11.4 | 1, 2 | Low TTE |
| 1.33 | 55 | 15.4 | 3 | |
| 1.67 | 67 | 20.3 | 3 | |
| 2 | 95 | 27.2 | 4 | |
| 2.33 | 88 | 33.6 | 4 | |
| 2.67 | 119 | 42.3 | 5 | |
| 3 | 141 | 52.6 | 5 | |
| 3.33 | 157 | 64 | 6 | |
| 3.67 | 136 | 73.9 | 6 | |
| 4 | 144 | 84.4 | 7 | |
| 4.33 | 81 | 90.3 | 8 | High TTE |
| 4.67 | 60 | 94.7 | 8 | High TTE |
| 5 | 73 | 100 | 9 | High TTE |
TTE test-taking effort
Fig. 1Flowchart showing participant numbers separately for computer- and paper-based administration; TESS responders are defined as participants who answered all three TESS items; non-responders are defined as participants who did not answer any TESS items (despite participating in the study)
Comparison of different groups of (non)-responding in TESS
| External criterion | Group |
| ANOVA | Scheffé test | |
|---|---|---|---|---|---|
| (A) BPT test time (hours:minutes) | Non-responders (G1) | 122 | 1:22 (0:53) | F(2) = 85.40; | G2 < G1 < G3 |
| Low TTE (G2) | 100 | 0:58 (0:40) | |||
| High TTE (G3) | 198 | 2:01 (0:34) | |||
| (B) BPT test score | Non-responders (G1) | 71 | 35.77 (45.14) | F(2) = 9.31, p < 0.01, η2 = 0.08 | G1 = G2 and G1 = G3 |
| Low TTE (G2) | 52 | 22.04 (36.08) | |||
| High TTE (G3) | 106 | 49.27 (33.44) |
Comparison of non-responders in TESS (G1) with responders with low test-taking effort (G2) and responders with high test-taking effort (G3), as defined by (A) BPT test scores or (B) BPT test time
N number of participants, M mean, SD Standard deviation, TTE test-taking effort, low vs. high TTE determined by stanine standardization, see Table 5