| Literature DB >> 34989691 |
Victoria Leong1,2,3, Kausar Raheel1, Jia Yi Sim1, Kriti Kacker1, Vasilis M Karlaftis4, Chrysoula Vassiliu5, Kastoori Kalaivanan2, S H Annabel Chen1,2,3,6, Trevor W Robbins4,7, Barbara J Sahakian7,8, Zoe Kourtzi4.
Abstract
BACKGROUND: The global COVID-19 pandemic has triggered a fundamental reexamination of how human psychological research can be conducted safely and robustly in a new era of digital working and physical distancing. Online web-based testing has risen to the forefront as a promising solution for the rapid mass collection of cognitive data without requiring human contact. However, a long-standing debate exists over the data quality and validity of web-based studies. This study examines the opportunities and challenges afforded by the societal shift toward web-based testing and highlights an urgent need to establish a standard data quality assurance framework for online studies.Entities:
Keywords: COVID-19; executive functions; learning; neurocognitive assessment; web-based testing
Mesh:
Year: 2022 PMID: 34989691 PMCID: PMC8778570 DOI: 10.2196/28368
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Examples of data exclusion statistics reported for lab-based and web-based cognitive studies.
| Study type and citation | Task(s) | Data excluded | |
|
| |||
|
| Kim et al [ | Lab, psycholinguistic task | 5/42 (11.9%) participants excluded for high error rates or being outside demographic. Reaction time outlier removal=0.75% of total data |
|
| Von Gunten et al [ | Lab, inhibition tasks (antisaccade, go/no go, and Stop Signal) | 37/463 (7.99%) participants excluded |
|
| Backx et al [ | Lab, CANTABa tasksb | No exclusions, no distractions reported |
|
| Hicks et al [ | Lab (experiments 1 and 3), working memory tasks | Experiment 1: 0/58 (0%) participants excluded, although 10% of participants reported cheating; experiment 3: 10/112 (8.9%) participants excluded due to excessive missing data |
|
| Ruiz et al [ | Lab, working memoryc, nondeclarative/declarative memory tasks | (a) OSpand, 0% excluded; (b) MLAT5e, 0% excluded; (c) CVMTf, 1/50 (2%) participants excluded |
|
| Baniqued et al [ | Cognitive video training | 27/219 (12.3%) participants excluded or withdrew |
|
| |||
|
| Kim et al [ | Online, psycholinguistic task | 3/39 (7.7%) participants excluded for high error rates or being outside demographic. Reaction time outlier removal=0.75% of total data |
|
| Eisenberg et al [ | Online (using Amazon Turk), inhibition tasks (go/no go, Stop Signal) | 102/662 (15.4%) participants excluded for noncompletion of task battery; 38/560 (6.8%) participants further excluded for failing 4 or more tasks |
|
| Backx et al [ | Online, CANTAB tasks | 2/18 (11.1%) participants excluded, high SWMg errors; |
|
| Hicks et al [ | Online (experiments 2 and 4), working memory tasks | Experiment 2: 12/100 (12%) participants excluded for failure to complete test battery within 24 hours; Experiment 4: 28/112 (25%) participants excluded due to noncompletion of task battery |
|
| Ruiz et al [ | Online, working memory, nondeclarative/declarative memory tasks | (a) OSpan, 7/50 (14%) participants excluded; (b) MLAT5, 8/15 (16%) participants excluded; (c) CVMT, 10/50 (20%) participants excluded |
|
| Buitenweg et al [ | Cognitive flexibility training | 91/249 (36.5%) participants excluded for not meeting criteria (N=11) or withdrew from study (N=80) |
aCANTAB: Cambridge Neuropsychological Test Automated Battery
bCANTAB tasks include SWM, PAL, ERT, OTS, PRM-I, RVP, and PRM-D.
cMemory tasks include OSpan, MLAT, and CVMT.
dOSpan: automated operation span task.
eMLAT: modern language aptitude test.
fCVMT: continuous visual memory task.
gSWM: spatial working memory.
hPAL: paired associates learning.
iERT: emotion recognition task.
jOTS: one touch stockings of Cambridge.
kPRM-I: pattern recognition memory-immediate.
lRVP: rapid visual processing.
mPRM-D: pattern recognition memory-delayed.
Summary of participant demographics by testing modality.
| Demographic variable | Modality (group) | |||||
|
| F2Fa (n=41) | RGTb (n=44) | Total (N=85) | |||
|
|
|
|
| |||
|
| Mean (SD) | 21.54 (2.26) | 22.14 (2.05) | 21.85 (2.16) | ||
|
| Range | 18.11-29.22 | 18.51-26.83 | 18.11-29.22 | ||
|
|
|
|
| |||
|
| Female | 29 (70.7) | 33 (75) | 62 (72.9) | ||
|
| Male | 12 (29.3) | 11 (25) | 23 (27.1) | ||
|
|
|
|
| |||
|
| Chinese | 34 (82.9) | 36 (81.8) | 70 (82.4) | ||
|
| Malay | 4 (9.8) | 6 (13.6) | 10 (11.8) | ||
|
| Indian | 2 (4.9) | 2 (4.5) | 4 (4.7) | ||
|
| Not reported | 1 (2.4) | 0 (0) | 1 (1.2) | ||
|
|
|
|
| |||
|
| Lower | 13 (31.7) | 16 (36.4) | 29 (36.3) | ||
|
| Higher | 24 (58.5) | 27 (61.4) | 51 (63.7) | ||
|
| Not reported | 4 (9.8) | 1 (2.3) | 5 (5.9) | ||
|
|
|
|
| |||
|
| Secondary School | 27 (65.9) | 23 (52.3) | 50 (58.8) | ||
|
| Bachelor’s Degree | 12 (29.3) | 16 (36.4) | 28 (32.9) | ||
|
| Not reported | 2 (4.9) | 5 (11.4) | 7 (8.2) | ||
|
|
|
|
| |||
|
| Right-handed | 38 (92.7) | 42 (95.5) | 80 (94.1) | ||
|
| Left-handed | 2 (4.9) | 2 (4.5) | 4 (4.7) | ||
|
| Not reported | 1 (2.4) | 0 (0) | 1 (1.2) | ||
aF2F: face-to-face.
bRGT:remote guided testing.
Figure 1Overview of remote guided and face-to-face testing processes.
Summary of experimental tasks administered and respective delivery platforms.
| Domains and tasks | Delivery platform | ||||
|
| i-ABC | CANTABa | Inquisit | Verbal | |
|
| |||||
|
| WCSTb | ✓ | —c | — | — |
|
| PRd | ✓ | — | — | — |
|
| TMTe | — | — | ✓ | — |
|
| IEDf | — | ✓ | — | — |
|
| |||||
|
| SWMg | — | ✓ | — | — |
|
| WAIS-IV BDSh | — | — | — | ✓ |
|
| |||||
|
| Stroop Task (Stroop) | — | — | ✓ | — |
|
| SSTi | — | — | ✓ | — |
|
| |||||
|
| SLj | ✓ | — | — | — |
|
| |||||
|
| WASI-IIl vocabulary (vocab) | — | — | — | ✓ |
aCANTAB: Cambridge Neuropsychological Test Automated Battery.
bWCST: Wisconsin Card Sort Test.
cEmpty cells indicate that the particular task was not administered via the specific delivery platform.
dPR: probabilistic learning and reversal.
eTMT: trail making task.
fED: intra-extra dimensional set shift.
gSWM: spatial working memory.
hWAIS-IV BDS: Weschler Adult Intelligence Scale–Fourth Edition Backwards Digit Span.
iSST: Stop Signal Task.
jSL: structure learning.
kQ: intelligence quotient.
lWASI-II: Weschler Abbreviated Scale of Intelligence–Second Edition.
Figure 2Hardware specifications for remote guided participants (total N=44), including computer (a) brand; (b) operating system; (c) screen size (in inches) (d) screen resolution (in pixels); (e) processor and (f) RAM (in GB).
Figure 3Web capability for remote guided participants (total n=44), including (a) internet download/upload speed (higher=better); and (b) internet latency (shorter=better).
Summary of hardware and web capability specifications for remote guided participants, compared to the standard testing equipment used for the face-to-face group.
| Hardware specifications | RGTa n/mean, (%/SD) | F2Fb standard | |||
|
| |||||
|
| Acer | 14 (13.6%) | HP Probook | ||
|
| Apple | 6 (31.8%) | —c | ||
|
| Asus | 9 (20.5%) | — | ||
|
| Dell | 3 (6.8%) | — | ||
|
| HP | 7 (15.9%) | — | ||
|
| Lenovo | 5 (11.4%) | — | ||
|
| |||||
|
| Windows | 30 (68.2%) | Windows 10 | ||
|
| Mac OS | 14 (31.8%) | — | ||
|
| |||||
|
| Intel Core i3 | 2 (4.5%) | Intel Core i7 2/2.4ghz | ||
|
| Intel Core i5 | 21 (47.7%) | — | ||
|
| Intel Core i6 | 1 (2.3%) | — | ||
|
| Intel Core i7 | 17 (38.6%) | — | ||
|
| Intel Core i8 | 1 (2.3%) | — | ||
|
| Intel Core i9 | 1 (2.3%) | — | ||
|
| Other | 1 (2.3%) | — | ||
| RAM (GB) | 9.73 (4.35) | 8.0 | |||
| Total hard disk space (GB) | 417 (229) | 500 HDD (+256 SSD) | |||
| Free hard disk space (GB) | 270 (223) | 108 | |||
| Screen size (inches) | 13.8 (1.74) | 13.3 | |||
|
| |||||
|
| 1280 x 800 | 1 (2.3%) | 1920 x 1080 | ||
|
| 1366 x 768 | 6 (13.6%) | — | ||
|
| 1440 x 900 | 2 (4.6%) | — | ||
|
| 1920 x 1080 | 19 (43.2%) | — | ||
|
| 1920 x 1280 | 1 (2.3%) | — | ||
|
| 2560 x 1600 | 10 (22.7%) | — | ||
|
| 3200 x 1800 | 3 (6.8%) | — | ||
|
| Unspecified | 2 (4.6%) | — | ||
|
| |||||
|
| Mouse (wireless) | 27 (61.2%) | Wired mouse | ||
|
| Mouse (wired) | 15 (34.1%) | — | ||
|
| Mouse (integrated) | 2 (4.6%) | — | ||
|
| Keyboard (wireless) | 2 (4.6%) | Integrated keyboard | ||
|
| Keyboard (integrated) | 42 (95.5%) | N/Ad | ||
|
| Webcam (integrated) | 43 (97.7%) | Integrated webcam | ||
|
| Webcam (separate) | 1 (2.3%) | N/A | ||
|
| Microphone (integrated | 35 (79.6%) | Integrated microphone | ||
|
| Microphone (separate) | 9 (20.5%) | N/A | ||
|
| |||||
|
| download speed (Mb/s) | 77.9 (88.6) | 44.6 | ||
|
| Upload speed (Mb/s) | 70.4 (96.1) | 48.1 | ||
|
| Internet latency (ms) | 10.6 (12.3) | 5 | ||
|
| |||||
|
| Google Chrome | 38 (86.4%) | Google Chrome | ||
|
| Mozilla Firefox | 5 (2.3%) | N/A | ||
|
| Safari | 1 (11.4%) | N/A | ||
aRGT: remote guided testing.
bF2F: face-to-face.
cWe used one set of standard equipment for testing the F2F participants, hence there is only one value reported for each subheading under the F2F column.
dN/A: not applicable.
Summary of data quality indices for all tasks.
| Delivery platform and task | (1) Missed trials (%), mean (SD) | (2) Data exclusion | (1) Reaction time (sec), mean (SD) | |||||||||||||||
|
|
| Trial level (%), mean (SD) | Task level (N), (tech/perf) |
| ||||||||||||||
|
| F2Fa | RGTb | F2F | RGT | F2F | RGT | F2F | RGT | ||||||||||
|
| ||||||||||||||||||
|
| Wisconsin Card Sort Test (WCST) | 0.73 (1.3) | 1.02 (1.9) | 3.50 (3.2) | 4.92 (5.4) | 0/0 | 0/ | 1.33 (0.18) | 1.39 (0.22) | |||||||||
|
| Probabilistic learning and reversal (PR) | 0.30 (0.6) | 0.74 (1.5) | 3.06 (3.1) | 5.80 (5.9) | 0/1 | 1/1 | 0.90 (0.16) | 1.01 (0.21) | |||||||||
|
| Structure learning (SL) | 3.41 (2.6) | 3.27 (2.6) | 0.99 (0.7) | 1.72 (3.5) | 0/0 | 1/1 | 1.07 (0.15) | 1.04 (0.16) | |||||||||
|
| ||||||||||||||||||
|
| Color-Word Stroop | N/Ac | N/A | 3.38 (4.5) | 3.28 (4.7) | 0/0 | 1/1 | 0.84 (0.13) | 0.87 (0.14) | |||||||||
|
| Stop Signal Task (SST) | 0.98 (1.8) | 1.59 (3.3) | 1.41 (2.4) | 1.14 (2.7) | 0/1 | 1/1 | 0.47 (0.08) | 0.42 (0.09) | |||||||||
|
| Trails A and B | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0/0 | 2/0 | 40.9 (10.7) | 40.4 (10.2) | |||||||||
|
| ||||||||||||||||||
|
| Intra/extra-dimensional set shift (IED) | N/A | N/A | N/A | N/A | 0/0 | 0/0 | N/A | N/A | |||||||||
|
| Spatial working memory (SWM) | N/A | N/A | N/A | N/A | 0/0 | 0/0 | N/A | N/A | |||||||||
|
| ||||||||||||||||||
|
| Backwards Digit Span | N/A | N/A | N/A | N/A | N/A | 0/0 | 0/0 | N/A | |||||||||
|
| WASIe vocabulary | N/A | N/A | N/A | N/A | N/A | 0/0 | 0/0 | N/A | |||||||||
aF2F: face-to-face.
bRGT: remote guided testing.
cN/A: not applicable.
dCANTAB: Cambridge Neuropsychological Test Automated Battery.
eWASI: Wechsler Abbreviated Scale of Intelligence.
Figure 4Plot of performance indices for (a) i-ABC; (b) Inquisit; (c) CANTAB and (d) Verbally delivered tasks. Face-to-face participants are shown in dark grey bars, remote guided participants are shown in light grey bars. Error bars indicate the standard error of the mean, ***P<.001.
Summary of task performance indices.
| Delivery platform and task and performance index | Scores by group | GLMa modality effects | ||||||
|
| F2Fb, mean (SD) | RGTc, mean (SD) |
| |||||
|
|
|
| Modality | |||||
|
|
|
|
|
| ||||
|
|
| Nonperseverative errors | 10.1 (5.5) | 10.2 (6.8) |
| |||
|
|
| Perseverative errors | 9.3 (2.6) | 10.0 (4.1) |
| |||
|
|
| |||||||
|
|
| Perseveration | 3.1 (1.7) | 3.7 (2.8) |
| |||
|
|
| Switching probability | 6.6 (2.5) | 6.7 (2.6) |
| |||
|
|
|
|
|
| ||||
|
|
| PI mean | 0.15 (0.22) | 0.06 (0.18) |
| |||
|
|
| PI change | 0.18 (0.30) | 0.16 (0.31) |
| |||
|
|
|
| Modality | |||||
|
|
|
|
|
| ||||
|
|
| Interference (reaction time) | 0.23 (0.10) | 0.22 (0.11) |
| |||
|
|
| Interference (accuracy) | –0.08 (0.07) | –0.10 (0.09) |
| |||
|
|
|
|
|
| ||||
|
|
| Stop Signal reaction time | 0.24 (0.19) | 0.28 (0.27) |
| |||
|
|
|
|
|
| ||||
|
|
| Trails B/A time ratio | 1.26 (0.43) | 1.17 (0.31) |
| |||
|
|
|
| Modality | |||||
|
|
| |||||||
|
|
| Extra dimensional shift errors | 5.6 (7.2) | 4.7 (5.2) |
| |||
|
|
| Pre-extra dimensional shift errors | 7.3 (2.6) | 9.6 (7.7) |
| |||
|
|
|
|
|
| ||||
|
|
| Between errors | 25.3 (16.7) | 32.3 (21.3) |
| |||
|
|
| Strategy | 13.3 (4.6) | 14.4 (4.1) |
| |||
|
|
|
| Modality | |||||
|
| WASIe vocabulary (standardized score) | 50.1 (7.2) | 56.0 (7.6) |
| ||||
|
| Backwards Digit Span (total score) | 8.7 (3.1) | 9.7 (3.2) |
| ||||
aGLM: general linear model
bF2F: face-to-face.
cRTG: remote guided testing.
dCANTAB: Cambridge Neuropsychological Test Automated Battery.
eWASI: Wechsler Abbreviated Scale of Intelligence.
Figure 5Summary of considerations for suitability of unsupervised, supervised web testing and in-person methodologies for cognitive testing. RT: reaction time.