Literature DB >> 32322612

Self-reported data for mental workload modelling in human-computer interaction and third-level education.

Lucas Rizzo1, Luca Longo1.   

Abstract

Mental workload (MWL) is an imprecise construct, with distinct definitions and no predominant measurement technique. It can be intuitively seen as the amount of mental activity devoted to a certain task over time. Several approaches have been proposed in the literature for the modelling and assessment of MWL. In this paper, data related to two sets of tasks performed by participants under different conditions is reported. This data was gathered from different sets of questionnaires answered by these participants. These questionnaires were aimed at assessing the features believed by domain experts to influence overall mental workload. In total, 872 records are reported, each representing the answers given by a user after performing a task. On the one hand, collected data might support machine learning researchers interested in using predictive analytics for the assessment of mental workload. On the other hand, data, if exploited by a set of rules/arguments (as in [3]), may serve as knowledge-bases for researchers in the field of knowledge-based systems and automated reasoning. Lastly, data might serve as a source of information for mental workload designers interested in investigating the features reported here for mental workload modelling. This article was co-submitted from a research journal "An empirical evaluation of the inferential capacity of defeasible argumentation, non-monotonic fuzzy reasoning and expert systems" [3]. The reader is referred to it for the interpretation of the data.
© 2020 The Author(s).

Entities:  

Keywords:  Argumentation theory; Automated reasoning; Expert systems; Fuzzy reasoning; Knowledge-based systems; Mental workload

Year:  2020        PMID: 32322612      PMCID: PMC7160424          DOI: 10.1016/j.dib.2020.105433

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications table

Value of the data

These datasets provide the answers related to a set of questionnaires proposed in the literature of MWL (Tables 2, 3 and 5) aimed at assessing the mental workload imposed on participants by a set of designed tasks (Tables 1 and 7). In total 872 records are reported, each representing the answers given by a user after performing a task.
Table 2

The questionnaire of the Nasa Task Load Index [1].

FeatureQuestion
Mental demandHow much mental and perceptual activity was required (e.g. thinking, deciding, calculating, remembering, looking, searching, etc.)? Was the task easy or demanding, simple or complex, exacting or forgiving?
Physical demandHow much physical activity was required (e.g. pushing, pulling, turning, controlling, activating, etc.)? Was the task easy or demanding, slow or brisk, slack or strenuous, restful or laborious?
Temporal demandHow much time pressure did you feel due to the rate or pace at which the tasks or task elements occurred? Was the pace slow and leisurely or rapid and frantic?
EffortHow hard did you have to work (mentally and physically) to accomplish your level of performance?
PerformanceHow successful do you think you were in accomplishing the goals, of the task set by the experimenter (or yourself)? How satisfied were you with your performance in accomplishing these goals?
FrustrationHow insecure, discouraged, irritated, stressed and annoyed versus secure, gratified, content, relaxed and complacent did you feel during the task?
Table 3

The questionnaire of the pairwise comparison procedure of the Nasa Task Load Index [1].

PairFactor 1Factor 2
1Temporal demand⬜ OR ⬜Frustration
2Performance⬜ OR ⬜Mental
3Mental demand⬜ OR ⬜Physical demand
4Frustration⬜ OR ⬜Performance
5Temporal demand⬜ OR ⬜Effort
6Physical demand⬜ OR ⬜Frustration
7Performance⬜ OR ⬜Temporal demand
8Mental demand⬜ OR ⬜Effort
9Physical demand⬜ OR ⬜Temporal demand
10Frustration⬜ OR ⬜Effort
11Physical demand⬜ OR ⬜Performance
12Temporal demand⬜ OR ⬜Mental demand
13Effort⬜ OR ⬜Physical demand
14Frustration⬜ OR ⬜Performance
15Performance⬜ OR ⬜Mental demand
Table 5

Features and experimental study questionnaire designed by Longo [2].

Feature [Source]Question
Mental demand [1]How much mental and perceptual activity was required (e.g., thinking, deciding, calculating, remembering, looking, searching, etc.)? Was the task easy (low mental demand) or complex (high mental demand)?
Temporal demand [1]How much time pressure did you feel due to the rate or pace at which the tasks or task elements occurred? Was the pace slow and leisurely (low temporal demand) or rapid and frantic (high temporal demand)?
Effort [1]How much conscious mental effort or concentration was required? Was the task almost automatic (low effort) or it required total attention (high effort)?
Performance [1]How successful do you think you were in accomplishing the goal of the task? How satisfied were you with your performance in accomplishing the goal?
Frustration [1]How secure, gratified, content, relaxed and complacent (low psychological stress) versus insecure, discouraged, irritated, stressed and annoyed (high psychological stress) did you feel during the task?
Physical demand [1]How much physical activity was required (e.g. pushing, pulling, turning, controlling, activating, etc.)? Was the task easy or demanding, slow or brisk, slack or strenuous, restful or laborious?
Solving and deciding [4]How much attention was required for activities like remembering, problem-solving, decision-making and perceiving (e.g. detecting, recognizing and identifying objects)?
Selection of response [4]How much attention was required for selecting the proper response channel and its execution? (manual - keyboard/mouse, or speech - voice)
Task and space [4]How much attention was required for spatial processing (spatially pay attention around you)?
Verbal material [4]How much attention was required for verbal material (eg. reading or processing linguistic material or listening to verbal conversations)?
Visual resources [4]How much attention was required for executing the task based on the information visually received (through eyes)?
Auditory resources [4]How much attention was required for executing the task based on the information auditorily received (ears)?
Manual Response [4]How much attention was required for manually respond to the task (e.g. keyboard/mouse usage)?
Speech response [4]How much attention was required for producing the speech response (e.g. engaging in a conversation or talk or answering questions)?
Context bias [2]How often interruptions on the task occurred? Were distractions (mobile, questions, noise, etc.) not important (low context bias) or did they influence your task (high context bias)?
Past knowledge [2]How much experience do you have in performing the task or similar tasks on the same website?
Skill [2]Did your skills have no influence (low) or did they help to execute the task (high)?
Motivation [2]Were you motivated to complete the task?
Parallelism [2]Did you perform just this task (low parallelism) or were you doing other parallel tasks (high parallelism) (e.g. multiple tabs / windows / programs)?
Arousal [2]Were you aroused during the task? Were you sleepy, tired (low arousal) or fully awake and activated (high arousal)?
Task difficult [2]1 / 8 * [(solving and deciding) + (auditory resources) + (manual response) + (speech response) + (selection of response) + (task and space) + (verbal material) + (visual resources)]
Table 1

List of tasks given in the form of third-level classes to students at Technological University Dublin, School of Computer Science, Dublin, Ireland. Four topics of the module ‘Research Methods’ in the Master of Science were delivered by three distinct approaches.

Topics1. Science
2. Scientific method
3. Research planning
4. Literature review

Instructional condition1. Traditional direct instruction, using slides projected to a white board;
2. Multimedia video of content. Transformation of the content of the slides of 1 into a  multimedia video projected to a white board;
3. Constructivist collaborative activity added to 2.
Table 7

List of seeking web-based tasks of varying difficulty and demand. These were first designed in [2].

TaskDescriptionTask conditionWeb-site
1Find out how many people live in SidneySimple searchWikipedia
2Read simple.wikipedia.org/wiki/GrammarNo goals, no time pressureWikipedia
3Find out the difference (in years) between the year of the foundation of the Apple Computer Inc. and the year of the 14th FIFA world cupDual-task and mental arithmetical calculationsGoogle
4Find out the difference (in years) between the foundation of the Microsoft Corp. and the year of the 23rd Olympic gamesDual-task and mental arithmetical calculationsGoogle
5Find out the year of birth of the 1st wife of the founder of playboySingle task + time pressure (2-min limit). Each 30 secs user is warned of time leftGoogle
6Find out the name of the man (interpreted by Johnny Deep) in the video www.youtube.com/watch?v=FfTPS-TFQ_cConstant demand on visual and auditory modalities. Participant can replay the video if requiredYouTube
7a) Play the song www.youtube.com/watch?v=Rb5G1eRIj6c While listening to it, b) ind out the result of the polynomial equation p(x), with x = 7 contained in the wikipedia article http://it.wikipedia.org/wiki/PolinomiDemand on visual modality and inference on auditory modality. The song is extremely irritatingWikipedia
8Find out how many times Stewie jumps in the video www.youtube.com/watch?v=TSe9gbdkQ8sDemand on visual resource + external interference: user is distracted twice and can replay videoYouTube
9Find out the age of the blue fish in the video www.youtube.com/watch?v=H4BNbHBcnDIDemand on visual and auditory modality, plus time-pressure: 150 s limit. User can replay the video. There is no answer.YouTube
These are important to the field of human mental workload and knowledge representation and reasoning. They may instantiate knowledge-bases created by researches in the field of knowledge-based systems and automated reasoning. It might also support machine learning researchers interested in using predictive analytics for the assessment of mental workload. Lastly, they may serve as a source of information for mental workload designers interested in investigating the features reported here for mental workload modelling. They may serve as baselines for comparison against newly developed models of similar purpose. Reported values by three state-of-the-art measurement techniques of mental workload are also displayed: the NASA – Task Load Index [1], the Workload Profile [4] and the Raw Task Load Index [5]. A self-report indicating the participants perceived mental workload is also provided. This can be applied for triangulation purposes when developing new methods of mental workload assessment.

Data

In this section three datasets in the field of human mental workload are described. These datasets are built upon subjective measures of mental workload. In other words, they rely on the subjective feedback - in this case questionnaires - provided by humans engaging with an underlying task. These questionnaires (Tables 2, 3 and 5) are proposed in the literature of MWL [1,2,4]. For each dataset these are defined: A set of tasks performed (Tables 1 or 7). A set of questionnaires proposed in the literature of MWL aimed at measuring the mental workload imposed by tasks in 1. A dataset in .csv format containing one row per answers of questions in 2 given by each participant who performed one task listed in 1. In [3], these datasets are also employed for the construction of fuzzy rule-based systems and argument-based systems. Such systems are built upon a set of rules/arguments able to infer a numerical MWL scalar from the data described in this article.

Dataset A

The set of performed tasks in this dataset is listed in Table 1. The set of questionnaires employed for measuring the MWL imposed by these tasks is listed in Table 2 and 3. Finally, the description of collected associated to this dataset is listed in Table 4.
Table 4

Description of columns in attached data table in supplementary attachment A.

Column headerDescription
class_descriptionType of class attended by students of the “Research methods” module (Table 3). Four possible values: literature_review, planning_research, science and the_scientific_method
delivery_methodDelivery method employed during the class (Table 3). Three possible values: pdf, video and video_and_collaborative_activity
nationalityNationality of the participant
AgeAge in years of the participant
DateDate relative to the day the record was collect
mental_workloadSelf-assessment mental workload reported by Fig. 1. Integer between 1 and 20
NASA_TLXThe NASA-TLX score [1]. Real number between 0 and 100.
mental_demandMental demand reported according to question of Table 2. Integer between 1 and 20.
physical_demandPhysical demand reported according to question of Table 2. Integer between 1 and 20.
temporal_demandTemporal demand reported according to question of Table 2. Integer between 1 and 20.
performancePerformance reported according to question of Table 2. Integer between 1 and 20.
frustrationFrustration reported according to question of Table 2. Integer between 1 and 20.
effortEffort reported according to question of Table 2. Integer between 1 and 20.
factor1_vs_factor2 (columns N to AB)Pairwise comparison of the 15 pairs of Table 3. Possible values are 0 (factor 1 was chosen) or 1 (factor 2 was chosen).
List of tasks given in the form of third-level classes to students at Technological University Dublin, School of Computer Science, Dublin, Ireland. Four topics of the module ‘Research Methods’ in the Master of Science were delivered by three distinct approaches. The questionnaire of the Nasa Task Load Index [1]. The questionnaire of the pairwise comparison procedure of the Nasa Task Load Index [1]. Description of columns in attached data table in supplementary attachment A.

Dataset B

Tasks employed for the construction of this dataset are the same ones described in Table 1. Remaining information for the construction of this dataset is listed below. The questionnaire employed for measuring the MWL imposed by these tasks is listed in Table 5, while the description of collected data is listed in Table 6
Table 6

Description of columns in attached data table in supplementary attachment B.

Column headerDescription
class_descriptionFour possible values (Table 3): literature_review, planning_research, science and the_scientific_method
delivery_methodThree possible values (Table 3): pdf, video and video_and_collaborative_activity
DateDate relative to the day the record was collect
nationalityNationality of the participant
AgeAge in years of the participant
mental_workloadSelf-assessment mental workload reported by Fig. 1. Integer between 1 and 20
MentalDemandMental demand reported according to question of Table 5. Integer between 1 and 20.
ParallelismParallelism reported according to question of Table 5. Integer between 1 and 20.
TemporalDemandTemporal demand reported according to question of Table 5. Integer between 1 and 20.
ManualResponseManual response reported according to question of Table 5. Integer between 1 and 20.
VisualResourcesVisual resources reported according to question of Table 5. Integer between 1 and 20.
EffortEffort reported according to question of Table 5. Integer between 1 and 20.
SolvingAndDecidingSolving and deciding reported according to question of Table 5. Integer between 1 and 20.
FrustrationFrustration reported according to question of Table 5. Integer between 1 and 20.
ContextBiasContext bias reported according to question of Table 5. Integer between 1 and 20.
TaskAndSpaceTask and space reported according to question of Table 5. Integer between 1 and 20.
MotivationMotivation reported according to question of Table 5. Integer between 1 and 20.
VerbalMaterialVerbal material reported according to question of Table 5. Integer between 1 and 20.
SkillSkill reported according to question of Table 5. Integer between 1 and 20.
AuditoryResourcesAuditory resources reported according to question of Table 5. Integer between 1 and 20.
PhysicalDemandPhysical demand reported according to question of Table 5. Integer between 1 and 20.
Selection Of ResponseSelection of response reported according to question of Table 5. Integer between 1 and 20.
SpeechResponseSpeech response reported according to question of Table 5. Integer between 1 and 20.
PastKnowledgePast knowledge reported according to question of Table 5. Integer between 1 and 20.
ArousalArousal reported according to question of Table 5. Integer between 1 and 20.
PerformancePerformance reported according to question of Table 5. Integer between 1 and 20.
TaskDifficultTask difficult reported according to question of Table 5. Real number between 1 and 20.
RAW_TLXThe Raw TLX score [5]. Real number between 0 and 100.
WorkloadProfileThe Workload Profile score [4]. Real number between 0 and 100.
Features and experimental study questionnaire designed by Longo [2]. Description of columns in attached data table in supplementary attachment B.

Dataset C

The set of tasks performed by participants in the construction of this dataset is listed in Table 7. Questionnaires answered by participants after performing a task can be seen in Tables 5 and 3. Remaining information for the construction of this dataset is listed below. The description of data collected is listed in Table 8.
Table 8

Description of columns in attached data table in supplementary attachment C.

Column headerDescription
TaskNumberNumber of the task performed by volunteer. List of tasks can be seen in Table 4.
InterfaceVersionInterface of the website in which the task was performed. Two options are possible: original and modified.
CompletionTimeInSecondsTime taken for completion of the task.
GenderCatGender of the volunteer: male or female.
BornYearThe year of birth of the volunteer
LangCatThe main language spoken by the user (however all the users were almost fluent in English). Options are: Chinese, Czech, English, French, German, Italian, Polish, Portuguese, Spanish and other.
ContextBiasContext bias reported according to question of Table 5. Integer between 1 and 100
PastKnowledgePast knowledge reported according to question of Table 5. Integer between 1 and 100
SkillSkill reported according to question of Table 5. Integer between 1 and 100
MotivationMotivation reported according to question of Table 5. Integer between 1 and 100
ParallelismParallelism reported according to question of Table 5. Integer between 1 and 100
ArousalArousal reported according to question of Table 5. Integer between 1 and 100
SolvingAndDecidingContext bias reported according to question of Table 5. Integer between 1 and 100
SelectionOfResponseSolving and deciding reported according to question of Table 5. Integer between 1 and 100
VerbalMaterialVerbal material reported according to question of Table 5. Integer between 1 and 100
VisualResourcesVisual resources reported according to question of Table 5. Integer between 1 and 100
AuditoryResourcesAuditory resources reported according to question of Table 5. Integer between 1 and 100
ManualResponseManual response according to question of Table 5. Integer between 1 and 100
SpeechResponseSpeech response reported according to question of Table 5. Integer between 1 and 100
TaskDifficultTask difficult reported according to question of Table 5. Integer between 1 and 100
MentalDemandMental demand reported according to question of Table 5. Integer between 1 and 100
TemporalDemandTemporal demand reported according to question of Table 5. Integer between 1 and 100
FrustrationFrustration reported according to question of Table 5. Integer between 1 and 100
EffortEffort reported according to question of Table 5. Integer between 1 and 100
PerformancePerformance reported according to question of Table 5. Integer between 1 and 100
PhysicalDemandPhysical demand was considered 0 for all instances of this dataset.
factor1_vs_factor2 (columns AB to AP)Pairwise comparison of the 15 pairs of Table 3. Possible values are 0 (factor 1 was chosen) or 1 (factor 2 was chosen).
NasaTLXRawThe Raw TLX score [5]. Real number between 0 and 100.
NASATLXThe Workload Profile score [4]. Real number between 0 and 100.
WPThe NASA-TLX score [1]. Real number between 0 and 100.
List of seeking web-based tasks of varying difficulty and demand. These were first designed in [2]. Description of columns in attached data table in supplementary attachment C.

Experimental design, materials, and methods

Mental workload (MWL) is an imprecise construct, with distinct definitions and no predominant measurement technique. It can be intuitively seen as the amount of mental activity devoted to a certain task over time. Several approaches have been proposed in the literature [1,2,4,5] for the modelling and assessment of MWL. Data reported here is relative to two sets of tasks (third-level classes in Table 1 and seeking web-based information in Table 7) performed by several participants. Subjects were briefed about the study and they were requested to sign a consent form that included data protection and treatment. Privacy and anonymity of participants were in all respects protected by the authors. The goal was to collect the information asked in Fig. 1 and in Tables 2, 3 and 5. These contained features believed by domain experts to influence overall imposed mental workload by performed tasks. MWL is an undefined psychological construct. Therefore, the goal of the data is to help scholars to understand and develop new measurements techniques of MWL. No participant performed/answered the same task/questionnaire more than once, avoiding ambiguous data. Hence, each record can be employed for a case-by-case analysis of the MWL imposed by the respective task.
Fig. 1

Baseline self-reporting measure of Mental Workload [3].

Baseline self-reporting measure of Mental Workload [3].

Third-level classes at Dublin Institute of Technology

Students attended third level classes in the Technological University Dublin and filled either questionnaires in Tables 2 and 3 or only the questionnaire in Table 7 after each class (Table 1). The set of questionnaires were related to the features perceived by different mental workload designers to influence the imposed MWL by the performed task. In Tables 2 and 3 only features of the NASA-TLX [1] measurement technique were being investigated, while in Table 7 a larger set of features [2] was being considered for MWL modelling and assessment. Therefore, two distinct sets of data were generated and reported in the data tables of supplementary attachments A and B. In total, students were from 24 distinct countries (age 19–74, mean 30.9, std = 7.63). In general, four topics of the module ‘Research Methods’ were delivered in three different forms (Table 1) during the semesters of the academic terms 2015–2018. Some group of students received the first instructional condition (PDF slides presented by lecturer to students), some received the second instructional condition (same content of PDF slides presented through video and no lecturer), and some received the third instructional condition (same as the second instructional condition with a collaborative group activity added at the end). The number of students who attended each class is described in Table 9.
Table 9

Number of students across topics and delivery methods.

TopicDuration (mins)Delivery method
123
Science[18, 62]317019
Scientific method[20, 46]393641
Research planning[10, 68]434541
Literature review[18, 57]414318
Number of students across topics and delivery methods.

Information seeking web-based tasks

Nine information seeking web-based tasks of varying difficulty and demand (Table 7), were performed by participants over three websites: Google, Wikipedia and YouTube. These websites were selected due to their popularity and assumption that participants were familiar with their interfaces. The original interface of each web-site was slightly manipulated to impose different MWL demands on participants interacting with them, leading to 9 tasks on the original websites and 9 tasks on the modified websites (18 in total). 46 volunteers performed all the tasks in a random order on different days, over 2 or 3 sessions of approximately 45/70 min each. Afterwards, the questions of Tables 3 and 7 were answered using a paper-based scale in the range [0..100] ∈ א, partitioned in 3 regions delimited at 33 and 66. 405 valid instances were generated.
SubjectArtificial Intelligence
Specific subject areaKnowledge-based systems.Human mental workload.
Type of dataTable
How data were acquiredSurvey.
Data formatRaw
Parameters for data collectionData was collected according to a set of tasks believed to impose different levels of mental workload on participants.
Description of data collectionOne part of data was collected through a set of surveys applied to students who attended classes at the Technological University Dublin (Table 1). The other part of data was also collected through a set of surveys, but this time answered by volunteers who performed a set of designed web-based tasks (Table 7).
Data source locationCity/Town/Region: DublinCountry: Ireland
Data accessibilityWith the article and as a supplementary attachment.
Related research articleAuthor's name: Lucas Rizzo and Luca LongoTitle: An empirical evaluation of the inferential capacity of defeasible argumentation, non-monotonic fuzzy reasoning and expert systems [3]Journal: Expert Systems with Applications.
  1 in total

1.  Diagnosticity and multidimensional subjective workload ratings.

Authors:  P S Tsang; V L Velazquez
Journal:  Ergonomics       Date:  1996-03       Impact factor: 2.778

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.