Literature DB >> 35078276

Using very short answer errors to guide teaching.

Oliver Putt1, Rachel Westacott2, Amir H Sam3, Mark Gurnell4, Celia A Brown1.   

Abstract

BACKGROUND: Student performance in examinations reflects on both teaching and student learning. Very short answer questions require students to provide a self-generated response to a question of between one and five words, which removes the cueing effects of single best answer format examinations while still enabling efficient machine marking. The aim of this study was to pilot a method of analysing student errors in an applied knowledge test consisting of very short answer questions, which would enable identification of common areas that could potentially guide future teaching.
METHODS: We analysed the incorrect answers given by 1417 students from 20 UK medical schools in a formative very short answer question assessment delivered online.
FINDINGS: The analysis identified four predominant types of error: inability to identify the most important abnormal value, over or unnecessary investigation, lack of specificity of radiology requesting and over-reliance on trigger words.
CONCLUSIONS: We provide evidence that an additional benefit to the very short answer question format examination is that analysis of errors is possible. Further assessment is required to determine if altering teaching based on the error analysis can lead to improvements in student performance.
© 2022 The Authors. The Clinical Teacher published by Association for the Study of Medical Education and John Wiley & Sons Ltd.

Entities:  

Mesh:

Year:  2022        PMID: 35078276      PMCID: PMC9303801          DOI: 10.1111/tct.13458

Source DB:  PubMed          Journal:  Clin Teach        ISSN: 1743-4971


BACKGROUND

Single best answer (SBA) examinations are frequently used to assess knowledge in medicine. Studies have shown that some cognitive processes used to answer SBAs are different to those used in real‐life scenarios. , Very short answer (VSA) questions were developed to provide a more thorough test of knowledge as described by Sam et al. VSA examinations provide a clinical vignette with an associated question from which a student must generate an answer consisting of between one and five words. This format can be delivered online with predominantly rapid electronic marking using pre‐specified answers. With manual marking kept to a minimum, the cost and time incurred by more open‐ended questions are reduced, while the benefits of free‐response answers are retained. While feedback on examinations is sought‐after by students , and it has been shown that feedback can improve student performance, , we were only able to identify one PUBMED‐indexed paper that used student examination performance to improve faculty teaching. If clinical teachers want to enhance their teaching, one approach is for them to seek feedback on student performance. Because of the free text answers of VSA questions, we hypothesised that common incorrect answers could highlight areas of misunderstanding, providing an insight into student misconceptions and hence opportunities for improving clinical teaching. Such insights cannot be gained through the analysis of incorrect SBA answers because students must select from a list of pre‐defined answer options, none of which may have been the student's initial response to the question. Our aim was to consider, with examples, if an analysis of common incorrect answers in VSA format examinations was possible and if it could identify common areas of misunderstanding amongst students. Our ultimate hope would be that by informing clinical teachers of the common errors identified, it could help them reflect on and improve their teaching.

METHODS

We undertook a secondary analysis of data from a study comparing VSA questions and SBA questions. All UK medical schools with final year students (N = 32) were invited to participate between September and November of 2018, and 20 schools enrolled. The students were invited to sit a formative examination. The examination consisted of 50 questions based on the core knowledge required of a medical graduate. The examination was taken twice in succession, first in VSA format and second in SBA format. The VSA format required students to write a one‐ to five‐word answer in response to a clinical vignette (see Examples 1 to 4 in Figure 1). The SBA version gave five options, one of which was the best answer. Information regarding the study was given to the students prior to enrolment. Participation in the examination was deemed as informed consent. Ethical approval was provided by the Medical Education Ethics Committee (Reference MEEC‐1718‐100) at Imperial College London.
FIGURE 1

Example questions from the very short answer paper

Example questions from the very short answer paper Incorrect VSA responses for each question were collated and reviewed by a single researcher who performed a thematic analysis of errors. Incorrect answers for the SBA questions were not analysed in this secondary analysis. Types and range of error varied between questions; thus, categorisation was subjective and question specific. The processed data were subsequently analysed and categorised by four researchers: three practicing doctors of mixed clinical experience and one lay person. Categorisation was achieved through discussion of errors identified and if/how they could be improved through altered teaching.

FINDINGS

The 42,670 incorrect answers to the 50 VSAs given by 1417 final year students from 20 medical schools within the UK were analysed. The number of students participating per medical school ranged from 3 to 256. A variety of errors were observed. Many were attributed to a lack of knowledge. Lack of knowledge errors were specific to the question, making generalisable lessons difficult; however, errors were also identified that we believe could be preventable by changing teaching practices. Four types of error were identified in multiple questions, which could be used to inform teaching and learning. These errors were as follows: Inability to identify the most important abnormal value. Over or unnecessary investigation. Lack of specificity of radiology requesting. Over‐reliance on trigger words. Four types of error were identified in multiple questions, which could be used to inform teaching and learning. These errors were as follows: Inability to identify the most important abnormal value. Over or unnecessary investigation. Lack of specificity of radiology requesting. Over‐reliance on trigger words. Examples of each type of error—related to the questions shown in Figure 1—are further described below. Table 1 summarises the most common incorrect responses to each question.
TABLE 1

Question details and findings

Example questionCorrect answer a % Students answering incorrectly (from N = 1417)Three most common errors (frequency as % of incorrect answers from previous column)Learning point for clinical teachers
Example 1: Inability to identify the most important abnormal value Carboxyhaemoglobin 89.3%

Observations (29.2%) (e.g., heart rate and blood pressure)

Carbon monoxide (28.5%)

Arterial blood gas (26.5%)

Teach students how to identify which investigation result is most important to guide diagnosis or treatment when presented with a list of investigation results.
Example 2: Unnecessary or over‐investigationCT of abdomen and pelvis97.7%

CT pulmonary angiogram (35.2%)

d‐Dimer (27.8%)

ECG (7.7%)

Teach students about the judicious and step‐wise use of investigations alongside the avoidance of unnecessary tests.
Example 3: Lack of specificity of radiology requesting Ultrasonography of neck 53.4%

Imaging—location not specified (31.6%)

FNA (18.8%)

Thyroid scintigraphy (15.1%)

Teach students how to be specific when requesting radiology investigations, including requesting the modality and anatomical region for all types of imaging.
Example 4: Over‐reliance on trigger words Osteoarthritis 39.5%

Gout (28.6%)

Rheumatoid arthritis (20.4%)

Fracture (12.3%)

Teach students to avoid using trigger words as the main determinant of pattern recognition and to be cognisant of when they are using pattern recognition to ensure they consider alternative hypotheses prior to ‘closure’.

Note that other answers would have been accepted as correct had the context of the answer remained the same.

Question details and findings Observations (29.2%) (e.g., heart rate and blood pressure) Carbon monoxide (28.5%) Arterial blood gas (26.5%) CT pulmonary angiogram (35.2%) d‐Dimer (27.8%) ECG (7.7%) Imaging—location not specified (31.6%) FNA (18.8%) Thyroid scintigraphy (15.1%) Gout (28.6%) Rheumatoid arthritis (20.4%) Fracture (12.3%) Note that other answers would have been accepted as correct had the context of the answer remained the same.

Inability to identify the most important abnormal value (Example 1)

Clinicians must identify key investigations required for diagnosis and management. Many candidates answered with either an incorrect parameter or something that was not a parameter (e.g., arterial blood gas [ABG] and carbon monoxide). While modern ABG analysers routinely provide carboxyhaemoglobin results, it is important that clinicians can understand and explain both what is being measured and its significance.

Unnecessary or over‐investigation (Example 2)

Judicious use of investigations is important for patient safety and economic reasons. Unprovoked deep vein thrombosis (DVT) in a 60‐year‐old man should prompt a clinician to look for an underlying malignancy. A significant number of candidates requested a CT pulmonary angiogram (CTPA) or d‐dimer as described in our previous study. The d‐dimer is not helpful in a patient with confirmed DVT, and a CTPA is not indicated in the absence of clinical features of pulmonary embolism.

Lack of specificity of radiology requesting (Example 3)

Clinicians often request radiological investigations that require asking for the correct test. Marks were frequently lost because candidates wrote vague responses such as ‘imaging’ or ‘ultrasound’ without specifying what type of scan or which anatomical region should be investigated.

Over‐reliance on trigger words (Example 4)

These errors resulted from candidates incorrectly focusing on trigger words and instigating erroneous pattern recognition. The example shows how focusing on ‘base of thumb pain’ without considering the additional information provided has resulted in an incorrect diagnosis of gout or a scaphoid fracture.

DISCUSSION

There is minimal literature discussing the impact of using student errors to alter our clinical teaching practices. Analysis of SBA questions is challenging because cueing can mask students' misunderstanding. VSA questions were developed to create a more authentic measure of knowledge while allowing electronic delivery and predominantly machine marking. In this study, we have shown that analysis of errors is possible and affords greater insight into students' knowledge and understanding. In this study, we have shown that analysis of errors is possible and affords greater insight into students' knowledge and understanding. Reviewing the incorrect answers given in a large‐scale VSA question examination generated some general and repeated errors. If these errors were made in real life, they could impact patient safety, so addressing them is not merely about teaching ‘to the test’. Providing information to clinical teachers about errors could therefore serve as a prompt for reflection. Our reflections on these errors helped us to identify a learning point for each of the four errors, as shown in Table 1. All of these learning points could be implemented by clinical teachers outside of the ‘examination preparation’ arena. Providing information to clinical teachers about errors could therefore serve as a prompt for reflection. Our reflections on these errors helped us to identify a learning point for each of the four errors. This study is limited by being retrospective in nature. Discussion groups were not undertaken; thus, we were unable to obtain candidates' reflections on why incorrect answers were made or their actual cognitive reasoning processes. The examination was not critical to educational progression; thus, errors could be due to lack of engagement rather than a lack of knowledge or understanding. VSA questions were a new format for most participants, and previous studies have suggested that students may adapt learning and revision strategies if this type of assessment was used summatively. The errors we identified may therefore not be reproducible if widespread adoption of this assessment method occurs and may vary locally (we were unable to investigate error patterns made by students at different medical schools, for example). While there is no evidence that errors made in examinations are carried over to clinical practice, there is evidence that similar errors including over‐reliance on a clinical finding or investigation, over‐investigating and premature closure can all result in errors in clinical practice. Finally, the frequency of each type of error in our dataset depends, in part, on the content of the questions used, and therefore, our discussion may not represent the most prevalent errors across the entire domain of applied medical knowledge.

CONCLUSION

Analysis of student errors in answering VSA questions could provide information to guide clinical teachers. There is a dearth of literature looking at how examination results can be used to improve teaching. We propose that analysis of errors could be beneficial to both the student and teacher. We have identified four types of error that resulted in repeated mistakes and suggest ways in which teaching could be adapted to mitigate them. This was a descriptive retrospective analysis, and thus, further work needs to be undertaken to determine optimal approaches to using the feedback identified by this, or a similar review to enhance teaching, and then to prospectively evaluate the impact of such changes to teaching.

CONFLICT OF INTEREST

At the time of the study, AHS, RW, MG and CB were members of the Board of the MSCAA. The authors declare they have no other conflicts of interest.

ETHICS STATEMENT

Ethical approval was provided by the Medical Education Ethics Committee (Reference MEEC‐1718‐100) at Imperial College London.
  9 in total

1.  Errors in clinical reasoning: causes and remedial strategies.

Authors:  Ian A Scott
Journal:  BMJ       Date:  2009-06-08

2.  A closer look at cueing effects in multiple-choice questions.

Authors:  L W Schuwirth; C P van der Vleuten; H H Donkers
Journal:  Med Educ       Date:  1996-01       Impact factor: 6.251

3.  Exploring examinee behaviours as validity evidence for multiple-choice question examinations.

Authors:  Luke T Surry; Dario Torre; Steven J Durning
Journal:  Med Educ       Date:  2017-07-31       Impact factor: 6.251

4.  Do medical students learn from multiple choice examinations?

Authors:  P J Rees
Journal:  Med Educ       Date:  1986-03       Impact factor: 6.251

5.  Very-short-answer questions: reliability, discrimination and acceptability.

Authors:  Amir H Sam; Samantha M Field; Carlos F Collares; Cees P M van der Vleuten; Val J Wass; Colin Melville; Joanne Harris; Karim Meeran
Journal:  Med Educ       Date:  2018-02-01       Impact factor: 6.251

6.  The use of objective structured clinical examination (OSCE) for detecting and correcting teaching-learning errors in physical examination.

Authors:  K R Sethuraman
Journal:  Med Teach       Date:  1993       Impact factor: 3.650

7.  The importance of feedback for medical students' development.

Authors:  Rahul Rajesh Amin; Rohin Patel; Prashant Bamania
Journal:  Adv Med Educ Pract       Date:  2017-03-22

8.  Comparing single-best-answer and very-short-answer questions for the assessment of applied medical knowledge in 20 UK medical schools: Cross-sectional study.

Authors:  Amir H Sam; Rachel Westacott; Mark Gurnell; Rebecca Wilson; Karim Meeran; Celia Brown
Journal:  BMJ Open       Date:  2019-09-26       Impact factor: 2.692

9.  Using very short answer errors to guide teaching.

Authors:  Oliver Putt; Rachel Westacott; Amir H Sam; Mark Gurnell; Celia A Brown
Journal:  Clin Teach       Date:  2022-01-25
  9 in total
  1 in total

1.  Using very short answer errors to guide teaching.

Authors:  Oliver Putt; Rachel Westacott; Amir H Sam; Mark Gurnell; Celia A Brown
Journal:  Clin Teach       Date:  2022-01-25
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.