Literature DB >> 31528647

The effect of test-enhanced spaced learning on the otolaryngology board and annual examination results: A quasi-experimental study.

Sasan Dabiri1, Aeen Mohammadi2, Rita Mojtahedzadeh2.   

Abstract

INTRODUCTION: In the test-enhanced spaced learning, educational contents are presented in small packages of well-developed test questions with a defined frequency to the learners. It is not clear that applying this educational style might have a positive impact on the summative assessment. Therefore, in this study, we assessed the effect of the test-enhanced spaced learning on the otolaryngology board and annual examinations of residency training.
METHODS: In a quasi-experimental study with consecutive sampling, all forty-four residents of otolaryngology in four levels of training in 2016 at Tehran University of Medical Sciences (TUMS) received daily-standardized multiple-choice questions with a twice-repeated frequency of 10 days. Individual feedback according to one's response to each test was provided. The results of national board and annual exam were compared with the same results of all residents of other universities and previous year TUMS' residents for whom spaced learning were not applied and they were considered as the control groups. The board exam had two parts, multiple-choice questions, and computer-based clinical examination. The annual exam format was multiple choice questions. The total score for each one was 150. Student's t-test and Mann-Whitney U test were used for comparative analyses.
RESULTS: The mean of the board exam results showed statistically significant improvement compared to other medical schools (113.6±10.7 vs. 102.9±13.4 in multiple-choice questions, p-value= 0.048, 118.7±12.5 vs. 54.1±60.0 in the computer-based clinical examination, p-value<0.001), while similar comparison results in the previous year did not show any significant difference.
CONCLUSION: Spaced learning with testing effect may be useful in the clinical education setting to improve the learning outcomes.

Entities:  

Keywords:  Cognition ; Internship ; Otolaryngology; Residency ; Learning

Year:  2019        PMID: 31528647      PMCID: PMC6664287          DOI: 10.30476/JAMP.2019.74696

Source DB:  PubMed          Journal:  J Adv Med Educ Prof        ISSN: 2322-2220


Introduction

The spaced effect that uses repeated exposures to a content or a practice to improve memorization was defined by Ebbinghaus in the late 19th century (1). Recently, biological studies have shown that repetition with the gap between exposures has significant positive effects on memory (2,3). Spaced learning, which is also known as distributed practice, is based on this effect and is effective in various types of learning in humans (4-6). Repeating the content presentation to the learner seems to improve their long-term memorization. The term “spaced” is used to distinguish it from the “massed” learning. In massed learning, all educational contents are presented in one stage, but in spaced learning, contents are fractionated to small parts and delivered repeatedly in diverse time intervals. As effective memorizing of information is very critical for clinical practitioners, some researchers in medical education have focused on the impact of the spaced learning in recent two decades (7-12). Repetitive presentation of small packages instead of massed exposure is the key and critical element in this setting for the improvement of long-term memory. On the other hand, students could remember most parts of the delivered material when they use tests during educational courses. This phenomenon, which is known as the testing effect increases the percentage of correct retrieval of information (13,14). The potential of the test, being a small package of information, could be helpful in spaced learning, especially in medical education (15). On the other hand, social networks could be an effective means of transferring small packages of information. Generally, the emergence of e-learning in surgical education, mainly as a blended approach, could enhance the results of learning (16). Social networks, especially in the context of mobile learning, seem to improve learning in higher education as a new e-learning tool (17,18). Telegram is one of these social networks that are popular in Iran and has overgrowing but user-friendly capabilities. One of the acceptable tools for evaluation of the educational programs and any change in them are summative assessments (19). A few studies focused on the summative assessments to evaluate the results of the test-enhanced spaced learning as a new educational structure in medical students (20). In this study, we delivered test questions based on spaced learning strategy via Telegram and assessed its effect on the results of annual and board examinations in otolaryngology residents.

Methods

In this quasi-experimental research with consecutive sampling, all 44 residents of Otolaryngology-Head and Neck Surgery at Tehran University of Medical Sciences (TUMS) participated in the study in 2016. The duration of the residency program in Iran is four years. Residents should participate in the annual exams to upgrade from the previous level of residency (levels 1 to 3) or be nominated for board examination (level 4). These annual exams have two parts: clinical work assessment (total score of 150 with minimum pass level of 105) and written multiple-choice question (MCQ) exam (total score of 150 and the minimum pass level depends on the level of residency). In this quasi-experimental research with consecutive sampling, all 44 residents of Otolaryngology-Head and Neck Surgery at Tehran University of Medical Sciences (TUMS) participated in the study in 2016. The duration of the residency program in Iran is four years. Residents should participate in the annual exams to upgrade from the previous level of residency (levels 1 to 3) or be nominated for board examination (level 4). These annual exams have two parts: clinical work assessment (total score of 150 with minimum pass level of 105) and written multiple-choice question (MCQ) exam (total score of 150 and the minimum pass level depends on the level of residency). Seven last-year residents were the candidates for the national board exam. The board exam of Otolaryngology-Head and Neck Surgery specialty has two parts: the first one is written MCQ exam, and the second step is a computer-based clinical examination (CCE). The MCQs exam contains 150 questions with four choices. The CCE includes various types of questions that assess mainly the first four levels of Bloom’s taxonomy in the cognitive domain (remembering, understanding, applying, analyzing). Each part has a score of 150, and anyone who obtains ‏‏the minimum of 105 in MCQ exam would be eligible to participate in the CCE. All the mentioned participants were invited to a private electronic group in Telegram. As mentioned earlier, it is a phone-based media network with an instant message service that was released in 2013. Its real-time performance, the possibility of easy file transfer, simple group creation with multiple administrating and participating options, browsing media types like text, photos, and videos, and compatibility with various platforms benders it a favored social media network in Iran. This collaborative program was added to the other educational programs of the department. Participation was not mandatory for the residents, and it was assured that participation would not affect their annual clinical work assessment scores. Residents received one similar question daily, responded to it until the end of that day, and received ‏‏feedback individually according to the response. Also, there was an opportunity for discussion in the mentioned private group. There was not a uniform time interval for the repetition of the questions in spaced learning in the literature. On the other hand, there was a large amount of medical content as a reference study for annual and board examination. As a result, it was decided to perform two repetitions in this study. For each question, this process was repeated on day 11th and day 21st after the first time of exposure to the question (for example, the question of day one was repeated on day 11 and 21; question 2 on day 12 and 22; and so on). One of the authors (SD) gave all feedbacks and managed the discussions in the private group. For covering the most parts of the reference content of examinations, the total 120 standardized MCQs were designed based on the reference book of the annual and board exams (Cummings Otolaryngology - Head and Neck Surgery, 6th edition, 2015, Elsevier, Saunders Co.) by the expert members of the national otolaryngology specialty board committee and validated. These questions were delivered to the residents for five months. Also, an organized answer for each question was developed in the portable document format from the main reference book and other related educational resources. Selection of the control group within the university was not possible because the participants could communicate with each other and separation of the groups was not earned. Thus, all residents of other universities and previous year TUMS’ residents, none of whom had received educational contents with spaced structure, were selected as the control groups and the results of the annual and board exams were compared. Participants’ satisfaction was assessed with a researcher made scale which was designed and delivered by Google form builder. Participants responded to the questionnaire anonymously to increase the reliability of the responses. The questionnaire was validated by ten experts in the field of Otolaryngology or medical education. Its Cronbach alpha was 0.94. Other than demographic questions such as the level of residency or total time of participation in the project, the questionnaire included six questions with a five-point Likert scale from strongly agree to strongly disagree. Half of these questions were about the social media used for education and its impact on the learning of the students (satisfaction of social media use in education, usefulness of social media in education, agreement with social media use in education in the future), while other questions were mentioned to the direct point of the spaced learning structure (satisfaction from spaced learning in education, usefulness of spaced learning, agreement with spaced learning in the future).

Statistics

Student’s independent t-test was applied for comparisons. For the groups in which the distribution was not normal, calculation of the differences with Mann-Whitney U test was also done. P-value of less than 0.05 was considered a statistically significant difference. For improving the methodologic design of the study, postgraduate students of the other universities had access to the questions and comprehensive related responses in a same daily release for the intervention group, but without the repeated structure. As there was a limitation for selecting the control group within the intervention university group (due to the risk of the mixture of the participants and the impacts on the outcomes), results of the other universities in that year were considered as the control group. Comparison of the participants’ results with the counterpart results of the otolaryngology residents in other universities in the same year was considered as a horizontal comparison. There were 11 universities other than TUMS that were training otolaryngology residency. The horizontal comparison was made with the mean score of the assessments of all other universities globally. Also, for better controlling of confounding factors and better evaluation of the results, a vertical comparison was performed as well, i.e. comparing the participants’ results of otolaryngology residents in the previous year from the same university.

Results

The total number of postgraduate students in the otolaryngology specialty training in the two years of 2015 and 2016 in Iran was 578 in 12 universities that were categorized into two groups: TUMS (40 and 44 students in the two years of 2015 and 2016, respectively), and other medical schools (241 and 253 students, respectively). All TUMS residents participated in answering the questions according to their daily free time during the five months of the intervention. However, their participation was not a diurnal activity, and the average active days in a spaced learning interactive process were not similar for different levels of residency. The mean percentages of participation in each level for the total 120 question were 0%, 40%, 33% and 22% for level 1 to 4 residents, respectively.

Annual examination

Descriptive results of the analysis of the annual examination are shown in Table 1. Horizontal comparisons of the mean MCQ exam scores did not reach any significant differences in the two years of the study. Also, no significant difference was found in the vertical comparison of the mean MCQ exam scores in the TUMS group. However, this comparison in other universities showed a significant reduction in the MCQ exam score. Comparisons of the annual clinical work assessment scores as a covariant in vertical and horizontal directions did not show statistically significant differences unless in the vertical comparisons of the other universities group.
Table 1

Annual examination results of the otolaryngology specialty in TUMS and other medical schools

Type of examMedical School (number of residents)2015p-value2016
between schools - 2015within a school groupbetween schools - 2016
MCQ score: >mean±SD TUMS (31)91.5 ± 10.00.1930.9900.50691.6 ± 12.2
Other medical schools (180)94.9 ± 13.60.001*89.8 ± 14.6
CWA score: mean±SD TUMS (37)122.1 ± 23.90.4610.6860.678124.0 ± 11.5
Other medical schools (192)118.8 ± 23.60.040*123.0 ± 13.3

TUMS: Tehran University of Medical Sciences, MCQ: Multiple Choice Question, CWA: Clinical Work Assessment, SD: Standard Deviation

Statistically significant

Annual examination results of the otolaryngology specialty in TUMS and other medical schools TUMS: Tehran University of Medical Sciences, MCQ: Multiple Choice Question, CWA: Clinical Work Assessment, SD: Standard Deviation Statistically significant

Board examination

Descriptive results of the analysis of the board examination are shown in Table 2. The analysis showed that there was a statistically significant better result in MCQ exam in the spaced education group in comparison to other universities. There was no significant difference when the horizontal comparison was done in the previous year. Analysis of the CCE scores in horizontal comparisons was also accompanied by the same results (Figure 1). A significant difference was found with the intervention group but not in the previous year.
Table 2

Board examination results of the otolaryngology specialty in TUMS and other medical schools

Type of examMedical School (number of residents)2015p-value2016
between schools - 2015within school groupsbetween schools - 2016
MCQ score: mean±SD TUMS (9)107.3 ± 14.70.3910.3620.048*113.6 ± 10.7
Other medical schools (61)102.6 ± 15.40.905102.9 ± 13.4
CCE score: mean±SD TUMS (7)75.2 ± 58.20.4600.058<0.001*118.7 ± 12.5
Other medical schools (61)60.7 ± 54.00.52054.1 ± 60.0

TUMS: Tehran University of Medical sciences, MCQ: multiple choice question exam, CCE: computer-based clinical examination, SD: standard deviation

statistically significant

Figure1

The simplified structure of the horizontal and vertical comparisons of the summative assessment results.

In vertical comparison, CCE scores were better in the intervention group, but this difference was not statistically significant (p-value: 0.058). The non-parametric analyses with Man Whitney-U test did not reach a significant difference (p-value: 0.142). The observed power was 0.435. MCQ exam results also did not show a significant result in vertical comparison in the TUMS group. Vertical comparisons in other universities did not show significant differences in CCE and MCQ scores. These vertical comparisons with Mann-Whitney U test also did not show significant differences. Board examination results of the otolaryngology specialty in TUMS and other medical schools TUMS: Tehran University of Medical sciences, MCQ: multiple choice question exam, CCE: computer-based clinical examination, SD: standard deviation statistically significant

Satisfaction Survey

From the total of 44 residents in the TUMS, 75% filled the satisfaction survey form. 97% agreed that spaced learning was effective or very effective (two top points from 5-point Likert scale) and others believed that it was neither good nor bad. While 85% completely agreed or agreed with the usefulness of the spaced learning system, 9% did not have an opinion, and 6% disagreed. 49% preferred to continue this educational system without any changes for the future, while 36% agreed with its continuation with little changes. Others did not have any opinion or agreed with the continuation but with significant changes. Overall, there was nobody with severe disappointment on various questions of the survey, noting that even the selection of disagreement choices in the Likert scale was less than 10%, if present in different questions.

Discussion

Despite more than a century after Ebbinghaus presented the spacing effect, the recent electronic world gives a good infrastructure for applying this method. However, few studies have evaluated its impact on clinical education, especially on summative evaluations like the annual examinations. This study tried to assess the effect of the test-enhanced spaced learning on the final summative evaluation in the Otolaryngology residents. There are not any statistically significant differences between the results of TUMS and other medical schools in annual and board examination results in the year before the intervention. Therefore, it could be assumed that these two groups did not have significant differences in competency and they were probably similar in educational structure and personal potencies for these examinations. Figure 1 simplifies the structure of the comparisons used in this study. As the Figure shows, the analysis was performed in vertical and horizontal directions. The simplified structure of the horizontal and vertical comparisons of the summative assessment results. Statistically significant differences were seen in the board examination results between TUMS and other medical schools in 2016, while such a difference was not seen in 2015 )horizontal comparison(. As there was not a significant difference between the two groups in one year earlier, this superiority of the results in the target group might be assumed secondary to the intervention. On the other hand, the vertical comparison could not show any significant difference in both groups. In annual examination results, no significant changes were seen in horizontal comparisons of both. Also, no significant differences were seen in two TUMS groups in vertical comparison, while a significant reduction in MCQ exam score of the other medical schools was found in the second year (2016). This might be secondary to the difference between the competencies of the participants. However, statistical analyses of the ward score did not show significant differences, and it could be assumed that the personal factors do not have a significant impact. Another reason might be the changes in the difficulty index of the examination. Maintenance of the mean score in TUMS after spaced education might be indirectly interpreted as the effectiveness of the spaced learning. In a cohort study by Smeds and colleagues (19), the results of the final annual examination of the undergraduate medical students in the rotation of surgery were improved significantly in comparison to the previous year. The materials they used for spaced learning were MCQs designed according to educational objectives for the surgery ward rotation and were accessible via smartphone app., email, or online. This difference, i.e. the current study could not show a significant improvement in the results, might be due to the sample size as the observed power analysis of the current study was 44%. Two important questions in the spaced learning are the time duration among the exposures, and the number of the exposures to reach long-term memorization. Cepeda et al. (3) showed that despite the effectiveness of duration from 1 second to more than a month, a few days of separations between the exposures had better results in comparison to exposures within a day. In the study by Smeds et al. (19), three and seven days for incorrect and correct responses respectively seemed to be an effective spaced time. Larsen (20) emphasized that intervals around a week to a month had very long stability of memorization of the information for months to years. A ten-day fixed interval with three repetitions in the current study showed some beneficial effects on summative assessment. Participation in this program was not mandatory for residents. Most of the participants were the second and third-year residents that were 40% and 33%, respectively. In a similar research by Blazek et al. (10), email was used for spaced learning in medical students, and the participation rate was 75%. Task and responsibility overload in the first year of residency in a referral non-private teaching hospital may have a significant impact on the residents’ participation. On the other hand, postgraduate residents usually are very busy in comparison to undergraduate medical students. As emphasized by Mammarella and the colleagues (12), the effectiveness of the spaced learning could be affected by emotional status. Finally, it is worth mentioning that despite the absence of active participation (responding to the messages), residents could answer the questions for themselves without any visible activity in the social media network. Residents were very satisfied with this new educational program, and 87.5% declared that it was effective or very effective. About 97% or even more agreed that it was favorable. Half of the participants preferred to continue this program without any change in design. Similar results were achieved by Smeds in a student survey which showed that helpfulness of the intervention for final course exam, annual exam and preparation for the oral exam were 95%, 90%, and 60%, respectively (20). Spaced learning could be effective in surgical skills. Cecilio-Fernandes and colleagues (21) have reviewed the literature systematically for the role of spacing on the acquisition of surgical skills. The results showed that spacing of the education of the skills had improved the final score, and this point was helpful even for simple procedures like suturing. Nevertheless, the optimal gap time, which was between 5 minutes and one month and was not clear. Kim et al. (22) tried to reach the optimal time for spacing and retrieval in the real world. They analyzed the data collected using the learning management system (LMS) from 10514 employees who had answered to some work-related questions. The results showed a significant direct relationship between spacing intervals and retention intervals using a developed dataset model. One of the limitations of this study was the low sample size, especially for participants of the board exam. Lack of a concurrent control group may also be important. Different questions in two years of annual and board examination might have a confounding impact on the results. On the other hand, randomization was also impossible due to ethical educational issues. Also, the participation of residents was not mandatory. The only factor that motivated them to be active in this program was their incentive along with a load of their obligatory tasks. The time interval duration and number of repetitions have remained a challenge in the literature (23). Comparison of two groups with different time durations and frequency of repetition in a randomized trial or a multi-center trial could be helpful for more clarification of the effectiveness of spaced education. Spaced learning could also be assessed in the psychomotor abilities that are important in surgical skills, especially for fine surgical skills like otolaryngology surgeries.

Conclusion

According to the results of this research on the postgraduate students of Otolaryngology, the test-enhanced spaced learning may improve learning considering the changes in the summative assessment results. However, further research on this subject is recommended.
  16 in total

1.  Making memories stick.

Authors:  R Douglas Fields
Journal:  Sci Am       Date:  2005-02       Impact factor: 2.142

Review 2.  Distributed practice in verbal recall tasks: A review and quantitative synthesis.

Authors:  Nicholas J Cepeda; Harold Pashler; Edward Vul; John T Wixted; Doug Rohrer
Journal:  Psychol Bull       Date:  2006-05       Impact factor: 17.737

3.  Spacing effects in learning: a temporal ridgeline of optimal retention.

Authors:  Nicholas J Cepeda; Edward Vul; Doug Rohrer; John T Wixted; Harold Pashler
Journal:  Psychol Sci       Date:  2008-11

4.  Test-enhanced learning in medical education.

Authors:  Douglas P Larsen; Andrew C Butler; Henry L Roediger
Journal:  Med Educ       Date:  2008-10       Impact factor: 6.251

5.  The Power of Testing Memory: Basic Research and Implications for Educational Practice.

Authors:  Henry L Roediger; Jeffrey D Karpicke
Journal:  Perspect Psychol Sci       Date:  2006-09

6.  The theoretical base of e-learning and its role in surgical education.

Authors:  Evgenios Evgeniou; Peter Loizou
Journal:  J Surg Educ       Date:  2012-07-15       Impact factor: 2.891

7.  Picking the right dose: the challenges of applying spaced testing to education.

Authors:  Douglas P Larsen
Journal:  J Grad Med Educ       Date:  2014-06

8.  Randomized controlled trial of spaced education for pediatric residency education.

Authors:  Erin F Mathes; Ilona J Frieden; Christine S Cho; Christy Kim Boscardin
Journal:  J Grad Med Educ       Date:  2014-06

9.  Learning Curves of Virtual Mastoidectomy in Distributed and Massed Practice.

Authors:  Steven Arild Wuyts Andersen; Lars Konge; Per Cayé-Thomasen; Mads Sølvsten Sørensen
Journal:  JAMA Otolaryngol Head Neck Surg       Date:  2015-10       Impact factor: 6.223

10.  Interactive spaced education versus web based modules for teaching urology to medical students: a randomized controlled trial.

Authors:  B Price Kerfoot
Journal:  J Urol       Date:  2008-04-18       Impact factor: 7.450

View more
  2 in total

1.  Developing a national e-learning course in otorhinolaryngology: the Danish experience.

Authors:  Helena Fagö-Olsen; Charlotte Duch Lynggaard; Kasper Aanæs; Per Cayé-Thomasen; Steven Arild Wuyts Andersen
Journal:  Eur Arch Otorhinolaryngol       Date:  2020-03-13       Impact factor: 2.503

2.  Attitudes of an international student cohort to the Quizlet study system employed in an advanced clinical health care review course.

Authors:  Benjamin D Zeitlin; Nishanth D Sadhak
Journal:  Educ Inf Technol (Dordr)       Date:  2022-10-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.