Literature DB >> 32628297

Rethinking gross anatomy in a compressed time frame: Clinical symptoms, not case studies, as the basis for introductory instruction.

Sasha N Zill1.   

Abstract

The goal of this observational study was to develop effective approaches to introduce first year medical students to gross anatomy/embryology in a compressed time frame. Pedagogical reorganization of anatomy instruction in the regions of Lower Extremity and Head and Neck was based upon core clinical conditions taught in second-year and USMLE Step 1 board review courses. These conditions were not presented as clinical problems, as many students had limited prior training in medical terminology, but focused upon clinical symptoms, allowing for direct correlation of structure and function. Instruction stressed vocabulary acquisition and was extended to prepare for laboratory dissections. Overall methodology was multimodal, including "flipped" and traditional lectures, study of prosections/radiographs and small group laboratory review sessions. Content was significantly reduced: knowledge of muscle actions and innervations was required, not muscle origins and insertions. Performance was evaluated by criterion-based written examinations that included a set of questions (34) asked repetitively over an 8 year period (n = 606 students) and by regional practical exams. Mean scores in all areas were sustained or numerically improved, despite the compression of instruction duration. Analysis showed no significant differences based upon question format or instructional modality. Despite the high performance levels, students needing assistance in learning could be identified by score distributions. A survey of students indicated that these changes effectively decreased stress and facilitated review for the USMLE Step 1 Board examination. These results suggest that training in gross anatomy can be modified to a compressed duration by instruction in the context of clinical symptomatology.
© 2020 The Authors. Clinical Anatomy published by Wiley Periodicals LLC on behalf of American Association of Clinical Anatomists.

Entities:  

Keywords:  curriculum; dissection; learning; medical education; vocabulary

Mesh:

Year:  2020        PMID: 32628297      PMCID: PMC7754134          DOI: 10.1002/ca.23645

Source DB:  PubMed          Journal:  Clin Anat        ISSN: 0897-3806            Impact factor:   2.414


INTRODUCTION

Recent reforms in curricula in many medical schools have led to a compression and reduction of the duration of instruction in gross anatomy (AAMC‐HHMI, 2009; Halliday, O'Donoghue, Klump, & Thompson, 2015; Hołda et al., 2019; McBride & Drake, 2018; Papa & Vaccarezza, 2013; Sbayeh et al., 2016). A number of schools have adopted integrated systems‐based courses that reduce the duration of study of basic science in many disciplines, including anatomy (Klement, Paulsen, & Wineski, 2011; Schwartz, Ajjarapu, Stamy, & Schwinn, 2018; Sugand, Abrahams, & Khurana, 2010; White et al., 2014). Medical schools have also substantially decreased or eliminated traditional lectures, including anatomy (Kamei, Cook, Puthucheary, & Starmer, 2012; Kerby, Shukur, & Shalhoub, 2011; Mazur, 2009; Prober & Heath, 2012; Vasan, DeFouw, & Compton, 2011). Instruction has been replaced by “flipped” lectures and methods such as problem‐based instruction, team‐based and independent learning that can engender “active” learning (Koles, Stolfi, Borges, Nelson, & Paremelee, 2010; Morton & Colbert‐Getz, 2017; Neville, 2009). These methods are “cost effective” and do not require faculty extensively trained in gross anatomy (Vasan et al., 2011; Yammine, 2014). Most of these changes have resulted in reduced time allotted to anatomy instruction (Drake, 2014; Drake, McBride, Lachman, & Pawlina, 2009; McBride & Drake, 2018; Rizzolo et al., 2010). There have been few specific assessments of the effects of these curricular changes on knowledge of gross anatomy and published studies have reached diverse conclusions (Clunie, Morris, Joynes, & Pickering, 2018; Hołda et al., 2019). Student knowledge of anatomy as a whole has been reported as somewhat higher (Findlater, Kristmundsdottir, Parson, & Gillingwater, 2012), equivalent (Bergman, Prince, Drukker, van der Vleuten, & Scherpbier, 2008) or substantially lower (Hinduja, Samuel, & Mitchell, 2005; Hołda et al., 2019; Papa & Vaccarezza, 2013) in integrated/problem based curricula compared to traditional anatomy instruction. Understanding of surface anatomy (McKeown et al., 2003) and confidence in knowledge of anatomy (Farey et al., 2018) have also been reported as low following training in systems‐based curricula. Comparison of scores on United States Medical Licensing Examination (USMLE) Step 1 and Step 2 examinations showed no significant differences between students trained in traditional or integrated anatomy courses (Cuddy, Swanson, Drake, & Pawlina, 2013), but those examinations provide only limited assessment of overall knowledge of anatomy. Recent changes in medical education have also generated new problems related to student preparedness in gross anatomy and stress (Slavin, Schindler, & Chibnall, 2014). As anticipated in the Carnegie report (a major impetus for curriculum revision), the expansion of class sizes in many medical schools has increased “the diversity of prior knowledge, skills, and abilities that students bring with them to medical school” (Irby, Cooke, & O'Brien, 2010). Compression of the duration of instruction in anatomical sciences has been particularly disadvantageous and stressful for students with limited prior training as it requires simultaneous acquisition of the vocabulary and “grammar” of medicine (Bowen & ten Cate, 2017; Wilson‐Anstey, Lambert, & Krog, 2019). The increase in student stress has recently been recognized as a major problem, with consequences in rising rates of attrition, alcohol dependency and even suicide among medical students (Dyrbye, Lipscomb, & Thibault, 2019; Dyrbye, Thomas, & Shanafelt, 2005; Hill, Goicochea, & Merlo, 2018; Jackson, Shanafelt, Hasan, Satele, & Dyrbye, 2016; Laitman & Muller, 2019). However, few studies have proposed specific methods for modifying medical curricula to decrease student stress. The adverse effect on student well‐being was also cited as a major impetus in the recent change in the USMLE Step 1 examination from numerical to pass/fail scoring (Committee to Evaluate the USMLE Program, 2020; Humphrey & Woodruff, 2020). Originally intended to inform a binary decision on licensure, the exam has, instead, been extensively utilized in ranking students for residency programs (Chaudhry, Katsufrakis, & Tallia, 2020; McGaghie, Cohen, & Wayne, 2011). A number of studies have found that this inadvertently led to the development of a self‐directed, “parallel” curriculum, in which students focused, through third‐party sources, upon clinical topics that were tested on the USMLE Step 1 exam, rather than basic science course material (Burk‐Rafel, Santen, & Purkiss, 2017; Kumar et al., 2015). In confronting these problems, many schools have adopted new approaches for establishing a strong basis of knowledge of anatomy within a clinical context (Bains & Kaliski, 2020; Brooks, Woodley, Jackson, & Hoesley, 2015; Willey, Lim, & Kwiatkowski, 2018). A number of studies have demonstrated the effectiveness of use of multimodal pedagogies (Estai & Bunt, 2016; Johnson, Charchanti, & Toupis, 2012; Klement, Paulsen, & Wineski, 2017) rather than single instructional methods. Multimodal teaching can increase the efficiency of instruction in students with diverse prior training in anatomy (Houser & Kondrashov, 2018) and may also aid to ameliorate some components of stress derived from compressed curricula (Slavin et al., 2014). Similar changes in the duration of gross anatomy instruction have occurred at the Joan C. Edwards School of Medicine. This observational paper describes pedagogical changes that were developed in preparation for a compression of instruction duration in a new, fully integrated curriculum. A goal of these changes was to integrate content in Anatomy in the Pedagogy in the areas of Head and Neck and Lower Extremity based upon understanding the symptoms of core clinical conditions rather than full analysis of clinical problems, as many students lacked the background for meaningful diagnostic analysis (Al‐Jamal, 2018; Bowen & ten Cate, 2017). These changes also sought to integrate the structure and format of the self‐directed “parallel” curriculum into formal training in anatomy and clinical science (Committee to Evaluate the USMLE Program, 2020). The major question (hypothesis) of this study was: can pedagogical changes maintain and consolidate learning in a compressed time frame, while decreasing student stress? In the following, data are reviewed that support this hypothesis. Furthermore, this method provided the context for instruction in anatomy further in the curriculum, could aid students in preparation for board examinations and permit identification of students needing assistance in learning (Gullo, McCarthy, Shapiro, & Miller, 2015; Zill, 2019).

METHODS

Gross Anatomy Instruction and Evaluation—Gross anatomy has been consistently taught by body region at the Joan C. Edwards School of Medicine and evaluated by in‐house (custom) written multiple‐choice and regional laboratory practical examinations. Examinations were given at the end of study (or midpoint and end for Head and Neck) of each body region. Student scores and point biserial values of questions were obtained from ExamSoft records for the years 2014–2019 or from a website maintained by our school (OME Question Bank) for 2010–2013. Data on student performance on earlier written examinations (2005–2009) were derived from departmental grading reports and the accuracy of the data has been confirmed through records kept by other course faculty. These records were assembled into a long‐term database on performance in a spreadsheet (Excel) file and mean percentage scores and standard deviations were calculated. Mann–Whitney‐Wilcoxon and Paired t tests were used to evaluate the significance of differences in performance using SPSS (v26 IBM) and SigmaStat software (Jandel Scientific). Data for figures were plotted as histograms and scatter plots in SigmaPlot (Jandel Scientific). As written examinations after 2012 also include questions in other courses that were taught simultaneously, more specific evaluation of performance in gross anatomy was obtained through a database of select repeat questions (Joncas, St‐Onge, Bourque, & Farand, 2018). These questions (34) were multiple‐choice in format and were asked on examinations in the regions of Lower Extremity and Head and Neck in either identical form or with minor modifications throughout the period of 2012–2019. No consistent changes in performance were observed after the minor modifications. Repeat questions were considered to be Clinical Vignette in format if they referred to the symptoms of a specific patient or Basic Science if they focused upon knowledge of anatomy without directly referring to a clinical case. Student scores on regional practical examinations were assembled from course grading reports. Practical examinations were taken in groups in the gross anatomy laboratory (2005–2019). Tests were timed and students moved through stations with questions on dissections or prosections. Answers were hand‐written, fill‐in‐the‐blank format and were graded by the principal instructor in each region. Survey of Students on Effectiveness of Curricular Revision—A survey was sent by email immediately after completion of instruction to the class in 2019–2020. Students were asked to rate numerically (Likert Scale format: 5 strongly agree, 1 strongly disagree) statements on (a) the effectiveness of the format of Clinical presentation of anatomy, (b) whether the format aided in preparation for the USMLE Step 1 board examination, (c) the effects of the Clinical/Anatomical format on stress in learning. Replies to the survey questions were kept anonymous to faculty as they were returned to the class president (Luke Hamm, MUSOM Class of 2022) who tabulated the results and removed any individual identifiers. The response rate to the survey was 51.5% (41/80 students responding).

RESULTS

Adapting to changes in curricula and decreased duration of instruction in gross anatomy

In the years prior to 2005, gross anatomy instruction at the Joan C. Edwards School of Medicine was initiated at the start of the first year of medical school. The course was first given over the entire year and then limited to the first semester, concurrent with courses in biochemistry, cell biology and introduction to clinical medicine. The course consisted of lectures (7–10 per week) and laboratory sessions (2–3 per week, each 3–4 hr in duration, 6–8 students per cadaver) with full cadaver dissection, as well as study of prosections and images. After a series of curricular revisions, integration of first year basic sciences was initiated in 2012 and anatomy was taught and tested in short (typically 2–3 week) blocks with other disciplines. The number of disciplines taught simultaneously increased (beginning in 2014–2015, for example, Neuroanatomy/Neuroscience was taught in blocks concurrent with Head and Neck gross anatomy). The number of lectures and laboratory sessions in anatomy initially remained the same but decreased (~20%) in the last academic year (2019–2020) of this study. During the period of 2010–2019, the duration of anatomy instruction has been substantially compressed: the histograms in Figure 1 plot the duration of instruction (number of days from start to completion of instruction) for Lower Extremity (left) and Head and Neck for years 2010 and 2017–2019. There has been a 65.2% decrease in duration for the region of Lower Extremity and 41.0% for Head and Neck instruction.
FIGURE 1

Duration of gross anatomy instruction—These plots compare the duration of instruction (number of days from start to completion of instruction) for Lower Extremity (left) and Head and Neck for years 2010 and 2017–2019 and reflect a substantial compression in the duration of study [Color figure can be viewed at wileyonlinelibrary.com]

Duration of gross anatomy instruction—These plots compare the duration of instruction (number of days from start to completion of instruction) for Lower Extremity (left) and Head and Neck for years 2010 and 2017–2019 and reflect a substantial compression in the duration of study [Color figure can be viewed at wileyonlinelibrary.com]

Pedagogy based on clinical symptoms

Pedagogical changes were made with the goal of modifying the emphasis and content to provide a basic foundation of knowledge in anatomy within the compressed duration while preparing students for the USMLE Step 1 Board Exam. Specific clinical conditions were identified for each anatomical region (Figure 2) that were a subset of those taught in second‐year courses (diseases and therapeutics) at the Joan C. Edwards School of Medicine. Many of these conditions were also found in sources (board review books, websites) used for study for the USMLE Step 1 examination. The conditions were selected based upon their association (cause, symptom) with specific anatomical changes and many had previously been presented as clinical correlates of anatomy.
FIGURE 2

Core clinical diseases syndromes used as a context for anatomy instruction—Instruction in the regions of Lower Extremity and Head and Neck was reorganized based upon a core set of clinical conditions. These conditions have been taught in second‐year courses (diseases and therapeutics) and/or studied in USMLE Step 1 board review

Core clinical diseases syndromes used as a context for anatomy instruction—Instruction in the regions of Lower Extremity and Head and Neck was reorganized based upon a core set of clinical conditions. These conditions have been taught in second‐year courses (diseases and therapeutics) and/or studied in USMLE Step 1 board review Anatomy teaching was reorganized as logical extensions of the symptoms of the clinical syndromes. Clinical/anatomical charts (Figure 3a) were introduced in 2014 and initially used as review material. However, these charts and clinical conditions were progressively incorporated as primary teaching material in the period of 2015–2019. For example, circulation of the Lower Extremity was presented and then discussed in the clinical contexts (Figure 2, Left column): understanding of the symptomatology of varicose veins was established by elucidating the anatomical organization of venous drainage (superficial and deep veins, “muscular pump”) and the presence of valves and anastomoses (perforating veins). The specific causes of the pathology (e.g., thrombophlebitis) were not elucidated at that time but discussion was extended to understand the adaptation of anatomy to resist the physical effects of gravity in standing and walking. Other conditions also served as mechanisms for understanding the relationship of structure and function of the circulatory system: treatment of bleeding (laceration) of the femoral artery formed the basis for discussion of the function of arterio‐arterial anastomoses in allowing normal compression of arteries associated with joint movements (flexions).
FIGURE 3

Clinical/anatomical format and multimodal pedagogy—(a) Anatomical basis of symptoms of clinical syndromes—Predominant symptoms were identified for each clinical syndrome. The anatomical basis of that symptom was utilized to introduce discussion of the related body structures/systems. The detailed pathology was not discussed but provided a foundation for understanding the relationship between anatomical structure and function. (b) Laboratory dissection, prosections—All students participated in cadaver dissections, augmented by independent study of prosections with photographic and radiological images. Dissections were modified (e.g., brainstem) to maximize cognizance of three‐dimensional structure [Color figure can be viewed at wileyonlinelibrary.com]

Clinical/anatomical format and multimodal pedagogy—(a) Anatomical basis of symptoms of clinical syndromes—Predominant symptoms were identified for each clinical syndrome. The anatomical basis of that symptom was utilized to introduce discussion of the related body structures/systems. The detailed pathology was not discussed but provided a foundation for understanding the relationship between anatomical structure and function. (b) Laboratory dissection, prosections—All students participated in cadaver dissections, augmented by independent study of prosections with photographic and radiological images. Dissections were modified (e.g., brainstem) to maximize cognizance of three‐dimensional structure [Color figure can be viewed at wileyonlinelibrary.com] The shift to a clinical focus did not limit the extent of anatomy instruction. For example, the symptoms of Horner syndrome formed the basis for discussion of the structure of the autonomic nervous system (Figure 3a). The pathways of sympathetics (divergent two neuron arcs) to the skin allow for synchronous activation of sweat glands in thermoregulation, with the consequence that damage produced dry skin (anhydrosis). The requirements for control through descending pathways from the hypothalamus and activation and distribution through spinal and cranial nerves led to an understanding that Horner syndrome could result from damage at a number of anatomical locations, including cervical lesions (e.g., Pancost tumors) that can produce symptoms in the eye and face (ptosis, miosis). The format of “flipped” instruction was also introduced in the academic year 2019–2020. Students were given short videos (typically less than 20 min) as an introduction to the anatomy of particular regions prior to lecture/discussion. In class, the content of the videos was discussed within the context of clinical conditions. For example, the basic anatomy of the spinal cord was introduced in video, including discussion of the sensory input and motor output pathways and extended to a description of the anatomy and normal functions reflex pathways. In class, the symptom of hyperreflexia formed the basis for extending learning to an understanding of the need for descending and local modulation of reflexes. Thus, the symptomatology of the clinical condition (spinal cord injury) formed the basis for discussion of structure and function, even when the order was arranged as basic science followed by clinical context. However, some content was de‐emphasized or eliminated to adapt to the time compression. The retention of rote memorization is strongly dependent upon the duration of study (Dempster, 1988). As instruction in Lower Extremity was limited to 6 days in 2019 (Figure 1), rote memorization was decreased and students were required to learn only muscle actions and innervations, not their origins and insertions (although identification of individual muscles on cadavers was still required on practical examinations). Knowledge of some origins/insertions was required when clinically or conceptually relevant: for example, in discussion of the symptoms of Gluteal Gait, the origin of the Gluteus medius muscle (lateral aspect of the ilium) was stressed as that contributed to understanding of the muscle action in shifting support of body load. In Head and Neck, students were no longer required to memorize all the branches of the maxillary artery, only those directly contributing to major clinical conditions (such as the middle [and accessory] meningeal arteries to epidural hematoma). However, instruction did include identification of the major branches of the external and internal carotid arteries which were tested on radiological images. Most laboratory dissection sessions have been maintained during this period, augmented by a multimodal approach utilizing prosections and radiological images (Figure 3b). Students still performed extensive dissections of the Lower Extremity, with required identification of major structures on practical examinations. In Head and Neck, full dissections of the cranial cavity (preserving the brainstem), orbit, neck, and oral and nasal cavities (head bisection) were performed by students. However, other regions were studied only in prosection (e.g., joints of lower extremity, infratemporal fossa, suboccipital region) as dissection of these structures is more complex and time consuming. Study of prosections was facilitated by distribution of labeled photographic images of specimens although knowledge was tested in practical exams on the original specimens.

Multiyear data base for evaluating performance

A multiyear data base of student exam scores was used to assess performance although long term comparisons were complicated by the introduction of other courses that were taught and tested simultaneously (gross anatomy was still a major focus). Figure 4a shows mean scores and standard deviations (n = 55 exams taken by 592 students) of gross anatomy written examinations (in all body region) and comprehensive final examinations from 2005 to 2011 (before the last curriculum revision). The written examinations specifically tested only knowledge in gross anatomy in specific body regions. Figure 4b shows mean scores on block examinations (for the regions of Lower Extremity and Head and Neck) that included other subjects after curriculum revision. Student performance on those examinations remained high and did not differ from the mean scores in which gross anatomy was specifically tested by body region (p = .45, Students t test). Figure 4c is a plot of the mean scores on block examinations in Lower Extremity, Head and Neck, in which pedagogy was changed, and in other body regions (Upper Extremity, Thorax), in which teaching format remained constant, for the years 2010 and 2017–19. Comparison of data within this period showed that the mean scores for exams in the years 2013–2016 were Lower Extremity 89.7 ± 2.2 SD, Head and Neck 88.7 ± 2.2 SD, Upper Extremity 89.3 ± 3.6 SD, Thorax 83.0 ± 3.0 SD while scores in the period of 2017–2019 were Lower Extremity 90.6 ± 3.0 SD, Head and Neck 92.0 ± 0.9 SD, Upper Extremity 86.2 ± 2.0, Thorax 78.9 ± 3.4 SD (Note: data on the Thorax exam for 2017 was not available). Scores in Thorax and Upper Extremity showed a small decline over this period (although variations in teaching of other disciplines may have occurred) while performance Lower Extremity was unchanged and scores in Head and Neck improved numerically. Figure 4d is a plot of the mean scores on practical examinations in the areas of Lower Extremity and Head and Neck for the years 2005–2019 (n = 37 examinations, total = 1,178 students). Mean scores on laboratory exams showed numerical improvement during the period of 2014–2019.
FIGURE 4

Extended database for evaluation of student performance—(a) Data base for long term evaluation. In the period of 2005–2012, examinations in gross anatomy were given at the end of study of each body region and comprehensive final exams were given at the end of study of gross anatomy. Histogram of mean scores (+SD) of students (n = 592 students) indicate that performance was high and relatively constant. (b) In the period after curriculum revision (2013–2019), students were given block examinations in which gross anatomy was a major component but other courses were tested simultaneously. Student performance on block exams (n = 586 students) remained high. (c) Plot comparing block exam scores in different body regions for years 2013–2014 and 2018–2019. Scores in Head and Neck and Lower Extremity remained relatively constant while scores in other body regions (upper extremity, thorax), which did not change pedagogical format, declined numerically. (d) Scores on gross anatomy practical exams (Lower Extremity, Head Neck combined) throughout the period of 2005–2019 (total = 1,178 students) [Color figure can be viewed at wileyonlinelibrary.com]

Extended database for evaluation of student performance—(a) Data base for long term evaluation. In the period of 2005–2012, examinations in gross anatomy were given at the end of study of each body region and comprehensive final exams were given at the end of study of gross anatomy. Histogram of mean scores (+SD) of students (n = 592 students) indicate that performance was high and relatively constant. (b) In the period after curriculum revision (2013–2019), students were given block examinations in which gross anatomy was a major component but other courses were tested simultaneously. Student performance on block exams (n = 586 students) remained high. (c) Plot comparing block exam scores in different body regions for years 2013–2014 and 2018–2019. Scores in Head and Neck and Lower Extremity remained relatively constant while scores in other body regions (upper extremity, thorax), which did not change pedagogical format, declined numerically. (d) Scores on gross anatomy practical exams (Lower Extremity, Head Neck combined) throughout the period of 2005–2019 (total = 1,178 students) [Color figure can be viewed at wileyonlinelibrary.com]

Analysis of repeat questions

Specific data were obtained by evaluation of performance on a subset of 34 questions in gross anatomy in the areas of Lower Extremity and Head and Neck that were asked repetitively in successive years. The subject and correct answers of all questions remained constant but minor modifications in some years were made the to the text, format or distractor answers. Figure 5a is a plot of the mean scores on 17 questions in Lower Extremity anatomy that were asked repeatedly from 2012 to 2019 (8 examinations taken by n = 643 students). Figure 5b is a similar plot for questions in Head and Neck Anatomy (16 examinations taken by 606 students) over the same years. Scores on all repeat questions remained high and numerically increased in the years 2016–2,109. Comparison of pooled data from 2016/2017 and 2018/2019 showed a small, statistically significant increase in performance (Wilcoxon Signed Rank Test, p ≤ .01).
FIGURE 5

Analysis of repeat questions in gross anatomy—(a) Histogram of mean performance on questions in Lower Extremity anatomy (n = 17) asked repeatedly (or with minor modification) over an 8 year period (2012–2019; n = 643 students), including the curriculum revision in 2013. (b) Similar histograms of performance on questions (n = 17) in Head and Neck Anatomy over the same period (n = 606 students). (c) Types of repeat questions—This histogram plots mean scores on different types of questions (data from both anatomical body regions). Performance remained high regardless of the question type or body region. (d) Flipped lectures were introduced in 2019 in several areas. This histogram plots student performance on repeat questions (n = 6) in areas of spinal cord structure and reflex function for the flipped lectures (2019) and for the same questions in previous years (total n = 490 students): performance was statistically unchanged [Color figure can be viewed at wileyonlinelibrary.com]

Analysis of repeat questions in gross anatomy—(a) Histogram of mean performance on questions in Lower Extremity anatomy (n = 17) asked repeatedly (or with minor modification) over an 8 year period (2012–2019; n = 643 students), including the curriculum revision in 2013. (b) Similar histograms of performance on questions (n = 17) in Head and Neck Anatomy over the same period (n = 606 students). (c) Types of repeat questions—This histogram plots mean scores on different types of questions (data from both anatomical body regions). Performance remained high regardless of the question type or body region. (d) Flipped lectures were introduced in 2019 in several areas. This histogram plots student performance on repeat questions (n = 6) in areas of spinal cord structure and reflex function for the flipped lectures (2019) and for the same questions in previous years (total n = 490 students): performance was statistically unchanged [Color figure can be viewed at wileyonlinelibrary.com] Student performance was also analyzed according to the format of the repeat questions (Basic Science or Clinical Vignette, see Section 2). Figure 5c shows plots of student performance on the types of questions from 2012 to 2019 (same data set as Figure 5a,b). There were no statistically significant differences in the mean scores (Vignette mean 91.5 ± 2.2 SD; Basic Science 93.3 ± 1.8 SD) or in comparison of individual questions over this period. In addition, changes in scores were calculated for successive years after their introduction to test whether students had prior knowledge of question repetition (Wood, 2009). The mean changes in scores of all 34 questions in the first 2 years was −1.5 ± 6.2 SD and only gradually increased as mean scores rose in successive years. Thus, students apparently had no prior knowledge of question repetition.

Effects of instructional modality

The effects of “Flipped” lectures were also evaluated by using “repeat” questions on written examinations. Figure 5d is a plot of student performance on questions (n = 6) in areas of nervous system anatomy and reflex functions for the flipped lectures (2019) and for the same questions in previous years (2014–2018) in which the material was presented by conventional lectures (total n = 490 students): The mean performance was slightly lower for the “flipped” lecture year when compared to conventional lectures (mean flipped 92.2 ± 5.7 SD; conventional 94.8 ± 4.9 SD) although this difference was not statistically significant. Mean scores on all “flipped” lecture questions were similar to the mean of all other repeat questions (91.8 ± 1.99 SD). Thus, student scores showed no correlation with pedagogical format.

Quantification of student performance

As student performance was uniformly high, we examined the biserial point values of repeat questions (calculated in ExamSoft or by our school's Office of Medical Education) to assess their effect in detecting students with academic difficulties. The mean point biserial value for repeat questions in the areas of Lower Extremity and Head and Neck given in the years 2012–2019 was (0.287 ± 0.074). Figure 6a is a plot of the mean biserial value (+SD averaged over) for each questions versus the mean student score on those questions. The point biserial values showed large variations and regression analysis showed only a weak correlation with the student scores (r 2 = .28).
FIGURE 6

Quantification of repeat questions and survey of students—(a) Biserial values—This graph plots the mean point biserial values (+SD) for repeat questions in both the areas of Lower Extremity and Head and Neck. The point biserial values showed considerable variability and were only weakly correlated with the mean performance (r 2 = .28). (b) Bimodal distribution of exam scores—This histogram plots the distribution on practical exams scores (mean Lower Extremity and Head and Neck, years 2017–2019). The distribution is bimodal and indicates that, despite the high mean scores, these examinations can still be used to identify students having difficulty in gross anatomy. (c) Survey of students—These histograms show responses of students to a survey (in Likert format, 5 strongly agree, 1 strongly disagree) on the positive effects of clinical/anatomical format (left), evaluating preparation for the USMLE Step 1 examination (middle) and effectiveness in reducing stress in learning (right) [Color figure can be viewed at wileyonlinelibrary.com]

Quantification of repeat questions and survey of students—(a) Biserial values—This graph plots the mean point biserial values (+SD) for repeat questions in both the areas of Lower Extremity and Head and Neck. The point biserial values showed considerable variability and were only weakly correlated with the mean performance (r 2 = .28). (b) Bimodal distribution of exam scores—This histogram plots the distribution on practical exams scores (mean Lower Extremity and Head and Neck, years 2017–2019). The distribution is bimodal and indicates that, despite the high mean scores, these examinations can still be used to identify students having difficulty in gross anatomy. (c) Survey of students—These histograms show responses of students to a survey (in Likert format, 5 strongly agree, 1 strongly disagree) on the positive effects of clinical/anatomical format (left), evaluating preparation for the USMLE Step 1 examination (middle) and effectiveness in reducing stress in learning (right) [Color figure can be viewed at wileyonlinelibrary.com]

Identifying students needing assistance in learning

Previous studies have indicated that the scores on examinations in the first year of medical school can be used as a measure to identify students needing assistance in learning (Gullo et al., 2015). Study of student performance in gross anatomy indicated that scores on practical examinations could fulfill that function. Figure 6b is a plot of the number of students and the ranges of scores for practical examinations in Lower Extremity and Head and Neck for the years 2017–2019 (n = 242 students in 9 examinations). This plot shows a discrete group of students who did not achieve passing grades and the distribution of exam scores was strongly bimodal in some years. These data indicate that, despite the high performance levels, a subset of students needing additional assistance in learning can still be identified.

Survey of effects of curricular changes on learning and stress

To gain an initial measure of students' views of the effectiveness of the curricular changes, a survey was distributed to students by email with questions related to (a) beneficial effects of the format of data presented as Clinical Symptom/Anatomy charts, (b) potential benefits of the format in preparation for the USMLE Step 1 examinations and (c) the overall effects on student stress. A histogram of student responses is shown in Figure 6c. The responses of students were quite positive about the curricular format: 90.2% of students strongly agreed/agreed that the clinical format of anatomy reduced stress (mean score 4.61 ± 0.97 SD of 5) and 95.1% felt that the format facilitated their preparation for the USMLE Step 1 examination (mean score 4.76 ± 0.54 SD).

DISCUSSION

The goal of the pedagogical changes described in this article was to adapt teaching of gross anatomy/embryology to a compressed duration of instruction while still preparing students for licensing examinations and subsequent training in medicine. Two new approaches were utilized in the pedagogy and its evaluation: first, instruction was based upon correlating the symptoms associated with a core of clinical conditions with their structural and functional foundation in anatomy; second, an extended data base (2005–2019) was developed to evaluate these changes, based upon on examinations that included specific questions asked repetitively over a number of years (2012–2019), spanning the curricular revisions. The effects of these changes in instructional method and evaluation are discussed below.

Instruction based upon symptoms, not clinical cases

Instruction in anatomy was reorganized and based upon the symptoms of clinical conditions rather than full case studies (Fan, Jiang, Shi, Wang, & Li, 2018; Neville, 2009). The conditions were specifically associated with or directly resulted from changes in anatomy at a macroscopic level. Instruction based upon symptoms was a useful way to “parse out” learning, in a number of ways: (a) this method permitted development of a basic vocabulary and knowledge of the specific definitions of terms was stressed and reiterated in written and lecture material and in discussions; (b) the causes of selected clinical conditions could be immediately related to anatomy and provided a context for learning the structural organization of the body; (c) analysis of symptoms that reflect pathological or developmental changes permitted direct discussion of the relationship between structure and function. The change in instructional method represents a shift in emphasis rather than a novel approach as study of clinical symptoms is a component of many other pedagogies (Morton & Colbert‐Getz, 2017; Neville, 2009). In addition, a number of schools have organized instruction within blocks based upon disease processes, although the utilization of clinical symptoms as a primary pedagogical method has not been explicitly stated (Halliday et al., 2015; Rizzolo et al., 2010; White et al., 2014).

Advantage: Learning the vocabulary of medicine and reduction of student stress

Limiting discussion to the anatomical basis of symptoms had important advantages over pedagogies based upon complete clinical problems. The effectiveness and efficiency of problem‐based approaches depend upon the background and prior training of the student, which have become increasingly diverse. Anatomy is a language that is essential in developing the vocabulary of clinical medicine (Al‐Jamal, 2018; Bowen & ten Cate, 2017). Clinical terminology is highly specific and widely utilized because, as noted by Bowen and ten Cate (2017) it is efficient in describing complex concepts with combinations of features and provides uniformity in communications among physicians. However, the medical lexicon (“talking like a physician,” Bowen & ten Cate, 2017, p. 49) is often only acquired in medical school while vocabulary instruction is rarely discussed in the medical education literature. For many students, medical terminology is effectively a “foreign” language and, in teaching at a primary or undergraduate level, language instruction typically begins by establishing and defining a vocabulary in simple contexts and then building up to complex thoughts (Larson, Dixon, & Townsend, 2013). In the present study, lectures often utilized a “word of the day” (e.g., hernia) and precisely defined its meaning (displacement of a structure from its normal position) (Kearney & Levine, 2019). This basic but essential element is often neglected in other formats, such as “flipped” lectures or independent learning. Omitting vocabulary instruction produces considerable disparity between students that have prior training and students who are stressed by the requirement of looking up and learning many terms, particularly if the volume of material is increased while the duration of learning is decreased. It is also important to note that an inherent limitation in multiple‐choice examinations (such as the USMLE Step 1) is that the questions provide all the words rather than relying upon the students to choose and assemble those words. Insufficient understanding of the meanings of words may not be assessed but could be manifested in clinical choices later in training and practice (Hołda et al., 2019). In addition, repetition of presentation of clinical content (repeated case studies) does not insure comprehension of vocabulary. The duration of older, semester‐length courses in Anatomy was effective in providing the repetition needed for language acquisition. The survey that we distributed indicated that students perceive these changes as effectively preparing them for the LCME Step 1 examination. Furthermore, although responses were more diverse, a majority of students indicated that the format adopted reduced stress in learning. We plan to repeat and extend this survey in future years but, even within the limitations of parametric analysis of the Likert format (Sullivan & Artino, 2013), these data provide support for the idea that a Clinical/Anatomical approach and reduction of detail in initial learning can mitigate the deleterious effects of student stress (Dyrbye et al., 2019).

Disadvantage: Elimination/reduction of content in anatomy

The decrease in duration of instruction presented a considerable obstacle to students' retaining the details of body structure. The retention of rote memorization has long been known to be strongly dependent upon the duration of study and spacing of instruction (Delaney, Godbole, Holden, & Chang, 2018; Dempster, 1988; Larsen, 2018; Morin et al., 2019). To adapt to the decreased duration of instruction (6 days for Lower Extremity) learning of muscle anatomy was reduced (only actions/innervations, not origins/insertions). Although this had an immediate benefit in decreasing the informational load, it proved to be a “two‐edged sword,” as students reported difficulty of initial identification of muscles in laboratory dissection, which is based upon muscle origin/insertion as well as position. Also, learning of origins and insertions was still required for some muscles when that provided insight into the symptoms of clinical syndromes (e.g., Gluteal gait). However, knowledge of muscle actions and innervations still provided understanding of the relationship between muscle structure/function and also facilitated understanding of syndromes of nerve damage (Ghosh, 2017; Miller, Perrotti, Silverthorn, Dalley, & Rarey, 2002). Detail in content was also decreased in teaching of the circulatory system by requiring learning only arterial branches of major clinical relevance (ex only meningeal branches of the maxillary artery). However, identification of all branches was still required in laboratory dissection and in the interpretation and testing of arterial angiograms. It is important to note that, concomitant with the decline in anatomy teaching, there has been a dramatic increase in the use and informational content of imaging techniques that require a detailed knowledge of three‐dimensional body structure, although questions based upon images of anatomy have apparently been reduced in the USMLE Step 1 examination. The reduction of content in anatomy may clearly be an impediment to a number of aspects of medical education, particularly training in specialties in which knowledge of anatomical detail is essential (e.g., orthopedics, surgery, neurosurgery, radiology). This could be resolved through supplementary instruction in more advanced courses in Anatomy introduced later into the curricula (Blunt & Blizard, 1975). This was not reflected in test scores in the present study, as questions related to deleted content were also eliminated in assessment.

Analysis of performance using an extended baseline and “repeat” questions

The multiyear database of student exam scores indicated that student performance in gross anatomy remained high even after curriculum changes that increased the number of courses taught simultaneously. The development of long‐term databases is important as errors in analysis of test scores may occur when performance is evaluated over a limited period (Gullo, 2016). Many older studies examined data over single or two year periods (e.g., Minhas, Ghosh, & Swanzy, 2012; Zhang, Fenderson, Schmidt, & Veloski, 2013). The differences in performance may not only reflect changes in pedagogy but other, more complex parameters that are only apparent when data are evaluated over longer time periods (e.g., Nieder & Borges, 2012). This is particularly important in comparing some forms of assessment, such as practical examinations, that are inherently variable as they often use student dissections as test material (discussed by Yaqinuddin, Zafar, Ikram, & Ganguly, 2013). These errors may not be apparent but may be detected if data are acquired and evaluated over longer time periods. More specific analysis was possible using the subset of repeat questions (Joncas et al., 2018). As the goal was to optimize and assess establishment of a basic foundation of gross anatomy and the medical vocabulary, no changes were made to make these questions more discriminative. Analysis of test scores after introduction of the use of repeat questions showed small variations in performance that paralleled the scores on nonrepeat questions, indicating that students had no prior knowledge of question repetition (Wood, 2009). Changes were made in the text and distractors for some questions to insure their reliability (Deepak, Al‐Umran, AI‐Sheikh, Dkoli, & Al‐Rubaish, 2015) but no consistent changes in performance were found after these changes. Consistently high performance was also evident in student scores on practical examinations which can test some aspects of anatomical knowledge more directly than written multiple choice tests (Ghosh, 2017; Schuwirth & van der Vleuten, 2004). In addition, we did not attempt to correlate these results with subsequent performance on USMLE Step 1 examinations as those tests are limited in their assessment of knowledge of anatomy and do not include item analysis of topics that could be specifically related to our curricular changes. However, the sustained performance of students in repeat questions and practical examinations supports the conclusion that the pedagogical changes were effective in establishing a base of knowledge of anatomy even in a decrease period of instruction.

Multimodal pedagogies

An important element in sustaining student performance in gross anatomy in a compressed duration was the use of multimodal pedagogies to reiterate terminology and increase the depth of knowledge of body structure. The benefit of use of diverse methods of instruction has now been demonstrated in a wide variety of studies (Estai & Bunt, 2016; Hu, Wattchow, & de Fontgalland, 2018; Klement et al., 2017; Johnson et al., 2012; Moro, Smith, & Stromberga, 2019). In the present study, both “flipped” and conventional lectures presented the same anatomical structures repeatedly in photographs and diagrams, as well as radiographic (X‐ray), radiological (CT) and MRI images. In addition, “active” learning regularly was utilized as many of the same structures were then dissected in laboratory sessions, often on the same day (Ghosh, 2017). Laboratory sessions were essential in developing appreciation for anatomical variation. Spacing of repetition was also aided by subsequent small group laboratory review sessions that occurred prior to each examination. These multimodal methods facilitated learning by students with diverse backgrounds, as evidence by the high performance on both written and practical examinations. Student feedback in course evaluations regularly cited these approaches as making subjects “easier to understand.” In contrast, use of single pedagogies such as problem‐solving or team based learning approaches have potential benefits in developing student cognition but at the expense of increasing stress (White et al., 2014). While changes in methods of assessment have been indicated to improve student well‐being (Bloodgood, Short, Jackson, & Martindale, 2009; Wilkinson, 2011) there have been few proposals for changes in pedagogy to reduce stress (D'Eon, 2014; Lyndon et al., 2017). The present study suggests that establishing and directly assessing a vocabulary and base of anatomical knowledge can facilitate learning that can be applied to more complex clinical problems, with a concomitant and overall decrease in stress.

Early identification of students needing assistance in learning

A goal of this study was to examine whether performance in gross anatomy could be used to identify students needing assistance in learning. Analysis of examination scores showed that, despite high overall performance levels, it was possible to consistently identify a discrete group that had not achieved a passing grade. The specific causes of difficulties encountered by some students are unclear but may reflect their academic background and the lack prior training in Anatomy, which can impede acquisition of steric knowledge and formation of an internalized image of body structure needed for interpretation of radiographs, CT and MRI images (Houser & Kondrashov, 2018; Wish‐Baratz, Gubatina, Enterline, & Griswold, 2019; Zill, 2016). Our school has developed methods for identification of students needing assistance in learning and the present results suggest that analysis of early performance in gross anatomy may be of value in that process (Gullo et al., 2015).

Limitations of this study

“Ceiling Effect”—Student scores on recent gross anatomy tests were high (increasing from averages in the high 80s to low 90s) and potentially subject to a “ceiling effect” (Taylor, 2012), limiting their applicability in discriminating performance among students who achieve a passing grade (Cramer & Howitt, 2005). Ranking student performance was not our goal as our method has been to establish a core of knowledge and assess performance through questions that are criterion‐referenced (evaluating a specific body of anatomical knowledge as a fixed criterion) rather than norm referenced (assessing differences in abilities among test‐takers) (Glaser, 1963). Due to the changes in our pedagogy, examinations have essentially become screening or threshold tests rather than rating tools, with the goal of aiding students in establishing a base of knowledge of the anatomical basis of clinical medicine which could then be expanded in further training (Shepard, 2018). While scores prior to 2016 showed normal distributions, analysis of repeat questions required use of nonparametric tests that do not require normal distributions (e.g., Wilcoxon test) to demonstrate that performance was consistent. In addition, a significant but small percentage of students did not pass these tests, indicating that some individuals found them challenging. We plan to re‐evaluate the repeat questions (Boone, 2016) to increase our ability to identify students needing assistance in learning by increasing the number of clinical conditions studied and by introducing other measures of assessment, such as grading laboratory dissections (Nwachukwu, Lachman, & Pawlina, 2015). Use of multiple metrics of assessment has been suggested as a method increasing the reliability of pass/fail examinations (Möltner, Tımbıl, & Jünger, 2015) and has also been discussed as applicable to the consequences of the proposed changes in the USMLE Step 1 examination to a pass/fail format (Swails, Aibana, & Stoll, 2019). Limitations of observational studies—In addition, this article was an observational study done without a control group (Rosenbaum, 2009), due mainly to constraints in the number of anatomy faculty. Although this study has shown that student performance was consistently high and did not decrease through curricular changes that compressed the duration of gross anatomy instruction, we did not demonstrate a direct correlation between the modifications of pedagogy and test scores, as the teaching methods were applied to all students. The accuracy of measurement of changes in knowledge of anatomy was aided by the use of repeat questions over an eight‐year period, which eliminated variations due to question format, distractors, etc. and supported by the finding that comparable changes occurred in both written and practical examinations (McKillip, 1992). We believe that the consistency (not improvement) in those data support the idea that instruction in gross anatomy was effective despite the compression of duration. In addition, comparison of block exam scores in Head and Neck and Lower Extremity with assessment of teaching of anatomy in other regions, in which pedagogy was not comparably changed, supports the idea that methods we used were beneficial. However, further confirmation is needed and studies are planned to assess the effects of integration by other methods, such as tests of retention of anatomical knowledge in students at more advanced stages of undergraduate training. Last, we also plan to repeat and expand the survey on student stress to confirm the effects of curricular changes on student well‐being and to correlate the effects with specific pedagogical methods. In sum, we believe our data support the idea that pedagogical changes can be effective in teaching gross anatomy in a limited time frame and these methods may benefit others confronting the changes in anatomy instruction in medical education.
  76 in total

1.  Education. Farewell, lecture?

Authors:  Eric Mazur
Journal:  Science       Date:  2009-01-02       Impact factor: 47.728

2.  Calls for reform of medical education by the Carnegie Foundation for the Advancement of Teaching: 1910 and 2010.

Authors:  David M Irby; Molly Cooke; Bridget C O'Brien
Journal:  Acad Med       Date:  2010-02       Impact factor: 6.893

3.  A change to pass/fail grading in the first two years at one medical school results in improved psychological well-being.

Authors:  Robert A Bloodgood; Jerry G Short; John M Jackson; James R Martindale
Journal:  Acad Med       Date:  2009-05       Impact factor: 6.893

4.  Preparing to take the USMLE Step 1: a survey on medical students' self-reported study habits.

Authors:  Andre D Kumar; Monisha K Shah; Jason H Maley; Joshua Evron; Alex Gyftopoulos; Chad Miller
Journal:  Postgrad Med J       Date:  2015-04-24       Impact factor: 2.401

5.  The Conundrum of the United States Medical Licensing Examination Score Reporting Structure.

Authors:  Jennifer L Swails; Omowunmi Aibana; Barbara J Stoll
Journal:  JAMA       Date:  2019-08-20       Impact factor: 56.272

6.  An anatomy workshop for improving anatomy self-efficacy and competency when transitioning into a problem-based learning, Doctor of Physical Therapy program.

Authors:  Mona Bains; Debora Z Kaliski
Journal:  Adv Physiol Educ       Date:  2020-03-01       Impact factor: 2.288

7.  Alarming decline in recognition of anatomical structures amongst medical students and physicians.

Authors:  Mateusz K Hołda; Tomasz Stefura; Mateusz Koziej; Oksana Skomarovska; Katarzyna A Jasińska; Wojciech Sałabun; Wiesława Klimek-Piotrowska
Journal:  Ann Anat       Date:  2018-09-20       Impact factor: 2.698

8.  The reliability of the pass/fail decision for assessments comprised of multiple components.

Authors:  Andreas Möltner; Sevgi Tımbıl; Jana Jünger
Journal:  GMS Z Med Ausbild       Date:  2015-10-15

9.  Human structure in six and one-half weeks: one approach to providing foundational anatomical competency in an era of compressed medical school anatomy curricula.

Authors:  Nancy Halliday; Daniel O'Donoghue; Kathryn E Klump; Britta Thompson
Journal:  Anat Sci Educ       Date:  2014-07-03       Impact factor: 5.958

10.  Is medical education hazardous to your health?

Authors:  Marcel D'Eon
Journal:  Can Med Educ J       Date:  2014-12-17
View more
  1 in total

1.  Rethinking gross anatomy in a compressed time frame: Clinical symptoms, not case studies, as the basis for introductory instruction.

Authors:  Sasha N Zill
Journal:  Clin Anat       Date:  2020-07-20       Impact factor: 2.414

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.