Literature DB >> 31750362

Combination of different clinical reasoning tests in a national exam.

Anahita Sadeghi¹, Ali Ali Asgari¹, Nezarali Moulaei², Vahid Mohammadkarimi³, Somayeh Delavari⁴, Mitra Amini⁵, Setareh Nasiri⁶, Roghayeh Akbari⁷, Mojgan Sanjari⁸, Iraj Sedighi⁹, Parisa Khoshnevisasl¹⁰, Manouchehr Khoshbaten¹¹, Saeed Safari¹², Leily Mohajerzadeh¹³, Parisa Nabeiei⁵, Bernard Charlin¹⁴.

Abstract

INTRODUCTION: Clinical reasoning as a critical and high level of clinical competency should be acquired during medical education, and medical educators should attempt to assess this ability in medical students. Nowadays, there are several ways to evaluate medical students' clinical reasoning ability in different countries worldwide. There are some well-known clinical reasoning tests such as Key Feature (KF), Clinical Reasoning Problem (CRP), Script Concordance Test (SCT), and Comprehensive Integrative Puzzle (CIP). Each of these tests has its advantages and disadvantages. In this study, we evaluated the reliability of combination of clinical reasoning tests SCT, KF, CIP, and CRP in one national exam and the correlation between the subtest scores of these tests together with the total score of the exam.
METHODS: In this cross sectional study, a total number of 339 high ranked medical students from 60 medical schools in Iran participated in a national exam named "Medical Olympiad". The ninth Medical Olympiad was held in Shahid Beheshti University of Medical Sciences, Tehran, Iran, under the direct supervision of the Ministry of Health and Medical Education in summer 2017. The expert group designed a combination of four types of clinical reasoning tests to assess both analytical and non-analytical clinical reasoning. Mean scores of SCT, CRP, KF, and CIP were measured using descriptive statistics. Reliability was calculated for each test and the combination of tests using Cronbach's alpha. Spearman's correlation coefficient was used to evaluate the correlation between the score of each subtest and the total score. SPSS version 21 was used for data analysis and the level of significance was considered <0.05.
RESULTS: The reliability of the combination of tests was 0.815. The reliability of KF was 0.81 and 0.76, 0.80, and 0.92 for SCT, CRP, and CIP, respectively. The mean total score was 169.921±41.54 from 240. All correlations between each clinical reasoning test and total score were significant (P<0.001). The highest correlation (0.887) was seen between CIP score and total score.
CONCLUSION: The study showed that combining different clinical reasoning tests can be a reliable way of measuring this ability. Copyright: © Shiraz University of Medical Sciences.

Entities: Chemical

Keywords: Medical assessment ; Medical students; Education

Year: 2019 PMID： 31750362 PMCID： PMC6820014 DOI： 10.30476/jamp.2019.83101.1083

Source DB: PubMed Journal: J Adv Med Educ Prof ISSN： 2322-2220

Introduction

Clinical reasoning is the method by which clinicians collect information about the patients’ problems and develop a plan to solve the problem and manage the patients (1-3). Clinical reasoning is a critical and high level clinical competency, and it should be acquired during medical education, and medical educators should attempt to teach this ability to medical students (4,5). Although special medical care is provided by specialists, most of the general medical care is performed by general practitioners; Then, improving the ability of clinical reasoning and decision-making can play a major role in reduction of the incidence of adverse events in the clinical care and the promotion of health indicators (6). Assessing this ability is also important; different clinical reasoning tests developed for measuring this ability include Key Features (KF), Script Concordance Test (SCT), Comprehensive Integrative Puzzle (CIP), and Clinical Reasoning Problem (CRP) (4). One of the tests used for assessing clinical reasoning is clinical reasoning problems (CRP). In this test, a scenario is presented, and students should choose the two diagnoses they consider most likely for the scenario; they should also mention the features of the case that are important for the correct diagnosis and indicate whether these features positively or negatively predict each diagnosis (7-10). KF test was introduced by Bordage and Page (10); its questions focused on critical steps in clinical problem solving and may pertain to aspects that learners generally find difficult or that are necessary for the patients’ management. Its focus is on the key features of diseases (4). Several studies investigated the reliability, generalizability, construct, and predictive validity of the KF test (11). SCT was introduced by Bernard Charlin in 2000 (12,13). In SCT, a short patient scenario ensues with three questions. Every scenario has three columns, which include diagnostic hypothesis, new clinical finding, and scale from −2 to +2 (4,12,13). Scoring in this test is according to an expert panel answering the question (4,12). Another clinical reasoning test is the CIP Test that introduced Case and Swanson for assessing clinical reasoning in routine situations (14,15). . This test was developed according to illness script theory (16). CIP is a kind of pattern recognition test such that its format is similar to extended matching test (14,17). Each of the clinical reasoning tests measures a different aspect in the clinical reasoning domain. Sometimes, it seems necessary to measure all abilities by using all the above-mentioned tests. In this study, we evaluated the reliability of combining various clinical reasoning tests (SCT, KF, CIP, and CRP) in one national exam. We also evaluated the correlation between these tests and the total score of the exam.

Methods

In this cross sectional study, a total number of 339 high rank medical students from 60 medical schools participated in a national exam named Medical Olympiad. The ninth Medical Olympiad was held in Tehran, Iran, in Shahid Beheshti University of Medical Sciences under the direct supervision of the Ministry of Health and Medical Education in summer 2017. An expert group with members from all Iranian medical schools developed a question bank of specialized clinical reasoning test items in Internal medicine, General Surgery, Pediatrics, Obstetrics, and Gynecology. The expert group designed a combination of four types of clinical reasoning tests to assess the clinical reasoning based on the methodology described in former articles (8,10,13,16,18-22). Fifteen experts (the expert panel) in the fields of Internal medicine, Surgery, Pediatric, Obstetrics, and Gynecologists were asked to answer the tests without using textbooks or consulting with each other. Then, each answer was weighted according to their scores. In each KF test, a case was described and subsequently followed by 16 questions. Students would choose four correct answers for each KF test. The answers were weighted by the expert panel’s (answers to the same questions). To increase the discriminating power, we calculated the partial credit score (18,23-27). To measure clinical data interpretation in ill-defined cases, we presented the clinical situations as vignette cases that did not include all the data necessary to provide a diagnosis. Then, a series of related items with different formats (diagnosis, investigation, or treatment) in three parts were designed. The first part included a clinical scenario or hypothesis, the second part succinctly gave more information (clinical or paraclinical data) that might have positive or negative effects on the first, and the third part was a 5-point Likert scale type. SCT was introduced by Bernard Charlin in 2000 (12,13). In it, a short patient scenario follows three questions. Every scenario has three columns, which include diagnostic hypothesis, new clinical finding, and scale from −2 to +2 (1,12,13). Scoring in this test was according to the expert panel’s answers to the question (1,12,15). The CRP tests were scored based on the binary method in which the correct answer had one score, and the wrong answer had zero score. By using the summative method, the total score (sum of the scores of questions) was calculated (9). In this test, the answers were not weighted, and a combination of items in four parts (patient's history, physical examination, paramedic(s) and treatment) was considered as the correct answer. CIP scores were calculated from the answers given by the reference panel. For each of the four columns, four correct responses out of 4 (4/4) questions were graded as 100%, 3/4 as 75%, 2/4 as 50%, and 1/4 was graded as 0%, respectively. The grade of the CIP exam was determined by the sum of grades (15). The total exam score obtained by the summation of four test scores; each test accounted for 25 percent of the total score. The total score was 240, and each test score was 60. Data analysis was performed using SPSS software version 21. Mean scores of SCT, CRP, KF, and CIP were expressed using descriptive statistics. Reliability was calculated using Cronbach's alpha. Spearman's correlation coefficient was used to evaluate the correlation between the score of each clinical reasoning test and the total score. Student t-test was used to measure the difference between the scores of male and female students. The significance level was considered <0.05.

Results

Reliability was calculated through Cronbach's alpha for the combination of tests, which was 0.815. The reliability of KF was 0.81, and this measure was 0.76 for SCT, 0.80 for CRP, and 0.92 for CIP. 56% of the participants were female (190), while 149 (44 %) people were male. The mean total score was 169.92±41.54 from 240, and the total mean scores of female and male students were 164.87±42.85 and 176.353±9.01, respectively. Table 1 shows clinical reasoning subsets scores according to gender. Male students obtained better total scores than female students, and this difference was statistically significant (p<0.001).

Table 1

Clinical reasoning total and subset scores according to gender

Sex	Mean±SD Total score (from240)	Mean±SD KF (from 60)	Mean±SD SCT (from 60)	Mean±SD CRP (from 60)	Mean±SD CIP (from 60)	N
Female	164.87 ±42.85	37.39 ±7.93	18.77 ± 5.87	50.80± 17.65	57.90± 17.38	190
Male	176.35 ±39.01	38.76 7.43	20.89 ±4.64	55.67±16.65	61.03 ±16.05	149
Total	169.92± 41.54	37.99 ±7.73	19.70 ±5.46	52.94 ±17.36	59.28 ±16.86	339

Clinical reasoning total and subset scores according to gender Table 2 shows the correlation between each of the clinical reasoning and total score. According to this Table, all of the correlations were significant (P=0.001). The highest correlation (0.887) was between the CIP score and total score, and the lowest correlation (0.473) was between CRP and SCT.

Table 2

Correlation of clinical reasoning tests with each other and with the total score

	Variable	KF	SCT	CRP	CIP
Spearman's rho	KF	1	0.545**	0.675**	0.640**
	SCT	0.545**	1	0.473**	0.523**
	CRP	0.675**	0.473**	1	0.647**
	CIP	0.640**	0.523**	0.647**	1
	Total Score	0.803**	0.631**	0.885**	0.887**

Correlation is significant at the 0.01 level (2-tailed).

Correlation of clinical reasoning tests with each other and with the total score Correlation is significant at the 0.01 level (2-tailed).

Discussion

Clinical reasoning is a cognitive process for the diagnosis of the patient problem, and it has an important role in clinical problem solving and patient management (1,2,19). Good assessment methods for measuring this ability are important to determine weak points in this field and try to improve this ability (4). There are various methods for assessing this ability, including PT, KF, SCT, and CRP (19,20). Clinical reasoning assessment methods addressed the underlying construct of the clinical reasoning process and focused on specific sub-tasks, such as data gathering, activating diagnostic hypotheses, and prioritizing diagnostic alternatives (19). This study was designed and implemented to evaluate the reliability of the combination test of clinical reasoning (SCT, KF, PT, and CRP), and the correlation between the score of each test and with the total score. The reliability of combination of these four tests was excellent and equal to 0.815. Amini et al. in their study used four methods of clinical reasoning (KF, SCT, CRP, and CIP) and reported the reliability 0.91 for a combination of the tests (15). Khsohbaten et al. in their study used four methods of clinical reasoning (KF, Scenario, CRP, and puzzle); the combined reliability was equal to 0.86 (21). KF reliability in this article was 0.81; previous studies reported the different range of reliability for the KF: 0.75-0.95 (1,15,20,22-24). A good reliability range for SCT was 0.7 to 0.8 (25-27); the present study showed the reliability of 0.76 for SCT. CRP reliability in this article was 0.81; previous studies reported a different range of reliability for the CRP: 0.72 (9), 0.61 (8), 0.83 (7). However, acceptable reliability for CIP was 0.43-0.73 with a mean of 0.60 (28). The reliability of CIP in the Amini et al.’s study was 0.91 (15); the reliability of CIP in the present study was similar to Amini et al.’s (15) results and was equal to 0.92. In this study, the highest and lowest mean scores were related to CIP and SCT, respectively. These results are contrary to Amini et al.’s study that the highest and lowest mean scores were related to KF and CRP, respectively (15). In Groves et al.’s study, there was a clear difference between the CRP and SCT means (8). The evaluation of the relationship between gender and the scores of the sub-tests and the total score showed that male participants in all four subsets had a higher score than the female participants; the highest and lowest difference was related to CRP and SCT subset, respectively. In this study, we investigated each clinical reasoning test and the total score; the correlation between each of the clinical reasoning tests and the total score was positive. SCT examined the degree of coherence between the panel judgment of experts and student responses (1,29), and it was suitable for assessing the depth and breadth of the students’ knowledge. Therefore, it focuses on the structure and organization of the knowledge base (12). The main emphasis was on assessing the students' ability to evaluate the diagnostic assumptions by providing new information (13); although CRP and SCT assess data interpretation, clinical reasoning assessment using CRP test provided a more comprehensive vignette by assessing individual ability in the generation of diagnostic hypothesis and information synthesis, as well as data interpretation (8). On this basis, we expected the correlation between the SCT and CRP to be low; the result of the present study verified this assumption. Therefore, the lowest correlation was related to the correlation between CRP and SCT. The philosophy of developing the KF test is solving a clinical problem (15). The results of this study showed that the correlation between each clinical reasoning test and the total score was positive, and it was above 0.473. The highest correlation between the subsets was related to that between CRP and KF. CIP was developed based on illness script theory (16), and it is a kind of pattern recognition (14,17). The scoring system for CIP is not dependent on experts’ panel judgment and each puzzle has definite answers. Therefore, we expected the high correlation between CIP and other subsets. Results of the present study showed that positive and high correlation existed between CIP and each of the other clinical reasoning tests (CRP, SCT, KF). We expected a high correlation between the total score and all of the clinical reasoning tests that confirmed our expectation by results. The findings of this study and other studies that used combination methods suggested that using different and complementary methods of assessing clinical reasoning provided a more detailed and qualitative evaluation than either clinical reasoning assessment method alone (8) The strength of the present study is that the study is a national study that was done by the participation of all medical students from all over the country. The limitation of the present study was the fact that data were gathered from top students that may not be a good representative of all medical students.

Conclusion

The combination of different clinical reasoning tests will be a reliable method for measuring clinical reasoning ability in high stake examinations. The use of combination of these clinical reasoning tests is recommended for high stake examinations in medical schools.

24 in total

1. The Script Concordance test: a tool to assess the reflective clinician.

Authors: B Charlin; L Roy; C Brailovsky; F Goulet; C van der Vleuten
Journal: Teach Learn Med Date: 2000 Impact factor: 2.414

2. The CIP (comprehensive integrative puzzle) assessment method.

Authors: Rosalie Ber
Journal: Med Teach Date: 2003-03 Impact factor: 3.650

3. An innovative method to assess clinical reasoning skills: Clinical reasoning tests in the second national medical science Olympiad in Iran.

Authors: Mitra Amini; Mohsen Moghadami; Javad Kojuri; Hamidreza Abbasi; Ali Arhami Doolat Abadi; Nezar Ali Molaee; Elham Pishbin; Hamid Reza Javadzade; Vahid Monsef Kasmaee; Hasan Vakili; Mohamad Ali Reis Sadat; Roghaye Akbari; Bita Omidvar; Afshin Shafaghi; Marzie Dehbozorgian; Mohammad Morad Jafari; Alireza Monajemi; Kamran Soltani Arabshahi; Peyman Adibi; Bernard Charlin
Journal: BMC Res Notes Date: 2011-10-17

4. Reliability and validity of key feature cases for the self-assessment of colon and rectal surgeons.

Authors: Judith L Trudel; Georges Bordage; Steven M Downing
Journal: Ann Surg Date: 2008-08 Impact factor: 12.969

5. Effects of a supplementary final year curriculum on students' clinical reasoning skills as assessed by key-feature examination.

Authors: C Nikendei; S Mennin; P Weyrich; B Kraus; S Zipfel; M Schrauth; J Jünger
Journal: Med Teach Date: 2009-09 Impact factor: 3.650

6. Electronic assessment of clinical reasoning in clerkships: A mixed-methods comparison of long-menu key-feature problems with context-rich single best answer questions.

Authors: Sören Huwendiek; Friedrich Reichert; Cecilia Duncker; Bas A de Leng; Cees P M van der Vleuten; Arno M M Muijtjens; Hans-Martin Bosse; Martin Haag; Georg F Hoffmann; Burkhard Tönshoff; Diana Dolmans
Journal: Med Teach Date: 2017-03-10 Impact factor: 3.650

Review 7. Key-feature questions for assessment of clinical reasoning: a literature review.

Authors: Patricia Hrynchak; Susan Glover Takahashi; Marla Nayer
Journal: Med Educ Date: 2014-09 Impact factor: 6.251

8. The clinical integrative puzzle for teaching and assessing clinical reasoning: preliminary feasibility, reliability, and validity evidence.

Authors: Vincent F Capaldi; Steven J Durning; Louis N Pangaro; Rosalie Ber
Journal: Mil Med Date: 2015-04 Impact factor: 1.437

9. The Medical Council of Canada's key features project: a more valid written examination of clinical decision-making skills.

Authors: G Page; G Bordage
Journal: Acad Med Date: 1995-02 Impact factor: 6.893

10. The effect of short-term workshop on improving clinical reasoning skill of medical students.

Authors: Parsa Yousefichaijan; Farshad Jafari; Manijeh Kahbazi; Mohammad Rafiei; AbdolGhader Pakniyat
Journal: Med J Islam Repub Iran Date: 2016-07-12

4 in total

1. Teaching clinical reasoning to undergraduate medical students by illness script method: a randomized controlled trial.

Authors: Mana Moghadami; Mitra Amini; Mohsen Moghadami; Bhavin Dalal; Bernard Charlin
Journal: BMC Med Educ Date: 2021-02-02 Impact factor: 2.463

2. The Relationship between Experiences Level and Clinical Decision-Making Skill in Midwifery Students: A Cross-Sectional Study.

Authors: Somayeh Delavari; Kamran Soltani Arabshahi; Mitra Amini; Maryam Aalaa; Ghadir Pourbairamian; Niloofar Bahoosh; Nasrin Asadi; Bhavin Dalal; Javad Kojuri; Hadi Hamidi; Sajad Delavari
Journal: Med J Islam Repub Iran Date: 2022-07-18

3. How to develop clinical reasoning in medical students and interns based on illness script theory: An experimental study.

Authors: Somayeh Delavari; Alireza Monajemi; Hamid Reza Baradaran; Phyo Kyaw Myint; Minoo Yaghmaei; Seyed Kamran Soltani Arabshahi
Journal: Med J Islam Repub Iran Date: 2020-02-20

Review 4. A scoping review of clinical reasoning research with Asian healthcare professionals.

Authors: Ching-Yi Lee; Chang-Chyi Jenq; Madawa Chandratilake; Julie Chen; Mi-Mi Chen; Hiroshi Nishigori; Gohar Wajid; Pai-Hsuang Yang; Muhamad Saiful Bahri Yusoff; Lynn Monrouxe
Journal: Adv Health Sci Educ Theory Pract Date: 2021-07-12 Impact factor: 3.853

4 in total