Fatemeh Keshmiri1, Sari Ponzer2, AmirAli Sohrabpour3, Shervin Farahmand4, Farhad Shahi5, Shahram Bagheri-Hariri6, Kamran Soltani-Arabshahi7, Mandana Shirazi8, Italo Masiello9. 1. PhD Candidate in Medical Education, Faculty of Medicine, Tehran University of Medical Sciences, Tehran, Iran. keshmiri_f@razi.tums.ac.ir. 2. Professor of Department of Clinical Science and Education, Södersjukhuset, Karolinska Institutet, Stockholm, Sweden. sari.ponzer@ki.se. 3. Assistant Professor, Department of Internal Medicine, Tehran University of Medical Sciences, Tehran, Iran, aasohrabpour@tums.ac.ir. 4. Associate Professor, Department of Emergency Medicine, Imam Khomeini Hospital, Tehran University of Medical Sciences, Tehran, Iran. shfarahmand@tums.ac.ir. 5. Assistant Professor, Department of Internal Medicine, Oncology Department, Tehran University of Medical Sciences, Tehran, Iran. fshahi@yahoo.com. 6. Assistant Professor, Department of Emergency Medicine, Imam Khomeini Hospital, Tehran University of Medical Sciences, Tehran, Iran. hariri.shahram@gmail.com. 7. Professor, Department of Internal Medicine, Department of Medical Education, Iran University of Medical Sciences, Tehran, Iran. soltarab34@gmail.com. 8. Post-doctoral Researcher, Department of Clinical Science and Education, Södersjukhuset, Karolinska Institute, Stockholm, Sweden, & Associate Professor, Educational Development Center, Department of Medical Education, Tehran University of Medical Sciences, Tehran, Iran. mandana.shirazi@ki.se. 9. Associate Professor, Department of Clinical Science and Education, Södersjukhuset,, Karolinska Institute, Stockholm, Sweden. italo.Masiello@ki.se.
Abstract
BACKGROUND: Simulation can be used for educating, evaluating and assessing psychometric properties of an instrument. The aim of this study was to contextualize and assess the validity and reliability of the Interprofessional Collaborative Assessment tool (ICAR) in an Iranian context using simulation. METHODS: In this descriptive study, contextualization of the ICAR was assessed through several steps. Firstly, validity assessment was approved through expert panels and Delphi rounds. Secondly, reliability assessment was done by arranging a simulation video and assessing reproducibility, test-retest (ICC), internal consistency (Cronbach's Alpha) and inter-rater reliability (Kappa).The participants included 26 experts, 27 students and 6 staff of the Standardized Simulation Office of Teheran University of Medical Sciences. RESULTS: Contextualization and validity of the ICAR were approved in an Iranian context. The reliability of the tool was computed to be 0.71 according to Cronbach´s Alpha. The test-retest was calculated to be 0.76. CONCLUSION: The Iranian ICAR can be a useful tool for evaluating interprofessional collaborative competencies. The development of the instrument through a simulation scenario has been a positive prospect for researchers.
BACKGROUND: Simulation can be used for educating, evaluating and assessing psychometric properties of an instrument. The aim of this study was to contextualize and assess the validity and reliability of the Interprofessional Collaborative Assessment tool (ICAR) in an Iranian context using simulation. METHODS: In this descriptive study, contextualization of the ICAR was assessed through several steps. Firstly, validity assessment was approved through expert panels and Delphi rounds. Secondly, reliability assessment was done by arranging a simulation video and assessing reproducibility, test-retest (ICC), internal consistency (Cronbach's Alpha) and inter-rater reliability (Kappa).The participants included 26 experts, 27 students and 6 staff of the Standardized Simulation Office of Teheran University of Medical Sciences. RESULTS: Contextualization and validity of the ICAR were approved in an Iranian context. The reliability of the tool was computed to be 0.71 according to Cronbach´s Alpha. The test-retest was calculated to be 0.76. CONCLUSION: The Iranian ICAR can be a useful tool for evaluating interprofessional collaborative competencies. The development of the instrument through a simulation scenario has been a positive prospect for researchers.
Learning to work interprofessionally is fundamental to patient safety and a patient-centered approach (1). According to the World Health Organization’s definition, interprofessional education (IPE) is the process by which a group of students or health care providers of different professions learn with, from and about each other (1).In IPE programs, a competency-based approach is often used to assess the achievements and quality of interprofessional work (2-3). The components of a competency-based assessment instrument should include objectives of assessment and its guidelines (4). Some instruments, designed to assess interprofessional competences, have been developed and validated (5-8). In his study, Oates states that Interprofessional Collaborator Assessment Rubric (ICAR) can be used to assess knowledge/skills and behavioral acquisition (8). The aim of this study was to validate the ICAR (6,9-10) as an interprofessional performance assessment tool in an Iranian context.The properties of the ICAR can be favorable to examiners, instructors and learners because they can increase the quality of instruction, and their functionality as a guideline with respect to the learning expectations and fostering critical reflection on performance (6). However, to assess interprofessional competencies, the researcher has to assess teamwork skills and communication in interaction with other professional members. This remains to be difficult since those assessed have to show effective collaboration and information exchange (11,12). Hence, several issues can affect the assessment, validity and reliability of a tool. Therefore, the quality assurance of a tool should be meticulously run and adapted to different contexts.The validation process is usually adapted in a real educational context (6), and it seldom is applied in a simulation setting (13-17). Considering the challenges of assessing the reliability of interprofessional collaboration (10,13-14), the aim of this study was to contextualize and assess the validity of the ICAR in an Iranian context, and estimate the reliability of the ICAR reproducibility, internal consistency and inter-rater reliability (6) through simulation.
Methods
In this descriptive study, the ICAR was contextualized and validated from December 2011 to January 2013 (Fig. 1).The original instrument, which is based on a 4-point Likert scale, includes 31 items within six domains: Communication, Collaboration, Roles and Responsibilities, Collaborative Patient/Client-Family Centered Approach, Team Functioning and Conflict Resolution/Management domains. Scores indicate how often specific interprofessional competencies occur; i.e., score of 1 indicates minimally, score of 2 indicates developing (occasionally), score of 3 designates competent (frequently) and score of 4 represents mastery (consistently) (6).
Study Area
This study was conducted at Tehran University of Medical Sciences in Iran, a developing country in the Middle East. In Iran, medical education systems are centralized and medical curriculums are developed by Ministry of Health. However, interprofessional education programs are not considered in the curriculum. Based on the authors’ Knowledge, interprofessional education and collaboration are not considered in the curriculum of Middle Eastern counties. In Iran, running IPE is not an easy task due to the cultural boundaries to the doctor-center discipline. Running IPE in Iran needs some preparation of context; therefore, this study, which contextualized tools and developed educational materials, may be used as a foundation for establishing new IPE system in Iran. This study was conducted to prepare the infrastructure of education and evaluate interprofessional education in Iran.
Participants
The total number of participants in this study during the contextualization and validation phases were 59; of whom, 26 were experts, 27 students, two simulation educators, two standardized patients and two were film producers. In this study, experts were individuals who had at least 10 years of experience in clinical education, non-technical skills training (such as communication and teamwork) and were familiar with IPE.The process is described in Figure 1.
Fig. 1
Contextualization and Validation of ICAR in Iranian Context: Process
Study Design
Contextualization of the instrument: The process was based on the “Toolkit on Translating and Adapting Instruments” (18). The instrument was adapted to the Iranian context through the following process:Validity Assessment of the ICAR, Part 1: English to Farsi: The expert panel confirmed the external validity of the tool following the translation from English to Farsi and through applying the guidelines of the Human Sciences Research Institute (HSRI) (18). Content and face validity were assessed through two Delphi rounds. In the first round, the translated tool was sent to the experts (response rate was 80%). The results of the first round were gathered after seven days. Then, the experts’ opinions were analyzed through the content analysis approach. Two items were suggested to be added in the domains of collaboration and challenge management, which were related to problem solving and providing feedback.The suggested items were approved in the second Delphi round. The agreement between experts was higher than 90%, and no further suggestions were made. Hence, in the final version, the total number of items was 33 within six domains. The total maximum score was 99, and the results were reviewed and confirmed by an expert panel in the Iranian context.Simulated Scenario Development and Video Production: The reliability of the translated instrument was confirmed through a simulation scenario:The simulated scenario was developed based on the objectives of the ICAR and the core competencies for interprofessional collaborative practice (3,6,19). The scenario was about a case of multiple traumas in an emergency ward with emphasis on interprofessional collaborative competency in the health care team. The scenario was developed during 18 sessions (36 hours).The expert panel members confirmed the validity of the scenario to increase the scenario reality.The Standardized Simulation Office at TUMS conducted the simulated team training sessions to produce a video. The simulation training sessions were conducted and all the simulated team members exercised their role- play. Simulation educators watched their final exercise, and confirmed quality of their role- play.The simulation situation of the emergency ward was conducted at TUMS skill lab.The simulated scenario was recorded and a video was produced. The duration of the video was 30 minutes, and its production took 12 hours of teamwork.The validity of the video and authenticity of the simulation were approved in terms of interprofessional collaboration skills.Reliability Assessment of the ICAR: In this study, the reliability of the tool was assessed by performing reproducibility (test-retest), internal consistency and inter-rater reliability analyses. To do so, two rater-training sessions were provided. In the first session, raters gained familiarity with the reliability assessment process, the ICAR and its guideline. Then, the raters watched the video produced in the earlier phase and completed the Iranians’ ICAR. Afterwards, they received feedback from the main investigator of the research (M SH).The test-retest approach was done by the raters filling out the ICAR with one-week interval between the first and second occasions (20). Internal consistency was calculated per domain through Cronbach’s Alpha. The agreement between the 23 raters and experts’ score was assessed through correlation (21).Inter-rater reliability was approved by assessing the correlation between four expert raters after watching the video and applying non- parametric analysis tests (Kappa coefficient) (22,23).External Validity Assessment, Part 2: Farsi to English: After demonstrating the validity and reliability of the ICAR in an Iranian context, the tool was back translated separately from Farsi to English. The translated version was sent to the main developer, Professor Curran, who confirmed the consistency of the final ICAR. The reproducibility of the ICAR was assessed by ICC analysis, the most appropriate and commonly utilized test for assessing parametric data (21,24,25).In this study, the test-retest approach was measured by the ICC, internal consistency was measured by Cronbach’s Alpha and inter-rater reliability was assessed by Kappa coefficient. The agreement between the raters and experts’ scores was assessed using descriptive statistical analysis. SPSS Version 16 was used for data analysis.
Ethical Considerations
The Ethical Committee of Teheran University of Medical Sciences (ID: 91-01-30-17052) approved thisstudy. Prof. Curran gave permission to use the ICAR instrument. Inform consent was taken for all the raters who participated in this study.
Results
Demographic Characteristics of the Study Participants
The mean±SD age of the expert panel members was 43.5±6.5yrs., and their mean±SD work experience was 10±8.5yrs. In this study, 50% (n=6) of the participants of Delphi rounds were men. The participants' mean±SD age was 41.5±8.5 yrs., and their mean±SD of work experience was 12±2.5yrs. The mean (Mean±(SD)) age of the video production committee was 37.5±9.6yrs., and their mean±SD of work experience was 10±8.5yrs. The mean±SD age of the four raters in inter-rater reliability assessment was 33.2±6.9yrs. The majority of the raters in reproducibility assessment were female (52%) and their mean±SD age was 28.4±2yrs.
Validation of the ICAR
The content and face validity of the Iranian ICAR was approved through two Delphi rounds. The validation of scenario and video production was approved through consensus.
Psychometric Properties
The internal consistency according to Cronbach´s Alpha was 0.71, and the test–retest reliability was 0.76 (Table 1).
Table 1
Psychometric Properties of the Inter-professional Collaborator Assessment Rubric (ICAR)
ICAR domains
Test- Retest:ICC
Internal Consistency:Cronbach`s Alpha
Agreement between raters’ scores (23) and experts’ score (%)
Communication
0.73
0.65
75.12
Roles and responsibilities
0.80
0.71
72.65
Collaboration
0.75
0.74
84.34
Collaborative patient centered approach
0.75
0.71
77.17
Team function
0.83
0.75
74.78
Conflict management
0.75
0.75
67.80
ICAR Domains
0.76
0.71
75.31
The highest test-retest reliability was in the team function domain (ICC=0.83), and the lowest in the domain of communication (ICC=0.73).The agreement between the raters and the experts’ score was calculated to be 67.8 and 84.3(Table 1).The inter-rater reliability was calculated by kappa coefficient and turned out to be K=0.7 (Table 2).
Table 2
Inter-rater Reliability between the Raters
Raters
Test
1 vs 2
Kappa=0.7P<0.001
1 vs 3
Kappa=0.71P<0.001
1vs 4
Kappa=0.72P<0.001
2 vs 3
Kappa=0.72P<0.001
2 vs 4
Kappa=0.71P<0.001
3 vs 4
Kappa=0.78P<0.001
1,2 ,3 and 4
Intra class coefficient )ICC(=0.87P<0.001
Discussion
Measuring interprofessional collaboration competencies is still a challenge; therefore, applying a valid and reliable instrument remains a significant issue. The findings of this study confirmed the innovative validation process by endorsing the reliability of the ICAR within a simulated setting. In this study, several professionals were engaged in the research to be in line with the assumption of multidisciplinary attribution of IPE.In our study, the tools' validity was strongly agreed upon by applying the accepted HSRI’s guideline (18), Delphi rounds, an expert panel, and receiving feedback from the instrument’s main author (6), which was in line with the research procedure of other studies (26,27). We followed the research process by Curran et al. (6) and applied two rounds to compile the instrument in a rigorous fashion just as other researchers have done (28).We used simulation to assess the psychometric properties (reliability) of the instrument in the interprofessional education domain. The controlled setting allowed the researchers to assess the interprofessional skills by controlling for confounding factors and possible Hawthorne effect that may occur in a real setting. In their study, Hayward et al. (10) assessed the reliability of another interprofessional performance tool in a real context, but despite the extended observation, they were not able to assess all the domains of their instrument. In this study, to avoid the challenges, reproducibility was assessed within a simulated situation. Shirazi et al. (29) conducted a research to evaluate the reliability of standardized patients (SP) and to assess communication skills as opposed to standard raters. Despite the limited number of assessors, which was a limitation, the researchers found an acceptable correlation among all raters (29). In this study, to increase the statistical power and obviatepotential biases, the number of raters was increased to 23.The highest internal consistency was in the domain of team function and challenge management. The lowest internal consistency was computed in the communication domain. This could be interpreted as an inconsistency between items because of different issues such as ‘respect’, which are more subjective and more dependent on raters’ perceptions and experience. However, in accordance with this latest finding, studies from Shyne et al. (30) and Najafi et al. (31) found similar low internal consistency with the Team STEPPS Teamwork Attitudes Questionnaire (T-TAQ) and SDOT, respectively. In contrast with our findings, Hayward et al. study (10) found internal consistency of communication to have the highest domain. This may be due to its use of a multi-source feedback (MSF) process to assess collaboration competencies (10).The highest agreement between the 23 raters and the experts’ scores was in the field of collaboration (84.3%), which could be related to the raters’ familiarity with these concepts and gaining experience in the group work. Moreover, the items’ construct in this domain consisted of simplicity, clarity and relevance of items for the raters. The lowest accordance was in the domain of conflict management (67.8%). These findings may be due to the possible limited involvement of the 4th-year medical students in health care management issues or lack of training. However, this information could not be confirmed.The correlation between experts’ rating was approved by non-parametric statistical tests. The Kappa coefficient (32) demonstrated propriety agreement among each pair raters and between all raters’ (four) scores. In line with our study, findings from Schmitz demonstrated acceptable or highly acceptable coefficients (33). However, in contrast to our findings, Boulet et al. found a low correlation among their raters (0.09 and 0.29), which can be due to the lack of training or an assessment guide in their study (34). In this study, to inhibit the possible low inter-rater reliability among the raters, the research group compiled a guideline for reliability assessment and conducted training sessions.
Limitations
The validity assessment of an interprofessional assessment tool in a real setting is a difficult task as there are many confounders. Moreover, In Iran, we do not have interprofessional wards where we could run such studies. Therefore, in this study, we used a video of a simulated setting to assess the ICAR psychometric properties. Although this was a psychometric study, one of the limitations was the reproducibility assessment of the instrument by medical students due to the restriction in the implementation of the interprofessional meetings. Limited generalizability across institutions, countries and even within the setting of an institution (i.e., beyond the trauma team) might be a limitation.
Conclusion
Assessing IPE is a complex task due to the difficult task of assessing team competencies. This study confirmed the validity and reliability of the modified ICAR to evaluate interprofessional collaborative competencies in an Iranian context, which was done using a novel process of using a simulated scenario. IPE in an Iranian context is a new endeavor and we recommend conducting more research on this topic.
Acknowledgements
Sincere thanks go to Professor Curran for his comments and kind contribution and also to the medical students at TUMS who took part in this research.
Authors: Philip Shayne; Fiona Gallahue; Stephan Rinnert; Craig L Anderson; Gene Hern; Eric Katz Journal: Acad Emerg Med Date: 2006-04-24 Impact factor: 3.451
Authors: John R Boulet; Marta van Zanten; André de Champlain; Richard E Hawkins; Steven J Peitzman Journal: Adv Health Sci Educ Theory Pract Date: 2006-07-27 Impact factor: 3.853
Authors: S Gorter; J J Rethans; A Scherpbier; D van der Heijde; H Houben; C van der Vleuten; S van der Linden Journal: Acad Med Date: 2000-11 Impact factor: 6.893
Authors: Melanie C Wright; Noa Segall; Gene Hobbs; Barbara Phillips-Bute; Laura Maynard; Jeffrey M Taekman Journal: Simul Healthc Date: 2013-10 Impact factor: 1.929