| Literature DB >> 31435411 |
Abstract
OBJECTIVES: Multiple Mini Interviews (MMI) have been conducted across the globe in the student selection process, particularly in health profession education. This paper reported the validity evidence of MMI in various educational settings.Entities:
Keywords: Admission method; Health profession education; Higher education; Multiple mini interview; Validity
Year: 2019 PMID: 31435411 PMCID: PMC6695046 DOI: 10.1016/j.jtumed.2019.03.006
Source DB: PubMed Journal: J Taibah Univ Med Sci ISSN: 1658-3612
Figure 1Study flow chart.
The content of MMI stations.
| Sources, university, country | Number of MMI station | Number of days taken | Times required per station | Competencies assessed |
|---|---|---|---|---|
| Eva K. W. et al., | 10 | 4 | 8 min (iv), 2 min (break) | Critical thinking, ethical decision making, communication skills, knowledge of healthcare system |
| Eva K. W. et al., | 10 | – | 8 min | Not available |
| Eva K. W. et al., | 9 | 2 | 8 min (iv), 2 min (break) | Scholarship-critical thinking, healthcare advocacy, professionalism-ethical decision making, collaboration |
| Moreau K. et al., | 12 | 2 | 8 min (iv), 2 min (break) | Communication, collaboration, professional ethics |
| Reiter H. I. et al., | Study 1 : 9 | – | – | Not available |
| Brownell K et al., | 10 | 2 | – | Compassion and Empathy, Honesty and Integrity, Ability To Tolerate Ambiguity, Reflective, Respect For Others |
| Harris S. and Owen C., | 10 | – | 5.5 min | Giving instruction, taking instruction, emotional communication, problem solving, resilience & maturity, enthusiasm for medicine, ethics, awareness of common issues in medicine |
| Lemay J.F. et al., | 10 | 2 | 8 min (iv), 2 min (break) | Advocacy, ambiguity, collegiality & collaboration, cultural sensitivity, empathy, ethics, honesty & integrity, responsibility & reliability, self-assessment |
| Reiter H. I. et al., | 8 | – | 8 min | Not available |
| Hofmeister M. et al., | 10 | – | 8 min (iv), 2 min (break) | Teamwork, honesty, ability to accept feedback about one's self, ability to accept self-limitations, caring & compassion, responsibility taking, time management, the ability to accept professional limitations, cultural sensitivity, motivation for family medicine, goal setting |
| Humphrey S. et al., | 3 | – | 5 min (iv), 1 min (break) | Not available |
| Roberts C. et al., | 8 | 8 | 7 min (iv), 2 min (break) | Not available |
| Rosenfeld J. M. et al., | 12 | 2 | 8 min (iv), 2 min (break) | Discussion, interpersonal skills, cooperation |
| Eva K. W. et al., | 9 | – | 10 min | Not available |
| Hofmeister M. et al., | 12 | 1 | 8 min (iv), 2 min (break) | Family medicine attributes, teamwork, disclosure of error, ethical behavior, ability to accept feedback, ability to accept self-limitations, caring, taking responsibility, time management, ability to accept professional-limitations, cultural sensitivity, motivation & goal-setting, ability to handle specific situation |
| Kumar K et al., | 8 | 8 | 7 min (iv), 2 min (break) | Not available |
| Razack S. et al., | 10 | – | 8 min (iv), 2 min (break) | Professionalism, empathy, communication skills |
| Roberts C. et al., | 8 | 9 | 7 min | Not available |
| Dore K. L. et al., | 7 | – | 8 min (iv), 2 min (break) | CanMEDs competencies: Medical expert, Communicator, Collaborator, Manager, Health advocate, Scholar, Professional |
| Kulasegaram K. et al., | 12 | – | 8 min | Communication, collaboration, critical thinking, ethics, personal statement, understanding of the health care system |
| Roberts C. et al., | 8 | – | – | Not available |
| Jones, P. E. & Forister, J. G., | 8 | – | 7 min | Boundary recognition, responsibility, honesty, integrity, professionalism |
| O'Brien A. et al., | 4 | 2 | 5 min | Presentation skills, work experience, ethical thinking, professionalism |
| Uijtdehaage S. et al., | Study 1 : 13 (1 rest station) | Study 1 : 3 days | 8 min (iv), 2 min (break) | Commitment to & experience with underserved populations, cultural sensitivity, leadership potential, maturity, being an effective team member |
| Yen W. et al., | 8 | – | – | Communication skills, taking responsibility on own action, ethical decision making, inter-professional collaboration, problem solving skills, reflective practice, time & resource management skills, resolves conflict |
| Cameron A. J. and MacKeigan L. D., | 10 | – | 8 min (5 station), 6 min (5 station) | Commitment to care, critical thinking, problem solving & creativity, ethical reasoning & integrity, interpersonal skills, motivation, communication skills, self-awareness, team player |
| J. Dowell et al., | 4 (2007) | – | – | Interpersonal skills & communication (including empathy), logical reasoning & critical thinking, moral & ethical reasoning, motivation & preparation to study medicine, teamwork & leadership, honesty & integrity |
| Eva K. W. et al., | 12 | – | 8 min (iv), 2 min (break) | Ethical issues, communication, collaborative tasks |
| Griffin B. and Wilson I., | 9 | – | 8 min | Verbal communication, empathy, motivation to study medicine |
| Jerant A. et al., | 10 | – | 8 min (iv), 2 min (break) | Integrity/ethics, professionalism, interpersonal communication, diversity/cultural, awareness, teamwork, ability to handle stress, problem solving |
| McAndrew R. and Ellis J., | 10 | 5 | 5 min (iv), 2 min (break) | Dentistry as a career, logic, reasoning, ethics & plagiarism, manual dexterity, breaking bad news, research & data interpretation |
| Fraga J. D. et al., | 6 (5 MMI; 1 TI) | – | 8 min (MMI), 20 (TI), | Professionalism, communication skills, critical thinking, ethical behavior, tolerance for uncertainty, teamwork |
| Husbands A. & Dowell J., | 10 | – | – | Interpersonal skills & communication (including empathy), logical reasoning & critical thinking, moral & ethical reasoning, motivation & preparation to study medicine, teamwork & leadership, honesty & integrity |
| McAndrew R and Ellis J, | 10 | – | – | Not available |
| Perkins A. et al., | 5 | – | 5 + 5 min (1st station), | NMC standard of competence: professional values, communication & interpersonal skills, nursing practice& decision making, & leadership, management, team working |
| Raghavan M et al., | 11 | – | 8 min (iv), 2 min (break) | Not available |
| Says F. E. et al., | 6 | – | 8–10 min | Personal, professionalism, motivation, moral & bioethics, teamwork & communication skills, behaviors |
| Tavares W. and Mausz J., | 10 | 2 | 8 min (iv), 2 min (break) | Self-awareness, responsibility, communication, ethical & moral judgment, teamwork, conflict resolution, problem solving, critical thinking, management skills |
| Till H et al., | 10 | 10 | – | Interpersonal skills & communication (including empathy); logical reasoning & critical thinking; moral and ethical reasoning; motivation & preparation to study medicine; teamwork & leadership; honesty & integrity |
| Tiller D et al., | 9 (iMMI) | 4 | 7 min (iv), 2 min (break) | Not available |
| Ahmed A. et al., | 8 | 2 | 8 min (iv), 2 min (break) | Responsibility, ethical & moral judgment, communication skills, management skills, problem solving, self-awareness, teamwork, conflict resolution |
| Andrades M. et al., | 8 | – | 7 min | Safe doctor, communication skills, professionalism, problem solving, team approach, ethical issues, reasons for selecting family medicine, commitment to the program |
| Barbour M. E. et al., | 10 | 7 | 5 min (iv), 2 min (break) | Communication skills, teamwork, work experience, community contributions, reason to study at bristol, reason to study in dentistry, numeracy & data-handling, ethics, research in dentistry, dexterity, professionalism |
| Callwood A. et al., | 8 | – | 5 | Motivation to become a midwife, awareness of midwifery philosophy and the role of the midwife, respect for difference & diversity, honesty & integrity, kindness, compassion & empathy, intellectual curiosity & reflective nature, advocacy, respect for privacy & dignity, initiative, problem solving & teamwork |
| Eva K. W. and Macala C., | 12 | – | 8 min (iv), 2 min (1st station), | CanMEDs competencies: medical expert, communicator, collaborator, manager, health advocate, scholar, professional |
| Hissbach J. C. et al., | 12 (2009) | 1 | 5 min (iv), 1.5 min (break) | Empathy, communication skills, self-regulation |
| Hopson L. R. et al., | 8 | – | – | Adaptability, hardworking problem solving communication skills, teamwork, altruism ethical, aware of issues facing medicine, compassionate, drive to excel |
| Kelly M. et al., | 10 | 2 | 7 min | Irish medical council's eight domains of professional practice: patient safety and quality of patient care, communication & interpersonal skills, collaboration & teamwork, management (including self-management), scholarship, professionalism, clinical skills, relating to patients |
| Kelly M. E. et al., | 10 | – | 7 min | Not available |
| Liao SC. et al., | 7 | – | – | Empathy; respect for life; crisis management; initiative; insightfulness; integrity, communication skills |
| Oliver T. et al., | 8 | – | 10 min | Ethical & moral, interpersonal, intrapersonal, professional |
| Oyler D. R. et al., | 4 | – | 7 min | Critical thinking, teamwork, ethical reasoning & integrity, communication & interpersonal skills |
| Roberts C. et al., | 6 | – | 8 min (iv), 2 min (break) | Vocation/motivation, communication, organisation/personal management, personal attributes |
| Sebok S. S. et al., | 8 | – | – | Communication, critical thinking, maturity, effectiveness, empathy, professionalism, resolution, integrity |
| Stowe C. D. et al., | 5 (pilot) | 1 | 8 min (iv), 2 min (break) | Pilot: critical-thinking, rapport/empathy, ethics/professionalism, knowledge of pharmacy, personal attributes |
| 3 (full implementation) | 1 | |||
| Alweis R. L. et al., | 6 (5 MMI; 1 TI) | – | 8 min (MMI), 16 min (TI), | Professionalism, team player, constructive response to stress, capacity for self-reflection, capacity for empathy, adaptability/tolerance of uncertainty, and the ability to incorporate feedback |
| Burkhardt J. C. et al., | 8 | – | – | Patient care, medical knowledge, diagnostic skills, communication skills, procedural skills, professionalism |
| Cox W. C. et al., | 7 | 3 | 6 min (iv), 2 min (break) | Integrity, adaptability, empathy, critical thinking, teamwork |
| Sebok S. S and Syer M. D., | 8 | – | 8 min (iv), 2 min (break) | Communication, critical thinking, maturity, effectiveness, empathy, professionalism, resolution, integrity |
| Yoshimura H. et al., | 5 | 3 | 10 min (5 min for PBQ; | Patient care and procedural skills, practice-based learning & improvement, interpersonal & communication skills, professionalism |
| Abdul Rahim & Yusoff, 2016 | 5 | 3 | 5 min (with judges) and 2 min (preparation) | Language proficiency, general conduct, critical thinking, ethical awareness, communication skills, knowledge of health care system, standard interview question |
Data synthesis of the selected studies.
| Source and year | Study design, Location, field of study, Number candidate | Objective & instruments | Results | Validity evidence |
|---|---|---|---|---|
| Eva K. W. et al., | Pilot study design McMaster University (Canada) N = 117 Medical School | Development of an innovative admission protocol (MMI) that is intended to take advantage of this lesson in the context of student admission | Both applicants and examiners were positive about the experience and the potential for this protocol The reliability of the MMI was observed to be 0.65. | Response process Internal structure |
| Eva K. W. et al., | Cohort study design* McMaster University (Canada) Medical School N = 45 | Assess the validity of this new admission protocol by examining the relationship between pre-clerkship performance, the MMI and the traditional admission protocol used by the Undergraduate MD program at McMaster University Evaluation tools used: OSCE, PPI | MMI was the best predictor of OSCE performance and GPA was the most consistent predictor of performance on multiple-choice question examinations of medical knowledge. | Relation to other variable |
| Eva K. W. et al., | Experimental study design* McMaster University (Canada) N = 54 Health Sciences Faculty | Assess the consistency of ratings assigned by health sciences faculty members relative to community members during an innovative admissions protocol called the MMI | Overall test reliability was found to be 0.78 Study suggested that admissions committees should distribute their resources by increasing the number of interviews to which candidates are exposed rather than increasing the number of interviewers within each interview | Internal structure |
| Moreau K. et al., | Observational Study Design* McMaster University (Canada) N = 12 applicants (5 aboriginal, 7 non-aboriginal) Medical School | Evaluate whether any suggestion of bias existed in application of the MMI in its assessment of aboriginal medical school applicants. | Recommended that MMI stations be vetted by aboriginally sensitive personnel, but neither aboriginal-specific rater training nor aboriginal rater assignment is required Interviewer type and interviewee type were both non-significant contributors to the scores observed, Overall reliability of this 12-station MMI was 0.70 | Response process Internal structure |
| Reiter H. I. et al., | Case Study Design* McMaster University (Canada) N= Study 1 : 57 Medical applicant (Study 1 and Study 2); Rehabilitation Sciences applicant (Study 3) | Determine the impact of security violations on perceived competence levels in 3 different studies (low-stakes in research study, high-stakes in admission, high-stakes in dual application) | Overall test generalizability of this 9-station MMI with 2 examiners per station was 0.78 Overall test-retest reliability of this 12-station MMI with 1 examiner was 0.70. Mean performance on exposed station equaled 4.92 (SD = 1.36), mean performance to which applicants were naive equaled to 4.94 (SD = 0.65): revealed no effect of prior exposure. Overall test-retest reliability of this 7-station MMI with 1 examiner per station was 0.70 for OTs and 0.68 for PTs. No statistically significant differences in MMI performances were detected. Predictable violations of MMI security do not unduly influence applicant performance ratings. | Response process Internal structure |
| Brownell K et al., | Acceptability and Feasibility design University of Calgary (Canada) N = 281 Medical Admission | Introducing the MMI as a replacement for the more traditional interview process and its acceptability by applicants and interviewer | Interviewers indicated that they had adequate time to assess applicant, the MMI was a fair assessment and scoring sheet allowed them to differentiate between applicants. Applicants indicated that they received well information before-hand for the MMI, MMI was free of gender and cultural bias and there was sufficient time to present ideas at the stations. | Response process |
| Harris S. and Owen C., | Australian National University (Australia) N = 115 candidates Medical Admission | Describe the development and pilot testing of a set of admissions instruments based on the McMaster University MMI. Designed to assess desirable, non-cognitive characteristic in order to inform final decisions on candidate selection for entry to medical school | 97 candidates were deemed satisfactory and 18 were excluded on the basis of their interview from further consideration for admission This method proved to be an efficient process to determine suitability of candidates to the course. For the retained and rejected candidates, they had significantly different total scores and mean scores for each station. | Response process Consequences |
| Lemay J.F. et al., | Observational Study design* University of Calgary (Alberta, Canada) N = 281 Medicine School | Develop an MMI assessment that would measure several non-cognitive attributes | Cronbach's alpha for the stations ranged from 0.97 to 0.98 The correlations between stations, based on total scores, ranged from 0.042 to 0.360. | Internal structure |
| Reiter H. I. et al., | Observational Study design Michael G DeGroote School of Medicine at McMaster University N = 117 candidates for MMI (Spring 2002); 45 enrolled in clerkship evaluation and MCCQE (2005/2006) | Comparing performance on the various admissions measures used by McMaster University's medical school with in-program performance during clerkship evaluation and national licensing examination. | Correlations between 5 admission tools and in-program assessment exercise confirmed that: Only the MMI was statistically predictive of OSCE performance (standardized β = 0.4, Only the MMI was statistically predictive of clerkship performance, measured with both the average ratings assigned by clerkship directors (standardized β = 0.7, Correlation between each admissions tool and performance on the subscales of the MCCQE Part I revealed that: Only the MMI was statistically predictive of CLEO or PHELO performance (standardized β > 0.4, Only the MMI was predictive of CDM performance (standardized β = 0.35, The MMI and uGPA were equally predictive of overall test performance (standardized β > 0.3, | • Consequences |
| Hofmeister M. et al., | University of Alberta, Edmonton, Canada N = 71 Physician, medical educators | Assess the acceptability of the MMI to both international medical graduate (IMG) applicants to family medicine residency training in Alberta, Canada | 69 applicants responded to applicant acceptability survey. 13 (18.8%) indicated that they had been interviewed for residency training position before and 26 (37.7%) indicated they had not; 30 (43.5%) did not respond. Five theme were identified: expression of appreciation (n = 16), content (n = 15), format (n = 14), requests for more information on MMI (n = 5) and miscellaneous (n = 5) | Response process |
| Humphrey S. et al., | West Midlands Deanery, UK N = 96 Senior House Officers (SHOs) to a UK regional paediatric training programme | Assess candidates' and interviewers' perceptions of the use of a MMI for selection of senior house officers (SHOs) to a UK regional paediatric training programme | Both candidates and interviewers were positive about the fairness of MMI (Mean score of 4.0 and 4.4. respectively) Gender, age and previous experience of MMIs did not account for differences in candidate responses (P > 0.05) Total of 86% candidates were international medical graduates who preferred the format more than UK graduates did (P = 0.01) Interviewers mainly experienced consultants who agreed that the multi-station format was better than traditional interview (mean score 4.8) and represented a reliable process (mean score 4.4). | • Response process |
| Roberts C. et al., | University of Sydney N = 485 candidates; 155 interviewer and 21 questions taken from a pre-prepared bank Medical candidates | Establish whether interviewers can make reliable and valid decisions about applicants when selecting candidates for entry to a graduate-entry medical program, using a pre-professionalism framework and the MMI format. Wanted to know which features of the MMI were most useful in guiding admissions committees to focus their resources in making robust decisions about candidates. | For a single MMI question and 1 interviewer, 22% of the variance between scores reflected candidate-to-candidate variation. The remaining 78% would reflect unwanted factors. The generalizability coefficient with this test format was 0.70 The reliability for an 8-question MMI was 0.7; to achieve 0.8 would require 14 questions. A disattenuated correlation with the Graduate Australian Medical School Admissions Test (GAMSAT) subsection ‘Reasoning in Humanities and Social Sciences’ was 0.26 The mean inter-question correlation was 0.23 (SD = 0.15), giving a typical range of inter-question correlations of 0.08–0.38 (range – 0.22 to 0.58) | Internal structure Relation to other variable |
| Rosenfeld J. M. et al., | Observational Study design* McMaster University (Canada) N = 400 applicant for MMI N = 400 applicant for Traditional Interview Medical School | Determine the cost to ensure that the tools can be feasibly implemented Provide answers to frequently asked questions regarding the logistics involved in mounting a MMI | MMI: 72 h of creation time for 24 MMI Stations, $50 per station = $1200 MMI: N = 192 (96 per each of 2 days of interviews) thus 66.7 h per 400 candidates TI: N = 192 (48 per each of 4 days of interviews), thus 400 h per 400 candidates MMI: Admissions coordinator 154 to 278 staff hours (11–19 staff spread over 2 days of interview) and 8 actors for each 2 days MMI: 8 clinics with 12 rooms provided in kind 4800 score sheets ($240) TI: 16 clinics rooms provided in kind 1200 score sheets ($60) MMI is at least as cost efficient as many other personal interviews format even though MMI's disadvantage is the requirement of a greater number of rooms. | Response process |
| Eva K. W. et al., (20) 2009 | McMaster University, Canada Undergraduate N = 34 Postgraduate N = 22 Medical School | Tests of the validity of the MMI selection process, comparing MMI scores with those achieved on a national high-stakes clinical skills examination Explore the stability of candidate performance and the extent to which so-called ‘cognitive’ and ‘non-cognitive’ qualities should be deemed independent of one another | Median reliability of eight administrations of the MMI in various cohorts was 0.73 Correlation between performance on them MMI and number of stations passes on an objective structured clinical examination-based licensing examination was r = 0.43 (P < 0.05)) in postgraduate sample and r = 0.35 (P < 0.05) in undergraduate sample Correlation between ‘cognitive’ and ‘non-cognitive’ assessment instruments increased with time in training | Internal structure Relation with other variable |
| Hofmeister M. et al., (21) 2009 | University of Calgary and University of Alberta N = 71 International medical graduate Family Medicine Residency | Develop and assess the evidence for the validity and reliability of the MMI in the assessment of professionalism in IMG candidates for family medicine residency education at the Universities of Calgary and Alberta. | The reliability as indicated by the generalizability coefficient associated with average station scores was 0.70 with one interviewer per station There were no statistically significant differences in total MMI scores or mean station sum scores based on session, track, applicant age, gender, years since medical school completion or language of medical school. There were low, non-significant correlations with OSCE overall (r = 0.15), MCCEE (r = 0.01) and MCCQE I (r = 0.06) scores and a higher non-significant correlation with MCCQE II scores (r = 0.33) | Internal structure Relation to other variable |
| Kumar K et al., (22) 2009 | University of Sydney, Australia (6 Days), University of British Columbia, Vancouver, Canada (2 Days) N = 442 Candidate; 75 Interviewer Medical selection | Reports on a qualitative analysis of participants' experiences and perceptions of a high-stakes process for selection into graduate-entry medical school | One-to-one interviews Improved the quality of candidates–interviewer interaction Reduced the stress Multiple assessment opportunities Ease interviewer anxiety associated with having to judge candidates unfavorably (I) Gave better chance of discriminating amongst candidates (I) Gave candidates chance to redeem a ‘bad first impression with one person’ and regain their composure (C) Standardized scenario-based interviews Make it harder for rehearsal and coaching of responses and thus gave insight into the ‘genuine’ ability of candidates (Candidates) Potentially reduced interviewer subjectivity (Interviewers) The mini-interview Short time (C) Limits the quality of interaction within the interview and lacks opportunity to impress the interviewers (C) What is being measured? Assessed ‘reasoning skills and the ability to unpick a dilemma/situation’ (C) Assessing communication skills and were concerned that decisions based on social interaction skills rather than the reasoning capabilities of the candidate (I) What else should be measured? Lack of opportunity to discuss specific personal qualities of candidates (C) Include one station to explore the candidate's commitment to medicine and to university which they had chosen (C) | Response process |
| Razack S. et al., (23) 2009 | Pilot Study design* McGill University Medical School, Canada Faculty of Medicine N = 100 | Assess the acceptability of the MMI in our context and to compare this new tool with our current selection tools, including the traditional interview | MMI rated more highly than traditional interview on fairness, imposition of stress and effectiveness as a measurement tool. MMI also i) allowed them to be competitive; ii) was enjoyable and iii) was often a favourite part of their interview experience. | Response process |
| Roberts C. et al., (24) 2009 | University of Sydney, Australia Medicine and Dentistry Admission N = 207 Interviewers; 686 candidates | Establish the conceptual equivalence (concept in this case being entry-level reasoning skills in professionalism) Investigate whether there were any systematic differences in outcome among equally able candidates from different subgroups caused by DIF 4-facet Rasch Model | SD by facets were: interviewer stringency or leniency, 0.52; candidate ability, 0.75; and MMI question difficulty, 0.27. The spread of candidates was 1.44 times that of the interviewers, therefore variance was 2.08 times that of the interviewers All the questions were well within the predetermined range of 0.5–1.7 All 39 questions had a good fit to the IRT model Of 195 checklist items, none were found to have significant DIF after visual inspection of expected score curves. | Response process Internal structure |
| Dore K. L. et al., (25) 2010 | Observational Study design* McMaster University: obstetrics-gynecology and pediatrics University of Alberta (internal medicine) N = 484 candidates (across 2 years) | Investigates the reliability and acceptability of the MMI at the postgraduate level, given the differences in the applicant pool and process | Overall reliability of the 7-station MMI across all years and program was at least moderately acceptable, 0.55–0.72 This 7-station were used for feasibility and until acceptability was established Estimating overall reliability for 10-station, the range of reliability would increase to 0.64–0.79 88% candidates believed they could accurately portray themselves during the MMI 77% indicated that specialized medical knowledge was not needy to complete the stations. While 90% of interviewers believed they could reasonably judge candidates' abilities. | Response process Internal structure |
| Kulasegaram K. et al., (26) 2010 | Observational Study design* Michael G. DeGroote School of Medicine, McMaster, Hamilton, Canada N = 152 applicants Medical School | Examine the possible association between personality and non-cognitive performance since there is ample evidence that the MMI is a valid predictor of non-cognitive outcomes in medical school | Significant differences in GPA scores in favour of the non-volunteers, [F = 15.68 (p < 0.0001)], the difference was only 0.05 (is not sufficient to bias computed correlation). Correlation among the subscores of the Neo-5: ranged from −0.40 to +0.30 (Mean = 0.035) with the largest negative correlation between neuroticism and conscientiousness No personality variable correlated significantly with MMI total score. The mean correlation was 0.09 and the largest (non-significant) correlation was 0.20. | Relation to other variable |
| Roberts C. et al., (27) 2010 | Historical Study design* University of Sydney, Australia N = 207 interviewer; 686 candidates Medicine and Dentistry Admission Using the same candidates and interviewer in Questions Bank in MMI article (Roberts C. et al., 2009) | Focuses on the performance of interviewers and how this impacts on the reliability and acceptability of the MMI, and discusses possible strategies to moderate any negative influences Many-Facet Rasch Model (MFRM) has the capability to both identify these sources of error and partially adjust for them within a measurement model that may be fairer to the candidate FACETS Software (Many-Facet Rasch Model) | Decision of 207 interviewers had an acceptable fit to the MFRM model. Interviewer stringency/leniency and question difficulty were anchored by the measurement model at 0.00 logits and candidate ability was allowed to float Interviewers are more variable than MMI questions and the spread of interviewers is nearly 3.5 times that of MMI questions. Reliability of separation index in terms of stringency/leniency gave a reliability of 0.91 High reliability indicates that the interviewers are meaningfully separated according to their levels of severity from lenient to stringent with a high degree of confidence | Response process Internal structure |
| Jones, P. E. & Forister, J. G., (28) | Cohort study design McMaster University (Canada) N = 176 (N = 93: Behavioral Interview Format, N = 83: MMI) Physician Assistant Program | Compare the experiences with behavioral-based and MMI outcomes with two consecutive cohorts of applicants at one public US PA program. Rasch Analysis Software | Item reliability for four behavioral interviews was 0.64 compared to 0.77 for ten MMI stations Person reliability was 0.72 for 85 non-extreme scoring persons behavioral interview as compared to 0.57 for 83 in MMI BI and MMI had similar model fit. BI did not adequately measure differences in applicant characteristics but MMI measured more variation in non-cognitive traits. | Internal structure Relation to other variable |
| O'Brien A. et al., (29) 2011 | Pilot Study design** St George's University of London, UK Undergraduate streams for medicine N = 47 candidates | Design a modified version of an MMI and to evaluate its potential reliability, feasibility and acceptability, with a view to assessing whether such an approach could be justified at St George's Assess its concurrent validity compared to the standard interviews (Sis) | No difference in performance between 5-year streams and 4-years stream: MBBS 5 mean score was 3.75 (SD 0.54) MBBS 4 mean score was 3.73 (SD 0.74) No difference for the interview scores: MBBS 5 mean 3.51 (SD 0.52) and MBBS 4 mean 3.56 (SD 0.57) MBBS 4 candidates, Cronbach's alpha for MMI was 0.69 MBBS 5 candidates, Cronbach's alpha was 0.73 MBBS 4 applicants performed just as well on the MMI as they did on the traditional interview, with MBBS 5 applicants performing better on the MMI | • Internal structure |
| Uijtdehaage S. et al., (30) 2011 | Observational Study design* David Geffen School of Medicine, University of California, Los Angeles N = 76 applicants (2009); N = 78 applicants (2010) Medical undergraduate | Determine the reliability of the MMI, potential bias in scores, and the degree of acceptance by the interviewers and applicants | 2010 cohort rated the MMI process as less stressful (average 3.7 in 2010 versus 4.2 in 2009) The preliminary reliability of the MMI in 2009 was 0.58 – lower than reported in previous studies – but improved in 2010 to 0.71 after an easy station was replaced with a more challenging. Applicants indicated that MMI process was free from cultural bias (average 6.3) or gender bias (average 6.6) in 2009 | Response process Internal structure |
| Yen W. et al., (31) 2011 | Michener Institute for Applied Health Sciences, Toronto, Canada N = 196 candidates Health Science | Investigate the potential for the EQ-i to serve as a proxy measure to the MMI Bar-On EQ-i emotional intelligent instrument | MMI had a moderate reliability estimate of 0.75, and EQ-i had a lower estimate of 0.65 MMI total score and the EQ-i total score were not found to be significantly correlated (p = 0.14) nor were relationships found at the subscale level (p > 0.003) Correlational analysis suggests that a relationship does not exist between the MMI and the EQ-i | Internal structure Relation to other variable |
| Cameron A. J. and MacKeigan L. D., (32) 2012 | Pilot Study design Leslie Dan Faculty of Pharmacy (LDFP), University of Toronto N = 30 Candidates Pharmacy | Determine specific nonacademic attributes to be assessed in an MMI designed for admission to the Leslie Dan Faculty of Pharmacy Assess the feasibility (resources and procedures) and acceptability of the MMI to candidates (interviewees) and interviewers Determine optimal station duration Assess the discriminant validity of the MMI Assess the reliability of the overall MMI score | The ICC for the overall 10-station score was 0.77 (95% CI, 0.63–0.88); for five 6-min stations was 0.66 (95% CI, 0.42–0.82) The Pearson r coefficient for MMI and PPA was −0.025 MMI and PCAT composite score was 0.042 Coefficient for PPA and PCAT composite was 0.370 (p = 0.048) Station scores provided by student interviewers were slightly higher than those of faculty member or practitioner interviewers The ICC for the MMI was 0.77 and correlations with PPA and PCAT composite were negligible | Response process Internal structure Relation to other variable |
| Dowell J. et al., (33) 2012 | Dundee University, UK Dundee Medical School N = 452 (2009); 477 (2010) | Describe Dundee medical school's progression from a traditional interview to a full-scale MMI, psychometric properties, views of applicants and assessors and to identify areas of further research | Questionnaire confirmed that the process was acceptable to all parties Cronbach's alpha reliability was satisfactory and consistent The range of correlations between station scores for 2009 and 2010 were 0.057–0.363 and −0.061 to 0.308, respectively Using MFRM fair scores would change the selection outcome of 6.2% and 9.6% mod candidates in 2009 and 2010 respectively. Student was less lenient, made more use of the full range of the rating scales and were just as reliable as staff | Response process Internal structure |
| Eva K. W. et al., (34) 2012 | • Cohort Study design | Determine whether students deemed acceptable through a revised admission protocol using 12-station MMI outperform others on the 2 parts of the Canadian national licensing examination (MCCQE) | Candidates accepted by the MMI-admissions process had higher scores in the Canadian national licensing examinations than those who were rejected for: Part I (mean total score, 531 [95% CI, 524–537] vs. 515 [95% CI, 507–522]; Part II (mean total score, 563 [95% CI, 556–570] vs. 544 [95% CI, 534–554]; Those who matriculated at McMaster did not outperform those who matriculated elsewhere for: Part I (mean total score, 524 [95% CI, 515–533] vs. 546 [95% CI, 535–557]; Part II (mean total score, 557 [95% CI, 548–566] vs. 582 [95% CI, 569–594]; | Consequences |
| Griffin B. and Wilson I., (35) 2012 | Cross-sectional Study design* Australian School of Medicine 2006: N = 364 2007: N = 336 2008: N = 337 Medical School | Assess how MMI scores relate to both the five factors and their associated facets. Using UMAT (a cognitive ability test used by medical schools across Australia and New Zealand); UAI (a percentile score based on high school academic performance and similar construct to GPA) | Extraversion and conscientiousness were correlated with MMI scores in all 3 years, and agreeableness was significantly related in 2 of the 3 years (p < 0.05) 4 of 6 facets of extraversion and conscientiousness were consistently correlated with MMI scores, but 2 facets (excitement seeking and cheerfulness under extraversion, and orderliness and dutifulness under conscientious) were unrelated. Suggest that MMIs and personality questionnaires are not interchangeable so we would therefore not recommend that personality tests to be used as a screening tool for MMIs. | Relation to other variable |
| Jerant A. et al., (36) 2012 | Observational Study design University of California, Davis (UCD), California N = 444 candidates School of Medicine | Examine relationship among applicant personality, MMI performance and medical school acceptance offers | Those with extraversion scores (among 444 candidates) in the top (versus bottom) quartile had significantly higher MMI scores (adjusted parameter estimate = 5.93 higher, 95% CI: 4.27–7.59; P < 0.01) In a model excluding MMI score, top (versus bottom) quartile agreeableness (AOR = 3.22; 95% CI 1.57–6.58; P < 0.01) and extraversion (AOR = 3.61; 95% CI 1.91–6.82; P < 0.01) were associated with acceptance offers. After adding MMI score to the model, high agreeableness (AOR = 4.77; 95% CI 1.95–11.65; P < 0.01) and MMI score (AOR 1.33; 95% CI 1.26–1.42; P < 0.01) were associated with acceptance offers. | Relation to other variable |
| McAndrew R. and Ellis J., (37) 2012 | Observational Study design Cardiff University, UK N = 190 Candidates Cardiff Dental School | Evaluate the MMI process as part of the admissions procedure for dental school Gain a greater insight into the applicants' and participations' experiences of the MMI process and inform future research in this field | 127 (67%) felt they had perform well with 24 (19.3%) unsure and 36 (12.8%) saying their performance was not ideal MMI experience was considered favourable and from 137 written comments received by candidates, 39 were most positive with reference to the MMIs | • Response process |
| Reiter H. I. et al., (67) 2012 | 6 schools that is: Data for admissions ending in 2008 and 2009. N = 5253 (across six schools) Medical students | Investigates whether the MMI is diversity-neutral and if so, whether applying it with greater weight would dilute the anticipated negative impact of diversity-limiting admissions measures. | There was a positive correlation between MMI scores and age (P < 0.05); the correlation was neutral with respect to gender, size of community of origin and income level, and there was a negative correlation (P < 0.05) for those with self-declared aboriginal status. This means that MMI scores were unrelated to gender, size of community of origin and income level. They correlated positively with age and negatively with aboriginal status. GPA and MCAT correlated negatively with age and aboriginal status, GPA correlated positively with income level, and MCAT correlated positively with size of community of origin. | Response process Relation to other variable |
| Fraga J. D. et al., (38) 2013 | Retrospective Cohort study Reading Hospital Institutional Review Board N = 237 applicants; 17 faculty interviewer Internal medical residency | Investigate reliability and acceptability of five-station MMI model for resident selection into an internal medicine residency program in the Unites States | G-coefficient are reported for each station and ranged from a minimum of 0.97 for the Last Call station to a high of 0.98 for Overloaded Census station G-coefficient for combinations of candidate within station = 0.96 G-coefficient for station within interviewer = 0.98 G-coefficient for interviewer within candidate = 0.95 Applicants indicated that they agreed with the statements that ‘the MMI was fair’ more strongly than with ‘a traditional interview is fair’ (5.12 vs. 4.07, Applicants had higher agreement that ‘the MMI is effective at evaluating non-cognitive skills’ than for ‘the traditional interview is effective evaluating non-cognitive skills’ (5.05 vs. 3.41, No difference in perceived stressfulness of the MMI compared to the traditional interview (3.06 vs. 3.18, Generalizability data showed that even with only five stations, the reliability of the process was high enough for high-stakes decisions such as admissions (>0.9 for candidate within station) | Response process Internal structure |
| Husbands A. & Dowell J., (39) 2013 | • Cohort Study design* Dundee Medical School, UK N = 140 (1st year students; 128 (2nd students) in 2009, N = 150 (1st year students) in 2010 | Address which aspects of the selection process can be justified in terms of predictive validity of knowledge-based and OSCE examinations in early medical school Establish if MMIs are useful in the UK | MMI scores significantly correlated with six of 10 examination sittings, with magnitudes ranging from 0.24 to 0.50 (unrestricted), accounting for between 5.70% and 25.00% of variance in students' examination scores. Multiple regressions also confirmed that the MMIs remained the most consistent predictor of success, accounting between 5% and 17% of the variance in assessment scores alone or in combination with candidates' gender. | • Relation to other variable |
| McAndrew R. and Ellis, J. (40) 2013 | Observational Study design* Cardiff University Dental Hospital, UK N = 235 (BDS interviewees); N = 62 (Therapy and hygiene interviewees) | Identify any new themes for consideration during the MMI process highlighted by a post-interview evaluation questionnaire in a new cohort of undergraduate dental students and undergraduate dental care professionals (hygienist and therapist) Compare and contrast the findings between groups Further detailing and quantification of applicants' perception towards the MMI as a selection tools by analyzing the responses from the different applicants cohorts | Binomial statistically calculated and chi-squared tests of independence of categorical variables on nominally scaled data revealed statistically significant differences (p > 0.001) and both groups of students were positive regarding the selection process (p > 0.001) 54 separate free text comments recorded but qualitative analysis failed to identify any themes that had not been previously identified. | • Response process |
| Perkins A. et al., (41) 2013 | Kingston University and St George's University of London, UK N = 890 candidates Nursing Programme | Assist in identifying those candidates who demonstrate the potential to achieve the Nursing and Midwifery Council (NMC) domains | Positive response from candidates with 65% replying that it was “a better experience” compared with traditional interviews. Interviewers likewise responded positively with 71% noting “a better experience” Unsolicited feedback indicated that some would have preferred to have had greater opportunity to discuss nursing issues with their interviewees. | Response process |
| Raghavan M. et al., (42) 2013 | Observational Study design* University of Manitoba, Canada Faculty of Medicine N = 1257 applicants (2008–2011) | Seek an association between MMI scores and applicants' place of high school graduation Find associations between MMI scores and applicant attributes in the following three domains: rural connections, employment in rural areas and rural community service such as volunteer and leadership activities Understand MMI performance of applicants from Aboriginal communities | Rural high school graduates scored significantly lower (mean of 4.4 on scale of 1–7; p,0.05) than urban high school graduates (4.6) Among rural-attribute domains, those with rural community service alone had the highest MMI scores (4.9) while those with rural connections alone had the lowest scores (4.3; p = 0.016) GPA (mean of 4.0) of rural high school graduates was similar to that of graduates of urban high schools (4.1) GPA and MCAT scores in a multiple linear regression model, rural-attribute domains were not significant predictors of an applicant's MMI score | Response process Relation to other variable |
| Says F. E. et al., (43) 2013 | Observational Study design* King Abdulaziz University, KSA N = 352 participant (Men = 174; women = 178) Faculty of Medicine | Evaluate the applicant's non-cognitive strength and area for improvement identify the outstanding students to be selected for different faculty committees, students support unit, as their skills will be looked after and developed more through courses and workshops Identify below-average students who will receive the same attention in order to enhance and develop their non-cognitive skills | Percentage of men students performed well in station (personal character) about 95% Percentage for women is about 90% were excellent, 9% were acceptable and about 1% was poor. 94% and 71% of women and men students, were excellent in team work and communication skills respectively 94% and 73% of women and men students, were excellent in behaviors respectively Reliability analysis using the Cronbach alpha revealed moderate reliability index (0.38) among the performance results of men, whereas women is 0.76 (high reliability index) The reliability index among whole student performance was 0.61 | Response process Internal structure |
| Tavares W. and Mausz J., (44) 2013 | Observational Study design* Centennial College Simulation Centre (Toronto, Ontario, Canada) N = 30 candidates Paramedicine | Seek evidence of reliability and validity for the assessment of non-clinical attributes using the MMI in paramedic contexts Explore the association between non-clinical attributes and clinical skills in paramedicine | Inter-station reliability for the MMI reached 0.77 Pearson's correlations between the overall MMI score and mean Simulation-Based Assessment (SBA) global rating scores reached r = 0.31 (r = 0.48) and ranged by dimension from r = −0.11 (−0.17) (procedural skills) to r = 0.54 (r = 0.83) (communication) | Internal structure Relation to other variable |
| Till H. et al., (45) 2013 | Observational Study design* University of Dundee Medical School N = 452 candidate assessed in 2009 | Explore whether the MMI as used at the University of Dundee Medical School is an effective instrument that can reliably separate the more able candidates form the less able candidates Explore whether the three groups of examiners who carried out the assessment process exhibited any systematic differences in their rating patterns Multifaceted Rasch Modeling | MMI reliably (0.89) separated the candidates into four statistically distinct levels of non-cognitive ability. The Rasch measures accounted for 31.69% of the total variance in the ratings (candidates 16.01%, examiners 11.32% and stations 4.36%) Students rated more severely than staff and also had more unexpected ratings. Adjusting scores for examiner severity/leniency and station difficulty would have changed the selection outcomes for 9.6% of the candidates. | Response process Internal structure |
| Tiller D. et al., (46) 2013 | Cohort Study design* University of Sydney N = 293 international candidates (iMMI) N = 571 local candidates (MMI) Medical and dentistry | Determine whether the scores derived from the iMMI process were both equivalent to and as reliable as those scores derived from the in-person MMI Wants to describe the feasibility, acceptability and cost-effectiveness of the iMMI iMMI using Skype | Comparison of MMI and iMMI scores for international candidates in 2009 and 2011, respectively, was no significant in terms of variance (p = 0.338) No significant difference between the MMI scores for local and international candidates in 2011 (p > 0.05); the MMI scores for international candidates had greater variation (p < 0.01) The reliability of the nine-question iMMI was 0.76 and for MMI was 0.70 Interviewer satisfaction (iMMI circuits and the technology): rated highly with 61 (78%) giving the two most positive ratings for the interview process, 71% for satisfaction with the technology Estimated administrative cost of conducting the iMMIs was AU$10 145, representing a saving of AU$51 742 or 84% from estimated cost when conducting in Vancouver in 2009 (AU$61 887) | Response process Internal structure |
| Ahmed A. et al., (47) 2014 | Observational Study Dubai Health Authority, United Arab Emirates N = 187 candidates Dubai Residency Training Program Family Medicine | Explore the implementation of MMI in different cultural settings | A MMI with 8 stations, produced absolute and relative reliability of 0.8 and 0.81, respectively | • Internal structure |
| Andrades M. et al., (48) 2014 | Aga Khan University, Pakistan N = 16 (interviews); N = 14 (MMI) Family Medicine Residency Programme | Compare MMI versus semi-structured interviews for assessing non-cognitive domains in the selection of Family Medicine residents. Determine perceptions of the interviewers and candidates for the acceptability and feasibility of MMI as a selection tool | Majority (87%) of the interviewers believed that they were able to get an accurate portrayal of the candidates. 50% (5 out of 8) interviewers were unsure of the feasibility of conducting an MMI compared to the interviews. All but one of the interviewers thought that interviews can be replaced by MMI. | Response process |
| Barbour M. E. et al., (49) 2014 | Observational Study design* University of Bristol, United Kingdom Dentistry and medicine programs N = 414 candidates (213 candidate in 2011–2012; 201 in 2012–2013) | Establish whether starting station influenced total score Determine whether gender influenced total score or MMI outcome Assess candidate and interviewer acceptance of MMI | MMI provided an efficient means to discriminate between the performance of applicants who were all academically highly qualified, with total scores ranging from 35% to 87% of the maximum possible score Female candidates performed significantly better than male candidates when assessed by total score (p = 0.011; mean score 94.4 for female applicants and 91.9 for male applicants) and by outcome (offer/decline; p = 0.016; 58.6% of female and 46.4% of male interviewees received an offer of study following interview) Stakeholder acceptance was good, with 75% of candidates and 95% of assessors preferring MMI over traditional interviews | Response process Internal structure |
| Callwood A. et al., (50) 2014 | Mixed Method Study design* University in the South East of England, UK N = 62 participants Midwifery Studies programme | Develop, pilot and examine the reliability of MMIs in pre-registration student midwife selection in the UK | Station specific attributes assessed (with Cronbach's alpha scores) included: compassion and empathy (0.93); respect for difference and diversity (0.96); honesty and integrity (0.97); intellectual curiosity and reflective nature (0.94); advocacy (0.91); respect for privacy and dignity (0.95); team working and initiative (0.96); the role of the midwife and motivation to become a midwife (0.95) Cronbach's alpha scores measuring internal consistency ranged from 0.91 to 0.97 | Internal structure |
| Eva K. W. and Macala C., (51) 2014 | Observational Study design* University of British Columbia, Vancouver, Canada N = 41 medical school candidates Faculty of Medicine | Explore the impact of MMI station design on the utility of the process for making selection decisions | Behavioral interview (BI) stations more reliably differentiated between candidates (G = 0.77) than did the other station types (SJ, G – 0.69; FF, G = 0.66) The correlation between actual MMI scores and BI stations was also greatest (BI, Candidates considered the FF stations to be more challenging and more anxiety-provoking than either the SJ or BJ stations | Response process Internal structure |
| Hissbach J. C. et al., (52) 2014 | Pilot test in 2009 Hamburg University, Germany N = 80 (2009); N = 200 (2010) Medical school admission | Improve reliability and reduce costs of the subsequent procedure in 2010 | The mean inter-station correlation in 2009 was 0.20 (min: −0.11; max: 0.44) and 0.19 in 2010 (min: 0.07; max: 0.32) The overall reliability of the initial 2009 HAM-Int procedure with 12 station and an average of 2.33 raters per station was ICC = 0.75 In 2010 the ICC remained stable at 0.76, despite the reduction of the process to nine stations and 2.17 raters per station Costs were cut down from $915 to $495 per candidate. With the 2010 modalities, we could have reached an ICC of 0.80 with 16 single rater stations ($570 per candidate). | Response process Internal structure |
| Hopson L. R. et al., (53) 2014 | University of Michigan, Ann Arbor, Michigan N = 71 participant Emergency Medicine Resident Selection | Determine the perspective of participants on the MMI in a stimulated EM interview situation Determine if the MMI would provide new information to assess residency candidates | Participant responses showed a negative effect on a decision to interview at this program on both pre (mean 2.7, SD 0.7) and post (mean 2.8, SD 0.9) EM grades correlated with MMI performance ( Participants preferred a traditional interview (mean difference = 1.36; Mixed format (traditional interview and MMI) was preferred over a MMI alone (mean difference = 1.1; | Response process Relation to other variable |
| Joshi N. K. et al., (68) 2014 | Media social such as Twitter, ALiEM Blog, YouTube Data is taken for the first 14 days of the event Able to attract 712 unique readers from 41 countries | Organize and summarize the responses from the global education community and propose potential solutions and recommendations | Non-cognitive abilities are already adequately elicited with the current interview process Several comments centered on the time and resources that are required to recruit and train MMI facilitators, select and execute the MMI stations and assess applicant performance. Contributors' comments were consistent in expressing the importance of an interview process that is fair, unbiased, equitable and reliable, and elicits information about non-cognitive qualities. | Response process |
| Kelly M. E. et al., (55) 2014 | Mixed method study design National University of Ireland Galway N = 109 participate (EU origin = 64, Non-EU = 45) International Medical students | Aims of this study were to run an experimental MMI in an internationally diverse student population to establish its Establish the fairness with respect to age, gender, socioeconomic group and candidate background Predictive validity in year one assessment outcomes Stakeholder (MMI candidates and assessors) acceptability | Non-EU students and those for whom English was not a first language achieved significantly lower scores on MMI than their EU and English speaking counterparts (difference in mean 11.9% and 12.2% respectively, P < 0.001) MMI score was associated with English Language proficiency (IELTS) (r = 0.5, P < 0.01) Correlations emerged between First Year results and IELTS (r = 0.44; p = 0.006; n = 38) and EU school exit exam (r = 0.52; p < 0.001; n = 56) MMI predicted EU student OSCE performance (r = 0.27; p = 0.03; n = 64) MMI was considered a highly authentic assessment that offered a deeper understanding of the applicant than traditional tools, with an immediate relevance to clinical practice. Cultural specificity of some stations and English language proficiency were seen to disadvantage international students. | Response process Relation to other variable |
| Kelly M. et al., (54) 2014 | Feasibility Study design* Clinical Science Institute, National University of Ireland, Galway, Ireland N = 241 eligible students Medical program | Establish the feasibility of running a MMI in an Irish setting | Cronbach Alphas for the 10 individual stations range from 0.74 to 0.80. Overall Cronbach Alpha of MMI was 0.78 90% either agreed or strongly agreed that the content of the MMI was relevant to their understanding of the practice of medicine MMI was considered almost on a par with academic achievement as suitable grounds for selecting medical students MMI 73%; Academic achievement 79%; whereas other tools were less favored. | Response process Internal structure |
| Liao SC. et al., (56) 2014 | National Cheng Kuang University (NCKU), Taiwan N = 122 Department of Medicine | Strengthen the evaluation of applicants' interpersonal skills Construct a more holistic and valid medical student admission interview | Correlation between each of the seven MMI stations and the group interview were all positive, ranging from 0.15 to 0.42 The correlation coefficients were all significant (all Cronbach's alpha for the seven MMI stations was 0.54 Cronbach's alpha for the MMI combined with the group interview rose to 0.63 Combining the group interview with the MMI increased the internal consistency of the entire interview and made it a more valid interview | Internal structure Relation to other variable |
| Oliver T. et al., (57) 2014 | Ontario Veterinary College (OVC) N = 186 candidates Doctor of Veterinary Medicine (DVM) program | Investigate if different MMI scores measure distinct non-cognitive attributes Determine if MMI scores (construct specific or total MMI scores) are related to conceptually relevant personality measures and conceptually relevant scores in a standardized clinical communication interview | The reliability for the MMI was Ep2 = 0.73, suggesting that scores for participants on oral communication and problem evaluation were generally consistent across raters and stations. Standardized factor loadings ranged from 0.55 to 0.71 for oral communication, and from 0.32 to 0.58 for problem evaluation (all with p < 0.001). The correlated uniqueness values for scores within the same station ranged from 0.48 to 0.67, and more importantly the correlation between the oral communication and problem evaluation factors was extremely high (r = 0.87) These result suggest that, while there is a good fit and rational for a 2-factor model over a 1-factor model, there are method (station) and trait (attributes measured) effects present which limit the ability to conclude we are assessing two independent factors Total MMI score had a weak but significant correlation with extraversion, and significant correlations with building the relationship and explaining and planning. | • Internal structure |
| Oyler D. R. et al., | Pilot Study Design University of Kentucky (United States) N = 38 Pharmacy Admission | Develop and implement a new interview process designed to assess the soft and hard skills necessary for success in postgraduate year 1 (PGY1) Assess candidate and interviewer acceptance of the MMI as a part of the traditional pharmacy residency interview process Research Electronic Data Capture (REDCap) | Candidates agreed that MMI allowed them to convey their abilities effectively but did not agree that it was more effective than traditional interview Candidate did not agree that MMI caused less anxiety than traditional interviews MMI was more effective than traditional interview in assessing candidates' abilities, skills and thought processes. | Response process |
| Roberts C. et al., (58) 2014 | Experimental Study design** University of Sydney, Sydney, Australia N = 1382 candidates Medical school | Reliability and validity of non-cognitive characteristics of candidates when selecting them into general practice specialty training using MMI Explored the concurrent validity of the MMI with the Situational Judgment Test (SJT) | For a single MMI question and one assessor, 28% of the variance between scores was due to candidate-to-candidate variation. The varying views that interviewer had for particular candidates accounted for 40% of the variance in scores. The generalizability co-efficient for a six question MMI was 0.7; to achieve 0.8 would require ten questions. A disattenuated correlation with the SJT ( | Internal structure Relation to other variable |
| Sebok S. S. et al., (59) 2014 | Experimental Study design (Nested design) Queen's University, Kingston, Ontario, Canada N = 455 school applicants Medical Program | Examine the psychometric properties of the MMI as employed at another Canadian medical school. Descriptive analyses is using Generalizability theory (G theory) and Many-Facet Rasch Model (MFRM) | Generalizability analyses: Estimated variance component for applicants was 0.47 (16.3%) The largest variance component was Large residual error variance ( There was little variability amongst stations (0.04, 1.3%) and raters nested within stations (0.01, 0.4%), indicating the stations had similar variability and the raters were scoring examinees consistently Many-Facet Rasch analyses Most of the applicants are situated in the 0–1 region on the logit scale, indicating that they were all fairly proficient Both faculty and student raters are positioned around the 0 logit mark, which verify that they were equivalent in terms of their severity Applicant reliability coefficient for the medical school applicants across the eight stations range from 0.84 to 0.92, which indicates heterogeneity among the applicants Applicant reliability coefficients for the included attributes five (communication, critical thinking, maturity, empathy and professionalism) ranged from 0.66 to 0.76 MMI meets some of the requirements for psychometric quality; however, there are still ongoing issues that need to be addressed. The analyses were also able to identify problematic applicants, raters, stations and items | Internal structure |
| Stowe C. D. et al., (60) 2014 | Pilot study University of Arkansas for Medical Sciences College of Pharmacy, United States N = 13 volunteered (pilot phase); N = 224 (Full implementation) Pharmacy | Outline the development and implementation of the MMI at the University of Arkansas for Medical Sciences (UAMS) College of Pharmacy | A significant degree of inter-rater reliability (Pearson r ≥ 0.60) was found for all the themes evaluated expect the personal attribute scenario | Internal structure |
| Zaidi N. B. et al., (69) 2014 | Experimental Design Study (Nested design) One of United States Medical School N = 15 applicants | This study explore whether items, defined as specific attributes on an MMI evaluation form, are assessed consistently across MMI stations regardless of station scenario. Generalizability (G) Theory | Applicant ( The estimated variance components from the G study suggest that the greatest amount of variance is attributable to the main effect of the scenario ( These two variance ( The The scenario Low estimate of variance attributable to the | Internal structure |
| Alweis R. L. et al., (61) 2015 | Historical Study design** at Northeastern United States Internal Medical Residency N = 13 interviewers; N = 240 candidates Qualitative study | Aimed to gain a better understanding of these biases from the perspective of the interviewer | Five major themes regarding interviewer perceptions were sub grouped into specific applicant characteristics, personality factors, cultural factors, perception of prior preparation and concerns with norming. Subjects noted that they felt that applicants with introverted personalities may have fared less well in the MMI process. Many felt that the system itself could be biased toward extroverts who may be able to respond more quickly in situations that require spontaneous answers. Seven interviewers noted that applicants unfamiliar with national systems and cultural norms may have been at a disadvantage. Five interviewers voiced concern that scores might be biased by norm-referencing, or comparing the applicants to those coming before or after them on a specific interview day. | Response process |
| Burkhardt J. C. et al., (62) 2015 | Historical Study design** University of Michigan, Ann Arbor, Michigan N = 71 out of 72 interns Emergency Medicine Residency Admission | Investigate whether it provides additional information regarding future first-year resident performance that can be useful in resident selection | An individual's score on the MMI correlated with overall performance ( MMI correlated with ACGME individual competencies patient care and procedural skills at a less robust level ( | Relation to other variable |
| Cox W. C. et al., (63) 2015 | Observational Design Study University of North Carolina (UNC) N = 253 Chapel Hill Eshelman School of Pharmacy | Describe the development, implementation and evaluation of the Multiple Mini-Interview (MMI) within a doctor of pharmacy (PharmD) admissions model | Analyses suggest that MMI stations assessed different attributes as designed, with Cronbach's alpha for each station ranging from 0.90 to 0.95. All correlations between MMI station scores and academic indicators (GPA & PCAT) were negligible. No significant differences in average station scores were found based on age, gender, or race. | Internal structure Relation to other variable |
| Sebok S. S and Syer M. D., (64) 2015 | Canadian Medical School N = 455 Applicants who were shortlisted; N = 6 raters (3 student raters, 3 faculty raters) | Investigated how raters distinguish between different non-cognitive attributes in the context of performance-based assessments to better understand unexplained rater variance Tools: Many Facet Rasch Model; Hierarchical Clustering | At times, raters were unable to distinguish between the various non-cognitive attributes Applicant reliability values ranged from 0.84 to 0.92, which suggests heterogeneity within the sample. Station 4 had a Rasch reliability of separation value of 0.94, which denotes substantial differences between the faculty and student raters. Both faculty and student raters show the non-cognitive attribute of critical thinking as more distinct from communication and empathy. Thus the placement of the attribute clusters against the center scale, which in this instance is approximately 0.30, indicates overall lower levels of dissimilarity and less variability within this station compared with station 4. | Response process Internal structure |
| Yoshimura H. et al., (65) 2015 | Observational Study design Tokyo Bay Urayasu-Ichikawa Medical Centre's, Urayasu City, Japan and Gifu University N = 26 medical graduates | Investigated reliability and acceptability of the post-graduates admissions MMI with PBQ and SQ interview formats within MMI stations. | PBQ and SQ formats had generalizability coefficients of 0.822 and 0.821, respectively One examiner per station, seven stations could produce a reliability of more than 0.80 in both PBQ and SQ formats. More than 60% of both candidates and examiners felt positive about the overall candidates' ability. All participants liked the fairness of this MMI when compared with the previously experienced SSPI. SQs were perceived more favorable by candidates; in contrast, PBQs were perceived more relevant by examiners | Response process Internal structure |
| Abdul Rahim & Yusoff, 2016 (66) | School of Medical Sciences, Universiti Sains Malaysia N = 447 Medical Student Admission | This paper described the implementation of the MMI and reported the preliminary evaluation data on its validity evidence. | Overall reliability of 5 manned stations was 0.94 CFA confirmed unidimensional of MMI Positive rating on feasibility and acceptability of MMI by interviewers and candidates Area for improvement was the variation of difficulty and discrimination of the stations between sessions. Might indicate the quality of the question, assuming that the standards of the interviewers are similar. Or might indicate varying interviewer standards between sessions. | Response process Internal structure |