| Literature DB >> 20525220 |
Jane Tighe1, I C McManus, Neil G Dewhurst, Liliana Chis, John Mucklow.
Abstract
BACKGROUND: Cronbach's alpha is widely used as the preferred index of reliability for medical postgraduate examinations. A value of 0.8-0.9 is seen by providers and regulators alike as an adequate demonstration of acceptable reliability for any assessment. Of the other statistical parameters, Standard Error of Measurement (SEM) is mainly seen as useful only in determining the accuracy of a pass mark. However the alpha coefficient depends both on SEM and on the ability range (standard deviation, SD) of candidates taking an exam. This study investigated the extent to which the necessarily narrower ability range in candidates taking the second of the three part MRCP(UK) diploma examinations, biases assessment of reliability and SEM.Entities:
Mesh:
Year: 2010 PMID: 20525220 PMCID: PMC2893515 DOI: 10.1186/1472-6920-10-40
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Figure 1In a Monte Carlo analysis, a simulated group of 10,000 candidates take an examination with a true mean of 50%, a true SD of 10%, a true reliability of 0.9, and a pass mark of 60%. Figure 1a shows the candidates' marks on the first attempt (horizontal axis), with the pass mark shown as the vertical dashed grey line, the failing candidates shown in red and the passing candidates shown in black. All of the simulated candidates then take the examination again, and their marks on that second occasion are shown on the vertical axis, with the horizontal dashed line showing the same pass mark as was used on the first occasion. Figure 1b is restricted to the 1565 candidates who passed the examination on the first assessment, and shows the marks they obtained when they took the examination for the second time (horizontal axis), and then again on taking it for a third time (vertical axis). Once again the notional pass mark of 60% is indicated by the vertical and horizontal grey dashed lines.
Reliability of the MRCP(UK) Part 1 and Part 2 examinations.
| Part 1 | Part 2 | |||||||
|---|---|---|---|---|---|---|---|---|
| 2002/3 | - | - | - | - | 149 | .79 | 7.67% | 3.51% |
| 2003/1 | - | - | - | - | 146 | .76 | 7.43% | 3.66% |
| 2003/2 | - | - | - | - | 150 | .73 | 6.94% | 3.58% |
| 2003/3 | 199 | .89 | 9.23% | 3.09% | 152 | .76 | 7.24% | 3.52% |
| 2004/1 | 200 | .89 | 9.70% | 3.10% | 149 | .75 | 7.10% | 3.55% |
| 2004/2 | 200 | .89 | 10.46% | 3.14% | 177 | .83 | 8.05% | 3.28% |
| 2004/3 | 200 | .91 | 9.68% | 3.14% | 183 | .78 | 6.94% | 3.26% |
| 2005/1 | 200 | .89 | 10.67% | 3.16% | 181 | .76 | 6.77% | 3.30% |
| 2005/2 | 200 | .92 | 9.27% | 3.08% | 180 | .80 | 7.33% | 3.25% |
| 2005/3 | 195 | .90 | 10.19% | 3.21% | 253 | .83 | 6.73% | 2.78% |
| 2006/1 | 194 | .92 | 11.08% | 3.23% | 250 | .81 | 6.46% | 2.82% |
| 2006/2 | 193 | .90 | 10.09% | 3.24% | 251 | .85 | 7.20% | 2.75% |
| 2006/3 | 195 | .89 | 9.83% | 3.27% | 253 | .82 | 6.52% | 2.80% |
| 2007/1 | 195 | .92 | 11.49% | 3.25% | 249 | .77 | 5.84% | 2.83% |
| 2007/2 | 195 | .91 | 10.59% | 3.25% | 263 | .84 | 6.89% | 2.72% |
| 2007/3 | 195 | .92 | 11.51% | 3.26% | 262 | .85 | 7.13% | 2.76% |
| 2008/1 | 184 | .93 | 11.90% | 3.15% | 264 | .82 | 6.52% | 2.76% |
| 2008/2 | 185 | .91 | 11.13% | 3.34% | 266 | .85 | 6.95% | 2.73% |
| 2008/3 | 185 | .92 | 11.59% | 3.28% | 259 | .84 | 6.99% | 2.77% |
The Part 1 papers consist entirely of Best-of-Five questions. The Part 2 papers are mostly Best-of-Five questions, with two or three >Several-from-Many (questions in each diet. Negative marking is not used in either examination.
Reliability of the first eight Specialty Certificate Examinations.
| Year | Specialty | Candidates | Number of scored items | Alpha | SD | SEM |
|---|---|---|---|---|---|---|
| 2008 | Gastroenterology | 8 | 200 | .84 | 7.00% | 2.80% |
| 2009 | Dermatology | 39 | 200 | .88 | 7.27% | 2.52% |
| 2009 | Endocrinology and Diabetes | 39 | 200 | .89 | 9.03% | 2.99% |
| 2009 | Geriatric Medicine | 15 | 200 | .48 | 3.97% | 2.86% |
| 2009 | Infectious Diseases | 6 | 200 | .94 | 12.13% | 2.97% |
| 2009 | Neurology | 25 | 200 | .89 | 9.13% | 3.03% |
| 2009 | Nephrology | 33 | 200 | .86 | 7.80% | 2.92% |
| 2009 | Respiratory Medicine | 25 | 200 | .85 | 7.47% | 2.89% |
Results are scored as percentage of answers correct, and therefore are directly comparable with value shown in Table 1.