| Literature DB >> 27625722 |
Teresa M Chan1, Brent Thoma2, Keeth Krishnan3, Michelle Lin4, Christopher R Carpenter5, Matt Astin6, Kulamakan Kulasegaram7.
Abstract
INTRODUCTION: Online education resources (OERs), like blogs and podcasts, increasingly augment or replace traditional medical education resources such as textbooks and lectures. Trainees' ability to evaluate these resources is poor, and few quality assessment aids have been developed to assist them. This study aimed to derive a quality evaluation instrument for this purpose.Entities:
Mesh:
Year: 2016 PMID: 27625722 PMCID: PMC5017842 DOI: 10.5811/westjem.2016.6.30825
Source DB: PubMed Journal: West J Emerg Med ISSN: 1936-900X
Figure 1Flow diagram of study design. Phase 1 depicts the quality-indicator (QI) selection process, Phase 2 depicts the score derivation process based on the reduced list of QIs, and Phase 3 describes the reliability and validity testing data for the two derived instruments for scoring the quality of medical blogs and podcasts.
QI, quality indicator; ITC, item total correlation; ICC, intraclass correlation coefficient; ALiEM, Academic Life in Emergency Medicine; AIR, approved instructional resources.
Parent websites and distribution of the 39 selected blog or podcast online educational resources (OER), from which the gestalt score was derived (Phase 2).
| Website name | Number of rated posts |
|---|---|
| Academic Life in Emergency Medicine | 12 |
| BoringEM | 1 |
| Clinical Monster | 1 |
| Dr. Smith’s ECG blog | 2 |
| Don’t Forget The Bubbles | 2 |
| Emergency Medicine Ireland | 1 |
| EM Lyceum | 3 |
| EM Basic | 1 |
| EMCrit | 1 |
| EM Literature of Note | 1 |
| ERCast | 3 |
| Life in the Fast Lane | 2 |
| Pediatric EM Morsels | 4 |
| R.E.B.E.L EM | 1 |
| The NNT | 1 |
| The Poison Review | 2 |
| The Skeptics Guide to Emergency Medicine | 1 |
NB: For a complete listing of all the rated blog posts, please refer to Appendix.
Figure 2Final list of 13 quality indicators rated by trainee raters on a 7-point Likert scale.
Demographics of raters who evaluated online educational resources.
| Instrument development trainee raters (n=40) | Expert gestalt educator raters (n=20) | |||
|---|---|---|---|---|
| % by country of origin | 2.5% United States of America | 75% United States of America | ||
| 97.5% Canada | 25% Canada | |||
| Year of training or years in practice at the time of their enrollment | 0 years in practice (All are trainees) | 10.3 years in practice (SD 10.2) | ||
| Academic affiliation | Year 1 medical student | 40% | Full professor | 10% |
| Year 2 medical student | 30% | Associate professor | 15% | |
| Year 3 medical student | 18% | Assistant professor | 65% | |
| Year 4 medical student | 3% | Clinical appointment | 10% | |
| Year 1 resident | 5% | None | 5% | |
| Year 2 resident | 3% | |||
| Year 3 resident | 3% | |||
| % current or past official medical education position within institution | N/A | 90% total | ||
| Breakdown | ||||
| Dean/chair | 15% | |||
| Residency PD | 40% | |||
| Residency APD | 45% | |||
| Other GME role | 30% | |||
| Clerkship director/UGME role | 30% | |||
| Research/quality Improvement role | 20% | |||
PD, program director; APD, associate or assistant program director; GME, graduate medical education; UGME, undergraduate medical education
Educator gestalt rating scale of blogs and podcasts for trainee learning.
| Would you recommend this to a learner? | |||||||
|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
Correlations between the scores by subjects in the first and second rating incidence.
| Question item number | Pearson’s | p-value |
|---|---|---|
| Q1 | 0.92 | <0.001 |
| Q2 | 0.84 | <0.001 |
| Q3 | 0.37 | 0.05 |
| Q4 | 0.63 | <0.001 |
| Q5 | 0.33 | 0.08 |
| Q6 | 0.45 | 0.02 |
| Q7 | 0.93 | <0.001 |
| Q8 | 0.57 | 0.001 |
| Q9 | 0.74 | <0.001 |
| Q10 | 0.71 | <0.001 |
| Q11 | 0.79 | <0.001 |
| Q12 | 0.81 | <0.001 |
| Q13 | 0.85 | <0.001 |
Inter-rater agreement on the quality indicator subscore components, calculated using a 2-way random effects model for consistency to calculate the ICCs (interclass correlation coefficient).
| Question item number | Single measure ICC | Average measure ICC | Number of missing data points | % Missing |
|---|---|---|---|---|
| Q1 | 0.04 (0.02–0.08) | 0.64 (0.47–0.79) | 202 | 13% |
| Q2 | 0.03 (0.01–0.07) | 0.56 (0.35–0.74) | 193 | 12% |
| Q3 | 0.17 (0.12–0.26) | 0.89 (0.84–0.94) | 206 | 13% |
| Q4 | 0.12 (0.07–0.19) | 0.84 (0.76–0.90) | 208 | 13% |
| Q5 | 0.10 (0.06–0.16) | 0.81 (0.71–0.89) | 713 | 45% |
| Q6 | 0.28 (0.20–0.39) | 0.94 (0.91–0.96) | 476 | 30% |
| Q7 | 0.38 (0.28–0.50) | 0.96 (0.94–0.98) | 216 | 14% |
| Q8 | 0.22 (0.15–0.32) | 0.92 (0.89–0.95) | 773 | 48% |
| Q9 | 0.16 (0.11–0.25) | 0.88 (0.82–0.93) | 465 | 29% |
| Q10 | 0.22 (0.14–0.32) | 0.92 (0.87–0.95) | 287 | 18% |
| Q11 | 0.17 (0.11–0.26) | 0.89 (0.83–0.93) | 290 | 18% |
| Q12 | 0.29 (0.21–0.41) | 0.95 (0.92–0.97) | 319 | 20% |
| Q13 | 0.14 (0.09–0.22) | 0.87 (0.80–0.92) | 285 | 18% |
Eliminated in Score Models 1 and 2 due to alpha <0.85 or single measure ICC <0.15
Eliminated in Score Model 2 since trainees were unsure too often (>25% missing data)
p-value was <0.001 for all ICC calculated
Figure 3Two proposed online educational resources evaluation instruments.
A comparison of the reliability calculations of the two proposed online educational resources evaluation instruments using different missing data procedures.
| METRIQ-8 score | METRIQ-5 score | |||
|---|---|---|---|---|
|
|
| |||
| Pragmatic analysis | Imputation analysis | Pragmatic analysis | Imputation analysis | |
|
|
| |||
| Single measure ICC (95% CI) | 0.30 (0.22–0.42) | 0.38 (0.29–0.51) | 0.22 (0.15–0.32) | 0.35 (0.26–0.47) |
| Average measure ICC (95% CI) | 0.94 (0.92–0.97) | 0.96 (0.94–0.98) | 0.92 (0.88–0.95) | 0.96 (0.93–0.97) |
ICC, intraclass correlation coefficient
NB: The pragmatic analysis awards a zero value to any missing data points. The imputation analysis substitutes the grand mean for the missing data points (any items which were not rated by the trainee raters).
Relationships between average METRIQ-8 and METRIQ-5 Scores with other comparative instruments (average educator gestalt score, ALiEM AIR certification).
| METRIQ-8 score pragmatic score | METRIQ-5 score pragmatic score | |
|---|---|---|
| Pearson correlation (r) to educator gestalt score for recommending resource to a trainee | r=0.35 | r=0.41 |
| p=0.03 | p<0.01 | |
| Logistic regression for ALiEM AIR certification status | Odds ratio 1.28 (1.09–1.50) | OR = 1.5 (1.14–2.20) |
| Wald test | Wald test | |
| (1,38)=8.8 | (1,38)=8.4 | |
| p=0.003 | p=0.004 |