Literature DB >> 30122457

Automation bias in medicine: The influence of automated diagnoses on interpreter accuracy and uncertainty when reading electrocardiograms.

Raymond R Bond1, Tomas Novotny2, Irena Andrsova2, Lumir Koc2, Martina Sisakova2, Dewar Finlay3, Daniel Guldenring3, James McLaughlin3, Aaron Peace4, Victoria McGilligan5, Stephen J Leslie6, Hui Wang3, Marek Malik7.   

Abstract

INTRODUCTION: Interpretation of the 12‑lead Electrocardiogram (ECG) is normally assisted with an automated diagnosis (AD), which can facilitate an 'automation bias' where interpreters can be anchored. In this paper, we studied, 1) the effect of an incorrect AD on interpretation accuracy and interpreter confidence (a proxy for uncertainty), and 2) whether confidence and other interpreter features can predict interpretation accuracy using machine learning.
METHODS: This study analysed 9000 ECG interpretations from cardiology and non-cardiology fellows (CFs and non-CFs). One third of the ECGs involved no ADs, one third with ADs (half as incorrect) and one third had multiple ADs. Interpretations were scored and interpreter confidence was recorded for each interpretation and subsequently standardised using sigma scaling. Spearman coefficients were used for correlation analysis and C5.0 decision trees were used for predicting interpretation accuracy using basic interpreter features such as confidence, age, experience and designation.
RESULTS: Interpretation accuracies achieved by CFs and non-CFs dropped by 43.20% and 58.95% respectively when an incorrect AD was presented (p < 0.001). Overall correlation between scaled confidence and interpretation accuracy was higher amongst CFs. However, correlation between confidence and interpretation accuracy decreased for both groups when an incorrect AD was presented. We found that an incorrect AD disturbs the reliability of interpreter confidence in predicting accuracy. An incorrect AD has a greater effect on the confidence of non-CFs (although this is not statistically significant it is close to the threshold, p = 0.065). The best C5.0 decision tree achieved an accuracy rate of 64.67% (p < 0.001), however this is only 6.56% greater than the no-information-rate.
CONCLUSION: Incorrect ADs reduce the interpreter's diagnostic accuracy indicating an automation bias. Non-CFs tend to agree more with the ADs in comparison to CFs, hence less expert physicians are more effected by automation bias. Incorrect ADs reduce the interpreter's confidence and also reduces the predictive power of confidence for predicting accuracy (even more so for non-CFs). Whilst a statistically significant model was developed, it is difficult to predict interpretation accuracy using machine learning on basic features such as interpreter confidence, age, reader experience and designation.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2018        PMID: 30122457     DOI: 10.1016/j.jelectrocard.2018.08.007

Source DB:  PubMed          Journal:  J Electrocardiol        ISSN: 0022-0736            Impact factor:   1.438


  14 in total

1.  Reader Perceptions and Impact of AI on CT Assessment of Air Trapping.

Authors:  Tara A Retson; Kyle A Hasenstab; Seth J Kligerman; Kathleen E Jacobs; Andrew C Yen; Sharon S Brouha; Lewis D Hahn; Albert Hsiao
Journal:  Radiol Artif Intell       Date:  2021-11-10

Review 2.  Artificial Intelligence in Medical Imaging and its Impact on the Rare Disease Community: Threats, Challenges and Opportunities.

Authors:  Navid Hasani; Faraz Farhadi; Michael A Morris; Moozhan Nikpanah; Arman Rhamim; Yanji Xu; Anne Pariser; Michael T Collins; Ronald M Summers; Elizabeth Jones; Eliot Siegel; Babak Saboury
Journal:  PET Clin       Date:  2022-01

Review 3.  The Importance of Incorporating Human Factors in the Design and Implementation of Artificial Intelligence for Skin Cancer Diagnosis in the Real World.

Authors:  Claire M Felmingham; Nikki R Adler; Zongyuan Ge; Rachael L Morton; Monika Janda; Victoria J Mar
Journal:  Am J Clin Dermatol       Date:  2021-03       Impact factor: 7.403

4.  Unity Is Intelligence: A Collective Intelligence Experiment on ECG Reading to Improve Diagnostic Performance in Cardiology.

Authors:  Luca Ronzio; Andrea Campagner; Federico Cabitza; Gian Franco Gensini
Journal:  J Intell       Date:  2021-04-01

5.  The effect of uncertainty in patient classification on diagnostic performance estimations.

Authors:  Leo C McHugh; Kevin Snyder; Thomas D Yager
Journal:  PLoS One       Date:  2019-05-22       Impact factor: 3.240

6.  General Practitioners' Attitudes Toward Artificial Intelligence-Enabled Systems: Interview Study.

Authors:  Christoph Buck; Eileen Doctor; Jasmin Hennrich; Jan Jöhnk; Torsten Eymann
Journal:  J Med Internet Res       Date:  2022-01-27       Impact factor: 5.428

7.  WaSP-ECG: A Wave Segmentation Pretraining Toolkit for Electrocardiogram Analysis.

Authors:  Rob Brisk; Raymond R Bond; Dewar Finlay; James A D McLaughlin; Alicja J Piadlo; David J McEneaney
Journal:  Front Physiol       Date:  2022-03-17       Impact factor: 4.566

8.  Toward a hemorrhagic trauma severity score: fusing five physiological biomarkers.

Authors:  Ankita Bhat; Daria Podstawczyk; Brandon K Walther; John R Aggas; David Machado-Aranda; Kevin R Ward; Anthony Guiseppi-Elie
Journal:  J Transl Med       Date:  2020-09-14       Impact factor: 5.531

9.  How cognitive psychology changed the face of medical education research.

Authors:  Henk G Schmidt; Silvia Mamede
Journal:  Adv Health Sci Educ Theory Pract       Date:  2020-11-26       Impact factor: 3.853

10.  Systematic Review of Clinical Decision Support Systems for Prehospital Acute Coronary Syndrome Identification.

Authors:  Charles Richard Knoery; Janet Heaton; Rob Polson; Raymond Bond; Aleeha Iftikhar; Khaled Rjoob; Victoria McGilligan; Aaron Peace; Stephen James Leslie
Journal:  Crit Pathw Cardiol       Date:  2020-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.