PURPOSE: Recent studies on diabetic retinopathy (DR) screening in fundus photographs suggest that disagreements between algorithms and clinicians are now comparable to disagreements among clinicians. The purpose of this study is to (1) determine whether this observation also holds for automated DR severity assessment algorithms, and (2) show the interest of such algorithms in clinical practice. METHODS: A dataset of 85 consecutive DR examinations (168 eyes, 1176 multimodal eye fundus photographs) was collected at Brest University Hospital (Brest, France). Two clinicians with different experience levels determined DR severity in each eye, according to the International Clinical Diabetic Retinopathy Disease Severity (ICDRS) scale. Based on Cohen's kappa (κ) measurements, the performance of clinicians at assessing DR severity was compared to the performance of state-of-the-art content-based image retrieval (CBIR) algorithms from our group. RESULTS: At assessing DR severity in each patient, intraobserver agreement was κ = 0.769 for the most experienced clinician. Interobserver agreement between clinicians was κ = 0.526. Interobserver agreement between the most experienced clinicians and the most advanced algorithm was κ = 0.592. Besides, the most advanced algorithm was often able to predict agreements and disagreements between clinicians. CONCLUSIONS: Automated DR severity assessment algorithms, trained to imitate experienced clinicians, can be used to predict when young clinicians would agree or disagree with their more experienced fellow members. Such algorithms may thus be used in clinical practice to help validate or invalidate their diagnoses. CBIR algorithms, in particular, may also be used for pooling diagnostic knowledge among peers, with applications in training and coordination of clinicians' prescriptions.
PURPOSE: Recent studies on diabetic retinopathy (DR) screening in fundus photographs suggest that disagreements between algorithms and clinicians are now comparable to disagreements among clinicians. The purpose of this study is to (1) determine whether this observation also holds for automated DR severity assessment algorithms, and (2) show the interest of such algorithms in clinical practice. METHODS: A dataset of 85 consecutive DR examinations (168 eyes, 1176 multimodal eye fundus photographs) was collected at Brest University Hospital (Brest, France). Two clinicians with different experience levels determined DR severity in each eye, according to the International Clinical Diabetic Retinopathy Disease Severity (ICDRS) scale. Based on Cohen's kappa (κ) measurements, the performance of clinicians at assessing DR severity was compared to the performance of state-of-the-art content-based image retrieval (CBIR) algorithms from our group. RESULTS: At assessing DR severity in each patient, intraobserver agreement was κ = 0.769 for the most experienced clinician. Interobserver agreement between clinicians was κ = 0.526. Interobserver agreement between the most experienced clinicians and the most advanced algorithm was κ = 0.592. Besides, the most advanced algorithm was often able to predict agreements and disagreements between clinicians. CONCLUSIONS: Automated DR severity assessment algorithms, trained to imitate experienced clinicians, can be used to predict when young clinicians would agree or disagree with their more experienced fellow members. Such algorithms may thus be used in clinical practice to help validate or invalidate their diagnoses. CBIR algorithms, in particular, may also be used for pooling diagnostic knowledge among peers, with applications in training and coordination of clinicians' prescriptions.
Authors: Emanuele Trucco; Alfredo Ruggeri; Thomas Karnowski; Luca Giancardo; Edward Chaum; Jean Pierre Hubschman; Bashir Al-Diri; Carol Y Cheung; Damon Wong; Michael Abràmoff; Gilbert Lim; Dinesh Kumar; Philippe Burlina; Neil M Bressler; Herbert F Jelinek; Fabrice Meriaudeau; Gwénolé Quellec; Tom Macgillivray; Bal Dhillon Journal: Invest Ophthalmol Vis Sci Date: 2013-05-01 Impact factor: 4.799
Authors: Mark B Horton; Christopher J Brady; Jerry Cavallerano; Michael Abramoff; Gail Barker; Michael F Chiang; Charlene H Crockett; Seema Garg; Peter Karth; Yao Liu; Clark D Newman; Siddarth Rathi; Veeral Sheth; Paolo Silva; Kristen Stebbins; Ingrid Zimmer-Galler Journal: Telemed J E Health Date: 2020-03-25 Impact factor: 3.536
Authors: Michael D Abràmoff; Theodore Leng; Daniel S W Ting; Kyu Rhee; Mark B Horton; Christopher J Brady; Michael F Chiang Journal: Telemed J E Health Date: 2020-03-25 Impact factor: 3.536
Authors: Dawn A Sim; Pearse A Keane; Adnan Tufail; Catherine A Egan; Lloyd Paul Aiello; Paolo S Silva Journal: Curr Diab Rep Date: 2015-03 Impact factor: 5.430
Authors: Ramon Casanova; Santiago Saldana; Emily Y Chew; Ronald P Danis; Craig M Greven; Walter T Ambrosius Journal: PLoS One Date: 2014-06-18 Impact factor: 3.240