Jon N Marsh1,2, Ta-Chiang Liu1, Parker C Wilson1, S Joshua Swamidass1,2, Joseph P Gaut1,3. 1. Department of Pathology and Immunology, Washington University School of Medicine in St Louis, St Louis, Missouri. 2. Institute for Informatics (I 2 ), Washington University School of Medicine in St Louis, St Louis, Missouri. 3. Department of Medicine, Washington University School of Medicine in St Louis, St Louis, Missouri.
Abstract
Importance: A chronic shortage of donor kidneys is compounded by a high discard rate, and this rate is directly associated with biopsy specimen evaluation, which shows poor reproducibility among pathologists. A deep learning algorithm for measuring percent global glomerulosclerosis (an important predictor of outcome) on images of kidney biopsy specimens could enable pathologists to more reproducibly and accurately quantify percent global glomerulosclerosis, potentially saving organs that would have been discarded. Objective: To compare the performances of pathologists with a deep learning model on quantification of percent global glomerulosclerosis in whole-slide images of donor kidney biopsy specimens, and to determine the potential benefit of a deep learning model on organ discard rates. Design, Setting, and Participants: This prognostic study used whole-slide images acquired from 98 hematoxylin-eosin-stained frozen and 51 permanent donor biopsy specimen sections retrieved from 83 kidneys. Serial annotation by 3 board-certified pathologists served as ground truth for model training and for evaluation. Images of kidney biopsy specimens were obtained from the Washington University database (retrieved between June 2015 and June 2017). Cases were selected randomly from a database of more than 1000 cases to include biopsy specimens representing an equitable distribution within 0% to 5%, 6% to 10%, 11% to 15%, 16% to 20%, and more than 20% global glomerulosclerosis. Main Outcomes and Measures: Correlation coefficient (r) and root-mean-square error (RMSE) with respect to annotations were computed for cross-validated model predictions and on-call pathologists' estimates of percent global glomerulosclerosis when using individual and pooled slide results. Data were analyzed from March 2018 to August 2020. Results: The cross-validated model results of section images retrieved from 83 donor kidneys showed higher correlation with annotations (r = 0.916; 95% CI, 0.886-0.939) than on-call pathologists (r = 0.884; 95% CI, 0.825-0.923) that was enhanced when pooling glomeruli counts from multiple levels (r = 0.933; 95% CI, 0.898-0.956). Model prediction error for single levels (RMSE, 5.631; 95% CI, 4.735-6.517) was 14% lower than on-call pathologists (RMSE, 6.523; 95% CI, 5.191-7.783), improving to 22% with multiple levels (RMSE, 5.094; 95% CI, 3.972-6.301). The model decreased the likelihood of unnecessary organ discard by 37% compared with pathologists. Conclusions and Relevance: The findings of this prognostic study suggest that this deep learning model provided a scalable and robust method to quantify percent global glomerulosclerosis in whole-slide images of donor kidneys. The model performance improved by analyzing multiple levels of a section, surpassing the capacity of pathologists in the time-sensitive setting of examining donor biopsy specimens. The results indicate the potential of a deep learning model to prevent erroneous donor organ discard.
Importance: A chronic shortage of donor kidneys is compounded by a high discard rate, and this rate is directly associated with biopsy specimen evaluation, which shows poor reproducibility among pathologists. A deep learning algorithm for measuring percent global glomerulosclerosis (an important predictor of outcome) on images of kidney biopsy specimens could enable pathologists to more reproducibly and accurately quantify percent global glomerulosclerosis, potentially saving organs that would have been discarded. Objective: To compare the performances of pathologists with a deep learning model on quantification of percent global glomerulosclerosis in whole-slide images of donor kidney biopsy specimens, and to determine the potential benefit of a deep learning model on organ discard rates. Design, Setting, and Participants: This prognostic study used whole-slide images acquired from 98 hematoxylin-eosin-stained frozen and 51 permanent donor biopsy specimen sections retrieved from 83 kidneys. Serial annotation by 3 board-certified pathologists served as ground truth for model training and for evaluation. Images of kidney biopsy specimens were obtained from the Washington University database (retrieved between June 2015 and June 2017). Cases were selected randomly from a database of more than 1000 cases to include biopsy specimens representing an equitable distribution within 0% to 5%, 6% to 10%, 11% to 15%, 16% to 20%, and more than 20% global glomerulosclerosis. Main Outcomes and Measures: Correlation coefficient (r) and root-mean-square error (RMSE) with respect to annotations were computed for cross-validated model predictions and on-call pathologists' estimates of percent global glomerulosclerosis when using individual and pooled slide results. Data were analyzed from March 2018 to August 2020. Results: The cross-validated model results of section images retrieved from 83 donor kidneys showed higher correlation with annotations (r = 0.916; 95% CI, 0.886-0.939) than on-call pathologists (r = 0.884; 95% CI, 0.825-0.923) that was enhanced when pooling glomeruli counts from multiple levels (r = 0.933; 95% CI, 0.898-0.956). Model prediction error for single levels (RMSE, 5.631; 95% CI, 4.735-6.517) was 14% lower than on-call pathologists (RMSE, 6.523; 95% CI, 5.191-7.783), improving to 22% with multiple levels (RMSE, 5.094; 95% CI, 3.972-6.301). The model decreased the likelihood of unnecessary organ discard by 37% compared with pathologists. Conclusions and Relevance: The findings of this prognostic study suggest that this deep learning model provided a scalable and robust method to quantify percent global glomerulosclerosis in whole-slide images of donor kidneys. The model performance improved by analyzing multiple levels of a section, surpassing the capacity of pathologists in the time-sensitive setting of examining donor biopsy specimens. The results indicate the potential of a deep learning model to prevent erroneous donor organ discard.
Authors: Pooja Singh; John L Farber; Cataldo Doria; George C Francos; Rakesh Gulati; Carlo B Ramirez; Warren R Maley; Adam M Frank Journal: Clin Transplant Date: 2012-01-29 Impact factor: 2.863
Authors: A Hart; J M Smith; M A Skeans; S K Gustafson; A R Wilk; S Castro; A Robinson; J L Wainright; J J Snyder; B L Kasiske; A K Israni Journal: Am J Transplant Date: 2019-02 Impact factor: 8.086
Authors: John D Bukowy; Alex Dayton; Dustin Cloutier; Anna D Manis; Alexander Staruschenko; Julian H Lombard; Leah C Solberg Woods; Daniel A Beard; Allen W Cowley Journal: J Am Soc Nephrol Date: 2018-06-19 Impact factor: 10.121
Authors: Andre Esteva; Brett Kuprel; Roberto A Novoa; Justin Ko; Susan M Swetter; Helen M Blau; Sebastian Thrun Journal: Nature Date: 2017-01-25 Impact factor: 49.962
Authors: Syed Ali Husain; Mariana C Chiles; Samnang Lee; Stephen O Pastan; Rachel E Patzer; Bekir Tanriover; Lloyd E Ratner; Sumit Mohan Journal: Clin J Am Soc Nephrol Date: 2017-12-07 Impact factor: 8.237
Authors: D Serón; M Carrera; J M Griño; A M Castelao; M A Lopez-Costea; L Riera; J Alsina Journal: Nephrol Dial Transplant Date: 1993 Impact factor: 5.992
Authors: Ahmed Serag; Adrian Ion-Margineanu; Hammad Qureshi; Ryan McMillan; Marie-Judith Saint Martin; Jim Diamond; Paul O'Reilly; Peter Hamilton Journal: Front Med (Lausanne) Date: 2019-10-01
Authors: Geert Litjens; Clara I Sánchez; Nadya Timofeeva; Meyke Hermsen; Iris Nagtegaal; Iringo Kovacs; Christina Hulsbergen-van de Kaa; Peter Bult; Bram van Ginneken; Jeroen van der Laak Journal: Sci Rep Date: 2016-05-23 Impact factor: 4.379
Authors: Roman D Büllow; Jon N Marsh; S Joshua Swamidass; Joseph P Gaut; Peter Boor Journal: Curr Opin Nephrol Hypertens Date: 2022-02-14 Impact factor: 3.416
Authors: Xiang Li; Richard C Davis; Yuemei Xu; Zehan Wang; Nao Souma; Gina Sotolongo; Jonathan Bell; Matthew Ellis; David Howell; Xiling Shen; Kyle J Lafata; Laura Barisoni Journal: J Med Imaging (Bellingham) Date: 2021-12-20
Authors: Darren E Stewart; Julia Foutz; Layla Kamal; Samantha Weiss; Harrison S McGehee; Matthew Cooper; Gaurav Gupta Journal: Kidney Int Rep Date: 2022-05-30
Authors: Brendon Lutnick; David Manthey; Jan U Becker; Brandon Ginley; Katharina Moos; Jonathan E Zuckerman; Luis Rodrigues; Alexander J Gallan; Laura Barisoni; Charles E Alpers; Xiaoxin X Wang; Komuraiah Myakala; Bryce A Jones; Moshe Levi; Jeffrey B Kopp; Teruhiko Yoshida; Jarcy Zee; Seung Seok Han; Sanjay Jain; Avi Z Rosenberg; Kuang Yu Jen; Pinaki Sarder Journal: Commun Med (Lond) Date: 2022-08-19