Gene Kitamura1, Christopher Deible2. 1. University of Pittsburgh Medical Center (UPMC) and University of Pittsburgh, UPMC Department of Radiology, 200 Lothrop St., Pittsburgh, PA 15213, USA. Electronic address: kitamurag@upmc.edu. 2. University of Pittsburgh Medical Center (UPMC) and University of Pittsburgh, UPMC Department of Radiology, 200 Lothrop St., Pittsburgh, PA 15213, USA. Electronic address: deiblec@upmc.edu.
Abstract
PURPOSE: To validate a machine learning model trained on an open source dataset and subsequently optimize it to chest X-rays with large pneumothoraces from our institution. METHODS: The study was retrospective in nature. The open-source chest X-ray (CXR8) dataset was dichotomized to cases with pneumothorax (PTX) and all other cases (non-PTX), resulting in 41,946 non-PTX and 4696 PTX cases for the training set and 11,120 non-PTX and 541 PTX cases for the validation set. A limited supervision machine learning model was constructed to incorporate both localized and unlocalized pathology. Cases were then queried from our health system from 2013 to 2017. A total of 159 pneumothorax and 682 non-pneumothorax cases were available for the training set. For the validation set, 48 pneumothorax and 1287 non-pneumothorax cases were available. The model was trained, a receiver operator curve (ROC) was created, and output metrics, including area under the curve (AUC), sensitivity and specificity were calculated. RESULTS: Initial training of the model using the CXR8 dataset resulted in an AUC of 0.90 for pneumothorax detection. Naively inferring our own validation dataset on the CXR8 trained model output an AUC of 0.59. After re-training the model with our own training dataset, the validation dataset inference output an AUC of 0.90. CONCLUSION: Our study showed that even though you may get great results on open-source datasets, those models may not translate well to real world data without an intervening retraining process.
PURPOSE: To validate a machine learning model trained on an open source dataset and subsequently optimize it to chest X-rays with large pneumothoraces from our institution. METHODS: The study was retrospective in nature. The open-source chest X-ray (CXR8) dataset was dichotomized to cases with pneumothorax (PTX) and all other cases (non-PTX), resulting in 41,946 non-PTX and 4696 PTX cases for the training set and 11,120 non-PTX and 541 PTX cases for the validation set. A limited supervision machine learning model was constructed to incorporate both localized and unlocalized pathology. Cases were then queried from our health system from 2013 to 2017. A total of 159 pneumothorax and 682 non-pneumothorax cases were available for the training set. For the validation set, 48 pneumothorax and 1287 non-pneumothorax cases were available. The model was trained, a receiver operator curve (ROC) was created, and output metrics, including area under the curve (AUC), sensitivity and specificity were calculated. RESULTS: Initial training of the model using the CXR8 dataset resulted in an AUC of 0.90 for pneumothorax detection. Naively inferring our own validation dataset on the CXR8 trained model output an AUC of 0.59. After re-training the model with our own training dataset, the validation dataset inference output an AUC of 0.90. CONCLUSION: Our study showed that even though you may get great results on open-source datasets, those models may not translate well to real world data without an intervening retraining process.
Authors: Stéfan van der Walt; Johannes L Schönberger; Juan Nunez-Iglesias; François Boulogne; Joshua D Warner; Neil Yager; Emmanuelle Gouillart; Tony Yu Journal: PeerJ Date: 2014-06-19 Impact factor: 2.984
Authors: Sulantha Mathotaarachchi; Tharick A Pascoal; Monica Shin; Andrea L Benedet; Min Su Kang; Thomas Beaudry; Vladimir S Fonov; Serge Gauthier; Pedro Rosa-Neto Journal: Neurobiol Aging Date: 2017-07-11 Impact factor: 4.673
Authors: Manisha Bahl; Regina Barzilay; Adam B Yedidia; Nicholas J Locascio; Lili Yu; Constance D Lehman Journal: Radiology Date: 2017-10-17 Impact factor: 11.105
Authors: David M Schnyer; Peter C Clasen; Christopher Gonzalez; Christopher G Beevers Journal: Psychiatry Res Neuroimaging Date: 2017-03-23 Impact factor: 2.376
Authors: Germán González; Samuel Y Ash; Gonzalo Vegas-Sánchez-Ferrero; Jorge Onieva Onieva; Farbod N Rahaghi; James C Ross; Alejandro Díaz; Raúl San José Estépar; George R Washko Journal: Am J Respir Crit Care Med Date: 2018-01-15 Impact factor: 21.405
Authors: Jakub Olczak; Niklas Fahlberg; Atsuto Maki; Ali Sharif Razavian; Anthony Jilert; André Stark; Olof Sköldenberg; Max Gordon Journal: Acta Orthop Date: 2017-07-06 Impact factor: 3.717
Authors: Johannes Rueckel; Christian Huemmer; Andreas Fieselmann; Florin-Cristian Ghesu; Awais Mansoor; Balthasar Schachtner; Philipp Wesp; Lena Trappmann; Basel Munawwar; Jens Ricke; Michael Ingrisch; Bastian O Sabel Journal: Eur Radiol Date: 2021-03-27 Impact factor: 5.315
Authors: Yee Liang Thian; Dianwen Ng; James Thomas Patrick Decourcy Hallinan; Pooja Jagmohan; Soon Yiew Sia; Cher Heng Tan; Yong Han Ting; Pin Lin Kei; Geoiphy George Pulickal; Vincent Tze Yang Tiong; Swee Tian Quek; Mengling Feng Journal: Radiol Artif Intell Date: 2021-04-14