Literature DB >> 35239091

Effect of Training Data Volume on Performance of Convolutional Neural Network Pneumothorax Classifiers.

Yee Liang Thian1, Dian Wen Ng2,3, James Thomas Patrick Decourcy Hallinan2, Pooja Jagmohan2, Soon Yiew Sia2, Jalila Sayed Adnan Mohamed2,4, Swee Tian Quek2, Mengling Feng3.   

Abstract

Large datasets with high-quality labels required to train deep neural networks are challenging to obtain in the radiology domain. This work investigates the effect of training dataset size on the performance of deep learning classifiers, focusing on chest radiograph pneumothorax detection as a proxy visual task in the radiology domain. Two open-source datasets (ChestX-ray14 and CheXpert) comprising 291,454 images were merged and convolutional neural networks trained with stepwise increase in training dataset sizes. Model iterations at each dataset volume were evaluated on an external test set of 525 emergency department chest radiographs. Learning curve analysis was performed to fit the observed AUCs for all models generated. For all three network architectures tested, model AUCs and accuracy increased rapidly from 2 × 103 to 20 × 103 training samples, with more gradual increase until the maximum training dataset size of 291 × 103 images. AUCs for models trained with the maximum tested dataset size of 291 × 103 images were significantly higher than models trained with 20 × 103 images: ResNet-50: AUC20k = 0.86, AUC291k = 0.95, p < 0.001; DenseNet-121 AUC20k = 0.85, AUC291k = 0.93, p < 0.001; EfficientNet AUC20k = 0.92, AUC 291 k = 0.98, p < 0.001. Our study established learning curves describing the relationship between dataset training size and model performance of deep learning convolutional neural networks applied to a typical radiology binary classification task. These curves suggest a point of diminishing performance returns for increasing training data volumes, which algorithm developers should consider given the high costs of obtaining and labelling radiology data.
© 2022. The Author(s) under exclusive licence to Society for Imaging Informatics in Medicine.

Entities:  

Keywords:  Convolutional neural network; Dataset size; Deep learning; Pneumothorax; Volume

Mesh:

Year:  2022        PMID: 35239091      PMCID: PMC9485337          DOI: 10.1007/s10278-022-00594-y

Source DB:  PubMed          Journal:  J Digit Imaging        ISSN: 0897-1889            Impact factor:   4.903


  12 in total

1.  Challenges Related to Artificial Intelligence Research in Medical Imaging and the Importance of Image Analysis Competitions.

Authors:  Luciano M Prevedello; Safwan S Halabi; George Shih; Carol C Wu; Marc D Kohli; Falgun H Chokshi; Bradley J Erickson; Jayashree Kalpathy-Cramer; Katherine P Andriole; Adam E Flanders
Journal:  Radiol Artif Intell       Date:  2019-01-30

2.  Chest Radiograph Interpretation with Deep Learning Models: Assessment with Radiologist-adjudicated Reference Standards and Population-adjusted Evaluation.

Authors:  Anna Majkowska; Sid Mittal; David F Steiner; Joshua J Reicher; Scott Mayer McKinney; Gavin E Duggan; Krish Eswaran; Po-Hsuan Cameron Chen; Yun Liu; Sreenivasa Raju Kalidindi; Alexander Ding; Greg S Corrado; Daniel Tse; Shravya Shetty
Journal:  Radiology       Date:  2019-12-03       Impact factor: 11.105

3.  Sample-Size Determination Methodologies for Machine Learning in Medical Imaging Research: A Systematic Review.

Authors:  Indranil Balki; Afsaneh Amirabadi; Jacob Levman; Anne L Martel; Ziga Emersic; Blaz Meden; Angel Garcia-Pedrero; Saul C Ramirez; Dehan Kong; Alan R Moody; Pascal N Tyrrell
Journal:  Can Assoc Radiol J       Date:  2019-09-12       Impact factor: 2.248

4.  Deep-Learning-Based Neural Tissue Segmentation of MRI in Multiple Sclerosis: Effect of Training Set Size.

Authors:  Ponnada A Narayana; Ivan Coronado; Sheeba J Sujit; Jerry S Wolinsky; Fred D Lublin; Refaat E Gabr
Journal:  J Magn Reson Imaging       Date:  2019-10-18       Impact factor: 4.813

Review 5.  Privacy in the age of medical big data.

Authors:  W Nicholson Price; I Glenn Cohen
Journal:  Nat Med       Date:  2019-01-07       Impact factor: 87.241

6.  Assessment of Convolutional Neural Networks for Automated Classification of Chest Radiographs.

Authors:  Jared A Dunnmon; Darvin Yi; Curtis P Langlotz; Christopher Ré; Daniel L Rubin; Matthew P Lungren
Journal:  Radiology       Date:  2018-11-13       Impact factor: 29.146

7.  Preparing Medical Imaging Data for Machine Learning.

Authors:  Martin J Willemink; Wojciech A Koszek; Cailin Hardell; Jie Wu; Dominik Fleischmann; Hugh Harvey; Les R Folio; Ronald M Summers; Daniel L Rubin; Matthew P Lungren
Journal:  Radiology       Date:  2020-02-18       Impact factor: 11.105

8.  Predicting sample size required for classification performance.

Authors:  Rosa L Figueroa; Qing Zeng-Treitler; Sasikiran Kandula; Long H Ngo
Journal:  BMC Med Inform Decis Mak       Date:  2012-02-15       Impact factor: 2.796

9.  Medical Image Data and Datasets in the Era of Machine Learning-Whitepaper from the 2016 C-MIMI Meeting Dataset Session.

Authors:  Marc D Kohli; Ronald M Summers; J Raymond Geis
Journal:  J Digit Imaging       Date:  2017-08       Impact factor: 4.056

10.  Key challenges for delivering clinical impact with artificial intelligence.

Authors:  Christopher J Kelly; Alan Karthikesalingam; Mustafa Suleyman; Greg Corrado; Dominic King
Journal:  BMC Med       Date:  2019-10-29       Impact factor: 8.775

View more
  1 in total

1.  Global Research Trends of Artificial Intelligence on Histopathological Images: A 20-Year Bibliometric Analysis.

Authors:  Wentong Zhou; Ziheng Deng; Yong Liu; Hui Shen; Hongwen Deng; Hongmei Xiao
Journal:  Int J Environ Res Public Health       Date:  2022-09-15       Impact factor: 4.614

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.