Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Effect of Training Data Volume on Performance of Convolutional Neural Network Pneumothorax Classifiers.

Literature DB >> 35239091

Effect of Training Data Volume on Performance of Convolutional Neural Network Pneumothorax Classifiers.

Yee Liang Thian¹, Dian Wen Ng^2,3, James Thomas Patrick Decourcy Hallinan², Pooja Jagmohan², Soon Yiew Sia², Jalila Sayed Adnan Mohamed^2,4, Swee Tian Quek², Mengling Feng³.

Abstract

Large datasets with high-quality labels required to train deep neural networks are challenging to obtain in the radiology domain. This work investigates the effect of training dataset size on the performance of deep learning classifiers, focusing on chest radiograph pneumothorax detection as a proxy visual task in the radiology domain. Two open-source datasets (ChestX-ray14 and CheXpert) comprising 291,454 images were merged and convolutional neural networks trained with stepwise increase in training dataset sizes. Model iterations at each dataset volume were evaluated on an external test set of 525 emergency department chest radiographs. Learning curve analysis was performed to fit the observed AUCs for all models generated. For all three network architectures tested, model AUCs and accuracy increased rapidly from 2 × 103 to 20 × 103 training samples, with more gradual increase until the maximum training dataset size of 291 × 103 images. AUCs for models trained with the maximum tested dataset size of 291 × 103 images were significantly higher than models trained with 20 × 103 images: ResNet-50: AUC20k = 0.86, AUC291k = 0.95, p < 0.001; DenseNet-121 AUC20k = 0.85, AUC291k = 0.93, p < 0.001; EfficientNet AUC20k = 0.92, AUC 291 k = 0.98, p < 0.001. Our study established learning curves describing the relationship between dataset training size and model performance of deep learning convolutional neural networks applied to a typical radiology binary classification task. These curves suggest a point of diminishing performance returns for increasing training data volumes, which algorithm developers should consider given the high costs of obtaining and labelling radiology data.

Entities: Chemical

Keywords: Convolutional neural network; Dataset size; Deep learning; Pneumothorax; Volume

Mesh：

Year: 2022 PMID： 35239091 PMCID： PMC9485337 DOI： 10.1007/s10278-022-00594-y

Source DB: PubMed Journal: J Digit Imaging ISSN： 0897-1889 Impact factor: 4.903

12 in total

1. Challenges Related to Artificial Intelligence Research in Medical Imaging and the Importance of Image Analysis Competitions.

Authors: Luciano M Prevedello; Safwan S Halabi; George Shih; Carol C Wu; Marc D Kohli; Falgun H Chokshi; Bradley J Erickson; Jayashree Kalpathy-Cramer; Katherine P Andriole; Adam E Flanders
Journal: Radiol Artif Intell Date: 2019-01-30

2. Chest Radiograph Interpretation with Deep Learning Models: Assessment with Radiologist-adjudicated Reference Standards and Population-adjusted Evaluation.

Authors: Anna Majkowska; Sid Mittal; David F Steiner; Joshua J Reicher; Scott Mayer McKinney; Gavin E Duggan; Krish Eswaran; Po-Hsuan Cameron Chen; Yun Liu; Sreenivasa Raju Kalidindi; Alexander Ding; Greg S Corrado; Daniel Tse; Shravya Shetty
Journal: Radiology Date: 2019-12-03 Impact factor: 11.105

3. Sample-Size Determination Methodologies for Machine Learning in Medical Imaging Research: A Systematic Review.

Authors: Indranil Balki; Afsaneh Amirabadi; Jacob Levman; Anne L Martel; Ziga Emersic; Blaz Meden; Angel Garcia-Pedrero; Saul C Ramirez; Dehan Kong; Alan R Moody; Pascal N Tyrrell
Journal: Can Assoc Radiol J Date: 2019-09-12 Impact factor: 2.248

4. Deep-Learning-Based Neural Tissue Segmentation of MRI in Multiple Sclerosis: Effect of Training Set Size.

Authors: Ponnada A Narayana; Ivan Coronado; Sheeba J Sujit; Jerry S Wolinsky; Fred D Lublin; Refaat E Gabr
Journal: J Magn Reson Imaging Date: 2019-10-18 Impact factor: 4.813

Review 5. Privacy in the age of medical big data.

Authors: W Nicholson Price; I Glenn Cohen
Journal: Nat Med Date: 2019-01-07 Impact factor: 87.241

6. Assessment of Convolutional Neural Networks for Automated Classification of Chest Radiographs.

Authors: Jared A Dunnmon; Darvin Yi; Curtis P Langlotz; Christopher Ré; Daniel L Rubin; Matthew P Lungren
Journal: Radiology Date: 2018-11-13 Impact factor: 29.146

7. Preparing Medical Imaging Data for Machine Learning.

Authors: Martin J Willemink; Wojciech A Koszek; Cailin Hardell; Jie Wu; Dominik Fleischmann; Hugh Harvey; Les R Folio; Ronald M Summers; Daniel L Rubin; Matthew P Lungren
Journal: Radiology Date: 2020-02-18 Impact factor: 11.105

8. Predicting sample size required for classification performance.

Authors: Rosa L Figueroa; Qing Zeng-Treitler; Sasikiran Kandula; Long H Ngo
Journal: BMC Med Inform Decis Mak Date: 2012-02-15 Impact factor: 2.796

9. Medical Image Data and Datasets in the Era of Machine Learning-Whitepaper from the 2016 C-MIMI Meeting Dataset Session.

Authors: Marc D Kohli; Ronald M Summers; J Raymond Geis
Journal: J Digit Imaging Date: 2017-08 Impact factor: 4.056

10. Key challenges for delivering clinical impact with artificial intelligence.

Authors: Christopher J Kelly; Alan Karthikesalingam; Mustafa Suleyman; Greg Corrado; Dominic King
Journal: BMC Med Date: 2019-10-29 Impact factor: 8.775

1 in total

1. Global Research Trends of Artificial Intelligence on Histopathological Images: A 20-Year Bibliometric Analysis.

Authors: Wentong Zhou; Ziheng Deng; Yong Liu; Hui Shen; Hongwen Deng; Hongmei Xiao
Journal: Int J Environ Res Public Health Date: 2022-09-15 Impact factor: 4.614

1 in total