| Literature DB >> 34109197 |
Nicolas Schneider1, Keywan Sohrabi2, Henning Schneider1, Klaus-Peter Zimmer3, Patrick Fischer1, Jan de Laffolie3.
Abstract
Introduction: The rising incidence of pediatric inflammatory bowel diseases (PIBD) facilitates the need for new methods of improving diagnosis latency, quality of care and documentation. Machine learning models have shown to be applicable to classifying PIBD when using histological data or extensive serology. This study aims to evaluate the performance of algorithms based on promptly available data more suited to clinical applications.Entities:
Keywords: CEDATA-GPGE registry; convolutional neural network; diagnostic assistance; machine learning; pediatric inflammatory bowel disease
Year: 2021 PMID: 34109197 PMCID: PMC8180568 DOI: 10.3389/fmed.2021.666190
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Workflow and Datasets displays the outline of the applied methodology as well as the resulting datasets and their segmentation into CD and UC diagnosis.
Figure 2Heatmap of inflammatory locations: displays the correlation of observed inflammations in the given locations. Correlation is visualized using Pearson's “r” coefficient. Additionally, the diagonal line of values shows the frequency of inflammation observed at the location in percent.
Achieved accuracy per method for 2018 follow-up dataset: table showing the executed machine learning methods and the achieved accuracy in percent as well as standard deviation during cross validation.
| SVM | 85.49 | 0.83 |
| RF | 86.53 | 0.87 |
| XGBoost | 83.94 | 0.87 |
| Dense NN | 88.78 | 5.43 |
| Conv. NN | 90.02 | 4.01 |
| Conv. NN optimized | 90.57 | 3.45 |
Of the naive implementations, the convolutional neural network perfomes best at 90.02% which is topped by its optimized implementation reaching 90.57 percent. The underliing data stems from registry inputs of 2018 and includes laboratory values.
Figure 3CNN performance without laboratory values: plots the performance of the convolutional neural network (CNN) when using sub datasets selected by time of input. None of the datasets contain laboratory values.
Figure 4CNN performance with laboratory values: illustrates the performance of the convolutional neural network (CNN) on datasets containing laboratory values. The datasets are sampled by year of collection.
Achieved accuracy per method for initial visitation dataset: table showing the executed ML-methods and the achieved accuracy on Initial-Total dataset in percent.
| SVM | 85.89 |
| RF | 83.33 |
| XGBoost | 82.89 |
| Dense NN | 86.89 |
| Conv. NN | 86.94 |
| Conv. NN optimized | 87.06 |
Of the naive implementations, the convolutional neural network perfomes best at 86.94% which is topped by its optimized implementation reaching 87.06%. The underliing data includes laboratory values.
Sensitivity and specificity of CNN per diagnosis: this table displays the sensitivity and specificity of the convolutional neural network (CNN) trained and tested on the data from the Follow-Up-2018 dataset.
| Crohn's disease | 91.4 | 89.73 |
| Ulcerative cloitis | 89.73 | 91.4 |