| Literature DB >> 31239486 |
Florentino Luciano Caetano Dos Santos1, Irmina Maria Michalek2, Kaija Laurila3, Katri Kaukinen3,4, Jari Hyttinen5, Katri Lindfors3.
Abstract
Widespread use of endomysial autoantibody (EmA) test in diagnostics of celiac disease is limited due to its subjectivity and its requirement of an expert evaluator. The study aimed to determine whether machine learning can be applied to create a new observer-independent method of automatic assessment and classification of the EmA test for celiac disease. The study material comprised of 2597 high-quality IgA-class EmA images collected in 2017-2018. According to standard procedure, highly-experienced professional classified samples into the following four classes: I - positive, II - negative, III - IgA deficient, and IV - equivocal. Machine learning was deployed to create a classification model. The sensitivity and specificity of the model were 82.84% and 99.40%, respectively. The accuracy was 96.80%. The classification error was 3.20%. The area under the curve was 99.67%, 99.61%, 100%, and 99.89%, for I, II, III, and IV class, respectively. The mean assessment time per image was 16.11 seconds. This is the first study deploying machine learning for the automatic classification of IgA-class EmA test for celiac disease. The results indicate that using machine learning enables quick and precise EmA test analysis that can be further developed to simplify EmA analysis.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31239486 PMCID: PMC6592927 DOI: 10.1038/s41598-019-45679-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1IgA-class EmA test classes.
Description of the dataset.
| Class | Number of samples | Percent of the total dataset | |
|---|---|---|---|
| I | Positive | 274 | 10.55% |
| II | Negative | 2260 | 87.02% |
| III | IgA deficient | 13 | 0.50% |
| IV | Equivocal | 50 | 1.93% |
Figure 2Confusion matrix of the classification model.
Analysis of performance of Model 1 (based on the whole sample size) and Model 2 (supplemental, randomly under-sampled).
| Measure of performance | Model 1 | Model 2 |
|---|---|---|
| Accuracy | 96.80% | 98.85% |
| Classification error | 3.20% | 1.15% |
| Sensitivity | 82.84% | 98.91% |
| Specificity | 99.40% | 98.81% |
| F1 score | 0.65 | 0.75 |
| Cohen’s kappa coefficient | 0.85 | 0.98 |
Figure 3Receiver operating characteristic curves for the classification model.
Figure 4Development and usage of the classification model.