| Literature DB >> 31978198 |
Wan-Tzu Chang1,2, Yen-Po Yeh3,4, Hong-Yi Wu2, Yu-Fen Lin3, Thai Son Dinh2, Ie-Bin Lian1,2.
Abstract
BACKGROUND: Invoices had been used in food product traceability, however, none have addressed the automated alarm system for food safety by utilizing electronic invoice big data. In this paper, we present an alarm system for edible oil manufacture that can prevent a food safety crisis rather than trace problematic sources post-crisis.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31978198 PMCID: PMC6980643 DOI: 10.1371/journal.pone.0228035
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Performance of various classifiers on identifying correct invoices using different choice of m (number of topics).
| Topic | Accuracy | KNN (k = 5) | SVM (linear) | Logistic Regression | Neural Network | Random Forest |
|---|---|---|---|---|---|---|
| m:30 | Sensitivity | 91.6% | 81.1% | 81.2% | 88.3% | 93.8% |
| Specificity | 97.9% | 95.7% | 96.3% | 97% | 98.1% | |
| Error rate | 3.8% | 8.1% | 7.7% | 5.3% | 3% | |
| m:60 | Sensitivity | 92% | 84.5% | 85.1% | 91.6% | 95.4% |
| Specificity | 97.2% | 95.8% | 96.1% | 95.2% | 98.4% | |
| Error rate | 4.1% | 7.2% | 6.8% | 5.1% | 2.4% | |
| m:90 | Sensitivity | 91.8% | 87.7% | 89.3% | 91.2% | 95.2% |
| Specificity | 97.2% | 96.2% | 96.4% | 96.8% | 98.4% | |
| Error rate | 4.2% | 6% | 5.4% | 4.7% | 2.4% | |
| Sensitivity | 92.3% | 90.1% | 91% | 93.5% | 95.2% | |
| m:120 | Specificity | 97.5% | 96.2% | 96.9% | 97.1% | 98.5% |
| Error rate | 3.9% | 5.4% | 4.6% | 3.9% | 2.4% | |
| Sensitivity | 92.4% | 90.4% | 91.4% | 93.2% | 95.2% | |
| m:150 | Specificity | 97.3% | 96.2% | 97% | 97.1% | 98.6% |
| Error rate | 4% | 5.3% | 4.4% | 4% | 2.3% | |
| Sensitivity | 94.1% | 92.1% | 93.1% | 93.6% | 95.2% | |
| m:180 | Specificity | 97.9% | 96.7% | 97.6% | 97.9% | 98.5% |
| Error rate | 3.1% | 4.5% | 3.6% | 3.2% | 2.4% |
*Sensitivity: probability to identify related e-invoice
Specificity: probability to identify the non-related e-invoice
Error rate: total proportion of accuracy
Performance of various classifiers on identifying correct invoices using customized keywords.
| Custom | Accuracy | KNN (k = 5) | SVM (linear) | Logistic Regression | Neural Network | Random Forest |
|---|---|---|---|---|---|---|
| (no feature selection)m:60 | sensitivity | 91.7% | 92.7% | 93.2% | 93.6% | 93.9% |
| specificity | 97.9% | 98.2% | 98.4% | 98.3% | 98.1% | |
| error rate | 3.8% | 3.2% | 3% | 3% | 3% | |
| (feature selection)m:60→31 | sensitivity | 89.7% | 89.8% | 90.2% | 91% | 90.5% |
| specificity | 97.7% | 97.7% | 97.8% | 97.8% | 97.9% | |
| error rate | 4.4% | 4.4% | 4.2% | 4.4% | 4.1% |
Performance on identifying problematic manufacturers.
| Error rate | Sensitivity | Specificity | |
|---|---|---|---|
| 2.65% | 66.67% | 99.29% | |
| 2.21% | 77.78% | 99.06% | |
| 2.43% | 66.67% | 99.53% | |
| 0.44% | 96.30% | 99.76% | |
| 5.97% | 0.00% | 100.00% | |
| 3.97% | 70.37% | 99.53% |
*Sensitivity: probability to identify B as suspicious manufactures
Specificity: probability to identify A as benchmark manufactures.
Fig 4(A) Scatter plot of A- and B-labeled manufacturer, (B) Scatter plot of A-, B- and C-labeled manufacturer, with 95% and 99% prediction regions.