| Literature DB >> 31984365 |
Adyasha Maharana1, Kunlin Cai2, Joseph Hellerstein3, Yulin Hswen4,5,6, Michael Munsell7, Valentina Staneva3, Miki Verma8, Cynthia Vint9, Derry Wijaya2, Elaine O Nsoesie10,11.
Abstract
OBJECTIVES: Access to safe and nutritious food is essential for good health. However, food can become unsafe due to contamination with pathogens, chemicals or toxins, or mislabeling of allergens. Illness resulting from the consumption of unsafe foods is a global health problem. Here, we develop a machine learning approach for detecting reports of unsafe food products in consumer product reviews from Amazon.com.Entities:
Keywords: artificial intelligence; consumer product safety; food and drug administration; food safety; machine learning
Year: 2019 PMID: 31984365 PMCID: PMC6951857 DOI: 10.1093/jamiaopen/ooz030
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Database for linking Amazon reviews to FDA recalls.
Figure 2.Distribution of consumer reviews across recalled product categories.
Figure 3.Features of Amazon reviews for the study period. Temporal trends (a) and distribution of customer ratings (b) of Amazon reviews.
Figure 4.Reasons for FDA food product recalls.
Recall reasons not captured in Figure 3
| Contamination (others) contains a raw material that may contain >0.3 ppb chloramphenicol contaminated with undeclared steroids products in vacuum packages were undercooked. |
| Illegallevels of Aflatoxin above legal limit does not meet pH standard of 10 for boiled/preserved eggs found a chemical which does not have a tolerance level in US FDA testing found unapproved pesticides/not permitted in US pesticide not allowed in US but approved for usage in EU violative levels of lead. |
| Issues with manufacturing/transport liquid containing vessel may leach lead firm was manufacturing acidified foods without license may not have been transported at a safe temperature recalling firm lacked adequate Good Manufacturing Practices packet may have an incomplete seal which could allow air to enter the packet causing oxidation improperly pasteurized faulty screen at flour mill. |
| Voluntary recallnotification of opportunity to initiate voluntary recall - letter from FDA |
Performance of the various machine learning approaches employed for identifying unsafe food products
| Classifier description | Precision | Recall | F1 score |
|---|---|---|---|
| Linear SVM (Feature selection using Chi2 | 0.61 | 0.64 | 0.62 |
| Multinomial Naive Bayes (Feature selection using Chi2, | 0.66 | 0.66 | 0.66 |
| Weighted logistic regression (Feature selection using Chi2, | 0.58 | 0.74 | 0.65 |
| Weighted logistic regression (Feature selection using Chi2, | 0.64 | 0.71 | 0.67 |
| Weighted logistic regression (Feature selection using mutual information, | 0.60 | 0.68 | 0.64 |
| Weighted logistic regression with SMOTE (ratio = 1: 5) (tested on real data points only) | 0.62 | 0.68 | 0.65 |
| Weighted logistic regression with SMOTE (ratio = 1: 3) (tested on real data points only) | 0.62 | 0.71 | 0.66 |
| Weighted logistic regression with SMOTE (ratio = 1: 2) (tested on real data points only) | 0.62 | 0.70 | 0.66 |
| Weighted logistic regression with SMOTE (ratio = 1: 1) (tested on real data points only) | 0.63 | 0.66 | 0.64 |
| BERT (epoch = 10, max sequence length = 128) | 0.76 | 0.67 | 0.71 |
| BERT (epoch = 10, max sequence length = 128) with focal loss for dealing with imbalanced data ( | 0.75 | 0.74 | 0.73 |
| BERT (epoch = 20, max sequence length = 256) | 0.79 | 0.67 | 0.72 |
| BERT (epoch = 30, max sequence length = 256) | 0.78 | 0.71 | 0.74 |
| BERT (epoch = 30, max sequence length = 256) with focal loss for dealing with imbalanced data ( | 0.77 | 0.71 | 0.74 |
BERT is the best performing classifier. Chi2 refers to Chi-square. The accuracy ([true positives + true negative]/total reviews), precision (also known as positive predictive value = true positives/predicted positive condition), recall (also known as sensitivity = [true positive/[true positives + false negatives]), and F1-score (the harmonic mean of the precision and recall) are discussed.
Figure 5.Plot of the reconstruction error. This shows that sentences from unsafe and safe product reviews are not significantly different. This might explain the difficulty in the classification process.