| Literature DB >> 35071732 |
Nafiseh Jafari1, Mohammad Reza Besharati2, Mohammad Izadi2, Alireza Talebpour3.
Abstract
A self-report questionnaire survey was conducted online to collect big data from over 16000 Iranian families (who were the residents of 1000 urban and rural areas of Iran). The resulting data storage contained over 1 M records of data and over 1G records of automatically inferred information. Based on this data storage, a series of machine learning experiments was conducted to investigate the relationship between nutrition and the risk of contracting COVID-19. With highly accurate scores, the findings strongly suggest that foods and water sources containing certain natural bioactive and phytochemical agents may help to reduce the risk of apparent COVID-19 infection.Entities:
Keywords: Big data; COVID-19; Diet; Machine learning; Multilayer perceptron; Nutrition; Random forest
Year: 2022 PMID: 35071732 PMCID: PMC8767975 DOI: 10.1016/j.imu.2022.100857
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Fig. 1Learning Performance vs. Window Size for the Running Average (an averaging filter for inputs).
Fig. 2Histogram for Feature 25 with Class-Tag Coloring (Daily Tea Drinking as a habit in lifestyle). The greater value indicates micro-communities with a higher prevalence of tea consumption.
Results of random forest with 10-fold cross-validation.
| Random | Window | # Of Features | # Of Instances | # Of Classes | Accuracy | Time (Computational Complexity) |
|---|---|---|---|---|---|---|
| EXP-1 | 1 | 9 | 2540 | 4 | 67 | 20 seconds |
| EXP-2 | 20 | 9 | 2540 | 4 | 47 | 20 seconds |
| EXP-3 | 20 | 83 | 16227 | 4 | 85.17 | 2 minutes |
| EXP-4 | 20 | 122 | 16227 | 4 | 86.31 | 5 minutes |
| EXP-5 | 1 | 125 | 16227 | 2 | 87.39 | 5 minutes |
| EXP-6 | 1 | 125 | 16227 | 4 | 74.35 | 5 minutes |
| EXP-7 | 20 | 125 | 16227 | 4 | 86.40 | 5 minutes |
| EXP-8 | 50 | 125 | 16227 | 4 | 94.33 | 5 minutes |
| EXP-9 | 100 | 125 | 16227 | 4 | 96.96 | 5 minutes |
| EXP-10 | 200 | 125 | 16227 | 4 | 98.18 | 5 minutes |
| EXP-11 | 400 | 125 | 16227 | 4 | 99.04 | 5 minutes |
Results of multilayered perceptron with 10-fold cross-validation.
| Multilayer | Window | # Of Features | # Of Instances | # Of Classes | Accuracy | Time (Computational Complexity) |
|---|---|---|---|---|---|---|
| EXP-12 | 1 | 9 | 2540 | 4 | 71 | 1 minutes |
| EXP-13 | 20 | 9 | 2540 | 4 | 37 | 10 minutes |
| EXP-14 | 20 | 83 | 16227 | 4 | 81.26 | 2 hours |
| EXP-15 | 20 | 122 | 16227 | 4 | 76.00 | 3 hours |
| EXP-16 | 1 | 125 | 16227 | 2 | 84.51 | 3 hours |
| EXP-17 | 1 | 125 | 16227 | 4 | 67.25 | 3 hours |
| EXP-18 | 20 | 125 | 16227 | 4 | 76.43 | 3 hours |
| EXP-19 | 50 | 125 | 16227 | 4 | 92.22 | 3 hours |
| EXP-20 | 100 | 125 | 16227 | 4 | 94.99 | 3 hours |
Deep Neural Network.
Using Colab.research.google.com.
Fig. 3The above diagram was plotted for the citizens of Tehran in the research dataset for 330K dietary conditions associated with a reduction in the risk of COVID-19. Each point represents a distinct group of dietary conditions, and each condition is further subdivided into four subparts (e.g., daily coffee consumption, daily dairy consumption, weekly consumption of fish, and high consumption of fast foods).
Results of metabolites data experiments.
| Task | Classification | Precision | Recall | ROC | |
|---|---|---|---|---|---|
| EXP-M1 | COVID-19 Fatality Prediction | J48 | 85 | 78 | 0.84 |
| EXP-M2 | COVID-19 Fatality Prediction | Dl4jMlp (Deep Neural Network) | 86 | 77 | 0.88 |
| EXP-M3 | COVID-19 Fatality Prediction | Multilayer Perceptron | 90 | 97 | 0.989 |
| EXP-M4 | COVID-19 Fatality Prediction | Logistic Regression | 90 | 97 | 0.994 |
| EXP-M5 | COVID-19 Fatality Prediction | Random Forest | 82 | 100 | 0.98 |
Fig. 4ID3 for 2540 instances of data with 9 features.
Fig. 5The J48 algorithm's decision tree suggests that the key control variables for "death" and "survival" in severe COVID-19 cases were the level of T3 thyroid hormone in the blood.
Results of dietary data experiments results.
| Task | Classification | Window Size for Running Average | Accuracy | |
|---|---|---|---|---|
| EXP-C1 | COVID-19 Mortality Rate Prediction | Random Forest | 1 | 64.65 |
| EXP-C2 | COVID-19 Mortality Rate Prediction | Random Forest | 10 | 92.3 |
a digest of results.