| Literature DB >> 35356732 |
Insu Choi1, Jihye Kim2, Woo Chang Kim1.
Abstract
In this study, we observed the changes in dietary patterns among Korean adults in the previous decade. We evaluated dietary intake using 24-h recall data from the fourth (2007-2009) and seventh (2016-2018) Korea National Health and Nutrition Examination Survey. Machine learning-based methodologies were used to extract these dietary patterns. Particularly, we observed three dietary patterns from each survey similar to the traditional and Western dietary patterns in 2007-2009 and 2016-2018, respectively. Our results reveal a considerable increase in the number of Western dietary patterns compared with the previous decade. Thus, our study contributes to the use of novel methods using natural language processing (NLP) techniques for dietary pattern extraction to obtain more useful dietary information, unlike the traditional methodology.Entities:
Keywords: dietary pattern; machine learning; natural language processing (NLP); topic modeling; word embedding
Year: 2022 PMID: 35356732 PMCID: PMC8959352 DOI: 10.3389/fnut.2022.765794
Source DB: PubMed Journal: Front Nutr ISSN: 2296-861X
Characteristics of survey participants.
|
|
|
|
|---|---|---|
| All | 16,187 | 16,809 |
|
| ||
| Male | 6,592 (40.7) | 7,144 (42.5) |
| Female | 9,595 (59.3) | 9,665 (57.5) |
|
| ||
| 19–39 | 4,643 (27.5) | 5,296 (32.7) |
| 40–59 | 6,189 (36.8) | 5,814 (35.9) |
| 60+ | 5,997 (35.7) | 5,077 (31.4) |
|
| ||
| General | 9,750 (60.2) | 7,831 (46.6) |
| Apartment | 6,437 (39.8) | 8,978 (53.4) |
|
| ||
| Elementary School | 4,589 (28.3) | 3,288 (19.6) |
| High School | 6,830 (42.2) | 6,342 (37.8) |
| Over Associate Degree/Bachelor Degree | 3,666 (20.7) | 5,639 (33.5) |
The difference between the total number of respondents and the number of respondents per category means no response.
Figure 1Data preprocessing process.
Figure 2Two-dimensional scatter plot of food name vectors.
Figure 3Intuition behind LDA.
Figure 4Calculated perplexity (left) and C (right) for selecting optimal number (2007–2009 dataset).
Figure 5Calculated perplexity (left) and C (right) for selecting optimal number (2016–2018 dataset).
Topic-based dietary patterns of 2007–2009.
|
|
|
|
|
|---|---|---|---|
|
|
|
| |
|
|
|
| |
| 1 | Americano | Instant coffee | Soybean paste soup (Doenjang Soup) |
| 2 | Kimchi | White rice | Mix of red pepper paste and soybean paste (Ssamjang) |
| 3 | White rice | Ramen | Cabbage |
| 4 | Egg | Kimchi | Kimchi |
| 5 | Multigrain rice | Pumpkin | White rice |
| 6 | Stir-fried anchovy | Cutlassfish | Lettuce |
| 7 | Bean sprout | Cucumber | Garlic |
| 8 | Sweet potato | Snack | Sesame leaf |
| 9 | Marinated meat (Jangjorim) | Cabbage | Red pepper |
| 10 | Spinach | Beer | Multigrain rice |
Topic-based dietary patterns of 2016–2018.
|
|
|
|
|
|---|---|---|---|
|
|
|
| |
|
|
|
| |
| 1 | Mix of red pepper paste and soybean paste (Ssamjang) | Kimchi | Americano |
| 2 | Pork belly | Instant coffee | Fried chicken |
| 3 | Lettuce | Milk | Mayonnaise |
| 4 | Red pepper | White rice | Fish cake soup |
| 5 | Cold noodle (Naengmyeon) | Multigrain rice | Ramen |
| 6 | Onion | Soybean paste soup | Snack |
| 7 | Soju | Kimchi stew | Chicken breast |
| 8 | Grilled mushrooms | Apple | Soda |
| 9 | Orange juice | Roasted seaweed | Sausage |
| 10 | Duck meat | Stir-fried anchovy | Beer |
Figure 6Number of tokens and proportions of entire tokens for the top 10 menus (2007–2009).
Figure 7Number of tokens and proportions of entire tokens for the top 10 menus (2016–2018).
Figure 8Visualization results of inter-topic distance map via multidimensional scaling (MDS) (2007–2009) (14).
Figure 9Visualization results of inter-topic distance map via multidimensional scaling (MDS) (2016–2018) (14).