| Literature DB >> 31775234 |
Cheong Kim1,2, Francis Joseph Costello1, Kun Chang Lee1,3,4, Yuan Li1, Chenyao Li1.
Abstract
With the remarkable improvement in people's socioeconomic living standards around the world, adolescent obesity has increasingly become an important public health issue that cannot be ignored. Thus, we have implemented its use in an attempt to explore the viability of scenario-based simulations through the use of a data mining approach. In doing so, we wanted to explore the merits of using a General Bayesian Network (GBN) with What-If analysis while exploring how it can be utilized in other areas of public health. We analyzed data from the 2017 Korean Youth Health Behavior Survey conducted directly by the Korea Centers for Disease Control & Prevention, including 19 attributes and 11,206 individual data points. Our simulations found that by manipulating the amount of pocket money-between $60 and $80-coupled with a low-income background, it has a high potential to increase obesity compared with other simulated factors. Additionally, when we manipulated an increase in studying time with a mediocre academic performance, it was found to potentially increase pressure on adolescents, which subsequently led to an increased obesity outcome. Lastly, we found that when we manipulated an increase in a father's education level while manipulating a decrease in mother's education level, this had a large effect on the potential adolescent obesity level. Although obesity was the chosen case, this paper acts more as a proof of concept in analyzing public health through GBN and What-If analysis. Therefore, it aims to guide health professionals into potentially expanding their ability to simulate certain outcomes based on predicted changes in certain factors concerning future public health issues.Entities:
Keywords: General Bayesian Network; adolescent obesity; data mining; health informatics; public health; what-if analysis
Mesh:
Year: 2019 PMID: 31775234 PMCID: PMC6926973 DOI: 10.3390/ijerph16234684
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Growth chart for children and adolescents in 2007.
| Age | Gender | BMI Percentile | ||
|---|---|---|---|---|
| 5th | 85th | 95th | ||
| 12 | Boy | 15.35 | 23.32 | 26.35 |
| Girl | 15.20 | 22.22 | 24.77 | |
| 13 | Boy | 15.82 | 23.93 | 27.02 |
| Girl | 15.71 | 22.83 | 25.38 | |
| 14 | Boy | 16.32 | 24.40 | 27.48 |
| Girl | 16.25 | 23.31 | 25.83 | |
| 15 | Boy | 16.83 | 24.74 | 27.77 |
| Girl | 16.78 | 23.67 | 26.11 | |
| 16 | Boy | 17.33 | 24.95 | 27.89 |
| Girl | 17.27 | 23.89 | 26.24 | |
| 17 | Boy | 17.80 | 25.08 | 27.89 |
| Girl | 17.68 | 23.99 | 26.24 | |
| 18 | Boy | 18.20 | 25.18 | 27.85 |
| Girl | 17.96 | 23.98 | 26.15 | |
Performance evaluation of classifiers.
| Accuracy | F-Measure | AUC | |
|---|---|---|---|
| GBN-MB |
|
|
|
| GBN | 52.579% | 0.523 | 0.743 |
| LR | 46.689% | 0.436 | 0.688 |
| DT | 46.779% | 0.467 | 0.632 |
| SVM | 45.431% | 0.134 | 0.661 |
| NN | 50.535% | 0.495 | 0.723 |
| NB | 45.627% | 0.451 | 0.674 |
| BA | 52.588% | 0.521 | 0.742 |
| RSS | 51.776% | 0.489 | 0.736 |
| RF | 52.570% | 0.518 | 0.743 |
Sensitivity of ‘Obesity_Level’ to findings at other nodes.
| Nodes | Mutual Information | Entropy (%) | Variance of Beliefs |
|---|---|---|---|
| Obesity_Level | 1.80976 | 100 | 0.4895308 |
| Pocket_Money (KRW) | 0.22196 | 12.3 | 0.0246610 |
| Sleeping_Quality | 0.03398 | 1.89 | 0.0025873 |
| Sitting_Time_Study (min) | 0.02494 | 1.38 | 0.0027259 |
| Academic_Performance | 0.02423 | 1.35 | 0.0014365 |
| Wealth | 0.02413 | 1.34 | 0.0017131 |
| Pressure | 0.01420 | 0.766 | 0.0010950 |
| Education_Mother | 0.00766 | 0.425 | 0.0007854 |
| Smartphone_Service | 0.00645 | 0.357 | 0.0006391 |
| Smartphone_Time (min) | 0.00478 | 0.283 | 0.0001776 |
| Education_Father | 0.00396 | 0.221 | 0.0003561 |
What-If analysis scenario of factors affecting adolescent obesity.
| What-If Analysis Scenario | Obesity Result (%) | |
|---|---|---|
| Scenario 1 | Select the middle level of Pocket_Money | 22.8 → 59.7 |
| Scenario 2 | Set Sleeping_Quality to ‘Very low’ | 22.8 → 29.9 |
| Scenario 3 | Maximize Sitting_Time_Study | 22.8 → 31.9 |
| Scenario 4 | Select the middle level of Pocket_Money and set Wealth to “Low” | 22.8 → 74.7 |
| Scenario 5 | Maximize Sitting_Time_Study and select the middle level of Academic_Performance | 22.8 → 35.3 |
| Scenario 6 | Maximize Smartphone _Time & set Smartphone_Service to “Study” | 22.8 → 32.8 |
| Scenario 7 | Maximize Education_Father and minimize Education_Mother | 22.8 → 32.8 |
Note: obesity result (%) stands for the probability of causing obesity that was calculated by our General Bayesian Network (GBN) models.
Figure 1What-If analysis: this is the initial state of the predicted data we extracted from the GBN analysis. From this canvas, we are able to manipulate certain parameters to find out what effect this has on the relationship with other nodes, especially with the dependent variable node obesity.
Figure 2What-If analysis when Scenario 1 is applied: note that the grey variable represents the variable that has been manipulated and, thus, through this, we can try to understand what effect this has on the other relationships in this What-If analysis graph, including the effect it has on the final outcome obesity. The graphs that follow on from this one have the same methodological approach.
Figure 3What-If analysis: when scenario 2 is applied.
Figure 4What-If analysis: when scenario 3 is applied.
Figure 5What-If analysis: when scenario 4 is applied.
Figure 6What-If analysis: when scenario 5 is applied.
Figure 7What-If analysis: when scenario 6 is applied.
Figure 8What-If analysis: when scenario 7 is applied.
All attributes included within this study.
| Revised Attributes | Original Attributes | Type | Description | Source |
|---|---|---|---|---|
| Region | CTYPE | Nominal | City type (County, Big city, Small city) | Lee, Kang, Kim, Son, Lee and Ham [ |
| Academic_Performance | E_S_RCRD | Academic performance | ||
| Pressure | M_STR | How often do you feel under pressure? | ||
| Suicide_Thought | M_SUI_CON | In the last 12 months, have you ever had suicide thought? | ||
| Sleeping_Quality | M_SLP_EN | In the last 7 days, do you think that sleeping time is sufficient? | ||
| Drinking | DRINKING | Have you ever had a drink? | ||
| Smoking | SMOKING | Have you ever smoked cigarettes? | ||
| Education_Father | E_EDU_F | Father’s educational level | Lee, Kang, Kim, Son, Lee and Ham [ | |
| Education_Mother | E_EDU_M | Mother’s educational level | ||
| Wealth | E_SES | Family economic level | Jang, Oh, Kim and Shin [ | |
| Pocket_Money (KRW) | E_ALLWN | Average pocket money per week | ||
| Healthy_Eating | F_BR | Numeric | During the last 7 days, how many days did you eat breakfast? | Lee, Kang, Kim, Son, Lee and Ham [ |
| F_FRUIT | During the last 7 days, how often did you eat fruit? | |||
| F_VEG | During the last 7 days, how often did you eat vegetables? | |||
| F_MILK | During the last 7 days, how often did you drink milk? | |||
| Unhealthy_Eating | F_FASTFOOD | During the last 7 days, how often did you eat fast food? | ||
| F_INSTND | During the last 7 days, how often did you eat instant noodles? | |||
| F_CRACK | During the last 7 days, how often did you eat snake? | |||
| F_SODA | During the last 7 days, how often did you drink carbonated drinks? | |||
| F_CAFFEINE | During the last 7 days, how often did you drink coffee or energy drinks? | |||
| F_SWDRINK | During the last 7 days, how often did you drink sweet drinks? | |||
| Exercise_60 min | PA_TOT | In the past 7 days, the number of days that take more than 60 min of exercise. | Lee, Kang, Kim, Son, Lee and Ham [ | |
| Exercise_20 min | PA_VIG | In the past 7 days, the number of days that take more than 20 min of strenuous exercise. | ||
| Sitting_Time_Study (min) | PA_SWD_S | In the last 7 days, how many hours do you spend sitting for study per day on weekday? | Lee, Kang, Kim, Son, Lee and Ham [ | |
| PA_SWK_S | In the last 7 days, how many hours do you spend sitting for study per day on weekend? | |||
| Smartphone_Time (min) | INT_SP_WD | In the last 7 days, how many hours do you spend using smartphone per day on weekday? | Park and Song [ | |
| INT_SP_WK | In the last 7 days, how many hours do you spend using smartphone per day on weekend? | |||
| Smartphone_Service | INT_SP_ITEM | Nominal | During the last 30 days, which kind of smartphone service is mainly used? | |
| Obesity_Level | Obesity | >=95th BMI Percentile | Lee, Kang, Kim, Son, Lee and Ham [ | |
| Overweight | 85th~95th BMI Percentile | |||
| Normal | 5th~85th BMI Percentile | |||
| Underweight | <=5th BMI Percentile |