| Literature DB >> 30045712 |
Alessio Rossi1,2, Giovanna Calogiuri3.
Abstract
BACKGROUND: As other westerns countries, a large portion of Norwegians do not meet the minimum recommendations for weekly physical activity (PA). One of the primary targets of the WHO's Global action plan for the prevention and control of noncommunicable diseases is to reduce insufficient PA by 10% within 2025. In order to effectively increase the PA levels in the population, an in-depth understanding of PA habits within different sub-groups is therefore vital. Using a machine learning (ML) approach, the aim of this study was to investigate patterns and correlates of PA in adult Norwegians, as well as to construct a predictive model of future PA.Entities:
Keywords: Machine learning; Physical activity correlates; Physical activity patters; Prediction
Mesh:
Year: 2018 PMID: 30045712 PMCID: PMC6060515 DOI: 10.1186/s12889-018-5854-2
Source DB: PubMed Journal: BMC Public Health ISSN: 1471-2458 Impact factor: 3.295
Fig. 1Number of survey respondents for each year by gender. This figure shows the number of survey respondents recorded for each PA component in each year by gender
Structure of the training dataset
| h1 | h2 | … | hk | C1 | C2 | C3 | |
|---|---|---|---|---|---|---|---|
| s1 | 2 | 3 | … | 2 | 2 | 1 | 3 |
| s2 | 1 | 1 | … | 2 | 2 | 2 | 1 |
| s3 | 3 | 2 | … | 1 | 1 | 1 | 3 |
| ⁞ | ⁞ | ⁞ | ⁞ | ⁞ | ⁞ | ⁞ | ⁞ |
| s | 1 | 2 | … | 2 | 3 | 3 | 2 |
Each example s describes a vector of items m consisting of k features (h) and three labels c ∈ {1, 2, 3}. The light black reflects the vectors of features; otherwise, the dark black reflects the labels. Label 1, 2 and 3 reflects the item Frequency, Duration and Intensity
Correlation between PA characteristics
| Male | Female | ||||
|---|---|---|---|---|---|
| Frequency | Duration | Frequency | Duration | ||
| < 25 | Duration | 0.22 | – | 0.28 | – |
| Intensity | 0.27 | 0.37 | 0.23 | 0.34 | |
| 25–44 | Duration | 0.15 | – | 0.15 | – |
| Intensity | 0.24 | 0.30 | 0.18 | 0.32 | |
| 45–64 | Duration | 0.12 | – | 0.13 | |
| Intensity | 0.22 | 0.30 | 0.12 | 0.29 | |
| > 65 | Duration | 0.14 | – | 0.16 | – |
| Intensity | 0.17 | 0.32 | 0.11 | 0.32 | |
Spearman’s rank correlation coefficient from 1999 to 2013 based on the original 6- or 8-point component scales between the independent features (Frequency, Duration and Intensity) in both male and females and in the four age groups (i.e., < 25, 25–44, 45–64, > 64 years old)
Fig. 2PA differences among age groups. This figure shows the differences between males and females among age groups computed by using Tukey’s HSD post-hoc test. The values provided in this figure are based on the original 6- or 8-point component scales. * refers to the statistical difference vs < 25 age group; & refers to the statistical difference vs 25–44 age group; $ refers to the statistical difference vs 45–64 age group; + refers to the difference vs > 65 age group
Fig. 3PA differences among years. In this figure we present: i) the evolutions of the Frequency in both males and females from 1985 to 2013; ii) the evolutions of Duration a week in both males and females from 1999 to 2013; iii) the Evolutions of Intensity in both males and females from 1999 to 2013. Int, BG and WG refer to the p-value of Interaction, Between Groups difference and Within Groups difference, respectively. The means for the different PA components are grouped in 3 ordinal classes
Classification performances
| < 25 | 25–44 | 45–64 | > 65 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Prec | Rec | F1 | Prec | Rec | F1 | Prec | Rec | F1 | Prec | Rec | F1 | |||
| Frequency | Male | Ord | 0.85 | 0.82 | 0.82 | 0.57 | 0.54 | 0.53 | 0.55 | 0.55 | 0.55 | 0.93 | 0.92 | 0.92 |
| RF | 0.54 | 0.53 | 0.53 | 0.49 | 0.48 | 0.49 | 0.47 | 0.57 | 0.51 | 0.46 | 0.55 | 0.5 | ||
| B1 | 0.46 | 0.45 | 0.46 | 0.38 | 0.37 | 0.37 | 0.4 | 0.4 | 0.4 | 0.42 | 0.41 | 0.41 | ||
| B2 | 0.22 | 0.46 | 0.29 | 0.19 | 0.44 | 0.27 | 0.18 | 0.43 | 0.26 | 0.21 | 0.46 | 0.29 | ||
| Female | Ord | 0.71 | 0.70 | 0.70 | 0.69 | 0.65 | 0.67 | 0.82 | 0.77 | 0.79 | 0.93 | 0.92 | 0.92 | |
| RF | 0.62 | 0.67 | 0.6 | 0.51 | 0.63 | 0.54 | 0.75 | 0.77 | 0.71 | 0.57 | 0.66 | 0.59 | ||
| B1 | 0.52 | 0.54 | 0.53 | 0.44 | 0.44 | 0.44 | 0.59 | 0.57 | 0.58 | 0.49 | 0.48 | 0.48 | ||
| B2 | 0.45 | 0.67 | 0.54 | 0.4 | 0.63 | 0.49 | 0.57 | 0.76 | 0.65 | 0.38 | 0.62 | 0.48 | ||
| Duration | Male | Ord | 0.94 | 0.95 | 0.94 | 0.54 | 0.61 | 0.54 | 0.55 | 0.57 | 0.56 | 0.80 | 0.77 | 0.76 |
| RF | 0.67 | 0.72 | 0.69 | 0.54 | 0.61 | 0.54 | 0.55 | 0.57 | 0.55 | 0.44 | 0.5 | 0.47 | ||
| B1 | 0.55 | 0.56 | 0.7 | 0.42 | 0.42 | 0.42 | 0.34 | 0.35 | 0.35 | 0.42 | 0.42 | 0.42 | ||
| B2 | 0.41 | 0.64 | 0.5 | 0.33 | 0.57 | 0.42 | 0.25 | 0.5 | 0.33 | 0.22 | 0.47 | 0.3 | ||
| Female | Ord | 0.72 | 0.71 | 0.72 | 0.63 | 0.69 | 0.65 | 0.49 | 0.58 | 0.5 | 0.81 | 0.78 | 0.76 | |
| RF | 0.72 | 0.61 | 0.49 | 0.46 | 0.68 | 0.55 | 0.47 | 0.51 | 0.49 | 0.38 | 0.62 | 0.48 | ||
| B1 | 0.53 | 0.52 | 0.52 | 0.53 | 0.54 | 0.53 | 0.43 | 0.41 | 0.42 | 0.44 | 0.48 | 0.46 | ||
| B2 | 0.33 | 0.57 | 0.42 | 0.46 | 0.68 | 0.54 | 0.33 | 0.58 | 0.42 | 0.38 | 0.62 | 0.47 | ||
| Intensity | Male | Ord | 0.61 | 0.70 | 0.63 | 0.78 | 0.77 | 0.78 | 0.51 | 0.48 | 0.49 | 0.86 | 0.81 | 0.82 |
| RF | 0.62 | 0.67 | 0.65 | 0.53 | 0.56 | 0.53 | 0.38 | 0.51 | 0.42 | 0.45 | 0.55 | 0.47 | ||
| B1 | 0.52 | 0.54 | 0.53 | 0.48 | 0.47 | 0.47 | 0.39 | 0.4 | 0.4 | 0.39 | 0.39 | 0.39 | ||
| B2 | 0.45 | 0.67 | 0.54 | 0.33 | 0.57 | 0.42 | 0.23 | 0.48 | 0.31 | 0.26 | 0.51 | 0.34 | ||
| Female | Ord | 0.67 | 0.71 | 0.69 | 0.59 | 0.65 | 0.62 | 0.49 | 0.51 | 0.53 | 0.80 | 0.72 | 0.7 | |
| RF | 0.58 | 0.66 | 0.59 | 0.61 | 0.62 | 0.61 | 0.44 | 0.51 | 0.5 | 0.41 | 0.48 | 0.44 | ||
| B1 | 0.49 | 0.49 | 0.49 | 0.48 | 0.48 | 0.48 | 0.41 | 0.42 | 0.42 | 0.34 | 0.34 | 0.34 | ||
| B2 | 0.46 | 0.68 | 0.55 | 0.24 | 0.49 | 0.31 | 0.27 | 0.52 | 0.36 | 0.23 | 0.48 | 0.31 | ||
Models metrics of PA Frequency, Duration and Intensity in all the age groups and in both males and females. Prec, Rec and F1 refer to precision, recall and f1-score, respectively. In the table, the models with higher F1 score are highlighted
Individual characteristics and sociocultural factors associated with the respondents’ PA
| Male | Female | ||||
|---|---|---|---|---|---|
| Feature | Coef | Feature | Coef | ||
| Frequency | < 25 | PA habit - Strength exercise | 0.97 | Values - Healthy life | 0.71 |
| Values – Debts | 0.78 | PA Habit - Handball | 0.50 | ||
| PA Facilities - Sport hall | 0.77 | PA Facilities - Illuminated track | 0.49 | ||
| PA Motive – Challenge | 0.73 | ||||
| Climate Change | −0.70 | ||||
| 25–44 | Values - Healthy life | 0.78 | Values - Healthy life | 0.98 | |
| PA Facilities - Illuminated track | 0.62 | ||||
| PA Facilities - Fitness Centre | 0.49 | ||||
| 45–64 | PA Facilities - Walking trial | 0.95 | PA Facilities - Walking trial | 0.60 | |
| Values - Healthy life | 0.77 | PA Habit - Shooting | 0.58 | ||
| PA barriers - Lack of enjoyment | −0.52 | ||||
| > 65 | Environmental behaviours - Active Transport | 0.77 | Duration | 0.53 | |
| PA Intensity | 0.61 | Environmental behaviours - Active Transport | −0.51 | ||
| PA Barriers - Lack of time | −0.58 | Values – Strikes | 0.49 | ||
| Comfort with divergences | −0.57 | Close relationships with neighbours | −0.48 | ||
| Education field | 0.55 | Values - Healthy life | −0.44 | ||
| Duration | < 25 | PA Intensity | 0.94 | PA Habits – Sailings | 0.75 |
| Values – Honesty | 0.76 | ||||
| Childhood in a farm | 0.73 | ||||
| Health benefits – Snus | −0.58 | ||||
| Values - Brands quality | 0.55 | ||||
| 25–44 | PA Facilities- Walking trail | 0.68 | PA Facilities- Walking trail | 0.57 | |
| 45–64 | PA Facilities- Walking trail | 0.66 | PA Habit – Cycling | 0.47 | |
| > 65 | PA Habit – Cycling | −0.62 | PA Facilities - Outdoor area | 0.54 | |
| Language identity | −0.58 | Values - Mothers with disabilities | 0.53 | ||
| PA Facilities - Track and field stadium | 0.52 | Values - Healthy food | −0.52 | ||
| Religion inquiry | −0.50 | PA Habits – Hiking | 0.52 | ||
| Environmental organizations | 0.49 | Values - Personal liberty | 0.49 | ||
| Intensity | < 25 | Disagreements with neighbour’s | −1.00 | PA facilities - Fitness Centre | 0.67 |
| PA Facilities - Walking trial | 0.91 | PA Habits – Hiking | 0.53 | ||
| National pride | −0.89 | PA Facilities – Motorsport | 0.53 | ||
| Values - Children obedience | −0.87 | PA Habits – Jogging | 0.53 | ||
| Values - Countryside life | −0.86 | Attitude to State Church | 0.48 | ||
| 25–44 | View on social security | 1.00 | PA facilities - Fitness Centre | 1.00 | |
| PA Frequency | 0.73 | ||||
| PA Motive – Appearance | 0.71 | ||||
| Work field | 0.69 | ||||
| Values - Economic equality | −0.58 | ||||
| 45–64 | PA Motive - Health benefits | 0.97 | PA Motive - Health benefits | 1.00 | |
| > 65 | PA Habit – Cycling | 0.72 | Values – Gambling | −0.41 | |
| Environmental concerns | −0.52 | PA Frequency | 0.40 | ||
| Values - Brands name | 0.51 | View on children number | 0.40 | ||
| Religion identity – Christianity | −0.50 | View on pension & holidays | −0.39 | ||
| PA Frequency | 0.50 | Values – Marriage | −0.39 | ||
The coefficient indicates the importance for the feature (as computed by Gini Coefficient) in predicting the different PA components Frequency, Duration and Intensity in all the age groups (only five features with highest coefficient are shown)
PA components prediction
| Frequency | Duration | Intensity | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1985 | 2013 | 2025 | 1999 | 2013 | 2025 | 1999 | 2013 | 2025 | ||
|
|
| 1.02 | 1.27 | 1.44 ± 0.06 | 1.62 | 1.42 | 0.92 ± 0.04 | 1.63 | 1.65 | 2.45 ± 0.18 |
|
| 0.75 | 1.10 | 1.22 ± 0.04 | 1.31 | 1.27 | 1.21 ± 0.07 | 1.40 | 1.51 | 1.53 ± 0.14 | |
|
| 0.76 | 1.10 | 1.36 ± 0.08 | 1.25 | 1.22 | 1.15 ± 0.02 | 0.98 | 1.12 | 1.19 ± 0.09 | |
|
| 0.88 | 1.22 | 1.35 ± 0.08 | 1.13 | 1.19 | 1.03 ± 0.04 | 0.64 | 0.87 | 0.66 ± 0.08 | |
|
|
| 0.89 | 1.30 | 2.07 ± 0.18 | 1.37 | 1.31 | 1.30 ± 0.08 | 1.46 | 1.60 | 1.62 ± 0.12 |
|
| 0.69 | 1.12 | 1.80 ± 0.17 | 1.18 | 1.11 | 1.03 ± 0.06 | 1.14 | 1.32 | 1.23 ± 0.08 | |
|
| 0.90 | 1.26 | 1.66 ± 0.11 | 1.19 | 1.18 | 1.13 ± 0.04 | 0.77 | 1.03 | 0.84 ± 0.02 | |
|
| 0.69 | 1.26 | 1.42 ± 0.09 | 1.00 | 1.08 | 1.13 ± 0.05 | 0.49 | 0.74 | 1.10 ± 0.05 | |
Frequency: Values lower than 1 refer to Frequency less than twice every 14 days; values from 1 to 2 refer to Frequency from once to twice a week; values higher than 2 refer to Frequency more than three times a week
Duration: Values lower than 1 refer to duration less than 30 min; values from 1 to 2 refer to duration between 30 and 60 min; values higher than 2 refer to duration higher than 60 min
Intensity: Values lower than 1 refer to intensity ‘I feel that my body becomes warm’; values from 1 to 2 refer to intensity from ‘I feel that my body becomes warm’ to ‘I feel I breathe harder and get sweaty’; values higher than 2 refer to intensity close to ‘maximum exertion’
Mean of PA components class answers from values recorded until 2013 and mean ± standard deviation predicted upon 2025