| Literature DB >> 35614408 |
Tao Xie1, Ning Zhang1, Ying Mao2, Bin Zhu3.
Abstract
BACKGROUND: The internet has become an important resource for the public to obtain health information. Therefore, the ability to obtain and use such resources has become important for health literacy. This study aimed to establish a prediction model of Chinese students' electronic health literacy (EHL) to guide government policymaking and parental interventions, identify the predictors of EHL in Chinese students using random forests, and establish a corresponding prediction model to help policymakers and parents determine whether primary and secondary school students have high EHL.Entities:
Keywords: Chinese students; Electronic health literacy; Lasso; Random forest; Web nomograms
Mesh:
Year: 2022 PMID: 35614408 PMCID: PMC9132355 DOI: 10.1186/s12889-022-13421-4
Source DB: PubMed Journal: BMC Public Health ISSN: 1471-2458 Impact factor: 4.135
Overview of the characteristics of all the datasets
| Factor | Testing = 0 | Training = 1 | EHL = 0 | EHL = 1 | |
|---|---|---|---|---|---|
| Sex | 0.81 | ||||
| Male | 188 (51.8%) | 423 (48.5%) | 408 (49.8%) | 203 (48.9%) | |
| Female | 175 (48.2%) | 449 (51.5%) | 412 (50.2%) | 212 (51.1%) | |
| Age | 14.0 (11.0, 17.0) | 14.0 (11.0, 17.0) | 14.0 (11.0, 17.0) | 16.0 (13.0, 17.0) | < 0.01 |
| Race | 0.41 | ||||
| Han | 360 (99.2%) | 861 (98.7%) | 809 (98.7%) | 412 (99.3%) | |
| Nhan | 3 (0.8%) | 11 (1.3%) | 11 (1.3%) | 3 (0.7%) | |
| Grade | < 0.01 | ||||
| 2 | 105 (28.9%) | 243 (27.9%) | 259 (31.6%) | 89 (21.4%) | |
| 3 | 102 (28.1%) | 246 (28.2%) | 234 (28.5%) | 114 (27.5%) | |
| 4 | 152 (41.9%) | 375 (43.0%) | 318 (38.8%) | 209 (50.4%) | |
| 5 | 0 (0.0%) | 1 (0.1%) | 1 (0.1%) | 0 (0.0%) | |
| 7 | 2 (0.6%) | 7 (0.8%) | 7 (0.9%) | 2 (0.5%) | |
| 8 | 2 (0.6%) | 0 (0.0%) | 1 (0.1%) | 1 (0.2%) | |
| Family size | 4.0 (4.0, 4.0) | 4.0 (4.0, 4.0) | 4.0 (4.0, 4.0) | 4.0 (4.0, 4.0) | 0.81 |
| Only child | 0.84 | ||||
| Yes | 29 (8.0%) | 99 (11.4%) | 84 (10.2%) | 44 (10.6%) | |
| No | 334 (92.0%) | 773 (88.6%) | 736 (89.8%) | 371 (89.4%) | |
| Employment status | < 0.01 | ||||
| Father working | 97 (26.7%) | 243 (27.9%) | 244 (29.8%) | 96 (23.1%) | |
| Mother working | 6 (1.7%) | 27 (3.1%) | 21 (2.6%) | 12 (2.9%) | |
| Parents both working | 42 (11.6%) | 109 (12.5%) | 113 (13.8%) | 38 (9.2%) | |
| No parent working | 218 (60.1%) | 493 (56.5%) | 442 (53.9%) | 269 (64.8%) | |
| Location of the household | < 0.01 | ||||
| City | 138 (38.0%) | 312 (35.8%) | 324 (39.5%) | 126 (30.4%) | |
| Rural | 225 (62.0%) | 560 (64.2%) | 496 (60.5%) | 289 (69.6%) | |
| Mother education | 2.0 (1.0, 3.0) | 2.0 (1.0, 3.0) | 2.0 (1.0, 3.0) | 2.0 (1.0, 3.0) | 0.07 |
| Father education | 2.0 (2.0, 3.0) | 2.0 (2.0, 3.0) | 2.0 (2.0, 3.0) | 2.0 (2.0, 4.0) | 0.55 |
| Game time of online | 2.0 (1.0, 3.0) | 2.0 (0.0, 3.0) | 2.0 (0.0, 3.0) | 2.0 (1.0, 3.0) | 0.91 |
| Parental phubbing behavior | 22.0 (17.0, 27.0) | 22.0 (17.0, 27.0) | 21.0 (17.0, 26.0) | 24.0 (18.0, 28.0) | < 0.01 |
| General self-efficacy | 24.0 (20.0, 28.0) | 24.0 (20.0, 28.0) | 23.0 (20.0, 26.0) | 27.0 (23.0, 31.0) | < 0.01 |
Overview of the characteristics of the training dataset
| Factor | EHL = 0 | EHL = 1 | |
|---|---|---|---|
| N | 585 | 287 | |
| Sex | 0.89 | ||
| Male | 285 (48.7%) | 138 (48.1%) | |
| Female | 300 (51.3%) | 149 (51.9%) | |
| Age | 14.0 (11.0, 17.0) | 16.0 (13.0, 17.0) | < 0.01 |
| Race | 0.52 | ||
| Han | 576 (98.5%) | 285 (99.3%) | |
| Nhan | 9 (1.5%) | 2 (0.7%) | |
| Grade | < 0.01 | ||
| 2 | 190 (32.5%) | 53 (18.5%) | |
| 3 | 166 (28.4%) | 80 (27.9%) | |
| 4 | 223 (38.1%) | 152 (53.0%) | |
| 5 | 1 (0.2%) | 0 (0.0%) | |
| 7 | 5 (0.9%) | 2 (0.7%) | |
| Family size | 4.0 (4.0, 4.0) | 4.0 (4.0, 4.0) | 0.44 |
| Only child | 1.00 | ||
| Yes | 67 (11.5%) | 32 (11.1%) | |
| No | 518 (88.5%) | 255 (88.9%) | |
| Employment status | 0.05 | ||
| Father working | 172 (29.4%) | 71 (24.7%) | |
| Mother working | 18 (3.1%) | 9 (3.1%) | |
| Parents both working | 82 (14.0%) | 27 (9.4%) | |
| No parent working | 313 (53.5%) | 180 (62.7%) | |
| Location of the household | < 0.01 | ||
| City | 233 (39.8%) | 79 (27.5%) | |
| Rural | 352 (60.2%) | 208 (72.5%) | |
| Mother education | 2.0 (1.0, 3.0) | 2.0 (1.0, 3.0) | 0.04 |
| Father education | 2.0 (2.0, 3.0) | 2.0 (2.0, 4.0) | 0.76 |
| Game time of online | 2.0 (0.0, 3.0) | 2.0 (1.0, 3.0) | 0.33 |
| Parental phubbing behavior | 21.0 (17.0, 25.0) | 24.0 (19.0, 28.0) | < 0.01 |
| General self-efficacy | 23.0 (20.0, 26.0) | 27.0 (23.0, 31.0) | < 0.01 |
Overview of the characteristics of the testing dataset
| Factor | EHL = 0 | EHL = 1 | |
|---|---|---|---|
| N | 235 | 128 | |
| Sex | 0.83 | ||
| Male | 123 (52.3%) | 65 (50.8%) | |
| Female | 112 (47.7%) | 63 (49.2%) | |
| Age | 14.0 (10.0, 17.0) | 15.0 (11.0, 16.0) | 0.73 |
| Race | 1.00 | ||
| Han | 233 (99.1%) | 127 (99.2%) | |
| Nhan | 2 (0.9%) | 1 (0.8%) | |
| Grade | 0.83 | ||
| 2 | 69 (29.4%) | 36 (28.1%) | |
| 3 | 68 (28.9%) | 34 (26.6%) | |
| 4 | 95 (40.4%) | 57 (44.5%) | |
| 7 | 2 (0.9%) | 0 (0.0%) | |
| 8 | 1 (0.4%) | 1 (0.8%) | |
| Family size | 4.0 (4.0, 4.0) | 4.0 (4.0, 4.0) | 0.47 |
| Only child | 0.54 | ||
| Yes | 17 (7.2%) | 12 (9.4%) | |
| No | 218 (92.8%) | 116 (90.6%) | |
| Employment status | 0.02 | ||
| Father working | 72 (30.6%) | 25 (19.5%) | |
| Mother working | 3 (1.3%) | 3 (2.3%) | |
| Parents working | 31 (13.2%) | 11 (8.6%) | |
| No parent working | 129 (54.9%) | 89 (69.5%) | |
| Location of the household | 0.74 | ||
| City | 91 (38.7%) | 47 (36.7%) | |
| Rural | 144 (61.3%) | 81 (63.3%) | |
| Mother education | 2.0 (1.0, 3.0) | 2.0 (1.0, 3.0) | 0.89 |
| Father education | 2.0 (2.0, 3.0) | 2.0 (2.0, 4.0) | 0.11 |
| Game of online | 2.0 (1.0, 3.0) | 1.0 (0.0, 2.5) | 0.19 |
| Parental phubbing behavior | 22.0 (17.0, 27.0) | 23.5 (17.0, 28.0) | 0.38 |
| General self-efficacy | 23.0 (20.0, 26.0) | 27.5 (22.0, 31.0) | < 0.01 |
Fig. 1Feature selection and ranking by random forest. A The plot shows a boxplot of all variables. B History of decisions of rejecting or accepting features by random forest
Fig. 2Feature selection using LASSO regression. A LASSO coefficient profiles of the clinical features. B Optimal penalization coefficient lambda (λ)
Fig. 3ROC for evaluating the model’s discrimination performance in the training and testing datasets. Red: Training dataset; a Green: Testing dataset
Fig. 4Nomogram of EHL
Fig. 5DCA focusing on the relationship between the benefits and risks brought by different models. Benefits and risks of models on training (A) /test (B) dataset