| Literature DB >> 34925484 |
Wenxiu Xie1, Meng Ji2, Mengdan Zhao2, Kam-Yiu Lam1, Chi-Yin Chow1, Tianyong Hao3.
Abstract
BACKGROUND: From Ebola, Zika, to the latest COVID-19 pandemic, outbreaks of highly infectious diseases continue to reveal severe consequences of social and health inequalities. People from low socioeconomic and educational backgrounds as well as low health literacy tend to be affected by the uncertainty, complexity, volatility, and progressiveness of public health crises and emergencies. A key lesson that governments have taken from the ongoing coronavirus pandemic is the importance of developing and disseminating highly accessible, actionable, inclusive, coherent public health advice, which represent a critical tool to help people with diverse cultural, educational backgrounds and varying abilities to effectively implement health policies at the grassroots level.Entities:
Mesh:
Year: 2021 PMID: 34925484 PMCID: PMC8683224 DOI: 10.1155/2021/1916690
Source DB: PubMed Journal: Comput Intell Neurosci
Mann-Whitney U test.
| MLS features (1–26) POS (27–72) | EHR mean | EHR std. | RHR mean | RHR std. | Asymp. sig. (2-tailed) | Effect size | Common language effect size CLES | 95% CI | Mann-Whitney U | Wilcoxon W | Z |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Average sentences per paragraph | 0.601 | 0.345 | 3.316 | 1.163 | 0.000 | 2.588 | 0.966 | 2.301 to 2.874 | 52.00 | 4238.00 | −14.669 |
| TTR | 0.461 | 0.119 | 0.623 | 0.079 | 0.000 | 1.828 | 0.902 | 1.568 to 2.088 | 4389.00 | 8575.00 | −10.680 |
| Difficult words | 141.813 | 96.381 | 70.872 | 35.986 | 0.000 | −1.311 | 0.823 | −1.558 to −1.065 | 8769.50 | 70897.50 | −6.657 |
| Low-stroke characters | 641.593 | 444.264 | 284.707 | 140.930 | 0.000 | −1.507 | 0.857 | -1.758 to−1.256 | 8108.00 | 70236.00 | −7.264 |
| Middle-stroke characters | 120.396 | 80.348 | 51.810 | 27.695 | 0.000 | −1.562 | 0.865 | −1.814 to−1.31 | 7357.00 | 69485.00 | −7.955 |
| High-stroke characters | 0.055 | 0.229 | 0.452 | 1.604 | 0.005 | 0.277 | 0.578 | 0.045 to 0.508 | 14144.00 | 18330.00 | −2.815 |
| Average strokes per character | 7.677 | 0.274 | 7.711 | 0.342 | 0.195 | 0.103 | 0.529 | −0.127 to 0.334 | 14604.00 | 18790.00 | −1.297 |
| 2-character words | 256.637 | 176.783 | 114.341 | 56.262 | 0.000 | −1.509 | 0.857 | −1.76 to −1.258 | 8100.50 | 70228.50 | −7.271 |
| 3-character words | 13.000 | 11.824 | 8.298 | 6.372 | 0.008 | −0.603 | 0.665 | −0.837 to −0.369 | 13139.50 | 75267.50 | −2.647 |
| Average words per sentences | 11.592 | 4.963 | 11.893 | 1.962 | 0.018 | 0.106 | 0.530 | -0.125 to 0.336 | 13438.00 | 17624.00 | −2.368 |
| Single sentences | 0.883 | 0.114 | 0.460 | 0.191 | 0.000 | −2.376 | 0.954 | −2.655 to −2.098 | 949.00 | 63077.00 | −13.851 |
| Ratio of noun phrases | 0.267 | 0.104 | 0.415 | 0.198 | 0.000 | 0.810 | 0.717 | 0.573 to 1.046 | 8396.50 | 12582.50 | −6.999 |
| Frequency of noun phrases per 10K | 322.369 | 37.240 | 314.664 | 37.002 | 0.028 | −0.208 | 0.558 | −0.439 to 0.023 | 13620.50 | 75748.50 | −2.200 |
| Average idioms per sentences | 0.001 | 0.004 | 0.012 | 0.029 | 0.004 | 0.424 | 0.618 | 0.192 to 0.656 | 13969.00 | 18155.00 | −2.894 |
| Content words | 392.582 | 265.421 | 163.074 | 80.538 | 0.000 | −1.642 | 0.877 | −1.896 to −1.387 | 6988.50 | 69116.50 | −8.293 |
| Adverbs of negation | 2.758 | 3.067 | 0.935 | 1.228 | 0.000 | −1.032 | 0.767 | −1.272 to −0.792 | 9522.50 | 71650.50 | −6.283 |
| Sentences with complex semantic categories | 25.396 | 22.039 | 7.614 | 4.758 | 0.000 | −1.643 | 0.877 | −1.898 to −1.388 | 6696.50 | 68824.50 | −8.578 |
| Density of content words | 0.828 | 0.026 | 0.815 | 0.031 | 0.000 | −0.433 | 0.620 | −0.665 to −0.2 | 11857.50 | 73985.50 | −3.820 |
| Average logarithmic frequency of content words | 1.738 | 0.169 | 1.337 | 0.183 | 0.000 | −2.225 | 0.942 | −2.498 to −1.952 | 1854.00 | 63982.00 | −13.009 |
| Idioms | 0.077 | 0.268 | 0.224 | 0.510 | 0.009 | 0.312 | 0.587 | 0.081 to 0.544 | 14168.00 | 18354.00 | −2.623 |
| Pronouns | 41.692 | 35.043 | 1.469 | 1.652 | 0.000 | −2.530 | 0.963 | −2.814 to −2.245 | 620.50 | 62748.50 | −14.398 |
| Personal pronouns | 37.868 | 31.838 | 0.705 | 1.139 | 0.000 | −2.577 | 0.966 | −2.864 to −2.291 | 326.00 | 62454.00 | −15.334 |
| Conjunctions | 18.967 | 16.440 | 11.520 | 6.407 | 0.001 | −0.795 | 0.713 | −1.031 to −0.558 | 12507.50 | 74635.50 | −3.228 |
| Positive conjunctions | 16.824 | 14.159 | 9.000 | 5.193 | 0.000 | −0.991 | 0.758 | −1.23 to -0.751 | 10808.00 | 72936.00 | −4.795 |
| Negative conjunctions | 0.846 | 1.584 | 1.440 | 1.393 | 0.000 | 0.414 | 0.615 | 0.182 to 0.646 | 10710.50 | 14896.50 | −5.070 |
| Difficult words ratio | 30.384 | 5.724 | 35.847 | 8.175 | 0.000 | 0.706 | 0.691 | 0.471 to 0.941 | 9400.50 | 13586.50 | −6.077 |
| A | 3.099 | 3.774 | 3.026 | 2.969 | 0.236 | −0.023 | 0.507 | −0.254 to 0.207 | 14740.50 | 18926.50 | −1.184 |
| VI | 0.143 | 0.485 | 0.159 | 0.424 | 0.348 | 0.037 | 0.510 | −0.194 to 0.267 | 15418.50 | 19604.50 | −0.938 |
| Dk | 0.011 | 0.105 | 0.017 | 0.130 | 0.680 | 0.048 | 0.514 | −0.183 to 0.278 | 15919.00 | 20105.00 | −0.413 |
| VG | 2.154 | 2.454 | 2.054 | 1.919 | 0.505 | −0.049 | 0.514 | −0.28 to 0.181 | 15305.00 | 19491.00 | -0.666 |
| Nv | 9.967 | 10.390 | 9.290 | 6.711 | 0.083 | −0.089 | 0.525 | −0.32 to 0.142 | 14132.00 | 18318.00 | −1.734 |
| Neqb | 0.033 | 0.180 | 0.057 | 0.277 | 0.590 | 0.092 | 0.526 | −0.138 to 0.323 | 15810.00 | 19996.00 | −0.539 |
| Cab | 0.121 | 0.390 | 0.398 | 0.799 | 0.000 | 0.377 | 0.605 | 0.145 to 0.609 | 13151.00 | 17337.00 | −3.534 |
| I | 0.033 | 0.180 | 0.000 | 0.000 | 0.001 | −0.406 | 0.613 | −0.638 to −0.174 | 15488.00 | 77616.00 | −3.414 |
| VAC | 0.154 | 0.392 | 0.048 | 0.215 | 0.001 | −0.406 | 0.613 | −0.638 to −0.174 | 14493.00 | 76621.00 | −3.214 |
| Nd | 4.253 | 7.872 | 2.298 | 3.408 | 0.002 | −0.418 | 0.616 | -0.65 to −0.186 | 12764.50 | 74892.50 | −3.050 |
| Nb | 1.451 | 2.423 | 0.739 | 1.420 | 0.000 | −0.425 | 0.618 | −0.657 to −0.193 | 12369.50 | 74497.50 | −3.789 |
| Dfb | 0.066 | 0.291 | 0.003 | 0.053 | 0.000 | −0.451 | 0.625 | −0.683 to −0.219 | 15181.00 | 77309.00 | −3.831 |
| Neu | 4.505 | 6.339 | 2.395 | 3.209 | 0.001 | −0.521 | 0.644 | −0.754 to −0.288 | 12438.00 | 74566.00 | −3.347 |
| VJ | 10.011 | 8.836 | 6.455 | 5.125 | 0.000 | −0.586 | 0.661 | −0.82 to −0.352 | 11896.00 | 74024.00 | −3.797 |
| VL | 2.912 | 2.946 | 1.710 | 1.741 | 0.001 | −0.588 | 0.661 | −0.821 to −0.354 | 12457.00 | 74585.00 | −3.343 |
| Cba | 0.110 | 0.379 | 0.003 | 0.053 | 0.000 | −0.602 | 0.665 | −0.836 to −0.369 | 14652.50 | 76780.50 | −5.125 |
| Caa | 12.846 | 11.117 | 8.705 | 5.150 | 0.103 | −0.608 | 0.666 | −0.842 to −0.374 | 14244.50 | 76372.50 | −1.631 |
| VB | 0.912 | 1.488 | 0.321 | 0.722 | 0.000 | −0.635 | 0.673 | −0.869 to −0.401 | 12839.50 | 74967.50 | −3.789 |
| VHC | 0.527 | 1.089 | 1.705 | 1.958 | 0.000 | 0.649 | 0.677 | 0.415 to 0.884 | 9100.00 | 13286.00 | −6.641 |
| Da | 0.516 | 1.058 | 0.125 | 0.348 | 0.000 | −0.686 | 0.686 | −0.921 to −0.451 | 12813.00 | 74941.00 | -4.647 |
| Nes | 2.033 | 2.100 | 0.960 | 1.278 | 0.000 | −0.723 | 0.696 | −0.959 to -0.488 | 10783.50 | 72911.50 | −5.088 |
| Dfa | 3.473 | 3.854 | 1.608 | 1.752 | 0.000 | −0.797 | 0.713 | −1.033 to −0.561 | 10952.00 | 73080.00 | −4.769 |
| VH | 22.154 | 15.913 | 13.767 | 8.222 | 0.000 | −0.817 | 0.718 | −1.053 to −0.58 | 11401.00 | 73529.00 | −4.243 |
| Nf | 7.418 | 8.071 | 3.094 | 3.956 | 0.000 | −0.852 | 0.727 | −1.089 to −0.615 | 9110.50 | 71238.50 | −6.401 |
| Nc | 7.780 | 7.781 | 2.688 | 5.303 | 0.000 | −0.864 | 0.729 | −1.101 to −0.627 | 8121.00 | 70249.00 | −7.430 |
| Nep | 3.527 | 4.388 | 1.327 | 1.653 | 0.000 | −0.890 | 0.736 | −1.128 to −0.653 | 12279.50 | 74407.50 | −3.559 |
| T | 1.165 | 1.662 | 0.261 | 0.649 | 0.000 | −0.953 | 0.750 | −1.192 to −0.715 | 10192.00 | 72320.00 | −6.926 |
| Di | 2.901 | 3.169 | 0.912 | 1.538 | 0.000 | −1.003 | 0.761 | −1.243 to −0.763 | 9200.50 | 71328.50 | −6.755 |
| SHI | 4.396 | 4.942 | 1.676 | 1.709 | 0.000 | −1.006 | 0.762 | −1.246 to −0.766 | 10776.00 | 72904.00 | −4.917 |
| VD | 1.703 | 2.355 | 0.375 | 0.861 | 0.000 | −1.012 | 0.763 | −1.252 to −0.772 | 9587.50 | 71715.50 | −7.241 |
| Cbb | 5.890 | 6.402 | 2.415 | 2.001 | 0.000 | −1.022 | 0.765 | −1.263 to −0.782 | 10325.50 | 72453.50 | −5.297 |
| Neqa | 6.011 | 5.997 | 2.537 | 2.227 | 0.000 | −1.034 | 0.768 | −1.274 to −0.794 | 10036.50 | 72164.50 | −5.553 |
| Na | 113.209 | 77.721 | 64.639 | 32.606 | 0.000 | −1.065 | 0.774 | −1.306 to −0.824 | 11326.50 | 73454.50 | −4.308 |
| Ng | 6.473 | 8.048 | 2.068 | 1.980 | 0.000 | −1.090 | 0.780 | −1.331 to −0.848 | 8874.50 | 71002.50 | −6.668 |
| V_2 | 4.560 | 5.879 | 1.128 | 1.214 | 0.000 | −1.197 | 0.801 | −1.44 to −0.953 | 9003.50 | 71131.50 | −6.676 |
| VE | 7.824 | 7.207 | 2.670 | 2.913 | 0.000 | −1.237 | 0.809 | −1.482 to −0.993 | 8111.50 | 70239.50 | −7.335 |
| DE | 27.396 | 21.300 | 11.653 | 7.618 | 0.000 | −1.336 | 0.828 | −1.583 to −1.09 | 8606.50 | 70734.50 | −6.816 |
| P | 22.571 | 16.570 | 9.330 | 6.188 | 0.000 | −1.424 | 0.843 | −1.672 to −1.175 | 8358.00 | 70486.00 | −7.045 |
| Ncd | 6.747 | 5.960 | 1.869 | 1.933 | 0.000 | −1.526 | 0.86 | −1.777 to −1.274 | 7167.50 | 69295.50 | −8.263 |
| VCL | 5.242 | 5.730 | 0.693 | 1.033 | 0.000 | −1.656 | 0.879 | −1.911 to −1.401 | 5248.50 | 67376.50 | −10.615 |
| VC | 48.791 | 35.985 | 14.759 | 10.551 | 0.000 | −1.812 | 0.900 | −2.071 to −1.552 | 5672.50 | 67800.50 | −9.506 |
| D | 44.110 | 31.491 | 14.489 | 8.247 | 0.000 | −1.849 | 0.905 | −2.11 to −1.589 | 5603.00 | 67731.00 | −9.573 |
| VK | 9.736 | 7.688 | 2.293 | 2.086 | 0.000 | −1.889 | 0.909 | −2.151 to −1.627 | 5158.50 | 67286.50 | −10.096 |
| VA | 11.659 | 10.140 | 2.088 | 2.214 | 0.000 | −1.919 | 0.913 | −2.181 to −1.656 | 4616.50 | 66744.50 | −10.605 |
| VF | 4.198 | 3.964 | 0.222 | 0.629 | 0.000 | −2.119 | 0.933 | −2.388 to −1.849 | 4026.50 | 66154.50 | −13.780 |
| Nh | 41.692 | 35.043 | 1.469 | 1.652 | 0.000 | −2.530 | 0.963 | −2.814 to −2.245 | 620.50 | 62748.50 | −14.398 |
Continuous feature distribution in regular health resources (RHR) translated to Chinese.
| Continuous MLS features | Min | Max | Mean | SE. | 95% confidence interval for mean | SD. | Skewness | SE. | Kurtosis | SE. | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ASPP | 0.122 | 8.857 | 2.759 | 0.072 | 2.617 | 2.90 | 1.518 | 0.145 | 0.116 | −0.194 | 0.231 |
| DCW | 0.691 | 0.896 | 0.817 | 0.001 | 0.814 | 0.820 | 0.030 | −0.210 | 0.116 | 0.382 | 0.231 |
| RNP | 0.000 | 1.271 | 0.385 | 0.009 | 0.367 | 0.403 | 0.192 | 0.826 | 0.116 | 1.207 | 0.231 |
| ALFCW | 0.808 | 2.228 | 1.420 | 0.012 | 1.397 | 1.442 | 0.242 | 0.365 | 0.116 | -0.18 | 0.231 |
| NFNP | 200.00 | 458.33 | 316.25 | 1.765 | 312.78 | 319.71 | 37.14 | 0.268 | 0.116 | 0.616 | 0.231 |
| ASPC | 6.505 | 8.705 | 7.704 | 0.016 | 7.673 | 7.734 | 0.329 | -0.264 | 0.116 | 1.059 | 0.231 |
ASPP: average sentences per paragraph; DCW density of content words; RNP: ratio of noun phrases; ALFCW: average logarithmic frequency of content words; NFNP: normalised frequency of noun phrases; ASPC average strokes per character.
Figure 1Continuous feature distribution in regular health resources (RHR) translated into Chinese.
Performance of Gaussian Naïve Bayes (GNB) classifiers with different feature sets.
| Model | Techniques | Training (5-fold CV) | Testing | ||||
|---|---|---|---|---|---|---|---|
| AUC mean (SD) | AUC | Accuracy | Macro F1 | Sensitivity | Specificity | ||
| 1 | MLS + POS full (69) | 0.971 (0.0212) | 0.940 | 0.921 | 0.992 | 0.944 | 0.8824 |
| 2 | MLS + POS jointly optimised (6) | 0.998 (0.0026) | 0.993 | 0.940 | 0.943 | 0.963 | 0.9118 |
| 3 | MLS full (26 features) | 0.997 (0.003) | 1.0 | 0.966 | 0.963 | 1.0 | 0.9118 |
| 4 | MLS optimised (2 features) | 0.998 (0.004) | 1.0 | 0.943 | 0.938 | 1.0 | 0.8529 |
| 5 | POS full (46 features) | 0.959 (0.0238) | 0.907 | 0.852 | 0.843 | 0.889 | 0.7941 |
| 6 | POS optimised (8 features) | 0.982 (0.0166) | 0.968 | 0.955 | 0.951 | 1.0 | 0.8824 |
| 7 | MLS + POS separately optimised (10) | 1.0 (0) | 1.0 | 0.955 | 0.951 | 1.0 | 0.8824 |
| 8 | Refined MLS + POS separately optimised (2) | 0.995 (0.0079) | 0.999 | 0.989 | 0.988 | 0.982 | 1.0 |
Criteria of classifier optimisation.
| Techniques | Cross-validation accuracy (CVA) | Minimal classification error (MCE) = 1-CVA |
|---|---|---|
| Joint optimisation (6) | 0.9853 | 0.0147 |
| Optimised MLS features (2) | 0.9902 | 0.0098 |
| Optimised POS features (8) | 0.9854 | 0.0146 |
Figure 2Recursive feature elimination using SVM as base estimator.
Figure 3Percentage of vulnerable people-oriented (VPO) materials and regular health resources (RHR) assigned by GNB to 10 probability bins.
Thresholds, positive/negative likelihood ratio, and 95% CI of the best-performing GNB classifier on the test data.
| Probability thresholds | Sensitivity (95% CI) | Specificity (95% CI) | Positive likelihood ratio (LR+) (95% CI) | Negative likelihood ratio (LR-) (95% CI) |
|---|---|---|---|---|
| 0.1 | 0.982 (0.95, 1.0) | 0.9118 (0.82, 1.0) | 11.12 (3.77, 32.79) | 0.02031 (0.003, 0.142) |
|
|
|
|
|
|
| 0.5 | 0.963 (0.91, 1.0) | 0.941 (0.86, 1.0) | 16.37 (4.26, 62.87) | 0.039 (0.010, 0.154) |
| 0.6 | 0.963 (0.91, 1.0) | 1.0 (1.0, 1.0) | Infinity | 0.037 (0.010, 0.144) |
| 0.7 | 0.9444 (0.883, 1.0) | 1.0 (1.0, 1.0) | Infinity | 0.0556 (0.019, 0.167) |
| 0.9 | 0.9074 (0.830, 0.985) | 1.0 (1.0, 1.0) | Infinity | 0.09259 (0.0402, 0.213) |
| Machine learning classifier: refined GNB classifier (with ALFCW and Nh features) | ||||
Figure 4Receiver operating characteristic (ROC) of Gaussian Naïve Bayes classifiers.
Figure 5Percentage of vulnerable people-oriented (VPO) or regular health resources (RHR) assigned by CHAAAT to each 10-score bin.
Thresholds of transformed scores of CHAAAT regression formula.
| Score thresholds | Sensitivity | Specificity | Score thresholds | Sensitivity | Specificity |
|---|---|---|---|---|---|
| 1.828 | 1.000 | 0.853 | 64.118 | 0.926 | 0.941 |
| 7.685 | 1.000 | 0.882 | 66.222 | 0.907 | 0.941 |
| 16.462 | 0.981 | 0.882 | 68.542 | 0.889 | 0.941 |
| 25.144 | 0.981 | 0.912 | 70.964 | 0.870 | 0.941 |
|
|
|
| 72.994 | 0.870 | 0.971 |
| 43.160 | 0.963 | 0.941 | 73.327 | 0.852 | 0.971 |
| 59.899 | 0.944 | 0.941 | 76.001 | 0.852 | 1.000 |