| Literature DB >> 35983527 |
Qingqun Li1, Xiuli Zhang1, Chuxin Zhang1, Ying Li1, Shaorong Zhang1.
Abstract
Objective: To establish a risk prediction model of nonalcoholic fatty liver disease (NAFLD) and provide management strategies for preventing this disease.Entities:
Mesh:
Year: 2022 PMID: 35983527 PMCID: PMC9381194 DOI: 10.1155/2022/8793659
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.809
Figure 1Data set preprocessing process.
Figure 2Flow chart of random forest construction.
Figure 3Data processing process based on machine learning.
General data comparison between training set and verification set.
| Variable | Training set | Validation set |
|
|
|---|---|---|---|---|
| Gender (male/female) | 60/80 | 17/43 | 3.742 | 0.053 |
| Age (years) | 39.1 ± 9.34 | 39.2 ± 9.41 | 0.069 | 0.945 |
| WC (cm) | 79.56 ± 9.28 | 79.48 ± 9.32 | -0.056 | 0.956 |
| BMI (kg/m2) | 23.15 ± 2.36 | 23.21 ± 2.24 | 0.167 | 0.867 |
| SBP (mmHg) | 123.26 ± 16.32 | 121.35 ± 15.68 | -0.767 | 0.444 |
| DBP (mmHg) | 76.48 ± 10.36 | 75.89 ± 10.43 | -0.368 | 0.713 |
| NAFLD (%) | 31 (22.1%) | 13 (21.7%) | 0.006 | 0.941 |
| FPG (mmol/L) | 5.30 (5.00-5.61) | 5.34 (5.01-5.62) | 1.832 | 0.067 |
| TC (mmol/L) | 4.84 ± 0.94 | 4.86 ± 0.92 | 0.139 | 0.890 |
| TG (mmol/L) | 0.95 (0.71-1.36) | 0.95 (0.72-1.38) | 0.552 | 0.581 |
| HDL-C (mmol/L) | 1.35 ± 0.31 | 1.36 ± 0.30 | 0.211 | 0.833 |
| LDL-C (mmol/L) | 2.52 ± 0.68 | 2.51 ± 0.66 | -0.096 | 0.924 |
| ALT (U/L) | 17.01 (13.11-23.53) | 17.01 (13.15-23.56) | 0.238 | 0.812 |
| AST (U/L) | 19.03 (16.10-22.23) | 19.03 (16.08-22.23) | 0.029 | 0.977 |
| GGT (U/L) | 15.00 (11.02-21.45) | 15.00 (11.00-23.30) | 0.985 | 0.325 |
| UA (umol/L) | 314.15 ± 79.56 | 312.36 ± 80.49 | -0.145 | 0.885 |
| MTTP mutations | 25 (17.9%) | 11 (18.3%) | 0.006 | 0.936 |
Multivariate logistic regression analysis of NAFLD risk in training set.
| Variable |
|
| Wald |
|
|
|---|---|---|---|---|---|
| BMI | 0.198 | 0.023 | 90.564 | 1.219 (1.170~1.273) | <0.01 |
| TG | 0.113 | 0.032 | 13.846 | 1.116 (1.050~1.183) | <0.01 |
| HDL-C | -1.087 | 0.316 | 20.982 | 0.332 (0.205~0.536) | <0.01 |
| LDL-C | 0.214 | 0.079 | 6.538 | 1.250 (1.050~1.491) | <0.05 |
| ALT | 0.016 | 0.005 | 11.195 | 1.015 (1.006~1.025) | <0.05 |
| UA | 0.005 | 0.003 | 12.184 | 1.004 (1.001~1.005) | <0.01 |
| MTTP | 0.997 | 0.056 | 8.671 | 2.71 (1.38~5.27) | <0.05 |
Figure 4Importance rating of characteristic variables based on random forest.
Figure 5ROC curves obtained by the prediction models based on logistic regression and random forest.