| Literature DB >> 35365931 |
Mazen Noureddin1, Fady Ntanios2, Deepa Malhotra2, Katherine Hoover2, Birol Emir2, Euan McLeod3, Naim Alkhouri4.
Abstract
This cohort analysis investigated the prevalence of nonalcoholic fatty liver disease (NAFLD) and NAFLD with fibrosis at different stages, associated clinical characteristics, and comorbidities in the general United States population and a subpopulation with type 2 diabetes mellitus (T2DM), using the National Health and Nutrition Examination Survey (NHANES) database (2017-2018). Machine learning was explored to predict NAFLD identified by transient elastography (FibroScan® ). Adults ≥20 years of age with valid transient elastography measurements were included; those with high alcohol consumption, viral hepatitis, or human immunodeficiency virus were excluded. Controlled attenuation parameter ≥302 dB/m using Youden's index defined NAFLD; vibration-controlled transient elastography liver stiffness cutoffs were ≤8.2, ≤9.7, ≤13.6, and >13.6 kPa for F0-F1, F2, F3, and F4, respectively. Predictive modeling, using six different machine-learning approaches with demographic and clinical data from NHANES, was applied. Age-adjusted prevalence of NAFLD and of NAFLD with F0-F1 and F2-F4 fibrosis was 25.3%, 18.9%, and 4.4%, respectively, in the overall population and 54.6%, 32.6%, and 18.3% in those with T2DM. The highest prevalence was among Mexican American participants. Test performance for all six machine-learning models was similar (area under the receiver operating characteristic curve, 0.79-0.84). Machine learning using logistic regression identified male sex, hemoglobin A1c, age, and body mass index among significant predictors of NAFLD (P ≤ 0.01).Entities:
Mesh:
Year: 2022 PMID: 35365931 PMCID: PMC9234676 DOI: 10.1002/hep4.1935
Source DB: PubMed Journal: Hepatol Commun ISSN: 2471-254X
NAFLD populations and definitions
| Population | CAP (dB/m) | Fibrosis score, VCTE (kPa) | Overall population, n | T2DM subpopulation, n |
|---|---|---|---|---|
| Non‐NAFLD (simple steatosis) | <302 | <8.2 | 2605 | 353 |
| NAFLD | ≥302 | – | 1226 | 468 |
| NAFLD F0–F1 | ≥302 | ≤8.2 | 875 | 310 |
| NAFLD F2 | ≥302 | 8.2–9.7 | 78 | 40 |
| NAFLD F3 | ≥302 | 9.7–13.6 | 84 | 42 |
| NAFLD F4 (cirrhosis) | ≥302 | >13.6 | 57 | 38 |
| Cryptogenic cirrhosis | <302 | >13.6 | 33 | 11 |
| Borderline steatosis | 274–<302 | 8.2–13.6 | 33 | 11 |
| Control | <274 | 8.2–13.6 | 66 | 16 |
Abbreviations: CAP, controlled attenuation parameter; NAFLD, nonalcoholic fatty liver disease; T2DM, type 2 diabetes mellitus; VCTE, vibration‐controlled transient elastography.
Unweighted.
Did not meet criteria for NAFLD using CAP but met criteria for fibrosis using VCTE.
FIGURE 1Participant disposition. aComplete transient elastography (FibroScan®) examination was defined as a fasting time of ≥3 hours, ≥10 complete stiffness (E) measures, and a liver stiffness interquartile range/median E < 30%. bHeavy drinker was defined as drinking an average of ≥20 g/day and ≥30 g/day for women or men, respectively, based on the NHANES alcohol use survey. cNAFLD was defined as CAP ≥302 and VCTE ≥8.2, including cryptogenic cirrhosis (CAP <302, VCTE >13.6), borderline steatosis (CAP 274–302, VCTE 8.2–13.6), and control (CAP <274, VCTE 8.2–13.6). dNon‐NAFLD was defined as simple steatosis, CAP <302, and VCTE <8.2. Abbreviations: CAP, controlled attenuation parameter; E, ≥10 complete stiffness measures; HIV, human immunodeficiency virus; NAFLD, nonalcoholic fatty liver disease; NHANES, National Health and Nutrition Examination Survey; T2DM, type 2 diabetes mellitus; VCTE, vibration‐controlled transient elastography
FIGURE 2Age‐adjusted prevalence of NAFLD and fibrosis. (A) NAFLD and fibrosis stages in the overall population and (B) participants with T2DM. (C) Fibrosis among participants with NAFLD in the overall population and (D) participants with T2DM. (E) NAFLD by ethnicity. Abbreviations: CI, confidence interval; NAFLD, nonalcoholic fatty liver disease; T2DM, type 2 diabetes mellitus
Age‐adjusted demographics, clinical characteristics, and metabolic comorbitities in the overall population and T2DM subpopulation
| Overall population | T2DM subpopulation | |||||||
|---|---|---|---|---|---|---|---|---|
| Non‐NAFLD CAP <302 and VCTE <8.2 | NAFLD CAP ≥302 | NAFLD F0–F1 CAP ≥302 and VCTE ≤8.2 | NAFLD F2–F3 CAP ≥302 and 8.2< VCTE ≤13.6 | Non‐NAFLD CAP <302 and VCTE <8.2 | NAFLD CAP ≥302 | NAFLD F0–F1 CAP ≥302 and VCTE ≤8.2 | NAFLD F2–F3 CAP ≥302 and 8.2< VCTE ≤13.6 | |
| n | 2605 | 1226 | 875 | 162 | 353 | 468 | 310 | 82 |
| Demographics | ||||||||
| Age, years; mean (SE) | 47.2 (0.18) | 48.0 (0.28) | 48.1 (0.25) | 47.9 (0.74) | 50.1 (0.38) | 49.3 (0.46) | 49.8 (0.56) | 48.2 (1.26) |
| Sex, male; % | 44.1 | 57.2 | 57.3 | 66.5 | 39.0 | 44.8 | 44.9 | 61.6 |
| Race; % | ||||||||
| White (non‐Hispanic) | 60.2 | 61.0 | 61.8 | 57.9 | 40.5 | 51.3 | 43.6 | 48.6 |
| Black (non‐Hispanic) | 12.8 | 8.8 | 7.5 | 9.1 | 20.8 | 11.7 | 16.4 | 9.5 |
| Mexican American | 8.0 | 13.1 | 13.9 | 17.4 | 18.1 | 18.4 | 21.2 | 22.1 |
| Other Hispanic | 7.3 | 7.0 | 6.8 | 8.2 | 6.4 | 6.8 | 6.5 | 10.4 |
| Asian (non‐Hispanic) | 6.9 | 5.4 | 5.8 | 5.2 | 8.3 | 5.7 | 8.1 | 6.9 |
| Other | 4.8 | 4.7 | 4.1 | 2.1 | 5.9 | 6.1 | 4.2 | 2.5 |
| Clinical characteristic | ||||||||
| BMI ≥30 kg/m2; % | 29.1 | 73.9 | 73.2 | 93.7 | 48.4 | 81.9 | 76.7 | 91.6 |
| Waist circumference, cm; mean (SE) | 94.6 (0.67) | 113.5 (1.05) | 112.2 (1.03) | 122.0 (1.35) | 105.3 (1.85) | 121.2 (1.67) | 117.1 (1.94) | 123.9 (2.84) |
| ALT, U/L; mean (SE) | 19.9 (0.32) | 29.1 (0.83) | 28.8 (0.94) | 37.0 (2.47) | 25.3 (1.43) | 32.8 (2.26) | 29.3 (2.21) | 40.0 (3.86) |
| AST, U/L; mean (SE) | 20.4 (0.25) | 23.7 (0.51) | 22.4 (0.32) | 27.4 (1.83) | 21.4 (0.98) | 25.4 (1.34) | 21.7 (1.06) | 29.5 (2.85) |
| Alkaline phosphatase, U/L; mean (SE) | 74.4 (0.97) | 81.8 (0.96) | 79.9 (0.96) | 84.3 (3.95) | 79.6 (3.10) | 85.6 (2.33) | 85.4 (2.61) | 87.6 (4.56) |
| GGT, U/L; mean (SE) | 23.7 (0.51) | 38.3 (1.78) | 34.4 (1.78) | 49.8 (3.67) | 29.1 (1.65) | 43.0 (2.03) | 33.7 (2.79) | 54.8 (5.33) |
| HbA1c; %; mean (SE) | 5.5 (0.02) | 6.1 (0.03) | 6.0 (0.04) | 6.4 (0.14) | 7.1 (0.16) | 7.5 (0.14) | 7.7 (0.20) | 7.6 (0.30) |
| HDL cholesterol, mg/dL; mean (SE) | 55.6 (0.39) | 45.8 (0.61) | 45.8 (0.57) | 43.7 (1.60) | 49.4 (1.12) | 43.4 (0.93) | 43.8 (1.15) | 42.4 (2.74) |
| Triglycerides, mg/dL; mean (SE) | 122.0 (2.05) | 192.4 (6.51) | 194.6 (7.37) | 223.6 (29.37) | 155.0 (10.01) | 222.4 (14.53) | 225.6 (14.87) | 262.7 (53.74) |
| Fib‐4 index; mean (SE) | 1.01 (0.02) | 0.94 (0.02) | 0.87 (0.02) | 1.07 (0.09) | 0.97 (0.04) | 0.97 (0.04) | 0.84 (0.04) | 1.13 (0.15) |
| NAFLD fibrosis score; mean (SE) | −1.98 (0.04) | −1.32 (0.05) | −1.54 (0.06) | −0.84 (0.12) | −0.51 (0.09) | −0.22 (0.13) | −0.49 (0.20) | −0.16 (0.18) |
| FibroScan® CAP, dB/m; mean (SE) | 232.8 (1.32) | 333.5 (2.27) | 339.4 (1.71) | 351.0 (2.78) | 252.8 (4.71) | 347.8 (2.74) | 351.8 (3.48) | 361.4 (4.38) |
| FibroScan® fibrosis, kPa; mean (SE) | 4.6 (0.05) | 7.9 (0.30) | 5.4 (0.07) | 10.1 (0.16) | 6.6 (1.29) | 10.3 (1.74) | 5.7 (0.20) | 10.2 (0.26) |
| FAST score; mean (SE) | 0.06 (0.00) | 0.21 (0.01) | 0.17 (0.01) | 0.35 (0.03) | 0.11 (0.01) | 0.28 (0.02) | 0.18 (0.02) | 0.41 (0.04) |
| Comorbidities | ||||||||
| Diabetes; % | 8.4 | 29.6 | 23.5 | 44.5 | 100.0 | 100.0 | 100.0 | 100.0 |
| Hypercholesterolemia; % | 32.6 | 43.1 | 43.0 | 50.2 | 44.9 | 52.6 | 52.7 | 61.0 |
| Hypertension; % | 24.3 | 44.6 | 44.0 | 43.4 | 47.4 | 50.6 | 45.2 | 41.9 |
| Ischemic heart disease; % | 4.4 | 7.2 | 6.0 | 10.7 | 8.2 | 11.4 | 10.5 | 17.0 |
Abbreviations: ALT, alanine aminotransferase; AST, aspartate aminotransferase; BMI, body mass index; CAP, controlled attenuation parameter; FAST, FibroScan®‐aspartate aminotransferase; Fib‐4, fibrosis‐4 index; GGT, gamma‐glutamyl transpeptidase; HbA1c, hemoglobin A1c; HDL, high‐density lipoprotein; NAFLD, nonalcoholic fatty liver disease; SE, standard error; T2DM, type 2 diabetes mellitus; VCTE, vibration‐controlled transient elastography.
Unweighted.
FIGURE 3Test performance by AUROC for the six machine‐learning methods. Abbreviations: AUC, area under the curve; AUROC, area under the receiver operating characteristic curve; Ctree, classification tree; ElasticNet, elastic network; LogReg, logistic regression; NeuralNet, neural network; RF, random forest; SVM, support vector machine
Predictive performance of the six machine‐learning models
| Model | AUROC (95% CI) | Accuracy | Sensitivity | Specificity | Predictive value | |
|---|---|---|---|---|---|---|
| Positive | Negative | |||||
| LogReg | 0.83 (0.81, 0.86) | 0.78 | 0.55 | 0.89 | 0.70 | 0.81 |
| Ctree | 0.79 (0.76, 0.82) | 0.75 | 0.53 | 0.85 | 0.63 | 0.80 |
| ElasticNet | 0.84 (0.81, 0.86) | 0.78 | 0.56 | 0.89 | 0.70 | 0.81 |
| RF | 0.83 (0.80, 0.86) | 0.79 | 0.61 | 0.88 | 0.70 | 0.83 |
| SVM | 0.83 (0.80, 0.85) | 0.78 | 0.52 | 0.90 | 0.72 | 0.80 |
| NeuralNet | 0.83 (0.80, 0.85) | 0.75 | 0.71 | 0.78 | 0.60 | 0.85 |
Abbreviations: AUROC, area under the receiver operating characteristic curve; CI, confidence interval; Ctree, classification tree; ElasticNet, elastic network; LogReg, logistic regression; NeuralNet, neural network; RF, random forest; SVM, support vector machines.
Machine‐learning predictors of NAFLD using logistic regression
| Clinical characteristic | OR | 95% CI |
|
|---|---|---|---|
| Sex; male versus female | 1.33 | 1.07–1.66 | 0.010 |
| HbA1c | 1.33 | 1.21–1.46 | <0.001 |
| BMI | 1.06 | 1.03–1.09 | <0.001 |
| Waist circumference | 1.05 | 1.03–1.06 | <0.001 |
| AST | 1.03 | 1.02–1.04 | <0.001 |
| Age | 1.01 | 1.01–1.02 | <0.001 |
| Diastolic blood pressure | 1.01 | 1.00–1.02 | 0.010 |
| Alkaline phosphatase | 1.00 | 1.00–1.01 | 0.018 |
| Triglycerides | 1.00 | 1.00–1.00 | <0.001 |
| HDL | 0.99 | 0.98–1.00 | 0.008 |
Abbreviations: AST, aspartate aminotransferase; BMI, body mass index; CI, confidence interval; HbA1c, hemoglobin A1C; HDL, high‐density lipoprotein; NAFLD, nonalcoholic fatty liver disease; OR, odds ratio.
Based on a 1‐point increase for all numeric covariates.